ID Gypsy123-I_DR repbase; DNA; ZEB; 5414 BP. XX AC chr16; XX DT 03-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Gypsy retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy123-I_DR; KW Gypsy123-LTR_DR; Gypsy123_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5414 RA Jurka J. and Kohany O.; RT "Gypsy retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1282-1282 (2008). XX DR Genome; chr16; Positions 15705351 15710764. XX CC Positions [2923-3399] - Integrase core CC 'CTGCTG' target site duplication CC LTRs are 100% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 733..5163 FT /product="Gypsy123-I_DR_1p" FT /translation="MTSIEVGEFIENPSEQIFNLLTKDQLIEIAAHYGIVL FT TTNDKKLKENVRTVLKMAFIEKGIMSSEVGVSVTRYPSTTFTFEQQKELVL FT LQLEQRKLEFKMEIEKRKLEQETILAQQSLENRKLDLQQQRLDLMKDGKVR FT LNDDDLGEGAEGGLPSVNFNVAANLRLMPLFNEEDVDTFFLLFERVADNQS FT WPEAERSLMLQCVLTGKAQKAYSALSLADSQNYDKVKKAVLKAYELVPEAY FT RQKFRGLRKNFTQSYVEVARELQLCFTRWCNSASVKLFKDLVELVVLEQFK FT NILPNQIAIFIAERNVLSIAEAAALADEYDLIHRVGSDVGCLKEKPTFVQR FT PELPTVTRYGNTKIAGKNDPVLICNYCLGHGHWKKECPALQMKGRGRYIQP FT KPTFLAAPVARSVESESSFIPCSNEMRESSFSKCNFSPFVSDGFVSLPEGE FT RVPVKILRDTGASASFISESVLPFSKESFTGKSMLICGIGLTRMSVPLHRV FT TLQSKLIHGDVELGLRPALPVEGVSVILGNNLAGGRVWGESAPPLIVTSKP FT ISVADRDSGLDYHDVFPACAVTRSASRMSIMKNEVPFQIDGGVSWPASLLS FT VSQDELIKQQKADLTLNELFSQVLSDANAAQGYFMCEGVLIRKWTPQAKAS FT IGTPYVQVVIPTPLRSMVLKLAHDDQSGHLGVRKTYDRLMRNFFWPRLKRD FT VSSYIKTCPVCQLIGKPNQNPKPVPLRPIPAVSEPFSHLIVDCVGPLSRSR FT SGSEYLLTVMCQVTRYPAAFPLRSITAKSVLKALSQFMSVFGIPKIIQSDR FT GTNFTSGVFEEVLRQLNVKHNISTAYHPQSQGAIERFHQTIKSLLRAYCTE FT LDRDWEEGLPWLMLAAREVTHESTGYSPNELVFGHNVRGVLAALRDDWISL FT EPPKKLAEYVNGFKRRLFEAVKLAKANLMQAQLKMKQAYDRRTEHRSFSVG FT DQVMALLPVTTSPFKARFAGPYVIHRKESDENYVIATPERRTKTQVCHVNL FT LKPFYGRSGSVCCQVSPVLSVHPATSDELFLDYEKTMEPDDSIMCPRLSNS FT EILSDLPAFLQHLPTERRLELVRLIEDYVCLFSDVPNQTHVVQHDIDVGDS FT LPVRQRFYRVPWEKRKQLESEVKYLLENNLAVPSSSSWSSPCLLVKKKDGT FT LRFCTDYRKLNAITKPDSFPLPRIEDCVDKVGAARYLSKFDLLKGYWQIPL FT SPRAREVSAFITPDGAYSYNVMSFGLRNAPATFQRLMNRVITDLEGCAVYL FT DDLIVYSDSWEEHLRRIRQLFERLKEAGLTVNLAKCEFARATVTYLGKVVG FT HGVVRPVSAKIEAIDRYPPPRTKRELMRFLGMVGYYRSFCNNFSETVAPLT FT DLLKVSRVFEWNSICQNAFENVKALLTNAPVLAAPQWDRPFKIHVDACEIG FT AGAVLLQDDGDGVERPVCFFSKKFLPYQINYSVIEKEALALVWALQHFDVY FT VGGGWIAGGYLL" XX SQ Sequence 5414 BP; 1454 A; 890 C; 1312 G; 1758 T; 0 other; aatgggggct cgtctaaaac atttttctta gactggttaa atttggtatt gtttgtataa 60 tttttgttat tgttttggat tttgttaaat ttgttttggt taattggtta tgatgataaa 120 actgaaattt gatttattat tgggagaatc gagtgcaatt gtattgaaag aatagactta 180 ttttgtgtaa ttccatgcac agctggttga catttcaaat agccctgtga attaatttcg 240 ttgttaggat taagtgatat gtctcttggg acatgtcaat tttttttttt tagtttgtta 300 tttttgagca gatcggtggt tagagctatg tctcgggagc acgccgaggt gtgagagaat 360 cgccaatata attggttgtt gttctcagcc tgcggtattt ggtgcataaa aacccaccac 420 tgtatgcttg aagactaggg tctacttagt tagatttatg tatagttata ggttgcgaaa 480 ttaatataca gggcgcaaat atatttgttg tctccagtgg gtgtggcaag gaggtaaatt 540 acttaagggg tttattgatt attatttttc ttttattttg tatatgtgtg tgtatatata 600 tatatatata tatatatata tatatatata tatatatata tatatatata tatgtatgta 660 tgtatatgta tatagacata tatagatttt tttttttctt tctctttcgt tgtgcccctt 720 ctgttcctga ttatgacgtc aattgaagta ggtgagttta ttgagaatcc ttctgagcaa 780 atatttaatt tgttaacaaa agaccaattg atcgagattg cagcccacta tggtattgtc 840 cttacgacaa atgataaaaa actgaaagaa aatgttagga cagttttaaa gatggctttt 900 atagagaaag ggattatgtc ttcagaagta ggggtttcgg ttacgaggta tccctctact 960 acctttacct ttgaacaaca gaaagagttg gtacttctcc agttagaaca gcgaaaattg 1020 gagtttaaaa tggaaattga gaaacgaaaa ttagaacagg aaactatttt ggcacaacaa 1080 tctttggaaa atagaaaatt agatcttcaa cagcaacgtt tggatctaat gaaggatgga 1140 aaagtgagat taaatgatga tgatttaggt gaaggagcag agggggggct accttctgtt 1200 aattttaatg tagcagcaaa cttgcgatta atgcctttgt ttaatgaaga agatgtggat 1260 acattttttt tgttatttga acgagttgcg gacaatcaga gttggccgga ggccgaacga 1320 tcgttaatgt tacagtgtgt attgacgggt aaagcacaga aagcttattc tgctttgagt 1380 ttggccgata gtcaaaatta tgataaagtt aaaaaggccg ttttgaaggc atacgaattg 1440 gttccagaag cgtatcgcca aaaattccgt ggtttaagaa agaactttac ccagtcatat 1500 gtagaagtag cacgtgaact gcaattgtgt tttacacgct ggtgtaattc tgcctctgtc 1560 aaattgttta aggatctggt ggaattggtt gttttggagc agtttaagaa catactgcca 1620 aatcaaattg ctatatttat tgcagaacgt aatgttttgt caatagcaga ggcagcagca 1680 ttagcagacg agtatgattt aattcataga gtggggagtg atgtgggttg tcttaaagag 1740 aaacctacat ttgtgcagcg tccagagtta ccgacagtca cgcgttatgg aaacacgaaa 1800 attgctggta aaaacgatcc agttttaatc tgtaattatt gtttaggtca tgggcattgg 1860 aagaaagagt gtcctgcgtt acagatgaaa ggtcggggcc gttacatcca gccaaaacct 1920 acattcttag ccgcaccggt ggctagatct gttgaatctg agtcctcctt cataccatgt 1980 agtaacgaaa tgcgggagag ttccttttct aagtgtaact tttcaccttt tgtttctgat 2040 ggttttgtgt ctcttccaga aggcgagcgt gtgccggtga aaattctgcg ggacacaggg 2100 gcatctgctt ctttcatttc tgagtccgta ttacctttct caaaagaatc atttacgggc 2160 aagtcaatgt tgatctgtgg aattgggtta actaggatgt cggtgccact gcatagggtt 2220 acgttgcagt ctaaattgat tcatggtgat gtggaactag ggctgagacc tgcccttcca 2280 gtggaaggag tttctgtcat tcttgggaat aatttggctg gtggacgagt ttggggtgag 2340 tcagcaccac cattaattgt aacgtcaaaa cccatatccg ttgcagaccg ggatagtgga 2400 ttggattatc atgatgtttt tcctgcttgt gcagtcacgc gctcagcaag tcggatgagt 2460 attatgaaaa atgaggtacc atttcaaata gatgggggtg tttcttggcc agcgtcgttg 2520 ctttctgtat cgcaggatga attgattaag cagcaaaaag cagatctgac gttgaatgaa 2580 ttgttttcac aggtgctctc cgatgctaat gcagcccagg ggtatttcat gtgtgaaggg 2640 gttttgataa gaaaatggac tccacaagct aaagcctcta tagggacgcc atacgtacag 2700 gtagtaattc caaccccttt aagatcgatg gtgttaaagc tcgctcatga tgatcagtct 2760 gggcatttgg gcgtgaggaa gacgtatgat cgtttaatgc ggaatttttt ttggcctcga 2820 ctaaaacgcg atgtgtcgag ctatattaaa acttgtcctg tttgccagct aataggtaaa 2880 ccaaatcaga atccaaaacc tgttccgtta cgtcctatcc ctgcggtgtc tgaaccattc 2940 agtcatttaa tagtggactg tgtgggtcct ttgagtcgat caaggtctgg tagtgagtat 3000 ttgctgactg ttatgtgtca ggttacacgg tacccggcag cttttccact gcgatcaatt 3060 acagcgaagt ctgttcttaa ggcgctctcc caatttatgt cagtctttgg gatacctaaa 3120 atcatacagt cagatcgggg tactaatttt acttcgggag tgtttgagga agttttgcga 3180 cagcttaatg ttaagcataa tatatctacg gcgtatcatc cgcagagtca gggcgcgatt 3240 gaacgcttcc accagactat taaatccctg ttacgtgctt attgtacaga actggatcgt 3300 gactgggagg aggggttacc ttggctcatg ttagctgcca gagaggtaac ccatgaaagt 3360 actggttata gcccaaatga actggttttc ggtcataatg tccgtggggt actagctgcg 3420 ttacgggatg attggatttc tcttgagcca ccgaaaaaat tagcggagta tgtaaacggg 3480 tttaaaagac gactctttga ggctgttaag ttggctaaag caaacttaat gcaggcacaa 3540 ttaaaaatga agcaagcgta cgatcgtaga actgagcacc gatcttttag tgtcggggat 3600 caggtaatgg cgttattgcc ggtgactact tctccattta aagctagatt tgctggcccg 3660 tatgtgattc accgtaaaga gtcggatgaa aattatgtta ttgcgacacc agaaagaagg 3720 acaaaaactc aagtatgtca tgttaactta cttaaacctt tttatggccg ttcaggctca 3780 gtttgttgtc aggtatcccc tgtattgtcc gttcatcctg cgacatctga tgaattattt 3840 ttggattatg aaaagacaat ggagccagat gacagcatta tgtgtcctcg attgagtaat 3900 tcagaaattc tgagtgactt accagccttt ttgcaacatc ttcctactga gagacgttta 3960 gagttggtca gattgattga ggattatgta tgcctttttt cagatgttcc taatcagacg 4020 catgttgtac aacacgatat tgatgtgggg gattcccttc cagtgcgtca gcgattctat 4080 agagtgccgt gggaaaaaag gaagcaattg gaatcggagg ttaagtatct gttggagaat 4140 aatttggctg ttccatcttc ttctagttgg tcatctcctt gcttgctagt aaaaaagaaa 4200 gatggtacct tgagattttg taccgattat agaaaattaa atgcaatcac aaaacctgat 4260 tcttttcccc ttccccgtat cgaagattgt gtggacaaag tgggtgctgc cagatacctc 4320 agtaaatttg atttattgaa gggttattgg cagattccgc tgagtccgcg cgcccgtgaa 4380 gtctctgcgt ttattacacc tgatggggca tattcttata atgtgatgag tttcggattg 4440 cgaaatgcac cggccacatt tcagaggcta atgaatcgtg tgataactga tttggaaggt 4500 tgtgcagttt acttggatga tttgattgta tatagcgatt cttgggagga acatctacgg 4560 cgaattaggc agctatttga gcgtctgaag gaagcagggc ttactgtgaa cctagcgaag 4620 tgtgagttcg ctagggccac ggtaacctat ttgggtaagg ttgttggtca tggagtagtg 4680 cgtcccgtga gtgctaaaat tgaggcaatt gacagatatc ccccaccccg taccaagcga 4740 gaattgatgc gttttcttgg gatggtggga tattatcgga gtttttgtaa taatttttca 4800 gaaaccgtcg ctccgttgac cgatttgttg aaagtgtcac gtgtgtttga gtggaattca 4860 atctgtcaaa atgcctttga aaatgtgaag gcgttgttga caaatgctcc agttcttgct 4920 gcgccgcagt gggacagacc atttaaaata catgtagatg cgtgtgagat tggggcaggg 4980 gctgtgctcc tacaggatga tggtgatggg gtggaacgtc cagtatgttt cttctccaaa 5040 aaatttctgc cttatcagat taattattct gtaatcgaaa aggaagcctt ggcattggtt 5100 tgggcactcc agcattttga tgtctatgtt gggggggggt ggattgccgg tggttatcta 5160 ttgtgaccac aaccctttga cgtttttgac aactttacaa aatcctaacc aacgtttaat 5220 gcgatggtct ttatttttac agccgtacaa tttggagatt caccacatca aggggaagga 5280 aaatattttg gcggacgccc tttctcgtgc accggtctct tgaagtgata gagatatgtc 5340 gatttgtgtt ctgtttctta tttgttgatt gtgtataagt tttttttttt tttttttttt 5400 tggggaggaa ggtg 5414 // ID BEL18-I_DR repbase; DNA; ZEB; 6170 BP. XX AC chr13; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL18-I_DR; KW BEL18-LTR_DR; BEL18_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6170 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1383-1383 (2008). XX DR Genome; chr13; Positions 18129699 18135868. XX CC Positions [5108-5680] - Integrase core CC 'ATAAT' target site duplication CC LTRs are 97% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS join(152..2074,2078..5764) FT /product="BEL18-I_DR_1p" FT /translation="MEMDEDDQESERPLQMENEGNMETDIKGAIEQKDKNT FT SEEQCHVQESTELPRRSERPHIPTEKMLAYQKEECRKKEKKLTTLYEQWKL FT DARKARQDLKADISDKQLADIADSLEDKRNGIMKIFIDIREFITPAADLRR FT KIDACDAVTNDIVKIVLEKISNVDEYDAERERGRLSELLAHSYARSIFGSS FT ASLMSVNSHSGSSRTARKRADAAAELAAKEAEYKMMQAERQQKEKLRNIEE FT QFKKDLETQRFELECLQVEKDIEVARARVKSYDEEIKEETRNQSIKSEKQG FT PRNERQLQESYVLQSASLSEVSHLAQAVQDSIAINRLPLPEPTVFNGDPIQ FT FVEWKSSFMSLIGKKGISAADKLYYLKKYVSGSAHRYLEGTFYRNDEEAYK FT DAWDKLNKRYGQPFVIQRAFREKLSKWPKIQSKDAEGLRTFSDFLNACLQA FT IPHIKGLEILNDCEENQKLVQKLPDWIASRSNRQVTKTLMDDKEFPSFHDF FT VNFMTTEAEIMCNPITSLHALHSCESPYERRMSKETKRSKAIVFKTQTSVN FT NDTQSTVKGKFTMTKTQCMLCQDDSHQLHKCLKFMEKSLKDRRAYVKDNKL FT CYGCLKSSHHAKECRRRHTCDVCKLRHPTCLHDYNYDKDGIREPLPMVSNV FT HATESETTNAMSLNITRESQSVITSMIVPVWVSSVKNPSNEQLVYALLDTQ FT SDTTFIDKDISNTLQVESHPVKLKLTTMMGDNEIIRSNRVTDLQVRGYSSS FT VHISLPPVYTKECIPANYDHIPTQKTAMNWSHLKGIADEMAPLLNCEVGLL FT LGYNCPRELAPRQVILGKNEEPYAILTDLGWSIVGCSIPSINESTRFSLCQ FT RIATKEIPAMTPGDAVRILESDFKDITVDDKTVSQEDIMFLDKLKVGIKKN FT YLGHYEMPLPFKQRPSLPDNRELASIRLQHLNKKFSKNEKYKTDYFTYMND FT IIDRGDAEVVNEDEGTSGEKWYIPHHGIYHPKKPDKLRVVFDCSAKFKGTS FT LNEHLLSGPDMINNLTGVLVRFRRHPIAVICDIEKMFHQFHVQEKDRDYLR FT FLWWKDGDTNSVLQDYRMKVHLFGAVSSPGCANYGLRYLANEYSQSHALGA FT QFITRDFYVDDGVTSAETVEKAIQLTKEARELCAKGGLRLHKFVSNDNTVL FT QSIPESERAVNFETKDLTFNEMPLERALGIHWNIQSDGFKFHVPLKGQPTT FT RRGILSTVASIFDPLGFVAPFVLNGKRILQEMCRQGTGWDDTLYPALKPRW FT ERWQNDLVNLENVNIPRCYVPADFGKVVKTEMHHFSDASTCGYGQCSYLRM FT INEDDEIHCSFLIGKARVSPLKIITIPRLELTAAVVSVSMSNILSEELNLI FT DVDKFFWTDSKVVLGYINNDARRFHTFVANRVQKIRQSTTPNQWLYVPTYE FT NPADNASRGRRINELLSSNWFSGPSFLWEKEITTTNVVPDLPVGDPEVKKI FT QTLHTKTMENKDFVDRLSKFSSWSKTIKAVARLLRWIKKDKTNSPASVSEQ FT EDAKCLILKGVQRQVLSQGKRLPSHSKLHHLDAFIDKDGIIKVGGRLDHSS FT YTHAFKHPWILPKEHHVTKLIIAHCHECVKHQGKGFTINNIRSCGYWIPGI FT SRVVTSYILNCVTCRRMRRSAEGQRMSDLPSERVEPSPPFMYCGMDCFGPF FT MTKEGRKNHKRYGLLLTCLCSRAIHIEMLEDMSTDAFINSLRCFIAIRGTV FT RQIKCDQGTNFIGAKNELNSALQELDPERLTTFLANKQCDFVLNAPHSSHA FT GGVWERQIKTVRSVLNATITLSAGRLNDASLRTVFYEAMAVVNSRPLTIDN FT LNDPKSLEPLTPNHLITMKPTTALPPPGKFIREDMYTKKRWRRVQYLC" XX SQ Sequence 6170 BP; 2130 A; 1166 C; 1345 G; 1529 T; 0 other; gtaagagctt caccttcgct gattaagatc ctcgcggcag gcgcaggcgg acggatataa 60 cgcgacgacg gaaagggcat tgcagtgagt tataatcagc atagagcgtg atggtctacg 120 gagctgagat gtaaaagggt acgagttcaa tatggaaatg gatgaagatg accaagaatc 180 tgagaggccg ctccaaatgg aaaatgaagg taatatggaa acagatataa aaggagcaat 240 tgaacaaaaa gacaaaaata catcagaaga acagtgtcat gtacaagaga gcacagaact 300 ccctcgacgt tcagaacgtc ctcatatacc aacagagaaa atgcttgctt atcaaaaaga 360 ggaatgtaga aagaaagaaa aaaaattaac tacattgtat gaacaatgga agttagatgc 420 tcgtaaagca aggcaggatt taaaagctga catatctgac aagcagcttg cagatatcgc 480 tgattcactt gaggataaaa ggaatggtat catgaaaatc ttcatcgata taagggaatt 540 tattactcct gctgctgact taagacgtaa aattgatgca tgtgatgctg taaccaatga 600 cattgtcaaa atagtgttag aaaaaatatc aaatgtagat gagtatgatg cagaaagaga 660 aagggggcgt ctaagtgaac ttttagcaca cagttacgct cgctctatat ttggttccag 720 tgcttcacta atgagtgtca acagccattc tggatcctct cgtacagcaa gaaaacgagc 780 agacgctgca gcagagctgg ctgcaaagga ggctgaatac aagatgatgc aagcagaaag 840 gcaacaaaag gaaaaattaa gaaatataga agagcaattc aaaaaagatt tagaaactca 900 aaggtttgaa ctggaatgtt tgcaagtaga aaaggatata gaggttgctc gtgccagagt 960 aaaatcctat gatgaagaaa taaaggagga aacaagaaat cagtcaataa aaagtgaaaa 1020 gcaggggccg agaaatgaaa ggcagcttca ggaatcttat gttcttcagt cagcatcgct 1080 cagtgaagtg tctcaccttg ctcaagcagt tcaggacagc atagccataa acaggttacc 1140 cctacctgag cctacagtgt tcaacggaga cccaatacag tttgtggaat ggaagtcatc 1200 atttatgtca ttaataggta aaaagggcat ttcagcagct gacaagcttt attatttgaa 1260 aaagtatgtg agtggctcag cgcataggta tcttgagggc accttttatc gaaatgatga 1320 agaagcgtac aaagacgctt gggacaagct taataaaagg tacggccagc cctttgttat 1380 acaaagagcg tttagagaaa agctgtctaa gtggcctaaa atacagtcca aagacgcaga 1440 aggtttgaga acattttcag actttttaaa tgcatgtttg caagccatac cacatataaa 1500 aggtctggaa atattaaatg actgtgaaga aaaccagaag ctggtgcaaa aactacctga 1560 ctggatagct tccagatcga accgtcaggt cacaaaaacc ctcatggatg acaaagagtt 1620 tccaagtttc catgactttg tcaatttcat gacaacagaa gcagaaatta tgtgtaatcc 1680 catcacttct ttacatgctc ttcattcatg tgagtcacca tatgagagaa gaatgtcaaa 1740 ggaaactaaa agaagtaaag caattgtttt taagacacaa acaagtgtaa acaatgacac 1800 acaatcaaca gtcaaaggaa agtttacaat gactaaaaca cagtgtatgt tgtgtcagga 1860 tgacagtcat caacttcata agtgtctgaa attcatggag aagtctctga aagacagaag 1920 agcctatgtg aaagataaca aactctgcta cgggtgttta aaatcgtccc accatgctaa 1980 ggagtgtcgt cgtcgacaca catgtgatgt atgcaagtta cgacatccca catgtcttca 2040 cgattacaat tatgacaagg acggaatcag agaatgacca ttgcccatgg taagcaatgt 2100 gcatgcaact gaaagtgaaa ctactaatgc catgtcactt aatatcacaa gagaaagtca 2160 gtcagtcatt acatcgatga tcgtaccagt atgggtgtct tcagttaaaa atccatccaa 2220 cgaacagctt gtgtatgcat tgctagatac acagagtgac actacattta ttgataaaga 2280 cattagtaat acactgcaag tagagagtca cccagtgaaa ctcaaactaa ccaccatgat 2340 gggagacaat gagattatta gaagcaatcg agtaacggat cttcaagtga gagggtacag 2400 ttcctcagta catatcagtc ttcctcctgt atacacaaag gagtgtattc cagctaatta 2460 tgatcacata ccaacacaga aaacggcaat gaactggagt cacttgaaag gaattgctga 2520 cgaaatggca cctctactca attgtgaagt aggcttacta cttggctata actgccctag 2580 agaactagct ccaagacaag tcatactagg taaaaatgag gaaccatatg ccattttaac 2640 agatttaggg tggagtattg tgggctgctc cataccaagt atcaatgaat caacaaggtt 2700 cagtctttgt caaaggatag ctacaaagga gataccagct atgacacctg gggatgcagt 2760 ccgtatactt gaatcagact ttaaggacat cacagtagat gacaagacag tctctcaaga 2820 ggacatcatg ttcctagaca aacttaaagt aggaataaag aaaaattatc taggacatta 2880 tgaaatgcct cttccattta agcaacgacc atccttgccg gacaacagag aacttgcatc 2940 gatcagactc caacatctga acaagaagtt ttccaaaaat gagaaataca aaacagacta 3000 cttcacttac atgaatgaca ttattgacag aggtgatgca gaagtggtaa atgaagatga 3060 gggaaccagt ggtgaaaaat ggtacatacc tcaccatggg atctaccacc caaaaaagcc 3120 agataagcta cgggttgtat tcgattgctc tgcaaaattc aaaggtacaa gtctcaatga 3180 acatttacta tcaggaccag acatgataaa caacctcacg ggagtccttg taaggttcag 3240 aaggcatcct attgcagtga tatgtgacat tgaaaaaatg tttcaccaat ttcacgttca 3300 ggaaaaggat cgtgattatc tgcgcttctt atggtggaaa gatggagaca caaattcagt 3360 tcttcaagat tacagaatga aagtgcatct atttggtgca gtctcttcgc ccggttgtgc 3420 aaactatggg ctaagatatt tagccaatga gtacagccaa tcacacgcat taggcgctca 3480 gttcatcaca agggattttt atgttgatga tggagttaca agtgcagaaa cagtagaaaa 3540 ggctatacag ttgacaaagg aagcacggga gctgtgtgca aagggtggtc ttcgacttca 3600 caagttcgta tcaaatgaca acacagtttt gcaaagcata ccagaatcag aacgtgctgt 3660 aaactttgaa acgaaagatc tcaccttcaa tgaaatgcca ctcgaaagag ctctgggaat 3720 tcactggaac atacagagtg atggtttcaa attccatgtt ccattaaagg gtcagcctac 3780 aacacgtcgt ggcatattat ccactgttgc ctccatattt gatcctttgg gatttgtagc 3840 tccatttgtt ctcaatggca agagaatcct acaggaaatg tgtcgccaag gaactggctg 3900 ggatgataca ctttaccctg cactcaagcc acggtgggag agatggcaaa atgaccttgt 3960 caatctagaa aatgtgaaca taccacgatg ttatgtacct gctgatttcg gaaaagtagt 4020 gaaaacagaa atgcatcatt tctcagatgc gagtacctgt ggttacggtc aatgttctta 4080 tctaaggatg ataaatgagg atgatgaaat tcattgctct tttcttattg gcaaagcccg 4140 ggtttctcct ttaaagatta tcacaatacc caggttagag ctaactgcag ctgtagtatc 4200 agttagtatg agcaatatac tcagtgaaga gcttaacctc attgatgtcg acaaattctt 4260 ttggactgac tcaaaggtag ttttgggtta cattaataac gatgcccgtc gtttccacac 4320 atttgtagcc aatagagtcc aaaaaatacg tcaaagtaca acacccaatc aatggctgta 4380 tgtcccaaca tatgaaaacc cagcagacaa cgcttcaaga gggagaagaa tcaatgagct 4440 gctttcatct aattggttct ctgggccctc atttttgtgg gaaaaggaaa taacaaccac 4500 aaatgttgtg ccagacctcc cagtcggaga tccagaggta aagaaaattc agacattaca 4560 cacaaagacc atggaaaata aagattttgt agaccgattg tccaagttct catcttggtc 4620 taagactatc aaagctgttg cacgccttct gagatggatt aagaaagata aaaccaattc 4680 accagcatcc gtaagtgaac aagaagacgc taaatgcttg attctaaaag gagtacaaag 4740 gcaagtgttg agtcaaggaa aaagacttcc aagtcacagt aaacttcatc atctggatgc 4800 ctttatagac aaagatggta taatcaaggt gggaggcaga ttagatcact catcatatac 4860 tcacgcattc aagcatccat ggatcctgcc aaaggaacac catgttacaa agttgattat 4920 tgcacactgc catgaatgtg taaagcatca aggcaagggt ttcacaatca ataatatcag 4980 atcatgcggc tattggatac ctggaataag cagagtggtc acatcttaca ttctcaactg 5040 tgtgacttgt agaaggatga gaagatctgc tgaaggccag aggatgagtg atctcccctc 5100 agaacgtgta gagccttcgc cgccttttat gtattgtggc atggactgct ttggtccatt 5160 catgactaaa gaagggagaa aaaatcacaa gaggtacggt ctgcttctaa catgcttgtg 5220 ttcaagagct attcatattg aaatgttaga agacatgtcg acggatgcat ttattaacag 5280 tctgagatgc tttattgcaa ttcgaggcac agtaaggcaa ataaaatgtg accaaggtac 5340 caatttcata ggcgcaaaga atgaattgaa ttcagcattg caagaacttg atccagaacg 5400 cctcacaact tttcttgcaa acaaacaatg tgactttgtt ttgaatgctc ctcattccag 5460 tcatgcaggg ggtgtatggg agcgtcaaat caaaactgtc cggagcgttc ttaatgctac 5520 catcaccctt tcagcaggca ggcttaatga tgcctcattg cgcactgtat tttacgaagc 5580 catggcagta gtcaatagtc gccctcttac gattgacaat ctgaatgacc ccaagagctt 5640 agaaccactc acacccaatc atctgataac catgaagccc accacagcat tgcctccacc 5700 aggaaagttc attagagagg atatgtatac aaaaaagaga tggagacgtg tacagtacct 5760 ttgctgaaca gttttggagc agatggaaaa aggagtatct acacaatatc attgcaaggc 5820 aacgctggca tgctccaaaa aggaaccttc aaattggtga tgtggtcatg gatatagaag 5880 aaacattgcc taggagtgaa tggagactag gaaggattat agagactgtt ctgagcccgg 5940 atggacttgt aagaaaagcc aaagtagcac ttggcgacaa gagattaaac aagaaaggaa 6000 aagaatgcat aaaatctcaa tagtagatag acccgctcaa aagttggtct tgttgttgga 6060 agcagagtga atgcatttac aatgggactt tattgcactt tatacacttc caaagagggt 6120 tataattgga aattagttat gtttgtggaa cattgataat ttggtgggag 6170 // ID DNA-3-6_DR repbase; DNA; ZEB; 585 BP. XX AC . XX DT 30-OCT-2008 (Rel. 13.1, Created) DT 30-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE nonautonomous DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; TSD 3-bp; KW DNA-3-6_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-585 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1229-1229 (2008). XX DR [1] (Consensus) XX CC TIR is 32-bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 585 BP; 171 A; 144 C; 133 G; 137 T; 0 other; ggggccgatc acaccgaacg cgcttttacg ttccaaaaac gcgaggcgca ccacactgcc 60 ttttttgttg acaagaaaaa aagaagcgcg gtgcgctttt tatgtcgcta ggcaacgact 120 gaatcagctg ggtattgtgc gagagtgttg ctgttgatat taatataata ttaatataat 180 tttaatattt aataatattg tgatattcaa gatttgtgaa gtcatacagc tccggataac 240 tcaaaacagc aaccacaagt ctctcctcca tcttcaaaag tctccgtagt cgtcttgaca 300 acacaaacac tgcagtcacc tcaacggaaa ccccgcctct gctttcattt gattggagaa 360 tgaaaaagac gcgactgacg taacgtgctt tttccgctca gagttgactt ttttcaactg 420 cgagcgcaca gcacgcaacg gcaaaaacgc gaggcgcagc aggcggttaa aacgcgaggc 480 gcgcagggcg cataagcagc gcgcaaaaac gctcgcggcc gttagtaaat cattcaaaaa 540 aggctgcctc tcaacgcaaa aagcgcgttc ggtgtgatcg gcccc 585 // ID Gypsy58-I_DR repbase; DNA; ZEB; 4221 BP. XX AC chr16; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy58-I_DR; KW Gypsy58-LTR_DR; Gypsy58_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4221 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1431-1431 (2008). XX DR Genome; chr16; Positions 11954172 11949952. XX CC 'AATCA' target site duplication CC LTRs are 100% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 45..3992 FT /product="Gypsy58-I_DR_1p" FT /translation="MSLLQPPSVLSLQGNLAENWKVWLQKFELYLIASGVA FT EKSETVQCATFLHIAGEEAIKVYNTLQFTAEEVNKIDALKRKFKEYCEPRK FT NLPYIRHMFFTRTQSTGETIDAYVTDLKHKAKDCEFGDLTDSLIRDRIVCG FT IKDDNVRARLLREADLTLEKAIDTCRANEITSNQMKMLVEEVDVNKISSVK FT NTKRRGKTVQFEQKSRQHALENQSKTHVKCSRCGYQHEYQKCPAYGQVCKS FT CNKRNHFAKMCKTQVSTKKIQSIDENEQSDDERSDFFVGTIELQTNQRVKN FT VHTVTTEAQNLKDNWTESLRINNKSVTFKLDTGAACNVMPHNVFTTLETQR FT KLKKSNCHLVTYSGHKMSPVGQVQLTFTYRKQKHNIQFQVVDGNVPAILGR FT TTCEKLKMVKRIHEISESENKADPLKGFDDLFCGLGCLPTVHHIKTDPDVS FT PVIHPPRKVPIALKDKIVEELHKMESMGVIVKQTEPTDWVNSMVTVVKPNK FT IRICMDPQDLNKAIKREHYPLLTVEEVVSSMPNAKIFSVLDANQGFWQIKL FT DEESSKLCTFNTPIGRYRFLRLPFGVSSASEVFQRAIAQMLEGLKGVVNVI FT DDILVWGETTEEHDHRLIKLLERARECNLKLNKSKCQIRTSEIRYLGHIIS FT EEGLKPDEEKVRAIVKMPPPENKQALMRFMGMVQYLSKFIPNLSDISAPLR FT KLLEEDIQWHWEEEQQKSFTKMKNLLANAPTLQFFNVQKPVTLSVDASSEG FT IGAVILQDKKPVAYGSRALTDAQKNYAQIEKELLAIVYGCEKFHQYLYGKE FT VEVESDHKPLEIIFKKPLHQAPPRLQRMLLRLQRYILKVVYKPGKEMHIAD FT ALSRAYLNECTEDLLGQQLELCCITTYLPVSEEKLNVFRKATATDQEMQLL FT CKMTMDGWPKEKSEVPKQIQTYWTFKEEISFTSGLLFKSAKIIVPSQLRKE FT MLTKIHESHLGVVKCKERARDVLYWPNMSTQIEEFVSQCPVCNAHKNSNPR FT EPMISHPIPGRPWSKIGTDLFHFDGSNYLLSVDYYSKFTEISKLSDTTAQS FT VIMAMKSMFARYGIPDTVVSDNGPQYASTEFSSFAKAWEFEHVTSSPGHAQ FT SNGQAERAVQTIKNLLKKAQCGKNDPYISLLEYRNTSLDGIGYSPAQLLMG FT RRLKSKIPTSVTLLTPENNIQIHHKLKQRQHKQKSYFDRQTRLLPNVHVGE FT TVRVQQGDVWKPAIITRKHEQPRSFIIQTPEGRTYRRNRKHILKTGEKSFT FT TVTDNTVSDVIDSHVPDTSSMLDNDTQETVVTESQSHEYITRSGRQVKLPS FT RYKD" XX SQ Sequence 4221 BP; 1572 A; 782 C; 869 G; 998 T; 0 other; tggtgtcaga aaagggattg taaacggaaa gcacaactga gaagatgtca ctactgcaac 60 caccatcagt gttgtctctt caaggcaact tggctgagaa ttggaaggta tggcttcaaa 120 aatttgaatt atacctcatt gctagcggag tggcggagaa atcagaaaca gtacagtgtg 180 caacctttct tcatatcgcg ggtgaagaag cgatcaaagt ttataacaca ctccagttca 240 ccgcagaaga agtgaataag attgacgcac ttaaaaggaa atttaaagag tactgtgagc 300 caagaaagaa tctgccatat attcgtcaca tgtttttcac gagaacacaa agtaccggag 360 agacaattga tgcatacgtc acggatctta aacacaaggc gaaagactgt gaattcggtg 420 atttgactga ctcgctaata agggacagaa ttgtttgcgg gataaaagat gacaatgttc 480 gcgcaaggct gttgcgggag gcagatctca cgttggaaaa ggcaatagac acctgcagag 540 ccaatgaaat tacatcaaat caaatgaaaa tgcttgttga agaagttgac gtgaataaaa 600 taagcagtgt taaaaataca aagagaagag gcaaaactgt gcagtttgaa cagaaaagta 660 gacaacacgc tttggaaaat cagtccaaaa cacacgtgaa atgttcacga tgtggatatc 720 aacatgaata tcaaaagtgc ccagcgtatg gacaagtgtg taaatcctgt aacaaaagga 780 atcactttgc aaagatgtgc aaaactcaag tatcaacaaa gaaaatacaa tcaatagatg 840 aaaatgagca aagtgatgat gaacgaagtg atttttttgt aggaactatt gagctgcaaa 900 caaatcaaag agtgaaaaat gtgcacactg tcaccactga agcacaaaat ctgaaggaca 960 actggactga aagcttgaga ataaacaaca agagtgtcac attcaagtta gacacaggag 1020 cagcctgtaa tgtaatgccc cacaatgtgt tcacaacgct agaaacacaa agaaaactga 1080 agaaatctaa ctgtcaccta gttacatatt ctggccacaa gatgtcgcct gtgggtcaag 1140 tacagctaac tttcacctac agaaaacaaa aacacaacat tcagtttcaa gtggtggatg 1200 gaaacgtacc tgcaatactt ggacgcacaa cctgtgaaaa actaaaaatg gtcaaaagaa 1260 ttcatgaaat aagtgaaagt gaaaataaag ccgacccttt aaaaggattt gatgacttgt 1320 tttgtggtct tggttgctta ccaactgtgc atcacattaa aacagaccct gatgtgtcac 1380 cagtgattca tccaccaaga aaggttccaa tagcccttaa agacaaaata gttgaagaat 1440 tacacaagat ggaaagtatg ggagttatag taaagcagac agaaccgaca gattgggtga 1500 atagtatggt aactgtggtg aaaccaaaca aaatccgtat atgtatggat ccgcaggacc 1560 tcaataaagc aataaagcgt gagcattatc cattactgac agtggaagaa gtggtatcca 1620 gcatgccaaa tgccaagata ttttcagtcc tggacgctaa tcaaggtttc tggcaaataa 1680 agctagatga agaaagctcc aaactctgta cattcaatac accaataggc aggtatagat 1740 tccttcgctt accgtttggt gtgtcttcag ccagtgaagt gttccagcgt gcaattgcac 1800 agatgttaga aggtcttaaa ggtgttgtca atgtcataga tgacatccta gtatggggag 1860 aaacaacaga agaacatgat cacagactca tcaagctact ggaaagagca agagagtgta 1920 atctaaaact gaacaagagc aagtgccaaa taagaacttc agaaataaga tatcttggac 1980 atatcatctc agaggaaggt ctcaagccag atgaggaaaa agtaagagct attgtaaaaa 2040 tgccaccacc tgaaaacaaa caagcactca tgagatttat gggcatggtg caatacctgt 2100 ccaaattcat tccaaatcta tccgatatca gtgctcctct cagaaaactg cttgaagaag 2160 atattcagtg gcactgggag gaagagcaac aaaaaagctt cacaaaaatg aagaatttgt 2220 tagcaaatgc accaacacta caattcttca atgtacagaa acctgtcacg ctttcagttg 2280 atgccagctc agaaggaatt ggagcagtta ttctacaaga caaaaagcca gtggcatatg 2340 gatcacgtgc gctaacagat gctcaaaaga attacgcaca aattgaaaaa gagttgcttg 2400 ctattgttta tggatgtgaa aaatttcatc aatatctcta tggcaaagaa gtggaggttg 2460 aaagtgatca taaaccattg gagatcattt ttaagaaacc actgcaccaa gctccaccta 2520 gattgcagcg catgctcctc agacttcaaa gatacatcct gaaggtagtt tacaagccag 2580 gtaaagaaat gcacattgct gatgcattaa gtcgtgcata cctgaatgag tgtacagaag 2640 atttgcttgg tcaacagctg gaattgtgct gcataacaac atacttacct gtttcagaag 2700 aaaaactaaa tgtttttcgc aaagctacag ctactgatca agaaatgcaa ttactgtgca 2760 aaatgacaat ggatggatgg cctaaagaaa aaagtgaagt tccaaaacag attcagacat 2820 actggacttt caaagaagaa ataagtttca cttctggact tctttttaaa tcggcaaaga 2880 tcattgttcc aagtcaattg agaaaagaaa tgctaacaaa aatacacgaa tcacatcttg 2940 gagttgtaaa gtgtaaggaa cgagctaggg atgttctgta ttggccaaac atgtcgactc 3000 aaattgaaga gtttgtctca caatgtccag tgtgcaatgc acacaagaac agtaatccaa 3060 gagagccaat gatatcacat ccaataccag gaagaccttg gtccaaaatt ggcacagacc 3120 tgttccattt tgatggttca aattacttac tttctgttga ttactactct aaattcacag 3180 aaatcagcaa actgtctgac acaactgctc agagtgtaat tatggcaatg aagtcaatgt 3240 tcgcaagata tggaataccg gataccgtgg tctcagacaa tggtccacag tatgccagca 3300 cagagttcag cagttttgca aaagcatggg aatttgaaca tgtgacatcc agcccgggac 3360 acgcacagtc gaatggacag gctgagagag cagtccaaac cataaaaaac ctgctgaaga 3420 aagctcagtg tggtaaaaat gatccttaca tttctttatt ggaatacaga aacacatcgt 3480 tggatggtat tgggtactca ccagcacaat tattaatggg acgcagactc aaatccaaga 3540 tacctacctc agtcactttg ttgacacctg aaaacaacat acaaattcat cataaattga 3600 aacaaagaca acacaagcag aagtcttact ttgacagaca aacaagacta cttccaaatg 3660 tccatgttgg agaaactgtg agagtgcaac aaggagatgt ttggaaacct gcaataatca 3720 cacgaaaaca tgaacaaccc agatccttca tcatacaaac tccagaagga agaacataca 3780 ggagaaacag gaaacatatt ctcaagactg gagagaaaag tttcacaact gtgacagaca 3840 acacagtcag tgatgtgata gactcgcatg tccctgacac gtcatcaatg ctggacaatg 3900 acacacaaga aactgtagtc actgagtctc agtcacatga gtacattacc cggtctggga 3960 gacaagtgaa attgccttca agatacaaag actgaaaatc tacagcatgt ttagctctga 4020 tttgaattag aaacttcatc tctgtttaca attaaagaaa taaagaaaga atagtgtaga 4080 taaataaaag ttctatttag tttatgcagt tatttaaagc atacataata aggagtttac 4140 aaatacagta tggaaaaaga aaaagaaaaa aaagtttata cattgctgaa gattgcatag 4200 tattaaactt aagagaaggg a 4221 // ID DNA-2-24B_DR repbase; DNA; ZEB; 4304 BP. XX AC . XX DT 26-FEB-2009 (Rel. 14.02, Created) DT 26-FEB-2009 (Rel. 14.02, Last updated, Version 1) XX DE nonautonomous DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; 2-bp TSD; KW DNA-2-24_DR; DNA-2-24B_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4304 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 9(2), 527-527 (2009). XX DR [1] (Consensus) XX SQ Sequence 4304 BP; 1443 A; 691 C; 741 G; 1428 T; 1 other; cactcttaaa aaaaaaggtt ccttgaggtt ctatatagaa ccttggggtt ctatacttgg 60 ccagtagaac cttttaccca aaaaatggtt ccatttagaa ccctttgact gcaaagaacc 120 tttttgatta aaatggttct ataatttttg aatcataaca taattatgta gtaaagatta 180 taaaataatt ataagttata acacagagag acaaagcttt ttgagtcaaa acctaataaa 240 atgaatgctg tggtcaaaaa cttaatcagt gtacagtatg atgggatttt tttcatgcag 300 gtttgtacaa tagcatataa cttataggcc tagtttatat atactcaatg tttaacattt 360 ttgacaaatt attttgtgaa ataaataaaa taaccaattt atttagactg tgcagtaaca 420 tgttgttatg gtttaataat ttatttgaaa ctaataatca taaaaaagcc ctctggtgac 480 gtcagtcatt gacaaggaca tcgtcatgga ccagtggtga agagcctcga gagtgaaggt 540 taatttttcg aaaggagaag tccactattt cagcttcatc attcaggtat gttagtacat 600 ttaatatata tagtgaaata ttttgtatta tgaagtaagg tttctatttg atagaaaatc 660 cctcgtgtta tatggtgata cagattgaat gggggyaact ttagcgttaa tatggataaa 720 cactgtgctt gtgcgatgac ctcggtcgtc acccttcggt tcagaaatct gttaagttga 780 atttgttctg ctttttattc gaaataagtt acaatatcgc tatacttgtt tgtcagtact 840 actaattaaa ttaccgataa cagtacacag ctgtgttatt tgaaggcgag cttcgagaga 900 gcgtgaatga agcatgtgca cgttatttta aaattaacta acgttagatt tgagccgatt 960 tcatcgtgtt tttcgtgttg tttttagctc caaccttgaa caaacgctca cttttagctt 1020 ggttcgtgtg tgttttatca gtgattgagc taaactctgc aaagtagtgg acattactgc 1080 ccgcttttga gacgttcccg ctagtctgca taaaaccgca agatagcagt gcaccgctcg 1140 tgcccgcaca ggcagattcg tgcccgagtc tgtgcacgcg cgggaggcca gtcacggtgg 1200 caagttctcc ctatggccag tccgcccctg aatattattg aaacctttaa aaagccaaac 1260 atattttaac aatccataga tgaaagatca ggtcttattt ataaaagaca caaaaatgca 1320 gactaaatgt attgtgaaat attaaatata tatatatata tatatatata tatatatata 1380 tatatatata tatatatata tatatatata tatatatata tatatatata tatatagtgc 1440 aaaataagtt taaaataaat attttaatgt atattttatt tttcttcaaa cagcctggaa 1500 actgaactgc atcataaaac ttgtgcttgg tatgtatcat tgttttaaaa tccaaacaaa 1560 ctgtattgtg gtcagtaaaa tgaaataaat attataaaag acttcaagat tgtatttatc 1620 attacagaaa agtgtcatgg gacccatatt aagtttccaa atttgatctg ctgcacacaa 1680 gtagctttga agaagaagga tcccagctta aagaaaaaac acgtgtcaca tccataattg 1740 tttgacatca tcttcaagca cacctggtaa agtattgtat tttataacat gcataattca 1800 gacttgttgg ttgttgtgta ctgaaaaaat atatttatca caaaaaatat tctattactt 1860 actactgctg ttttaataca gtgagaatat taaggtcgtt gcttaataaa aagtcctgca 1920 gagaagaaat atgtattgaa aggtgtaaat ttgcctacag gattccttgt tgcaatgtct 1980 tcaaaacaaa tgtaaaaatg aaaggatttc agtggtgagc tcataatgat gggatcaaaa 2040 ggaagtgaac tggactctga gggacttaac acaaggcaag ttcatttcaa tacattttat 2100 gtcaagatta attctgtgat ttaaattcat atttacacat tctacaaatg ctgtcttcct 2160 acctcagatt gtgatggacc tagaatgcga caccgtttac gaataagtca agaaccaaat 2220 gaaacacact attttccaga gtacagatta tggaggggtc tgcaaaacta atggagcagt 2280 ttcttcttct tggagtgcca cagcttgtga gtaaccaaac actgttacat tgtgtgccaa 2340 atagatatga aaaatctgtg tatatgacat tttaaagctc atttggtaaa gcaatgcatt 2400 agcaatgcca aatgtgttcc aggtctcaca tattaataaa tgtatagatt aaacacaatc 2460 taagtccctt tggataaaag tatctgccaa atatgatgta tgttatttga taaacagtta 2520 ttttgttttg ttcttattct aattttattg ttttatttaa tgttcatact gaattgaggt 2580 cactcagaaa tgtcaatttc attaaattct gtaacaaatg aaatgcagat taaatagaaa 2640 atatattaca gctgtctaca ctgtatttct gtatttttga tgactttaaa gtacttgtat 2700 tgctgaaaat gtgtcacttg gaaacttaca caaagtaaac aaaatgtttt aatttaggat 2760 gctgcatata aggacttagt gaagaacaca gaaacatctt taaaagaaat ggcccaatag 2820 aagggaagga aagaaggaag tactctatgc tagtctcatt ggacagcaga ggaaaccaca 2880 gaaggtaact acactacaca aaacacacac acacacttta attgtattaa ttataactgt 2940 agttctattt gacttatgtt cattggaaaa gtactcagca ttgaaatatt gttttacaga 3000 aatccaaagg tcctaacagc tctcctgatt ccgtcaaggc ttgaagtgtc agcattacga 3060 tggccaacac tcgctcacta tccttctgct ccctcattta tcagaggcac cacagtagaa 3120 tgaattagca attactcttc actttccagc cgcaacccag cactgggaaa cacccataca 3180 ctctctcata cacctgcaca cattcactca ctgtggccaa ttcaccaata ccgtacgcat 3240 gcctttggac tgtgggggca actggagcac ccagaggaaa cccacacaaa cactgggaga 3300 acttgcaaac ttcacacaga aacaccaact gacccagtca ggactcgaac cagtgacctt 3360 cttgctgtga tgtgacagtt ctaaccagtg agccactgtg ccacccatgg acaaaactat 3420 ttacgttata agagaggccg gcaaacatgg acatgctaca ggttgtggaa tggctgttct 3480 cagtgttttg tctaatcatt ttgcaaagtt ttgtgtgcga ttaagcactt tcttcaataa 3540 cttttattga aaattatgta taaaaatgga tgttgccatg accaatacat acatgtaccg 3600 tcaacttgtt gaaattgtag tgttatttaa tagatgtttg tgtattgttt ctatctgcat 3660 tgtaatacat ttttcatcat tttagagatg tttgaaagtt aagttttgta tttgttcaat 3720 cagtaagcaa gcttacaaat gtttattaaa aagtgaaatc attagaatta gattgttgaa 3780 tgccattaag ctgttaatta tgtttactaa ttaaatagta aaaatgtata atgtttagag 3840 gtagatctta tacaaagtac gcagaggtgc aattttaaat attatttaaa actttatttt 3900 gtgaagtaaa ttgaataaat ttaaacaatt tgaattttct gcattggttt tattttttat 3960 tcaataataa ataacattaa ttcaataata aataaagaac cctcaaatgg ttctaaatag 4020 aaccatgtaa cttgaattca aagaaccatt tggggttcta tttaatgaac ccttaaaatg 4080 gttctattta taaccttttt ggggttccat atatgaagcg cttgaaagaa ccattttttt 4140 tggttctata tagaaccgtt tattttaaga gtgaaagaac catttggggt tctatttaat 4200 gaacccttaa aatggttcta tttagaacct ttttggggtt ccatatatga agcgcttgat 4260 agaaccattt ttggttctat atagaaccgt ttcttttaag agtg 4304 // ID Gypsy51-LTR_DR repbase; DNA; ZEB; 937 BP. XX AC chr1; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy51-I_DR; KW Gypsy51-LTR_DR; Gypsy57_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-937 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1418-1418 (2008). XX DR Genome; chr1; Positions 13298365 13297429. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 937 BP; 247 A; 126 C; 190 G; 374 T; 0 other; tgtggccgat tcatttaaga gatttattta tttacttatt tatttttata tgcaagtaat 60 agttaacttc agcaaaactt gcatctaatg ttttgtttta ttcacttcaa gagttcctat 120 gtttgttgta ggtggggctg taaatagtgt aacaccgggt gaaagttaaa gcttccctct 180 gtgtgcccgc gctcagtcgc gtggtgcacg cagatggtga agcagaggta attttcgctc 240 tcctctgatc ctacaattat tatccctttg cagtcacaat tattttattt tgacactaaa 300 atgctgttta gggtttctgt tgtaatgtgt attgttgtga tgagtatatt ttactttaaa 360 tggtgtgttt attgtgattt tagagttttc atttgcacgt gtatttttca tgtaggattg 420 agagggtttc cagtgttaaa ctatactgac tgtgcctcac tgttttgtta aggtatgttg 480 ttttggttat tggcttgtta ttttaaggat tcaaatatgt aaatcagttt aaattacagt 540 aatggtgtca tttaattgtg tcatggcaga ttttcaacta ataatttaat taatgcaaac 600 aaaaagggtt ttattttatc agtgatcaaa agaaaattaa tgcaatttac aattaacagt 660 tatttaatgt tacatatttt atttatatta tgcaatttat gtgattttgt ttttattggc 720 atttaaatgt ttaaaatagt aaaatacata tttctctgtg tgcctgctca gtcgcgtggt 780 gcacgcagat ggtgaagcag aggattgaga gggtttccag tgttaaacta tactgactgt 840 gcctcactgt tttgttaagg gaataaatca gtgtttctcc acctcatctg tgtgtgtgac 900 cgtgtttgag agggaagtcc atttgaagtc tgctaca 937 // ID Kolobok-N5_DR repbase; DNA; ZEB; 485 BP. XX AC . XX DT 16-OCT-2008 (Rel. 13.1, Created) DT 16-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE putative nonautonomous Kolobok transposons from zebrafish - a DE consensus. XX KW Kolobok; DNA transposon; Transposable Element; Nonautonomous; KW TTAA TSD; Kolobok-N5_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-485 RA Bao W. and Jurka J.; RT "Kolobok-type transposons from zebrafish."; RL Repbase Reports 8(10), 1349-1349 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 485 BP; 155 A; 78 C; 104 G; 148 T; 0 other; aggtcccgtg aagtgctttg aaatgtgcat ttttattcga tgtttgacgt aatctcaacc 60 gaaacacgaa gagagggtgg gacatagtgt agctcctccc ctttttaaaa aacagccaat 120 agcgttttgt ttttatcaca gctctgccag tgagagtggt tgagctcaag tgcatcaaat 180 gaaaagcaaa tgagaagcgt cttgaagggg gcggggcatg tcagatacta gagagcattt 240 gattggttat aatatgaaga ttgatttgat gagaaactga agtatgaggt gacgtgaata 300 aaaccgttga tccatttagg cggaagtgac aaactacaag ctttacatgt ttatatcagt 360 tttatatctt ctaaacgcga attttgtcac tgttttggag cacactagct tatagatatc 420 ctaaaaacta acaatactat atgatactaa catctaaaaa actttatttt aatttcatgg 480 gacct 485 // ID Gypsy72-LTR_DR repbase; DNA; ZEB; 596 BP. XX AC chr14; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy72-I_DR; KW Gypsy72-LTR_DR; Gypsy72_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-596 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1460-1460 (2008). XX DR Genome; chr14; Positions 54393137 54392542. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 596 BP; 85 A; 170 C; 113 G; 228 T; 0 other; tgtcaggaca ctggcttgat ctttctcttc tacctcttcc tgcttttcat tcttgcagcg 60 tgcactcagc tgatcattat tacgaatcag tgcgcacacg gcttgctgca cctgttactg 120 gtctctgtta atttccttgc tatttaagct ctccttttcc cactttctgg tcgctagtgt 180 gtcctgtcat tattcaggct atctcgctgc gttatcttga ttcctgtcat tgtcatgcat 240 tacttcatgt tctgattctg tttttctctt ctgcccagtt tttgctcggt tgcctgtgtt 300 ttttctgttt cattttggat tctccccggc cagccaccta ccttcgactt cgcctgtccg 360 acgctgaact tgactttcct ccgttatccc atctgcctcc cgatcgatgc ttgcctgcct 420 gacctctctc acgcccgctg atttggacac tgttgccgag ttgcctttcg ctcccaccga 480 gcgatttcat tttcatttga gtcacgagcc ttggctcgtc atttgttttg agaagttaat 540 aaaatccttt tgtgttaaca gcatttgcgt ttggatccgt ctctccgggt gtgaca 596 // ID Gypsy73-I_DR repbase; DNA; ZEB; 4229 BP. XX AC chr14; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy73-I_DR; KW Gypsy73-LTR_DR; Gypsy73_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4229 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1461-1461 (2008). XX DR Genome; chr14; Positions 17571841 17567613. XX CC 'CTCTT' target site duplication CC LTRs are 99% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 437..2518 FT /product="Gypsy73-I_DR_1p" FT /translation="MLARTCSFGDIKDSLIRDRIVCGINDAGLRERLLRER FT DLTLEACVRICRAAELSRQNSKAITGAATEEVHGVRQATRKEQYQTGNIIT FT CKFCGKKHEMNKRKCPAFGKKCKNCGKENHFAATCRSQLKQWRKRTVHNVN FT EQDSDEYEDILTVSATDSETVNKIESKREKKTQLFAGMLIGKDVVKFQIDC FT GASCNVIPINLLNPDTKLEATKTGLVMYNQSKVKPLGKCKLKIRNPRNQKL FT YRLEFQVVDVNCTVPLLGKRASEAMNLIKVQYENILAIDSIVTTEQPSTRP FT WSIQHIRTEYADVFTGDGCLEGAYKIEIDDTVKPVQLPKRRVPVAIMKPLK FT EELQDLQRREIIAPVERSTDWISSMVVVQKANGRPRVCIDPRPLNKALKHS FT HFPLPTIDDILPDLSKARVFTVCDVKSGFWHVKLEEESSYLTTFSSPFGRF FT RWLRMPMGISPAPEIFQRKLTLALEGLPGIYIIADDVLITGQGETQEAAER FT DHDAKLKAFLTRCRKKGIKLNADKFKLRQREVSYIGHLLTEDGLKIDPDKV FT KAITYMPRPMDVKGIQRLLGMVNYLSKFCAHLSDQCEVLRHLTHKDSEWKW FT TIKQEEAFSKIKETIAHAPVLKYYNPDEELTVQCDASDTGLGAALMQRGRP FT IAYASRALTHTERGYAQIKKGVFGSTFQHGKISPLYLWAESDSTK" FT CDS 2469..3824 FT /product="Gypsy73-I_DR_2p" FT /translation="MEKFHHYTYGRKVTAQSDHKPLENILRKPLLSAPKRL FT QRMALRLQKYDVDVVYVPGRDMLLADTLSRAYLPDSVAANSVEAEIETINM FT TQHLTISEDRLTKIRYATKDDKTLQSLINTIQQGWPRNKTETPQEIRHYYL FT FQEELSYQDGIVFRGERAVIPAKLCREIIQRIHSSHIGVEGCLRRARECVY FT WQGMNEQIKAYVQRCDTCRSVDFKQQKETLIPHETPNRPWAKVGTDLFTFD FT KKDYLITVDYYSNFWEIDYLQDTKSNTIIKKLKAHFARQGIPDIVFSDNGP FT QFSSQEFQNFSRHWEFLHKTSSPGYPQSNGKAEAAVKMAKRLMLKAKASRQ FT DPYLAILDHRNTPSQGLSTSPAQRLLSRRTKTLLPTKISLLRPKVQNVLGE FT IKANQHRQQTYYDRSAKDLSTLKQGDIVRVQPFEPHAMWRKGTVLEHVDPR FT SYKVQLD" XX SQ Sequence 4229 BP; 1495 A; 824 C; 968 G; 942 T; 0 other; tggtggcagc ggtgagatga tctagacatc tggagattaa gagtctagac ggctgataaa 60 taactttaaa gagatttttt tttctaatag cgagtatgga cggactaaaa cctccactgc 120 cattctgcat ggatgccgac aatcttgcaa aatcctggaa atttggaaag accagtttca 180 actctacatg gaactcacta tgtcggacgt agagggaaaa aaaacaaagt caagctgttg 240 cactatttaa tcggcgagag tggccgggaa ctgctggaaa ctttaatgat tgacgttgca 300 acggatgcaa gaatggtaac aaacatcatt gcaaggttag acgaacagtc taaatgagac 360 agtagaacgc tatcgattct tcacgaaaac tcagggagtt gatcagtcga ttgataatta 420 tgtaacggag ctaagaatgc tcgcgaggac atgcagtttc ggcgacataa aagactcatt 480 aatacgcgac agaattgttt gtggaattaa cgatgcaggg ctgagagaga gactgcttcg 540 tgagagagat ctgactttag aagcatgcgt gcggatttgt agagcagcgg agctatcgcg 600 gcaaaacagc aaagccatta caggggcggc aacagaagag gtgcacggag tacgacaagc 660 aacgcggaaa gagcagtatc aaaccggtaa cataattaca tgcaaattct gtggaaaaaa 720 acacgaaatg aacaaaagga agtgtccagc ttttggaaag aaatgtaaaa attgcggaaa 780 agaaaaccac tttgcagcca catgtagatc tcagttaaaa caatggagaa aaagaacggt 840 gcacaatgtc aacgaacaag acagtgatga gtatgaggat attctcactg ttagcgcaac 900 agactctgag actgtaaata aaatagagag caaaagagaa aagaaaactc agttgtttgc 960 aggaatgctc attggaaagg atgttgtgaa atttcagatt gactgtgggg caagctgcaa 1020 tgttattcct ataaacttac tgaatccgga cacaaagcta gaagccacca agactgggct 1080 agtaatgtac aaccaaagca aagtgaagcc gcttggaaaa tgtaaactaa agataagaaa 1140 tccaagaaac caaaagcttt acaggcttga atttcaggtg gtggatgtta attgtactgt 1200 gccattattg ggcaaaagag caagtgaagc catgaatttg ataaaagttc aatatgaaaa 1260 catcctagca atagacagca tagtgaccac agaacagccc tccaccagac catggtcaat 1320 acagcacata agaacagaat atgctgatgt atttacgggt gatggctgtc tagagggagc 1380 atataaaatt gagatagatg acacagtaaa accagtacag ctccccaaaa gaagagttcc 1440 ggtggccata atgaaaccct taaaagaaga actacaagat cttcagagga gagaaattat 1500 tgcaccagta gaaaggagta ctgattggat cagtagtatg gtggttgtac agaaggctaa 1560 tggcagacct agggtatgta tagatcctag acccctaaat aaagctttaa agcacagcca 1620 ttttccacta ccaaccatcg acgatatact cccagaccta tcaaaagcca gagtctttac 1680 agtttgtgat gttaaaagtg ggttttggca tgtcaagctg gaagaagagt ccagttatct 1740 taccacattc tcttctccct ttggaaggtt taggtggctc cgtatgccaa tgggcataag 1800 tccagctcct gaaatctttc agcgcaaatt gacacttgcg ttagaagggc taccagggat 1860 atatataata gcagatgatg tgcttattac aggacaggga gagacgcaag aagcagcaga 1920 gagagaccat gatgcaaaat taaaagcatt cttgacgcga tgcagaaaga aaggtatcaa 1980 attgaatgca gacaaattta aactgcgaca aagagaagtc tcctacatag gacacctgct 2040 gacagaggac ggactgaaaa tcgacccaga taaagttaaa gcaataacat acatgccaag 2100 accaatggat gtgaaaggta ttcaaagatt gttggggatg gttaattacc tctcaaagtt 2160 ttgtgcacat ctgtcggacc agtgtgaagt tttgagacat ctgacacaca aagacagtga 2220 atggaaatgg actataaaac aagaggaggc attcagcaaa ataaaggaaa ccattgctca 2280 tgcaccagta ctgaaatact acaacccgga tgaagaactc accgtccaat gtgatgcctc 2340 agacacgggc cttggtgctg cactaatgca gagaggtagg ccaatagcat atgccagtag 2400 agcgctaaca cacacagagc ggggttacgc acaaataaaa aaaggagttt ttggctctac 2460 ttttcagcat ggaaaaattt caccattata cttatgggcg gaaagtgaca gcacaaagtg 2520 atcacaaacc actagaaaat attcttcgta agccgctact gagtgctcca aaaagacttc 2580 aaaggatggc gctgagactt caaaagtatg acgtagatgt tgtgtacgtt ccaggtcgtg 2640 acatgctgtt agccgacacc ttaagcagag catacttgcc tgatagtgtt gctgctaact 2700 ctgtggaagc agaaatcgaa acaattaaca tgacacagca cttaacaatt tcagaggaca 2760 gactcacaaa gattcgttat gcaacaaaag atgacaaaac attacagtcc ctaatcaaca 2820 ccattcaaca agggtggcca aggaacaaga cagaaacacc acaggagatc agacactatt 2880 atctgtttca ggaggagctg agctaccagg acggcattgt attcagaggt gaacgagctg 2940 tcattcctgc taaactctgc agggagataa tacaacgtat acactcctct cacatcggtg 3000 tggagggatg ccttaggagg gccagagagt gtgtttactg gcagggaatg aacgagcaaa 3060 tcaaagcata tgtgcaaaga tgtgacactt gtcgttcagt tgactttaag cagcagaagg 3120 agacactgat tccacatgaa acaccaaata gaccctgggc caaggtaggc acagatttgt 3180 ttacatttga caagaaggac tatctaataa cagtagatta ttactccaat ttctgggaaa 3240 tagactatct gcaggacacc aaatcaaaca ccataatcaa gaagctgaaa gctcatttcg 3300 cccgacaggg cattccggat attgtatttt cagataacgg cccgcagttt tcatctcaag 3360 aatttcagaa tttcagtcgc cattgggaat ttcttcacaa gacctcatcc ccggggtacc 3420 ctcaaagtaa tggtaaagct gaagcggctg ttaaaatggc caagagactt atgctgaaag 3480 caaaagcatc aaggcaggac ccatacttag ctatcctgga ccacagaaac acaccatcac 3540 agggattatc aaccagtcca gctcagagat tacttagccg tcgaactaaa acattactac 3600 ccacaaaaat tagtctacta cgcccaaaag ttcaaaatgt cttgggggaa ataaaagcca 3660 accagcatcg tcaacaaact tactatgaca gatcagctaa ggacctaagc actctgaaac 3720 aaggggacat tgtgagagtc caaccatttg agccccatgc gatgtggaga aagggcacag 3780 tgctggaaca cgtcgaccca agatcctata aagtccaatt agattaggga ggtgtaataa 3840 ggcgaaacag gagacacctc aggaaggctt ctgagatgac accaaagaac aaaacaatga 3900 ctgctcattg tgagggtgct gaacatgcca gtgtagctgg cagtttacca acgctagcca 3960 acaatgcaga acaccagatg caaaatgcag acaaatctgc tgcatcacct gttgtcacaa 4020 aagcaggacg tgttatgatc aaaccaaaat acctaaagga ctatgttaca tgaggaaaaa 4080 aaaaaaagag ggaagaaaag atgaacaatt gacattcata agcataagtt acatgttagt 4140 gatatgttat ttcagtctgt ttaatttaaa gcacaaatta tgtgctaggg gggtttaatg 4200 tctttatttc gggaaaaaaa gagaaagga 4229 // ID Gypsy6-I_DR repbase; DNA; ZEB; 5410 BP. XX AC . XX DT 17-NOV-2004 (Rel. 9.1, Created) DT 19-MAY-2005 (Rel. 10.06, Last updated, Version 2) XX DE Gypsy6-I_DR is an internal portion of the Gypsy6_DR LTR DE retrotransposon. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy superfamily; Gypsy6-I_DR; Gypsy6-LTR_DR; Gypsy6_DR; KW endogenous retrovirus; gag; pol. XX NM Gypsy6-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5410 RA Kapitonov V.V.; RT "Gypsy6_DR LTR retrotransposons from zebrafish."; RL Repbase Reports 4(10), 261-261 (2004). XX DR [1] (Consensus) XX CC Gypsy6-I_DR is an internal portion of the Gypsy6_DR LTR CC retrotransposon, whose LTR is deposited in Repbase as CC Gypsy6-LTR_DR. This retrotransposon belongs to the Gypsy CC superfamily. Some copies of Gypsy6_DR are flanked by identical CC LTRs. The internal portion encodes the 354-aa Gypsy6_DR1p gag CC (pos. 160-1221) and the Gypsy6_DR2p 1418-aa pol (pos. CC 1092-5345) polyproteins. The polyprotein is composed of the CC protease, reverse transcriptase and integrase domains. XX FH Key Location/Qualifiers FT CDS 1092..5345 FT /product="Gypsy6_DR2p" FT /note="Polyprotein" FT /translation="MSSARPFCEKLRWWADVGPCTISPNNFCFNKTRTVIL FT QPAVGKLVPTKLQGHNLVGKRVGSNDHMPCLMSTCPKLVVTIGGVQVPCLV FT DTGSMVSTITESCFMTNFGLWGQEQLRSCHWLQLRAANGLSIPYIGYLELD FT IELCGRVVSGCGVLVVKDPPGGMCAQTPGVLGMNVLSRCYQELFGQHGTGL FT FDLPAVSQAPSFIFQALQNCHQAGVQPVKDAEGQVRVRGRRACRIPGGTVK FT FVATTCSVQYSGNAVLFEPPISGLPAGLLASPALLKVDGGTVYVPIVNVGT FT MDVVLYARTIVGVVSKVDVVALPPEVAEVNMVAARVSAQSSPSVQEQIATL FT DLSKLPEVEQGKVRALLLKYLPVFSSYDGDLGCTNLISHDIPLLDEIPVRQ FT RFRRIPPSEYEVVKAHINQLLETQVIRESSSPYASPIVLVKKKDGGLRMCV FT DYRRLNAKTRKDAFPLPRIEETLDSLAGACWFSTMDLASGYNQVPVTEKDK FT PKTAFCTPFGLFEWNRMAFGLCNAPSTFQRLMERLFGDQQCQSLLLYLDDI FT IVFSSSIDEHLARMEVVLSRLQREGLKAKLSKCAFFQREVRYLGHVISSEG FT VSTDPGKVEAVANWPCPTSVTELRSFLGFASYYRRFVEGFSKLAAPLHRLV FT AQLANPKQRKGNAHDFAASWSTECQSSFEGLKIKLTSAPVLAYADFSLPFI FT LEIDASQGGLGAVLSQEQQGKVRPIAYGSRSLRPTERNPSNYSSMKLEFLA FT LKWSMTEKFREYLLGQKCIVFTDNNPLSYLNSAKLGVMEHRWAAHLSAFDF FT EIKYRSGRSNRNADTLSRQNFSSTGEVQGLCPGVAVPAVLQQVAPDGLVTQ FT VNQVTAFPCPSGSDMGALQEADTVIGEVLVFWRRKLLPTSEERKQLSRLAV FT ILLRQWGRLVEIEGVLYRRVSRPDGGEEVLQVLLLAVMKTEVLTQLHQQHG FT HQGVERTSQLVRQRCYWPGMFADIARWCQECERCQCAKGTPSAPSSFMGHL FT LASRPNEILALDFTLMEPSRSGLENVLVMTDIFTKYTLAIPTRDQRAETVA FT QVLVAEWFCKFGVPGRIHSDQGRNFESTLIQQLCGLYGVVKSRTTPYHPAG FT NGQCERFNRTLHDLLRSLPPSKKSDWPLCLPQVLFAYNTTPHQSTGESPYY FT LMFGQEPKLPIDFLLGRVKEPSAGSVHGWIVEQQDRLQVAFEGARERLGAA FT ADRRKARHDLQVREAPLKEGQLVYLRDHSVRGRCKIQDLWSSVVYQVLKAP FT TESGVVYTIAPVADLSKTKHVHRSLLKAQLGPNLLPSPPEPAMGDGMQPSD FT EECDGDLLVLVPETPELRGRSRSQGSVPPVPRVVDEELEGRATGETVQPEV FT VPSNPVRAAEVVVRRTVRSTAGQHSNLHHLPRAVGTGTVSSIATNPFGTFF FT RPWD" FT CDS 160..1221 FT /product="Gypsy6_DR1p" FT /note="Gag-protein" FT /translation="MEDELRELRELVTQLKADNERLRQEQVPAALPGPSNI FT SIPVVSDPPLIDAGPSSTDRFVFVPRDRKCPKFSGRSGIDINEWVEEAEAC FT MRLRHLSSADRAFFLFDHLEGEAREEIRYRPQSEREDPKRVIQVLRDLYGC FT TKSYVALQESFFSRRQQEGETLEFSLALMSLLEMVKGQSPHGMPNAEILLR FT DQFVENVNDCTLRRELKQFVRRQPTATLVDVRGEALRWEREGMPGGARGRS FT QSVPSVYGIQYGVQGNRSVSSGAKSEMTELRDMLLKQQQQLNQLVQSMSLL FT QSCSPSLPPPKVNPVICRRCRQPGHFARNCDGGRMSVRAPSAQITSALTRR FT EQSSSSQPSEN" XX SQ Sequence 5410 BP; 1239 A; 1187 C; 1482 G; 1502 T; 0 other; ttctggcgta gtcggcagga ttccccctca ttgtgtgcag agacgtgtgt ttgtttttgt 60 atattttcat ttttgtagga attgaattgc ggcagtctcc ctcccctatt ctttttggtg 120 agtcaacaaa cgttgacttt ttgcttgtac gttgtagcaa tggaagatga attgcgtgag 180 ttaagggagt tggtaactca attaaaagct gataatgagc gactacggca agagcaggtg 240 ccagctgcac tgccgggtcc atctaatatt tctattcctg ttgtttcaga tcctcccctc 300 attgatgctg gtccctcgtc aaccgaccga tttgttttcg ttcctcgaga ccgtaaatgt 360 ccaaaattca gtggccggtc cggaattgac atcaatgagt gggtggaaga agcagaggct 420 tgtatgcgtc ttcgccattt gtcttcagct gatcgggcat ttttcttgtt tgatcacctg 480 gagggagagg caagagagga aattcgttat aggccacaga gtgaaaggga ggatccaaag 540 cgggtgattc aggtattgcg cgatctatat ggctgtacta agtcttatgt agctcttcag 600 gagtcattct tttccagaag acagcaggaa ggggagactt tggagttttc cttagccctg 660 atgagccttc tggaaatggt taaaggtcag tcacctcatg gcatgcctaa tgcagaaatt 720 ttactgcgag atcagtttgt ggaaaacgtt aatgattgca cccttcgtcg tgaacttaag 780 cagtttgttc ggcgtcaacc tactgccaca ttggttgacg tacgtggtga agcacttcgg 840 tgggaaagag agggcatgcc tgggggagcg cggggccgaa gtcagtctgt tccatcagtt 900 tatggtattc agtatggggt gcaggggaat cgaagtgtta gtagtggggc aaagtctgaa 960 atgactgaat tgcgggacat gttgctgaag cagcagcaac aattaaatca actagttcaa 1020 agtatgtctc tgcttcagag ttgttcaccc agtttgccgc cacctaaagt taaccctgtt 1080 atttgcagaa gatgtcgtca gccaggccat tttgcgagaa attgcgatgg tgggcggatg 1140 tcggtccgtg caccatcagc ccaaataact tctgctttaa caagacgcga acagtcatcc 1200 tccagccagc cgtcggaaaa ctagttccca ccaagttaca gggccataac ttggttggga 1260 aaagggtagg ctctaatgat catatgccgt gtttaatgtc tacatgtccg aagctcgttg 1320 taactatagg tggggttcag gtcccttgtt tggttgacac cggttccatg gtgtccacca 1380 ttactgagag ttgtttcatg actaattttg ggctgtgggg tcaagaacag cttagatcat 1440 gtcattggtt acagcttaga gctgcaaatg gtctttcaat tccttatatt ggttatttgg 1500 aattagatat agagctttgt gggcgagtag tttcaggctg tggcgtgctg gttgttaagg 1560 atcctcctgg gggcatgtgt gcacaaacac ctggtgtatt gggtatgaat gtgttgagcc 1620 gctgctacca ggagctattc ggccagcatg gtacaggcct ttttgattta ccggcagtat 1680 cacaggcccc tagttttatc tttcaggcct tacaaaattg tcatcaggct ggagttcagc 1740 cagttaaaga tgcagaagga caagtcagag tgcgtggacg tcgggcgtgt cgcatcccag 1800 gtggcactgt aaaatttgtt gctacaactt gttcggtgca gtattctggt aatgctgtac 1860 tgtttgaacc tccaatttct ggtctccctg caggtttgct ggcctctcct gcgctcctaa 1920 aggtggatgg tggtacggtc tatgtgccca tagtcaacgt gggcactatg gatgtggtat 1980 tgtacgccag aactattgtg ggcgttgtga gtaaggttga tgtagttgcg ttacccccag 2040 aggtcgcaga ggtaaacatg gtggccgcta gggtaagtgc acagtcttct ccttctgtgc 2100 aggagcaaat agccacttta gacctgtcaa aactgcctga ggtagagcag ggtaaagtta 2160 gggcattgct gttgaagtat ttgcctgtgt tttccagtta cgatggtgat ttgggttgta 2220 caaacctgat atctcacgat ataccattgt tagatgagat ccctgtcagg cagcggttca 2280 ggcgcatccc tccgtctgag tatgaggtgg taaaggcaca tatcaaccaa ctgctagaga 2340 cccaggtgat tagagaaagt tccagtcctt atgcttcgcc cattgtcctg gttaagaaaa 2400 aagacggtgg tctgcgcatg tgcgtagact accgtcgttt gaatgcgaaa accagaaagg 2460 atgcattccc tctaccacgt attgaggaaa ctttggactc gctggctggg gcctgttggt 2520 tttccaccat ggacctagcc agtgggtata accaggtgcc tgtaactgag aaggacaagc 2580 ctaagactgc cttctgtacc ccttttggcc tttttgagtg gaataggatg gcgtttggac 2640 tgtgtaatgc cccaagcacc ttccaacgat tgatggaacg gttgtttggg gatcaacagt 2700 gccaatccct cctcctgtat ttggatgata ttattgtctt ttcctcctct atagatgaac 2760 atctggcacg gatggaggtt gtcctgagcc gtctgcagag ggaagggttg aaggccaagt 2820 tatccaagtg tgctttcttt caaagggaag tgcgttattt gggtcacgtc atttcgtcag 2880 agggggtctc taccgatcca ggtaaagtgg aggcagtggc caactggcct tgcccgacca 2940 gcgttaccga gttgcgctca tttttggggt ttgctagcta ctaccgtcgt tttgtggagg 3000 ggttttccaa attggctgcc cctctccata ggctggtggc tcagcttgca aacccaaaac 3060 agcgaaaggg caatgcccat gactttgcgg cttcttggtc cacagaatgt caaagtagct 3120 ttgaggggtt gaaaattaag ctaactagtg ctccagtgtt ggcctatgct gatttttctc 3180 tgccttttat tttagagatc gatgccagtc agggaggctt gggggcagtc ctctcacagg 3240 aacaacaagg caaggtgcga ccaatagcat atggcagccg cagtcttagg cccaccgagc 3300 gtaatccatc taattatagt tcaatgaaat tggagttttt ggcactcaag tggtccatga 3360 ctgaaaagtt cagggagtat ttgttaggcc aaaaatgtat tgtctttact gataacaacc 3420 cccttagcta cctgaattct gccaagttag gtgtaatgga gcatcgttgg gctgctcact 3480 tgtccgcatt tgactttgaa ataaagtata gatcgggtag gagcaatcgt aacgcagaca 3540 ctttgtcccg gcagaacttt tccagtacag gagaagttca aggcctgtgt ccaggggtgg 3600 ctgttccagc tgtgttgcag caagtggccc cagatggatt ggtgacccag gtcaatcaag 3660 ttactgcctt tccctgcccc tctggcagtg acatgggtgc tctgcaggaa gcagacacag 3720 tcattggtga ggtgttggtg ttttggaggc gtaagttgct ccctacctcg gaggagcgta 3780 agcagctctc tcgcttggca gtcatcttgc ttcgccaatg gggccgcctt gtggaaattg 3840 aaggggtgct ctatcggcgt gtgtcacggc cggatggcgg agaggaagtt ctccaggtgt 3900 tactactagc cgtcatgaag actgaggtct tgacccagct gcatcagcag catggacatc 3960 agggggttga gcgtacttcc cagctggttc gtcagagatg ttactggccg ggtatgtttg 4020 ctgacattgc acgctggtgc caggagtgtg agcgctgcca gtgtgctaaa ggcaccccct 4080 ctgctcccag tagttttatg ggacatcttc tggcttctcg gcctaacgag attttagctc 4140 tcgatttcac tttgatggag ccttcaagat ctggcctaga aaatgtgttg gtcatgactg 4200 acatatttac aaagtacacc ttagctatac ccactagaga ccaacgggcc gagactgtag 4260 cccaggtcct tgtggccgaa tggttttgta aatttggggt gccaggtcgc atccactccg 4320 accaaggtcg taattttgag tccactttga ttcagcagct gtgtgggttg tatggagttg 4380 taaagtcccg tactactcca taccaccctg ctggaaatgg tcaatgtgag cgtttcaata 4440 gaacgttgca tgatctgctg cgttctcttc caccttctaa aaagagtgat tggccacttt 4500 gtctccctca ggttctcttt gcgtacaaca ctactcctca ccagtcaact ggagaatccc 4560 cgtattattt gatgtttgga caggaaccta aacttcccat tgatttcctc ttgggtagag 4620 ttaaagagcc gtcagccggc agtgttcatg ggtggattgt ggaacagcaa gatcggttac 4680 aagtcgcttt tgaaggtgct cgtgagcgat tgggtgccgc agccgaccgg cggaaggccc 4740 ggcatgatct tcaggtaagg gaagcaccac tgaaggaggg tcaactagtt taccttcgcg 4800 atcatagtgt gcgaggtaga tgtaaaatcc aagacctgtg gagctcagtg gtctaccaag 4860 tcctaaaagc acctactgaa agtggagtgg tctatactat tgctcctgtt gcagacctca 4920 gtaaaaccaa acatgtacac agatccttgt tgaaggccca gcttggtccg aatctgcttc 4980 ctagtccacc tgaacctgca atgggtgatg gcatgcagcc ttcagatgag gagtgtgatg 5040 gagacttgct ggttctagtt cctgagacac cagagcttag aggaaggtca aggtcacagg 5100 gttctgtccc ccctgttcct cgggtggtag acgaggaact ggagggtcga gcaactgggg 5160 agactgtgca gcctgaagtg gtgccctcta accctgtaag agcagcagaa gttgtggtaa 5220 ggagaactgt gagaagtacg gcgggtcagc attctaacct gcaccacctc ccacgagctg 5280 taggtacagg gacagtgtcc agtatagcga ctaacccttt tggtacattc tttcgccctt 5340 gggattaggc tcttcgatta atgatgttta gttcaccgtc ggggcgacgt tgcagaaatt 5400 gggggtggaa 5410 // ID Copia-9-LTR_DR repbase; DNA; ZEB; 946 BP. XX AC . XX DT 16-MAR-2009 (Rel. 14.03, Created) DT 16-MAR-2009 (Rel. 14.03, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Nonautonomous; KW Copia-9-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-946 RA Bao W. and Jurka J.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 9(3), 693-693 (2009). XX DR [1] (Consensus) XX CC Solo LTR from Copia retrotransposon. XX SQ Sequence 946 BP; 140 A; 203 C; 164 G; 439 T; 0 other; tgtcatggtc ctgacctgac agtcttgtct gtcctgtcag tgtttaatgt ttgcttgttt 60 gtcctgtgct tatttacatt gctaagtctt aatttgtgtt ggccatgtgc tcctcttgtc 120 tttcctcctt gctttcagtt accccgccct cttatttaga ccttgttatc ctgattgttt 180 tcacctgtct ctcgtgtgct ttgttctata taatgtgctg tgtgttctca tttctttgct 240 ggttcgttcc atgtgagctt gtcttgtatc cttgccttgc atctgtacta agcttttggt 300 cttgcctgtt ctagttcgct tgtctattta ttgtttgttt gtagctatta ctttctttcg 360 tttgtttcta gtgtttactt aattttgtgt tctaatcttt attttacttt tacccttaga 420 gccattgttc catttttctg tcagtcccag tcttcttagt agataattat tagtagttgt 480 ttgatttgtt taattactct ctttaagatt ctagaagtct ttatttcagt tttgttttat 540 cagattgctc aagtctgtct ttctgtcttc tagtatttag attttatttt atttcagttt 600 tagtttattt tattgtatat tattactgcc tgtttgttgt cctgtcttgt gtcttataat 660 tttccagtcc tattccgtgt cctgacctcc ctggtctgac agtctggagt ttacgtggcc 720 tgctccagga tctagatctg gttttgacca ctgttgctcc tactcctggc tggctgttca 780 ttttagtacc atcttgtcac taccactggc ttccttgtgt tgttgcccct accgtttgga 840 ccttatcatc cccctgtgtg tctacttgtg ccttattctg tcaataaacc cctttgtatt 900 gttcattgct gcaactgggt ctgttttctt gcaatcaacc gtgaca 946 // ID piggyBac-N3_DR repbase; DNA; ZEB; 438 BP. XX AC . XX DT 28-OCT-2008 (Rel. 13.1, Created) DT 28-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE piggyBac-N3_DR is a putaitve piggyBac nonautonomous DNA DE transposon - a consensus. XX KW piggyBac; DNA transposon; Transposable Element; Nonautonomous; KW TSD TTAA; piggyBac-N3_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-438 RA Bao W. and Jurka J.; RT "piggyBac-type DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1618-1618 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 438 BP; 138 A; 71 C; 71 G; 157 T; 1 other; cctcttaagg cccaaggtgt ttttttacat gcattttttt atttctcttt gctatttggg 60 cttattggaa cctaattaga ataaaaacct aagtatcatc ttttgatatg atgtactttt 120 agagaaaaat gatgtccata tatgtggact cgtggtccga atttacataa aacacttttt 180 cctgactgtt ttttaatgca tatataacat akaaattttt ttgaacatgt tttgactatc 240 agagagtaaa aacaacattt tttttccccc attttggaca gttaaaaata gtgtttggga 300 catttcatat gctgcaaaac agttgcagga tgacactgta tgtctgtaac aaaatataaa 360 acattcaaag tattttaaat agccacttaa gagctaaaat gtgctgtcca cgtatgtggc 420 cactcaagcc ctaggagg 438 // ID hAT-N30_DR repbase; DNA; ZEB; 982 BP. XX AC . XX DT 14-OCT-2008 (Rel. 13.1, Created) DT 14-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Putative nonautonomous hAT DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW 8 bp TSD; hAT-N30_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-982 RA Bao W. and Jurka J.; RT "hAT DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1300-1300 (2008). XX DR [1] (Consensus) XX CC TSD is 8-bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 982 BP; 278 A; 204 C; 227 G; 273 T; 0 other; taagggtgtc acgatcctcc aaatcctcga ttcgattaca ttttcgattc taaaggcacg 60 attcgattcg attttcgatt atgaataatt aattaattaa tgaccaatta attatttgta 120 gcctaccgtt taaactacct gacctgcatg gtctttgttt tacccataaa caaatcatac 180 agtaaatgaa taaaggcaag atacacacat aattaccacc tgtcaatcac tttttctgcg 240 ggactcgtga ataggcagtg atctgtgtcg ttataatggc gtcggtaaaa aagacgcaca 300 accaacagga accagccaac agtatctgag gtgttcgcta aaatgactaa gtacaagtga 360 gtgaaagatt gaagcagtcc tctgactgcc tggacctgct gtctcgagcg catggctgtg 420 tgtgcgtctg tgtctgtgtg gtcacgtgat gtgcattttc agaggtagag aggaaggagg 480 gctgctcaga aatgctacac gccactgtgg atgtcaaatc gttgtcgttc taaaatgcca 540 tttaaaaaca aagacagtgt aaacagggcc tgagtgtgta cttttcgcga gcggatttgc 600 aacgggggcg ggcggaggat cgcgatgccg gtgttgtcta tcggacgaac cgtacgtaat 660 acgtacatag cagagcttgc aaaactgtgt ttttttttgt cgacaacacg aacgttgtca 720 acataactca ctggaaatcc aaaatgctta cacaccggcg acttcattga aagaggagag 780 ggtttaagtt ctgtcgacgg gtctcctgct tctgcagttt aacaagcact tcaacaagcc 840 tgtttttttc ccgcttggca agccaagctg acgtgacatg ggggcgtggc agcatcgacg 900 attctatttt ttgattcgat aatcgaaatt gagcataaat ttcgatcgat ttcgattaaa 960 aatcgaaatc gtgacacccc ta 982 // ID ERV2-LTR_DR repbase; DNA; ZEB; 586 BP. XX AC AL591210; XX DT 05-JUN-2002 (Rel. 7.05, Created) DT 05-JUN-2002 (Rel. 7.05, Last updated, Version 1) XX DE ERV2-LTR_DR is a long terminal repeat of the ERV2_DR endogenous DE retrovirus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; ERV2-I_DR; KW ERV2-LTR_DR; LTR retrotransposon; class I ERV; gag. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-586 RA Kapitonov V.V. and Jurka J.; RT "The ERV2_DR endogenous retrovirus from zebrafish."; RL Repbase Reports 2(5), 14-14 (2002). XX DR Genbank; AL591210; Positions 32437 31852. XX CC ERV2-LTR_DR is a long terminal repeat of the ERV2_DR endogenous CC retrovirus. An internal portion of the retrovirus is deposited as CC ERV2-I_DR. XX SQ Sequence 586 BP; 167 A; 118 C; 100 G; 201 T; 0 other; tgtggaaata taaatttcta tcaactaatt ccaaatgaac aataatctac gtataactga 60 aaagttatat tctttggttt ttaattatat aatttcatta acattccagc taggagggac 120 attaaccttt gtctttctga ttacaagcta agccaaagta atgcaaattg tcagtttcac 180 agcatggggt gtctattgtt ttctcttttc gatcaaatgg tcaggcggtt tctgtctcca 240 gaacatgatg agatcaatgc ctgaaagagc agtttgccct gccgaccaca actcttaatt 300 tgcatacttt gtttagctat atcccctcct cctaagaaca caatatagcc atgaaatctt 360 tgtcttaatt cagactaatt cagacttgtg tgagacttgt gagagacttg tgaaagaact 420 tgcagactgc ggacctctag taggctcttg cagacacgga catcttaaag acgctgtatg 480 actgcagatt aaccaacacg tctcaataaa ggaattctgc ttgtttcgag tctcctggac 540 tgggttctct cttctttttc actcatttat tttttttttt acaaca 586 // ID DNA-6-N6_DR repbase; DNA; ZEB; 2309 BP. XX AC . XX DT 25-NOV-2008 (Rel. 13.11, Created) DT 25-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE nonautonomous DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; TSD 6-bp; KW DNA-6-N6_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-2309 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(11), 1718-1718 (2008). XX DR [1] (Consensus) XX CC The TSD is mostly 6-bp long, but 8-bp TSD is also observed. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 2309 BP; 753 A; 433 C; 409 G; 712 T; 2 other; cagggctgat agtggaaaaa attcctgagc ctgaactttt tttccttgcc cctgaggtga 60 gtaggggtgg gggtactatg tatgcaacgc acttttaaca cataacttgt gggcccctgg 120 gcccaaatat attaacagcc tatgtgtaac cctgatcatc attattgtca agtggctctt 180 tttagacaca tgccagtaat gtcagtgaca catgccagtg tcagtgatct tgtgacatgt 240 gccagtaggt gtcagtataa gcacgttata agcacattaa tcttataagt ctctttgctt 300 taatatttaa aaatcattag gcaaatgaca cctgttatct tacatgcata tatgttatga 360 gttttcaaca tgcattttat tataactttt taaaggatat tatttaaaat accttaaaat 420 gaaaataagt taaataataa aataaatgaa tgaattaaaa acatagagaa gtccatattg 480 tagggaacaa aggaactgcc aggatgcaat aatatacagt acaacagata agtgtaaccc 540 ctcccataca taagcatgcc acaataaatt tgactgacat gtactgtcaa atgactatat 600 tacgtacgga agtatgacta cgaatacata aaataaatgc gtttaaatta aaacatcgtg 660 gagtagctca gcaaataaac taacttgcga cgcgataaat taggttaaaa gtcgcgtaat 720 gattaaatat ggttttgctc gtgttaatta atgtcataag cttcgccggg tgtcgatgtc 780 actgcagtga catgcagcgc gacactctga gtgacacagg tcactacaca tgccactagc 840 gagtgacaca tgacagtgga aaaccggacg tgccaccgga agtgtcacta cactcagtgg 900 catgtgccag tgtcatctgt acgtcagatg tcgtgtcact ccatgttaaa actgctactg 960 tgaatccgcg ttcaagcgca cacaataagc taaaactcgc cccaagcacc ggcaaaaaca 1020 aataacaatg aatgtgtcga ggaaagagta aggacatatc ttaattattt attaataatt 1080 ttattttatt wttttattat ttattatttt aatttattat ttacttattc tgagtcagtg 1140 tcgcaaagat cctgactggc aatctcctct tggactcgcc tttagaraga aattcatctt 1200 tacggattac atgaaggagt ttttttgttt tcgatttgtt ttatttcgtt aagtaataat 1260 ttaaagcttt ctataaatat atttattatg tctgtgaggc atatacattt tgcttcattt 1320 tgttaagcgc tcctgttcaa aaaccagacg acagaaagcg cataattttg gttttcttta 1380 ttttataaaa acgctgtaac gttttttgat gtattactcg tactttgtga gcaaaaatga 1440 cggataactt taaattgctc ccactgaaaa aatgcaagtg attgctctgg cgcctcgatg 1500 tcctgcaagc tttttataag caaactatgc gcctaatagc acacattctt taacgtttaa 1560 gctaccattg ttttatactt gaactgtttt atatctttta ttttaggcaa gtcgtgatga 1620 tctgagaagg caaactgatc aaacagacaa tgatggtctt ccgctgggcg ctgttcgtta 1680 aaaaaaaacc ttatttttaa cgaataaaaa atgctccaaa cgtttttcta aataagctga 1740 ataaaaaaac gaaactaaat ataaacactt tgccattagg ctacatacaa aactgaacta 1800 tttcatagct gaaacactaa gttgtttaag gagaaggggg aaatatattt ttgtcccgtc 1860 tattgcttca ccgttatttc gagaataaac ccagcgtaaa tatgcagatg tcattcccaa 1920 aacatatcag cgtatatgtg ccgaaaacga aagtttcata caaattctga atggattata 1980 gcctggaaag tgcaaagcac gctcctctta ttatcactaa aaagcgtcac taatcttagt 2040 tcatagagat ctcacaaaga tccgttataa tttataaagc tctctaaatt acacaattaa 2100 tcttttattt acatggcgtc tacactcaaa agatgattca cgagcacaca aaatggcact 2160 taataaaaac caatccggcg ctttcagcag tgagtgaatt ctgtgaatct agagaaatga 2220 cagatcaata tccgtgaacc tgattttttt tcccccgcag ccacagtacg ccggaaatcc 2280 ggcgtggtgc cggaacgcta tcacccctg 2309 // ID Gypsy-116-LTR_DR repbase; DNA; ZEB; 738 BP. XX AC . XX DT 24-NOV-2008 (Rel. 13.11, Created) DT 24-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy-116-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-738 RA Bao W. and Jurka J.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(11), 1788-1788 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 738 BP; 128 A; 118 C; 176 G; 315 T; 1 other; tgttacgagt agttgtagtg tagtgtaggc ttgtgtgtgt gtgtgtgtgg tgtttctctc 60 tctctctctc tttctctctc tctctttctc actctaagta ttcgcaggtg cagcggatta 120 cgttgggtcg gatcatgatt ggcgcttgga ttggcggcgg ctgtatataa acctgcagag 180 agcatggatt cgatggctcc ccgctctctc tcgatttccc cgcttgcgct cagtggagat 240 tattgyttaa gtgatcaaat gagtgtttgt tgcttatatc tcttttgtgg cgttttgtga 300 gatgccgatg tgttaatgtt gttaatttta accggcgata gaagcccggt tttctgtctc 360 cctgtctggc ttggagcatg gtagggaggt ttggtgcctt ttatttgtta ttttcgtttt 420 ctcctgtgtt tgttagtagg gaggtaagtt ttctgtattt tattttatta tcttgttttc 480 tagggaagtt taatctaatt tagttagctt ggtttattgt catttatttt ggctgaaccc 540 ctcccgatgt tatttttgct ccttatttct gtaaataaat catattttct tgttaattct 600 cagtggttgt gtttttggtt tatattttgg ggagagagag cgcgggggca ccagggatta 660 tatttttaat ccctgttatt ttgttacgtt ttggttaaac ctctttttga ttaaggattt 720 ttaatttaat tcgtaaca 738 // ID TZF28 repbase; DNA; ZEB; 1613 BP. XX AC U51227; XX DT 10-APR-1997 (Rel. 2.03, Created) DT 27-JUL-2000 (Rel. 5.06, Last updated, Version 2) XX DE Danio rerio transposon Tzf.28. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Nonautonomous; KW (clone Tzf.28) zebrafish; KW Danio rerio (zebrafish) Tc1-like element; TZF28; KW Tc1-like transposon; Tdr2; Tzf.28; subfamily Tdr2. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1613 RA Lam L.W., Lee S.T. and Gilbert W.; RT "TZF28."; RL Direct Submission to Genbank (13-MAR-1996)Wan L. Lam, Molecular RL and Cellular Biology, Harvard University, 16 Divinity Avenue, RL Cambridge, MA 02138, USA. XX RN [2] RA Lam L.W., Seo P., Robison K., Virk S. and Gilbert W.; RT "Discovery of amphibian Tc1-like transposon families."; RL J. Mol. Biol 257(2), 359-366 (1996). XX RN [3] RA Lam L.W., Lee S.T. and Gilbert W.; RT "Active transposition in zebrafish."; RL Proc. Natl. Acad. Sci. U.S.A 93(20), 10870-10875 (1996). XX RN [4] RA Ivics Z., Izsvak Z. and Minter A.; RT "Identification of functional domains and evolution of Tc1-like RT transposable elements."; RL Proc. Natl. Acad. Sci. U.S.A 93(10), 5008-5013 (1996). XX RN [5] RA Izsvak Z., Ivics Z. and Hackett B.P.; RT "Repetitive elements and their genetic applications in RT zebrafish."; RL Biochem Cell Biol 75(5), 507-523 (1997). XX RN [6] RA Ivics Z., Izsvak Z. and Hackett B.P.; RT "Genetic applications of transposons and other repetitive RT elements in zebrafish."; RL Methods Cell Biol 60, 99-131 (1999). XX DR GenBank; U51227; Positions 1 1613. XX SQ Sequence 1613 BP; 531 A; 305 C; 337 G; 440 T; 0 other; tacagtgtat ccgcaaagta ttcatagcgc ttcacttttt ccacattttt tatgttacag 60 ccttattcca attccaaaat ggattaaatt aatttatttc atcaacattc tacccacaat 120 accccataat gacaatgtga aaaaaatttt tttttttaat tattgcataa aaaaaaaaaa 180 gctgaaaaat cacatgtaca taagtattca cagcttttgc agtgaagcta aattgagctc 240 aggtacgttc tgtttcaagt gttcattctt gaaatgtttc agacaggtta attggaattt 300 cacctgtggt aaattaagtt gatttggaca tgatttgaaa aggcatacac ctgtctatat 360 aaggtcccag ggttgacagt gcatgtcaaa gcacaaacca aacatgaaga caaaggaatt 420 gtctgcagac ctccgataca ggattgtcgt caaggcacaa ggctggggaa ggttacagaa 480 aaaatttctg ctgctctgaa agttccaatg agcacagcga cctccatcat ccatgtggaa 540 gatgtttgga accaccagga ctcttactag agctggccag ccatctaagc taagtgatca 600 ggagagaagg gccttagtta gggaggtgat caataactca atggtcactc tgtctagctc 660 cagccatctt ctatggagag aggagaacct tacagaagga caaccatctg tgcagcaatc 720 caccaatcag gcctgtatgg tagattagcc agtgttaacc actcctcgtc tggaatttgc 780 aaaaaggcat ctgaaggatt ctcacaccat aagaaacaaa attctctggt ctgatgagac 840 taaaattgaa ctctttagag tgaatgccag gcgttacttt tggagaattc aggcaccgct 900 catcaccagg ctaacaccat cactacagtg aagcatggtg gtggcatgca tcaagctgtg 960 gggatgtttt ttcagcagca tgaaatggaa gactagtcag aatagaggga agatgaatga 1020 agcaatgtac agagacatcc tgaagtgaaa accttcttca gagtaatctt gatttcagac 1080 tggggtgacg gtttatcttc cagcaggaca atgaccgaaa gcacactgcc aaaatatcag 1140 tggagtggct tcacaacaac tcgatgaata ccttaagtga accagccaga gcccagacct 1200 aaatcctttt gaatatctct gaagagatgt aaaatggctg tacaccgtcg cttcccatcc 1260 aacctgatag agtttgagag gtactgcaaa gaggaatggg cacaaattct caatgacagg 1320 tgagccaagc tgtggcatga tattcaaaaa gagttgaggc tgtagttgct gccaaagatg 1380 catcaacaaa gtattgagca aagactgtac atttttatat gaacgtgatt ttttgaccct 1440 tttaatttta ataaatttga aacaatttca aaaaatgttt tttcacattg gcattatggg 1500 caactgtgtg tagaatgttg aggaaataaa tgaaattaat ccgtttttga ataaggtaaa 1560 aaggtggaaa aaaatgaggt agaattatga tatcactttc ccgatgcact gca 1613 // ID CR1-31_DR repbase; DNA; ZEB; 3533 BP. XX AC . XX DT 11-DEC-2008 (Rel. 13.12, Created) DT 11-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-like non-LTR retrotransposon - a consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; AP endonuclease; CR1 clad; CR1-31_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-3533 RA Bao W. and Jurka J.; RT "CR1-like non-LTR retrotransposons from zebrafish."; RL Repbase Reports 8(12), 2128-2128 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 4..3063 FT /product="CR1-31_DR_1p" FT /translation="MTKMAARAHAAASLSPDKTVFLCSSSCTSQCFCASTS FT RLSVRNRAYSRTFLLDIGRNKLYKLNSADTEELQDLGLLRMPTHPPTPTSP FT PRPQWRRFKRRERKQKRGKRGGIRTRLAANPHKPAIPSIILANVRSLDNKL FT DYIRLLRSSQRTVKDCCVFVFTETWLNDNVPDSAIQLDQLTCYRADRAIVA FT GGKTRGGGLCVYINESWCRDAVVVRKHCSPLVEFMIIKCRPFYLPREYTAI FT LLFAVYIPPSSNANNRSEALNDLYLHISEQQTGHPDAFLILAGDFNHADLK FT SVFPKIQQHIDFPTRGNNTLDFVYTTQRGAYKAFPLPHLGASDHLTVMLMP FT AYRPIVKVTKPVCKEIQVWPEGSSEALQDCFNTTDWDMFKQAATYNNTTDL FT QEYAETVTAYINKCTEDVTVTKTITVRPNQKPWLTGEVFNLLKARNTAFRA FT GDEASLKSARANLSRGIREAKRQYSRGIAHRFNDSRDTRSLWRGIQSITDY FT KPQPQTCDSTIPLLNELNKFFARFEVQNSTIAQKMPPPPGDQVMTLSPDSV FT RRSLSRINARKAPGPDNIPGRVLRDCASELTNVFTDIFNISLTQAVVPTCF FT KATTIIPVPKNSSPSCFNDYRPIALTPIIMKCFERLVMQHIKSVLPPSLDP FT FQFAYRPNRSADDAISTALHTALTHLDKKDSCVRMLFIDFSSAFNTIIPQQ FT LSQKLLQLGLNTSLCNWLLDFLTGRPQAVRAGSNTSSTITLNTGAPQGCVL FT SPILFTLLTHDCAPSHNSNLFIKFADDTTVVGLINKNDEENYRSEVSRLAE FT WCRDNNLFLNVEKTKEIVVDFRKEHTLHAPLTINGATVERVSSTKFLGVHI FT TEDLSWTKNSTALAKKSQQRLYFLRKLKRARAPAPIMYTFYRGTIESILTS FT CITVWFGTCNASCRKTLQRIVRTAEKIIGVSLPSLQDIYSTRLTRKALCIT FT ADATHPMHSFFSLLPSGRRLRSLQARTNRLKDSFIHQAVRKLNSLPALLPA FT PSPRTSSPRTSGL*" XX SQ Sequence 3533 BP; 945 A; 1017 C; 704 G; 867 T; 0 other; tatatgacca agatggcggc gcgtgcacac gctgcggctt ctctctctcc cgacaaaacg 60 gtgtttttgt gttcttcgtc ttgtacgtcg cagtgttttt gtgcgtcaac gtcacgttta 120 tctgtccgta accgcgcata tagtcgaaca tttctgctag acatcggtag aaacaaactt 180 tacaagctaa actccgcgga cactgaggag ctgcaagatc tcggcttgct ccggatgcct 240 acccatcctc cgacccccac ctcaccacct cgcccacaat ggaggcgctt caagcggcgc 300 gagaggaagc agaagagggg taagcgcggg ggtatccgaa ctaggctagc ggctaacccc 360 cacaaacccg ctatcccctc catcatccta gcgaacgtac gctcgctaga caacaaactg 420 gactacatcc gtctacttcg ttcatcacag aggactgtaa aggactgttg tgtttttgtg 480 tttacggaaa cttggctcaa cgacaacgtt ccggacagcg ccattcagct cgaccagcta 540 acatgctaca gagcagacag agctatcgtc gcgggaggta agacccgcgg tggcgggctc 600 tgtgtttaca tcaatgagtc atggtgccgc gatgccgttg tagtccgcaa acactgctca 660 ccactggtgg agttcatgat tattaagtgc cgtccatttt atctaccgag ggaatacaca 720 gccattctgc ttttcgctgt ttacatccct cccagctcca atgcaaacaa caggagtgag 780 gcactaaatg acctgtacct gcacattagt gagcagcaga caggccaccc cgatgctttt 840 ctcatcctgg caggggactt taaccatgca gaccttaaga gtgtgtttcc aaaaatacaa 900 caacacatag actttccaac acggggcaat aacacactgg actttgttta caccacacag 960 agaggagctt acaaagcctt ccccctcccc caccttggtg cctcagacca cttaactgtc 1020 atgctaatgc ctgcttacag accgatcgtt aaagtcacca aaccggtttg caaggagata 1080 caagtgtggc cagaaggttc ttcagaggct ttacaagact gcttcaatac cacagactgg 1140 gatatgttta aacaggctgc cacttacaac aacaccactg acctccagga gtatgctgag 1200 actgttacag cctacataaa caaatgcact gaggatgtaa cagtcaccaa aaccatcaca 1260 gtccgtccca atcagaagcc atggctaaca ggagaggtct tcaacctcct gaaggctagg 1320 aacactgcct ttagagctgg agatgaggcg agcctaaagt cagctagagc caacctttcc 1380 cgtggcatca gagaagctaa gagacagtac tccagaggta tagcccatcg ctttaatgac 1440 agcagggaca cacggagtct gtggagagga atacagtcca tcacagacta caaaccccaa 1500 ccacagacat gtgacagcac catccccctg ctgaacgagc taaacaaatt cttcgctcgt 1560 tttgaggtac aaaacagcac cattgcacag aagatgccac ctccccctgg tgaccaggtg 1620 atgactctgt ctccggacag tgtgaggagg tctctcagca ggattaacgc acgcaaagct 1680 ccgggccctg acaacatccc aggtcgtgta ctgagagact gtgcatcaga actcactaat 1740 gtcttcacag acatttttaa catctctcta acccaggctg ttgtccccac ctgttttaaa 1800 gccactacca tcattccggt cccaaaaaac tcatctccct cctgctttaa tgactaccgt 1860 ccaattgcac tgactcctat catcatgaag tgctttgaac ggcttgttat gcagcacatc 1920 aagtctgtcc tccccccctc cctggacccg tttcaatttg cttatcggcc caaccgctcg 1980 gctgatgatg ccatctccac tgcacttcac acagcactca cacatctgga caaaaaagac 2040 tcctgcgtcc gtatgctgtt catagacttc agttcagcat tcaacactat catcccccaa 2100 cagctctctc aaaaactgct ccagctgggg ctgaacacct ctctgtgtaa ctggctgctg 2160 gattttctca ctggaagacc acaagcagta cgggccggca gtaacacatc cagcaccatc 2220 actttaaaca ctggggcccc tcagggatgt gttctgagcc ctatcctctt caccctgctc 2280 actcatgact gtgcaccatc acacaactcc aatctgttca tcaagtttgc ggatgacaca 2340 actgtggtgg gtctcatcaa caaaaacgat gaggaaaact acagaagtga ggtgagccgt 2400 ctggccgagt ggtgcagaga caacaatctc tttttgaatg tggagaagac gaaagagatt 2460 gttgttgact tcaggaaaga gcacactctc catgctcccc tgaccatcaa cggtgcgact 2520 gtggagcgtg tgagcagcac caagttcctg ggtgtacaca tcactgagga tctctcctgg 2580 accaaaaact ccactgcact ggccaaaaaa tcacagcagc gtctctactt cctccgcaaa 2640 ctcaaaagag caagagcacc agccccaatc atgtacacct tctacagagg cactattgag 2700 agcatcctga ccagctgcat cactgtgtgg tttggaacct gcaatgcatc ctgcaggaaa 2760 acacttcaac gcatagtgag aacagctgag aagatcattg gtgtctctct cccctccctt 2820 caggacattt acagcacacg tcttacccgt aaagccctct gcataacagc agatgccacc 2880 cacccaatgc acagcttctt cagtctgctg ccatcaggga ggagactgcg cagtctccag 2940 gccaggacca acagactgaa agacagcttc atccatcagg ctgtcaggaa gctgaactct 3000 ctcccagctc tgctccccgc tccctcccca cgcacctcct cccctcgcac ctctggactc 3060 tgacacacac acacacacac acacacacac acacacacac acacacacac acacactcat 3120 ctgcactggt cactttatgc agacacatat actacctccc aatatgtcac tgtcacttta 3180 tcacaattat catgtttaca agttcttttt gcactatact gttttttatc tgcacaactc 3240 tggtttactt tgcactcatt gctatatgcc cttaatgtca ctgtattgtt tacaattttt 3300 tttttatatt acatatgtat atttcttaga ccacatttat gtgcaataat gtatatactg 3360 taaattttct aaattttctt tttttcttaa actctactat tttttatata tatattttat 3420 attaatgtta agcaccttgg gtctgagagt aacgcaattt cgattctctg tatgtcttgt 3480 acatgtggct gaattgacaa taaagctgac tttgactttg actttgactt gac 3533 // ID I-5_DR repbase; DNA; ZEB; 4772 BP. XX AC . XX DT 08-JUL-2009 (Rel. 14.07, Created) DT 08-JUL-2009 (Rel. 14.07, Last updated, Version 1) XX DE I-5_DR is a non-LTR retrotransposon from the I clade - consensus. XX KW Nimb; Non-LTR Retrotransposon; Transposable Element; I; KW reverse transcriptase; endonuclease; RNaseH; I clade; I-5_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4772 RA Kapitonov V.V. and Jurka J.; RT "I-5_DR, a family of I non-LTR retrotransposons from zebrafish."; RL Repbase Reports 9(7), 1396-1396 (2009). XX DR [1] (Consensus) XX CC This is a young family of Nimb non-LTR retrotransposons. The CC consensus is incomplete at its 5' terminus. XX FH Key Location/Qualifiers FT CDS 864..4604 FT /product="I-5_DR_1p" FT /note="AP endonuclease, RT, RNAse H." FT /translation="MPTILQWNARSLLANGQEFKKFIEDLSNKPEIICIQE FT TWLKIHLDFRITGYIDVRHDRKEEIGGGCVTFIKENLSFREVEKGKDQEYI FT VIAVWIKGVELIVINYYNPCKKMDISKLRQIKEINNRNVIICGDFNAHSTL FT WGGEKLDENGEILEELLEEMNLVCINDGSGTRLDLYRGKLSALDLTIVSKN FT LGDICNWEVWEESTLGSDHFPVISKLAWKGEEEQRGRIEKWIFSKAQWDKF FT MYLCSVRSEEIDLNDEIEEIDRKIREVITTVAEQTIPKNKSKMKKRAVPWW FT TDGCNKAVRERNKMFKILRKTHNIQNLIKYKKAQALVKKTIKEAKKQSWRD FT YCSKIGRTVPIGEVWGMIKSMRGIKKQWQYPVLKAGEELASTEEGKAEMIA FT REIVKVYSSNNLTEKGLRSRERTKAKYPGISDKKEKTDSSMDAPFTLGELK FT RALETFQATAPGKDNICYEMVKHLDVLAMGKLLGLYNKVWEEGKIPMSWKE FT AIIIPIRKPGKDATNPSNYRPIALTSHIGKIMERMITERLNHYVEGRGFIS FT MYQSGFRRGRSTMDPIVYLETEIRKAQVNKEVVLAVFIDIEKAYDMVWKEG FT VLIKLNKLGITGRIFNWIKDFLFGRIIQVRIGKVLSKRYVVENGTPQGSVI FT SPILFTIMINDIFSNIQSDIGRSLFADDGALWKRGRNVKHVKQKIQEAVKL FT VEEWSYSWGFKLSVEKTKTVLFTRKKICDIEINMYGQKLEQVKSFKFLGMW FT LDTKLTWNEHINQLVNKCKKVLNIMRCLSGTEWGASRNALRNIYVALVRSV FT LDYGCFIYGRASKSNLQKIEVIQAQALRICCGAYKTTPISALHVEMGEIPL FT ELRRKQIMMNYWINLQGQEINSNPAKRVLIPRWEYEKNKTLFFGRDSMEVA FT KEMELIDKGFNKTVPIATTPPWLYPLGLVDLELLSNSQDYRNIANMSGIIE FT DKIMTKYKEYTEVYTDGSVNLDLGRTGFGVFVPKLDISIKKRTTNYLTIYT FT VELMAIIAALYWIEESGLKKVVICSDSYSALSSIKSMTSKNRQDCIYEIYE FT ILFRLQRANIRIVFMWIPAHIGIKGNETADFLAKEACKLDEIMNILLSKTE FT AKVIVKKDILKQWQYNWDRDITGRHYYRLQEKVGNWRKGKGNNKIEGIITR FT LKMGHTGLNKTLHLIGKHPTGLCDNCQEEESVEHVLCYCKKFIIEREILKK FT EIIKQNFSEISVKIILGNDNNESLVKYLRKTGLINRI" XX SQ Sequence 4772 BP; 2001 A; 493 C; 1063 G; 1214 T; 1 other; aattaaacta actacaatat tgaagaatca agtaggagaa attataatgg ctaaggtact 60 gagggatgga aatctactaa ttgtgtgcaa aagtgaggaa caaagagaaa gagccatgaa 120 aatcaaggaa ataggcaaat acaaagtgtc aaatatgagc aaaatcggag taggaaataa 180 actgagtaaa ggagtaatct ggggagtgcc agttggagta aatatagagg atctaaaatc 240 aaatttgcga gggggaaaga tcacagtgag agatggtaca aggagtgata gtgaaggaat 300 tctgattttt tttgatgatg aagttttacc atcaaaagtg acactaggat atctgagcta 360 cagagtgaga gaatacatac ctagaccagt gaggtgctat aactgccaaa gatttgggca 420 tgtggcgaaa atatgcaaag ggaagaaaag atgtgcaagg tgtgggggga atcatggata 480 tcatatggat gtgtagtaat gaagaaagag agggaaattc aacaaataaa aactcaaaat 540 aaaataacat atgcagaagc agttaggaga ataaaacaaa gagagagaac tgaggaaaat 600 ccagttctag tcagacaaaa tgtggaaatg caggaggaag tgggtgatga gaaaatattg 660 atagaagtta aaaaattggt aacatttatt gcaggagtaa taaatgcaac gatggaaatc 720 aagtcaaaaa cagaaagaat tcaggtcata gtgaaggctg ctgtccatca tctgggtgta 780 agtgggataa catgggagga agtgagaaat gatctgaatg caaaaacaag ccaagatata 840 acttagagaa tacataaaat attatgccta ctatactaca atggaatgct agaagtcttt 900 tggcaaatgg acaggaattt aaaaaattta ttgaggattt atcaaataaa cctgaaatta 960 tatgtattca agaaacatgg ttaaaaatac acttggattt taggattaca ggatatatag 1020 atgtaagaca tgataggaaa gaagaaatag gaggcggttg tgtaacattt attaaagaaa 1080 atttatcttt cagagaagtg gaaaaaggaa aagatcagga atatattgtg atagcagtat 1140 ggattaaagg agtagagcta atagtaataa attattataa cccttgtaag aaaatggata 1200 taagtaagtt gagacaaatt aaagaaatta acaatcggaa tgtaattatt tgtggtgatt 1260 ttaatgcaca tagcacacta tggggagggg aaaagcttga tgaaaacgga gaaatattag 1320 aggaactatt agaagaaatg aatttggttt gtataaatga tggtagcgga actagattag 1380 atctatatag aggaaaacta tcagcattag atttaactat tgtatcaaag aatttaggag 1440 atatatgtaa ttgggaagta tgggaagaat caacattagg tagtgatcat tttccagtga 1500 ttagtaagtt agcatggaaa ggggaggaag agcaaagagg aagaatagag aaatggatat 1560 tcagtaaagc acaatgggat aaatttatgt atttatgtag tgtgagaagc gaagaaattg 1620 atttaaatga tgaaatagaa gaaatagata gaaagatcag agaagttata acaacagttg 1680 ctgagcaaac aatacctaaa aataaaagta aaatgaagaa gagagcagtt ccatggtgga 1740 cagatggttg taataaggct gttagggaga gaaataaaat gttcaagatt ttaagaaaaa 1800 cgcacaatat tcaaaattta ataaagtata aaaaagcaca agctttagtg aaaaagacta 1860 ttaaggaggc aaagaagcaa agttggagag attattgtag taaaatagga agaacagttc 1920 cgattggaga agtgtggggt atgataaaga gtatgcgagg aattaaaaaa caatggcaat 1980 atccagtgtt aaaagcaggg gaggaattag cgagtacaga agaaggaaaa gcagaaatga 2040 ttgcaagaga aattgttaaa gtatatagtt caaataattt gacagaaaag ggactgagaa 2100 gtagagaaag aacaaaagca aaatatccag gcataagtga taaaaaagag aaaacagata 2160 gttcaatgga tgctccgttt actttaggag aactgaagag ggcattagaa acatttcagg 2220 caacagcacc aggtaaagat aatatatgct atgagatggt aaaacattta gatgtattag 2280 ctatgggtaa gttgctagga ttgtataata aggtatggga ggaaggaaaa attcctatga 2340 gctggaaaga ggctattatt attcctatta gaaaaccagg aaaggacgct acaaatccaa 2400 gtaattatag accaatagca cttacttctc acattggaaa aattatggaa agaatgataa 2460 cagaaagact aaaccattat gtagaaggca gaggttttat atcaatgtat caaagtggat 2520 tcagaagagg aagaagtact atggatccta tagtttattt agaaacagaa attagaaaag 2580 ctcaggtaaa caaagaggta gtattggcag tatttattga tatagagaag gcatatgata 2640 tggtatggaa agaaggagta ttgattaaat taaacaaatt aggaattaca ggaagaatat 2700 ttaactggat aaaagacttt ttatttggaa gaattattca agttagaata ggtaaagtat 2760 tatcaaaaag atatgtggta gagaatggaa ccccacaagg gagtgtaata agcccaattc 2820 tctttacaat aatgataaat gacatatttt ccaatattca aagtgatatc ggacggtcac 2880 tgtttgcaga tgacggagct ctgtggaaga gaggaaggaa tgttaaacat gttaaacaaa 2940 agatacaaga agctgtaaag ttagtggagg aatggtcata ttcatggggt ttcaagttat 3000 cagttgaaaa gacaaaaact gtattattta ctaggaaaaa aatctgcgac atagaaatta 3060 atatgtatgg acagaaactg gagcaggtaa agtcatttaa atttttggga atgtggcttg 3120 atacaaaatt aacttggaat gaacatatta accagctagt taacaaatgc aaaaaagtat 3180 taaatataat gagatgtctg tctggaactg aatggggagc gagtagaaat gcgttaagaa 3240 acatatatgt agcattggtt cgatcagtat tagactatgg gtgtttcata tatggaagag 3300 cttctaagtc taatctgcaa aaaatagaag ttatacaagc ccaggcatta agaatatgtt 3360 gtggagctta taaaactact ccaatctcag cacttcatgt agaaatggga gaaataccac 3420 tggagctaag aaggaaacaa ataatgatga attattggat aaatctacaa gggcaggaaa 3480 taaatagtaa tccagcaaaa agagttctta ttccaagatg ggaatatgaa aaaaacaaaa 3540 ctctcttttt tggaagagat agcatggaag tagctaaaga gatggaatta attgataaag 3600 gctttaacaa aacagtacca atagcaacaa caccaccatg gttatatcct ttaggattag 3660 ttgatttgga attactaagc aatagccagg actatagaaa catcgctaat atgtctggaa 3720 taattgaaga taaaattatg acaaaatata aagaatatac tgaagtatac acagatgggt 3780 cagtaaattt ggatttgggc agaacagggt ttggtgtatt tgtaccaaaa ttagacatat 3840 caattaagaa aagaacaacc aattacctaa caatatatac agtagaactg atggctataa 3900 ttgctgcgct ttattggata gaggaaagtg gattaaagaa agtagttata tgttcagatt 3960 cctattcagc attaagttca atcaagtcaa tgacatcaaa gaacagacag gattgcatat 4020 atgaaatata tgaaatttta tttagacttc agagagcaaa tattagaata gtatttatgt 4080 ggattccagc acatatagga attaaaggaa atgaaacagc agatttttta gctaaagaag 4140 catgcaaact agacgaaatt atgaatatct tgttaagtaa aacagaagca aaagttatag 4200 tgaagaagga tatattaaaa cagtggcagt ataattggga tagggatata acgggtagac 4260 attactatag attacaagag aaagtaggaa attggaggaa aggtaaagga aataataaaa 4320 tagaaggaat aataacacgg ttaaaaatgg gacacacagg cttaaataaa acattacatt 4380 taataggaaa acatccaact ggattgtgtg ataattgtca ggaggaggaa tcagttgaac 4440 atgttttatg ttactgcaag aaattcataa tagaaagaga aatattaaag aaagaaatta 4500 taaaacaaaa cttttcagaa atcagcgtaa aaattatact tggaaacgat aacaatgaaa 4560 gtttagtaaa atatctaagg aaaacaggat taattaacag gatttaaaag aataaagatg 4620 taggtaaaaa aaaaaaaaat attattattt tttttttttt tttytctctc cgggaggtag 4680 caaactcagg ctcacactcc agtacagtag gtggcggtaa tgcaccaaaa gttggttgcc 4740 gactgccata aaacagcaaa gaagaagaag aa 4772 // ID DIRS-15_DR repbase; DNA; ZEB; 7069 BP. XX AC . XX DT 07-JUN-2010 (Rel. 15.11, Created) DT 07-JUN-2010 (Rel. 15.11, Last updated, Version 2) XX DE A family of DIRS-type LTR retrotransposons - consensus. XX KW DIRS; LTR Retrotransposon; Transposable Element; DIRS-15_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-7069 RA Kojima K.K. and Jurka J.; RT "DIRS-type LTR retrotransposons from zebrafish."; RL Repbase Reports 10(11), 1843-1843 (2010). XX DR [1] (Consensus) XX CC This consensus is generated from 8 sequences with >99% identity. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 200..3562 FT /product="DIRS-15_DR_1p" FT /note="reverse transcriptase and ribonuclease H." FT /translation="MEKHQTLSTTTSSSNNNNNNNNNNLHQANESGAILET FT SAEDQHQLTQSGNQLEPEATVRGRRPIRSTATRLSRRLQSSSPYATNLHPP FT SPASSYASATSSPTINRNMTANELRQSITDAGISIPSRCNKTELLKLYTSI FT TSSTPPTRNSRNTRSRHAPYPQHSASPPASNQPGPRKATGKTKNTKKPQAQ FT HHQDNRSITGTTSNQTKDNNPDINATQQNNATFFWPQAPQSSATPSPHLSA FT TNPLQFSLPSNLPSTSTNLIHSHEPTNISTNAPLHSSSFSPSNIPSFMPSS FT SLHQALASNTNPPPHPPLPTNISSTHPPFTLATAVPLPIPQNAPALEPPPV FT SNTIRNLIISGADIDLSTLLSPIAPPTAERQIDCGEFALILKPSTSSQTRT FT LSLAEFHVAFSRYTEVICSVFPHRRRELNDYMAIISELALSYGGTHFYTYH FT KLFSAKCAIRVSQWNQIQYWGAIDFDLHNRVFLGCRNLSCAVCRSSLHSTT FT SCPFVIPPSDTVPPNPRSTSYVPRTSTSAIPALLPSQSSSRNQSSHNPCFN FT FNSGRCYRHPCKFLHICNYCGGAHARIVCQVQKANKKSKQYLSTPVNIPQL FT AHELSLHPDSNFSDFLITGLSNGFHPGVSTLPSYNLACPNLMSATAEPEVV FT DQLIKKEIDNNFMIGPFLAPPFRVYRISPIGIATRKFSGKKRLIIDLSSPH FT NSCFSSINSIIPPEEYALNYHDIDQAISLIKLVGRNAWLAKVDISSAFKVM FT PLHPDYWHLFGINWRSKFYFAVRLTFGCRSSPKIFDMLSEAICWILSNNYG FT IAHILHLLDDFLIISPPSNPATEHLTITKTVFDNLGIPLAEEKTSGPGTSL FT EFLGIKLDSNKFQASLPKEKIDRIIALSSIFLENQNCSKRELLSILGHLNF FT AMRIIPQGRPFVTHLLQLAASVPGLDDSLSLSDQCRHELSLWISFLKCWNG FT CSFFYSDLIESPIDIQLYTDAAPSIGFGGYYQGRWFASSWPHQMIEIPPHH FT QSSALFELYPLVAASILWGDEWSASSILVHCDNEAVVQCINKRRSHSPALM FT PLLRRLIWTSAKKQFIITAKHVPGFHNQIADSLSRFLFQKFRMLAPEADLH FT PHPVPPYSEMIFL" FT CDS 3574..4551 FT /product="DIRS-15_DR_2p" FT /note="tyrosine recombinase." FT /translation="MHHLHQTSISLILNAVAPRTLQTYLTAWNTFKHFHFL FT YNTSFPDFSLLTITSFITYLHSYKNIQASSIRSYLSGIQFFHKLIHGSTSD FT AIANSQTSLLIKGVQKTHPSLPDSRLPITHHILSKCISTLRKGYVSTHTDR FT TLDAMFILSFFGFLRCSEISITSKFVPSIHPTISDLTLVDGETISFLIKQS FT KTDQSRKGHLIYIFDIPSSTFPFQTLLAYIHYRKTLNPTPLSPLFIDDLSR FT PVTRFWFQKHLKAVLRLSGISPEPYSSHSFRIGAATTAAHRGLSQQHIQTL FT GRWTSDAFKSYIRLSQSHLKEAQKTMSMSHNTNP" XX SQ Sequence 7069 BP; 1918 A; 2186 C; 1099 G; 1866 T; 0 other; ttaagtgaag tttcccaaac taatttcgag aggagcacgt gatagaattg actacagctg 60 atccgtatca ctaatcacta actagccaat cggatcactc taaatttaat ataaatatcc 120 agcctaaact ccattcctca tcttcgtttt tggaagaccc ggaaacagct agcgactcaa 180 gcaagcaaaa atctgcaaaa tggaaaagca ccaaacgctc tccaccacca ccagcagcag 240 caataacaac aacaacaaca acaacaacaa tctccaccaa gctaatgaaa gcggcgctat 300 cctcgaaact tcagccgagg accagcatca acttacccag agtgggaacc agcttgagcc 360 cgaggctaca gtgagaggcc gaaggccaat ccgttcaaca gcaacaaggt tatctcgccg 420 tcttcaatcc tcatccccgt atgctacaaa tctccatcca ccttcaccag cctcttcgta 480 cgcttccgca acatcttcac caaccatcaa caggaacatg accgcaaacg aactccgcca 540 aagcatcaca gacgccggca tttccattcc aagccgatgc aataagactg aactcctcaa 600 actgtacaca tccattacct catcgactcc gcccaccagg aactccagga acactcgctc 660 ccgccacgcc ccgtatcccc agcacagcgc ttctccgcca gcctctaacc aacccggacc 720 aagaaaagca actggaaaaa ccaagaacac caaaaaacca caagcccaac accaccaaga 780 taataggagc attactggaa ccacatctaa tcaaaccaag gacaataacc cagacattaa 840 cgctacccag caaaacaacg ccaccttctt ctggccacaa gctcctcaat ctagcgcaac 900 tcccagtcct catctttctg ctactaatcc cttacagttt tctcttcctt ccaaccttcc 960 atctacatct accaacctca tccactcaca cgagcctaca aatatcagta ctaatgctcc 1020 tcttcattct tcttcttttt ctccctccaa tattccttca ttcatgccct cttcatccct 1080 ccaccaagca ctcgcatcta atactaaccc tcctccgcat cctcctctcc ctactaacat 1140 ttcttccact caccccccct tcaccctagc cactgccgta cctcttccca tccctcagaa 1200 tgctccagct ctggaacctc cacccgtctc caataccatc aggaatctca tcatctcagg 1260 tgctgacata gatctctcca cacttctttc acccattgca cctcccacgg cagagcgcca 1320 gatagattgc ggcgaatttg cactcatact gaaaccatcc accagttctc aaacccgcac 1380 tctttcacta gcagaatttc acgtagcttt ttccagatac acagaggtaa tttgctcggt 1440 ttttcctcat aggaggcgcg agctcaacga ttacatggcc attatttcgg agctcgcgct 1500 ctcctatggg ggaacacatt tttacaccta tcataagtta ttctcagcaa aatgcgcaat 1560 tcgcgtctca caatggaatc aaattcaata ttggggggct atcgactttg atctccataa 1620 cagagtcttt ctcggttgcc gtaatctatc ctgcgcggtc tgccgttcca gtctccactc 1680 caccacctcc tgtcccttcg tcatccctcc ctctgataca gttccaccta atccaagatc 1740 caccagctac gttcctcgta catccacctc cgctatccct gctcttcttc cttctcagtc 1800 ttcttcccga aatcaatctt ctcataatcc ctgcttcaac ttcaattccg gaagatgcta 1860 ccgacaccca tgcaaatttc tccacatctg taactactgc ggaggcgcgc acgcacgcat 1920 tgtttgccaa gtccaaaaag ctaataaaaa atccaagcag tatctgtcga ctcctgttaa 1980 tattcctcaa cttgctcatg aattaagttt gcaccctgat tctaacttct ctgattttct 2040 cattacaggt ttatccaacg gatttcaccc cggcgtctct actcttccgt cctacaacct 2100 ggcctgccct aatttaatgt cagccaccgc agaaccagaa gtcgtagatc aactgatcaa 2160 gaaagagatc gataataatt ttatgatcgg tccctttctt gctcccccct tcagagtata 2220 ccgaattagt ccaatcggca tagctaccag aaaattttct ggcaaaaaac gcctcataat 2280 cgacctttca tcccctcata attcttgctt ctctagcatt aacagcatca ttcctccaga 2340 ggaatacgcg ctaaactacc acgatataga ccaagcgata tccctgatca aattagtcgg 2400 ccgcaacgct tggctcgcta aagtagacat ttcctcggca ttcaaagtta tgccgttaca 2460 ccccgactac tggcatcttt ttggcattaa ttggagatct aaattctact tcgccgtacg 2520 ccttaccttc ggatgcagaa gtagtccaaa aatatttgac atgctttcgg aagctatatg 2580 ctggatcctg tcaaataact acggcatcgc gcacatacta catctcctcg acgatttcct 2640 cattatctca cccccgagta accccgcaac agaacacctg acaataacaa aaacagtttt 2700 cgacaatctc ggcattccgc tcgcagagga aaaaacttcg ggacctggta cttcactaga 2760 atttttaggc atcaaattgg attcaaacaa attccaagcc tccttaccta aagaaaaaat 2820 cgaccgcatc atcgcattat cctcaatatt tttggaaaat caaaattgct caaagcgcga 2880 attgttgtcc attttaggac atttaaattt cgccatgcgc atcattcccc aaggacgtcc 2940 ctttgtaact caccttctcc aattagcagc atcggtcccc ggattagatg attctctctc 3000 tctatccgac caatgccgtc acgaactcag cctctggatt tccttcctaa agtgctggaa 3060 tggctgttcc tttttctata gcgacctgat agaatccccg atcgatattc aactatacac 3120 agacgccgct ccctcaatag ggttcggtgg atactaccag ggtcgctggt tcgcatcttc 3180 atggcctcac caaatgatcg aaattccccc tcaccatcaa tcgtcagcac tattcgaatt 3240 atatcctcta gtcgccgctt ccatattgtg gggagatgaa tggtccgcct ccagcattct 3300 agtccattgc gacaatgaag ccgttgttca atgcattaac aaaaggcgct cacactctcc 3360 cgcgctaatg ccgctcctcc gccgcctcat ttggacctcc gccaaaaaac agtttatcat 3420 aacagctaaa catgtaccag gttttcataa tcaaattgct gactctctct ctcgttttct 3480 gtttcagaaa tttcgaatgc tcgcaccaga agcagatcta catccgcatc cagtgcctcc 3540 ttattcagag atgatattcc tataaaccac cctatgcatc acctgcatca gacgtccatc 3600 tcccttattt taaatgctgt tgctcctaga actttacaaa catatcttac tgcatggaat 3660 acattcaaac atttccattt cctgtacaac acatcatttc cagatttttc cttgcttaca 3720 attacctcat tcatcacata cctccattca tacaaaaaca tacaagccag ctccatcaga 3780 agctatttaa gtggcatcca attctttcac aagttaattc atggctctac ttcagatgct 3840 atagcaaatt cgcaaaccag ccttcttatt aaaggcgttc aaaaaaccca cccctccctc 3900 ccagactcta ggctgcccat cacacaccat atactgtcta aatgtatctc cacccttcgt 3960 aaaggctatg tatccacaca cacagatcgt accctggatg ctatgttcat actttctttt 4020 ttcggattct taagatgttc cgaaatttct ataacttcaa aattcgtgcc cagcatccac 4080 cccaccatat cagatctaac tttggttgac ggggaaacaa tatccttcct catcaagcaa 4140 agcaagaccg atcaatcacg caaaggacac ctaatttaca tattcgacat cccctcctct 4200 acattcccgt ttcaaacact cctagcatac attcactaca gaaaaactct caatcccacc 4260 cctctctccc cccttttcat agatgattta tctcgcccag taacacgctt ctggtttcaa 4320 aaacacctta aagcagtcct tcgcctttct ggcatttcac cagaaccata ctccagccac 4380 tcatttagaa ttggagccgc tactaccgca gctcatagag gactatctca acagcatata 4440 cagacactgg ggaggtggac atctgatgcc ttcaagtcat acattcgcct aagccaaagt 4500 catcttaaag aagcccagaa gaccatgtca atgagtcata ataccaaccc ataacgtaac 4560 tatccccccc cccaccccca cccccgccaa tcccctccca gcatcaccac caccccgctc 4620 agaggagtct ccatttcctc cctccttccc cgactcctgc gaagccagcc tttgatagct 4680 caactcccga gctgtgacta cgagcacgag tcacgtcact gccccccccc atcagcccta 4740 gcttctttta tttatcttat tttgctttga ccatttatat acgcatatat gaggagcgag 4800 tatagcacgc ccacataaat atatgcctat atgaataaca gaatcatatg cccgcacgcg 4860 cccacgcatg cggatatata catttttaca tagcgctgtc acccctaagc tctggccacc 4920 gcaggagcgg tcccagagct taaaccccct tttttccctc tagaaggagt ctccatcggc 4980 caaattcccc ccttcccgac tccttctaga accagccttt gatagctcaa cttcccgagc 5040 tgtgactacg agcacgagtc acgtcactgc ccccccccac cagccctagc ttcttttatt 5100 tatcttattt tgctttgatc atttatatac gcatatatga ggagtgtgta tgacacgcct 5160 acaaatatat gcctatatga ataacagaat catatgcccg catgcgccca cgcatgcgga 5220 tatatacatt tttacatagt gctgtcaccc ctaagctctg gccaccgcag gagcggtccc 5280 agagcttaaa accccttttg ccctctagaa ggagtctcca tcggccaaat tcccccttcc 5340 cgactccttc tagaaccagc ctttgatagc tcaactgccc gagctgtgac tacgagcacg 5400 agtcacgtca ctgccccccc ccccaccagc cctagcttct tttatttatc ttattttgct 5460 ttgatcattt atatacgcat atatgaggag tgtgtatggc acgcctacaa atatatgcct 5520 atatgaataa cagaatcata tgcccgcatg cgcccacgca tgcggatata tacattttta 5580 catagtgctg tcacccctaa gctctggcca ccgcaggagc ggtcccagag cttaaaaccc 5640 ccctttttcc tctagtagga gtctccatcg gccaaatttc ccccttcccg actccttcta 5700 gaaccagcct ttgatagctc aacttcccga gctgtgacta cgagcacgag tcacgtcact 5760 gccccccccc ccaccagccc tagcttcttt tgtttattta ttttgctttg actgtttata 5820 tacgcatata tgaggaatgc gcaacatgcc tacaaatata tgcctatatg aataatataa 5880 tcatatgccg cgtgcgccta cgccgcgcat atacattttc actagtgccg tcacccctaa 5940 gctcttcccg aactgtgact acgagcacga gtcactccac gagcttaatt tgttttactt 6000 tacctattca tatgtgcata ttaaaaaaaa aaaaaaaaaa aaaaaaaaaa agcacggcac 6060 acgtgcgtat atggatatat gcccatatga atagataatt atatgcccgc atgcggacat 6120 atacattatt tcataacatt tgcctatttt tacactacct ccatcggaga tagagtccca 6180 tcttttccca tcaaggctct tacaagcctg catctcctct acctctctaa tttcttctca 6240 aatcactata tccagcagcc ggatatagca gttctagcct tttggggggc gttcttcgaa 6300 tacacggctg ctgtcccgac acaatggcat ttatggggag ctatcgagaa ctacctgatc 6360 tcatatccct cctaatgctt attgaccagg cgggagcctt gggctcatct atcaccgagc 6420 tcagggttct ctcccgggac agcatgccaa actagcttaa atagtcaagc actatctaag 6480 tgtgaactct tgaagtgaag tttcccaaac taatttcgag aggagcacgt gatagaattg 6540 actacagctg atccgtatca ctaatcacta actagccaat cggatcactc taaatttaat 6600 ataaatatcc agcctaaact ccattcctca tcttcgtttt tggaaaaccc ccccatcctt 6660 cccttctccc tcctcctcag ttcgggcgtc acggcggctc agtggctagc actgttgcct 6720 cacagcaagg aggtcattgg ttcaacacta gctgagccaa ccgatactct ctgtgcggag 6780 tttgcatgtt ctccccgtgt tcgcgtgggt tttctccggg tactccggtt tcctcccaca 6840 cccaaaaaac catgcaaagt taaagtcaca agcacatact acaaattagc gccaatagtg 6900 atttattttg ggagctatcg agaactacct gatctcatat ccctcctaat gcttattgac 6960 caggcgggag ccttgggctc atctatcacc gagctcaggg ttctctcccg ggacagcatg 7020 ccaaactagc ttaaatagtc aagcactatc taagtgtgaa ctcttgaaa 7069 // ID DNA-1-3_DR repbase; DNA; ZEB; 837 BP. XX AC . XX DT 15-OCT-2008 (Rel. 13.1, Created) DT 15-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE nonautonomous DNA transposon from zebrafish - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; TSD 1 bp; DNA-1-3_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-837 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1212-1212 (2008). XX DR [1] (Consensus) XX CC TSD is 1 bp (see the comment for DNA-1-3B_DR). TIR is 43 bp CC long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 837 BP; 183 A; 241 C; 210 G; 202 T; 1 other; atgctgtgtt cacaccagac gcggaacgcg cggataaatc gcgctattcg cgcgtaaata 60 gccgcgtgaa catttgagtt tactcgcttc attcgcgcgt caaaccccgc ttcattcgcg 120 cgtcaaatcc gcttctgctt cattcgcgcg tcaaatccgc ttcattcgcg cgtcaaattc 180 acttcagaac agacgcggat tcgcgtgatg ggcagggctt ctgtctgccc ggtgactcta 240 gcttcatagc taaatggcta acatggattt tatgaagaaa ataacagtgt ttatgtgctt 300 tatgaaggct gaaaaacagc gtcgatacgt ttagggccgt gtctgagtcc actacatcct 360 ttcagaggtg catccagctc tgtgagctca taaactcctc cagaaactga acctggatga 420 cggaggcttt cagcggtgct tctgactgag cccagcccag tttgatgaac ttcttgtcgg 480 tgtcggctgg aggatttccc ccgggacacc racaacaggt cgatacgtca caatcacgcc 540 cccacaagag caagctcctg attggttaac gcggcgcgaa tgtccgctga agttcagatt 600 ttcgaactcg agcgattcgc gcgaaacgcg cgttaagcgc gtcaaacgcg caaaacgctc 660 aattcgcgcc gcgccattcg cgcgtatcgc gccattcgca ccgcgcaatt cgcgtcattc 720 gcgccgcgcc attcgcgcgt atcgcgccgc aggatgtcta ttcgcgcgtt tgcattgact 780 taacatgtaa atcactcgcg cttgacgcgc gttccgcgtc tggtgtgaac gcagcat 837 // ID ZFERV_LTR repbase; DNA; ZEB; 692 BP. XX AC AF503912; XX DT 23-MAR-2006 (Rel. 11.03, Created) DT 23-MAR-2006 (Rel. 11.03, Last updated, Version 1) XX DE Danio rerio endogenous retrovirus ZFERV (LTR portion). XX KW Endogenous Retrovirus; Transposable Element; ZFERV_LTR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-692 RA Shen C.H. and Steiner L.A.; RT "Genome structure and thymic expression of an endogenous RT retrovirus in zebrafish."; RL J Virol 78(2), 899-911 (2004). XX DR EMBL/GenBank/DDBJ; AF503912; Positions 1 692. XX SQ Sequence 692 BP; 180 A; 99 C; 166 G; 247 T; 0 other; tgagggaatt atatttttga gggaatcata ttttttaatg ttttatatat gttcagaatt 60 gatattaagt tttttcatat ttttgatgtt ttatatatgt ttaacattga tattaagttt 120 tgctattttt ttactgatga aatgctcgtt gtttgaaata tgagcctatt gcgttgtttg 180 aaatgtgagg ctgtcgctta aagcttcgct tctagagaaa cggggttttt ccacagaagt 240 aagacgagag atctgagctc atggaaagct gtgctttggt acagggatgt ggtttaacac 300 ctgcaaggaa ccagcatttc ataggcgtat ctcagtctgt gtagaggtgc atgtgtgttt 360 tgtgtgtgtt tgtgtgtgtg tacaggctgc tggctgattc taactggttg aaatgagatc 420 tgatgatggg aagcgaaata atggattgga ggctcatcga gagtggggag gagacaacca 480 agaataaaaa ctattgtttt atgattgtac ggcactactg gagcgcccag tgtgccttat 540 gcatagtggc tattcagtat tcctgagctc agacattgtt gatctgattc tctctctttt 600 ttattaaatt ttgattactc gataaagtta agtttcctgc gtctactttc aagcgaatcc 660 tgaactcggg gagggaaaaa ccattcccaa ca 692 // ID Tc1-5_DR repbase; DNA; ZEB; 1234 BP. XX AC . XX DT 26-SEP-2008 (Rel. 13.09, Created) DT 27-SEP-2008 (Rel. 13.09, Last updated, Version 4) XX DE DNA transposon - a consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Tc1-5_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1234 RA Jurka J.; RT "Mariner-type families from zebrafish."; RL Repbase Reports 8(9), 934-934 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1234 BP; 343 A; 284 C; 300 G; 306 T; 1 other; caggggttgg acaatgaaac tgaaacaytg gccaatttag tgttggaggt ttcatggcta 60 aatttgacca gcctggtggc caatcttcat tgattgcaca ttcaccagta agagcagagt 120 gtgaaggttt aattagcaga gtaatagcac agttttgctt aaaatattgc aattcaaaaa 180 aggacaaatt gttggtgcgc gtctcgctgg cgcatctgtg accaagacag caagtctttg 240 tgatgtatca agagccacgg tatccagggt aatgtcagca taccaccaag aaggagtaac 300 tgtggacgca agaggaagct gtctgaaagg gatgtccggg tgctaacccg gattgtatcc 360 aaaaaacata aaaccacagc tgcccaactc actgcagaat taaatgtgca cctcaactct 420 cctgtttcca ccaaaacttg ttcgtcggga gctccacagg gtcaatatat tatacatggc 480 caggctgcta tagccaaacc tttggtcact cgtgccaatg ccaaacgtcg gtttcaatgg 540 tgccagcagc gaaaatcttg ggctgtggac aatgtgaaac atgtattgtt ctctgatgag 600 tccaccttca ctgtctttcc cacatccggg agagttacgg tgtggagaag ccccaaagaa 660 gcgtaccacc cagactgttg catgcccaga gtgaagcatg ggggtggatc agtgatggtt 720 tgggctgcaa tatcatggca ttccctaggc cccaaatata cttgtgctag atgggcgcgt 780 cactgccaag gactaccgaa ccattctgga ggaccatgtg cacccaatgg ttcaaacatt 840 gtatcctgaa ggcggtgccg tgtatcagga tgataatgca ccaatacaca cagcaagact 900 ggtgacagag tggtttgatg aacatgaaag tgaagttgaa catctcccat ggcctgcaca 960 gtcaccagat ctaaatatta ttgagccact ttggggtgtt ttggaggagc gagtcaggaa 1020 acgttttcct cctcaccagc atcatgtcat agtgacctgg ccactattct gcaagaagaa 1080 tggctcaaaa tccctctggc cactgtgcag gacttgtatc tgtcattccc aagacgaatt 1140 gatgctgtat tggccgcaaa aggaggccct acaccatact aatgaattat tgtggtctaa 1200 aaccaggcgt ttcagtttca ttgtccaacc cctg 1234 // ID DNA-TTAA0-7_DR repbase; DNA; ZEB; 1255 BP. XX AC . XX DT 08-JAN-2009 (Rel. 14.01, Created) DT 13-JAN-2009 (Rel. 14.01, Last updated, Version 1) XX DE non-autonomous DNA transposon - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW DNA-TTAA0-7_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1255 RA Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 9(1), 14-14 (2009). XX DR [1] (Consensus) XX CC These transposons insert specifically between 5'-TT and AA-3', CC and do not produce TSD. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1255 BP; 389 A; 225 C; 225 G; 416 T; 0 other; gtattacagt ttttctcaat tggtttggta catttctcac atcagaattg aaattctcaa 60 aagttcttgt tcaattgtca catcatcatg ccatttgtgc agatgaaaaa ggcagtttct 120 cattgcattg aacaaattgc aaatgctttt gtccatccat gcaaatgatt gtgtacaatt 180 ctcagttttt tcgtacatta tcaattgctt ttgtcatgct aatcaaaatg ctttgttata 240 ggaatctatc aaatagtctc tttccccaaa acatttaggc tatagttcat catataagtc 300 ttcacatgca aaatgattta ccaagccatc ataacatgtc aagcatatat acatttccat 360 aagacatttg tccgtaaata tgatctaaat tggtaaattc ctcccaggtg aatctagact 420 ttctctaatg aaaagagttg atcaaccaat gatcgacagt tttctgaatg agctcaaggg 480 agcatctcat gaacaatcta ctggggagta tataggtaac cagagcacaa tagaaagtta 540 cagcagattc cgcagagcat ggattaggca tgcaagaaga tactatccac gctgccttgc 600 acgggaggac agtaggtgtg acgtggatga gaatttgtgg cctaacagac aggaacggga 660 agaagctgaa gatcaggatt gagattagaa tttggggagt tgttcatatt tttttgttta 720 cctactgtat gacctatgtt tatatttttg acctatgttt acttacgtat tgtatttcac 780 ctttgttatg tgcagtgctg tctttttttt ttctttacgc actgcaatgt caatgtttgc 840 caataaagaa gtcattctgt aaatgtgaat cttgtctttt ccaatcaatc ttacacatat 900 acattatatg aggtcaaatg tacaacactt gtcatcacac ctaaaccaca ttttacatca 960 acatgtccct tcaaagtgca ctattcaatt gccaacatga ctaatgaatt tgactgtctt 1020 atccgtacac aatgacacaa tgattaaaca ttctgatggc actgacaagt atattgacac 1080 aaaaacttgc ttttgaggga tggactaagg attttgagca agagactggc ttttgcaggt 1140 gatccacggt gttttgctat ttgttcaaac tgttttgaga aatgcacttg atcttctgca 1200 aattgcaaga atgattggaa aaatgtacca aagtgattga gaaaaactgt aatac 1255 // ID Kolobok-N2_DR repbase; DNA; ZEB; 1371 BP. XX AC . XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE nonautonomous Kolobok transposons from zebrafish- a consensus. XX KW Kolobok; DNA transposon; Transposable Element; Nonautonomous; KW TTAA TSD; Kolobok-N2_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1371 RA Bao W. and Jurka J.; RT "Kolobok-type transposons from zebrafish."; RL Repbase Reports 8(10), 1346-1346 (2008). XX DR [1] (Consensus) XX CC The TSDs are TTAA specific. The termini are similar to known CC Koloboks. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1371 BP; 421 A; 312 C; 276 G; 362 T; 0 other; agagccccta ttttgcatta taaaaggtca tattttggtt ttgggggtct ccaacaacag 60 gctgatatgc atgcaaggtc aaaaaacact ttcattgtct tataatatgc atttattttt 120 acctaattat cccaacgact cccatatgat tcgttcagcg attcatttgt tcccaaaccc 180 ctccttagcg cgaagctaat ctgcgctgat tggtccgatg acccagtctg ttgtgattgg 240 tcgactgcgt tcagcgcgag acggagagaa atgcccacca cggctatgaa gtagcacaga 300 gtatatgtga gagcccaatg caggaatgca ataaagcaat gcagttaaac accagcatat 360 tactctactc ttaaccctaa ccccaagtaa taacaacgac acacattcag tattaatcca 420 cacagtggca aaagttgaac tattttgaaa attgaccgcg ccgcgcgtgt gaggaacagc 480 tgatggtggc catagcaaag acaaacagca gaggatcgcg agttcagaaa cgcatttaaa 540 tcggtaaagg aagaagcacg cgtcgcgttt tcaacgtggt tttggatgca caatatgtga 600 atagccccat aaagatttaa cctgagagat acagtacaag cactgatcta taatagataa 660 gtgattacaa gccgcgcgga gcgattacaa cttataacac acaattaaac acatattttg 720 caaactacac aaaacgaggc aacaatttta attacactta catgttatga tccggaggaa 780 gaagaaactg gtccatatga actgtaacag ttactgacaa agtccctgtc aaagtctgac 840 aaagtcccat acataatagt ctctgctgct ccttccttta atagcaaacg gccgacgaat 900 cccgcgttgc agcgtaggtt attgtatcaa aaatcagaaa aatgacagga acaaacacag 960 cactaggttg gctatactgt ggtattgttg tgaaaatgaa ctttaaccct ttattccccg 1020 cttgtcagcc gaatctccat ctgtcagtga tcgtcatctt cgcgtcatga tcagtcccgt 1080 gatcccccgc gctccgctag tgtctgaagg gaaaggcgcg ttcacgtttt accggatctg 1140 gaactacata tttggtgatg taccgtatcg acgtcgacgc gttcaagcaa aagatccgaa 1200 cccaacacga ttcatttatt cgactctgag tcgactattt cgttaaagaa tcaatagttt 1260 taaacacggt gcactttcag atttaaacct cagctggatg ttttcattca cttagtgctg 1320 tgttacacac tgcatggaag gtcattttca aaaacccata ataggggctc t 1371 // ID DNA-5-1_DR repbase; DNA; ZEB; 739 BP. XX AC . XX DT 16-OCT-2008 (Rel. 13.1, Created) DT 16-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE nonautonomous DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; TSD 5-bp; KW DNA-5-1_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-739 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1233-1233 (2008). XX DR [1] (Consensus) XX CC The consensus was built from copies ~95% identical to it. TIR is CC 21 bp long. The TSD is 5-bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 739 BP; 244 A; 135 C; 143 G; 217 T; 0 other; cagggttcaa cgctaaggat tttttctact ggtccgatcg ggcaagtggt ttagattttt 60 acttgccctg ccaaaatttt cactggcccc accaaaaaaa aaaaaaggga agttaatagc 120 tattttttag ccacatattt taaataatgt gtcaaaaata aagtctgtga atctagaatt 180 tcaatactta aaaaaaaatt gtaaataaaa gtgttatgca aacaaaaaga gcagtatgga 240 aaatgtggag gtattttatt gcagtttgaa attatttaac aaaaggtggc tgacttgtca 300 aactgacggc aaactatgca taacatcact tcatttttat tttgtcgtgt tgttcccatg 360 taaatgtcta cttccaagac gttagaaaca gacattttct tttagcctta taatttgatg 420 ttgccgccat gttgccgttt ctacgctgta ctggttgtta ccaggttacg aaaaaaaata 480 gtagcatggt caaatcatca aatcagataa gaggaaccgg aaagcaatat gttgtttacg 540 acattacaga gaaggtagcg tttaaagact taaaagcaca aacttaaaat gcacgcaatt 600 gacaaatagt cacagacaaa ttaaatgtta gtgacaaggc agcactgtcc caatcgggcc 660 agtaatgatc cggtctactg tcccaagtgt ctctcacgct ggccctgggc caccgggcag 720 tccttattgt tgagccctg 739 // ID PROTOP1_DR repbase; DNA; ZEB; 384 BP. XX AC . XX DT 02-MAY-2002 (Rel. 7.04, Created) DT 02-MAY-2002 (Rel. 7.04, Last updated, Version 1) XX DE PROTOP1_DR is a nonautonomous P-like DNA transposon - a DE consensus. XX KW P; DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; P superfamily; PROTOP1_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-384 RA Kapitonov V.V. and Jurka J.; RT "PROTOP1_DR, a nonautonomous P-like DNA transposon from RT zebrafish."; RL Repbase Reports 2(4), 23-23 (2002). XX DR [1] (Consensus) XX CC The consensus sequence was reconstructed based on 10 copies CC present CC in sequenced DNA that represents ~1% of the genome. Approximately CC 1000 copies of PROTOP1_DR are expected to populate the complete CC zebrafish genome. PROTOP1_DR copies are 7% divergent from the CC consensus sequence. CC This element is characterized by 33-bp terminal inverted repeats CC and 7-bp targets site duplications. We classify this element as a CC nonautonomous DNA transposon from the P superfamily. PROTOP, CC PROTOP_A CC and PROTOP_B DNA transposons found previously in D. melanogaster CC are characterized also by 7-bp TSD and 30-bp TIRs similar CC to those in PROTOP1_DR. XX SQ Sequence 384 BP; 115 A; 76 C; 88 G; 105 T; 0 other; caaagagcat agaatcacca agaggcgaca ctctagtgca atttgggaaa cagccactag 60 atggcgcggc ggccattttg gaatgaaaac tccaatagaa caacagcata ttataagtct 120 gtaaaataaa ctattaaaag tgctgatgat tgtgatagta agtgttgtat tgtcgtcttt 180 caggttgtat ctcagcttta atgcgctttt taaataaata aataaaaaac aaagcagctg 240 cttgccatcg cgacagcaat aagatccaat ggacggccga tcgctttcac tccaaaatgg 300 cggaatccgg ggctgttgct gggcgctgct gttgcaatgg aacgttctat tgagtgtcgc 360 ctcttggtca ttctaagctc tttg 384 // ID Gypsy-19-I_DR repbase; DNA; ZEB; 6544 BP. XX AC chr13; XX DT 11-FEB-2005 (Rel. 10.01, Created) DT 11-FEB-2005 (Rel. 10.01, Last updated, Version 1) XX DE An internal portion of the Gypsy-19_DR LTR retrotransposon - a DE fossilized sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy superfamily; Gypsy-19-I_DR; Gypsy-19-LTR_DR; Gypsy-19_DR; KW endogenous retrovirus; gag; integrase; protease; KW reverse transcriptase. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6544 RA Kapitonov V.V. and Jurka J.; RT "Gypsy-19_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 5(1), 11-11 (2005). XX DR Zebrafish.; chr13; Positions 7939002 7932459. XX CC Gypsy-19-I_DR is an internal portion of the Gypsy-19_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its CC long terminal repeat is deposited in Repbase as CC Gypsy-19-LTR_DR. The genome contains one copy of Gypsy-19-I_DR CC that is flanked by 2% diverged LTRs. The internal portion CC contains two ORFs encoding Gypsy-like gag and pol proteins. CC The first ORF contains one frame shift and encodes the 601-aa CC Gypsy-19_DR1p gag (pos. 94-1892). The second ORF encodes the CC 1566-aa Gypsy-19_DR2p pol protein (pos. 1847-6544) composed of CC the protease, reverse transcriptase, and integrase domains. XX FH Key Location/Qualifiers FT CDS 1847..6544 FT /product="Gypsy-19_DR2p" FT /translation="AEGQTSCMGSTEFFKLRSVSVVGRTETGGNKQCPKKH FT SYKQNVNSTSKLPARLVGTRCTASIQIEGKVVSCLLDTGSQVTTIPMSYYK FT KHLSRHPMKSVEHLLEVEGANGQSVPYLGYVELNLKFPKEFLGVEVEIPTL FT ALIVPDLTTVPQILIGTNSLDVLYTNYTEGETSACKSNFYGYQAVINVLER FT RNQQASIETVGSVRLKGKQEIIPAGCVVVLEGVMQLKNPLFKKVVSVESPM FT SSHLPGGLLVANSLHTLPARQRVAQLSILVKNETQTDLIIPSRAIIAEAHA FT VERIVEREISKKSAENEKMELNEQKIPIDFGNSPLSPEWKKRITSVLNSMP FT DVFALNDLDYGHTDKVKHRIKLNDETPFKHRARPIHPQDVDAVRKHLQELL FT EAGIIRESESPFSSPIVIVRKKNNSVRLCIDFRKLNSQTIKDAYALPNLEE FT AFSVLTGSRWFSVLDLKSGYYQIEMEEADKQKTAFVCPLGFWEFNRMPQGI FT TNAPSTFQRLMERCMGDLNRKEVLVFIDDLIVFSETLEEHETRLMQVLKRL FT KEFGLKLSPEKCKFCQTSVKYLGHIVSQHGVETDPSKVEALKTWPRPNNLK FT ELRSFLGFSGYYRRFVQDYSKIVKPLNDLTAGYPPLQKHCNKKRDETKQYF FT NPKEQFEERWTLECQQAFDTVIEKLTSAPVLGFADPKLPYVLHTDASTTGL FT GAALYQEQDGQKRVIAFASRGLTKSEAKYPAHKLEFLALKWAVTAKFNDYL FT YGADFTVVTDSNPLTYVLTSAKLDATSYRWLSSLSTFNFKIQYRAGNRNQD FT ADGLSRRPQEAMPDDLETTKERERIRQFAYHHLTDASSAVVSSEAIKAICE FT RHEVCQSNSHTDSLYQPVTLVESLAVGEKVLPQSFQQEETCRSGEIFLLSE FT QDLRERQRADPEIGIIIKHLEHEEKPCLKTIPSHLALWLREWNRLVMKNGV FT LFRKRNEQGSESYQLALPVNLRDVVIKELHDEMGHLGIERTLDLVRSRFFW FT PKMSLTIEQKVKTCERCIRRKTPPEKAAPLVNIKTSRPLELVCMDFLSIEP FT DQSNIKDILVITDHFTKYAVAVPTRNQKAQTVAKTLWDNFLVHYGFPEKLH FT SDQGPDFESRTIKELCRVAGISKTRTTPYHPRGNPVERFNRTLLQMLGTLN FT TKEKSNWKDFVKPLVHAYNCTRNDVTGFSPYELMFGRQPRLPIDLAFGLPV FT SSQPESHSKYVQSLKNRLEESYMVATKNASKVAARNKTRYDQRVVSSTLDV FT GDRVLVRNVRLRGKHKLADKWESDVYVVVKKAVELPVYTVKPEGKDGPLRT FT LHRDLLLPCGFLLMDKPKEPLDKEVPVKPRTRASSAKELEKCESQSEHSED FT DDLSESYAPERTLEIETRIVSNTNPVACIRNRQTKNLPSVETAIEPSTECP FT IQHEPINDNLPEPVERNIPYLPEAEDQDLLCESERENVEELINHQSTENEI FT NYISSQKDHTPCQSPLIDQDEMPQPEKTQRRSQRSHEPPKRLQYPQLGNPL FT SMVIQSLLSGLSTAFTNSLEENLEQFNSVPTIVTVQPQMQRDLHAFRRGE" FT CDS join(94..603,600..1892) FT /product="Gypsy-19_DR1p" FT /translation="MEIVKRENIEIQNSVIVSGLTLSALDEELEDYLKRYG FT SVKSNLLIDNPESEYHHNTIVEFDCDTAMSNLEPSLPMDIQSTQDSNAVFH FT VRSLGSVYASAASSSLTEGYLESLVAIAQSSGKPLQDVLQMELQKMNALKL FT PASDSPRPDTPVQMFQNLKTLSRKAHQISHPPYYSISNNASTADTISLNLP FT TSALNPPGIQRVVLEHVVKTSDAVSSSNAAYRLKAFSGRSPRQNTEPDFET FT WRASIDFLINDQSLSDSHKTRKILDSLLPPASDVVRHVGPDVSPTKCLELL FT ESVYGSVEDADELLVKFISTLQNQGEKPSAYLHRLHVMLSATIRRGGVTEA FT ERKGYLLKQFCRGCWDNNLISDLQLERRKAAPPSFAELIVLIRTAEDKLLL FT KEERMKKHLGINKHVSVPVKFRTATHQQSVYYADAIDEPRVENNHESLRQK FT SAKPKSKSDQSELDALKKEVAKLQAKISSMMADPIRENKKKASSDANEFNQ FT LKKQVAELQAHLVPVVQGRCFEKSPASKNFPVRHRPNVMGPEMHKEERPTW FT NSTNRPRPGYCFQCGEDGHLAVHCKNAPNPQRVEEKRRELRDRQAAWDLQN FT SLN" XX SQ Sequence 6544 BP; 2104 A; 1372 C; 1424 G; 1644 T; 0 other; gaaaagttgg tgagccagtt aggagtgtaa atagctaaat ttcaatcgtt ttaattatca 60 tttattgcac tttagcattt aaaataaata gaaatggaaa ttgtcaagcg tgaaaatatt 120 gaaattcaga attctgttat tgtcagtggc ttaacactct ctgctctaga tgaagagtta 180 gaagattatc tgaaaagata tggatcagtc aaaagtaact tacttattga taatccagaa 240 tcagaatatc atcataatac aattgtggag ttcgactgtg atactgcaat gagtaatttg 300 gagccctctc tccctatgga tatccaaagt acacaggatt caaatgctgt ctttcatgtt 360 cgcagcctag ggagcgtgta tgcctctgca gccagcagca gtctaactga aggttacttg 420 gagagtctgg tagccattgc ccaatccagc ggaaagcctc tgcaggatgt tctacaaatg 480 gagctgcaaa agatgaatgc tttgaagctt ccagcatctg actcccctag acctgataca 540 cctgtgcaaa tgttccagaa cctgaaaacc ctgtcaagga aagcacacca gataagtcac 600 cctactattc tataagcaac aatgctagta ctgcagacac catatccttg aatctaccta 660 caagtgctct gaatccccct ggcattcaac gagtagtctt agaacatgtt gtgaaaacca 720 gtgatgccgt ttcttcttca aatgctgctt accgtctcaa agccttctca ggaagaagcc 780 caagacagaa cactgaacca gattttgaaa cttggagggc cagcattgac tttctgatta 840 atgaccagtc cctttctgat tcgcacaaaa cccgaaaaat tcttgacagt ctgctgcccc 900 cagcctcaga tgttgtccgg catgttggtc cagatgtttc accaaccaaa tgccttgaac 960 ttttagaatc tgtgtatggc tcagtggagg atgctgatga gctgctagtg aagtttataa 1020 gcactctgca gaaccaggga gaaaaaccgt cagcctacct gcatcgtctt catgtaatgt 1080 taagcgctac aatccgacga ggtggtgtta cagaggctga acgcaaaggc tatcttctca 1140 aacaattctg tcgtggctgc tgggacaaca atttgatttc agatctgcag ctggagcgaa 1200 gaaaagccgc acccccctct tttgctgaac tcatcgtact catccgtact gcagaagaca 1260 aactgttgtt aaaagaagaa agaatgaaaa aacatcttgg aataaataag catgtttctg 1320 ttccagtcaa gttccgaaca gccactcacc aacagtctgt ctattatgct gatgcaattg 1380 atgagcccag agttgagaac aatcatgaga gtctgaggca gaagtcagct aagcctaaaa 1440 gtaaatctga tcagtctgaa ttggatgcac tgaagaaaga agtcgcaaag ctgcaggcca 1500 agatttcttc aatgatggct gaccctatca gagaaaataa aaagaaagct tcctctgatg 1560 ccaatgaatt taatcaactg aagaaacaag ttgctgaact acaggctcat cttgtaccag 1620 tagtgcaagg gagatgtttt gagaaatctc cagcttctaa aaatttccct gtgagacaca 1680 gacccaatgt aatgggacca gagatgcata aagaggaaag acccacctgg aactctacca 1740 atagacctcg tcctggttac tgctttcagt gtggagaaga tggtcatcta gcagtccact 1800 gcaaaaatgc tccaaacccc caaagagttg aagaaaagcg ccgtgagctg agggacagac 1860 aagctgcatg ggatctacag aattctttaa actgagatca gtctcagttg tggggcgaac 1920 agagactgga ggaaacaaac aatgcccaaa gaaacacagc tacaagcaaa atgttaacag 1980 cacttctaaa ttacctgcaa gattggttgg aacaaggtgc actgcaagta ttcaaattga 2040 gggaaaagta gtcagctgtc ttctagatac cggatcgcag gtgaccacaa ttccaatgtc 2100 ctattataaa aagcatctat ccagacaccc catgaagtca gtggagcacc tactcgaagt 2160 tgaaggagcc aatggccagt ctgtacccta cctgggttat gtggagctca atttaaagtt 2220 tcccaaagaa tttcttggtg tagaagttga aatcccgacc ctggctttga tagttcctga 2280 tctaacaact gtacctcaaa tacttattgg cacaaactca ctggacgtgc tgtacaccaa 2340 ttatactgaa ggagaaacat ctgcctgcaa gtcaaacttt tatggctatc aagcggtcat 2400 aaatgtactg gagaggagga accaacaagc ttccattgaa actgtgggat ctgtaaggtt 2460 aaaaggaaaa caagagataa taccagcagg atgcgttgta gtgttggagg gagtcatgca 2520 gttaaaaaat cctctgttca agaaagtggt gtcagtggag tcacctatgt catctcactt 2580 acccggtggt cttctagtgg caaacagtct ccacacttta cctgcaagac aacgtgttgc 2640 acagttgtca attctggtga aaaacgaaac ccaaactgac ctgatcattc catcaagggc 2700 tattattgct gaagctcatg ctgtagaacg aatcgtagag agagaaattt caaagaaaag 2760 tgctgaaaat gagaagatgg aattaaacga acaaaagatc ccaattgatt ttggaaactc 2820 cccactttca cctgagtgga agaagagaat aacctctgtg ttgaactcca tgcctgatgt 2880 ctttgcttta aatgatttgg attacggaca cacagataaa gtaaaacaca gaataaagct 2940 taatgatgaa acaccattca agcatcgagc tcgtccaatt catccccaag atgtggatgc 3000 agtcaggaag catctccaag aactgttgga ggcaggtatc atcagggagt cagaatctcc 3060 attttcgtct ccaatagtca ttgtacggaa gaaaaacaac tctgtgcgcc tgtgtattga 3120 cttccgcaag ttaaactccc agaccataaa ggatgcgtac gcccttccta acctggaaga 3180 agctttttca gtcctaactg gatcaagatg gttttcagtc cttgatttaa agtcaggcta 3240 ttatcagatt gagatggaag aggctgataa gcaaaaaacc gcctttgtgt gcccactagg 3300 tttttgggaa tttaaccgga tgccacaggg aatcacaaac gcaccaagca cattccaacg 3360 gctcatggaa cgttgtatgg gagatttgaa ccggaaggaa gttcttgtgt tcatcgacga 3420 cctcattgtt ttctcagaaa ctttggagga acatgagacg agattaatgc aagtgcttaa 3480 aagacttaag gagtttggat taaagctttc accagaaaag tgcaagtttt gtcaaacctc 3540 tgttaaatac ttgggccaca ttgtttctca acatggagtg gaaactgatc catctaaagt 3600 agaagcccta aaaacctggc caagaccgaa taatctaaag gagttaagat ctttccttgg 3660 attttcaggg tactacagaa gatttgtgca agattactct aaaatagtaa aaccactcaa 3720 tgatcttact gcaggatatc caccactgca gaagcattgt aacaaaaaga gagatgagac 3780 caaacagtac tttaatccaa aggaacagtt tgaagaaagg tggactctgg aatgtcagca 3840 agcgtttgac actgtcattg aaaaacttac atctgctcca gtcttaggat tcgctgatcc 3900 aaagttgcca tatgtgttgc acacagatgc cagtaccacc ggccttggtg cagccctata 3960 tcaagagcaa gatgggcaga agagagtcat tgcctttgca agcagaggtc ttaccaagag 4020 tgaggcgaaa tatccagctc ataagttaga gttcctggcg ctgaagtggg cagtcactgc 4080 taaattcaat gattatctgt atggagctga cttcacagtg gtgacggaca gtaatccgct 4140 gacatacgtt ttaacctctg ccaaactgga tgcaaccagc tacaggtggt tgtcgagtct 4200 gtcaactttt aactttaaaa ttcagtacag agctggaaac agaaatcagg atgcggatgg 4260 tttatccaga cgtccacaag aagcaatgcc tgatgatcta gaaacaacaa aagagagaga 4320 gagaatccga cagtttgctt atcatcacct cacagatgcg tcaagtgcag ttgtttcatc 4380 agaagcaata aaagccatct gtgaacgtca tgaagtttgc cagtcaaaca gtcacactga 4440 ttccttgtac caaccagtga cccttgtaga atctcttgct gttggtgaaa aagtcttacc 4500 tcaatcattc cagcaggaag aaacttgtcg atctggtgag atattccttc tgtcagagca 4560 agatttaagg gaaagacaga gagcagatcc agaaattgga atcattatca aacacctaga 4620 acatgaggaa aagccttgtc ttaaaaccat accttctcat ttagccctgt ggttaagaga 4680 atggaatcgc ctggtaatga aaaatggagt gcttttcagg aaaaggaacg agcaaggaag 4740 tgagtcatat cagttggcat tacctgttaa tcttcgtgat gtagttataa aggaactgca 4800 tgatgagatg ggacacttag gcattgaacg aacccttgat ctagttcgat ccagattctt 4860 ctggcctaaa atgtctttga ccatagagca gaaagtgaaa acctgtgaac gatgtattcg 4920 acgaaaaact cctcctgaga aagccgctcc attagttaac atcaaaacca gcaggccatt 4980 ggagttagtt tgtatggatt tcctgtcaat tgagcctgac caaagtaaca ttaaggacat 5040 actggtcata accgaccatt tcacaaagta tgctgtggct gtgccgacga gaaatcagaa 5100 ggcccagacg gtggccaaga ctctgtggga caactttctg gtgcattatg gctttccaga 5160 aaaactccat agtgaccaag gtccagattt cgagtctcga acgataaaag agttgtgtag 5220 agtggcagga atttccaaaa caagaactac tccttaccat ccaagaggta atccggtaga 5280 gcggtttaac cggactctcc ttcaaatgct gggaactttg aatacaaagg agaagtcaaa 5340 ctggaaggat ttcgtaaaac ctctcgtcca tgcatacaat tgcacaagga atgatgtgac 5400 tggattttca ccatatgaac ttatgtttgg aagacaacct agattaccta ttgacttggc 5460 ttttggttta ccggtcagca gtcagcctga atcccattca aaatatgtgc aatctctaaa 5520 aaatcgtttg gaagagagct acatggttgc tactaaaaat gcttcaaaag tggctgcacg 5580 taacaagaca aggtatgacc aacgtgttgt ttcctcaact ttggatgtgg gtgaccgcgt 5640 gcttgtgaga aatgtccgtt tgagaggaaa acacaagcta gcagataaat gggaatctga 5700 tgtgtatgtt gtagtcaaga aggctgttga gttgcccgtc tatacagtga agcctgaagg 5760 gaaggatgga cctcttcgaa ctttacatcg agatctgtta ttgccttgtg gatttctact 5820 gatggacaaa cccaaagaac cacttgataa ggaagtgcct gtgaaaccca gaaccagagc 5880 cagttctgcc aaagaacttg agaaatgtga atcccaaagt gaacattcag aagatgacga 5940 tctaagtgag agctatgctc ctgaaaggac tttggaaatt gaaacaagga ttgtgtcaaa 6000 tacaaatcct gttgcttgta tcagaaatcg acaaacaaag aacttaccaa gtgtggaaac 6060 tgcgatagag ccaagcacag aatgtcctat ccagcatgaa cccattaatg acaacttacc 6120 tgaacctgtg gaaagaaata taccatactt acctgaagct gaagatcaag atctcctatg 6180 tgaatctgaa agagaaaatg tagaggaatt aattaaccac caatctacag agaatgaaat 6240 aaattatatt tcttcacaaa aggaccatac cccttgccag agtcctctga tagatcaaga 6300 tgaaatgcct caacctgaaa aaactcaaag acgcagtcaa agatcccatg aaccccccaa 6360 aagacttcaa tatcctcagt tgggaaatcc tctcagtatg gtgatccagt ccttgctatc 6420 aggcctgagc acagcgttta ccaactccct ggaagaaaac ctggaacagt ttaatagtgt 6480 gcccaccatt gtgactgttc aaccccagat gcagagggac ctgcatgcat tcaggagggg 6540 agag 6544 // ID DNA-2-28_DR repbase; DNA; ZEB; 387 BP. XX AC . XX DT 09-JAN-2009 (Rel. 14.01, Created) DT 12-JAN-2009 (Rel. 14.01, Last updated, Version 1) XX DE Nonautonomous DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA-2-28_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-387 RA Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 9(1), 6-6 (2009). XX DR [1] (Consensus) XX CC 2bp TSD. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 387 BP; 106 A; 65 C; 72 G; 144 T; 0 other; cacagcaaaa tcctctgagt aaaatttact cagttcagag agtatttggt ccctctttaa 60 ataagggcaa acaggttggt aataagtaaa gctgctgttt gtggagttgc ttaatgatcc 120 tctgctgaga ttttgagaga ttcgatgtat tatatttcag ctgatttcat ctaaggctgt 180 gactgaagtc agttgtagtt ttgtgtttct cttctgtttc tgtgtttctg cttattaaca 240 gcaggtgttc atcatcagtg ctcaatcatc acttaattaa tcctttcatt gtctcattaa 300 ctttaactcg agtaacttta actcaattta gagagggacc aaatactatc taaactgagt 360 aaattttact cagaggattt tactgtg 387 // ID L1-4_DR repbase; DNA; ZEB; 5548 BP. XX AC AL807749; XX DT 04-AUG-2002 (Rel. 7.07, Created) DT 19-FEB-2010 (Rel. 15.03, Last updated, Version 3) XX DE L1-4_DR is a non-LTR retrotransposon from the Tx1 clade. XX KW Tx1; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; endonuclease; L1 clade; L1-4_DR; DewaDr1. XX NM L1-4_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5548 RA Kapitonov V.V. and Jurka J.; RT "L1-4_DR, a family of non-LTR L1-like retrotransposons from RT zebrafish."; RL Repbase Reports 2(7), 24-24 (2002). XX RN [2] RP 1-5548 RA Kojima K.K. and Fujiwara H.; RT "Cross-genome screening of novel sequence-specific non-LTR RT retrotransposons: various multicopy RNA genes and microsatellites RT are selected as targets."; RL Molecular Biology and Evolution 21(2), 207-217 (2004). XX DR Genbank; AL807749; Positions 100605 95058. XX CC L1-4_DR is a family of L1-like non-LTR retrotransposon. This CC family was active recently (no stop codons in ORF1; CC a few stop-codons in ORF2). CC It encodes two proteins: CC the 459-aa L1-4_DR1p (positions 171-1547) and the 1294-aa CC L1-4_DR2p CC (positions 1577-5461, a conceptual translation). CC These proteins are most close to corresponding CC proteins encoded by other L1-like elements. L1-4_DR1p is a CC putative CC RNA/DNA binding protein, and L1-4_DR2p is composed of the CC AP endonuclease (aa positions 1-200) and reverse transcriptase CC domains. CC This is the same as leucine tRNA-gene spacer-specific non-LTR CC retrotransposon family DewaDr1 [2]. XX FH Key Location/Qualifiers FT CDS 171..1547 FT /product="L1-4_DR1p" FT /translation="MDTLVKELELKMDYGSMDKATPENNNSRDSNNGIDDN FT DTWATVVARRRKTKSDTSREGSGEMQQTKGKANAEHLENSQQSSHRNEMIN FT KLKSQARFQRQYKKETTLTMTVKDPENITVTMIIKAVEDKTGIGKLFGLRK FT KSNFDYELTMENETDCDHLMDGLMINQQFCEVSKLCATERMVSFLNLPNYI FT QDSEIIQKLVDWGVSPILPLRRRYHPGTTVADGTRFIRVKFPKEVMSLPYN FT VKFDTEEGPKYFRVIHDQQIKTCRLCGSAEHEKKDCPQFVCRECLEQGHFT FT RDCKAPRCQGCKKTILWCRCESDEEETGVMETNKQMEKSSNEEREEEVQEL FT PPDDLNEQEEEQAMSEEDEGDTADTEAQDLMKDDGHDMEEEQGAAGENTES FT RIEEEVSDDDDNEEINIGTKDRTIDSINRRRRKTVQLNIQQVLKKQKLRKE FT AKAKLKTERTDLRF" FT CDS join(1577..3517,3516..4724) FT /product="L1-4_DR2p" FT /translation="MDCFIWIFFCLDGLFSFIFLMNNLCLVSINVRGLSSK FT VKFENVIALTKKCDVICIQETGWNENIVNDLKKCWDGEILYNNDPNKKKGM FT AILIRRGIGYTFDVLFKDNYGRILTIKIMNKDEEIRICNIHAPNEDLERVT FT FFKDLSVLMSGWNNVIVLGDFNTVLERIDVDDHMVFRADVGRRELKHMIEK FT HKYVDVWRERNRAKREYSRRQWVNTVLKQSRLDYVLCTRNVESFISNIFYK FT IFSCSDHDFLYVMMDFSGVERGPGVWVFNTELLKNDFYKIEMENIIINSVN FT DELYDEEISVWWDNVKLEAKRFSIECSKKMQKAKRAKERQLNKEWENEMEK FT ITEGNMDIRRIVILEEKLKKLEEEKCMGARIRSKIKNTVEGERSTKFFYDL FT EKTRQKADLIKNVSTKEKTVKDKESILRTVKDFYETLFKAKGVHEEDKDFL FT LNQIKVKVSEEDKKLCDSDITEEEINEAITQLSNGKSPGLDGLSSEFYKTF FT KDVLIPILKDLFIAIFKKGQLSESMKKGMIKIIYKNKGDKDYLQNYRPLSM FT LNTDYKILAKILANRLKKVVPTLITTNQAYGVIGRDIADTVTSIRDLIWYI FT KEKKDEGFLFSIDLEKAFDRVEHSYLFDIIQKFGFGENFIKWIKCFLYRDI FT FSCFKINGFLTDYMEISRSIRQGCPLSALLYTLVAEPLGLAINGEKKIKGF FT KIEINRTEQKIYQYADDTTLFLKDFKSVGKAMEIFDKYCRGSVAKVNKEKT FT EYMKMGKVDVQQGNWEYKEQKKYINILGITLGYDENKTREIIWDELINKME FT KRLCFWKQRVLFLKGKVLVLNSLFLSKMWYVLSVVSLPTWVYKKLKTMILN FT FLWDDKPSKIAYNTIIGKVDEGGLRLIDPWIRIKSMRIKTLKKFLNEDNIL FT WKSIMSYFINKCGQIRDDFLWMAFKDRMIENIPEFYEELLRTWKCFYNNIQ FT TEIEGRKLYLQQPLFLNQNNKSKKQMFYENWYAVGFRQVKDILYEIKPGFL FT PTQAIIDTLEEIEDVDDKEKIEDQYKKLRLALPDHWIKTIEENE" XX SQ Sequence 5548 BP; 2269 A; 553 C; 1134 G; 1592 T; 0 other; tgctttcaag gaagtgtgag gtggcagtag ggagagaaag gctctcccat tttgatttgc 60 tttattgttt tttttcttga ttttgcttaa attaaattgt attaattttg tttagtttta 120 gttaaacccc agacagtgtt atctgtttgg ggttaaaacg ttttgaaagg atggacactt 180 tggtaaagga actggaacta aaaatggact atggcagcat ggacaaagct actcctgaaa 240 acaacaattc aagagattca aacaacggca tcgacgataa tgacacatgg gcaactgttg 300 tggcaagaag gaggaaaact aaatcagaca caagtagaga aggaagtgga gaaatgcaac 360 aaactaaagg taaagcaaat gctgaacact tggaaaacag ccaacaatca agtcatagaa 420 atgagatgat aaacaaactg aaaagtcaag ctagatttca gcgacaatat aaaaaagaaa 480 caactctgac aatgactgtg aaagatcctg aaaatatcac tgtaacgatg attataaagg 540 ctgtggaaga taagactgga attggaaaat tgtttggact gaggaaaaaa tccaattttg 600 actatgaact tactatggaa aatgaaacgg actgtgatca cttaatggat ggactaatga 660 ttaaccaaca attttgtgaa gtatcaaaac tctgcgcaac tgagagaatg gtttcttttt 720 tgaacttacc caactatatt caagatagtg aaatcatcca aaagctggtg gactggggag 780 tttctccaat tctcccactg agaagaagat atcatccagg aacaactgtg gctgatggaa 840 caaggtttat cagagtgaaa tttccaaaag aagttatgag tcttccttac aatgtaaagt 900 ttgatacaga ggaaggacca aaatatttta gagtgataca tgatcagcag ataaaaacat 960 gcagattatg tggaagtgct gaacatgaaa aaaaagactg cccacaattt gtgtgtagag 1020 aatgtctgga gcaggggcat tttacgcggg actgtaaagc cccacggtgc caaggctgca 1080 aaaagacaat attgtggtgc agatgtgaat cggatgagga agagactgga gttatggaaa 1140 caaataaaca aatggagaaa tcaagcaatg aagaacggga agaggaagta caggaattac 1200 caccggatga tttgaatgaa caagaagagg agcaggccat gagtgaagag gatgaaggag 1260 atacagcaga cactgaggca caagatctaa tgaaagacga tggacacgac atggaagaag 1320 aacaaggagc agcaggtgaa aatacagaaa gcagaattga ggaagaggta agcgatgatg 1380 atgataatga agaaataaat attgggacta aagacagaac aatagacagc ataaacagaa 1440 gacgcagaaa aactgtacaa ttaaatattc agcaagtgct taagaaacaa aaattacgaa 1500 aggaagcaaa agcaaaacta aaaactgaaa gaactgatct aagattttag atcataaaaa 1560 agactacaaa tgattgatgg attgtttcat ttggattttc ttttgtttgg atggcttgtt 1620 ttcttttata tttctaatga acaacttatg tttggtttca attaatgtaa gagggctgtc 1680 atccaaagtg aagtttgaaa atgtaattgc tttaacaaaa aaatgtgatg ttatatgtat 1740 acaagagact ggatggaatg aaaacattgt taatgattta aaaaaatgtt gggatgggga 1800 aatattgtat aataatgacc caaataagaa aaaaggcatg gcaatattaa ttagaagggg 1860 aataggatat acatttgatg ttttatttaa agataattat ggaaggattt taactattaa 1920 aattatgaat aaggatgaag aaataagaat atgtaatata catgctccaa atgaagactt 1980 ggaaagagtt acttttttta aagatctaag tgttttaatg agtggatgga ataatgttat 2040 tgttttagga gattttaata ctgttttaga aagaatagat gtagatgatc atatggtgtt 2100 tagagcagat gttggaagaa gagaactgaa acacatgatt gaaaaacata aatatgtaga 2160 tgtatggaga gagagaaata gagccaaaag agaatactca agaaggcagt gggtgaatac 2220 agttttaaaa caaagcagat tagactatgt tttatgtaca agaaatgtag aatcttttat 2280 ttcaaatatt ttttacaaga tttttagctg tagtgaccat gattttctgt atgtaatgat 2340 ggatttcagt ggagttgaaa gaggaccagg tgtatgggtg tttaatacag agcttttaaa 2400 gaatgatttt tataaaattg aaatggaaaa cattattatt aatagtgtga atgatgagtt 2460 atatgatgaa gaaataagtg tgtggtggga caatgtaaaa ttagaggcca aaagattttc 2520 aatagaatgt tcaaagaaaa tgcagaaagc caaaagagct aaagaaagac aattaaacaa 2580 agaatgggag aatgaaatgg aaaagataac agaaggaaat atggatatta ggagaatagt 2640 gatattagaa gagaaactga aaaaactaga agaggaaaaa tgtatgggag ctagaataag 2700 aagcaagata aaaaatacag tggaaggaga aagaagtaca aagttctttt atgatctaga 2760 aaaaacacga caaaaagcag atttgataaa gaatgtctca acaaaagaga aaactgtcaa 2820 agataaagaa agtattttaa gaacagttaa agatttctat gaaactttgt ttaaagcaaa 2880 aggagttcat gaagaagata aggatttttt attgaatcaa ataaaggtta aagtaagcga 2940 agaggataaa aaactgtgtg atagtgatat aactgaagag gagatcaatg aagctataac 3000 acaattaagt aatgggaaaa gccctggttt agatggtttg tcatctgaat tttataagac 3060 ttttaaagat gttttaattc caattttaaa agatcttttt attgctattt ttaaaaaagg 3120 acagttgagt gagagtatga agaaaggaat gattaaaatt atttataaaa ataaaggtga 3180 taaagattat ttgcaaaatt atagaccttt aagtatgctt aatacagatt ataaaatatt 3240 agcaaagatt ttagcaaaca gacttaaaaa ggtagttccc actcttatta ctactaacca 3300 ggcttatggt gttataggta gagatatagc agacacagta acaagcatca gagatttaat 3360 ctggtacata aaagaaaaaa aagatgaagg atttttattc agcatagatc tagaaaaggc 3420 ttttgataga gttgagcata gctatttatt tgacataata cagaaatttg gctttggtga 3480 gaattttatt aagtggataa aatgtttttt atacagatat ttttagttgt tttaaaataa 3540 atggattttt aaccgactac atggagattt ctagatctat aagacaagga tgtcctttat 3600 cagcgttatt atacacatta gttgctgaac cattaggctt agctataaat ggagaaaaga 3660 aaattaaagg gtttaaaata gaaatcaata gaacagagca gaaaatttac cagtatgctg 3720 atgataccac tctattttta aaagatttta aaagtgttgg aaaagctatg gaaatatttg 3780 ataaatattg tcgaggatcg gtagcaaaag taaataaaga aaaaactgaa tatatgaaga 3840 tgggaaaagt agacgttcaa caaggaaatt gggaatataa agaacaaaaa aaatacataa 3900 atatcttagg cattacactg ggatatgatg aaaataaaac tagagaaata atttgggatg 3960 aacttataaa taaaatggaa aaaagattat gtttttggaa acagagagta ttgtttttaa 4020 aaggaaaagt actggtatta aattctcttt ttctatctaa gatgtggtat gttttaagtg 4080 ttgttagtct acctacgtgg gtgtataaga aattaaaaac tatgatttta aactttttat 4140 gggatgataa accatctaaa attgcatata acactatcat tggaaaggtg gatgagggag 4200 gactaagact tatagatcca tggataagaa taaaaagcat gagaattaaa acattaaaaa 4260 agtttttaaa tgaagacaat attctatgga aaagcataat gagttatttt attaataaat 4320 gtggacaaat aagagatgat ttcttatgga tggcatttaa agaccgcatg atagaaaata 4380 ttcctgagtt ttatgaagag ttgttgagaa catggaaatg cttttataat aatatacaaa 4440 ctgagattga agggagaaaa ctttatttac agcaacccct atttttaaat cagaacaata 4500 aaagcaaaaa gcaaatgttt tatgagaact ggtatgcagt gggttttaga caagtaaaag 4560 acattttata tgaaataaaa cctgggtttt taccgactca agcaataata gatacactgg 4620 aggaaataga agatgtagat gataaagaaa aaattgaaga tcaatacaag aagttaagat 4680 tagcattacc agatcactgg atcaaaacta ttgaagagaa tgaaaaaaga aactgaaaat 4740 agaaaaataa aagttttttt taaaaatgga tgaagataag attagtatca atgattgtcc 4800 tatcaagatg ttttatacat gtttgtgtaa cactgtgttt aaaaaaacct aaatcaagag 4860 aattttggga aaagttattt gaaaactttg atacttcaaa tatatggaaa aatgtaagat 4920 caattttaaa aagtccagca ttggaaaact tagattttat gttaagacac aactgcataa 4980 tgacagagat tatctttaaa aagattgggg tatcacaaga tgatttgtgt aaagtgtgtt 5040 tggaaaaaaa ggaaggcgtg ttacacctat ttttaaattg taaaaagttg agtgatttta 5100 tgaagatgtt gaaaacaatg gtatgcaatt ttctgtatga tgaaaacatt attttagaag 5160 aatgggatac actgttttat ttggttttaa tgggaaaaca aaaaataagt ttgctcttaa 5220 ttatatgttg actcttgcaa gatatacaat atggaaaaga agaaatatta tgaaacaaaa 5280 gaaaaaagaa attccattgg ttttgttgta taaacagatt gtgactgagg aaataatggt 5340 tatatatgac tattgcaaaa tgtatgaaaa gatggacatt tttgaaaaat gtattagaaa 5400 aaataatcca tatattgtac aaacttggac tggttttaaa gttttcttac ctggagattt 5460 ttaaatattt tatcttttaa atatttgtat gtataaatgt catgattgtt gatgatgtat 5520 tttttaagaa agaaaaaaaa aaaaaaaa 5548 // ID Gypsy54-I_DR repbase; DNA; ZEB; 6215 BP. XX AC chr17; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy54-I_DR; KW Gypsy54-LTR_DR; Gypsy54_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6215 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1423-1423 (2008). XX DR Genome; chr17; Positions 10827573 10821359. XX CC Positions [2883-3386] - Reverse transcriptase CC Positions [5146-5376] - Integrase core CC LTRs are 97% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 424..1650 FT /product="Gypsy54-I_DR_1p" FT /translation="MLDCEGLPNSHVSNASMEQQGVFSSTPFPNSQPQNLV FT AAMSDIISQVGQQIADSIVTRLNQTHSVETHPVIDSPKHVTTDNTSSQMLD FT LLQSQMSFSRNVKEPPSFRGKMSDTVDLNEWIDIVRDYIKRNNLRKEQQAE FT EIMVHLRGKARDVVKFGIRNSDVNIVHNPDAIFSILRKHFEATPCSPLPLA FT DFYTTLPGPDEGAYEYWLRLNRAVDVAADRLKEQGKVLDCPGTEVTRMFIR FT HCPCKELAITFRSKTMDQWTVQEVHNILNEYHSETSLVAAVRNTASIKIPV FT NRTEVELPTAALPPTSNVQQSSSLEVSALERVMNLLEKVLLKEANSNEPSR FT RSKPNVSLPRIKGLNDLPCTVCLSPDHSGLTHCREKKLCFQCHSPDHSRLG FT CPERVKSLSSHGPGN" FT CDS 1866..5204 FT /product="Gypsy54-I_DR_3p" FT /translation="MIDSGSMACTLSETAERQLLQHKPNLQKLPTNDVVII FT GCGGQRVIPKAMYELELTVYACTMVVPVLVVPGQTDDLILGSNAIKWLIQK FT MKETDGYWRLVSSPTNPDDAECHQFLSLLSNVERWKGGDMPDKVGTAKLRE FT KITLQPNHEHLVWAQLPVSTTVSIGSTVIIEPTLSKCRPRNVIIGRVITPM FT WGSRWVPFKIINPTEKVIILKKNTKIVDVFTCIAVEELSVPESIKCNVQSA FT ESPIVHVHSSEERNAVLDKLGLKDLHLEACEVSDEWKDRLLNLIEKYESTF FT SRSKMDCGEATDFVHKIHLVDEKPFRLPYRRVAPCYYDKLRTVLSEMELKG FT IIRKSQSEYASPLVLVWKKDGDLRICTDFRWLNAKTVKDAHPLPHQSDALA FT ALGGNVFFSTMDLTSGFYNVPLFEEHKKYTAFSSPFGLHEYNRLPQGLTNS FT PATFMRMMMSIFGGENFSSLLCYLDDLMVFAPNEHIALDRLELVFSRLSRH FT NLKLAPKKCKFLRRSVKFLGHVVTEDGVQTDSEKVKVISDIQIVDLMEADG FT ITPSQKKIRSFLGMILYYQHFIQDCSAKAKPLFNLLSNQAQNKARRKSGRL FT RKSLNVVKLSHEDWTSECNTAFETLKQDLLHSVTLAHPDFSHPFILSVDAS FT FDGIGAVLSQVPPGEKIARPVAFASKTLSKSQINYPAHRLEFLALKWAICD FT KFSHWLKGRHFTAWSDNNPLTYILTKPRLDACEQRWVAKLAAYEFNLKYVP FT GAKNVVADALSREPFVKSCVSHRLLKEPYISLLDEVNGVVIGTVQDAFRVS FT NHCQNVLACNNDQNENEMQDVNNYCSGSFGADEVSAALSAYSGDVVGLLPA FT ANPTLPQFSSEDPSVTIPLSRLSNLQEQDSILGRVINYVLRSRRPSKRERT FT MEPKSVTFLLKHWKKLKIRNHVLYRVKRDRLMNRKVFQYVVPASLRLDVLR FT GVHDASGHQGCARTLSLAAERFFWSGMSKDIKSYVKNCQRCIVGKTPEPDA FT RAPLERIRTSEPMELVCIDFWSAEQTSGKAVDVLVVTDHFSKMAHAFPCHN FT QSAKQVARRLWNDFFCVYGFPKRIHSDQGANFESKMIRSLLEMAGIQKSHT FT IPPNG" FT CDS 5200..6213 FT /product="Gypsy54-I_DR_2p" FT /translation="MGNGLAERFNRTLGNMIRALPVESKAKWPQLLQTLTF FT SYNCTVHETTGFAPFYLMFGRIPRLPIDIMFQHVLCDDRVVSHHEFVTTLR FT RDLSTAAVIARKHSLREQNRHTILYNRKVKGAPLVVGDRVLLANRALKGMK FT KVADKWDSVVYEVQSVRPEINVYRIKDSQTGREKVVHRNQLLPVSFLSWDV FT DEKESVLSNPTCATRGSPLDPALMENADSVSKTSEWLLRMDDSQEDDERDT FT NCGASDVHIEMTTSECSEIDQHSLEPVQSTKNLPVTESHESSSCVKVPENR FT TVSVICNQPTSTHASHITTRLGRPVRPPDRLICEIDQQRLVESDPN" XX SQ Sequence 6215 BP; 1827 A; 1272 C; 1363 G; 1753 T; 0 other; ttggtgccgt gactcgttgt ttcttcaatc aacatggcgt ctgaagcggc cgaggattca 60 cctaaaagtt tcattgggag tcaggtggac tgttttgctc aagaactgtg atcatttatt 120 tgaaaagtcg aaattttatt cactcagtga ctgcatttat cagattttac tattaaggac 180 aatttttctt ttttttatca gctgcatatt tgtggatttt aatgtttttt ttattaattt 240 tgggttaaga tttttgatca gtcattcaca atgagcaata ttgatggttt tcatacacct 300 gtatgttgaa gtaaaggtag aggatggggt ttgttaaaag tgccacagtc tttacaggtc 360 agtgagccta atgtacagca cgatgcacat gcaaatacag aaaatgattt accaaattta 420 gatatgttgg attgtgaggg cctacctaat agtcatgtgt caaatgcttc aatggagcaa 480 cagggcgtgt tttcatctac cccatttcct aactcacagc ctcagaattt agtcgcagca 540 atgtctgaca taattagtca agttggtcag caaattgctg atagtatagt gactagatta 600 aaccaaacac attcagttga gacacatcca gtgattgatt ccccaaaaca tgtgactaca 660 gataatacga gctcacaaat gctggactta ttacaaagcc agatgtcatt ttccagaaat 720 gtgaaagaac ctccaagctt cagaggtaaa atgtctgaca ctgttgactt aaatgagtgg 780 atagatatcg tcagggacta cattaagcgg aacaatctga ggaaagaaca gcaagctgag 840 gagattatgg tccatctgag aggcaaagca agagatgtgg ttaagtttgg aataaggaac 900 agtgatgtga acattgtgca taaccctgat gcaattttta gtattctcag gaaacatttt 960 gaagctactc cctgctctcc tctaccgctg gcagatttct ataccacact gcctggacca 1020 gatgagggtg cttatgagta ctggttgagg ttaaaccgtg cagttgatgt tgctgctgat 1080 cgcctaaagg aacagggaaa agtgctggat tgtccaggaa ctgaggtaac acgcatgttc 1140 atcagacatt gtccttgtaa agagcttgcc ataacttttc gatctaaaac gatggatcaa 1200 tggactgtac aagaagttca taatattctt aatgaatatc actctgaaac cagtcttgtc 1260 gctgcagtca gaaatacagc atccattaaa attcctgtta acagaactga agtggaactc 1320 ccaacagcag cacttccacc tacctctaat gttcagcagt ccagttcctt ggaagtttct 1380 gcattagagc gagtcatgaa tttactggaa aaagtcctgt taaaagaagc caacagcaat 1440 gagccaagtc gacgatcaaa gccaaatgtg agtttacccc gaataaaagg tttaaatgac 1500 ctaccatgca ctgtctgttt aagtcctgat cactctggcc tcacacactg tcgggaaaaa 1560 aagctgtgtt ttcagtgtca ctcacctgac cattcccgcc ttggttgtcc tgagagagta 1620 aaatctctgt catctcatgg gccaggaaac tagcagatct acactcaggg gaggatggtg 1680 tagatcataa aaacattcct cccaatcagt gttgtatgag tgacccggtc tcagtatata 1740 atcttactaa ttctgttgag cctgcaggaa aaattatcat ttatcagaac acgcacaaag 1800 tacagaaaat tgatagtctg ttttacacac ctgctacgat atggagtgac ctttaactga 1860 aagctatgat tgacagtggt tccatggcat gtacactgag cgaaactgct gaacgtcagc 1920 tgttgcagca caaacccaac ttacaaaaac tccctaccaa cgatgttgtc attatcggtt 1980 gtggcggaca acgtgtcatc ccaaaagcaa tgtatgaact tgaacttaca gtttatgcct 2040 gtacaatggt tgttccagtt ttagttgtcc caggccaaac tgatgattta attctgggta 2100 gcaatgctat caaatggcta attcaaaaaa tgaaggaaac agatggatac tggagacttg 2160 tttccagccc aactaaccct gacgatgctg aatgtcacca gtttctgtcc ctgctttcaa 2220 atgttgaacg ctggaaagga ggagacatgc cagacaaggt gggcacagca aagttaaggg 2280 agaaaataac tctacaacca aatcatgaac atctggtttg ggctcagctt cctgtatcaa 2340 ccactgtttc tatagggagc acagttatca ttgaaccaac cctatctaaa tgccgtccaa 2400 gaaatgttat catagggaga gtcatcacac ctatgtgggg cagtagatgg gtgccattca 2460 aaataattaa tccaacagag aaagtgatca tccttaagaa gaatactaaa atcgtggatg 2520 ttttcacttg tatagctgtt gaagagttgt ctgtacccga gtccattaag tgcaatgttc 2580 agtctgctga gagtcccata gtccatgtgc attccagtga ggaaagaaat gctgtgctgg 2640 ataagttggg attgaaagat ctacatctgg aagcttgtga agtatctgat gagtggaagg 2700 acagacttct gaatctcatt gagaaatatg agtcaacatt ctcacgtagt aaaatggact 2760 gtggagaggc cactgacttt gtccacaaaa ttcatctggt cgacgaaaag ccatttagac 2820 tcccttatcg ccgtgtggct ccctgctact atgataagct gcgtactgtc ttgagtgaaa 2880 tggaattaaa gggcattata aggaagtctc agagtgaata tgcatctcca ctggttttag 2940 tatggaaaaa ggatggtgac ctccgtattt gtactgattt caggtggctg aatgcaaaga 3000 ccgtaaagga tgcccaccca ttaccacacc agtcggatgc actcgcagct ctgggaggta 3060 atgttttctt ttctaccatg gacttgacct caggcttcta taatgttcct ttgtttgaag 3120 aacataagaa gtacaccgct ttctcgtctc cttttggcct ccatgaatat aaccgtttgc 3180 ctcagggcct gactaacagt ccagcaacat tcatgaggat gatgatgtcc attttcggtg 3240 gtgaaaactt ctctagcctc ctttgttatt tggacgattt gatggttttt gctccaaatg 3300 agcatatagc cctagacaga ctagaacttg tcttttcacg cttgtctagg cacaatttga 3360 aactagctcc aaaaaagtgc aagttcctca gaaggtccgt aaaattcttg ggacacgtcg 3420 tcactgagga tggtgttcag actgactctg aaaaggtgaa agtaatcagt gatattcaaa 3480 ttgttgatct tatggaagca gatggtataa ctccttctca aaagaaaatc aggtctttct 3540 tgggcatgat tttatactat cagcatttca tacaggactg ttctgctaaa gcaaaacctc 3600 tttttaatct cctatcaaat caggctcaaa ataaagctcg caggaagtct ggacggttga 3660 ggaagtcact caacgtagtt aagctttcac atgaggattg gactagtgaa tgcaacacag 3720 cttttgaaac tttgaagcag gacctacttc acagcgttac tcttgcacac cctgatttca 3780 gtcatccctt cattctctct gttgacgctt catttgatgg aataggagct gtgttgtccc 3840 aagtgccacc tggtgagaaa attgcacgtc cagttgcttt tgcaagtaaa acactttcta 3900 agtcccagat aaactaccca gcgcatagac tcgaattcct ggctttaaag tgggctattt 3960 gcgacaagtt tagccattgg ctaaaaggaa gacatttcac agcatggtcc gataataacc 4020 ctttgaccta cattctaact aaacccagat tggatgcgtg tgaacagcga tgggtggcaa 4080 agcttgctgc atacgaattc aatctaaagt atgtaccagg tgcaaagaac gttgtagcag 4140 atgctttaag tagagagccc tttgtcaagt cttgtgtcag ccatcgtctg ttgaaagaac 4200 catacatttc tcttctggat gaggtcaatg gtgttgttat tggtaccgtg caagatgcct 4260 ttcgtgtttc caaccactgt caaaatgtgc tagcttgcaa caacgatcag aatgaaaatg 4320 aaatgcagga tgtgaacaac tattgctctg gatcttttgg agctgatgaa gtttctgctg 4380 cattaagtgc atattctggg gatgtggttg gtcttttacc agcagccaat ccaacgttac 4440 cccaattttc ttctgaagat ccatctgtaa caatcccact ttctagactt tccaacctcc 4500 aagagcagga cagtatactt ggaagagtca ttaactatgt tctgagaagt aggcgtccct 4560 caaagagaga aaggacaatg gaacctaaat ctgtgacatt cttgttaaag cactggaaaa 4620 agcttaaaat tcgtaatcat gtgctgtaca gagtaaaaag ggatcggcta atgaatagaa 4680 aagtctttca gtatgttgta cctgcttcct tgagacttga cgttctgcgt ggcgttcatg 4740 atgcctctgg tcaccagggg tgtgcgagaa ctctctctct tgcagctgaa agattctttt 4800 ggtcagggat gagtaaggat attaaatcat atgtaaagaa ctgtcaaaga tgtattgtcg 4860 ggaaaactcc agagccagat gctcgtgcac ctcttgagcg aattcgaacc tctgagccta 4920 tggaacttgt ctgtattgac ttttggagtg ctgaacaaac ttcaggaaag gctgtcgatg 4980 tcttagttgt gacagatcat ttctcaaaaa tggcacacgc cttcccgtgt cacaaccagt 5040 ctgcgaagca agttgcccgt cgcctttgga atgatttctt ttgtgtttat ggattcccaa 5100 agagaataca ttcggaccaa ggtgcgaatt ttgaaagcaa aatgatcagg agcctgcttg 5160 aaatggctgg tatccaaaaa tcgcacacca taccacccaa tgggtaatgg tttagcagaa 5220 cgttttaata gaactctggg caatatgata agagcacttc cagttgaatc caaggcaaag 5280 tggccgcaac ttctccagac tttaaccttc agctacaatt gtactgtaca tgaaacgact 5340 gggtttgcac ccttttatct catgtttggg cgtattccaa ggctgccaat cgacattatg 5400 tttcagcatg tcctttgtga tgacagagtg gtcagccatc atgagtttgt caccacactg 5460 aggagagact tgagcacagc tgctgtaatt gcccgaaagc acagcctcag agaacagaac 5520 cgccatacca tcctttacaa tcggaaggtc aaaggtgcac ccttagtagt gggagataga 5580 gttctgcttg caaatcgtgc attgaaaggc atgaagaagg ttgcagacaa atgggactct 5640 gttgtgtacg aagtgcagtc agtgagacca gagataaatg tgtacaggat taaggattct 5700 caaactggca gagaaaaagt tgtgcacagg aatcagttgc ttccagtgag tttcctctct 5760 tgggatgttg atgaaaagga gtctgttttg tcaaatccca cttgcgctac tcgtggcagt 5820 cctctagatc ctgctctgat ggagaatgcc gattctgttt caaagacatc agaatggttg 5880 ttaagaatgg atgattcaca ggaagatgat gaaagagata ccaactgtgg agcttctgat 5940 gtccacattg aaatgactac ttcagaatgt tctgaaattg accaacacag tcttgaacct 6000 gttcaaagta caaagaactt acctgttact gagtcgcatg agtcctcaag ctgtgtcaaa 6060 gtccctgaaa acagaactgt ttctgtcatt tgcaatcagc ccactagcac acacgcaagc 6120 cacataacca ctcgcttagg gagaccagtt agaccacctg acagactgat ttgtgaaatt 6180 gatcagcaaa gacttgtaga gagtgatcca aacag 6215 // ID DNA13TA1_DR repbase; DNA; ZEB; 318 BP. XX AC . XX DT 11-JUL-2002 (Rel. 7.06, Created) DT 11-JUL-2002 (Rel. 7.06, Last updated, Version 1) XX DE DNA13TA1_DR is a nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA13TA1_DR; KW DNA13TA1a_DR; Dr000073; nonautonomous DNA transposon. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-318 RA Bao Z.; RT "Dr000073, an unclassified and uncharacterized repeat."; RL http://www.genetics.wustl.edu/fish_lab/repeats/. XX RN [2] RP 1-318 RA Kapitonov V.V. and Jurka J.; RT "DNA13TA1_DR, an ancient family of nonautonomous DNA transposons RT from zebrafish."; RL Repbase Reports 2(6), 2-2 (2002). XX DR [2] (Consensus) XX CC This element is characterized by 13-bp terminal inverted repeats CC and the TA targets site duplications. CC Its classification is not certain yet, although it CC is expected to be a member of the Mariner/Tc1 superfamily. CC There are several thousand copies of DNA13TA1_DR present in the CC zebrafish genome; they are ~85% identical with the consensus CC sequence. CC Unclassified version of DNA13TA1_DR was identified as Dr000073 CC [1]. XX SQ Sequence 318 BP; 84 A; 64 C; 55 G; 115 T; 0 other; ttgaccttat tcttcatgta caagcgggca cagccattgg aatctttttg gctcgagact 60 tccggtctca tttacttcca ttgattttta gacgttaaaa acagcttgtt atgctgcttg 120 atgttgcaaa ctgatatttt cttattatat tattctactt tttatgtata gtcatgaaca 180 cacttgtttg tagagcaagt agtttgaccg ttttctgccg tttattattc ctagtcattt 240 ctcccatagg caactgaatc ggaagttcta aaacaatcgc aaaaacgagc gcacttccgc 300 attgcagaat aaggtcaa 318 // ID Copia2-I_DR repbase; DNA; ZEB; 2248 BP. XX AC chr18; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 01-OCT-2008 (Rel. 13.1, Last updated, Version -1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia2_DR; KW Copia2-LTR_DR; Copia2-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-2248 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1586-1586 (2008). XX DR Genome; chr18; Positions 31970791 31973038. XX CC 'CATAG' target site duplication CC LTRs are 99% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 32..2248 FT /product="Copia2-I_DR_1p" FT /translation="MSITIVRPTTKEGRLALKRGLAQLRQQGLEPRAFAQF FT FWTMARGLDYDDCALKKAFNRSLDDPLPEWEMEQLQILNFWDFSDYVHYRK FT DWQILNPPDDVSSDHPTIPPSSNQNHHHLPAPTVKRKIKKQVTQVAACPRV FT SSARIPIVESTESTTIASGETKSLELNLTSREVSQSTSEFMSVKSDPVTLQ FT SVMSSVDSSLFVPAQATQDVKDPSTSRPGRRTGRRARAREARALRLKTVSA FT PAPERPPVSAPAPERPPVSAPAPERPPVSAPAPERPPVSAPAPERPPVSAP FT APERPPVSAPAPERPPVSAPAPERSSVAVPFRPLALPAPHRCLALPAPAWL FT PDQPAPHGPPDQPAPHGPPDQPAPALPAPLRPPVQPAPALPAPLRPPVQMA FT PALPAPHRPPAQLAPHRPPALPAPHRPQVHLHSSIPVPLKTPVLPIPVDPL FT CIVPPAPPWSSTWTKDPSKLAPNSVYVAVKDINSLAPMRIPATLEGIMDPY FT PFPASSALGCSSDSATQPLHVASDEISDSTHVTVPATPETLTELTSNPVSA FT ALEGISDKDPEPVLTALEYASDLAPVPVPTALEVFTDLAPQPALPVQVGSK FT SILKPVPAALKGKPKFNPDPRSRPKVFHTKRLNVSKGASTFPIPPIPPSLL FT TSSSGFQRSAPPWLPQNNFPGLFPPRPAAPPVFPVYPGLAPHSPSQLLPPV FT FPALSSPSSFLSHPLSQLLLHPLPLPRLDAWRRPFGGGY" XX SQ Sequence 2248 BP; 452 A; 794 C; 493 G; 509 T; 0 other; gtacgaaccg gccaaacaag aacccagcag aatgagcatc actattgtcc gtcccaccac 60 taaggaaggg agattggccc tcaagcgggg actggctcag cttcgccagc agggtctgga 120 acctcgggca tttgcccaat ttttctggac catggccagg ggcctcgact atgatgactg 180 tgctctcaag aaggccttta accgctccct agatgaccct ttgccagagt gggagatgga 240 gcaattacaa attctaaatt tctgggactt ctctgattat gttcattatc gtaaggactg 300 gcaaatccta aatcctcctg atgatgttag cagtgaccac cctaccatcc ctccctcctc 360 aaatcaaaac caccatcatc ttccggctcc cactgtgaaa cggaaaatca agaagcaggt 420 tacccaagtt gctgcctgcc ccagggtgtc ctctgcacgg attcctattg ttgagtcaac 480 tgaatccacc acaattgcct ctggggaaac caagtccctt gaactcaatt taacttccag 540 agaagtttcc cagtcaacgt ctgaattcat gtctgttaag tctgacccag ttacattgca 600 gtcagtcatg tccagtgttg atagctcttt gtttgtgcct gcccaagcaa cccaagatgt 660 gaaagacccg tcaacctccc gccctgggag gcgtacgggt aggagagccc gtgccaggga 720 ggccagagcc cttcgcttga agacagtgtc ggctccagcc ccagagcgcc cgccagtgtc 780 ggctccagcc ccagagcgcc cgccagtgtc agctccagcc ccagagcgcc cgccagtgtc 840 ggctccagcc ccagagcgcc cgccagtgtc ggctccagcc ccagagcgcc cgccagtgtc 900 ggctccagcc ccagagcgcc cgccagtgtc ggctccagcc ccagagcgcc cgccagtgtc 960 ggctccagcc ccagagcgca gctcagtggc agtcccgttc cgaccccttg ccctaccggc 1020 accacacaga tgcctagccc tgccggcccc agcctggctc ccagaccagc cggctcccca 1080 cgggcctcct gaccagccgg ctccccacgg gcctcctgac cagccggctc cagccctgcc 1140 ggctcccctc aggcctcctg tccagccggc tccagccctg ccggctcccc tcaggcctcc 1200 tgtccagatg gctccagccc tgccggctcc ccacaggcct ccagcccagc tggctcccca 1260 caggcctcca gccttgccgg ctcctcacag gcctcaagtt catttgcact cttccattcc 1320 tgtcccactc aagacccctg tcctacctat tcctgtggat cctctctgta ttgttcctcc 1380 ggctccaccc tggtcctcaa cctggactaa agatccttca aagttagccc ctaactcagt 1440 ttatgtagcc gtgaaggata taaatagttt agctcctatg cggattcctg caaccctgga 1500 ggggatcatg gacccatacc cttttccagc ttcatcagca ctggggtgct cctcagattc 1560 tgccactcag cctttacacg tggcttcaga tgagatctct gattcaaccc atgtaactgt 1620 ccctgcaact ccagaaaccc ttacagaatt gacctctaat ccagtatctg cagccctcga 1680 gggtatatca gataaagacc ctgagcctgt cctgactgcc ttggagtacg cctcggattt 1740 agcccctgtt ccagttccca cagcattgga ggtctttaca gatttggcac ctcagccagc 1800 tttaccagtc caagtgggct ctaagtcaat cctgaagcct gtgcctgcag ctctgaaggg 1860 caaacccaaa tttaacccag accctaggtc ccgtcccaag gttttccaca ctaagagact 1920 gaatgtatct aaaggagcct cgacttttcc tatccccccc atccccccct ctttgttgac 1980 aagctctagt ggtttccaaa gatctgcacc tccctggttg ccccaaaaca acttcccagg 2040 actgtttcct ccccgtccag cagcacctcc agtttttcca gtctacccag gtcttgctcc 2100 ccactctccc tcccagttgt taccccctgt gtttccagct ctctccagtc cttcaagctt 2160 tctttcccac cccctttccc agttgttgtt acatcccttg cctctgccaa gattggacgc 2220 ctggaggcgt ccgttcgggg gagggtac 2248 // ID HATN12_DR repbase; DNA; ZEB; 579 BP. XX AC . XX DT 11-JUL-2002 (Rel. 7.06, Created) DT 11-JUL-2002 (Rel. 7.06, Last updated, Version 1) XX DE HATN12_DR is a nonautonomous DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW Dr000096; HATN12_DR; nonautonomous DNA transposon; KW hAT superfamily. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 7-389 RA Bao Z.; RT "Dr000096, an unclassified and uncharacterized repeat."; RL http://www.genetics.wustl.edu/fish_lab/repeats/. XX RN [2] RP 1-579 RA Kapitonov V.V. and Jurka J.; RT "HATN12_DR, an ancient family of nonautonomous hAT-like DNA RT transposons from zebrafish."; RL Repbase Reports 2(6), 11-11 (2002). XX DR [2] (Consensus) XX CC HATN12_DR is a family of nonautonomous DNA transposons that CC belongs CC to the hAT superfamily [2]. CC More than 10,000 copies of HATN12_DR are expected to populate the CC zebrafish genome. The element was active long time ago, its CC copies are ~16% divergent from the consensus sequence. CC HATN12_DR is characterized by 8-bp target site duplications and CC perfect 16-bp terminal inverted repeats. XX SQ Sequence 579 BP; 205 A; 82 C; 91 G; 201 T; 0 other; tagggctgca caatattgga aaaatctgat attgcaatat tttatttttc tgcaataaat 60 attgcgatat taatacagtt tcacaagatg gtttgaatag ctctatttga cagttttctg 120 gggagtctaa cagtattcag gtacagaaat tgaataatca caatgcaaaa aaaacttttc 180 ttttcttact ttgctttgtc tcgttttagt ccaaatatca aaaaaattct tagatcaagt 240 aaaaatattg ttttgtttca acttcagaag aaataagtca aaattaagag tttttcctta 300 aaacaagcaa aattatctgc cagtggggta agtaaaataa tcttgttttc gctttgaaat 360 gtagatattt ggactagaaa caagacaaaa attctaagta gaaaagtatt ttttgcataa 420 attaatcgta taaattctgt ataaatacag taattaaata caattctgta gctcctggtc 480 aactataatt cagacttaac attgcatatc ttgcgatgtg actattgcgg atgcgcacat 540 tgcgatatcg atgctgaaac gatatattgt gcagcccta 579 // ID Copia2-LTR_DR repbase; DNA; ZEB; 1150 BP. XX AC chr18; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 01-OCT-2008 (Rel. 13.1, Last updated, Version -1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia2_DR; KW Copia2-I_DR; Copia2-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1150 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1587-1587 (2008). XX DR Genome; chr18; Positions 31969641 31970790. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1150 BP; 176 A; 267 C; 174 G; 533 T; 0 other; tgtcaggatc cagcccttac agccctgtca ctcctgtctg tgtttaatgt ctctttgttt 60 gttttgtgta ttagcacatg gttatctctt ttgttgtgtt ggccatgtgc tcctcttgtc 120 ctgcctcctt gttttcctat gcctcgcccc cttgtttagt ccttgtttgt ctaattatgt 180 tcacctgttc cttgtgctat ctactccctt tatattgagt tcattttcca cttgtctttg 240 ccggttcgtc atactatctt tcgtgtgctt gtctagctta tcttgtttag tgcatgatag 300 aggttttttc catatagtct agttttgttt cctttgtcta gtctagtaca agtctttttg 360 atctttatct tgaattaatt tatccttttt gattcggtta atttatttat ttattttatt 420 tattttttct ttcaataagg ttgctgtcat tttcaacttt atttctgttt gcagctcaag 480 attaacttct agttttcatt ttgctatctt tgtgctcctt agcctcagtc tagccctagt 540 ttttagataa ttctttattt tttaaattcc tgttcctgtg ttttgcattc tgttgtattt 600 ctagttatag cttatttctt taagcctttt gtttaagttt cagtagtttc atttagttcc 660 ttatttgctt catttttgtt ttttgagctc ttatttcccc tcttaatttt ttctcatatt 720 tacttagttc tagttttttt tttttttttg cttgctcttc tttttaattt ttttgtttcc 780 ccagccgtct agtcctgttg tgttgcttgt ctcacttccc tcctgagtca tcgctggttg 840 tggatcccct tcccttgatg atcgatctgt ttgccacccc tgtccttaac cagaggttcg 900 gctagctacc ttttgactgc ttggcctgaa atttggaatt cccctttcct gaccagcctg 960 tgttaccctc ttaagacact tccctctgta tcatcctgaa gtcctgagtc tcctcccctc 1020 accagcctgt tctcctgaaa tcgtcctgca ctattgactg tcctcctgca tagccttgtt 1080 tacctcctga aaataaacct tgtgttctct tgtcaattct gcatttgggt cctacttgcc 1140 atccgtgaca 1150 // ID DNA-X-2B_DR repbase; DNA; ZEB; 439 BP. XX AC . XX DT 16-OCT-2008 (Rel. 13.1, Created) DT 16-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE This is a nonautonomous DNA transposon, - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; DNA-X-2_DR; DNA-X-2B_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-439 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1250-1250 (2008). XX DR [1] (Consensus) XX CC This element is another version of DNA-X-2_DR. TSD is CC unidentified. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 439 BP; 119 A; 108 C; 83 G; 129 T; 0 other; caaagtccct ttaaggcaag tcatttcact cggcggccat ctttgaaaca cctctcgggc 60 agtatgctcg ggcattctgt ctgaatgggg aaacatcaaa ttctccaaaa ctgcttgcca 120 agcttacgat taaatttcat attaggaatc accaataaaa ttaaacaaca actgtctctt 180 tagtttcatt tctaaacgtc caaatcacac aaaatctgca gaaactcacg tctggtccga 240 gcccctcccc cggagtatcg tcagtctata gcgatcgatg attggctcct gtactagaag 300 gcgggcttta ttcgccatat agcgatcgat gattggctcc tgtactagta ggcggggctt 360 tattcaccat attgaccgtt acacttttcc ccattcaaaa agatacgagt gacatgtctt 420 gtgtattcta tagtctttg 439 // ID DNA-X-4_DR repbase; DNA; ZEB; 1245 BP. XX AC . XX DT 21-OCT-2008 (Rel. 13.1, Created) DT 21-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE A nonautonomous DNA transposon - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; DNA-X-4_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1245 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1253-1253 (2008). XX DR [1] (Consensus) XX CC Copies are ~87% identical to the consensus. The consensus CC contains imperfect subterminal inverted repeats (38-71, CC 1195-1228), and the TSD is not characterized. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1245 BP; 341 A; 241 C; 259 G; 402 T; 2 other; caacagttct gtctggttct cgaatctgat tggctgatag ccgtgcgata ttctgcaata 60 tcagaactcg tacagcctcc tcacccttgt gtattactcc gcccacatag agtgacaagc 120 agatsaataa actcactaca gtttgacaaa tattgcagct gttggacaac ataatgtact 180 tttgaggctt ttttaggcga gaatgtagtt gtttagattg caactatgca gtttatttat 240 aaggatagtg cctattttaa atatttataa tttcggagat acagcgcgtc ggcatgccat 300 tagcctgtca tactgagcag agcaaagacg gttgacgttg tccatccaca agatggcgac 360 agagaccgca taataagccc ttagaggaga aaaactcagc gtaaaatttc aaactacagc 420 tgatcaaatc attattaaac tggtaagtga cattctaagt cgatctctct cttttgtatg 480 ttgtagtgct gtatttatac catagtaatt gtagtgtatt gagtgtgaga tgggactcgc 540 atatcaggaa tgtatgtatt ttgtgttggc cactctttgt ataaccctga gttacttttt 600 ggttggccat ctgtttgttt ctaatgagtc tcctgttttc gttactgcag actagttcag 660 taaaaagaaa gaaagaagaa atgcccgcat gtgtttagcg tttttcctta tcgcaaacac 720 aacagtaatc tggtagtgtt tgcgttgctt tggctttttc ggggttaatt attgtgatat 780 cccgattgca acagagaaat actgggaaat crtctgtatc agactgatgg catttcatgt 840 tttcatgctg ttcagcctta taatcttaaa atgtgagcaa aatacacctg ttttgtcatc 900 actttagaca ttacgctaga gaatcattca aatactagct ctaaagtgac gttggtgaag 960 tagcaacggt ttctgctgtt ctgacgtcag ctgcagatgt gaatgaatgg cggaagaaag 1020 tagttcctca tacaaaagga tttttagact ctccgtgttt gattttcttt tttatataca 1080 cgattatgcc gtcaaactgt tgtataatat gcaatatcac actcgtagca gtgcgatatg 1140 gctgtatatc gtcactggtg ggacactaag gcactctgcc aacgcacgcc tcccaccagt 1200 gccgatatac agccatatcg cactgctact cgtgtgatat tgctc 1245 // ID HATN13_DR repbase; DNA; ZEB; 1119 BP. XX AC . XX DT 11-JUL-2002 (Rel. 7.06, Created) DT 11-JUL-2002 (Rel. 7.06, Last updated, Version 1) XX DE HATN13_DR is a nonautonomous DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW Dr000152; Dr000342; HATN13_DR; nonautonomous DNA transposon; KW hAT superfamily. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 832-1 RA Bao Z.; RT "Dr000342, an unclassified and uncharacterized repeat."; RL http://www.genetics.wustl.edu/fish_lab/repeats/. XX RN [2] RP 152-1119 RA Bao Z.; RT "Dr000152, an unclassified and uncharacterized repeat."; RL http://www.genetics.wustl.edu/fish_lab/repeats/. XX RN [3] RP 1-1119 RA Kapitonov V.V. and Jurka J.; RT "HATN13_DR, an ancient nonautonomous hAT-like DNA transposon from RT zebrafish."; RL Repbase Reports 2(6), 12-12 (2002). XX DR [3] (Consensus) XX CC HATN13_DR is a family of nonautonomous DNA transposons that CC belongs CC to the hAT superfamily [3]. CC Several thousand copies of HATN13_DR are expected to populate the CC zebrafish genome. The element was active long time ago, its CC copies are ~13% divergent from the consensus sequence. CC This element is characterized by 8-bp target site duplications CC and CC perfect 15-bp terminal inverted repeats. XX SQ Sequence 1119 BP; 369 A; 202 C; 202 G; 346 T; 0 other; tagggctgca cgattaatcg aaaaaagatc gcgatctcga ttcgacccta cacacgatct 60 taattcagct tttctacgat tcagccaatt atattttcaa gttcaggaga gaagcataaa 120 ggcggccgca caagtcttca cattgtttta tatacgttgc tcagcaacat ggacacctcc 180 aaatggtgtt aaaagtgtca catactgtat ttataaggta taattcaccc aaaaatgtca 240 tttctgtctt catgtaatga tgtattcgcg gccgcaaaat cacacgtgag ctgacgcact 300 gtttgtttac tacagagagg acgcgcgtgc acgccaatga tgtctacttt agtgaacaga 360 acctgcatag gctgtgaata acaggtaata aagttgtaaa taataacgtt cagtttgttg 420 cacagagtga tcgtttggga gccgcgcggg aagtatgatt tgcatgtatg tttttttttc 480 aaagtgacgg cagccatgac atgagcgcac tcagcagtga aagtgaaacc gaaagcgcgc 540 acatcattta aatgatcata tcgcatttta gagttatgat tacgacaagc atttgatatg 600 tttttttctt tcatagcgaa cacaaagtgt tgtgcatgaa aataaatgtt tacagtgtca 660 gtataactac tctagcctat ataatgttga atataatgca agagggaatc tgataatgac 720 aaatgtctca ttttaccagt gaataaaatg gtctaataat gttgtcaatg acagatgaca 780 agcatatgtc tcaaacataa taattcaact tcagaattat gaatagttgt attctgatgt 840 catcacttta ctgtgaatta acaaccagga catatttaaa gtctgttcaa gtccaccatt 900 caaagtctat acaaaattta gcattttaac caacagtaga cctacacata ggcctaatat 960 tattattgtt gttttaccat atgtttgaga aataacaata ataatagcct actcatatta 1020 acctttattt tacatctaca gaatcgtgag aaaatcgtga tcttgatttt aagcaaaaaa 1080 aattgtgatt ctcattttag ccagaatcgt gcagcccta 1119 // ID Gypsy153-I_DR repbase; DNA; ZEB; 4399 BP. XX AC chr14; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy153-I_DR; KW Gypsy153-LTR_DR; Gypsy153_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4399 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1567-1567 (2008). XX DR Genome; chr14; Positions 10039563 10035165. XX CC Positions [3188-3697] - Integrase core CC LTRs are 100% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 204..1205 FT /product="Gypsy153-I_DR_2p" FT /translation="MATVFGNMCPFDESSEQWSSYTERFEYFVAANSIADD FT KLVPTFLSVIGPKTFTLLRNLLQPEKPGEKTYEQIVNTLKAHFSPKPLVIA FT ERFRFHRRNQLEGETVTMFVAVLKKLAEHCEFGDTLNDTLRDRLVCGLRCE FT GIQKRLLTESNLTLQRAIELSVAMELAAKEAQQLSSNSKVYKMVTEKQTEM FT KGPCFRCGKTGHTPVTCWFKDMECRSCKKKGHIERACRNKTEKGKMQHKTT FT FRRKQKKQVFTVDQENKKNTDTSDDEFSLHVLTVAGGNKSYKVTALVEGQP FT VKMEIDTAAAVSLVSDVVYREILSHLPLKPPDVPLKLTLANQ" FT CDS 1211..4129 FT /product="Gypsy153-I_DR_1p" FT /translation="MKGFLQVTVELNKQIKKLPLYVVSGDYPSLMGRSWLE FT QLQVNWQVFHMMTSNTLDLEDVLQKNSDVFKKELGSMQGIKVKLTVEPECQ FT PKFLKARPLPYALKPKVEASLTELVKDGVLEPVSVSKWATPIVPVLKKDGS FT IRICGDFKVTVNPVLSVEQYPLPLINDIFAGLTGGQKFSKIDLNQAYLQMH FT VEEQSRELLTINTHKGLFRYQRLPFGITSAPSLFQRAMDQILAGLPGVVCY FT LDDILVTGTDDESHLQHLDTTLKRLKEYGLRVRKDKCEFFQSAVEYLGHVI FT DASGLHTSPAKVKAIVDAPTPKNVTQLRSFLGLLNYYGRFIPNIATLLKPL FT HKLLCQENSWRWTTECQETFQRAKETMLKSEVLTHFDPSLPIQLACDASPY FT GVGAVLSHIMPNGQEKPIAFASRSLSKAEANYAQIEREALSIVFGVRKFYQ FT YIFGRKFTLLTDHRPLTAIFGPYHGIPCLAASRMQRWALLLSAHTYDIKYR FT KSELHRNADGLSRLPLADQVKETKVAEIFYFSQVERTPITAAQVRKGTRND FT PVLSKVMDIVMTGKDENHDLELKPYLTRRHELSVQTGCLLWGRRVIIPPTL FT RKSVLKQLHAGHFGMVRMKEIARSYFWWPGVDREIEEKARTCTSCQSIRNV FT PQLAPLHPWEYPEKPWHRIHADFAGPVEDKMPLVVIDAHSKWPEVAIMKSR FT SAEKTIEKLGEIFSRFGPPVQFVSDNGPQFISHEMGTFLLANGVQHIKSSP FT YHPATNGLAERFVQTMKHALKASIGQGTLHQRLHNFLLCYRSTPHATTKVS FT PAYLLFNRELRTSFELLKPATLKETVLQQQENQVQRRKLRAKDRSFLIGSS FT VLARNYGTGPKWVPATVETQTGPVSYRVKTAGNLLWRRHTDQLLSGASTFT FT DLPEGTDVCDSVGSSPASATVTTEPSKITDSPPTKELSPPVVSTENGDQVS FT RRYPIRERRQPQRFLDYT" XX SQ Sequence 4399 BP; 1360 A; 929 C; 1041 G; 1069 T; 0 other; cggatataat aaagtggcga cgaggattga agcacttgcg cgtgtgttat gacaacgcgg 60 aagtgaagca gcgtgagtga cacgagagcg agagagaaaa aaaaagtacg agccggctgc 120 tttcacgtct tctaaaggga gaggaccgac agcgattcag ataagtgact gaattgattg 180 ttgatgaccg aaaacgggac aaaatggcaa cagtttttgg caatatgtgt ccgtttgatg 240 aaagcagtga acagtggagt tcttatacag aacgatttga gtactttgtg gcagcaaatt 300 caattgctga tgataaattg gtacccacct ttttaagtgt gataggtcca aagacattca 360 cactactgcg taacctccta caaccagaga aaccggggga gaaaacgtat gagcaaatag 420 taaacactct caaagctcat ttctccccga agccattagt gatagcagag agatttcggt 480 ttcaccgacg aaatcagcta gaaggagaaa ctgttacaat gtttgtggca gtgttaaaaa 540 aattagctga acattgcgaa tttggtgata cattaaatga cacactacgg gacagattag 600 tgtgtggact tcgctgtgag ggaattcaaa aaaggctatt aacagagagc aacttgacac 660 tacaaagagc gattgaactg agtgtggcta tggaattggc tgctaaagaa gctcagcaac 720 tgagctcaaa cagtaaagtg tataaaatgg tcacggaaaa acaaacagaa atgaagggcc 780 cttgttttcg ttgtggaaaa actggacata cacctgtcac gtgttggttt aaggacatgg 840 agtgccgcag ctgtaaaaag aaaggacaca tagagcgtgc atgtagaaat aaaacagaaa 900 aagggaaaat gcagcacaaa acaacattta gaagaaaaca aaagaaacaa gtattcacag 960 ttgaccaaga gaacaagaag aatacagaca catcagatga tgaattttct ttgcatgtgc 1020 tcaccgtggc agggggaaac aagtcataca aggtcacagc gctggtggaa ggacaaccag 1080 tgaagatgga gattgacacc gctgctgcag tgtcgcttgt gtcagatgtg gtataccgtg 1140 aaattctgag ccacctgcca ctaaaaccac ccgatgtgcc cctaaaactt acactggcga 1200 atcagtgacc atgaaagggt ttctccaggt taccgttgaa ctaaacaaac aaataaaaaa 1260 gctgccattg tatgtggttt cgggtgatta cccctcactg atgggacgtt cctggctgga 1320 acaactgcag gtgaactggc aggtctttca catgatgaca tctaatacat tggatttaga 1380 ggatgttctt cagaaaaaca gtgacgtttt caaaaaggag ttaggcagca tgcaggggat 1440 caaagtgaaa ttgaccgttg agccggaatg ccagccaaaa ttcttaaagg cccgaccttt 1500 gccatatgca ctcaaaccca aagtggaggc aagcttaact gaacttgtaa aagatggtgt 1560 cctggaacct gtcagtgtaa gcaagtgggc aacacctatt gttcccgttt taaagaaaga 1620 cggtagcata agaatttgcg gagattttaa agtcactgtg aacccagtgt tgtcggtgga 1680 acaatacccc ttaccgctca taaatgacat ctttgcagga ctgactggag gacaaaaatt 1740 cagtaaaatt gatctcaacc aggcatacct gcaaatgcat gttgaggagc agtccagaga 1800 actattgacc ataaatacac acaagggcct atttcggtat cagcgcttac catttggaat 1860 cacttcggct ccgtccctgt ttcagcgagc gatggaccag attctggcag gactcccggg 1920 agtagtatgt tacttagacg acattcttgt gactggtact gacgatgaat cacacctgca 1980 gcatctggac actaccctaa aaagactgaa agagtatgga ttgagagtgc ggaaggacaa 2040 atgtgaattt ttccagtctg cagttgagta cctaggccat gtcattgatg cctctggcct 2100 gcatacgtcc ccagcaaagg ttaaggccat agtggatgct ccgacaccta aaaatgtcac 2160 ccaactgcgt tcattcctcg gcttgcttaa ttattatgga cgttttattc caaacatcgc 2220 tactctgttg aaacccctgc acaaattgtt gtgtcaggag aatagctgga ggtggaccac 2280 tgaatgtcag gagactttcc aaagagcaaa ggagaccatg ctgaagtcag aagtcctcac 2340 tcattttgat ccttctctgc caattcagct ggcctgcgac gcctccccat atggtgtggg 2400 agctgtcttg tcccacatca tgccgaacgg tcaagagaaa ccgatcgcat ttgcttctag 2460 aagtctgagc aaagccgaag caaattatgc tcaaattgaa cgtgaagcat tgagcattgt 2520 gtttggagtg cgcaagtttt accagtatat tttcggaagg aagtttactc tgctgaccga 2580 tcatcgccca ctgacagcga tttttggtcc ttatcatggc atcccctgtc ttgctgctag 2640 taggatgcaa agatgggctt tgttattgtc tgctcatacc tatgacatca aataccggaa 2700 gtcagaattg cacagaaacg cagatggact gtccagattg cctctcgctg accaagtaaa 2760 agaaacaaaa gtagctgaaa ttttctactt tagccaagtg gaaagaaccc ctataacagc 2820 tgcacaagtg cgcaagggca cacgtaatga tcctgtcttg tccaaggtca tggatatcgt 2880 catgacagga aaagatgaga accatgacct agagctaaaa ccatatctta ctcgacgtca 2940 tgaactctca gtgcagacag gctgtttatt gtggggaagg agagtgatca ttccaccaac 3000 attgcgcaaa tcagtgttaa aacagctgca tgcgggacac tttggaatgg tccgcatgaa 3060 ggagattgcc aggagttact tctggtggcc aggagtggat cgagaaattg aagagaaggc 3120 aagaacatgt acgtcatgtc aaagcatacg caacgtacca caactggcac cgctgcaccc 3180 ctgggaatat ccagagaaac cgtggcatcg cattcatgct gattttgcag gtccagtcga 3240 ggacaaaatg ccgctggtcg tcattgatgc acacagtaag tggcctgaag tagccataat 3300 gaaatctagg tcagcagaaa agaccattga gaagttggga gaaattttca gtagatttgg 3360 accaccagtg cagtttgttt cagacaatgg acctcaattt atttcgcatg aaatgggcac 3420 atttctactt gcaaatggag tgcaacatat caagtcatca ccctaccatc ctgcaacaaa 3480 tgggcttgcg gagaggttcg tgcagaccat gaaacatgct ttaaaagctt caattggaca 3540 aggcacattg catcaacgcc ttcataactt cttgctgtgc tatcgcagta ctccacacgc 3600 aactaccaaa gtgtcaccag cctacttgtt gtttaatcga gagctcagga caagctttga 3660 gctacttaaa ccagcaacac ttaaagaaac ggtcctacaa caacaagaaa atcaggtaca 3720 gcgacggaaa ctgcgagcca aagacagatc ctttttgata ggctcatcgg tgctggcccg 3780 gaactatggc accggtccaa aatgggttcc agctactgtg gaaacccaga cgggtcccgt 3840 gtcctacagg gttaaaactg ctggtaatct tctatggagg agacatactg atcagttact 3900 gagtggagcc agtacattca ctgaccttcc tgaaggtacc gacgtctgtg attcagttgg 3960 cagctcacca gcttctgcga ctgtcactac agagcctagc aagattacag attccccacc 4020 tactaaagaa ctgtcacctc ctgttgtttc gactgaaaat ggggaccaag taagccgccg 4080 gtatccaata agagagcgac gtcaacctca gcgttttttg gattatactt agttaggggc 4140 tacccttaca gtatttgagt tggggcagaa taatccccta agggtaagcc agggtttgag 4200 gtagtccacc ctcattccca gggatttagt tgtgttaaat tactaaggac atttaagaca 4260 tttaatcaag aaacgttcaa ataaataaat gtgtgtttac agaatgttat aatggtaaac 4320 tgaatgttat acattcattc tgcatggtgc tagttcaaaa ttaaaatgtt gaccttcaaa 4380 tttaagggga aggaaatgt 4399 // ID CR1-8_DR repbase; DNA; ZEB; 3590 BP. XX AC . XX DT 21-OCT-2008 (Rel. 13.1, Created) DT 21-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE CR1-8_DR is a CR1-like non-LTR retrotransposon - a consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; AP endonuclease; CR1 clad; CR1-8_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-3590 RA Bao W. and Jurka J.; RT "CR1-like non-LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1203-1203 (2008). XX DR [1] (Consensus) XX CC The 5'-part is incomplete. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 247..3459 FT /product="CR1-7_DR_1p" FT /translation="LFWLRGLVLTKLQDLNMWLFRLLLWIILCSAVISAAP FT LMYSAGDLYKLRPATSQPPLLLHNHPDLTRRPRPKYIHRGSRRNVSSTTNH FT DRNGIIRSFWSNTRLPRFARRGVNHDVLSSLPRSDYMSNVTVRFMLLNVRS FT LSNKTSYIHELIEDKSIDFMCLSETWQKPNDFFILNQTVPSGYKYLCKPRC FT TGKGGGLALIYRDCFKICPIQIAEPTSFEVLAVLLKGPTPTVLVIVYRPPK FT VSALNIFFNELSAILTHVCTLSPNIMLMGDFNIHFDNTANTQYNDFKLVLD FT SADLVQYVHFPTHSKGHILDLVCCSGVVPNNISSTDHLISDHKSVFFNVSL FT SLGRVKPPRSVSFRNIKNIDHHVLEGMIDYSVNQCLSTSTSDLVNFYNNSL FT SQILDTVAPLRTRIVSYTNPSPWFTPELRHQKALGRRLERLWNKTGLEVHK FT QMYSEHLASYKAEIERAKSTFYSNLITSGHSNTRTLFSTVAKLLKPHDSVS FT LDPIQCVPYSNFLHAKTDXIHHELLNSVLIAPNYQYDYSHXSGSVSILSNF FT ELPSVDFITREIMKSKTSTCSLDPLPTVLVKSCINSLSSFITSIVCSSLST FT ATVPSSLKMASITPILKKTGADTNDLNNFRPIANLPFVSKILERTVAAQLH FT SHLLTNNIYEHFQSGFRPKHSTETALTKVVNDLLLAADSGYLSILILLDIT FT AAFDTISHSLLLERLASIGIADNALCWFSSYLSDRKQFVQIKNRRSDPVSV FT LHGVPQGSVLGPLLFIIYILPLGHIFRSFGIHFHFYADDTQVYISTKPTTS FT HPPSDLTKCLNEINTWMTNNFLKLNASKTEALLVGSKSVLSKAQSFTLFVD FT NSPVNFSTHVKSLGVILDGMLSFSSQISNISRSAFFHLRNIARLRPSLSQQ FT STEVLVHALVTSRIDYCNSLLSGIPDKQLHRLQLIQNSAARIVTCSRTSDH FT ITPILFHLHWLPVHYRIQYKILLLTYKALHNLAPHYLSDLLQLYTPSRSLR FT SSSAELLSVPHFKMKSFGGRAFSCTAPKLWNSLPLHIRQMDSISNFKSQIK FT TFLFRIAFNDLLV*" XX SQ Sequence 3590 BP; 941 A; 796 C; 603 G; 1242 T; 8 other; tgatttccgg ttgagtgaac gcaggcgcgt tttggctgcc gctgctcacc gtgatcactt 60 ttaaatactt ttatctgaaa gttttatctg aatattttaa tttggaattc acttcatctg 120 atactatttc gtctgaaacc atctgaaacc ttacccgaat tctcacccka aagctgtgtg 180 actgctgttt gacactttgc gcactggatg attgacgtgg acgtttgctg acggacttac 240 tgataacttt tctggctccg tgggctcgtc ctaactaaat tacaggatct caacatgtgg 300 ctgtttaggc tgcttctatg gattatcctt tgctctgctg tgatcagtgc cgctcctctg 360 atgtattcgg cgggggacct ttacaaactt agaccggcaa cttcgcaacc tccgttatta 420 ctccacaatc acccggacct tacccgccgg ccgagaccga aatacataca tcggggatct 480 cgtcggaatg tttcttcaac caccaatcat gataggaatg gtataattcg gtcgttttgg 540 tctaatacaa gactgcctag attcgccagg aggggtgtta accatgacgt gctttcatcg 600 ctgcccaggt cagattatat gtctaacgtt actgttcgtt ttatgcttct aaatgttcga 660 tctctcagca ataaaacatc ctacattcat gaactcattg aggacaaaag tatagatttt 720 atgtgtcttt ctgagacttg gcagaagccc aatgactttt ttattttgaa tcaaactgta 780 ccttccggat ataagtattt atgtaaacct cgctgtaccg gtaaaggggg aggtcttgca 840 ctaatttacc gtgactgttt taagatctgt ccgatacaga tagcagagcc tacatctttt 900 gaagtgttgg ctgtactact taaaggtccg actcctactg ttctagttat agtctataga 960 ccaccaaagg tatctgcatt gaatatcttt tttaatgaac tttctgctat tttaacacat 1020 gtgtgtactt tatctccgaa cattatgttg atgggagatt ttaatattca ttttgataac 1080 actgccaaca ctcagtataa tgatttcaaa ctggtcctgg acagtgctga ccttgtgcag 1140 tatgtccact ttccaacaca ctctaagggg catattcttg atctagtatg ttgttctggt 1200 gttgtcccta ataatatatc ttcaactgat cacttaatct cagatcataa atctgtattt 1260 tttaatgttt ctctttcttt gggaagagtt aaacctccac gctctgtctc atttagaaat 1320 attaagaaca ttgaccatca tgtcctggaa ggaatgattg attattctgt aaaccagtgt 1380 ctgtccactt ctacttctga cttggtaaat ttttataaca atagcctgtc tcaaatatta 1440 gacactgtgg ctcctctaag aacccgcatt gtatcttata ccaatccctc tccatggttt 1500 actcctgaac ttcgtcatca aaaagctcta ggtcgtcgcc tggagagact ctggaataaa 1560 actggtctag aagtccataa gcagatgtat tctgagcatc ttgcttctta caaggctgaa 1620 atagaaagag caaaatcaac attctactca aatctcatca ccagtggaca ctcaaataca 1680 agaactttgt tcagtactgt tgccaaattg ttgaaaccac atgattctgt ttcccttgat 1740 cctattcagt gtgttcccta ctctaatttt ctccatgcta agactgatcr tattcatcat 1800 gaattgctwa attctgtttt gattgctccc aattatcagt atgattatag ccatrttagc 1860 gggtcagtra gtattctctc taattttgaa ctgccctcag ttgattttat tactagggag 1920 atcatgaagt ctaagacctc tacctgctca cttgatcccc tacctactgt tttagttaag 1980 tcttgcatta attcactatc ctcttttatt acatctatag tgtgttcttc actatcaact 2040 gctactgttc cttcatcttt aaaaatggcc tctatcactc caatactcaa aaagactggg 2100 gctgatacta atgacctaaa caattttagg ccaattgcca accttccgtt tgtttcaaaa 2160 atactggaaa gaacagtagc agctcaactt cactcacatc tgcttactaa taacatctat 2220 gagcactttc aatctgggtt tcgccctaaa cacagcacag aaacagccct cactaaggtt 2280 gtaaatgatc ttcttttagc tgctgattct ggttacctgt ccatacttat tcttttagat 2340 attactgcgg ctttygatac tattagtcat tcactattat tggaamgact ggcttctata 2400 ggaatagctg acaacgcact ttgctggttc tcttcctatt taagtgacag aaaacagttt 2460 gtgcagataa agaatagacg ctcygatcct gtttcagtcc ttcatggtgt tcctcaaggg 2520 tcagtattgg ggccactttt atttattatt tatatcctac ctttaggtca tatatttcgt 2580 tcttttggca tacattttca tttttatgca gacgatacac aagtttatat ctccacaaaa 2640 cctacaacat cgcacccccc ttctgacctt accaaatgtc taaatgagat taatacatgg 2700 atgactaata atttcttaaa gttaaatgcc agtaaaaccg aagccctctt ggttggctca 2760 aaatctgttt tatccaaagc tcaatctttt accctgtttg ttgataactc cccagtcaat 2820 ttctcaaccc acgtgaaaag tcttggggtt atactggatg gcatgctctc atttagttct 2880 caaattagta atatctcacg ttctgccttt tttcatttgc gtaacattgc aagacttcgc 2940 ccttcactat ctcaacaaag tactgaagtg cttgttcatg ctttggttac atcacggatt 3000 gattactgta actcgcttct ctctggtatt cctgataaac aacttcatag actacagttg 3060 attcagaatt cagcggctag gatagttacc tgctcgcgta cttctgatca cataactcca 3120 attctctttc acctgcactg gcttcctgtt cactatcgta tacaatacaa aatcctcctc 3180 ttaacataca aagctcttca taacctggcc cctcactatc tttctgatct tcttcaattg 3240 tacactcctt cacgctcact gagatcatct tctgctgaac tgttatctgt cccacacttt 3300 aagatgaagt catttggagg cagggctttt agttgtacag caccaaagtt atggaattcc 3360 ctgcctttgc acattcgcca gatggactct atttccaatt ttaaatctca gataaagacg 3420 tttttgttta ggatagcttt taatgatttg ctggtttaat tttatgctga ttgtcatttt 3480 tactgcttgc atttttatga tgtgcttgta atctgctgta aggtgtcctt gagtgctttg 3540 aaaggcgcct aaaaataaaa ggtattatta ttattattat tattattaat 3590 // ID Gypsy62-LTR_DR repbase; DNA; ZEB; 475 BP. XX AC chr14; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy62-I_DR; KW Gypsy62-LTR_DR; Gypsy62_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-475 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1440-1440 (2008). XX DR Genome; chr14; Positions 6787407 6786933. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 475 BP; 100 A; 140 C; 83 G; 152 T; 0 other; tgtcacagac tcggtcccag ttaatcccct caacaaccag cagaggtccc gctcacccga 60 ctactgaagt tttaagaact ccattaccca caagaccccg cgggaccttg attactgtcc 120 agctgaaacc catacacaca aacgcttata agcccttctc gcctgtcatt ctgtgcgaag 180 tcttgttttg ctctgcattc aattccaagc gttttcccta cattatctga ctgatttctg 240 ttgccgaccc ggactgtttg tttacctgtg attctatgcc gcctgcccag atccttgcct 300 gtatcctcgt atatgtatct ttgctacctg cctacaactc tgcttgtgac ccgactttga 360 gtgagttccc tgatctgatc ttctcgtttg tgaaacgtac atttgcttct ttatttgtgt 420 tgtgttcaat aaagacgctg caaatggatc ccacttcgtc tgactcttca tcaca 475 // ID Gypsy104-I_Dr repbase; DNA; ZEB; 4283 BP. XX AC . XX DT 22-APR-2009 (Rel. 14.05, Created) DT 22-APR-2009 (Rel. 14.05, Last updated, Version 1) XX DE An internal portion of the Gypsy-104_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy superfamily; endogenous retrovirus; KW gag; reverse transcriptase; integrase; Gypsy104-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4283 RA Dib M.R. and Naveira H.F.; RT "Gypsy104_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 9(5), 944-944 (2009). XX DR [1] (Consensus) XX CC Gypsy104-I_DR is an internal portion of the Gypsy104_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its long CC terminal repeat is deposited in Repbase as Gypsy104-LTR_DR. CC Gypsy104_DR is characterized by 4-bp target site duplications. CC The internal portion encodes two proteins: the 305-aa gag CC Gypsy104_DR1p (pos. 17-931) and 1138-aa polyprotein (pos. CC 868-4281, conceptual translation) composed of the protease, CC reverse transcriptase, and integrase domains. CC Some insertions fairly recent, according to the hight identity CC between their flanking LTRs. CC Consensus obtained after the alignment of at least three CC independent insertions bearing at least 85% Homology over at CC least 1000bp. XX FH Key Location/Qualifiers FT CDS 17..931 FT /product="Gypsy104-I_Dr_1p" FT /note="Gag-protein." FT /translation="MDTAGQDPVRSAVAQQGALLGQHETRLSNTTREVEFL FT SNQVADLTARIQDLQLEAAQGGRFPRHEPEPRCNNPPTYDGDPNSCRAFLS FT QCSVVFALQPRTYAAEETRVAFVLTLLTGRARDWGTSVWESRSPCCASFDE FT LRLEMMRLFDRSARGQEAADQLARLRQGSRSVTEYAIQFKTLAASCDWNEG FT ACRAMFRAGLEEEIQDELATHDLPHDFDDLINLALRVEVRLRRRHRRRMVH FT SSATMDDHSSLTASATLSDPEPMQVGRLRLTPQEKQQRLARGLCLYCGKPG FT HFAASCPAKAKAH" FT CDS 868..4281 FT /product="Gypsy104-I_Dr_2p" FT /note="Polyprotein." FT /translation="MPLLWQAWTFCSILSSKSQGSLVKRGILVSTIPLFYS FT PSSRTLLPVSVQYGDSSNCCSALIDSGAEGNFLDRSMAAQWGIPAIPLPKP FT IPARSLDGTLIATVSHTSPSVSLIVSGNHREVTTLYLLDSPSAPIVLGHPW FT LVQHGPHVDWSGNSVLSWSHSCLESCLGPAPFPGSVSSVLQVEPDDLTGVP FT AEYRDLSLVFSKSRATSLPPHRPYDCAIELLPGTSPPKGRLYSLSSPEREA FT MDKYINESLKAGLIRPSSSPAGAGFFFVKKRDGSLRPCIDYRGLNDITVKN FT RYPLPLMSSAFELLQGAKVFTKLDLRNAYHLIRIREGDEWKTAFNTPTGHF FT EYRVLPFGLTNAPAVFQALVNDVLRDMVNQFVFVYLDDILIFSPSLQVHTQ FT HVRQVLQRLLENQLFVKAEKCVFHAKSVSFLGFVISAGEIKADPSKVRAVA FT EWPTPDSRKALQRFLGFANFYRRFIRNFGLIAAPLTALTSPKVLFRWGDKA FT QEAFDKLKSRFISAPVLSVPDPKRQFIVEVDASEVGVGAVLSQRSPRDGKV FT HPCAFFSHRLTPTERNYDIGNRELLAVRLALGEWRHWLEGAVQPFLVWTDH FT KNLQYLSSAKRLSARQARWSLFFSRFNFTLSYRPGSKNIKPDSLSRLFEIP FT EREVSNDTILPKGVVVASLSWDIERQIQEALRECRVPRGVPAGKLFVPTAL FT RSQVVQWGHSSRMVCHPGVRRSLAAIRQRFWWPSMARDVRQFVVACSVCAQ FT NKSSNSPPTGLLHPLPIPSRPWSHLALDFVTGLPESRGNTVILTVVDRFSK FT AAHFIPLPKLPSAKETAQVVVDHVFRIHGLPVNVVSDRGPQFVSRFWREFC FT RQIGASTSLSSGFHPQTNGQSERANQDLERALRCLASHNPSSWCQQLSWVE FT YAHNTLPVSTTGMSPFQCSVGYQPPLFPSQEPDAAVPSALAFVQRCRRTWR FT RAKEALARASRRTKAAADRHRTLAPHYVCGQRVWLSTKDLPLRVASRKLAP FT RFIGPYRITKVLSPVAVRLKLPPTLGRVHPVFHVSRVKPVYSSPLVPSIPT FT PSPPPPRLVDGSPVYTVRRLLDVRRRGRGFQYLVDWEGYGPEERSWIPARD FT ILDRGLIADFRRRRGEPPPGAPSGARREGG" XX SQ Sequence 4283 BP; 841 A; 1232 C; 1111 G; 1099 T; 0 other; gaacaatctg accaggatgg acacagcagg ccaagacccc gtcagatctg ccgtggccca 60 gcagggagcc ctgctcgggc aacatgagac cagactctcc aacactacaa gagaggtaga 120 gtttctgtct aatcaggtgg ctgacctcac tgctcgcata caggaccttc agctagaagc 180 agcacaagga ggccgttttc cccgtcatga gcctgaaccc cgctgtaaca acccacccac 240 ctatgacggc gatcccaact catgccgggc cttcctgtcc caatgttcgg ttgtgtttgc 300 acttcaacca cgcacgtatg cagcagaaga gacaagagtg gcctttgttt tgactcttct 360 cacaggcaga gcccgtgatt ggggtacctc tgtttgggaa tccagatctc catgttgtgc 420 ctcttttgac gaactcagac tggagatgat gaggcttttt gacaggtcag ccaggggaca 480 ggaggctgca gaccagctag cacgacttcg ccaggggagt cgctctgtta ctgaatacgc 540 aatccagttc aagaccctgg ctgcctcgtg cgactggaat gagggagctt gccgggcaat 600 gtttcgtgct ggcctagaag aggagatcca ggatgaactg gctacccatg acttgcccca 660 tgacttcgat gacctcatca acctggcact tcgcgtcgag gtccgtcttc gccgacgaca 720 ccgccgaagg atggtccatt cctcagccac gatggatgat catagttcac tcactgcctc 780 tgctactttg tctgaccccg aacccatgca ggtggggcgc ctccgcctca caccccagga 840 gaaacagcaa cgactggctc ggggcctatg cctctactgt ggcaagcctg gacattttgc 900 agcatcctgt ccagcaaaag ccaaggctca ctagtgaaac gggggatcct ggtgagcact 960 atcccactct tctattcccc ctcttcccgt accctacttc ctgtgtcagt ccagtatggg 1020 gactcctcta actgttgttc agccctcatt gattctgggg cggagggcaa ttttttggac 1080 cgctctatgg ctgctcagtg gggtattcct gccatccctc tccctaaacc cattcctgct 1140 cggtccctag atggtaccct gattgccaca gtctcccata cttcccccag tgtaagtctt 1200 atagtttccg gcaatcaccg tgaggtcacc acgttgtacc ttttagattc tcccagtgct 1260 cccattgttt tgggtcaccc ctggttggtg cagcatggtc ctcacgtgga ttggtccgga 1320 aactctgtct tgtcttggag tcattcctgt cttgagtctt gtcttggtcc tgccccgttt 1380 cctggttctg tctcttctgt gttgcaggtg gagccggatg atctgaccgg ggttccggcg 1440 gagtaccgtg atttgtccct ggtgttcagt aagtcccggg ccacctctct gcctccgcat 1500 cgaccctacg attgtgccat agagctcctc ccaggcactt ctccgccaaa gggtcgctta 1560 tattcccttt ccagtccaga aagagaggct atggacaagt acattaacga atcccttaag 1620 gctggcctca tccgcccttc ctcatctcct gctggtgctg ggttcttctt cgttaagaag 1680 agggatggct cccttcgacc ctgtattgat tatcgagggc tgaatgacat cactgttaag 1740 aacaggtacc ccttgccgct tatgtcttca gcctttgaac tcctgcaggg agctaaggtc 1800 ttcaccaagc ttgacctccg taatgcctac cacctcattc gcatacggga gggggatgag 1860 tggaagacag ccttcaacac acccacggga cactttgaat atcgggtcct tccctttggt 1920 ctgaccaatg ccccggctgt ctttcaggcc ctggtcaacg acgtgctgag agacatggta 1980 aaccagtttg tctttgtgta ccttgatgat atcttaatat tctctccatc tctacaggta 2040 cacactcagc atgttcgcca ggtgctgcaa cgactactag agaaccagct gtttgttaag 2100 gcggagaagt gcgtcttcca tgccaagtct gtgtcgttcc tggggtttgt gatttcggcg 2160 ggtgagatca aagctgaccc ctctaaggta agggctgttg ccgagtggcc aactcctgac 2220 tctcgtaaag cattgcagcg gttcctggga tttgccaact tttatcggcg attcatccgg 2280 aactttggtc tgattgctgc acctttaact gcgctcacct ctcccaaggt actgtttagg 2340 tggggcgata aggctcagga ggcctttgat aaattaaagt cccgttttat ctctgctcct 2400 gtcttatctg ttccagatcc taaacgacag ttcatagttg aggtggacgc ttctgaggtc 2460 ggagtaggcg cagtcctgtc acaacgatct ccccgggatg ggaaagtgca tccttgtgct 2520 ttcttctccc accgcctgac ccccacagag cgaaactatg acattggtaa tcgggagttg 2580 ctggcagtca gattggcttt gggtgagtgg cgccactggt tagagggtgc agttcaacct 2640 tttttggtct ggacagacca caagaacttg caatatctaa gttcagccaa aaggcttagt 2700 gcaagacagg ctcgctggtc cctcttcttc agccggttca actttacgct ctcgtacagg 2760 cctgggtcta agaacatcaa acctgactct ctctcccgct tgttcgagat cccagagagg 2820 gaggtttcaa atgacaccat cttgcccaag ggggtagtgg tggcctctct ctcctgggat 2880 attgagagac agatacaaga ggccttgcga gagtgcagag taccgagagg ggtcccagcg 2940 ggcaagttat ttgttcccac tgctctgcgc tctcaggttg tccagtgggg tcactcatcc 3000 agaatggtct gtcacccagg agttcggaga tcactggctg ccatccgtca gcgattttgg 3060 tggccatcca tggccaggga tgtcaggcag ttcgtggtag cctgctcagt ttgcgcccag 3120 aacaagagtt ccaattcgcc tcccactggt ctgcttcacc ccctacccat tccttcccgc 3180 ccttggtcac atttagccct tgactttgtc actggtttgc cggaatcgag gggtaacacc 3240 gtgatcctta cggtggtgga ccgattctcc aaagcggccc atttcattcc cctccccaag 3300 ctcccctcag ccaaggagac agctcaagtg gtggtagacc acgtattccg gattcatggt 3360 cttccggtta atgtggtttc cgataggggt ccccagtttg tctcccggtt ctggagggaa 3420 ttctgccgac agatcggggc ctctacgagt ctgtcgtcgg gattccatcc tcagaccaat 3480 gggcagtctg agcgggcaaa ccaggatttg gaacgtgctc tccgctgcct ggcatcccat 3540 aaccctagct cctggtgcca acaactgtcc tgggtagagt atgcccataa tactctacca 3600 gtttctacta caggtatgtc accattccaa tgttctgttg gctatcaacc acctctgttc 3660 ccctcacagg aacccgatgc tgcggtcccg tctgccttgg cctttgtcca acggtgtcgg 3720 cgcacctgga gaagagctaa ggaggccttg gcccgggcta gtagacggac caaagcagcg 3780 gctgatcgtc atcggactct ggctcctcac tatgtatgtg gtcaaagggt atggctctcc 3840 actaaggacc tgcctctcag ggttgcttcc cgtaagttgg ctcccaggtt cattggtcca 3900 taccggatca caaaggtgtt gagtcctgtg gcggttcggc tcaagcttcc tcctacgctt 3960 ggtcgggtac acccagtgtt tcatgtctct agggttaagc ctgtgtattc ctcccctctt 4020 gttccctcta ttccgacccc ttccccccct cccccccgtc tagtggatgg ctctccagtc 4080 tatactgtta ggagattact ggatgtcaga cgtaggggtc gtggatttca atacctagtg 4140 gactgggagg gctatggccc ggaggagagg agttggattc cggccaggga tattctggac 4200 agggggctga tcgcggactt ccgtcggcga cgaggtgagc cccctcctgg agcgcccagt 4260 ggcgcccgta gggagggggg tac 4283 // ID hAT-N67_DR repbase; DNA; ZEB; 1546 BP. XX AC . XX DT 30-OCT-2008 (Rel. 13.1, Created) DT 30-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Putative nonautonomous hAT DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW 8 bp TSD; hAT-N67_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1546 RA Bao W. and Jurka J.; RT "hAT DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1336-1336 (2008). XX DR [1] (Consensus) XX CC TIR is 14-bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1546 BP; 483 A; 269 C; 278 G; 514 T; 2 other; catgggcgta aatcccgggg gggacggggg acatgtcccc tcccatccga gggttgcccc 60 cccccccccw ccccaccacc aaaaaaaaat taattattaa aaataaaaac tctcaattgt 120 gaaaaatata tgtatatata cctaaaataa ttgtctaagt agaaagaaaa ccgcatttat 180 catttaaaat gcatttagaa acagttgagt tccccctccc cttcctcara gtggttcggt 240 ccactgtgct ttttttccct catctcacct actcacacac acgcacacgt caggtgtgcg 300 gctgcggctg agttaatgtt gaatttcagt aagtaagtag ggggttttta atccacctta 360 aataaagcga ttctcttcaa tgtagttaat acagactcgt tcataaatta gttgcggcaa 420 aaatgctgat tacggatgta attgtccatg aatctgctgt attagcgatt aaaattactt 480 atctagttaa cgttagcttg ttgctaacgc taaagccgtt tcccatgcat tgtttacatt 540 tgtgacgact tggcataatg ttaacttaac attaacgacc acaattagca tattgtttag 600 ttgtgtattt gctaaatgag cactgtgcac actaacccca ctgtattttt tgtataatca 660 gtttctattc tgttttaagt gactcaatta atttgaaatt aaaatgttaa ccttttttaa 720 ttctttgttt ttattgttgt gattattttt atgatagttt tgcattcttt atgtaaagca 780 ctttgaatta ccattgtgta taaaatgtgc tacagtataa ataaacttgc cttgcagtgt 840 catgtattgg ataaaatgac agagaaaatg gacataagat catttttcag tgcagctaaa 900 cgccaagtac aaaaagttca catattaagg caatttggct ttaagaatga gtgcttatga 960 aaatactaat gtcacaatat caatcacagg caaaggagaa agaaatgatt gaggaacagg 1020 cagagcaggg tcagtcagat gtacagtgtc aggtaagggt gttgtgcttt attgaaggga 1080 taatgacaat tcttttgggg ggtgaacaat tttttttaat gttttctgag ttgtggctgt 1140 ttagcagatg aacttcacag gctcactgat ttacaatatg atcggtcatt ttaacaagtg 1200 taatgtaaac aagttagtta ttatgagtat ggaaaatgtc ttttcctgca tttattctgc 1260 ttcaggagtc agtctgttga ctgtaaacaa cacaacaaac aatgcaataa atagttttga 1320 agaaaaattt gctttgtcgt tgaacctgag aaaaactttg aaaataatca taatgacgta 1380 gtctccatgc tataataata attatgatga aaccattttt aactgcgggg acatatcttt 1440 attagtacaa ttcggctgta ttatttgtgt ccccttcaaa aaaatgctcg aaagaaattt 1500 tatatttatt gtccccccct actgttaaat cagatttacg cccatg 1546 // ID Gypsy120-I_DR repbase; DNA; ZEB; 4522 BP. XX AC chr19; XX DT 03-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Gypsy retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy120-I_DR; KW Gypsy120-LTR_DR; Gypsy120_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4522 RA Jurka J. and Kohany O.; RT "Gypsy retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1276-1276 (2008). XX DR Genome; chr19; Positions 7009380 7004859. XX CC Positions [2175-2651] - Integrase core CC 'CACTC' target site duplication CC LTRs are 99% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 84..4520 FT /product="Gypsy120-I_DR_1p" FT /translation="MSRQQKKGPMKPVSVEQEEEQGRPAEEEMEHKDKDEM FT ASREPTLADLTSLFSAHMAKMDSREAVRKQEHIDQERRFKALQHQFSLLQM FT EVQARTSPTSDACHIDTHSLDVHGAQADTDNQQFTSPNVFHSNADLIDDRF FT PPPRFREPRLEKLADSDDIEHFLITFERIATACRWPKADWAFHLIPLLTSR FT ARAAYVQMDLDDSTEYDKVKAAILVKYDINAETYRLRFRSLTVLPSEAPKE FT LYTRLKDLYGKWIKPKGKTTEEVGEIIILEQYLRMLSPELQVWIRERDPTT FT AAEAAALADVFVAARGKNQSWGRKTSSEARRPPFPQFQPKSMAGLGKTPKG FT GSPEMKPAMTLRRPVVCYLCGQEGHTKPVCPRNVVKPTNLCYVPRGWEDKA FT QKPNSLLHAITVKVNGKEFPALIDTGSDQTLVSRKVVAPSSVNASNKLSIC FT CVHGDERLVPTADLYLAVNNQPYLLEVGVVDNLPYPVVLGRDLPVLLDLIQ FT PMSQCNMVGYPSYAKAKQQDEPAQTLSMLPFYNADLETSPPKPKTTRRQRR FT HAKLQYNASRLPAGEVDGLNPQMKLPTNISELQQNDTSLVAYFEQANQMEM FT GDKVIDDNCERFMISENILYRQIGSVKQLIVPKEARELVLHLSHSIPWAGH FT LGVRKTIERIKKYFYWPGLKADVVQYCKSCPECQLVSLHHPPRVPLQPLPL FT ISTPFERLGMDIVGPVEKSSLGNRFLLVITDYATRYPEVFPLKSMKAKNVA FT TCLIQFCSRVGLPREILTDQGTNFMSTLLKQVYQLLGIKSLRTTPYHPQTD FT GLTERFNQTLKQMLRKFVDETGKNWDQWLPYILFAYREVPQASTGFSPFEL FT LYGHEVRGPLTLLKEMWEGIKGDREPSNVVAYVLQMRERLEKMRALARTHM FT AEAQKRQRTWYDQRAREREFVSGEKVLVMLPSKENKLLAKWQGPFNIKSRL FT GPTTYEVATPGQEHASRILHVNLLKKWVPRSEQVLLVCRVQGDEECEDQYL FT PQPTSADIRLDHLPEKQRRQVSPLCNPQIFSELPGHTSLIQHDVILKPDVA FT VRRMSYRIPEKLQEALKEEVDFMLSLGIIEPSQSEWCHPVVLVPKKDGNIR FT FCIDFRYLNSVSQFDCYPTPRIDSLIDRLGKAVYLTTLDLSKGYWQIPLTE FT RARPLTAFRTPWGLFQFRFLPFGLHGAPATFQRLMDQVLQGLTFAAAYLDD FT IIIYSTTWEEHMQHLHEVFQRLQRAGLTANPAKCAIARKEAEYLGFVIGNG FT VVRPQIKKIQALEECPLPQTRKELRSFLGMAGFYNRFIPNFSSRAATLTDM FT VGVRCPNQCQWTEERMAAFKDIQTALTTNTVLYNPDFTKEFIVQTDASERG FT LGAVLLQGSPGERRPVVFISRKLFPRETRYSTIEKECLAVKWALDSLRYYL FT LGREFILETDHKALQWLERMRDTNGRITRWYLAMQPFRFKVHHVPGKANVT FT ADYLSRCASETPEGRG" XX SQ Sequence 4522 BP; 1281 A; 972 C; 1112 G; 1157 T; 0 other; tatggtgtca gaaggtggat tttaatagag gacacaggtt tttttttgtt tccctctcca 60 tagcctcagt acattgagtt gccatgagcc gtcagcaaaa gaaggggccc atgaagccag 120 tttctgtgga gcaagaggag gagcaaggaa ggcctgcaga ggaggaaatg gagcacaagg 180 ataaagatga gatggcaagt agagagccaa ctttggctga tttaacaagt ttgtttagtg 240 cgcatatggc aaaaatggat tccagggagg cggtcagaaa gcaggaacac atagatcaag 300 aacgacggtt taaagcatta caacatcagt tcagtttatt acaaatggaa gttcaggccc 360 gcacctcacc tacatctgat gcctgccaca ttgatacaca ttctctagat gtgcatggtg 420 ctcaagcaga tactgataat cagcagttta cctccccaaa tgtcttccac tcaaatgctg 480 accttattga tgatcgcttc ccaccaccaa gatttcggga acctcgatta gagaagttgg 540 ctgattctga tgatatagag cacttcttaa taacttttga acgaatcgct acagcctgtc 600 gctggcctaa agctgactgg gcatttcatt tgatccctct attaactagc agggcaagag 660 ctgcatatgt ccaaatggac ctggatgact ctaccgagta tgataaggtt aaagctgcca 720 tcttggtaaa gtatgacata aatgctgaaa cttataggct gagatttcgt tccctcactg 780 ttctcccttc tgaggctccc aaggagttgt ataccagact taaagacctg tatggaaagt 840 ggattaagcc caaaggaaaa actactgaag aggtaggaga aataattatt ttagaacaat 900 atctgcgaat gctttctcct gaacttcagg tctggataag ggagcgtgac cccacgacag 960 cagcagaggc tgctgcctta gcggatgtct ttgtggcagc aagggggaag aatcagtcct 1020 ggggcaggaa aacaagcagt gaggcacgaa gaccaccctt tcctcaattt caaccaaaat 1080 ccatggctgg attgggtaag actcctaagg gaggttcccc agaaatgaaa cctgctatga 1140 ctctgagaag accagtggtc tgctatctct gtggacaaga ggggcacaca aaacctgttt 1200 gtccaaggaa tgtagttaaa ccgactaatc tgtgttatgt tcctagaggt tgggaggata 1260 aagcccagaa gccaaattcc ttgctgcatg ccatcactgt gaaagtgaat ggtaaagagt 1320 ttcctgctct gattgatact gggagtgacc agacactggt gagtcggaag gttgtggctc 1380 catcttcagt taatgcaagt aataagcttt ccatttgctg tgttcacggt gatgagagat 1440 tggtgcctac tgcagatcta tatctggctg ttaacaacca accctacttg ctagaggtgg 1500 gggtagtgga taatcttccc tatcctgtag tgttaggacg agatttgcca gtgcttttgg 1560 atcttatcca gcccatgtca cagtgcaata tggtacttac tagagccaag gcaaaacaac 1620 aagatgaacc agcacaaact ctgtcaatgt tgccatttta taatgctgat ctagaaacca 1680 gtcctccaaa acccaagacg actagacgac aaaggagaca tgcaaagttg caatataatg 1740 cctctcgtct gcctgcaggt gaagtagatg gtcttaatcc acaaatgaaa ttacccacca 1800 acattagtga attgcagcaa aatgacacca gtcttgttgc atatttcgag caagcaaacc 1860 agatggagat gggcgataaa gtgattgatg ataactgtga gcgatttatg atttcagaga 1920 acatactcta ccgtcagatt ggatcagtga aacagctgat tgttccaaag gaggctcggg 1980 aacttgttct tcacttgagc cactcaatcc catgggctgg ccacctaggg gtaaggaaaa 2040 ccatagaacg gatcaagaag tatttctact ggcctggtct gaaggctgat gttgttcagt 2100 attgcaagag ctgccctgag tgtcaactgg tatctctcca tcatccacct cgagtaccac 2160 tacagccact tcctttgatt agtactccgt ttgaaagact gggcatggat atcgttggcc 2220 cagtagaaaa aagcagcttg ggtaatagat ttctgctggt tatcacagat tatgctacaa 2280 ggtacccaga ggtattccct ttgaaatcta tgaaagcaaa gaacgtggca acctgtttaa 2340 tccagttctg ttccagagtc gggttacccc gtgaaattct tactgaccaa gggactaact 2400 ttatgtctac tctgttgaag caggtctacc agttgctggg tattaagagt ctgcgcacaa 2460 ctccatacca tccgcagact gatgggctca ctgagcggtt taatcaaacc ttgaagcaga 2520 tgctcagaaa gtttgttgat gagactggta agaattggga ccagtggctt ccctatatac 2580 tttttgctta tcgggaggtt ccccaagcat caactgggtt ttctcccttt gagcttttgt 2640 atgggcatga agttcgggga cctttgaccc tacttaaaga gatgtgggaa ggtattaaag 2700 gagaccggga gccttctaat gttgtggctt atgtcttgca aatgcgggag cgtctggaga 2760 aaatgagagc tttggcaaga acccatatgg cggaggccca gaaacgccaa cgaacatggt 2820 atgatcagcg agcacgtgaa agagaatttg tgagtgggga aaaggtgttg gtgatgttgc 2880 ctagcaagga gaacaagctt ttggccaagt ggcagggccc ttttaatatc aagagccggc 2940 taggccctac aacttatgaa gtggcaactc caggtcagga gcatgccagc agaatactgc 3000 atgtgaatct gttaaaaaag tgggtgcccc gttctgaaca ggtactgctt gtatgccgtg 3060 tccaaggaga tgaggagtgt gaggatcaat atctgcctca gccaacatct gctgacatta 3120 ggctggacca tctgcctgaa aaacaacgaa gacaggtgag ccctctttgc aatcctcaaa 3180 tattttctga gttgccgggt cacacttctc tgatccaaca tgatgtgatc ctgaaacctg 3240 atgttgctgt tagacgtatg agttacagaa tacctgagaa attgcaagag gccttgaagg 3300 aggaagtgga ctttatgctg agcctaggga tcatagaacc ttcccagagt gagtggtgcc 3360 atccagtggt cctggttcca aaaaaagatg gaaacatacg attttgcatt gatttccggt 3420 acttgaattc tgtatcacag tttgattgct atcctactcc ccgtattgat tcattaatag 3480 atcgattggg caaagcagta tacctgacca cactggatct ttctaaagga tactggcaaa 3540 tacctttgac tgaacgggcc cggcctttga cagcatttag gacaccatgg ggcctattcc 3600 agttccgatt tcttcccttt ggcctgcatg gggcaccggc aacatttcag aggcttatgg 3660 accaggtatt gcaagggtta acgtttgctg cagcttatct tgatgatatt ataatttaca 3720 gtacaacctg ggaggagcac atgcaacatc tccatgaagt cttccaacgg ctccaaagag 3780 ctggtctgac tgctaatcct gcgaagtgtg ccatagcccg gaaagaggca gaatacctgg 3840 gctttgtcat tggcaatgga gtggtgcgac cacagattaa gaagattcag gcactggaag 3900 aatgtccttt accacagaca cgtaaggagt taagatcatt tcttggaatg gcaggcttct 3960 ataaccgctt tattcctaac ttctccagta gagctgctac cttgactgac atggtgggtg 4020 tccggtgccc gaatcaatgc caatggactg aagagaggat ggctgcattc aaagacatac 4080 agactgcatt gacaaccaac actgtactgt ataatccaga cttcaccaaa gaatttattg 4140 tgcagacaga cgcctctgaa agaggtttag gagcagtact gttacaggga tccccaggtg 4200 agcgacgacc tgtagtcttc attagccgca aactctttcc aagagaaacc cgttattcca 4260 caattgagaa ggaatgtttg gcggtgaaat gggccttaga ctccttgaga tattatttgc 4320 ttggaagaga attcatactg gaaacagacc acaaagcact ccaatggctg gaaagaatga 4380 gagatactaa tgggcgaatc acaaggtggt acctggccat gcaaccattc cgatttaagg 4440 tccaccacgt tcctggcaag gcaaatgtaa ctgcagacta tctttcccgc tgtgccagcg 4500 agacgcccga agggaggggg tg 4522 // ID DNA-X-11_DR repbase; DNA; ZEB; 1356 BP. XX AC . XX DT 08-DEC-2008 (Rel. 13.12, Created) DT 08-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE nonautonomous DNA transposon - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; DNA-X-11_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1356 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(12), 2159-2159 (2008). XX DR [1] (Consensus) XX CC TSD is unknown. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1356 BP; 395 A; 255 C; 285 G; 421 T; 0 other; gtggtgggcc gttatcggcg ttaacgtgct gcgttaacgc gagactctta tcgggcgata 60 aaaaaatatc gccgttaatc tattctcaaa gttgggttgg gagctgggtc tattctacgc 120 aagctatgat gactttcacc ttgatatttt agcgcggatg tatacctggc cgaattgcac 180 tgtaggtggc gagaacgagt cttcgaacct gtgtgtattc ctactgtgaa attaccacat 240 caaacgtgac gtgctaacat ggatgcagtt cactagagtg aatctgatta atgttagctc 300 cacatctaac tatcaggcac ctgtagagtt tatgcgttcg agtaaaacat atacaaataa 360 aatggaccgg gtgcttttta aaatgaatga cggtgaatga aagtcaaacc ggtgagccgt 420 ctgacaaaaa aagtgccctt ctgaatcaaa tcagcaggat gcccttctgg atctttcact 480 tacttcgaag acactactga ctacgcttac atggacatct gtaatctagt tatttgcctt 540 aatagacaat aatataatta aggtgtttac gtgaagtgct ttcatgtaag agtttcctct 600 gtaattttgg gtgactttaa ctgcagttcg gcagtttcac tttcactcat gaacatttca 660 ttcatgcccc cgtgacaaac tggggtatta gacgcaaata aggagtaagg actggtgaga 720 gtgttatgga atttaataac gcacgccaaa tggaaagaaa aaaacttccg catttcgcga 780 tgtgtgtgtg tgtgtgtgtg tggtccttta ctgacagccg cgtgtgtgga tcttgtcgga 840 aaatatggcg aaaagtccta catgacggta atagtttgat tgcggtgttt acttcaataa 900 tgccactaat atctgtgcat actccacatg tcttaattcc atttctgttt agttcagtta 960 tgactttagt cggattaagg tgatcaaaaa tcgctgtttg gagcttatgt aggcctacag 1020 ttcaaaattc atgtttaact gaataaacag ttagtaaaca caagtacatc ttattgaaca 1080 tcatttattt tcatcaccaa ttatcatagt agaacagttt ctcaagcagt ttgtgatgca 1140 ttttggaaac aggagatgag cccctggtct aatgcgccac ctggcttgag aaacccgttc 1200 tcaaagactt acttttagtc attatttggg tagcacacat attctgaatg ccttcggcag 1260 aattcaaatg agccatttta atctagatta atctagatta attccaagat tacagtgaga 1320 ttaatctaga ttaaaaaaat aatctatgcc caccta 1356 // ID DNA-8-33_DR repbase; DNA; ZEB; 563 BP. XX AC . XX DT 28-FEB-2009 (Rel. 14.02, Created) DT 04-MAR-2009 (Rel. 14.02, Last updated, Version 1) XX DE A nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; TSD 8-bp; DNA-8-33_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-563 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 9(2), 533-533 (2009). XX DR [1] (Consensus) XX SQ Sequence 563 BP; 140 A; 146 C; 95 G; 182 T; 0 other; taggcttggg cggtatccaa attttgatac cgtcaaacct cctccctatt ttacctcggt 60 atacggtatt accgtgcata ataaaaaaaa ttatgacgta aggctcagac agcgtcacca 120 aactgttggc ttgtgcctaa accattcaga aactgaatac cgtatatgca accttaggtt 180 tgaggtattt ccatctctcg cggtcacctt tttgttgcac aatttgcaaa ttgcctcgtc 240 cgtatttacc gcctctccct tctcgttcgg tcgaaacccg aaatactgcc aaactgcaga 300 agttgtgttc tttttcgaga ccaggtcact tggtgcgcga ctcgccatct ttccccacct 360 ctctctttct cctccacgct gtcccgtgat gacaaccaca catacgcatt tttaggcatt 420 aaataaatta tatttaatat ttaatccttg tctcctatat aagtgcattg ttttttatga 480 cggtataacg gtattgaaac tgataccgtt gctattttta gatcccgcgg tataccatat 540 taccgtatta ccgcccaagc cta 563 // ID Gypsy100-LTR_DR repbase; DNA; ZEB; 1727 BP. XX AC chr19; XX DT 02-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy100-I_DR; KW Gypsy100-LTR_DR; Gypsy100_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1727 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1510-1510 (2008). XX DR Genome; chr19; Positions 11706883 11708609. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1727 BP; 554 A; 265 C; 300 G; 608 T; 0 other; tgtaacccag atcataaatc taggtcacta ataatcattt tagaaagtat tattattatt 60 atttatttat attattttat tattaatact ttaataatta gtgaataaaa agcagaaaga 120 tcgctgtgta attaaaggta ttaacaaata ttttcgtacc tgaacttgat tggagaagtg 180 agcagttagc tccgcccccg cacatgcgac ggaaagcaca agcagatagt gccaacaaac 240 aaacaagagc tagagaagta gcaagttgac aagcaaccga gagaactaga gatatagtgc 300 taacaaacaa ccgagaacta gagagatatt aagttaacaa gcagtgaaaa ctagagagct 360 gacgagcaca cgagagctgg agagctaacg agcacacgag agctaacgag cacatgagag 420 ctggagagct aacgagcagt cgagagctag tgatctaaca accagttgag agatagtaca 480 tcaatagaaa actgacaaaa tagacttgat tcaactttat ccttcactaa ttactgtctt 540 tggtgagtac caatgccttt atgagaaatt tattaatatt ttattgtgaa tatgttgggt 600 tgtaaagtga aacagtagta aaaaaccaaa gttcttgtaa atgttttcaa aattactctt 660 aaaattgctt ttaaaagaat cctacataat atttctagtg tttataattg tgtttacatc 720 tctgtaaaat attttgtatt gaaagtactc atttcttaaa gagacatgca cgccatgcgg 780 tgctagcctt gctaacactg tgcattagca gtgtgattag ctgttatgtt ttatcaatgc 840 acaattgatt gtaacagaca atttatttaa ttttaactgg tttatttaga tacttgacac 900 taatgtttat tctggctttg ttgttacaag gccgggtttt tttttgtatg tgttttttgt 960 cctcctcact actgccggtg ttgttcttga ttgtgaacgg ccatcctgag atcgtcctgg 1020 ctgaatattc atattctggt ctggactggc gagccatcct actgatacgt gcctgaacta 1080 agtaaaagag agtgaaatct gctgatctgg taggaccaaa ctaatttgtt aagagagttg 1140 ttttttttct ctcctcgtgt tttctacaag gatatttttc atgggaacaa tttaaatatt 1200 ttttttcctt ggactgtttt ttttcttctt ccaaattttt gaaaggagac atttaactgg 1260 acactttgac ttcaaacaaa gactttatat tgaaagttca tctttacttt taacaaagac 1320 tctgcttttg aaatctgaaa tttagggatt gtttggaact ttgaaactga actttgcatt 1380 tattttaata gcagtttgtt gttgaattta ctattcacat tttctattgt attatactgt 1440 tttatattgg ttttatttgt tttcttttaa aatctcaaaa gtacattaag caaaacaata 1500 caacatattt tgcattaatt tatttgttgt gaattatatg attcatttat taatcctaaa 1560 attagattac actattatca ctataagtat tgttcttttg ctccagtgag tcgaatctta 1620 cctttaaact ctggtccaaa aagaattcta attgttagcg ttggtctccg ttataaaaat 1680 acactagtta acatatataa taggagccac acaagggtag tgttaca 1727 // ID DIRS-12_DR repbase; DNA; ZEB; 5860 BP. XX AC . XX DT 07-JUN-2010 (Rel. 15.11, Created) DT 07-JUN-2010 (Rel. 15.11, Last updated, Version 2) XX DE A family of DIRS-type LTR retrotransposons - consensus. XX KW DIRS; LTR Retrotransposon; Transposable Element; DIRS-12_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5860 RA Kojima K.K. and Jurka J.; RT "DIRS-type LTR retrotransposons from zebrafish."; RL Repbase Reports 10(11), 1840-1840 (2010). XX DR [1] (Consensus) XX CC This consensus is generated from 8 sequences with >96% identity. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 394..1878 FT /product="DIRS-12_DR_1p" FT /translation="MSLSKSAASPEHSSRACPAACGALIAAKDPHPFCVVC FT LGLKHAQEALETPENCSHCLKLPKKLLRRRMKVAATQCVEFSDSDGGHGDD FT SAQPGDSRGAMSLDWADQPNPAFPEEDIFAGVPPFADEPAGSGDDDDAGLL FT GVSEDEEAIPPSGAPQIHTPATVPQSILLDVCERAAARLNIEWPAPQSATD FT QERDIYDGKLLGPPPGPRKQLFPVLPACAKHMKHYWGDPLNLKQGLAGLEV FT KDMASIGMGDPPVIEPSIARHLSPIQGGLLAPPKPVLPNKMDRFSASVHQA FT AYRSSAVTVRALNVSSLLSAYQAELLDDLGQQLDKGTPSPSLWKEILTVND FT LVLRNARQAVQACGRSMALSVVGERALWLNLSGLPDSEKRRIAGAPVEPGQ FT ALFGPAVALMQQRCDDKKKEDEAFKLCLPRKAAPRQVPPARAPIPPATGRN FT FHQGKDKPRTRPPPRQGNHPSAKPWGRTTPAAAGKPPAQTPSDPKRKRPA" FT CDS 1637..4189 FT /product="DIRS-12_DR_2p" FT /note="reverse transcriptase and ribonuclease H." FT /translation="MKPSNYVFLGKRLHARCPQRAPLFPLPQGGISIRARI FT NLAPDPLRDRATILPQNLGEEQRPLPRENPLPKPPRIRNASGRPDVQPMGA FT PSLCSRATSPTDYFPAPELLSKRRRIAQKEVVHPMCSVDVVCSRPRFKGSV FT VRKSVQAVSPHRSNFVPLTQIKASAHVFPNTKHTKYTSKQTNQMNSFPKEH FT ISVNPCMSPLVCPLEGAIAPEMNKPAEFAVSSSPVCVGSLANQVTAWQSVS FT APEWVIHTITRGYRLQFAKRPPRFNGVLSSQTEESSAHILKDEISSLLQKG FT AVRVVPPHLIGQGFYSRYFLVPKKDGSLRPILDLRVLNKHLRKYSFRMLTH FT GSLLRSVNQNDWFTSIDLKDAFFHISIYPPHMKFLRFAYQGVCYEFTVLPF FT GLSLSPRTFCLCAEAGLAPLRTAGLKILTYIDDWLIIADSREKVIQNTFRV FT LTHIKALGFRVNVSKSSFTPSQNVTFLGLELNSITMRARLSQERILSLLNC FT LSQFREGARVQYRTCLRLQGLMASAIHVLPLGLLRMRAFMKWVLSLHLSPL FT CDLSLYVTVSRSCTTALRHWRNAEFYAHGSPLGAVMMRKVVTTDASLTGWG FT ATQEGRIVNGVWSHTLHSAHINYLELLTVWKALKHFLPRLQGHHVLVRCDN FT TTAVAHINRQGGMRSSKLHALAHXLLVWSRQHFLSLRATHVPGVLNRGADL FT LSRGNPLYGEWRLHPQIVGLLWKRFGQADVDLFASRENAHCPMFFSLQDED FT APLGVDALAHPWPNVLLYAFPPLCLITPTLFRVKEQGLRLILIAPRWPRAP FT WLAEIIPLLHAQPWPLPLRTDLLSQANGEIYHPRPDRVALWAWPVKGQT" FT CDS 2368..5211 FT /product="DIRS-12_DR_3p" FT /note="tyrosine recombinase at C-terminus." FT /translation="MGNSHNNQGIQASVCQKTPAFQRSSLFSNRRELCSHS FT ERRNLLSPAEGGGSSGTPAPDRAGVLLSLFPSSKERRLSSSYFGSQSVEQT FT FEEIQLQNVNTRFSLTLSKSERLVHIDRSERCFLPHKHLSSAHEISPFRLP FT GRLLRVHSSSVRALTKPEDVLSVCGGRACAAENSWSQDPDIHRRLADHSRF FT ARESNSKYFPCAHSHQSTRFPGECEQKQFHSIPKCDFSGTGAEFNYDARAP FT VARAHSLSSELSLTIQGRSKGTVSYMSQTTRSHGLGYSCPATRPFENEGVH FT EVGFISSSQPVMRSLPVCHSLALVHNSAAPLEERRVLRSWEPPRGCHDAQS FT GNDRCLADRVGRHTGGQDSERRVVPHTSLSPHKLFGASHCVESSKTFSAPP FT AGASCASAVRQHYSCSTYQPPRGDAILQAPRLSSXAFSMEQATFSVVTGDP FT CPRRSEQGCRPFVKGEPTLRGVASPSSNSGPALEEIRPGRRRSLRLARKRP FT LSYVLLATGRGCPPRRGRTGSSMAQCAALCVPSSLPNNPNPIQGEGTRLET FT DPDSTEVAQGSMAGRDNPSSACTAMAPPITYGPPVPSEWGNLPPSSRQGGS FT LGLARERANLNTLGLSPRVIATIQNARASSTRSLYDCKWRVFEEWCEERRL FT ISYQCSVADILGFLQGLMDCGKSFSTIKVYLAAITACHVGFDGSTAGQHPL FT IRRFMKGARRSLPVTRRVVPEWDLSMVLDALTHHPFETLEAISLKHLSFKT FT ALLLALASAKRVSDLHALSVHPSCIKFSLSGEKVSLRPNPAFMPKCFPNFS FT CEGVELSAFHPPPFSSTEDQRLNALCPVRALRAYINRTSSFRRSDQLFISW FT APPNRGNPISKQRLSHWLVEAISLAYESKGVRPPGGLRAHSTRGMAASWAL FT FRGVSLQDICAAASWASPHTFVRYYRLDVTQTPVAHSVLGVGSS" XX SQ Sequence 5860 BP; 1295 A; 1665 C; 1452 G; 1447 T; 1 other; gttctctttc atcatctcgt gttcgagatc cacctatggg aaggcatccg tgcctgacct 60 ctgcagaagc atccaattgc accaagtctg gcttgacaga cagaagcgcg tgcccaatgg 120 caggtaaaga accgcccctt tacctgtgcg cataaatcat tctgcacgcg ctgctattcc 180 tcagttaaca ttcgcttctc gctgactctg caactctaca gctcggttgg attactgctg 240 ctcatttacg tgtcttcagg aggacatatc tttcgatcag cctccaccag acgtgacagt 300 ctgttatccg accaggttag ttgtggtttt cgcatcccga gctgcccacg ctcctcttac 360 acctgtccct gtccacggct gcccgcctcg gatatgtcac tttctaaatc tgccgcttct 420 cccgagcact catcacgggc ctgcccggcc gcgtgtggcg cgcttattgc tgcaaaggac 480 ccacacccct tctgtgttgt ctgcttgggc ctcaagcacg ctcaagaggc gttggaaacc 540 ccggagaatt gcagccattg tcttaaatta ccgaaaaagc ttctgcggcg tcgtatgaaa 600 gttgcagcga ctcaatgtgt tgaattttcc gattcggatg ggggacatgg tgacgacagc 660 gcgcagccgg gggactcccg gggcgctatg tctctcgact gggctgacca gcccaatcct 720 gcgttccccg aagaggacat cttcgcgggt gttccgccgt tcgctgatga accggccggt 780 tctggcgacg atgacgacgc gggtcttctg ggtgtttcag aggacgagga ggccatcccg 840 ccctctggcg ctccccagat tcacactccc gcgaccgtac cccagtccat cctcctagat 900 gtgtgtgagc gagcggccgc tcgcctcaac atcgaatggc cagctccaca gagtgccacc 960 gaccaggaga gggacattta tgatgggaag ttgctggggc ctccaccagg cccgaggaaa 1020 caactttttc ccgtcctccc agcgtgtgcg aagcacatga agcactactg gggcgacccg 1080 ctcaacctta aacaaggtct cgcgggtctg gaggtaaagg atatggcatc gatcggcatg 1140 ggtgacccac ccgttatcga gccatcaatc gctcgacacc tcagccccat tcagggcggg 1200 ctactcgccc ccccgaaacc ggtcctgcca aacaaaatgg accgcttttc tgcctctgtt 1260 caccaggccg cctataggtc ctcagccgta actgtgaggg ctcttaatgt ctcctcgctc 1320 ctctccgctt accaggcgga gctgttggat gacttgggac agcaactgga caagggaacc 1380 ccttccccat cactgtggaa ggagattctc acagttaacg acctcgtcct ccgtaacgca 1440 cgtcaggccg tccaagcctg cgggcgttcc atggcgcttt cggtggtagg agagcgcgcg 1500 ctctggttga atctgtcggg cctccctgac agtgaaaaga ggcggattgc gggcgccccg 1560 gtagagcccg gacaggctct ctttggcccc gctgttgcat taatgcaaca acgctgcgac 1620 gacaaaaaga aggaggatga agccttcaaa ttatgtcttc ctaggaaagc ggctccacgc 1680 caggtgcccc cagcgcgcgc ccctattccc cctgccacag ggcggaattt ccatcagggc 1740 aaggataaac ctcgcaccag accccctccg cgacagggca accatccttc cgcaaaacct 1800 tggggaagaa caacgcccgc tgccgcggga aaaccccctg cccaaacccc ctcggatccg 1860 aaacgcaagc ggccggcctg atgttcagcc aatgggggcg ccgagtctgt gttcacgggc 1920 cacaagccct acagactatt ttcctgcccc agagcttctg tccaaaagac ggagaatagc 1980 tcaaaaggag gtcgtccacc ccatgtgttc agtggacgtt gtgtgttctc gccccaggtt 2040 caaggggtct gttgtgagga aaagtgttca agcagtgtca ccacaccgtt caaactttgt 2100 tcccctcact caaataaagg cttcggccca tgtgttccca aacacaaaac acacaaaata 2160 cacctcaaaa caaacgaatc aaatgaattc gttcccaaaa gaacacattt ctgtgaaccc 2220 ctgcatgtca cccctagttt gtccactaga gggcgccatt gcaccagaaa tgaacaaacc 2280 ggccgaattt gcagtaagca gttctccggt gtgtgtgggg tcacttgcga atcaggtaac 2340 agcctggcag tctgtgtcag cacccgaatg ggtaattcac acaataacca ggggatacag 2400 gcttcagttt gccaaaagac ccccgcgttt caacggagtt ctctcttctc aaacagaaga 2460 gagctctgct cacattctga aagacgaaat ctcctctctc ctgcagaagg gggcggttcg 2520 agtggtaccc ccgcacctga tcgggcaggg gttttactct cgttatttcc tagttccaaa 2580 gaaagacggc tctcttcgtc ctattttgga tctcagagtg ttgaacaaac atttgaggaa 2640 atacagcttc agaatgttaa cacacggttc tctcttacgc tcagtaaatc agaacgattg 2700 gttcacatcg atcgatctga aagatgcttt cttccacata agcatctatc ctccgcacat 2760 gaaatttctc cgtttcgctt accagggcgt ttgctacgag ttcacagttc ttccgttcgg 2820 gctctcacta agcccgagga cgttctgtct gtgtgcggag gcagggcttg cgccgctgag 2880 aacagctggt ctcaagatcc tgacatacat agacgattgg ctgatcatag ccgattcgcg 2940 agagaaagta attcaaaata ctttccgtgt gctcactcac atcaaagcac tcggtttccg 3000 ggtgaatgtg agcaaaagca gtttcactcc atcccaaaat gtgacttttc tgggactgga 3060 gctgaattca attacgatgc gcgcgcgcct gtcgcaagag cgcattctct ctcttctgaa 3120 ttgtctctca caattcaggg aaggagcaag ggtacagtat cgtacatgtc tcagactaca 3180 aggtctcatg gcctcggcta ttcatgtcct gccactaggc cttttgagaa tgagggcgtt 3240 catgaagtgg gttttatctc ttcatctcag cccgttatgc gatctctccc tgtatgtcac 3300 agtctcgcgc tcgtgcacaa cagcgctgcg ccactggagg aacgcagagt tttacgctca 3360 tgggagcccc ctcggggctg tcatgatgcg caaagtggta acgacagatg cctcgctgac 3420 agggtggggc gccacacagg agggcaggat agtgaacggc gtgtggtccc acacacttca 3480 ctcagcccac ataaattatt tggagcttct cactgtgtgg aaagctctaa aacattttct 3540 gccccgcctg caggggcatc atgtgctagt gcggtgcgac aacactacag ctgtagcaca 3600 tatcaaccgc caagggggga tgcgatcctc caagctccac gccttagctc acmagctttt 3660 agtatggagc aggcaacatt ttctgtcgtt acgggcgacc catgtcccag gcgttctgaa 3720 caggggtgca gaccttttgt caagggggaa cccactctac ggggagtggc gtctccatcc 3780 tcaaatagtg ggcctgcttt ggaagagatt cggccaggca gacgtcgatc tcttcgcctc 3840 gcgcgaaaac gcccattgtc ctatgttctt ctcgctacag gacgaggatg cccccctcgg 3900 cgtggacgca ctggctcatc catggcccaa tgtgctgctc tatgcgttcc ctcctctttg 3960 cctaataacc ccaaccctat tcagggtgaa ggaacaaggc ttgagactga tcctgatagc 4020 accgaggtgg cccagggctc catggctggc agagataatc cctcttctgc atgcacagcc 4080 atggcccctc ccattacgta cggacctcct gtcccaagcg aatggggaaa tctaccaccc 4140 tcgtccagac agggtggctc tctgggcttg gcccgtgaaa gggcaaacct aaatacactg 4200 gggctttctc cacgtgttat tgcaactatt cagaacgcta gagcctcctc cacacggtcc 4260 ctttatgact gcaagtggcg cgtgtttgag gaatggtgtg aggagcgcag actgatttca 4320 tatcagtgct cagtcgctga tattttgggc tttctgcaag gccttatgga ctgtggaaaa 4380 tcattttcta caatcaaggt ctatctagca gccatcactg catgtcatgt agggtttgat 4440 ggcagcacgg ctgggcaaca ccctcttatt cgtagattta tgaagggtgc ccgtcgttcc 4500 cttccagtca ctagaagagt tgttcctgag tgggacctct ccatggtgct ggatgctttg 4560 actcaccatc cttttgagac cctagaggct atttctctga agcatctgtc tttcaagaca 4620 gctctgctgc tggccttggc ttcagctaaa cgtgtcagtg atctacatgc actctctgtt 4680 cacccctcgt gcattaaatt ctccctcagt ggagaaaaag tttctctcag gcctaacccg 4740 gcctttatgc caaaatgctt ccctaatttt tcttgtgagg gggtagagct ttccgctttt 4800 caccctcctc cattctcctc cacggaggat cagaggctta atgctttatg tcctgttcga 4860 gctctgcggg cctatatcaa caggaccagc tcttttagaa gaagtgacca actcttcatt 4920 tcttgggccc ccccaaacag ggggaatccc atttctaaac aacgcctctc tcattggctt 4980 gtggaagcca tctctctggc atatgaatct aagggagtgc ggcctccagg gggcctcaga 5040 gcacattcca ctagaggcat ggctgcttct tgggctctat ttagaggggt ttccttgcag 5100 gacatttgtg ccgctgcaag ctgggcctct ccacatacat ttgtcagata ctaccgtctt 5160 gatgttactc agactccggt agctcattct gtcttaggtg tgggctcttc gtagccctgt 5220 gccgtatgct ttttacaact ttttgagcct agttggtcta acttggctta aatttgtccc 5280 tagtgttatc ataccctcac ctgacacaaa ttatatacat tttatatgta caaatacata 5340 tattgtatat catatatata tatatgctct agcatattgt gttttcaggt gtccttaaac 5400 gcatgttacc acatgctttg tgacgtgtca tgcaccgcct ttcaggagac cgtccgctgg 5460 cttggtgggc ctcgctgtga gtgtattggg caaccgggga gctgtccata tctcccatag 5520 gtggatctcg aacacgagat gatgaaagag aacaataggt tactgtcgta accccggttc 5580 tctgaaacat cgagtggaga gatccaccaa gtttgccccg cttgctgcac gagaagcgaa 5640 tatgctcact gaggaatagc agcgcgtgca gaatgattta tgcgcacagg taaaggggcg 5700 gttctttacc tgccattggg cacgcgcttc tgtctgtcaa gccagacttg gtgcaattgg 5760 atgcttctgc agaggtcagg cacggatgcc ttcccatagg tggatctctc cactcgatgt 5820 ttcagagaac cggggttacg aacagtaacc tattgtttcc 5860 // ID TDR14 repbase; DNA; ZEB; 274 BP. XX AC . XX DT 04-MAR-2002 (Rel. 7.02, Created) DT 04-MAR-2002 (Rel. 7.02, Last updated, Version 1) XX DE Zebrafish non-autonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; TDR14. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-274 RA Jurka J. and Drazkiewicz A.; RT "TDR14: a non-autonomous DNA transposon from Danio rerio."; RL Repbase Reports 2(2), 24-24 (2002). XX DR [1] (Consensus) XX CC Contains 32 bp TIRs. XX SQ Sequence 274 BP; 103 A; 43 C; 42 G; 85 T; 1 other; gaggtaaagt tggttttata ttcacacatt cagactttct ccttaataat ataatttcag 60 aaaangtatt ctttgcaaat tctaaatagg gaaatcatat tagcagccaa tgactatcaa 120 agtacaacat tgttcacagt caaataaata gtgttaatgt tgtttagaag tcatacttgc 180 atgctatttc agaccgaata ttcaaagtag catggtaaac aatataggga ctgcaaaatg 240 aacgaatgtg cgaatataaa accaacttta cctc 274 // ID DNA2-2_DR repbase; DNA; ZEB; 786 BP. XX AC . XX DT 07-DEC-2004 (Rel. 9.11, Created) DT 07-DEC-2004 (Rel. 9.11, Last updated, Version 1) XX DE DNA2-2_DR is a family of nonautonomous DNA transposons. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW DNA2-2_DR; nonautonomous DNA transposon; KW TA target site duplication; hAT-N18_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-786 RA Kapitonov V.V. and Jurka J.; RT "DNA2-2_DR, a family of nonautonomous DNA transposons."; RL Repbase Reports 4(11), 290-290 (2004). XX DR [1] (Consensus) XX CC The genome harbors several thousand copies of DNA2-2_DR that CC are ~95% identical to their consensus sequence (~100 copies CC were transposed as a portion of the hAT-N18 element). CC DNA2-2_DR elements are characterized by the TA target site CC duplications and 14-bp terminal inverted repeats. CC Additionally, the 3' terminus contains a 63-bp palindrome CC (pos. 718-780). DNA2-2_DR elements are also characterized by CC a high target-site specificity. They are inserted preferentially CC into the cawTAwtg target-sites (TA -is a target site CC duplication). CC Classification of DNA2-2_DR is not certain yet, although it is CC expected to be a member of the Mariner/Tc1 superfamily. XX SQ Sequence 786 BP; 240 A; 175 C; 158 G; 213 T; 0 other; gcccctttca cacagtgata ccggtaaata tccggaaaat ttccggaacg actttaccgg 60 tatattcaaa aaagcgctgt tcacacaggc gaggacgtta cggaattttt ccggaaaaga 120 gcattcacac atccattcca aaataccggt aaattctgac atcattcacc acaaatgagc 180 tttaaacggc tgcgcttgtg tttgtaaaca tttgactaaa ttacaaactc tgtggatgga 240 tcaatattgt gaacaacttc aatgaaaaca tatcgcagga tcactttcgc atgtcgagat 300 gttcataata tgtgtgtgtg ctggcgctca caggctgttt cacaggcaca cgcaaagctt 360 gaaggtaaac aaacaacggc ttatcataag catctcatcg atgattattt acacagttgg 420 cattaagaag aacatataaa cgtgatctga ctaacttcta gcagctaaat gtgtctggaa 480 aaatattcaa aggcttttat tctcataaac cgcgcggacg taaatgcgtc tgactgttgt 540 gattggctaa agcagacgtc tcacgtcagc acgttctaga cgtgcacgcg cttattacgg 600 caatcttcct tctgcattca cacagcgcag cattccggca aattgccggt aatgttacaa 660 cttctctttc cggaaaatag ccagaacgaa tttaccggta ttttcaaaaa ggacctgttc 720 acacatacaa cctttccgga aaattgccgg taattttccg gaaaggtctg tatgtgtgaa 780 aggggc 786 // ID HARBINGERN1_DR repbase; DNA; ZEB; 650 BP. XX AC . XX DT 02-MAY-2002 (Rel. 7.04, Created) DT 02-MAY-2002 (Rel. 7.04, Last updated, Version 1) XX DE HARBINGERN1_DR is a nonautonomous DNA transposon - a consensus. XX KW Harbinger; DNA transposon; Transposable Element; Nonautonomous; KW Harbinger superfamily; HARBINGER1_DR; HARBINGERN1_DR; KW nonautonomous DNA transposon; TDR10. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RA Jurka J. and Drazkiewicz A.; RT "TDR10: a non-autonomous DNA transposon from Danio rerio."; RL Repbase Reports 2(2), 20-20 (2002). XX RN [2] RP 1-650 RA Kapitonov V.V.; RT "HARBINGERN1_DR, a nonautonomous DNA transposon from Danio RT rerio."; RL Direct Submission to Repbase Update (23-APR-2002). XX CC This element was identified originally [1] as unclassified CC TDR10 nonautonomous DNA transposon. CC It has been renamed as HARBINGERN1_DR [2] because it is a CC a nonautonomous derivate of the HARBINGER1_DR DNA transposon. CC HARBINGERN1_DR copies are flanked by 3-bp target site CC duplications CC (TAA) generated upon their integration in the genome [2] (not by CC TA as it was reported previously [1]). There are many subfamilies CC of HARBINGERN1_DR-like elements present in the genome. XX SQ Sequence 650 BP; 161 A; 175 C; 149 G; 153 T; 12 other; gggcgtactc acactakgya cagttgcctt gaaccgkgcc gaagcacgct tgtcccccct 60 cccctctccc ccgacggcct gcactcacat tgcattcgag cccgagcacg cttacgtcat 120 cgatgatgcg ctgttcagtt taanagaaga gaagcgctct cgctcagcac agtggagatt 180 gctttagtta tattgtttta gtcgtttgat atgcagtgac acgcagtcaa atattttgct 240 gaacagatca accacttttg acgctcataa ataatcataa aagtcctcgt gctgcaggwa 300 ttaggaggtt tgctgaaggt gcagctgtca tgcagtgagg ggtttgcgtc tttaataawc 360 tacgacagtt tgcgttcatt gaamagtaag aatgattaat aaatccatat gaaacagtcc 420 cttaaaagts acgtckcgtc ttcagtttcg ggctcaggcg cgctttgcac tcacactaca 480 agcgtaccgc gccaaagccc aagtgaaccr cgctctggca cacctcttcc aaccgggcca 540 gggccggcca astgaaccat gcctgagccc aattcagagc actcacactt ctcaaacgaa 600 ccgggaaacg ggcctgggca cngttcggat agcatagtgt gagtacgccc 650 // ID Helitron-N1_DR repbase; DNA; ZEB; 2581 BP. XX AC . XX DT 31-OCT-2006 (Rel. 11.1, Created) DT 31-OCT-2006 (Rel. 11.1, Last updated, Version 1) XX DE A family of non-autonomous Helitrons - a consensus sequence. XX KW Helitron; DNA transposon; Transposable Element; Nonautonomous; KW Interspersed repeat; Helitron-N2_XT; Helitron-N1_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-2581 RA Kapitonov V.V.; RT "Helitron-N1_DR, a family of non-autonomous Helitrons from RT zebrafish."; RL Repbase Reports 6(10), 495-495 (2006). XX DR [1] (Consensus) XX CC This transposon is usually inserted into A|TT target sites (no CC TSDs). Subterminal TIRs: pos. 3-15 and 2550-2538; a palindrome at CC pos. 2552-2571. The Helitron-N1_DR consensus sequence is 76% CC identical to the consensus sequence of the Helitron-N2_XT from CC the frog genome. Therefore, horizontal transfer was involved in CC evolution of these transposons. XX SQ Sequence 2581 BP; 748 A; 775 C; 471 G; 584 T; 3 other; ttaatggggt ttgccatagg caaagtccat tcttacaatt gtgcatggct tattattctt 60 cttcttcctt cttccgtacc atttttcggc gcgtaactag tcccgcagct ttcgtcccag 120 acccttgaaa gtgggctcaa atcgtgcggt cttatcggga atggtgtgct atgactttta 180 taaggggtgg ggggtttttt ggccccgcag gggccaaaaa acccccccaa aaatcccata 240 gacttaacat tgagccgaac tttgacgaat cacagcgccg agtgtgaatt tcatagaaat 300 atgggattca cagcatctgt agggggttgc agcctaagtg agaacatacc tcgcaagggg 360 gtataagttg cacccctggg gctctaggac gtcccaaagt gctcccattg acttacattg 420 gccccattga ctcccattca tttttctagg agagcactaa acgcgccgcg aaatgggacg 480 accttttagg cgttatagcg ccaagtgtga aattcgtaga aatatgggat tcgcaacatc 540 tgtagggggt tgcagcctgt atgagaacat accccaaaaa ggggtataag ttgtacccct 600 ggggtgctag gacgtcccaa aggggtcccc attgacttta cattgcccat tgactcccat 660 tcatttcttg actcacatgt caatcacatt acatagcagt gtcatactga cttgggggtt 720 ggcttatttg actgaggcaa ccaatcagca tctcaatatg attttgaagc tcacaagcca 780 cgccccccca aaccatttaa agacccttaa taactgcccc attgacttaa catggggtgg 840 gatgtcccat tgcagatccc attgacttcc attataaagg tcacacatct ataacattac 900 ataggagtgt catagagaca tgggggtggg ctcatttgac tcaggcaacc aatcagcatc 960 tcagtatgat aatgaagctc acaagccacg cccccaaact ggtcccatag actgccatta 1020 taactggcct acatgcatat ctttgcttag cagtgtccya ttggcttggg ggatggctca 1080 tctgactcag acaaccaatc agcatctcaa tatgataatg aagctcacaa gccacgcccc 1140 caaattggtc ccatagactg ccattataac tggcctacat gcatatcttt gcatagcagt 1200 gtcctattga cttgggggtt ggctcatctg actcagacaa ccaatgagca tctcaatatg 1260 ataatgaagc tcacaagcca cgcccccaaa ctggtcccat agactgccat tataactggc 1320 ctacatgcat atctttgctt agcagtgtcc tattgacttg ggggttggct catttgactt 1380 ggacagccaa ccacattcaa gttcactctg taacctcgcc catagcaaca aaacagagta 1440 ccctagcaac cattcatcaa cagctatatc tcagcatcgg aacatcgtag agacttgggg 1500 attggctcgt ttgactcatg ctagcaaacg gaacttccta tatgctacac atgctagcag 1560 tgactagcta catgctaata ttgactagcc aagtactgta acttgctaga aatgcttact 1620 aagtataaat atttcatcag gcatgtatgt gaggcttgcc tagtaaccac ccagggtacc 1680 ctagcaactg cctagcaacc acccaaatta ccctagcaac cgcctagcaa ccacctagca 1740 accgcctagt aacgccttag taagggccta gcgaccactc aggacaccct agcaaccgcc 1800 tagcaacgcc ttagcaacca ctcagaatac cctagcaacc acctagcaac accttagcaa 1860 ccacctagca accactcagg acaccttagc aaccgcctag caacacctta gcaaccacct 1920 agcaaccact cagaacaccc tagcaaccac ctagcaacca ctcaggacac cctagcaacc 1980 gcctagcaac accttagcaa ccacttagca accactcaga acaccctagc aaccacctag 2040 caacacctta gcaaccacct agcaaccact taggacacct tagcaaccgc ctagcaacac 2100 cttagcaacc acctagcaac cactcagaac accctagcaa ccgcctagca accactcagg 2160 acaccctagc aaccgcctag caacgcctta gcaaccactt aggacaccct agcaaccgcc 2220 tagcaacacc ttagcaacca cctagcaacc acttaggaca ccttagcaac cgcctagcaa 2280 caccttagca accacctagc aaccactyag racaccctag caaccgccta gcaacacctt 2340 agcaaccacc tagcaaccac tcaggacacc ctagcaaccg cctagcaaca ccttagcaac 2400 cacctagcaa ccactcagaa caccctagca accattcaga acaccctagc aaccaaaaca 2460 aatacatttc accgcatttc tgcatttcct gctattttgc ccagtttagt accattttgt 2520 accccatcga tttgccacgc aaaccccatt cacatttcct ttaggaaatg tacaattcta 2580 g 2581 // ID DNA-5-3_DR repbase; DNA; ZEB; 533 BP. XX AC . XX DT 17-OCT-2008 (Rel. 13.1, Created) DT 17-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE nonautonomous DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; TSD 5-bp; KW DNA-5-3_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-533 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1235-1235 (2008). XX DR [1] (Consensus) XX CC Copies are ~92% identical to the consensus. The element is CC characterized by 5-bp TSD. The subterminal inverted repeats CC (42-89, 470-518) are imperfect. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 533 BP; 173 A; 70 C; 96 G; 194 T; 0 other; cagggtgtcc gcggggtctt aaaaagtatt aaaagttgat aaatcaatgt agagaaattt 60 aaggccctta aaaagtatta aaaagtctta aatgctattt tgcaaggtat taaattttat 120 atcatttttg attatgcaat gtatggttgt atgctaaagt ttgcctgaat taaatctgcg 180 aatatcagga tgctgtgtag tttatgaaat caatgaaatc ctactagatt tgacatcatg 240 ctgctttgtt tactgcagta accaaggcaa caccattatt tctgcagttc tataggcgcc 300 aacctgctga attagctaga tttaatagat ttttattcag tatatgaata atgttttagt 360 tttggattag tttcttacat taatatttag taaatatact gtaagttaaa atctcgccag 420 tgtttccggt tgaaacctaa agtggttata aggtcttaaa atgtatggaa agagtcttaa 480 aaaaggtctt aaaaggtatt gaatttcact ctctgattcc tgtatatact ctg 533 // ID MuDR-N1_DR repbase; DNA; ZEB; 896 BP. XX AC . XX DT 21-OCT-2008 (Rel. 13.1, Created) DT 21-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE putative MuDr-type nonautonomous DNA transposon from zebrafish - DE consensus. XX KW MuDR; DNA transposon; Transposable Element; Nonautonomous; KW TSD 9-bp; MuDR-N1_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-896 RA Bao W. and Jurka J.; RT "MuDR-type DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1615-1615 (2008). XX DR [1] (Consensus) XX CC The members of this family are 97% identical to the consensus. CC The TSD is 9-bp long, and the TIR is ~70-bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 896 BP; 286 A; 183 C; 189 G; 238 T; 0 other; catagatata tacactagat atcgcatagg gaccctgagc atgcgtcaat agcgccgcca 60 cattggtaca gtgctcccag gacaaatgtc attcaaccgc actagtcaag acagtgttat 120 tacgtgaaga tgcgggactt tagcgctgtc tacgggtgta gtaacgagca aacaaagaaa 180 acaaagcaca aaggcagaac atttcatagg taatattaag tttttttctg tttttgtatg 240 ttctgaactt ttgtgctaat caggtaacgt tattgatgat aacagtcact tactgcattc 300 accatacggc aaagcagctc caactcgcac taaacactcg gcttatgcta gttttgttga 360 ataaaatcag caaacaatgc aaaagaaata tgacaacgag atgctgcgct gccagaaact 420 tgtattattg tcggctaacg ttagtgaaag agtcgttcgg gggattcatt cacaaacgaa 480 tcgctccctc cgtcagtatg agaagtgaaa gcaggagagg agctgtgttt caggacatga 540 ttagatcaaa tttaacaggg agggtgaata gtacatttcc gtacacacaa acacaagctt 600 tttgtcagga atgcccgtgc ggtcactgat ccatcaatgt agaaaagtga tgtaaaatta 660 taattttcgt aattagaaaa aaaaaattac atactaacat ccaggaaaac tcccgatcat 720 agatatatgt gtatatgtgt atatctctgg ctttggatgg ccacagtcct ccactgtacc 780 ttggtcccgc attcatttca aaggagcgct accctgtagc aagatggcgg cgctattgac 840 gcattccgtc caatagacaa caataggcca ggcgacatct aatgtatata tctatg 896 // ID GYPSY3LTR_DR repbase; DNA; ZEB; 542 BP. XX AC AL591172; XX DT 05-JUN-2002 (Rel. 7.05, Created) DT 05-JUN-2002 (Rel. 7.05, Last updated, Version 1) XX DE GYPSY3LTR_DR is a long terminal repeat from GYPSY3_DR. XX KW Gypsy; LTR Retrotransposon; Transposable Element; GYPSY3I_DR; KW GYPSY3LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-542 RA Kapitonov V.V. and Jurka J.; RT "The GYPSY3_DR LTR retrotransposon from zebrafish."; RL Repbase Reports 2(5), 17-17 (2002). XX DR Genbank; AL591172; Positions 65447 65988. XX CC GYPSY3LTR_DR is a long terminal repeat from the GYPSY3_DR LTR CC retrotransposon. XX SQ Sequence 542 BP; 103 A; 107 C; 101 G; 231 T; 0 other; tgtcagggtt tttggttaat ttctcatgta atgtttttta ttcgccacta gatgtcgcca 60 tttcaacttt tctttttttg gttacctgta atcaccgaga tcagtttcac atgttccttg 120 tttagttgtt tgtataaaag taaccctttt ttcctctgtt ctttgctcgg ttattgttga 180 tccatgcctg tcacctacag tgagtttttc cttcttgtta tttaagctag tctcttgtat 240 ctaatattcc ttggagcttt ttggattttt tctgttcaga agtttttgcc tgtctaagtt 300 gatgtacctt ttggaacctg aagtttcctg tttttgttta cttgaatttt ttgaaacctg 360 aagtttcctg tttttgttaa ctgtttttcg tcttccattt tgaagattga tttttgttgc 420 tgataaaact cctcgaggaa cctgatagtg tctcattgct gggtcttcaa ctaccagtgg 480 ctcagtggta aggaatctac actcacacgc cagggacccg ggttcgatcc ccgagcctga 540 ca 542 // ID Gypsy157-I_DR repbase; DNA; ZEB; 10900 BP. XX AC chr6; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy157-I_DR; KW Gypsy157-LTR_DR; Gypsy157_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-10900 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1575-1575 (2008). XX DR Genome; chr6; Positions 41149821 41160720. XX CC Positions [4444-4866] - Reverse transcriptase CC Positions [6292-6768] - Integrase core CC LTRs are 89% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 1517..3319 FT /product="Gypsy157-I_DR_5p" FT /translation="MSHRYANSTQNKQSSFSGTSSHLPPVEHVQNDPMPPL FT ERQTPPLPTNSFFAGQYVTPTQAFRLPGTGASGQVPPLCFDNSPDTRPGGH FT AERAVNTSVCKHSDYGPLGTMTTQSSEHQSFAKMPAEQLLPRPLPGASHHP FT VQSNRPSPQAYSVYSNPFPEPTKSLQGGPAAHITVKHVTMGQAHQMYSPEP FT TLLTGQVQSYSGSTPLPPFANARVPSMTLYNPQVAVDQANLDAPFQMSPPP FT QQNMQVGALAQLPLSTSASQSAYPHVTQFRPPDQVATNQCPPMAAHGGFMQ FT THQVKNVQVFTGNPDCRILVEDWIRDLQYLLEAIELPMHLRFSTVVRHLGG FT EARNLILNLPPHDQTPERAFEELRAEYSDTRGSLDPLADFYERSQYSGESA FT CSYAIALDAKLRTVEEKERGGRPFLDRDSKLTRQFMRGLTDEEVYTRVAPM FT APRLLSFRELQAELRSLARETKKFHTQSTAKKTFAQVHVASRGGENTKMDR FT PKHASELSELTEMVKKLALIQEEQMAKLSQLEMRMASPSLASPLAVQPGKG FT KASQSVGFVCYRCGEQGHTARVCRAVLPESSLAEAQQPKTSPDWHLHFAQN FT LNA" FT CDS 8035..9129 FT /product="Gypsy157-I_DR_3p" FT /translation="MLAFPVVLGLDHLFFSGLQMDIRKNIFWFQADQQYSS FT QNNDAFMHDWHNSSSLALFSAVPPSLASEEINLLWEACHNTCLDEEGKEHF FT LNQLQQNSDVCTNLIGKTSVLTHQIFVNQDVPIKQKPYRVSPTKQRVIKQL FT IDEMLEADIIEPSSSAWASPVVVIPKKTGGYRFCVDFRKLNSVCHNDAYPL FT PTIQEILEFLNGAVVFSTLDLNSGYWQVPIEKKSQYKTAFICSQGLFHFKV FT MAFGLQNAPATFQRLMERVLGELQRKNCFVYLDDIIIYYPSVEQHFLNIQA FT VLDKLKEAHLTVNMKKTHFFRTSLKFLGHIVSAEGIQADPKKTQAVLEFPV FT PKNIKEVKRFLGMTICHLHRCK" FT CDS join(3328..4305,4309..7002) FT /product="Gypsy157-I_DR_1p" FT /translation="MAAGASMGTQQGPHSPVMRISTKGGDTALVGPRNEGE FT VEVNGKACKALIDSGSQITSITHDYWQSHPALQKQKLQSSTIAIEGAAGQT FT VPYHGVLHIDLKVLGKEFKAVPTFVVPDSDYRSSVPLLVGTNVIRASRTHL FT QAAYRLQFLHQVKEKHPEWYTALLKVGSTKQRGVHGIVGPAVYTGHTIHIP FT GGKEMDLRCKVRTGPQRKTYTALIEGHPSLKLPQDILVAKVLVNVKRGCAA FT VRVMNLSQQVVTIKQHTRLASAVLVDKVVEFPDKKHGEAGHSGACLSLDQV FT VASCGVDLSKASVESEDQRVLLKDLLDKNADVFSHPLDYGHTTTVQHEIPL FT VDPRPFRMPYRKIPPSQWQDVRRLLMDMETAGVIRPSKSPYASPVVVVTKK FT DGSLRLCIDYRKLNSCSTRDAFPLPRIEEALEALGQAKYFSTLDLTSGYWQ FT VEVAEQDKPKTAFSTPMGLFEANRMPFGLQNAPSTFQRLMTCFGDLNFTHL FT LIYLDDIIIFSKSFKEHLERLQMVFDRLREHGLKLKPSKCQLVRKEVQYLG FT HLVSAEGIRTDPEKINKVKDWQRPTNCVEVLQFLGFTGYYRRYVKGYSNLV FT SPLYRLTSGDPKRKKRGKQKNPGPHQPFLWTAECEEAFQSLKEKLISVPVL FT GYPDYSQPFLLQTDASGTGLGAVLVQVQNGAERVIAYASRGLSPPETRYPA FT HKLEFLALKWAVTEKFHDHLYGRKFSVLTDNNPLKYVMTSAKLDATGQRWV FT SHLSIFDFDIQYRRGQDNSNADALSRMSNQEVVETLQSCPQQLRISEPSPE FT EAHTTQKSEGVIGEDTVLIEETESCQETREVYMDVGMEALPAMTILELRAM FT QREDGVIGPVLHFKSRNHKPSHSERIEVGATGGLLLKEWKRLVVKKGILYR FT SIRDCHKGVIEQLILPEKLRETIKKALHNDAGHLGFERTLQMIRERFYWPR FT MFQEIKAWCEQCERCCLRKTPTANVRAPLVSIHTSAPMDLVCVDFLMLEKS FT KGGIENVLIVTDHFSRYAQAYPTRDQKASTVARTLWRNFFCRFGFPARLHA FT DQGRNFESAIVKELCKCTGITKTHTTPYHPQGNGTTERFNRTLMNMLGTPE FT PHLKSRWHEHIDAMTHAYNCSQHDSTGYSPYYLMFGRHPRLPVDLIFGSPT FT TNQPCEYSEYVQTLYDSLKQAYALANKTSQLAKGQQKRFPAYIYIYFFFCL FT TYLTVAYCLATQNY" XX SQ Sequence 10900 BP; 3215 A; 2348 C; 2400 G; 2937 T; 0 other; aaaagcttta gcctaacccc acatagcctc ctcttttaca aggattcgtg agtaatacag 60 ctaatgactc tctgagactg ttgtttctaa tttacacttt gagttttttt tcttcaggga 120 cttgttgaac taagttataa ttaaactatt tttgaccatt ttgagtttgt ttcaatatat 180 gccttcctga aaagttatat aagtgtgtgt ttgttgagtt ttgcattgag atgttttttt 240 tgttgttgta ttgttgtttt tttgaacatg actgatggta atgaaagagt gttttgagtg 300 tttttttccc tttatgatta caacttgtgg gtctgtatat gaacagaagt tagacagacc 360 tttgtaagtg tagcgcagtg tttgttttgt ccacaagatg tcgctgtgac actgttttat 420 ttaggtattt ggacgtttct tatgttgttt gagccttttg gcacaccgta gtgttgaata 480 tgtgcgcagt gcgccgtaat ttcctcagtt tgtgttatca acactgatag gtaagacttg 540 ttcatgatag caggatcctt atttaaaata catctttgtt attaatgctc aagagttttt 600 tttactgaga tattctgtgt actaaattgt aaggagcaaa tggagagttg tatagggagt 660 taaaaggttg tttaattttt tattttttgt gatagagtga ttgagggagt gtgcggctca 720 tgcaaagcca ttgtgttgca gtagttaata atactgaatg ttttgtgttg atatttgact 780 gagtttgtgt tatcaacact gataggtgcc cttctgacaa ataaatggtc gagtcacgat 840 cgcgaacctc tctttgcctg catctccagt tgaccacccc acaacacata caacacaacg 900 cagagtagca gcggtggtgc tagaggccag cgagcacgca aagagcgcgt gggttacatt 960 ttggtgccgt gacaacccgg attgtggcaa aattacaccg accaggaaga gtatttaaca 1020 cccagtgatt tgattgctga actctgtaat agatgaaatt taatcactaa actggtgatt 1080 ttggggatac aacaccggca tctcgatatc ctttaaagcc tgcttgcaat ctgaactgtt 1140 tatgttttaa tgtgagtgat ctgttgtcat gaaaaatatt tttttgtgaa aaaaaaaaaa 1200 aagactattc agaattgaga tttcttccat ttgtttctga gattgtattg catctaatta 1260 atatttagtg agaaatacta caggcctttg agtataaagt gacaagtcat ctcagtttag 1320 gaatttttgg cataagtgtt ttttttctgt gtatatatct gacatagctt attattttta 1380 ttattttttt atgaaatttt ttttttctca ttcctacaaa attgaacatt ttacgtaaat 1440 acaacacatt gatacaatta aattacacta caaattaacc tgcttgcaca tagtaaacat 1500 ttttacaaaa tttacaatgt cacacagata tgcaaactca actcaaaata agcagagctc 1560 cttttctggc acatccagtc acctgccacc agttgagcat gtccaaaatg acccaatgcc 1620 acccttagaa agacaaactc caccactccc tacaaactcc ttttttgctg gccaatatgt 1680 gacaccaact caggcattta ggctgccagg cactggggcc tcagggcagg taccgccact 1740 ctgctttgac aatagcccag atacaaggcc aggtggacat gcagaaaggg cagtcaatac 1800 ctctgtttgc aaacacagtg actatgggcc attagggacc atgactacgc aaagctctga 1860 gcaccagagc ttcgctaaaa tgccagctga gcagctattg cccagacccc ttccaggtgc 1920 ttcacatcat ccagtacagt caaatagacc gagcccccag gcttactctg tgtattcaaa 1980 tccctttcct gagcctacaa agagtcttca gggagggcca gcagcacata ttacagtcaa 2040 acacgtaact atggggcagg cccatcaaat gtattcccct gaacccacgc ttctcactgg 2100 tcaagttcaa agctatagtg gatccacccc attaccacca tttgctaatg ctagggttcc 2160 cagtatgact ctatacaacc ctcaagtagc agtggatcag gcaaatcttg atgctccttt 2220 tcaaatgtct cctcctcccc aacaaaatat gcaggttggt gctcttgctc aacttcccct 2280 atctacatcg gccagtcagt cagcctaccc acatgtgaca cagttcaggc cccctgatca 2340 agtagccaca aatcagtgtc ctccaatggc agctcatggt ggtttcatgc aaactcatca 2400 agtaaaaaat gttcaagttt tcacaggaaa tccagactgt aggatactgg tggaagactg 2460 gattcgagat ttgcagtacc ttctggaggc aatagagctc ccaatgcacc ttcgattttc 2520 aactgttgtg cgacatctgg gtggtgaagc cagaaatctc atcttaaacc tacctcccca 2580 tgatcagacc cctgagagag cattcgagga actgagagct gaatacagtg acacaagagg 2640 ctctctagat cctctggctg atttctatga gcggagccag tattcaggag agtctgcatg 2700 ttcttatgct attgcacttg atgcaaagtt acggacggta gaggagaagg aaagaggagg 2760 taggcccttt ctagatcgag atagcaaatt aacccgacag ttcatgcggg gtcttacaga 2820 tgaggaagtg tacacaagag tcgctccaat ggcacccagg cttttgagct tcagagagct 2880 acaagctgag ctccgaagtt tagccagaga gactaagaag ttccatacac agagtacagc 2940 aaagaaaaca tttgctcagg tacatgtagc ttcgagaggg ggggaaaaca ccaaaatgga 3000 tagaccaaag catgcatctg aattatcaga gttgacagaa atggtaaaga agttagccct 3060 aattcaggaa gaacaaatgg ctaagttgtc ccaactagaa atgagaatgg cctctccatc 3120 ccttgcatcc ccattagctg tccaaccagg gaagggaaaa gcaagccaga gtgtgggttt 3180 tgtgtgttac cggtgtggag aacaaggcca cacagccaga gtgtgccgag cagtacttcc 3240 tgaatcaagt ctggctgagg cccaacagcc taaaacttct cctgattggc acctgcactt 3300 tgctcaaaat ttaaacgctt agtgcccatg gcagctgggg catccatggg tacacagcaa 3360 ggaccccact caccagtaat gaggatttcc actaaaggag gggacactgc cctagtgggt 3420 cccaggaatg aaggggaagt ggaagtaaat ggcaaggcat gcaaggctct cattgattct 3480 ggctctcaga taaccagtat tacccacgat tactggcaga gtcacccagc actacaaaaa 3540 cagaagctac agtcttctac aatagctata gagggtgcag caggtcagac agtaccctat 3600 catggagtcc tacacattga cctgaaagtg ctgggaaaag aattcaaggc agtacctact 3660 tttgttgttc ctgattccga ctatcgctct tcagtccccc tgttagtggg aaccaacgta 3720 atacgagcct ctcggaccca tcttcaagca gcttatagat tgcagtttct gcaccaggtc 3780 aaagaaaagc atcctgagtg gtacacagct ttactgaagg tggggagcac caagcaaagg 3840 ggagtgcatg gcattgtggg tcctgctgtt tacactggtc atacaataca catccctggt 3900 gggaaagaga tggatttaag gtgtaaggtc agaacaggcc ctcaaaggaa gacctacaca 3960 gctcttattg aaggtcatcc ctccttaaag ctccctcaag acatcttagt tgccaaagta 4020 cttgtgaatg tgaaaagagg ttgtgccgct gttagagtga tgaatctttc ccagcaagtg 4080 gttacaatca aacaacacac aagactggcc agtgcagtcc tagtggacaa ggtggtagag 4140 ttccctgata aaaagcatgg tgaagctgga cacagcgggg catgtctcag tctagaccaa 4200 gtagtagcca gctgcggtgt tgacctaagt aaagcttcag tggaaagtga ggaccaacgt 4260 gtcctactta aggacctgct ggacaagaat gcagatgtgt tttcctaaca tcctctggac 4320 tatggtcaca ccacaacagt acagcatgag attcccttgg ttgacccaag gccgttccga 4380 atgccatacc gcaaaatacc cccctcccag tggcaagatg taaggcggtt attgatggat 4440 atggagacag caggagttat tcgccccagt aaaagtcctt atgcatcgcc tgtggtggtt 4500 gtgacaaaga aagatgggtc actgcgattg tgtattgact acaggaaact aaactcctgc 4560 agcacacgag atgcattccc tttgcctagg atagaggagg ctctagaggc tttaggacag 4620 gcaaagtatt tctcgaccct cgaccttacc tcaggatatt ggcaggtgga ggtggcggaa 4680 caagataagc ccaagacggc attcagcact cctatgggtc tgtttgaagc caataggatg 4740 ccttttgggt tgcaaaatgc accctccaca tttcaaagac tgatgacctg ttttggcgat 4800 ttaaatttca cccacctcct tatttatctt gacgacatca taattttctc taagtccttt 4860 aaagaacact tggaaaggct ccagatggtt ttcgataggc ttcgagaaca tggtttaaag 4920 cttaaaccct ctaagtgcca gcttgtgagg aaggaagtgc agtacttggg tcacttggtg 4980 tcagcagagg gcatcaggac agacccagag aagatcaata aggtcaagga ttggcaaagg 5040 ccgactaact gtgtagaagt gttgcagttt ttgggattca caggctacta taggaggtat 5100 gttaagggct actctaactt agtttctcct ttataccgcc ttacctccgg cgaccctaaa 5160 agaaagaaaa gaggaaagca gaaaaatcct ggcccacacc agccatttct ttggactgct 5220 gaatgtgagg aggcttttca gtctttaaaa gaaaagctga taagtgttcc agtgctaggc 5280 tatccagatt acagtcagcc ctttctgctg cagactgacg cctcaggaac agggcttggc 5340 gcagttctgg tgcaggtcca aaatggggca gaaagggtaa ttgcatatgc gagcagaggc 5400 ttgagcccac ctgagacaag gtatccggca cacaagcttg agtttcttgc tctcaagtgg 5460 gcagttacag agaagtttca tgaccacctc tatggacgca agttctctgt tctgacagac 5520 aataaccctc tgaaatatgt aatgacctca gccaaactgg acgccaccgg tcagagatgg 5580 gtttctcatc tatccatatt tgattttgac attcagtatc ggaggggaca ggataattca 5640 aatgctgatg ccctctcgcg aatgtcaaat caagaggtgg tggaaacact gcagtcctgc 5700 ccacagcagt tgaggataag tgaaccaagt cctgaggagg cacataccac ccagaagagt 5760 gagggggtca taggagagga cacagttctt atagaggaaa cggaaagctg tcaggagact 5820 agggaggtgt atatggatgt ggggatggag gcactgcctg ctatgactat actggagctc 5880 cgagcgatgc agagggaaga tggggttatc ggtcctgtcc tccacttcaa gagtaggaac 5940 cacaaaccaa gtcacagtga gaggatagaa gtgggagcta ctgggggtct tcttttaaaa 6000 gagtggaaga ggctggtggt aaaaaaagga atcctatacc gcagtatcag agactgccac 6060 aaaggggtga ttgaacaact aatactgcct gaaaagctgc gagaaaccat taaaaaagcc 6120 cttcacaatg acgcagggca tctgggattt gagaggacac tacagatgat aagagaacga 6180 ttttactggc caagaatgtt ccaggaaatt aaggcttggt gtgaacagtg tgagaggtgc 6240 tgcctaagga aaactcccac agccaatgtc agggcacccc ttgtcagtat tcataccagt 6300 gccccaatgg acctggtctg tgtggacttc ttgatgctgg agaagtccaa gggtggtata 6360 gagaatgtgc ttatagttac ggatcacttt tcccgttacg cacaagccta ccccactaga 6420 gaccaaaagg caagcacagt agccagaacc ttgtggagga atttcttctg ccgatttggg 6480 ttccctgcaa gattacatgc tgatcaaggc cgtaattttg aaagtgcaat tgtgaaggag 6540 ttatgcaagt gtacgggcat cactaaaact cacaccacgc cctaccaccc ccagggaaat 6600 gggaccactg agagatttaa ccgtactctg atgaacatgt taggaacccc agagcctcac 6660 ttgaagtccc gatggcatga gcacattgat gccatgacac atgcatataa ctgctcacag 6720 catgactcca ctggctatag cccatactac ctaatgtttg gcaggcatcc gaggctccca 6780 gtggatctca tttttggttc ccccacaaca aaccagccat gtgaatacag cgagtatgtt 6840 cagactctgt atgactccct gaagcaggcc tatgccctgg ctaataagac atctcagctg 6900 gcaaaaggac aacagaaaag gtttcctgct tatatatata tatatttttt tttttgtttg 6960 acttatttga cagtggcata ctgtctggca acccagaatt attaaggaaa cgttacataa 7020 aaatggcaac ccaggaggga cagaaaactt atctatgagt tgtgagtaac aggaaaacac 7080 ttcactattc agcatcactg ccacaccaca aaatggctgc cacttcaaat gattcctcaa 7140 atgactcact tcataagatg gagactatgt taatcaatca attcaataaa tttcagaaga 7200 tgatcttatg aacacaacag tttcagaaac tgaggacatt aaaatccctt tgcccaaatc 7260 acttgaagga ttgccagctt tttcaactca agctacactg ttggctatgg agctcacaga 7320 agtttttgat caccaacctc ccttatcacc aagctcccca tccacactga aagaaactgg 7380 tcttcaagat accctgacaa ttcaacaatt gagcaaaatg atgcaaaccc aatatcagag 7440 tctgaaggac acattgttaa aggaatcaaa tgacataaag cagtctgtgg acacatttac 7500 aataaatgtt aataatgcat ctgaaaaaca attgactgaa atgaaacagc ttttgactga 7560 tcacttcgag cacttcagga aggaggttaa caacatgttg cattggaggt tgaaaaatca 7620 tctgtctgaa atactgaaag attttaaatc tgttatgcaa ccaatggctg ctaccttaaa 7680 tctcattcaa aatgaaaccg atcaatgtgt acaacacatg aataccttta ttgcggagtc 7740 aaattcccaa tcaactttac agaatgaggt acagcagtgt acacaaaaac ttcaacaatt 7800 aacacaggac atcaaaagtc tgaaatcatc gccgcagatt tccacaaaag caaatgcttc 7860 agtgcaaaca gttctcttta catcttctaa tgtggtagat tctcctgtgg ttaaccctag 7920 gttacagcgg gttcattcca ctttacaatc tggaatgggt gattcagatt ttcatttcat 7980 atcaccgtac aaacttctac agtatactta cctgtggtgg ttttggcttc tcagatgctg 8040 gctttccctg ttgtgttggg tttggatcac ttatttttca gtggcctcca aatggatatc 8100 agaaaaaaca tcttctggtt tcaggcagat caacaatatt cttcccagaa taatgatgcc 8160 tttatgcacg attggcacaa ttcctcatct ttagctctct tttctgcagt tcctccaagt 8220 ttagccagtg aagaaatcaa tcttttgtgg gaggcatgcc acaacacctg tctggatgaa 8280 gaaggtaagg aacactttct aaaccaactg cagcagaatt ctgatgtttg taccaatttg 8340 attggaaaaa ctagtgtttt gactcaccag atttttgtca accaagatgt accgataaaa 8400 caaaagcctt acagagtgtc accaacaaaa cagagagtga tcaagcaact gattgatgag 8460 atgttagagg ctgatatcat tgagccgtcc tcttctgctt gggcctctcc ggtagtcgtg 8520 ataccaaaga aaactggtgg ctatcgtttc tgcgtggatt tcagaaagct taattctgtt 8580 tgtcataatg atgcttatcc actccctacc atccaagaaa ttcttgaatt cttaaatggt 8640 gctgttgttt tttctaccct ggacctcaac agtgggtact ggcaggtgcc aattgagaaa 8700 aagagtcaat acaaaactgc cttcatctgc agccaaggat tatttcactt caaagtaatg 8760 gcttttggtc tgcaaaatgc acccgcaacc tttcaaaggt tgatggagag ggtcctaggt 8820 gagttacaaa ggaagaactg ctttgtatac ttggatgaca tcatcattta ttatccctca 8880 gtggaacagc acttcttgaa cattcaagct gttttggata aactgaaaga agcacacctt 8940 actgtgaaca tgaaaaagac ccacttcttc cgtacatcac tgaagttcct tggacacatt 9000 gtgtcagcgg agggaattca agcagaccct aaaaaaaccc aagctgtgct agaatttcct 9060 gtaccaaaaa acatcaagga agtaaaacga tttttaggta tgactatttg tcatttacac 9120 cgatgcaagt gaagtcggcc tgggtgcagt actagtgcag caaactgggt tgggcacaga 9180 ggaagttctt gcttatgcca gtcgaacttt gaatccagca gaacgaaatt ataccactac 9240 tgagcaggaa tgcttagcga ttgtgtgggc tctcgaaaaa tggagatatt ggaaattgtt 9300 acagatcact cttcttcaaa ttggatcttt aaaactcaaa agcataacac tcgcttaatt 9360 tgatgggcac tacatctcca ggagttcaat ttcactgtgg aatatcggaa agtataacac 9420 tgtgccggat gctttgtcta tggctccttt ggaacataag gaagcagagg gatcagtagt 9480 tctcacctgt tttaccgctt tgtctcgtaa gccaggaaaa cctgaacatc ttccctttcc 9540 aatcaccaat tgtgacatct ggaaagctca acagacacat caacacattt acaaccaaat 9600 agtggaagca ggggaactga ctgtgaattc ctctacaaaa tttactatct tggaagacaa 9660 agtctacaga gttgttcagt ttccccataa cactgtatgc caagtatctt acagaaggtg 9720 ttccaacatc aggtactcca gaatcttcac aaagatcctc tcgctgggca tcttggtcag 9780 tttaaaacgt gcaagagact acaagctttg gtttattggc ccaacttgaa ccaaaatatc 9840 aaggagtttg tgcagaattg tcatttgtgc cagagataca agccagaatg ccgcaaacca 9900 cctggctcct tgcaacaaac cattgtgcag cgtccatggg aaatgtttgg agtacacctg 9960 atgggtccat ttcctagaag taattccggg aatgtttttc ttctagtttt tgtggactac 10020 tactctcggt gggtagaact tttcagtcta cgtaaagcca cagctgagac tgtttcacag 10080 attctggtcc gagaaattct cacccgttgg ggagttccag attacatctt gtcggatcga 10140 ggacctcagt ttgtgtcttc tgtttttcag gaactctgta aaacatggaa catcgggcac 10200 aaaatgacaa atgcatacca cccacagaca aatcacactg aaaggttcaa tcgtacactc 10260 aagacaatgg tggcatcata tgtcacagac aatcacaaac attgggacaa gttcttacca 10320 gagttcaggt ttgctattaa ttctgccgta aatgagtcca ccggagtgac tccagctgag 10380 ctaaacctca atagagctct tcgaggaccc atggatgtgt tattgcaacc gcgagatgtc 10440 tatcctgatg atttctgtta tgacaaggta acagagttac ataagacgaa gaattatgtt 10500 cagaagagac tgaatagagc tcgtgaatgg caaaaacgaa attatgacaa gaatcgaaga 10560 gaagctgtgt ttctctttct aaggcagaga agtccttttc tgtaagcttg ctccaaggtg 10620 gctgggtccg tatagaatca tttggaaact aggatccctg aactatgaaa ttgttctgga 10680 agcctcagga gaagatttaa gggtagttca cgtctccaag ctcaaaccct gttttccatt 10740 tgcactagaa ctggaatcca tacaaaaact tcgactacaa gaaatgtttc aagaggacag 10800 tgatgatgaa gactttttag gatttacgga gaaagagatt taacaatcca ggatttaacg 10860 cagtttccct acagtctggc ttttcatatg gaggggggag 10900 // ID CR1-24_DR repbase; DNA; ZEB; 2247 BP. XX AC . XX DT 21-NOV-2008 (Rel. 13.11, Created) DT 21-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE CR1-like non-LTR retrotransposon - a consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; AP endonuclease; CR1 clad; CR1-24_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-2247 RA Bao W. and Jurka J.; RT "CR1-like non-LTR retrotransposons from zebrafish."; RL Repbase Reports 8(11), 1698-1698 (2008). XX DR [1] (Consensus) XX CC The 5'-part is incomplete. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS join(141..533,455..2083) FT /product="CR1-24_DR_1p" FT /translation="LRSYAINTFWRKAKTSKNMSDWQSFRAIRNKCTQHIR FT QAKSSYFKQYLAGNGPDSQRFWNIIKTMENETVSSHLPILIKHDGVIITDK FT LNMVKHFNRHFVNAGCSSRIALLLTRHQLFTLLPQVLTHFLSGLKNSSPVN FT QTSTLHSPASGPHTFSFRLIKETEVLGELLNLDTHKXAGLDDLDPSFLKKA FT AYIIAEPITCIFNLSLQTGVLSRDWKSAAVTPLFKGGDKTDLNCYRPISIL FT PCLSKILEKLVNKQLIYYLETHNVLNQAQSGFRSGHSCVTATLKVLNDIIC FT AIDKKEYCVAAFVDLAKAFDSVDHEILLDRLRYIGLSESCLVWFRNYCSGR FT LQSVRVEGFLSDPLLLQKGVPQGSILGPTLFNIYINDIASAAGDSYIHLYA FT DDTIIYTSSSSLTTALSSLQASFLSIQRAFTNLHLVLNTNKTKCXVFNRNL FT PHVEIPPKISSLDGTEIDYVDCYKYLGIWLDGKLAFETHINELLKKVKARI FT GFLYRNKSSFTHSSKQHLVKLTVLPILDYGDLIYRSASKTLLHKLDVIYHA FT AIRFVTGAPFNTHHCQLYSSLNWPSLQSRRQIHWFLFIYKTLIRKTPQYLQ FT SLLNIHETCRNLRSSNFINLCIPKVCTSFGRYSFQFSAADAWNHLQQTLKL FT STFIPLSSFKNSIQSIVQDHCACF*" XX SQ Sequence 2247 BP; 636 A; 432 C; 389 G; 786 T; 4 other; cagaaattgg gcttatacca gatgttgaag aggcttgggc atattttact cttttgttta 60 gaactgtttt aaacaaacat gcccctttta aaaaattaga gtaaagaatc gtcttagccc 120 ttggttcagc acagatttag ctgagatcat acgccataaa caccttctgg cgtaaagcaa 180 agacttctaa aaatatgtct gattggcaat cttttagggc tataagaaat aaatgtactc 240 aacatattag acaggcaaaa tcaagttatt ttaagcaata tttggctggt aatggtcctg 300 attctcagag gttttggaat attatcaaaa caatggaaaa tgaaacagtt tcctcacatc 360 ttccaatttt aattaaacat gatggggtca ttattacaga caaacttaat atggtgaagc 420 attttaatag acattttgtc aatgctggtt gtagctcaag aatagctctc ctgttaacca 480 gacatcaact cttcactctc ctgcctcagg tcctcacaca ttttctttcc ggttgataaa 540 agagactgag gtacttgggg aactgctcaa tttggatact cacaaawcag ctggattgga 600 tgatctagat ccatcctttt taaagaaagc agcctacatc attgcagagc ctatcacatg 660 tatttttaac ctttctctcc aaacyggtgt cctatcaaga gactggaaat ctgctgctgt 720 aactccactt tttaaaggtg gtgataaaac agatttaaat tgctacagac ccatctctat 780 tttgccctgt ctctctaaaa tattagagaa gctggtcaat aagcagctca tttattattt 840 ggagacacat aatgttctca atcaggctca gtctggtttt agatctggtc atagttgtgt 900 aactgctact cttaaggttc ttaatgatat tatttgtgcc attgacaaaa aggagtattg 960 tgtagctgca tttgttgatc tggcaaaagc ttttgactct gtggaccatg aaattctact 1020 ggataggcta agatatattg gactatctga gagctgtctg gtctggttca ggaactactg 1080 ttctggtcgt ctacaatcag taagggtgga ggggtttctg tcagatcccc tgctgcttca 1140 gaaaggggtc ccacaaggta gtatattggg gcctacactt tttaatattt atattaatga 1200 tattgcttct gctgcaggtg attcttacat acatctatat gctgatgata ccatcattta 1260 tacttccagt tcttctctga ctacagcact ttcttcattg caagccagtt ttttaagtat 1320 ccaacgcgct tttaccaatc ttcacttagt tttaaatact aacaaaacaa aatgtwtggt 1380 ttttaataga aatttaccac atgttgagat tcctcctaaa atttcctcat tggacggtac 1440 tgagattgat tatgttgatt gctataaata tcttggtatc tggttagatg ggaaactggc 1500 gtttgaaact cacataaatg aactgctcaa aaaggttaaa gcccgcattg gttttcttta 1560 caggaacaaa tcttccttta ctcactcttc aaagcagcat ttggtcaaat taacagtatt 1620 accaattctg gattatgggg atttaattta cagatcagct tcaaaaactc ttcttcacaa 1680 actggatgtt atttatcatg cagcaattcg ttttgttact ggtgcaccat ttaatactca 1740 tcactgtcaa ctgtattctt cactgaattg gccttctctt caatcacgtc gacaaattca 1800 ttggtttctg ttcatttaca aaactcttat tagaaaaact ccacaatatt tacagtcact 1860 tcttaacatt catgaaacat gccgaaatct gcgctctagt aatttcatta acctttgcat 1920 acccaaagtt tgcacttcat ttggtcgcta ctcttttcag ttttctgctg ctgatgcttg 1980 gaatcatctg cagcagacct taaagctcag cactttcatt ccgctatcat cttttaaaaa 2040 ttctattcag tcaatagttc aagatcactg tgcttgtttt tagatttttg ttggttttat 2100 atgctattat ctgcatttgt gttgttattg tattgtgtcc ctastgcttt gtactgcatc 2160 tgcttttcat gttgcctctt ggccaggtcg tcattgtaaa tgagaactgg ttctcaattg 2220 acttacctgg ttaaataaag gttatta 2247 // ID BEL13-I_DR repbase; DNA; ZEB; 5209 BP. XX AC chr11; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL13-I_DR; KW BEL13-LTR_DR; BEL13_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5209 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1375-1375 (2008). XX DR Genome; chr11; Positions 10718495 10713287. XX CC 'ACTGT' target site duplication CC LTRs are 98% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 813..2636 FT /product="BEL13-I_DR_2p" FT /translation="MVPDDIALQYSRQRGNSDEWKVAEIMKFLQAEILSRE FT RTVQLLKPSNYKDNQSYQKPFKKQDTAFEFKQRKHHIPSAAVLQTTSHRPQ FT SCVFCDSTTHKTELCAETDIATRKEKLMKMGRCFLCLGPKHIARFCKSKVS FT CNSCGSKHHSAVCEKKEKNAPDSVDKDNVVSSFTSHSLKIQPTKQDHSVLL FT QTVRAWAEGPGGRISVRCLLDGGSQRSFISERAVRALKLPVIKRETFTLHT FT FGCAEPVTTKRNTVKVTLQNTQQREQKIEIEALQTPQVCTALMKIPGEHIQ FT AELRRKGLQLADYQDDNACDTELSVLIGADYYWHIVSGHVERITDALVAVE FT SIFGWSVQGPVKMSSVADAACMHSQVNEDTLVSDHLKAFWEIESLGITMKQ FT TDSAEEEESLYKFEKTTQYKNGRYEVELPWRPDKPELPDNYRIAKKRFEGL FT KKRLQSDVVLYHRYNEVVCDYLEQGIVEEVAEDQSSQPTAKYYMPHHAVLR FT EDKVTTKLRVVFDASSHETDSPSLNDCLLTGPNLNPDLLSILIKFRLNAIA FT FTADIKKAFLQISLAEKDRDAVRFLWLAVPPHENKGEKLRVLRMTRVVFGV FT SPAHFSLQLP" FT CDS 2780..5179 FT /product="BEL13-I_DR_1p" FT /translation="MSTAGMDLCKWTTNCHVLKEKWKTVRNDVVPDKEMPG FT SVLKVLGLVWRTETDDFVFDLTALLDAVAKGENTKRAILKLSARIFDPIGF FT LTPFTVRVKCLFQEMWTRGLGWDEELPADIEKEWRKWCSELAQIHHIVIQR FT WYGITSEHRHDAQQLHVFCDASERAYSAAAYLLLELNDGTKTTCLVASKSR FT VAPLKKMSLPCHELMGAVIGARLGNSLLKPMNMDPQSLHMWTDSMIVLQWI FT QSPAHKWKQFVSNRVAEIQSITNPAMWSHCRSKDNPADLPTRGQTVTNLKE FT NELNSCEDNSGMESLLNLAKYSKLNRVLRITAWIKRFMHNSKSSSKRRGEL FT TADEMFEAEKYWIKVTQDSNFRHEITLLKAHKGLNADSKIKELKPFLDDDE FT LICVGGRLQQSKFSYREKHPWILPKRARYCELLVQHNHELTMHSGLRDTLV FT QIRSKYWILQGRQLVKTILSKCAVCKRFKAKPSQQDTVPLPRDRIVETPPF FT EVTGIDFAGPLYVKNDRVLCKAYIALFTCAVTRAVHLELVSSQSTESFLLA FT LKRFVSRRGLCKIIYSDNAKTFKRANQDLSELWQTIKDPQVLEYFSGKGIT FT WRFIVERAAWWGSFWERLVRSVKTCLRKVLGRASLTFEEMCTLLTEVEAVL FT NSRPLTFVNNELDDPQPLTPAHFLVGERLTALPPKPFPADYDHPTVNKEDM FT TWRWRYRNRLITVLWNRWRKDYLLDLKSAHSCRIQKPTELKIGDIVLIGEA FT NMPRQSWKLGKIELLFPGRDGKVRACAVRTSAETVLRRPIQLLYALEIE" XX SQ Sequence 5209 BP; 1696 A; 1025 C; 1235 G; 1253 T; 0 other; gaaagtttgg tgccgaaacc cgggaaacta gcaagcaata tccgaaatgg cggaacagct 60 ggagagatta ctccgaaaac ggagagctgt acgcggatcc acaacgagac ttttgcagga 120 cattgaaacg gaggttagca aagaagatcc gattgttgat catttgcgta aactgttagc 180 cctgttatct gcaaaagagg aaacgctgct ggagttggat attaagatag aagaaggaac 240 ggaaatggat gatttggaga atgaagttga gaatgttgag gaatataaag gaaggataat 300 cactgtgaaa acccgcgcgc acagagtgat acagaagagc cgcgaaagca gcaggcagag 360 caacacagac cccccgcaac aaggacagac agttaagtta ccgaagttaa ttattaacaa 420 gtttaacgga gaaattggcc aatggcagga cttttggaat caatttgaca cagctattca 480 taaaaatgac gcgctgagca aaacggaaaa gtttaactac ttaaagacat atttaaccgg 540 cgctgcttca aaggccatag cggggctgat gttaacagac agtagttacg accatgctat 600 tgatctgctg ctgaatcgat tcggacgaaa agatctgctt attaacgcgc atatgaacaa 660 agctgctaag tctttgtccc gtaaggaaat cccatgacgt cactgctctg agacagctat 720 atgacgagtg tgaagtgcag atccgcagct tagaattgct gggtgtcgta tcagatgcct 780 atggaagcct gttatgtccc gttctgctac agatggttcc agatgatatt gctctgcagt 840 acagtcgtca gaggggaaac agtgatgaat ggaaagttgc agaaataatg aaattcctgc 900 aagctgagat tttaagcaga gagagaactg tgcagctgct taaaccaagt aattacaaag 960 acaatcaaag ctatcagaaa ccattcaaaa aacaagatac tgcatttgaa tttaagcaga 1020 gaaagcatca cattccatca gctgcagtac ttcaaactac cagtcataga ccacagagct 1080 gtgttttctg tgacagcaca acacacaaaa cagaactgtg tgcagaaact gatattgcta 1140 cacgaaaaga aaaattgatg aaaatgggac gatgtttttt gtgtctagga ccaaaacata 1200 tcgcaaggtt ctgtaaaagc aaagtgagct gcaattcatg tggaagcaaa catcacagtg 1260 ctgtatgtga gaaaaaagag aaaaatgcac ctgacagtgt agacaaagat aatgtggttt 1320 cctcatttac ctcacattca ttaaagatac agcccactaa acaagaccac agtgtgttgc 1380 tgcagacagt cagagcttgg gctgaagggc ctggtggacg aataagcgtt cggtgcttat 1440 tagatggagg cagccagaga agctttatca gtgaaagggc tgtcagagct ttaaaattac 1500 ctgtaataaa acgagaaaca ttcactctac atacctttgg ttgtgcagaa ccagtgacaa 1560 caaagcgaaa tacagtgaag gtaactctcc agaacaccca acaaagggag cagaaaattg 1620 aaatagaagc acttcagaca ccccaggtgt gcactgctct gatgaaaatt ccaggtgagc 1680 acatacaagc agaactgaga agaaaagggc tgcagttagc agattaccaa gatgacaatg 1740 cttgtgacac agagttatct gtgttgattg gtgctgacta ttactggcac atagtgtcag 1800 gccatgtgga aagaataaca gatgctcttg tagctgttga aagcattttc ggatggtcag 1860 tgcagggccc ggtcaaaatg tccagtgttg ctgatgcagc atgtatgcat agtcaagtca 1920 atgaagacac actggtttca gatcacctaa aagccttctg ggaaattgag tctcttggta 1980 taaccatgaa gcagacagac agtgctgagg aagaggaatc tctttacaaa tttgaaaaga 2040 caactcagta caaaaatggc cgatatgaag ttgagttgcc atggcgacct gacaaaccag 2100 aactccctga taactacagg attgctaaga agagatttga gggcctgaaa aaaaggttgc 2160 agtcagatgt agttttgtat cacagatata atgaagttgt gtgtgattac cttgagcagg 2220 gcattgtcga agaagttgcg gaggatcagt catctcagcc cactgcaaaa tattacatgc 2280 ctcatcatgc tgttctgcgt gaagacaagg taacaacaaa gctcagagtg gtgtttgatg 2340 catcatcaca tgaaactgat tctccatcac taaatgactg tttgctaact ggtcctaatc 2400 tgaacccaga tcttctcagc atattaatca aattcagact gaatgcaatc gcatttacag 2460 cagacatcaa gaaggctttc ttgcagattt ctcttgctga gaaggacagg gatgcagtac 2520 gatttctctg gcttgctgtg ccaccacatg agaataaagg cgagaagctg cgtgtgctac 2580 ggatgacgag agtagtgttt ggagtgtccc cagcccattt ctccttgcag ctaccgtgag 2640 aaagcatctg aagaaatatg aagagcagaa tcctgaggta gtaaacataa taaaagaatc 2700 actatatgtt gatgatttca tctcaagtgc aagtgatgtt gaaaaggcct tctccataac 2760 aactactgca aaacagatta tgtcaactgc tggcatggac ctgtgcaaat ggacaaccaa 2820 ctgtcatgtg ttaaaggaaa aatggaaaac agtgagaaat gacgttgttc cagataaaga 2880 aatgccagga tctgtattga aggtattagg cctggtttgg agaactgaga cagatgactt 2940 tgtctttgat ttgactgcgt tactagatgc tgtagcaaag ggagagaaca caaaaagagc 3000 cattctgaaa ctctctgctc gcatattcga cccaatagga ttcctcactc ccttcactgt 3060 gcgagtcaaa tgcctatttc aagagatgtg gacgcgtggt ctcggctggg atgaggagct 3120 gcctgcagac attgaaaaag aatggagaaa atggtgctca gagctagcac agatccacca 3180 catagtcatc cagcgctggt atggaataac atccgagcac agacatgatg cacagcaact 3240 tcacgtgttc tgtgatgcga gtgaacgagc ctatagtgct gctgcctacc tgttactaga 3300 gctgaacgat ggaacaaaga ctacttgcct tgtcgcttcc aagtccaggg tagcgccttt 3360 gaagaaaatg tcactgccat gtcatgagct tatgggtgcg gtgattggtg caagactagg 3420 aaacagcttg ttaaaaccga tgaatatgga tccccaatca cttcacatgt ggactgattc 3480 tatgatcgta ctacagtgga ttcaaagtcc agctcacaag tggaaacagt ttgtgtcaaa 3540 cagagtggct gaaatacagt ctataactaa ccctgcaatg tggtcacact gtaggagcaa 3600 agacaatccc gcagaccttc ccactagagg tcagactgtg actaatctaa aggaaaatga 3660 acttaacagt tgcgaggaca acagtggaat ggaatctttg ttaaatttag caaagtacag 3720 taaactgaac agagtattgc gaataactgc ttggatcaag aggttcatgc acaacagtaa 3780 atcaagctct aaaagacgcg gtgagttaac tgcagatgaa atgtttgagg ctgaaaagta 3840 ctggataaaa gtgacacaag acagcaactt caggcatgag attactttgc taaaagctca 3900 caaaggccta aatgcggact caaaaataaa agaactgaaa cctttcctgg atgacgatga 3960 attaatctgt gttggaggca gactgcaaca atcaaaattc agttacaggg aaaagcaccc 4020 atggattcta cctaaaaggg ccagatactg tgagctgcta gtgcagcaca accatgaatt 4080 aactatgcat tcaggtttga gagacactct agtacaaatt aggagtaaat actggatttt 4140 acaaggccga caacttgtga aaactattct gtcaaaatgt gctgtgtgca aaagatttaa 4200 agcaaagccc tcacaacagg acactgtccc cttgccccga gataggatag tagagacccc 4260 tccatttgaa gtgactggca ttgattttgc aggtcctctg tatgtaaaaa atgatcgtgt 4320 gttatgcaag gcctacattg cattgttcac atgtgccgtc accagagcag tgcacttgga 4380 gctggtttca agccaatcaa cagaaagctt cctgctagca cttaagcgat ttgtatctag 4440 aagaggactg tgcaagataa tttattcaga caatgccaaa acattcaaaa gagccaatca 4500 agatcttagt gagctgtggc aaacaattaa agatcctcaa gttctagaat atttctcagg 4560 gaagggcatc acctggcgtt tcatcgtaga gcgagcagcc tggtggggca gcttttggga 4620 gaggttagtt cgatctgtga agacgtgtct gagaaaggtg cttgggagag catcactaac 4680 tttcgaagag atgtgtacac tacttacaga agtagaagct gtacttaact ccagaccact 4740 tacctttgtc aataatgaac ttgatgaccc acagccactt acacctgcac actttctagt 4800 tggcgagcga ctaactgcac tgccaccaaa accatttcct gcggactatg atcaccccac 4860 tgtgaacaaa gaggatatga catggagatg gagatacagg aatcgactca taaccgtcct 4920 gtggaaccgc tggagaaagg actatctgtt agacctgaaa tctgcacatt cgtgccgcat 4980 acagaaacct actgagctga aaattgggga cattgttctc attggagaag ctaacatgcc 5040 aaggcaatct tggaagttag gaaaaattga acttttgttt ccagggcgag atggtaaagt 5100 tcgagcatgt gctgttcgca catctgcaga gactgttttg agaagaccaa ttcaattgct 5160 atatgcatta gagattgagt agatgaacag ttgttcatgg ggtggagga 5209 // ID ERV1-N7-LTR_DR repbase; DNA; ZEB; 688 BP. XX AC . XX DT 03-DEC-2008 (Rel. 13.12, Created) DT 03-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE nonautonomous endogenous retrovirus from zebrafish, long terminal DE repeat - consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR; ERV1-N7-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-688 RA Bao W. and Jurka J.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(12), 2176-2176 (2008). XX DR [1] (Consensus) XX CC The 5'- end of the LTR is 5'-TA. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 688 BP; 267 A; 122 C; 128 G; 171 T; 0 other; tataggagaa tttctgatgc atatagagga acttttatta tgtaatcact atataatctg 60 ccactattaa aatatagatt actattaaaa tacactgtta aaacatatag aaatacattc 120 aagtgatgca gaggaaaaca cacagacatg cagggcctta taattgacga aaacacacac 180 aggcatacag ggccttataa ttgagggcct cacaattgaa gcacagcacg tcatgcgcct 240 aaatgagaac attcagatag agcgtatgca atacatttag acaaaagtat gtattgacca 300 agacaaaaac aacccttgac aaaatagcat acataaggag aatcatttag aaagtatgga 360 aacaacatga tcatgtgcaa acacatgatt gtaagcacat gctatatgac cttaagatat 420 gtaaaaacag aaaaacatac agccattaat taaaaagcta ggatactatt ggagccggga 480 aatcgcgtga tgctatacga catgtcagga tgactatggg aaaaaactgt atataaactg 540 tgggtttttg aacattggga ggagagagat cgatcgacca actctgctgt aacaataaac 600 tgcttcaact tgagaacttc gaagacctca gcctcttttt tgaaacttag aagatgtttg 660 tcgagatcca gcgcaacttt ccacaaca 688 // ID Copia-8-I_DR repbase; DNA; ZEB; 1620 BP. XX AC . XX DT 20-FEB-2009 (Rel. 14.02, Created) DT 20-FEB-2009 (Rel. 14.02, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Nonautonomous; KW Copia-8-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1620 RA Bao W. and Jurka J.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 9(2), 554-554 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1620 BP; 289 A; 640 C; 427 G; 261 T; 3 other; gaagactcgg ccaagaacat ggatccagcg ggtattcaaa tcatgcgcct ccgtcaggat 60 gcccggtcaa ttgaggagta cgtggaggac ttyattaatc tggcccaccg tacaaccmtg 120 agtgaggtat gtctaatgat tttctttcat ggaggacttt ctgagccact gtactccaac 180 atgccactgc acaatcccca ctggacactg gcacaattta ttgatctggc tctccaaatg 240 agtggatctt ccttcaccat aggagaggtg gacgaaaatc caaataatct tttggggggg 300 gggggctgta gagagcaaga cccgctggct acggggcttg cgacccattg tccacagcca 360 gctcacaaga tggccgctcc cagtcagcca gctcacaaga tggcttccag ttcaaagtct 420 gttcctgctc ctgaaaggcc atctccagag ctgccattcc ctccagtgct gccagatcct 480 ccagagctgc cagaccctcc agtgctgcca gaccctccag tgctgccaga ccctccagtg 540 ctgccagayc ctccagtgct gccagaccgt ccagtgctgc cagaccctcc agtgctgcca 600 gaccgtccag tgctgccaga tcctctagag ctgccagagt cactgccacc gccagagctg 660 ccaccgccag agctgccacc gccagagctg ccagagccga cgccgctgcc agcgcctcag 720 ccagagccga cgccgcagcc agcgcctcag ccagagccga cgccgcagcc agcgcctcag 780 ccagagccga cgccgcagcc agcgcctcag ccagagccga cgccgcagcc agcgcctcag 840 ccagagccga cgccgcagcc agcgcctcag ccagagccga cgccgcagcc agcgcctcag 900 ccagagccga cgccgcagcc agagccgacg ccgcagccag cgcctcagcc agagccgacg 960 ccgcagccag cgccagagcc gacgccgcag ccagcgcctc agccagagcc gccgccgcct 1020 gagcttcccg agtggtcgcc gccgccgcct gagcttcccg agtggtcgcc gccgccgcct 1080 gagcttcccg agtggtcgcc gccgccgcct gagcttcccg agtggtcgcc gccgccgcct 1140 gagcttcccg agtggccgcc gccgcctgag cttcccgagt ggtcgccgcc tgagcttccc 1200 gagtgtccgc cgctgcctga gcttcccgaa cggccacagc tcctcgccct gtcggcgcca 1260 tccagactcc tcgtcctgtc ggcgccatcc agacagctca cccagccggc gtcatccaga 1320 ctcctcgcct tgccgatccc agcccggctc ctcgccctgc cggctcccta caagcctcca 1380 accctgccgg ctcctcgcag tactccaggt ccttcgctgc cacacgatcc aggaccactg 1440 cagctccact ctccgggtct tccgcagctc cagtctccag gccctccgca gctccacgcc 1500 ccaggtacgc cccagctgca tggtcctggc cctccatccc tccccctgtt ccgcctccgc 1560 tccgcctccc gcctgaacag taagtggagt gtctggaatc cactcctaga gggggggcta 1620 // ID Gypsy-14-I_DR repbase; DNA; ZEB; 6455 BP. XX AC . XX DT 11-FEB-2005 (Rel. 10.01, Created) DT 11-FEB-2005 (Rel. 10.01, Last updated, Version 1) XX DE An internal portion of the Gypsy-14_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy superfamily; Gypsy-14-I_DR; Gypsy-14-LTR_DR; Gypsy-14_DR; KW endogenous retrovirus; gag; integrase; reverse transcriptase. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6455 RA Kapitonov V.V. and Jurka J.; RT "Gypsy-14_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 5(1), 1-1 (2005). XX DR [1] (Consensus) XX CC Gypsy-14-I_DR is an internal portion of the Gypsy-14_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its CC long terminal repeat is deposited in Repbase as CC Gypsy-14-LTR_DR. Gypsy-14_DR is characterized by 4-bp CC target site duplications. The internal portion encodes two CC proteins: the 572-aa gag Gypsy-14_DR1p (pos. 100-1815) and CC 1518-aa Gypsy-14_DR2p polyprotein (pos. 1845-6398) composed of CC the protease, reverse transcriptase, and integrase domains. CC PBS is complementary to Arg-tRNA. The internal portion CC is flanked by 99% LTRs. XX FH Key Location/Qualifiers FT CDS 1845..6398 FT /product="Gypsy-14-I_DR2p" FT /translation="MQSHSSKIKDGKLPRGLVGERCIAQVTILGQKCNCLI FT DTGSQVTTIPLSFYDQHLSETPINSISDLLEVEGANGLSVPYIGYIELNIT FT FPAEFIGTSVELPTVALVVPEPESCTMPLIIIGTNTLDTLYDEHLKTSSMT FT FQPSLFGYRTVLNTIRLRHKHNAGALGQIRLRSKVPEVMAAGQTVVLEGIA FT HMHGFPTDKWIVVEHPSISFLPGGIVVKNCLLTLPSRRYSHLPVVLTNETD FT HNITIPQRCVIAEIHALESVLSSDNAIPKQGFQSDEPKVIKEEEFILNFGD FT SPLSQEWKDRITKKLREMPEVFAQHDLDFGHTQKMKHCIKLHDESPFKHRA FT RPIHPRDIGAVRKHLQELLLSGVIRESESPFSSPIVVVRKKNGDVRLCIDY FT RKLNLQTIKDAYALPNLDETFSALNGSKWFTVLDLKSGYYQIEVAEEDKPK FT TAFVCPLGFWEFNRMPQGVTNAPSTFQRLMEKCMGDINLKEVLVFLDDLIV FT FSASLEEHERRLLQVLARLKEYGLKLSPEKCRFFQTSVRYLGHVVSEQGVA FT TDPEKIEAIKTWPCPRNLKELRSFLGFSGYYRRFIKDYSQIAKPLNELTSG FT YPPSRKSCKKAEKEYQYHDPKEPFGGRWTATCQRAFEILIEKLSTAPVLGF FT ADPKLPYVVHTDASTTGLGAALYQEQNGKQRVIAFASRGLSYSEARYPAHK FT LEFLALKWAVTEKFHDYLYGGTFTVITDSNPLTYLLTTAKLDATSYRWLAA FT LSTFSFRLQYRAGKQNVDADSLSRRPHESLPEDQLSVKEFERINKFVEYHT FT MDTENPQNTDRDIIRAICEKHSVQQSDDSGVTLVESLAIHPDAVPECFEQE FT DSLNGLAALPLLMKEDLAERQRCDPVIREVVMCMESGEKPLPALRLEYPDL FT PFYLREWDRLELIDGVLYRRRREGDSVNFQLVLPEDLRSFALNSLHDQMGH FT MGTERTLDLVRSRFYWPRMGTEVEHKIKTCSRCVRRKSLPQKSAPLVNIQA FT TRPLQLVCMDFLSLEPDKSNTRDILVITDFFTKYAVAIPTPNQKARTVAKC FT LWDQFIVHYGFPERLHSDQGPDFESHTIKELCAIAGIQKIRTTPYHPRGNP FT VERFNRTLLNMLGTLEEKDKSHWKDFVKPLVHAYNCTKHESTGFSPYELMF FT GRQPRLPLDLAFGLPCKDCKQVSHSQYVQSLKSHLKESYEVASQNARRIAE FT KNKIRYDKHVTNSSLEVGDRVLVRNVRIRGKHKLADRWESEIYVVVKQAAN FT LPVYTVRPETKDGPLRTLHRDLLLPCGFLPASKIPDPVVSKPIVRHNTQKN FT PDSQILDSSVSVLEMDNDIPEMMCGAYPIDETRFTTVHQVLRPNVSDEVPH FT NPEVIHSRSADKVAQSDQSSDLPADVPEQAPPTENIREHLPVGVPTDSLPD FT VSEITESSPIPDAKSVNDQENSQEECSGQTQTSDEEGNDLLEDLPRRSTRQ FT RKGPKMLTYPQLGNPLMHVVHSFFQGLNTVLLNALDNSESLNDLPGKSNIQ FT DV" FT CDS 100..1815 FT /product="Gypsy-14-I_DR1p" FT /translation="MDVVMQEGIKIPNAIIISGETETADDEEFLKILQKHG FT SIVRSVKIQDPESEFNQDVIIEFDSGSALQSLEPLLPYTYQLTSDPNITYL FT VRSLSSVYTQQLGGSATKSYLKGLKEIAKLSGANFEAMLSEMLTEMSAVVL FT PASSVSEAPDKPPITPSGQEPISGKTSPVMAQPSNEGNQEMHQTAAHVADS FT SVLTSPTILNPPEVQRLVVEHVVRSGEGVAQAHIPMRLRLFSGRKPRPANE FT TDYDTWRSSVDLVLKDPAISDLHGSRKILDSLLPPAADVIKQLSADAAPSA FT YLQLLDSAFGTVEDGDELFAKFMNTLQDAGEKPSAYLYRLQAALNIAIKRG FT GVLASEADRHLLKQFCRGCWDDGLIADLRLAQKRDDPPTFAQLLLMLRTEE FT DKHTAKTIRMKQHLGSSKQRALMNTQRIWVCDEPKQSTDCNVLSLATEAKE FT LKKQIATLQSQLAKLATKSEAPKKLASQTVDRKASFKDKKSTKPSVESTMP FT MHYKQTDKPRPWYCFQCGEDGHIASSCDSEPNPALVAEKRKLLREKQSQWE FT SQNNMAKAALNQSQFLLRDSQGLKH" XX SQ Sequence 6455 BP; 1925 A; 1338 C; 1426 G; 1766 T; 0 other; taaaattggc gagccagcca ggaggtggtg ctgtattgct gagaactgac aggtgattaa 60 aatactattg gtgtttggaa taatcatttc actgataaaa tggatgtagt tatgcaagaa 120 ggtatcaaaa tcccaaatgc tattataatt agtggggaaa ctgagacagc agatgatgaa 180 gaatttctga aaatcttgca aaaacacggt tctattgtta gatcggttaa aattcaagac 240 cctgaatctg agtttaatca ggatgtgatt attgaatttg acagtggcag tgctttacag 300 tctttagagc ccctgttgcc gtatacttac cagttgacta gtgatccaaa tatcacttac 360 ttagtgagat ccttatccag tgtgtacact caacagttag ggggtagtgc taccaaatca 420 tacctaaagg gattgaaaga aatcgcaaaa cttagtgggg caaattttga agcaatgttg 480 agtgaaatgc taacagagat gagtgctgta gttcttcctg caagctctgt atctgaagct 540 cctgacaaac cccctattac accatctggt caagagccaa ttagtggcaa aacatctcca 600 gttatggctc agcctagcaa cgaaggaaac caggaaatgc atcaaactgc tgctcatgtc 660 gctgattctt cagttcttac ttcccccaca atacttaacc caccagaagt acagaggttg 720 gttgttgaac atgtggtgag gagtggagaa ggtgttgccc aggcacacat acccatgcga 780 ctcagactgt tttctggtag gaaacctcgc cctgctaatg agacagatta tgacacatgg 840 cgttctagtg tcgatctcgt tttgaaggat cctgcaatat ctgatcttca tggttcaaga 900 aagattctag atagcctttt gcccccagct gctgatgtca ttaaacaact gagcgctgac 960 gctgctccat cagcttacct gcagttgcta gattctgcct ttggtacagt cgaagatggt 1020 gatgaacttt tcgctaaatt catgaatacc ttgcaggatg caggagaaaa accttcggcc 1080 tacctgtata ggcttcaggc agccttgaac atagccatca agcgtggtgg tgtgttggcc 1140 agtgaagcag accgacacct tttgaaacaa ttttgtcggg gctgctggga tgatggttta 1200 atagctgacc tgcgattggc acaaaaacga gatgatcctc ctacttttgc gcagctatta 1260 ttgatgttac gaaccgagga agataaacat actgcaaaaa ccattcgtat gaagcaacat 1320 ctaggtagtt ccaagcagcg tgcactgatg aacacccaaa ggatctgggt atgcgatgaa 1380 cctaagcaaa gtacagattg caatgtgttg tcattagcaa cagaagctaa agagcttaaa 1440 aaacagatag ccactctaca gagccaatta gctaaactcg ctactaaatc tgaggcccca 1500 aagaaactgg catcacaaac cgtagatcgg aaagcttctt tcaaagacaa aaaatcaacc 1560 aagccaagtg ttgaatctac aatgcctatg cattataaac aaactgacaa acccaggcca 1620 tggtactgtt ttcaatgtgg ggaagatggc cacattgcct cctcctgtga ttcagagccc 1680 aacccagcac ttgtggctga aaagcgtaaa cttctgaggg agaaacagtc ccagtgggaa 1740 tctcaaaaca atatggcaaa ggctgcttta aaccagagcc agttcctgtt gcgggacagc 1800 cagggactga aacattaaag tgtcccccta aaactaaagc ttctatgcaa agccatagtt 1860 cgaaaataaa agatggcaaa ttacccagag ggttggttgg tgaaagatgc atcgcacaag 1920 tcactatttt agggcagaaa tgtaactgtt tgattgacac aggctcccaa gttacaacta 1980 ttcccttatc cttttatgac caacaccttt cagaaacccc catcaactcc atttctgatt 2040 tgttggaagt tgaaggtgcg aatggccttt ctgtgcccta cattgggtac atagaactga 2100 atataacttt tcctgcagag tttattggga caagtgttga attacctact gtcgctttgg 2160 tggttccaga gccagagtct tgcaccatgc ctttaattat tatagggaca aatactttgg 2220 acacccttta tgatgaacac ttgaaaacga gttctatgac cttccagcct tccttgtttg 2280 gctacagaac tgtcctgaat accatcagat tgcggcacaa acataatgct ggtgctcttg 2340 gtcaaataag gttgaggagt aaggtaccag aagtcatggc ggctggccaa actgtcgttt 2400 tagagggcat tgcccatatg catggctttc caactgataa atggattgtt gtggaacacc 2460 cctccatatc tttcttaccc ggtggaattg tggtgaaaaa ttgcttactt actctccctt 2520 ctagacgata tagccatctt cctgtggttc tcacaaatga gactgatcac aatatcacta 2580 tcccacagag gtgtgtcatt gcagagatcc atgctttgga gtcagtgctt tcttctgaca 2640 acgctatacc taaacaggga ttccaaagtg atgagcccaa agtaatcaaa gaagaagagt 2700 tcattttaaa ttttggtgat tctccattat cacaggaatg gaaggatcga atcacaaaga 2760 aacttcgtga gatgccagaa gtctttgctc aacatgatct ggattttgga catacacaaa 2820 aaatgaaaca ttgcatcaaa ttacatgatg aatccccgtt caaacaccgc gctagaccaa 2880 ttcatccacg tgatattggg gctgtaagaa aacaccttca ggaacttctt ttaagtggag 2940 tcatacgcga gtcagaatcc cctttttctt cgccaattgt tgtagtcaga aagaaaaatg 3000 gagatgtacg actctgcatc gattatcgga agttaaatct tcaaacaatc aaggacgcat 3060 atgctttgcc gaaccttgat gagacatttt cagctttaaa tggatccaag tggttcactg 3120 tactagactt aaaatctggc tactatcaga ttgaagttgc agaggaagac aaaccgaaaa 3180 cggcctttgt gtgtcccctg gggttttggg aatttaatcg tatgccccag ggggtaacca 3240 acgccccaag cactttccaa aggctaatgg aaaagtgtat gggtgacatt aacctgaaag 3300 aagtgctagt gtttttggat gatttgattg tcttttctgc atcccttgag gaacacgaga 3360 gacgtttgct acaagttctt gctcgtctga aggagtatgg gctaaaactt tcaccagaaa 3420 agtgccgttt tttccagacc tctgttagat acttgggtca tgtagtatcc gagcaaggcg 3480 tggcaacaga ccctgaaaag attgaggcta taaagacctg gccttgtcca aggaacctta 3540 aggagttaag gtcattttta ggcttttcag gctactaccg taggtttatt aaagactact 3600 cacaaatagc aaagccctta aatgaactga cttctggtta tccaccctcg aggaagagtt 3660 gtaagaaagc tgaaaaggaa tatcaatacc atgatccaaa agagcctttt ggcggacgtt 3720 ggacagctac ttgccagaga gcatttgaaa tcctaattga aaagctttct actgctcccg 3780 tcttgggatt tgcggacccc aaacttcctt atgttgttca caccgatgct agcaccacag 3840 gattgggagc tgcattatat caagagcaaa atgggaagca acgtgttatt gcatttgcta 3900 gtagaggctt gtcgtatagc gaggcccgat acccggccca taaactcgag ttccttgccc 3960 ttaaatgggc tgtaactgag aaatttcatg attatcttta tggcggcaca tttactgtca 4020 taacagatag caatccattg acataccttc tgacgacagc taaattggat gctaccagct 4080 accggtggct tgctgcactt tcaacattct catttcggct gcagtaccgg gctggtaaac 4140 aaaatgttga tgcagacagt ttgtctaggc gaccacatga atcattgcca gaagaccagt 4200 tatcggtaaa ggagtttgaa aggatcaata agttcgtgga gtatcatacc atggatactg 4260 agaatccaca gaatactgat agagatataa tccgagctat ttgtgagaag cattctgtcc 4320 aacaaagtga tgattctggt gtcactctag ttgaatcttt ggcaatccat cctgatgctg 4380 tgccggaatg tttcgaacaa gaggatagtc taaacggatt agctgcttta cctctcctta 4440 tgaaggaaga cttggctgaa cggcagagat gtgatcctgt catccgtgaa gtagtgatgt 4500 gcatggaatc tggtgaaaaa cctctgcctg cattacgact agaatatcca gatcttccat 4560 tttacttacg agaatgggac cgtcttgagt tgatagatgg agttctgtac agacggcgac 4620 gagaaggaga ctcagtcaat ttccaactgg ttttgccaga ggatcttaga agttttgctt 4680 tgaacagtct acatgatcaa atgggtcata tgggaacaga aaggacgcta gatttggtcc 4740 gttctaggtt ctattggcca aggatgggaa ctgaagtgga gcacaaaatt aagacttgta 4800 gtcgttgcgt acgtaggaaa tccttgccgc agaaaagtgc gcccttagta aacattcaag 4860 ccacaagacc cttacaactt gtttgcatgg attttctttc tcttgaacca gataagagca 4920 atactcgtga tatcttggtt attaccgatt tctttacgaa gtatgcagtg gccataccaa 4980 cgccgaatca gaaagcaaga actgttgcaa agtgcctatg ggatcagttc attgtgcatt 5040 atggtttccc tgagcggtta catagtgacc aaggaccaga tttcgaatct cataccatca 5100 aggaactttg tgcaattgcg ggaatccaaa agattagaac aacaccatat catcctcgag 5160 gaaatcctgt ggaacggttt aaccggactt tgttaaatat gctcggaacg ttggaagaaa 5220 aggacaaaag tcattggaag gattttgtca aacccttggt acatgcatat aattgtacta 5280 aacacgagtc tacaggattt tctccatatg agttgatgtt tggccgtcaa ccacggttgc 5340 cgcttgatct tgcttttgga ttgccatgta aagactgtaa acaagtgtct cattcacagt 5400 atgttcaaag tcttaaatct catctaaagg aaagttatga ggttgcttca caaaatgcac 5460 ggaggattgc tgaaaagaac aaaatccggt atgataaaca tgtaactaat tcttctttgg 5520 aagtcggaga tcgtgttcta gtaaggaatg tacgcatccg tggaaagcat aagctagctg 5580 accgatggga atctgaaatc tatgttgttg tgaaacaagc tgctaactta ccagtttaca 5640 ctgtccggcc agaaacaaaa gatggtccac ttcgtactct tcatagagat cttttgcttc 5700 cctgtggatt cttacctgct tccaagatcc cagatccagt tgtgagtaaa cctattgtaa 5760 gacataatac tcagaagaat cctgattcac agatccttga cagttcagtt tctgttcttg 5820 agatggataa tgacattcct gaaatgatgt gtggagctta tccaattgat gagacaagat 5880 ttactacggt tcaccaagtg cttaggccga atgtctctga tgaggtccct cataaccctg 5940 aggttattca ttcacgctca gcggacaaag tagctcagag tgatcaaagt agtgacttac 6000 ctgcggatgt tccagagcaa gctccaccta ctgaaaatat aagagaacac ttacctgttg 6060 gtgttccaac agactcctta cctgatgtct cggagatcac tgagtcaagt ccaattcctg 6120 atgcaaagtc agttaatgat caagaaaatt ctcaagagga atgtagtgga cagacccaga 6180 cctctgacga ggaggggaat gatcttttag aagaccttcc cagacgatca actagacaaa 6240 gaaagggtcc taaaatgctt acctatccac agcttggaaa ccctttaatg cacgtagtgc 6300 attccttttt ccaaggctta aacactgttt tgcttaatgc attggataat agtgaaagtc 6360 tgaatgattt acctgggaaa tccaacatac aggatgtata atatgttaag ttattctaaa 6420 ctgtcacgag acgtgcacag ttaagtgggg gagga 6455 // ID Gypsy46-LTR_DR repbase; DNA; ZEB; 331 BP. XX AC . XX DT 25-SEP-2007 (Rel. 12.09, Created) DT 25-SEP-2007 (Rel. 12.09, Last updated, Version 1) XX DE A long terminal repeat of the Gypsy46_DR LTR retrotransposon; a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy superfamily; LTR; Gypsy46-I_DR; KW Gypsy46-LTR_DR; Gypsy46_DR; endogenous retrovirus. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-331 RA Dib M.R. and Naveira H.F.; RT "Gypsy46_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 7(9), 822-822 (2007). XX DR [1] (Consensus) XX CC Gypsy46-LTR_DR is a long terminal repeat of the Gypsy46_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its CC internal portion is deposited in Repbase as Gypsy46-I_DR. XX SQ Sequence 331 BP; 62 A; 64 C; 69 G; 136 T; 0 other; tgttacatgt ccatgtattt attatttatt tatttgttac tttgtcgccc tcttgtgttg 60 tgttgatggt tgtgttttca tttagagttg atttcctccg cacctgcaga caattcatcg 120 tcggctattt aagagctcgt gctgcagtga ttcagagacg gggatttgag cggccgtcca 180 gacgtggagt tgtgctcctt atgttctcag cctgtgttta ttgtgtttcc tccccacact 240 ttatttatat ttgattgttt aattaataaa acccctagac tgatccattt gtgtttttcc 300 ctcatttaat gttgtgactg gggtcgtaac a 331 // ID DNA-2-20_DR repbase; DNA; ZEB; 8652 BP. XX AC . XX DT 18-NOV-2008 (Rel. 13.11, Created) DT 18-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE nonautonomous DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; 2-bp TSD; KW DNA-2-20_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-8652 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(11), 1711-1711 (2008). XX DR [1] (Consensus) XX CC TSD is 2-bp long, and TIR is ~62-bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 8652 BP; 2704 A; 1279 C; 1466 G; 3200 T; 3 other; cactgtaaaa aaaatctgta aaataacagt taaatgactg tatattttaa cagatttttt 60 ccgtatacac gtattacggg tgtttttccg tttatataca ttttcgaatt gcattgtggg 120 aaggggatcg cagctcagac agcgtttgat ggacaccggg acatatttct tcctctgttt 180 tttccctggt aagtacgatt aataaaacat ataacaatat tttagttaaa agactgcttt 240 attacctttt tatcgttatt taaaatgatt tttgtcttgt atgaagcgcc catggaaagt 300 acagcagtga attacgttta atattgttaa taagatcaga ctttatttca ttaagatcaa 360 actgagctat gagtggttaa atgtaattta aacaagcatt tacctaatga ttttttacct 420 ttttatgcgt tttcctcatc agtaaatatg atatctgttc atttacagtt gtttgtccgt 480 atttttctaa actgccgtgc gcgggaagtg acgtgacgtc ccaatcaccc actcactccc 540 tctgcacctg cgtgagtctg actggattaa cggtaaaacg tcagcttgtc tgtgagagtg 600 taaaaggtat gttaattatt ttatagaaat ataaatgttt tctttgtaca gtgcattgaa 660 caagggatag cattcaaagt ttgtgacact gtgtagtttg gttgatatat agtgctgaag 720 gtgtatatcg atgactggtt aacgttacct ttaatgaaca ctaacgttaa cgtgtttcaa 780 acgctcgtaa cacctgaccg tctgtgtttg tttgcatatt aataactaat ttcatatttt 840 tactgacgtt atcaaatgtt agctttagtt tgaccgacgt attaaggctg tgtgaccgag 900 tttcagtgga acagctttgc tggtttttaa gtaagttata agttagactc gcgtgaattg 960 gaggcattgt gtaactcggt atgaggggaa atacacactc gtaaagttat tattagtggt 1020 gggccgttat cagcattaac gtgctgcgtt aacgcgagac ttttatcgcg cgataaaaaa 1080 aaatatatat cgccgttaat ctattctcaa cgttgggttg ggagctgggt gactttcata 1140 ctttgatatt ttagcgcgga tgtatacctg accggtgagc cgtctgacaa acaagtgccc 1200 ttmtgaatca aatcagcagg atgcctgact ttaactgcag ttcggcagtt tcactttaac 1260 tcatgaacat ttcattcatc ccgcgtgaca aactggggta atagacgcaa ataagaagca 1320 agaactgatg agagtgttat ggaatttaat aacgctcgcc aaatggaacg aaaaaaaaaa 1380 cttccgcatt tcgcgacgcg tgtttgtgtg tacagagata acagagtgat acagtaagtg 1440 tgtgacattt aaaagtattt atagtgttta cactgtcatg cgttttcggc ctcaaataca 1500 acgtgtacct taaagcacac aattaaatgt ttgttgtagt cgagttgaac tgtcattgat 1560 attatattta tgaacgtctt gtcaacttgc agattcagcc aagttcactt ttaagaacct 1620 caagggcagc taatatgaag atataaagat gagttgttct tccacagtct taaaattctt 1680 caaactaatt gtcagttcga ttcattatag caaataatac tttctttact attaggagat 1740 caggagtttt attttaatga aaccttgata atagagtgaa ctacacatgg tgttatcgca 1800 aatgaaatct ttttatgtag attttgagtc tgagttttat ttgcacatat aaggttcact 1860 gatattttgc tttgttgact ttcaggttct ttcattttgg ttataagtac atgttacagg 1920 taaattctta gattcaatcc aaactgaatt tgaattaata cggacaaatg aaataaaata 1980 cacctgggat tttaaatgtt ttcatttttt tattattcac aacgtttgtg aagtctaaat 2040 gtagactgtc tcaaactgaa tgagtgttat ttttgttgta ggtgtgttgg tgtaaaaatt 2100 gatgctgact ggttagtact gcatttttaa ccataacaat aatgatttat atttagcatt 2160 gcatctattg tcattttacc aagaatgtgt ttgctctttc agaactcagg cagttgtgct 2220 ggaccttctg taccttatac aggagatgac agtaaacttt caagtttaag aggtaaaatg 2280 ttcagggttt ttctgtttga ttttgagtgt tttgtagata taaggttcac tgatatatga 2340 tataagattt gctttatgtt tatgttgtca ttttggttat aagtacaagg tacaggtaca 2400 ttttcacagc aatctaaatt aaaatcttaa atcttacatt caatccaaac tgaatttgaa 2460 ttaatatgga caaatgaaat aaaatacacc tgggatttta aatgttttca tttttttgtt 2520 atttacaaag tttgtaaagt ttaaaagtag actacgtgct gcatgtttca aactgaatga 2580 gtgttatttt tgttgtaggt gtgttggtgt aaaaagtgat gctgactggt cagtactgca 2640 ttttttaacc aaaaaaataa acaatgattt atatttagca ttgcatctat tgtaatttta 2700 ccaagaatgt gtttgctctt tcagaactca ggcagttgtg ctggaccctc tgtaccttat 2760 acaggagatg acggtaaact ttgaagttta agaggtaaaa tgttcagcgt tttttctgtt 2820 tggttttaag tgttttgtac atataaggtt cactgatatt ttgctttgtt gayttccagg 2880 ttctgtaatt ttggttataa gtacaaggta caggtacatt ttcacagcaa tctaaataaa 2940 aatcttaaat cttacgttca atccaaacta aatttgaatt gatatagaca aatgaaataa 3000 aatacagctg gggttttaaa tgttttcatt tttttttatt attcacaaag tttgtaaagt 3060 ataaaagtag actacgtgct gcatgtttca aactgaatga gtgttatttt tgttgtaggt 3120 gtgttggtgt aaaaattgat gctgactggt cagtactgca tttttataaa taatacaaaa 3180 aattaaaata aaaaatgatt tatatttagc attgcatcta ttgtcatttt accaagaatg 3240 tgtttgctct ttcagaactc aggcagttgt gctggaccat ctgtacttaa tacaggggat 3300 gacagtaaac ttccaagttt aagaggtaaa atggtcaggg tttttgtttg atattgatgc 3360 ttttattttt attatttggc aaaaccaatt acttgtacct atatttttac taagtgttta 3420 catgaacaca ttttgcttca gtaagttttg aattttgtta acatgtattc aaactaaaaa 3480 gtactgggtt gatgtgggtg ttttatctgg gttttttaag ctgctactgt aatctgttaa 3540 atgaaccaag cagttattta ttattattat ttatttttgc atcatttgaa atgttacaag 3600 ttctttattt tggtaatgtc tgtttaaagt gataaggaaa taactggtcc actgctgaat 3660 ctcttttctt ttctctctta gccacctgat gatgagagag acaatttcaa tgtgggcatc 3720 ttgatggtga gccaggcctt catgctgatt ttgtgaccgt ttaatctatt tagcgtcatt 3780 gtaataactt tatttcagga tgcaaaactg gctaaaaatc tactttaaat ttcaacatta 3840 attgtatgat agtgtggttg atttaacatt ttatttaatg ttgggttcat tgtaaaaaaa 3900 taaagattta attttggttg aataaatgct gattttggga cagttttgtc tatttatcat 3960 catagaatta ccattatttc agggtggaaa actgcctaaa aatcagtggt gactttttaa 4020 cgttgtttca atgatagttt gattgatgtt ttaacattta ttcaatgtta atttaacgta 4080 ggtttgctct ctgggtaaga actaacttaa aaaatgcgaa aaacttatgt atgtgccact 4140 tgaattttgt atcacatgtt attggcatgt tatggaattt aactatggca aataagtgag 4200 aggagatgga aagctatgat gtctacacgt tttcctgaaa catttgcaag gtcaattctt 4260 tcttgtataa agatagtgga aaagaaagaa atgtaaaaca aaatggtttg taattttttt 4320 acattttttt ttctttcctt tattgttttt ttttttgcct tatttttgtt cttgcactat 4380 aattatttct tgtaatattt gattaagtaa atatatattt tttaatttct tttaaattat 4440 gtagcagaca tgtatttgaa taattttaca taataattat aatatttata atattattaa 4500 taatattaat aatttaacat ataggtgtgc atatatgtac ccatatatgt acatgtaata 4560 taggaaaaca gccaatttta tatttgtcac atatttcaaa aacctaaatc aaaacctata 4620 ttaaaccata tgtttatata ggttctttcc atgtggacgt gtaccagacc caacacacaa 4680 ctcacttttc ttttcttttt ttcagaagaa gagtccacag tcacaataac aagatcattg 4740 agctgtcccc taattgtggt gagctacatg tttagtctct tatgtgctaa acattatgta 4800 gcctaaggaa gcagccctga ctctggagtt tatatgaagg tatgttttta catttttaag 4860 tttatttcat ttttataaga caagtaagtt gatacatgtt tgttgtattt acaacaggag 4920 catgctgagt aactatccag aatgtaggtc caaagcagaa aaaagtgcac caagcagaaa 4980 cataccatct aaactccttc gattcctgtc agaccagaaa gactttcaga atccttggaa 5040 gatttaaagg taattaagca caccggtaat actttataat aactaaacac tattgattaa 5100 tttattaagc attagcaaaa agttctgtta agctttaact ctacattaac agaccttagt 5160 atacagttta taactacagc tgcaaatgct gtaatcttga cttataagca catctataat 5220 gtgcttaatg attttgtttt tatattttgt taatgattaa tttttcatta ctaaatttag 5280 tatagcatta tttacaaacc agtgatttaa gaatagttgg tgtatttttt ttctaatcat 5340 tcagaatgtg taagtaaatg gttaataaac tattcaaact aacatttata tatcttatta 5400 ttcaggcata tagtaatagt tgggcagcac agtggctcag tggctagcac tgttgcctca 5460 cagcaagaag acctccgggt gctccggttt cccccacagt ccaaagacat aagctatagg 5520 gaaattgatt aacttaattg gctgttatgt atgagtgtgt ttgtaaataa gtgtgtatgg 5580 gtgtttccaa gggcgtactc atgctatgca cagttgcctt gaaactggaa gcgtgcttgt 5640 ctcccccgac gacccgcact cacattgcat tcgagcctga gcatgcttac atcatcaata 5700 acgcactgtt cagtttaaca agtgctctca ctcagcacag tggagatttc tttatattgt 5760 tatatcgttt tagttgttca atatgcagtg acacgcagtc aaatatttca ccgaacagat 5820 cagccacttt tgacgctcat aaacaatcaa agtcaacgtg ctgcaggtac taggaggttt 5880 gctaaaggtg cagctgttgt gcagtgaggg gtttgtgtct ttaatctatg acagtttgca 5940 agggactttg atttggataa attaaccact attactgttc aatgaacgca aaccccttac 6000 tgcaccacag ctgcgccttt agcaaacctc ctattcctgc agcacaagga gtttatgatt 6060 gtttattaac atcaaaagtg gctgatctgt tcggcgaaat atttgacagc gtgttactgc 6120 atcccaaacg actaaaacaa tataactaaa gaattctcct ccgtgctgag agagagcact 6180 tcctaaacta aacagcgcat catcgatgat gtaagagtgc tcaggctcga atgcaatgtg 6240 agtgcgggac gtcaggggag acgggaggga ggacaagagt gcttcggcct ggttcaaggc 6300 aactatacat agtgtgagta cacactaagc tgaaggaaaa tacatgaatg aatagtaatt 6360 gttatacttg tatataaagt ataaaaaata agtatataaa gtatatacaa tacttatttt 6420 aataagtgct tttgttaact tcatccagtt ttgtagccta aagtgaggat tgtttatgct 6480 ttgtaaatcc ccttataaat cacaattaaa gattcagtta tattctaaaa atgaaattca 6540 ttttattgta ttaaatgaaa aataaatctt atctaaaata aaattactgt acagtttaaa 6600 catagctaaa taacactgaa atgttgcatc gtaatatatc attaaattat tttattgttg 6660 ttttatccag ttttatgtta tattttggca ctcctgctat tcagcaaggt ttcatattta 6720 cagtaattac attttttaga taagactgta atgatttatt gttcatttga tactgagcct 6780 ttaactgtcg tttataagga tttataaatc ataaataatc ctcactttag attaggccac 6840 caaactggaa gaarttttgt aaataatgca atattaaata taatattata tatatatata 6900 tatatatata tatatatata tatatatatt ttttttttta atctataaat agtaatgtaa 6960 aatcaatcat ttaagtttga taacacaatt attaaactgt agctgtagca aattttgcag 7020 ctgtatttat aaccaactgc taacatctat taatgtagag ttaatgctta acagataatg 7080 aattcactat ttgctaatgc ttcataaatg aatcatagcc cgtagttatt atagtgttac 7140 cagcatagtt tgtttatatt actgtttctc tcttgcatta ttgttccgaa tattgtcaaa 7200 agtacctaca ctttattttg aaaaatctta aagtgatttc tcacccaaaa ctgaataatt 7260 gatcattcat aatttaccta ctctcttgac attttaaacc tgtatgactt tgtttcctct 7320 gcaaaacaca aaatatcata ttttaaagaa tgttgctcct atttctccta ttttgttctg 7380 ttgaagtatg tcagtcagac aggcttgaaa tgacaagagt gagacatgaa gatgatgata 7440 aaactttcat ttgtgtaaac tattgacatg caaggaatca ccccagttgt aagtgaaatg 7500 tcattagtaa tgtttttttt tttttttttt ttttacagcc attgaagtat cagtattgga 7560 gcggaagaag acccagttgt atcgtctgtt tcctgccatc ttgcagttaa cctggtttct 7620 taatgtaaag tacttgtaat ttgcccactt gtcctcaatt atcttaaagc taacatttaa 7680 aactaatttg ctatgagatt atttgccaag acttctcact gtgtgattca catgtcaata 7740 tgtatttatt gtgtgtataa ctagctattt ggaattcaaa tgatttgtat tcattaaaga 7800 atctgtttta attgcaaagt tgtttttgta ctaatcattg taagttcaca gttaacaaac 7860 tatttactgc ttctctacac atacagtata gaatacatgt tttgttttac tatggtaaac 7920 cattaatagt gtatttccca tatgtatatg tttatcctta ccaactgcta tttctatgct 7980 gcttgaagag cattaaatta gtaaaataat atttttttta cagtaggtat cccattaaaa 8040 caaagaatct gttcattttt tcactgtata tatactatta aaaaacaaaa aagtcctctt 8100 tttacagtag gtatacaatt aaaacacaga atctgtctgt ttttttacag taggtatact 8160 attaaaacac agaatctgtc tgttttttta cagtaggtat actattaaaa cacagaatct 8220 gtctgttttt ttacagtagg tatactatta aaacacagaa tctgtctgtt tttttacagt 8280 aggtatacta ttaaaacaca gaatctgtct gtttttttac agtaggtata ctattaaaac 8340 acagaatctg tctgtttttt tacagtaggt atactattaa aacacaaaat atttccgctt 8400 ttttactgtt ggtataccct taaaaacaca gaaaataacc gtttttttat atagaaaacc 8460 cttaaattac tataaaatgg ttcgttattt tacagtagtt taaccgtcaa aaaacggata 8520 ttttttagtt tttttaatgt ggacacactg tcaattaaca gaagttttcc gtttttaatt 8580 tacggtaaaa tatttgtgta atgagctgcc agtacatttt accgtttttt tacggaattt 8640 ttttttagag tg 8652 // ID DIRS-1D-I_DR repbase; DNA; ZEB; 6041 BP. XX AC . XX DT 19-FEB-2009 (Rel. 14.02, Created) DT 19-FEB-2009 (Rel. 14.02, Last updated, Version 1) XX DE internal portion of DIRS-like LTR retrotransposon family - a DE consensus. XX KW DIRS; LTR Retrotransposon; Transposable Element; Nonautonomous; KW reverse transcriptase RNase H; phage integrase; DIRS-1D-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6041 RA Bao W. and Jurka J.; RT "Families of DIRS-like retrotransposons in zebrafish."; RL Repbase Reports 9(2), 540-540 (2009). XX DR [1] (Consensus) XX CC This is the internal portion of DIRS LTR retrotransposon. The CC associated LTR portion is DIRS-1D-LTR_DR. This sequence was CC derived from sequence data generated by the Danio rerio CC Sequencing Group at the Sanger Institute. XX SQ Sequence 6041 BP; 1512 A; 1727 C; 859 G; 1935 T; 8 other; taaaaagctt tagctctaaa agcttttaca gcattgcact ttttagccca tcaatatgga 60 aataaccaac aatatctaca actgcaacca gagccctgct ctccctgaga cttcagccac 120 tgcaatgaag ccaaactctc accgctcttc gcggtccacc tgtggtaacg tagtatgctt 180 taaggggaaa cttctctctt taaggagaaa ctactctcca acaagactaa atatttctta 240 tgtctttaga aaagcaagac tctcataaat aaaaaaataa tttaaaccat tttaaagaca 300 caaacactat ggacttaacg tcctcctata atctgtaaac aagcagagat tataacaaac 360 caaataaacc cacctttcaa gcttgctgtc tgttcacagg tccataacat tttaacagcc 420 tggctgattt gactgcaagg atcaaagcaa tccaatccat aaaactttag tccacaacat 480 gtttacatgt ttataaacat acatggatac actgctgtca tccaaacttg ttgctggagt 540 gctttcagta aagtagtatc tttcagacaa aaacacttgt attatggtct aacaaaatgg 600 ctgccgacct tttgattgac acttcattga gccaatagct gaaaagagaa gtgtcaccat 660 ccaagagctg tcagaacatg agatggtccc gccctctctt acgatagtta tgaacagact 720 tggttgcaaa gactttgtga atgaattggg ccaattatgc ttctgatttc attcatgtta 780 agattccaaa tgtaatgtta tttaatcaat cagattaaag taaagcccat agtaaacatt 840 taattgacac agtttaataa gtattgttca agtgcaaatt aaatgacaac atcattttag 900 aagttaaata aataaataaa ttaaacaaat aactaacata aataaataaa taaacatgca 960 tataaataaa caaaccctgc cacttccaat agctatttaa cttctaaaat aaaattaccc 1020 acctaatgac aataagcatc acatcgatca tagaagtccg atcgtaatat gtcacaggaa 1080 tttaaactta tcacttcacc tactccactt taaagaggta tttttaagat caccagccaa 1140 gccatgctca atatccttgg ccctccacta tcctgggggg gaatgaacgt ctaccaaaac 1200 atttaaaaca ctatctgtct gtcgattcct cacctcgcta gagaactgcg ttctcaccac 1260 taattttttc tgattttctc atttcaggtc taacctatgg attccacccg ggcacctcag 1320 ctctcccttc ctatcctaat attcagtcag cttttgctga ttgaaccaga aattgtggat 1380 catcttattt aaaaagagat cgacaaccac tttatgtttg accattctct gccccaccac 1440 tcaaggtttc acgcatttrt cccataagca gtgcaactcg caaattctcc atcactggct 1500 tctttctaac aatcgcagaa tcccccacct tatccaactt tagatgatat tctaatcacc 1560 ccccccccac attgcctcca gccaaacacc tagagatcac caaaaggttt tcgccaatct 1620 aggtattctg ctcgcagaag gctggaccca atacatttat cgaatttcta ggcatcaact 1680 tcgatttgct caaattccaa gcttctcttc ctaaagagaa aattggctga ataatttctc 1740 tttctcttgt attataagaa agagacttga attaaacaca tactgttatc tatcctcggc 1800 cacctaaaat tcacagtgca aatttacctc aaacttgcct actccagctc cctaccacaa 1860 gtcacgattt aaaaggcaca atttctctct caaaacccac ccacaatgaa ctccgcttat 1920 ggattgcttc cttaagctat ggaacggctg ttccttcttc tgcagagatt gttcgcaccc 1980 ccattagata tttgtctatt tactggtgct accccatcag ttgggttcca agggtactca 2040 gaggccactg gttgccacca catggccccc tcaatctatt tccactctag aaaccattgt 2100 tcatcagctt tcttcaaact gtacccaata gtcgctgccg ccatactatg gggagacgaa 2160 tggtccacct ctaggattct cattyattgt gttgaggaag aaacaatata atgcatagac 2220 aaaggcgttc ctactctcta agctttaatg ctacttttaa gatgtcttat ttgacattag 2280 tcaatttatc atgtctgcat aatacgtgcc aggttgcaaa accaaatcac tgactctctc 2340 tcattttctt tccagaaact ttggtcactg gctacagaag tggaccccaa ctcaacgcct 2400 gtccatccct ttccacaaat gacattacct taagccaccc ccattataac ctccaccaaa 2460 ctccatatca cttaccatta aagtagtagc cttcagaccc ttcagtcaca catcacagca 2520 tgaaacattt aaactcaacc ctcactccaa cccaccatag cagatctaat tttgtgtgat 2580 gaggaaagaa agggtttcct cgtcctgcaa agtaagatag atctattgac aaagaacatt 2640 tcatttacat cttcaacatt cctgccctca cccacccagt ccaaactctc tagtatacct 2700 acactcgagg attataagaa acaaaccccc tagcccccct ttttactgat gacgctaatc 2760 gtccagtatc acatctctgg ttccagaaac acctcaaagt tgtccacctc tcaggatttc 2820 ctcccgaacc aatctccagc cattcattca gaattggcat tattaaaagg ctttcccagc 2880 accagatcca gacccttggt cattggttct ctgaagcctt tcaaatccta cattcgcctc 2940 agtcttttca tctcaaagaa cccaagttgt ttctttcaaa cctctcagcc caatcttggc 3000 caccccttta ttctcaagcc aaagccatct atctgtccat ccctagcaaa ccccctcacg 3060 gctcagactc ccgctggagt aaatatgagc ttcagctctc gcagagctta acatgaccct 3120 ctttctgttc cccctgccag cattgagttc ctctgctgtt tcctcctagt tctttactcc 3180 ctgcctttga ctctcatgta atctccttgc tacatcagca tcaccctaag ctcagatttc 3240 gcaggggtca acatgagctt tggctctgac aagagcctac cctatcaccc tctcctttct 3300 gttccccctg ccagcgttga gtaattccgc tgttccctcc tagctcttta ccccttgcct 3360 ttggctctct ggtaacctcc ttgctccatc tgcatcaccc taagctcaga ctctcgcaag 3420 ggtcaacacg agcttcggct ctcgcaagag cctcaccttt caccctctcc tttctgttcc 3480 tcctgccagt gttgagtgct tccactgttc cctcctagct ctttacccct cgcctttggc 3540 tctcttgtta ccaccttgct caatctgctt aaagctcaga ctcctgtagg ggtcaacaca 3600 agtttcggct ctcccaagag cctccccttt caccctctct tttctgttat ccctgccagc 3660 gttgagtact tccgctgttt cctcctagct cttcactcct tgcctttggc cctctcataa 3720 cctccttgct ctatctgcat cacccaagct cagactctcg caagggtcaa cacgagtttc 3780 ggctctcgca agagcctcac ctttcaccct ctcctttctg ttcctcctgc cagtgttgaa 3840 tgcttccact gttccctcct agctctttac ccctcgcctt tggctctctt gttaccacct 3900 tgctcaatct gcttcaagct cagactcctg taggggtcaa cacaagtttc ggctctccca 3960 agagcctccc ctttcaccct ctcttttctg ttatccctgc cagcgttgag tacttccgct 4020 gtttcctcct agctcttcac tccttgcctt tggccctccc ataacctcct tgctctatct 4080 gcatcaccct aagctcagac tcctgcaggg gtcaacgcaa gctgtggctc tcataagaac 4140 ctcacctatc aacctctcct ttctgtatsc cctgccagcg ataagcgctt ctgctgttcc 4200 ctcctaattc ttagctctta actccttgcc ttgactctca actaacctct ttgctccatc 4260 ttagcatcac cctaagctca gacccccaca ggggtcaaca cgagcttcgg ctctcgcaag 4320 agcctcccct atcaccctca ccttcctgtt tccactgcca gaaatgagtg cttatgctgc 4380 atcattttat ttattctgta ttgtcattca tcagtttcta cattttgttt aatctgtata 4440 gtagttcatt agcattacat gttatcatat tctgtatttt gcatagtaat ctcgttatct 4500 ccaacagctc tccccttttt tctttccccc ctgtcagcgt tgtatgctcc ctctgcttta 4560 gtatcttatt gcataatttg tatattctgt atagtaattg tgttggatat ttgtgtttct 4620 tacagctgct ctcccccctt ttwcctgccc tgcagcatca agcacctttg cagctccaat 4680 taatttattc tgcataatgg tttattgtta tatcttgtat tgtatcattt gcataaaaat 4740 tgtttagtat atctgttact tccagccacc ttgcccattt cttttcccat tccagtgtgg 4800 agtgcttccg ctgcaccagt ttatttgttt tccatatagt tattgataaa atgtctacat 4860 tgtgttcatt ctgcatagta gtccatttgt atttcatgta ttatattgtg tatttctgga 4920 taataatcat rttactccca gcagctctcc cctttccact tgccagaatt gagtgcctta 4980 ctgttttcta ccagcycttt actcattgtt tttgactctc atccaacctc cttcctccac 5040 attaacatca ccctaagctc tgactcccgc cggagtcaat aggagttttg gctctcgcaa 5100 gagcctcacc tatcgccttc accttccagt ttcccctgcc agtgtcgagt gcttccgctg 5160 catcagttta tttattcttt atagttattc accagtatcg gcattracat gtttattctg 5220 catagtagtt cattagcatt tcatgttctg tattttgcat agtaaccttg ttacttcaac 5280 agcacccccc tttttctctc cccttaccag cgttgtgggc tcccgctgca ttagtatctt 5340 tcttattgat tattttgcay attatgcata gtaattgtat agcatatttt tgttaattac 5400 agctgttttt cttctgccag cgctaagtgc ttccgctgct tactgcattt tattatgcat 5460 agtatttccc taatgtttca tgtttttatt ttattgttaa tatattaaat ctgtgttaat 5520 accagcttcc ttcccctctc tttctccctg ccagtttcta gcaacatgcc agcattgagt 5580 gcttctgctg catagcagtt gtatattcca tacagtaatt tatatagcat tgtttcctcc 5640 agcttctctc ctttgtttac ccccaacagc tttttatgct tccgctacca aatctcttcc 5700 ctcctcttaa acagccagct ccacaccctt cattattaca tctcctcaca aaccttctct 5760 taagcactga ctcccgtaga gggcccattt aagccccaat ttacacacga gtctccaccc 5820 accctctaag ccactcagac tcccttagac ttcctaactt tatttcttag ccaccaaacc 5880 caccctgccc ttcacacact ccctagaaag cttttccccc actttttatc agttcttcac 5940 atatatatcc agcagccgga tatagcactt tttctttgca ctttggggag ttcctgtgta 6000 aacacgcggc tgctgtcccg agcgcttatc attggctttt t 6041 // ID hAT-N62_DR repbase; DNA; ZEB; 908 BP. XX AC . XX DT 29-OCT-2008 (Rel. 13.1, Created) DT 29-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Putative nonautonomous hAT DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW 8 bp TSD; hAT-N62_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-908 RA Bao W. and Jurka J.; RT "hAT DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1331-1331 (2008). XX DR [1] (Consensus) XX CC TIR is 11-bp long, TSD is 8-bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 908 BP; 279 A; 148 C; 146 G; 334 T; 1 other; cagtggcgca aaaagggggt atgcagtata tgcggcgcat aggggcgcca cacatggggg 60 gcgccattgt gccaaaacta tttttgaaat tatccatatt aaaagtttca aataatatat 120 atatatattt ttgttaaaaa tgtttattta gcattttata cgcgtataat attgtatttt 180 attaaattaa aaaatcattc cacccctccc ccccattttt cataccatat acatgttttt 240 atgaatttat tatgtatcat attgattcat caggcctttt ggtaacttct tagctcaaaa 300 ctattttaat tttagctaca aacaaagtta taacatttca attaaataga acatttcaca 360 cagtttgttt ctgttttttt tttttttaca ctaaacattt gacagtcttt tttatttcac 420 ttttagcagt ttcagtcaat tgttcttgca aagacagacc ttacgcttct cacagtatgc 480 atgtgagaag taatcgcaat gttcttttgc cctttaggac tcccatgtct atcattttgt 540 tttgacaaat gtagtacaaa aagtccaaaa gcaaggctag tcaaggtgct ataatcattt 600 agtgagggaa aattaaattg agaagaaatt tggggtggta tgagatgcat ttctttgcga 660 cttaacatgc ttaatgtaat tatacctttt actacttcaa aaaatactat ttcrcagttg 720 tgttcagaac tgcgaaactg tattttttga agcatattga tatctttatg aaaaaatatc 780 tttaatttat tgatggaaat attttccaca ttttatcact tgcctgtttt aaaatatagt 840 tggattgaag gggggcgcgc aaaagttgaa gccgcatacc catcagtaaa tgtgcagttg 900 cgcccctg 908 // ID TE-X-6_DR repbase; DNA; ZEB; 1554 BP. XX AC . XX DT 12-MAR-2009 (Rel. 14.03, Created) DT 12-MAR-2009 (Rel. 14.03, Last updated, Version 1) XX DE nonautonomous transposable element from zebrafish - a consensus. XX KW Transposable Element; Nonautonomous; TE-X-6_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1554 RA Bao W. and Jurka J.; RT "transposable elements from zebrafish."; RL Repbase Reports 9(3), 702-702 (2009). XX DR [1] (Consensus) XX CC No obvious TIRs are detected, and TSD is not detected. XX SQ Sequence 1554 BP; 525 A; 290 C; 249 G; 486 T; 4 other; aaatccttta acgcacggca agccgttgat tatctcgctt attccatggt catttgccaa 60 ggtaggcata agtttaaact agttttgctc tttgcagcgc aatatttgtc cgaatttttg 120 tcagttttta catgttagat tttctagtcc ctccactgaa tcagctgtga caggctgaca 180 gacagggaga cagagagaga taatgtgcgt gctgtttata cctgctaata ttaagatgcg 240 atttcatcaa tccgatcaca ggtggacaaa taagacattc ccagtcacac ctggagtgtg 300 tttccagtca cacatgtttc tattgtctac tttctaccac ttctctgatg agggaggatc 360 tgtatgtata ttggctttct cttaacatta cctttaatgc tgcgaaagct gggctaccta 420 tagcagaatt tacatttcat aataaacata caaatcttat taaaagtacc atatcaatta 480 aatacgcttt aggaatatgt tactttatgc attggctaat gaaattaaag ataatttatt 540 tcctatcaac tttaagcctt cacactttag ttttaacatg ctgttttttt aaataaaaat 600 ccctttaaca aatatttttg aacatctaaa gtaaataaat catgttatta tgcctctttc 660 gaatcaaatt attaggctag gcaaattgat ctcatggaat tgatctaaca aaaagactgc 720 gatcgttgct ttaagtagaa taacrttacg attattaaaa cacattaatc tatacaagtc 780 tatacacatg aaataaaaca ttcacaaagc tgtatcttat gaaacaccag ttcaagggca 840 cataagatgc attttaaaga gaaagcagcc aaagcctatt tcagaaatct ttgctgttaa 900 atatgtaggt aaatttagca aaagaaaaaa aacatatcag gacaggtaag agaacattct 960 gctttatttt tcttctttaa aacaaaagca atcagccaaa cacgcaggat cagtgtaata 1020 ttacattatt tacggtcaaa ttaattatga aacttatcty tgcgatctgc caaaaaaaat 1080 acacctgaga acaaataata ggttcaaaag tccaaagact gaccgttaga tacaccaaag 1140 attaattata tatcacrttt taacaacaga gatgcaagag agaggcattc gcggagctgc 1200 tgctgaccgc ctggagctta caamattttt aaagacatgc tatccttata cataaactgc 1260 cgatttgact ttaaaataac tatattctcg cttgaaaaac tcttaaaact tgattttgtg 1320 gcataattta tgagtatttt tctaattttg cggattcctg tccaccatta acagtcctca 1380 tcggtttcct gtgcagcaca tgcgagttgc tatggtaact acaaacgttt gggagaagcg 1440 catttatttc gaactgtgat caaaactgac tggaactacc tgtgcgtttc aacgaaaata 1500 atgcacaccc tccagccaat cagaatcaag aatttcaaca gaccatggaa taag 1554 // ID Gypsy152-I_DR repbase; DNA; ZEB; 4450 BP. XX AC chr4; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy152-I_DR; KW Gypsy152-LTR_DR; LTR63_DR; LTR_Retrotransposon. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4450 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1565-1565 (2008). XX DR Genome; chr4; Positions 37022708 37018259. XX CC Positions [1892-2203] - Integrase core CC LTRs are 97% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 291..1988 FT /product="Gypsy152-I_DR_2p" FT /translation="MFSLEEFVKSLSLELLDVCRKQDLFLIADHFKLVVNR FT QDRKAEIKQKLTEQLVELNVLITPNIEMDETADHASPGDEVVTSLDMGDQN FT VCSGSTDAEAGVEAAVDTNVGLPPFEPFSPFSIGSRDDARLKVRVARLYYD FT AQEHTRRAEIDLRLQIRRLEIEADTQVRLRQLELDASKKAVEAIQPVPASS FT SSSYVSPVSTSSTSSFDLSKHIALVPQFRESEVDSYFNAFERIATSLHWPP FT EFWSLLLQCKLVGKAMEVYSTFSLEESLKYDSVKSAILRAYELVPEAYRQK FT FRTHRKTSNQTFVEFAREKGTLFDKWCVASDVADFKSLRELVLLEEFKNCL FT PEKVVVYLNEQKVNSLSHAAVLFDEFILTHKNIFVHSRVENAPAVAILSNT FT QFVRSKNNAPKVGDERNCYYCHVKGHVIVDCPRLKRKQQSQAKPVGFVKTF FT HQSVAENSKDKVEETFRPFILKGLISLTGRDEDQKEITMLRDTGSMQSFIV FT ESAIKFSDETSCHSSVVVQGIEMGCVKRPLHNLHLKSDLCSGLVRVAVCSS FT LPTRALISCPSCSNRSCKI" FT CDS 2692..4449 FT /product="Gypsy152-I_DR_1p" FT /translation="MKRTFDKKAVARSFKPGDAVLVFMPIPGSSLSAKFFG FT PYEVKEKLSQTNYVICTPDRRRKTRTCHVNMLKLYHSRAQDDAEKMTDNAV FT ISVVAVTENPVDANEDGLMFPSISQSTVSLSNSEILKDLNSHFKNLTVDQK FT RDLVTLIFEFKCLFNDVPTLTNVIQHDIDVKDARPVRQHAYRVNSVKRSVM FT QQEVDYLLDNGLAKPSSSPWSSPCILVPKSDGSFRFCTDFRKVNALTVPDN FT YPLPRMEDCVDNVGNANFVSKLDMLKGYWQIPLSSRASDIAAFITPDAFAQ FT YTAMAFGLRNAPATFQRLVNTVLAGIPNCSAYLDDVVVYSRDWSEHLNSLR FT DVFQRFASASLTLNLAKCEFGQAKVTYLGKEVGHGQVRPVEAKVEAISEFP FT VPTTRRELRRFLGIAGYYRSFYRNFSSVVSPLTLLLSPSKSFVWDEKCQYA FT FESVKILLSSSPVLAAPDLNRPFTLEVDASNVGAGAVLLQDDEQGIAHPVS FT YFSRKFNKNQLNYSVIEKETLALIFALQHFEVYVGCSVQPLIVFTDHSPLT FT FLSKLYNHNQRLTRWSLFLQSYNLDIRHKKGVENILADA" XX SQ Sequence 4450 BP; 1272 A; 787 C; 994 G; 1397 T; 0 other; tgaaatgggg cctcgttccg gatatttatc tgagacatta aataagatca attaaggtaa 60 gataaaataa atcatctgga ataaattgaa ttcatcaaga aaataatttt aaaaagtaat 120 ttttgaaatt aatatttctt aaatttatca gatggtaatc tttgagaaaa atgaccaaag 180 aaaagctgtg ttgatggcaa cagaaattgc tgtatagtag ttgacattta ggtaagttgc 240 agaagtagca gttggcagca ctttttgagc acgcgacact ttaatttaaa atgttcagtt 300 tagaggaatt tgttaagagt ctttcattgg agctgcttga cgtttgtcgg aaacaagact 360 tgtttttgat tgctgatcat tttaaactgg tagttaacag acaggatcgt aaagctgaaa 420 ttaagcaaaa gttaacagag cagttggttg agcttaatgt tcttatcacg ccaaatattg 480 aaatggatga gacggctgat catgcaagcc ctggtgatga ggtggtgact tctttagata 540 tgggagatca gaacgtgtgt agtggcagta cagatgcaga agcaggggtg gaggccgctg 600 ttgatactaa tgttggtttg ccaccgtttg aaccgttctc cccattctcc ataggttcac 660 gagatgatgc gcgactgaaa gttcgcgtgg cacgtcttta ttatgatgcg caagagcaca 720 cccgtcgggc tgaaattgac ttgcgtttgc agatacgcag actcgaaatc gaggctgata 780 cgcaggtgag actgcgccag ctcgagttgg acgcttcaaa gaaggctgtg gaggctattc 840 agccagttcc agcttcttcc tcttcttcat atgtatctcc ggtgtcgact tcatccactt 900 caagttttga cttgagtaag cacattgcgc ttgttccaca attcagagaa tctgaagttg 960 attcttattt taatgcgttt gaacgcattg cgacttcatt gcactggcca cctgagttct 1020 ggtctttgtt actgcaatgt aaacttgtag gcaaggctat ggaagtttat tctacttttt 1080 cacttgaaga gagtttaaag tatgattcag ttaaatcagc aatacttcgt gcatacgaat 1140 tagttcccga agcttataga cagaagttta ggacgcatag gaaaacttcc aatcaaactt 1200 ttgtggagtt tgctagagaa aagggcacgc tttttgataa atggtgtgta gctagtgatg 1260 tggcagattt taaatcactg cgtgagcttg ttcttttaga ggagttcaag aattgtttgc 1320 ctgagaaagt ggtagtttat cttaacgaac agaaggtaaa ttcattgtca catgccgctg 1380 ttctctttga tgaattcatt ttgactcaca agaacatatt tgttcactca cgtgtggaga 1440 atgcacctgc agttgcaatt ttatctaaca cacagtttgt gcgctcgaaa aataatgcac 1500 caaaagttgg cgatgaacgt aactgttact attgccatgt aaaaggtcat gtaatagttg 1560 actgcccccg acttaaacga aaacaacaat cacaagctaa acctgtaggt tttgtaaaga 1620 cttttcatca gagtgtggct gaaaactcaa aagataaggt ggaggagacg ttccgtcctt 1680 ttattcttaa ggggttaatt tctctgactg gtcgtgatga ggatcaaaag gaaatcacaa 1740 tgcttcgtga tacaggctca atgcaaagtt ttattgttga aagtgcgatc aaattttctg 1800 atgagacgtc atgtcattct agtgtagtag tgcaaggaat tgaaatggga tgtgtaaaga 1860 gacctttgca taatctgcat ttaaaaagtg atctatgttc tggacttgtc cgtgttgcag 1920 tttgttcttc actgccgacc agggcactaa tttcatgtcc aagctgttca aacaggtcat 1980 gcaaaattta ggaatcacac atcgcacctc cagtgcttac catccagaaa gccaaggagc 2040 tctggagcgt tttcaccaaa ctttgaagtc gatgttgagg aaatattgca tgcagactgg 2100 taatgagtgg gatgaaggta ttccaatgct gatgttcgcg attagagaga gtgttcagga 2160 gtcactcggc tttagcccag ctgagttggt ctatggatac caattggata cctcttcttc 2220 atatgtatct ccggtgtcga cttcatccac ttcaagtttt gacgtgagta agcacattgc 2280 gcttgttcca caattcagag aatctgaagt tgattcttat tttaatgcgt ttgaacgcat 2340 tgcgacttca ttgcactggc cacctgagtt ctggtctttg ttactgcaat ataaacttgt 2400 aggcaaggct atggaagttt attctacttt atcacttgaa gagagtttaa agtatgattc 2460 agttaaatca gcaatacttc gtgcatacga attatttccc gaagcttata gacagaagtt 2520 taccagggct atggatacca attgcgcgga cctttgaagg ttctgaaaga tcaaattctg 2580 gaaactgaca attcaaaaac gaatgtttta gagtacgtaa caaagtttcg agcaaggttg 2640 cagaccgctt gttcaattgc tagagattta cttgcccgtg cgcaaaaaga aatgaaacga 2700 acatttgaca aaaaggcagt tgcgcgttct tttaagcctg gtgacgcagt tcttgttttt 2760 atgcctatac ctggttcttc actgtcagct aaatttttcg gaccgtacga agtgaaagaa 2820 aaattgagcc agactaatta tgtgatctgt acacccgacc gaagacgaaa aacccgcact 2880 tgtcatgtga atatgcttaa actttatcat tcgagggcac aggatgatgc tgagaaaatg 2940 acggataatg ctgttatttc tgttgtggct gtgacagaaa atcctgttga tgctaatgaa 3000 gatgggctta tgtttccaag tatttctcag tctaccgtga gtttgtccaa ttctgagatt 3060 ttaaaagatc ttaattcaca ttttaaaaac ttgactgttg atcagaaacg cgatcttgtg 3120 acattaattt ttgaatttaa gtgtctgttt aatgatgtgc caacactcac aaatgtcatt 3180 caacatgaca ttgatgtcaa agatgctcgt ccagtccgac agcatgctta cagagtaaat 3240 tctgtaaaac gttcagttat gcaacaagag gttgactatt tgctggataa tggtcttgct 3300 aagcctagct ctagtccatg gagttctcca tgtattttag tgccaaaatc tgatggttct 3360 tttagattct gcacagattt tagaaaagtg aatgcactaa cagttcctga caactatcca 3420 cttcctagaa tggaggattg tgttgacaat gtaggaaatg ccaactttgt gagtaaatta 3480 gacatgctga aaggttattg gcaaatacca ctttcgtcta gagcatctga tattgctgct 3540 tttatcactc cagatgcgtt tgcacaatat acggccatgg cgtttggact ccgaaacgca 3600 cctgctacct ttcagaggtt agtaaacact gttttagctg gtattcctaa ttgtagcgct 3660 tatttggatg atgttgttgt ttattccaga gattggagtg aacatttgaa ttcattacgt 3720 gatgtatttc aacgttttgc aagtgcttca ttgaccctta acttagcaaa atgtgaattt 3780 ggtcaggcga aggtgactta tttgggaaaa gaagtgggac atggtcaagt tcgacctgtg 3840 gaagcgaagg ttgaggctat atctgagttc ccagttccta cgacgcggcg tgaactccga 3900 agatttctcg gaatcgctgg ttattatcgc agcttctata gaaatttctc gtctgtggtt 3960 agtccattaa ctttactctt aagcccttct aaatcgtttg tgtgggatga aaaatgccaa 4020 tatgcatttg agagtgtgaa aatactgttg agtagttcac cagttcttgc agcccctgat 4080 ttaaatcgtc cattcacact tgaggttgat gcaagtaatg ttggcgctgg tgctgtgtta 4140 ctgcaagatg atgagcaggg tatagctcat ccggtgagtt atttttcgag gaagtttaac 4200 aagaatcagt taaactattc tgtgattgaa aaggagacac ttgctttaat ttttgcattg 4260 caacactttg aagtatatgt tggatgtagt gtacagcctt tgatagtttt tacggatcat 4320 agtccattga catttctttc aaagttgtat aatcacaatc agagattgac acggtggtca 4380 ctcttcttgc aaagttacaa cctggatatt aggcacaaga agggagttga gaatattcta 4440 gcggatgcac 4450 // ID ERV1-N2-I_DR repbase; DNA; ZEB; 2731 BP. XX AC . XX DT 24-OCT-2008 (Rel. 13.1, Created) DT 24-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE nonautonomous endogenous retrovirus from zebrafish, internal DE portion - consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW internal portion; ERV1-N2-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-2731 RA Bao W. and Jurka J.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1598-1598 (2008). XX DR [1] (Consensus) XX CC The TSD flanking this LTR retrotransposon is 4-bp. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 241..1155 FT /product="ERV1-N2-I_DR_1p" FT /translation="MEGFISKESDVFFALWTKKVEPTLCKLNKEGLELTDS FT VSTLFAWWDIIGKNGKKSKRGDIIEALNYMCYSKLRNVMSEIPSIIKARCE FT EFDDVQKSEAEKSEAKILDLKLQLDNLLAEQLALAEKCKSQKDTIADLKAM FT LKTQKFVSPKMLNSDGIKKHVHFSDEVTCNESEMICSEAVAKKSLPVSVIK FT VMPHHSAAKTHGVEALESLKNPQTSDDHSNQPSGKQNRLCMTCKKFGHREE FT QCWTSGQGRPPPYFLRKRKSLKSKNQWSSRNKNSSANNTLSELTALLIQGL FT QLLTSFAXGLAA*" XX SQ Sequence 2731 BP; 868 A; 488 C; 486 G; 880 T; 9 other; gatggtgccg aaacccggga tcctttgcag tgagtttttc tgaaatttac tttagccggt 60 cttagctgga ttttaacagc tgacctggtg aatggattaa aattggcctt ctaaccacaa 120 gagtgggtaa gtctactttt attgtttctc catttgctta aattgcatga aacagaattt 180 acaagactgt aatttgtttc aatagttcaa tttggtaaat ttcagattac ctcctcaagg 240 atggaagggt tcatttctaa agaaagtgat gtattctttg ctttatggac caaaaaagtt 300 gaacctacat tgtgcaaact taacaaagag ggacttgaat taacagactc agtctctaca 360 ttatttgcat ggtgggatat aattggcaaa aatggaaaga aatcaaaacg tggtgatata 420 attgaagcac taaattacat gtgttactca aaactgcgya aygtaatgtc agaaatacct 480 tcaattataa aggctagatg tgaagaattt gatgatgttc aaaaatcaga ggcagaaaag 540 tcagaggcaa aaattctaga tctcaagtta cagcttgata atttactagc tgaacagtta 600 gccctggctg aaaaatgcaa aagtcaaaaa gacaccattg cagatttaaa agctatgcta 660 aaaacacaga agtttgtgtc tcccaaaatg cttaacagcg atgggattaa aaagcatgtt 720 cacttctcag atgaagttac ttgtaatgaa tctgaaatga tttgttcaga ggccgtagct 780 aaaaagagtt tgcccgtttc agtaattaag gtgatgccac accatagcgc agcaaagacg 840 catggtgtag aagcattaga gagtcttaaa aaccctcaaa cctcagatga tcacagcaat 900 caaccctcag gcaaacaaaa caggctttgt atgacctgta aaaagtttgg tcacagagag 960 gaacaatgtt ggacatcagg acagggaaga cctccacctt attttctgag aaaacggaaa 1020 tcactaaaga gcaaaaatca gtggtcttca cgtaacaaaa actcatctgc taataacaca 1080 cttagtgaac taactgcatt gttaatacaa ggcctgcaac tgttaacttc ctttgctrgc 1140 ggcttagcag cctaatagca ataaaagtta ataattgctg atcctgtttg catttccaaa 1200 tttgttatat ttgtgttatt cacattttca tttccataga acacaatggt attgtgtgat 1260 actacaygta cacatagact gagtaattcc ttaaaacatg attttcttac tgtaggtgta 1320 aaaacactca ggtctgaggt tttagaagcc acagatgcat tctagaccgc atgtcacgta 1380 taaagaagga attaactgta tgctaaaatt cattaatttt aaggtacttg tgtttatttg 1440 ttggatgagt ttcatctcct aaaagtggta aactgtgcta tcctcaatat agctgcagtt 1500 taaactacaa acaggagtat agarttgcct tgtggctcca gtgtaataac tgcagtaaca 1560 atgaaaatgc tttcttacag gattcagcct tgaacattgc actttatcat gctyctttaa 1620 aataaaacat tgatttggaa gttttttgag ccagatcaca ttcacatctc atgctcaaat 1680 tccatgctgg ctccctgtta caacaggact aaattttaca cactaccgta aaacttccac 1740 cttttgttag gagcatgttt acacaaacac atagcctcac agctggaaca ctgccacaca 1800 gagaacaatg tttcaaatct tcacacacaa tagtcaaatt cactattttg tctaacaagc 1860 ttaatataga ttttacatgt tatttaagtc ttttcagtca gactttattt taactataga 1920 tttttcatct tgtgttgtta ttttggtttc ttttgatttt taaccagtta cttatttttt 1980 gtttaaacaa agttatatca aaatgtaaat tgtgattcta aatccaaaat gctgaattgt 2040 tgttgctttt gtaacacttt tgacatctct acaacaaatg ttatttgtgt ttacttgtat 2100 ttaatccatt taaatgcagt tgtgattgtc tttgaatatt gattcgttta attcactgta 2160 tttggttaca ctgaatattt tgctgaaact cttaaccagc tattgttgat tttgttgtaa 2220 agcaattatt ttgatgtttc cagatattgt gtaaagagct tgtttgcttt tattttgtct 2280 taatcttgac catattttaa ttacattaat ttaaggrcat tttgcttgag gagaactcaa 2340 actgaccaca tcaatccttt gatcaatgac aaatgccatg tgtttgctty tgctgtttgt 2400 tgtgccattt gctgttttcc aatacaggat ccatggagaa aaggttcacg tcttgtgatc 2460 aacttcaaaa gaagagcagg ggttccctgc attcctcagc acctattctt gtgggggtat 2520 gaaggacaat ttgcatataa gcagtgcaac accattcgat taaagcctct cacacagggc 2580 tgraggacaa caaagactga ccaatctaat tcctacaacc agcctggaca ggaagtccga 2640 agatgacagc agcacacatc acaatgcata ttttcatcat taatgtcacc tttttgcctc 2700 tttatgtgtc acaacggtgc caggggagga t 2731 // ID hAT-N79_DR repbase; DNA; ZEB; 2089 BP. XX AC . XX DT 13-FEB-2009 (Rel. 14.02, Created) DT 13-FEB-2009 (Rel. 14.02, Last updated, Version 1) XX DE Nonautonomous hAT DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW hAT-N79_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-2089 RA Bao W. and Jurka J.; RT "hAT DNA transposons from zebrafish."; RL Repbase Reports 9(2), 545-545 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 2089 BP; 626 A; 373 C; 392 G; 698 T; 0 other; tagagctggg cgatatggca aaaaaaatta tcacgataat ttttttcata tcagtcgata 60 tcgatattta tcacgataaa tgtaaaatct ttatatctat caattatata agattttcag 120 catgccaatg gctttgtgca gctgatttaa cacatttagt gaaatgtttt agctgtctga 180 caatataaat aataaaataa taaacaaaag aacaatctta attcagcatt tacttgttca 240 agttttcaac aaccaaagac attacctaca agttattgga tattattgtt caacatcaga 300 aaaaaaaata aataaataaa taaataaata aaacaaaccc ggtttggaac aacgtgagtt 360 aatagtcaca gaattttcag ttttggatga actctccttt taatggtgag tatggtgaat 420 ttagggtgaa atatacttta atgtatttta tttagggatg ctaatgatta cttgatctat 480 tgattgtcga taacgcttta attgatagac cttatcgatg actgattaag catggacatg 540 taaaggttta gttatgcttt gcgctgtgag acgcatccac gcattacatc atcaagtgtg 600 tgcagtttac cttacttatg ctgtcaccct cttgacatcc catattgcgg gacgcataaa 660 accttgttat gcgcatatct ccgatatatc cgcgcgtgtt ttaaccaaat ttgtctgaca 720 gactagcacg aataggtcta tgcagactct cattttacaa atcgcgcatc acaacgaaaa 780 catgcgggga gccagccggg gagcctctcc atttacacag cgagcttctg agctccatcc 840 aagtttattc ttccgaggta acccaagaaa tctatcagtg ctgcgaaaag actgtccagt 900 tcatgctgct cagaccgcgc ttagttaatg aattaaaatg aaaccgtgaa aatagtttga 960 tggattgttt tggcgctaaa gcatatagcg ggcttgtgtt aaagcgcttc acctcagatg 1020 ttattcgttc ctcttgttta tgctgtagat aactgaccaa caaaatatac atcaaaatta 1080 agttacacat tatacaaaac gaaattcgtt acaaaaagat acattttcaa ggaaatatgt 1140 ttatatttat ttagttcgtg tccgccttaa acaactgcgg gtggatgatg cttcgctccg 1200 tggaaaaaaa ggatttaatt aatgatccac tgctgttgta aacgtctgtc atatatttgc 1260 tgcattttta atatgtcatt attttaaaga ttatgtcttg agcatctgat ttctcctctg 1320 tgattatgtc ctttaaatat ataaaggctg gttcttttac cgtcgatgag agacaatagt 1380 ggtattacag atttcaaaac aaatatttta atattaaaaa ggaaacataa gcttgacatc 1440 aaaagatcct attattgtct taacaaatac ccttaaatga tgcagcctgt agtttacagc 1500 aagcatcaag tgttcattga gatgtgtttg agattgtgtc ctgtctgtca tatctttcgt 1560 ctgtcttttc ggggtttttt tctcgtaagg cacggctttc gaaaatgatg acatgagact 1620 ctgctgggta gctgcgctgg tttaccaaca cttgtttata ggtctgtaat ttcttagcat 1680 cttcataagg gtggtcttgt tttacatggt gaaacaggtt tgtggtgttg cctgcttttg 1740 atgacacgag tcgtctgcgc agtttgcggt gcacgttgct ctgttttgtg tcggatagca 1800 aaaaactaag tagctcaaga ctgaagtcta gtgacgtttc ttgtcaacaa tgtctgtgac 1860 tctgaccttt tttaccttat ttctttctta ctcctttaag tgcaactaac actgtatggc 1920 tgtaaacttg tacggcgtgc ggcatccgga agggcaggct gcaaaatgct tgcgcaacgt 1980 tacgcatagt gcgtaaatat tatcgagcaa ttatcgaact gtcattatcg ttttatcgaa 2040 aaattttata tcgtgataat tatcgatatc gaattatcgc ccagcccta 2089 // ID Gypsy55-I_DR repbase; DNA; ZEB; 4474 BP. XX AC chr16; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy55-I_DR; KW Gypsy55-LTR_DR; Gypsy55_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4474 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1425-1425 (2008). XX DR Genome; chr16; Positions 28711419 28706946. XX CC Positions [3210-3725] - Integrase core CC 'ATGGG' target site duplication CC LTRs are 100% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 114..4169 FT /product="Gypsy55-I_DR_1p" FT /translation="MADCFRRPDPLSFEGNVADNWRIFEQEYDIFIAAAHH FT DKPAKTRAYILLNLAGPEAIERERSFVYAAEVRAPGENGAVLTPAESKEDP FT ECLKRKFREICNPQQNKTLERHKFHSRNQKQGETIESFISDLRIKAKACQF FT GDLTDELICDRIVCGIKSEGLRKALLRDSDLTLIKAISICRIYEMTEESNK FT TLAMPQTATNVDAVRPVSTRRRQYTGRKPQTEQNDSLIINNCKNCGNSHLA FT KKEKCPAFGQLCHNCKKMNHFKKCCRQRCQYVQKKANYKKAVQELEIEKPP FT AFDDTFYVDGVEFDKCVDTVNSFMAEQEEGFVTLHINKTPIEVKVDTGAKC FT NVMSQTTFKRIATNIQPVEQENTPNLVAYGGSKIETKGLVTLQCNLKGQRH FT SLPFFLVDQDVQPLLGFRACLDLGIVTMSPHVHLVSMEKSTEQVLEEYKDL FT FTDELGELPITYSMTIDPSVQPVVRPAHRIPLAMQDRVKAELDRMETLGVI FT TPVSEPTDWVSSMVVTHKKDKQEIRLCINPKDLNTALKRPHHPMRSVEEVA FT SQMSGATVFSVLDAKNSFWQIRLDRKSSMLTTFSTPFGRYRFLRMPFGINS FT ASEVFQRSMEQLFSGYPCSIIVDDIIIGGRDAAEHDANLKRVLNRARNVKL FT KLNPAKCKFRLDRVSYVGHVFTSEGLKADPSKTKAIFEMPAPKNVPALQRF FT LGMANYLGKFIPNLSDVAAPLRKLTHKDTAWCWYHQHQEAFDTLKSCLTTP FT PVLSYYDVKKPVTLTCDASCFGLGAACMQDGRPVAYASRTLTDTETRYAQI FT EKELLAVVFACTKFRDYVYGKSVVVETDHQPLVTILKKPIHTAPARLQRMM FT LRLQCFDITLVYKKGKHMYLADTLSRAPNTKVPPDAESDTFEVMSVSYIST FT ARLEELQKETAEDKVLQTLSTVIQQGWQNKKHTLSPDLHPYFPYRDELVVE FT NGIVMKGHKTVIPRSLQKEYIKIVHRGHPGLDATKRRARSIIFWPTMTADI FT TAELITCSVCNSTKPHQQKEPLKPYPVPDLPWSTVATDMFEWRGQNYMVLV FT DSYSGWYEIDLLRDTTSSAVIRKLKRHFSVHGTPHTLISDNARQYTSQQFR FT DFAKQWDFVHVTSSPEYPQSNGLAERAVRSAKQLMEKSHRDGTDVFLNLLN FT LRNTPRDETLGSSAQRLLSRQTRSAIPVNSKLLEPAPKQAEQVAAQLFKKR FT MAQKRYYDISSSPLQPLAEGQVVRMQTPKGYDRLGTVKEVNKEPRSYTIQY FT NGKTYRRNRRHILPVAEPAPSRFDPDDSDIHKTITHTQDSVPPEPQTHFPQ FT TKSTGTDLPKHTVLSPTKVGSTPYRTRSGRISKPNPKYNQ" XX SQ Sequence 4474 BP; 1417 A; 1011 C; 951 G; 1095 T; 0 other; tggtgtcaga aaacggcacg gatttctcgc ctgtgtttaa agccattttg atggaaagca 60 aagactgctc ataaagtttt tttttctctt tatcaaagcc gcatagtgtg actatggcgg 120 attgctttcg gagacccgac ccgctctctt tcgaaggaaa tgttgctgac aactggcgaa 180 tctttgagca ggaatacgac attttcatcg cggcagcaca ccacgataag cctgccaaga 240 caagggccta tatcctcctc aatctcgcag gaccggaggc catagaacga gagcgttcat 300 tcgtctatgc agcagaggtg cgtgctccgg gtgagaatgg cgctgtttta actcccgctg 360 aatcaaagga ggatcctgaa tgcctgaaaa ggaagtttcg agaaatttgc aacccacagc 420 aaaacaaaac tttggagaga cacaagtttc actctagaaa ccagaaacaa ggtgagacta 480 ttgagtcatt tattagcgat ctgaggatca aagcaaaagc atgccaattt ggtgatttga 540 cagacgagct gatctgtgac aggatagttt gtggcatcaa aagtgaaggc ttaaggaaag 600 cactgttaag agacagtgat ctaacactga tcaaagccat ctccatatgt cgcatatatg 660 aaatgactga agaaagtaat aaaacattgg cgatgccaca gacggccact aacgtggatg 720 cagtacgtcc agtttccacc aggaggcgcc aatacacagg acgaaagcct caaacagaac 780 agaatgattc actgattata aataactgta aaaactgtgg aaacagccat ttggcaaaga 840 aagaaaaatg tccagctttt ggacagctgt gccacaactg caagaaaatg aatcatttca 900 aaaaatgttg cagacagcgt tgccaatatg tacaaaagaa agcaaactac aaaaaggcag 960 ttcaggagct tgaaatagag aagccccctg catttgacga cactttttat gttgatggcg 1020 ttgaatttga taagtgtgtc gacactgtta actctttcat ggctgagcag gaagaaggat 1080 ttgtaacctt acacattaac aaaactccaa tagaggtgaa agtcgacact ggggcaaaat 1140 gtaacgtcat gtcacaaaca acattcaagc gaattgctac taatatacaa ccagtggagc 1200 aagaaaacac acctaacctt gtggcttatg gtgggagtaa aattgagaca aaaggcttag 1260 ttacactgca gtgtaattta aaaggacaaa gacactcact cccgtttttc ttggtggatc 1320 aagatgtaca accactttta ggatttcgtg catgcctgga tttagggatt gtcacgatga 1380 gcccacatgt tcatctggtc agtatggaaa aatccacgga acaagtgctt gaagaataca 1440 aagacttgtt tactgacgaa cttggagaac ttccaatcac atactccatg acaattgacc 1500 caagcgtgca gccggtagtt cgccctgcac atcgaattcc actggccatg caggatcgtg 1560 ttaaagcaga gctagaccgc atggaaactt tgggtgtcat aacaccagtc tctgagccaa 1620 cagactgggt ttcatccatg gtcgtgacgc ataaaaaaga caaacaagaa atcagactgt 1680 gtataaatcc taaagatctc aatactgccc taaaaagacc tcatcacccg atgcgcagtg 1740 tagaggaagt agcatcccag atgtcgggag caacagtgtt ttctgtgctg gacgcaaaga 1800 actctttctg gcagatacgt ctagaccgca agtcctctat gttgactaca ttcagcacac 1860 cgtttgggcg ctataggttt cttcgtatgc cctttggcat aaactccgca agcgaagtgt 1920 tccagcgctc aatggagcag cttttctctg gctacccctg ttctatcatc gttgatgaca 1980 tcatcatcgg aggccgtgat gctgctgaac acgatgctaa tctgaaaaga gtgttgaatc 2040 gagcacgcaa tgtcaaactc aaactaaacc ctgcaaaatg taagttccgt ctggaccgag 2100 tgagttatgt aggccatgtc tttactagtg aaggcttaaa ggctgatccc tcaaaaacaa 2160 aagcaatttt tgagatgcca gctccaaaaa atgtcccagc attgcagcgc ttccttggta 2220 tggcaaatta cctagggaag tttattccaa acttaagtga tgttgctgca cctctaagaa 2280 agttaacgca caaagacact gcatggtgct ggtaccacca acaccaagaa gcatttgata 2340 ctctcaaatc atgtttgaca acgccacctg tcttgtcata ctatgatgta aagaaacccg 2400 tcacattaac ctgtgatgcc tcatgttttg gactcggtgc tgcatgtatg caggatggca 2460 gaccagtcgc gtatgcatct cgcacactca ctgacacaga aactcgatac gcacagatcg 2520 aaaaggagct gctagctgtt gtattcgcct gcaccaaatt cagggattat gtgtatggga 2580 aatctgtggt tgttgaaaca gaccaccagc ccctggtgac gattttaaag aaacccattc 2640 acactgcccc tgcaagatta cagagaatga tgctcaggtt acagtgcttt gacatcactc 2700 tcgtgtacaa gaaaggcaaa cacatgtatc ttgcagacac cctttcccgg gctccaaaca 2760 caaaagtccc tccagatgct gaaagtgaca cgttcgaggt catgtctgtc agttacatct 2820 ccactgcacg actggaggag ttgcaaaagg aaacagcaga ggataaggtg ttacagactc 2880 tcagtacggt gatccaacaa ggttggcaaa ataaaaaaca taccttgtct ccagatttgc 2940 atccatattt tccctacagg gatgaactag tcgtggaaaa tggaatagtg atgaaaggcc 3000 acaaaacagt cattcctcgc tcattacaga aggaatacat caaaatagtc cacagaggac 3060 atcctggatt ggatgccacc aagcgcagag caagaagtat cattttctgg ccaacaatga 3120 cagctgatat cacagcagaa ctcatcacat gctcagtctg taatagcaca aagccacatc 3180 aacagaaaga accacttaaa ccatacccag ttccagactt gccctggtct acagtggcaa 3240 cagacatgtt cgagtggcgt ggtcagaact acatggtgct tgtggactca tactcaggtt 3300 ggtatgagat tgatctgctc cgtgatacga catcctcagc tgtaatcaga aaactgaaaa 3360 gacatttctc agtgcatggc acaccgcaca cactaatttc tgacaatgcc agacagtaca 3420 caagtcaaca atttagagac tttgctaaac agtgggactt tgtacacgtc actagcagtc 3480 cagaataccc gcagtcaaat ggtttggcag aaagagcagt acgcagtgct aaacagttga 3540 tggagaaatc ccacagggat gggactgatg tattcctcaa cctcctaaat ctgagaaaca 3600 ctcctcgtga cgagacgtta ggctcatctg cgcaacggct tctgtcacga cagacacggt 3660 cagccattcc tgtcaactct aagctcttag agccagctcc caaacaagcc gagcaagtcg 3720 cagcacaact cttcaaaaag aggatggctc aaaagcgata ctacgacatc tcaagttcac 3780 cactgcaacc acttgcagag ggacaagttg tcagaatgca aaccccaaag ggttatgacc 3840 gcctgggcac agtaaaggaa gtaaacaagg agcctcgttc ctacaccatt caatataatg 3900 gaaaaacata cagaaggaac agacgccaca ttcttccagt cgctgagcct gctccttcga 3960 ggtttgaccc tgatgacagt gacattcaca aaacaattac acacacacag gacagtgttc 4020 cccctgaacc acaaacacac tttccacaaa caaaatccac aggtactgac ttgccaaagc 4080 acacagtact atcccctacc aaggttggta gtacacctta tagaacacgt tcaggtcgta 4140 tttctaagcc aaaccctaaa tataaccagt gagttttact aatagtagac tctgtggcac 4200 tctgattgtc gttctgaata tgtcccaaat gttaactata gttattgtga tgctgacaat 4260 attacatttt gttgatttgt ttttttacac ttaaagaaaa aaaaaaagga aaaaaaggcc 4320 tgatattata tatttttttc tgcattagtc acatggttgt tattatagtg atccttattt 4380 tcattaatga ttgagttaaa gtcatgcaga aaatttgttt attgttatat aagttttctt 4440 tgtttcgtaa gatgttacat ctaaagaaag ggga 4474 // ID Gypsy99-I_DR repbase; DNA; ZEB; 4190 BP. XX AC chr19; XX DT 02-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy99-I_DR; KW Gypsy99-LTR_DR; Gypsy99_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4190 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1507-1507 (2008). XX DR Genome; chr19; Positions 14219394 14223583. XX CC Positions [3087-3419] - Integrase core CC 'ACACC' target site duplication CC LTRs are 98% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 26..1588 FT /product="Gypsy99-I_DR_2p" FT /translation="MAEGFRRPDPLIFDGNIAENWRKFELEFDIFIAAAHS FT DKPEKTKAYILLNLAGPEAIERERSFTYAPALLAADGQDVITPAESREDPE FT CLKRKFREVCNPETNVTMERHSFNTRNQKPGETIEAYVSTLRNKAKTCNFG FT ALTDELIRDRLVCGIDNDSVRRVLLRESDLTLTKAIKICQISELTEQHTKV FT LAAPRSGLASVDAIHFNAKGRKKAVEHQQRKHMSGTKNIVNCKNCGGNHPA FT KRDQCKAFGQQCHLCKKFNHFMSQCKSASYDNFKSIHQITDANTDSSEESF FT HIDGLCLGASEIDAVSHNRINKQEIHCSLTVNGKPLELKIDTGAKCNVIST FT ETYQTLRHDEKIQEKSKTTLVAYGGTVIQTQGTVMMQCWTSKQKSHMLEFH FT VVDKNVKSLLGLPDCLKMDLISLKEEVHGIDLNKNPNLPQEIFTEYADLFD FT DKLGTLPVTYSMKINPDITPVVRPPRRIPVAMRDKVEAELKNMTKLGVITP FT ILEPTQWCPQWLPHIRKTLTRFVYA" FT CDS 1543..2643 FT /product="Gypsy99-I_DR_1p" FT /translation="MVATHKKDTDKIRLCIDPRDLNEALMRPHHPMRTVEE FT VASQMSNANVFSVLDAKSSFWQIKLDHESSLCTAFATPFGRYINSASEVFQ FT RAMEQIFAGFPCAIIVDDIIVGGKGEKEHDENLKKVLNRARQVNLKLNPQK FT CKFKLREVSYVGHLFTEQGLKPDPAKIQAITEMQPPSDKAALQRFLGMANY FT LCKFIPNFSELATPLRQLLHRDVAWCWSHQQQEAFNMLKKCITFPPVLRYF FT DVSKPVTITCDASQYGLGAACLQDDQPISYASRTLTETEMRYAQIEKELLA FT VVFACQKFYDYIYGKPVLVETDHQPLVTILNKPLQSAPARLQRMILKLHKF FT DLTLTYKKGKQLTWLTLCHVLPER" FT CDS 2691..4016 FT /product="Gypsy99-I_DR_3p" FT /translation="MISVNRLQELREHTEKDSSLQILCNTIKHGWPRKIQS FT VPVQLKPFFPFRDELSVGDGIITKGTRVLVPHSLQSEYLQILHKGHAGAEA FT TKRRARDAVFWLTITQDIDNFVQSCSICNALKPHQQKEPLHLHEIPELPWS FT VVATDIFDWNSQQYLVLVDSYSGWFEINSLTNISSQNIISKLKRHFSVHGI FT PQKLITDNGTQFTSQIFRDFASSWDFCHITSSPEYPQANGLAERAVRSAKR FT LLETTKRDGTDLYLNLLNIRNIPRDKILGSPAQRLMSRVTRTNLPISKQML FT KPRTKDTVKVKTQLSKKRQMQKMSYDKKSKPLSPLHESQVVRMQTTKGHDK FT IGVVRTPLAKPRSYIVESDGKEYRRNRRHVLPVKEPLPHCRQADLEDLLSQ FT LHSMPDSNSPPKDAVRTEENIPPKPTDQKLPYRTRYGRVSKPNPKYFD" XX SQ Sequence 4190 BP; 1408 A; 910 C; 907 G; 965 T; 0 other; tggtgtcaga agtaaagaag aaataatggc cgagggattt cgccgaccag acccgctgat 60 attcgacggc aatatagctg aaaactggcg aaaattcgaa ttggagttcg acatattcat 120 cgccgcagct cacagcgata agccagaaaa aaccaaagcc tacatactcc tcaatcttgc 180 gggaccagag gcgatcgaga gagagcgttc gttcacatac gctccagcgt tactggcagc 240 cgatggtcaa gatgtaatca cgccagcgga gtcaagagag gatccagaat gtttaaaacg 300 gaaattcaga gaagtgtgta atccggagac aaatgtgacg atggagagac attcatttaa 360 taccaggaat caaaaaccag gagaaactat cgaagcttat gtgagtacac ttagaaataa 420 agcaaaaaca tgtaactttg gtgccctcac agacgaatta atcagagaca gactagtgtg 480 tggcatagac aatgatagcg tgagaagagt gctgctcaga gaaagtgatc tgacgcttac 540 aaaagcgatt aagatatgcc aaatcagcga attgaccgaa cagcacacga aagtgctggc 600 cgcaccgcgt agcgggcttg cgagcgtcga cgcaatacac ttcaatgcaa aggggagaaa 660 aaaggcagtt gagcaccagc agcgaaaaca catgagtggg acaaagaaca ttgtcaactg 720 taaaaattgt ggtggaaatc atccagcaaa acgtgatcaa tgcaaagctt ttggacaaca 780 atgtcacctt tgtaaaaagt ttaatcattt catgtcccaa tgcaaatctg cgtcatacga 840 caatttcaaa tccattcatc aaatcactga cgcaaacaca gacagtagcg aagagtcatt 900 tcatatcgat ggactgtgtc tgggtgccag tgaaatagat gcagttagtc ataacaggat 960 aaacaagcaa gaaatacatt gcagcctgac agtaaatggc aagccacttg agcttaaaat 1020 tgacaccgga gcaaagtgta atgtcatctc aactgagaca taccaaacac tgagacatga 1080 tgagaaaatc caggaaaagt caaaaacaac attagtggct tatggaggca ctgtgattca 1140 aacacaagga acggtgatga tgcaatgctg gactagcaaa caaaaatcac atatgttgga 1200 gtttcacgta gttgataaaa atgtcaaatc attgctgggc ctgccagact gtttaaaaat 1260 ggacttgata tcccttaagg aagaagtgca tggaattgat ctcaacaaaa atccaaattt 1320 gccacaagaa atatttacag aatatgctga cctctttgat gataagctag gcaccttacc 1380 agtgacatat tcgatgaaga ttaacccgga cattactcct gtggtccgcc cccctcgccg 1440 aatacctgtg gcgatgcggg ataaagtgga agctgaatta aaaaacatga caaagctggg 1500 ggtaataaca cccattttag agcccacaca gtggtgtcct caatggttgc cacacataag 1560 aaagacactg acaagattcg tttatgcata gatccacggg atctaaacga agcgctcatg 1620 cgccctcacc atcccatgcg cacagtagaa gaggtagctt cacaaatgtc caatgccaat 1680 gtcttttcag tgcttgatgc taaaagctca ttctggcaaa ttaagttaga tcacgagtcc 1740 tccctttgca ctgcttttgc aactccattt ggcagataca ttaattcggc atctgaggtg 1800 ttccagaggg caatggaaca gatctttgca ggatttccat gtgccataat tgtagatgac 1860 atcatcgtcg gaggaaaagg ggaaaaagag catgatgaaa acttaaagaa ggttctaaac 1920 agggcacggc aggtgaatct aaaactcaac ccacagaagt gtaagttcaa gctcagagaa 1980 gtcagttatg tgggccatct gttcacagaa caaggactaa aaccagaccc agccaaaata 2040 caggccataa cagaaatgca accaccaagt gacaaagctg ctttgcaacg ttttttgggt 2100 atggctaact acctctgcaa attcattcca aattttagtg aactggcaac accactacga 2160 cagctgctcc acagggacgt agcttggtgc tggtcgcatc aacagcaaga ggcatttaac 2220 atgctaaaaa agtgcatcac attcccacca gtgctgagat attttgatgt aagcaaacct 2280 gtaactatca catgtgatgc ttctcaatat ggattgggcg cagcatgtct acaggatgat 2340 cagcctatta gctatgcctc acgcactctc actgagactg aaatgagata tgcacaaatc 2400 gagaaagagc tattggctgt ggtctttgcc tgtcaaaagt tctacgatta tatctacgga 2460 aaaccggttt tggtggaaac agaccatcag ccccttgtta caattctgaa caaacctcta 2520 caatcagctc cagcacgact acagcgaatg atactgaagc tgcacaagtt cgatttgacc 2580 cttacctaca aaaaaggaaa acagcttact tggctgacac tctgtcacgt gctcccagaa 2640 aggtgacaac aaaaaatgat gaggaggagt ttgatgtctt ggctgttcag atgatttcag 2700 taaaccggct acaagagctc agagaacaca ctgaaaaaga cagctctctg caaattctat 2760 gtaacacgat taagcatggt tggccaagaa aaattcaaag tgttccagtg caattgaaac 2820 ccttctttcc tttcagggat gagctcagtg ttggggatgg aatcatcaca aagggaacaa 2880 gagttctagt tcctcacagc ctgcagagtg aatatttgca gattttgcac aaaggccatg 2940 caggagcaga agcgaccaaa cgcagagcac gcgatgctgt tttctggttg actataacac 3000 aggacataga caattttgta cagtcctgca gcatatgcaa tgctctgaag ccccatcaac 3060 aaaaagagcc attgcaccta cacgaaatac cagaactgcc ttggtctgtt gttgccactg 3120 acatatttga ctggaacagc caacagtacc tagtgctagt tgattcatac tctggctggt 3180 ttgagataaa ttcacttaca aacatatcct cacaaaacat catcagcaaa ttaaaaaggc 3240 acttctctgt gcatggcatt ccccaaaagc ttatcactga taatggcacg cagttcacaa 3300 gccagatttt cagagacttt gcaagttcat gggacttttg tcacatcact agtagtcctg 3360 aataccctca ggcgaatggc ctcgcagaga gagctgttcg tagtgcgaag cgcctgcttg 3420 aaactaccaa gagggatggg acggacttgt atttgaattt gctaaacatc cgcaacattc 3480 ctcgggataa gatcctgggt tcaccagcac agagactgat gtccagggta acacgtacca 3540 atttgccaat cagcaaacag atgctgaaac caaggacgaa ggacacagtg aaggtaaaaa 3600 cacaactttc aaaaaagaga caaatgcaaa aaatgtccta cgacaaaaag agcaagccgc 3660 tctccccact acatgaatcc caagtagtaa ggatgcaaac aacaaaagga cacgacaaaa 3720 ttggagttgt cagaacccct cttgccaagc ccagatccta tattgtggag tccgatggca 3780 aagagtacag gagaaatcgt cgacatgtgc ttcctgtaaa agaaccacta ccacactgca 3840 gacaggccga tctggaggac ttgttgtcac aacttcattc tatgccagac agcaacagcc 3900 cacctaaaga tgctgtgcgt acggaagaga acattccacc taagccaact gatcaaaagt 3960 tgccttatcg tactagatat ggtcgggttt caaagcctaa tccaaagtac tttgactaaa 4020 tgttgttaaa aatcaacata aagaacatat tattgcagag agtaattctg atagtttatt 4080 gttattttca gttgcaatcc aagttttgct atttctataa gttaccaaga catttgtaaa 4140 gttgtaaaag tattctctta gagggagaac aaaaaactaa agaaagggga 4190 // ID Gypsy71-I_DR repbase; DNA; ZEB; 6672 BP. XX AC chr1; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy71-I_DR; KW Gypsy71-LTR_DR; Gypsy71_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6672 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1457-1457 (2008). XX DR Genome; chr1; Positions 50992090 50985419. XX CC Positions [2928-3467] - Reverse transcriptase CC Positions [4938-5414] - Integrase core CC 'TGTG' target site duplication CC LTRs are 100% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 343..1689 FT /product="Gypsy71-I_DR_1p" FT /translation="MSYTYAADDVEDDVYIGGGRGRFFRDIVHAPGAMGFV FT DSPVVSSTRIKTDCQVPRSDTPSTDLSHLITQLAHEIGQSISAQLCKGNEK FT GDKVSNNSNVGLETPSAESHSLNMTGVKLVMHSDVKEPPSFRGDGTDKLSI FT HEWEEQIDVYLRKRGVPKGEQAQEMISRLGGRARDVIKVTLRSNSSLKPDE FT DPKVVIDILKQHFSELTYSAMPLADFYNTLPAVGENAMDYWIRLNKAVDVA FT EHCLKRQGRNLEDPSKEVTMMFVKHCPDPALSAVLKFKTADKWKACEIQEH FT LDDHHTQLKIRQQQAKSRRPGVERYATANVQATVIDESVQVGASTRLQNQT FT EMVGDVRTVSDTTSLQNLITQLDQMLKQNNQAAIRTPYMSGGYRSVRKPCR FT VCKATDHSTTMHCRREGLCMSCCMPGHWKRDCPKARYGASVPNLNMSQEQQ FT QLLN" FT CDS 1704..6338 FT /product="Gypsy71-I_DR_2p" FT /translation="MERECAGEQNKTLNQQCEIDWERWYESASSAQYEDSQ FT VVMQNSQKIELFSELFYAPVIINKTFQVKGMLDSGSMACTVSEDTASRLME FT KNILTYDKQLTERVVLVGCGGHQTRPKCMYEVEIEVYGVQCIVPILVVPGQ FT KDDLILGSNVIKYLMHEMKSSDDYWRVATQSCDSSLAPDASQFLDMMAGLT FT RWQGAETPNKIGTVKLTQAVTLMAKQEYLVWGRLPKNTPMSPGSTVVVEPT FT SSKSMQRNILVARIITPMWGDGYIPMKIMNLSDQPVTLKRNCKLADVSPCV FT AVEDFTAFQNTSQVDRKEQVLANTESNSEDLKKKLVEVGLKDLDIDLCQVS FT NSTKQELVQLLVNYNDIFSKHALDCGEAKGFSHRIRLTDERPFRLPYRRVP FT PAHYEKLRQVLTEMENQGIIRKSISEFASPLVMVWKKDGSLRICTDFRWLN FT ARTLKDAHPLPHQSDCLASLGGNVYFSTMDLTSGFYNIPMHEEDKKFTAFT FT TPLGLHEYNRMPQGLCNSPASFMRMMLSIFGDLNFTKLLCYLDDLLVFAAT FT EKEALSRLEEVFQRLRQNNLKLSPKKCHLLRTSVKFLGHIIGGGGVSVDPE FT KVDVISRMNKSDLMDNDGVTPSVKRIKSFLGMVFYYQHYIANCSAIAKPLF FT ALTAGQKRKGKGRITKQAGTYRKLKSSDWTAECDASFNSLKEKLLNCVILA FT HPDFSKPFILSIDASLDGLGAVLAQIPQGEDKARPVAFASKTLSTSQKRYP FT AHRLEFLALKWSVCEKFSHWLKGNTFTVWTDNNPLTYIMTKPKLDACEQRW FT VAKLAPYTFDIRHIAGSKNVVADVLSRDPFAKTVSNRLLNEFYKCLLSEAE FT EVCEERIQDAFRLKVQCHRKESESDPKEILSVANSDSCDANEVNALCEAHN FT NWEQAAESRAMHLIQSVQQFTPSNLDTLTELSLQELQQRQEQDTSIATIAP FT FVSRKRRPSRREKANFNLGALFLSKQWERLTILDGILYRVSKDPTCKQKRY FT QYVLPQSLKAKALSGIHDLAGHQGQARTLYLARQRFFWPKMEQDVKAYVKC FT CQRCILAKSPEPSARAPLESIRTSSPMELVCLDFWSAEDSKQNSVDVLVVT FT DHFTKLAHAFPCANQTAKQVARKLWDHVFCVYGFPERIHTDQGANFESGLI FT AELLKLSGVVKSHTTAYHPMGNGGTERFNRTLGAMLRSLPLKAKHQWPQQI FT QTLTFAYNATVHETTGFPPFYLMFGRVPRLPVDVMFKQVLRDPVVVDYSTY FT AKTLLSHLHEAAAIAQQHSAKEQQKQAQGYNKKVKGTHLNIGDHVLIANKG FT ERGRRKLADKWESTIYKVIKRNPRTHTYVVQDNNGARRVVHRNLLLDISFL FT PAQMEFGGASDLWLEERDEEESVATSKTQDLSVCLESSCSKERTSLWVLDG FT TEHSMDHSTPSETGSGWFYTDGGQQSQDDDCSERFFEDESHVTEPRDSQTD FT CFNLDSDRNTSIVNTVELPNHETEDRDTQNIGNTQRQVTRAGRVVKKVNRL FT IESMAQKQFNFKSITNTLGRKSQSLLTLF" XX SQ Sequence 6672 BP; 2065 A; 1311 C; 1516 G; 1780 T; 0 other; ttggtgccgt gaccttcgaa aggatcaaca tcagattgac caccggatgt tgctgtatgc 60 tgcattcttc acaccaagga ttcggtttaa cttttccctt cttgacttat ctgctggaga 120 gaggtacaaa gactcattgt gaaacatcag tgaagataat ttgacagaga ccaaaaaaaa 180 aaaaaaaaaa aaaaaaaaaa aaagtttaca tagttttttt ctattggaca cctgcaaaga 240 cttttgacag aacatttttt tttctctttt tgcttaatcg taataaaata atagcaatat 300 ttgaataata attatagtta taataatttt aaagaaaaga aaatgtcata cacttacgct 360 gctgacgatg ttgaggatga tgtgtatatt ggaggaggaa ggggtagatt ttttagagat 420 atagttcatg caccaggagc tatgggtttt gttgactctc cagtggttag ttctacacgc 480 ataaaaactg attgtcaagt tcccaggtca gacacacctt ccactgattt gagccatctc 540 attacacaat tagcacatga gataggacaa tcaatctcag cacagttgtg caagggaaat 600 gaaaaggggg acaaagtgtc gaacaattca aatgtaggtt tagaaacacc ctcagcagaa 660 tcacattctt tgaacatgac tggtgtaaaa ctagtgatgc attctgacgt aaaagagcca 720 ccaagctttc gtggagatgg cactgataaa ctttctatcc atgagtggga agaacaaatt 780 gatgtttatc ttagaaagag aggtgtgccc aaaggagagc aggctcaaga gatgatatct 840 agacttgggg ggagagcaag agacgtaatt aaagttacat tgcgtagtaa ttcatcttta 900 aaacctgatg aagacccgaa agttgttatt gacattctta agcagcactt tagtgaactg 960 acatactctg caatgccact cgctgatttc tataacactc tgcctgcagt aggagaaaat 1020 gcaatggact attggattcg tttgaataag gctgttgatg ttgcagagca ctgcttaaaa 1080 agacagggac gtaaccttga agatccaagt aaggaggtaa caatgatgtt cgtaaaacac 1140 tgcccagatc cagctctttc tgctgtgcta aaatttaaaa ctgcagacaa gtggaaagct 1200 tgtgagattc aggagcacct tgatgatcat catacgcagt tgaagattcg gcagcagcag 1260 gccaaatcaa gacgtcctgg tgttgagaga tatgccactg cgaatgtgca ggccactgta 1320 attgatgaat ctgtgcaggt gggcgcatca acaagactcc aaaatcaaac tgaaatggtg 1380 ggtgatgttc gcacagtgtc tgatactaca agcttgcaga atctcattac tcaacttgat 1440 caaatgttaa agcagaacaa tcaagcagca atccgtaccc catatatgtc aggtggatat 1500 cggtcagtcc ggaaaccatg tagagtctgc aaagcaactg atcactcaac taccatgcac 1560 tgccgacgag aaggtctgtg tatgtcatgc tgtatgcctg gccactggaa aagagactgc 1620 cctaaggcaa gatatggagc cagtgttccc aatttgaata tgagtcaaga acaacaacag 1680 ctgttaaact gacggacctg catatggaga gggaatgtgc aggtgaacaa aataaaaccc 1740 tcaaccaaca atgtgagatt gattgggagc gatggtatga aagtgcttcc tcggctcaat 1800 atgaagatag tcaagtagtc atgcaaaatt cacaaaaaat tgaactgttt agtgaactgt 1860 tttatgctcc agtaatcata aacaaaacgt tccaagtaaa gggaatgctg gactcaggct 1920 ccatggcctg tacagtaagc gaagatacag cgtctcgatt gatggaaaaa aacatactca 1980 cctatgacaa acaactaact gagcgagtcg tcctggttgg gtgtggaggt catcaaaccc 2040 gccctaagtg catgtatgaa gtggagatag aagtgtacgg tgttcagtgc attgttccca 2100 ttcttgtggt cccaggacaa aaagatgatc tcatccttgg atctaatgtc ataaagtacc 2160 tgatgcatga gatgaaaagc agtgatgact actggagagt ggccactcag agctgtgatt 2220 cctcattggc ccctgatgct tctcagtttc ttgatatgat ggctggattg acacgatggc 2280 agggtgctga aactcctaat aagattggaa ctgtgaaact gacacaagct gtgaccctca 2340 tggcgaaaca ggagtatctt gtttggggca gactcccgaa aaacactcct atgtctccag 2400 gcagcacagt ggttgttgag cccacgtcct caaaatccat gcaacgaaac attctggttg 2460 cccgcattat cacacctatg tggggtgatg gatatatccc tatgaaaatt atgaaccttt 2520 ccgatcagcc agtaacttta aagagaaatt gcaagctagc agatgtttcg ccctgtgtag 2580 ctgttgaaga tttcacagct ttccagaaca ctagtcaggt tgataggaag gaacaagtgt 2640 tagctaacac tgaatctaac tctgaggatc tcaagaaaaa acttgtagag gttggcctta 2700 aggatttaga cattgatttg tgccaagtaa gcaactctac taagcaagag ttagttcagc 2760 tattagttaa ttataatgac atcttttcga aacatgcttt ggactgtggc gaggcaaaag 2820 gcttctctca tcggatccgt ctgacagacg aacgcccatt tcgccttcct tacagaagag 2880 ttcctcctgc tcattacgaa aaactgagac aagtcttgac agagatggaa aatcagggca 2940 taataaggaa gtccattagt gaatttgcct caccactagt catggtatgg aaaaaagatg 3000 gcagtttgag aatttgcacc gactttagat ggctcaatgc caggacttta aaagacgcgc 3060 acccactccc acatcagtca gactgtttgg catccttggg tgggaatgtc tattttagca 3120 caatggactt gacttcgggt ttctataata tacccatgca tgaggaagac aagaaattta 3180 ctgcattcac caccccactt ggtctgcatg aatacaaccg catgccacaa ggcttatgca 3240 atagccccgc ttcgtttatg cggatgatgc tgagtatatt tggtgacctg aattttacta 3300 agctattgtg ctatttagac gacctactag tttttgcagc cactgagaag gaagcattga 3360 gtaggttgga ggaagtattc caaagacttc gccagaacaa tctgaaactg agtcccaaaa 3420 agtgtcacct attgcgaaca tcagtgaagt tcttggggca cataatcgga gggggtggag 3480 tttctgttga cccagaaaag gtggatgtaa tctcccgtat gaacaagtca gatcttatgg 3540 acaatgacgg agttactcct tctgttaaaa gaataaaatc ctttctggga atggtctttt 3600 attatcagca ttacatagct aattgctctg ccattgcaaa acctttgttt gccctcactg 3660 ctggacagaa gaggaaagga aagggaagga tcacaaaaca ggctggtacc tataggaaac 3720 taaaatcttc cgattggacg gcagagtgtg atgcaagttt caactcactg aaagagaagt 3780 tgttaaactg tgtaatactg gcacatcctg atttttcgaa acccttcatc ctgtccatag 3840 atgcttcttt agatgggttg ggggcagtat tagctcaaat acctcagggg gaagacaaag 3900 ctaggcctgt tgcattcgca agcaaaacac tcagtacctc acagaaacga tatccagccc 3960 accgccttga atttttggcc ttaaagtgga gtgtttgcga gaagtttagt cattggctta 4020 aggggaacac atttaccgta tggacggata acaatccgtt gacatacata atgactaaac 4080 caaagctaga tgcctgcgaa caacgttggg tcgccaagct agccccatat accttcgata 4140 tacgtcacat agcaggaagt aagaatgtgg tggctgatgt tcttagtcgt gatccttttg 4200 caaagacagt cagtaatagg ctgctcaatg agttctataa gtgcttgctg tctgaggctg 4260 aagaagtttg tgaggaaagg atccaggatg cattcagact gaaagttcag tgtcaccgca 4320 aggagtctga gtctgacccc aaagagattc tttcagtagc taattcagat tcttgtgatg 4380 ctaatgaagt taatgctctt tgtgaagctc ataataactg ggaacaagca gcagagtcta 4440 gggcaatgca tttaattcaa tctgttcagc agttcacacc atctaatctg gacaccttaa 4500 cggaactctc cttacaagaa cttcaacaac gacaggaaca ggataccagt atagccacaa 4560 ttgccccttt tgtgtcccgc aaaagaaggc cttcaaggcg agaaaaggct aacttcaacc 4620 tgggcgcact gtttctcagc aaacagtggg agagacttac aattctcgat gggatacttt 4680 atcgagtgtc taaggaccct acatgcaagc agaagcgata tcagtatgtg ctgccccaga 4740 gtctcaaggc taaagcattg agtggtattc acgaccttgc aggacaccag ggtcaagcaa 4800 gaacacttta tttggcaagg cagcgatttt tttggcctaa gatggagcag gacgtcaagg 4860 cctatgtcaa atgttgccaa agatgcattc ttgctaaatc acctgaacca tccgctagag 4920 ccccattgga gagtatcaga acttcgtcac ccatggaact tgtatgcctg gacttttgga 4980 gtgcagaaga ctcaaagcaa aactctgtcg acgtgttagt agttacggac cacttcacca 5040 aattagccca tgcgtttccg tgtgccaatc agacagctaa gcaagtggca aggaagctgt 5100 gggatcatgt tttttgtgtt tatggattcc cggaacgtat acacacagac caaggtgcta 5160 attttgagag tggtttaatc gcagagttgc ttaagttgtc aggagttgtt aaatcacata 5220 ccacagcata tcatccaatg ggcaacggag gaactgaacg atttaatagg actttaggag 5280 caatgctgcg ctctttgcca ttaaaagcta aacatcaatg gcctcaacaa attcagaccc 5340 taacatttgc ctataatgcc actgtacatg aaacaactgg gtttccgccg ttttacctta 5400 tgtttggccg cgtgccaaga ctccctgtag acgtgatgtt caaacaggta ttacgggatc 5460 ctgtagtagt ggattatagt acttacgcca agacgttact ctctcatctc catgaagcag 5520 cagccattgc ccaacagcat tctgctaaag agcaacaaaa acaagctcag ggatataaca 5580 aaaaagtaaa gggcacgcat ttgaacattg gagatcatgt actgatcgct aataagggag 5640 agagaggaag gaggaaactt gctgataagt gggagtccac tatctataaa gtcatcaaac 5700 ggaaccctcg gacacataca tatgtggtac aagacaataa cggagctagg agagttgtac 5760 atcgcaatct tttattggac atcagctttt tacctgctca gatggagttt ggaggagcca 5820 gtgatttgtg gttagaggaa cgtgatgagg aagagtccgt tgctacctca aagactcagg 5880 atctctcagt ttgtttggag agcagttgtt caaaggaacg aacaagtctt tgggtgctgg 5940 atggaacgga acattcaatg gatcatagca caccaagtga aactggatct gggtggttct 6000 acactgatgg aggacagcaa agtcaggatg atgattgctc tgagaggttc tttgaggatg 6060 aaagtcatgt tacagagcct cgtgatagtc agacagactg tttcaatctc gatagtgaca 6120 gaaacacatc cattgttaac acagtagaac tgcctaatca tgagactgaa gatcgagaca 6180 ctcagaacat tggaaacaca caaagacagg tcacacgtgc tggtagagta gtcaaaaagg 6240 ttaatcgtct cattgagtct atggcacaga aacagttcaa ctttaaatca attacaaaca 6300 ccttgggtag aaagtcacag tcattactta ccttgtttta gacatttatt taatcgtcca 6360 attttatgtc ctctaatata cacttgagaa atagcttgat ccctataaat gtaaattaat 6420 tcatttgaca tggtttaaat aggcataata ttgttgactt aatgatttgg tatttggaat 6480 ctggtatggt gtaaaaggca tcatttattc tgagaggctt ttaaaggtct gatcatcctt 6540 taaatttttc tcaacctttt tgtaaggtag cattttcaag ctgagtaggt tcagagattg 6600 ttcgaataat ttagtcaact atttgtaaat gcttaagttt tagggatttt ggtgaattta 6660 gaagggggtg aa 6672 // ID Gypsy146-LTR_DR repbase; DNA; ZEB; 808 BP. XX AC chr25; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy146-I_DR; KW Gypsy146-LTR_DR; Gypsy146_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-808 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1554-1554 (2008). XX DR Genome; chr25; Positions 21221424 21220617. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 808 BP; 195 A; 114 C; 177 G; 322 T; 0 other; tgttatgttg aaaattgacc atggcccctt taagagaagt ctgtccttgt ctcgcgatgt 60 gtgtgtgtgt gtgttggtgg gcaggcgcgt gcacgcacct gcgtgactga gcagagaagc 120 catgtttgca tcgtggtgag ttgtgtgtga tgagctgtga tcgtcgtgga aaagcgttgt 180 tttgggatga cttggcttgt gtacaatata gtggactcca tgttgaacag taatcaggat 240 ttctgatgta agtttattat ttcatgttat atactaagtt catttactgt ttctcggaat 300 ttgatagagt attattgttg tcctataaaa taaggtatct tgtgtggtga tacatgtgtg 360 tataatgtat acttcattga aaatgttcaa attgtgtgat ataaggttaa attggttgtg 420 ctaacatgct tttatttctt tttttgtaca atagcactgc cgtatttcag tgtaggataa 480 agaaaacatt tttgtattcg atttttgtat aaatattttt atttattggg gcactatagt 540 acccgaagtt ttgtgctctt gcatttttct taacattgtg ctattgagat ttgcataaaa 600 gtaggatttt gagttttatt tctgtttaac tgttcaggat tatttattat tgttattttt 660 tgggaaaaag tgttttcaag gcgcttattc cgctgtgctc ccttcgaccg ctgtaatatt 720 ttcttattaa acagcagtta gataaaaccc ccatttaaag caatctttta catgcttgtt 780 ttgtgtcttg atttgccacc cggctaca 808 // ID Gypsy40-I_DR repbase; DNA; ZEB; 4979 BP. XX AC . XX DT 21-SEP-2007 (Rel. 12.09, Created) DT 21-SEP-2007 (Rel. 12.09, Last updated, Version 1) XX DE An internal portion of the Gypsy40_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy superfamily; Gypsy40_DR; KW Gypsy40-LTR_DR; Gypsy40-I_DR; endogenous retrovirus; gag; KW integrase; reverse transcriptase. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4979 RA Dib M.R. and Naveira H.F.; RT "Gypsy40_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 7(9), 810-810 (2007). XX DR [1] (Consensus) XX CC Gypsy40-I_DR is an internal portion of the Gypsy40_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its long CC terminal repeat is deposited in Repbase as Gypsy40-LTR_DR. CC Gypsy40_DR is characterized by 4-bp target site duplications. The CC internal portion encodes one polyprotein the 1586-aa polyprotein CC Gypsy40_DR1p (pos. 160-4917) composed of the gag, protease, CC reverse transcriptase, and integrase domains. Some insertions CC fairly recent, according to the high identity between their CC flanking LTRs. Consensus obtained after the alignment of at CC least nine independent insertions bearing at least 85% Homology CC over at least 1000bp. XX FH Key Location/Qualifiers FT CDS 160..4917 FT /product="Gypsy40_DR_1p" FT /note="ORF." FT /translation="MEVFSLEEFVTKPTIEQINVCRKQDLLLIADHYQIVV FT NKQSRKAEIKGKILLKLHEKSVLLLSNVVEETGENISQDDVAVAHLSPGAE FT DDPNGQVDVGAEAKAGLPVFDPFSPISSGYRDDARLKVRLARLQYEAQEKA FT HARQAEINLRLEIRKLEIEAEKVIKLRQLDLEAMKISSGSAAQPNSAPASV FT SSDVAPPTDPAPTAFDIGKHIALVPPFRESEVDSYFCAFERIAISLRWPKE FT VWSLLLQCKLFGKAQEVCSTLSVEDSLKYENVKSAILRVYELVPEAYRQRF FT RACKKTSAQTFVEFAREKGILFDKWCSASKVADFNSLRELMLMEEFKNCLP FT ERVVVYLNEQKVTSLSQAAVLADEFILTHKNVFATARSEKSQPNSSQTSRP FT KSSTTREERECYYCHKQGHVIADCLVLKRKQQPQVKSVGFLKTHTAFNASD FT DDKIDDNYKPFILEGFISLSENGEDQIKVKILRDTGALQSFIVADKLKLSD FT DTFCGSSVIVQGIEMNCIKAPLHRIHLKSDLCTGFVRVAVRPSLPVKGIDV FT ILGNDLAGGKVMPVLEVTEKPDLCAQSDLTDAYPDVFPACAVTRAQTRKLG FT NVIELADSFFSPLLAGEKIPDSETSREKQNTQNVDVVTDTETLKLPVSREK FT IMVAQKEDKSLAPIFDSAVTKDVIKEMKRGFFIDNELLMRKWCTNPGSDFD FT WDAVYQIVVPSPYRQHVLYLAHDHQLAGHLGITKTYNRILKHFFWPGLKKD FT VVKYCRVCHTCQIVGKPNQVIPPAPLTPIPVIGDPFEHIIVDCVGPLPKTK FT SGNQFLLTIMCAATRFPEAVPLRKITAPVIVKALIKFFSTFGLPKIIQSDQ FT GTNFLSKLFKQILQSLDITHRVSSAYHPESQGALERFHQTLKSMLRKYCLS FT TDKNWDEGVPLVLFAARETIQESLGFSPAQLVFSHEIRGPLKVLKERMLSV FT ETSKKTNILDYVSQFRDRLHSACKLAKESLASAQKGMKRKYDRKAVARSFA FT VGDKVLVLLPVPGSSLSARFAGPYEIKKKLSETDYMIHTPDRKRQTRVCHI FT NMLKVYHSKETGKDAPVSKPESVVPTVVAVAEQTSAVMSDADDGVILRDLP FT QSSVRLVNSEILSDLNSYLKHLPADQKTDIIKLISDFNCLFGDVPTQTNVL FT KHDINVDGARPIKQHAYRLNMAKRSIMRQEVNYLLENKLAKPSSSPWSSPC FT LLVPKPDGTYRFCTDYRKVNAVTVPDSYPLPRMEDCVDNVGSARFVSKLDM FT LKGYWQIPLTDRASDISAFVTPDSFMQYNSMAFGLRNAPATFQRLVNTVLA FT DIPNCSAYLDDIVVYSMSWRDHVHSLREIFTRLSNASLTLNLAKCEFGQAT FT VTYLGKEVGQGQVRPVDAKVTAISQFPVPTTRRELRRFLGMAGYYRSFCKN FT FSTVVNPLTSLLSPSTSYVWSDACQSAFDSVKALLCNAPVLAAPDCASPFK FT LEIDASAVGSGAVLLQEDETGIDHPVSYFSRKFNKHQLNYSTIEKETLALL FT FALQHFEVYVGSSCLPVIVYTDHNPLVFLSRMYNHNQRLMRWSLLVQNYNL FT DIRHKKGIDNVMADALSRA" XX SQ Sequence 4979 BP; 1393 A; 970 C; 1117 G; 1499 T; 0 other; tttaaatggg ggctcgtccg ggatctcttt ttaataaaag aaacctgaaa ttgtgataag 60 ttttgtgttt gttggtggca attggtgtta gtttgtgtat tgtacataat agatgtttgc 120 ggaagggtat cttgctgttt gtttcgtgta gtctgtatca tggaggtttt tagtttggag 180 gaatttgtta caaaaccaac catagagcag attaatgttt gtaggaagca ggatttatta 240 ctgattgctg atcattatca gattgttgtt aacaagcaga gtcgtaaggc agaaataaaa 300 ggtaagatac tgttaaagtt acatgagaaa agtgttcttc ttctgtcaaa tgttgtggag 360 gagactggtg aaaatattag ccaggatgac gttgctgtgg cgcatctcag tccgggtgct 420 gaggatgatc ctaatggtca agttgatgtc ggcgcggagg cgaaggctgg cttgccagtg 480 tttgatccgt tctcccctat ttcctctggt tacagagatg atgcgcgact gaaagtccgc 540 ttagcccgtc tgcagtacga ggcacaagaa aaagcgcatg cacgtcaggc tgaaataaac 600 ttgcgtttag aaatacgcaa gctagaaata gaggctgaaa aagtgatcaa gctgcgacag 660 ctagatctgg aggcgatgaa gatttcttct ggctcagctg cgcagcccaa ttcggctcct 720 gcctctgtgt cctctgatgt ggccccgcct actgatccag ctccgaccgc atttgacata 780 ggtaagcaca ttgcattagt cccaccattt agagaatcag aagtagattc ttatttttgt 840 gcttttgagc ggatagccat ttctttacgt tggcctaaag aggtttggtc actattgctg 900 cagtgcaagt tgtttggtaa agctcaagag gtttgctcta ctctgtcagt tgaggatagc 960 ttgaaatatg aaaatgtcaa gtctgcaatt ttgcgagttt atgaacttgt tccagaagcc 1020 tatagacagc gatttagagc atgtaagaaa acatctgctc agacttttgt tgaattcgca 1080 agagagaaag gaatcttgtt tgataagtgg tgtagcgcta gtaaggtggc tgattttaac 1140 tcactgcggg aactgatgtt aatggaagaa tttaaaaatt gtttgcccga gcgagttgtt 1200 gtttatctaa atgaacagaa ggtgacttca ttgtcgcaag cagctgttct agcagacgag 1260 ttcattttaa ctcacaagaa tgtttttgca actgcacgtt ctgagaagtc ccagcctaat 1320 tcttcacaaa ccagtagacc aaaatctagc acaactcgtg aagaacgtga gtgctattat 1380 tgccacaaac aaggtcatgt catagccgat tgtttggttt taaaacggaa acagcagccg 1440 caagtgaaaa gtgtagggtt tttaaaaact cacactgctt tcaatgcatc tgatgatgac 1500 aaaattgatg acaattataa accttttatt ctggaaggat ttatttcgtt gtctgaaaat 1560 ggtgaggatc agattaaggt aaaaatcctt agggacactg gtgctttgca atcttttatt 1620 gttgcagaca agttaaagtt gtcagatgac actttctgtg gctccagtgt gattgttcag 1680 ggcattgaga tgaattgcat caaagcaccg ctacatcgta tacacttgaa gagtgatctg 1740 tgtacaggat ttgtgagggt ggctgtgcgc ccatcacttc ctgtaaaggg catagatgtt 1800 attcttggaa atgatctggc aggaggaaaa gtaatgccgg tgttagaggt aacagaaaaa 1860 ccagatttgt gtgctcaatc ggatttgact gacgcttacc ctgatgtttt ccctgcatgc 1920 gcagttacac gtgcacagac acgcaagttg ggtaacgtaa ttgaattggc tgattcattt 1980 ttctctcctc tcctagctgg tgagaaaatt cctgattctg agacatccag agaaaaacaa 2040 aacacacaaa atgttgatgt agtgactgat actgaaacgt taaagttacc tgtatcacga 2100 gagaaaataa tggttgctca aaaagaggat aaatctcttg cacctatttt cgattcagct 2160 gtcacaaaag atgtgataaa agaaatgaaa aggggatttt tcattgacaa cgagttgctg 2220 atgaggaaat ggtgtacaaa tccaggtagt gattttgatt gggatgctgt ttatcaaatt 2280 gttgttccat ctccttatcg tcaacacgta ctatatctgg cgcatgacca tcagctagct 2340 ggccatctag gaatcacaaa gacatacaac cgaattttga agcatttctt ttggccaggt 2400 ttaaaaaaag atgtggttaa atactgtcgt gtttgccata catgtcagat tgtaggaaag 2460 ccgaaccaag tgattccgcc tgcacccttg actcccatac ctgttattgg cgatcctttt 2520 gaacacataa ttgtggactg tgtggggccg ctccctaaaa caaaatctgg taatcagttt 2580 ttactaacta taatgtgtgc cgcaactagg tttccagagg ctgttcccct gagaaaaatc 2640 accgcacctg tgattgttaa agctctgata aaattcttct ctacatttgg cctgcccaaa 2700 atcattcaaa gcgatcaagg tacgaatttt ctatcgaagc tttttaagca aattttacaa 2760 tcactcgaca tcacccatcg tgtttcaagc gcatatcatc ctgagagtca gggtgcgctt 2820 gaacgctttc atcagacact caagtccatg ctccgcaaat actgcctgag tacagataag 2880 aactgggatg aaggagtccc tttggtttta tttgctgcga gagaaacaat tcaagaatcg 2940 cttggattta gcccagctca gctagtgttc tctcacgaaa ttagaggtcc gttaaaagta 3000 ctaaaagaac gcatgttgtc tgttgagacc agtaagaaaa ctaacatatt agattatgtt 3060 agtcaattcc gtgatcgttt acactccgcg tgtaaactgg ctaaagaatc tttggccagt 3120 gcgcagaaag ggatgaagcg caagtacgac agaaaggctg ttgcgcgctc attcgctgtt 3180 ggagataagg ttctcgtact tttaccagta cctggttctt cactttctgc tcgtttcgcc 3240 ggtccgtacg aaataaaaaa gaaactgagt gagacggatt atatgattca tacacccgat 3300 agaaaacgcc aaacccgtgt atgtcatatt aatatgctta aagtatacca ttccaaagag 3360 acaggtaagg atgccccggt gtcaaagcca gaatccgttg ttcccactgt tgttgcagtt 3420 gcggaacaga cttctgctgt gatgtcagat gctgatgacg gtgtaattct tcgtgacctt 3480 ccccaatcat ctgtgaggtt agtcaattca gaaattttga gtgatctaaa ttcctatctc 3540 aaacatttgc ctgctgatca gaaaacagat atcattaagt taatatctga ttttaactgt 3600 ttgtttggtg atgttccaac ccaaacaaat gtgttaaaac acgatattaa tgtggatggt 3660 gcacgtccaa taaagcagca cgcatatcgt ctgaatatgg ctaaaagatc tattatgcgt 3720 caagaagtga actacttgct tgaaaataaa ttggcaaaac cgagcagtag cccttggagt 3780 tcaccgtgtt tgctggtgcc aaaaccagat ggaacgtaca gattttgtac tgattaccga 3840 aaggttaacg cagtcactgt cccagacagc tatcctctac cacgcatgga ggactgcgtt 3900 gacaatgtcg gttcggcacg ttttgtctct aaattagaca tgctaaaggg atattggcaa 3960 attccgctta ctgatagagc ttctgacatc tccgcattcg tcactccaga ttcatttatg 4020 caatacaatt cgatggcttt cggtcttaga aacgcacccg cgacgtttca acgtctagta 4080 aatacagtgt tggccgacat accaaactgt agcgcgtatc tcgatgatat agtagtttac 4140 tccatgagct ggcgtgatca tgtgcattcg ttaagagaaa tatttacacg actttctaac 4200 gcatctctaa cgctaaattt agctaaatgt gagtttggcc aagcgactgt cacatacctg 4260 ggcaaggagg ttggacaagg tcaagttcgc cctgtagatg cgaaagtaac cgctatttcg 4320 cagtttccag tccccaccac tcgccgagaa ctacggaggt tcttgggtat ggctggttat 4380 taccgaagtt tttgcaaaaa cttttctact gtggttaacc ccttaacatc tttgcttagc 4440 ccttctacat catatgtgtg gtctgatgct tgtcaaagtg cctttgatag tgtgaaagca 4500 cttttatgta atgcacctgt tcttgctgct cctgattgtg catcaccatt taagcttgag 4560 attgatgcaa gtgctgtagg ttctggtgcc gtacttcttc aggaggatga aactggtatt 4620 gaccatcctg tgagctattt ctctcgtaag tttaataaac atcagctcaa ttattctacg 4680 attgaaaagg aaacccttgc tctgttgttt gctttgcaac actttgaggt ctacgttggt 4740 tccagctgtt tacctgtaat tgtatatacg gatcacaatc ctctagtatt cctatccagg 4800 atgtacaatc ataaccaacg actgatgaga tggtctctgc tagtccagaa ttataatttg 4860 gatattcgtc ataaaaaagg cattgataat gttatggcgg atgccctgtc tagagcttaa 4920 tttttttttc tctctctctg gtttatttta acaaacaggt ttgttcttga gggtcaggg 4979 // ID Gypsy90-LTR_DR repbase; DNA; ZEB; 385 BP. XX AC chr18; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy90-I_DR; KW Gypsy90-LTR_DR; Gypsy90_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-385 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1492-1492 (2008). XX DR Genome; chr18; Positions 17919239 17919623. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 385 BP; 111 A; 115 C; 76 G; 83 T; 0 other; tgtgacaagc atcccatgaa gctatcagca tcgtcctgga gaggaacgag gaacacctgg 60 gccctctcat tcccaactaa tcagcaccag ctgcaactca tccaagaccg gctacataag 120 ccacaccaaa agaaccagaa ggggagaagc atacgaagga agcagagcag taacacttat 180 tctccctttc tttcctctag acagcagttg gtgaccgatc ggggtcctag acccatctca 240 cagacactga gcaccccacg gatgaaagaa catcacccgc tggagcaccc ctttggatct 300 tgcactcatg cacctttgta aataaatcca ccctctggga tctaaacttt aattcttttg 360 tcgtgtgatt cttcagccca tcaca 385 // ID HATN5A_DR repbase; DNA; ZEB; 1501 BP. XX AC . XX DT 05-JUN-2002 (Rel. 7.05, Created) DT 05-JUN-2002 (Rel. 7.05, Last updated, Version 1) XX DE HATN5A_DR is a nonautonomous DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW HATN5A_DR; HATN5_DR; nonautonomous DNA transposon; TDR8; KW hAT superfamily. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1501 RA Kapitonov V.V. and Jurka J.; RT "HATN5A_DR, a composite nonautonomous hAT-like DNA transposon RT from zebrafish."; RL Repbase Reports 2(5), 25-25 (2002). XX DR [1] (Consensus) XX CC HATN5A_DR is a nonautonomous DNA transposon that belongs to the CC hAT superfamily. HATN5A_DR is a composite transposon, it carries CC the TDR8 element (positions 278-1221). CC Several hundred copies of HATN5A_DR are expected to populate the CC zebrafish genome. They are ~5% divergent from the consensus CC sequence. CC This element is characterized by 17-bp terminal inverted CC repeats and the 8-bp target site duplications. XX SQ Sequence 1501 BP; 461 A; 278 C; 305 G; 444 T; 13 other; tagggatgta acggtattgt aaataccgtc ataccgcaat attaattttt ttcgatatta 60 ccgaagtcgc atgactcggt aaaactatag gtcttctgag aaaatttgct caggcgaatg 120 aagcgaacgg gaggtagcga aaactacaat wsccatcagc ccatgcttga ccatcatccc 180 ttgcggtctg ttgtcgctac agatccagta atgcggaaat ggagtgtgct gctagaagcg 240 gggatgaaaa gagctggaaa actctaaagc aggtcgcaca ccaatgcgtt ttagttygaa 300 aacgcataag ttttgctacr gttacaccaa ccgtycacac tacgcnggmg ctctcgagcg 360 ccgaaaacgg agcgtttcga aatgtgcgga taacgtctca gtgtggataa tggaaaactg 420 agacatctga aaacggargc ggggctgcag acattcgcct ctctgattgg ggcttttcct 480 gaatattaag tagcctaaca cgcacagttc agtcctgcat tatctccgtg taagttcaga 540 cttcgcaagt ttgatcaagg ctgcagtctc ttcttctcag tttgatatgg aaaagcagat 600 tataccgagg acacgggtaa atcttcaaag ggaacagtgt actttataac ttcattcaca 660 tcaccctggc ttcgttgttt cactttmtca acaataaaat gtaaacatga tttaaggaac 720 tgcctatttt cattttaata ttagcaactt agacagcaga matgttgagg cgtcgtgctg 780 catawnatga gcgtcatctt cactgtgtgg atatttataa caaaacggag ccgataacaa 840 ctgcctcctt tcaatttcag tgaaaatacg aaacacagcc tctcttttgc tgagtatcag 900 ttttaagaat cgataatggc cattttaaaa gtataacata caataagttt atacattata 960 ggaaataaag gcaagcgatc agtcaatata cagaatgtac gtggttacat taatcattaa 1020 cttatctttg cgctcagcca aaacacgtta cctgagaaca agtaatagat tccaatgmcc 1080 aaagtcaggg aatatgtcgt tagataaaga caacaagata aatgaaatat cccgtttaat 1140 aaatatagtg agattagatc cagcgggaga tgcttgatga gaagtccgac tagcagagct 1200 ctcatctggg tagatgggct gagtgtgatt aaatgttgaa tgtgatgagt tttcaacaat 1260 actaaattga aactttattg tttttacatg gtttaataat tttttgttat taaaattgaa 1320 gttcctgttt caaagcttac agatagatgg ctaatttgta tgtcattgac acttttggca 1380 cttttttgga gtattttcat aagttttgtt ttttcctgta aatgattcaa taaataccgt 1440 accgtgacat tcataccgag gtattaccgt accgtgaaat tctgataccg ttacatccct 1500 a 1501 // ID DNA-6-N3_DR repbase; DNA; ZEB; 1478 BP. XX AC . XX DT 03-NOV-2008 (Rel. 13.11, Created) DT 03-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE nonautonomous DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; TSD 6-bp; KW DNA-6-N3_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1478 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(11), 1715-1715 (2008). XX DR [1] (Consensus) XX CC TIR = 292 bp, TSD =6 bp. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1478 BP; 429 A; 273 C; 304 G; 472 T; 0 other; tacggtggcc gggaagtgca aaacaacatt acaaagtttg aaacactttt acaaagctcg 60 agacaaattt acattttgaa aaacattttt acctatcatc agacacaatt acataggaaa 120 aacaatttta ccaaggacga aacaaattta cattttagaa aacaaattaa caagacgcaa 180 aacactttta caaatcccga aacaaattta caattacaga ttccctacgg aaagggaatg 240 taccacaccc cggaagtgac gtagaatgta ccacacaccg gtagtgacgc ggggaaaagt 300 tgttgttgtt gctggtgagc gcggtaaatt cgcgctgtgg agtacatggc gtaaacaaag 360 tttatggtga ccgaacatgg acagcggtcg agggttgttt tgcccgtttt gtggcaaaca 420 catgagcagc ttaacacggt tttgctttgc gtgtggtcgg tgtttagagt ttataaagga 480 cgcggaccag acggaaacac cagacatgct gcatcaatgt gttcaatatt tttacgagat 540 ccactcgtac gctgtaatcg tgaacatgat gtcaagtcta cacggtgtaa acctctgctt 600 gaggactctt aacagtaaac tgaaggaagc ctgtgttatg gtttaactgg tgtccgtgtt 660 aactgaggcc ccttttcaga tgtgcaaact attcacgttt gcagattggc agattcgtca 720 ctttttcaca gcaacaaaac atgcccatac cgaataaaga ttcttattac ttggtgtcag 780 tgtaaacatt gttgatttta atatttgtgt ggcagaacag cataaaccaa aattaatcta 840 aagaaataat tataattgac atgtcatcaa cgggattcga acccaagtca aaaggttctg 900 cacgagtatc aaccgactta tccaactgag ctactccggt gtacaggttg agattggttt 960 tggtatcttt atcagtcaac atgtgccgtg ctttaattgt ttccatgtac ttgcatttac 1020 atgtttctat gcactctcac gctgatattt tctcagaata gctctaatat gttttatcgc 1080 agttagtcaa accatgattt ttatagttgt agacttgtgg aaacctgtct acaaagtcac 1140 acacgagctg gatattattg atattttcca gtgtgtgtgt tacattccgg gtcactaccg 1200 gtgtgtggta cattctacgt cacttccggg gtgtggtaca ttccctttcc gtagggaatc 1260 tgtaattgta aatttgtttc gggatttgta aaagtgtttt gcgtcttgtt aatttgtttt 1320 ctaaaatgta aatttgtttc gtcctttgta aaattgtttt tcctatgtaa ttgtgtctga 1380 tgataggtaa aaatgttttt caaaatgtaa atttgtctcg agctttgtaa aagtgtttca 1440 aactttgtaa tgttgttttg cacttcccgg ccaccgta 1478 // ID DNA-2-1_DR repbase; DNA; ZEB; 2550 BP. XX AC . XX DT 14-OCT-2008 (Rel. 13.1, Created) DT 14-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; 2-bp TSD; KW DNA-2-1_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-2550 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1217-1217 (2008). XX DR [1] (Consensus) XX CC The TSD is 2-bp, mostly TA. The TIR is ~30 bp long.The 5'-CACT CC terminus suggests that this family belongs to the EnSpm CC superfamily; 2-bp TSDs also occur to EnSpm elements. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 2550 BP; 953 A; 415 C; 338 G; 841 T; 3 other; cactcaaaaa aatgatatgt tggcgttaat taactttttt ttgtccactg gttccacgta 60 accgagttaa gttatcttta agaaagaata ttaattttag tgaacttaag ttagttacat 120 aaaggtaatg taatgttgtt tagttcaatc aacgtaacat ttatagttta gctttaatca 180 atgtagttat ttaagttcaa ttattctaaa attgatattg ttatgtttaa ccaaataaaa 240 aaactgtttc cgtttcattg agtttattca atagcatttt tacaatttta cacatttcca 300 gttttacaca ttttccagca atcaaatgat tatcaaaatc taaatattgt ctttatctgt 360 atcttcactt attacaataa gttaaaatat taatactatt aaaataatag taatacaagt 420 atgaagtaaa ttaaaaacaa taaatgattt acaaaaatat gcttacaaaa gcatcataaa 480 atacactttt caaaatgata cttttcatat attacaacaa aacattcaaa aataatattc 540 tgccttcaaa tgatttacaa aaatatactt cgaaaaatta agtcttcttt tgtaaactat 600 actgtacctt ctttacactg aaaaagcaga taacacttca gttgttcaaa gaaagattaa 660 cattttcagg caaagtaaca cgatcagcac tacaccataa attagaccaa ccatcacaga 720 acatgcaata gctacacaca ttcatctctg acatttgcaa catttctgca catttgcacc 780 agacactttc ttacaaccac tccctatatc taaaaaaaaa aaaaaaaaca caaaaaaaac 840 attgtgcatg aagtctgctt cacacaggta agtgggcttg acaaaccaca tgtaaaagca 900 ttcttaattc ttaacattct ttgtataatt agcacatctt gggtgtattg cctcttcttg 960 ttgatttgct gaatgtctcc tcaattgtaa gtcactttag acaaaagagt ctgctaaatg 1020 ttaatgtaac tactgtagat tcataaggtg gtgttgcaca tcaaggtttg tgcttcatca 1080 tgcccagcat cctaaaaatg tcagataaaa aaawtaaaca tttaccaaca gatcaaaggt 1140 gaaaggagga attggtttaa gaggaaaata caaaatgttt ttaaaaaata ataaaataaa 1200 atgtaaatag cttaccatgt actccctcac ttaggggctt tgaggatgca ttcacacctt 1260 tttttccact ggtgatgttc taaatgacat gaaatgacaa ttagtaattt actaaataat 1320 tagtagttaa aattactagt aatttttact gtacatcatt ttaatgtgtg tataggcatc 1380 taaaggcata gtacctggat gatccacagt ctcctcagcc agatatggga ctttagaggt 1440 tgagctgtaa acctgtgagg gagaaaatag atgcgagtga gcaaagattt gtaaaaataa 1500 taaaatataa taatataaat atatagcatt tagaataata tatttataat aataaaaatg 1560 aaaaaaawaa aacactgcaa actactaaac ttaaaagttt aaatgtgtta gatgaccaaa 1620 caaataagct tacattaaat atgtcatcat gctagatgtc atcctattta agtgtcacat 1680 atgcttgtca agtttttttt ttttttttta agttaacagg cttgaataat aacaatgtgc 1740 caacaccacc ttttaattaa tattatgtaa gttatcaaaa ctagcattca aacttgacaa 1800 acagcttgac actcatttaa aatagagact atcaaatcaa atcaaaaatc aaatcacttt 1860 tattgtcaca ttatcagcag cacgtgtgct atgataagtg aaatgcttag actattctaa 1920 catgactatt ccttactacc attgtttgca actttatttt gggggagtta atatayacaa 1980 agtataaaat ctcacaggaa tgcaaaaaaa ctacaaattt atctgtcatt aatgtgaaaa 2040 agaaacaggg aacatggcaa tttcataaca atactataca caaacacaac agagtattac 2100 aagggagtac attaatgact caaatgttgt cacagcctca atataaatac ataaaaagta 2160 aataaaatct cacctgtgtg tccacattgg gcaaagcttt gtgcacgttc tcaatttgta 2220 ctgtaatctt catcttttga ttcccataat atggcgccga gctgagagtg atcgttgtgg 2280 tttgaatctt cttacttcct gtgttgtttt gaaactgcgc atgtgcagat attactccac 2340 ctattgacca aaatgcagta ctgcagaagc aacacaactc cctttcattc atgttaatta 2400 agcgagctgt ttaaatgtaa aaactaactt tttctaattt tttccaacac aacactttta 2460 tttttaatac acaaacataa gcgaattgtt taatttcaat atagttacat tctttggacc 2520 aactaaagcc ataattcatt tttttgagtg 2550 // ID DNA-2-31_DR repbase; DNA; ZEB; 852 BP. XX AC . XX DT 13-JAN-2009 (Rel. 14.01, Created) DT 16-JAN-2009 (Rel. 14.01, Last updated, Version 1) XX DE Nonautonomous DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; 2-bp TSD; KW DNA-2-31_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-852 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 9(1), 9-9 (2009). XX DR [1] (Consensus) XX CC TSD is 2-bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 852 BP; 283 A; 153 C; 131 G; 285 T; 0 other; cagtttttac agacgctagg acacatttct caatacttag gtcacttttg caaaactctt 60 cacacaattc tcctaaccga ctttcagctt ggcaaagcag ttcatttcac attcaaaatg 120 cactacaact accaaaacac tttattcagg tctcaaataa actcattctt ccagaacact 180 agcaaaggtt gacagccgac aaacacactt tgtcacccac aaaacaatga cctaaaaaac 240 actaacaaca tgtagcatta cacagtgttt tcttgtgtaa aacaaggaca catctctgtt 300 tataattgca atatatattg tttataactg caatatatgt tactggaatg aatcagacat 360 gatgcagaat tcctcaatat tttatgtcgt ttatttattt ttattttttt caactaagta 420 attccaaaat acaagaattt gtatacacac catccactga tacagataca atagtaatgc 480 taatctggct ggttaaactg catttttaga tttgaaatat gtttcaataa aatattgctg 540 ttgaattttg ctgtcttatt cagttttgca ttatgttatt gttttgaaca taagtttaac 600 agttttgaaa acagtatgta agcacgtgca aaatgtcctg tacgtacaaa gatttttggc 660 agttgttgtg tctgagtgag aaaagaattc atgaaatttg agagatgtag tcattgaatg 720 cattttgtgc caaaacaatg ataattgatc ctcagtttag cccacataga cttctgttgt 780 gctcactgtg tgaagagttt tgcaaaagtg acccaagtat tgagaaatgt gtcctagcgt 840 ctgtaaaaac tg 852 // ID Copia1-I_DR repbase; DNA; ZEB; 4057 BP. XX AC chr1; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 01-OCT-2008 (Rel. 13.1, Last updated, Version -1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia1_DR; KW Copia1-LTR_DR; Copia1-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4057 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1584-1584 (2008). XX DR Genome; chr1; Positions 26847616 26851672. XX CC Positions [1465-1977] - Integrase core CC 'ACGCT' target site duplication CC LTRs are 98% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 103..3993 FT /product="Copia1-I_DR_1p" FT /translation="MRLLGLKETVLSEASAHADDDEGPDKNEEAYAELIQY FT LDDKSLSLVMRDAADDGRKALRILRAYYAGKGKPRIISLYTELTSLQKKPS FT ESVTDYIIRAETAITALRNAEETLSDGLLIAMILKGLPDSFKPFAIHITQS FT DTEITFAEFKTKLRNFESTENFHSVASEDNVMKANASPSGRKGREGLAYSD FT ITCYNCQQKGHKMRTCPALNCKRQWCNYCKSSTHKDENCRRKRRDDIKLAS FT DKNDDEEHTFIFKASDYRSYEIQRKGLMVDTGATSHIITDLKKFKQFDDKF FT QPEKHYIELADGTKASGVAQRRGDAEVHLIDSAGHKVKTILKNALYIPSYP FT QNIFSVKAATTNGASISFQQGQDKLTHKDGTKFSIHEYNRLYYLNTANGDG FT DTDSCNSCYDIQTWHEILGHCNYEDISKLQNVVKGMKITGKIDKSNLNCEI FT CTQGKFVQSRNREPDTRAKAALELVHTDLAGPIDPEAKDGFKYALAFTDDY FT SGAVFVYFLKAKSDTARATEKFIADTAPYGRIKCVRSDNGTEFTAKEFQSL FT LSKNGIRHETSAPYSPHQNGTAERNWRTLFKMARCMILESGLPKMLWTYAV FT MTAAVIRNRCYNSRIHQTPYYALTGKEPDLSKMRIFGSVCYAYKHDKKKLD FT ARCEKGIFVGYDKYSPAYLIYYPDTGKVLKHRLVKFITRCVADHQTQTEIT FT MSDDDSHRERYVSPQSTHNITNEMPEKTTETESQSSAQTESGEQCARYPKR FT NRRAPQYLTDYVSDVESDDQILTNVDYCYRMCDLPQTFEEAMNSNESKMWV FT KAMEDEISSLQENNTFTLTTLPEGKHAVGGRWVYSIKSNIDETETYKARYV FT AKGYSQVMGVDYIETFSPTANITSIRALMQMAAQYDLELHQMDVKTAYLHA FT PIDCEIYMEQPKGFEVKSNRGDKLVCKLNKSLYGLKQSGRNWNKMLHVYLT FT ENDFIQNLADYCVYSKRSGKDRVIIIIWVDDLIIAASDNSLLKDVKEMLTS FT KFKMKDLGKLKHFLGIDFDQSKGTVRVNQKRYILRILERFNMLDCKPRSTP FT CELKLDYDSDSDPIDPKRYREAVGSLIYLMTCTRPDLSWVLSKLSQYLSKP FT NEQLWSTVKHVMRYLRGTIDQELCYKKSDEKLKLVAYCDADFASDLNDRRS FT TTGYCFSLSANGPLISWKSKKQSTVALSTCEAEYMALTVTTQESMYLVQLL FT DSMENDCMYMPVQIFEDNQGAIALSKNPVCRQRCKHVDVRYHFIRSALSDG FT KVTVENCPTEDMVADVMTKPVTKFKLEKFKGFLFGM" XX SQ Sequence 4057 BP; 1400 A; 719 C; 930 G; 1008 T; 0 other; ggttatgggc ccagaagaga tgtgagtaac cgatggaata gattgtgttt cgacggagat 60 gagaaaaact atgaactttg ggaaacaaag tttttaggac acatgcggtt gctaggattg 120 aaagagacgg tgctatcaga ggcctccgca catgctgatg atgatgaagg acctgataaa 180 aatgaagaag cgtatgcaga attaatccag tacctggatg acaaaagttt gtcacttgtc 240 atgagagatg ccgcggatga cggtaggaaa gctctgcgga ttttacgggc ctactatgct 300 ggcaaaggga agccgcgaat catttctctg tataccgagt taacttcttt acaaaagaaa 360 ccaagtgaga gtgtcacaga ctatattata agagctgaaa ccgcaatcac agcactaaga 420 aatgcagaag aaacccttag tgatggactt ctaattgcga tgattttgaa aggtttgccg 480 gactcattta aaccgtttgc aatacatatc acacagagtg atacagaaat aacgtttgct 540 gagtttaaaa cgaagctgag aaattttgag agtactgaaa acttccacag tgtcgccagt 600 gaagataacg tgatgaaagc aaatgcctcc ccctcgggac gcaaggggag agaagggctc 660 gcatactcgg acattacgtg ttacaactgc caacagaaag ggcacaaaat gcgtacatgc 720 ccagccttaa actgcaaacg acagtggtgt aattactgca aaagctcaac acacaaagac 780 gagaactgta gacgaaaaag gagagatgat attaaactgg ctagcgacaa gaacgatgat 840 gaggaacaca catttatctt caaagccagt gactacagat catacgaaat acagcgaaaa 900 gggctgatgg tcgacactgg cgcaacctcg catatcatta ctgatcttaa gaaattcaaa 960 cagtttgatg acaaattcca acctgagaaa cactacattg agttggcgga cggaacaaag 1020 gcgagtggag ttgcgcagag aagaggtgac gcagaggtgc acttgattga cagcgcggga 1080 cataaagtga aaacgatact aaagaacgcc ttatatatac catcgtatcc acaaaacatt 1140 ttttctgtta aagcagcaac aacaaacgga gcttcgatca gctttcaaca aggacaagac 1200 aaactgactc ataaggacgg tacaaaattt agcatacatg agtacaatag actgtattat 1260 ctgaacactg ctaatggaga tggtgatact gatagttgta actcatgtta tgatatccaa 1320 acctggcacg aaattcttgg gcactgcaat tatgaagaca tttccaaatt gcaaaatgta 1380 gtgaagggga tgaaaataac agggaagatt gataaatcta atttaaattg tgagatttgt 1440 acacagggta aatttgtaca gagccgaaac agggagcctg atacaagagc taaagctgca 1500 ctcgaactag tgcacacaga tctagctggt ccaattgacc cagaagctaa agacggcttc 1560 aagtatgctt tagcattcac tgatgattac tctggtgcag tatttgttta ctttttaaaa 1620 gctaaaagtg acactgcaag agcaactgaa aagttcattg ctgacacagc accttatggg 1680 cgcataaagt gtgttaggtc agataatgga acagagttca cagcaaaaga gtttcagtct 1740 cttttaagta agaatggtat acgccatgag acttcagcgc catattcgcc ccatcaaaat 1800 gggactgccg agagaaattg gcgaacactt tttaagatgg caagatgcat gatcttagag 1860 agcggcctgc ccaagatgtt gtggacctac gcggtaatga ctgctgcagt aattcgcaac 1920 aggtgctaca atagccgtat acaccaaacc ccatactatg cattgactgg gaaggagcct 1980 gatctctcta aaatgagaat atttggttca gtatgctacg catacaaaca cgacaaaaag 2040 aagctggatg ctagatgtga aaaagggatt tttgtaggat atgataaata cagccctgca 2100 tatttgattt attatccaga cacaggaaaa gttctgaagc acagacttgt gaaattcata 2160 acaagatgtg tggctgatca tcagacacag acagaaatca caatgtcaga tgatgattct 2220 catagagaga gatatgtatc tccccagtca acacataaca ttactaatga gatgccagaa 2280 aagacaactg agacagagag tcagagtagt gctcaaactg agagtggtga acaatgcgca 2340 cgttatccta agagaaatag gagagcccca cagtatttga cagattatgt atctgatgtg 2400 gagagtgatg atcagattct tactaatgtt gattactgtt acagaatgtg cgatttacca 2460 cagacttttg aggaagctat gaattccaat gagtctaaga tgtgggttaa ggccatggaa 2520 gatgaaatta gctcactaca agaaaacaat acatttaccc taactacact accagaaggt 2580 aaacatgcag tggggggtag atgggtctac tctattaaga gtaatataga tgagacagag 2640 acatacaagg ccagatatgt tgctaagggc tatagtcaag tgatgggtgt agattacata 2700 gagactttct caccaacagc aaatatcaca tcaattcgag ccttaatgca aatggcagca 2760 caatatgatc ttgaacttca ccaaatggat gtgaaaacgg cttacttaca cgctccaata 2820 gattgtgaga tttacatgga gcagccaaaa ggatttgaag tcaaatcaaa cagaggtgat 2880 aaactagtct gtaaattgaa caaatcactg tatggtttaa aacaatcagg acgaaattgg 2940 aacaaaatgt tacatgtgta tctcactgaa aatgatttca tacagaatct ggctgattac 3000 tgtgtgtaca gtaagcgaag tggtaaagac agggtgatca taataatctg ggtagatgat 3060 ctcatcatag ctgctagtga caacagttta ctgaaggatg tgaaagaaat gctgacatca 3120 aaattcaaaa tgaaagattt gggaaagctc aaacattttc tgggaattga tttcgatcaa 3180 agtaagggaa ctgtaagggt aaaccagaaa agatacatat tgcgaatact ggagagattc 3240 aatatgttag actgtaaacc cagatcaaca ccatgtgagc tgaaattgga ttacgacagt 3300 gatagtgatc ctattgatcc taaaaggtat cgtgaggcag taggtagttt gatctaccta 3360 atgacatgta ctagacctga tctaagttgg gttcttagta aactatcaca gtatttgtcc 3420 aaaccaaacg aacagctctg gtctacggtt aaacatgtta tgaggtactt gaggggtaca 3480 attgatcagg agctatgtta caagaaaagt gatgaaaaac ttaaacttgt ggcatattgt 3540 gatgctgatt ttgcctcaga tctaaatgac agaagaagca caacagggta ttgtttcagc 3600 ctttctgcaa atggtccttt gatctcatgg aagtcaaaga aacagtccac ggtagctcta 3660 tcgacatgcg aggcagagta tatggcactg actgtaacca cacaagaaag catgtacctt 3720 gtacagttat tggacagtat ggaaaatgac tgcatgtata tgcctgtaca gatttttgag 3780 gacaatcaag gtgccattgc actatcaaag aatccagtgt gccgtcaaag atgcaaacat 3840 gtggatgtac ggtatcattt cattcgatcg gcactgagcg atggaaaagt gactgttgag 3900 aactgtccca cagaagacat ggtagccgat gtcatgacta aaccagtgac aaagtttaag 3960 ctggaaaaat tcaaaggttt tttgtttggt atgtaaatcg aacatttgtg ttaccatata 4020 cagtttgtat actgtacagt atgagtgcaa gtggggg 4057 // ID Gypsy-168-LTR_DR repbase; DNA; ZEB; 539 BP. XX AC . XX DT 16-MAR-2009 (Rel. 14.03, Created) DT 16-MAR-2009 (Rel. 14.03, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy-168-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-539 RA Bao W. and Jurka J.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 9(3), 697-697 (2009). XX DR [1] (Consensus) XX SQ Sequence 539 BP; 115 A; 176 C; 94 G; 154 T; 0 other; tgtcaccgac tcggtcccag tcattcccct cgctggccag cagaggtcat catcaccgga 60 ctattgacat tacatcatcc acttagactg attgcactca cacctgaagc acatccagca 120 atcacctgcc acacacatat aagcagcact cacacacagc ttcattgcga agtcttgttc 180 tgccccggct aacactactg agcgttctca tccttgcctg ctttcccgtt gtgatcctgc 240 ttcgtttact gacccagtct tgcctgccgc ctgccttgta cttcagcctg aatcccgact 300 ttgatcctag ccgcctgcct ttgaactcta gcctgcttac cgactctgat actagccgcc 360 tgcctctgac ccatgcctgc atactcatcc tgtgtttacc tgccgccagc ccaacgacta 420 catactactg tgtttgatgt gagtttgcgc acgcacaaca tctgtgtgct attttttatt 480 aaactgtgtt taataaaaat actgcaaatg gatccctctg tgtcagccac ttcgttaca 539 // ID DIRS-4N2-LTR_DR repbase; DNA; ZEB; 387 BP. XX AC . XX DT 24-OCT-2008 (Rel. 13.1, Created) DT 09-MAR-2009 (Rel. 14.04, Last updated, Version 2) XX DE nonautonomous DIRS-like LTR retrotransposon family , LTR- a DE consensus. XX KW DIRS; LTR Retrotransposon; Transposable Element; Nonautonomous; KW reverse transcriptase RNase H; phage integrase; DIRS-4_DR; KW DIRS-4N2-LTR_DR. XX NM DIRS-4N2-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-387 RA Bao W. and Jurka J.; RT "Families of DIRS-like retrotransposons in zebrafish."; RL Repbase Reports 8(10), 1270-1270 (2008). XX DR [1] (Consensus) XX CC The element is the solo 3'- LTR portion of an assumed DIRS LTR CC retrotransposon, derived from recombination. It contains two CC split LTRs which show similarity to those of DIRS-4_DR. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 387 BP; 108 A; 107 C; 73 G; 99 T; 0 other; gtgatgtttt ataaacaaat ttcgggagga gcacgatcag tctttgacga ggctaattca 60 tcacacacac aatcattcta ttatccaatc agctcaagag caaaccccta taaatagtca 120 aacacgtcat acctccgttt tctcttgact tcagcgtccc tccaccaccc caactccaca 180 ctattaaacc agatacctat ttaaatctga ggggggagcg ttctggagcc gggctagaca 240 ctgcgctcgg accctatctc tctttatcct gataagggga ataacacgag ttagggtgtc 300 ttcccgagct cagagccctc tccccggaca gcacgccaaa tacgcatatt ctattcagtc 360 aaatatctgt gagtgtgaac tcgtgaa 387 // ID Gypsy53-LTR_DR repbase; DNA; ZEB; 222 BP. XX AC chr17; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy53-I_DR; KW Gypsy53-LTR_DR; Gypsy53_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-222 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1422-1422 (2008). XX DR Genome; chr17; Positions 36952436 36952657. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 222 BP; 44 A; 55 C; 44 G; 79 T; 0 other; tgtagtgcat tgatccttat gactgcgtat atgggatctg tattgttctg tgtagctcct 60 ccccctctac ctcgctctca ctctgtcaca cagtcattca gtatgtatgt gcgtcgatac 120 taagtgaagc tgctgtgtgt cttctgtgcc cgctgttatt aaacactgat tagtttgaac 180 cctgcttctc tatcagttta atgcaagttc tccgagtcaa ca 222 // ID Gypsy43-LTR_DR repbase; DNA; ZEB; 794 BP. XX AC . XX DT 21-SEP-2007 (Rel. 12.09, Created) DT 21-SEP-2007 (Rel. 12.09, Last updated, Version 1) XX DE A long terminal repeat of the Gypsy43_DR LTR retrotransposon; a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; LTR; Gypsy43-I_DR; Gypsy43-LTR_DR; KW Gypsy superfamily; Gypsy43_DR; endogenous retrovirus. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-794 RA Dib M.R. and Naveira H.F.; RT "Gypsy43_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 7(9), 817-817 (2007). XX DR [1] (Consensus) XX CC Gypsy43-LTR_DR is a long terminal repeat of the Gypsy43_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its CC internal portion is deposited in Repbase as Gypsy43-I_DR. XX SQ Sequence 794 BP; 158 A; 154 C; 188 G; 294 T; 0 other; tgttacatat tgagtaacat taattgtgac aatgcaagtg tgtattgttc ctgtgtgttt 60 cctcccatta tctatctctt tctctctctc tctccacatg agcggagatc aggtgcagct 120 gtgagagggg cggcgccagc tataaaggca cgccgttgac gctcctcatc gagttctctt 180 cctgtttccc ctcgccattg tgtgtgtgtg tctgagcagt tcgctctgag ttctctgtgc 240 ggcagtgatc tgtctgaaac ctttcttttc aaagtcatct ggtgagatac tctgttaaat 300 acggtaatag gacggtaacc tgcattagct gtggctaatt gtttaatgcc tttctccaac 360 cctgtttcaa agtcatctgg tgagatactc tgttaaatac ggtactagga cggtatcatg 420 tgttagctgt ggctaagatt taccttaatg ctgaatgatc aacgggaagg ccctttttct 480 tttgtccagt ttttctcctg ttttgtggtt agttatggag gttagaattg tatgctttat 540 ttatggaatt ttatttttgg tataggtaag aggtttaata ttaggtagcg taccttgttt 600 tgtttgttgt tttggccttt tgttccctga ttctggtttt gatctttttg tttaataaat 660 ctgatacccc ttattgacaa cttggcctgt gtgtgtgtgc ttgggggaac aggaacgggg 720 actccgggtg accacttttt atagcggtca tccttaaact tatgtgcgta gcggggtgca 780 tccgctacgt aaca 794 // ID Gypsy104-LTR_DR repbase; DNA; ZEB; 870 BP. XX AC . XX DT 22-APR-2009 (Rel. 14.05, Created) DT 22-APR-2009 (Rel. 14.05, Last updated, Version 1) XX DE A long terminal repeat of the Gypsy104_DR LTR retrotransposon; a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy superfamily; LTR; KW endogenous retrovirus; Gypsy104-I_DR; Gypsy104_DR; KW Gypsy104-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-870 RA Dib M.R. and Naveira H.F.; RT "Gypsy104_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 9(5), 945-945 (2009). XX DR [1] (Consensus) XX CC Gypsy104-LTR_DR is a long terminal repeat of the Gypsy104_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its CC internal portion is deposited in Repbase as Gypsy104-I_DR. XX SQ Sequence 870 BP; 126 A; 231 C; 157 G; 356 T; 0 other; tgtcatgtcc tcggtttgat catctgggcc actgggtgtt gttgttgttt tgtttgtatt 60 atctatgtgc ctttccctag catgttttgt tttgatcctc cgccacatgc cttctgttta 120 ctctctaatt ccacgcccat cctcctgttt aagctattac gttattaagt taatgtcact 180 cacctgttct tcttgatttc tccctctatt tattccttcc cttagcgctg tcctgtgtca 240 gattgttgtt tgcccttctc gtgtttccct cgatctcccg gtcctgtcca gtcagccttt 300 ttcctgttgt atctgaatcc cggtctacat ctttgatttt gcctttttct gctaccaggc 360 ccgttcccgt ttagtttcat tcctgctgct cgcctgtctc ttctctccgc ttcagcgatc 420 cattttccgg cctcctccag ccccgcagcg ctctctcttc ctcgcgggtg tcgcggccca 480 ggaaacgcga cccagcagta tcgccccctg ccggcggtac ctattattac accctctggg 540 agaagcctgt cctctctacg aagaggtttt tgttccttgt tttttttgtt ggatttttcg 600 gacatttttg ttacattttt ttcccttttt tctgacttta gttttctaag acttttttcc 660 tctacggaga gtttaagttt aagttttttc cctctggatg tttttgagaa ttctccatag 720 tgaagctatt gaacatctta ttttcctctt cggagctgtt tttgttgaag aacctttttc 780 ccatctgagt ttgctatttt gcttatattt gaataaactg tgaaggacct gcatttgtgt 840 cttctgtttg ccttgatcag atccctgaca 870 // ID Gypsy116-I_DR repbase; DNA; ZEB; 7004 BP. XX AC chr20; XX DT 03-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy116-I_DR; KW Gypsy116-LTR_DR; Gypsy116_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-7004 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1519-1519 (2008). XX DR Genome; chr20; Positions 51691869 51684866. XX CC Positions [3479-3904] - Reverse transcriptase CC Positions [5436-5912] - Integrase core CC 'CAAA' target site duplication CC LTRs are 98% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 2384..3967 FT /product="Gypsy116-I_DR_1p" FT /translation="MLDTGSMACTLSDSVVTKLREHGVLDTVTEKETDVII FT IGCGGKRVFPKSVVELKLDVYGCDVIVPCLVVPGQDDELILGTNVIKYLVH FT RLKGSDRYWELISTPSGPSSEGDEFLSMLAGIRRWRGETVPDVVGTVKLHS FT AVTLSPGQEHLVWGKLPAKSCGQVGSTVMVEPSRVHSASKKVLVGRVVSPL FT WGDGWIPLKLINPSDKPVTLRRNTKVADVYPCIALEDVCEAASESTVRLCV FT QQSAVEPDRSDAPSSRLQKCGLESVDLDACEVSDFWKAKLCDLVLEYESIF FT SRHSLDCGEVRDFVHRIRLVDERPFRLPFRRVPPSQYQKLRVALNEMEERG FT IIRKSVSEFASPLVLCWKKNGDLRICTDFRWLNARTVKDAHPLPHQEDCLA FT ALGGNSFFSTMDLTSGFYNVPLHEDDKKYTAFTTPVGLHEYNRLPQGLCNS FT PASFMRMMTKIFGDQNFLSLLCYLDDLMVFAPSEEVALQRLEMVFGMLKKH FT NLKLSPKKCNFLRRSVKFLGHLITEEGVSTDP" FT CDS 4011..6746 FT /product="Gypsy116-I_DR_2p" FT /translation="MCPDGATPSVKRIQSFLGLVMWYQRFIENCSSLARPL FT FALTSGVRKARGVGRRGVSITRKLTPDDWSPACEAALSALKEALLSNVVLA FT HPDFSRPFILATDASTEGLGAVLSQLSPGENRARPIAFVSRSLSRAQARYP FT AHRLEFLALKWAVCDKFSHWLKGHRFTAWTDNNPLTHILTKPRLDACEQRW FT VAKLAAFEFDIKYVPGSKNTIADTLSRQPFVDSRVSHRIVREPYEKLIAES FT RNHTCATVQDLFRLSNACQRVVGSAEVSADACDRMGVAPDVDGPGSFSNAE FT VAAVFSNHLSWDNATRMRTIPLVGFVQQITDVGFDALPSYSHQELRDSQLQ FT DEHLQRVMFYVDQKRRPSRRVRMGEPSAVRKLLKQWDKLSLADGVLYRVSK FT DPLTHHKRFQFVVPPSLKDEVLRGCHNEAGHQGQYRSLHLCRQRFYWTGIE FT QDVREYVKVCSRCVLAKTPEPEGRALLESVKTSAPLELVCIDFWTAEDSRN FT KSVDVLVVTDHFTKLAHAFCCSNQSARSVAKKLWDNYFCYYGFPERVHSDQ FT GANFCSELIRHLLEFSGVKKSQTTPYHPMGNGTPERFNRTLGNMIRTLEPR FT PKHDWPQMLHTLTFLYNCTSHETTGFPPFYLMFGRVPRLPIDVIFKSVLRD FT DLNVALPKYVESLSRDLKDALAVAKTNASKEQSHQARVYNRRNKGVNIEVG FT DRVLLANKSERGKRKLSDKWDSVVYTVVDCDPKTHIYKIKHPVSDQVKVVH FT RNLLLCVNFLPVCDAVGTNDDGSVISVDDGVSDGCSVSSDDSMDNGDGLVV FT ADNVSQMNELCDISNSAEIVDVPSHNAGSENTSMDSMILQDGPETSNACDP FT DFVPNGNVDSTELPGRRSRFGRLLKPVNRLISSMSQQTLFSSRKQSLGKWS FT FSIVSLKH" XX SQ Sequence 7004 BP; 1831 A; 1386 C; 1807 G; 1980 T; 0 other; atggtgccgt gaagcctgtt cgtgatcgac gcgacgctga gtgcagtgtg ttatcactgt 60 ggcgtggggc tcgggttacc agtgtgctgg ttgagctgtt tataagctac acaacgcaga 120 tcatcgaggg agagacgaga agcgagtgtg acgggactgt cgcgggtttc cagcttcacg 180 tgaagatcga cgagcgatta acgaggagcg agtgttacgg gaccgtcgcg ggtttccagc 240 ttcacgtgaa aatcatcgag ggattgacga gaagcgagtg tgacgggacc gtcgcgggtt 300 tccagcttca cgtgaaaatc atcgagggag aggcgaggag cgagagtgac ggggccgtcg 360 cggcttccca gcttcacgtg aaaatcatcg agggagagtc gaggagcgag agtgacgggg 420 ccgtcgcggc tttccagctt cacgtgaaaa tcatcgaggg agaggcgagg agcgagagtg 480 acggggccgt cgcggctttc cagcttcacg tgaaaatcaa cgagggatta acgaggagag 540 tgtgacgtca caggctttcg ggttcgcgtg aaaaccatcg agacactggc ggagtgcatc 600 tgaaacaccg cggggttctg atggacgatt tgcctgctgt ttaaggacgt tgcgattgaa 660 aaaaaaaaaa agaaaacttt cttttctgac cagcctttga gcaacgggga tatcgacgga 720 ctgcagctaa gtgactgtta ttgactgaat ctttcctgat tagtttattt actttttgag 780 tttcatgata aaagtatttt caaatggctg agggtggtgc agatagctgg ttgttcgagg 840 ctgggcgggg aaaatacctg atgcaatatt ctaatgctaa attggggcag agtggaatga 900 aagttaagtc cgatgaccat tcagtgaaag agaatttaat ggcgcatgat gtcaaccaaa 960 tcttccctgt cacgtctact cctaagccag gtaaattggg aacatctagt gatcgtgatg 1020 tgggcgatct taatgtttta gttgaggaat tgggtcgaaa aattggtgaa tcagttactg 1080 ctaaacttct atcagagggt ggattcgctg ggagctgtaa caggaatagc acttcgagtt 1140 cacaatgtgc taatcttgat ttatcacagc tgaatgtcat cttgaaatca gatgaaaaag 1200 aaccgcctgt gtttaggggc gacaatctgg gtagatgtga tgtccaagaa tgggttgaca 1260 tgatgcagct ttacctacaa aaaagaaatg ttgacattcc taatcaagca gacgaaataa 1320 tcagccatct tatgggtagg gcaagagatg ttgtaaaagt aactttgcgc agtaacccta 1380 cgttagacta caaatcacag ccaaatttaa tatacagcat acttaagcaa catttcagta 1440 acgtgtcata ttctagtatg ccacttgctg atttttacgg cacgttacct agatacggcg 1500 agaaccctgt tgactactgg ctgcgactga ataaagcagc agacattgct gacgagtgtc 1560 tcagggctca gggtaggtgt atggacaatt tgagttctga agttgcaatg atgtttgtgc 1620 ggaattgtcc agacactgat taagcgcgta cttttaaagt gcggccatta gaaaaatgga 1680 ccgcaaagga tgttcaggag ctattagaca gtttccagcg tgacagcaag tcgagagcaa 1740 ttacaaacga atatgcgatc ggtcaaaacg cgcagatttc tttgcacagc tgcgaacatg 1800 ttaatgctgg tcctgagcct gtgctagcat gttttaagca gggacccagc gtacctaatg 1860 actcaaaacc ccatacaaaa gatgagtcta gtttgaatcg gatggctgat atgctgacac 1920 agctgattga cattttgaaa aagaaggaag agagcgtttc agagtatgtg cctgcaggta 1980 aaatgcgaga aatgactgca gttgggcgaa aaaactgtag aatttgtggt gacacagcgc 2040 acactacaat tgaccattgt aaaactcatc gtttgtgctt taggtgccat actgctggtc 2100 attccaaacg agactgtaca aagaaattcg ctactacacg cttttcttca gattcacctg 2160 gcgatcagga aaactgaaac acccgcattc ggagaggggg agtgtgggtg ttgcagaaaa 2220 ttccctcgag gtggatactg atttcgaagc tttgtttctc tcgtttaaga atctttgtcc 2280 cgactctcag aaagtcgtct ttcagaatac acaacgtata gcttgctcag atacactctt 2340 ctacacacct gttcttatta atgactcact cactgttagc gccatgttgg acaccggctc 2400 tatggcttgt actctgagtg attcggtagt cactaagcta agggaacatg gtgtcctgga 2460 tactgttacg gaaaaagaaa ctgatgtcat cataatagga tgtggtggca aacgagtgtt 2520 cccaaagagt gtagttgaac ttaaactgga tgtttatggc tgtgatgtta ttgtcccatg 2580 tcttgtcgtt ccaggccaag atgatgagct cattttgggg acgaatgtca ttaagtatct 2640 cgtccacaga ctgaagggtt ctgacagata ttgggaattg atttcaaccc ccagtggacc 2700 tagttcagag ggtgacgagt ttttgtcgat gctggctggt atacgtcgct ggaggggtga 2760 aacagttccg gatgttgtcg gtacagtaaa acttcatagt gcagtcacac tttcgcctgg 2820 tcaggaacat cttgtctggg ggaaactacc cgctaagtca tgtggccagg taggcagtac 2880 tgtgatggtg gaaccatcta gagtccattc tgcttctaaa aaagttttag ttggtcgtgt 2940 agtttctcct ctttggggcg acggttggat accactaaag ttgattaatc ctagcgacaa 3000 gccagtaacg ctgaggcgaa acactaaagt agccgacgtt tatccttgta tcgctttaga 3060 ggatgtgtgc gaagcggcgt ctgagtctac tgtgcggctt tgtgttcagc agtcagcagt 3120 tgagccagat agatccgatg cacctagcag cagacttcaa aagtgcggtt tggagtctgt 3180 tgacttggat gcatgtgaag tttcagattt ttggaaggcg aagctgtgtg atttagtgct 3240 tgaatacgaa tccatttttt ctagacattc tcttgactgt ggcgaagtta gggactttgt 3300 gcacagaatt cgcctagttg acgaaagacc atttcgttta ccttttaggc gagtcccccc 3360 ttcgcagtat cagaagttgc gcgttgctct taatgaaatg gaggagaggg gcataattcg 3420 aaagtccgtt agcgaatttg catcgccttt ggttttgtgc tggaaaaaga atggcgattt 3480 gcgcatttgc acggactttc gttggcttaa cgcgcgtact gtgaaagatg cgcacccttt 3540 gccccatcag gaggattgtc ttgcggcact cggtggcaat tcatttttta gtacgatgga 3600 cttaacttct ggattctata atgttccact gcatgaagat gacaaaaagt acacagcatt 3660 cacgacgccg gtagggttgc acgaatataa caggctaccg caaggcctct gtaactcacc 3720 tgcgagtttc atgagaatga tgactaagat ttttggggat caaaatttct tgtctctgtt 3780 gtgctacttg gatgatttaa tggtatttgc acccagtgaa gaggttgcac tacagaggct 3840 cgaaatggtt ttcggaatgt tgaaaaagca caatttgaaa ttgtctccaa aaaaatgtaa 3900 ttttttaaga cgctctgtaa aatttcttgg tcatttaatc acagaagagg gtgtgtcaac 3960 ggacccctag taaagttgaa gcaattgcta atattacttc tgcagattta atgtgccctg 4020 atggtgcaac cccgtcggtt aagagaattc agtcctttct tggtctcgtc atgtggtacc 4080 aaaggttcat tgagaattgt tcgtcccttg cacggccact ttttgctttg actagtggag 4140 ttaggaaggc cagaggagtg ggacgcagag gggtttctat aactagaaag cttactcccg 4200 atgattggtc tccagcatgt gaggcagctt tgtctgcatt gaaagaagcc ctcctgtcta 4260 atgtagtcct tgcacaccct gacttttcta gaccttttat tcttgccact gatgcatcaa 4320 ctgaagggtt gggtgctgtg ctttcgcaac tgagtccagg agaaaacagg gctcggccca 4380 ttgcgtttgt gagtagatca ctgtccagag cacaagccag atatccagca catcgtcttg 4440 agttcctcgc gcttaagtgg gctgtatgtg acaagtttag ccactggcta aaagggcatc 4500 ggttcacagc ttggacagat aataacccat tgacgcacat tttgaccaaa cctaggctcg 4560 acgcctgtga acaaaggtgg gtggctaaac tcgctgcctt tgagtttgac atcaaatatg 4620 ttccagggtc gaagaacaca atagctgaca ctttgagtag gcaacctttt gttgacagtc 4680 gggttagcca tcgcatagtc agggaacctt atgaaaagct tattgcagag tcaagaaatc 4740 atacttgcgc tactgtccag gatcttttca gattgtccaa tgcctgtcag cgagtggttg 4800 gcagtgcgga ggtctcggcg gatgcttgtg atcggatggg ggttgctccg gacgtggatg 4860 ggccgggtag tttttcgaat gctgaggtgg ctgctgtttt ctccaatcat ctttcttggg 4920 ataatgcgac gagaatgcga actattcctt tggttggatt tgtccaacaa atcacagacg 4980 ttggatttga tgctcttccc tcatactctc accaagaact aagagacagt caactccagg 5040 atgaacattt gcagagggtg atgttctatg ttgaccagaa acgtaggcct tccaggcgtg 5100 tgcgtatggg agagcctagt gctgtacgga agcttttaaa gcagtgggac aaactatcct 5160 tggctgacgg tgtcctgtat cgtgtttcaa aagatccttt gacacaccat aaacgtttcc 5220 aatttgtagt gcctccgtca ctcaaagatg aagtgttgag gggttgtcac aatgaggctg 5280 gacaccaagg gcagtatcgt tctcttcacc tttgccgtca gcgtttttat tggacaggta 5340 ttgagcagga tgtacgtgag tatgtgaaag tgtgtagtcg ttgtgtgctt gcgaagacac 5400 cggagccaga gggacgggct ttactggaaa gtgttaagac ctctgctcca cttgaattgg 5460 tctgcattga cttttggacg gcagaggatt cacgaaataa atcagttgat gtattagtgg 5520 tcaccgacca ttttactaag cttgcccatg ccttctgttg ttcaaaccaa tctgctaggt 5580 cggtggcaaa gaagctctgg gataactatt tctgctacta tggtttccca gagcgtgttc 5640 attctgatca gggtgccaat ttctgtagcg agctaatcag acacttgctt gagttctcag 5700 gggttaagaa gtctcaaact accccatatc acccaatggg gaatggcacc cctgagcgat 5760 tcaaccgaac attgggaaat atgatcagaa ctcttgagcc tcgaccaaag catgactggc 5820 ctcagatgct tcacacattg acttttctgt acaattgtac ctcacatgag actaccggat 5880 ttcctccctt ctatctaatg ttcggccgtg tcccaagact accgattgat gtcatcttta 5940 agagtgtgtt gagggatgat ctgaatgttg ctcttcccaa gtacgttgag tctctgagta 6000 gggacttgaa ggatgctctc gctgttgcaa agactaatgc ttctaaagaa cagtctcacc 6060 aagcccgcgt atacaatcgc agaaataagg gtgtgaatat tgaagttggt gatagggttc 6120 ttttagccaa caagagcgag cgtggcaaac gtaaactctc agacaagtgg gacagtgtag 6180 tgtatactgt ggttgattgc gatcctaaaa ctcacattta caaaataaaa caccctgtgt 6240 ctgatcaagt caaagttgtt cacagaaacc tgcttttgtg tgtcaatttt ctccctgtat 6300 gtgacgctgt gggaacaaat gatgatggga gtgtgatttc tgtagatgat ggcgtttctg 6360 atggatgctc agtaagttct gatgattcaa tggacaatgg tgatggtctt gtcgtagctg 6420 acaatgtttc tcagatgaat gagttgtgtg acatttccaa ctcagctgaa attgttgatg 6480 tgccgtcaca taatgcgggt tctgagaata cttcaatgga ttctatgatt cttcaggacg 6540 gacctgaaac atctaatgcc tgcgatcctg attttgttcc taatggtaat gtggatagca 6600 ctgagttacc gggtcgcaga tcaaggtttg gcagattgtt gaagccagta aatcgcctta 6660 tttcttccat gagtcaacag accttgttta gttcaagaaa acaaagtctg ggaaaatggt 6720 ccttttcaat tgttagtttg aaacattgat atgtcagaat ttttttttgt atctgtctct 6780 gtagaaaaca agtgaaatga attgcaagtg tgatgtgtta aaggcacgtc atttgggtca 6840 tgatcaggat aagtgatatc cgcagcactt tttcctatca gttcttaatt tgaacagata 6900 aaacggatca aaagctttgt gctaagaatg gcaattcttg gttaatttct gcatttttgt 6960 gttttctaag gttgtcgcat tttgtcacat tcagtggggg agaa 7004 // ID LTR1_DR repbase; DNA; ZEB; 848 BP. XX AC . XX DT 02-MAY-2002 (Rel. 7.04, Created) DT 02-MAY-2002 (Rel. 7.04, Last updated, Version 1) XX DE Long terminal repeat from some endogenous retrovirus - a DE consensus. XX KW LTR Retrotransposon; Transposable Element; endogenous retrovirus; KW LTR1_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-848 RA Kapitonov V.V. and Jurka J.; RT "LTR1_DR, a family of zebrafish long terminal repeats."; RL Repbase Reports 2(4), 21-21 (2002). XX DR [1] (Consensus) XX CC LTR1_DR is a long terminal repeat of unidentified endogenous CC retrovirus. The consensus sequence was reconstructed from CC 20 copies, which are ~88% identical to it. CC LTR1_DR copies are flanked by 5-bp target site duplications. CC There are multiple subfamilies of this LTR present in the genome; CC it harbors several thousand copies of LTR1_DR-like elements. XX SQ Sequence 848 BP; 104 A; 202 C; 207 G; 333 T; 2 other; tgtcagggtt ctgccactct ggtcttgtaa attcttgttt tggtggcaga gcccggacac 60 tagctctgtc tggtcctgtt tctgtctctg tgtgcgtgcg cgccgtcgtg ggtgtacgca 120 gagtgtgcgc gctcccgctt gatgcggccg cgcgcgctct gcgtgcctca gacgcgcgct 180 cttgtactcg tgtttgtgtt ttgttctgtc agcagcgcgc tgtttcattc tcagcgtctc 240 agtctagttg gtttcggttt tggttggcgc tgagatgaag catgcgcgtt gcgtatgtgt 300 gagcgcacgg taaggtgttc atctatcgtg tgctcgtgtc ttgcgtcttt tgtcaaagca 360 cgtggctcgg tgtttacatt gtggtcacgt gcttttgtcg tgtgcttcag tgttgtgctt 420 gtgttgtcat gaacatgcgg ttaatgagtt ctctcattgg ctgcatgttc ttgtcctgtt 480 ttatgtgagc gcatggcttg tgttgtctct ctgtgtcatg cgctctcccg tctattgtct 540 agtcccaccc tccttgttaa cccattatta gttwattatg ttcacctgtt tgtcaattta 600 ctttttgctt tataatcccc tcatgtttgc tgtcctgtgc cagttcgtcg tcaatgttgc 660 cctgtcytgt tgtttccagt cctgttttgt cctgccagcc cagtcaagtt tgtttgtgtt 720 ttgttttatt aatgttttcc ccctcggggt agtttgtttt gccttttatt tttattttta 780 ttattaataa atacccacat ttttctgcac ttgagtcctc gctccttttt ccctcaccga 840 ccgtgaca 848 // ID Gypsy52-LTR_DR repbase; DNA; ZEB; 1473 BP. XX AC chr17; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy52-I_DR; KW Gypsy52-LTR_DR; Gypsy52_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1473 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1420-1420 (2008). XX DR Genome; chr17; Positions 23230336 23231808. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1473 BP; 398 A; 261 C; 318 G; 496 T; 0 other; tgtaggggat ctacaaatcc gaatatgaga gaacttttat tttgaaattt gacatgggcg 60 ccgccattac tttctccggt atgtgtgtgg gtgacgatcg cattcaggca gagagaggga 120 ggagagaaag gcgagttgtg acagcagtgg tgagctaatg aagattgtgc agttttaaaa 180 ctttttgatc aacttaagaa gttaactgat gacatgaaac aaagctagat tacattgctt 240 caaagtctcg ttgcttttaa gtctccgtat agtttaatga gccagttatt aacagtgacg 300 cagcaagttt tttttgccgc gctcgtgagt gacgcgggat aataggcctg ctgttaaagt 360 cagtcggatt attcagtgct gaagttgaat actcccgtgt ttttttgtaa tttacatcca 420 gatttttgtt agttcaaacg aactcaaaac gtagtggatg ctgtgttatt tttgcgagga 480 tttttgcgtt ggaaagcatc gtgtgttcat tgtttgctct gattgatcgc cgttggacgg 540 gattgtggcc ttgttggatc gtgctgtgcg ttcaggatca cgagggcaat cgttggggat 600 caaagagttg aaggaacatt accctgcctt tccagactcc tttcagcgct ccgttctgga 660 cccaagagtg gtgatgtaca atattgagtt cacttttccc tgaatattgg taagacatct 720 tttcttttca ccaagtatcg aaacagaact gtgttttaaa ggaactctca ttcgaaaaac 780 tggaaattgt gcgtttttcc acaaagagac ttatttgttt accggtctat ggacattttc 840 actcgtgaat ggactgagta tatttttcct gaattttgaa agtgaactct aacttgattt 900 gaactcattt gattaatact gattgtgaat gtgaatcttg tcattgttta aggtgtgact 960 gaaacactaa cgttagcaca ttgattctat ttcatttatt attgtctata tttcatttca 1020 ttttgagctg attgagttta aaacaattta actttttatg tgaagaaaca taagaaaaga 1080 aaactcattc aattagtctc ataagactaa tactttttgt ttaaataaat atttgtttgt 1140 ttagttacat tgagttaact gaagtccttt atactaagga gtggggccgt ctcatctatc 1200 ctgttagatc tgggagggtg ggtttttctg agtttttaca cacttacata cgcacttcag 1260 gctaccaccg tgataaagat agatcctgaa gcccacctca ctttatctgt agctccatat 1320 agcaacacga tagacactca catataacca ctagttatac tgagacctaa agcattgggt 1380 ggggggacta agatcctgtt gctcactcag tcttccttag tgtaaagtat attatttgcc 1440 ccacaaggcg gtggcatatt cccctccgcc aca 1473 // ID hAT-N18_DR repbase; DNA; ZEB; 1519 BP. XX AC . XX DT 07-DEC-2004 (Rel. 9.11, Created) DT 07-DEC-2004 (Rel. 9.11, Last updated, Version 1) XX DE hAT-N18_DR is a nonautonomous DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW CHARLIE3; DNA2-2_DR; nonautonomous DNA transposon; KW hAT superfamily; hAT-N18_DR; horizontal transfer. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1519 RA Kapitonov V.V. and Jurka J.; RT "hAT-N18_DR, a family of nonautonomous DNA transposons from RT zebrafish."; RL Repbase Reports 4(11), 306-306 (2004). XX DR [1] (Consensus) XX CC hAT-N18_DR is a family of nonautonomous DNA transposons that CC belongs to the hAT superfamily. This transposable element is CC characterized by 8-bp target site duplications and 15-bp CC terminal inverted repeats . hAT-N18_DR is a composite CC transposon: its internal portion (pos. 607-1219) is a copy of CC the DNA2-2_DR transposon (pos. 1-615). The 70-bp termini of CC hAT-N18 and hAT-N19 are 90% identical to each other. The CC primate CHARLIE3 hAT transposon is 78% identical to the CC hAT-N18 consensus sequence, excluding the DNA2-2_DR copy. Such CC high identity indicates the horizontal transfer. XX SQ Sequence 1519 BP; 449 A; 337 C; 335 G; 386 T; 12 other; cagtggtccc caacccccgg gccgcggact ggtaccggtc cgtggatcaa tcagtaccgg 60 gccgcacaag aattaattat ctttaagttg gcagataaag tagctgcatt taaagccaaa 120 ctggatttgt ggggacggyg cgtgaacaga ggcatattgg acatgtttca aacattagcg 180 gggattttgt gtgagactga gcctgcgccc tcactcacca agctggtgca cgatcaccag 240 tcttcgctgt taaaggagtt tgagcgctac ttcccaacct caaaggaccc acaaacggcc 300 aaggaatgga taygtgaccc atttgtcaac aaaccagctg aatccagcat gtccgtgcaa 360 gaagaagacc aactgctgga gatcgcaaat gacggcggcc ttaaaacagt gttcgagaca 420 acaactctgc cggtgttctg gattaaagtc atggcagaay acccagagat cgctaacaca 480 gcgctgaaaa ccctgttgcc atttycaaca tcctatctgt gtgaagcggg gttttctgca 540 gtgacagcca ccaaaacaag gcaaaggagt agactggaca caagcaacac acttcgggtg 600 tcaatagccc ctttcacaca gtgataccrg taaatatatg gaaaatttcc ggaatgactt 660 naccggtaaa ttcaaaaaag agctgttcac acaggcgagg acgttacgga atttttccgg 720 aaaagagcat tcacacatcc attccaaaat accggtaatt ctgacatcat taaccagaaa 780 tgagctctaa acggctgcac ttgtatttgt aaacatctga ctacatnaca aactctgtgg 840 atggatcagt attgtgaaca acttcgatga aaacatayag agaaacactt tcgcatgtcg 900 agatgtacat gayatgtgtg tgtgctggcg ctnacgggct gtttcacggg cacacgcaaa 960 gcttgaaggt aaacaaacaa cggcttatca taagcatctt atcgataatt atttacacgg 1020 ttggcattaa gatgaacata taaacgttat ctgcctaatt tctagcagct aaatgtgtct 1080 ggaaaaatat tcaaaggctt ttattctcat aaaccgcgcg gacgtgaatg cgtctgactg 1140 ttctgattgg ctaaagcaga cgtctcacgt cagcacgttc tagacrtgca cgcgctcttt 1200 ccggcaatct tccttctgcc attaccccca gatgggacca tctcrttgca aagaaacaag 1260 ctcagggctc ccattgattt agcattacgg acagtttttg atgttataat atgttatatt 1320 gttggagcaa tgtgttaata aagttattac ttcctaatgt tataatagca ttgtaatgtt 1380 ggagcaatgt gctaataaag ttccatatga gtacacagtg gattcacgtt tattattatt 1440 attatttcaa ccccccggtc cgcagtaaaa ttgtcaagcg ttgaccggtc cgcagttaga 1500 aaaaggttgg ggaccactg 1519 // ID CR1-18_DR repbase; DNA; ZEB; 3428 BP. XX AC . XX DT 11-NOV-2008 (Rel. 13.11, Created) DT 11-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE CR1-18_DR is a CR1-like non-LTR retrotransposon - a consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; AP endonuclease; CR1 clad; CR1-18_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-3428 RA Bao W. and Jurka J.; RT "CR1-like non-LTR retrotransposons from zebrafish."; RL Repbase Reports 8(11), 1692-1692 (2008). XX DR [1] (Consensus) XX CC The 5'- part is incomplete. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS join(674..1027,1031..1336,1267..2310,2145..3200) FT /product="CR1-18_DR_1p" FT /translation="SSFIILQVMMQVFLIELKRLLSLVDDRCECMMFGDFN FT INWMDKNSKSQLKSNMEKLNFTQLIDRPTRITRKSETLIDLIFTNRPERVT FT KTDNLITGLSDHNLTLIVRKLTRKRLLHHNMENQMLITGFAKSKIAHFEYD FT LKEVNWGKVFKELDINQSVGHFATILSKLTSKYTKTWKSRYKKFSLPWFNS FT DILQLIKRRDLALKNLYYPKLILIILFTLGKARSCSQKSLLSKTNTDHLVY FT TGLRNKVVSELRKAKMKYFSKLIEEAHGSSSKLWQHINRLSNSNRQKHSKI FT KHLKVTNNMICSDEAMANAFNSFFIESVQELSSNFNFTKSYPEEPDYVYPD FT SFYIKEVSCEKVQKVIENMSNSTAKDIFNFHSELIKRYKHFLLEPITYLVN FT LSIQTKTFPDSWKTAIVIPIYKSGDKDVACNYRPIAILPVVSKVLEKIVAE FT QLMEHLHSKQLLYPQQYGFREKYSTETANCYLLEKIKGSLDKGSVVGAVFL FT DLKKAFDTVNHSILLNKLREFKMSSEALQWFGSYLGGRQQCVRVNGVKSSV FT RSTILEYLKGLSLAHCCSQCTIKRVQNVIRSLTVVWVLFGWETAMCEGQWG FT EVICAFNNIGVPQGSVLGPLLFTMYINDLPECCPGVNCQLYADDTVIYVSA FT KTPCRAGEQLTQALQHISRWLELSHLTLNVKKTVSMCFSIRNRPVQDTFEV FT LIKNKVIKEVNEFKYLGITLDINLKFDSHVKNLSKKVKPSLNCFRFIRRFL FT TYHTAKLYMHTMIFSHLTYCITSWSQASLTTLKPIDSIYKQAVKIMDLKPM FT RWHHCDVLCKHNLFNFDSFIQFSYLKLVFKCLNNHVPILFSKLINRHQTSH FT RVTRATANGDCLVPKCKTSFGQSVFSVKGAKLWNELPVSLKSESNLNIFSS FT ALKKWLKTKQQCLHK*" XX SQ Sequence 3428 BP; 1143 A; 514 C; 624 G; 1147 T; 0 other; gatcagatct tcatgctgtc atgccaaaag gctagtcatc cgaagctaat catgtcatat 60 cataatggta atgcagactc tgggttttca gcccatgctt gtaacaggtt ttctgatctt 120 cttgtcagat cagaaagttt attgtcttag catctgacaa ttcttgtggt ctcatctcgt 180 agaatttcca caacagtact catggtttga tattatgcag agacttagcc atccttacag 240 agttaaagat gaaagtgttt gtcaatcact gtttttatct tgtgtttgca tgttctcttg 300 agactcaaag tgacatttca tgttgcagca gtgcagtaca gactactaca cttttgcagt 360 ctgctgttaa caaatgtgat aataattaat aatatgtatt tgactgtgtg ttagtgttta 420 tttcttaaaa aatatgtaat tcattctttt gttattgagt ctattgctct gtttgtggtt 480 gtaaagaaat acatgttact cagcaatgat gcattaaaat aagcagacag aatttacagt 540 gtaattttga tcaaatgatg tatattaatt tatataagtg aaaatttgaa atgtttgcag 600 attgaaactc atgtgaatat ggagtgttta gcactaaatg taattctctc tccaaatatg 660 aaattttaca taatcgtcct ttataatcct ccaggtcatg atgcaagttt ttttaataga 720 attgaagaga cttttgtcgc ttgttgatga tcgttgtgaa tgtatgatgt tcggtgattt 780 taatattaat tggatggata aaaacagtaa atcacagttg aaatcaaata tggagaaatt 840 aaattttaca caattaattg atagacctac ccgaatcact agaaagagcg aaactcttat 900 agatctgatt ttcactaata gaccggagag agtaacaaaa acagataatt taatcactgg 960 tctttcagat cataatttaa cattaattgt aagaaagtta actcgaaagc ggttactgca 1020 tcataattag atggaaaacc aaatgttaat tacaggattt gcgaaatcaa aaatagctca 1080 ttttgaatat gatttaaagg aagttaattg gggaaaggtt tttaaagaat tagatataaa 1140 tcagagtgtg ggtcattttg caactatttt aagtaaatta acaagtaaat atacaaaaac 1200 ctggaagagt aggtacaaga aattttcact tccttggttt aatagtgata ttcttcagtt 1260 aattaaaagg cgagatcttg ctctcaaaaa tctttactat ccaaaactaa tactgatcat 1320 cttgtttaca ctgggttgag gaacaaagtg gtatctgaat tgcgaaaggc taagatgaaa 1380 tatttttcta aattaattga ggaggcccat ggcagcagtt ctaaattgtg gcaacatatc 1440 aatagactat ctaattcaaa caggcaaaaa cattctaaaa taaagcatct taaagttact 1500 aataatatga tttgcagtga tgaggctatg gcaaatgctt ttaatagttt ttttattgag 1560 tctgttcaag aactgtcaag taattttaac ttcactaaat cataccctga ggaaccagat 1620 tatgtttacc cagactcctt ttatattaag gaagtatctt gtgaaaaagt ccaaaaagta 1680 atagaaaata tgagtaattc tacggctaag gatattttta attttcactc tgaattaatt 1740 aagagatata aacattttct gttggagcca attacatatt tagtaaactt atcgattcaa 1800 acaaaaactt tcccagatag ttggaagaca gcaatagtta taccgattta taaatctggc 1860 gataaggatg ttgcttgtaa ctataggcct attgccattc tacctgtagt gtcaaaagtg 1920 ctagagaaaa ttgttgcaga acaattaatg gagcatttac actctaaaca gttgctttat 1980 ccacagcagt atggttttag agagaaatat tccacagaaa cagcaaattg ttatctgctt 2040 gaaaaaataa aagggtcact ggataaaggt agtgtggttg gggcagtgtt cctggaccta 2100 aagaaagcct ttgacacggt taatcacagt atcttactaa ataaattaag agagttcaaa 2160 atgtcatcag aagccttaca gtggtttggg tcttatttgg gtgggagaca gcaatgtgtg 2220 agggtcaatg gggtgaagtc atctgtgcgt tcaacaatat tggagtacct caagggtctg 2280 tccttggccc attgttgttc acaatgtaca taaacgatct tccagaatgc tgcccagggg 2340 ttaactgcca gttatatgca gatgacacgg tcatctatgt gtcagctaaa acaccttgcc 2400 gggcaggtga gcaactgacc caggctttac aacacatttc aagatggctt gaactgtctc 2460 atttaacgtt aaatgtcaag aaaacagttt ctatgtgttt ttctatacga aataggcctg 2520 tacaggatac atttgaggtg ctgataaaaa ataaagtaat taaagaagtg aatgagttta 2580 agtatcttgg cataactcta gacattaatt taaaatttga tagtcatgtt aaaaatttaa 2640 gtaagaaagt taagccaagt ctaaactgtt ttcgttttat tagaaggttt ttaacatatc 2700 acactgcaaa attatacatg cacacaatga tattctctca cttgacgtat tgcattacat 2760 cttggtcaca agcctcttta actacattga agcctattga ctcaatctat aaacaagcag 2820 ttaaaattat ggacttaaag cctatgcgat ggcatcattg tgatgtttta tgtaaacata 2880 acctgtttaa ttttgatagt tttattcaat ttagttatct taaattggtt tttaaatgtt 2940 taaataatca tgttccaatc ctgttttcta agctaatcaa tcggcatcaa acctcccaca 3000 gagttacacg ggcaactgct aacggtgact gtctagtgcc aaagtgtaag acctcgtttg 3060 gtcaatctgt cttctctgta aagggagcca aactgtggaa tgagttaccg gttagcttaa 3120 aatcagaaag taatctaaat atttttagta gtgctcttaa aaagtggtta aaaacaaaac 3180 agcagtgttt gcacaaatag tctactgtta gttggttgga tcatgtcgcc tttgcttttg 3240 cctgcagtaa ttttatgtaa agaattgtaa tgttgtgtat gttttattat atttctattt 3300 tgtcaatttt aagcctcctg tggacaggtg ttgaaaatta gcaagtttgc taaaacactt 3360 aaacaaatgc atttggttgt ccagtgtaat tgtgatgtcc atgtcaaata aataaataaa 3420 taaataaa 3428 // ID HARBINGERN2_DR repbase; DNA; ZEB; 674 BP. XX AC . XX DT 02-MAY-2002 (Rel. 7.04, Created) DT 02-MAY-2002 (Rel. 7.04, Last updated, Version 1) XX DE HARBINGERN2_DR is a nonautonomous DNA transposon - a consensus. XX KW Harbinger; DNA transposon; Transposable Element; Nonautonomous; KW 16-bp TIRs; 3-bp TSD; Harbinger superfamily; HARBINGERN2_DR; KW nonautonomous DNA transposon. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-674 RA Kapitonov V.V.; RT "HARBINGERN2_DR, a nonautonomous DNA transposon from Danio RT rerio."; RL Repbase Reports 2(4), 13-13 (2002). XX DR [1] (Consensus) XX CC HARBINGERN2_DR copies are flanked by 3-bp target site CC duplications CC (usually TAA) generated upon their integration in the genome. CC It has perfect 16-bp terminal inverted repeats similar to other CC HARBINGERs. There are approximately 1000 copies of HARBINGERN2_DR CC harbored by the zebrafish genome. The consensus sequence was CC reconstructed based on 11 copies which are ~5% divergent from CC the consensus sequence. XX SQ Sequence 674 BP; 162 A; 165 C; 185 G; 162 T; 0 other; gggtgctttc acacctgtga atcgattcag ttgttccgaa acagagatta caattgttac 60 attgttgctc tttgctcttg gagcggttcg ctttcacact gcaaagtttc taatcggacc 120 aaaagagcta aaacaagtca cgtgcgagta aactctcctc acattggtca gagtgtcagg 180 gtttattttg cagcgtcccg ctcagctgtc aggagaggtg gtggtttggt ggtgattgac 240 agggtgcgcg cgcgcgacgt gtctgaggag agatgcggtg gggaggggtg agaagggtgc 300 gcgacgatgc ctatttgagg accgggaggg agacgcgaga ttaccgggag atcatcactc 360 gtttgcgggc atccggagac tcgcgaaact tcccgcccta ctcataattc tctcttcata 420 tagccgtatg cctattacat atccataaaa cactgtgata taaccgcgct cggatcggat 480 cgctttctca ctgcaatcga accgctccag ggttcgtttc aatcgagccg agaccacctc 540 attcaagcga tctcggagcg attactttgg cgcggaacag agcgcgattg ccctgttcac 600 atatgccaaa cgaaccgcgc taactgggca aacgagacac gttccgaaac aaaagtgtag 660 gtgtgaaagc accc 674 // ID DNA-2-19_DR repbase; DNA; ZEB; 1837 BP. XX AC . XX DT 18-NOV-2008 (Rel. 13.11, Created) DT 18-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE nonautonomous DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; 2-bp TSD; KW DNA-2-19_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1837 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(11), 1710-1710 (2008). XX DR [1] (Consensus) XX CC TSD is 2-bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1837 BP; 655 A; 202 C; 236 G; 744 T; 0 other; cccaggtgaa aaaaaatata taattaaatg caattaaaat acacttaaat acccgcaaat 60 acatttttag tacattgtta tttttttgtg ttaaataaaa gtttgatggt tttacactat 120 aaatatacta attaaaagta tgtttatact tcagtaggac tttagtacac ttgacaaaag 180 tatactaaat accagtaata cttaaataaa tttttagtgt actacaataa agtacactac 240 agaaaaaaca tccaaaagag ttttattagt gtgtttttca aaagtgtgct ttaaatgctt 300 taaacattag ttgattttta gtacactgtt tacatactaa tcctaagttt aaaataagtt 360 aatatataaa aaatctatta gtaatgtatt gtagtagaag tacattttcc caaaagttgt 420 acttaagtac acttaatgtg aatgcaatat tatcactaac acttaaattg attttgagtg 480 tggtaatttt aagtttacat taaattgact ttattaaaaa tctattagta atgtattgta 540 gtagaagtac attttcccaa aatgtcttta cttttaatac atgtgctgta atgttgcatc 600 ttatttatat ttttgtgttc tcagtttttg ataaattttt gtttccttcc ataaggtatt 660 tttggacttg tggtttattg cacaaaaaga ttaaagtgtt ttgatgaagc acagatgctg 720 ttttctttgt acacacagta tccatttaaa atatattctt ttaataaatg tttattatgt 780 ttatattact attataccta tgtttattat gtttaaatta ttattatgtc ataaatatat 840 ttaaatgttt aaaagtaggg ttcaagtgta tttctagttg taactaaata tatattttta 900 aaatacagat atagtatgtt aaaagcacat tttagttcaa tttcatggtg tctcaaaata 960 gcacagttga gtacacttag atggtattaa gtttatctta acatatactt aatactatct 1020 tttagtacat taagtacaaa attagtgtgt gaaaatagag cacttttagt acattactga 1080 aaaagtatac ttagtgtact taaagtgtat tttagcacac tgttttcaca ttttgtatat 1140 tcttttaata tattactatt atactttaag ctattcttaa atatatttat aaatgtttaa 1200 aattagggtt caagtgtatt tctagttgta actaaatata tattttttaa aatacagata 1260 tagtatgtta aaagcacatt ttagttcaat ttcagggtgt ctcaaaatag cacagttgag 1320 tacacttaga tggtattaag tttatcttaa catatactta atactatctt ttagtacatt 1380 aagtacaaaa ttagtgtgcg aaaatagagc acttttagta cattactgaa aagtgtacta 1440 agtatactta aataaagtgc attttagcac actgctttca cattttgtat attcttttaa 1500 tatattacta ttatgcttta agctattctt aaatatattt ataaatgttt aaaattaggg 1560 ttcaagtgta tttctagttg taactaaata tatattttta aaatacagat atagtatgtt 1620 aaaagcacat tttagttcaa tttcagggtg tctcaaaata gcacagttga gtacacttag 1680 atggtattaa gtttatctta acatatactt aatactatct tttagtacat taagtacaaa 1740 attagtgtgc gaaaatagag cacttttagt acattactga aaagtgtact aagtatactt 1800 aaataagtgt attttagtgc actttttttc acctggg 1837 // ID Gypsy98-I_Dr repbase; DNA; ZEB; 5504 BP. XX AC . XX DT 19-APR-2009 (Rel. 14.05, Created) DT 19-APR-2009 (Rel. 14.05, Last updated, Version 1) XX DE An internal portion of the Gypsy98_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; endogenous retrovirus; Gypsy superfamily; KW gag; integrase; reverse transcriptase; Gypsy98-I_DR; KW Gypsy-98-LTR_DR; Gypsy98_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5504 RA Dib M.R. and Naveira F.H.; RT "Gypsy98_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 9(5), 960-960 (2009). XX DR [1] (Consensus) XX CC Gypsy98-I_DR is an internal portion of the Gypsy98_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its long CC terminal repeat is deposited in Repbase as Gypsy98-LTR_DR. CC Gypsy98_DR is characterized by 4-bp target site duplications. The CC internal portion encodes two proteins: the 355-aa gag CC Gypsy98_DR1p (pos.199-1263) and 1402-aa polyprotein CC (pos.1218-5423, conceptual translation)composed of the protease, CC reverse transcriptase, and integrase domains.Some insertions CC fairly recent, CC according to the hight identity between their flanking LTRs. CC Consensus obtained after the alignment of at least five CC independent insertions bearing at least 85% homology over at CC least 1000bp. XX FH Key Location/Qualifiers FT CDS 199..1263 FT /product="Gypsy98-I_Dr_1p" FT /note="Gag-protein." FT /translation="MDEELRELRELVAQLRADNEKLRHEQVPVEQPGPSNI FT SIPAVVDPLPAVPSSSAAERFVFVPRDRNCPKFNGRSGIMIEEWVEEAQTC FT MRLRHLSTSDQAFFLFDHLEGEARDEIRYRPRDEREDPNKIFQALRELYGC FT TSSYVALQESFFSRKQQDGETLLEFSLVLLNLLEKVKRQSPHGMPNADILL FT RDQFVEHVRDGALRRELKQFVRRQPNARLLDVRSEAMRWEREGMPGGTRGR FT SQSVPSAYGIQYGVQGYQPPGGSNLASSELRELREMLKKQQQQLNQLTQNV FT AQIQNLQPRLQLSRSNQVICRRCQQPGHFARECDGERRPPRFAPARADSAS FT TRGVQSGQPSEN" FT CDS 1218..5423 FT /product="Gypsy98-I_Dr_2p" FT /note="polyprotein." FT /translation="FRFNKGCAVWATVGKLIPAELQGHSLVGRRTGSNVSA FT EVDFMSRFMASCPQLIVYMSGVKVQCLVDTGSMVSTITESCFVSNFGAWGQ FT EQLRSCHWLQLRAANGLSIPYIGYLELDVELCGQVVTGCGVLVVRDPPGSM FT CAQVPGVLGMNILGRCYRELFGRHGAQLFDLPAVSQAPTSVFQALQHCHLI FT EVQPDSDQGSRVKVRGRQVCRIPGGTIKLVAVTCSTQYSRGTVLFEPPESG FT LPAGLLASPALVQVDHGTAYVPVVNVSTADVVVYAGTVVGTVNQVYVVSLP FT PEISEVPVVSARVSIQVAQAVQSVQEQIGEIDLSVLSAEEQGKVRELLEKY FT ASVFSIHEGDLGCTSLISHEIPLTDDVPVRQRYRRIPPSEYEVVKAHINQL FT LEAQVIRESCSPYASPIVLVKKKDGSLRMCVDYRRLNAKTRKDAFPLPRIE FT ETLDSLAGACWFSTMDLASGYNQVPVGEADKPKTAFCTPFGLFEWNRMPFG FT LCNAPSTFQRLMERLFGDQQCQSLLLYLDDIVVFSSSVDQHLARMEVVLSR FT LLREGLKAKLSKCAFFKRKVQYLGHVISSEGVSTDPGKIEAVSQWPQPNSV FT SGLRSFLGFASYYRRFVEGFARLAAPLHRLVAELAAPKRSKHGPKEFEQAW FT SSECQSSFEELRCRLTTAPVLAYADFSLPFILEVDASHRGLGAVLSQEQQG FT KVRPIAYGSRGLRPTERNTSNYSSMKLEFLALKWAMTEKFREYLLGHKCVV FT YTDNNPLSYLTSAKLGAMEQRWAAQLAVFDFEVKYRSGKINRNADALSRQD FT SSVTEVQDLLPGLVVPAALCQASHRQLVTQVNQIVSFPCSSENDISVQQQA FT DSVIGELLVFWRRKLPPSAEERKKLSRSAVILLKQWDRLREIEGVLYRRVF FT RPDGGEEVLQVLLPLAMKSEVMTQLHQQHGHQGVERTTQLVWQRCYWPGMS FT GDIARWCQECERCQCAKGVQPIPVSFMGRLLAARPNEIVALDFTVLEPSHS FT GIENVLVITDIFSKYTLAVPTRDQRAETVAQVLVAEWFCKFGVPGRIHSDQ FT GRNFESVLIQQLCHLYGVEKSRTTPYHPAGNGQCERFNRTLHDLLRTLPLS FT RKGDWPSCLPQVLFAYNTTPHQSTGESPYFLMFGQEPRLPIDFLLGRIEEP FT LAGSVHRWVLEHQNRLQVAFEGARGRLEAAADRRKARHDLQVHNLPLREGQ FT KVYLRDCSVRGRHKIHDLWSSVVYQILKAPKEGGVVYTIAPTTDLSKIKHV FT HRSLLKPQVGRDPLPRVPDVPLVERVLPREMDSDEGDLFVLVPETPQVSSG FT QASQGSPPSVLQDAGGSEGEGGNTEMAAELPPVMNPPLGPVVRGESAVRRT FT GRSTAGQHSNVHHLPRSLGVGTVSSIVGSAAEVWFRPWS" XX SQ Sequence 5504 BP; 1267 A; 1172 C; 1551 G; 1514 T; 0 other; cattctggcg tagtcggcag gatcctcgta tggtatagtg catagacgtg agtattggta 60 gtgtggggct tgtttttttt tttttttttt tttttttttt tcctgcagga gcagatagcg 120 gcagcggcat attggtgttt cagtgtggag agtagatttc aagcccatag tggtgagtga 180 ttttgtagtt ttgtaactat ggatgaagaa ttacgtgagt tgcgagaatt ggtagcccaa 240 ttaagggcag acaatgaaaa attgcgacac gaacaagtgc ccgttgaaca acctggtccc 300 tctaatattt ccattccggc ggtggttgat cctcttcctg ccgttcctag ttcctcagca 360 gcggaacgat ttgtttttgt gccacgggat cgaaattgcc caaaatttaa tggtagatct 420 ggcattatga tcgaagagtg ggtggaggag gctcaaactt gcatgagact ccgccatctc 480 tctacttctg atcaggcatt ttttctgttt gatcatttgg agggagaggc gagggatgaa 540 attcgatatc ggccaaggga tgagagggag gatccaaaca aaatttttca ggctttgcga 600 gaattatatg gttgtactag ttcatatgta gctcttcaag agtccttttt ttccaggaaa 660 caacaggatg gggagaccct gttagagttt tccctagtcc tgttaaatct cctggaaaaa 720 gtaaagcgcc agtcgcctca tgggatgcca aatgcagata ttttgttgcg cgaccaattt 780 gttgaacacg ttcgtgatgg tgctcttcga cgtgaactca aacagtttgt ccgtcgccag 840 cctaacgcta ggttattgga cgttcgtagt gaagcaatgc gctgggaacg ggagggtatg 900 cctgggggaa cacgtggtcg tagccaatct gtaccatctg cttatggcat ccaatatggg 960 gtacagggat atcagcctcc gggaggtagt aatttagcaa gttctgaatt gcgcgaattg 1020 agagaaatgc tgaaaaaaca acaacagcaa ttgaatcagt tgactcaaaa tgtggctcaa 1080 attcaaaacc tccaaccacg tcttcagtta tcccggtcta atcaggttat ttgtcgtcgg 1140 tgtcagcagc ctggtcattt tgcccgggaa tgtgatgggg aacggagacc tcctcgtttc 1200 gcgccagcac gtgctgattc cgcttcaaca aggggtgtgc agtctgggca accgtcggaa 1260 aactaatacc cgctgagctg cagggccaca gcttggtcgg gagaagaaca ggctcaaatg 1320 tttcagctga ggtagatttt atgtctcgct ttatggcatc gtgtcctcaa cttattgtat 1380 atatgagcgg ggtcaaggta cagtgtttgg tggacactgg gtctatggtg tccaccatta 1440 ctgagagctg ttttgtgtca aattttgggg catggggtca ggaacaactt aggtcatgcc 1500 attggttgca gcttcgggcc gcgaatggcc tttcgattcc atatattggc tacctggagc 1560 tagatgtgga gttgtgtgga caggtggtta caggttgtgg cgtgctggtg gtcagagacc 1620 ctccgggtag tatgtgtgct caggttcctg gggttctggg aatgaatatc ctgggccgct 1680 gctatcggga gctctttggc cggcatggcg ctcagctttt tgatctacct gctgtgtcac 1740 aggctcccac ttctgtattt caggctttac aacactgcca cctcattgaa gttcagccag 1800 attcggatca gggaagtaga gttaaagtac gaggacgtca ggtgtgccgc atccccggag 1860 gcaccatcaa attggttgcc gtaacctgtt caacccaata ctccaggggc actgttttgt 1920 ttgagccacc agaatctggt cttccagcag gtttgttggc atcccctgca ctggtccagg 1980 tagaccatgg aactgcttac gtgcctgtgg tgaatgttag caccgcagat gtagtggtgt 2040 atgccggcac agttgtgggg actgttaacc aggtttacgt tgttagcttg cccccagaga 2100 tctctgaagt tccagtagtt tcagctaggg tgagtataca ggttgcacag gctgtccagt 2160 ctgtacaaga acaaattggg gaaattgatt tgtctgtact ttcagcggaa gaacaaggga 2220 aggtgaggga actattggag aagtatgcat ccgtgttttc tatacatgag ggtgatttgg 2280 gatgtaccag tctcatttcc catgagattc ccttgactga tgacgtacct gtgaggcagc 2340 gttatagacg cattcctcca tctgaatacg aggtagtcaa agcacacata aatcaactgc 2400 tcgaggctca agtaataaga gaaagttgta gtccatacgc gtcacccatt gtcttggtta 2460 aaaagaagga tggtagccta cgtatgtgtg tagactaccg tcgtttaaat gctaagacaa 2520 ggaaggatgc ttttccttta ccccgcattg aggagacttt agactcgctg gcgggggcat 2580 gctggttttc taccatggac cttgctagcg ggtataatca ggtccctgta ggtgaggcag 2640 ataaaccgaa gactgctttt tgtactccat tcgggctatt tgaatggaac cggatgccgt 2700 ttggcctttg taatgcccct agcacctttc agcgtttaat ggaacggttg tttggagacc 2760 aacagtgcca gtctctgctt ttgtacttgg acgatattgt ggtgttttcc agttcagtag 2820 atcagcatct agcccggatg gaggttgtct taagcaggtt gctcagagag ggcctgaaag 2880 caaaattgtc aaaatgtgct ttctttaagc gtaaggtaca gtacctgggt catgtgatct 2940 cttcagaagg ggtctctaca gatccaggga agattgaggc agtttcacag tggcctcagc 3000 ctaatagtgt ttcggggctg cggtcatttt tggggtttgc tagttactat cggcgctttg 3060 tggaggggtt tgcgaggttg gcggctcccc tccacaggct ggtggcagaa ctggcagccc 3120 caaaaaggtc caagcatggc cccaaagaat ttgaacaggc ctggtctagt gagtgccaga 3180 gtagctttga ggaattgagg tgtagactca ccacagcccc tgtactcgca tatgcggact 3240 tctcactccc ctttattttg gaggtggacg ccagtcatag ggggttaggt gcagtcctct 3300 ctcaagagca gcagggtaag gtgcgtccca tagcctatgg tagtcgtggt cttaggccca 3360 ctgagcgcaa tacctctaac tacagttcaa tgaagctgga gttccttgcg ctcaagtggg 3420 ccatgaccga gaagtttagg gagtaccttc tgggtcataa gtgtgtagtc tatactgata 3480 ataatcctct tagctacctg acctctgcca agttgggtgc catggaacag cgttgggccg 3540 cccagttagc ggtgtttgac tttgaggtca agtatagatc aggtaagatt aaccgaaacg 3600 ctgatgcttt gtcacgtcag gattcctctg tgacagaggt tcaggatctg cttcctgggt 3660 tggttgttcc agccgcatta tgtcaagcta gtcacaggca gttggtgacc caagtaaacc 3720 aaattgtatc ctttccatgt tcttctgaga atgatatcag tgtccagcag caggctgatt 3780 cagttatcgg agagttattg gtgttctgga ggcggaagtt gccaccaagt gcagaggagc 3840 gcaagaaact ctccaggtct gcggtgatct tacttaaaca gtgggatcgt ctgagggaga 3900 tcgagggagt gctttatcgg cgagtgtttc gccctgacgg tggggaagaa gtccttcaag 3960 tgttgctgcc attggccatg aagtctgagg taatgacaca gctgcatcaa cagcacgggc 4020 accagggggt agagcgtacc acccagcttg tgtggcagcg gtgctactgg ccgggcatgt 4080 cgggggacat tgcccgttgg tgccaggagt gtgaaaggtg tcagtgtgcc aagggtgttc 4140 agcccatccc agttagcttc atgggacgtt tgttggctgc aaggccaaac gagattgtgg 4200 ctttggactt cacagtgttg gagccttcac actctgggat agagaatgtt ttggttatta 4260 ctgacatctt ttccaagtac accttagctg tacctactag ggaccaacga gctgagactg 4320 tggcccaggt tctggtggcg gaatggttct gtaagtttgg ggtgccaggt cgtatccatt 4380 cagaccaagg ccgtaatttt gagtctgtcc tcattcagca gctgtgtcat ttatatgggg 4440 ttgaaaagtc acgcacaacc ccataccatc cagcaggaaa cggccagtgt gagcgcttta 4500 ataggacatt acatgatctt ttgcgcaccc ttccgttgtc taggaaaggg gattggccat 4560 cttgtctccc ccaggtcctt tttgcctata acactacacc tcaccaatct acgggggaat 4620 ccccctattt tctgatgttt gggcaagagc ctagactgcc aattgacttt ttgcttggta 4680 ggattgaaga accattagca ggtagtgttc accggtgggt tttagagcat cagaatcggc 4740 tgcaggttgc ttttgagggt gctcgtgggc gtttggaggc tgcagccgat cgtcgaaaag 4800 ctaggcatga cctacaggtg cataacttgc ctctaaggga ggggcagaaa gtgtatctcc 4860 gtgactgtag tgtgagaggt agacataaga tccatgacct ctggagctca gtggtgtacc 4920 agatactgaa ggcacctaag gaaggaggtg tagtgtacac cattgcgcca actacagatc 4980 taagtaagat aaagcatgtg caccgatccc tattaaaacc tcaggttggt agagaccccc 5040 tccctagagt gccagatgtt ccattagtgg aacgtgtgct gcctcgtgag atggattcag 5100 atgagggaga cttgtttgtt ttagtgcctg aaacgccaca ggttagtagt gggcaagcat 5160 ctcaaggtag tcccccttct gtccttcagg atgccggtgg ctctgaagga gaaggaggca 5220 atacagagat ggcagctgaa ttaccgcctg taatgaaccc tccattaggc ccggtggtga 5280 ggggtgagag tgccgtgcgt aggactggaa ggagtacggc cggacagcat tctaatgtac 5340 atcatctgcc acggtcactt ggtgtaggta cagtttctag tatagttggt agtgcagctg 5400 aggtatggtt ccgtccttgg agttaaaggg tttaggagac atcttttctt gtcattttct 5460 ttcagttcac catcggggcg acggtgcaga agcctggggt ggat 5504 // ID CR1-35_DR repbase; DNA; ZEB; 2600 BP. XX AC . XX DT 13-FEB-2009 (Rel. 14.02, Created) DT 13-FEB-2009 (Rel. 14.02, Last updated, Version 1) XX DE CR1-like non-LTR retrotransposon - a consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; AP endonuclease; CR1 clad; CR1-35_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-2600 RA Bao W. and Jurka J.; RT "CR1-like non-LTR retrotransposons from zebrafish."; RL Repbase Reports 9(2), 521-521 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS join(399..1280,1142..2422) FT /product="CR1-35_DR_1p" FT /translation="QTSQSPKLGLAEXFRCYCPXQASYKKRKSALWHNSNT FT RLLRQACRKAERKWKKDKLQISYEMLKDSLSAYQSAAKSAKASYFTDLIVK FT NHSTPKALFSVIHSVVNPAVQSMPVASNALCESFLRHFTDKISILRTKLCS FT APXDPPPPTLXLVYWDAFDPISLQSLKNVIANTKLSFCPNDIIHPKFLKQI FT IDTVGPGLTSFFNKCLSTGSFPNELKMATVTPLLKKPSLDPSILNNFHPIS FT VLPFCSKILEKIVLDQLQSFLKKATVSLKFFSLGLKWLIALKQPSSEYLMI FT SCDFRKNCTRSTAIFFKKSNCITEIFQSGFKMAHSTETALLRVLNDILLAT FT DSGDSVVLVLLDLSAAFDTVDHSILLSRLESVVGLKGTVLKWFKSFLSERK FT FSVKLGNFSSSFADLTCGLPQGSVLAPSLFSLYMLPLGAILRSHGVSFHFY FT ADDTQIYLPIKRNDPLALSTLLRCLDEIKIWLSHNFLSLNEDKTEVIVFSP FT TDNVQASCLDLGCLSAFRSPRVRNLGVIXDESLKLDKQISLVIGSSFYQLR FT LLSKVKHFLNPTTLEMAVHAFITSQLDYCNALYCGMSKSQVSHLQLVQNAA FT ARFICNSRKYDHITPLLRGLHWLPVQFRIDFKVLLLXYKSLHNLAPSYLSE FT LLHLYTPVRSLRSSEQNLLSVPQSRLKXRGDRAFSVAGPRLWNTLPLEIRM FT APSLSIFKSLLKTYLFSLAF*" XX SQ Sequence 2600 BP; 650 A; 596 C; 422 G; 911 T; 21 other; aatattgcaa tgaaatatga tcgtttttta ttagtgggtg attttaacat tcatgtytgc 60 tgtcaggcaa accctttatc taaggagttt cttaacctta ttgactcttt tagtctggtt 120 caatggataa cagatccgac catgttttag gccataccct agacctygtt ctttctcacg 180 ggcttgctgt ctctgatata gttttctgac tktaygcttt ctgatcataa gccgatttta 240 ttttccatgt ctcttcctac tctcccttta tccactaatg ttaatgtaac actgtctcgt 300 tactaytctc ctacatttgg ggccaatttt aatcagtgtt ttttggaaac atgctctcat 360 cattctttgg atctaccact gtttgacctg gatgctgaca aacatctcag tctcctaaac 420 tcggcctggc tgagygtttt agatgttact gccccwttca agcctcatat aaaaaacgca 480 aatccgcttt atggcacaac tcaaacactc gtctgctccg ccaagcctgt agaaaggccg 540 agcggaaatg gaaaaaggat aaattgcaaa tttcttatga gatgctaaaa gacagtctct 600 cagcttatca gtcagctgca aaatctgcca aggcttctta ctttactgat ctcattgtga 660 aaaatcactc tacacctaaa gctctgttct cagttataca ctctgttgta aatccagctg 720 ttcaatcaat gcctgttgct tcaaatgctc tatgtgaaag ctttttgaga cacttcactg 780 acaaaatatc aattttaagg acaaaactct gctctgctcc crttgatcct cctcctccca 840 ctttgcmtct ggtctattgg gatgcttttg atcccatttc tctacaatca ctgaaaaatg 900 taattgctaa tactaaactt tccttctgtc ccaatgatat catccaccct aaatttctaa 960 aacaaatcat tgacactgtc ggcccaggct tgacttcctt tttcaataag tgcctctcta 1020 ctggctcttt tcccaatgaa cttaaaatgg ccactgtcac tccgcttctc aaaaagcctt 1080 cactggatcc ctctattctg aataattttc accctatctc tgtcttacca ttctgctcta 1140 agattttaga aaaaattgta ctcgatcaac tgcaatcttt tttaaaaaaa gcaactgtat 1200 cactgaaatt tttcagtctg ggtttaaaat ggctcatagc actgaaacag ccctcctcag 1260 agtacttaat gatatcctgc tagccactga ctctggcgat tctgtggtcc tggttcttct 1320 agatctgtca gcagcatttg atactgttga tcactctatt ctgctgtcca gattagagtc 1380 agttgtgggc ttgaaaggca cagtccttaa gtggtttaag tcttttcttt ctgaaagaaa 1440 gttttcagtt aaattaggta acttctcatc ctcatttgct gatctcactt gtggcctccc 1500 tcaaggctca gtcttagctc cctcactttt ttcattatat atgcttccgc taggggccat 1560 tttaagaagt catggggtgt ctttccattt ttatgctgat gacacccaga tttacttacc 1620 tattaagagg aatgaccctc tcgcacttag taccttacta aggtgcttag atgagatcaa 1680 aatctggctg tcccacaatt ttttatcatt aaatgaggat aaaacagagg tgattgtctt 1740 cagccctact gataacgttc aggcctcttg cctagatttg ggctgcttat ctgccttcag 1800 atcgccccgt gtccgcaatc tgggtgtcat cyttgatgag tcattgaaac tagacaaaca 1860 gatcagtttg gttattggct ccagttttta tcaactacgt ytgctgtcta aagttaaaca 1920 cttyttaaat cccaccactc tagagatggc tgtccaygct ttyattacat cacagttaga 1980 ctactgcaat gcactttact gtggcatgtc aaaatcccag gtttctcacc ttcagytggt 2040 ccaaaatgct gctgctagat ttatttgtaa tagcagaaaa tatgaccaca ttacccctct 2100 tctaagaggc cttcattggc tgcctgtgca attcagaatt gattttaaag tgcttttact 2160 grtttataaa tccctccata atctagcccc atcctatctc tcagagctgc ttcatcttta 2220 cactcctgtg agatctctta ggtcaagtga gcagaatctt ctttcagttc cacagtctag 2280 gctgaaackt agaggggaca gagctttctc tgtggcaggt cctcggttgt ggaacacctt 2340 gccccttgag attagaatgg ctccatcatt atctattttt aaatcacttc ttaaaacgta 2400 cctttttagc ttggcttttt aagattattt taatagttcw ttctacygct tttttacatt 2460 tttacagacg tttctcttgc yttatttcta tgtttttata ttctgtcttt ttattttttt 2520 accttgtctt atgtacagca ctttggtcag tcttgtgact gcttttaaat gtgctttata 2580 aataaatkaa cttgaactta 2600 // ID Gypsy75-LTR_DR repbase; DNA; ZEB; 830 BP. XX AC chr1; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy75-I_DR; KW Gypsy75-LTR_DR; Gypsy75_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-830 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1466-1466 (2008). XX DR Genome; chr1; Positions 26254738 26255567. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 830 BP; 211 A; 135 C; 173 G; 311 T; 0 other; tgtaacgggt tataatatca cctgtgatat tgtctttttc attaaaatct ttttcattgt 60 tttagtgatt ttattttgac attttctatc caatatatgt gaatttagta ttgagacttt 120 tattttgaat tttgctttcg gcggcgcgtc gacgccattt tagattactc gcgcgcctct 180 cgccagttca ctttgcactg gctggggaga ggtaggtgtt ttgctgtgct cctcgttttc 240 tgagtaaaat aagtatctaa agtgtataaa atgtttccat ttaatgccgt tggtactgtt 300 tagatgtatt gcaatgttct gttgtatgat ttaagtgtgg atgaatcagg aatgtgtagt 360 tatatcgtgt ggatgtagta cattttgtat tagtgtaaat ctgtgattac agcaactaat 420 agacattatg ttatacagga atatactgct gttatattac ccacaagaga tttagccact 480 gtttgatgtc aatccacagg tatttccatt ttatcatctc agtgttttat gttcatttat 540 tgatttttat tgatatttac agtaggtttc attatgttaa tgatataatt ttcctgactg 600 ttaaagttaa gtagcagttc actttgcact ggctggggag aggaatatac tgctgttata 660 ttacccacaa gagatttagc cactgtttga tgtcaatcca cagagctatg gtggccagtc 720 catgttgggc ttgaataaac cttctccggt gaatgaagta gcgcgtgtcc ttgtcttcct 780 tacacacaac gcagaggcca gtaacagcgg gcgaaagcag accggttaca 830 // ID BEL12-LTR_DR repbase; DNA; ZEB; 575 BP. XX AC chr14; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL12-I_DR; KW BEL12-LTR_DR; BEL12_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-575 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1374-1374 (2008). XX DR Genome; chr14; Positions 49656282 49656856. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 575 BP; 144 A; 79 C; 126 G; 226 T; 0 other; tgttaataac tttgacatgg tgtctttgtt attacatctt ctctacaatg tttatattgt 60 gtgtgtgtgt atactttctc tttctctatt atatgtatat ttgatatttt ggtatttatg 120 ttatgaagtg ttcagtgtaa aaccggaagt cttttaattt gaattaatta ccattacgga 180 tataaacaga cacggaaaga tcacacatga tcatggcgtc ccagctgtgc ggtctgctac 240 ggactaaaag tttagcttct ggtttgattg gagaacaagg tactgacact tcgtttgata 300 tgtttgtttg tttgtattga gtttgtatat tgagtttaag tgatgtctgt ttgtttgtat 360 atggttttca tatttgctta catttggttt cattttctct ttgtgtacaa gctcttacgg 420 tgtttgccgt acggtgcaag atatacggtg tttggccttt ggtataaggt gttttgatgt 480 tgacgacggc gtcagtgaag tgaatgctgc actgaaataa attatggaaa aacaccagtg 540 aaagaggcaa tcctttattg aaccagggag ctaca 575 // ID TDR2 repbase; DNA; ZEB; 941 BP. XX AC . XX DT 04-MAR-2002 (Rel. 7.02, Created) DT 25-SEP-2007 (Rel. 12.1, Last updated, Version 2) XX DE Zebrafish non-autonomous DNA transposon - a consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; KW Tc1 superfamily; Tdr2. XX NM TDR2. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-941 RA Gottgens B., Barton M.L., Grafham D., Vaudin M. and Green R.A.; RT "Tdr2, a new zebrafish transposon of the TC1 family."; RL Unpublished. XX RN [2] RP 1-941 RA Jurka J. and Drazkiewicz A.; RT "TDR2: a non-autonomous DNA transposon from Danio rerio."; RL Repbase Reports 2(2), 30-30 (2002). XX DR [2] (Consensus) XX CC TA-target site duplication. XX SQ Sequence 941 BP; 294 A; 183 C; 190 G; 272 T; 2 other; tacactacct gacaaaagtc ttgtcgtcga tcccagttgt aagagcaaca aataataact 60 tgacttctag ttgatcattt ggaaaagtgt cagaaggtag atttttctca gatgaatcat 120 ctgttgaact gcatcccaat catcacaaat actgcagaag acctattgga acctgcatgg 180 acccaagatt ctcacagaaa tcagtcaagt ttggtgaagg aaaaamtcat ggtttggggt 240 tacattcagt atgggggcgt gcaagagatc tgcagagtgg atggcaacat caacagcctg 300 aggtatcaag acatttgtgc tgcccattac attacaaacc acaggagagg gcaaattctt 360 cagcaggata gcgctccttc tcatacttca gcctccacwa catcaaagtt cctgaaagca 420 aagaaggtca aggtgctcca ggattggcca gcccagtcac cagacatgaa cattattgag 480 catgtctggg gtaagatgga ggaggcattg aagatgaatc caaagaatct tgatgaactc 540 tgggagtcct gcaagaacgc tttctttgcc attccagatg actttattaa taagttattt 600 gagtcattgc agagatgtat ggatgcagtc ctccaagctc atgggagtca tacacaatat 660 taattctttt tccactgcac catgacttta tattctatac tgtacattat ttctgttaag 720 tgacaagact tttgtctaag caaagtcaga ccttactgtc ctaattaaat aattaaaaat 780 caaggcatga tcatatttta ttttggtaaa ataagtgtaa tctagaggcc tttgcctttc 840 atataagcca cttctgatac caaatgatca actagaagtc aagttattat ttgttgttcc 900 taaaacttgg ataggcgaca agacttttgt caggtagtgt a 941 // ID BEL19-I_DR repbase; DNA; ZEB; 5894 BP. XX AC chr13; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL19-I_DR; KW BEL19-LTR_DR; BEL19_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5894 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1385-1385 (2008). XX DR Genome; chr13; Positions 40007449 40001556. XX CC Positions [4924-5484] - Integrase core CC 'GTTGG' target site duplication CC LTRs are 100% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 43..5850 FT /product="BEL19-I_DR_1p" FT /translation="MMSRNRTGVSEPDVIRQRRITRPPAYLRDYEVGYTSK FT HLTFADERSEMLTCIREMREENMQLRQEVRRISDVIANSPVLASQDPVAVQ FT QSRVKEAGVSSTPKSTGMLVSFQDYENPRQIEHTPLVPPRDESLLAVSQDR FT DRELIEELTESLKRVGRLSDSPSGSGISTPSYHDPHNSNSNSPPRNNASDA FT NIPVRQHNLTRFHYCNTDDSPVLQSSIPPSDAYRTTTDPYYHCVNDRTHYL FT SKNARDHLDYFSFDGKQEYNRENRHFLSPGSAYNPYSVPPHALYTQPVQYR FT GPTPTIPNFSRDDPREFAHLKLALDNVLPYDASEAFKFQILVDHLKLEDAL FT LVADSYSHSRTPFSDTMRALTDMYGQPHQLALQRITNLMEGPNIRSGDVKS FT FKSFALRVRALVGMLNQLGSTGWTELKCGSHVSRLLAKLPHDLRANFKRFI FT NPIQTPIPTLLEFADWLEYEVRVQVDGNQYGSYPEYEKHVPRKDKRPSFMP FT RKLTTVLHGGEQKEVAVENIKKEHAHVERPQDKPKKYCPFCDSFQHYMNQC FT SNFKLLSKEQVESWIKAGNRCWRCGRGHLSSKCTLKAKCKRCDRRHLDVLH FT EVNASAYTMIKSNQKIPQKSTDSEGPTSQTLYLDRPTCGKQVLLKLSRVII FT QSGIKSLETYAVLDDGSERTILMYEAAQRLGLQGDREDLALRTVKQEAQTI FT QGKVVSFSIVPFSFPDRSYQIKGAFTAKELALVQHTYPISDLQKKYRHLRD FT LPLHEINKAQPLVLIGSDYPHLITPLEPVRLGPPGGPAAVHTKLGWTLQGP FT SSLLKSCLRPQECLFISCLSPEAELLRQVENLWKLDILPYRSEKIISRSRQ FT DAEAIRILEEKTIRIEVQDVKRYATPLLWKQLPPPLNAPQEAVMGLLCSTE FT RKLSKEPEKAESYDFEIQKLLQSGYVTQLTPAESESSAQSWYIPHHMVQHN FT GKNRIVFNCSFEYKGQSLNEYLLPGPTLGSSLLGVLLRFREYPVAICSDVK FT GMFHQVRLLDEDKPFLRFLWRQGKTSEMPVVHQWEVLPFGTACSPCCAVFA FT LQSHVKGDLDPTDEVRQTVERNFYVDNCLSSVPTESQAKNLVNRLQCHLMD FT GGFELRQWACNIPEVIQHLPNDLKSKSGEIWLSQGIDNPQENALGLIWHCK FT TDNLVYKPYQREDGEITMRKIYRTLAKLYDPLGYLIPYTTRAKTVVQLLWD FT KKREWDDPNLPEDLLAIWHQWENELPNLSQVFLPRCYTSHIQCTVKHRCVH FT VFCDASERAYGSVAYLCSEDEQDNRHVSFIAARSRVAPRKQISIPRLELCA FT ALTGAQLGDVLKRELTLNINQFVYWSDSTTVLNWLQSDSCRYRVFVGVRVT FT EIQELSDPGSWRYIDSATNPADDVTRGLSLIQLAENSRWKLGPDFLRQPSS FT CWPKLPVGPASEEVQELKKSIFCGLVDDIKEQDEPDVHQFSSYSEFLEATA FT RSLHGAADSTFSPSADDFRQAELSILRQCQKHDFPEEYALLREGKAIPTRS FT RILKLAPEYDRDLDLIRVGGRLRRCPYLNEEVLHPIVLAPDHPVVKLLIKH FT YDAQLHHPGAGRVLAEIQRKYWILRGREAIKRHQHNCLDCRKWRGKPVVPK FT LSDLPPSSLRLFKPPFYSTGVDCFGPLMVKIGRRIEKRWGIIYKCLTTRAV FT HLDLLYHMDSDSFLMSLRRFIARRGKPHELLSDQGTNFKGGANELEDTFNQ FT IQPILRDMLAKDQIRFQFNPPSAPHFGGSWEREVRSVKTALRSTLGAQTVT FT EEVLRTILTEVENILNSRPLGYVSSDIADPDPVTPNSLLMGRPDSSLPQVV FT YSDSELLSKKRWRHSQVLSDHFWKHFIHDFLPSLQSRQKWHSERQNIVVGT FT VVLIVDEQLPRACWKVGTVSSVIPSADGRVRTVTVKVKDQTYTRPVVKLIE FT LPSLPEDLENSS" XX SQ Sequence 5894 BP; 1695 A; 1298 C; 1319 G; 1582 T; 0 other; tttaatggtc cttcgagccg gatatttatt actaccttaa caatgatgtc caggaatagg 60 acaggagttt cagagccaga tgttatacgt cagcgtagaa ttactagacc accagcatat 120 ctacgtgatt atgaagtagg ttacactagc aaacatctta catttgcaga tgagcggtcg 180 gaaatgctta catgtatacg tgaaatgagg gaagagaata tgcagttgag gcaggaggtg 240 agacgtatat cagatgtgat tgctaattct ccagttttag cttctcagga tccagtagct 300 gtgcagcagt ctagagttaa agaggcagga gtgtcatcta caccaaagtc tacaggtatg 360 ctggtgtcat ttcaggatta tgagaatcct cgccagatag aacatacccc tttggtgcca 420 cccagagatg aatccctcct cgcagtgtcg caggatcgtg accgagagct tattgaggag 480 ctaacagaga gtctaaagag agtaggcagg ttaagtgata gtccgtctgg ttcaggtata 540 tcaacaccct cttatcatga ccctcataat agcaatagca attcacctcc acgtaataac 600 gctagtgatg ctaatatccc tgttcgacaa cataatctga ctagattcca ttattgtaat 660 actgatgatt ctccagtgtt gcagtccagc ataccacctt ctgatgctta tcgcacaaca 720 actgatcctt actatcattg tgtgaatgac aggacgcatt acttatcaaa gaatgccaga 780 gatcatcttg actacttttc atttgatggg aagcaagagt ataatagaga gaatagacat 840 ttcctatcac ctggtagtgc atacaatcca tactctgtgc ctccacatgc tttatatact 900 caacctgtac aatatcgtgg ccctactcct accattccga attttagtag agatgacccc 960 cgcgagtttg ctcatcttaa acttgcatta gacaatgttc ttccctatga tgcttcagag 1020 gcattcaagt ttcaaatact tgttgaccat cttaagctgg aagatgcttt gcttgtagct 1080 gactcctata gtcacagccg tactccattc agcgacacaa tgagagcgct tactgatatg 1140 tatgggcagc cacatcaatt agccttgcaa aggattacaa atttgatgga agggccaaac 1200 attaggagcg gagacgttaa gtcatttaaa tcatttgctc tacgtgtccg agcattggtt 1260 ggaatgttaa accagttagg tagcacaggt tggacggaac taaaatgtgg ttcacatgta 1320 tctcgactct tagccaagtt gccacacgac ctaagagcta acttcaagag atttattaac 1380 cccattcaga caccgattcc aactctgctt gagtttgctg actggttgga gtatgaagtg 1440 cgtgtgcaag tagatggaaa tcagtatgga tcataccccg aatatgaaaa acatgttcca 1500 cgtaaagaca agcggcccag ttttatgcct cggaagttga ctacagtgct acatggagga 1560 gaacaaaagg aagtagcagt tgagaatata aagaaggagc atgctcatgt ggaacggcca 1620 caggacaagc caaaaaagta ctgtcctttc tgtgattcat tccagcacta tatgaatcaa 1680 tgttctaatt tcaaacttct ctccaaggag caggtagaaa gttggattaa agcaggcaat 1740 agatgctggc gttgtggacg aggacattta tcgtcaaaat gtacgttgaa agccaagtgc 1800 aagcgatgcg acagaaggca tcttgatgtc ctacatgagg tgaatgctag tgcatacaca 1860 atgattaaaa gtaatcagaa gattcctcaa aaaagtacag attcagaagg gcctacctct 1920 cagacattat acttggatcg ccctacttgt ggtaaacaag tgctgttaaa actgagccga 1980 gttattattc agagtggtat taagtcacta gagacatatg cagttcttga cgacgggtca 2040 gaacgtacta ttctaatgta tgaagccgct caacgtctag gtcttcaagg agacagggaa 2100 gacctggctc ttcggactgt gaaacaggag gctcagacca tacagggaaa agttgtttcc 2160 ttttccattg ttcccttttc attccctgat cgatcttatc aaattaaggg tgccttcaca 2220 gccaaagaac tcgcattggt ccagcataca tacccaattt ctgacctcca gaaaaaatac 2280 cgacatctac gtgacttacc cttgcatgag atcaacaaag ctcagcctct tgttcttatt 2340 gggtcagact atccgcactt gattacccca ttggagcccg tacgtttggg gccccctgga 2400 ggtcctgcgg ctgtccatac caagcttggc tggactttac aaggtccttc aagtcttctc 2460 aaatcttgtt tacgtcctca ggaatgtctc ttcatttctt gcctttctcc tgaagctgaa 2520 ttattgcgac aggttgaaaa tctttggaag ttggatatcc ttccttaccg ctcagagaag 2580 attatcagcc gatctcgtca ggatgcagaa gcaattcgta tcttagaaga aaaaacaatc 2640 aggatcgagg tacaggatgt aaagagatat gccacccctt tactctggaa gcagttaccc 2700 ccccctttga atgcacctca ggaagcggtc atgggtctgt tatgtagcac tgaacgaaaa 2760 ctgtctaaag agccagagaa agctgaatcc tatgatttcg agattcagaa actactgcag 2820 tcgggttatg tcactcagct gacacctgca gaaagtgaat caagtgcaca gtcgtggtac 2880 ataccacacc acatggttca acataacgga aagaaccgta ttgttttcaa ttgctccttt 2940 gaatacaagg gtcaatcctt gaacgaatat cttctgcctg gccctacttt aggctcaagt 3000 ctcttaggag tccttcttcg ctttcgtgaa tatcccgtag ctatctgtag tgacgtgaag 3060 ggcatgttcc accaggtccg cctgcttgat gaggacaagc catttttacg attcctgtgg 3120 cgacagggaa aaacatctga gatgccagtc gtgcatcaat gggaagtcct gccatttggc 3180 actgcttgca gtccctgttg tgcagttttt gcactccaat ctcatgtaaa gggggatctt 3240 gatcctactg atgaggtgcg tcaaacagtc gagcgaaatt tctatgtaga caattgtcta 3300 agcagtgttc caactgaatc tcaagccaaa aatctagtga accgactgca atgtcatctt 3360 atggatggtg gatttgagct cagacagtgg gcatgtaaca tccctgaggt gattcagcac 3420 cttcctaatg acttaaagtc caagagtggg gagatatggc tcagtcaggg aatagataac 3480 ccacaggaaa atgcccttgg attgatatgg cattgcaaaa cggacaactt agtttacaaa 3540 ccttaccagc gagaagatgg tgagattact atgcgaaaaa tctatcgcac tttggccaag 3600 ctctatgatc cgctaggata tctcatacca tatactacaa gagctaaaac tgttgtgcag 3660 ttgttgtggg ataagaaacg tgaatgggat gatcccaatc tccctgaaga tctgctggcc 3720 atttggcatc agtgggaaaa tgaactccca aacctgtctc aagtattctt gccaagatgt 3780 tacacttctc acattcaatg tacagtcaaa catcgctgtg tacatgtatt ttgtgatgct 3840 tctgaaagag cctatggttc tgttgcatac ttgtgctctg aagacgaaca agacaataga 3900 catgtatctt tcattgcggc aagatcacgg gttgcccccc gaaaacagat ctcaatccct 3960 cgtttagagc tatgtgctgc actaacggga gcacagcttg gagatgttct aaaacgagag 4020 ctgaccctaa acatcaatca atttgtttat tggtcagatt ccactaccgt ccttaactgg 4080 ctacagtctg attcgtgtag ataccgagtg ttcgtaggag tgagagtgac agaaattcag 4140 gaactttctg accctggctc ttggcgctac attgattcag ccacaaaccc tgctgatgat 4200 gtaactcgtg ggttgtcact aattcagcta gctgaaaatt caagatggaa acttggtcca 4260 gacttcttga ggcagcctag tagctgttgg ccaaaattac cagtaggtcc cgcttctgaa 4320 gaagttcagg agcttaagaa atccatcttt tgtggattag tggatgacat taaagagcaa 4380 gatgagcctg atgtccatca gttcagtagc tattctgagt tcttagaggc tactgcacga 4440 tcccttcacg gggcggccga tagcacattt tcaccaagtg cagatgattt cagacaagca 4500 gaactgtcta tcctcagaca atgtcagaaa catgacttcc cagaagaata tgctttgctc 4560 agagaaggta aagccattcc cacacgtagt cgaatactta aacttgctcc cgaatatgac 4620 agagacttgg atttgatcag agttggggga cgacttcgta gatgccctta cttgaatgaa 4680 gaagtattac accctatagt tctggcccca gaccatcccg tcgtcaagct tctaattaaa 4740 cattatgatg cacaattgca tcacccggga gctggaagag tcttggctga gatacaacga 4800 aagtattgga ttttgcgtgg aagagaggcg atcaaacggc atcagcacaa ctgccttgac 4860 tgccggaaat ggagaggcaa gccagttgtc ccaaaattat cagacctacc accatcaagt 4920 ctaagattat tcaaacctcc cttctactct actggagtag attgttttgg gccactgatg 4980 gtgaaaattg gacgccgcat agaaaaaagg tggggcatca tctacaagtg cctcaccaca 5040 cgagctgtgc atcttgatct gctgtatcac atggacagtg attcattttt gatgtctctg 5100 aggcgcttca ttgctcggag agggaagcct cacgaactcc tttccgatca aggaacaaat 5160 tttaagggtg gtgctaatga gttagaagac accttcaatc agattcagcc catccttcga 5220 gacatgttgg ccaaggatca aatcaggttc caattcaacc ctcccagtgc ccctcatttt 5280 ggagggtcat gggaaagaga ggtgaggtcg gtcaagactg ctttaagaag cactttgggt 5340 gcgcagactg ttaccgaaga ggtattaagg acaattttga ctgaggtcga aaacattcta 5400 aactccagac ctctcggtta cgtctcgagc gatatagcag atcccgaccc agtaactccg 5460 aactccttac taatggggag acctgactca tctttgcctc aggttgttta ttcagattct 5520 gaattgctca gtaagaaaag atggcgacac agccaggtcc ttagtgacca tttttggaag 5580 cattttattc atgactttct gccatcctta cagtcaagac aaaagtggca cagtgaaagg 5640 cagaacattg tggttggtac agtagtactc attgttgatg aacagcttcc aagagcatgc 5700 tggaaagttg gtacagtttc gtccgtcatt cccagtgcag atggtagggt cagaacggtg 5760 acggtcaaag tgaaagatca gacatacact agaccagttg tgaaactaat tgaactgcca 5820 tctttacctg aagacctgga gaattcatcc taaactcaat aatggcaaat ttgcttagca 5880 aatttgaggg cggc 5894 // ID Copia5-LTR_DR repbase; DNA; ZEB; 192 BP. XX AC . XX DT 21-OCT-2008 (Rel. 13.1, Created) DT 21-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia5-I_DR; KW Copia5-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-192 RA Jurka J.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1593-1593 (2008). XX DR [1] (Consensus) XX CC 5 bp target site duplication. LTRs are 100% similar to each CC other. The closest elements to this family are from fungi and CC plants. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 192 BP; 55 A; 27 C; 53 G; 57 T; 0 other; tgttaacatg ttgttatctc tctactttgt tgacctctaa gaaggttatc cgggttacaa 60 atgagaaagc gcgcaggggc ggggttgatg agggaaaaag ttcagttgaa gatgtctcgt 120 gtgtgtgcga ggagggaatg aacgaaataa agtgtgctta ctaaataaac gtcgtgttta 180 ttcaacttta ca 192 // ID Gypsy134-I_DR repbase; DNA; ZEB; 5762 BP. XX AC chr3; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy134-I_DR; KW Gypsy134-LTR_DR; Gypsy134_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5762 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1529-1529 (2008). XX DR Genome; chr3; Positions 52737804 52732043. XX CC Positions [2679-3182] - Reverse transcriptase CC Positions [4461-4595] - Integrase core CC LTRs are 98% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 472..1542 FT /product="Gypsy134-I_DR_1p" FT /translation="MEDEMSELRQLVAQLKADNERLHLERTPALPGPSAAS FT PVSDAEPVLPQPADRNIGQPERFVFVPRDRKCPKFNGKTGIGINEWIEEAQ FT ACIRTRHLSIPDQAFFLFDHLEGEAREEIRYRSDIERGDPDKIFSALREVF FT GCSQSHVALQEAFFSRRQSEGESLLEFSLALMALMEKVKQGFSGVIPNAPV FT LLRDQFVEHVADNALRRELKQLVRRQPAATLLEVRSEAIRWEQEGMPGGAR FT ARSQSVPLSYGIQYGVQRGTHSNKAAPSQSELSEIREILKAQQRQLNQLAQ FT SFAQIQGVPPRSRSPRFGPVICRRCHHPGHFARECNASLPGPSSSVSTAVL FT PVEGPCQASQPSGN" FT CDS 1632..4595 FT /product="Gypsy134-I_DR_2p" FT /translation="MSGCPQLVVNIGGVPITCLLDTGSMVTTITESCFLEN FT YNPWGQERLKLCQWLQLRAANGLEIPYIGYLELDVELCGKIVPDCGILVVR FT DPPGGMCARLHGVLGMNVLSRCYQELFGQHGQALFDSPPVVGFPGAVMQAL FT QHCHEIGPQTVFGQVGKTRVRGRYQCRVPGGTMKIVAATCSEHYSGNTVLF FT EPLESGLPAGLLASPALVPVVRGTVYIPVLNVGTLDAVLFPNTLLGSLCSV FT YVVSLPPGVTEVKEVVATVSSQGCGIAPSVQEQVNSLDLSVLEMEEQDQVR FT TLLHRYQSVFSTHDQDLGCTNLISHDIPLTDDVPVRQRYRRLPQSEYDVVK FT THINQLLEAQIIRESCSPYASPIVLVKKKDDSLRMCVDYRQLNAKTRKDAF FT PLPRIEETLDALTGARWFSTLDLASGYNQVPVTEGDKAKTAFCTPFGLFEW FT NRMPFGLCNAPSTFQRLMERLFGDQRHSSLLLYLDDIVVFSSSVTQHLERL FT EVVLGRLKREGLKAKLEKCAFFKKQVAYLGHVVSSQGVSTDPSKVEVVANW FT GRPNNVTELRSFLGFAGYYRRFVEGFAKLAAPLHKLVAEFVEGKRKKPTRL FT GFASAWTGPCQSSFDALKEKLTTSPILAYADFSLPFILEVDASQRGLGAVL FT SQEQEGKVRPIAFASRSLRSTERNPVNYSSMKLEFLALKWAMTEKFREYLL FT GHKCIVFTDNNPLSHLATAKLGTTEQRWAAQLAVLDFDIRYRPGKNNGNAD FT ALSRQSAVDSEGVGFLLPATAVPTALQQAVRGEPVYAVTQAALKVFPSHSP FT ADLRGLQETDPDIGKVLLFWRSKKFPCLDERKLLSKSSVTILRQWDRLVEL FT NGVLHRRIFRPDGREEVFQVVLPVVLREEVLRHLHQEHGHQGIERTTELVR FT QRCYWPGMSGEVARWCRECERCQIAKEGPHGASSFMGHLLAARPNEILAID FT FTVLEPSCSGLENVLVMTDVFTKYTLAVPTRDQ" XX SQ Sequence 5762 BP; 1365 A; 1210 C; 1611 G; 1576 T; 0 other; ggtgttttag tgtgtgtata gtgtgttttt agtagatggg ttttattgaa tttaaacagt 60 atatacattg cccccctttt tgatgcattt ttgaaaaaaa taaagtttgt atgtattttt 120 tccaatccac gtctcttgcc atttactgca agcttacctg tgttgccttt tataatataa 180 aatataggtc cctggccata ttccagggtg gcgtagtcgt gctagctgtt gaacattatt 240 gaaccccttg gttatgccac attttggcgt agtcggcagg actgcagtaa tttaggcaaa 300 aacgtggttt gataaattag ggttttgggt cttgtttggt ggttttgttt cattgtgatt 360 ttagtgtaaa gcagcagcga cggttaggag tttggggaac atatatttgc ttcagtctac 420 tgacagtggg tccctttgtg gtggtaagtg gtaatttatt tttctttaaa gatggaagat 480 gaaatgagtg aacttagaca actggtggcc cagttgaagg cagacaatga gaggttgcac 540 ctggagcgta cccccgcatt gcctggtcct agtgcggcat ccccggtgtc cgatgcagag 600 cctgtactac ctcaacctgc tgacagaaat attggtcagc ctgaaagatt tgtgtttgtt 660 cctagggacc gtaaatgtcc taagtttaat ggtaaaacag gaattggaat aaatgagtgg 720 attgaggagg cacaggcctg cattaggact cgtcatttgt caattcctga tcaggcattc 780 tttttatttg accacctaga gggagaagca cgagaagaga tcaggtatcg ttctgatatc 840 gaacgagggg acccagacaa gatttttagt gcgttacgtg aagtttttgg ttgttctcag 900 tctcacgtgg cattacagga agcctttttc tccaggaggc agtcagaggg agagagctta 960 cttgaattct ctcttgcttt gatggccttg atggagaaag ttaaacaagg gttttcgggg 1020 gtcataccaa atgctccagt attattacgc gaccagtttg tggaacatgt agcagataat 1080 gcacttcgcc gtgagcttaa acaattagtc cgccgtcaac cggctgccac gttgctagaa 1140 gtgcgtagcg aggcgattag gtgggaacaa gaaggtatgc ctgggggggc gcgggctcgt 1200 agccagtctg tcccattgtc atatgggatt cagtatgggg ttcagagggg aactcattca 1260 aataaagcgg caccctcgca atctgaatta agcgaaataa gagaaatatt aaaagcacaa 1320 caacgtcagt tgaaccagct tgctcaaagt tttgcccaaa ttcagggggt acctccgcgg 1380 agtcgctccc cccgctttgg accagtgatc tgtaggcggt gccaccaccc aggtcatttt 1440 gccagggaat gtaatgcatc attgccaggt ccgtcgtctt ctgtttctac tgctgtgtta 1500 ccagtggagg ggccctgtca ggcaagtcag ccctcgggaa actaataccc accgggctgt 1560 tgagccacag ctcggatggg gtaattaggg gctcagatgt tctaattagt ggtagctcag 1620 tgtcccgttt aatgtcaggt tgccctcagt tggtggtcaa catcggtggt gttcctatca 1680 cttgtttgct tgatacaggt tccatggtga ccaccataac tgagagctgc tttctggaaa 1740 actataatcc gtggggtcaa gaacgtctaa aactgtgcca atggctccaa ctacgtgctg 1800 ctaatgggct tgaaatccca tatattggct atttggagtt agacgtggag ctgtgtggca 1860 aaatagttcc agattgtggg atattggtgg tcagagatcc tcctggtggc atgtgtgctc 1920 ggctccatgg ggtgctgggg atgaatgtgc tgagccggtg ctaccaggag ctctttggcc 1980 aacatgggca ggccttattt gattctccac cagtagtagg gttccctggt gctgtgatgc 2040 aggctttgca acactgtcat gagattgggc cccagacagt atttggccaa gtaggtaaga 2100 caagggtgcg aggtcgttac cagtgccgtg ttccaggtgg taccatgaag atcgttgctg 2160 ctacttgttc agagcactat tcagggaaca ctgtgttgtt tgaaccactg gagtcaggac 2220 ttccggcagg gcttttggcc tcaccagcct tggtgcctgt tgtgcgaggg acagtctata 2280 tcccagtgct caatgttgga acattagatg cagttctttt tccgaacacc ttactaggtt 2340 ctttgtgtag cgtgtatgtg gtgagtttac ctcccggggt gactgaagtg aaggaggtgg 2400 tggctactgt gagttctcaa ggttgtggga ttgcaccctc tgtacaagaa caggttaact 2460 ctctagactt gtctgttttg gagatggagg agcaggacca ggtgaggacc ctcttacaca 2520 gatatcagtc tgttttttcc actcacgacc aggacttggg gtgtactaat ctgatctccc 2580 atgatattcc cctcacagat gatgttccgg ttcggcagcg gtatagacgc ttgccccaat 2640 ctgagtatga tgtggtcaaa actcacataa accagttgtt ggaagcccaa attatacgtg 2700 agagctgtag tccatatgcg tctcctattg tcttagtaaa gaaaaaagac gatagtctac 2760 gcatgtgcgt agactaccgg caattgaatg caaagactag gaaggatgca tttccattgc 2820 cccgtataga agagactttg gatgcactga ctggagcccg ctggttctcc actttggatt 2880 tggccagtgg ctacaaccag gttccagtca ctgaggggga taaggccaaa actgcattct 2940 gtaccccgtt tggcctattc gaatggaaca ggatgccgtt tgggctttgt aacgctccga 3000 gtacattcca gcgtttaatg gagagattat ttggtgacca gcgacattct tcattacttt 3060 tatatcttga tgacattgtg gttttctcct cctcagtgac tcagcatctg gaacggttgg 3120 aggtggtctt gggtaggcta aagcgagagg gcctgaaagc caagttggaa aaatgtgcgt 3180 tcttcaaaaa gcaggttgca tatttggggc atgtagtgtc atctcagggt gtttccacag 3240 accccagtaa agtggaggtc gtagcaaatt gggggcgtcc aaataacgtg acagaacttc 3300 gttcattttt agggtttgct ggctattatc gccgctttgt ggaggggttt gccaagttgg 3360 cggcccctct tcataagcta gtggctgaat ttgttgaggg caagcgaaag aagcccacaa 3420 gactgggttt cgctagtgca tggactggcc cttgtcagag ctcctttgac gccttaaagg 3480 agaaacttac cacatcaccc atactcgctt acgccgactt ctcgttacct tttattctag 3540 aagtagatgc aagtcagcgg ggcctaggag ctgtactctc ccaggaacag gaggggaagg 3600 tgcgacccat tgcctttgcc agtcgcagtt tgcggtccac tgagcgcaac ccagtaaatt 3660 acagctcaat gaaactagag ttcttggcgc tcaagtgggc catgacggaa aaatttcgag 3720 agtacttgtt ggggcataaa tgtattgtct ttactgacaa caaccctttg agtcatctag 3780 cgacagcaaa gctggggacc actgaacagc gctgggctgc ccagctggct gttcttgact 3840 ttgacattcg ttaccgtccg ggtaagaata atggtaatgc agatgctctt tcgaggcagt 3900 ctgcagtaga tagtgagggg gtggggtttt tgttgcccgc tactgcagtt cctacagctt 3960 tacagcaggc tgtaagaggg gagccagtgt atgcagtcac ccaagcagca cttaaggtct 4020 tcccaagtca ttcaccagct gacttaaggg ggctgcaaga gactgaccca gatataggta 4080 aggtcttgtt gttttggcga agcaaaaagt ttccctgttt agatgaacgc aaattgttgt 4140 ctaaaagtag tgtaactata ctccgtcagt gggaccggct ggtagaactt aatggagttt 4200 tacaccgtcg aatcttccgc cccgatggga gggaagaagt ttttcaagtg gttctgcctg 4260 tggtattaag ggaggaagtt ttaagacact tgcaccagga gcatggtcac cagggcattg 4320 aacggaccac tgagttggta cgccagcgtt gctactggcc aggtatgtct ggggaggtgg 4380 cccgttggtg tagggagtgt gaacgctgcc aaattgcaaa agagggacca catggggcct 4440 ccagttttat ggggcatttg ttagccgcta ggcctaatga gatcttggcc attgacttta 4500 cagtgttaga gccatcctgt tcagggttgg agaatgttct ggtgatgact gacgttttca 4560 ccaagtacac cctggctgtt ccaacccgag atcaatgagc ggaaactgta gctcaggtgt 4620 tgctggtgga atggttctgc aagttcgggg ttcctggtca tatacattct gaccagggca 4680 gaaattttga atcttccctt atacaacagc tttgtagctt gtataacgta gagaagtcgc 4740 gaaccacacc ttaccaccct gcagggaacg gccagtgtga gaggtttaac cgaaccctcc 4800 acaacctgtt gcgcactctg cctccttcca gtaagcgaga ctaggtctta tggcttccac 4860 aggtgctgtt ttgttataac actacccccc atcaatcaac aggtgaatca ccgttttatt 4920 tgatgtttgg tcaggagcct cggctacctg tggactttct attaggtaac gtggaggagg 4980 cagctgcagg gacaatatat gagtgggtga gggagcatca agctcgactt caagtggcgt 5040 ttgaaggcgc aaggggacac ttgagagctg cggcagaaca gcgtaaggag ttgtatgact 5100 tgcgtgtacg agatgcacca ctgagcgagg ggcagttggt ctaccttcgc gatcttggtg 5160 tgaggggtca ggcaaagatc cgtgattgtt ggagcccagt ggtgtatcag gttgtgaaag 5220 cccctaaaga aggaggatca gtttattcga ttgccccgtt gaatgatttg gataaggtga 5280 ggcaggtaca tcgctcgctg ctgaagggcc gaatacagaa ggattgccct gacaacttgc 5340 cctcctgtag ttctggagtg gaacctgaac cccccttgga ggaaccggct ttggaagagt 5400 cagaacaggt tgatttgtgg gttttgttgc ctgcaactgc tccagtgagt gaggccgggc 5460 agggagtaga ttcaagacaa gtaagttcct ccactagcag tggggcaggg gtctctgtta 5520 gtgcagatca aacctgtggg aacgaggtgg agttacctgt agatgtagta ctacgacggt 5580 ctaggagggc gacagctggt cagcactcta atgttcatcg ccttccacga tctgtgggaa 5640 ggggtactga cagagaggag cctgtaggaa cctctagggc gattaccgct tggtttagac 5700 cctggaattg aagggtagaa gtgtttatcg tcggggcgac gatacagaag ctgggggtag 5760 at 5762 // ID Gypsy36-LTR_DR repbase; DNA; ZEB; 390 BP. XX AC . XX DT 21-SEP-2007 (Rel. 12.09, Created) DT 21-SEP-2007 (Rel. 12.09, Last updated, Version 1) XX DE A long terminal repeat of the Gypsy36_DR LTR retrotransposon; a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy superfamily; LTR; Gypsy36-I_DR; KW Gypsy36-LTR_DR; Gypsy36_DR; endogenous retrovirus. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-390 RA Dib M.R. and Naveira H.F.; RT "Gypsy36_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 7(9), 802-802 (2007). XX DR [1] (Consensus) XX CC Gypsy36-LTR_DR is a long terminal repeat of the Gypsy36_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its CC internal portion is deposited in Repbase as Gypsy36-I_DR. XX SQ Sequence 390 BP; 76 A; 73 C; 89 G; 152 T; 0 other; tgttaggtcc catgtgggac ctgtttgtat gtgtgtgttt tttgtgtgat ttttgcagag 60 cctgcatgca tgattgcaga ttggtgggtg ggtcatccca cctgagactc atcgtggggt 120 gttttataag gagtccggca ttcggtctag aggagagctc gattagccgg acgtctctgc 180 ttggcgcttt tgatttagtt ttgtgtttgg atcatgagct cagtacactt ccatacacat 240 gcttttcact actgactttc acctatactg acttggattg ctttaaatac tttttgttaa 300 ttatacattt tgttaataaa tcatttctct tttcaattta cctggtccgt attgtctctc 360 taatgttgca actttgagcc ggttgtaaca 390 // ID DNAX-1_DR repbase; DNA; ZEB; 693 BP. XX AC . XX DT 17-NOV-2004 (Rel. 9.1, Created) DT 17-NOV-2004 (Rel. 9.1, Last updated, Version 1) XX DE This is a nonautonomous DNA transposon. XX KW DNA transposon; Transposable Element; Nonautonomous; DNAX-1_DR; KW nonautonomous DNA transposon. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-693 RA Kapitonov V.V. and Jurka J.; RT "DNAX-1_DR, a nonautonomous DNA transposon from zebrafish."; RL Repbase Reports 4(10), 260-260 (2004). XX DR [1] (Consensus) XX CC The genome harbors several thousand copies of DNAX-1_DR CC elements that are ~90% identical to the consensus sequence. CC DNAX-1_DR elements are characterized by 21-bp terminal CC inverted repeats. Despite the relatively young age of these CC elements, most of them are not flanked by target site CC duplications. XX SQ Sequence 693 BP; 206 A; 130 C; 136 G; 212 T; 9 other; cantagctat gtttccatcc acctattttt atgcgcattt tgcatatgcg cataaaaaaa 60 acngttgatg gaaacgccat gatgcgcata aattttgaaa atgcgcataa aaaacntatg 120 cgcataactg agtaggataa actttttatt cgataagaaa agatgcgcat aaactacgat 180 ggaaacactt ttaccgcaca aactccagca tgcgcattaa aaaaggtcat gtgattttgt 240 tawaagagat catgtgatga ttaaaatgtg tgtgaatgga caaaccakca ggctgagcac 300 attgtaaaac atctgaaatg ttgttttggt cattctaaaa cgccttaccg tttcagtatt 360 agtgttatta tattattaat gacctccaga atcaagagcg tctgtgctcc gcgtctgaca 420 ccttcaaacg ccaccgcgcg ttcactgckt gtcaggattg ycttctgagg cgcaagtcat 480 ttattagatg aagaaaagat tgaygcagct tctcctactg cagcaaattc agtttttact 540 gttgatattt ggcgccagtt aatcaggaag tgacgatttt gttcgctttg actcgttgga 600 tggaaacgct gctttattcg cacatctttt atgcgataat ccagttttgc gcataaagtt 660 cattcgcatt tttggatgga aacatagcta ntg 693 // ID Gypsy147-I_DR repbase; DNA; ZEB; 6930 BP. XX AC chr15; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy147-I_DR; KW Gypsy147-LTR_DR; Gypsy147_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6930 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1555-1555 (2008). XX DR Genome; chr15; Positions 37964124 37971053. XX CC Positions [3231-3770] - Reverse transcriptase CC Positions [5223-5699] - Integrase core CC 'TTTA' target site duplication CC LTRs are 100% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 580..2004 FT /product="Gypsy147-I_DR_1p" FT /translation="MFGDNDRDDFSPVKPSNVTFPMGRGGGLVNFHDEKVS FT EINPRCYGRGRGLANMSLGGMHTIDSAIGASAGPTASSTPITRPDAERAIS FT PESLGSIISDLAQKIGESISASLNLVHQPSPVQPNRLLSEQPSQSIDASTL FT KVIVQKDTAPPPFFRGDKSDSFTIHEWEDMMSSYLNRMECKTQSEVRDLIM FT SRLAGRARDVVKVSLRSRPELSASKLQSAVFDILKHNFSELAYSNMPMADF FT YNTVPRANEGPMDYWIRLNKAIDVADECLHRRNKSIEDPAAEAVMMFISHC FT PDPTLALSFQFKPAEKWTAAEVQERLDNHQRDLKRVSSRANHKLSLSACSQ FT CTVSTSYDCVSSPPRQPAPTPAQSLPPVSLNPKAVPFSPDTSFQHMVGMMD FT RVLSLCTASLTSPSQVHAGKAADFKNDLNPPCRVCKSKEHTTYLHCRRHRL FT CRLCFSPGHFKSDCPKSAYPVTPSDTAQLN" FT CDS 2034..6626 FT /product="Gypsy147-I_DR_2p" FT /translation="MGKFNVNSLTDPADAQNVFMNCCKLLPEDKTVLYVGS FT QRVNGASELFYAPVTVGGQHILKGLLDSGSMSCTLSEEGEAKLQASGLLPH FT QEIPKNVVLVGCGGLTTQPKCMYDLEVVIYGFKFVVPTLVVPGQKDEFIIG FT SNVIKCILQKMKSSEDYWKLASCDNRDPECEQLLELLTCISRWCGPKQPDK FT LGTVKLCQAVTLLPRREYLVWGKMPANVPVSPGSTVIVEPTTARSAPKNIL FT VGRVITPMWGDRWVPMKILNPNPTPVTLRRNSKLADVSSCLAVEDLTVTQG FT LSKTSCNVPNSFPAENVPSPDVVQMLKDYGLGDINIDACDVSMSWKRKLAA FT LLVSYHDVFSKDKLDCGEAKDFVHRIHLFDERPFRLPYRRVPPAHYQKLRE FT VLSEMEEKGIISKSVSDYASPLVMVWKKDGNLRVCTDFRWLNAKTVKDAHP FT LPHQSDCLAALGGNAFFSTMDLTSGFYNIPLHDSDRRYTAFTTPMGLYEYN FT RLPQGLCNSPASFMRMMLSVFGDLNFTSLLCYLDDLLVFAPSESEALNRLE FT IVFSRLRSSNLKLAPKKCHFLRKSVKFLGHVIDSTGVSVDEEKVKVISAFQ FT KEDLMKDDRVTPLQKKIRSFLGMVLYYQAFIPNCSRIARPLFNLTTGQKRS FT AKNAGGGRRAGTFRELTPQDWTPDCDSAFSELKTALVDSVVLAHPDFERPF FT LLCTDASLEGLGAVLSQVPVGEDKARPIAFASKALSRSQTKYPAHRLEFLA FT LKWAVCDKFSHWLKGHEFTVWTDNNPLTYIMTKPKLDACEQRWVSKLAPYS FT FEIKYVPGKLNVVADALSRDPFTRPVSQRLLSEPYSDLLRHVHSVDDGNVQ FT EVFRLTCQPQSVESGPVFGLDTSMSADDVSSILSSCDEWETVTECRAASIA FT EHLDSLINSDEVFHSLSLADLQDHQHKDPVIARVCYYIDRKRRPSRRERAN FT EHLYVLRILKQWDKLSVLNGILYRAIKDPLTKHKRFQFILPESLKSQALSG FT LHDLAGHQGQPRTLALARQRFFWLDMERDVRDYVKKCPRCVFSKTPEPAAR FT ATLESIKTTAPLELVCIDFWSAEDRHNKSVDVLVITDHFTKLAHAFPCQDQ FT TAKTVAKRLWDSFFCVYGFPQRIHSDQGASFESELMAELLALAGIDKSRTT FT PYHPMGNGGTERFNRTLGNMLRSLPPRPKQKWPQMVQTMTFVYNCTVHETT FT GFAPFYLMFGRVPRLPVDLMFQSVLRDDSICDYDQYVQSLISDLQTAMSVA FT QTNSAHKQRHQSSQYNKRAKGLPLAIGDQVLVANKGCRGKRKLADKWEPVV FT YTVVAAKPSIHVYKISDRAGIERTVHRNLLLQVNFLPLPNTDSDGPGLEVA FT GSLTCSVAPQSDLTGSDGLTARTNSDICSDVEEIEPAEDTDEGLNEATGLS FT QVDCMSNVSNAATCNDDRTSSWVHSQLPSQQVPNSPLPVPVADDCTVAGAG FT ATLSPPQSLPDCVLPVVQNPVVDPNAADHYSTRFGRIIKPVCRLIESMVQI FT ETLLSSDSGSAVIHV" XX SQ Sequence 6930 BP; 1782 A; 1562 C; 1620 G; 1966 T; 0 other; cgtggtgccc gtgaccattc cggatcatct gatcagctct tcgcagatcg ccgtggttga 60 gtgacgactc cgcatcgccg atccactccg ggtttttcct tcaccgccat tgctcgcagc 120 agctccgcgc tcgcgataag gagatcgcgc gcacgcgcgc gtgtgtgtgt gtggaacagc 180 gtttgacatc gcggggcaga ctgtttcgtc agagacgcgc taaactggct gatcgtcgtt 240 catcgttgcc ttttaatcgc gattgactgt gtgaggactg cgtgggacgc ttgagtgacc 300 tcagtgcatc atatccttta tccttgcaga aacaaattat tgaatgttac tgttttaaat 360 cagcaatgtg tctgactgac tttgaatgag gaaatattat actcttgtga cagagctttt 420 ttccctttga acactttttc catcttaacg tttgattgag tgagtttgat atcaaatttg 480 accatctttt atggtaaatt ttgcggtaaa tctattttgt agcgtttcaa tttgtgataa 540 ttcttaatgt tttatgtgtt tgcattttac aattgaaaca tgtttgggga taatgacaga 600 gacgatttta gtccagtgaa gccatcaaat gtgacctttc caatgggcag gggtggtggg 660 cttgtcaatt ttcatgatga gaaggtatct gaaataaacc ctcgttgtta tgggagaggg 720 agggggttag ctaatatgag tttgggcggt atgcatacta ttgattcagc aattggtgct 780 tcagcaggcc caacagctag ctccacccct atcacacgcc cagatgctga acgtgctatt 840 tcccctgagt ctttaggtag tataatttca gatttggctc agaaaatagg tgaaagtatt 900 tcagctagcc ttaatcttgt gcaccagccc agcccagtgc agcctaatcg tcttctgtca 960 gaacaaccca gtcagagcat tgatgcttcc accctaaaag ttatagttca gaaagacact 1020 gcacctccac cattcttcag aggcgataaa agtgattcgt ttaccatcca tgagtgggaa 1080 gatatgatga gcagctattt aaacagaatg gagtgcaaaa cacagtcaga agttcgtgat 1140 ctcattatga gcagactagc tggcagagcg agagacgttg ttaaggtgtc cctgcgaagc 1200 cgccctgaac tgagtgccag taaacttcaa agtgcagtgt ttgacatatt aaaacacaac 1260 tttagtgaat tggcttactc aaatatgccc atggctgact tttataatac tgtcccaaga 1320 gcgaacgaag gtcccatgga ttactggata cgccttaaca aggccataga tgttgctgat 1380 gagtgcttgc atagacgtaa taagagcatt gaggatcctg ctgctgaagc tgtcatgatg 1440 ttcatttcac attgtccaga cccaactctt gctctgtctt ttcagttcaa accggctgag 1500 aaatggactg ccgctgaagt tcaggagcgt ctggataatc atcagagaga tttgaaacga 1560 gtttcatcca gagcgaatca caaattgtct ctgtctgctt gtagccagtg cactgtttca 1620 actagctatg attgtgtgtc cagtccacct cgtcagcctg cacccacacc tgctcagtct 1680 cttccccctg tctctcttaa tcccaaagct gttccattct ctcccgacac cagctttcaa 1740 cacatggttg gcatgatgga tagagtccta tcactgtgta ctgcctcttt aacgtcaccc 1800 agtcaggttc acgcaggcaa agctgcagac ttcaagaacg acttaaatcc cccatgtaga 1860 gtctgtaagt caaaggaaca caccacatac ttgcattgta gacgtcacag actgtgtcgt 1920 ctctgtttca gccctggcca ttttaaaagc gactgcccta aatctgccta ccctgtgact 1980 ccctcagata ctgcccagtt aaactaacca gcccatgtga cgagaggggc aacatgggca 2040 aatttaatgt aaactccctc acagatccag cagatgcgca aaatgttttc atgaattgct 2100 gcaagttatt accagaagac aaaaccgttt tatatgtggg gtcacagaga gtgaatggag 2160 cgagtgagtt gttttatgct cctgttacag ttggtggcca gcacatattg aaggggttat 2220 tggactctgg tagtatgtct tgcacgctca gtgaagaggg tgaggccaag ctgcaagcta 2280 gtggtctcct tccacatcaa gagattccca agaatgttgt tctggttgga tgtggtggac 2340 taacaacgca gccaaagtgc atgtatgatt tggaagttgt catctatgga tttaagtttg 2400 tcgtcccaac tcttgtggtg cctggacaaa aagacgagtt cataattgga agcaatgtta 2460 ttaaatgtat actgcagaag atgaaatcga gtgaagacta ctggaaactt gcctcctgtg 2520 ataatagaga tcctgaatgt gagcaattgc ttgaacttct tacctgtatc tcccgctggt 2580 gtggccccaa acagcctgac aagctgggaa ctgtgaagct ctgtcaagca gttactcttc 2640 ttcccagacg cgagtatctg gtctggggta aaatgccagc taatgtccct gtttctccag 2700 gaagcactgt aattgttgag ccaaccactg cacgctcagc ccctaagaac attctggttg 2760 ggcgagtaat aacacccatg tggggtgatc gatgggtgcc aatgaaaata cttaacccta 2820 atccaacacc agtgacactt cgtcgcaact ctaaacttgc cgacgtttcc tcctgtctgg 2880 cggtagaaga tctcactgtc acacagggat tgagtaaaac ttcttgtaac gttcccaatt 2940 catttcccgc tgaaaatgtc ccatcccctg atgtggtgca gatgttgaaa gattatggtc 3000 tgggggacat taacattgat gcatgtgatg tttctatgtc atggaagaga aaactcgcag 3060 ctctcctggt gtcatatcat gatgttttct ccaaagataa gttggattgt ggagaagcaa 3120 aggattttgt gcacaggatt catctctttg atgagcgccc attccgtctc ccctatcgcc 3180 gcgtccctcc tgcgcactat caaaaactga gagaggtgtt gtcagagatg gaggaaaaag 3240 ggataatcag caagtctgtc agtgattatg cgtcaccttt agtcatggtc tggaaaaaag 3300 acggtaatct gagggtttgc acggattttc gctggctcaa tgctaaaact gtcaaagatg 3360 cccacccttt accacatcaa tcagattgtc tcgctgccct agggggtaat gccttcttca 3420 gcaccatgga tttgacttca gggttttaca acatcccact ccatgactct gaccggcgat 3480 acactgcctt cactactccc atggggctgt acgagtataa tcgtctccca cagggccttt 3540 gcaatagccc tgcgtctttt atgcgcatga tgctcagtgt ttttggtgac ctcaacttta 3600 ccagtctgct ctgttacctg gacgatttgt tagtctttgc tccatctgaa tcagaggcac 3660 taaatcgcct ggagatcgtg ttttctcgat tgagatccag caacctgaag ctagctccta 3720 aaaagtgtca ctttctgaga aagtctgtaa agtttctggg acatgtgatt gatagtactg 3780 gtgtttctgt tgatgaagag aaagtgaaag taatctctgc ttttcaaaaa gaggatttaa 3840 tgaaggatga cagagtcaca ccattacaaa agaaaatcag atcatttctt ggtatggtcc 3900 tgtattatca agccttcatc ccaaattgtt cacggatcgc caggccctta ttcaatttga 3960 ccacaggcca aaaacgttct gcgaaaaatg caggtggcgg taggagagca ggcacttttc 4020 gggagttgac acctcaggac tggaccccag actgtgattc ggcctttagt gaattaaaga 4080 cggcccttgt tgacagtgtg gtgctggctc atccagattt cgaacgtccc ttcttactct 4140 gtactgacgc ttcgttggaa ggcttaggag ctgtgctctc tcaagttcct gttggtgagg 4200 acaaggcgag accgatagcg ttcgctagca aggctctgag tcgcagccag acaaaatatc 4260 ctgcacacag attggaattc ttggctctga agtgggcggt gtgtgacaag ttcagtcact 4320 ggctgaaagg ccatgagttt acagtgtgga cggataacaa tccactgaca tacatcatga 4380 ccaagccaaa actggatgca tgtgagcagc gctgggtctc gaaactcgct ccatattcat 4440 ttgagatcaa gtatgttcct gggaagctta acgttgtggc agatgcactc agtagggatc 4500 cgttcaccag gccagtgagt cagcggcttt tgtcagagcc ctattcagat cttttgagac 4560 atgtgcacag tgttgacgat ggaaatgttc aagaagtgtt cagacttacc tgtcaaccgc 4620 aatcagtgga atcaggtcct gtttttgggc tggatacatc aatgtccgct gatgacgtgt 4680 cttccattct gtcctcctgc gatgaatggg aaacagttac tgaatgtaga gcagcatcaa 4740 ttgcagaaca tcttgactct ttgatcaaca gtgatgaagt gtttcattcc ctttctcttg 4800 ctgacttgca agaccatcag cacaaagacc ctgtcattgc tagagtgtgt tattacatcg 4860 acagaaagcg cagaccatca agacgtgaac gagctaatga acacctgtat gtgttacgga 4920 tccttaagca gtgggacaaa ctttcagtgc ttaatggaat cttgtacaga gcaattaaag 4980 acccgctcac gaaacataag aggtttcagt tcatcttacc tgaatctttg aaaagtcaag 5040 ctctttctgg ccttcacgac ctggctggtc atcaagggca accacgcact ttggcattgg 5100 cacgtcagag gtttttctgg cttgatatgg agcgtgatgt gcgggattat gtgaagaagt 5160 gtcccaggtg cgtctttagc aaaactcctg agcctgctgc gagagccact ttggagagta 5220 ttaaaaccac agccccattg gagctcgttt gcattgactt ctggtctgct gaggaccgac 5280 acaataaatc tgtggatgtt ttggtgatca cagatcactt cacgaaacta gcacatgctt 5340 ttccatgtca ggaccagaca gcaaaaacag tagccaagag actgtgggat agcttcttct 5400 gtgtttatgg atttccccaa cgcatccatt ctgatcaagg tgcttcattt gagagtgaat 5460 tgatggctga gcttcttgcg ctggcaggta tcgacaagtc tagaaccact ccttaccacc 5520 caatggggaa tggtggtact gagcgattta acagaacact aggtaacatg ctgaggtctc 5580 ttccacctag acctaagcag aaatggccgc agatggtaca gaccatgaca tttgtgtaca 5640 attgcacagt gcacgaaaca acggggtttg ccccttttta cttgatgttt ggcagagtgc 5700 cacggctgcc agtggaccta atgtttcaga gtgttctgcg agatgactcc atctgtgatt 5760 acgaccagta cgttcagtcg ctcatcagtg accttcagac tgctatgtcg gtagcacaga 5820 caaattcagc tcacaaacaa agacaccagt ccagtcaata caacaaaaga gcaaaaggcc 5880 tacctcttgc cattggtgac caagtgcttg tagccaacaa gggatgtagg gggaaacgaa 5940 aacttgctga caaatgggaa cctgttgtgt acactgttgt tgctgcaaaa ccatccatac 6000 atgtctataa gatcagtgac agagctggta ttgagcgtac agtgcatcgc aatctacttt 6060 tgcaagtgaa ctttctcccc ttgcctaata cagactctga tggtcctggt ctggaggttg 6120 ctggatcact tacctgcagt gtcgctccac agtcagactt aacaggctct gatggtctaa 6180 ctgcccgaac aaacagtgat atttgctctg atgttgaaga gattgagcct gctgaggaca 6240 cagatgaggg actgaatgag gctacgggcc tctcacaagt tgactgcatg tccaatgtct 6300 ccaatgctgc tacatgcaat gatgaccgca ctagttcatg ggtgcattct cagttgccta 6360 gccaacaggt gccaaattca cctcttccag tgcctgtggc cgatgactgc actgtggctg 6420 gtgctggtgc aaccctgtct cccccacagt ctctccctga ttgtgtctta cctgttgtac 6480 agaatcctgt agtagatcca aatgcagctg atcattactc taccaggttt gggaggatta 6540 ttaaaccagt gtgtagactt attgaatcaa tggttcaaat tgaaaccctg ttgagttcag 6600 attctgggtc tgcagttatt catgtctgaa tttaaatttc agccccaaat tcctgaaggt 6660 tttggaattg agtgtttttc tcctatcttt ctgatatctt tctttaacgg actgtggtat 6720 aagtttacaa ggtgtaatgg gcaccttgat aatgtctata aaatgtgctg aaatttggcc 6780 aatttctaaa catttttatc cttttatagg gtagctttta atgctgtctg tcttcacata 6840 tttttggtgt gtggaccaag taattcttat gaccagtgta cagagttttt catgttttct 6900 cagaattttg tgtaattcta ggggggtgag 6930 // ID Gypsy154-I_DR repbase; DNA; ZEB; 3735 BP. XX AC chr3; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy154-I_DR; KW Gypsy154-LTR_DR; Gypsy154_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-3735 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1569-1569 (2008). XX DR Genome; chr3; Positions 407418 403684. XX CC Positions [2618-3148] - Integrase core CC LTRs are 98% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 750..2318 FT /product="Gypsy154-I_DR_2p" FT /translation="MPAFSCLPAPTPPPEPMEVGGLHLSTAERQRRLTNRL FT CMYCGAADHFFGKCPIKGASSPVDRGVLASTISLSPTPLRRTTLPVVLRHG FT GTSTSCEAFIDSGAEGNFMDETWALRHSIPSVEMDKPPPVYALNGQLLHQV FT RRITIPVSLTVSGNHQETIRFYLFDSVLSPLILGHPWLSKHNPHIVWSDGT FT VMSWSLTCHDKCLVSAVPSVSSVSVLQDEPGDLSGVPEEFHDLRAVFSRSR FT AVSLPPHRPYDCSFDLVPGSTPPRRRLYSLSKPEHEALEQYLSESLAAGTI FT VPSTSPAGAGFFFVKKKDGSLRPCIDYRGLNSITIKNCYPLPLLASAFEVL FT QGAKVFTKLDLRNAYHLVRIKAGHEWKTGFNMPLGHLEYRVLPFGLVNAPS FT VFQALVNDVLRDMLNIFVLVYLDDILIFSPSLETHNHHVRRVLQRLLENRL FT FVKAEKCIFHADSVPFLGYIVSAEGISIDPAKQSLTLPSLIVRVQKTSNQM FT RYHVCSTPLITTPPQNPFWLKAVLSEL" FT CDS 2519..3706 FT /product="Gypsy154-I_DR_1p" FT /translation="MIRDTRSYVMSCPVCAQSKSSNLPPSGLRRTLPIPSR FT PWSHIALDFVTGLPPSANNTIILTVVDRFSKSAHFIPAPKLPSAKETAVLV FT FKHVFRIHGLPVDMVSDRGLQFTSCFWREFCRQIGATASLSSGFHPQTNGQ FT AEHVNQILGRLLRTLAARNPSSWRAGLNTRKTPLQRHQLGFLLSTVVWVTN FT LRYSPTKSGRRQSLQYRRTLVAAVKLGRPYTPHYKNPEKTPVATLIINPIK FT TPRYIRGQRVWLSTRHLPLQSQSRKLAPKFIGPYTIIKIINLAAVQLRLPH FT LLCRIHPIFHVSCVKPVVRASNHPTPASAPILIDDVPAYKVKKILASRRRG FT RGIQYLIDWKGYGPEERSWVLARDILDCSLIDDFSHSCQSTSLGALLGGGV FT LS" XX SQ Sequence 3735 BP; 810 A; 1101 C; 842 G; 982 T; 0 other; gaataatctg accaaaacga tggaggtggc tgagagaggc tccatacact tcgcggtcgt 60 acaccaggga gcaatgctcg gctgccacga tgaaaaactc tctgccacca gacttgccgt 120 ggacagtctg tctgcacagg taacggagct taccaaccag ctgcacagat tttgccatgg 180 gggtccatct aatgctgaac agccgatgtc tgagccccga gtgaacaacc ccccacgtta 240 tacgagtcaa ccctctgagt gtcggtcttt tttgacccag tgtgaggtag tgttctcgct 300 acagccatct acctatgcca ccaatcgctc ccgagtggca tttattctct ctctgctcac 360 tggacaggcg agagaatggg gagccgccgt atgggaggct gaatctgact gctgctctca 420 ctacgcctct tttaaggagg agatgttgaa agtgttcggc cgatccgtgc tggggagaga 480 ggcttcccgt cagctcgcag cgctccgcca agggggtcga tccgcggctg actttgccat 540 ccagttccgc accctggcgg ccaccagtga atggaacgag ccagcccttg tcgctcgttt 600 tttggatggt ctctcctccg agttgaagga ggagctctat ttccatcgaa ctaccctatt 660 ctctggactc ccttatagag agggcgattc gtctggataa gcgattggag ttgcgtcacc 720 attccagagg ttcatctgag gagaggagga tgcctgcttt ctcttgtttg cccgctccca 780 ccccccctcc ggagcccatg gaagtgggtg gtctacatct ctctactgct gaacgtcaac 840 gccgcctgac caacaggctc tgcatgtact gtggggccgc tgatcatttc ttcggcaagt 900 gcccgataaa aggggcaagc tcgccagtag accgaggggt tctggcgagc acgatctcct 960 tatcccccac tccactgcgg cgaaccactc tccctgttgt actgcgacat ggtggcacct 1020 ccacttcctg cgaggcgttt attgactcag gagccgaagg gaacttcatg gacgagacgt 1080 gggcattacg tcatagtatt ccctccgtcg agatggacaa gccacctccg gtgtacgcac 1140 tcaatggaca actcttgcac caggttcgac gtataaccat acctgtgagt ctcactgtct 1200 ctggaaatca tcaagaaaca ataagatttt atttatttga ctctgtcctc tctccattaa 1260 tactcggcca cccctggcta tctaaacata atccccacat agtctggtcc gatggtactg 1320 ttatgtcatg gagtcttacc tgtcatgata aatgtcttgt ctctgctgtt ccttctgtct 1380 cctctgtctc tgtgttgcag gatgaaccag gtgatctttc gggcgtgcca gaggagtttc 1440 acgatctgcg agcggtcttc agtcgttccc gggccgtctc tcttcctcct caccgacctt 1500 atgactgtag cttcgatctc gttccaggta gtactccccc taggcggcga ttgtattccc 1560 tatctaagcc tgagcacgaa gcccttgagc aatatctctc cgaatcactc gcagctggca 1620 ccatagttcc ctccacgtcg cccgctggtg ctggtttctt ttttgttaag aaaaaagatg 1680 gctcgctgcg cccgtgtatt gactatcgag gtcttaacag tataaccatt aagaattgct 1740 atccgctgcc cctcctggca tccgcttttg aagtcttaca gggtgctaag gttttcacta 1800 aattagacct tcgcaacgcg tatcatttgg tccgtatcaa ggccgggcat gagtggaaga 1860 ccgggtttaa tatgccgctc ggtcacttag aatatagggt ccttcccttc gggctggtaa 1920 atgccccttc tgtattccaa gccctcgtta atgacgtact tagagacatg cttaacatct 1980 tcgttttggt ctaccttgac gacatcttaa tcttttcccc ttcgctcgag acccataacc 2040 atcacgttcg ccgcgtacta caacgcctgt tagagaatcg actgttcgta aaagcggaaa 2100 aatgtatttt ccatgcggac tcggttccgt ttctaggcta tatagtctcc gccgagggta 2160 ttagcataga cccagccaaa caaagcttga ctttaccatc tcttatcgtc cgggttcaaa 2220 aaacgtcaaa ccagatgcgc tatcacgtct gttcaacccc tctgataacg actccacccc 2280 agaacccatt ctggctaaag gctgtgttgt ctgagcttta gcctggggaa tcaaacgtaa 2340 ggttcaacgg gctctcacac gggttacaac cccccagaga tgccccgaga ataaattatt 2400 cgtccccaaa tctctacgtc cggctgtgct ccgctgggca cacgcttcca aattagtagc 2460 ccacccgggg gttaaaggca ccctaaccgc gatccgtcaa catttctggt ggccgtctat 2520 gatacgcgac acgcgttctt atgtcatgtc ctgcccagtt tgcgctcaat ctaagagtag 2580 taaccttcct ccctccggtc tccgcagaac tttacccatt ccctcccgcc cgtggtctca 2640 catcgcctta gacttcgtta ctgggttacc accttcagcg aacaatacca ttatccttac 2700 agtagtcgat cgcttctcga aatccgctca ttttattccc gctcccaaac tcccctctgc 2760 gaaagaaact gccgtactgg ttttcaaaca tgtctttaga atccatggtc taccagtaga 2820 catggtctct gacagaggtc tgcaattcac ttcgtgtttc tggcgagaat tctgccgaca 2880 gataggggct acagccagtc tgtcatcagg ctttcaccct caaactaatg ggcaagcgga 2940 acacgtgaat caaattctcg gtcgcctcct ccgcacctta gcagcgcgga atccctcctc 3000 gtggagggct gggctgaata cgcgtaaaac tccactccaa cgtcatcaac tgggctttct 3060 cctttccaca gttgtctggg ttaccaacct ccgctattct cccaccaaga gcgggaggcg 3120 tcagtccctt cagtacaggc gtacattagt cgctgccgta aaacttggaa gaccgtacac 3180 tccacattat aaaaatcccg agaaaacacc cgtcgcaacg ctaatcatca accccatcaa 3240 gactcctaga tacattcgcg gccaaagagt ctggctttcc acccgccacc ttcccctcca 3300 gtcccaatca cgcaaactag ctcccaaatt tattggtcct tataccatta tcaagatcat 3360 caatctcgct gcggttcaac tgcgcttacc tcatttgctt tgtcgcatcc acccgatatt 3420 ccatgtttcg tgtgttaaac ctgtcgtccg cgcttccaac cacccaaccc ctgcctccgc 3480 tcctatcctt attgacgacg ttcctgctta taaagtcaag aagatcttag cttcccgccg 3540 tcgcggacgt ggtattcagt atctgattga ctggaagggc tatggcccgg aggagagaag 3600 ttgggtcctg gctcgggaca tcctggattg ctcgctcatc gatgacttct ctcactcgtg 3660 ccagtcaact tctctgggag cgctcctcgg aggaggggtt ctgtcatgaa tctggtttgt 3720 taattgtcta tcttt 3735 // ID Gypsy-24-I_DR repbase; DNA; ZEB; 4529 BP. XX AC . XX DT 11-FEB-2005 (Rel. 10.01, Created) DT 19-MAY-2005 (Rel. 10.06, Last updated, Version 2) XX DE An internal portion of the Gypsy-24_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy superfamily; Gypsy-24-I_DR; Gypsy-24-LTR_DR; Gypsy-24_DR; KW endogenous retrovirus; gag; integrase; reverse transcriptase. XX NM Gypsy-24-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4529 RA Kapitonov V.V. and Jurka J.; RT "Gypsy-24_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 5(1), 21-21 (2005). XX DR [1] (Consensus) XX CC Gypsy-24-I_DR is an internal portion of the Gypsy-24_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its CC long terminal repeat is deposited in Repbase as CC Gypsy-24-LTR_DR. Gypsy-24_DR is characterized by 4-bp target CC site duplications. The internal portion encodes one CC polyprotein: the 1464-aa Gypsy-24_DR1p (pos. 114-4505) CC composed of the gag, protease, reverse transcriptase, and CC integrase domains. The consensus sequence was built from CC several copies less than 4% diverged from each other. XX FH Key Location/Qualifiers FT CDS 114..4505 FT /product="Gypsy-24_DR1p" FT /translation="MEDRAASVDYSDEAFTHLRDPVANLIDTFSELYLDSE FT EEKESKDESPQQGQEDISKIDDFPSPPPPLDFHDDQLQFENDEHDQRIHSI FT EKHLADLEHRVSSFVTAETLNANLRACEDRINYYVQRELDRVQQKCYAKVE FT DLSRSIVDCLKRRDKQLEQQFKAIKPIMSTPMHSSIVTSHKTSQTPSRIDA FT TQDTTKQGTYLSPSSFPTSIKLELPTFGNADSEDPLDFIERFEEYDELRPL FT HHEEMLAALSVSLKGTAKSWWKAEKSSITDWLSFKEKFLFSFLNEDHKEVA FT AQKLADYKQKVNESIRDYAFNYRAMSLKINPAMSESELVQATLRNCNPRLA FT SLLRGTVKSIDDLVRLGTQIEKDWSESRKRWSQGKEEDQKKKSSAVKGQPN FT RLMLIDPCLCDNVLQAPVILNHSYFNAVIDTGSTFSLLQKKLWERLKKKDE FT QLTRSDQTFMLANGQSQKTLGKVLWACEIYGVKHEVTFHVMDDDSLAVPVI FT LGLDFLKKAKVTIDFNVSRIYLPDANSSHPVCFNKTTEHAAVKFYAAQEEV FT GVSHDERLKLIDQALENSHTTTKVKSQLKALMCDWPSVCTNKLGRTDLIKH FT VIKTTDDLPLRKRPYRVSKAKNDFIEEQIQELLQQKIIKPSTSPWASPVVV FT VEKKDGGSRLCIDYRGLNAKTFLDAYPMPQITDILDSLQGAKVFSTLDLKS FT GYWQLEMDPASMEKTAFVTASGLYEFSSLPFGLKNAAASFQRLMEQVLRDL FT KNKCCMVYIDDIIVYSPDVQTHLNHLEQVFHSLHKAGLTLNLKKCKFICAS FT LDYLGHTISADGVNVNSDKVEAIRTFPIPKTLKELQRFLGLAAWYHRFIPD FT FSSKTAPLHLLKRKDVKWNWSDECQRAFDVIKDELTRAPVLCTPNFDLSFK FT VQTDASDVGLGAVLTQEVEGQERVIAYASRLLRGAEKSYSASEKECLAVVW FT AVEKWHHYLEGRPFEVITDHASLVWLFQHPKPSSRLERWTIRLQGYHFTVR FT YRKGQCNIVPDVLSRREEVNSQAVLLHTPAKKNFSTVSCDLPLDLSQIACE FT QEKDTECQEIMVKAKSQRTTDLKRTHYICKNGVLFRSIPDSKEGQRLQVVI FT PEKLREVTLSYAHDSPLSGHLGRFKTLMRLLEFAYWPSIRTDVWEHCKICE FT KCQRYKPTNLKPAGDLQSVPIVEPGYMLGMDIMGPFPRSSRQNEYLLVIVD FT YFTKWVEVFPMRTAKSNTIVRILIEEIFTRWGTPAFIVSDRGRQFTSNLLD FT QLCKQWQITPKLTTAYHPQSNLTERVNRNLKTMIAMFVEQNHRTWDQWIYE FT FRFALNTAWHESTGYSPAEIALGRQLKGPLQRALHNPPDPNQPAYNTLERQ FT KILYDVVRDNVEKAQSKQRKYYNMKRRTQNFEEGDLVWVRTHPLSKADDAF FT MAKISPKWKGPARIVKKLGPVNYKVTMLSDVAQVDTYHTQNLKIWHGADF" XX SQ Sequence 4529 BP; 1467 A; 856 C; 984 G; 1222 T; 0 other; gatggcaccc gaacagggac attcgtgaat tgtattttac gaagtccttt gaattataaa 60 gagatatttc actgatactg aaattcttca caagcaaata ttacaaattt acaatggagg 120 acagagcagc tagtgtggat tattcagatg aagcctttac tcatctcaga gatcctgtag 180 ctaatcttat agatactttt agtgaattgt atctagattc agaagaagaa aaagaaagta 240 aagatgaatc tccacagcag ggacaggaag acattagtaa aattgatgac tttccatcac 300 cacccccacc ccttgacttt catgatgatc aattacagtt tgagaatgat gaacatgatc 360 agcgtataca tagcattgaa aagcacttgg ctgatcttga gcatagagta agcagttttg 420 tcaccgctga gacactgaat gcaaatttaa gagcatgtga agataggatt aactactatg 480 tacagaggga gttagatcgt gttcagcaaa aatgctatgc taaagttgag gatttgagta 540 ggagtattgt ggattgcttg aagcgtagag acaaacagct agagcaacaa tttaaggcta 600 tcaaaccgat catgtccact cctatgcatt ccagcatagt tacttctcac aaaacatctc 660 aaacaccctc tagaattgat gcaactcaag acaccacaaa gcaaggcact tacctttctc 720 catcgtcctt cccaaccagt attaaacttg aactgcctac ttttggaaat gcagattcag 780 aagaccccct tgatttcatt gagcggtttg aagaatatga tgaacttcga cctctacatc 840 atgaagagat gttagcagct ttatctgtaa gtcttaaagg tacagctaag agttggtgga 900 aggctgagaa gagcagtatt acagattggt tgtcattcaa agaaaaattc cttttttcat 960 tcttgaatga agatcacaag gaagtggctg ctcagaaatt ggctgattac aaacaaaaag 1020 tcaatgaaag tataagagac tatgctttta actatagagc aatgtcactg aaaataaacc 1080 ctgcaatgtc tgaatctgaa ttggtacaag caacattgag aaactgtaat cctagattgg 1140 cttcattatt aagaggaaca gtgaaaagta ttgatgatct ggttcgtctc ggtacacaaa 1200 tagagaaaga ttggtcagaa agtaggaaaa gatggagtca aggaaaggaa gaggatcaaa 1260 agaagaaatc ttcagcggtg aaaggacaac caaataggct catgcttatt gacccttgtt 1320 tgtgtgataa tgtactacag gctcctgtta tcttgaatca ctcatacttc aatgctgtaa 1380 ttgatacagg aagtacattt tctttgctgc agaagaagtt gtgggagaga ttgaagaaaa 1440 aagatgagca attgactaga agtgatcaaa cgttcatgct cgcaaatgga cagagtcaga 1500 aaactctagg taaagtatta tgggcatgtg aaatttatgg agtgaagcat gaagtcacct 1560 ttcatgtgat ggatgatgac agtttggctg ttcctgttat attaggcttg gattttctca 1620 aaaaggctaa ggtaaccatc gactttaatg tttcacgtat ctacctacct gatgctaaca 1680 gtagtcaccc tgtatgtttt aacaaaacaa ctgagcatgc tgctgtgaag ttttatgctg 1740 cacaggaaga agttggagta agccacgatg agaggttaaa actgattgac caagccttgg 1800 aaaattctca cactacgacc aaggtaaaga gtcaattaaa agctcttatg tgtgattggc 1860 catcagtatg tactaacaaa ctgggccgta cagaccttat caagcatgtg atcaagacca 1920 ctgatgactt gcctttaaga aagagaccat atagagtttc taaagccaag aatgatttta 1980 ttgaagaaca gatacaggag ttgcttcaac aaaaaatcat caaaccttct acatctcctt 2040 gggcttcacc tgtggtagtg gtagagaaaa aggatggggg atctagatta tgcattgact 2100 accgagggct taatgcaaaa acttttctag atgcttatcc tatgcctcaa atcacagata 2160 tactggactc tcttcaagga gctaaggtgt tcagcacgtt ggacttaaag agtggatact 2220 ggcagttaga aatggatcct gcaagtatgg aaaaaacagc ttttgtcact gcttcggggc 2280 tatatgaatt ctcgtctctt ccctttggcc ttaaaaatgc agctgcgtct ttccaacggc 2340 tgatggaaca ggtactgaga gatcttaaaa acaaatgttg tatggtttat atcgatgaca 2400 ttattgtata ctcacccgat gtccaaactc acctgaatca tcttgaacaa gtgtttcaca 2460 gcctacacaa agctggtctc acacttaacc taaagaaatg taagttcatt tgtgcttcac 2520 ttgactactt gggccatacc atctcagcag atggagtcaa tgtgaattca gacaaagtgg 2580 aggctatcag aacatttcca attcccaaga ccttaaagga attacaaaga tttttaggac 2640 tggcagcttg gtaccatcga tttattcctg atttctcctc caaaacagct cccttacacc 2700 tcttgaagag gaaagatgtg aagtggaatt ggtctgatga gtgtcaacgt gcctttgatg 2760 ttatcaaaga tgagctcact agagcacctg tgttgtgtac acctaacttt gacctttcct 2820 tcaaggtaca gactgacgca agtgatgtgg gtttaggggc tgtgctcact caagaagtgg 2880 aaggacaaga gagagttatt gcctatgcat ctcggttgct cagaggggct gagaagtcct 2940 attccgcctc agagaaagag tgtctggcag tagtgtgggc agtagagaag tggcatcatt 3000 accttgaagg tagaccgttt gaggtaatca ctgatcatgc ttccttagtc tggcttttcc 3060 aacatcctaa accttcatct agattggaaa gatggacaat cagactacaa ggataccatt 3120 ttactgtaag ataccgaaaa ggtcagtgta acatagtgcc cgatgtgttg tccaggagag 3180 aggaagtgaa ctcacaggct gtattactgc atacaccagc caaaaagaat tttagtactg 3240 tctcatgtga tctgcctttg gacttatcac aaattgcttg tgaacaggaa aaggataccg 3300 aatgccaaga gatcatggtt aaagccaaaa gccagagaac cacagatctg aagaggactc 3360 attatatttg caagaatgga gtcttattca ggagcattcc agattcaaag gaaggccaaa 3420 gactacaggt tgtaatccct gagaaattga gagaggtaac tttgtcttat gcgcatgaca 3480 gtcctttaag cggtcatttg ggtaggttca aaactcttat gcgactgcta gaatttgcat 3540 attggccatc catacgtact gatgtttggg aacattgcaa aatttgtgag aaatgtcagc 3600 ggtacaaacc tacaaaccta aaacctgctg gtgacctaca aagtgtgccc atagttgaac 3660 ctgggtatat gctgggtatg gacatcatgg ggccgtttcc acggagctca cgccaaaatg 3720 agtatctact agttatagtg gactatttca ctaagtgggt agaggttttt ccaatgagaa 3780 ctgccaagtc taacactatt gtacggattc tcatagaaga aatattcact agatggggaa 3840 ccccagcctt catcgtatcc gaccgaggta ggcagtttac ctccaacctg ttggatcagt 3900 tatgtaaaca gtggcaaata actccaaaac tcaccaccgc ctatcatcct caatcaaatc 3960 ttacagaaag agtcaatcga aatttgaaaa ccatgattgc catgtttgtt gaacaaaacc 4020 accgtacctg ggatcagtgg atatacgagt tcagatttgc tctgaatact gcttggcatg 4080 aaagtactgg ttattcacct gctgagatag cacttggacg acagttgaag ggacccctac 4140 agagagcctt gcacaatccc ccagatccta accaaccagc atataatacc ctagaacgcc 4200 aaaaaattct atatgatgta gtcagagaca atgtagagaa ggcacagagt aagcagagga 4260 agtattacaa tatgaagaga agaactcaga attttgagga gggagatttg gtatgggtaa 4320 gaactcaccc tctctctaaa gctgatgatg cattcatggc taaaatttct cctaaatgga 4380 aaggcccagc taggattgtg aaaaagctag gaccggtaaa ttacaaggtc actatgttat 4440 cagatgttgc gcaagtagat acttatcata cacaaaactt aaaaatttgg catggtgcag 4500 atttttaaaa aaacacgggg agggatgta 4529 // ID DNA-X-5_DR repbase; DNA; ZEB; 1221 BP. XX AC . XX DT 27-OCT-2008 (Rel. 13.1, Created) DT 27-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE A nonautonomous DNA transposon - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; DNA-X-5_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1221 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1254-1254 (2008). XX DR [1] (Consensus) XX CC TSD is unknown due to sequence divergence. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1221 BP; 413 A; 197 C; 229 G; 377 T; 5 other; cagaccccct agtttttcgc gattccgcga tcgctgaatc caacgcaaaa tcaacaaaaa 60 gtcgcaaatt tttgcgatcc tgcaaaattc ttaatttaaa cgcaaaataa tcgcaaaaaa 120 tgtaaataaa tctcataaaa catccaattc caaaccatat aatcaaatta tatttatttc 180 agtttgcaat taattatgtt ttacatgact tatagacgtt gcatacgcag cgcacgtgat 240 gtatttgagt gtctctcgaa aaatgacgtc tcacctgcgc cctcacaatc attacattac 300 atggaagggg gggggaattg ttttgcagcc tgtgcagtaa gcagacgcgg atgaagcgcg 360 agtgatttac atgtgaagtc aatgcaaaga tgggattaga catcctgtgg cgcgaattgg 420 gcgttttgtg catttgacgc gcttcagact gcaaaagaaa gtgggagcaa acatctaaca 480 aacggagcaa tggaacagac agaaaagcaa gcagcttaca atgatggaga ttggttcaag 540 gatgacggcc gctggacgtg accgaattga aggacattat ttgtgcttta catgtatggc 600 tggtattatg atgtgcgtca aatcgataaa tgatttcgtt taactctttc cctgcagtta 660 acgagagaaa acgcttccct gccattgacg agttttacgg caatccgtgt tttaggtgta 720 ttacggtagg ggaagcccta acgcatgtcc tgagtgaggt accatgatgt cagatgctcc 780 agggagtgaa atctgagaga aagtaagatc gtggataaaa ataagagtta tacaaccttc 840 ctttggctta atccctaccg ttttagatct gttttagatc ttgtcttgac aagagaccaa 900 aataaagaag ctttattttt atgatttgtg tgtcattgtc attgtgtcag attgcacacc 960 taacatagta ggctacctaa tatggtaaat aaratatata tatatatata tatatatata 1020 tatatatata tatatatata tatatatata tatatatata tatatatata tatatatata 1080 tatatatata tatatatata tatatatata tatatatata tatatatata cataattmtt 1140 tttagcaaat ttctgggaat taccaccaca aaatcagtaa tttttatyac aaaaaaatat 1200 cktgaaaaam tagggggtct g 1221 // ID L1-17_DR repbase; DNA; ZEB; 5593 BP. XX AC . XX DT 16-MAR-2009 (Rel. 14.03, Created) DT 16-MAR-2009 (Rel. 14.03, Last updated, Version 1) XX DE a non-LTR retrotransposon from the L1 clad - consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; endonuclease; L1 clad; L1-17_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5593 RA Bao W. and Jurka J.; RT "L1 non-LTR retrotransposons from zebrafish."; RL Repbase Reports 9(3), 692-692 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 154..1053 FT /product="L1-17_DR_1p" FT /translation="MYTSRKQDSSPSTTPSGKKKMKLASTEANLTDVMAEA FT LSKQQISLETTVQNAIRAAMEDVKSSLIELRQEVQSQTNTVRDLISKVDKI FT QGETRQIKKDLNVCKTEQQNLTSKIAELEDRSRRNNVRLVGLPQGREGDNP FT IGFLQKMLPIWIPALRNKGPIEIDRAHRIYGSGKSRTLLFRVLRYPDRQAI FT LQGAREVMKTEAIRDQDHLLRFFADYSGFTSRKRQAFGSLQRELHALGIPN FT FLIYPAVLRVNHGGNQQSFDSVKKAEQFLEEVRNGHASGVRRKLSFQGPSA FT PSESMDE*" FT CDS 1612..5397 FT /product="L1-17_DR_2p" FT /translation="MDKYKIISLNVRGLNSPNKRAKLLIYLRRKRVDIALI FT QETHLKNSDVQRLQNKFYKLAAFSTDNTKTKGTIVLISRICNIKIDRQSQD FT QSGRLAYVCTNIRGKKIAFVSIYAPTIFEATFFLKMANHLISLIDYSLIIG FT GDMNAVIDPVLDRSSSCNQVQLNSSAALKNFVKALNLVDIWRSWNPSSRDY FT TFFSTRHLTYSRIDYLLCSKDLLSYFDQIMILPAILADHNPLMVDFNCNSF FT AERSLRWRFNTSLLLNSQFDVDFKAKLAEFLLFNIGSVEDPIFVWEATKGF FT IKDFSIAFATNLKKMQNQRIDYLEKECKSLEYSLKSSFSKLTVNKLQSSKT FT ELNDLLRRKAEFLLHRTRQNYYSNGARPSRLLALRLKQNQAKAYIDVINHP FT AKGPVRNTKEINSAFSDYFSKLYISECTLPSHNCASFLHSLILPQLNNEEA FT EQLGQPITLEELKEALQGAKKGKAPGLDGIPAELFLKYFDLLGPIFLAAIH FT EAVNTGAFHPQLNIALISLIPKKGKDHSNCANYRPISLLNTDIKMYARILA FT LRLQRYINKLVHPDQTGFMPGRLASDNIRRLLHVIHEARDCPTPAAVLSLD FT AEKAFDRLEWDYLWSVLEVFGLGTNFINMIKVLYRNPTASVITNGLHSSPF FT NLGRGTRQGCPLSPMLFALSLEPLAQLIRQEDICSFSVKSHKQCISLYADD FT ILLFISDLQTSFPLILNVFGKFSLFSGYKINWDKSSMLPLNSIAKHIKLPY FT NVPIRNSITYLGIAVQSSLQTIPDANYLQVLSRVEQDLRVWTSMPLSLHAR FT IASVKMNILPRINFLSSMIPLPAPKKFWHKLNSMIRKFIWNGKQPRLKFET FT LQRTKENGGLALPNFELYHRAFQIKCIHVWLNPLSRVPWRQIEMKLADKYR FT LQDILFSGLNNKQCIQAFGPIIGNTIVNFKLMEKYIHHSPKWHIHTPIWNN FT NDLRSGGAPFLSRKWMQKGIYTLKDISGSNGILSFQELCERYETEPSSLFL FT YLRLRSAMKAYGVPWGDNLPTHPIITWFDLSFTNHFTSWIYRKALEGTAKN FT LAIQRCWETDCQLEEGVTDWERVWQNIFTSSKNPNHQLIHYNFCHRTYWTP FT LKRHRINASFSPFCDKCPNQELGSFFHMMWECEKVQRFWNEICVILSKIIE FT YPVPLNPSVLLLSDDSTLNLTKLQKAVWLAGLTSAKKLLAQHWLPPHNLDM FT YKWLVQLQDIIMLELSTARVNLVQMSTLQIWTLAADKISLFIAQRNDQDPE FT *" XX SQ Sequence 5593 BP; 1713 A; 1058 C; 1061 G; 1761 T; 0 other; ttaaaaaatg taatatttca gggaggtcac gtgacccttg cacgcgatgg ttgagagaca 60 ctgatgctcc gtccaaaccc tcttcctttt ctagtttaac aagtttataa gtatcgacag 120 tcagtatttt cgacgtgtga aagtattact taaatgtata catccaggaa acaagactct 180 tctccgtcta caaccccatc gggaaagaaa aagatgaagc tagctagcac tgaagctaat 240 ctaactgacg ttatggcgga ggcgctttcg aaacaacaaa tctcgttgga aacaacggtt 300 cagaatgcta ttcgagcggc aatggaagat gttaaatcat ctctgataga gctccgccaa 360 gaagtacaat ctcaaacgaa cacggtacgt gacttgatat caaaagtgga caagatccag 420 ggggagacac gacagatcaa gaaagattta aatgtgtgca aaaccgagca gcaaaatctc 480 acctcgaaaa ttgccgagtt ggaggaccgg tctcgccgta acaatgtgcg cctggtgggg 540 ctcccacaag gccgggaagg agataatcca ataggatttc ttcaaaagat gctccctatc 600 tggataccag cgctgaggaa caagggtcca attgaaattg atcgagcgca cagaatttac 660 ggctctggta aatcccgcac tttactgttt cgagtgttac ggtacccaga ccgacaagcg 720 atacttcagg gtgcccggga agtaatgaaa accgaggcga tacgggatca agatcacctg 780 ttgcgtttct tcgctgatta cagtggattt acctcccgca aacggcaggc attcggaagt 840 cttcaaaggg agctgcatgc gctggggatc ccaaactttc tgatctaccc ggccgttctc 900 cgtgtcaacc atggcggtaa tcaacaatca tttgattcgg ttaaaaaagc ggagcaattc 960 ttagaagaag tccgtaatgg ccacgcatct ggtgtgagac gtaaactgtc ttttcaaggt 1020 ccatccgcgc cttcagagtc aatggatgaa taaaaagtcc ataaacgtgt ttggaataaa 1080 cacctctgct ctacatgtaa aacatacaat ttaatgttct gcttcagaga acttgagagt 1140 tgagacaagt tcatttattc acgtattatt tcataccttg taggggttgt tattattatt 1200 ttgttattat tttatttatt tatttattta tttttcatta gctaacttat aatttgtgga 1260 cttcgtgttt ttttttcttt ttcatattat ttaattatta attattataa atttaattat 1320 ttcaaatgtt taaattattt actctagtta tatatgactg tcattattgt tcatagtaag 1380 tatacagtta tgttctgtgg ttaatggaag agggtggtcg acgttgactt tctagtgaac 1440 gaaggtgtat gctgccacgg gcccttatga tgttggggaa ttttgcggtc tcagggggta 1500 tttgttcatt ttatttgtct gtactttttt gcattgtttt agctttattt aggcagttaa 1560 attctggttg tggaaaaact cttaaaactc gtattgtttt tcaaatagat tatggataaa 1620 tacaaaatta tatcacttaa tgttagagga cttaacagtc caaacaagag agctaaactt 1680 ttgatttatt tacggaggaa aagggttgat atcgctttaa ttcaagaaac acatctgaaa 1740 aattcagatg ttcagaggtt acagaataag ttttataaac ttgcagcttt ttctactgat 1800 aacactaaaa ctaaaggcac tattgttttg atatcgcgca tctgtaacat caagattgat 1860 agacaaagtc aagatcaatc tggtagactt gcctatgttt gcactaatat aaggggaaag 1920 aaaattgctt ttgtttctat ctatgcccct acaatattcg aggcaacatt ttttttgaag 1980 atggctaacc acttgatttc acttatagat tactcactga ttattggggg ggacatgaat 2040 gcagttattg atccagtatt ggacagatct tcctcatgta atcaagtcca gcttaactct 2100 tcggccgcac tcaaaaattt tgtaaaggca ttgaatttgg tcgatatttg gagatcttgg 2160 aacccctcaa gtagagacta cacatttttc tcaacaaggc atctcacgta ttcaagaata 2220 gattatttat tatgctcaaa agaccttttg tcatattttg accagataat gatactgcca 2280 gctattcttg cagatcataa tcctttaatg gtagatttca attgtaattc atttgcagaa 2340 agatctctgc ggtggcgttt taatacctcg cttttactga atagccagtt cgatgtagac 2400 ttcaaagcca aattggctga atttctttta tttaatatag gctctgttga ggaccccata 2460 tttgtgtggg aagctacaaa aggttttata aaagatttct ccatagcttt tgctactaat 2520 ttaaaaaaga tgcaaaatca gagaattgac tatttagaaa aagaatgtaa atcacttgaa 2580 tattccctaa aatcttcatt ctccaagttg accgttaata aacttcaatc ttctaaaact 2640 gaactaaatg atcttctgag aaggaaagca gaatttttac tccacagaac cagacagaat 2700 tattactcta atggtgctag accaagcaga cttttagccc ttagactgaa gcaaaatcaa 2760 gcaaaagcgt atattgacgt aatcaatcac ccggccaaag gaccagttag gaataccaaa 2820 gaaattaatt cagcattttc tgactacttt tctaaacttt atatatctga atgtactctc 2880 ccaagtcata actgtgcttc cttcttgcac tctctaattt taccacagct taataatgaa 2940 gaagcagaac agttagggca acctattact ctagaagaac ttaaagaggc attacagggt 3000 gctaaaaaag gcaaggcacc aggtttagac gggattcccg cagagttatt tcttaagtat 3060 tttgatcttc tcggtcctat ctttttggct gccattcatg aagcagtaaa tacgggggcc 3120 tttcaccctc agttgaatat agcacttatt tcgctgatcc caaaaaaagg aaaggaccac 3180 tcaaactgtg caaattacag acccattagc ctgttaaata cagatattaa aatgtatgct 3240 agaattttag ctttgcgatt acaacgatac attaataaat tagttcatcc agatcagacg 3300 ggctttatgc caggtagatt agcatcagac aacatccgga gattactaca tgtcatacac 3360 gaggcccggg attgtcctac tccagctgca gttctttcct tagatgcaga aaaggcgttt 3420 gaccggctgg agtgggacta tctctggtct gttttggagg tttttggcct tggtactaat 3480 tttattaata tgattaaggt attatatagg aacccaacag cttctgttat aactaatggt 3540 ctacattctt caccttttaa tttagggcgc ggcactagac aggggtgccc gttgtccccc 3600 atgctatttg ctttgtcgct tgagccacta gcacagctga taagacaaga agacatttgt 3660 tcattttcag ttaaatctca caaacaatgc atatctcttt atgccgatga tatactattg 3720 tttatctcag atttacaaac gtcatttccc cttatactga atgtatttgg aaagtttagt 3780 ttgttttccg gatacaaaat caactgggat aaatcctcta tgcttcctct aaactccata 3840 gctaaacata ttaagctgcc atataatgtc cccatacgaa attccataac ttatctaggc 3900 attgctgttc agtcatcttt gcaaactatc ccggacgcta actatcttca agtattatct 3960 agggtagaac aagatctccg ggtttggacc tcaatgcctt tatcattgca tgctagaata 4020 gcaagtgtta aaatgaacat cttgccgcga attaatttcc ttagttctat gattccgtta 4080 cccgccccca aaaagttttg gcacaaactt aactctatga ttcgtaaatt catatggaat 4140 ggtaagcaac ctagattaaa atttgaaact ctacaacgta caaaagaaaa tgggggtcta 4200 gctcttccaa atttcgaatt atatcacaga gcttttcaaa taaaatgtat tcatgtttgg 4260 ctgaacccac tatctagagt tccttggaga caaattgaaa tgaaactagc tgataaatat 4320 agattacaag atattctgtt ctccgggcta aataacaagc agtgtataca agcttttggt 4380 ccaattattg gcaatacgat agtaaatttt aaattaatgg aaaaatacat tcatcattca 4440 cctaaatggc acatacatac acctatctgg aataataatg atcttagatc tgggggtgca 4500 ccatttcttt caaggaaatg gatgcagaaa ggtatttata cactaaaaga tatttctggt 4560 tctaatggca tattaagttt tcaagaactt tgtgagcggt atgaaacgga accttcctct 4620 ctctttttat atctgcgatt aagatcggcc atgaaggcct acggggttcc ctggggggac 4680 aatcttccca ctcatccaat cattacttgg tttgatctgt cctttactaa tcattttacc 4740 tcatggatat acagaaaggc cttggagggt acagcaaaga atttagctat acagcgctgt 4800 tgggaaactg attgtcaact agaagagggt gtcacagact gggaaagagt atggcagaac 4860 atttttactt catctaaaaa tcccaatcac caactcattc actataactt ttgtcacaga 4920 acctactgga ccccactcaa aagacatcgt atcaatgcct cctttagtcc tttctgtgat 4980 aaatgtccta accaggaact tggatctttt ttccatatga tgtgggaatg tgagaaagta 5040 caaagatttt ggaatgagat ctgtgtcatc ctatcaaaaa ttattgaata tcctgtacca 5100 ttgaatcctt ctgttctatt actgagtgat gattctacat tgaaccttac caagctacaa 5160 aaagctgttt ggctggctgg cttaacttct gctaaaaaat tactagccca acattggctg 5220 cctccacata atttggacat gtacaagtgg ttggtacaac ttcaagacat tataatgttg 5280 gagttatcca cggcgcgagt gaacttggta caaatgagta cattacagat ctggacttta 5340 gctgctgaca aaatctcctt gtttattgca caaagaaatg atcaggatcc agaatgattg 5400 ccttgctttg tataattgat atttgatact gtattctgtt tttgtttttt tgttaacttc 5460 tattcattgt aaagcccagt ttcaagagac cgcaggggag gggggggggg tgggggttga 5520 attgatatgt ttcattatat aggtttttca gcctgtatgt tttgttaaca attaataaaa 5580 aaaaaaaaaa aaa 5593 // ID Gypsy151-I_DR repbase; DNA; ZEB; 6279 BP. XX AC chr14; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy151-I_DR; KW Gypsy151-LTR_DR; Gypsy151_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6279 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1563-1563 (2008). XX DR Genome; chr14; Positions 14030049 14036327. XX CC Positions [4693-5166] - Integrase core CC 'TTGG' target site duplication CC LTRs are 100% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 148..6216 FT /product="Gypsy151-I_DR_1p" FT /translation="MANERRNLVREIRKELCELSADELFLIAKDIEQIDKR FT ADSQELLEDEEGCFDYVSGFLCCKRLMGEEDEGMGVLLDLRDKVLQISQSK FT QIAAQENKTHDPSKGRAQGHIDHTTPDANTSTSHTHTNMVDTEYQQMLKMY FT EELGAKIQATNPHALPPRVSPLTLPFPTYPTLHSDGDQISRRELPFLSRRE FT FKVHGGQIGDSTSEISFNSLCRQIEEGLHEQFSESEIIRGVLKITKPGHFK FT EMLMEKENLTVRELKGLLQSHLGDKSSTELFQSLMCARQGENESPQQFLYR FT AIGLKQKIQFASKHSTADIRYDAKTIQEVFLHTVCQGLGPKQGELRREIKQ FT LVTAGDVTDDMLLRQLVKVTSEEEERQRRLQSLTRPKVTHARSAQIEGAPE FT IKAAAVTTEQTDKEIRKLTTQVEALTNLVTSLQQAKEKEQHCQCMSKAPEK FT KRSPSCANCIKQGTGEACNHCFLCGEEGHRAVGCLHRKTKVTNNTRFKSTP FT QTTVSDSRSPCHKYTVSVGSWNNRNSSARRPQPRQPEHSEERIVQFVGKKC FT LLDGEIGDKQVSMLLDTGAQVSIIDQSWREKYLPSHEVHPLSEIVGPSAGL FT EVFAINGEVIPFSGWVEATVSLPGQDGTRYSIQVPFLVSQLQLERPLLGFN FT VIAELIAGPIDSTNILTTLHSLMSSTEGNQDDLVKVSVGFISAHKVHADTA FT WVRVGPQSVTIQPGQVANVKCRVPPNFTTSDTRVLFEPSEDTPHLSHLDLG FT GGLVEINKSGRSYIRVPVGNHTKHEVTIPKRAILGEIAAIAKIIQTEQEDL FT NNPQKSTSPLPILPEDSDTPWHPPVDLTHLKSSEQAIVRQLLYEESAVFAH FT NETDIGNIPSLQMTLSLKDDIPVQKAYTSIPKPLLKEVKEYIQDLLAKGWI FT VKSRSPYAAPVVCVRKRDGTLRLCIDYRQLNKKTVPDRHPLPRIQDLMNTL FT GGYRWFSILDQGKAYHQGFMAEGSRHLTAFVTPWGLFEWVRIPFGLSNAPA FT AFQRSMEDMLESIRDECCLPYLDDVLCFAKSFEEYVEKLRVVFRALRHHGV FT KLRPAKCELFKHEVRYLGRLVSADGVRVDPKDLEAVRVLRDKVPQNVGEVR FT RLVGFLSYYRTYIQDFARLAKPIYELLQAHTAGAASQEGRRKEKKRGLQLP FT SRNPIVWEDKHQQSLDKLIDMLTNPPVLAYPNFDAPFVLHTDASEQGLGAV FT LYQRQEGKLRVIAYGSRTLSPAEKNYRLHSGKLEFLALKWAVCEKFRDYLF FT YAPYFTTYTDNNPLTYVMSSAKLNASGYRWVGELADFRFEIKYRPGKSNND FT ADMLSRSPLDMDHYISECTEGLSQAAIQKTWEGNLMWKNQDVAWVAALNLT FT APDMIILQNQSIPVISPDELSRAQREDLMIGPVMELKEAGKTLTPDVWRMA FT AGDTKKLLREWHKLVLENNIMYRKTMQRRQLVLPTKYRQMILQQLHNNMSH FT VGTEKVLQLARERFYWPGMRKSVEEYVTRQCPCITQKHPVTHVRAPMGGIT FT SSTPLELVCIDYLHLEASRGGYEYILMVIDHFTRFVQAYPTKNKSGKSAAE FT KIFNDYIPRFGFPAKLHHDQGREFENSLFHTLQKLSGVGHSRTTPYHPQGN FT PVERMNRTLLQMLRTLTEKDKQNWKEHLPKVIHAYNCTKHESTGYSPFYLM FT YGRHPRLPIDLLFGLAQDNDFNTQNEYAEHWADQMAEAYKIAAQSSKQRSL FT RNKEYYDQKASCIILRPGDRVLVRNLSQRGGPGKLRSFWEPTIYVVKEQLG FT DNFVYKVHPERDEHKVRTLHRNHLLLVNDLPTAESLDPRKQRVSKEMNVSM FT GKNTNRKVTPRDERREDEQSNSDEEEGYCWRRISNPKYSGCERKNLESELP FT QEELEEPDITVRDEIPRAVGQLPTETEDIRQEGDQSENLICVDIQETGQDM FT ESDCEGSKEQQTTEDQGQESHECEPYETSRRSLRERRPAQRMTYPLLGQPV FT FQTIPKINTVAANTIQPLTLMHTPVHTWLGHISSPYEEMLFPVMVPNY" XX SQ Sequence 6279 BP; 1940 A; 1335 C; 1553 G; 1451 T; 0 other; gttcttgggg gctcgtccgg gatcagcgcc acctgctgga cgagtggagc ttatcccaaa 60 gaatccgaag accagcagct ggggattctg aattggattt cctgctgtag tttggaaggt 120 gctaacagtg gagaaaaagt tagtactatg gcaaacgaga gaagaaatct ggtgagagag 180 atcaggaaag agctctgtga actttctgct gatgaactct tccttattgc taaagacatc 240 gaacaaatag acaaaagggc cgactcgcaa gagttattag aggatgaaga gggatgtttt 300 gattatgtta gtgggttttt gtgctgtaag cgtctaatgg gggaggaaga tgagggaatg 360 ggtgtgctac tggacttaag agacaaggtt ttacagattt ctcagtctaa acaaatagca 420 gcacaggaaa ataaaacaca tgacccaagt aaagggagag cccagggtca tattgatcac 480 acaacacctg atgcaaacac aagtaccagt cacacacaca caaacatggt agacactgag 540 tatcagcaga tgttgaagat gtatgaggag cttggagcaa aaattcaagc tacgaatccc 600 catgccctgc ccccacgagt cagccctctc acacttccct ttccaactta tcctacctta 660 cacagcgacg gagaccagat aagcaggagg gaactaccat ttctctcacg gagagagttt 720 aaagttcatg gaggccagat tggtgactct acttctgaga ttagtttcaa cagcttgtgc 780 cgacaaattg aggagggtct gcatgaacag tttagtgaaa gtgaaattat tcgaggggta 840 ctaaaaatca ctaaacctgg ccacttcaaa gagatgttaa tggagaaaga gaatttgaca 900 gtacgagagt taaaggggct tttacaatct cacttgggag ataagagtag cacagaactg 960 ttccaaagtt taatgtgtgc gcgccaaggg gaaaatgaaa gcccacagca atttctttat 1020 cgtgcaattg gacttaaaca aaagattcag tttgcttcca aacacagcac tgctgatata 1080 agatatgacg ctaaaaccat acaagaggta tttctccaca cggtttgcca aggtttaggg 1140 cccaagcaag gtgagcttcg tcgtgagatt aaacagcttg tgacggcagg agatgtcact 1200 gatgatatgc ttcttaggca gcttgtaaag gttacaagtg aggaggagga gaggcagcgt 1260 cggctgcagt cactgacacg acccaaggtc actcatgctc gtagcgccca gattgaaggg 1320 gctccagaaa ttaaagcagc tgctgtgact actgaacaaa ctgacaaaga gattcgcaaa 1380 ctcactacac aagtagaagc actcacaaat ttagttacct cgctgcagca ggctaaagag 1440 aaagaacagc attgtcagtg catgagtaag gcacctgaaa agaagcgaag tccaagttgt 1500 gctaactgta ttaaacaagg gacaggggag gcctgcaatc actgtttctt gtgcggcgaa 1560 gagggtcacc gtgctgtcgg ttgcctgcat agaaaaacaa aagtaacaaa caacaccagg 1620 tttaagagta caccgcagac aacagtttcc gattccaggt caccctgcca caaatataca 1680 gtcagtgttg gttcatggaa taataggaat agttctgcaa ggagaccaca gccaagacaa 1740 cctgagcact cagaggagag aatagtccag tttgtcggaa agaagtgtct actggatgga 1800 gaaattggag ataaacaagt aagcatgctc ctcgacactg gagcacaagt aagcattata 1860 gatcaaagct ggagggaaaa atacttaccc tcacatgagg ttcacccctt atctgaaata 1920 gtggggccca gtgctggtct ggaggtgttt gctatcaatg gagaagtaat cccatttagt 1980 ggatgggttg aggccacagt aagtttgcct ggacaagacg ggactcgtta ctctatccaa 2040 gtgccctttt tagtcagtca actacaatta gagaggcctc ttttgggatt taatgtgatt 2100 gcggagttga ttgcaggacc tatagatagt acaaacatcc tcactacctt acactctctc 2160 atgagtagca ctgaaggtaa ccaagatgac ctggttaaag tgtccgttgg cttcattagt 2220 gcacataaag tacatgcaga cacagcatgg gtgagagtag ggccgcaaag tgtcaccatt 2280 cagccaggac aggttgctaa tgtaaaatgt agagtacctc ccaacttcac cacttcagac 2340 acgagagtac tgtttgagcc aagtgaagac accccacatc tatcacatct ggatcttgga 2400 ggaggactgg tggaaattaa caaaagtgga cggtcatata taagagttcc agttgggaat 2460 catactaagc atgaggtgac cattccaaaa agagctatac ttggagaaat agcagccatt 2520 gctaagatta tccagacaga gcaagaggac ctaaacaacc cacagaaaag tacctcacca 2580 ctacccatct tgccagagga cagtgataca ccatggcatc cccctgtgga ccttactcac 2640 ctaaaatcaa gtgaacaagc cattgtgaga caacttcttt atgaagagtc agctgtgttt 2700 gctcataacg aaactgacat tgggaacatc cccagcctac agatgacttt gagcttaaag 2760 gatgatattc cagtgcagaa ggcttatact tccataccca aaccactctt aaaagaggta 2820 aaagagtata tacaagactt gctggccaag ggctggatag taaaatctcg ttcaccctat 2880 gctgctccgg tagtatgtgt ccgcaagaga gacgggacat tacgtttatg tattgactac 2940 cggcagctca ataagaaaac tgttccagat cgtcatcctc tgccaaggat ccaggacctc 3000 atgaacacct tgggtggtta ccgctggttc tccatacttg accagggaaa ggcctatcac 3060 caggggttta tggcagaagg gtcgagacat ttgacagcat ttgtgactcc ttggggcctc 3120 ttcgaatggg tgaggatacc ctttggtctg tcaaatgccc cagccgcttt tcagcggagt 3180 atggaagaca tgttagagtc tatccgtgat gagtgttgcc tgccctacct ggacgatgtg 3240 ctctgctttg ctaaatcatt tgaagagtat gtagagaaat tgcgggttgt gttcagggcc 3300 ctccggcatc atggagttaa gctgcgtcca gctaagtgcg agctgttcaa acatgaagtc 3360 cggtatttgg gtcgtcttgt gtcagcagat ggggtaagag ttgaccccaa agatctggag 3420 gctgtgaggg tacttcgtga caaagttccc caaaacgttg gagaagttcg tcgtttggtg 3480 ggcttcctga gctactacag gacctacatc caggactttg ccaggttagc taaacccatt 3540 tatgagcttc ttcaggcaca cacagctgga gctgcatcac aagaaggacg gagaaaggaa 3600 aagaaaaggg gtctacagct gccctctcga aatcctatag tgtgggaaga caaacaccaa 3660 cagagtctgg acaaattgat tgacatgctt accaacccgc ccgtgctggc ataccccaac 3720 tttgatgcac cgtttgtact acacaccgat gcttcagagc aaggattggg ggcggtgctt 3780 tatcagagac aagagggtaa attacgggtc attgcgtatg gctcaaggac attgtcgcca 3840 gctgagaaga attatagact ccattccggt aagctggaat ttctcgcact caagtgggca 3900 gtgtgcgaaa agtttcgcga ctaccttttc tatgccccgt atttcacaac atatacggac 3960 aataatcctc tcacttatgt gatgagttct gcaaagttga atgcgtcagg ctatcgttgg 4020 gtgggggaac ttgcagattt ccgatttgag ataaaatacc gccctggcaa aagtaacaat 4080 gatgctgaca tgttgtctcg gtcccccttg gacatggacc attacatcag tgagtgcaca 4140 gagggtttgt cacaagcggc catccaaaaa acctgggagg gtaatttgat gtggaaaaac 4200 caagatgtag cctgggttgc agctctaaat ttgacagctc ccgacatgat tatcctccag 4260 aaccagtcaa taccagtcat cagtcctgat gaattgagta gagcacagcg tgaggattta 4320 atgattggac cagtgatgga attaaaggaa gctggtaaga cattgacccc tgatgtttgg 4380 cgcatggcag ctggtgacac aaagaagttg ttaagagaat ggcacaagct tgtcctagag 4440 aacaacatta tgtataggaa gacaatgcaa aggagacagc ttgtactacc aacaaagtat 4500 cggcagatga tccttcaaca gctgcacaat aacatgagtc atgttggaac agagaaagta 4560 ttacagttgg ctcgagagag gttctactgg cctgggatgc ggaaaagtgt ggaggagtat 4620 gtgactcgcc agtgtccatg catcacacag aagcacccag tgacacatgt gcgtgcacca 4680 atgggtggga tcacatcaag cacccccctg gaattggtgt gtattgatta tttgcacctg 4740 gaggccagcc gcggtgggta cgaatatatc ttgatggtaa ttgaccactt taccagattt 4800 gttcaggcat accctaccaa gaacaaatct ggcaagtcgg cagctgagaa aatattcaac 4860 gattacatac ctcgttttgg attccctgca aaattacacc atgaccaggg gcgtgagttt 4920 gaaaactccc tcttccacac tctgcaaaag ctctctggtg ttggtcactc caggactact 4980 ccataccacc cccagggtaa tcctgtggag agaatgaacc gaaccttgct acagatgctg 5040 cgcactctga ctgaaaagga taaacaaaac tggaaggaac atctacccaa ggtaatccat 5100 gcctacaatt gcaccaaaca tgagtcgact gggtactctc cgttctacct catgtatggc 5160 cgtcacccac gactccccat tgacttactg tttgggttag cgcaagacaa tgatttcaac 5220 acccaaaatg agtatgctga acactgggct gatcagatgg cggaggccta taagattgca 5280 gcacagagca gtaaacagag aagcctacgt aacaaagagt attatgatca gaaagccagc 5340 tgtatcatcc taagacctgg agatcgtgtc ttggttagga atttgagtca gagaggcggc 5400 ccaggaaagc tgcgttcttt ttgggaaccc actatttatg tggtgaaaga gcagttaggt 5460 gacaactttg tgtacaaagt gcaccctgaa agggacgaac acaaggtcag aacactacac 5520 agaaaccacc tcctactcgt aaatgactta ccaacagctg agagcctgga cccaaggaaa 5580 cagcgagtct ctaaagaaat gaatgttagt atgggaaaaa acacaaatag aaaggttaca 5640 ccaagagacg agagaaggga ggatgaacag tctaactctg acgaagagga aggatactgt 5700 tggaggagga taagcaatcc aaagtattca ggatgtgaaa gaaaaaatct ggaaagtgag 5760 ttgcctcagg aagaactgga agagcctgat atcacagtga gggatgagat tccgagggct 5820 gtgggacaat taccaactga aactgaggac ataaggcagg agggtgacca atcagaaaac 5880 ttaatttgtg tggacattca agaaacagga caggacatgg agtcagactg tgaaggctca 5940 aaagagcagc agaccactga ggaccaaggg caagagtcgc atgagtgtga accctatgaa 6000 acaagtagac ggtcactccg ggaaaggaga cctgctcaaa gaatgactta cccattacta 6060 ggacagcctg tattccaaac aatacctaaa ataaacacag tagctgccaa cacaatacag 6120 ccattaacat tgatgcacac acctgtacat acatggttag gccatatcag tagtccatat 6180 gaagaaatgc tatttccagt tatggttcca aattattgaa agttatatat gagaatgaaa 6240 aaatatatat atggtattca ccttctgtgt gggggagag 6279 // ID BEL1-LTR_DR repbase; DNA; ZEB; 674 BP. XX AC chr17; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL1-I_DR; KW BEL1-LTR_DR; BEL54_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-674 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1354-1354 (2008). XX DR Genome; chr17; Positions 14944897 14944224. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 674 BP; 194 A; 98 C; 133 G; 249 T; 0 other; tgtgctggaa catatttgat gaaagcagaa ataatgttta atctttgatt ggcaagataa 60 ttattgtgta ttgcaagggt aattctttta agtagtgatt tggtttcatt ttgtaaatgg 120 ttagtgttcc atctagtggt gaccttgagt tatagcaagg tagatcctgc tgttcactta 180 aaatcaggaa gtaaagtgct gcaagcacac ggcaatgact acgatgatca gcaatcgtca 240 tgtttttctg taatcttaac ccttgcaaca gcatcgcttg taagtacgtt tttgattgta 300 taaggtgttc attatattgt atgttttatg aataattgag ctcaagagtg acttcttagc 360 ttatcggcag agctaataca ttagccacaa tttgctttag ctttatattt cagtattttg 420 tatttacata agagaagata ttatttacta caaagcactc tgtatttcat gtaaaagcta 480 tgcttctgta gttttgttaa cttgtggaca tattatttat gtatcctatt gttttttatt 540 gtagtttcac tccataccca agaccattaa acctgttggc aaatgagttt cttgtcttca 600 gagtcgtgtg agggaaggga gtttatctaa ctgtcaaaac aatcactaag ggacagtatg 660 ttgaggacag taca 674 // ID HATN17_DR repbase; DNA; ZEB; 1547 BP. XX AC . XX DT 22-OCT-2004 (Rel. 9.09, Created) DT 22-OCT-2004 (Rel. 9.09, Last updated, Version 1) XX DE DNA transposon HATN17_DR. XX KW hAT; DNA transposon; Transposable Element; 8-bp TSD; HATN17_DR; KW hAT-type. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RA Schulte-Merker S., van Eeden J.F., Halpern E.M., Kimmel B.C. RA and Nusslein-Volhard C.; RT "no tail (ntl) is the zebrafish homologue of the mouse T RT (Brachyury) gene."; RL Development 120(4), 1009-1015 (1994). XX RN [2] RA Hammond S.; RT "HATN17_DR."; RL Direct Submission to Repbase Update (23-JUL-2003)Wellcome Trust RL Sanger Institute, Hinxton, Cambridgeshire, CB10 1SA, UK. E-mail RL enquiries: zfish-help@sanger.ac.uk Clone requests: RL clonerequest@sanger.ac.uk. XX RN [3] RA Gentles A., Kohany O. and Jurka J.; RT "DNA transposon HATN17_DR - a consensus."; RL Direct Submission to Repbase Update (MAY-2004). XX DR [1] (Consensus) XX CC HATN17_DR is a hAT-like non-autonomous DNA transposon from CC zebrafish. There are around 70 copies of HATN17_DR in the CC genome, with >96% similarity to the consensus. This transposon CC contains a partial DANA SINE element, and shows similarity to CC Dr00017, Dr000745, HE1_DR1, DANA, Dr000132, and TDR9. There is CC an 8-bp target-site duplication. XX SQ Sequence 1547 BP; 496 A; 281 C; 291 G; 479 T; 0 other; cagtggcgcc cccagaaaat tttcttaggg gtggccagaa gaggccgcac caaatcttgg 60 ggtggcacac aaaaaaaata atgaattcag tgtaatgtta tatttttgtt tctggtaaat 120 gaaataatgt agtttttatt catttaatta attctacttg aaatattgca atttattctt 180 tgaaataatt cagcaagtac atgtgcaaac caaataaaaa tctatgttta gtttaaaaac 240 acttttcata tactgtataa ataacttctt tttttacgtg cctttattgt acatcatacg 300 agatatgcca tgtctaaaat taatgaagat taatgagttt attattccca gtgatgggtt 360 gcagctggaa gggcatccgc tgtgtaaaaa tgtgctagat aagttggcgg ttcattctgc 420 tgtggtgacc ctggattaat aaagggacta agccgaaaag aaaatgaatg aaagaataat 480 gagtttatta attacccaaa caaatgaaca aactgaacaa aaggcattac tatacacact 540 cactatacct aataatatta tttatccata ttgctttttg aaccatttta ttaatgcagc 600 ttcttctatt gatatactgt agcttaaggt aaatattaaa tagccattgc tgtctattga 660 aggtgtgtgc gtgctgtcaa ggacgatcgg ggagggtagt ggtggttcta gtttaaatga 720 caccctgggc gaaccaccct atacaccccc acctctcttg aattgttaga tttgttattc 780 aataaatata acaatttatt tatatttatt ctaaataaat acaattaata ttaatataca 840 attattattc taaataaata acagaaatca gtcctaaata ttactggaaa acaacaactg 900 gagtgatatg ccaagatcac ttaagaaacc ttttgcatga aaattaatta tgacaattct 960 aaagaataca aaaaatgtat tatagaatta tctgtggtct tagaccagat catggctgag 1020 aaatcatgtt atgaagtctt acctccctga ctcattatcc ctcctttctg ctttaaaggc 1080 atgcttagtg aaagtaacat catcatcatc atcatcatca tattatataa actttaatcg 1140 actttcaaaa ctcgctgcgt gccgacacct ctgcacaaaa ggctgttact ccggtttcac 1200 ggcgcatgag cggtgcctat tatgtcggcg tgcatgcaaa caaccaaccg ggattcacac 1260 aggagtgcgt gaggcgcgcg ggaatggttt tcgcgcgcat gcgtcagttt gctttcacca 1320 gcattgaacc agagggggag agctacgtca gtaacaccaa caataattta gtgactgcat 1380 tttatttatt tatttatcta aatattggga atttataagt tcatttaaat caataatgtc 1440 aaccgcaaaa ttatattggg gtggccaaag gggtggccac aggggtggcc agagtttacc 1500 gaggggtggc cgtggccacc cctggccacc ccttggaggc gcccctg 1547 // ID Gypsy150-I_DR repbase; DNA; ZEB; 6102 BP. XX AC chr4; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy150-I_DR; KW Gypsy150-LTR_DR; LTR67_DR; LTR_Retrotransposon. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6102 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1561-1561 (2008). XX DR Genome; chr4; Positions 40545868 40551969. XX CC Positions [4267-4662] - Integrase core CC LTRs are 98% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 1103..2668 FT /product="Gypsy150-I_DR_2p" FT /translation="MSATDFAKLRELLLIEEFKSCLPEKVVVYLNEQKVST FT VSQAAVLADEFVLTHKNVFTSVRPEKSSSGSHVQSQPSSSNSSPQTTKNDL FT ECYYCHKKGHKLSDCIIFKRKQQQPKSVGFVNSVTVLPSDACNSSFAPFLM FT KGWISLSGEPHDQKEVQILRDTGAAQSFILSDCLPLSDKSYCCASVLVQGF FT EMGIVNVPLHRVHLQCDLITGFVKVGVRSSLPVNGVDIILGNDLAGGKVMP FT VVEVLDSPTVSSSPDIFTKNYPEVFSACAVTRAQMRRTGEDAALDDTFMAS FT VFADQLPSPPSDSESSSKAHSSSVSVRAHAPLNSNALPLNITRDQMCAAQR FT KDPNLSKCFDSALDPEVAKKSQVAYFVEDGLLMRKWSPNNRGDLDWGAVYQ FT IVIPDLYRQQILSLAHDHHMSGHLGVTKTYDRIIRHFFWPGMKKNVAEYCR FT TCHTCQVTGKPNQVIPPAPLVPIPVIGEPFENVIVDCVGPLPKTKSGNQFL FT LTIMCRATRFPEAVPLRKIQRCIVTK" XX SQ Sequence 6102 BP; 1712 A; 1229 C; 1293 G; 1868 T; 0 other; ttaaatgggg gctcgtccgg aataaagaaa gattcccaaa aaaaaaaaaa aaaaaaaaaa 60 attggaagtg aaactatttg catccacaca tatatcttaa gtgttgggtg gtgttgtgtg 120 agatagctga cgctgtgtgc aggcattatg gtcgtatttg atttacaggc ttttattctc 180 aaaccgagtt gggaacaaat tgataagtgt agaaaagatg acttattgat tattgcaagc 240 cattatgaca tttctgttga aaaacaggct ttaaagaaag agatcaaacg cttggttgag 300 acacgcttgg ttgagctgaa aatactgcag gggagtgatg aaggcggtga gggcatgcga 360 cattccagca tggaggcgcg gcctgctgat tccttgaatg agctgggtga aggtttgggt 420 gaaggtttgg atgaagtaga tgccgaggcg gaggctgatg gtaagcctgc tcttcctccc 480 ttttatcctt tttcccctgt ttccactggg tctagagaag acgtacgttt aaaggtacgt 540 ctggcccgag tgcaaatgga gacgcaggaa aagatgcagg ctcgccaagc tgaattagac 600 atgcgtgtac aaatacgtaa gctggagatc gaggctgata cgcaagtaag gctgcgacag 660 cttgagctgg atgctgcgaa ggctgctgcc agcccagctg cgcagtcaag cactgcgcac 720 agtcctgtgt cccccccgcc gtctgataca tcctcgcttg cctttgatgt cagccgttat 780 atagctcttg ttccccagtt tagagaaagc gaagttgata catattttaa cgctttcgag 840 cgcattgcgt ctgctctgcg ttgcccaaaa gatctcttct cctccagtgt aagttgatag 900 ggaaggctca ggaagtctgc tcgactcttt cattagaaga tagtttgcag tatgacactg 960 tcaagtcagc agtccttcgt gcttgtgagc tggtaccaga agcttatcgg caaagattta 1020 gaaaccataa gaaaaatagc catcagacat ttgtggagtt ttcacgggag aaatcagtcc 1080 tgtttgataa gtggtgtgct gcatgtcagc aactgatttt gcaaagttac gagaattgct 1140 gttaattgag gagtttaagt cgtgtttacc agaaaaggta gtagtttatt taaatgaaca 1200 gaaagtgtcc actgtgtctc aagcagctgt gcttgcagat gagtttgtgc taacgcataa 1260 aaatgtgttt acttcagtgc gtcccgaaaa gagctcatct ggctcgcatg tgcagtcaca 1320 accatctagt tcgaattcca gtccacaaac tactaaaaat gaccttgagt gctactactg 1380 ccacaagaaa gggcacaaat tatctgactg tattattttt aaaagaaaac agcaacagcc 1440 aaaaagtgtt ggatttgtga attcggttac tgtgttaccg agtgatgctt gcaatagcag 1500 ttttgcgcct tttctaatga aaggttggat ttctttatct ggagagccgc atgatcagaa 1560 agaagttcaa atcttgcgtg acactggcgc cgctcagagc tttattcttt ctgattgttt 1620 accgctgtca gataaatcgt actgctgtgc cagcgtttta gttcaaggat ttgagatggg 1680 tattgtaaat gtgcctttac accgtgtaca tttgcaatgc gatctaatta caggatttgt 1740 gaaagttggt gtgcgttctt ctttgcctgt aaatggagtt gacatcattt taggaaatga 1800 tttagcagga gggaaagtga tgcctgttgt ggaagtgtta gacagtccga ctgtgtcctc 1860 ttcacctgac atttttacaa aaaactatcc tgaagtgttt tctgcgtgtg ctgtcacacg 1920 agcgcagatg cgcagaactg gcgaggacgc cgcgttggat gacactttta tggcttcagt 1980 ttttgcagac cagttacctt ctcctccctc tgattcagag tcaagttcaa aggctcactc 2040 aagttcagtg tcggtaagag cgcatgcgcc actaaattct aatgctttac cattaaatat 2100 caccagagat caaatgtgtg ctgcacaacg aaaagatcct aatttgagta aatgctttga 2160 ttcggctctg gatcctgagg ttgctaaaaa gagtcaggtt gcttattttg ttgaagatgg 2220 tctgctgatg agaaagtgga gtcctaataa tagaggagac ttagactggg gtgctgtgta 2280 tcaaattgtc atacctgatc tgtacagaca gcaaatattg agtttagctc atgatcatca 2340 catgtcagga catttggggg tgacaaaaac gtatgatcgt atcattagac attttttttg 2400 gccgggcatg aaaaagaatg tggctgaata ttgtcgcacc tgtcatactt gtcaggtcac 2460 tggaaagcct aatcaggtga ttcctcctgc ccctcttgtg cctattcctg tgattggtga 2520 gccatttgaa aatgttatcg ttgattgtgt cggacccttg ccaaagacta aatcaggaaa 2580 tcaatttctt ctaactatta tgtgccgcgc aacccgtttt cctgaagcag tgccactgcg 2640 taaaattcag agatgtatag taacgaagta gaactacttc actactgtac ttaagtacta 2700 aaaggcagta tctgtacttt actggagtat tgtttttttc tcctacttcc acttttaccc 2760 aagtacatat tttccatgag ttttatactt ttactccgat agatttttta tgtgctgcat 2820 cgttactcgt tactaggggt gtcaaaatga tcgatatcag ttcagtaata gatcataacc 2880 ggtgattaca tgatgacgtc atttatctcc tatgcgcgat gtcgcaatgg tctcaatatt 2940 atggcggagg acggcgcgag ggcgagtgaa ggcggcaaaa cacacgagcc cctatttcac 3000 tattagggag aaatgttgta aaacttcacc tctgcctctc tctttctctc actttggaca 3060 tttgacccgc tggccttttt gtaaggtaac ttttcctctc ttcttggctg ataaagctgt 3120 agattcagac acgagcgtgt ctgatgcgca ggttttctcc actgtgtcta ttacctgtga 3180 taatcgcgca ttcttcccgc catggctgaa caacgctttc atttctaaat ccctttctgt 3240 tgaggacaat aaccaatcat tggcgtgcaa tacctcgcct gtcagcgctc aaaaaggagg 3300 cgggataaaa tccacatgag tttattagct gtaaatgttc atgctgtctt ctgtgcatga 3360 gttttgtttg atacattcag aaagtgtgtt ttctttgagc aggtcaaatt aagggcacag 3420 ttcatatatc tgactgctgt attaaagaac aaaattctca caccaagtca caccaagaca 3480 aaagcaatag aattctttat tgtatatctt acctcctaat gtccctagtt aacacaatca 3540 atgcattttt ttcaacacat cccataattt ctaaaatatc agcctctact aaatggttga 3600 gaagttggtt tatggtaaaa gtaccaaaat taataaatat actacagaaa atattaagtg 3660 taagaccaat ttaattaaaa acattataaa aaaaaaaaac attttgaata ctaaaccata 3720 gccgtaagcc ataaaatatt tcagattatc atttaacagt aatatacacg ctttcttgtt 3780 tttgtttttt ttacaataaa atcaaaatca agtgaattag tgcaacagga tttagtttgc 3840 ttgcgcgcat taaatgcttt gctgcgcgcg cgagagagag agtatgtatg tgagtgaaca 3900 tgctaaatta tctcatcagt gaagggtttt tctggtaaaa gggcctgacg atgtgagttc 3960 ttgtaccata gaaactattt acagagaaaa acagtatcgt aagccctgtt tctttttgtt 4020 ctgcgtaatc acgtgcccca aaaaagatat ttaaaataaa caaaacaatt agtgtatttt 4080 gactgctttc tttaataact acattacaca atacttgtac ttttactttc agtacttgag 4140 tagtaaattt tgaaataaac tacttgcaat acttaagtac aaaaaatgtt gaatacttta 4200 ctacttccac ttaagtatgg tgcttaagga gcacttctac ttctactcaa gtcaattttt 4260 gattgagtac ttgtactttt acttaagtat gggtctctag tactttatac atctctggta 4320 aaattacagc ttctgccgtc tcaaaagcac tcgttaaatt tttttctaca ttcggacttc 4380 ctaaagttgt tcaaacagat caaggaacta attttctcta aattgttctc acaaattcta 4440 aaaacccttg gcatttccca ccgtcattct agtgcatatc accccgaaag tcagggtgcg 4500 cttgagaggt tccatcaaac ctcgaaaagc atgttaagaa aatatgcaat ggacactgaa 4560 aaagactggg acgagggtgt gccactggtt ttgttcgcag tgcgtgaaac cgttcaagaa 4620 tcgcttggat ttagccccgc agatctattt tatgcacacg ggtttcgggg acctttgaag 4680 attttgaagg aaaatatttt aaacactgac tcaaacgtta agtgtaatat tctggacttt 4740 gttagtcgat ttcgagagcg tttgcatgct gcgtgttctc tcgctcgaga atccctctct 4800 gttgcgcaat caggcatgaa gagacggtat gatgcaaatg ctctctctcg ttgttttcaa 4860 cctggtgacg aagttcttgt tctgctgcct attcctggct ctgctctttc tgctagattt 4920 tctggcccgt accgtgtttt ggaaaagata agtgatactg attacgttat tgacacccct 4980 gacagaagac gtcagtcacg cgtatgtcac atcaatatgc ttaaaccata tcactccaga 5040 acttcaccta ttacatccac agttcagatc gcagggcccg cgatctcttc tgctgcagtc 5100 tgtgatgtgt tatcctcctc ccttcatttg cccagtgcag atgatgatgg agtggttttg 5160 cgcaaaggat cagttcagtc accgcgctta agcaattcag atatgctgaa acgtttgtcc 5220 acgcaattag atcacttgtc tatcgaacaa caaaaagatg ttgagtgtct catttatgat 5280 tttccacaat tatttggaga cattcctacg cagaccacag tattaaagca tgacattaaa 5340 atcaactgct ctgctccgat aaagcaacat gcttatagag tgaataccac aaagagatcc 5400 ataatgcgaa ccgaagccaa atatttgctt taataatttg gcggttcata gcagtagtcc 5460 ttggagtagt ccgtgtttgc tcgttccaaa gccggatgga acgtatcgat tctgtacaga 5520 ctacagaaaa ataaatgccg tcactgtacc ggattcatat ccgttaccgc ggatggaaga 5580 ctgtattgat agccttggta ctgcacagtt tgtcacaaaa ttagacttac ttaaaggtta 5640 ttggcaaatt ccacttagtc cacaagcttc tgaaatttct gcattcgtta ctccggatta 5700 ttttcttcaa tattccgtta tggctttcgg catgcgaaac gcccccgcaa catttcagag 5760 actcgtcaat attgttttag ccggtgttcc aaattgcaac gcataccttg acgatttggt 5820 ggtatattca tctgactgga aagaacatat tgctctttta aggacagtgt ttgagaggct 5880 agcaaatgca tcactgaccc ttaatttagc aaaatgtgaa ttcggccaag ctaccgttac 5940 atatctggga aaggaagttg gtaaaggtca agttcgtcca gttgaagcta aggtcactgc 6000 catctctgag tttcccatcc caaaaaccag acgcgaactg cgcaggtttc ttggcatgtc 6060 tggctattac cgtagtttta atttctccaa ctctcccata tc 6102 // ID DNA-3-1_DR repbase; DNA; ZEB; 2005 BP. XX AC . XX DT 21-OCT-2008 (Rel. 13.1, Created) DT 21-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE nonautonomous DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; TSD 3-bp; KW DNA-3-1_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-2005 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1223-1223 (2008). XX DR [1] (Consensus) XX CC The members are ~97% identical to the consensus. TIR is 40 bp CC long, and TSD is 3 bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 2005 BP; 660 A; 354 C; 365 G; 626 T; 0 other; agccgctttt ccactatcgt gctgaatcgt tctttaaaca gaactgttcc attccgtgcc 60 aaaccgggcc agacagcacg gatacggttt cattttccac tgtcgtgctg cgacaaagct 120 ctttaaaaca ttacacaaaa cgcatcagta gttgccttgg taacgcaatg taaacggctc 180 cctttgttca attaaagtta aataagagcc gaaactagtt tttttttttt ttttcataaa 240 agcaaaagca accatgtgac taaacactgt tatgccgctc tatcaaatag gggtgctaaa 300 tataggtaca tgtgcaccct tatttattta tatgttgcgc agctctggct aaattttatt 360 tggagggaaa attaccaaaa tgtaacgtga tgattgaaaa taattggagg ggaaataaat 420 aatatttaaa atctctgaac ataacagaaa aataaacaat gataagacaa caataatagt 480 gcatttgcat tatagtggca gaaatataat tttggacagg ccttgtcaaa ggtgagcggg 540 cactttcact aataaaaaca cagcacataa attatttcat ttttaacaat taattaaatt 600 acactgcgtt tttatatcaa aagttgataa tgatataaaa tatactacat tttgatggta 660 aaatgaaacc aattaagcga tggctgttat ttgttttact attcaaacat taaataacga 720 atgcagcaag tgaaataaaa aacgaatatg aaacaacaca tatatagcag gatttaaagc 780 atataaagca ctattatatt tagcaaaatg ctgctctatt atctttttat tttgatcatt 840 ttatttttat tattataatt attttttgtg ttcataatta actaacaagc tgtaggcgaa 900 ttattgctag acgttttaga aaatatttgc gtagctatat taaagatcac caagaacaaa 960 aaaacaaaaa cacagaaaca tgttacatgc ctcataaatg tcatataact tatgtataac 1020 cgaatcatat aattgcctat agctgaaatg atgaaaattg ctcattttat ttttataaaa 1080 catttgtcaa aacgtattga tattttttga gtaaaagcaa tactgacatt taagaatcca 1140 tatttattta attcattcat tcttttgact tgtttattag tgcatatgct ataatataaa 1200 tcatacaaca gtaccttgga cagttacagt gccgctccgt aaagctttca agcaaagtct 1260 tttggacggc cgacattaca acacgaacac gccgtccttc aggtggtctc gccggtatta 1320 acaagctgtc ctggctttga attaaaaagt gcaattattt cctcgttgtt aaaagagtct 1380 gcaagttcgt gttccatgga tatagtgatg ttgttgtgga atttggacat cgacgatgca 1440 aagcgtgact ttattcgctt tattcgatgc gcagcaggaa aaaaaagtca gcagcgcaag 1500 tcgctggcat gtctctcccg gatcagagcg agttttgtgc tgtgaaatac agcgctgcgt 1560 gcaataaact cacattcagg caggcgaagt aggtctgtat tcaagtcaat aaaataataa 1620 ctttaatgaa taaaatatct tcgtaagaat gtgatgagtt attcacttct ctgccagaga 1680 ctgctctatg gttttaaaac ccttttcaaa ctgtatcgtt tagacctgga tcattatttt 1740 atattaaatg tactgtagtt tattgaggat gtctgatgac tagcgcgctt ctgctgagcc 1800 agctgtggta gcttagcaac cgagtcgcga cgtcaacaca cagaatctgt cagcacagtt 1860 cagcacggtt gtctaaccgt gccgagaatt ccgggccgag aacggtttgt aatcgtgccg 1920 cgccgttcca ggctcagtgg agaaacaacc gtaaccgtac cgaagtgttc ttagaacggt 1980 ttggcacgat agtggaaaag cagct 2005 // ID Gypsy-27-LTR_DR repbase; DNA; ZEB; 1449 BP. XX AC . XX DT 11-FEB-2005 (Rel. 10.01, Created) DT 11-FEB-2005 (Rel. 10.01, Last updated, Version 1) XX DE A long terminal repeat of the Gypsy-27_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy superfamily; Gypsy-27-I_DR; Gypsy-27-LTR_DR; Gypsy-27_DR; KW endogenous retrovirus. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1449 RA Kapitonov V.V. and Jurka J.; RT "Gypsy-27_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 5(1), 28-28 (2005). XX DR [1] (Consensus) XX CC Gypsy-27-LTR_DR is a long terminal repeat of the Gypsy-27_DR CC LTR retrotransposon that belongs to the Gypsy superfamily. Its CC internal portion is deposited in Repbase as Gypsy-27-I_DR. XX SQ Sequence 1449 BP; 339 A; 207 C; 343 G; 560 T; 0 other; tgtgacgatc gtcattagaa ttttccttat tattatatga gtgacgtctg attgggttac 60 tgctgtcagc tgatgcaacg ttatttagag cgccttggat gggtttctcg ctctctctct 120 cattcattcc ttatcacgca gtcagcatgc aggtgattcc gggtcactga gaggtcatgc 180 tgacggagtg tgtgtgtgtg tgttgagagc gcactgcgta cttgttgctt tgagtccgtt 240 ttttgtagct caatatgtgt tttcgtgctg cgtggtcttg ctctgaggta gtggcagtgt 300 tctttgaagt tggtcacggc atcgaaagtg cgagtatgtt tttgatgcgg acgttgaagc 360 ggcaacattt atgaatggaa gagtttatga ttttggcgga ggttgtattt gcgatgcatg 420 gtgaacgtat tatgcgctcg ggtttgtttg tggagtgagg cgcgggttta ttttgttttt 480 tggcgctcga gttgcctatt tgttttgggt tttttctttc tttttgtttg aatgtttgat 540 cactatcgtg atcgggattt gtgttgcagt tttctgggcc cagtagcatc gacgggtgat 600 cgggtaagca aatgtttctt ctttgattta tattgttttg gttaacttgt atatgtgtat 660 gtatgtccaa aagatttaat gagatggtag agtaaccata tgaatgtgca ttgcgttcac 720 gattgcttca acgtgcacaa atggaaaatc tactttagcg tatacgaatt gagagacatt 780 tcaagtgcat tgattacccg tttacttcag cgtgcacaaa tggaaaatct actttagcgt 840 ttacgaactg agagacattt cagagtgcat taattacccg attacttcag cgtgcacaaa 900 tggaaaatct actttagcgt atacgaacta agagacattt caagtgcatt gaatgtgcaa 960 ttgattcagc gtgcacaaat ggaatatcta ctatgtgtaa agtgtacaac tgagttctat 1020 aagttcttga atgtgtgatt gatttgattt gaacttgaaa tattatgttt gtgggaattt 1080 gattaatatt ggttgttgta tatttagaag tggtttattt gatttatttc tttatggttg 1140 gttgatgtat cacccgttga tggttttgtt tgtaaaaggt aattaaaagg tattgtgcct 1200 gggcctggaa atttaagttt gtttggttta tggatttact tacaatttgt tgaagttaaa 1260 gtttgttctt tttgtttttt tctttattaa ttcattgatc tgttgtatga ttgaatatct 1320 gatttggttt tattaaatca tgcatctgtt gaaacttaaa ttttcattta attgaattat 1380 ttaagaaact gtagtggggt ctttcaaccc ctggcgccca acccacttta ttagccaaat 1440 accgctaca 1449 // ID DNA-2-10_DR repbase; DNA; ZEB; 1227 BP. XX AC . XX DT 03-NOV-2008 (Rel. 13.11, Created) DT 03-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE nonautonomous DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; 2-bp TSD; KW DNA-2-10_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1227 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(11), 1701-1701 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1227 BP; 432 A; 162 C; 193 G; 438 T; 2 other; cactctcaga aaaaaaggta cacggtggta caaataatgt tccttaagga acagttttgt 60 acctttgtaa aagttgtacc ttatatggta cagaaaagac ctcttatgat actgttctgt 120 accttttatt ggtacaattg aaatgaaggt acacaattgt tctcataaaa aaggtacata 180 agtgttggac aactccttta ttacaatatc tatgaaaata aatattactg gaaaggcaaa 240 gtgattttat gaataatttc catattaaaa catgaatcat catttaaaag catatgttta 300 aagaaactca taaacattaa ttattaagtt ctataataca aaacaactgg taaaacaaaa 360 ctataggtat tgtaaactaa acatttaagg catttttaat aaacatttgg taagcatttt 420 ctgataaata cattttcatt attgatatct gcaccttttg gcgtttgctt tccactacgc 480 acgtgagctg gcaaaagtcc tgcatatgca aagtgttcat gcagacattt tagtattgta 540 aarctaatca aacattgtgc atatctcrtt acaatacttt tgcataattc tatttctatt 600 ttaaaattaa gtaaattaaa tgaactttta agtgattaca aaggtattgt gtgaaaattg 660 gagaaaaaag tttttatatt gtacatggga gaatgtattt ctgtgttttt ataacatttt 720 tgtgaaaagt gttgtgaaat gtttagcaaa aaatagatct tttgatttat gttaaagtat 780 tttttattct ttattgtaaa tggaaaaggt gcatttacac aaaaagaaac ttaaaaaaaa 840 aacattgttt aaaaatgtat ttgatgtttt atatttactg aaaataaatt gacacatttt 900 ggatatagca aaatgccttt aaatagttct tgaaggaaca catttgacac tgttaaggta 960 caaagtgtct tgtcactgta gtggtacctt catggatcag atttgtacct ttgatgaagg 1020 tacaatattg tatctgcagg ttttaaaaac caaaggtaca ttttggtttc ccatggtaca 1080 aatcatgtcc ttaaaggtac aaactgcaat ggtacaaatt tgtttctttg tcttaaggta 1140 caattctgtt ccataaaaag gtactgcccc agtgacaagg gtttgtacct tctttggtac 1200 aacattgtac cattttttct gagagtg 1227 // ID DIRS-4N2_DR repbase; DNA; ZEB; 5990 BP. XX AC . XX DT 12-FEB-2009 (Rel. 14.02, Created) DT 12-FEB-2009 (Rel. 14.02, Last updated, Version 1) XX DE nonautonomous DIRS-like LTR retrotransposon family - a consensus. XX KW DIRS; LTR Retrotransposon; Transposable Element; Nonautonomous; KW reverse transcriptase RNase H; phage integrase; DIRS-4_DR; KW DIRS-4N2_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5990 RA Bao W. and Jurka J.; RT "Families of DIRS-like retrotransposons in zebrafish."; RL Repbase Reports 9(2), 541-541 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 5990 BP; 1493 A; 1939 C; 876 G; 1682 T; 0 other; gtgtggttta ttcataaact aatttcgaga ggatcacgtg cttataatca acacggctgg 60 ctccttgtta gctccgtaat cagccctatt agatgattac ggaagcatta taaataacct 120 gagttttcca ctccagttat cttcgtcttg aagctcgctt cccgctacaa aacgctccag 180 catcgcacca tttaacctat cgatacggaa gaacacacaa caacaacaac aacaacaaca 240 acaacaacaa caacaaacaa caactccagc cagagctcca ctctccctga gaattcagcc 300 gcagcctcgc caaaatcgag cctgaattcc cgcaaaaacg ccggtgctgc ccaccagctc 360 ttccatctcc acttcaaact gcctcacgca aacgcagtcc acaactccac ctaaactctc 420 taggcgtcga cgatccaaaa catcgcaaaa gcctaactca ctgcttccta gcggtgcacc 480 ctcggtaacg tatagtacgc ttttaagggg aagcatcgcg gaaacaacta atctccatca 540 aagactaaac ctatttacaa atcttttaaa aacaagactt gcgtgattag aagattcata 600 agaactcacc cttcaagccc gtccacaaat ccataagtaa atcactgcaa agatcacgac 660 attaatccac gacatattga tacgtcgatg gaccgcgctg ttatcatcca aactagtcgc 720 tggtgtgccg cagtgtcttc acacaacact ggtaaacaaa cttggcctcc gatcttgatt 780 gacgccccat cgagccaata gcgaaggagc agcgccctca tccaatgcgc ttttaaaact 840 cgatatggtt ccgccctctc aacaattcat gaatgaaccc agccataaaa gactcgtgaa 900 tgaactgaga cagacatgct tgcaaattaa gttaagatta agaattttag aagttaagat 960 tcctaacatg tatttaaaca aattaagtaa agcccataac aaacattcca ttcatacagt 1020 ttccactaag tagtgaatta agcgtttgat gttgggattt gatataataa ataaataaat 1080 tgacaaataa taataataat aataaatgaa tgaattaata ataataataa taataataat 1140 aataaataat aataaaataa taaataaata aaataaataa aaacctctgc cagatctgat 1200 agcatttaac ttctgtaaag actactaccc aaccagtgag aatttaacac aaaagtccaa 1260 tcgtaatacg tcaatgaccc atacttgcat atacagtgcc atatgtaact tataatggca 1320 cgccggctgg aggtgaaaat ggtaaagaaa ctaacttttc gcttgactgc atgaatagga 1380 catttaaata aatcttcaag aagaatttct aggaccacca gacaagctac accccatatc 1440 cccagctctg catccacctg ctatcctgtg ggggaacaaa tgcccacctc gatcatcctc 1500 gcagactgcg acaacgaagc tcacaattgt ccatgctctt attaagatgc cttgcattag 1560 atatcagcta aaacaatttt gtcatgtttg ctatacatgt atcatgttgt aaattaaatt 1620 tctgattttt tttttcttgt ttctcattcc agacattagg acagccggcc cagaagctga 1680 ccttcatcca acctattaca ctccacatcc attccaaatg caatcacaat ctacacaagg 1740 atgcatttcc aatcatacag cccacatcca taatgccata ttcacactag tcttcgggtt 1800 ttgagattct ctgaattacc atcagttgca atcccacaat ccacctcacc atatctgatc 1860 tgactggagg aacactatct ctctccatta agcaagcaaa ccgattattt cggcaaagca 1920 tgtcctatat acattttcga catcccttcc cccactcaca caattcaaat gttttaaaag 1980 tgttactaag tgtaagcaca acttaaacta tcccatacac ccatacattt tcactcacca 2040 cagttccaac ttatttaggg gttgcctaaa tttttctatt actgcagcag ggatgctcta 2100 agcaagctct atacacttac aagtccaata acactcgtca cactgccctg cactaataca 2160 gcagtaaatg cttctgcagg agcacacatt gtctccatac tgatcctctg cagcagcaga 2220 atcgctccac tgtgccatac tcacccattg cacctgtagc taagccctta ttcacttcct 2280 ctctacatat cactactaac agtatttaat attcttgcga gagcattcaa aactctccaa 2340 taccgatcac tactgcacct atccaatagc agtagcgctc cgctgagcta cactcaccat 2400 agcatcacaa caaccgcagt gacactctac ctgagcccct acttactttc atctccacac 2460 atcacttctg atatttaata tttttgtggg agcacctata atgctccaat actgactacc 2520 accgtacctg caacagcaga gatgctcagc caagccttat tctcccattg caccactgtc 2580 acagcgcagc tcctttcctt cccactcccc aacactgcac agcagttata cattttcggg 2640 aaagcgtacc cgcctcccga cattgacttc caccacacct ctgcaacagc agagaagctg 2700 cagagcttct tttccccccc cgcttcactg tcgcagcagt gacgctctgc ctgagctcac 2760 atacacttcc attcaacata ccactattga cacttcacaa cagtttaaac gcctccataa 2820 aagcgtttac tcgttattat cgtctagctc aattacacgt aacagcctca gcgtgatgca 2880 tgtatacatt tggagaggat atcaattcat cctcacagct cgcactcacg acatccacta 2940 tcttgaatac aaggacagaa atttaattct gcctgttccc caaccccccc cccccatccc 3000 cccccccctc tttcagcgct cccagttcag caaataatct ccccctatct ctctcccaag 3060 agtgcctgaa ccccgacccc aacaggagca aaactttttt ccccaaggct accagtataa 3120 aatcttttca gtcttcactt ccttcctgcg tcgagtttct ccgcacctcc ttcttctcct 3180 ttctagcgtt gagttcctcc gcacctcctt tccttccttc aagcgtcgaa tgcttccgct 3240 ctttcctcca tcccctccca ctctcccccc accacctcaa catcttttac tccctacccc 3300 tttccagcgt cgagtttctc cgctactttt ctcttttcag cgtcgagttg ctccgctact 3360 atctcttcct tctagtgttg ttcttccgct actctcatct cttcagccta agtatcacca 3420 ttcaaggcct gactcccccg gagtcaatca attaccccaa accccccccc ccctccacaa 3480 tcgacaactt tcttgttttc tagcgttgag ttcctccgca tctcctattc tatgttctaa 3540 agcgttgagt tcctccgcac ctcctttcct tccttcaagc gtcgaatgct tccgctcttt 3600 cctccatccc ctcccactct ccccccacca cctcaacatc ttttactccc tacccccttt 3660 ccagcgtcga gtttctccgc tacttttctc ttttcagcgt cgagttgctc cgctactatc 3720 tcttccttct agtgttgttc ttccgctact ctcatctctt cagcctaagt atcaccattc 3780 aaggcctgac tcccccggag tcaatcaatt accccaaacc cccccccccc ccccctccac 3840 aatcgacaac tttcttgttt tctagcgttg agttcctccg catctcctat tctatgttct 3900 aaagcgttga gttcctccgc acctcctttc cttccttcaa gcgtcgaatg cttccgctct 3960 ttcctccatc ccctcccact ctccccccac ctcctcaata tcttttactc cctaccccct 4020 ttccagcgtc gagtttctcc gcttcttttc tttttcagcg tcgagttgct ccgctactat 4080 cacttccttc tagcgttgag tattccgcta ctcttacctt tccagtctaa gcttagactc 4140 ccccggagtc ctgcccagcc cacctcctat aggagtctcc accaccccct ccccatttag 4200 actcctgtag gagcctaatc tttcagctct aactcccacg gagttacccc aagagctaag 4260 actcccacgg agtcccccct cccctctctc tctcgccccg gccaaatacg catctttcct 4320 accttctagc gttgagttcc tccgcatctc ctgtcctata ttccctagcg ttgagttcct 4380 ccgcatctct ttctcttcct tccagcgtcg aatgcttccg ctctttcctc tcccccctct 4440 tacaacttct ctcctctccc cacccttaac tcccaatccc cctttcagcg ttgagtttct 4500 ccgctacttt ctttcttcag cgttgagttt ctccgctact ttactatttt cagcattgag 4560 ttcctccgct actctttcta tatttcaaaa acatcactca ccaatctgac tcccaaggag 4620 tcttcccacg ttctgactcc cccggagtcc tgctcagccc accacctaca ggagtctcca 4680 cctctccctc ccttcccaga ctcctgtagt agtccaatcc tacagctcta actcccacgg 4740 agttgaccca gagctctgat ccccataggt ttcttttctt tacctttcca gcgtcgagtt 4800 cctccgcatc tcatgctctc ttcttcccag cgttgagttc ctccgcatcc tctttacttc 4860 cttccagcgt cgaatgcttc cgctcttctc tccaccccct ctacaacccc tctcccctac 4920 ccactacatt cccctttccc agcgttgagt tcctccgcta cttttcttct ttcagcgttg 4980 agttactccg ctactttcct atttctagca ccaaatccct ctgctactct cctttcttcc 5040 gttaggctcc ccaaccctgc ctccccgaac ccaactccaa cagagtcccc agcctctcct 5100 aagaatcagt ttcaactttt ctcctgcagt aacctatctc acttcaattt acattcagca 5160 gccggatatg gcactgacct ctcgcctttt ggggggcact tcaatacgcg gctgctgtcc 5220 cgagatgaaa acctttgcct tttggggagt tctcgagacc tacctgacct cgaattcccc 5280 tctcgccctt ctaacgggag ggagccccgg gctcgaggat attacgagct cagggctctc 5340 tcccgggaca gcatgccaaa tacgctttat tgattatcag ctaagtgtga actcttgaag 5400 tgtggtttat tcataaacta atttcgagag gatcacgtgc ttataatcaa cacggctggc 5460 tccttgttag ctccgtaatc agccctatta gatgattacg gaagcattat aaataacctg 5520 agttttccac tccagttatc ttcgtcttga agctcccccc cttccacccc tacatcctcc 5580 cactttttct gatcgggcga cacggtggcc cagtggctag cactgttgcc tcacagcaag 5640 aacaccgcta gtccaaccca tcgggctggt tggtgtttct gtgtggagtt tgcatgttct 5700 ccccgtgttc gcgtgggttt tccccgggtt ccccggtttc ctcccaccgt ccaaaaacat 5760 aaaccatagc caatcgacta aaacaaatta tcacccaata caacctgagt ttacacttct 5820 cacggtgaca agcaggggag ttctcgagac ctacctgacc tcgaattccc ctctcgccct 5880 tctaacggga gggagccccg ggctcgagga tattacgagc tcagggctct ctcccgggac 5940 agcatgccaa atacgcttta ttgattatca gctaagtgtg aactcttgaa 5990 // ID DNA-8-4_DR repbase; DNA; ZEB; 2568 BP. XX AC . XX DT 03-NOV-2008 (Rel. 13.11, Created) DT 03-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE DNA-8-4_DR is a nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; DNA-8-4_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-2568 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(11), 1719-1719 (2008). XX DR [1] (Consensus) XX CC TSD =8 bp. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 2568 BP; 823 A; 456 C; 459 G; 826 T; 4 other; tagggttgtg acggtgagga aatttcccca ccggttaatc gacatgtgac aacaccggta 60 ataccggtat caccgtggga gtgggcgttt cttcatttcc tctttttttt ctgcttttaa 120 atagcttata aatgcgtgcg tataatactt tcactttcag ttttgcggga gttgccaatt 180 cggcgtcaat tgtttgaatg gacgaaaagg aaactacaaa aaaaaaatcc ctgttattaa 240 acaaaacata acttttatta acataacgaa taaaacacaa caatgctaca ggctgaaaaa 300 actgtggaag ttggaaccat acaaataaca accataaacg cctatattat atattaacag 360 cttagtcctt atgaacaatc cgtttataaa atacattatt aacgaatcta taggctattg 420 aagaataaac cgaatatgaa atatgatcct tgcataatga tcacaactaa acaagckagc 480 actcatttta tatataaact ataaataaaa aaaaaatgtt aagaattgga acattatgta 540 aaaaatrcta aagactacag gctaaataaa agcgcctcgt cagagctaaa catcaccgca 600 cacgtgtcag tttacagctt ctggaaagat cagacaacct atacagatca cacaatctca 660 ataaacaaca tgtaaatgta taaatatttc gataagtatt gttatggaat atttataact 720 atatgatctt gctttttggt tatatcgata ccagctttga tctctcttac aagcggcagt 780 tacaggcgcw gcataagggc gtctcacact gtcagtggcg gaatacacac agcactggca 840 agacgtttta tttttagttt tatttgttgc agatttgcgc gttgttcctt ttaaaaaaaa 900 accttttatt ttttaaatct ctcaagttaa tgttgtctat cgagtaacta tcgagtccct 960 cggattggag ttcagtgcgc agtaagacta ttgaccaaca ccgccctgta agtttaaaca 1020 ctacatagac tgtaggtcca cagctctggt gtggtatgaa ttacatgttt taaatgacag 1080 cgagacatgc tacgtttagt ttttattgta aaatatacat tataacccag tacacagtaa 1140 aataatatgt tttgaatggc ttgtctattg acggcgtata gtagccgtta aacttaaata 1200 ggctgagcat attttcatta atattaaaac acattaatac aaattagcca ccctttgatt 1260 tttttttttg tttacgtata tttttatcaa acgaaaaatg caatgaattg ttaattttgg 1320 ttttcatttt attagaaata ataggcctat aatttatagg ctacacaact catatctttg 1380 gctccaattg cgtttttaca gatatcctac caatagactt ttttttggct caaagacatt 1440 tatgcctrtt aatttcagtg gtgtattttg acagatttcg taaaggcttc agctattcta 1500 cctgtcagct ggcacctgcc taagttttgc aacttgactt gtgttgcggc tcgatgtctg 1560 aatgagagag agaagagaga gagagagaga gagagagaga gagagagatg aaatcagtcc 1620 tcagattcga tgtgtggccg cacgtctcag ttactttatt gcaattttta atattggctc 1680 atctaaataa acttttcata tattaaaagc cgaaatattg ccagacagaa gaagcaactt 1740 tcggtttttg aagagagcgg cctcagttat aaaagatgcg cgcacacaca ggatagtcga 1800 cactcttgtc ggcctattca ttatggattt ttttaacatt aaacacataa taatcaagtg 1860 gaaaaaagaa cagctgaact tcggaaaaag tgctcacggc tgttgccacg atggtttgtt 1920 tctctgcagt tggattgtca atggcgcggc attgctaaat cactcgtttt aataaaaaaa 1980 taagataatt tattattaaa cattgtgaga tgatggtcgc gtgatttttt tttaaaatga 2040 gagtatgcgt tgcgtattta tggccattga tctatccgcg acccacccat agttaaacat 2100 caagttgaag gtgcgtgtat aaattagtta tgtttcctcc agatactgcc acctttgtca 2160 ggcaaagttt gcagattgcc tcattaacgt cttcaggttc tctttttgca tttggtttga 2220 acccaaaata ttgccaaaca ggcgcttttg cattgcgttt tgacaataaa tcgtccgcca 2280 tcctctttct gtaaattcga ctcctcttgt tctcctcacg tggtaaatga aatatgacgc 2340 actgtagcta tgtttagatt ttaattatag agcatgctct cgttttaagt caattattta 2400 taattatatt ttccatttat ttcaaataaa tgtttaataa aaaaaaataa aaaacgaata 2460 cttaaaaaaa acaaatgtgg ggacggtgtc acggtggaaa agtgatgtca ccggtgttgc 2520 gtcttaacac cggtatcacc gtcaacaccg tctatcgtgg caagccta 2568 // ID CR1-5_DR repbase; DNA; ZEB; 2227 BP. XX AC . XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE CR1-5_DR is a CR1-like non-LTR retrotransposon - a consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; AP endonuclease; CR1 clad; CR1-5_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-2227 RA Jurka J. and Bao W.; RT "CR1-like non-LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1200-1200 (2008). XX DR [1] (Consensus) XX CC The consensus was built from copies which are ~90% identical to CC it. The 5'-part is incomplete. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS join(3..281,285..974,997..1383) FT /product="CR1-5_DRp" FT /translation="ARSYTXQIIKQLFSEVSEPVLNILNSSLLLGYVPKTF FT KLAXIKPLIKKTQLDPKELVNYRPISNLPFLSKILEKVVSAQLCSFLQRNN FT IYEKFSGFRAFHSTETALLRVTNDLLLLSDRGCISLLVLLDLSAAFDTIDH FT NILLDRLENYVGITGTALAWFRSYLSEXYQFVSVNEEASYRSKVHCGVPQG FT SVLGPLLFTLYMLPLGDIIKKHGVSFHCYADDTQLYISSQPDNLYEFTKLT FT DCIADIKNWMTNNFLLLNSDKTEVLLIGPKNLTCKNLEHCLTLDGFLVQSS FT SSVKNLGVIFDSNLSFEGQISSICKTAFFSSHMLSMSDAEKLIHAFMTSXL FT DYCNALLGGCPAGLINKLQLVQNAAARVLSRTKKYDHITPVLSSLHWLPIK FT YCINFKILLTTYKALNGLAPQYLRELLVYYSPSRPLRSINSGQLIIYSQNI FT KINCRR*" XX SQ Sequence 2227 BP; 647 A; 478 C; 390 G; 701 T; 11 other; atgctagatc ctataccrac caaattataa aacaattgtt ttcagaagtt tctgaacctg 60 ttcttaacat tttgaattca tccctacttt taggatacgt tcctaaaact tttaaactag 120 ctrtcattaa gcctcttatt aaaaaaacac agcttgatcc caaagaactw gtaaattata 180 gacccatttc aaacctgcct tttctatcaa agatactaga aaaagtygta tctgctcaac 240 tatgttcctt cctacaaaga aacaatattt atgaaaaatt ttaatctgga tttagagcat 300 ttcatagcac agaaacagca ctcctaagag ttacaaatga cctcctcctg ttgtccgatc 360 gtggttgcat ttctctttta gtgttattgg acctcagtgc tgcatttgat acaatcgatc 420 ataatattct ccttgatagg cttgaaaatt atgttggtat tacaggaact gcattggcct 480 ggttcaggtc ttatctatct gaacrttatc aatttgtatc agttaatgaa gaggcatcat 540 acagatcaaa agtacactgt ggggtaccac aaggctcagt tcttggtccc ttactgttca 600 cactatatat gctaccgcta ggtgatatca ttaaaaaaca tggagttagc ttccactgtt 660 atgctgatga tacacaactt tatatctcct ctcaacctga taacttgtat gaatttacga 720 aattaacaga ttgtatagct gatattaaaa attggatgac taataacttc ttattactaa 780 attcagataa aactgaggtc ttgcttattg gacccaaaaa cctcacatgc aaaaacctag 840 aacactgttt aacccttgat ggattcttag ttcagtcctc atcttcagtc aaaaatttgg 900 gggtgatttt tgatagcaac ctgtcatttg aaggccagat ttcaagcatc tgtaaaactg 960 catttttttc atcttaaaaa tgttgccaaa ctttgacata tgctatcaat gtctgatgcg 1020 gaaaagctta ttcatgcttt catgacctct ygattagatt attgtaatgc gttactaggt 1080 ggttgccctg ctggcctaat taacaaactt cagctagttc aaaatgcagc tgctagagtg 1140 ctttctagaa caaagaaata tgatcatatt actccagttc tttcatcatt gcattggctc 1200 cctattaaat attgtataaa ttttaaaatt ttattgacta cctacaaagc cctgaacggc 1260 ttagctcccc agtatttgag ggagctcctg gtgtattata gcccctcacg tccactacgc 1320 tcaattaatt ctggacaatt gattatttat tcccagaata tcaaaatcaa ctgtaggcgg 1380 tagatctttc tcatatctgg cacctaaact ctggaacagc cttcctagca cagttcggga 1440 agcagacaca ctctgtcagt ttaaaactag attaaagaca catctctttg cattagcata 1500 cacataaaac acaaatgctg ttgaaatcca aatcctctaa aggattgtta gtctgcatta 1560 tttagggcaa ccggagccag gaacacttcc cwaaagacat tataatttga acggcatctg 1620 cacttatgtt agtctttttr tttttattat ttccgaggtt tccataatcc tggaccaggc 1680 cgtatcctga gcagctgctg tggtggtcat ggaggagtgg agagcatgag actgattcct 1740 gtaagacccc agtgacagac gagtcctcgc attgatcctg aagggccagc ctgtacacca 1800 gccggtgacc tctcccacct gcagcttctc cacgatggac gtccagcgyt ctccagcctc 1860 cggcgcctag actgcagctt tgcacaagan gtttggccat aggagaaatg gtcgtgccca 1920 actgagcctg gtttctctca aggttttttc cttcactttc gtcaattggt gaagtttgtt 1980 cctcgccgct gtcgccactg gcttgcatgg ttcgggactt gtggagctgc gcatcgatgg 2040 atttgctctt cagtgtttgg actttcagca gtgaaaatta aaccacactg aactgaacta 2100 aactgaactt caactctgaa aactgractg acacagtttc aatttactat aatcttctat 2160 gttaagctgc tttgacacaa tctacattgt aaaagcgcta tagaaataaa gatgaattga 2220 attgaat 2227 // ID DNA-9-1_DR repbase; DNA; ZEB; 323 BP. XX AC . XX DT 12-DEC-2008 (Rel. 13.12, Created) DT 12-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE A nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; TSD 9-bp; DNA-9-1_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-323 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(12), 2153-2153 (2008). XX DR [1] (Consensus) XX CC TIR is ~69-bp long. TSD is 9-bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 323 BP; 93 A; 57 C; 67 G; 106 T; 0 other; cgccaacctg atttcaccaa gtaggacatg ttcctggatt aacatctatg ttgatcctgg 60 aacaacattc caatcagcca atcagaatta agggatatgt ttaccgttta tgttaagttt 120 aggcttacgg ttaggtttat gcacttctac atgattgtta tccaactatt attcccctct 180 gattttggga ataatttaca gttatggttg ggtttagggg tagggaatag gtatggatta 240 cattttcgaa caaaaatgat gttccaggat caacatagat gttaatccag gatcgcatcg 300 tacttggcaa aatcatgacg tgc 323 // ID DNA-CCGG-1_DR repbase; DNA; ZEB; 1834 BP. XX AC . XX DT 01-DEC-2008 (Rel. 13.12, Created) DT 01-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE nonautonomous DNA transposon from zebrafish - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW DNA-CCGG-1_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1834 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(12), 2155-2155 (2008). XX DR [1] (Consensus) XX CC TSD is CCGG specific. TIR is ~700-bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1834 BP; 589 A; 312 C; 319 G; 613 T; 1 other; tgttacggtt taactggtga ccgtgttaac tggtgtccct ttccagatgt aagctattca 60 cgtttgcaga ttggcagatt cgtcactttt tcacagcaac aaaacatgtc cataccaaat 120 aaagattctt attacttggt gtcagtgtca ttgtgtgcat tattgatttt aatatctgtg 180 tagcagaaca gtatataacc aaagtaaata aaaaaaataa ttagaattta catgtcttta 240 acaggattcg aacccgtgta gtgaaaaggt ttagcacacg tatcaatcaa cttatctaac 300 tgagctactc agaacttcaa acgaagcggt ctgttttaat atctttgtca attaaaacat 360 gccttgctgt ggttgtggac ccgtgttaac atgtttccat cgatgttaac atgtttacat 420 gttgtttagt tacatgtact cgcgttaacg tgtttctaca aactctcccg ctgatagaat 480 agctctctgt gactaaacga tgcttcatca gttgctcaaa ccatgttttt tcttagttct 540 agactcatgt ctacctgtgc ctttggagtc gtgttaactc gtgacccatg ctcatacatg 600 taacgtaaat aaagatttta tccaaaaaat gaatacaatg acgatattat tatgcaacat 660 attatgtcta atatatataa tatatatttc atatataata ttatgcaaca gaatattaat 720 tgaaaatcgc aacatttcaa tgtatgatct tggtcacatc tgtatcaagg tacttatgaa 780 catgatttgg ctacatacga taaaattact aaaagctgct aagcctwaca tttcatcagt 840 gcttgatagc tcaattagtt agcacgtatg ttcctttcat gcgtctcttt tgttagtagg 900 ttcgaatcca catacgagtt ttgtttatgt agcctgattt tttttaattc tctttaaaat 960 aaaatggggg gggactgcga tttataacgt atgcttattt agcggaacat cgtagtcatt 1020 tagagctatt ctgagaaaat atcagctcgt gtaactttgt agacacagac taacacgagt 1080 caatgggact attatattct gattgcataa tattctaata ttctgttgca taatattata 1140 tatgaaatat atattatata tattagacat aatatgttgc ataataatat cgtcattgta 1200 ttcatttttt ggataaaatc tttatttacg ttacatgtat gagcatgggt cacgagttaa 1260 cacgactcca aaggcacagg tagacatgag tctagaacta agaaaaaaca tggtttgagc 1320 aactgatgaa gcatcgttta gtcacagaga gctattctat cagcgggaga gtttgtagaa 1380 acacgttaac gcgagtacat gtaactaaac aacatgtaaa catgttaaca tcgatggaaa 1440 catgttaaca cgggtccaca accacagcaa ggcatgtttt aattgacaaa gatattaaaa 1500 cagaccgctt cgtttgaagt tctgagtagc tcagttagat aagttgattg atacgtgtgc 1560 taaacctttt cactacacgg gttcgaatcc tgttaaagac atgtaaattc taattatttt 1620 ttttgattta ctttggttat atactgttct gctacacaga tattaaaatc aataatgcac 1680 acaatgacac tgacaccaag taataagaat ctttatttgg tatggacatg ttttgttgct 1740 gtgaaaaagt gacgaatctg ccaatctgca aacgtgaata gcttacatct ggaaagggac 1800 accagttaac acggtcacca gttaaaccgt aaca 1834 // ID DNA-X-2_DR repbase; DNA; ZEB; 816 BP. XX AC . XX DT 15-OCT-2008 (Rel. 13.1, Created) DT 15-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE This is a nonautonomous DNA transposon, - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; DNA-X-2_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-816 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1251-1251 (2008). XX DR [1] (Consensus) XX CC The consensus is built from members which are ~90% identical to CC the consensus. The TSD is unknown. The terminal TIR is very CC short, 4-bp. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 816 BP; 217 A; 185 C; 163 G; 250 T; 1 other; caaagtccct ttaaggcaag tcatttcact cggcggccat ctttgaaayg cctctcgggc 60 agtatgctcg ggcattctgt ctgaatgggg aaacatcaaa ttctccaaaa ctgtttgcca 120 agcttacgat tacattacat atttggaatc accaataaaa ttaaacaaca actgtctcat 180 aagtttcgtt tctaaacgtt cgaatcaaac aaaatcggca tattttcagg ttgcccaagc 240 taatgcgcat gcgcactcga aaggaacgag atcacgacac caacctcatt tatggccgtt 300 ctacacatta tcatcctctg gataatgatc gtctgatcgc gctgctcttc tgaagtaatc 360 cacaacttgg tcttgatggc gaatctcctc cagaaatgac agcgactctt tgtttacaag 420 tttgtgggcg tttgagtagt tgctgtcatg tgatgtgcgt ttgacaggac ggactgtacc 480 tcgcgttcgt ttcatacaga ttacaaaacc aaaaaacttt tgttttcaag tgcacttggt 540 tcatttaaaa gtacagattt caagctttat gtggatatat ttcttatgtc tgtgaagcaa 600 gtattcgctg agattccagt gtgtttgttg accaccaaaa ctgtcgtaaa agcacacgtc 660 tggtccgagc ttctcccccg gagaaacgtc agtctatagc gatcgatgat tggctcctgt 720 actagtaggc ggggcttcat tcgccatatt gaccgttaca cttttcccca ttcaaaacta 780 tacgagtgac atgtcttgtg tattctatag tctttg 816 // ID CR1-17_DR repbase; DNA; ZEB; 3516 BP. XX AC . XX DT 11-NOV-2008 (Rel. 13.11, Created) DT 11-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE CR1-17_DR is a CR1-like non-LTR retrotransposon - a consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; Nonautonomous; KW reverse transcriptase; AP endonuclease; CR1 clad; CR1-17_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-3516 RA Bao W. and Jurka J.; RT "CR1-like non-LTR retrotransposons from zebrafish."; RL Repbase Reports 8(11), 1691-1691 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 5..3364 FT /product="CR1-17_DR_1p" FT /translation="MAPVFAACRLNPLQLFLRLILVLFFVAQHSEALLVYD FT RQTLLNIRNGMVNRLISEPSGSCTYNHPASLSLPAHLCRLPCILFRRRRRR FT KRGKRGGLLISLKVRLSDAGMNCRARFSSDLIPDFYRIIPYRTSDPPIQCL FT VSVFPTSCEASTRICSPRCRRRGVDLSNLCQLRRIPRENDHHQVPTRLTLV FT NVRSVLNKTFILNDFFTARDLDFLFLTETWAVDGDLSPFSELVPVNCTYFN FT NPRRSGRGGGLASIFKKTFNCQPVRADAFSSFELQMFQINERSGPVLCALI FT YRPPRSTKDFIKQFGDLLSSILPRYDNILILGDFNVHVCCPKDNLAKDFLN FT LIDSFNLTQHVQGPTHNQGHTLDCVLTYGLSICNVALDDMAFTDHKAVTFE FT ITFTHKISNSPIVFRRFRRFNVSSIDLFKDLYINEQHTTMLEDPPSYMDVD FT QLLELFNCSFSSILDFVAPEKTRTVKPRSEPWLNETTRASRQACRKAERKW FT KRDKLQVSYEIFKESLSNYQRSMKEAKRKYFSELISTHQNKXDVLFSCISS FT VINPGSNVVMEESCDTCEAFLNFFTEKISSIRSGIASFDIVFPEPDVSPAA FT LVSFTPVSLLSLTDTVGKIKLSSYPLDIFPSKFFKQAFEIIGPSILSIINR FT SLVSGVVPSQLKHAVVQPLLKKPSLDKSVLSNFRPISKLPFLSKILEKTVL FT TQLQSXLELNSIHEVFQSGFKAYHSTESALLRVLNDILLFRDSGHSVILML FT LDLTAAFDTIDHEILISRLEXLVGIQGSALKWFRSYLSDRSFCVNVGQTFS FT SSRPLTCGVPQGSILGPILFSLYILPLGSIFRKYGVAFHFYADDTQIYVPF FT KSRDKKGLDSLMRCLVEVKSWLSSNFLNFNEEKTELVWFGDPNPLHLSGFG FT ELSTYQKPVVRNLGFQFDSDLKFNKQVNTVVKSSFYHLRLVAKVKSFLSFN FT DLEKVIHAFILGRLDYCNSLYVGISQNALNRLQLVQNAAARLLTGTRKFEH FT ISPVLSSLGWLPVRSRIDFKLLIFVFKSLNNLAPSYLSDLLHVYNPGRSLR FT SSDEFILSVPRSRCKRSGDRAFAVVAPKLWNGLPPHIRHAPTLPVFKKRLK FT HHLCSLVFEPF*" XX SQ Sequence 3516 BP; 916 A; 694 C; 705 G; 1191 T; 10 other; caagatggcg ccagtgtttg cggcctgccg tctgaaccct ctacagttgt ttttaagatt 60 aatcttagtt ttattttttg tggctcaaca ttctgaggct ctgttggtct atgatcgcca 120 aacgctcttg aacattagaa atggtatggt aaaccggctg atcagcgagc cgagtggatc 180 ctgcacctac aatcaccctg cttcactgtc attaccagct cacctatgcc ggcttccatg 240 catcctcttt cggagaagac gccgcagaaa acgtggtaaa cgcggtggtt tgttaattag 300 tcttaaagtg cgtttgtcgg atgcagggat gaactgcagg gctcgtttct cctcggacct 360 cataccggat ttttacagga ttattcctta ccgcacatct gatcccccga tccagtgctt 420 ggtttctgtt tttccaacct cttgtgaagc aagcacgcgg atttgttcgc cgcgttgtcg 480 aaggcgagga gtggatttaa gtaatctttg ccagctacga cgaataccac gggaaaacga 540 tcaccaccag gtacctacaa gattaacttt ggtcaatgta agatccgttt tgaataagac 600 attcatatta aatgactttt tcactgccag agatttggat ttcttatttc tgactgaaac 660 atgggcagtt gatggggacc ttagtccatt ctctgaactt gtcccagtaa actgcactta 720 ttttaataat ccgaggcgct ctggcagagg gggaggtctt gcgtcgattt ttaaaaaaac 780 atttaattgc cagcccgtgc gtgctgatgc tttctccagc tttgagctac agatgtttca 840 gataaatgaa cgctcaggac ctgtcctatg tgcattaatt tatcgacctc ctaggtctac 900 taaagacttt attaaacagt ttggggactt gttgagcagt attttaccta gatatgataa 960 tattttaatt ttgggtgatt ttaatgttca tgtttgctgc ccaaaagaca atttagctaa 1020 ggatttctta aatcttattg actcttttaa tttgacacag catgtgcaag ggcccactca 1080 caaccagggg cacacattgg attgtgtttt gacatatggt ctatcaatct gcaatgttgc 1140 attagacgat atggcattta cagaccataa agcagtcact tttgagatca cttttaccca 1200 taaaataagc aactctccta ttgttttcag gcgatttcgt agatttaatg tttcttctat 1260 tgatctgttt aaagatttgt acattaatga acaacataca actatgctag aggaccctcc 1320 ctcttatatg gatgttgacc agttactgga attgttcaat tgctcttttt cttcaatctt 1380 agactttgtt gcccccgaaa aaactagaac tgttaagcct agatcagagc cttggctaaa 1440 tgagacaact cgggcatcta gacaggcctg cagaaaggct gaacgcaagt ggaagagaga 1500 taaactgcag gtgtcatatg agatctttaa ggaatctctc tctaattatc agcgttctat 1560 gaaggaggct aagaggaaat atttttctga attaatcagt actcatcaaa ataagsctga 1620 tgttttattc tcctgcataa gttctgtgat taaccctggg tctaatgttg ttatggagga 1680 gtcatgtgat acatgtgaag cttttttaaa cttctttacg gagaagattt cttcaattag 1740 gtctggratt gctagctttg atattgtttt tccggaaccg gatgtgtccc ctgctgcact 1800 tgttagtttc acacctgtct ctcttctatc tctgactgat acagtaggca agatcaaact 1860 tagcagctat ccattggata tttttccatc aaagtttttt aagcaagcct ttgagattat 1920 tggccctagt attttatcca ttataaatag gagtcttgtt tctggtgtgg tgccttctca 1980 gttgaaacat gccgtcgtgc aacctctttt gaaaaagcct agtttggata aatctgtgct 2040 gtccaatttt agacctattt ctaagctacc ctttttatct aagattttgg agaaaacagt 2100 tttaacacaa ctgcagagct wtctggagct yaacagtatt catgaagttt ttcaatcggg 2160 ttttaaggca tatcacagta ctgagtctgc actgctacgt gtgttgaatg acattctgtt 2220 atttagggac tcggggcact ctgtgatcct tatgcttctt gaccttactg cagcttttga 2280 cactatagay caygaaatcc ttatttctcg tcttgagara ttagtgggca ttcagggctc 2340 tgctcttaag tggtttagat cttatttgtc agacagaagc ttttgygtta atgttggtca 2400 gaccttttca tcctccagac ccctgacatg tggagttcca caagggtcta tacttggacc 2460 tattttattt tctttatata ttttaccttt aggatctatt tttcgtaaat atggggtggc 2520 ttttcatttt tatgctgatg acactcagat ttatgtccct tttaaaagta gggataaaaa 2580 ggggcttgac tcccttatga gatgcctagt cgaggttaag agctggcttt cgtcaaattt 2640 tttaaatttc aatgaagaaa aaactgagct ggtctggttt ggcgatccta accctctgca 2700 tttatcaggt tttggtgagc tttccactta tcaaaaacct gttgttagaa acctaggatt 2760 tcaattcgac agtgatctga aatttaacaa acaagtaaat acagttgtta aatccagttt 2820 ttaccattta cgacttgtgg caaaagtaaa gtcgtttctc tcttttaatg accttgagaa 2880 ggtgatccat gcttttattc tgggaagact tgactattgy aattctctgt atgtgggtat 2940 tagtcaaaat gccctaaaca gattgcaatt agtacaaaat gcagctgcga gacttctgac 3000 agggactcgc aagtttgagc acatatcccc agttctttcc tctctgggct ggctgcctgt 3060 taggtcaagr attgatttta aactattaat ctttgttttt aaatccttaa acaatctggc 3120 accatcatac ttatcagacc tgctacacgt atacaatcct ggtaggtcac tgaggtcttc 3180 tgatgaattt attctatctg taccaaggtc ccggtgtaag cgaagtgggg atcgggcttt 3240 tgctgttgtc gccccaaaac tctggaacgg gcttccgccc cacatcagac acgctccaac 3300 tctacctgtt tttaaaaagc gtttgaaaca ccatctttgt tccttagtat ttgagccttt 3360 ttaatgttga gggatttgcc ttatgtttta attgttattt atgtgatgta gatttcatgt 3420 ttagtattga ttgtgttcag cactttgggc aacgttgtgt tgtaaaaaat gtgctatata 3480 aataaactaa actaaactaa actaaacttt ataaat 3516 // ID DNA-6-N9_DR repbase; DNA; ZEB; 1159 BP. XX AC . XX DT 13-JAN-2009 (Rel. 14.01, Created) DT 13-JAN-2009 (Rel. 14.01, Last updated, Version 1) XX DE nonautonomous DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; TSD 6-bp; KW DNA-6-N9_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1159 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 9(1), 10-10 (2009). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1159 BP; 368 A; 217 C; 223 G; 349 T; 2 other; tacggagacc cggaagggac gtgatggtgg ggggaaaaaa atgggtggaa gaaaaaaaat 60 tggtggaaga aaaaaataat tggttgggag aaaaaatgtt ttgcgttccc tcgcaaaact 120 gtttttccac aaacacttcc tgttcacttc atacatgtca accctcccgt ttttgccggg 180 attctcccgt attttaccat tctatcccac tatcatccta tcattaagta ttttcccata 240 tttctcatat atttttaatc ttgaaagcac gctgaaatta atataaaaat gtctgcattc 300 actttcttta cattaaagct gtgcgtcgat catcgccctt catatgcaac ctctgaatca 360 gtgaatgcat gtataagcgc tgactgacag acgcgctccg tatgataaac tgatcccaga 420 tcagcttctg tacacgccat ttaaagagga gcwaaagtgc aggacacttt gggattcggg 480 tgtgtgttta aacagacaaa tacactgaat aatatgtaaa catgtccgtc ctgcatgttt 540 tattttaaac aactaatttt ctcttaaatg agcacaaaca gttaaagtaa tggaatgctt 600 tcattataga tctgtgcatt cttacagtga cacctctgtt atcaaacaaa acaaaacgag 660 agattcattt gctgctcttc acttgtttga taacagaggt gtcactttaa atggcgtgta 720 cagaagctga tctgggatca gtttatcagt gcatatgaag ggcgatgatc gacgcacagc 780 tttaatggaa agaaagtgaa tgcagacatt tttatattaa tttcaacatg ctttcaagat 840 taaaaatacg tgagaaatat gggaaaatac ttaatgatag gatgatagtg ggatagaatg 900 gtaaaatacg ggagaatccc ggcaaaaacg ggagggttga catgtatgaa gtgaacagga 960 agtgtttgtg gaaaaacagt tttgcgaggg aacgcaaaac atctttgcat tctctcacaa 1020 aacatatttg caagataatg caaaacatct ttgcgaggga acgcaaaaac ttttaaaaat 1080 atattttcct atcactyttt tttttttttc tcccacccat ttttttcttc caccatcacg 1140 tcccttccgg gtctccgta 1159 // ID BEL21-I_DR repbase; DNA; ZEB; 6071 BP. XX AC chr12; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL21-I_DR; KW BEL21-LTR_DR; BEL21_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6071 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1389-1389 (2008). XX DR Genome; chr12; Positions 22025133 22019063. XX CC 'TTCTT' target site duplication CC LTRs are 99% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 65..6022 FT /product="BEL21-I_DR_1p" FT /translation="MQFEGVNADGARGARPKRHVYPPAYLEDYDFTPLRHK FT QVSQSLMPHSTEQLWRYGPADAAQISPALSDRASSPVSQYSLNVVDEWPPA FT AVNKEYSYDRQRMGAPLQSGVNIPTVPSAVSSPFHRPWEETEYQLSPQTCL FT SPSHRPAQTKHHLESLPMPSSHALNTSQSFQLHNTVAAPLSSDLNFHREGN FT NSVLTSKQAVEQRSYSAMPSHPVKEPQTEHVLVDFLGKMLNQLHLMRDEIQ FT TGMGAKRGVPESFKTPAHHDTSSAHYKIESVHRNSCQLAPPPTDSRAPTHE FT HNWHSERTFKPYNVADMPQPYHSDEPRRPKNQTSLFRASTLHPDERESTYR FT GPAPNIPDFVTGDPSEFTRLKIALENLLPPDTTELFRYQILLDHLRLDEAR FT LVADSYLNSPFPYSDTMAALTERFGQPYKLALRRIARVMDAPDIRRGDTAA FT FDKFALQIRSLVGMLETLGHDGQAELRCGSHVERLLCKLPPEMRSEFRRHM FT FRRPGAVYNLLDFSEWLQYEAWCQSSESYVVERRRRVEQRTEKGRVINPPG FT RSATILHGAEEGPPKLPNVPSIVPPASSGILQTQIKAFCPYCDKTDHYLSQ FT CPTFKSFNKQQIAEWIQTNHRCWRCGRTHQAAKCTLKKPCSICKGRHLQIL FT HEVNFKPATEGSCLVNSATETLYLDKPTGCRKVLLKVVRVLLRHKDQTLDT FT YAVLDDGSERTILLPPAATKLGIHGPVESLALRTIRQDVQAVTGAAVSFQI FT SPVTQPQKVYRISAAFTAERLGLADHSYPLSILEKYQHLKNLPLQPFEGAH FT PLLLIGADNTHLITPISPVRLGPSGGPAAIQTRLGWTLQGPIRHLKDQLSS FT QQCLFLSLTPAERQVRKDVERLWQIDVMPYRCEKQVTRSKEDREAIHLLET FT KTTRVEVNGILRYATPLLRRKDFPLFCAPQDAVMPNLRGIERRLTRSPERA FT EVYKSEIKKLELAGAAIKLQNEEKESTREKWYIPHHMITHNEKNRLVFNCS FT FEYEGLNLNDSLLPGPVLSPSLLGVLLRFREHCVAISGDIRGMFHQVLLLP FT EDRPLLRFLWRDLRREEPPDTYEWQVLPFGTTCSPCCATFALQQHVALHST FT PDEDVRFSVNKCFYVDNCLQSLPSAQEARQLVDKLRALLSSGGFDIRQWAS FT NKGEVISHLPPEARSTKLELWLSQDKAEPRESTLGLSWHCKSDNLGFKHRP FT ITYNALTMRNIYRVLASQYDPLGVILPYTTRAKVIVQHLWVKHRDWDDPQL FT PGELQQEWMKWEEELKYLPEVTLPRCYYPYYMDHSDVRREIHIFSDASEKA FT YGAVAYLLTEDHQGELHLAFLLARSRVAPRKQQSIPRLELCAALIGAQLAK FT LLVNELTVKLDKFTYWTDSTIVLHWLCSESCRYKVFVGTRVAEIQELTDCA FT EWRYVNSEENPADDLTRGKNLQDLIGPNRWHNGPKFLMQPQEKWPSHPNSQ FT FSDAVDETELKRGTFCGITITSPTTLDGKQCRSWNELVEAIAQELHGAAAL FT DNSLTADAYKDAETVALRGIQMDCFPEDFQLLKANKPVRSNSRLLCLSPEF FT DSETQLIRVGGRLRRVEVLDPATIHPIVLDPHHPYTQLLIKDYDARLLHPG FT PERVFAELRRRVWILRGREAVKKYQRTCLDCCKWRSKPATQQMVDLPPPRL FT RLFKPAFYSAGMDCFGPLMVTLGRRTEKRWGLLFKCLTTRAVHIEVLTSMT FT SDAFLMALRRFIGRRGQPAELYSDQGTNFRGGETELRETFSTLGSDLQQLL FT AKQRINFHFNPPASPHFGGVWEREIRSIKMALRTALGSETVSEEVLQTVLI FT EVEAILNSKPLGYVSADLADLDPVTPNCLLMGRPDGSLPQVVYPESDLLTK FT RRWRHSQILADRFWTAFIKHYLPAMQTRGKWQRSTPDIEPGTVAMLVDPQL FT PRALWQIGKVTRVFPGADGHVRTAEIQIKNRTYTRPISRLIVLPEIPEEED FT HMGSKETQQPI" XX SQ Sequence 6071 BP; 1672 A; 1487 C; 1428 G; 1484 T; 0 other; aattggtcct tcgagccgga gctcagccat tgtctcctgg ataaagagga tcaacggtgg 60 caatatgcag tttgaagggg tgaatgcaga tggcgccaga ggagccagac ctaagcgtca 120 tgtttatcca cctgcctatt tagaggacta tgacttcaca cccttacgtc ataaacaggt 180 cagtcagtca ttaatgcctc attctacaga acaattatgg agatacggcc ctgcagatgc 240 tgcacaaatt tcacctgccc tgtctgaccg tgccagcagt cctgtgagtc agtattcatt 300 aaatgtggtg gatgagtggc cacccgctgc tgttaataaa gagtattcct atgacagaca 360 acggatggga gcacctttac agtctggagt taatataccc actgttccgt ctgctgtatc 420 ttcaccattt cacaggccct gggaagaaac agagtatcag ttatcaccac aaacttgctt 480 aagcccctca cataggcctg ctcagactaa gcaccacttg gagtctcttc caatgccatc 540 ttcccatgca ctaaatacca gtcagtcatt tcagttgcac aatactgtcg ctgctccatt 600 aagttcagat ctgaattttc atagagaagg taataactca gtacttacct caaagcaggc 660 tgtggagcaa agaagttatt ctgctatgcc tagccaccca gtgaaagagc cacagactga 720 acatgtactc gtagattttc tgggtaagat gcttaaccag cttcacttga tgagggacga 780 aatacagacc ggaatgggag ctaaacgagg cgtcccagag tcatttaaga ctcctgctca 840 ccatgatact tcttctgctc actataaaat tgaatctgtc catagaaatt catgccaatt 900 ggctcctccc cccacggatt ccagagcccc cactcatgag cacaactggc attctgagcg 960 aacattcaaa ccttataacg ttgctgatat gcctcaacct taccacagtg atgagccaag 1020 gagacctaaa aatcagacat ctttgttcag agcatccact ctgcatcctg atgagaggga 1080 aagcacttat cgtggtcccg ccccaaacat tccagacttc gttacagggg atcctagtga 1140 gttcaccaga ctaaagattg ctcttgagaa cctgttaccg cctgatacta ctgaactgtt 1200 ccgttatcag atattgctgg atcatctaag attagatgaa gcccgcctgg tagctgactc 1260 ctaccttaac tcaccgttcc cttatagtga cacaatggca gcattgactg agaggtttgg 1320 gcaaccttac aagttagccc tacgtcggat agctagagtg atggatgcgc ctgacattcg 1380 acgaggcgat accgcagcct ttgataagtt tgccttacaa attagatccc ttgttggtat 1440 gttggagact ttaggccatg atggtcaggc tgaattgaga tgtgggtcac atgtggaaag 1500 actgttatgc aaactacctc ctgaaatgag atctgagttc aggagacaca tgtttcgtcg 1560 acctggtgca gtgtataacc tccttgattt ctctgaatgg ctgcaatatg aggcctggtg 1620 tcagagcagt gagagttatg ttgtagaacg caggcgcagg gtggagcaga ggactgagaa 1680 aggaagagta ataaaccctc ctgggcggtc agctacaata ctccatggag ccgaggaggg 1740 ccctcctaaa ctacctaatg tacccagcat agtcccccca gccagctcag gtatcctcca 1800 aacacaaatc aaagctttct gcccatactg tgacaaaaca gaccattacc tcagtcaatg 1860 ccctacattc aagtccttta acaaacagca gatcgcagaa tggattcaaa caaatcaccg 1920 ctgctggcgc tgtgggagga cacaccaggc agcaaagtgc accctaaaaa agccatgcag 1980 tatttgtaaa ggtaggcacc tacagatcct tcatgaagtt aacttcaaac cagcgactga 2040 aggatcgtgt ttggttaatt cagcaacaga gactctgtat ctggataaac ctacaggatg 2100 taggaaggtt ctccttaagg tagtcagagt tttgcttcgc cacaaagacc aaactctaga 2160 tacctacgct gtccttgatg acggatcgga gaggacaatt ctcctgcccc ctgctgccac 2220 aaaacttggt atccatggtc cggtggaaag cttagcccta cgcactattc gtcaggatgt 2280 gcaagctgtc actggtgctg cagtctcttt ccagatttct ccagttaccc agccacagaa 2340 agtatacaga atctcagctg cctttactgc tgaacgtcta ggactagcag atcattcgta 2400 ccctttgagc atcctggaga agtatcagca tctaaagaac cttcccctgc agccctttga 2460 gggagcacac ccattgctgt taattggagc agacaatact cacctcatca ctcccatatc 2520 acctgtccgc ctgggcccct ctggtggacc agcagcaata cagaccaggc tgggatggac 2580 gttacaggga cccataagac acttaaaaga ccagttgtca tcacaacagt gcctcttctt 2640 gtccctgacc cctgctgagc gacaggtgag gaaggatgtg gaaaggctct ggcagattga 2700 tgttatgcct tatcgttgtg agaagcaggt gacacgttca aaagaagatc gagaggctat 2760 tcatctcttg gagactaaga ctactcgggt tgaggttaat ggcattctcc gctatgcaac 2820 acctctattg cgcagaaaag atttccctct cttttgtgcc ccccaagatg ctgtgatgcc 2880 taatctcagg ggaatagagc ggcggcttac tcggtctcca gagagggcag aggtttataa 2940 gtcagagata aagaagttgg agctggctgg ggctgccatc aagctgcaga atgaggagaa 3000 ggaaagcact agagaaaaat ggtatattcc ccatcacatg atcactcaca atgagaaaaa 3060 tcgccttgta ttcaactgtt ccttcgaata tgagggactg aacctgaatg actccttgct 3120 tcctggtcca gtcttaagtc cctctctcct gggtgtcctc ttgcgattta gagaacattg 3180 tgtagccatc agcggggata tacgtggcat gtttcatcag gtcctgctac tgcctgagga 3240 caggccactc ctgcgcttcc tgtggcgtga cctgcgacgg gaggaaccgc cagacacata 3300 tgagtggcaa gttcttcctt tcgggacgac atgtagcccc tgctgtgcga ccttcgccct 3360 acagcaacat gtggcattgc acagtacccc tgatgaggat gtaagattct cagtaaacaa 3420 gtgtttctac gtggacaact gtctacaaag cctaccctct gcacaggaag cacgacagtt 3480 agtggacaag ttacgcgctc tcctatcctc cggtggtttc gacatccgac agtgggcaag 3540 taataaaggg gaagttatca gtcacctacc acctgaggcc aggtccacta agctggagtt 3600 atggctttct caagacaaag ctgaaccacg tgagtcaaca ttaggcctga gttggcactg 3660 taaatcagac aatctgggct ttaaacaccg acctatcaca tataatgccc tgacaatgcg 3720 taacatttac cgggtgctgg cgagtcagta cgaccccctt ggagttatcc ttccatatac 3780 tactagggct aaggtgatag tgcagcacct ctgggtcaag cacagggact gggatgatcc 3840 tcaacttcct ggtgaacttc aacaggaatg gatgaaatgg gaagaagaac taaaatatct 3900 acctgaggtg acactaccta gatgctatta cccctactac atggatcact cagatgtgcg 3960 acgagaaatt catatcttta gtgacgcttc tgagaaggca tatggagctg ttgcttacct 4020 gctcacagaa gatcatcagg gagaactcca tctagccttc ctgctagccc gatcacgagt 4080 agctccacga aagcagcagt ccattccgag gcttgagctt tgtgctgctt tgataggtgc 4140 acagttagca aagttgctcg taaatgagct cactgtgaag ttggacaaat ttacttactg 4200 gactgattcc actattgtcc tccactggct atgttcagaa tcctgtaggt acaaggtctt 4260 tgtgggtacc agggtggcgg agattcagga gttgacggac tgtgcagaat ggagatacgt 4320 caattcggaa gaaaaccctg ccgacgacct gactcgaggc aagaacctgc aagaccttat 4380 tggacccaac agatggcata atggtccaaa gtttttaatg cagcctcaag aaaagtggcc 4440 ttctcacccc aactcccaat tttcggatgc tgtggatgaa actgaactta agagaggaac 4500 attttgtgga atcactatta cttccccaac gactttagat ggaaagcaat gcagaagctg 4560 gaatgagctg gtggaggcaa ttgcacagga gttgcacggg gcggctgccc ttgataattc 4620 actcacagct gatgcttaca aagatgcaga gactgtggct ttgagaggga tacagatgga 4680 ctgttttcct gaggattttc agctgctaaa agctaataaa ccagttcgct ctaacagtcg 4740 gctattatgt ctttccccag agtttgactc agaaacccaa cttatacgag tgggaggacg 4800 tctgagacga gtagaagtcc ttgacccagc cactattcat cccatagtac ttgaccctca 4860 tcatccatac acccaactgc ttatcaaaga ctatgatgct cgactccttc acccgggacc 4920 agagcgagtg tttgctgaac tccgcagaag agtgtggatt ctgaggggta gggaggcagt 4980 caagaagtac caaaggacat gccttgattg ttgtaaatgg aggtcaaagc cagctacaca 5040 gcagatggtc gatctacccc cgccaagact cagattgttt aagccagcat tctattcagc 5100 aggcatggat tgttttggcc cattgatggt gacattaggt cgaaggacag aaaaaaggtg 5160 gggcctcctg ttcaagtgtc tcaccacccg tgccgtgcac atagaagtgc taacctctat 5220 gaccagtgat gccttcctaa tggcattgag aaggttcatt ggccgacgtg gccagcctgc 5280 agaactatac tctgaccaag gcacaaactt tcgaggtggc gaaacagaac tcagagaaac 5340 cttcagcacc ctaggttcag acctacagca gttgttagcc aaacagagga tcaatttcca 5400 cttcaatcct cctgcgtctc cccactttgg tggtgtatgg gagagggaga ttcgttcaat 5460 aaaaatggct ctccgcacag ctctgggctc agagactgtg tccgaagaag ttctccagac 5520 tgttttgatt gaagtagaag ccatcttgaa ttctaaacct ttaggttatg tttcagcaga 5580 tctggctgac ctggacccag tcactcccaa ttgcctgctg atggggcggc cagatggttc 5640 cttaccccaa gtagtatatc ctgagtctga tctccttact aaacggagat ggaggcattc 5700 ccagatattg gctgacagat tctggactgc tttcattaaa cattacttac cagctatgca 5760 gacaagagga aagtggcaaa gatcaacacc tgacattgag cctggcacag tagctatgct 5820 agttgacccc cagctgccgc gagccctatg gcaaattgga aaagtaacta gagtatttcc 5880 aggagcagat ggacatgtca gaacagcaga gatccagata aaaaacagga catatacacg 5940 gccaatatct cgtctgattg tacttccgga aatccctgaa gaggaggatc atatggggag 6000 taaagaaaca cagcaaccaa tttgaggcaa atttaacaga gcaaatttac aaataaattt 6060 gggggcggca a 6071 // ID Gypsy138-LTR_DR repbase; DNA; ZEB; 137 BP. XX AC chr1; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy138-I_DR; KW Gypsy138-LTR_DR; Gypsy138_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-137 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1538-1538 (2008). XX DR Genome; chr1; Positions 32319692 32319556. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 137 BP; 44 A; 33 C; 25 G; 35 T; 0 other; tgtaacggtt agtactcgat ttcctctgca ttgtgtgtag tagagatgag atttgacacc 60 caatcacgcc tcaaacagat cactttattc tggcaaacac aatgaaaaca cagtgatcag 120 tcccagacag tccaaca 137 // ID hAT-N71_DR repbase; DNA; ZEB; 3314 BP. XX AC . XX DT 03-NOV-2008 (Rel. 13.11, Created) DT 03-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE Putative nonautonomous hAT DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW 8 bp TSD; hAT-N71_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-3314 RA Bao W. and Jurka J.; RT "hAT DNA transposons from zebrafish."; RL Repbase Reports 8(11), 1738-1738 (2008). XX DR [1] (Consensus) XX CC The element contain another insertion, HE1_DR1, which is masked CC out. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 3314 BP; 1027 A; 494 C; 465 G; 963 T; 365 other; gagaagcccc tgtctgcagc ccgcagatct gggcggggat gaatgtgggg cggggctgca 60 cattcagcca cgcccacctg taatttcatt ggttagtgga aacgtaatag cgtgaatgcc 120 cactacgtgg tgtcagtaaa caggaaatgc atgtacaatt tcttaaatat tgtttttatt 180 gttaataaac acaaaaatgc acaaaagtgg acattcttgc tgagtgtttt atatgttaat 240 tatgttaaca gttatgtgaa tttcataatg gtatttttcc tgctaaaaaa tattacaaaa 300 gactcaaaag actcttttca caatggtgtc acttcagaaa cgcagtgatt ttagttctga 360 tattttaatg gtttattatt ttgcttcaaa atgcaaatgc atattgcata tttacaacaa 420 ataaagatgg agtctgaaaa aataaataaa taaataaaaa tgcagtgttt ttagttcagt 480 ttaatttttt ccctttgaat gggagagttc tctggaaaat ttacaattga aaatattaaa 540 attatacttt gatttaaggc tattaaaaat attgggcatg tgtccaattg tttctctttc 600 aatactgcac tgctttgagt tttattaaga gcagggataa gctattattt attaagggaa 660 gtatatgttt gtaacctttt tacttattaa atcagattat ttcatcatat aatggattta 720 tttttacttt gatgctgcta tgaatataca tgtatatgat ctgttatatg taaatgttcc 780 agcatgagta aaatttttta ttaatttttt tcatattaca tggtgtgtca tctatgaatg 840 caatcaattt cattatttat tatttactaa tccattaaag ttttgttcag ttwctccatt 900 tgaatgaaga cagtgctctg ctgccaccta ctgggtggtt ttaatatctg ttctacaatt 960 taattaataa gttacaaata atttcataat ctaatattac tgaattaaaa tgcagtcttc 1020 ttggttttat ttttgtatta tttgtaaata cagggtaaat actttaatac cactttctaa 1080 caacccttct gtgtccattt ctgcactgta gatctttaaa aatatttttt ataatgattt 1140 tgtttggtta ttaagagtca catacaacat tgttattcta tttcatttaa ttagaatatt 1200 aaatacactt aatattatat taatagtgtg gcacaaacgg ccagttttat ttattactta 1260 tctatcagga cacaaattaa tgaaaataaa tttgacatta ataagtgcag cactcgtgta 1320 aacttctgat tttttttatt tgacaaatcc tgataaataa ataccaactt catcttcagt 1380 aaaaacacca acacaaagca atacagtcag ttgaaagtcc ctctttctcg gttcaagtaa 1440 aaatctggca gatctgtgtt ttttagatgt cagatgtctc acctacagaa catctgtgta 1500 tgcacatgca cacatgtaca aaaggaagac tgacaattta tagaaaataa taataataat 1560 ttaagataca gcagtttgtg attattatta cctttaataa acataaaaac aattatatat 1620 tttttacagg gaaaaatctg ctgagcaata tattgggaag tcatttgttg aacagaatgc 1680 ttgcaggacc ccactctcgc catacccaac agacacagca ttttgtcaaa gtatatggca 1740 ctctgattga tgacacctaa aaaaaatcaa gtacatattt aagcaagaca caatcataca 1800 aagttttaaa actaacggag tgcagtaaaa agtgcttggc atattctgta ctctttatca 1860 tgtgtacaca cacacacaca cacacacaca cacacacagc ctgcttcttg aattttggat 1920 tactgacaat ggctcxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 1980 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 2040 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 2100 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 2160 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 2220 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 2280 xxxxxxxxxx xxxxxxxxxt gaatgaatga atgaatgaca atggctcaga aggtcctgga 2340 tgtcagtgag agttgcagaa atgtcccatc accatcctgt gtgacaactc tcagtaggat 2400 acactgcagg atctcaggcg gcatctgtac aaaataacca acaatcatca acatgtcgct 2460 taacatacag tatattaaag ttaaactcaa agaattaaat atgtttatac taggcatgcc 2520 ccctcctcac attccccttt gcataagttt catgcatggg ttataaagta aacaaaaaaa 2580 aaaaaagaga acatgaagtg taaataaata tattaaaaat tattaaatgc acaaatctat 2640 taaaacaaaa agagagaaag aactgaggta aaataaaata aaacagagaa tatgctccac 2700 aggaatcata gtgatgacta tatatatcaa tgccaatgat ctgcaaacag aatgtagtat 2760 agatcagtaa gttaagaatc aagttttata ttgtacatat ttagcattaa cattaccaga 2820 gtatataaat tagagaacat gccacaaaga aaaatatatt ttgttatctt tgatattaat 2880 aaacttcatg tgatggactg ttttgctcaa aactctttaa agtttacatt attcttctaa 2940 gtgaataaaa ccaccaactg tacgatcaac ctggaaaacg ccaaccctaa cacagaaaac 3000 atttttttat aggctttaaa acttcatatc tccttaaagg cctttaatta aataaatgac 3060 acataatacg tccgtgaggt gtactgcact ataatttgac catttacatg acagaagatg 3120 tgtacttacc atgcagagat gaaaaccgga atcgaaagca tacattacat caacgtctgt 3180 ttccgtgaac caatgaaaga cttgtaggcg gagtgacatg ttcagtcccg cccatttatg 3240 ggcggggctg cacattcagc tccgccccat attcagcccc gccccgatct gcgggctgca 3300 gacgggtgta cttg 3314 // ID Gypsy144-I_DR repbase; DNA; ZEB; 6642 BP. XX AC chr6; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy144-I_DR; KW Gypsy144-LTR_DR; Gypsy144_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6642 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1549-1549 (2008). XX DR Genome; chr6; Positions 43207368 43200727. XX CC Positions [3121-3624] - Reverse transcriptase CC Positions [5015-5488] - Integrase core CC 'ATTT' target site duplication CC LTRs are 99% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 1097..2020 FT /product="Gypsy144-I_DR_2p" FT /translation="MTLLQNQGERPSSYLHRLQVMLNATVRRGGISEAERD FT RSLLKQFIRGCWDNQLLVNLHLEQRKTNPPSFAELVVAIRAEENKQASKDE FT RMRSHLGTNKSSPSFFKSKAIVHQLSAHATEVGGGAHCKVETELNTQPALL FT KTLSSKKGQAKCQDTLEFSSLKAELNELRAQVQAMESAVPKRTSHIDSSAK FT ELAELRKQVADLKGQMLNMEAQSSQAQSLQSSRKRFETTQQGVSRFTQPQT FT NRTSLNRPRAGYCFRCGEEGHVVINCENDPNPSQVDRKRRLLREKQSQWDL FT QNQVKSQQLNFRQSLE" FT CDS 1963..4602 FT /product="Gypsy144-I_DR_1p" FT /translation="MGFAEPSEVTAVKLQAISGVGHAEIGTEFKSPKQCDR FT SKNVTSGTLPRGLIGTKCTAQLYIQGIQVNCLLDTGSQVTTIPSSFHKTYL FT AHLPVKPLNALLQVEGANGGSVPYLGYVDLTLIFPEEFLGVETEINTLALV FT IPDLVGVRQILIGTNSLDALYDVHVLERNVKPKPAFNGYRAVLNILNSRHK FT QRSAETLGSVTLEVDQAETVPAGGTTVVNGLVRMNSSFSENLAVIGPSQVS FT NLPGGLVIASGLHHLPSKRSFCVPVLVGNETQTDISISPKTVLAEMYSVQQ FT VMEIPQKTIPVDESNKKQIDFDFSDSPLPTEWKKRISDLLNAMPEVFALHD FT MDYGHTDKVKHRIRLSDETPFNHRPRPIHPHDISAVRKHLQDLLDAGIIRE FT SESPFASPIVVVRKKDNSVRLCIDFRKLNTQTIKDAYALPNLEEAFSVLSG FT SKWFTVLDLKSGYYQIEMDERDKEKTAFVCPLGFWEFNRMPQGITNAPSTF FT QRLMERCMGSLNRSEVLVFIDDLIIFSDSLEEHERRLMQVLNRLKEYGLKL FT SPDKCKFFQTSVRYLGHVVSQSGVSTDPSKIDAVKTWPKPQNLRELKSFLG FT FAGYYRRFVQDFSKIVRPLNDLTAGYPPVRRKAVCKQANRVYLNPKEPIGE FT RWTSECQNAFETIIERLTTAPVLGYANPKLPYLLHTDASTIGLGAALYQEQ FT EGHLRVIAYASRGLTRSEAKYPAHKLEFLALKWAVTTKFSDYLYGSEFTVV FT TDRNPLTYVLSSAKLDATSYRWLSSLSTFNFKIQYRAGAQNQDADGLSRKP FT NVQLVDDLETQKEQERIKQFALNHLSDARHVQQVMLPEVIKAICERHQAYQ FT PSSVALVESLGVSTEVRTWVACNFCIV" FT CDS 4616..6643 FT /product="Gypsy144-I_DR_3p" FT /translation="MRKQREDPEIREIIECLESDNGTPSLKGRPSDMVLWL FT KEWKKFEMRNGLLYRNRMDHGKVQRQLALPSNLREMVLTCLHDDMGHLGVE FT RTLNLLRSRFYWPRMANSVETKIRTCERCVRRKSPVQKAAPLVNIRTTRPL FT ELVCMDFLSVEPDNSNTKDILVITDHFTKYAIAIPTRNQKAQTVAKCLWEH FT FLVHYGFPEKLHSDQGPDFESRIIQELCKIAGIRKVRTTPYHPRGNPVERF FT NRTLLQMLGTLENKQKAHWREYVKPLVHAYNCTKNDVTGYSPYELMFGRQP FT RLPLDIAFGILNDESYQSHSKYVTSLKQKLEESYKLALENSNRVAERNKKR FT FDKNVIASNLEVGDRVLVRNVRLRGKQKLADKWESNVYIVIKKAQDVPVYT FT VCPEGKNSPVRTLHRDLLLPCGFLPAEVDASNSPKVHKKCRTRSQSATRHV FT DETENLDDEESSENESSYHFMPRESLEVVTRYILPESSFLQSENPPVAEEI FT EVLDTSQTNVDYLKEKKLEIIDENEQMCKVNENLPVQNEARFDCTNDGPGE FT HQVADTVVVSEEDELLNSVEELRDVQDEENEARTELILENEEKPQDEGLRR FT SSRQIRPPKMFHYPQIGNPLISVVQSLLQGLSNAFAQSEEQSEVFCDECVL FT VPGDPMSVLTTQPNACPRTCIRSKGEG" XX SQ Sequence 6642 BP; 2083 A; 1331 C; 1497 G; 1731 T; 0 other; aaaacttggc gagccagcca ggagaattta gagtgtcata caacaatcgt gacttctctg 60 tgtgatagaa cagcgaatag ttctactgac aataaccatg gatattgtga aacatgagaa 120 tgtaagcgtt tctgatgccg ttttagtgag cggtctaacg ttatccgaat ccgataaaga 180 tcttgaagca tggttgacac gttatggtag catcaaacga accctgctta ttgataaccc 240 agcctctgag tttcataacc aggccatcat agagtttgac aatagctcag ccatggaaat 300 attgcatcct ttgttaccat tacgaattgt gagtacatct aatgtagaca caatttttat 360 caagcgcgct ctgagcagtg tttattcaca gtcgactggt agcgatatca ctagagagta 420 tttggaggaa ctacaagaag tcgctagact tagtgggaaa ccctttcaag aagttattca 480 ggaggagcta aagaaactta atgctgctag ctcccttgct gaattctctt ccacatcaga 540 cattgaaaat gattgttcaa atttgattaa tcctcaggat cgtcacccaa aatctcaatc 600 acaaaatgta ggcaaagaga ccactgtacc cccgttgtca gataaggttt ctttgtctag 660 tgaaccacaa caacctaaaa atccaaattt gacaatgaca gatgaggcaa atgcagcacc 720 cacttatact gctttgccca tagacgtaat aaacccccct aacatacagc gagtagtggt 780 agaacatctt gtaagaacgt ctgatcctgt atctactcag cagataccag tgcgcttgag 840 gtcattttct gggaaaactc ctagaccgcc taatgaacct gattttgata cttggagggc 900 aagtgttgga ttctagcttg atgacccatc aatttctgat ttgtcacgca ccagaaatat 960 tttggatagt cttttgccac ctgcctcaga tgttgtgaag catgttaatc cccaagcact 1020 tccaacagtt tatttagatt tactggagtc agtttatggg tcagttgaag atggcgatga 1080 attgctcgca aaattcatga ctctcttaca gaaccaagga gagcgaccat caagttatct 1140 gcataggttg caagtgatgc tgaatgcaac ggtgcgaaga ggaggcattt cagaagctga 1200 acgtgaccgc agcttgttaa agcagttcat tagaggatgt tgggacaacc agttgctggt 1260 taatttgcat cttgagcaaa gaaagaccaa ccctccttca tttgctgaac tagtggttgc 1320 tattagagct gaggagaata aacaagcatc caaagatgag cgcatgcgaa gccatttagg 1380 aacaaacaag tccagtccaa gcttctttaa gtcaaaagct attgtacatc aactgtctgc 1440 tcatgctact gaagtaggag gaggtgcaca ttgtaaagtg gaaacagagt taaacacaca 1500 acctgctctg ctgaagactt tatcatccaa aaaaggacag gctaaatgtc aggacacttt 1560 agaattctca tcactgaaag ctgagttaaa tgagttgaga gctcaggtcc aggctatgga 1620 gtcagccgta cctaaaagaa cctcccacat tgattcaagt gctaaagaac tcgctgaatt 1680 aagaaaacaa gtagcagact taaagggtca aatgttaaat atggaagcac aaagcagtca 1740 agcacaatca ctacaaagtt ctaggaaaag gtttgagaca actcaacagg gtgtatcgag 1800 gtttacacag ccccaaacga acagaaccag tctaaatcgt cctcgtgctg gatactgttt 1860 tcgctgtggt gaagaaggcc atgttgtaat aaattgtgag aatgacccaa atccatctca 1920 agtagataga aagcgacgac tactgagaga gaaacaatct caatgggatt tgcagaacca 1980 agtgaagtca cagcagttaa acttcaggca atctctggag tagggcatgc agagattggg 2040 actgagttta aaagccctaa acagtgtgat cgctctaaga acgtaacatc aggaacattg 2100 ccaagaggat tgattgggac aaagtgcact gcccagttgt acattcaggg catacaggtg 2160 aattgcctac ttgatacggg atcgcaagtg accacgattc cgtcatcatt ccacaagacg 2220 tacctagcac acttacctgt gaagccattg aatgctctgt tacaagtgga aggagccaat 2280 ggaggatctg ttccctatct ggggtatgtg gacctaactt taatattccc agaagaattc 2340 ttaggtgtgg aaacagaaat caacactttg gctttggtaa ttcctgattt ggttggagtt 2400 cgacaaatat taataggcac aaactctctt gatgccttat acgatgttca tgttctagaa 2460 agaaatgtca agcccaagcc agctttcaat gggtatcgtg ctgtgcttaa cattctaaat 2520 tcaaggcata agcagaggtc tgcagaaact ctgggaagtg tgactctaga ggttgaccaa 2580 gctgaaacgg tgcctgctgg tggaacaaca gtagttaatg gactagttcg gatgaacagt 2640 tctttctctg aaaacctagc agtcattgga ccatctcaag tgtcaaactt accaggtgga 2700 cttgttatag ccagtggatt gcaccatctg cccagcaaac gttcattttg tgttcctgtg 2760 ctagtaggga atgaaactca aactgacatt tcaatttcac ctaagactgt gctagctgaa 2820 atgtattctg tacaacaagt gatggagatt cctcagaaga ccattcctgt tgatgaatca 2880 aataaaaaac agattgattt tgattttagt gactcaccct tgccaacaga atggaaaaaa 2940 aggatatctg atttgctaaa tgccatgcca gaagtatttg ccctgcatga tatggactat 3000 ggccatacag acaaggtgaa acatcgtatc agactcagtg atgaaacacc tttcaatcat 3060 agacctagac caatacatcc ccatgatatc agcgcagtta gaaagcactt gcaagacttg 3120 ttggatgccg gtataataag agagtctgaa tctccattcg cttctccaat agttgttgtg 3180 agaaaaaagg ataactctgt tcggctttgt atcgatttca ggaagttgaa cacacagacc 3240 attaaggacg cctatgctct tccaaatctt gaggaagcat tttctgtgct ttcagggtcc 3300 aaatggttca ctgttctcga cctgaaatca ggatactacc agatagaaat ggatgagcga 3360 gataaggaaa agaccgcctt tgtgtgtccc ctggggttct gggaatttaa caggatgccc 3420 caggggataa caaatgcgcc tagtactttt cagcgcctaa tggagcggtg tatgggaagt 3480 ttaaaccgga gtgaagttct agtcttcatt gatgacttaa tcatattctc agattctctc 3540 gaggagcatg aacgtcgtct aatgcaagtt ctcaatcggt taaaagagta tggtttaaaa 3600 ctttcccctg acaagtgcaa gttctttcaa acttcagtgc gatatcttgg tcatgttgtg 3660 tcccaaagtg gtgtgagtac tgatccaagt aaaattgacg ctgtaaaaac ctggccaaaa 3720 ccacaaaatt tacgagagtt gaaatccttt ttaggatttg ccggttatta ccgaagattt 3780 gtacaggact tctcgaaaat tgttagacca ctgaatgatc tcacggctgg atatcctcca 3840 gtgaggagaa aagctgtgtg taaacaggcc aatcgagtct acctcaaccc aaaagagcct 3900 ataggggaaa gatggaccag tgaatgtcaa aatgcatttg agacgatcat tgaaagattg 3960 acaactgcgc ccgttttagg ctatgctaat cctaagttac cttacctgtt gcacactgat 4020 gctagtacca tcgggctcgg tgctgcattg tatcaggagc aggaggggca cctaagggtc 4080 atcgcatacg ccagtcgtgg tttgactcgc agtgaagcca agtatccagc gcacaaactg 4140 gagtttctag ctttgaagtg ggcagtcacc actaagttta gtgactacct atatggaagc 4200 gaattcaccg tagtaacaga cagaaacccc ttaacttatg tcctaagttc agcgaagctt 4260 gacgctacta gttacaggtg gttgtctagt ctgtctactt tcaacttcaa aatccagtat 4320 cgggcaggtg cccaaaatca ggacgctgac ggattgtctc gaaagccaaa tgttcagctt 4380 gttgacgatt tagagactca gaaagaacag gaacgcataa agcagtttgc attaaaccat 4440 cttagtgacg ccagacatgt acagcaggtt atgcttccag aggtgatcaa agccatttgc 4500 gaaagacatc aagcttacca gccatcttct gtagccttgg ttgaatcttt aggtgtgtct 4560 accgaagtca gaacatgggt tgcctgcaat ttctgcattg tctgagaatg acttgatgcg 4620 caaacaaaga gaagaccctg agataagaga gattattgag tgtcttgagt ctgacaatgg 4680 aacacccagt ctgaaaggaa gaccatcaga tatggtattg tggctcaaag aatggaaaaa 4740 gtttgaaatg agaaatgggt tgctatatag aaaccgtatg gatcatggga aagtgcagag 4800 acagttggca ttgcctagca atctcagaga aatggttctt acctgtttgc atgatgatat 4860 gggccactta ggagttgaga ggactctcaa cctcctacgc tcacgatttt attggccacg 4920 gatggccaac tctgtggaga caaaaattag gacatgtgaa cgctgtgtca gaagaaaatc 4980 tccagttcag aaagcagctc cattggtaaa cattcgaacc accagacctt tagagttggt 5040 ctgtatggat tttctatctg tagaacctga caatagcaat acaaaggaca ttttagtaat 5100 aacagaccac tttacgaaat atgccatcgc catacccact cgaaatcaga aggcccagac 5160 agttgctaag tgcctttggg aacacttttt agtgcattat ggtttccctg aaaagctaca 5220 tagcgaccag ggaccagatt ttgagtcgcg tatcatacaa gaattgtgca agattgctgg 5280 cattcgaaaa gtaagaacaa cgccatatca cccgagggga aacccagtcg agcgttttaa 5340 tagaactcta ctgcagatgc tgggtaccct agagaacaaa caaaaagctc actggaggga 5400 gtacgtgaaa ccattagtgc acgcatataa ctgcaccaaa aatgatgtta caggttactc 5460 accttatgag ctgatgtttg gaaggcagcc tcgtttgcct ctagacattg cttttgggat 5520 tttgaatgat gagtcatatc aatctcattc caaatatgta acctctttaa agcaaaaact 5580 tgaagaaagt tacaaattgg ctttggaaaa cagcaataga gtagcagaac gaaataaaaa 5640 gagatttgac aaaaatgtga ttgcctccaa cttggaagta ggtgatagag tcttagtcag 5700 aaatgtccgg ttaagaggta aacagaaact ggcagacaaa tgggaatcca atgtttacat 5760 agtgataaag aaagcacaag atgtccctgt gtatactgtc tgtcctgaag ggaagaatag 5820 tcctgttcgt actcttcacc gtgatttgct cttaccttgc ggattcttac ctgctgaagt 5880 agatgcgtct aactcaccta aagtgcataa aaaatgtcga acgaggtcac agtctgccac 5940 aaggcatgtt gatgagacag aaaatctaga cgatgaggaa agttctgaga atgagtcttc 6000 atatcacttc atgcccagag aatcattgga agttgttaca cgttacatcc tacctgaaag 6060 ctcttttctt cagtctgaaa accctccagt tgcagaggag attgaagtat tggataccag 6120 ccagacaaac gttgattacc tgaaagaaaa aaagcttgag attattgatg aaaatgagca 6180 gatgtgtaaa gtaaatgaga acttacctgt gcagaatgaa gcgagatttg attgtacaaa 6240 tgatggacct ggagaacatc aagtagctga cactgtagta gtgtctgaag aggatgaatt 6300 gttgaattct gttgaagaac tgagagatgt tcaggatgag gaaaatgagg ccagaactga 6360 actgatcctt gagaatgagg agaaaccaca ggatgaggga cttagacgct cctcaagaca 6420 aatcagaccc ccaaaaatgt ttcactatcc tcaaataggt aatcccctaa tttcagtagt 6480 tcagtcctta ttgcaaggat taagtaatgc ttttgctcag tccgaagaac agtcagaagt 6540 attctgtgat gagtgtgtct tggtaccagg tgatcccatg tcagtgctga caacacaacc 6600 aaatgcatgc ccgaggacgt gcataagatc aaagggggag gg 6642 // ID SAT-1_DR repbase; DNA; ZEB; 828 BP. XX AC . XX DT 21-AUG-2010 (Rel. 15.11, Created) DT 21-AUG-2010 (Rel. 15.11, Last updated, Version 2) XX DE Satellite-like DNA - consensus. XX KW Satellite; Simple Repeat; SAT-1_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-828 RA Jurka J.; RT "Satellite-like DNA from zebrafish."; RL Repbase Reports 10(11), 2023-2023 (2010). XX DR [1] (Consensus) XX SQ Sequence 828 BP; 126 A; 351 C; 207 G; 144 T; 0 other; gccctccacc taggggcctc tgaccccagc aagctaccca ggggtccacc tctctccctg 60 gacttgcagt ataggcccca gggtgccccc tggccctcca cctaggggcc tctgacccca 120 gcaagctacc caggggtcca cctctctccc tggacttgca gtataggccc cagggtgccc 180 cctggccctc cacctagggg cctctgaccc cagcaagcta cccaggggtc cacctctctc 240 cctggacttg cagtataggc cccagggtgc cccctggccc tccacctagg ggcctctgac 300 cccagcaagc tacccagggg tccacctctc tccctggact tgcagtatag gccccagggt 360 gccccctggc cctccaccta ggggcctctg accccagcaa gctacccagg ggtccacctc 420 tctccctgga cttgcagtat aggccccagg gtgccccctg gccctccacc taggggcctc 480 tgaccccagc aagctaccca ggggtccacc tctctccctg gacttgcagt ataggcccca 540 gggtgccccc tggccctcca cctaggggcc tctgacccca gcaagctacc caggggtcca 600 cctctctccc tggacttgca gtataggccc cagggtgccc cctggccctc cacctagggg 660 cctctgaccc cagcaagcta cccaggggtc cacctctctc cctggacttg cagtataggc 720 cccagggtgc cccctggccc tccacctagg ggcctctgac cccagcaagc tacccagggg 780 tccacctctc tccctggact tgcagtatag gccccagggt gccccctg 828 // ID TDR22 repbase; DNA; ZEB; 1255 BP. XX AC . XX DT 01-APR-2002 (Rel. 7.03, Created) DT 01-APR-2002 (Rel. 7.03, Last updated, Version 1) XX DE Zebrafish putative non-autonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; TDR22. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1255 RA Jurka J. and Drazkiewicz A.; RT "TDR22: a putative non-autonomous DNA transposon from Danio RT rerio."; RL Repbase Reports 2(3), 15-15 (2002). XX DR [1] (Consensus) XX CC 2bp-target site duplication. CC Several hundred copies in the genome. XX SQ Sequence 1255 BP; 382 A; 252 C; 272 G; 338 T; 11 other; atggcccgtt tccactgagt ggtacggtac ggttcggtat gcttttatgg ccgtttccac 60 tgtcaaaagg taccaaaaag cgaaccgtac cgtaccactt tttgggtacc ctttscaaag 120 ggtacctagc acaacaaaag ggtaccaaaa ggcggagcta gacgcgcagc tgaacgctat 180 tggtttacag agaaacgtca ctagcgcata cacaagcagg agaatgaaaa caaaggaacc 240 gccattttta aatacacagc cgagacatta caccgtaata atatatacat ataataacga 300 gccatggtcg acctgagctc aaacaaacct tgtcgtcgtc ttgatgaaca gccacaaagc 360 caagaagaac aaaatctgcc gtgtcctgtt tttgttttac gaggccgrtt ctaaaagcgc 420 gagcggtttc actttctcca gagagctcgc gatcgctcng ncgcgcgtct atatttgaaa 480 taacaaactt cttgagctga tgataataac gtgcgcgtga ttattgaagt gcttctgaca 540 tccgatcctt tcagaaacgg acaaacgcga gagtgaagcg cgaaaaaaaa nacaaaggag 600 aagcyggaaa naannaaaaa caaaggagca aatgattctt tcagcaacct aaaacatgaa 660 caaactgcca tgtttaacta ttatcatcrc cttttggact attatgaact cgggaatgac 720 ggaattactt tctaacagag gttacatgtg ctggtgaaga ttaaagatac agatgagagg 780 tttgcactga ctgtgggcta tgttgcgtgt tgtttttgaa cccaaataag gactaaatgt 840 atgctgtgtg tagttttttt ctgtaattgg taacatatcg gagactgtaa gggtctgtat 900 gtgttcatat atgttgcatt tatttatttt atataattgc agacgttaca gtaggctatt 960 tcgcactgtc attgatctgc agttataatc aaatcatgtt catagaaagg ttagtaatga 1020 acatttatac acaagtattt atgtgtataa agcatctgtt ttgtgagaag tgcttctcat 1080 atgatatgtg aacgacccgt acagctttac tttcctcgag cgagaatgac gtcgactgaa 1140 actttctgtc gtacaccact ggtacccttt tggcagtgga aacgcaagcc tgataaaggt 1200 gacccgtacc gaccngtacc gtaccgtact gtaccactca gtggaaacgg gccat 1255 // ID DNA-TA-7_DR repbase; DNA; ZEB; 993 BP. XX AC . XX DT 16-MAR-2009 (Rel. 14.03, Created) DT 16-MAR-2009 (Rel. 14.03, Last updated, Version 1) XX DE nonautonomous DNA transposon from zebrafish - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; TSD TA; KW DNA-TA-7_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-993 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 9(3), 688-688 (2009). XX DR [1] (Consensus) XX SQ Sequence 993 BP; 355 A; 147 C; 176 G; 314 T; 1 other; ggggagcgcg gggcacaaag taacgcgggg ttaaatgtaa cacagagttt taaggtattt 60 gctcagggtt aaccatggca tgcttccgag gttttcacca cagtgtcagc agacgtcttc 120 ctgccaatta ttgaaaaagt ttggcgaaat ttggatgaga aacacaggag aaaccatttt 180 tgcagcataa aagtattttt tattatagtc aatatttttt attttaaaaa aatctgtgaa 240 agtatgaacg taaaatctta tattgttgtg atcaccgtct tctaggctaa aagatgaaac 300 cattttgaaa gatgtaaaaa acacacagtg actgctaagc cagttttgag gaaaacatga 360 cacagcgggg ttagttgtaa cagggtgtta caattaagcc acacactgtt ggacaccaat 420 taaactaaca aaataatttt tgaattttga gtgtaatgtt gagaactttt ttaaataaaa 480 tgtttttaaa aaaatttttt tttaatagat aaaaaaatat tttattgcta ataatgcaaa 540 taaatgcatt gtataataat ggataataat ccaaataaat tcaaatgtgt ttatccaata 600 aataaataaa taaaaataaa taaataaaca tactgtgcta tgtagaataa aaaaaataaa 660 atgagcyaag tactgaggaa atatagctac tatttaaagt aaactgaatt taaattaatt 720 gtgtgaaatc tgccttttta agcatgtgtt acaactaacc ctctgtctgt tacaacttgc 780 cccgcaggtg gggtaaatag taacattttt actcctggca cttttggcaa tactgcacag 840 aaaccatagg tccggcgatc ataatttcag tgctcatttg taggagaggc ttgtgtgttg 900 tattggtaaa aaacaaatgg tttgtctaac cccattactt tgtttattat ttgaccaaaa 960 ccaaaaagtg ttactttgtg ccccgctctc ccc 993 // ID Gypsy54-LTR_DR repbase; DNA; ZEB; 320 BP. XX AC chr17; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy54-I_DR; KW Gypsy54-LTR_DR; Gypsy54_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-320 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1424-1424 (2008). XX DR Genome; chr17; Positions 10821358 10821039. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 320 BP; 102 A; 66 C; 55 G; 97 T; 0 other; tgttgcacaa attttcacta acggttactg tattgaaata gtgatttgga tgagagatga 60 atagaggcag ttttatttaa ttttcatgta tgttttattt acatttataa ccactataca 120 tcacaatgta taagtttaga caaaatgtaa tgtaccagca gttctgagaa gcctctctgc 180 tgacactgat atcaccagag ctgcaaacca actaaatctt tggtgctgtg tttgtcccag 240 gacaataaaa gaagtttaag tgaccacacc tacctgactc gcagcattct tcagcatccc 300 agcgccaaca cacagtcaca 320 // ID CR1-1_DR repbase; DNA; ZEB; 4985 BP. XX AC . XX DT 02-MAY-2002 (Rel. 7.04, Created) DT 19-JUL-2005 (Rel. 10.08, Last updated, Version 2) XX DE CR1-1_DR is a CR1-like non-LTR retrotransposon - a consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; KW AP endonuclease; CR1 clad; CR1-1_DR; CR1DR1; ORF1; ORF2; KW reverse transcriptase. XX NM CR1-1_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 311-4303 RA Jekosch K.; RT "CR1DR1: CR1-like repeat from Danio rerio."; RL Repbase Reports 2(2), 7-7 (2002). XX RN [2] RP 1-4985 RA Kapitonov V.V. and Jurka J.; RT "CR1-1_DR, a family of CR1-like non-LTR retrotransposons in RT zebrafish."; RL Repbase Reports 2(4), 6-6 (2002). XX DR [2] (Consensus) XX CC CR1-1_DR is a family of CR1-like non-LTR retrotransposons and it CC was active in zebrafish recently. The consensus sequence encodes CC two proteins, CR1-1_DR1p (position 311-1216) and CR1-1_DR2p CC (positions 1220-4300). CC The 1027-aa CR1-1_DR2p protein is composed of AP endonuclease CC (aa positions 72-225) and reverse transcriptase (aa positions CC 559-796). The 302-aa CR1-1_DR1p protein is distantly related to CC proteins encoded by ORF1 in known CR1-like elements from chicken, CC pufferfish and turtle. CC The consensus sequence [2] was reconstructed from 10 copies CC that are ~1% divergent from it. CC Approximately 1000 copies of CR1-1_DR are present in the CC zebrafish CC genome. XX FH Key Location/Qualifiers FT CDS 311..1216 FT /product="CR1-1_DR1p" FT /translation="MSLPSLSLCAGEASMEALELELEEVESQIRALVVRRS FT RLREQLLVVPNAKAVSSPKVRGNYNHIIPSTSTPRPSLSRPSAPGARLSQA FT SFTPTPGYHGAWVQPRKVLPRSRGRTSPPVFEISTENRFSPLRESGPDVAI FT IGDSIVRHVRAASSKGNKVRTFCFPGARVRNISTQIPTILGAAESPGAVVL FT HVGTNDTGLRQSEILKKDFRSLIETVRRTSPATQIIVSGPLPTYRRGNERF FT SRLLALNEWLITWCKEQKLLFANNWNLFWERPRLFRPDGLHPSRAGAELLS FT DNISRLLRTI" FT CDS 1220..4300 FT /product="CR1-1_DR2p" FT /translation="LAGKNSQNSHYSHLDSCSPHLNISNAYLANPIETVSV FT PRIIRLRNKRTVCSRKNLVRIKPEKPVESENTNFVKLGLLNIRSLAPKALI FT INEIITENNLNALCLTETWLKQNDYISLNEATPPGFLYKHEARQTGRGGGV FT ASIFSDFLNIKQRNGLMFSSFEVLSLNVQLPDTIQKPMLSLALITIYRPPG FT PYVKFLKEFSDFISDLLVKTDKMLIVGDFNIHIDDANDTLGLAFMDLIHSL FT GIKQNVVGPTHRLKHTLDLILSYGIEVIDVDIIPQSDDITDHYLLLYKLCL FT PEISKPAPILRPSRTIVPSTKDEFINNLPDLSLFRNAPANSNDLDVVTSSM FT DAIFTSTLNTVAPIKLKKAREIKTIPWYNSHTRALKTATRALERKWKKTNL FT EVFRIAYKDSMSSYRRALKSARTEHLRKLIENNHNNPRFLFNTISKLANNR FT SSLEQTTPPQISSDDFMNFFSNKIEGFRQKIGDAKLSAPAYTPNPVNISLN FT HNNNLHCFKIIEHEELVKIINSSKPATCMLDSIPTKLLKELLPAIGEPLLN FT IINSSLSIGHVPNSYKLAVIKPIIKKPQLDTNNLANYRPISNLPFMSKILE FT KVVSTQLCSFLQTNNIFEVFQSGFRAHHSTETALVKITNDLLLAADRGCVS FT LLVLLDLSAAFDTIDHNILINRLKSTGVQGQALQWFKSYLTDRYQFVNLNG FT QPSQICPVKYGVPQGSVLGPLLFTIYMLPLGDIIRRHGISFHCYADDTQLY FT ISTKPDETSELSKLTECIKDIKDWMTNNFLLLNSDKTELLLIGPKSCTQQI FT SQLNLQLEGYKVSFSSTIKDLGVILDSNLTFKNHISHVTKTAFFHLRNIAK FT LRNMLSISDAEKLVHAFMTSRLDYCNALFAGCPASSINKLQLVQNAAARVL FT TRSRKYDHITPILSSLHWLPVKFRIEFKILLLTYKALNNLAPVYLTNLLSR FT YKPTRSLRSQNSGLLVVPRIAKSSKGGRAFSFMAPTLWNSLPDNVRGSDTL FT SQFKTRLKTYLFSKAYTQCIT" XX SQ Sequence 4985 BP; 1464 A; 1202 C; 916 G; 1403 T; 0 other; cgtcactggc gtcactgtct ccgttcggtc acatactgcg tgtgcttgaa gtttggactt 60 gctatttact cgcaatttaa atcttaatca taaaattctt cctcattcgc taagtatttg 120 tctctcctac ttagggtgac caaaccctta atacatttat acaaacaaaa ctgctttaaa 180 aacggtctgt ccctcgagca tccgcctgtt gtttgtagct ttagcctgct agcgccgctg 240 gtcagctaaa gctaccgacc tcttttacca tacacttttg acttactggc tttgctcttt 300 accccgtaaa atgtcgcttc cgtctctgtc cttgtgtgca ggagaagcat cgatggaggc 360 gttggagctg gagctggaag aagtggagtc ccagatccgc gcgctggtgg tgagacggtc 420 gcggctacgg gaacaactcc ttgttgtacc taatgctaag gccgtctcat cacctaaggt 480 acgtggaaat tacaaccaca tcattccctc tacctcaacc ccgcgtcctt ctctgtccag 540 gcccagcgca cccggggcgc ggctcagcca ggcgtcgttc acgccgacac ccggctacca 600 cggcgcctgg gtgcagccgc gcaaggtgct tcccagatcc cggggcagaa cgtctcctcc 660 tgtgttcgag atctccacgg agaaccgctt ctcccctctc cgcgagtcgg gtcccgatgt 720 ggccatcatc ggtgactcga tcgttcgtca cgtccgtgcc gcctcctcaa aaggtaataa 780 agtacgtact ttctgctttc ctggtgcccg tgtgagaaat atttctacac agattccaac 840 catcctgggc gctgccgaga gccctggtgc cgttgtcctc cacgtgggga caaacgacac 900 cgggctccgg cagtcggaga tcctgaagaa ggacttcagg agcctgatcg agacggttcg 960 acgcacctcg cccgccacgc agatcatcgt ttctgggccg cttcctacct accgccgagg 1020 aaatgaaagg ttcagtagac ttttagctct gaatgaatgg ctaataacat ggtgtaaaga 1080 acagaaattg ctctttgcta ataactggaa tcttttctgg gagcgtccta ggctcttccg 1140 tcctgacggc ctgcacccca gtcgagccgg agctgaactc ctgtcggaca acatctccag 1200 attacttcgc accatctgac tagcaggtaa aaattcacaa aattcacact atagccacct 1260 agactcttgt tcaccccact taaacatcag taacgcatat ctggcgaatc ctatagagac 1320 tgtgtctgtt cctcgtatta ttagattaag aaataaacgt actgtgtgct ccagaaaaaa 1380 tctagtaaga atcaaaccag aaaaaccagt agaaagtgaa aatacaaatt tcgtaaaact 1440 tggtctccta aacatcaggt cacttgcacc taaagcactt atcattaatg aaataataac 1500 agaaaacaat cttaatgcac tctgtctcac tgaaacctgg ctgaaacaaa atgactatat 1560 tagcttaaat gaagcaactc ctccaggatt cttatataaa catgaggctc gtcaaactgg 1620 tcgtggtggt ggagttgcat caatctttag tgatttcctt aatattaaac agagaaacgg 1680 acttatgttt agctcctttg aagtattatc gcttaatgtt cagcttccag atactataca 1740 aaaacctatg ttatctctcg ctttaatcac catatataga cccccaggac cctatgtcaa 1800 atttctaaaa gaattttctg attttatttc tgacttacta gtcaaaactg ataaaatgct 1860 aattgtaggt gactttaaca tccacataga tgacgctaat gatacattag ggctcgcgtt 1920 tatggattta atacactcac ttgggataaa gcaaaacgtt gtgggtccaa cccatcgctt 1980 aaagcataca ttagatctaa ttctgtctta tggaatcgag gttattgacg tagacattat 2040 accacaaagt gatgatatta cagatcacta cctcttacta tataagctgt gtttacctga 2100 aatcagcaaa cccgctccaa tactccgccc tagtagaact attgttccgt caactaaaga 2160 tgaatttata aataacttac ctgatctttc tctatttcgt aatgcacccg caaactcaaa 2220 tgatcttgat gtagtaacca gcagtatgga tgccatcttt actagcacac taaatactgt 2280 ggcacccatc aaattaaaaa aggctagaga gattaaaact ataccatggt ataatagtca 2340 tactcgtgcg ctcaaaacag caacccgtgc cctggaacgt aaatggaaaa aaactaattt 2400 agaggtcttt agaattgcgt acaaagacag tatgtccagc tataggaggg ctctaaaatc 2460 tgccaggacc gagcacctgc gcaaactgat agaaaataat cataacaatc ctagattttt 2520 atttaacacc atctctaaat tagcaaataa tcggtcatcc ttggaacaaa ctactccacc 2580 gcaaattagt agtgatgact tcatgaattt tttcagtaat aaaatagaag gctttagaca 2640 gaaaatagga gatgccaaac tttctgcacc ggcttatact ccaaatcctg taaatatttc 2700 attaaatcat aataataacc tacactgctt caaaatcata gaacatgaag agttagtaaa 2760 aattataaat agctctaaac cagctacgtg tatgctggac tcaattccaa caaaattact 2820 gaaagagctg ctacctgcta taggagaacc tcttcttaac attatcaact cttctttatc 2880 tataggccat gttccaaact cttacaagct agctgttatt aagcctatta ttaagaaacc 2940 gcaactagac accaacaact tagctaacta taggcctatt tcaaatcttc catttatgtc 3000 taaaatacta gaaaaagttg tttccactca attatgctct tttctgcaga cgaacaatat 3060 ttttgaagtg tttcagtcag gtttcagggc tcaccacagt acagaaaccg ccttagtgaa 3120 aataaccaac gatttactct tagctgctga ccgagggtgc gtctcgctat tagttttact 3180 cgatcttagt gcggcatttg ataccattga ccacaatatc ctcataaatc gcttaaagtc 3240 tacaggtgtc cagggacagg ctctacaatg gtttaagtca tacttaactg accgctacca 3300 gtttgtgaat cttaatggac agccttcaca aatctgccca gtaaagtatg gggtgcctca 3360 aggatcagtt ttaggccctt tactgtttac aatttacatg ctacctctgg gagacattat 3420 tagaagacat gggatcagct ttcactgcta tgcagatgat actcaattat atatttcaac 3480 taaacctgac gagacgtctg aactttctaa actaactgag tgtatcaaag acatcaaaga 3540 ctggatgacc aacaattttc ttctcttaaa ctcagacaaa acagaattat tacttattgg 3600 gcctaaatct tgcacacagc agatctcgca actcaattta caattagagg gatacaaagt 3660 tagctttagc tctactataa aagatctggg tgtcatatta gacagcaatc taacttttaa 3720 aaaccatata tcccatgtca caaaaactgc cttctttcat ctgagaaata tcgctaaatt 3780 acgaaatatg ctatccatct cagatgcaga aaagctagtc catgctttta tgacttcgag 3840 actggattac tgtaatgctc tatttgctgg ctgcccagca tcctctatta acaaacttca 3900 attagtacaa aatgcagcag ccagagttct gaccaggtct agaaaatatg atcatataac 3960 cccaatttta tcctccttac actggctgcc tgttaagttt cgtattgaat ttaaaatatt 4020 acttctcacc tataaagctc taaataatct agctcctgtt tatctaacca accttctgtc 4080 tcgctacaaa ccaactcgct ctttaagatc tcaaaattca gggcttctgg tagtacctag 4140 aatagcaaaa tcaagtaaag gaggtcgagc cttctctttc atggctccta cactctggaa 4200 tagccttcct gataacgtcc gaggctcaga cacactctcc cagttcaaaa ctagattaaa 4260 gacctatctg tttagtaaag catacactca atgcatcacc tagcgggttc cacacaggct 4320 tctgcatctt gcttatatac actatgaaca gcagctacgc taattattct ctttattctc 4380 tattttcacc tggggatact catcccgagg tcctcagatt atgcggagtc actgattgga 4440 tccaagacca gcgacgtgat gatcccaagg attccatatc cgggaccagg ccatatcctg 4500 agctgctgct gcgctgatgg tcgtggggag tggagaacat gagtctgatt ccagcgacgc 4560 tccagggaca gacgagtctt cgctgaggcc atcttccagc ctaaaccacg gcgaatgaag 4620 ctctgcacaa gacttttggc cagcggagaa attaaaatgg tcgtgcccaa ctgagtctgg 4680 ttctctcaag gttttttttc ttcactccca tcaggtgaag ttttttttcc ctctccgctg 4740 tcgccactgc ctcgcatggt tcaggattgg tagagctacg catcgatgaa tttgctcttc 4800 agtgtttgaa ctctcagtaa tgattaaatc acactgaact gagctaaact gaactgaact 4860 gaacttaaac actaaaacct gaaccacact gttccagtta ctatgaccat ttatgtgaag 4920 ctgctttgac acaatctaca ttgtaaaagc gctatacaaa taaagctgaa ttgaattgaa 4980 ttgaa 4985 // ID DNA-6-N2_DR repbase; DNA; ZEB; 5056 BP. XX AC . XX DT 29-OCT-2008 (Rel. 13.1, Created) DT 29-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE nonautonomous DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; TSD 6-bp; KW DNA-6-N2_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5056 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1241-1241 (2008). XX DR [1] (Consensus) XX CC Copies are ~98% identical to the consensus. TIR is ~1 Kb long. CC TSD is 6-bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 5056 BP; 1687 A; 787 C; 855 G; 1727 T; 0 other; ggtccgtgta tcatccgttc atttgattat tccttttatt acggaaaacg aaacatcaga 60 aaaacgatga atatttcgtt tttctgttta ttctgtaggc agaaaaaacg gaaactcatc 120 tgttattctg cttattaact taaaacgaaa tattaaaaca aacacaaaac caaaacgaaa 180 taatgagtca aaaaatggtc cgtgtacggt ccgtgtacgt tttgttcatt tgttttccat 240 tttcaaacgg aataaaggaa atggagaaaa cggccgtttt tccgtttttc ttttgctcac 300 gagaaaacga aaaaacgaga ttttggctgg attttcgttt tttggtttag ggtaggaaaa 360 cggataaacg actttaaaat tcattttaca catgtaggcg gtgctgaaac gcccactttc 420 ctctaattgg tcaaccaaat ctgcgctcgt gacgtcaccc tcaaaataaa agccttacac 480 gcaggctttt aattattatt atttaattat atatataaac aaagtttaca tattctatca 540 tttgaaccac acacagttag caggcttaca ttcattgcgt atgtataaat taaataaaaa 600 cgtaaatcag cagtattctt agacatttgt cattaaaata aggtcatagt tcactgccaa 660 gcttcttgct gctgtgcacc tgatgctgag caaagaaagc catttccgag cagcacctgg 720 tttacatgat tgtttcagag ctattgtagg cctaaataat agcctactct gtataaaata 780 ataatttatt attattatta ttattattat tattattatt attattatta ttattaggcc 840 tattattatt agtattgtta ttattattat ttacttagtt atttaatggt tgtaagaaca 900 cattggaccc tgcatggggc tataataaat ggtaccattt gtttctattg gattttctcc 960 tattttaatt taatatatgc ctttagccta ttgtaaatat ttatttgata attaatactt 1020 ttttattgca acgtcaaagt aggcctgtct tattattatt attttttgtt attattttgt 1080 cgttgttgtt gttattatta ttgtagcagg gacatagcca caggtgtggc agagtgtgct 1140 ggtgccaccc aaagtggcat cttgcacctg aaaatagatg ccttcgcgct caagcgcgcg 1200 caaaaaaact tgcacaggca aaagtaatga tgaatgtgtt gaaggaaaaa aaaaacattc 1260 taattatttg ttaataaaat agtgcaacat tattctcagt cagtgtaggc tgaaaaaatt 1320 aagatcgcca gcatttcaag ggtggtttta tttttagttc attgctgtga cgcgctatat 1380 ttcactaagg ctgcatgaga ctgcgcatct tgcttatttt ctctatttta catacagaac 1440 atatcataca gtccaggttc taccgttcta gtgagtgtgg ggcattgctt tacggtggta 1500 aagtggatta tttcttggca aattcataat gattcacttt gtttagtaat aaatttcata 1560 gttataaaat aactacattt caagaatatg ttaacacttt caactattta ggattattta 1620 ttgcatcaaa agtaatttca aagtaacatt aaatgtacgt ataaactgct taatttgtgt 1680 gtgtaaacac ctaagatgca aaaagcttat ttagaccaga ataggcctac taaatgcaac 1740 ggtcaatgta tgtaaactta tgacctaaaa tgtcttaaat gactaaattt aagtcttttt 1800 aagtttttaa ataatttaag tttaacataa atgatttaaa ctttaaactt caaattaaca 1860 gagtatggca aataaattgt ttattaaatg gaggtcaggt gtctatctta attataatta 1920 ttaaatatta tgacactata attcagtggt gtattggaga gtacgcacct tttaattaat 1980 taattttaaa taagtggttt atcctaaata aaatgcaatg ggatcggtgt gcaatgaata 2040 gtttgaaaac ccctggttca gactagacga gctagctgtc tgtgcgctgc gctaataact 2100 tggcatttgc tcttttcggt gctgccagat gcatttatag tgcataaaaa ataaaaaagg 2160 taattcggta aagcgcgcga tgcgcgcaga aataagcgca ctatatgcat atgtcgaatt 2220 aagttctctt tttctttaat caagactttt ccattcgagg aattacagtt attaatagct 2280 tagtccttat aaacgatcag tttataaaat acacaattta cggatcaagc ctaatgaaga 2340 ataagaagaa tatgcaacgc attatgatcc ttgaatgatc agagcgcatt tatataaact 2400 atatatttta tataatccat aaatattaaa aaacattata catttaaagt aatggaacat 2460 tatatgttat aaaatctaaa ctattattta ggctacaggc taaataaagg cgccttttcg 2520 gggctgttca gcgcgtgcgt caaagaaaag acatgacaca atctctataa aaactagttg 2580 tttatttaca atatctaaag ggatataaaa catctaacat gcatctccat tctcaacgag 2640 tttctgtgtt ctgcatcctt cacagaccga ggtctcattc cgaggtaatc tgtaaatatt 2700 ctcatttggt tctggcacat tcttgattaa aggtgaaact gaccggcgca acggatgatt 2760 tgtcatctag ggtgactaca gttataatac aggttatgga actatttcac tttaatttca 2820 tattattatt ttattaaaca aacaaaccag tttttgtttt ggctctgcat gcagcgtatt 2880 ggatctttga agcattgcac ttcaaatgtt attccttttt tttattttta ttgtattttt 2940 aatattttta attatacaac acaaaattgg ttctaaacag atacattttc ctataggcta 3000 tgatgaagtg tatatttttg ttcacacagg tcctatacaa aactgtgtgg atggatgatt 3060 tgtttctccg tgaaaatcac tcatttaata agcaaaacag gccaaaatgt tgtttagctt 3120 gcgtcaaact ggatggacaa tttacaaaca tattgaatac aaaggctggt tattttatgc 3180 aatgagggac aataggccta ttacaaatta aagaaaatta tttgcatatt aaaacgaaaa 3240 tatataatac aacaacagat cctacctcgt aagaaatgac tttagataat gtaaataagc 3300 atcaagtttt tttttttgag attgtgtgtc taattcagta gtgtagtgcg ttcgcgtggg 3360 tttcctccgg gtgctccggt tttccccaca gtccaaagac atgcggtaca ggtgaattgg 3420 gtaggctaaa ttgtccgtac tgtataagtg tgtgtgtatg tgtggatgtt tcccagagat 3480 gggttgtggc tggaagggca tccgctgcgt aaaaacttgc taaataagtt ggcggttcat 3540 tccgctgtgg cgaccccaga ttaataaagg gactaagccg acaagaaaat gaatgaataa 3600 tgtgcaatac taatttaaaa tacgtatcac taaccttata tgtgaaatag cagggcccgc 3660 tgtagcactt tctcaagaag agcaaatgtc aaattattag cgcaacgcac atgtagcgag 3720 ctcatctgtt tgtcggaact actagacaca atttttttta tcaactataa tgtgtttaat 3780 tagttaattt gaaatttatt tcattattcc agcaataatt gttgagtgaa agaatgcgct 3840 agaaatatgc attgcggctc cctttattgt acaggcttat tgcacttaaa cttttttagg 3900 ttcggctctc gaattagtaa ggttatgagt aaatgttatt taaaatattt aaagcctgcc 3960 atctagccta caataacagc aaatttaata ataaaaaaaa taataagaca ggcctacttt 4020 ggcgttgcaa taaaatagta ttaattatca aataaataat tacaatatat aattaaaata 4080 ggaagaaata aatgaaaaca aatggtaccc tttgttttta tagcagggcc caaagtgttc 4140 ttattctggt tctgctaaca actattaaaa agaaaaaaaa acacaccaaa caataaaaac 4200 actagtaact gataataata ataataataa taataataat aataataata ataataataa 4260 taattattat tattattatt attattatta ttattattat tattataaag cccttgctct 4320 gaaacaatca tgcaaaccag gtgctgctcg gaaatggcta tctttgctct gcatcaggtg 4380 cacagcagca agaagcttgg cagtgaacta tgaccttatt ttaatgacaa atgtctaaga 4440 atactgctga tttacgtttt tatttaattt atacatacgc aatgaatgta agcctgctaa 4500 ctgtgtgtgg ttcaaatgat agaatatgta aactttgttt atatatataa ttaaataata 4560 ataattaaaa gcctgcgtgt aaggctttta ttttgagggt gacgtcacga gcgcagattt 4620 ggttgaccaa ttagaggaaa gtgggcgttt cagcaccgcc tacatgtgta aaatgaattt 4680 taaagtcgtt tatccgtttt cctaccctaa accaaaaaac gaaaatccag ccaaaatctc 4740 gttttttcgt tttctcgtga gcaaaagaaa aacggaaaaa cggccgtttt ctccatttcc 4800 tttattccgt ttgaaaatgg aaaacaaatg aacaaaacgt acacggaccg tacacggacc 4860 attttttgac tcattatttc gttttggttt tgtgtttgtt ttaatatttc gttttaagtt 4920 aataagcaga ataacagatg agtttccgtt ttttctgcct acagaataaa cagaaaaacg 4980 aaatattcat cgtttttctg atgtttcgtt ttccgtaata aaaggaataa tcaaatgaac 5040 ggatgataca cggacc 5056 // ID HARBINGERN5_DR repbase; DNA; ZEB; 861 BP. XX AC . XX DT 11-JUL-2002 (Rel. 7.06, Created) DT 11-JUL-2002 (Rel. 7.06, Last updated, Version 1) XX DE HARBINGERN5_DR is a nonautonomous DNA transposon - a consensus. XX KW Harbinger; DNA transposon; Transposable Element; Nonautonomous; KW 16-bp TIRs; 3-bp TSD; Dr000025; Dr000168; Dr000182; KW Harbinger superfamily; HARBINGERN5_DR; KW nonautonomous DNA transposon. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 2-851 RA Bao Z.; RT "Dr000025, Dr000168 and Dr000182, unclassified and RT uncharacterized repeats."; RL http://www.genetics.wustl.edu/fish_lab/repeats/. XX RN [2] RP 1-861 RA Kapitonov V.V. and Jurka J.; RT "HARBINGERN5_DR, an ancient family of Harbinger-like RT nonautonomous DNA transposon from zebrafish."; RL Repbase Reports 2(6), 6-6 (2002). XX DR [2] (Consensus) XX CC HARBINGERN5_DR copies are flanked by 3-bp target site CC duplications CC generated upon their integration in the genome. CC It has perfect 18-bp terminal inverted repeats similar to other CC HARBINGERs. There are several thousand copies of HARBINGERN5_DR CC harbored by the zebrafish genome; they are ~12% divergent from CC the consensus sequence. XX SQ Sequence 861 BP; 239 A; 178 C; 187 G; 257 T; 0 other; ggcccaatcc caattctatt tttgtacccc taccccttcc ccttggccct tgaaacagag 60 tgtgaagggg aagggcttca aaatttaccc ctaagaaatg ggacagcact acagcacctg 120 cacacgtcat catatgtcat cgcgatctct tgcttcatat gagatcagac gatcgcgact 180 gctgtagtta ttccagttgc gttatttttt ggtatttatc ttcaggaaat cactgaaggc 240 atatatcatg ttatcataac gatctaatgt ggcaataaga tcgtaactgt actgtgcatt 300 tacacagtgg ccatattcat ctatgtaaac acacaaaaac aacattaaca ttatagcaga 360 cactgtaaaa agctcattcc cagccactag acttttctga cagggtattc gagtgtcatc 420 gagtgtcaga atgttgtggg actgctatac aggagttatt attatggatt atagatagcg 480 aaatttagca gtttttattt tagcgttttt tttttaaagc atgacggtaa aacacgaacg 540 cggttatgaa tgtattaaaa catgtgcttg tttgttgtaa aaattcgtaa taatgacaaa 600 aaaatactaa tttgtggatc tccttacttc cgggtgcagc catgctgccg ttgtggctgg 660 tgtattctgg gaaattttct taccccttgg tttcgagtgt ggtcctgaaa aatcttcgtt 720 cgaagggcta tctagccctt ccccttagcc ctacgccttc aagctaaaga gaattgggac 780 acccctaccc cttcacgtga acgcgcaaaa cgaggggtag gggtaagggg aagggctaag 840 gggtagaatt gggattgggc c 861 // ID Gypsy9-LTR_DR repbase; DNA; ZEB; 1450 BP. XX AC . XX DT 07-JAN-2005 (Rel. 10, Created) DT 07-JAN-2005 (Rel. 10, Last updated, Version 1) XX DE A long terminal repeat of the Gypsy9_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy superfamily; Gypsy9-I_DR; Gypsy9-LTR_DR; Gypsy9_DR; KW endogenous retrovirus. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1450 RA Kapitonov V.V. and Jurka J.; RT "Gypsy9_DR, an LTR retrotransposon from zebrafish."; RL Repbase Reports 4(12), 324-324 (2004). XX DR [1] (Consensus) XX CC Gypsy9-LTR_DR is a long terminal repeat of the Gypsy9_DR LTR CC retrotransposon. Its internal portion is deposited in Repbase CC as Gypsy9-I_DR. XX SQ Sequence 1450 BP; 432 A; 197 C; 329 G; 492 T; 0 other; tgtagccctt gataaatctg attaataaat aaagaagaca acaatcaggg aagagagcaa 60 gttatgttta ttattttttt gtaaatatgt tttgattatt attttctctt cattttgatt 120 tattcttggt gttctagtaa aagaaacatg tatacatata cttaagcttt aaagaactgt 180 taagcatgtg tgtatgtact tgcattggtg caggaaatta atgtaaaatg agggaggaac 240 aaaaaaaaaa gggagagaga gagagcggag aaaaaaaaga gagactctcg tggggtcgga 300 agtgaggagc gcaggagtgg acgagcaaaa gcggagtttt gttttaatcc cgacgaaaaa 360 ggagtgttca aaaagaaacg tgtgcgaagg attgtgatgt ggtctatttt ccccgccaag 420 agcggagagt gaagtgactc gtgaattagt ttcatcacgt cgactggaag gacacaagtt 480 tccgtgcgcc tggccgactc ggggaaaaaa aagagaaagc ttttctggat tcggttgtta 540 gagcgaatgt gtggggatta accttcatca cgtttcgccc accgtgaaca tatcaacaaa 600 ggatatgtga gtacagtgcc accgagatct gttgacaaaa ctgaatcgtt caaaggaaaa 660 gggcaacaac ggtttcgcgc attctttgcg tcctttcaaa gactgagttt ggattttcag 720 tacctgtatt tgcatgatga ataatcagtg gtgtgtttaa acggacactg gtttgagaat 780 taacgtttgt aacgagtgta ttgattaacg ttgattcagt tcaaatgtaa tgaatcgtca 840 agtgtggcga gtcaaatgag tcagttcaag agtcactgaa tgttttgaag gtttatttgt 900 gagtttgact aagtaccact gtcggggtta agagttactt attagaattt ctattttcaa 960 agatagcata ttagtttgga ttaagagaga gagagattca caatttattt gttagtttta 1020 tttatatatg gggatgcttt gagaagtgtc atgtgaatct ttaaacactc taattttctg 1080 ttgggtaaat ttgtgggtta gtacttataa ttctgtcact ttattagttt ggtttcatat 1140 ttatatactt tgtatttaca cactcaattg agcttaattt tgtttattta tctattcatt 1200 ttatttatat ttttattttt tttttataaa cctgaaggtt agatagcact tttattgtca 1260 gggttcttta tacttctcaa tttgggaagt ataggtttct acaattggag gcaccgcgct 1320 aaataaaagc tcattgttgt acatcctggt atattttttt ttgggaaaag accagttatt 1380 acatattact accgctggga taccagggct cctatatagt tttgtttgta cccagctagg 1440 gagggctaca 1450 // ID DNA-8-10_DR repbase; DNA; ZEB; 234 BP. XX AC . XX DT 14-NOV-2008 (Rel. 13.11, Created) DT 14-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE DNA-8-10_DR is a nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; TSD 8-bp; DNA-8-10_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-234 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(11), 1725-1725 (2008). XX DR [1] (Consensus) XX CC TSD is 8-bp long. TIR is ~60-bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 234 BP; 46 A; 60 C; 73 G; 55 T; 0 other; caagtggtgc tcgctgcaga gccatttgag gagagctgag ctccagcgag ggggagctta 60 agcttgagct ccaccttttt tgcacttctt ctacgagtga tgtcactggg ggtagggtta 120 ggggtggggt tggtgtacgc attaaaacag cttacaggag gaggagcgac agctcatgct 180 ccccctcgct ggagctcagc tctcctcaaa tggctctgca gcgagcaccc tctc 234 // ID DNA-8-14_DR repbase; DNA; ZEB; 1573 BP. XX AC . XX DT 23-NOV-2008 (Rel. 13.11, Created) DT 23-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE A nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; KW nonautonomous DNA transposon; TSD 8-bp; DNA-8-14_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1573 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(11), 1729-1729 (2008). XX DR [1] (Consensus) XX CC Copies are ~88% identical to the consensus. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1573 BP; 437 A; 328 C; 346 G; 461 T; 1 other; cagtggttcc caaccttttt tttgcctgag accccctttt tcccctggaa aatattgtaa 60 ggcccccctc cacatttaat gatataatcg ctcatataag tttgcagtaa tgatgacaaa 120 aaaatgttgt tttcaaacaa acggtatgtt tattactata tctagatatg tttatgcaca 180 aataatagcc taatgtttcc cagagatggg ttgcagctgg aagggcatcc gctgcgtaaa 240 acatgtgctg gataagttgg cggttcattc cgctgtggcg accccggatt aataaaggga 300 ctaagccgaa aagaaaatga atgaatgaat gaataatagc ctaatgtaaa atataaaagc 360 aaaacatcag taggccattt gtaactgaaa aacataagcc tttggccttt aaattgaccc 420 tatttgaaat tcaataacaa aaatatcagt attcacactg gtgtataagg gaaaaatata 480 atataggcta ttacaattta ctacttgcat gttatttgtt atgctctatc aattagtgtg 540 aggattgctg cgaggcgaat cagtgagact ctgtcttttt ttcttttttg tggaggacgg 600 attacaaatt tgtccatttt tggtcagcta gggcagtggt cctcaacctt tttatcacca 660 cgcaccggtc aacgcttgac aattttactg cggggggttg ggggtgtttc gtaggttgct 720 atgcgaccat caaccgtttt ctcagaggat gacaaactga caaaacattg ctgtcactct 780 ctgatacaag ttttatttat ggagaaaagt acaaaggact gcagtgtttg tgtgttctgt 840 gtttgtctga gaaaattagt cttaccgaaa tgtgaatatt ccatacaagc tgaagctgat 900 cggtcaattc ttgtcacgtg actcgcggtg cgctcgcggc ccggggttca ttggtgcgag 960 cgcgtcacgc cgcttgtgcg cgcaagccgc gcacctccat tggaaacaac aaacttgtgc 1020 aaactctaga aatacgcgat atgcgaacgc cccctaaaag gccgccgtca tttgcaatct 1080 ccatcagttg atcttcttgc acagccatgg tggattcacc tgatttgttg acaagtgggt 1140 tgcggatcta ttccttggca gttcgygggt cgttcactgg gcctttttcc cttagcaaag 1200 aaactttcca aagacgtctg tttcttactc attttgctgc ttgtgggtta aatttgggcg 1260 ctcaagtgac cgagatgtaa gcgagagaat acggtcattt ttcaaaataa aagacttttc 1320 aaaataaaag atcgttcaga ctcagataat aaataaaacg gaaataatta atgatttctt 1380 gtgcggcccg gtaccaattg atccacggac cggtggttga ggaccactga gctaggggat 1440 aaaatgaact aaggcaacat gatcattctc ctaattgtcc actgctaaga ttttttttta 1500 aatatgacgc gaccccccac aaagctcgct gaggccccct tgtgggccgc gacccccctg 1560 ttgggaaccg ctg 1573 // ID Gypsy172-LTR_Dr repbase; DNA; ZEB; 391 BP. XX AC . XX DT 24-OCT-2009 (Rel. 15.09, Created) DT 24-OCT-2009 (Rel. 15.09, Last updated, Version 1) XX DE A long terminal repeat of the Gypsy-172_DR LTR retrotransposon - DE a consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW endogenous retrovirus; Interspersed repeat; reverse transcriptase; KW gag; Gypsy superfamily; integrase; Gypsy-172-I_DR; KW Gypsy-172-LTR_DR; Gypsy-172_DR; Gypsy172-I_Dr; Gypsy172-LTR_Dr. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-391 RA Dib M.R. and Naveira H.F.; RT "Gypsy-172_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 10(9), 1211-1211 (2010). XX DR [1] (Consensus) XX CC Gypsy-172-LTR_DR is a long terminal repeat of the Gypsy-172_DR CC retrotransposon that belongs to the Gypsy superfamily. Its CC internal portion (Gypsy-172-I_DR) contains three ORFs encoding CC the 157-aa Gypsy-172_DR1p gag (pos. 125-595) and 458-aa CC Gypsy-172_DR2p pol proteins (pos. 709-2083) composed of the CC protease, reverse transcriptase, and the 736-aa Gypsy-172_DR3p CC pol proteins (pos.2004-4211) composed of the integrase domain. CC Some insertions fairly recent, according to the hight identity CC between their flanking LTRs. Consensus obtained after the CC alignment of at least three independent insertions. XX SQ Sequence 391 BP; 84 A; 108 C; 66 G; 133 T; 0 other; tgtcacagag acaaactccg tgattccctc ctctaaccag cagagggcac catcacctca 60 ttattaactg acagttccct cactacattt cccatctacc ccgtgtctgt ttgattacgg 120 gcccagctga aatcactctg ccatctattt aagcagactg gtttgtttca ttctttgcga 180 agtcttgttt gctactagtt gacatttctg agcgtttata ccctactttg attttcctgt 240 tgtgaccttg cctgtttacc gtctacgatc ccttgctgcc agccctgacc atttgcctgt 300 ttatgaccac gattctgttt ctctgccgtt gttgattatg tataataata aactgcacga 360 tggatcccaa tctccgtgac gaatcattac a 391 // ID Gypsy142-LTR_DR repbase; DNA; ZEB; 166 BP. XX AC chr15; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy142-I_DR; KW Gypsy142-LTR_DR; Gypsy142_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-166 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1546-1546 (2008). XX DR Genome; chr15; Positions 41200973 41201138. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 166 BP; 48 A; 24 C; 38 G; 56 T; 0 other; tgtgatatat tgatgtttgc atttttgtat ttgaaacaat agggactccg tagtacggat 60 gtagttgata tagttctggc agtttcggat gaaagcagaa agtaaagctg ttgcttaata 120 cggacgctcc gactcaagtg tatcttttac agtcagatat atcaca 166 // ID EnSpm-N3_DR repbase; DNA; ZEB; 2944 BP. XX AC . XX DT 15-OCT-2008 (Rel. 13.1, Created) DT 15-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE EnSpm-N3_DR is an non-autonomous DNA transposon - a consensus. XX KW EnSpm; DNA transposon; Transposable Element; Nonautonomous; KW EnSpm-N3_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-2944 RA Bao W. and Jurka J.; RT "EnSpm DNA transposons in zebrafish."; RL Repbase Reports 8(10), 1263-1263 (2008). XX DR [1] (Consensus) XX CC The element is a composite transposon containing TR14 and CC CR1-5_DR elements (masked out). TSD is 2-bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 2944 BP; 929 A; 447 C; 496 G; 1071 T; 1 other; cactgttaga catttctgta aattacacag ttatttactg tatttcaccc agtgtaatac 60 tgcaaattcc ttttacggta aataactgta tttaccgttg cattatggga atttagtgtg 120 acgtcgaaca acatatacag cgatgtactg tatttgtaaa aattcgtgta ttatactgta 180 ttttccacaa ctgcttcaac gccaccttgc ggcgattcga ctactctgca ccacattttg 240 cctgaggacc ctggagcaat tctggaggac aatttattag tgtgcctgaa gaacatactg 300 gatttaacaa actttactct ggtaagctga ggcagtgttt cattttacag aatgttttgt 360 ggacgagaat agaacaaagt aaagtataaa gttggctatt attattttct ctggcttggc 420 gttatttcgc ccatttgaga aatatatcat ctattaagat gttaccttta tttaatttat 480 tacattaaca ctactattac tttaaataag actgtttatg actattagcc attgttctcg 540 tcatatcttt ttatttattt acataggaac cggtgttgca gcgggttttg gtttaggagg 600 gaaccagagg tatgcagaca gttttggagg acacttagca cgagacaacg gtccggcaga 660 gagtgttttc ccccagaaga atatgaatga aagaccaaca aataatccag gtaaaagcgc 720 gtctgtctgt gctgacaaca ctcgacattg atttgagcac ctctattagc atttgtttgc 780 tcggcaacat tggctgaagc gatcttaaaa tggtaatgtt aactgttagc taaactgagc 840 taagtttata ctgaattgga ctcaaactca ttttaacgtg caacgtttaa cttatattga 900 tttatactat atatattaga tgttgtgaac ttatttttct ctggtttaga gtaacttata 960 acgttagctt gagaaaataa catcgcgacg taacctataa gaacgttaac gtaaatctat 1020 aacgttacaa tatgcatata tgcattatgc taataaccac ctttttgcta tttattgttt 1080 gctaattatt tttttatcac aactcttgac atttggttta acataattgt taccattata 1140 tttttttctg tttagatatg gcactggcac agcagcaggt tttgtctgaa gaaacaaaca 1200 gcatttcaga ggaacatctt gatcagcaca gccttatgct taaaaaataa gcaagaccag 1260 taagcaacca aggtaaacaa ataaaaaaaa gttgtgtgga ggttgtggat tgtagcatgg 1320 tattcttatc tgtatttttg gcttttttat ggtctcaata tagaatactg ggagcctgcc 1380 tgtgcaaact ggacctcata caacagtaag tactgtacat ctatttgttt ctcaatatat 1440 taacgttact tttaatacaa aatgtgtcac atgtttagtt gctagctaaa taaaaaaatc 1500 gttttccctt ttttattaaa ccatttagta ggtttttttt ttcatctttg tttataatat 1560 ttttgagcag attgaggtct gtcatgggtc agacatttca aaatacaata attagtattg 1620 ggatacatgc atttgtctat gtattaaatg tctttttccc catttcatgt tgcctaaata 1680 ttttgtaata aataaaataa tacaattttt tgtataaata attccagagg atgaaacaaa 1740 atgcacagca acaaccagag tgagccagaa gatagcagga gaagtactga agagaatgac 1800 aaccatgaac aacagtgagt tatccttctt tcaccagcta actgagaact tggattaggt 1860 gagacactct caatagtttt atcagtattc agtcagtttt agtttattag gaatacataa 1920 aacaaatagt ctaatctgtc ctgcagtgaa tcaacccatc acaaatgcta caatgttttt 1980 gtaaaacctt gttagggaaa aaaataaaca ggacattctg gttagttatc ttttaaacat 2040 ggtagtgagc aggttatgag caggagccag ttgctcagta ccagctgaaa actaccccaa 2100 cctagctgcc atgcttcaaa ataattaacc agtatatgat ttattttttt taaacaagct 2160 tctgaacatg tgaattcaga ccttgttgag catttgacac cagaactagt aatattttaa 2220 agttgggttg tttctgagtc ggctaatagt tacaaatatt tatttttcca acaatatgca 2280 attcagaagt ttaaattttt attgtacatg tatgttctag tgctttgtgt gatttattta 2340 tttttaatgt gttgatttta ttgcttaatg taattttcac acattttcct taaatgcttt 2400 aagcattatt aatgttttaa agaatggaat attttgtaat tgtgtctggt ttaatgtcag 2460 tagtacttag tacaattggg tttttttttc tcaacaatat gcaggtcaga cacttaaatt 2520 agttttgtac atgtatgtta atgtgctttg tcattttcaa tgtttttttt tcttaaaata 2580 aaatattttg aatgattgaa atgtaatgtt tttacacatt tttaagcttg ttttaagcat 2640 ctccaatgtt ttaaataaag agtgtttatt ttgttaatat tttgtaattg tgtctttttt 2700 aggtctatag tattaaaata atattaataa aattatatta taaaatattt aggacaaaat 2760 taaaaggttt acagtagcta actgttaact taggttttta cagtagttaa ccattttcag 2820 acactacrgt aacatgctgt aaacggattt acagttgtat actgtaaatc taaaatacag 2880 taacttactg gcaacagtgt tgccagtaag ttactgtaaa aaccctttga aatgtctaac 2940 agtg 2944 // ID Gypsy73-LTR_DR repbase; DNA; ZEB; 168 BP. XX AC chr14; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy73-I_DR; KW Gypsy73-LTR_DR; Gypsy73_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-168 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1462-1462 (2008). XX DR Genome; chr14; Positions 17567612 17567445. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 168 BP; 49 A; 19 C; 44 G; 56 T; 0 other; tgtgacaata ggaactaata atatctgtgt tgtgggagag ggaacctgtt tgagtgttgc 60 actgtcaatg ttgtgagtga cagagaagtt ttttggatta aactgagtta gacagttatc 120 atggtctcag tgagtctctt tactggtgaa gaaagtaatc tttaaaca 168 // ID DNA2-4_DR repbase; DNA; ZEB; 1336 BP. XX AC . XX DT 26-SEP-2008 (Rel. 13.09, Created) DT 28-SEP-2008 (Rel. 13.09, Last updated, Version 4) XX DE DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW DNA2-4_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1336 RA Jurka J.; RT "hAT-type families from zebrafish."; RL Repbase Reports 8(9), 932-932 (2008). XX DR [1] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1336 BP; 394 A; 248 C; 228 G; 466 T; 0 other; caggggtttt caaactgtgg gtcgcgaccc actagtgggt cgcacagtga acaaaggtgg 60 gtcgcgcaac gtcacatagc tgcagctata tttacattgc ggtgaatcaa aaccagtacg 120 attgacggca ataactcaaa aacctcttac cctaaaaata tctaaagtgt gtttcctaca 180 aaaagacaaa gctggttatg tttcacagct gtctttttct tttttttttt cgctcgacat 240 tacgagcact gctccgtttg agccctcgca atctgcgttt agccggtaga gctcgcgcag 300 agcggagctc tcagtaaggg accgtcggaa aagtgcttat ttttttttcg tttttttttt 360 ttcagttttt tttttttttt ttttgctccg tcctgcgtgt tttattttaa acactactaa 420 ttttctctta aatgagcaca aacagttact aaagtagtcg aatgcttcat tatagatctg 480 tgcattctta cattaacacc tctgttatca aacaaaacac aatgagagat tcatttgctg 540 ctcttcacta aataactacg gtaactttaa tcaatatgca aatacaatta aaagtgaaat 600 agattttata gctttattta atttctggat aggccattga ttttaatagg ctaaaccttt 660 tattttattg tcaatatttt ctaatgtttg ctatgtattc tatcatttga agctatttgt 720 tgtcccatat tttttacatc ccaacgttga cagattgcta atcaataaat agctatagtg 780 ctatctgtta gttttaacgt cagctaatgt tatggaaggg catagatgtt atggaaaata 840 gtaatagtaa tgtaactacc cccatcattc cttcctcaag caaatgtgta aatattagat 900 ctattcagca ataatgttaa ttgcattggc atatttatct gacgttttcc cagcttgtag 960 tagtcgatca aaaagctatt tagtttctta tgactataca ctagcagtgg aatttactgc 1020 gatgtgacgg ggcttgatca taatcaaggg gcttgatcaa aggttgatca tattctcttc 1080 taaaaaaaag aagagtatct ttttcaacaa aaaaatgatc acagttacag ccagcatccc 1140 acaaaaagca cttcgagcct cattttgtgc atatcaaatt cgcaaaatcc aaaaagccac 1200 acaccatata tttttgactg ctttgaatgg atacattttt tttacccatt tttccatatt 1260 aaaatattat ggtgggtctt gagattgaat tacttgccta ggtgggtccc ggaataaaaa 1320 agtttgggaa tccctg 1336 // ID Gypsy101-I_DR repbase; DNA; ZEB; 4305 BP. XX AC chr19; XX DT 02-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy101-I_DR; KW Gypsy101-LTR_DR; Gypsy101_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4305 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1511-1511 (2008). XX DR Genome; chr19; Positions 23445719 23441415. XX CC Positions [1673-2014] - Reverse transcriptase CC Positions [3146-3625] - Integrase core CC 'ATGAT' target site duplication CC LTRs are 99% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 218..4306 FT /product="Gypsy101-I_DR_1p" FT /translation="MTSGVRLAFPDKFSGNPAKCKGFLLQCKLFIAQQPHL FT FKDENGKIAFVCSLLTGKALDWATAVWPDSTPIFPSFNGFLKRFCTVFDHP FT EGGRNAGEELLCIQQGDRSAAEFALQFRTLAAQTGWADDPLVTLYRRALKP FT ELQREMACRDDGKTLDQLIELSIRLDTLLRTRNPLCSIASSPVSPETSAEP FT MQLGRTRLNPEERERRRRNHLCIYCGLPGHTKVLCPNKPLPKTLSVSATTM FT FTTTNNVVNLPVTLRNDGNEIETMAMIDSGAAGNFIDYTFATTHSIPLTSC FT DSSIAITAVDGRPLGEGRIKFQTLPVLLQTGSLHEEEISLLAINSPKHSVI FT LGLPWLQQHDPQVSWRTGEIIKWSNQCFTQCLHPVSPIQINIINKTEDSEL FT HHVPDAYHDLIEAFNKQKATKLPPHRDNDCAIELLPGTTPPRGRIFPLSQP FT ETEAMKKYISEELGKGFIRPSTSPASAGFFFVKKKDGSLRPCIDYRGLNEI FT TVKYRYPLPLVPAALEQLRSAQYFTKLDLRSAYNLIRIRQGDEWKTGFSTI FT DGHYEYLVMPFGLANSPSVFQAFVNEIFRDMLNKWVIVYIDDILIYSNSLS FT EHIQHVRAVLKRLIENQLYAKSSKCEFHQTCILFLGYIISPEGVAMDQQKV FT DSVTQWPQPETIRQLQRFLGFANFYRRFIRNFSSVAAPLTTMVKANNARLK FT WNPDAVRAFTQLKTRFSSAPILRHPDPEQPFVVEIDASNTGIGAILSQRSL FT VNKKLHPCAFYSRKLNSAERNYDVGNRELLAMKAALEEWRHWLEGAKHPFI FT VITDHKNLEYIRSCKRLNPRQARWALFFTRFDFQVTYIPGSKNIKADALSR FT LSDDETSEISDEPIIKSPLIVAPIQWDIDQEILQAAESNPSQQPCPENKIF FT VPPSLRERLISEVHDHPSSGHPGSTATVQMIQSRYWWPSIHKDVINFINKC FT SPCQMSKHSRHRPAGLLQPLEVPHRPWSHIAIDFITDLPISQGKTTILTVV FT DRFSKSCRLIPISKLPTAMETAELLCECVFRYYGLPEDIVSDRGPQFTSRL FT WSTFFRNLQINVSLTSGYHPQSNGQTERLNQEIGRFLRTYCHSTQNEWAKI FT LMWAEYAQNSLRKASTGLTPFQCVLGFQPPLFPWSGETSELPAVDTWFKNC FT EEVWNAAHTHLSHAIRRFKEQADRHRRPGPMYSPGQWVWLSTHDLRLKLPC FT KKLSPRYVGPFQIEKQISPVSFRLTLPNHFRISPTFHVSLLKPAAGPAETD FT REVAAGEQGPPPLMIDGEEAYQIHEILRSRCRGGQLQYLVDWEGYGPEERS FT WINRKDILDTTLLEEFHSQHPEMPAPRPRGRPRRRELPHFRSRSLGGGGGS FT " XX SQ Sequence 4305 BP; 1110 A; 1254 C; 909 G; 1032 T; 0 other; gaagacttcg cccaacacgg atcccgcagc catccaggtt ttgtctcacg aagtcaccac 60 acaagctcag gtattgtcaa cccaccaaca acaactaaca cacctaactc agttaacaga 120 tgaactggtg aaatccctcc aaaacctgca agctgctgcc actgcgcaac ccaccgccaa 180 ctaccctcca agtccacttg ttgtggcaca acccccgatg acttccggag ttcgattggc 240 ttttcctgac aagttctcag gtaacccagc aaaatgtaaa ggctttctat tgcaatgcaa 300 actgtttatc gcccaacaac cccatctgtt taaggacgag aacgggaaaa ttgcttttgt 360 gtgctcactg ctcactggga aagcgctaga ctgggctact gcagtttggc cagacagtac 420 cccgatattt ccctcgttta atggctttct caaacgtttc tgcactgtgt tcgatcatcc 480 ggagggtggt cgtaacgctg gcgaggagct attgtgtatc caacaaggag atcgctccgc 540 agctgagttc gctctgcagt tccgcacctt ggcagcgcaa actggctggg ctgatgatcc 600 gctcgtcacg ctctacagaa gggctctaaa gccagaattg cagcgagaga tggcatgtcg 660 tgatgatggg aaaactctgg atcaacttat cgagctttcc ataaggttag acactctcct 720 ccgcactcgc aatccactct gctcaattgc ttccagtcct gtatcccctg aaacctccgc 780 tgaacccatg caattgggta gaacccgtct gaacccagaa gagcgagaac gacggagaag 840 aaatcacttg tgcatttact gcggacttcc aggtcatacg aaggttttat gtcccaacaa 900 gcctctccct aaaaccctct cggtgagtgc aaccactatg ttcactacca ctaataatgt 960 tgtgaatctg cctgtcactt tgagaaatga tggaaacgag attgagacta tggccatgat 1020 cgattcagga gccgctggta actttatcga ttatacgttt gccaccactc actccattcc 1080 tctaacctcc tgtgattcct ccatagccat cactgctgta gatgggcgcc ctttgggaga 1140 aggacgtata aaattccaga ccctgccagt cttgcttcaa acaggctctc ttcatgaaga 1200 agaaatttcc cttctcgcca tcaactcccc taaacactct gtgattctcg ggttaccatg 1260 gctacaacag catgaccctc aagtctcttg gagaactggt gagatcataa aatggagcaa 1320 tcaatgcttt acccaatgtc tgcatcctgt ttcccctatc cagatcaaca taatcaacaa 1380 gaccgaagac tctgaactcc atcatgttcc tgatgcttat cacgatctaa ttgaagcatt 1440 caataagcag aaagctacta agctccctcc tcatcgtgat aatgactgtg ccattgagct 1500 gctacccggc accacgcccc ctcgtggtcg catattccct ctctcacaac ctgaaactga 1560 ggctatgaag aagtacatct ccgaggagct agggaaagga tttattcgac catctacttc 1620 tcccgcttca gcagggtttt tcttcgtcaa gaagaaggat ggcagcttac gcccttgcat 1680 cgattaccga ggcttgaatg agatcacggt caaataccgc tacccattgc cattggttcc 1740 tgccgccctg gaacaactcc gatccgccca gtatttcact aagttggacc tccgtagtgc 1800 atataacctc atccgcatac gacagggaga cgagtggaaa accgggttct ctacaattga 1860 cggccactac gaatacctcg ttatgccctt cggcctagca aacagtcctt ccgtgttcca 1920 ggcattcgta aatgagatat ttagagacat gctcaacaaa tgggtaatag tatatattga 1980 cgacatcttg atctattcca attcgctatc tgagcatatc cagcacgttc gggcagtgct 2040 caaacgtctc attgaaaacc agttgtatgc gaagagctcc aagtgcgaat tccaccagac 2100 ctgcatctta ttccttggct atattatcag tccagaaggc gtggccatgg accaacaaaa 2160 ggttgactct gtgacacaat ggccacaacc cgaaaccatc aggcaactac aacgattcct 2220 gggctttgct aacttctaca gacgcttcat aaggaacttc agttcagtcg ctgccccact 2280 aaccaccatg gtcaaagcta ataatgctcg ccttaaatgg aaccccgacg ctgtccgagc 2340 atttacccag ttaaagactc gcttctccag tgcacctatt ctgcgtcatc ctgatcccga 2400 acaacccttt gtcgtcgaaa ttgacgcgtc aaacactgga attggagcca tcctgtccca 2460 gcgatcccta gtgaataaga agctccatcc ctgtgccttc tattctcgca agcttaactc 2520 cgctgaacgg aactacgatg tgggaaaccg agaactcctc gccatgaaag ctgcgctaga 2580 ggaatggaga cactggcttg agggtgcaaa acaccctttc atcgtcatca ctgatcacaa 2640 gaacctagaa tacatccggt cttgtaaacg tctgaatcct agacaggcaa gatgggcgct 2700 gtttttcacc cgatttgact tccaagtcac ctatatccct ggttcgaaaa acatcaaagc 2760 agatgctctg tcccgcctct cagacgatga gacctctgaa atctctgacg agcccataat 2820 caaaagtcct ctaattgtcg ctcccatcca gtgggatata gatcaggaga tcctccaagc 2880 cgccgaaagt aatccttctc agcagccttg tccggagaat aaaatctttg ttcccccgtc 2940 gcttcgagaa agactcatct ccgaggtaca tgaccacccc agttccggtc acccaggcag 3000 cacagccact gtgcaaatga ttcaatcccg ctactggtgg ccatccatac acaaggatgt 3060 aatcaacttc attaacaaat gctcaccctg tcagatgtcc aaacactccc gtcaccgtcc 3120 agctggatta ctccaacccc tggaggttcc acatcgtcct tggtcacaca tcgctataga 3180 tttcatcacc gatttaccca tatctcaggg caaaaccacc atcctcacag tagtggaccg 3240 attttcgaaa tcctgccgcc ttattcccat ttccaaactg cccacagcca tggaaacggc 3300 tgagttactc tgcgagtgtg tgtttcggta ctatggtctc cccgaagaca tcgtatcgga 3360 tagaggtccc cagtttacat cccgactctg gtcaacattc ttcagaaatc tccaaatcaa 3420 cgtcagccta acatccggct atcatcccca atccaacggg cagacagagc gcttgaacca 3480 agagatcggt cgattcctcc gcacgtattg tcactcgacc cagaacgaat gggccaaaat 3540 tttgatgtgg gcagaatatg ctcagaactc cctcagaaag gcatctactg gattaacccc 3600 atttcaatgc gtgctgggct tccaacctcc gctctttccc tggtccggcg agacctctga 3660 actccctgcg gtggatacct ggtttaaaaa ctgcgaagag gtctggaacg cagctcatac 3720 tcatctctct cacgccataa ggcgttttaa ggaacaagcc gatcgacacc gacgtcctgg 3780 tcccatgtat tccccaggac aatgggtatg gctttccacc cacgacttac gcctcaagct 3840 gccctgcaag aaactcagtc ccaggtacgt gggtcctttc cagatagaaa aacaaatatc 3900 tcctgtttcg ttcagactga cactccctaa tcattttcgt atttccccaa ctttccatgt 3960 ctctctgctc aagcctgctg ctggtccagc cgagacggat agggaggtgg cagccggtga 4020 acagggtccc ccgcctctca tgatcgatgg cgaagaggct taccagatcc acgagatcct 4080 gagatccaga tgccggggcg gtcaactcca atatctcgtt gattgggagg ggtacggtcc 4140 ggaggagaga tcttggatta accgcaagga catcctcgac accacactac tggaagagtt 4200 ccactcacaa catccggaga tgccggcccc tcgcccccgt ggaagacccc ggcgtcgaga 4260 attgcctcac ttcaggagcc gttcgttggg gggggggggg ggctc 4305 // ID CR1-9_DR repbase; DNA; ZEB; 3379 BP. XX AC . XX DT 22-OCT-2008 (Rel. 13.1, Created) DT 22-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE CR1-9_DR is a CR1-like non-LTR retrotransposon - a consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; AP endonuclease; CR1 clad; CR1-9_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-3379 RA Bao W. and Jurka J.; RT "CR1-like non-LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1204-1204 (2008). XX DR [1] (Consensus) XX CC The 5'- part is incomplete. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS join(385..1533,1487..3199) FT /product="CR1-9_DR_1p" FT /translation="TKFLTRHFALLNCRSISDKASYLNEIITDNKLTFFST FT ETWQAPAISPVKSSCSQWLQISRPSQTTWQGGGLAVIYRHDIRLKQLDFHN FT TNSFEYMVLKADCLTIILLYRPPKALSSFFSELSELLTLACSLSSSILLLG FT DFNIHVDTACSNTTQLLSVFECFNLTQHVCFPTHTRGHTLDLICTSKGDIT FT SISASETGISDHKLIDFSLNLPLRSNCPKTTITYRNLKAIDTVSFSTSIVS FT SNLSDVINISSPSMILSNYHSIISEVLQQQAPIKTRYVPATHSSPWLTPEL FT RTLKATGRQLERLYRRTGLTVHYLAFQDHLKSYKSALTLARSKFISXTINK FT SSNRPKSLFNTINKLTSPPPQRALASTDAVLLFLFGLSARLQLMLFCCSYL FT DYLLDKTNAVHQCVPTDTELSHPSCLSSQSLSCFSLTTPDSVSKLILKSSS FT SSCHLDPAPTSLLKLCHSVISPVISHFINSSLTHASFPLPLKTAAVTPVLK FT KPNLDPSILSNYHPISNLPFISKLLESTVATQLQSFLVDNNLLDPFQSGFR FT PLHSTETALVKVVNDLLLSGDSGSLSILLLLDLSSAFDTVSHDLLISRLSD FT LGVSGTALSWFSSYLSDRQFFISVMDFQSPTVPLKRGVPQGSVLGPLLFSV FT YLIPLGLILQHHGLNYHLYADDIQLYSVCTPTASAQIENISACVNDINDWL FT TSNSLKLNMDKTEIIFIGTPALTSNIPTSYSCEIAGLHIKTSTSVKNLGVI FT FDSTLSFQSHIKSVTKSAFFHLRRISQLRPFISSKDAETVIHAFVSSRIDY FT CNALFIGLPASLISRLQYIQNSAARILTHTKRSAHITPILYDLHWLPVAYR FT IKFKILLLAFRSLNNLAPSYLCNMLVPYTPTRSLRSADSGLLAVPRYRLSS FT MGGRSFSVIAPKLWNSLPHSLRSANNISDFKSLLKTHLFSECFASDLPN*" XX SQ Sequence 3379 BP; 808 A; 920 C; 485 G; 1164 T; 2 other; taactagttt ttgcctgtgt tgggctctca ctcgctctaa catgcccttt gcaaaagtta 60 tttatttttt ctctcttttc ttttttgcct ttttttgcac ctattctccg taatggatat 120 ggctatttac catcgcagtt aaaggcatta aactgccacc acgtcttgat tccatgacgt 180 atgaatactg cggagttagg gattctccgg cgccacgata tgtcatcgct cacggcgttc 240 gctcaggagt actttttttt ctcagcctga agccattcca tgcatctgga ctaatcaaga 300 ccgtcacatc cagactctgc ctgctgacca taccattcat cgatcactaa tttatcccac 360 tctcaaccag catcataccg ttaaactaaa ttccttactc gtcactttgc gcttttgaac 420 tgtcggtcaa tatcggacaa agcctcttat cttaatgaaa tcataactga caataagctg 480 acattttttt ctactgaaac ctggcaagcc ccggctattt ctccagttaa atcttcttgc 540 tcccaatggc tacaaatatc tcgcccgtcc cagactacat ggcaaggggg tggtctggct 600 gtcatttatc ggcatgatat tcgacttaaa caactggact ttcacaacac caattcattt 660 gaatatatgg ttctcaaagc tgactgcctc acaattattc tactctaccg tcctccaaaa 720 gccttgtcca gctttttttc tgaactcagt gaactgctaa ccctggcttg ctcattgtca 780 tcttctatcc tcctgctcgg ggacttcaat attcatgttg atactgcctg ctctaatact 840 acccagttac tgtctgtttt tgaatgtttt aatttaactc aacatgtctg ctttccaact 900 catacccgcg gtcacaccct tgatctaatt tgtacatcca agggggatat cacttccatt 960 tctgcttctg aaactggtat ttctgatcac aaacttattg actttagcct taatcttcct 1020 ttgcgttcta attgcccgaa gactactatc acataccgaa atttgaaagc tattgacact 1080 gtgtcatttt ctacttctat tgtttcttcc aacctctctg atgtcattaa tatctcttcc 1140 ccatctatga tactgtctaa ctatcactct attatttctg aagtcctgca gcagcaagct 1200 cctattaaga ccagatatgt ccccgccact cactcttctc cctggttaac tcctgagctc 1260 cgtactctca aggccactgg tagacagctt gaacgtctct ataggagaac cggtcttact 1320 gttcattatt tggcctttca agatcatctt aaatcatata aatctgctct caccttagct 1380 cgctctaaat tcatctctrc tacaattaat aaatcaagca ataggccaaa atcactattt 1440 aatacaataa ataagcttac tagcccccca cctcaaaggg ccttagcttc aactgatgct 1500 gttttgctgt tcctatttgg attatctgct cgataagact aatgckgtcc accagtgcgt 1560 tcctactgat acagagctca gccatccatc atgtctctcc agtcagtctc tgtcctgctt 1620 ttcactgacg acccctgatt ctgtctctaa gctaatattg aaatccagct catcatcatg 1680 tcaccttgat cctgctccta cctctctcct gaaactctgc cactctgtta tctcccctgt 1740 catctctcac tttatcaatt catctctcac ccatgcatca tttcctctgc cactaaaaac 1800 tgctgccgtc acacctgtgc ttaagaaacc caacctcgac ccctcgattc tatctaatta 1860 tcatcctatt tccaatctac ctttcatatc taaattactg gaaagtactg ttgctaccca 1920 actccagtct tttctagttg ataataatct cctggaccct tttcaatctg gttttcgccc 1980 cctccatagt actgaaactg cacttgtaaa ggtagtgaat gatctattgc tgtctgggga 2040 ttctggctcc ttgtctatat tacttctact cgacctcagt tctgcctttg atacagtctc 2100 tcatgattta cttatctcac gtctctctga tctgggtgtt tctggtactg ctctctcttg 2160 gttctcttca tatctctcag atagacaatt cttcatctct gttatggatt ttcagtcacc 2220 tactgtcccc ctgaagcgag gtgttcccca gggttctgtt cttggcccac tattgttctc 2280 agtataccta atcccacttg gtctgattct acaacatcac ggtcttaact atcatcttta 2340 tgctgatgat attcaacttt actctgtatg tactccaacc gcatccgccc aaattgaaaa 2400 tatctctgct tgtgtgaatg atattaatga ttggctcacc tcaaattccc taaaacttaa 2460 catggacaaa actgaaataa tttttattgg caccccagca cttaccagca atattcccac 2520 cagttattcc tgtgagattg ctggcttgca cattaaaaca tctacttctg taaaaaattt 2580 aggtgtaatc tttgactcca cactctcttt tcagtctcac attaaatctg tcactaaatc 2640 tgcattcttt cacttacgtc gcatttcaca actgcgtccc ttcatcagta gcaaggatgc 2700 tgagactgtc atccatgcat ttgtctcctc acgcatcgac tactgtaatg ctcttttcat 2760 tggcctgcct gccagcctca tctccagatt acagtatatt caaaactctg cagctagaat 2820 actcactcat accaaacgtt cggctcacat aaccccaatc ttgtatgatc ttcattggct 2880 cccggttgca taccggatta aattcaaaat tctcctcctt gcatttagat ccttaaataa 2940 ccttgcaccc tcctatctct gcaacatgct tgtcccctac accccaactc gctctttacg 3000 gtctgctgac tctggccttc ttgctgtccc tcggtaccgc ctctcttcaa tggggggcag 3060 atcatttagt gttattgcac caaaactctg gaactctcta ccacactcac ttcgctctgc 3120 taataatatc tctgatttca aatccttact taagactcac ttattttctg aatgctttgc 3180 atctgatcta ccaaactgac cacttaatct gatccaccac tatcctgttc atatgtctct 3240 taggtttcgt ttgtgttatt catgtttgtt atatttgacc tcatgtatgt ttgtttatct 3300 ttttgttgca ttccttgtaa agtgtccttg agctctggaa aggcgctata taaataaaaa 3360 ttattattat tattattat 3379 // ID Gypsy89-LTR_DR repbase; DNA; ZEB; 559 BP. XX AC chr10; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy89-I_DR; KW Gypsy89-LTR_DR; Gypsy89_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-559 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1490-1490 (2008). XX DR Genome; chr10; Positions 10913998 10914556. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 559 BP; 120 A; 141 C; 91 G; 207 T; 0 other; tgtaatatct cagcacttta tcgcatcctc caacccacca gggggtgcac atgttgccct 60 atactcctca acattagatg gcagtaatga ctcttgttcc tctaaactct cctgtaatca 120 gcgtaaccac caccacctgt tctttgctct atataagact gctttgtatg ctgctctatg 180 ttcagtcctg aactaaacct cttgctgcca gttaagttgt tctcaagact ttttgcttgt 240 tctcaagact tgtttgctat tttgctgttt gattgttttg agaccctcgt gtctatttta 300 ttttgtatgt attttcagga agctcggctt tcctattttt gttgtacttt tcaaccttgc 360 tgtagagaga tattatctct cctttttgtc aaatttacct cagtaaagac tatattattt 420 ttttggaaat tcatcagttg tggactcact ttgttcctcc tctgctcttg ggccaccagc 480 tgctgactta gctcatctag taagatctac tgctaaccac accgcagacc cgggttcgat 540 ccccactcag atcatcaca 559 // ID Gypsy-18-LTR_DR repbase; DNA; ZEB; 1648 BP. XX AC chr10; XX DT 11-FEB-2005 (Rel. 10.01, Created) DT 11-FEB-2005 (Rel. 10.01, Last updated, Version 1) XX DE A long terminal repeat of the Gypsy-18_DR LTR retrotransposon - a DE fossilized sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy superfamily; Gypsy-18-I_DR; Gypsy-18-LTR_DR; Gypsy-18_DR; KW endogenous retrovirus. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1648 RA Kapitonov V.V. and Jurka J.; RT "Gypsy-18_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 5(1), 10-10 (2005). XX DR Zebrafish.; chr10; Positions 4027019 4028666. XX CC Gypsy-18-LTR_DR is a long terminal repeat of the Gypsy-18_DR CC LTR retrotransposon that belongs to the Gypsy superfamily. Its CC internal portion is deposited in Repbase as Gypsy-18-I_DR. XX SQ Sequence 1648 BP; 485 A; 249 C; 298 G; 616 T; 0 other; tgtaacccag atcaataatc tatcttacat gactgcttaa tttatttatt tactggtttt 60 aataataata cttaagaaga aatagaaatt taaagattga ttgctatgca agtaaaaaga 120 taggaagaaa aagaataaaa ctcccgagag acttttaact tgaagtcgga agttacccgg 180 aagttagctc cgcccctgtt gagcgcgcgc acagccagac agagagcgag agaacgcgat 240 tgaacttgaa cgagagagcg cgagcttaat agatgtacac ggcttaataa tccagcgatt 300 attaacaaat aacttcatat ttatttgata ctgtaagaaa ctaacagtaa ggcttgacag 360 tttgagacct agaaactgac aaaagtgaga ggagacctga cgaatctatt agactttaac 420 atctattccc tcagagagct gctattcctt ttttccctct ttcattatca tctactttct 480 tcttggtaag ttctgattct ttgcaagata tgtatattat tattttgagt tcatatactg 540 tgaattgtga ttaaaaacat tcagaacata agttatttaa aagtttaaaa gctttaactc 600 tgtgaaatgt gaagcaatga gtgtgaaatg tgtgtatata attatttaga ggtttgtgat 660 gtgttctagt tgaaaagttt tattgttgaa aatgattatc cagtgaaatg tgtattagat 720 ggatgctagc tttgctaact atgtacatag accacaatgt attacatgag cacatggtga 780 cttagcatat atgttttgtg tctgtgtgtg tgtgttaata atttagacac acaattcata 840 ttaggtttaa gatgatttat tcggacaaat gaaaataatg ctttattcgg ctttgttgtt 900 tcaaggccgg gttttttggt ttgttttttt ttttttttct ccactttgaa gattcaagag 960 acgtgattga gagtgattga gaactgtcca ttctgagcat tctgtccggt ttggactggc 1020 gagccacccc atcaaattgt gactggtgaa acacaaaggg atcagtcatc aaatctggta 1080 ggatcgaacc atttgattta tatatttttt ttacttttgc ttcctggatt tttgtggatt 1140 acacatatat ttttcccaat tttttggatt taaattttca catttgagga acattgctat 1200 atttttctac tttttttgcg aactgaactg aacattaaga gcccagactt ttggtatctg 1260 agccctaacg gaacactgga ctttgatgtt tgaatttaac tgtttcttat tgaatttttg 1320 aattttctga actgtgcatt taacttttga tatcttcttt attttgtggt cttactattc 1380 aaatgacttg ctgaaattat atttttttat ttctgcactt actgctaatt ttaagtatct 1440 gagtgagttt attattacag acaaattata aaaatttcag tgactggttt attctctttt 1500 taattgttat agagaaagta tacttatact tattctattt accttttttt tgctcttgag 1560 tcgactcctt tttcttctga ttgctggtcc aggcgatttc aaagtgctaa actagtagca 1620 aatcgagcac cactgagtac gcgctaca 1648 // ID TDR25 repbase; DNA; ZEB; 625 BP. XX AC . XX DT 01-APR-2002 (Rel. 7.03, Created) DT 01-APR-2002 (Rel. 7.03, Last updated, Version 1) XX DE Zebrafish non-autonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; TDR25. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-625 RA Jurka J. and Drazkiewicz A.; RT "TDR25: a non-autonomous DNA transposon from Danio rerio."; RL Repbase Reports 2(3), 18-18 (2002). XX DR [1] (Consensus) XX CC The TIRs are imperfect and may include TSDs. XX SQ Sequence 625 BP; 211 A; 113 C; 92 G; 201 T; 8 other; tttaaagcag gggtgcccaa actttttctt atgaagggcc aaaaaccaaa cttgattgag 60 ggctgtgggc cgaaggtaaa tataccaaac tatattacat taaagttgcc atgggtaatt 120 tcctaattta tttaataata tttaaaaata actagaaaac attactttaa atcgtattaa 180 ctaatgcagt ataatttttn nwtttaaaat tataacttat tacaataaaa acataaacaa 240 tcacatttat aacacaatgg agttcaatgc tgtannnaat acactagtca agctgaacct 300 gcctttgcct tgatttgctc actgatgtct tgtgcattgt cctctatttg tcgagtgaca 360 gtatttacat ttttwtaaaa aaatctgatt aatttacaac atttaaattg aaacatttaa 420 tttcagttag cttttaacaa taaaacaaac aaacaaaagg ttacattaaa ttagaaatga 480 cgatcnattc tctgttaaag gcattcgccc caaccatccc catcattctc cctctcttct 540 cagatgggat ggtgggccaa aatcaaaggt taccatgggc caactttggc ccgcgggccc 600 ttagtttggg catctctggt ttaaa 625 // ID DIRS-6_DR repbase; DNA; ZEB; 6526 BP. XX AC . XX DT 23-NOV-2008 (Rel. 13.11, Created) DT 09-MAR-2009 (Rel. 14.04, Last updated, Version 2) XX DE DIRS-like LTR retrotransposon family - a consensus. XX KW DIRS; LTR Retrotransposon; Transposable Element; KW reverse transcriptase RNase H; phage integrase; DIRS-6_DR. XX NM DIRS-6_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6526 RA Bao W. and Jurka J.; RT "Families of DIRS-like retrotransposons in zebrafish."; RL Repbase Reports 8(11), 1737-1737 (2008). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS join(1874..2869,2746..3183,3084..3524,3439..3906) FT /product="DIRS-6_DR_1p" FT /translation="EFSSKKPLIIGLSAPHNSIFPSINSTIPSDEYSLNYH FT DIDQAISLISLAGRNAWLAKIDISSAFKIMPIHPDFWHLFGIHWRSQFYFA FT VRLTFGCKSSPKIFDMLSEALCWILSNNYEIPHIIHLLDDFLLISPPSSPP FT AKHLSITQKVFENLGIPLAEEKTAGPSTSIEFLGINLDSNKFQASLPKEKI FT DRIISLSQIFLEKQSCTKRELLSILGHFNFAMRIIPQGRPFITHLLQLSSS FT VPGLEDTIYLSKPSRNELSLWISFLKQWNGCSFFYSDLISSPVDINLFTDA FT APSVGFGGFYQGHWFASTWPPQMLSLPRNQQSSALSNSTPRCPLSRVRRLL FT PRSLVCFNVAPADAQSTQKSAIICAFELYPIVAAALLWGDEWSASSILVHC FT DNEATVYCINKGRSHALPIMPLLRRLVWTAAKKQFIMTARHVPVCKNQIAD FT SLSRFLFQKFRLLVPEADQHPTPVPLYSQXILPFSLSLSFPEISALGTGSR FT PASNTCSTLFTNXIAINHPLHRLHETSISLILHAVAPRTLESYLTAWKSYK FT YFHTLYQIQFPDFSLLTITSFISHLHTAKNIQASSIKSYLSGVQFFHKLIY FT GATSEAISNAQTSLLIKGIQKSPPPPSFTGPPQKPFLTLKLPSSLKVSRNH FT PPPLPDTRLPITLNILAKCIRTLRKGYLSLHTARTLDAMFTLAFFGFLRCS FT EMAITSNFNPAIHPTISDLTLLNAETLAFFIKQSKTDQTSKGHFIYIFNIF FT SPTQPFQTLLAFLHSRRAKESDPHAPLFY*" XX SQ Sequence 6526 BP; 1734 A; 1893 C; 1098 G; 1797 T; 4 other; gtgaagttta tgtataaaca aatttcgaga ggatcacgtg cttatgattg ctagcagctg 60 acccgcatta tccaattcac tacgatccaa tcagatgact cctaaactac tataaatacc 120 ctggggttta ttccattgct atcttcgttt tgaagaggca gcttcaccgt agctagctcc 180 gttgaagaac caactctgta ccagcatgga caaccaaagc agctaccagc taccatctac 240 cagctaccat ctaccagcta caatctacca gctaccatct acaaattacc agctataatc 300 tacaagctac tagctataat ttacaagcta caatctacag cagcagcagc aacaactgca 360 gaaacaacaa atgcagcaac aacaacaact actactatta ctacttcaaa acaactccaa 420 caattgcatc aacaacttca acaaaacatc aaaccttcca cttcaacgca tctgctgtgt 480 cttcaaccat attctccagc aaatgacagc caaaactcca aagccattgc aatgaaactg 540 aaccattacc tttttcctag cggtgcacat gctatgcaca tgaagggaag ctttgcaata 600 taactttacc aaacatttag actaaaacat ttataaatct tttgaataac aagactagcg 660 taaacaatga ttttaactaa agatacttgc ctcttgcccg tccactgatc cataacattt 720 aacagcttgt ttgctggaat gttcagcaga gcaacagctc aagtcataaa caatggtgaa 780 cttgatcaac aaaatggccg ccgggctttt tcacctttgg cacgcttgac tggctctcct 840 ttaagccaat agctgtaagg aaaagcgtca ccatccaata agctctctgg agaaggtccc 900 gccctctctc ttgactctgt tgcaagtctt atgaacgaat ttgggccgga cccgcttgtg 960 aatgaagtta agataccaac aaatgttttt catggattaa agtaaagcca attactaaca 1020 ttccattcac agttttacta agtggtgaca tcgcccttga agtcttatag agttcagagt 1080 ttgatgttag caatataaaa aaaataataa taataaaaat aaaaaaaaaa aaaataaata 1140 aaataaaaaa taaaaaataa ataaatttat acacagagca ctgaggtcta attacactag 1200 accagtkact catttattca tacactagtg tcttatgtaa gcttataatg gcaagtcagt 1260 tggatgtgaa acagcaacaa aacaaacttc ccgattggct gctatgaata ggaggacatt 1320 tctgacaacg gatcttcaat aaacacttcc aggaccacca gacaaaccat accccttatc 1380 ctcagctctc catccatttg ccattctgag gggagatgaa tgctcacctc taccatcctt 1440 gcacgtagcg acatcaagct atcagaccat gcattaataa aaagaccatg acatttggct 1500 gcagaacctg accctcatcc aacccattac attttatatg ctcttcaaat acaatcacac 1560 tctacataaa ggatgcattt ccactcacat ggctcacatc ctcgatgcca tatttctcaa 1620 attgctttga attgcattgt caccctgata ctaacttctg atttcaggtc taacccaagg 1680 atctaacccg ggcgtttcag ctctcccttc cttaacctta ttctgctcca atctacaatc 1740 tgcgactacc gaacctgcaa cagtgaattc gttaatcaaa aaagaaaaga attgataata 1800 aattcatgat agtaccattc ttgccccact gttcagcgtt tcttgtatta gtcctattga 1860 cattgcgact tgagaattct ctagcaaaaa acccttgatt attggtctct ccgctcctca 1920 caattctata tttcctagca ttaacagcac cattccatca gacgaatact cgctgaatta 1980 tcatgatata gaccaagcaa tttcgctcat cagtttagcc ggtcgcaatg cttggcttgc 2040 aaaaatcgac atttcatccg cctttaaaat catgccaatt cacccagact tctggcacct 2100 ttttggcatt cattggcgct cacaatttta ctttgcagtc cgattaactt tcggctgcaa 2160 aagtagtcct aaaattttcg atatgctttc agaagcatta tgctggattt tatctaataa 2220 ctacgaaatc ccgcatatca tccatctcct cgatgatttt ctcctcattt ccccaccttc 2280 ttcacctcca gctaaacacc tatcgatcac ccaaaaggtt ttcgaaaatc taggcatccc 2340 cctcgcagag gagaaaacag ccggtcccag cacttccata gaatttctgg gcatcaactt 2400 agattcgaat aaatttcaag catcccttcc caaagaaaag atcgaccgta taatctcttt 2460 atctcaaatt ttcctcgaga aacaatcatg caccaaacgc gaactgctat ctattctcgg 2520 gcatttcaat ttcgcgatgc gcataattcc ccaaggccgc ccgtttatta ctcacctcct 2580 tcagctctcc tcctcggtcc ccggtttaga agataccata tatctttcta aacccagtcg 2640 caatgaactc agcttatgga tctccttcct taagcaatgg aacggctgtt cctttttcta 2700 tagcgaccta atttcttccc cggtggatat taatttattc actgacgctg ccccctcagt 2760 cgggttcggc ggcttttacc aaggtcactg gtttgcttca acgtggcccc cgcagatgct 2820 cagtctaccc agaaatcagc aatcatctgc gctttcgaac tctaccccat agtcgcagca 2880 gcgcttttat ggggagatga atggtccgcc tctagcattc tcgttcattg cgacaacgaa 2940 gccaccgttt attgcattaa taaaggacgc tcgcacgcac ttccaattat gcccttacta 3000 agacgcctcg tttggacggc agccaaaaag caattcatta tgactgctag acatgttcca 3060 gtttgcaaaa atcaaattgc tgattctctc tctcgctttc ttttccagaa atttcggctc 3120 ttggtaccgg aagcagacca gcatccaaca cctgttccac tctattcaca aatkatattg 3180 ccataaacca cccattacac cgcctccatg aaacttccat atctctgatc cttcacgctg 3240 tcgcaccaag gacccttgag tcatatctca cagcatggaa atcttataaa tatttccata 3300 ccctgtatca aatacagttc ccagattttt cattgcttac catcacctca tttatttccc 3360 acctccatac agccaaaaat atccaggcaa gctccattaa aagctacctt agtggggtcc 3420 aattttttca taaattgatt tacggggcca cctcagaagc catttctaac gctcaaactt 3480 ccctcctcat taaaggtatc cagaaatcac cccccccccc ttcctgacac tagactaccc 3540 attacactca atattttagc aaagtgcata cgcacactgc gcaaaggata cctctcactc 3600 cacacagccc gtacactaga cgccatgttt accctggctt tttttggctt cctgaggtgt 3660 tcagaaatgg ctatcacatc caatttcaat cccgcaatac accccactat atctgatcta 3720 acactgctga atgctgaaac actcgccttt ttcattaagc aaagcaaaac agatcagaca 3780 agcaaagggc attttattta catatttaac attttttccc ccacacagcc attccaaact 3840 ctcctagctt ttctccattc aaggagagca aaggagtctg acccacatgc cccacttttt 3900 tactgatgac gctaaccgtc cagtaacccg tttttggttt caaaaacatc tgaaagaagt 3960 ccttcgcctt tcgggcgttt cccccgaatt gtattccagc cattcattta gaattggcgc 4020 agccaccaca gcagctcaca aaggtctatc ctcgcaccaa atccagaccc taggccgctg 4080 gtcctccgat gccttcaaag cttacattcg cctcagccga tcccacctca agacagccca 4140 gctagcccta atcagctaaa ttccaaactc caactagggg tacgagctcg accctaagta 4200 ggctggagta cgcgtgtgtg tgtgcttgtg tatgggtgcg catgggaatg tgtgtgattg 4260 ctcgcgcaaa tacagcgtgc gtttgcacac gtatacatgc ttgtttgcac gcggatatgt 4320 atataggtgc gggcgtacgt ctaagtatgc atatgtacgt gcaagcaaat gagtatgtga 4380 gcctaggggc attacatgtt gtgcaggttg catgtatgct cttaactttc ctttcaaacc 4440 tactatgctc cttccctcac catgctctga cccccgcagg ggtccaatcc gagtttcgac 4500 tctcgcgagt cacccctagc ccatccccgg ccaccccttc actaccacct ccagtaagag 4560 cctgcactac cccctccctc ttcccgactc ttactgcccc ccccccccca tagctctgac 4620 ccccgcaggg gtcgcgctga gcttcgactc tcgcaagagt caccatttgc gccgccccac 4680 ccaggccatt agaaatggta tgtacgcgca tgttatacgc atacatccga tgtatacctg 4740 ttccgatgcg tgtgcatgca catgccgaaa caggaataga cgcacgtgca tgctcacgca 4800 cgcaggttat tccctctctc cttttcttcc tttggcgtcg agttcctccg ccctctcttt 4860 tcttcctttc tcataggcca tcaatcctca gctctaactc ccgcaggagt gactaaaacg 4920 agcttcgact ctcgcaagag tcccgccccg gccacctcct gctgcagtct ccaccgcccc 4980 ctccctttcc cagacttcag caggagaaat gcccacagca gctctgaccc ccgcaggggt 5040 cgctccgagc ttcgactctc gcaagagtca atcaccgccc atgcccggcc cttaccatgt 5100 aataaccata tatattcata tatatttata tatgcatata tactcctata ttctcctttt 5160 cttctttctg gcgtcgagat tctccgcctg gcaattattt gccccttctt tcttccttac 5220 agcgtgagtg cttccgctac ttttaccctt ctctgatctt cccccttatc tccttaccct 5280 cctatttcct caatttcttt tccagcgttg agttctccgc taactgtctt tctttcagcg 5340 tcactctttc gctatgttct tcctttcgct taggccctca atcctcagct ctagctcccg 5400 caggggcgac taaaacgagc ttcaactcct gccggagttc aggccctggc cacctcccac 5460 cctctcatgc aagagtctcc accgcccaaa tcttcccgac tcttgcttga gcaatctaac 5520 cccagctccc acccccgcag gggtgtatat atttgagcct tgactcccgc ggagtcaatc 5580 cagccctgcc ctcaccccgc ctaggctcta gcagtctact ttgctctgct cccktagaag 5640 cttctcttcc taagctatca acgctatatc cagcagccgg atatggcttt gttacacctg 5700 ctttttgggg ggctcttcaa tacgcggctg ctgtcccgag ctattcaaag gcatttttgg 5760 ggagttctcg agatctacct gagctcaaac tcccctcccg ctttgcaacg ggagggagcc 5820 ctgggctcga ggatctcatg agctcagggc tctctcccgg gacagcatgc caaacaagct 5880 tttataatta atcatcagct aagtgtgaac tcttgaagtg aagtttatgt ataaacaaat 5940 ttcgagagga tcacgtgctt atgattgcta gcagctgacc cgcattatcc aattcactac 6000 gatccaatca gatgactcct aaactactat aaataccctg gggtttattc cattgctatc 6060 ttcgttttga agagtccccc cttcctcccc tacgcctcct tcttagacgg gtgacacggt 6120 ggcccagtgc ctagcactgt cgcctcacag caagaatgtc tctggttcct ggctttacca 6180 aaactagcag acatttctgt gcggagtttt gcacttctcc ccgtgctcac gtgggtttcc 6240 cccgggttcc ccggtttcct cccaccgcct aaaaacatgc aattaagtca attgaacaak 6300 ccaaattgtc aacatagaca cgctcctagt tagtaattag tttcaagagc attcacttgc 6360 tacagcaggg gagttctcga gatctacctg agctcaaact cccctcccgc tttgcaacgg 6420 gagggagccc tgggctcgag gatctcatga gctcagggct ctctcccggg acagcatgcc 6480 aaacaagctt ttataattaa tcatcagcta agtgtgaact cttgaa 6526 // ID DNA-3-5_DR repbase; DNA; ZEB; 1074 BP. XX AC . XX DT 30-OCT-2008 (Rel. 13.1, Created) DT 30-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE nonautonomous DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; TSD 3-bp; KW DNA-3-5_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1074 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1228-1228 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1074 BP; 289 A; 241 C; 234 G; 310 T; 0 other; ggctacgttc acactgcaag gcttagtgct caaatctgat ttttttctca gatctgattt 60 ttttgcatgg ctgttcacac tgctcttata aatgtggcca atatcagatt tgcagtgtga 120 acagatcatg gtcctaaact gacccgcatg cgcaaaagtg cattacggag agcacaaaat 180 gtctaattat ccgcacacgt gtgtgctgca tgagtaagca cgttgtcaga tcatgcttat 240 atgattatta cccttttcac agacaaccgt ggtgtatttc atgaactgta aatgattgtt 300 ctgctactac taatgtgtat ttcggcattt gagacctaaa ataagtctcc tgtgattaaa 360 aaaaaaaaaa aacactgatc gtttgtgatt tatgacaacc gcggtgcgtg gcgctctgac 420 caccggtgta gtgaactcat caagggttaa tgagcagccg cagactgaac tgtgaaggcg 480 gagttggttt atctccgttt gtcggagtta tttaatttta cttcgttata aacagcagac 540 acgtgcagaa ggggggtttg ggtgctcgag cacctgccct ttttttctct cggagagaaa 600 gttatcttcc tcctttgcac acggatcccc tatccgcaac acccacgacg cttttcggct 660 tcgcgatcaa cccgtgcctc aatcgttaac cagattagtt aacaagcacc agttatgcct 720 tcaaaatctt tttcaacatg aacaaccaac tttctgtggt actgtaatcg ccctgtgtgc 780 cattctactg tgctgctgag gggtagatga tgtttgcagc acagaatgat gacgcatgtc 840 gctcaaagat tatgtaaaag tcacatgaaa tccgacataa ccgttcacac tgcggtcgca 900 ttgcaaaaca tcagatctgt gtctgattta agaccacata tgaaagtggc acagatctga 960 cctgaaaaga tcagattcca tgcggtttgg gctgttcaca ctgtcatcac ctgagtcaca 1020 tgagggggaa aaaatcagat tcgtgccaca tttgcctgca gtgtgaacgt agcc 1074 // ID CR1-13_DR repbase; DNA; ZEB; 1874 BP. XX AC . XX DT 31-OCT-2008 (Rel. 13.1, Created) DT 31-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE CR1-13_DR is a CR1-like non-LTR retrotransposon - a consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; AP endonuclease; CR1 clad; CR1-13_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1874 RA Bao W. and Jurka J.; RT "CR1-like non-LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1208-1208 (2008). XX DR [1] (Consensus) XX CC The 5'-part is incomplete. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 342..1697 FT /product="CR1-13_DR_1p" FT /translation="HRFTLLSNYRPISNLPFVAKVLERVVVSQLHNHLHAN FT TLFEKFQSGFRAGHSTETALVRVVNDLLLTADSGACGILVLLDLTAAFDTI FT CHSILLDRLRKWIGLSGTVISWFVSYLADRSQFVCLGAHRSRTVPLHQGVP FT QGSVLGPLLFSIYMLPLGQIMSKYGLGYHCYADDTQIYVSARPDLNTATSV FT LSACLLEIKSWMSQNFLKLNSSKTEVLLIGTPTKVSQYKDFTLSVDNVQLS FT PSTQVRNLGVLFDAQLSFNSHFKHLTKTAFFHLRNIARIRPFLSRPDAERL FT IHAFITSRLDFCNSLFGGLPXSSLRRLQYIQNSAARVLTHTSSRHHITPVL FT QQLHWLPXKSRIDFKTLILTFKAVHGLAPSYICDLVTLATPARSLRSASGL FT TLHQPRCKLKTMGGRAFSHSAPKLWNALPISIRNAASLDCFKKLLKTHLFS FT IAFNLG*" XX SQ Sequence 1874 BP; 460 A; 429 C; 341 G; 636 T; 8 other; gcggaaatat gttaaaactg ggctcacagt tcataaatca ctgtttgatc agcatcaaac 60 tttatcaaat gtttacagca taagaaggaa ttctactcaa gagctattag agtaatagtt 120 ctattcagag ttttattcca agttacagtt ttcaaccctc tctaactccc tccctcccac 180 atagagtgat gtaatgaatt ttttcacctg tacacttgat ccattcccya ccactatttt 240 gaagcaktct gtatctgtct ttgagcccca catcactgcc attgttaatg catccctcag 300 tgatgggatt gttccactag cattgaagat ggctgctata acaccggttt acactgttgt 360 caaactatcg acccatctca aatttaccat ttgtggctaa agtccttgaa cgtgtggtgg 420 tgagccaact ccataatcat cttcatgcaa acactctatt tgagaaattt cagtctggtt 480 tcagagcwgg tcacagtact gaaactgccc tagttagggt cgtaaaygac cttcttctaa 540 ctgcagatag tggggcctgt ggcattctcg tcttattaga tctcactgct gcttttgaca 600 ccatatgtca cagtatactg ctggacaggc tcaggaagtg gattggtttg tctggtacag 660 taattagctg gtttgtctct tatctcgctg accgctctca gtttgtctgt ttgggtgctc 720 atagatctcg gactgttcca ttgcatcagg gtgttcctca ggggtctgtc cttggtccac 780 tgctgtttag tatttatatg ttgcctcttg gccagattat gagtaaatat ggcttggggt 840 accactgcta cgctgatgac actcaaatct atgttagtgc tcgccccgat cttaacactg 900 ctacgtcagt tttatctgcc tgtcttttag aaattaaatc ttggatgagt caaaacttcc 960 ttaaattaaa cagttccaaa actgaggtct tattaattgg cacccctact aaggttagcc 1020 agtacaagga ttttacactt tcggttgata atgttcaact gtctccttct acccaggtgc 1080 gtaatcttgg ggtgctcttt gatgcacagc tcagttttaa ctcccacttc aaacacttaa 1140 caaaaactgc cttctttcat ctccgcaata ttgcacggat cagaccattt ctctcacgac 1200 ctgatgcaga gaggcttatc catgccttya tcacatcacg tcttgatttt tgtaattcac 1260 tttttggggg tttacctrca agttctctca gaagactaca atatatacaa aattccgctg 1320 cacgggttct racacacaca tcatcacgcc accatattac tcctgtgcta caacaattac 1380 actggcttcc trttaaatca cggattgatt ttaagacctt gattttaact ttcaaagcag 1440 tgcatgggtt agcacccagt tacatctgcg acctggtgac cctcgccacc ccagcccgca 1500 gtcttcgttc tgcatcggga cttaccttgc atcagccacg gtgtaaactc aagaccatgg 1560 gtgggagggc tttttcacac agtgcgccta agctttggaa tgcactcccc atcagcatta 1620 ggaatgctgc ctctctggac tgttttaaga aacttcttaa gactcacctt tttagcattg 1680 cttttaactt aggatgatca ttttgtttaa ctttttattc ctattttatg gttcatattt 1740 ttattgtttt attgcatctt ttattgttgt gctgttttgc ttgccttgtg ttctagtatt 1800 ttgcctgtag cgctttgagt ataagaaaag cgcattacaa ataaaatgta ttattattat 1860 tattattatt atta 1874 // ID Gypsy95-LTR_DR repbase; DNA; ZEB; 161 BP. XX AC chr19; XX DT 02-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy95-I_DR; KW Gypsy95-LTR_DR; Gypsy95_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-161 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1502-1502 (2008). XX DR Genome; chr19; Positions 14945611 14945451. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 161 BP; 50 A; 29 C; 38 G; 44 T; 0 other; tgtaacatat gcatttggat gtgacctgtg aagtcacgtg agttgacgtc acatgcaggt 60 ggcgcatgta aggaagttag tgtgcaagtg aatataaaga gctatgtttc ctccagcatg 120 cgagtctatt gtcatcatca cagaaataca taaacatcac a 161 // ID CR1-20_DR repbase; DNA; ZEB; 3287 BP. XX AC . XX DT 11-NOV-2008 (Rel. 13.11, Created) DT 11-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE CR1-20_DR is a CR1-like non-LTR retrotransposon - a consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; AP endonuclease; CR1 clad; CR1-20_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-3287 RA Bao W. and Jurka J.; RT "CR1-like non-LTR retrotransposons from zebrafish."; RL Repbase Reports 8(11), 1694-1694 (2008). XX DR [1] (Consensus) XX CC The 5'- part is incomplete. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS join(12..1286,1214..1468,1667..3079) FT /product="CR1-20_DR_1p" FT /translation="MMAPYLWQPVAAALDVFNFVYLLYFLNIFLQWLRTLY FT LHTFQFESIRKHWIIFIFILAFLIGKSDAVXSPAXADAVIYSREQLLALRH FT TSTFSVPTMEVPTELKRRRRGCRAGVKRRLKKKRYKPSLPVIIMGNVRSLG FT NKTEELTALVRTQREYRECSFMCFTETWLNDKTPDSVVALDRFHLVRADRC FT AKQTGKNKGGGLALFVNERWCRPAHITTKEVVCNKDIELLAVGMRPYYLPR FT EFSYVIAITVYIPPSANASNACEHINTVTASIQLKYPHSLIVISGDFNHVS FT LQSSLHTFTQYVDCGTRDCKTLDMLYANVKDAYASTPLPPLGRSDHNLIYI FT RSVYVPAVRKQPPTIRTVQVWSDSVYEALMDCFDSTDWEVLCKSHDEDIDS FT LTDCITDYITFVLRTQYPQKRLNATQTISHGLYNFCVENTVPTKKVKCYSN FT NKPWVTPELKALLNEKKRAFLAGDREEIKRVQKELKIKIKEGKVIINLKWN FT INYNKIMLEMYGVIVCTKLEVTIEQVRNGLRRIKKKKSAGPDGISAGVLKA FT CADQICWVVHYIFNLSLRLERVPILWKTSCVVPVPKKPNPTEFSHYRPIAL FT TAQLMKVFERLVLNYIKKLLIASEDRLQFAYKTGVGVDDAIIYLLHKSLSH FT LETSGNTVRITFFDFSSAFNTIQPALLKVKLENAGLQKGMVSWLTDYITNR FT PQYVRMHNCVSQVIMSSTGVPQGTVLAPYLFTFYTSDFQYSSKNCHLQKFS FT DDSAIVGYIRNHDESEYRLLLKSFADWCDMNCLNLNTSKTKEIIVDFSRHP FT SQAVPVIIQESEIEIVSSYRYLGVYLNNKLDWSENTIQVYKKGQSRIHLLR FT RLRSFGVCQTLLKTFYDSVVSSVILYAVTCWGGGLLEKEKNKLNKLIKRAG FT SVVGGALPTIEEIAQDRMLDKFVSIMNHDSHPLYDTVQACASSFSTRLIQP FT RCYRERYRKSFLPTVIRLYNQSHTR*" XX SQ Sequence 3287 BP; 1002 A; 552 C; 728 G; 1001 T; 4 other; gttgacgtgg catgatggcg ccgtacttgt ggcagcctgt tgcagcagct ctggatgttt 60 ttaactttgt gtacttgttg tactttttaa atattttttt gcaatggtta aggactttgt 120 acttgcacac ttttcagttt gagagcatca gaaagcactg gatcatattt atttttatac 180 tggcgttttt aattggaaag tcggatgccg tgamtagtcc ggcaawtgca gacgctgtca 240 tttattcaag agagcaactc ctggcacttc ggcatacatc gacgttttcg gtacctacta 300 tggaagttcc tacggaactt aagagacgac gaagagggtg tagagctggt gtgaaacgcc 360 gattaaagaa aaagcgctac aagccatccc tgcctgttat tattatgggt aatgttcgat 420 cgcttgggaa caagacggag gagctgacgg cgcttgtaag aacacagcgg gaatatcgtg 480 agtgcagttt tatgtgtttt accgagacat ggctgaatga taaaacacct gactcagtgg 540 ttgcattgga ccgcttccac ttggtgcgtg cggacaggtg cgcgaagcag actggtaaga 600 ataaaggtgg tggactggca ttgtttgtca acgaacgctg gtgccgccca gcacacatta 660 caacaaaaga agtagtgtgt aacaaggata ttgaactatt ggctgtggga atgaggccct 720 attatttgcc gagagaattc tcttatgtga tcgctataac ggtgtatatc cctccctcgg 780 ctaatgcgag taacgcatgt gagcatatta acaccgttac agccagtata caattaaaat 840 atcctcattc acttattgtt atttctggag attttaatca cgtatctcta caaagttctc 900 tgcacacttt cacccagtat gtagactgtg gcacaaggga ctgtaaaact ctggacatgc 960 tatatgcaaa tgtgaaagat gcctatgctt ctacaccgct ccctcctttg gggcgatcgg 1020 accataatct tatttacatc agatctgtct atgttccggc tgtaaggaag cagccaccaa 1080 cgattagaac agttcaagta tggtctgatt ctgtgtatga ggcattgatg gattgttttg 1140 attcaacaga ctgggaggtt ttgtgcaaat cccatgatga ggacattgac agtctaacag 1200 actgtataac tgattatata acttttgtgt tgagaacaca gtacccacaa aaaaggttaa 1260 atgctactca aacaataagc catgggtgac ccctgaatta aaggcattat taaatgaaaa 1320 aaagagggca ttcttagcag gtgatagaga ggaaattaag cgggtccaga aagagctaaa 1380 aattaaaatt aaggagggga aggtaattat aaatctaaaa tggaacataa actacaacaa 1440 aataatgtta gagatgtatg gagtgatcta aataggatgt ctggatctgg taatagtcta 1500 aagagatgta ctgcctctgt tggaaagaac tgggtggatg aaattaatat attttttaat 1560 agatttgatg cagcaccagt gttttagaaa attctcatcc gagtacacat cagagttcat 1620 ttttagattc accgtcagtt ttaacttcta atttttctac ttgtgagtct gtactaaact 1680 agaagtaacc atagaacaag taagaaatgg gttgaggaga attaaaaaga agaaatccgc 1740 tggaccagat ggcatcagtg ctggagttct taaagcctgt gcggatcaaa tttgttgggt 1800 ggttcattac atttttaatt tgagtttgcg tttggaacga gttcccattt tatggaagac 1860 atcatgcgta gtaccggtcc ctaagaaacc taaccctact gagttctctc attatagacc 1920 aatagcccta actgcccagt tgatgaaagt tttcgaaaga ttggttttaa attatattaa 1980 gaaactgttg attgcttctg aggacagact gcagtttgca tataaaactg gtgtaggggt 2040 agatgaygca attatttatc ttctacacaa atctctaagt catctggaaa catcgggtaa 2100 tactgttagg atcacatttt ttgatttctc aagtgcatty aacactatcc agcctgcact 2160 attaaaggta aaattagaaa atgcaggttt acaaaaaggt atggtaagtt ggttaactga 2220 ctatatcact aatagaccac agtatgttag gatgcataac tgtgtgtcac aagttataat 2280 gagcagtact ggtgttcctc aaggaacagt tttagctcca tatttattta cattttatac 2340 atctgatttt caatacagct ccaaaaattg tcatttacag aagttttctg atgactctgc 2400 aattgtaggt tatataagga atcatgatga gagtgagtat aggttgcttt taaaaagttt 2460 tgctgactgg tgtgatatga attgtcttaa tttaaacact agtaagacca aagaaattat 2520 tgttgatttt agtagacatc catcacaagc ggtgcccgta attatacaag agtcagaaat 2580 tgagatcgtt tcgtcatata ggtacctggg tgtatatttg aacaacaagc tcgattggtc 2640 tgaaaacaca atccaggtgt ataagaaggg tcagagccgc attcatttat taagacgact 2700 gaggtccttt ggtgtgtgtc agacactgtt aaaaactttt tatgactctg tggtgtcttc 2760 tgtgattctg tatgctgtaa cctgctgggg agggggactg ttggagaagg agaaaaacaa 2820 attaaataag ctcataaaaa gggcaggttc tgttgtgggg ggtgcactac ctaccataga 2880 agagattgca caggatcgaa tgctggacaa atttgttagt ataatgaacc acgatagtca 2940 tcccctttat gatacagtcc aagcgtgtgc aagctctttt agtacaagat taattcaacc 3000 ccggtgctac agagaaagat acaggaagtc ttttttacca acagtgatca gattgtataa 3060 ccaaagtcat accagatgaa ttgtgctgaa cttgaattag atattggata tgtacaagtg 3120 tttactcgtt gtttttggaa ccaaggtgtg cagtatgaaa tgtaatctta ttattgtttt 3180 tatatgtatg taacttattt atctattttt agtatggaga gctctgctgt gacatgtgaa 3240 tttccctccg gggattaata aagtcaaact aaactaaact aaactaa 3287 // ID DNA-X-7_DR repbase; DNA; ZEB; 646 BP. XX AC . XX DT 11-NOV-2008 (Rel. 13.11, Created) DT 11-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE A nonautonomous DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; DNA-X-7_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-646 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(11), 1733-1733 (2008). XX DR [1] (Consensus) XX CC TSD is unknown. Copies are ~90% identical to the consensus. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 646 BP; 188 A; 137 C; 136 G; 185 T; 0 other; gctgtgtctc atttcagagg ctgcatcctc cgaaggatgc agacttcaaa ggtgagtcct 60 tcgaagtcca cacaggctga accgagcgtt ttgaaatgag acgatctagc ctacagagga 120 cagatctcgt caccaggcaa ccgtaacagc tgctgttaat aagcgctgat aaaatttgta 180 atgacttttt ttcaaagtga ttttaaaact tattttaagc gatctgaagg caaaacaagg 240 tttacagaag ctggaaatat atttcctttg atccatacat gtacacataa acatgtaaac 300 tgtctataaa atcactcttt agtaagacat gtcatactct ttttgttttt taacatgtgt 360 ttagatatcc aagtaaaata aacctaattc atacatttaa accggagttt tcttttaaaa 420 acgtcctgcc gttatcagcg cgcaatgaat cttgggacgt tcgaggccgc gaaggatcca 480 ccggttgtgt ccttcattac ctgggaaacg aaggacgcat tttgaggctg cgtttgaagg 540 agccttcgaa ttttttttaa atgagacagc cttcgtcgcg ccgcgatgac gcagcgcact 600 tcgaatgcat ccttcggagg atgcagcctc tgaaatgaga cacagc 646 // ID Gypsy138-I_DR repbase; DNA; ZEB; 11410 BP. XX AC chr1; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy138-I_DR; KW Gypsy138-LTR_DR; Gypsy138_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-11410 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1537-1537 (2008). XX DR Genome; chr1; Positions 32331102 32319693. XX CC Positions [3823-4278] - Reverse transcriptase CC Positions [5293-5772] - Integrase core CC LTRs are 97% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS join(3571..5109,5113..6408) FT /product="Gypsy138-I_DR_1p" FT /translation="MDLSEVFSKSQATSLPPHRPYDCSIELVPGASPPRGR FT LYSLSIPERTAMEKYLNEALDSGFIRPSTSPAGAGFFFVSKKDVSLRPCID FT YRGLNHITIKNRYPLPLMNTAFEILQNATIFTKLDLRNAYHLVRIKEGDEW FT KTAFNTPTGHYEYQVMPFGLVNAPAIFQAFINDVLREMLNRFAFAYLDDIL FT IFSSSYEEHVQHVRQVLSQLLRHRLYVKLEKSEFYVSKVSFLGFIVFKCSL FT QMDPGKIKAVLYWPQPCSVKEVQRFLGFANFYRRFIRGFSSIAEPLTALTK FT KTAKSFVWTEMANKAFNRLKSLFTSAPILALPDPELPFVVEVDASDIGIGA FT VLSQRSKTDNKLHPCAYLSHRLTPAQRNYDIGNRELLAVKVALEEWRHWLE FT GAKHPFLIWTDHKNLTYIREAKRLNSRQARWALFFNRFDFTLSYRPGSKNS FT KPDALSRQFGSLDSETTPELIIPTSRVVGAIQWGIKETVRRAQPQTVPGNC FT PPGCLFVPNHLRSEVLQAHASLQSGHPGTTRTHKLIQRKFWWPRLQKDVRN FT FVAACSVCAQNKEPRTRPQGLLHPLSIPKHPWTHISLDFTTGLPESKGNTV FT ILVVVDRFSKACHLLPLPKLPTASQTAELLMQNVFRIHGFPQDMVSDRGPQ FT FMSRFWKAFGRLIGSSISLSSGFHPQYNGQTERVNQEIERSLRCLVSDNQT FT TWSSRLVWAEFAHNTLYHSSLGMSPFQCQYGFPHLMFPGQEPEVSVPAATQ FT LVRRCRRSWQRARVALLKATEQQRRQANKRRRPGPTLRPGQRVWLSTRDLP FT LRVESRKLAPHYIGPFKILRKINPVSYRLLLHRSMRIHTTFHVSKLKPVVC FT SSFSPPSKPIPRPWIIEGRPAYTVRRLLDSRLVRGKVQYLVDWEGYGPEER FT SWVPAKDILDPVLITDFNHAQRSSRGTSGAVLRGEAPVRTH" XX SQ Sequence 11410 BP; 3220 A; 2710 C; 2288 G; 3192 T; 0 other; tgtcgtcctg caatgacggg ttcataccag taatcataaa atcacaaaga aaagatacat 60 tgtaaataat tcaaacaaaa agatcaacaa aaatatacac tttacaggag cactcactca 120 ccacgaccac tcccttagct ggcgcgaata ataatgacct gtccagaacc agttaacaac 180 atgccaccag ccaatgaaca gagagaaggg tcatttagaa ttaaagggcc atgtacataa 240 aagaaaagca agtggaagca cttttataca agtggtgcta gctgcagagc catatacagg 300 ataaagaaga gaaacaaata aacaaccaaa tttgtgttca aaactccatg atattgtcag 360 ttgagttcag tgtagacact atttcccaca ggaattaaag tgaaaagagg gtggccagtc 420 ccctaaacac ctatagacaa acatgcccat tacatgttag ctaaaaaatg acaaatcatt 480 aatttgtaca caagttctcc agagcagaga ctgagatcat gtctcaattc aacatattgt 540 atcatatttc aaaaatatga aacacccaag cttaaaacaa aagagaaaca ttacagtaac 600 catatattaa gacatactgt aaacagtgct ttatataaga aagcacaaaa aaattgtgtt 660 gctatgtaaa gccccagccg cctattagtt accctggtgc attatctcaa taaatcttgt 720 aactggttaa aacagtctac cagctctgga acttacacga cttagtgatc tttcaatggg 780 attattatgc acagtaatat tagacagcga actgacctcc ataagttgag caccattagt 840 cattggtgaa atcgaggggt caggtgtcga tatttcaatc tgctgtgcag ttgatttagg 900 cacactacta cttgtaccaa caacattctc ataaacctca gagcccagct gggttgtcga 960 aaaatcatta cacatattat gttccaagtc atttggatga acagtatcca ctgctagtga 1020 agattcaaga acatctgaat catcttccac atgcgtgtct gtttcattca tgccatgaga 1080 aggcagctgt gacacccact cttgtgtcct gtaattggaa gcatcctcca atacattttc 1140 aattactgaa tcaacactct ctgagtcact tgagttgttg tttgctgtat cagcttccaa 1200 gtcatcttca ggaagtggga gaaagttgat tataaggtta agatgtactg tctttataac 1260 accagtttga ctgtgctgaa ttctgtaggt atgactctcc atgtttactc caacaacagt 1320 ataaactgca tcctcccata gatcagcaag cttccgtttt ccacgtgctc tcttgttagc 1380 cagcaaaacc cggtcaccag catcaattgg gccacccttg actcttttgt tatacaagtc 1440 agcatgcctt ttaagctgtt ttgtggcaga cgtctgtgca atacacattg cctccttcat 1500 atgtaagaac tcattaagct gcattccaaa caggatccaa ttttcatgct tcctcctgaa 1560 cagtggatgg caccacgttc tcaccagctg ctgtgtgcca ccactcaagc ttgatatgca 1620 acacctgtgt cttgccctat ataagtgtgt gtgttacaag ctactgtgtt cagtcttgag 1680 tcagtaccac ccggtgttct ccagattgtc tagagaagct aagtgacatt ctactgttgt 1740 gttttttgac ccttgtgtct aatttgtttt gcctgtttac tagaagctct gcttcagtta 1800 tttctgttgt acttttgaat tctgtttttc tagtaaagat catctgttta ttttttctga 1860 aaccaaatcc tcaagttccc ttattattct tctgctcgtg ggttcatcta gtgaggagtt 1920 agctcaactg ggaaaagccc cagggttcaa tcctcaccta aaccaacacc ttacagcaag 1980 actgaaccat ggaagtgaac ccagcaggag aagcctcacc accttccaat gttgatcgta 2040 ttctgtctgc tttgtctgat caagcaacca ctatccagaa ccatgagcag attcttggtg 2100 agatcctcca gttactaaat cctcaacatt taacttctca aggaagccaa ccagtagcca 2160 tccctccaac agtttttgtt gcttcatctg agcccaagtt accagcacca gagagatttg 2220 atggcgaccc agagaggtgt tgagggtttg tcacccagtg cacgctggcc tttcaattgc 2280 aacccagcag ctttcccaca gagaacagta aggtagcata tatcatcact ctacttactg 2340 gcaaagcctt gaattgggcc acatctttat gggaccagaa atctcccctt accagcaaca 2400 gtaccagttt tattgcggag atgaagaagg tcttccaaca tccagccagt aaaggagatg 2460 tgagttatcg cctacttcaa ctgtctcagg gtgagaggag tgtggcagaa ttttctatcg 2520 agttcagaac cctggcctca gagagtggat gggaccagcg agcactaaag gccacttttc 2580 accgtgcctt gtcctctgaa ttaaaggatg aactggcctt tagagatcct gcaccagatc 2640 ttgagtctct cattgatgtt gccatctgtg ttgaccatcg cctaagagaa tgacaactgg 2700 aacgaggtag agagaccagg ttatttgacc ctgatgtaca ttctagcacc atcactcagc 2760 cccttcaatc caatgatctt gatgaaccaa tgcaattggg gagaactaga ttgacccaga 2820 ccgaacagga tcggcggatg aaagaccgat gctgcctcta ctgtggaaag ccaggccact 2880 tccgctccac ttgccctgaa ctaaagggaa aagccagctc tcgtccaggg agagggggtc 2940 ccgggtgaga gtagatactt ccccctgtgc tatcacttct ggagtatccg ttcaagccac 3000 tatcacccaa ggcaaccaac gtcaccatct tagtgccttc attgactctg gagctgctgg 3060 taattttatg gatatcaatg tggccagtga gctgggtatt agaaccataa tcctccagag 3120 accccttacc attacagccc tagatggcag acctcttggt tcaggacagg ttagtcactg 3180 cactacatct ctcaagttaa aaataggtgc acaccgagaa gaatacagtt ctttttgatc 3240 cattcaccag aacacccctt agggcttggt ttcccttggc tattccacca caaccctttt 3300 atagactggt ccagatgtgt ggtaaagagt tggggtacta actgctgtgt ttctgatttt 3360 tctttttgcc atcctttgga gccactggag acatttcttg gacctcagcc tgatccctcc 3420 agatcaccct caacccacac tatatccata cctaccacag aagtgcgggt aatacccaag 3480 ctcaagccat taactttcga gtctccagcc cccttgccag agccacctaa ggttactcct 3540 ccatgtttgt ctcgagttcc ccaagaatac atggatctca gtgaagtctt tagtaagtct 3600 caggctactt cattaccccc gcatcgtcca tatgattgtt ctattgaact tgtacctgga 3660 gcaagtccac ctcgaggtag attgtactcc ctctccattc ccgaaagaac tgccatggag 3720 aaatacttga acgaggctct agacagtgga ttcattcgcc cttcaacctc acctgcagga 3780 gcaggcttct tttttgtttc caagaaggat gtgagtcttc ggccctgtat tgattataga 3840 ggtttgaatc acatcaccat taagaatcgt tacccattac ccctaatgaa cactgctttt 3900 gagatactgc agaatgccac tatctttaca aagttggacc ttagaaatgc ttaccatctg 3960 gtcaggatca aagaaggaga tgagtggaag accgctttta acacccctac aggccactat 4020 gaatatcagg taatgccctt tggccttgta aatgccccag ccatcttcca ggcattcatt 4080 aatgatgtcc tcagggagat gttgaacaga tttgcttttg cttacttgga tgacatactt 4140 attttttcaa gtagctatga agagcatgta caacatgtcc ggcaggttct ctcccagttg 4200 ttgagacaca gactgtacgt taagctggag aagagcgaat tctatgtctc caaagtgtcg 4260 tttcttgggt tcatagtatt caaatgtagc cttcagatgg atcctggcaa aattaaggca 4320 gtactgtatt ggccacaacc atgttcagtt aaagaggtac aacgattttt ggggtttgcc 4380 aatttttatc ggagatttat tagaggcttc agctccattg cagaacccct cactgccctc 4440 accaagaaga ctgccaagtc atttgtgtgg actgagatgg ccaacaaggc cttcaacaga 4500 ctgaagagtc tcttcacatc tgcccccatt ttagctcttc cagatccaga gcttcctttt 4560 gtagtggagg ttgatgcttc tgacataggc ataggagcgg tcctttctca gagaagtaag 4620 actgataaca agcttcaccc atgtgcttat ctttctcaca ggcttacacc agcacagagg 4680 aactatgaca ttgggaaccg tgagcttttg gcagtcaagg tggcattaga agaatggaga 4740 cactggttgg agggggctaa acatccattt ctcatctgga ccgaccacaa gaacctcacc 4800 tacattagag aggcaaagag gttaaactcc aggcaagctc gctgggccct tttctttaat 4860 cgttttgatt ttactctctc ttatcgccca ggctcaaaga actcaaagcc agatgcacta 4920 tcaaggcagt ttgggtctct agacagtgag accacaccgg aactaataat acctacctcg 4980 agagtggttg gcgcaatcca gtggggaatt aaagagacag tcaggagagc acagccacag 5040 acagtgccag gtaattgtcc tccagggtgt ctctttgtac ctaatcattt acgctctgaa 5100 gtgttgcagt gagctcatgc ttccctccaa tcggggcacc ctgggactac tagaacccat 5160 aagctcatac agaggaaatt ttggtggcca agactgcaga aggatgttcg caactttgtt 5220 gctgcttgct ctgtttgtgc acaaaacaag gaaccaagaa cccgacctca aggcctgctg 5280 cacccactgt ctatacctaa acatccatgg acgcatatat cgttggactt cactacaggc 5340 ctaccagagt ccaaaggtaa tacagtcatt ctggtggtgg tggacagatt ttctaaggca 5400 tgtcatctac tccctttgcc caaactccca acagctagtc agacagcaga gcttctcatg 5460 cagaatgtgt ttaggatcca tggctttccg caggacatgg tctccgatcg ggggccacaa 5520 ttcatgtcca ggttctggaa ggcatttggc cggctcattg gatcctccat tagtctctca 5580 tctggctttc acccccaata caatggtcag acagagaggg taaaccaaga gattgaaagg 5640 tctttgagat gcttggtttc agacaatcaa accacttgga gctcccggtt ggtctgggct 5700 gagtttgcac ataacaccct ctaccactct tccctaggaa tgtctccttt tcagtgccag 5760 tatggtttcc cacaccttat gtttccagga caggaaccag aagtcagtgt ccctgctgct 5820 acacaactgg tccgccgatg cagacgatca tggcagagag ctcgagtagc cttactaaaa 5880 gctaccgaac aacagcggag acaggcaaat aagcgtcgaa ggccagggcc cacacttcgc 5940 ccaggtcaga gagtgtggct ttcaaccagg gatcttcctc ttcgggtaga gtctcgcaaa 6000 ctagccccac actatattgg tccatttaag atcctccgaa agataaatcc tgtatcctac 6060 cggctactcc tgcacaggtc catgaggatt cacaccacct ttcatgtgtc caaattaaaa 6120 cctgttgtat gttcctcttt ctcaccaccc agcaagccca taccaagacc ctggatcatt 6180 gagggtagac cggcttacac agtacgcagg cttctggatt ctcgactggt ccgtgggaag 6240 gtgcagtacc ttgtcgactg ggagggctat ggcccagagg agcggtcatg ggtgccggct 6300 aaggatatac tggacccagt gcttatcact gacttcaatc atgcccaaag aagctccaga 6360 ggaacgtcag gagccgttct tagaggagag gctcctgtaa gaactcatta agctgcattt 6420 ctcctcctga ccaaaaggaa gtgctcctgt ctccttgcac aacaggatcc aattttcatg 6480 cttcctcctg aacagtggat ggcaccacgc tctcaccatc tgctgtgtgc caccactcaa 6540 gcttgatttg caacacctgt gtcttgccct atataagtgt gtgtgttaca agctactttg 6600 ttcagtcttg agccagtacc aaccggtgtt ctccagattg tctagagaag ctaagtgaca 6660 ttctactgtt gtgttttttg acccttgtgt ctaatttgtt ttgcctgttt actagaagct 6720 cggcttcagt tatttctgtt gtacttttga attctatttt tctagtaaag ttaaagtgta 6780 aaaatgtttc agtaatttca ggacagccat tggtttatta atcctttctc tcctattagg 6840 tctccttcct ctctccacaa aggaaatcac tctgcctaaa atattgtctt catgttgttt 6900 ggacattaga cattcatggg acatcacttc aacatctgga gattcagatg gcataaccgc 6960 ttgtggaaga tgaggcagca acaaggcatg tgtcttgacg gttgtatcac tatgcccaca 7020 agagtgcagg acagcttcta cttcttgtgt ggagactgta ccagataagg aggttttagc 7080 tgttaaacag gccactgcca cacattcagc atcaggagta ctgtcaaagg ggtgagtaga 7140 ccagcgaaaa acatcttgaa catggtcagt gtgcacatca gcagcctctg ccaataactt 7200 actgtaagga actcttgtga gacgatggag tacgctaggg tgctcaaagg gctcttggct 7260 cagggcgtca gcaactacat tttttggacc aggaacatac ttaatatcaa actggtatgg 7320 tgctagtttg gcaacccatc tatgttcaca agcatccagt tttgctttgg acagaatata 7380 tgttaatggg ttgttatcgg tccaaactgt gaatttctga cccctgagcc aatgatggaa 7440 tttatcacag atagcccatt ttacagcgaa aaactccagt ctatgtgcag ggtattgtga 7500 ttgtgcatat gtcagtgact tgcgaaagca acgggcctag ccattttacc atcagctggc 7560 aactgggata aaactgcccc caacccgtta cttgaagcat ctaacaagac ctgttcaagt 7620 aaagcctgtt tcagtttgtg aaatgcttga ctgcattcct cagtccagtc tgaagcagtc 7680 aactctctct gacatgtccg acgcttccac tttccacgtg gtgccttcct tcctgttgta 7740 agattgaaca atggcttggc gatagatgag cagtcttcaa taaattgctg gtaaaacact 7800 accatgccca gaaacgattc acagaactcc ttaaaagatg gcacttcttc ggagcaagct 7860 tgaggttatg tttctttaac cgtttaaaca ccatttccag acgctctaat gctagctctt 7920 ctgtgggtgc aaacaccagg acatcatcta gataacacaa tagactggtg aaattctggt 7980 ccccaaatat tgtcaacatc atccgcatga aagtggccgg actattacac aggccctggg 8040 gtaatctatt atattcatat aacccaaagg gcgatgtaaa tgctgtgtat ttcaggtcat 8100 cttcatgaac ttcgacatta taataaccaa atgtgaggtc catggttgaa aaataggcat 8160 ttccacccaa tgcagcaaga gcgtctgctt gatgtggaag tggatgtgcg tccttgattg 8220 tgaaagcatt aatccaacga aaatctgtgc atattctcaa gtcttcttac tttttccaaa 8280 ctaggacaag aggtgatgca aattcactac tggatttccg tatgatttca cggtcttcca 8340 tttcatccaa agagacgaag ccgatcatac tgattaggag caagacgtcg atatggcaat 8400 cgaaagggct tgtcatcact caagcgtatg cgatgaacaa atcctgctgc ctttccacag 8460 tccaacttgt gtctggaaaa gatagaatgg cactgagaaa taatccggac aagctttgcc 8520 ttacagtcat catacacgtc acaagattcc acatccaaat cagataatcc caaatcactc 8580 aagacattct tactctcagc agcacatgag tcgcccagtc caaacttggt cggtgagctg 8640 ataacatctg ttacgctcat ctcattctgc ctattagcta acacgtcaac tgtactttgg 8700 acgtgttgtt caagatcaca tgagcaacat gcaggaaatt ctgaacagtc aaaatcctgc 8760 aacgccatgc aggggaaagc atctgcaagt gcaaaatttc ttttcaaagt aattattttc 8820 tcagaaggat ttatcactga gtggaatgag tggaacccat ccatctttac gtatcaaaga 8880 tacggttttt ccaaccagca cggacctcgg tcttgatttt agtctgctgg gctctaaaat 8940 gacagcactg ccaactgaga cgtcattggg gttttgcaat ttgccccaaa ctaagttttc 9000 agtcataggt tgaagtgaca ctgccttttt gatcttcact gttccaatta catcagggac 9060 agtaccattc ttccattttt ttcacattgg ctagaaggtt aagcagttta ttttgttctg 9120 agtcatcaaa cttatgttcg gacacatgtt ctagtaattc catagaatgt ctcagctggt 9180 gaatcaaatg ttttaggaga ttacttccaa gtataaagtc atcacactgc cccttaacta 9240 ctaaagttgg aacagtggca tggcatccat acacattcat ctccagttca atcattccaa 9300 ggggttttgt cctagtgcct ccacatccga tcaaaaccac atctgttgga gcaagtgaag 9360 ggctagccaa cacgccagta ttcagtaatt gtggtaagat tttagagctc atagtacaag 9420 ccattgaccc agtgtcgagc attgcttttg cttcaacttg cccaccaatt atcactttag 9480 tatagaatag attatcatac ggctcaagtt tctgtgtaag acaactgtac attcagaact 9540 atacttctca catatacttt tatatacaga ttccagatca tcattcaata atattggggg 9600 gtcattattg ggcattacac tcaccccctc cacatgcagg cctattcgtt tcccgggcgt 9660 ggaacagcat ctgactttgt aacaaagaaa gcacagatga tttgctctgc agtgaaaatg 9720 agtggtatga tcaatatctc tgcatacttc acatgataga gaaggtctta tactgtttct 9780 gtgtgactta ggcagatgcc tttctacact agcctgattt cttaattgtc cattctccag 9840 gacacgttct aatattgcca agattcgctc catgggttca acaggctcct tagattgagg 9900 ctggtttgga atagatccag attgaatggc tgtcttaact tttgcatgag aaggctgaac 9960 agtcacttct tgtttcataa gagtaatggc tgaaggcact ctaggcactg agcgacatca 10020 tttctctcta ctatactctt ccaaccgtgc gtggatatca gccgcagtcc attgctccac 10080 aggtttacac ttgaatatga gagaaagctc agggtctgga caatgtctaa taaacattac 10140 agtgagttca cgggccatgt caacagtttt cttattctgt ctctttaaac agtcctttgt 10200 catatccatg gctctattaa gcctcagcca ataatcaaaa gggtcctcac tagcatgtgg 10260 aagtgtgcca taaaaatctg ctagtggcat gttagaaaaa gctgtgtcac tgaaatgttg 10320 tttcagcaaa tcaaaaatgg ggcttggacc cttatgcaaa ttaatagagg ggttactacg 10380 caagcccacc ctcacaactt cctgtgccct tcccataagt cgactcatga ctttatcagc 10440 ttgttcctca atcggctttt gcattttgcg cagatatgac agcattatag attcccactc 10500 atgtattgtg tacgtttcag atggatcacc gctaaaacaa acaggttctt taatttcaga 10560 ttttagaaca ctgttcagga cagatgagtc agcactacta gaaccgcaca ctctggattc 10620 caaacaggaa gctatgcttt ccccaataga tcgacccatc tgttctacca tccccactaa 10680 aaattccact gactcactgc ttttctctaa acctggttgt ttcagttcac tttctttcag 10740 ttaagtaggc agaggactta aattatcatt tacattccgt acagtttcta agtcaaacat 10800 gggggtagac gtgacaggcg tataattcaa tctacctcta cccaccatga gtgaaaacat 10860 aaaaagtctt ctacctcttc caaaagtatt ctctgactcc attttttaaa agaaaatagc 10920 aaaaatgaaa agaaaatcac ttaactaaaa aaatattagt attatcattt ttaaatttaa 10980 atgattaaat gacaccaaaa ctattagaaa aaaaaacaaa cacttccgga aagagagaga 11040 gaaaaaaaat tagtatctaa gcttaaaagc tcaaagttca ataaaaatgt attttgttca 11100 caaatatgta atgtcagcaa caacacttat caataactct gtcataaaaa aaaaccatat 11160 actttaaagt tcaatgtcaa actgttatta gtgatcaaaa agtcaaaatc ctactcaatt 11220 tgactccaca aaccagttct gttcctgtca gaagcgaaga gcggtccttt actcctcctt 11280 gcggccaaaa gcggtactac tctaaatagc tcctcaaacg agcgatcata atccgcggta 11340 cgaatgcctc cggaaacctc cgatgaatct tccctgcacg cggcttccaa agtcacgatc 11400 acggcaccat 11410 // ID DNA-8-11_DR repbase; DNA; ZEB; 961 BP. XX AC . XX DT 14-NOV-2008 (Rel. 13.11, Created) DT 14-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE A nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; TSD 8-bp; DNA-8-11_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-961 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(11), 1726-1726 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 961 BP; 274 A; 205 C; 207 G; 275 T; 0 other; tagtgatggg tcgttcatga acgattcgtt cattttgaac gaatcttcaa tatgactcgg 60 gaactacgag tcctctcagg gagtgattcg ttcatccgcg cgtgcgcaca tttgtgcagg 120 tagtatcgtt aatttcaagt cttcatcaca ttttgagtcg ttcatcgcgg aaaggcagaa 180 gccaatcata tgcgtttaga gccggaaaaa gaattgatcc gctcacctct cgagtcctct 240 atcgggtctg agtcactcgt tcatcacggg ccaatcatat gcgtttagag ccggaaaaag 300 aattgatccg ttcacctctc gagtcctcta tcgggtctga gtcactcgtt catcacgggc 360 caatcatatg cgtttagagc cggaaaaaag aattgatccg ttcatctctc gagtcctcta 420 tcgggtctga gtcactcgtt cctcacgggc caatcatacg cgtttagagc cggaaaaaga 480 attgaaccgt tcatctctcg agtcctcggg tttgagtcat tctgtcacgt gatgaacgaa 540 cgactcaaga accagaagac tcgaaaggtg aactaattat catggctcct atcggcttag 600 actgtgtaca ttgattaaga ttatatgtga ctgtcagtgt aacgtgaatg aacccctgac 660 atttgaagac atgaagaggt gagctgagca aagagacaac aataccttca tagacaaaag 720 agcaggtaaa cattgaatta ttattttctc cttcttatag tattctattt atgacttatt 780 tgtcgtgcga tcaacctttt gggctagttg tagatgtgtt tggaagcaat tcgtaacatt 840 ttaataatat tttggcaaat tgaaccaaat gaacgaaatg actcgaaaaa agattcgttc 900 atctcgatga acgagactca aagatccgag tcagtaaaat gatccgaact tcccatcact 960 a 961 // ID TDR6 repbase; DNA; ZEB; 250 BP. XX AC . XX DT 04-MAR-2002 (Rel. 7.02, Created) DT 04-MAR-2002 (Rel. 7.02, Last updated, Version 1) XX DE Zebrafish non-autonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; TDR6. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-250 RA Jurka J. and Drazkiewicz A.; RT "TDR6: a non-autonomous DNA transposon from Danio rerio."; RL Repbase Reports 2(2), 34-34 (2002). XX DR [1] (Consensus) XX CC TA-target site duplication. XX SQ Sequence 250 BP; 88 A; 39 C; 36 G; 87 T; 0 other; tatgaagagt tcagatgcaa aaacctctaa gtgccatctg aaattttctt ctaaaatgag 60 catttttctc ttttcagccc tcctgtgttt atgttaagta atttcacttt taaggcaata 120 aaaataactt attttcattg ccattaaagt gaaataactg aacataaaca aaggagcctg 180 aaaaaaatgc taattttaga agaaaatttt cagatggcat ttagaggttt ttgcatctga 240 actcttcata 250 // ID BEL19-LTR_DR repbase; DNA; ZEB; 1148 BP. XX AC chr13; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL19-I_DR; KW BEL19-LTR_DR; BEL19_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1148 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1386-1386 (2008). XX DR Genome; chr13; Positions 40001555 40000408. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1148 BP; 340 A; 195 C; 204 G; 409 T; 0 other; tgtacaaaag attgtttaat tattttgttt tctaactgta catttagtct ttacctgttt 60 tgttattatg tgggcgtggt taaaaggtcc gctcgcgagt ttaatgaata aagtgaaagt 120 gttcgacaca gtaatgaaag aggaacagag agaaaggcag gtgcgcatcg tgatgctcag 180 tcgccagttt cgagagttaa agccaacaaa ctgttaggat tgcaatcaca gcatcctcag 240 agaaggcagt ttgactgaat cctatcagat gacaggaaga tgtggtgtga ttacttcaga 300 gtttgacaga actttatgat gtgcgtcatt gaagaaacgt gagtgttcat agttcattat 360 taaaaacttt aattatcatt cgatgtatat atacatgcat ctatacgcat atttatagtt 420 tgcaactgta ttagaacgac acgtttatta tattatgtgt tcgtgatctt tcacgtgttt 480 attcacgtgt agtaataatg acgcagcatt tttccgtgag tagtttgagg actaattaaa 540 cctttttttt ctttattata gaaccacaca catacataca aaatcacacg ctactgcatg 600 aacatttaca gcaacattta acttaatctg atcacttgct tttacacaat cactcattat 660 acattataca ttcattgcaa atgaagatcc cggcaaatgt tgcacactgg ctgatgacat 720 cactactatt gatcgactgc tgaagtgatc gctggacctt ctgtttatga ggtggatttt 780 tctgtgaaac tattgggact tatttcacca tattacaagt gatcgctgga cctttctgtt 840 tataagtcct ctgaaactat tgggatttat ttccccacta tattacaacc ttacagtact 900 gtaattattg tcatcttata tgataattgt tttcatttac gtgggcctac gctattttat 960 ttgatttgta ttgtgatgat tatgtttaca cactaatcat aaagaaattg atacattgtt 1020 tgaacaaaat aatttgtgta tgacctcttt actcttctcc tattctttgg acatcaattg 1080 gataatatct cctttaaggt ggtatagaac cgtatagtaa tcagtgcttt catacaggct 1140 tcaataca 1148 // ID Gypsy57-I_DR repbase; DNA; ZEB; 5789 BP. XX AC chr16; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy57-I_DR; KW Gypsy57-LTR_DR; Gypsy57_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5789 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1429-1429 (2008). XX DR Genome; chr16; Positions 14958369 14952581. XX CC Positions [2489-3028] - Reverse transcriptase CC Positions [4344-4724] - Integrase core CC 'CTAA' target site duplication CC LTRs are 99% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 1382..3130 FT /product="Gypsy57-I_DR_1p" FT /translation="MRGKPEGSRCSTLTKERFLERAVGKCPEVDIHIGGVP FT VRCLLDTGSNVSTLTESFFRENLHGEDKDLHCTSKWLRITAANKLPLPYLG FT YVELDIKVMGLIIPECGFLIVRDGDAADLTPPGIIGMNITQRCREFALTEF FT DTTLGGKLDSVWREAFSRVQEVESAKATFVARVSGRGKTHIPASSAATVSA FT RVNKKVTHTNAWLLLEPGNAPLPGGLILMPTLVSPRSQVFPVQVINFSQED FT VWLPPKVQLGILTHCQCAESDICDVKFQRISADHEEVRIDQKEEPLSDSDL FT SNLLSRLHIGGTPEQQAELGILLMRHADVFAVHDEDLGYTDRVKHEIHLTD FT DTPIAQPYRRIPPTQFEEVKQHISGLLRKGVIQESSSPYASPIVLVRKSDG FT SLRLCVDYRRLNSKTRRDAFPLPRIDESLDALSGAQVFSTIDLASGYHQVA FT VDEKDRPKTAFTTPFGLFEYRRMPFGLCNAPATFQRLMQATMSDLVFQIVL FT IYLDDLLVYSSTFDDHLVRLETVFRRLRETGLKIKIEKCHFLQPEVKFLGH FT QVSAQGVSTDPDKISAVREWPVPSTLKELRSFLGVL" FT CDS join(4113..4724,4728..5741) FT /product="Gypsy57-I_DR_2p" FT /translation="MGECHQLLLPTCLRDQVLKNVHDQMGHQGIERTLSLL FT RQRCFWGGMYEDVEEWVKQCQRCMLAKLPQPKIKAPWASFLATRPLEVVAV FT DFTTLEPATDGRENVLVVTDVFTKFSQAFPTRDQKADTTAKILLREWFLKY FT GVPQRLHSDQGRNFESAVIAELCKLYGVRKTRTTPHHPQGNPQCERFNRTL FT HDLLKSLPPEKKRRPDHLAELVHAYNVTPHSSTGYSPYYLLFGVQPHLPVD FT ALLGQEPVTDDNLDWLKVHQERLRDAHVRAKDYAERKAAERAKQHEDKVYC FT PAIKVGQYVYLRHRPPGRNKIQDAWSSTVYKVLDVQGTTHTVEPVEGGTVK FT RVHRSNLRPCSHPVPVPMLRRLKQPVKDVPTSVLDSEMSSPGAECVLVEEV FT QCPGQNLTRSENPNLPVSELGGRFNDQMESGGSEDCLGADSHGIPNDALEV FT TPVCVELEKERINLPVENPATRPVPTPRKKKEERVNSEVPFLEVRKTQRTT FT AGIHSNPNQLPKSACNAVSFSPDVLSQVLAGMVLYTTGQLQGSLDE" XX SQ Sequence 5789 BP; 1666 A; 1214 C; 1464 G; 1445 T; 0 other; gagatttttg gtaccaggag tggggttcca ttgctaaatt ttggtaacat aaacaacggt 60 agtagtgaag aggaggcaag tgttgctgcc aacgaggtcc acagcacgga cggctacgta 120 cctagagaga ctacagagga gcaaggaatc agcttctgac tacaaaggag ggtgtttgcc 180 taagtgctga gatcattggt ggaggaccag gaaaattcat ctcttcaggt tggcctgcag 240 tgactttgac ttttctgtga ctctttgaca ttcaagttag agacttaaaa agggcgaaag 300 cagaaacatg gaagagaacc aggatgtctc agctgctcgt gctggaccca gccatgatgt 360 ggacagtggt gaggctgcag gtcatgttga aggtaatcct gtagcacaac tgcagtcaca 420 aattcatgaa ctaggtcaaa ggcataatga agtgatggct acccttgtta atatgagtaa 480 tgctaatact aggttgtatg tctatattcc aagggaacgt cagattgtac ccttcagtgg 540 taattatgta aaagatggtc agtctgttga tgaatttatt gatgaggttg aatttattaa 600 gagggtcagc tttagatgaa gtcaaactgc gtatgggggg agaaactaca caacccagtg 660 atctgttctc atatttgcgt ggggctttta gggagaaacg aacaactcct caattgttgc 720 atgcttttta tgcccgcaga cagtcagagg gtgaagattt tcgtgacttt tcccattctc 780 tctctcagct tcttagctcc gccatccaac aatcccctaa tgctgtgtct caaccccagc 840 tcactttaag agatcagttt gtcgaaggta tccgtgattc acatctccgt agagagttgc 900 gcaaactcat cagagaaaag ccccagtcca ccttgtttga agtccgggaa gaagcattgg 960 tgtgggtttc agaggacaga ccccgtggtg ctagtgtggc tagaagtcat aatattgtga 1020 gtacatgctc tgaggtgcca gaatgctcta cttcaggtgg tgcagctcaa actgatatta 1080 cagttgcctt acaggaggta gtaaaaataa tcacccagca gggaaaagcg attggagaac 1140 tcactaatgc agtccaggaa ctcacagtac agaaagccag ttctagtgtg agccagagtg 1200 acaggcccaa agtcaagcct aaatatacag ctgatggcca gcccatttgt ttgaggtgcg 1260 agggtgtggg acatatagct agacaatgtc ctgtaccacg tagttccagg aaccagtcca 1320 ctactgcccc aagctctgcg gtgcagggaa acggagctcc tccattgctt taagtcgagc 1380 aatgcgagga aaaccagaag gctcaagatg cagcacatta acaaaagaac gtttcttgga 1440 acgtgcagtt ggcaagtgtc cagaggtgga tatccacatt ggaggtgtcc cagttagatg 1500 tctgctagac acaggaagca atgtaagtac cctcactgaa agtttcttta gagaaaatct 1560 tcatggagaa gacaaggacc tacactgcac ttccaagtgg ctaagaatta ctgctgcaaa 1620 caagctacca ctgccatact taggctatgt tgaattggat atcaaagtga tggggctaat 1680 tatacctgag tgtggttttc tgatagtgag agatggggac gctgctgatt taacaccacc 1740 tggtatcata ggcatgaata tcacccagag atgcagggag tttgcactga ctgagtttga 1800 cacaacttta ggaggaaaat tggactctgt gtggagggaa gctttcagtc gtgtgcagga 1860 agttgagtca gcaaaagcaa cgttcgtagc aagagttagc ggtaggggca aaacccatat 1920 accagcatcc tctgcagcca cagtctctgc cagagtaaac aagaaggtga ctcacaccaa 1980 tgcatggctg ctgttggaac caggaaatgc accattacca ggtggattga ttcttatgcc 2040 caccttagtg tcgcctcgaa gtcaggtgtt tccagttcaa gtgatcaact tttcccagga 2100 agatgtttgg ctgcctccta aagttcaact tggtatactt acccactgtc aatgtgctga 2160 aagtgacatt tgtgatgtga aattccagcg catctctgct gaccatgaag aagtgagaat 2220 tgaccagaaa gaagagccac tgtctgattc tgatctgagt aacctgttaa gtaggctgca 2280 tataggaggg actcctgaac agcaagcaga gctgggtata cttcttatga gacatgcaga 2340 tgtgtttgcg gttcatgatg aagacctggg atatacagat cgggtaaagc atgagatcca 2400 cttgacagat gatactccca ttgctcagcc atatcgcagg atacctccaa ctcagtttga 2460 agaggtcaaa cagcacatct caggtttgct tcgtaaaggg gttatacagg aaagctcaag 2520 tccatatgcc tcacccatcg ttctggtgcg caagtctgat ggaagtctaa ggctgtgtgt 2580 cgattacagg aggttgaact ccaagactag acgtgatgct tttccattgc cgcgcatcga 2640 tgagagcttg gatgcgttga gtggagcaca agtattctcg actatagacc tggctagtgg 2700 ttaccatcaa gtagcagttg atgaaaaaga cagacctaaa acggccttta ctacaccatt 2760 tggccttttt gagtaccgtc gcatgccctt cgggctgtgt aatgcgcccg caacatttca 2820 acggctaatg caagccacca tgagtgactt ggtgtttcag attgtgctta tttacttgga 2880 tgacttgctg gtgtactcct ccacctttga tgaccacttg gtgcgtctgg aaacagtgtt 2940 tagaaggtta agagaaactg ggctgaaaat aaaaatagaa aagtgtcatt ttctgcagcc 3000 agaagtgaaa tttcttggcc atcaagtttc tgcccaaggt gtgagtacag atcctgacaa 3060 aataagtgca gttcgggagt ggcctgtccc tagcactctc aaggaactca gatcatttct 3120 tggggttctg tagctattac aggagattta ttgaagggtt ctctcagata gctggaccgt 3180 tgcatgatgt tgtgaacgtg tgcctcagag agacaagttc cattaaggca gctcaggtgt 3240 ataggagctc ttggacttcc cagtgtcaat tagcatttgc aaatcttaaa gataaactca 3300 caagtgcacc aacactaggg tatgctgatt tcacacttcc ctttgtgata gaaacagacg 3360 caagtaatct ggggctagga gcagtcttgt accaacatca aaatggcaag aagactgtaa 3420 tcgcatatgc tagcaggaga ctgaggggtg ctgagaagaa cgaccaaaat tatagcagta 3480 tgaagctgga gctgttagca cttaagtggg cagtcaccga aaagttcaga agttatctgc 3540 tggggtccaa attaaccatc ataacagaca ataaccctct ttgtcacctc accactgcca 3600 aattaggagc tattgaacag cgttgagcgg cacaacttgc agtgttcgat tttgaagtca 3660 agtaccgtcc tggacgatgc aatactgctg ctgatgctct ttctaggaga ccagcgttcg 3720 gtgaacctga atatgaaagt gaggatgcag agtatgatgg ctctgtggca atctgtaacc 3780 tgctcaggac aggaacagct ctggagccag agttgttagc agcaggtgtg gaagtcagta 3840 aagtcaggca gttacaggct tcttttgctg gagaagatga aaatgtggat gagaatacac 3900 ccactttgcc agggtactca aaggcagaac ttcatcagtt tcaggagtca gatccaacac 3960 taagtgtgtt caagaagttt tggagtcgtc aaagaaaacc cacccagcag gagcgactgg 4020 gattgtctaa atcagtacgt tctctgttga aacaatggcc aaagatcaaa gaaaagaatg 4080 gtctattata ttgtgtaatc gatgatgtca acatgggtga atgtcaccag ctcttgttac 4140 ccacctgctt gagggatcag gtcctgaaaa atgtccatga tcagatgggc catcaaggca 4200 ttgagagaac tttaagcctg ttgagacaga gatgtttctg gggaggaatg tatgaggatg 4260 tggaggagtg ggtgaagcaa tgtcagcgct gcatgttggc caagttgccg caaccaaaga 4320 ttaaggctcc ttgggcttca ttcttggcca cccgaccatt ggaggtagta gctgtcgatt 4380 ttacaacctt ggaaccagcc acagatgggc gcgaaaatgt tcttgtagta actgatgtgt 4440 ttacaaagtt cagtcaagca tttcccaccc gtgaccaaaa ggcagacact actgccaaga 4500 tccttctcag agagtggttc ttgaaatatg gagttcctca acgactacac tccgatcaag 4560 gcaggaactt cgaaagtgct gtgatagcag aactctgcaa gttgtatgga gttaggaaaa 4620 ctcgcacgac accacatcac ccacagggca atcctcaatg tgaaagattt aacagaaccc 4680 ttcatgacct tttaaagtct ctcccacccg agaagaagcg caggtgacca gaccatttag 4740 ctgagctggt tcatgcatat aatgttactc cacactcatc cacaggatac tcaccatact 4800 atctgctgtt tggggtccaa ccgcatttgc cagttgatgc tctgctgggt caggagcctg 4860 tcacagatga taaccttgac tggttaaaag tacaccaaga acgtcttagg gatgcacatg 4920 ttcgagcaaa ggactatgca gaaaggaaag ctgcagaaag agccaagcaa catgaagaca 4980 aggtatattg cccagctata aaagttggac agtatgtgta cctgcgacac aggccacctg 5040 gaagaaacaa aattcaggat gcatggtctt ccaccgtgta caaagtacta gatgtgcagg 5100 gtaccacaca cactgtggag cctgttgaag gaggtacagt aaaaagagtg cataggtcca 5160 acctcaggcc ctgcagtcat ccagtcccag tgcctatgct aagacgtcta aaacaacctg 5220 taaaagatgt acctacctct gttttggatt cagagatgtc ctcacctggg gcagagtgcg 5280 ttctggtcga ggaagttcag tgtccagggc agaacctgac tcgatctgaa aatccaaact 5340 taccagtttc tgagcttgga ggaaggttca atgatcagat ggagtctgga gggagtgaag 5400 actgtttagg ggcagacagt catggaatac ctaatgacgc attggaagtg acaccagtat 5460 gtgttgagct agaaaaagag agaatcaact tacctgttga gaatccagca acgagacctg 5520 tgcccacacc tagaaagaaa aaagaagaga gagttaattc tgaagtgcct tttcttgagg 5580 tgcgtaaaac ccaaaggaca actgcaggaa tacatagtaa cccaaaccaa ttaccaaagt 5640 ctgcttgcaa tgcggtgtct ttcagtcctg atgtgctgtc acaagtgctg gctggaatgg 5700 tcctctatac gacaggacag cttcaaggaa gtcttgatga gtagtgttca tgttggtcat 5760 cgaggacgtt gactttcagc aggggagag 5789 // ID Gypsy163-LTR_DR repbase; DNA; ZEB; 209 BP. XX AC chr7; XX DT 13-OCT-2008 (Rel. 13.1, Created) DT 13-OCT-2008 (Rel. 13.1, Last updated, Version -1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy163_DR; KW Gypsy163-I_DR; Gypsy163-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-209 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1605-1605 (2008). XX DR Genome; chr7; Positions 69939032 69939240. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 209 BP; 47 A; 43 C; 51 G; 68 T; 0 other; tgttgtgtac tgtggcttca ggtgtaacgc attttgagcc ttatgatgtt ccatatggga 60 cgctcagcga gatcctgcgt ctgcgcagta tgacattgta gcatgtctcc tgtatatccg 120 caatattcag taaagagttc tgttgcataa gttttgtgtc cgtgtcatcg cttaagtgca 180 gcgcacttac atccatgaga gatgtaaca 209 // ID Mariner-N2_DR repbase; DNA; ZEB; 3966 BP. XX AC . XX DT 02-OCT-2008 (Rel. 13.1, Created) DT 02-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Putative Mariner-type non-autonomous DNA transposon - consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-N2_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-3966 RA Bao W. and Jurka J.; RT "Mariner-type DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1612-1612 (2008). XX DR [1] (Consensus) XX CC 2-bp TSD, mostly TA, but other dinucleotides also appear. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 3966 BP; 1397 A; 746 C; 650 G; 1172 T; 1 other; cactctcaga aaaaaaggta caaaattgta ccttttaggg tacaaatggc ccgtcactgg 60 ggtggtaccc ttaagggtac aaatttgtac ctctatttaa aggtacaaaa ttgtaccttc 120 tacatttgta ccctttaagg gacaattttg taccctagac actgaaggta caaaaatgta 180 cctttttata ttatatagta ttttcagggt actgacccag gcattttgat cctttggtga 240 ctgcatgcac aaataaataa ataaaatatt ttggttcaca gatgtccatt tttatttatt 300 aatgtagaaa tatttcatta caactcacaa ctgaatgtgt taagaaataa aattacacag 360 caagtataaa gcagacatgt aaaacagtaa aaagaacaat ttagataacg aataaaaact 420 aaataatcta cacttaatac aaagaccaca agaatactgt ttaagattta taaacacatt 480 acacaggcta catcaagtac tgcactttat aaagtccaaa gtatttgtct ttagtcagaa 540 tactcggtca taataaataa tttgtcatca acattttctc tgaatttcag aatgaaggca 600 atgtttctct ataaattgct tcaacagtgc agaaaaycaa atacatcagt agcctcttcc 660 tcaacatcag tagattactc ctcatcatcc tcaagtggag ttgtaatttg ttgtaacaca 720 atatgtgttt atgcttttga taaacatgac atgtaaagca ctcatatctg gaaaatgttc 780 tttgatagtc attaggacca caagttattt aaaaatttgc tttatgcatg aatgatagca 840 gtgtgctgga ttaacttgat gattgtgatt atcccaagag catgtagaga aacaaaaaac 900 aaggttgatc actctggtaa ctggagtaga tcgaggcttg tcaccgtcca ccgttcattc 960 aaaacattcc actgcagaaa catcaaggtg ttcttctaat aagatggata tgtcaaaaat 1020 gtatattcgt ccacatcatt agcccggcca atcgcaatcc catcaactct gaagactgcc 1080 gtgactcgtc ttgttaacac agttgtccag tcggtggcct cctcctgtac ttgaactgtt 1140 ggatgtggtg aagatggttc actgtaaatg acaaagcaat aattacatta atatcagtca 1200 ctctatcact gatgaagaga aaatcaacat gcatagaaag taaaataatt aatattcact 1260 gattaaaaaa aaaaaaaaag tatcacagat taaaagacta atataaattt cattcattca 1320 ttcaatttct tttcggctta gtccctttat taatctgggg ttgccacagt ggaatgaacc 1380 accaacttat ccagcatatg ttttatgcag cagatgccct tctagctgca acccagtatt 1440 acgaaacatc cagacacact tattcacaca cacatttata cactacagac aaataagctt 1500 accaaattca gctatagcgc atgtctttat acttgtgggg aaaccggagc acttggagga 1560 aaccttcacg aacactggaa gaatatgcaa actcctcaca caaatgccaa ctgacccagc 1620 cgaggctcga accagcgacc ttcttgctgt aaggtgacag cactacccac tgcgtcatcc 1680 taatataaag aatatcaagc attataaaat gcatttatca cagcatatac taaacagaaa 1740 acaacatacg caccggagca caacgaaatg ttgaagtgtc tcgcttaaca aggctggcaa 1800 catgagaagt gctgctgtaa aatcaattcc tggaacataa gatgtggcat acaaacaaat 1860 atgaatgaat ccatccaaag ctagaattta ataatattaa tcaagtggag aaagtttaga 1920 aagagtacct tagaagtcag gtccatagca ttcttcagtt gatctctcat ctttcgatga 1980 aaacctagaa aataatttta aacatgtggg tgtacattta ttgttgtttg taagtaaaaa 2040 cctacaaatt taaaatagta tgtaagccat taaaaccatg gtatagataa tgagacatac 2100 ttacatcttg taactgagcc atgctttcat atgacatgcc ttcattggtc tgatgctgtt 2160 aggctgcatt ctgttgttca atcttgtcct gaaagacaat tggaaacttc aaaaacttct 2220 ggaaagcaat aagataaagc aataactatt tacttatttt gagatattaa taaaataggt 2280 aaaatagctc aaggtaccag tttcagaaca gatgagacaa attattacat taaagaacaa 2340 agtataactt aaaacgtcga tttatacaac atggtatgtt aataattcta cattaacata 2400 agtataacgt taaacgatag taaagcacca acaattctgg acatgagagt ttgtggacac 2460 tcctcaacat taatgtaact tagacactga aacgaagata tgtacaagca tacacacgcc 2520 acgtttgtaa ctagccacgt ttacataatc agtcgctttg tcggtgacaa aattgtgatt 2580 tttcttcaca aaatggcggt gtaacttata taaccgtcta aaattacatt tcagcggtca 2640 accgtggcta gaaaaatggc ctcaaaatat aaaacattat cgataattac aaatttccag 2700 tattaataac aaaataacaa aaggttttat atcaaggtat atgaagactt tagtcgcgcg 2760 agctccgggc cacaccgcac attactcacg ggtcacggcc cgattccggt gcacggatac 2820 ggcagcgtcg gctcgcttct gccgccgcaa ctggcccgag tgtattttgc tatttgggtt 2880 gcgattatat tccgccgtgg caacacttat taataaagca gtaacgtaac gtaagatata 2940 aggcgcgaat acgtaacgta ggcgataaag ggaactttta caccgagaaa actaacagca 3000 tatagtactc accgtctata acgttatgtc cctcggtgac ttcgatatca cattgtatta 3060 acttctttaa taagctgcag acatcttagc aaactccatc ctgcaggcca gcccggagca 3120 cttaaagcgg gggaaagccg agacccgaga gtcagacaaa atcaacagaa cacggcttca 3180 catgtaaaaa tgagactgaa aatatatctt taatacattc ttacctgaaa agtgtggact 3240 ccaaaataga cttttctttg aaaagcgtgc gaagttctgt gtaaacagcc agcatactcg 3300 acggaccgcc tcagtaacaa acactactat aacgtctcaa caagccaaat tcttaaagag 3360 acagcaacat caccacctgt atgaagatgc tgtcactctg catggttctt attctacatc 3420 ccataaacta tagggtctta aacagtaagc aaagaccagt atttacgttt ttaatgatta 3480 aaattacatt caattataaa ttaaattatt aacatttttg agaaaaaaat gtaatttaaa 3540 tacatgaaaa aaattttatt attaaaatat ttaattcaat caatttattt atttattata 3600 tttttaaatt atgtatttta gtattaatgt ttagtattaa tacatttaaa ttagtgttat 3660 atagtgtctt gatcttatca tgagctgagg gtacaatttt gttcctatag ggaaacaaaa 3720 tatataattg tacctttaaa ggtgccaaat tggtccttta cggtacaatt ttgtatccaa 3780 atgtgctata aaaggtacaa aaatgtacct ttcaaaccta aatctggaca tttgtacctt 3840 tatagcccat ttgggtacaa aattgtaccc taaggaccaa tatgggacct ttaaaggtac 3900 aattttatat tttgttccct ctaggaacaa aattgtaccc tcattgtacc tttttttctg 3960 agagtg 3966 // ID BEL9-LTR_DR repbase; DNA; ZEB; 654 BP. XX AC chr18; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL9-I_DR; KW BEL9-LTR_DR; BEL9_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-654 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1368-1368 (2008). XX DR Genome; chr18; Positions 26978947 26979600. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 654 BP; 191 A; 110 C; 119 G; 234 T; 0 other; tgttacgaat tctccatgta tgcctgtcag agagtatgag catgcagtac accaagtacg 60 cctcaaggtg gcactgtttg atcacatatg gacatttagt gaactgatta caggcatcgt 120 gctttagtgt tgcatcagcc attccgtctg gaaagttcgc tcgattgcac gcattattgg 180 gacaatgtat gagtgacaga caaagtaagt ttaatgttat ttgcttatat actgttattt 240 gaagattgaa tcaacagcaa acactgttta gcagtttaaa tgtgtttatt atgcacaatc 300 tgtttactat cgcgatctgt tcattgcccg tgaattgtgc tctgtaatat gagacgctga 360 tcattgaaag tgtaaattga tgatcattat aatttgtatt aacaaaggaa taagtgtaca 420 cttgatactt tgtatactaa gatatatttc agtttatttg ttatttgtat aagaatgttt 480 cagtaattaa tttgtaattc ttttgttctt tattgttctc agaaaccttg catttgtcca 540 tctcagcaat cattaaagct acatatatca tttaatactg actccaagtt ctctaatcgg 600 aaccctgtgg cggttagcaa acatcggtcc agtaattaat ccctaattgt aaca 654 // ID ERV1-3-I_DR repbase; DNA; ZEB; 7464 BP. XX AC AL591144; XX DT 04-MAR-2002 (Rel. 7.02, Created) DT 10-DEC-2008 (Rel. 14.01, Last updated, Version 2) XX DE Retrovirus Danio rerio 1. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Retrovirus; KW leukemia virus; VIRDR1; ERV1-3-I_DR. XX NM VIRDR1. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 2197-6103 RA Jekosch K.; RT "VIRDR1: Danio rerio retrovirus."; RL Repbase Reports 2(2), 38-38 (2002). XX RN [2] RP 1-7464 RA Bao W. and Jurka J.; RT "LTR retrotransposons from zebrafish."; RL Direct Submission to Repbase Update (10-DEC-2008). XX DR [2] (Consensus) XX CC Putative novel retrovirus similar to leukemia viruses. Original CC name of a partial copy was VIRDR1. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 7464 BP; 2315 A; 1287 C; 1873 G; 1973 T; 16 other; atttggtgtc agaatagtgg gatgtcttgg aggaactttt ccggaccctg cggacggtga 60 ctgatcatcc tgggacttcg tgtcacagcc ctgccccgct cgattgaggg ttgaggggcc 120 gaccgccggg ctcggagggg aaccccactg acatctggga aaaagaactc aagtggcatc 180 tgaactctct ggtaagarac aaattctttg tttattgtta aaaaagaaat tkaacaagtt 240 ttgggcattt gattaggcaa tttaaaaggg tttttcactg yaatttgggc catcaccatg 300 ctggtagatc tgtctctgtg ggatagcagc gtgattaatg caagtgagat ctgacatcag 360 gcggggatcg agacctttgt tacaagtgag atctggcgtt tagctgagat caagacctta 420 cacaagtgag atctggcgcc tgactaagat caagaccttt ttaatacgtt tggctagcta 480 actgcgattt actcccggga gggggttgaa atcagggggc tgtgcgattt acttccaaaa 540 aagaggttga aatcyagggc tgtgcgattt acttccagaa ggaggttgaa atcaggggcc 600 aagcgattta cttctaaaag ggggttgaaa ttgggggcca caaaattaga gctggaaagc 660 ttaacttttc gattttaggt gaaaaatcga aatctgttgg agacgtccga cagattaata 720 ggtaccgaca aagtccaggg gaaagattgt tgccagcatg gggaagtctc aaagcaaaat 780 aaaaagatat gatacaccgg tgttttggaa tatgggaaaa aaaattatat ggaaaaagaa 840 aaaaawagaa aaaagcatgc aagatatagg aaaactaata aagtatcaca agtttctaag 900 ggaaagggaa tgttgagtgt tactaggtta aacawaatga aaaagttgat tgagaaggca 960 gaaggaaaag caggttgttt ttgtgaaagg catgtgtgtg tgtcgtgtgg agacactgtt 1020 gattgttggc tcagggaggc agataagagg caaagaggaa aatgctgcag aggcagctgg 1080 catgtaggga agagaatgaa gaagtggcat gtagggaaaa gaatgaagga gataaagtga 1140 aggaaaaagc aagtgaaaaa tgtgaaagtg tgaatgaaag attaattagc actactaatc 1200 ctttctatac atctgtttcc atacatgtac cgccgtacca tcactatccc gtggctgagc 1260 tgcgagccct gagaactgac ccagacctca actgctactg tcccagaaga cctccgaccg 1320 caccaacgaa attcccacct caagcgagtg gaggacatct gtttgaactg gagaaatgga 1380 catttaacgg acattctgct gtgtgctcaa catgctgaag acaagctgac agtgaagcaa 1440 gagcaagaca agaagaaagc agaaagggag ctgtgcatgg ctctggtaca agtggccaac 1500 caaagatcca accaagacct gacaaccaaa gatatcacaa gttcaaaggc gacaaaggtg 1560 tcaagggtga gaaagatacc agaagatgga gaaactggaa tgtgacggat ggcgatgaaa 1620 tctgcagagc ctgtggtgya cgtgacgaga aactgtacca aatgtctgcg cttgccaccg 1680 agatggactt ttggtcaagg attgacccga aaggcgcaaa ccaacaacga ggagctgatg 1740 tgcggagagg gaagagcaga atggagaggg tctgggcaag cagccaacag ctacagtaaa 1800 gggaagtgaa catattttta ctaacatgtt taaatgcaga cacattttta tacacacata 1860 ttcacataca ctgacatact tttggctctc tgcaatgaag aaaaagcttg catttcccac 1920 tgtaattttg atcacagtaa tgttgatcat ttttctgatg aaatggcatt gttttttatt 1980 tagtgaagca tttttttaaa tatgccaaaa tattctatgt tgattgagaa tacattggtt 2040 gagtttctag ctgatagtgg agctagcaga tcttgcatac gaccatgtga cctgcagtgt 2100 gaaatacctt taacaaacaa atttcacgag tctctgtctg catctggtca tacagtggtt 2160 gagcgattta cagctccgct gacctgtgag acagaggggg gtaaaatttt cagacatgca 2220 tttgtttgtt caccaaaatg cccygttcct atcctcggaa gggacatttt gtgtaaacta 2280 aacttgattt taacagcaga ttcctctggt gtgagagtga ttgaaggaag gaggaatttt 2340 gttgtttaca aattgaatca caagcaccga agtgggctta tgaatggctg attgaaaaca 2400 atgartgggc taattcaatt tgcaaattgg ctaaagaacg tgtaaaacct tttgacagtg 2460 atataatgtc tcctggtgaa ctgcattgta cgtcacatgt cgtaactgat agatgaggaa 2520 tttgagaaag cttggtttga gacaaatgtg aatgaaacgt ggatgcagag tttgaaaagg 2580 cttggtttga ggcaaatgtg aatgaaactt tgattttaga gaaaatgtac tggaaggata 2640 gtctgtgtgc agtttcagtg tctctgtcag aaaagcagcg tttcattcta tctcatgtca 2700 gcacaggctg tgcctcacat atcagtttgc aagggtaaac atcagtcttg ggctgattta 2760 ggcccgtttg tgaagcagtg tcttgaagtc aaagattgga tcagcagaga graaggggta 2820 gaatggagcg agakctcgca rgcatttagg gtggacagtg aaactgaaac agctgtttca 2880 agaactgtga ctgctattga caaactttgc gtaaagaatt cttgcatggt tgactttaac 2940 gctgcagaca tacacccagc tttagcagaa ataccaagtg aattgtgggc taaaagcaag 3000 tatgatgttg gtttaatcaa aggttgtgat ccagtgacaa tcactgcaaa atctgattac 3060 agaccttgcc aacaacaata tcctttgaaa agggaagcca ttgagggcat taccccagta 3120 tttgaggctt tactggaaca gggtgttatt gtaccatgca acaattctga ggttcgcacc 3180 cctatttttc ctgttaaaaa gataagggat aatgggatgc ctacagaatg gcgttttgta 3240 caggatttac aagcagtaaa tgcagctgtc aaacaaaggg ctccattagt tccaaatccg 3300 tacacgattt tgtcacaaat tcctgaaaaa tcacaattct attcagtggt tgatttggca 3360 aatgcatttt tcagtgtgcc agtggacaaa gacagccagt tttggtttgc atttaatttc 3420 aatggcaaag gctacacctt tacacgtctg tgtcagggtt ttacagcatc tccaacttta 3480 tacaatgaag cgttgttaag aagtttggaa cctttgaccc tgacagctgg aactgctttg 3540 ttacagtatg ttgatgacct gttgatatgt gctgagaatg aagagacgtg tgtgaaagac 3600 actgtgactc tccttaagca tttggctaag gagggccaca aagtcagttt gacaaaattg 3660 cagtttgtta aacaaaaggt aacatttttg gggcacgtca ttacaccaca cagcaaatct 3720 ctgtctgaaa aaagggtgag tggtataaaa aatgtaccaa aaccactaac gaaaaaaaca 3780 aatgttgtct tttttgggta tgtgctcata ttgtcgcaca tttattccaa attatgcaat 3840 tttggaacaa cccctgagag ccctaacatt agggaagggg atgaaatcca ctgacaaact 3900 agagtggacg acagaggcag agcaggcatt tgtaaacatg aaattacaaa tggctgaggc 3960 ccctgcattg ggtttacctg taccaacaaa accatttgtt cagatggtag atgagagaag 4020 tgggttcatg acgtcattgc tcctacagga tcatggaggt agactgcgac ctgtggccta 4080 cttttcgagc aaacttgacc ctgtagcagc aggcctgcca cgttgcttaa gagcagtggc 4140 ggctgcggaa aaagctgtta tggcttcaag agattttgtt ggttattctg atctgatact 4200 gatggtgcca cattctgtgt ccatgatact ccaagaacaa aaaacatcgc atctgtcaac 4260 agcccgctgg ctgagatatc acactatctt gttagatatg ccaaatgtga ctgttaaacg 4320 atgtactggt ttgaatgcag ctactcttct tcctactgag gaagatgggg aagagcatca 4380 ttgttgttta acagcacttg aacaggtgtg tacaccacga cctgaccttt ctgacgaacc 4440 acttgaaaat tgtgacaatg tcctctttgt ggatggttca gcatttaaag atccacaaac 4500 aggccagaat aaagttggtt acgctgtaac aactgaattt gatgtggtga cctctgggaa 4560 attgccaggg cactattctg cacaggccgc agagctcgtg gcgttgacag aggcatgtaa 4620 attgatggca gaaaaagagg ctacaattta cactgactca agatatgcat ttggggtagc 4680 tcatgatttt ggggctctgt ggaaacacag aaaatttcta aagtctgatg gtcgaccaat 4740 actcaatgct cctttagtgg cagcgctgct ggatgcgatt ttactacctg acaaactggc 4800 catttgtaaa tgcgcagcgc acactaacaa taaagattct gtttctgtag gtaactccag 4860 ggcagatgga gcagcaaaag tcgcggcgtc ccaagacaag gacaactctg aatgttcttt 4920 gctatctgtt grtgataaca atgacgtgtg ttcttctttg caggacatgc agaccttcgc 4980 gacggggctg gagaaaaaac aagtggagac agtctggctg tgtgatgaaa gataatgtgt 5040 ggaagtgtgc tgagggcaag gcgtgtttac caaaacattt tttccaacat tatgcraaat 5100 tacttcacgg taaagatcat gtgtcaaaaa cagcaatggt tgcgcaaatg aacgaactgt 5160 ggttcacaaa ggggttcact acatttgcgg acaatttctg tagacgatgt gtaatttgca 5220 acacacataa tgtggccaga gcgataaaag ttccacaatc atctcatcca cctccaacag 5280 ggccatttga atatttgatg atggatttca ttgaattgtc cccatgcaat gggaaaaggt 5340 actgtttggt gatggttgac atgtggtcaa aatgggttga agtttttcca acctcaaaac 5400 aagattcggc tgcggtagca aaagcgttac tgactgaaat tgtgccgaga tggggaatac 5460 cacgaaaaat aagttcagat aatggtactc attttgttaa tgaagcaatc aaacaagtgg 5520 gccaatattt ggaaattgat ttgagaacac attgcagtta ccatccagct tcaggaggag 5580 ctgttgaaag agaaaatggt acmctgaaaa acaaattggc aaaatgttgt gaagacacag 5640 ggcttacatg ggttcaagct ttacccattg tcctgatgta catgagaatg agaaaaagat 5700 caaaaatgaa tttgagcccg tttgaaattc tctttggtaa accaccgcgt gtaggtgtga 5760 atgggggaaa acagcagctg ccctcaacag atgtgtgtga gaatgacatg ttgaattact 5820 gtaaagaaat gtctcatgtg ttgtccgatg tttgtgtgca ggtaaaggcc gcacagggga 5880 aggctgctga gagaccactg cacagtctga ggcctggcga tttcgtggtg atcagggacc 5940 tgaggagaaa gagctggaga gcgaaacgct gcgctggctg ggtccatttc aagtgctact 6000 gaccactgag acagcggtga aggtcgcaga gcgggcgacg tgggtgcatg ctgggcactg 6060 caggaaaatt ccatcwcctg aggaggattc cacgagggag taggagagaa agaggggctc 6120 tgaagctgtc tgaaggaaaa cagcaggctg ctatctgagt gaagcaaggt cacagctggg 6180 gagaaacaac aacaactcaa gaggtgtgtg tgaaacagta acttctgaat acaacaccaa 6240 gcagcaaaca gacggaggag aaagcgacca aaagagaggt gaataagctg tgcttgacca 6300 tcgcattaaa gatctgaaga tggacgaaaa gattcgctgg catccattcc gtgtgcaagg 6360 aatgtgtggt ctgaggagaa caacactggc attgcagcta gtgtcaccga atgttctgtt 6420 gttttaatta acctgattac agcaaaacat tactgacatt gtagcccaca tgagaatggc 6480 cgtaaaagag cctaagaacg ttaacaactg atggctagat tggctgtcat ttaagtgggg 6540 aaactggatc tactgggcct ttactgtggt gttacctgta ttgggggtgg gtctagtgat 6600 attatgttgt ctgccatgta tattcagatt tgtatcatgg tcagtgggca gactgattac 6660 atctacaaca tcccatcaga tggtaaagat ggccgtgaat gaagagggta tgaatattga 6720 catgggtgat gattttgatt atgactttga tgatagcagc agctacatgg atatggataa 6780 agaaaaaccg taaacaaatt gagaacaaaa ttttaggtga cggctgttac ttaattggga 6840 aaatattaac ggacctgact ttttgaaatt tgagataact atctgcttgc ttatatgtat 6900 tttgggtcta tccaaacaaa aacagcaaac tgcatgagtt ggtatgttct gtaacaagtc 6960 ttaaaagagt gatcatttgc atgaaaatcg agagaagagt gtaaatgtga tgtatgttta 7020 agagtacaga aacaacaatg tgttcctatg tttatgagtg ttattgatga attgatgtag 7080 agtgttagta atttttaact gattgaagaa atgtttccag aattttgttg tgtgaacaaa 7140 tcaataatgt gaaaataaca ggcaatttct gtgatttgaa cctctgaggg gtgactgagg 7200 gtctgatttc ggtcaggagc gcaataacag tgattattcg tttagcccgg ctcccaggga 7260 agagggatgt tttgaaaggt aaatcttaat ggggaaacta tattaagaag agtgtttaaa 7320 catctctagt gtatttgatg tgtgcagaaa cctgttaaat gatttttgta accagtgtga 7380 ttattataat cagtgtgatt ttagaaatca attgaaaaat gaatttaata atgagttatc 7440 aatgaatgat aaaaagagag gaat 7464 // ID DNA-8-15_DR repbase; DNA; ZEB; 2078 BP. XX AC . XX DT 26-NOV-2008 (Rel. 13.11, Created) DT 26-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE A nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; TSD 8-bp; DNA-8-15_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-2078 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(11), 1730-1730 (2008). XX DR [1] (Consensus) XX CC TSD is 8-bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 2078 BP; 688 A; 373 C; 394 G; 623 T; 0 other; tagggatgtc ccgatacaac attttaactt ccgatacgat accgatattg cagccatcag 60 tccgatacga taccgatata aatccgatat cagcacaaat tatgaatact ttatttttac 120 ctattttgtt gagtcgaatg tatgaacggc tagatcaaac tgagaacaag tcagcaacag 180 taagtatgga aaaaacaacc agtttattaa ctattggttg cataaatgtg aaccttttac 240 ataagaatat ataaaaagaa ttaaagaata aatattaaga ccctgaaaaa tatcttaatt 300 gaataaacaa aaatacattt ttaaagtgca aaatgctaat taaaggtcac ttcagttatt 360 cattttttac catcagcaaa gagtagaaac tgctgagagc aggaacataa gaaaaaaata 420 ttgttaattg aacaactgct aaaggttcag tgtccaaaat ttcaatttca cacacagctt 480 tttgatgaat ttttgatcca gttgagctga aaaatagctt tcacactggg cagcagaaat 540 aatttattgt taagttaata aaaatgtgca gtatggttag tacaaacatt tacatgatca 600 gagctatttc atctgtgttt gtatcataag aagattatgg ctgtatcaat gttcccaaac 660 attattaatt ttataataat atatattgca catattatat ttaacataaa aatacttgca 720 gaggctagat tgaaacacac cttttctcca cagcacttga aataatgtga cttttaagaa 780 taattttatt tatcaaaatt cattatattt gcatattata ttgcacagat tccaccctac 840 tgcccacatg ggaaaattag acaatgtaaa tacagacaga aagagaactg caaactgatg 900 tgcaaagctg acactgttta tcagagcact ttatctttct gggcttttta aagaagagtt 960 aaacatggga atgtctctaa taaaattttc cttctgcccc aaaagtaata ccattctttc 1020 ttttaaactg taatggccct cagcaacaga atgatttctt attcttattt aacagacaat 1080 gaaaacaaca aaaatgttta gcattttaca tgacactgtt cctttcagag tgttcaagta 1140 aaaaaaaatg aaataaagaa taataaatcc actatttcac tcctactatc catgtgcacc 1200 actgacaggt caaatctgct ccctctctag tcttggtgct gctgagagat tgagattgaa 1260 agtttgcgta taaaataatg tagaaacgca gctaaagaaa gcagaatccg ggacatgttc 1320 ggcgatttag aaatcttggc tgaatgcaat ttttaagtcc caaatagagg acatgtccgg 1380 gaaaaagata atgtattgtc accctaaata atgagctgga tcagtctatt tagtaaaagg 1440 cttttttccc tgcgtgtcct cgcgctcttc tcaaagctaa agtcagagca ggtgtccagc 1500 gcgtctctgt gggtctgctg cagcagcaga gcccgcgagt ctgtggcgga gacttttctc 1560 atgcacacag aaccgggccg ctgaggtaaa gtcttatgtg tgggaagcgc tggtgatgaa 1620 ttaggtatca cactggatga accacgatat gccatgaccg aattgaggct cgcacgatgt 1680 gtgctacgtc gccattcggt gccttatgtt gcttagcagc cttatgcaaa acaatgtaaa 1740 gatcgcttga cgtgaagtaa ataaaccagt aacgttagag atgtgccgtt gttgctagcg 1800 tcagcgtgtg ctaccgtcac aatagatttc tatcgtaata caccataact tagtgtgaag 1860 agcatgttaa atcttagaga ctttgtttgc tagcaagcag tgttgtgatc acgtgggctt 1920 tgcacgctgg ctctggacgc tgctggatag aatagctgta acggagttta gagagtatcg 1980 gactgcttat atcggagatt tttaacgcag tccgatatag ttttttggac tgatatcgga 2040 cgaatttccg atatcaatat cggatcggga catcccta 2078 // ID CATCH2LTR_DR repbase; DNA; ZEB; 825 BP. XX AC AL645822; XX DT 05-JUN-2002 (Rel. 7.05, Created) DT 05-JUN-2002 (Rel. 7.05, Last updated, Version 1) XX DE Long terminal repeat of the CATCH2_DR endogenous retrovirus. XX KW BEL; LTR Retrotransposon; Transposable Element; KW BELL/CATCH superfamily; CATCH2I_DR; CATCH2LTR_DR; CATCH2_DR; KW endogenous retrovirus. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-825 RA Kapitonov V.V. and Jurka J.; RT "CATCH2, an endogenous zebrafish retrovirus from the BEL/CATCH RT superfamily."; RL Repbase Reports 2(5), 3-3 (2002). XX DR Genbank; AL645822; Positions 128347 129171. XX CC CATCH2LTR_DR is a long terminal repeat of the CATCH2_DR CC endogenous CC retrovirus. See comments for CATCH2I_DR. XX SQ Sequence 825 BP; 226 A; 169 C; 156 G; 274 T; 0 other; tgtaaaagag gacggtgggc tgagcccaga gctggacacg agctgtgctc tctcctcacg 60 cgtgcatgct gtgacatcac aaacagactc ctatagagct cgtacttgat catgcgctga 120 tcacgagcac aggcagccaa tcaaaacgca gcttaggtct gctgtacatg gagactgtta 180 actggtttca ttctgatcag acagagagaa aggttgcatg ctttgctttc cgcctaattc 240 cagtcagttt tgtgtgcgtc aatgatgaga agtaagtgtt tatgcatctt ttacatacct 300 gttaccactg ttttatcatt atagtattgt attgagatta ttgatttatg cgcgagctcg 360 taaagaatcg cggactgccg cggctagtag tagcgatctt ttatgtaaat ataattgctt 420 taaatgataa tacatacatc gcctattaaa tccatgcttt ttgttatatt tccgcctaag 480 ttaaaagcct gttaatagtg agaattatta ttttatatac gtttatctgc cttgcttcat 540 ctgctttgct catgcatgtc agttataaat gtatattgta gtttagtgtt gtgtataagt 600 atttccactt gttactgtta tccttattat ctaaatttat ttattaatgt atattttctt 660 tcttttgttt agaccacaca cacacacaca cacgcgcaca cacacacaca tacacaaagt 720 gtgatggaca ttaaagagct ttaaatgggg ttcaagtctc cgtgttctta ccggcactcc 780 atggcgacga tcaaattcac ctaaaaacca gctaactccg ttaca 825 // ID GYPSYDR2 repbase; DNA; ZEB; 4212 BP. XX AC AL603743; XX DT 04-MAR-2002 (Rel. 7.02, Created) DT 04-MAR-2002 (Rel. 7.02, Last updated, Version 1) XX DE Gypsy Danio rerio 2. XX KW Gypsy; LTR Retrotransposon; Transposable Element; GYPSY/TY-3; KW GYPSYDR2; LTR; retrotransposon. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4212 RA Jekosch K.; RT "Gypsy-like element from D. rerio (variant 2)."; RL Repbase Reports 2(2), 12-12 (2002). XX DR [1] (Consensus) XX CC Putative novel GYPSY/TY-3-like retrotransposon with nearly CC perfect CC 135 bp LTRs (GYPSYDR2_LTR), a reading frame (pos. 155-4175) with CC 3 frame shifts and 1 stop similar to polyproteins from Fugu CC rubripes, CC fungi and plants. XX SQ Sequence 4212 BP; 1124 A; 1164 C; 839 G; 1085 T; 0 other; ctgttcatca tcagtcttga tacccatact actactgtgg atgtgtgtgt gcactccggt 60 gcatattgtg tgtttgagtg aagttgtgat taataaatac tgcataatgg atccctccgt 120 gtcagtctcc ttgttacaga agactttgcc caacatggat cccacagcca tacaggtttt 180 gtctcacgag atcacaacgc aagctcaggt attgaccaca catcagcaac agttaacaca 240 cctaactcag ttgacagatg aactggtgaa atcgctgcaa aacctgcatg ctgctcccac 300 cgcgcaactc accgccagtt actctcccgc tcaaccctct gtgacacaga cccaggttgc 360 ttctggtgtt cgtctggcac tacccgacaa gttctcaggt aacccagcca aatgcaaggg 420 ttttttacta cagtgtaaac tatttatcgc ccagcaacct catttgttta aagatgagaa 480 cgggaagatt gcctttgtat gttcactgct aacaggaaaa gctctagact gggctactgc 540 tgtatggcca gacagcaccc cgatattccc ctcattcaat gactttctca aacgattttg 600 cactgtgttt gaccatcccg agggtggtcg taatgctggt gaggagcttc tgtgtttaca 660 acagggagat cgacccgcag ccgaattcgc tgtacagtct cgcacactgg cagcgcaagc 720 aggctgggct gatgatccac ttactacacg ttaccgaaga gctctgcgac ccgaactgca 780 aaaagagatg gcttgtcgtg atgaggggaa aacattggat caactcattg aactctcaat 840 caggttagac acgttacttc gtacacgtaa ccctctatgc tctcccgttt cctgtcctgt 900 gtcccctgga cctgtcactg aacccctgca attggggaga acccgcctaa cgcccgagga 960 gcgtgaacga cggaggagaa accatctatg cctatattgc ggaaattcag gacacacgaa 1020 gatcctgtgt cccaacaaac caccccctaa ggctcttccg gtgagtgcta ccacagtatt 1080 cacgactacc aatgatatta tgagtttacc tgtctgtcta aaatatggag atgctgaaat 1140 ttcaacggca gccatggtcg actcaggagc cgctggcaac tttattgact acacgtttgc 1200 cttaacacac tccattcctc taacctcttg tgattctcct ttagccatca ctgctataga 1260 cgggcgcccc ctgggggaag gacacataaa attctaaact caacccataa ctcttcaaac 1320 aggcatgtag aaaagatgtc cctcctcgca attaactctc cacgacatac aatcattctc 1380 ggactcccct ggttaaaact ccatgaccct caagtttcct ggaaaactgg tgagattact 1440 agatggagta attattgtgt tacccaatgt ctgcattctg tttcccctgt ccagatcaac 1500 actataacca gtatcgatga ctctgagatt tgtcaaattc ctgaggctta tcatgatcta 1560 ctcgaagctt ttaacaagca aaaagccact aaacttccgc ctcatcgtga gtatgattgt 1620 gccattgaac tattaccggg taccacgccc cctcgtggta gaatctttcc tctctctcaa 1680 ccagagaccg aagctatgaa cacctacatt tcagaggtac tagaaaaggg tttcatacga 1740 ccatcgacgt cacctgcttc agctggattt ttcttcatca agaaaaagga tggcagttta 1800 cgtccatgca ttgattacag agaactcaat gaaatcacag tgaagtatcg atatccctta 1860 ccactagttc cagcagcact tgaacaactc tgctcagccc agtatttcac taagttggat 1920 ctccgcagtg catacaacct catccgtatt cgtcagggag acgagtggaa aaccgggttt 1980 tccactatca atggccacta taaataccgg gttatgcctt tcggcctagc taacagtcct 2040 tccgtgttcc aggctttcat caatgagata tttagagaca tgctcaacca gtgggtcatc 2100 gtttacattg acgatatcct tatttactcc aactccctac cggaacatat ccagcacgtc 2160 agagcagtat taaaacgcct tatccagaac cagctatatg ccaagatttc taaatgtgag 2220 ttccatcagt catgcatttc attcctgggc tatattatca gtcccgaggg agtcgccatg 2280 gaccagaaga aagttgactc cgtcacacaa tggccacaac ctgaaactat cagacattta 2340 caacgtttcc tgggtttcgc taacttctat cgaagattca tccggaactt aagtacagta 2400 gcagctcccc ttacaaccat ggtaaaagca aacaacgctc gcttaaagtg gaactcagag 2460 gccattcaag cattcaacca gctcaaagcc cgcttttcaa gtgctcccat cctgagtcat 2520 ccggacccca atcaaccttt tgtggtagag attgacgcat ctaactctgg tatcggagcc 2580 attctgtctc agaggtcgtt aaccaccaac aaactccatc cctgtgcatt ctactcccgg 2640 aaactcaatc ctgctgagag aaattatgac gtgggtaacc gcaaactcct cgctatgaag 2700 gctgctctag aagagtggag acattggcta gagggagcta aacacccatt tactgtcatc 2760 accgatcata agaatctgga gtacattcgt tcttgcaaac gcttaaatcc cagacaagca 2820 agatgggcac ttttcttcac tcgcttcgac ttccaagtca cctatatccc cggatcaaag 2880 aatgtcaagg cggatgctct ctctcgactt tcggacgaag agactctgac agaggatggc 2940 gaacctattc tgaaggaccc tcttatcatt gccccgatac aatgggatat tgaccctgaa 3000 attatccagg cttctgaaca acactccacg ccgcaggcat gtcccgagaa caaaattctc 3060 gggactcctt gctccgtgat aaactcattg gtgaagtgca tactcatccc agttccggtc 3120 atccaggcag tacagcaaca gtacaactga tccaatctcg ctattggtgg ccatccatca 3180 acaaagacgt aatcaacttc cttaacaagt gtgctccttg tcaaatggcc aaacattcac 3240 gtcatcgtcc agctggtctc cttcaaccct tagagattcc acgtcgccca tggtcacaca 3300 tagccatcga cttcatcact gatttacctc agtcacaggg taataccacc atactcactg 3360 ttgtggatcg attttcaaaa gcctgtcgac tcatcgccat acccaagttg cctacagctc 3420 tggagacagc ggaactatta tgtgaatatg tatttcgtta ctatggtctt cctgaagaca 3480 tcgtttctga ccgagggtcc cagttcacgt cccgtttatg gtctgccttc tttaagaacc 3540 tccaagtaaa tgtcagtttg acatcaggct atcaccctca gtccaatgga caaaccgaac 3600 gtctcaatca agagatcgga catttcctcc gcacctattg tcatgctaat caaactgaat 3660 ggaacaaatt tctcatgtgg gctgagtatg cccagaactc attaataaag gcatctacgg 3720 gtctaactcc cttccagtgc gtacttggct ttcaacctcc tctattcccc tggtcaggta 3780 agaccacaga actcccagca gtcgacacct agtttaagaa gtgtgaggag gtatggaacg 3840 cagctcacac ccacctagca cacgccatca gaagggtcaa agaacaagcc gaccgtcacc 3900 gtcgtcctgg tcccacgtat tccccaggac agtgggtttg gttatcagct cctatgcctt 3960 aaacaccctg caagaagctc agtcccaggt acgtgggtcc atttcaaata gagaaacaaa 4020 taactcctgt ttcctttcga ctgacactcc ctaaccatta tcgtatttgc cccactttcc 4080 atgtctctct gctcaagcct gctgttggtc cagccgaggc ggatagggag gtggcagcca 4140 gtgaacaggg tcccccgcct ctcatgatcg actaagagga tcccccggcg tttcttactt 4200 tgcttcgcct tc 4212 // ID Gypsy64-I_DR repbase; DNA; ZEB; 4268 BP. XX AC chr14; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy64-I_DR; KW Gypsy64-LTR_DR; Gypsy64_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4268 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1443-1443 (2008). XX DR Genome; chr14; Positions 47822710 47826977. XX CC 'GTTTT' target site duplication CC LTRs are 99% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 17..928 FT /product="Gypsy64-I_DR_2p" FT /translation="MDTAGPDPVRAAVAQQGVLLGQHETRLSNTSREVEVL FT SNQVAELTNRLQELQYETTRQGFRLEPEPRCNNPPPYDGDPNSCRAFLAQC FT SVVFALQPRMYDAEETKVAFVLTLLTGKAREWGTSVWETRAPCCASFNDLR FT LEMMRLFDRSARGQEAADQLARLHQSGRSVTEYAILFKTLAASCDWNEGAC FT RSIFRAGLEEDIQDELATHELPHNFDDLINLALRIEVRLNRRHQRKLTRSS FT GRVEGTLSSSTFPAAHDPEPMQVGRLQLTPQEKQLRLARGLCLYCGKPGHF FT AAVCPAKAKAHQ" FT CDS 1066..4269 FT /product="Gypsy64-I_DR_1p" FT /translation="MDSSVASQWGIPAIPLPDPIPARSLDGTLIATISHAT FT PSVSLVISGNHREVTTLYLLDSPNAPIILGHPWLVQHGPHVDWSGNSVLSW FT SQSCLESCLGPAPFPVSVCPVFQVNAADLTGVPAEYRDLGLVFSKSRATSL FT PPHRPYDCAIELLPGTSPPKGHLYSLSSPEREAMDKYIDESLKAGLIRPSS FT SPAGAGFFFVKKKDGSLRPCIDYRGLNDITIKNRYPLPLMSSAFELLQGAQ FT VFTKLDLRNAYHLIRIREGDEWKTAFNTPTGHFEYRVLPFGLTNAPAVFQA FT LVNDVLRDMVNQFVFVYLDDILIFSPSMQVHTQHVRQVLQRLLENQLFVKA FT EKCVFHAKSVSFLGFVISAGEIKADPSKVRAVAEWPTPDSRKALQRFLGFA FT NFYRRFIRNFSQIAAPLTVLTSPKVPFIWGSKAQEAFDNLKSRFISAPVLS FT IPDPKRQFIVEVDASDVGVGAVLSQRSQRDEKVHPCAFFSHRLSPTERNYD FT IGNRELLAVRLALGEWRHWLEGAVQPFVVWTDHKNLEYISTAKRLSSRQAR FT WSLYFSRFNFTLLYRPGSKNTKPDSLSRMFSAPEREVSTDTILPRGVVVAS FT LSWDIEREVLGAVRECQIPRGTPAGKLFVPQSLRSKVIQWGHSSRLACHPG FT VRRSLATIRQRFWWPSMAKDVRRFVFACPICAQNKSANSPPVGLLRPLPIP FT SRPWSHLALDFVTGLPESKGNTVILTVVDRFSKAAHFIPLPKLPSAKETAQ FT VVVDHVFRIHGLPVDVVSDRGPQFVSRFWKEFCRQIGASTSLSSGFHPQTN FT GQSERANQDLERTLRCLTSNNPSSWCQQLSWVEYAHNTLPVSATGMSPFQC FT CLGYQPPLFPSQEPEAAVPSALAFVQRCRRTWRKAKEALAQARRRTKAAAD FT RHRTPAPHYVCGQRVWLSTKDLPLRVASRKLAPRFIGPYRITKVLSPVAVQ FT LKLPHTLGRVHPVFHVSRVKPVFSSPLVPSIPTPPPPRLVDGSPVYTVRRL FT LEVRRRGRGVQYLVDWEGYGPEERSWVPARDILDRTLIADFRRRRGEPPQG FT TPGGDHGGGGT" XX SQ Sequence 4268 BP; 881 A; 1200 C; 1078 G; 1109 T; 0 other; gaacgatctg accactatgg acacagcagg tccagatcct gtcagagcag cagttgccca 60 gcagggggtg cttttgggcc aacatgagac ccgactcagc aacaccagca gagaggtgga 120 agtgctttcc aatcaggtgg ctgaacttac caatcgcttg caggaactcc agtatgaaac 180 cacacggcaa ggttttcgcc ttgagcctga accacgttgt aataatccac caccatatga 240 tggggaccct aactcatgtc gggccttctt ggcacaatgt tcggtggtat ttgcccttca 300 accacgcatg tatgatgctg aagagactaa ggtggccttt gtgttgaccc tccttactgg 360 taaggcccgt gaatggggaa catccgtttg ggaaacaaga gccccctgtt gcgcctcttt 420 taatgatctc cgacttgaga tgatgagact atttgaccgg tctgccagag gacaggaggc 480 tgcggatcag ctcgctcgcc tccatcagag tgggcgatcg gtcacggagt atgccatctt 540 gttcaaaacc ctagcagcat cctgtgactg gaatgagggg gcatgtcgat ccattttccg 600 tgcaggcctt gaggaggaca tccaggatga gctcgccact cacgaactgc cacacaactt 660 tgacgatctc attaatctgg ctctacgcat cgaggttcgc ctcaatcgcc gacaccaacg 720 taagctgacc cgctcctctg gcagggtgga gggcaccctt tcatcttcca catttcctgc 780 tgcacatgac cctgaaccca tgcaagtggg tcgtttgcaa ttaacacccc aagagaaaca 840 actacgtctg gcccggggtt tgtgccttta ttgtggcaaa ccaggacact ttgccgcagt 900 ctgcccagca aaagccaaag cccaccagtg aaacggggga tcctggtggg cgctattccc 960 cttcactccc cctcctcccg tactctgcta cccgcactga ttcagtatag aggttctacc 1020 aatgcctgtt ctgctctcct cgactcggga gcagaaggaa acttcatgga cagctctgtg 1080 gccagccagt ggggtattcc agccattcct ctccctgatc ctatccctgc ccgttccctt 1140 gatggtaccc tgattgctac tatctcccat gctactccct ccgtaagtct tgttatatct 1200 ggaaatcacc gtgaggttac cacactgtac cttcttgact ctcccaatgc tcccataatt 1260 ttgggacatc cctggttggt gcagcatggt cctcatgtgg attggtctgg aaattcagtt 1320 ttgtcctgga gtcagtcttg tcttgagtct tgtcttggtc ctgccccgtt tcctgtttct 1380 gtgtgtcctg tttttcaggt gaatgcagcg gatctgacag gggttccggc ggagtaccgc 1440 gatctgggtc tggtgttcag caagtcccgg gccacatcct tgcctcctca ccgaccctat 1500 gactgtgcca tagaactcct cccaggcact tctccgccta agggtcactt gtattccctt 1560 tctagtcctg aaagagaggc catggacaaa tatatagatg aatcccttaa agctggcctc 1620 atccgccctt cctcgtctcc tgcaggcgct gggttcttct tcgttaagaa gaaggacggc 1680 tcccttcgtc catgtattga ttatcgaggg ctgaatgaca tcaccataaa gaataggtat 1740 cccttacctc tcatgtcttc agcctttgaa ctcttgcagg gagcccaggt cttcactaag 1800 cttgacctcc ggaatgccta ccaccttatc cgtattcgag agggggatga gtggaagacc 1860 gcctttaaca cccctacggg acactttgaa tatcgggtcc ttccgtttgg tctgacaaat 1920 gccccggctg tcttccaggc cctggttaat gacgtgctga gagacatggt aaatcaattt 1980 gtctttgtgt accttgatga tatcctgata ttttctccct ccatgcaggt acacactcag 2040 catgttcgtc aggtgctgca acgattactg gagaatcaat tgtttgttaa ggcggagaag 2100 tgcgtcttcc atgctaagtc agtttcattc ttggggtttg tgatttcggc gggtgagatc 2160 aaagctgacc cctctaaggt aagagctgtt gccgagtggc caactccaga ctctcgtaag 2220 gcactgcagc ggttcctggg atttgccaac ttttatcggc gattcatccg gaacttcagc 2280 cagattgctg cacccttaac agtgctcacc tctcctaagg taccgttcat ttggggtagc 2340 aaggcacagg aggcctttga taatttaaag tcccgtttta tctctgctcc tgtcttgtct 2400 attccagatc cgaaacgaca gtttattgtt gaggtggatg cttctgatgt cggagtaggc 2460 gcagttctgt cacaacggtc tcaacgggat gagaaggtgc atccctgtgc cttcttctcc 2520 caccgcctga gccccacaga acggaattat gacattggta atcgggaatt gttggcagtc 2580 agactggccc tgggtgagtg gcgtcactgg ttagagggtg cagtgcaacc ttttgtggtc 2640 tggacagacc acaagaattt agaatatatt agcacggcca aaaggcttag ttctagacag 2700 gcccgctggt ccctttattt tagtcgattc aactttacac tcttgtacag accagggtca 2760 aagaacacca agcccgactc tctctctcgc atgttcagtg ccccggagag ggaggtttca 2820 actgacacca tcctgcccag gggcgtggtg gtggcttccc tctcttggga catcgagaga 2880 gaggttctgg gtgctgtacg agagtgtcag atacctagag ggactccagc gggcaaacta 2940 tttgttcccc agtcactgcg ctccaaagtt atccagtggg gtcactcatc caggctggcc 3000 tgtcacccag gagtcaggag atctctggct actatccgtc aaagattttg gtggccctcc 3060 atggccaagg atgtcaggcg gtttgtgttt gcctgcccta tttgtgccca aaacaagagt 3120 gccaattccc ctcctgttgg tctgctccgc cccttaccca tcccttcccg tccctggtca 3180 cacttagccc ttgactttgt cactggccta cctgaatcga agggcaacac cgtcattctc 3240 acggtggtgg accgcttctc taaagcggct catttcattc ctctcccaaa actcccctct 3300 gccaaggaga ctgctcaggt ggtggttgat cacgtgttcc ggattcatgg tcttccggta 3360 gatgtggtct ccgatagggg cccccaattc gtctccaggt tttggaagga attttgccga 3420 cagatcgggg cctctacgag tctgtcttca ggattccatc cccagaccaa cgggcagtct 3480 gagcgggcaa accaggattt ggaacgtact ctccgctgtc tgacatccaa taatcctagc 3540 tcctggtgcc aacaactatc ctgggtcgag tatgcccata atactctacc agtttctgct 3600 acgggtatgt cgccatttca atgctgtctt ggctatcaac ctcctctgtt cccctcacag 3660 gaacccgaag ctgcggtccc gtctgccttg gcttttgttc aacggtgtcg tcgcacctgg 3720 aggaaagcta aggaagcctt ggcccaggct aggagacgga ccaaagcagc ggctgatcgt 3780 catcggactc cagctcccca ctatgtttgt ggtcaacggg tatggctttc caccaaggac 3840 ctgcctctca gggtagcctc acgcaagctg gcacccaggt tcattgggcc ataccggatc 3900 actaaagtgc tgagtcctgt ggctgttcag ctcaagctac ctcatacgct tggtcgggta 3960 cacccagttt ttcatgtctc tagggttaaa cctgtgtttt cctcccccct tgttccctct 4020 attcctaccc ctccccctcc ccgtctagtg gatggttctc cagtttacac tgtcaggagg 4080 ttactggagg tcagacgaag gggtaggggc gttcaatacc tggtggactg ggagggctat 4140 ggaccagagg agaggagttg ggtcccagcc cgggacattc tggaccggac gctgattgcg 4200 gacttccgtc ggagacgagg tgagcccccc cagggtacgc ccggtggcga ccatggaggg 4260 gggggtac 4268 // ID Gypsy120-LTR_DR repbase; DNA; ZEB; 1036 BP. XX AC chr19; XX DT 03-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Gypsy retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy120-I_DR; KW Gypsy120-LTR_DR; Gypsy120_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1036 RA Jurka J. and Kohany O.; RT "Gypsy retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1277-1277 (2008). XX DR Genome; chr19; Positions 7004858 7003823. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1036 BP; 270 A; 207 C; 190 G; 369 T; 0 other; tgtgatgtcc tgtcatatgg ccacacaaga ctgagggtaa caaacacacg cacacattcg 60 ctcagacaca cacacacaca cacacacaca cacaaacaca cacgcacatt cgctcaaacg 120 cacacactcg ctcagacaca cacacacaca cacacgcgct tacacataca cgcacattcg 180 ctgaaacgcg cacattcgct gagacccaaa caatactctc ctgctccttc actagctgac 240 aagcgcgcac cttgctcaac tacacgcata caaacaggca taacatccat aagaacttta 300 ggtaattaga gactgacaac ttctctttat ttcatatctt tatttataaa ataagttcca 360 aatattacat gtcacaattt ctttcagttt cggtacattg tttactggtt tggagtgcac 420 acattctagt ttacattagt actgtttctg ttatttctgt ttacttatgg ttatttctgt 480 tgagttaaag ttaccggagg gtgcagtgga ggatgatgct gtggtccgct ggatcggcca 540 gtcatgttta ctggcaaagg gctggttcct gtgtgtgcgt caaagattta tgtccgggtg 600 atcccgcgtg gcggaaccaa tgtgccgaaa tgataagggg gggaaagttt actttgaatg 660 tttttctata ttacttgttt gtttaattta aataatttgt ttattttgag taattatgtc 720 ttgtattttg taaatgtctt tttgtttaga attagtttga ttagtaaata ttatatttat 780 aattgttatg ttttattgtt tcacctatgt tgtttaagtg gatccaccgt ttgtgtataa 840 atactctgta gttgtttcat ttggtaggtc agtttggttg tgttcagact tcctgaattt 900 ctgttatatt tttgttgacc tcaatttgtt ttccaactag tttttatgtt ggattattgg 960 tttgtttaag ttaaataaat attttttggc cttaattttc ctgtgtcctc gtgccttacc 1020 gctgggctat ctcaca 1036 // ID Gypsy112-LTR_DR repbase; DNA; ZEB; 173 BP. XX AC chr20; XX DT 03-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy112-I_DR; KW Gypsy112-LTR_DR; Gypsy112_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-173 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1517-1517 (2008). XX DR Genome; chr20; Positions 45321620 45321448. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 173 BP; 29 A; 46 C; 38 G; 60 T; 0 other; tgggcacgcc gagggagcaa cattagtgat attgctgctt tttacggacc gctgtctttt 60 tacggaccgc tgtcttttta cggaccgctg ctttttacgg accgctgtct ttttacggac 120 cgctgtcttt ttacagaccg ctgctttaca gaccactgct ttctttttaa cca 173 // ID Gypsy72-I_DR repbase; DNA; ZEB; 4297 BP. XX AC chr14; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy72-I_DR; KW Gypsy72-LTR_DR; Gypsy72_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4297 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1459-1459 (2008). XX DR Genome; chr14; Positions 54397434 54393138. XX CC Positions [1716-2171] - Reverse transcriptase CC Positions [3183-3662] - Integrase core CC 'GGTAA' target site duplication CC LTRs are 98% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 1608..4298 FT /product="Gypsy72-I_DR_1p" FT /translation="MDKYLTESLAAGIIRHSSSPAGAGFFFVKKKDGSLRP FT CINYRGLNDITIKNRYPLPLMSSAFDLLQGARFFTKLDLRNAYHLVRMREG FT DEWKTAFNTPTGHFEYLVLPFGLTNAPAVFQALVNDVLRDMINQFVFVYLD FT DILIFSSAMQEHVQHVRRVLQRLLENQLYVKAEKCKFHVQSVSFLGHIISV FT EGLRMDPAKVRAVSDWPPPDFRKALQRFLGFANFYRRFIRNFGRVAAPLTA FT LTSTRIRFGWSVAAQTAFDHLKSLFTSAPILITPDPARQFVVEVDASEVGV FT GAVLSQTAQDNKLHPCAYFSHCLSPTERNYDVGNRELLAVRLALGEWRHWL FT EGAAVPFLVWTDHRNLQYIQTAKRLNARQARWALFFGRFNFTLSYRPGSKN FT SKPDALSRCFGSPADPSPPDTIIPSRCVVGAISWGIEERVKRAQAGVEVSS FT GCPAGMLFVPDAVRTAVLRWGHCSKISCHPGVRRSLAVIRQRFWWPAMAGD FT VRRFVGACSVCAQNKSSNSPSIGLLQPLPIPSRPWSHIAMDFVVGLPPSNG FT NTVVLTVVDRFSKAVHFIPLPKLPSARETAREVVDPVFRIHGLPEDVVSDR FT GPQFVSHFWREFCRQIGSSTSLSSGFHPQTNGQTERANQDLGRMLRCLASH FT NPSSWSQQLTWAEYAHNSLPVSSTGLSPFMCCLGYQPPLFPSQGSEAAVPS FT VQAYIDRCRRTWTRAREALLRARGRTKAAADRHRRQAPRYVCGQRVWLSTK FT DLPLKVSAGKLAPKFIGPYPISKIMSPVTVRLRLPLHLCRVHPVFHVSKIK FT PVLCSPHSPHVSSPVPPPPLVIYNIPAFRVKRILQARRRGRGFQYLVDWEG FT YGPEERSWVPARDILDPSLIEEFHLRQSLSISGTPSGVPGGRGT" XX SQ Sequence 4297 BP; 840 A; 1181 C; 1110 G; 1166 T; 0 other; ttacacacta gccatcatgg atccagcaga tgtcacgatg cgctctgcag tggagcttca 60 aggagctatg ttgggccgtc atgaggagga gttaatcaac gcccgacagt cagtggagac 120 tttaaatgcc caggtagcag agcttacgga gcgacttcac catctcagtc acgcggcgct 180 cgaagctgat cgaccagcgt ccgcgatggc cccgcccatt gcggagccac gaatcaataa 240 tccgcctgtc tatgccggtg agtccaccca gtgtcgttct ttcattatcc agtgtgaggt 300 ggtattttca cttcagccca gaacatactc ttctgactgt gcaaaggtgg catttgtcat 360 ctcccttctg actggtcggg cacgtgactg gggtgtggca gtatgggaat cccaagcctg 420 gtgctgttca gatttctcag catttaagga agagatgatt aaggtttttg atcgatctgt 480 ttttggcaag gaggcttctc ggctgctggc atcccttaaa caaggaaagc gttctgtggc 540 agattacgct gttgaattta ggactctggc tgccacgtgt aattggaata ctgaagccct 600 tactgcccgt tttctggaga gcctcatgga ggaggtgaag gatgagattt acgctcgtta 660 ccgccggact gcttggatga actaatcagc ttggccattc gactggacgc caggatggag 720 ctacggcgtc gtgtgcgggg cctgttagtc cagcctaggt ctgaaacccc ggttcccatt 780 tctcttccta cggccatgga caaggtagat gacgctgagc ccatggtttt gggtcgctac 840 cgcctgaccg ccaaggagag acagcgacgc ctgggggcag gcctctgcct ttactgtggt 900 gggcagggcc atcaagcagc ttcctgtcca ttaaaagcca tcgctcgtcg gtagccgggg 960 gagtcctggc gagcggtacc tccacctccc cagctcaggg ttcacgtact caattgtttg 1020 tgtctgtcat tgttaacaat gtctctttta aaggtacggc tttgatcgac tcgggggcag 1080 aggggaattt tattgatgag aactgggctc gctccagagg tatcaacatc ttacccctca 1140 aatcccctgt tactgctctc gctcttgatg gccgacccat cgccaccatc acctccgtta 1200 ccaactgggt gagtgtggtc acttctggca accacagaga ggagcttagt tttttggttt 1260 ataaatcacc caccactccc atagtttttg gccatccctg gttatcccgt catggtccct 1320 gcattaactg ggcagataat tctgtgtcgt catggagcct gtattgccat gcacattgtc 1380 tgttgtctgc tccttctccc gtttctcttt gtctgtcttc acaggaggag gagactgatt 1440 tatcctgtgt tccggacact taccatgatc tgcggggggt cttcagtagg tcccgtgctg 1500 tatctcttcc tcctcacaga ccatacgatt gtgccattga cctcctccaa ggcacttccc 1560 cgcctcgggg tcgtcttttt tccttgtctg ctccagaacg agcagccatg gataaatatt 1620 taacagagtc cctggcagct ggtatcattc gtcactcctc ctctccggct ggagcggggt 1680 tcttctttgt gaagaaaaag gatggttctt tgcgcccctg cataaactat cgaggtttga 1740 atgacataac cattaaaaat cggtaccccc taccgcttat gtcatcggcc ttcgatctct 1800 tgcagggggc aagatttttc acaaagttgg acctacgcaa tgcttaccat cttgtccgta 1860 tgagggaggg tgacgaatgg aagaccgcat tcaacacgcc cactgggcat tttgaatatc 1920 ttgtcctgcc atttgggctc accaatgctc cagcagtctt tcaggcactc gtcaacgatg 1980 tactccgaga catgattaat caatttgttt ttgtttattt ggatgacatt cttatttttt 2040 cttccgccat gcaggaacat gtccaacacg ttcggcgagt gctacaacgg ctgctggaga 2100 atcagctgta tgtaaaggcg gagaaatgca aattccatgt ccagtcagtt tcctttttgg 2160 gacacataat ttcagttgag ggacttcgca tggaccctgc aaaggttagg gctgtctccg 2220 actggccgcc acccgacttc cgaaaggcgt tacagcggtt cctgggcttc gccaattttt 2280 accggcgatt catcaggaat ttcgggcggg tggcggctcc cctgactgca ctgacctcca 2340 ccaggatcag gttcggttgg tctgtggctg cccaaactgc ctttgaccat ttaaaatctc 2400 tgttcacctc tgcaccgatt cttattactc ctgatcctgc ccgacaattt gttgtggagg 2460 tcgatgcctc agaggtaggt gtcggagcgg tcctctctca aactgctcaa gacaacaaat 2520 tacatccttg tgcatatttt tcacactgtt tgtctcccac agaacgaaac tatgatgtcg 2580 gcaatcggga gctgttggca gtccggttgg ctttggggga gtggcgccac tggttggagg 2640 gggcggccgt accctttctt gtgtggactg atcacaggaa cctgcagtac atacagacgg 2700 ccaaacgtct gaatgccagg caggcccgct gggctctatt ttttgggcgc ttcaacttca 2760 ccttgtcgta tagaccgggc tctaaaaatt ccaaacctga tgccttgtct cgctgttttg 2820 gttctccggc agatccttcc cctcccgata ccatcatacc ttccagatgt gtggtggggg 2880 cgatctcctg gggaattgag gagcgtgtca aacgagccca ggctggggtt gaggtgtcca 2940 gtggatgccc agcgggtatg ctctttgttc ctgatgcggt tcgcactgcc gttctccggt 3000 ggggccactg ttctaaaatc tcctgccacc caggggtgag gaggtcgctg gctgtcatcc 3060 gccaacggtt ttggtggcct gccatggctg gggatgttcg tcggttcgtg ggggcatgct 3120 cggtctgtgc tcaaaacaag tcctccaatt ctccatccat tggtctgtta caaccccttc 3180 ccattccttc tcgcccctgg tcacatattg ccatggactt tgtggtaggc ctccctccat 3240 ctaatggcaa tacagtggtt ctcactgtgg tggatcgttt ttccaaggcg gtccacttta 3300 ttcctcttcc aaaacttcct tcggccagag agactgctcg ggaggtcgtg gaccccgtct 3360 ttcggataca tggtcttccg gaggacgtgg tctctgacag gggtcctcag tttgtttccc 3420 atttctggag ggagttctgc cggcagattg gatcctccac cagtctgtca tcaggcttcc 3480 acccacagac caatggccag accgagcgtg ccaatcagga tctcggccga atgctccgct 3540 gtctggcgtc ccacaacccc tcatcctgga gtcaacagct cacatgggca gaatatgccc 3600 ataactcatt gccggtatct tcgactggtc tttccccttt catgtgttgt ctgggatatc 3660 aaccccctct gtttccttcc cagggctccg aagcagctgt cccctcggtc caagcataca 3720 ttgaccgttg cagacgcacc tggactaggg ccagggaagc attgcttcgg gccaggggac 3780 gcacgaaggc tgcagcggac cgtcaccgta ggcaggctcc caggtatgtc tgtgggcaac 3840 gggtgtggtt atcaactaaa gatctacctc ttaaggtttc agcaggaaaa ttggcaccta 3900 aatttattgg gccctacccc attagtaaga tcatgagccc ggtgactgtc cgccttcggc 3960 tgcccttaca cctctgtcgt gtacaccctg tctttcatgt ttccaagatc aaacctgttt 4020 tatgttcccc ccattcccct catgtgtctt cccctgtccc tccccctccc cttgttattt 4080 acaatatccc tgcctttagg gttaaaagga tcctccaggc cagacgccgt ggacggggat 4140 ttcaatatct agttgattgg gaggggtatg gtcctgagga gaggagttgg gtcccggctc 4200 gggacatcct ggacccctcg ctgattgagg agttccatct tcgccagtct ctctctatct 4260 cgggcacgcc aagtggcgtg cctgggggga ggggtac 4297 // ID hAT-N64_DR repbase; DNA; ZEB; 1589 BP. XX AC . XX DT 29-OCT-2008 (Rel. 13.1, Created) DT 29-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Putative nonautonomous hAT DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW 8 bp TSD; hAT-N64_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1589 RA Bao W. and Jurka J.; RT "hAT DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1333-1333 (2008). XX DR [1] (Consensus) XX CC Members are ~90% identical to the consensus. TIR is 13 bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1589 BP; 511 A; 245 C; 284 G; 548 T; 1 other; cagtgttggg gagtaactag ttacatgtaa cggcgttacg taatttaatt acaaaataaa 60 agtaacagta atttgttaca gttactgaga aaaaaatgtg taattaaatt acagttactt 120 atgaaaatgt taaagattac aaatggggtt acatctaaaa ataagaaata tatgttgaat 180 aatattaatc tgttgctttg cctgtttttg atatgcacga tgccttctgc taaggccatt 240 tattaaagca gtgctattct gatgcttttg attggttttc cggtttgatt gcggcgcacc 300 acgtctacca gttacgttaa tcctctctgc cgctgaaagc attaggctac tacaaccagt 360 cttttatcac attaacagtg ttgggcagta gcgacgctac aagtagtgac gctactagct 420 taactacatt tctcagtagc gtggcggtag cgtcgctact ttataaatca aaaagctttt 480 cagtagcgaa gctattttat tagtcaagta gcgcagtagc gtcaacacaa gctacattta 540 ccgatcgcgg atcaataagt gacggcacca acattcacag agctgggagc tttgaggccg 600 gtgtctagcc gatgaaagta aatccatatg atgkggagaa tgacgatttc ttcttcttcc 660 tgtttttcgg tggtcgacaa caaacttttt ggtgcgttac tgccacctct cgctctggtc 720 ggtgacgtcg ccaaccaatt aggctaacac gagaaatttg cttgatggaa agatgactga 780 gggagaggag gtctatatat atagtttact gtagtaaagg ctaaagtata ctatggtaat 840 tactattagt tcatgatagt tactaatgat actacattat gtagtgtaat tcattaatga 900 agagttgtaa atatatatac aatataatgc ttattcctaa atatttccct ttactataaa 960 ttactatagt attttaaatg tgtaacacct ggttttattg gatttttggt tttgctgtta 1020 tatacttata atttgtttct gtttggtaca gcatattatt tacagtaaat aactgaactg 1080 aacaattatg cctcatatgt ttcattgaca gttttattga tcactaagat atgattgact 1140 tggatttaag ttgcttcgat ttaaaaatga aaattgcaaa aatagcttag atgttttttt 1200 ttattgcaaa tgccacaaaa atcaccctgc acaactaact aaaattttga ttttgtaaca 1260 acagtaaaca ttttttatgg ttataaggtc tggttttgtg gtggctgtac tttaaaatta 1320 aattaatata atcttaattc aagtgatgaa agattttgaa attctaacag taaacagagc 1380 tactgtaagg tcacacctgc ttggtataaa tgcttgaaaa tgatttagtt gaaaatatcc 1440 attagaaact taaaagtaat caaaaagtat gtaatcagtt actttacttt tataaagtaa 1500 ttgaaaaggt acactactta ttacatttta aatagggtaa tttgtaatct gtaatctatt 1560 acatttccaa agtaacctcc ccaacactg 1589 // ID BELPADRE_I repbase; DNA; ZEB; 6326 BP. XX AC . XX DT 18-OCT-2005 (Rel. 10.1, Created) DT 26-OCT-2005 (Rel. 10.1, Last updated, Version 1) XX DE BEL-type LTR retrotransposon from Danio rerio - a consensus DE (internal portion). XX KW BEL; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Suzu-like; Pao/BEL; BELPADRE_LTR; KW internal portion; BELPADRE_I. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RA Copeland C.S., Mann V.H., Morales M.E., Kalinna B.H. RA and Brindley P.J.; RT "The Sinbad retrotransposon from the genome of the human blood RT fluke, Schistosoma mansoni, and the distribution of related RT Pao-like elements."; RL BMC Evol Biol 5(1), 20-38 (2005). XX RN [2] RP 1-6326 RA Jurka J.; RT "BELPADRE: BEL-type LTR retrotransposon family from zebrafish (a RT consensus sequence)."; RL Repbase Reports 5(10), 295-295 (2005). XX DR [2] (Consensus) XX CC A fragment of this element was described in Ref. 1 (Genbank Acc. CC No. BK005574). This entry reports a full consensus sequence of CC this relatively young family. The closest relative in Repbase is CC CATCH1 from Takifugu rubripes. XX FH Key Location/Qualifiers FT CDS 130..6222 FT /product="BELPADRE_I_1p" FT /translation="MATQAVGKSLEQLKAKRTVEKRAFTRLANTIFKTCKN FT MTEMDLQDSLNKLTIQAEKVMEVNDDLEAGMIAEIEANLDTDQSAELTEQQ FT KDDLEKTASDCELKLDEVKVLLHETLWTRYGDVELSTALQVAEAESDRVEA FT VDPTGNHEAYDFMLSHLQNLIKNAKEMYNKWQRCIPSGDERDFSQRQRGLE FT ANITRLVARKADFMQARQAEEARNIAHVPVISTYPIATIKLKPATLPKFSG FT SKRDFHRWKKDWEALQKQGEPTGSKEVKKVQLLDSLEEKIVRELRLTTYNS FT ADDIFRVLENRYGNKTAIAIEIVEDLQKIPPIRSHQPRKIVELIQEVEKAL FT GDLSDLGNTGAINNPLVTKSIEGKLPESLKKEWLIHVSAQQSAVTPDNRFD FT YLLDFLKKQENIYEQLEQLRLEEPNRKDVRNEPKWAKTKSTKSNTDQAGCV FT VCGDAKHKWKLYFCKQFRASKLVDKKAAVSALGACKKCLEVHEQYSFCKRN FT YLCKSQDCEDEGVPEHHYYLCPNAKREEKRADQKKSKLCPKTEKCVKDYTR FT DQQDFLSRLSPELAQQCRNVFSNTTARVFNVTKEKPGLLAQSGLLELPVIM FT MLLNVTANAGQKIGTLIDLASDTNYITHSAADRLNLKGEKITLVIHGVGGM FT KVRMKTKRYLLKIRVRTSQGTFKSHQLACYGLESIAEINNTVQPNQLKKLF FT PDIPLAELARPKVVDLLISHREGQLAPQRIRIVGDLVLWDGPLGKTIGGTH FT PELFEDVTVSAYRSKTHFARSMRLAAVKYEEVTEYSPPNKPLARLHESSVS FT TSTRDFLEWWKWDSIGAACEPKCGGCRCGNCQPGGKEMSLAEERELELVKE FT GLTYVMEDKHSNEPHWHAKYPWIQDPTSLPDNRRAVEATFLRTEKQLAKDP FT QWKAAYSAQVKDMLDRGAAVKLPESSIANWAGPVWYVSHLIAPNPHSVTTP FT VRLVWNSSQRFRGVSMNDLLIKGPDVLNQIRAVLLRFRSGVYAALGDIKKM FT YNSVWLEDQEVHLHRFLWRDTENEELGEYAITRVNIGDKPAGCIAQLAMRE FT TANLPSFAHLEEERRVIQHDSYVDDILTSHNDLDQLQSIVANTELILKAGG FT FHLKPWVFSGQSGREKSDDKCCKIKEKVMVLPNQLHDDDNKALGLGYSVED FT DKLYVMTSINFSKRKKKIRLGQNLTQEQVRAQTPNPLTRRELLSQVSGVYD FT PVGLVTPAKQKGAILVRRAFQEAKNTRLQVNETWDVALSDDLREDAIKLFE FT EYIQLGQIKFTRAITPPEFEGHPWAITFSDGSENTYGAVMYLRWDSKQGPV FT IRLVESKAKLTPLDQKGDAIKAEVCGAVFASRLRKYFEQHSRIQVERWFHL FT VDSQTVLGAIQRESYGYKTFFANRIGEIQGITKAQEWWWIPGPQNIADVIT FT RGASPQNLDESSEWQNGPRFLSLPVDEWPIKSAKDLAITARESINELQKKA FT FAAVLTRSKTKQKEPTMESKPIESPNLVRAEQKRPPAGLAVLNLCDIKRFS FT DLSRLVKTIAWVWRAAKKFLGKKRTVNKPKWEAVLSLGTISVREREDALRD FT IFLAEQDGVTFPNTTKERLVVFKDPDSGLLVCGGRVQVFQDDQLSVPILPS FT NSWISTLLAQESHKESHGGLAETLLRMRRRAWVLKGRRIAQKVVDSCVQCR FT KNKAKKCQQVMGDLPLERTQPAAPFEFTAVDLFGPYHVKDDVKKRVLLKVW FT GVVFCCMSSRAIHTELVNSQSTEGFLLAFQRFAAIRGYPRKIFSDPGTNFI FT GARPVLQDLYQFLEGIDKSTLEETAVKHGTEWIWKIHPADSPHRNGAAEAA FT VRIIKRALQNCGGESTLTYSEMHTALQIAANLANERPIDARAQTQEGCIQY FT VTPNSLLLGRASQSGDIKTFDFSSYPFKRLQAMQSLVNKFWERWSQLAGPN FT LFIRSKWHTTHRNVAIGDIVWIADQNALRGQFKLGRVISVNPDSKGVVRDV FT NIRTFPSYPVLITKPSKARANSSESKIGREKIPSTVLHRDVRRLVVLLPAE FT EQKSK" XX SQ Sequence 6326 BP; 2119 A; 1256 C; 1500 G; 1451 T; 0 other; gtgaaaaacc gggaccagtg aaaaactgga cagtgaaaaa acgggaaacc aggaaaactg 60 gaaacctgtt tgactcaaaa aaggcaagtg agaagataat atattcaatt tgaagaataa 120 ctgtgaagaa tggccacaca ggcggtagga aagtcactcg aacaactaaa agcaaaaagg 180 acagtggaaa aaagggcatt cacacgtcta gccaacacta tattcaaaac ttgtaaaaac 240 atgacagaga tggatcttca agacagttta aataaactca caatacaagc agagaaagtc 300 atggaagtaa atgatgatct tgaagcaggc atgattgcag aaatagaggc aaatctggat 360 acagaccagt cagctgaatt aacagaacag caaaaagacg acctcgagaa gactgcaagc 420 gactgtgagt taaaactgga tgaagtgaaa gttcttcttc atgaaacgtt atggactaga 480 tatggggatg ttgaactgtc cacggcactg caggttgctg aagctgagag tgatagagtc 540 gaagcagtag atccaactgg aaatcatgaa gcatacgatt tcatgcttag tcaccttcaa 600 aatctgataa aaaatgcaaa ggagatgtac aacaaatggc aacggtgcat cccatcaggt 660 gatgagagag atttcagcca gagacaaaga ggactcgaag caaacattac caggttagtg 720 gcaaggaaag ctgatttcat gcaagcaaga caagctgagg aggctaggaa tatagcacat 780 gtcccagtga tttctactta ccccatagca acaattaagt tgaaacctgc aacactacca 840 aagttctctg gtagcaagag agattttcac agatggaaga aggattggga ggcactccaa 900 aagcagggtg agcccactgg ctctaaggag gtcaaaaagg ttcagttact agacagtctt 960 gaagaaaaga ttgtaagaga actccgcctt accacttata attctgctga tgacattttt 1020 cgtgtcttgg aaaatcgcta tggcaacaag acagcaattg ctatcgaaat agtggaagac 1080 ttgcaaaaaa ttcctcccat tagaagtcac caaccacgaa aaatagttga actaattcaa 1140 gaagtagaga aggcacttgg agatttgagt gatcttggaa atactggcgc cataaacaac 1200 ccactggtaa caaagtcgat agaaggaaaa cttcctgaaa gtttaaaaaa ggaatggctt 1260 attcacgtat ctgcccaaca gagtgcagtc acaccagata atcgatttga ctaccttttg 1320 gatttcctca aaaagcaaga gaacatctac gagcaattgg aacaattaag acttgaagaa 1380 cctaacagaa aggatgtgcg aaacgaacca aaatgggcaa aaacaaagtc tacaaaatca 1440 aacactgacc aggctggttg tgtagtctgt ggtgatgcta agcataaatg gaaactctac 1500 ttctgtaagc agttccgggc atcgaaatta gtagataaaa aggctgcagt aagtgcactc 1560 ggagcttgta aaaaatgcct tgaggtccat gagcagtatt cattctgcaa acgaaactac 1620 ttgtgcaaaa gccaagactg tgaagatgag ggtgtccctg agcatcacta ctatttatgc 1680 cctaatgcta aaagagagga gaaacgtgct gaccagaaaa agagcaaact ttgtccgaaa 1740 acggagaaat gtgtaaaaga ctacactaga gatcagcaag attttctcag cagactttca 1800 ccagaattag cacagcagtg tagaaatgtg ttctcaaaca ctacagcaag agtgttcaat 1860 gtgacaaagg aaaagccagg tttactagca caaagtggac tgctagaatt gcctgtaatc 1920 atgatgcttc taaatgtgac agccaatgct ggtcaaaaga ttgggacatt aattgatttg 1980 gcatcagaca ctaattacat aacgcacagc gctgctgatc gtctgaacct aaaaggagaa 2040 aagataactc ttgtaatcca tggagttgga ggcatgaagg ttcggatgaa gactaaacgt 2100 tatcttctga aaataagggt gagaacctcc cagggaacct tcaagtccca ccaattggcg 2160 tgctatgggt tggaaagcat cgctgaaatt aataacacag tgcaaccgaa ccaactgaaa 2220 aaattgtttc cagacatacc tcttgccgaa cttgcaaggc caaaagtagt cgacctgctt 2280 ataagccata gggaaggaca gttggctcca cagaggatca gaattgttgg agacctggta 2340 ttgtgggatg gtccacttgg aaagacgatt ggaggaactc atcctgagct tttcgaagat 2400 gtgactgtat ctgcctaccg gtccaagact cattttgcac gatcaatgag attggctgct 2460 gttaaatatg aggaagtaac tgagtacagt ccaccaaaca aaccactagc cagactccat 2520 gaatcaagcg tctcaaccag tactcgagac tttcttgaat ggtggaagtg ggattccatt 2580 ggtgcagcat gtgagccaaa gtgtgggggt tgtcgctgcg ggaattgtca acctggtgga 2640 aaagaaatgt ctcttgctga ggagagggag ctggagttag taaaagaagg tctcacttat 2700 gtcatggagg acaaacacag caatgagcca cattggcatg ctaaatatcc ttggatacag 2760 gacccaactt ccttaccaga caacaggcgt gcagttgagg ctacatttct gcgtacagag 2820 aagcagttgg ccaaagatcc tcagtggaaa gcagcttata gtgcccaagt gaaagacatg 2880 cttgacagag gggctgcagt taagctacct gaaagttcaa ttgctaattg ggctggacct 2940 gtatggtacg taagtcatct catcgctcct aaccctcact cagtcacaac cccagtgaga 3000 ctcgtttgga atagcagcca aagatttaga ggtgtcagca tgaatgacct gttaattaaa 3060 ggaccagacg ttctaaacca gatacgtgct gttctgctga gattcagaag tggagtgtat 3120 gctgcattgg gggatattaa gaaaatgtac aactccgttt ggttggagga ccaagaagta 3180 catttgcaca gattcctttg gcgagacact gagaatgagg agcttggaga atacgcaatc 3240 acaagagtca atattggaga caaaccagca gggtgcattg cgcaactggc aatgcgtgaa 3300 actgctaatt tgccatcctt tgctcacctt gaagaggaac gacgggtaat ccaacatgac 3360 agctatgttg atgatattct cacatctcac aatgaccttg accagctaca gtccattgtg 3420 gcaaacacag agttgatctt gaaagctgga ggatttcatc tgaaaccatg ggtcttctca 3480 ggtcaaagtg ggagggaaaa gtctgatgat aagtgctgta agataaagga aaaggttatg 3540 gttctgccaa accaattgca tgatgatgac aacaaagcac ttggcctagg ctattctgtg 3600 gaagacgaca agttgtatgt gatgacatcc atcaattttt caaaaagaaa aaagaaaatt 3660 agacttggac aaaacctaac acaggagcaa gttagagccc aaacaccaaa tccactgaca 3720 agaagagagc tcttgagtca ggtatcaggg gtatatgatc cagtaggctt agtgactcca 3780 gccaaacaaa agggagcaat tctggttcgc agagcttttc aagaagcaaa gaatacaaga 3840 cttcaagtca atgaaacttg ggatgttgct ctctcagatg accttagaga agatgctatc 3900 aaactctttg aggaatacat ccaactaggc caaattaaat tcacaagagc cataacacct 3960 ccagaatttg aaggacaccc ctgggcgata accttttctg acggaagtga aaacacttat 4020 ggtgctgtaa tgtacttgag atgggactca aaacagggcc cagtaatcag gcttgtggag 4080 tccaaagcta agctgactcc tttagaccaa aaaggagatg ctatcaaggc agaggtgtgc 4140 ggagctgtgt tcgcctcacg tttgcgaaag tactttgagc agcacagccg aattcaagtg 4200 gagagatggt tccatttagt tgatagccaa acagtacttg gtgctataca acgtgaaagc 4260 tatggctaca aaacattctt cgccaatcgg atcggggaga tacaaggaat cacaaaagcg 4320 caagagtggt ggtggatacc aggcccacaa aacattgctg atgtgataac tagaggagct 4380 agccctcaaa atctcgatga aagttcagag tggcaaaatg ggccaagatt cctgagctta 4440 ccagtagatg aatggccaat caagtcagct aaagacttag ctatcactgc ccgagaaagc 4500 attaacgaac ttcaaaagaa agcgtttgca gctgtattga caaggtcaaa aacaaagcaa 4560 aaggagccga ctatggagtc gaaaccaatt gagagcccca acctagttcg agcggaacaa 4620 aaaagaccac cagcaggctt agctgtcctg aacttgtgtg acataaagcg gttcagtgac 4680 ctgtcacgac tggttaaaac aatagcatgg gtttggagag ctgcaaagaa gtttctcggc 4740 aaaaaacgga ctgtaaataa accaaagtgg gaggcagtct tgtcattagg aaccatttca 4800 gtaagagaaa gagaagatgc tctacgagac atttttcttg ctgagcagga tggggtgacc 4860 tttccaaaca ccactaaaga gaggttagtt gtctttaaag atccagactc tggattgttg 4920 gtctgcggtg gcagggtgca ggtctttcaa gatgatcagc tgagtgttcc aattcttcct 4980 tccaattcat ggatttccac attactagct caagaatctc acaaggagag ccacggaggc 5040 ttggctgaga ctctgcttag aatgcgaagg agagcctggg tattaaaagg cagaagaata 5100 gcacaaaaag ttgtggacag ctgtgtacaa tgtaggaaga acaaagcaaa aaagtgtcag 5160 caagtaatgg gtgatcttcc tttagaaaga acccaaccag ctgcaccttt cgagttcaca 5220 gcagtggatc ttttcggacc ttaccatgtg aaagatgatg tcaagaaaag agtcttgctg 5280 aaggtctggg gtgttgtttt ttgttgtatg tcgagcaggg caatccatac agaactggtt 5340 aactctcagt caaccgaagg tttcttactg gcctttcaaa gatttgctgc aattagaggc 5400 tatcccagaa agattttttc agatcctggt accaatttca ttggagcaag accagttttg 5460 caagatctgt accaattcct ggagggaatt gacaaatcta ccctggagga aacagcagta 5520 aaacacggaa ctgagtggat ctggaagatt cacccagctg attccccaca tagaaatgga 5580 gctgctgaag ctgcagtgcg cattatcaag agggcactcc aaaactgcgg aggggagtcc 5640 accctcacat acagtgaaat gcatacagcc cttcaaatag cagctaatct tgctaatgag 5700 cgaccaatcg atgccagggc acaaactcag gaaggctgca ttcaatatgt aacgccaaat 5760 tcgctattgc tcggacgagc gtctcaaagt ggtgacatta aaacttttga cttttccagc 5820 tatcccttca aaagactcca agcaatgcag tcattggtaa ataagttttg ggaacgttgg 5880 agccagcttg caggtcctaa tttgttcata aggagcaagt ggcacaccac gcacagaaat 5940 gtcgcaattg gagatattgt ctggatcgca gatcagaatg cactcagagg acaattcaag 6000 cttggcagag taatcagtgt caatccagac agtaaaggtg tcgtaagaga tgtaaatata 6060 cgaacattcc caagttaccc tgttctgatc acaaaaccat caaaagcaag ggcaaattct 6120 tcggagtcca aaatcggtcg agagaagatc ccgagtactg tccttcacag agatgtcaga 6180 cggctagttg ttctacttcc tgcagaggaa caaaagagta aataattcag agaacatcct 6240 tttgaaagct aaaagtaatc gaactaaaca gtttgccttc aaagagatgc gatttcctct 6300 ttatcccaag aaaatcgagt gggagg 6326 // ID CR1-30_DR repbase; DNA; ZEB; 1857 BP. XX AC . XX DT 03-DEC-2008 (Rel. 13.12, Created) DT 03-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-like non-LTR retrotransposon - a consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; AP endonuclease; CR1 clad; CR1-30_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1857 RA Bao W. and Jurka J.; RT "CR1-like non-LTR retrotransposons from zebrafish."; RL Repbase Reports 8(12), 2127-2127 (2008). XX DR [1] (Consensus) XX CC The 5'- part is incomplete. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS join(87..281,268..1746) FT /product="CR1-30_DR_1p" FT /translation="KKKLSGKLKAKAVYKGKLKMGNVGSAWDGMHLITGQD FT SKKTNKICLNEFTSDTVFHKILIIFFKIFFSRFDSHDFKQEIERMKNECRI FT DREYDCIFFDVKDVIMSFKQSKIGKAPGPDKIGGRLLKECAEQLGPIFYEI FT FVLSFQQQKVPALWKQSTVIPVPKKQKPQKIEDFRPVALTSLVMKSFEXIV FT KREITQKTQSVLDPFQFAYRPXRGVEDAVGTLLHLVLKHLDKPKTLAKLLF FT IDFSSAFNTIQPHILVEKLINEFELDRMLVGWILDFLVQRTQRVKVNNCFS FT EWLVSSTGSPQGCVLSPVLFILYTNSCRSIFSNRYILKFADDSVIVSLLSE FT EDSTHGPVIDYFSSWCQEAFLQVNVLKTKEMTIDFNRKRRLIKGKQTSING FT VXVESVDQYRYLGTVLDNKLSFSXNTNMLCKKGQQRLYCLQKLRSFXIDKS FT LMCMFYSSYIQSLLSFSLLCWFNSLSVKNKSCLEKIVNLGGKITGRKQMTM FT AQLYHRQVLSKAQSILSDDSHPMHHEFYLLPSGIRYRTPISKTNRFKLSFI FT PSAVKFLNLG*" XX SQ Sequence 1857 BP; 594 A; 255 C; 395 G; 601 T; 12 other; aataacaaac atacaaaacc acttaattat gtcaaaacgt aatagggcct ttaaagaggg 60 agatattgtt gagtgtcata gactaaaaaa agaagttaag tgggaaatta aaggctaagg 120 cagtatataa gggaaaattg aaaatgggaa atgtgggttc tgcctgggat ggcatgcatt 180 taattacagg acaggatagt aagaaaacaa ataaaatttg tttaaatgaa ttcacgtcag 240 acacagtttt tcacaaaatt ttaataattt ttttcaagat ttgatagtca tgattttaaa 300 caagaaatag agaggatgaa aaatgaatgc agaatagaca gagagtacga ttgcatcttt 360 tttgatgtga aggatgtgat tatgagtttt aaacagagta aaatagggaa ggcacctggg 420 ccggataara taggaggtag attgcttaag gaatgtgctg agcagttggg acccatattt 480 tatgaaattt ttgttttatc ctttcaacaa caaaaggtac ctgcactgtg gaarcagtct 540 acagttattc ctgtgccaaa aaaacagaag ccacagaaga tagaggactt tagacctgtg 600 gcacttacat ctttagttat gaaaagtttt gaaargatag ttaagaggga aataacgcaa 660 aagactcaga gtgttctaga tccctttcag tttgcgtata ggcctawgag aggggtggag 720 gatgcagtag ggactctgct gcacttagtt cttaagcatc ttgataagcc taaaacattg 780 gccaagttac tttttattga cttttcgtct gcattcaata ccattcagcc acatatttta 840 gttgagaaac ttattaatga atttgagttg gatcgaatgc ttgtaggttg gattttagac 900 tttcttgttc agaggacaca gagggttaag gtaaataatt gtttctctga atggttggta 960 tcatctacrg ggtcacctca aggttgtgtc ctttctcctg tactatttat tttgtacaca 1020 aatagttgta ggagtatttt tagtaatagg tacattttaa aatttgctga cgattctgta 1080 attgttagtt tgttaagtga ggaagactcc acgcacggtc ctgtaattga ttatttttcc 1140 agctggtgtc aggaggcttt cttacaagtg aatgttctta agactaaaga gatgaccatt 1200 gattttaatc gtaagaggag actgattaag ggtaagcaaa ctagtattaa tggagtggrt 1260 gttgagagtg ttgatcagta taggtatytg ggaactgttt tagataataa attgtccttt 1320 tctgytaata ctaatatgtt gtgtaagaag gggcaacaaa gaytatactg tttacagaaa 1380 ttgagatctt ttrrtataga taaatcactc atgtgcatgt tttacagttc atacattcag 1440 tcactgttat ctttttcttt gctttgttgg ttcaattctc taagtgtgaa aaataagagt 1500 tgtttggaga aaattgtgaa cctaggagga aaaataacag ggagaaaaca gatgaccatg 1560 gctcagttat accatagaca agttttgagt aaagctcaga gtattctgtc agatgattcc 1620 catcctatgc accacgagtt ttatcttctc ccttctggta ttcgatacag aactcctatt 1680 agtaaaacaa atcggtttaa actttcattc atyccctcag ctgtaaagtt tttaaattta 1740 gggtagaatt tgtattgtat aatgtacttt tgtgtattat gtgtgacatg cttgtatgct 1800 gcaaccaaat tgccttcggg aaatgaataa agactgactg actgactgac tctctga 1857 // ID Harbinger-N9_DR repbase; DNA; ZEB; 643 BP. XX AC . XX DT 17-NOV-2004 (Rel. 9.1, Created) DT 17-NOV-2004 (Rel. 9.1, Last updated, Version 1) XX DE Harbinger-N9_DR is a non-autonomous DNA transposon that belongs DE to the Harbinger superfamily. XX KW Harbinger; DNA transposon; Transposable Element; Nonautonomous; KW Harbinger superfamily; Harbinger-N9_DR; KW nonautonomous DNA transposon. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-643 RA Kapitonov V.V. and Jurka J.; RT "Harbinger-N9_DR, a nonautonomous DNA transposon from RT zebrafish."; RL Repbase Reports 4(10), 281-281 (2004). XX DR [1] (Consensus) XX CC Harbinger-N9_DR is a nonautonomous DNA transposon that is a CC member of the Harbinger superfamily. Its is characterized by CC 54-bp terminal inverted repeats (12 mismatches) and the TWA CC target site duplications. XX SQ Sequence 643 BP; 166 A; 150 C; 157 G; 169 T; 1 other; aggctcctac acaccgggac gcttttcgtt tgcgtttatc gtcagcgttt tacgagacgt 60 ttttccgtat tcaaacccaa gcgattttca ctggcgtcaa gcagaagagt atgcaaaatc 120 actccttgac gttagatggc gctacacaac tttaagcttc tgacacccgc ttgtaacaca 180 gaagaagagg aagagaggaa gttcacacgc ttgttgttaa agttactggt gactcgaaca 240 agcatggatg gttcaagcag cagctccagc tctagcgatg aagaaatgat tattgttcac 300 cagagcaata agcagctcca cgttcatatc gcctctgggc atcttcttca atgtgcgctc 360 gcattgacag ttttgcgctt gagcgcctcc aagtgctgtt tactgtaact tcagcagctc 420 cgtgcacgtg aacaaaagtg ccaatctgat tggtggagaa ggttttgacg cggcgcgtca 480 aaacaaaaaa acgagcatga ggcgtttttt ttaaaatgac gcttttgcgc ctgscgtttt 540 gcgcgttggt gtgcacgatc acattgacgc cctttattta gtcacgaggc gttaaacgtt 600 gacggaaaac gcgagcaaaa aacgtctcgg tgtgcacggg cct 643 // ID Gypsy102-LTR_DR repbase; DNA; ZEB; 837 BP. XX AC chr19; XX DT 02-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy102-I_DR; KW Gypsy102-LTR_DR; Gypsy102_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-837 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1514-1514 (2008). XX DR Genome; chr19; Positions 23305692 23304856. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 837 BP; 229 A; 127 C; 165 G; 316 T; 0 other; tgtaacagag ttattttggt aatattattt tcactcaaaa tccttttttc tattgagtat 60 aactttaatt tacttcattt gatattatta tatattattt tatcgttttg agttcatgct 120 tgtatgtgac tttcaggacc tgttcaagtg tactgtcact ttaagagtgt atactcgttc 180 gtgtgcgcgc tcatcgctct ctcagttcac ttccttctga gctgagagaa ggttgtctcg 240 tggttaatgg agacattaaa cggtaatatt attgacggtt gattgtatat tttaaatcta 300 aaatgttggt tgcttcgcta aacattctct gtgatgtgga aatatgaaac tgtggttcta 360 aatagttgtg attgagatcg tttgcatttg tgagtgagat tagcacatat tagcatgctg 420 taattactcc tcgtgtgcca acaggttgtt taatgaagat cataataaca gaatgcatgt 480 tttcaacaca ttttccattt gtgcaggtat gtttttgtat tgttggtagt ttttgtgatt 540 tcatgtcatt aatattaatt tagtcttatg ttaagcagat tagtttaata catgttcttt 600 caagaaatcg ctatctcagt tcacttcctt ctgagctgag agaaggttgt ctcgtggtta 660 atggagacat taaacggttg tttaatgaag atcataatca cagaatgcat gttttcaaca 720 cattttccat ttgtgcagaa aataaagaca gtgccaagcg gtaagaggtt tggagtcctt 780 tattgaaaaa caacaacaca acgcaggacg gattccactc gtgtgggacc cgttaca 837 // ID LTR3_DR repbase; DNA; ZEB; 784 BP. XX AC . XX DT 31-MAY-2005 (Rel. 10.05, Created) DT 31-MAY-2005 (Rel. 10.05, Last updated, Version 1) XX DE Long terminal repeat from some endogenous retrovirus - a DE consensus. XX KW LTR Retrotransposon; Transposable Element; Nonautonomous; KW Interspersed repeat; endogenous retrovirus; LTR3_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-784 RA Kapitonov V.V. and Jurka J.; RT "LTR3_DR, a family of zebrafish long terminal repeats."; RL Repbase Reports 5(5), 121-121 (2005). XX DR [1] (Consensus) XX CC LTR3_DR is a long terminal repeat of an unidentified endogenous CC retrovirus. LTR3_DR solo-copies are flanked by 5-bp target site CC duplications. LTR3_DR copies are over 90% identical with the CC consensus sequence. XX SQ Sequence 784 BP; 336 A; 138 C; 138 G; 171 T; 1 other; tgttacgtgt tttatgatgt taaagtggga tgctgtaaca tttaagtgag acaggtgtaa 60 tgtgtgcgac agatcaataa tgaggcaagg ttcagttact cttatttaaa gcttttactc 120 aataattgaa ttgaaacatg taacaataat aaatggggga gaaaaaaaac aaataaacat 180 ttataattat tatctttaaa caaaaacaac gagtgtaagc caaatcaaag aaattaacaa 240 aacaaaacaa tccttaaatc taatgtctaa actaggcgtc aaaaagaaac acaaataaaa 300 accgagatct tcagtgtata cgacacccga actaaactaa ttaaactaca aaactcaaaa 360 atacacgggt ggcaaaacac tcaaactaat ctaacaaaac atcaatctaa actaaaacaa 420 aacaaaccgg ataagtccgy taaatcttag tttagaatca cacaaccacc atcatattat 480 gttacttaaa gccaacgaag ttcaacaagc aaactcgaat tatcacagta actgagataa 540 atcacaaaat agacgaagca tttgcacaaa caaattgcgt ggagcacaaa gggtacgccg 600 aggcagggcg cacaccacac actggtctga cagggcttta aatccttgcg gctcgtcctg 660 ggattggtgg aaccggagag cgtcataatg atgatggaca gaataataaa ccaataagaa 720 acctaagggt tgagcaaacg gagaggaggg tgaaaaaaaa ggaaaaaaaa cataaatcat 780 aaca 784 // ID Gypsy85-LTR_DR repbase; DNA; ZEB; 1015 BP. XX AC chr1; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy85-I_DR; KW Gypsy85-LTR_DR; Gypsy85_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1015 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1482-1482 (2008). XX DR Genome; chr1; Positions 25812545 25811531. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1015 BP; 288 A; 143 C; 193 G; 391 T; 0 other; tgtggcgtgt gtgtgttgtt gcactagatt attggttcct tttttttatt atcagaaggg 60 tattttattt tgaaaggcgg tttaaaggaa atagacatga caaacaatta attactttcc 120 ttttacttca atattttgcg ttcttcttat ttcactgttt aaagtttcta tagtattctc 180 tttacatcca tgtgaaaaat aatataggta tttataagcg agtatgaccg cgtaggcgcg 240 agattatagc tgacgtagtt tccggtgcgc gctcactttg tgagatggca tccaacagac 300 aggtgcgttt ttcggctcct cttataaatg cctatattgt tacattaata tggatgttta 360 gattattcag tatcttctgt aaggttaact attgtatcat tattgtgagg gaaacagatt 420 tggatcatat ttgagtgtga agcgttttat ataagttaag tttaatgagc acaatggcta 480 gtccaaatta aattctgtgt atgtttatga ttttttttct ttttctttca ttacatataa 540 attatatatg taaaaggtga attagggctt taaatttaat cgggttgaca ctttacattt 600 atttctgaaa gggtttgtgt agcatatctg cattttccta tgaaaatgag aacaaatgag 660 tttaaaaatt ttgtactgat caacagacag acattgccgt gttgtgaaga tttgttggac 720 tgatgtctgt tatattcttt atgatacacg tatgttacaa atcacaatct gatgtatttt 780 atgtattgtt tttgtttatt tttaacatgg ttattgtgct catgcactca ctttgtgaga 840 tggcatccaa cagacagaca gacattgccg tgttgtgaag atttgttgga ctgatgtctg 900 ttatattctt tatgatacag acagagagac aagaataaac tctggttcga tcacattgct 960 gtatggatcc tttcatttaa accatcaaac acaacgcaga aggacatcca ttaca 1015 // ID ENSPM-2N_DR repbase; DNA; ZEB; 1538 BP. XX AC . XX DT 26-SEP-2008 (Rel. 13.09, Created) DT 27-SEP-2008 (Rel. 13.09, Last updated, Version 4) XX DE DNA transposon - a consensus. XX KW EnSpm; DNA transposon; Transposable Element; Nonautonomous; KW ENSPM-2N_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1538 RA Jurka J.; RT "EnSpm-type families from zebrafish."; RL Repbase Reports 8(9), 930-930 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1538 BP; 464 A; 276 C; 292 G; 506 T; 0 other; aagccctggt cagatcacac gaatttgtca cgatttcggc acgatttcct tgacgtaggg 60 tgtcgtgggg attcgtaaac gacaaatggg catcgtgaca caagattcaa tcgtttcttc 120 ttgtgtaatg tggcatagtt cacgacaacc gataccatgc ctgcgacgcc tcacgacacc 180 atcgcagaaa gtctagcatg tttaattttt ttctcgttct tcacgacgag ttccgtcacg 240 tgggtgacac tgtgcaatag gagcacataa tttctcggtt atatcagcgg gtgctgccgt 300 taatgcgctg gtcaggtaat caaaaaatag gctgcatatt tacttatggg tgggtcacgg 360 gtggataaga tcaatcattg gttatgcaaa ctttaactac cttttctaaa atattaaggt 420 gttttaagca attttatctg acagactggc acaaatatgc agacatctcc attctcacaa 480 cgagtttctg agttctctca tccttccaac agagtttgtt cttccgaggt aatctgggaa 540 catgttcagt gctgcgaata cattgcaaaa gcatttcctg cttattcaaa tttgtcatta 600 atagtgaaaa cagtttaata taggcttctc tgaaactgtg aatatttcac tttaatttta 660 tttaggctag attattattt tatttaacaa accaacacaa tctcacggca attcgtgttt 720 tggcgcatat agcggacttg ttttgaagcg cttcacctca aatgttattc gttattcttg 780 tttatcggat aactgaccaa caaaagatac attaaaatta agataaaaat tatagcaaac 840 gaaattcgtt tcaaaaagat atttttttcc aagaaaaata tacgaattat atttttgttt 900 gtgcccctcc actactcaac tatacgggtg gatgatttgt ttcgctctga ggaaaataag 960 gatttaatga atgcttcact gtgattgttg tatcacaaat ctcagttata tatttgctaa 1020 tttttttaat atgtcattat tttaaagatt atgtcttgag catctgattt ttccttgctg 1080 ctgcttatgt gctttcattt tctccgtctc actcgcggca agttcaggcg agggagtgtt 1140 taataagcaa cccatggcag gaaaatatac atttagattt caaagaaaat atcttaatat 1200 taaaaaggaa tcataagttt gatcctatcc atgtcgtaac gaatgccctt aaataatgta 1260 gttattagtt gtcagcaagc atcagtgttt attgagattg aaataagatt aaaaaaaatt 1320 gaaattgaga taagaaatat tgttatagca acacttgtga ctgctttggg aaataacgct 1380 cgtcgtagac gtcatgggat gactgatcgt gaaggaacgt gtagtctggc acatttgtta 1440 cccacgacaa gtcaagattt tatcaagaaa aaaatccgta atctgacata gtgactttcg 1500 taaccgacaa taaaaatcgt gtgatctgac cggggctt 1538 // ID DNA-1-11_DR repbase; DNA; ZEB; 724 BP. XX AC . XX DT 23-AUG-2010 (Rel. 15.11, Created) DT 23-AUG-2010 (Rel. 15.11, Last updated, Version 1) XX DE nonautonomous DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; 1-bp TSD; KW DNA-1-11_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-724 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 10(11), 1866-1866 (2010). XX DR [1] (Consensus) XX CC TSD is most likely 1-bp long. target site is CANTG. TIR is ~50-bp CC long. XX SQ Sequence 724 BP; 200 A; 137 C; 144 G; 243 T; 0 other; tgactacgtt tacatggaca tcagtaatca aattatttgc cttaatctga ataagacaat 60 aatatgatta aggtgtttac atgagttgct ttttgaatgt tcctttcatg atcccgtttt 120 acatgttata gcacataatt cgattaacgt cattgcgtca ccacgctatc cacatttcct 180 ccggagtttc atgtaatttc gggtgtttca tttttaattt gtcgacttta actgcagttt 240 ggcactttca ctttcattca ggaacatttc atgcatgccc ccgtgacaaa cgagatattg 300 gatgcgagta tgaactgctg gaagagtgtt gttttaatgg aatttgatac cgcacgccgt 360 atggggaaaa aaaaaacctc cgcatttcgc gatgcaggtg tctgtggtcc ttcactgact 420 cggtaggtgc agagaatagt gtcaaacagc cgtgtgtgtg gactatcctg tcgcaaaatg 480 cggcgaaaag tcctacacga cggtaatagt ttgattaagg tgtttacatg tctgtactgc 540 acttcaataa tgcgactaaa atcggcatac tccacatgtc ttaattcgat ttctgtttag 600 ttcgattatg accttaatcg gattaaatta atcaaaaatc gctgtttaca tggtagactc 660 ttaatcagag tattgtctta atcgtattaa aatcggatta ttggtgtcca tgtaaacgta 720 ctca 724 // ID EnSpm-N5_DR repbase; DNA; ZEB; 290 BP. XX AC . XX DT 15-OCT-2008 (Rel. 13.1, Created) DT 15-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE EnSpm-N5_DR is an non-autonomous DNA transposon - a consensus. XX KW EnSpm; DNA transposon; Transposable Element; Nonautonomous; KW hairpin structure; EnSpm-N5_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-290 RA Bao W. and Jurka J.; RT "EnSpm DNA transposons in zebrafish."; RL Repbase Reports 8(10), 1265-1265 (2008). XX DR [1] (Consensus) XX CC TSD is 2bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 290 BP; 97 A; 47 C; 44 G; 102 T; 0 other; cactgtaaaa aaaaatctgt aattttacgg tttatttccg gcagctgggg tgccggaaaa 60 aaaacgtaaa ataacggccg ttaaattaca gaaatttacc gtaaaataac ggacattaaa 120 ttacagaaat ttccttaaat tttaatttaa atttctggta aatttctgta attcaacctc 180 tgttatttta cagtaaattt ctgtaattta acggccgtta ttttacgttt ttttttcggc 240 accccagctg ccggaaataa accgtaaaat tacggatttt ttttacagtg 290 // ID DNA-8-36_DR repbase; DNA; ZEB; 588 BP. XX AC . XX DT 07-APR-2009 (Rel. 14.04, Created) DT 07-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE A non-autonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; TSD 8-bp; DNA-8-36_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-588 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 9(4), 858-858 (2009). XX DR [1] (Consensus) XX SQ Sequence 588 BP; 190 A; 79 C; 68 G; 251 T; 0 other; caggggcgga tttagtgatt ttggggccct aagcaattcc agccatgggg cccaagtcct 60 gaaatgcacc ctttctactt ttatttaatt tttatttata ttgctttttt tatatcactc 120 aatcttcttt tcgacatttt aaattaatgc aatctctaca ttttaaatga ttagttttct 180 taaaatgaat ttacaactaa aaataataaa tatatatata tatatatata tatatatata 240 tatatatata tatatatata attattatta ttattattat ttttatttat atataaattt 300 atttgactgc tggactgctc catgatggat attttatttt gtaatattat tattattaca 360 taacattaca ttattattaa taatattata tttgattgta tattaaataa tatttaatta 420 cgttaaaatt gtatttgttt aactctcacc atacaaaata tggtagaaaa caatgttata 480 tatagtttat aggtataatt tatacttcta gatatttatt gggggccccc agatttcctg 540 gggccctaag cggccgctta cctcgcttat tggttaaatc cgcccctg 588 // ID CR1-28_DR repbase; DNA; ZEB; 3343 BP. XX AC . XX DT 02-DEC-2008 (Rel. 13.12, Created) DT 02-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-like non-LTR retrotransposon - a consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; AP endonuclease; CR1 clad; CR1-28_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-3343 RA Bao W. and Jurka J.; RT "CR1-like non-LTR retrotransposons from zebrafish."; RL Repbase Reports 8(12), 2125-2125 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 4..3090 FT /product="CR1-28_DR_1p" FT /translation="MALLCWRRGKGCLLYFTCLSLLYLSDMLVFSFLFIYW FT TLKLLRGGRDGNCWSYRSDSSELINKKLLVVSWLILTSCVAQGVLDKIQRY FT SRTELLDLRSCGGLSPTRFEMDLPEELLRQPRRVSGRVRKRGGRGGVRQRV FT RRASKLPLPPVILCNPRSLRNKLDELRTQAGACFEYRESGLMVFTETWLCN FT EVPESLIQVDGFSHIRLDRDENSGKTRGGGVCVYINDRWCRNFAVRESICN FT PDLELLCVTLRPFYLPREFTNIFICAVYIPPSGNAIRAANQIADCVHKHLQ FT NKPDAPFLILGDFNQCGLEKSLPGFFQYVKCGTRKKNILDKCYGNIKDAYV FT AKARPPLGNSDHNVIHLLPTYRSVFKTSKPTTKVMKTWSNESIEELKGCFL FT CTDWDMFYQDANIDTVTETITAYISFCVDSIIPQRTIKCYPNSKPYITGGI FT KSCIKRKKLAFRAGDTVGLRAAQKDLNRELRTARIKYKEQTEQHLLKSNTK FT ELWNSIRRMTNMTPKRKALFACNESVKANELNDFYMRFETDNLKKCEEILG FT EVNCNMAEKRRLIHQGDVTKVFKSLCSSKAMGPDGITCFLLKTFAEELTPA FT WHRLFQLSVDTHTVPKLWKKSVIIPIPKKPCPQDNNDYRPVALTSNVMKSF FT EKIIIKELRKEVEPYLDQYQFAYKSNRGTIDAISTLTHLVLKHLESPVAYA FT RLLFIDFSSAFNSIQPHKLLEKLVKMQVNPFVIKWFYSFLTDRTQQVKFNS FT GLSDVATSSTGAPQGCVSSPFLFTLYTNDCVRQQTNQYILKFSDDTVLLHL FT FTKDASIMEYRAAVEWFVSWCDDHNLEINVKKTEEVIIDPRSVSDRSSIVV FT YDKDIKQVDSYKYLGVNIDXELNWHSQVSSLCARIHQRLHFLRRLRLFGVS FT SSIMMTFYKATIESILRYGIIVWFGNLTVKMKTQVNNLMRVAGKIMGIKTL FT GSLKDIFELCTIQQANKILSDSSHVLSVEYELMNSGRRYRVPLCKHNRYKH FT SFVPFSIKLINEQXRKDGQRLR*" XX SQ Sequence 3343 BP; 1048 A; 571 C; 737 G; 982 T; 5 other; aacatggcgc tgctctgctg gcgacgtgga aaggggtgct tgctgtactt tacttgtttg 60 tctttgttgt acctgtcaga tatgttggtt tttagttttt tattcatata ttggacactt 120 aaactcctgc gcggaggcag agatggaaac tgctggtcgt ataggagtga ttcgagtgaa 180 ttgataaaca aaaaactgct tgtcgtgtca tggctaattc tgactagctg tgtggcgcaa 240 ggagttttgg acaaaattca gcgatatagc aggacggagc ttttggattt gcggagttgc 300 ggaggactgt cacccaccag atttgagatg gatttacctg aggaactact gaggcagccg 360 aggcgcgttt cgggtcgggt caggaagcga ggcgggagag gcggcgtcag acagcgggtc 420 cggagagctt ctaaacttcc tttgccgcca gtgattcttt gtaatccacg ctcactgagg 480 aacaaactgg atgaacttcg cacgcaagcg ggagcttgct tcgagtaccg tgagtctggt 540 ctcatggttt tcactgaaac atggttgtgt aatgaagtac cggaaagctt aatacaagtc 600 gacggttttt cccatattcg cttggacaga gacgagaact ccggtaagac aagaggggga 660 ggggtctgtg tctatatcaa cgatcgttgg tgtcgaaatt ttgcggtaag agaaagcatc 720 tgcaaccctg acctggaact attgtgcgtc acattacgtc cgttctattt gccaagggaa 780 tttactaaca tatttatttg tgcagtttat atcccaccta gtgggaatgc tatcagagct 840 gcaaaccaaa tagctgactg tgttcataag cacttacaaa acaaacctga tgcacccttt 900 ttgattcttg gtgattttaa tcagtgtgga ctggaraaat ccttaccagg tttttttcaa 960 tatgttaagt gtggaacaag gaaaaagaat attcttgaca agtgttatgg caatattaag 1020 gatgcttatg ttgccaaagc cagacccccc cttggtaact cggaccataa cgttatccat 1080 ctccttccta cttatcggtc tgtgttcaaa acaagtaaac caacgactaa ggtcatgaag 1140 acatggtcaa atgaaagtat tgaggagcta aagggatgtt tcctctgcac agactgggac 1200 atgttttatc aggatgcaaa tatagatact gttacagaaa ctatcacagc atatatttct 1260 ttctgtgtgg attctatcat tccacagaga accattaagt gttatccgaa cagcaaacca 1320 tacataacag ggggaattaa aagctgtatt aaaaggaaga aattggcctt tagggcaggt 1380 gataccgtgg ggcttagagc agcacaaaaa gaccttaatc gggaactaag gactgcaagg 1440 ataaaatata aagaacagac tgaacaacat ttgttaaagt ccaatacaaa agaattatgg 1500 aactcaattc ggagaatgac aaatatgaca ccaaaaagaa aggcattatt cgcctgtaac 1560 gaatctgtta aggcaaatga attgaatgat ttctatatga gatttgagac agataatttg 1620 aaaaaatgtg aagaaatttt aggggaagta aattgcaaca tggctgaaaa aaggaggtta 1680 atacatcaag gagacgtgac taaggttttt aaatctttat gttctagtaa ggcaatggga 1740 cctgatggta ttacatgctt tcttctaaaa acatttgctg aggagcttac accagcctgg 1800 catcgcctgt ttcaactctc agtggataca catacagtac caaagctctg gaaaaaatct 1860 gtgataattc caatcccaaa aaagccttgt ccacaagaca ataatgatta taggccagtg 1920 gccctaacat caaatgtcat gaaatcattt gagaagatta taataaaaga attgcgcaag 1980 gaagttgagc cgtacttaga ccagtatcaa tttgcataca aaagtaaccg tggcactatt 2040 gatgcaattt ccacattaac acatcttgtt ttaaaacatc tagagagccc agttgcttat 2100 gctaggttac tttttatcga ttttagttca gcatttaatt caattcaacc acataaactt 2160 ttggaaaagc tggtaaaaat gcaagtgaac ccttttgtta ttaaatggtt ttattctttt 2220 ttaacagaca ggacacagca ggtaaaattt aattcagggt tatctgatgt agcaacaagt 2280 agtacagggg cgccacaagg gtgtgtaagt tcaccctttt tgttcaccct ctatacaaac 2340 gactgcgtta ggcagcagac aaaccaatat attcttaaat tttccgatga cactgtacta 2400 ttgcaccttt tcacaaaaga tgccagcatc atggaataca gagcggctgt agaatggttt 2460 gtcagctggt gtgatgacca caatttggaa attaatgtta agaaaacgga ggaggtcatc 2520 atagacccca ggtcagtaag cgataggagc tctattgtgg tttatgataa agacattaag 2580 caggttgatt catataaata tttaggagtg aatattgatm gtgaacttaa ttggcactct 2640 caggtgtcaa gtctttgtgc aagaattcat cagcgattac actttttacg cagactaaga 2700 ttatttggag tgagtagcag cataatgatg actttttata aagcaacaat agagtctatt 2760 ttaagatatg ggattattgt ttggtttggg aatctgacag taaaaatgaa aacccaagta 2820 aataacctga tgagagtggc aggaaaaatc atgggaatta agacacttgg ttcactaaaa 2880 gacatctttg agttgtgcac aatccaacaa gccaacaaga ttttatctga ttcttcacat 2940 gtgctgtctg tggaatatga gctgatgaat tctggcagaa gatatagggt cccgttgtgc 3000 aagcataata gatataaaca ttcttttgtt cctttttcaa ttaagttgat caacgagcaa 3060 gyaaggaagg atggtcaaag actaagataa caggattggt ttgcacaaca gagggcagtg 3120 gggcatatgg tactttaatt attttatttt tttttataaa aattttatta aaattactga 3180 ttttatggaa tcrtatttat attaggaggt attttaggat tagattttaa tgtgtatgta 3240 tgttacatgt tgtcttgttg gtgtckatgt acagcagaac cttttttcta aaacaaattt 3300 ctcctacggg agacaataaa ttttaccttg accttgacct tga 3343 // ID BEL10-LTR_DR repbase; DNA; ZEB; 327 BP. XX AC chr18; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL10-I_DR; KW BEL10-LTR_DR; BEL10_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-327 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1370-1370 (2008). XX DR Genome; chr18; Positions 45434931 45435257. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 327 BP; 86 A; 83 C; 58 G; 100 T; 0 other; tgtagaaaat ctttgacgtg atccctgaag ccctgcccac tgttggtaca aacctctctg 60 atacacgtac gtactgaagg cctaccacat ggttcacagc tgcaccctat tacccaaatc 120 atcagtccct ttaaatagtg tggcttcagt gtcacagtat gctccagtct ttcactgctc 180 tggaccagtc ttataccatg tggagagaat gtactggcat tggtaaataa aactccaaga 240 aaattgatca tgcttcaagt gttttgactg acactccatt gtgatactcc tcctgcgagt 300 cagcattacc tttctaattt ttttaca 327 // ID HERO-2_DR repbase; DNA; ZEB; 3639 BP. XX AC . XX DT 26-MAY-2009 (Rel. 14.06, Created) DT 26-MAY-2009 (Rel. 14.06, Last updated, Version 1) XX DE HERO-2_DR is a family of HERO non-LTR retrotransposons - a DE consensus. XX KW Hero; Non-LTR Retrotransposon; Transposable Element; HERO-2_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RA Bouneau L., Fischer C., Ozouf-Costaz C., Froschauer A., RA Jaillon O., Coutanceau J.P., Korting C., Weissenbach J. et al.; RT "An active non-LTR retrotransposon with tandem structure in the RT compact genome of the pufferfish Tetraodon nigroviridis."; RL Genome Res 13(7), 1686-1695 (2003). XX RN [2] RA Kojima K.K. and Fujiwara H.; RT "Cross-genome screening of novel sequence-specific non-LTR RT retrotransposons: various multicopy RNA genes and microsatellites RT are selected as targets."; RL Mol Biol Evol 21(2), 207-217 (2004). XX RN [3] RP 1-3639 RA Kapitonov V.V. and Jurka J.; RT "HERO non-LTR retrotransposons from the lancelet and zebrafish RT genomes."; RL Repbase Reports 9(6), 1170-1170 (2009). XX DR [3] (Consensus) XX CC This is a young family of HERO non-LTR retrotransposons. The CC consensus sequence was built based on multiple alignment of CC several copies less than 2% divergent from each other. XX FH Key Location/Qualifiers FT CDS 252..3512 FT /product="HERO-2_DR_1p" FT /note="Contains the RT and REL domains." FT /translation="MTHANEQTTNKIYVTCICGKLCKNHWGLKIHQARMKC FT LEQESKVQRTGPEPGETQEEPGPEATHRAKSLHVPEPQTPSEVVQQRIKWP FT PASKGSEWLQFDEDVSNIIQAIAKGDADSRLKTMTTIIFSYALERFGCIEK FT GKTKPTTPYTMNRRATQIHHLRQELRSLKKLYKKATDEEKQPLAELKNILR FT KKLMILRRAEWHRRRGRERARKRAAFITNPFGFTKQLLGDKRSGRLECSIE FT EVNRFIEETVSDPLREQELEPNKALISPTPPAREFSLRGPSLKEVKEIIKA FT SRSASTPGPSGIPYLVYKRCPGLLLHLWKILKVIWQRGRVAEQWRCAEGVW FT IPKEENSKNINQFRIISLLSVEGKVFFSIVSRRLTEFLLENNYIDPSVQKG FT GIPGAPGCLEHTGVVTQLIREAHENRGDLVVLWLDLANAYGSIPHKLVELA FT LHRHHVPSKIKDLILDYYNNFKMRVTSGSETSSWHRIGKGIITGCTISVIL FT FALAMNMVVKSAEVECRGPLTKSGVRQPPIRAYMDDLTITTTTVPGSRWIL FT QGLERLIAWARMSFKPSKSRSMVLKKGKVVDKFHFSISGSVIPTITEQPVK FT SLGKLFDSSLKDSAAIQKSKKELGAWLAKVDKSGLPGRFKAWIYQHSILPR FT VLWPLLIYAVPMSTVESLERKISGFLRKWLGLPRSLTSAALYGTSNTLQLP FT FSGLTEEFMVVRTREALQYRDSRDGKVSSACIEVRTGRKWNAGKAVEVAES FT RLQQKALVGTVATGRAGLGYFPKTLVSQVKGKERHHLLQGEVRASVEEERV FT SRVVGLRQQGAWTRWNTLQRRITWANILQADFQRVRFLVQAVYDVLPSPSN FT LHVWGKNETPSCLLCSGRGSLEHLLSSCPKALADGRYRWRHDQVLKAIAAS FT LASAINTSKNHRAPRKAVHFIKAGEKPRALPQLTTGLLHKASDWQLEVDLG FT KQLRFPHHIAATRLRPDIIAISEASRQLIILELTVPWEERIEEANERKRAK FT YQELVEECRERGWRTYYEPIEIGCRGFAGRSLCKVLSRLGITGVAKKRAIR FT SASEAAEKATRWLWIKRADPWTAVGTQVGT" XX SQ Sequence 3639 BP; 1029 A; 872 C; 992 G; 746 T; 0 other; ttcaagcctg gcgcagccag tgactcctag gaatagacta ggtggcaacc aagaatagtt 60 tggtcgacta ctggagagac agttgacggc acggaaagac ggcacttggg acagtatggg 120 ttagcacccc agcctgtgtc tttcgtgaga gagaacccaa acaagctacg gaaagcccca 180 cagagatata cccccaggag atcccgagag ggggggagga tgagatctcc aatcggacgg 240 atcaaaggtt aatgacccat gcaaacgaac agacgacgaa caaaatatat gtgacatgca 300 tttgcggaaa gctgtgtaag aaccattggg gcctaaaaat ccatcaggcc agaatgaaat 360 gtttggagca ggagagtaag gtgcaacgca caggtcctga acctggtgag acgcaggagg 420 agcccggccc ggaggcaacc cacagagcca agtccctcca tgtaccagag cctcaaactc 480 caagcgaagt agttcaacag cggattaaat ggcccccagc cagcaaagga agtgagtggc 540 tgcagttcga tgaagatgtg tccaacatca ttcaagccat agccaaagga gatgcagata 600 gccgactcaa aacgatgact accatcatct tcagctatgc tctagaaaga ttcggttgca 660 tagagaaagg aaagaccaag cccaccaccc cctacactat gaaccgtagg gctacccaga 720 tacatcacct gcgtcaggag cttcgctccc tcaagaaact gtataagaaa gctacggatg 780 aggagaagca accattagcg gagttgaaaa acattttgcg gaagaagctg atgatcctac 840 gcagggcaga gtggcatcgg agacgagggc gagagagagc caggaagcga gctgccttca 900 tcaccaatcc ctttggcttc acaaaacagc tgctcgggga caagcggagc ggtcgacttg 960 aatgctcaat agaggaagtg aatcgcttca ttgaggaaac agtgagtgat ccactgagag 1020 agcaggagct ggagcccaac aaagctctta tcagccccac ccctccagca agagagttca 1080 gtttgagggg gccaagtctg aaggaggtca aggaaatcat taaggcatct cgctcagcat 1140 ctactccagg ccctagtggc ataccttacc ttgtctataa gcgctgccca gggcttctcc 1200 tgcatctgtg gaagatcttg aaggtgattt ggcaacgagg aagagttgct gagcagtgga 1260 ggtgtgccga gggagtgtgg attcctaaag aggaaaactc gaaaaacatc aaccagtttc 1320 gaatcatctc tctattgagt gttgaaggga aggtgttttt cagcatcgtc tcacgaagac 1380 tgacagagtt cctcctcgag aacaattata ttgacccttc agtgcagaag ggagggattc 1440 ctggagctcc cggctgcttg gaacacactg gagtagttac acaactcatc agagaggccc 1500 atgagaacag aggggacttg gttgtcttgt ggttggactt ggcaaatgcc tatgggtcca 1560 taccccacaa gctggttgag ctcgctctac accgccacca cgttcctagt aagattaagg 1620 acctaattct ggattactac aataatttca agatgcgggt cacatctggg tcagaaacat 1680 caagctggca tcgcatcggg aaaggaataa taacaggctg caccatctca gttattcttt 1740 tcgctctcgc catgaacatg gtggtcaagt cagccgaagt ggaatgcaga gggcccttaa 1800 ctaagtcagg tgtgcgacag ccccctatta gagcatatat ggatgacctt accatcacaa 1860 caacaacggt cccagggagc aggtggatct tacaaggact tgagagactc atcgcctggg 1920 ctagaatgag ttttaagccc tccaagtcta ggtccatggt gctgaagaag gggaaagtgg 1980 ttgacaagtt ccatttttcc atctcaggaa gtgtcatccc aaccatcacg gagcaacctg 2040 tcaagagttt ggggaagctc tttgactcca gcctaaaaga ctctgcagcc atccagaagt 2100 ccaaaaaaga acttggagct tggctggcga aggttgacaa atccggcctg cctggtagat 2160 tcaaagcctg gatctatcag cattcaattc tgccccgagt tttgtggcct ctgctgatct 2220 atgcagtccc aatgtcaaca gttgagtccc tagaaaggaa gatcagtggc tttcttcgaa 2280 aatggttggg cctcccacgc agtcttacca gtgctgcact atacgggaca agtaacacct 2340 tgcagctacc attcagtggc ctcacagagg aattcatggt tgtacgcacc agagaagccc 2400 tacagtacag ggactctaga gatggcaagg tgtcatcagc ctgcatcgag gtgaggacag 2460 gcaggaaatg gaatgcaggg aaagcagtgg aggtggcaga gtcacgcctg caacaaaagg 2520 ctctggtggg cactgtagcg acaggcagag cgggcttggg ctattttcca aagaccttag 2580 taagccaggt caaaggcaag gaaagacacc acctactcca gggagaggtt cgagcaagtg 2640 tggaggaaga gagagtcagt agggtggtag gactccggca gcagggagca tggactaggt 2700 ggaatacact gcaacgtagg atcacctggg cgaacatctt gcaggcggat ttccaacgtg 2760 tccgtttcct agtacaagct gtctacgatg tactgccaag cccatcaaac ctccacgttt 2820 ggggaaagaa tgagacacct tcctgccttc tttgctctgg aagaggctct ctagaacatc 2880 tcctcagcag ttgccccaag gctctggctg atggtcgcta tcgttggcgc catgaccagg 2940 tgcttaaggc aattgctgcg agcttagctt cagccattaa cacgagcaag aaccatcgtg 3000 ctccaaggaa ggcagtccac ttcatcaaag ctggagaaaa accccgggcc ctcccacaat 3060 taacaacagg cctccttcac aaagcctcgg actggcagct ggaggtcgac ctgggaaaac 3120 agctgaggtt tcctcatcac atcgctgcaa cacgtctccg tccagacatt atagctatct 3180 cagaagcttc aagacagcta attattctgg agcttacagt gccgtgggaa gagcgtattg 3240 aagaagcaaa tgagaggaag cgcgctaagt accaggaatt agtggaggag tgcagggaga 3300 gaggctggag aacttactat gagcccatag aaattggatg cagaggcttt gcagggcgtt 3360 cactttgcaa agtcctcagt cgtttgggca ttacaggcgt ggcgaagaaa agggccattc 3420 gatccgcaag cgaagccgca gagaaggcca caaggtggct gtggattaag agggcagatc 3480 cgtggactgc tgttgggaca caagtcggga cttgatcaac cccggctggg tcacctgggt 3540 gagagtgtat gatgttgaga gacccgaaac actcaatgat cccaggatac atcactgatg 3600 atgtgtccca aatgcatcca tgagatgttt cttgcataa 3639 // ID hAT-N38_DR repbase; DNA; ZEB; 1029 BP. XX AC . XX DT 16-OCT-2008 (Rel. 13.1, Created) DT 16-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Putative nonautonomous hAT DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW 8 bp TSD; hAT-N38_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1029 RA Bao W. and Jurka J.; RT "hAT DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1308-1308 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1029 BP; 307 A; 195 C; 182 G; 345 T; 0 other; catacctgtc aaccctcccg tttttcccgg gattctcccg tattttacag ttctatcccg 60 ctatcatccc gtaaaggtat tttcccgtat ttctcccgta ttttcagtct ttctctgaag 120 ggtggcaaat aaacattaaa gagccgagcc tccctatacg caacccatac cgccgaacca 180 ccaggggccg ccccttgctc ttaaatgtga gtctgttctg tgctttcgct ttgtttaggc 240 atgaaaacac tttgaaataa atataaaaac ggcgcgattc cctttccttt tcattacagg 300 tgccgtctcc ccttcatatg caatcctcaa aacagtcata taacggtgca tgactgtcag 360 ctgacgcgct ccatagtgat aaatagacgc tgtttcccaa acggattctg tacacgcgat 420 ttgaagcgga gcgaaagtct gggttttggg tgcgtgttta aacagacata tacacaatta 480 ataataataa taataataac atctaaagat gtcgatcttg gtggggtttt ttttttttca 540 aacacaacta atttcgtcct aaatgagcat aaaaagttag gaaagcagtc gaatgctttc 600 attataatgt tagatgtgtg catttttaaa gtgacacctc tgttatttaa tgtaatcaaa 660 cgaaaaaaaa aatctaaata aataagagat tcattcgttg ctctttaatc agaatgtgta 720 agttatacag tgaagaaacg gctaatgaga tttgatagct tgattctatt tcattataat 780 agctattaat tttaataagc tgaactgttt gctaatgtat ttgctgctaa tgtatactat 840 ttattttttc ttttgtaaat aataataata aaacatatta ctgaccattt aactgtttat 900 ttacaatgaa acagctccaa atgaacatat ttagtaattt ggggattttt tttagaaagg 960 gggggggggg gaatccctta ttatggtggg catatccctt attttcacat cccaatgttg 1020 acaggtatg 1029 // ID Gypsy11-LTR_DR repbase; DNA; ZEB; 1748 BP. XX AC . XX DT 07-JAN-2005 (Rel. 10, Created) DT 07-JAN-2005 (Rel. 10, Last updated, Version 1) XX DE A long terminal repeat of the Gypsy11_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy superfamily; Gypsy11-I_DR; Gypsy11-LTR_DR; Gypsy11_DR; KW endogenous retrovirus. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1748 RA Kapitonov V.V. and Jurka J.; RT "Gypsy11_DR, an LTR retrotransposon from zebrafish."; RL Repbase Reports 4(12), 316-316 (2004). XX DR [1] (Consensus) XX CC Gypsy11-LTR_DR is a long terminal repeat of the Gypsy11_DR LTR CC retrotransposon. Its internal portion is deposited in Repbase CC as Gypsy11-I_DR. XX SQ Sequence 1748 BP; 489 A; 257 C; 403 G; 599 T; 0 other; tgtaacccag tagaatgaag ttctacaagt taagttttaa agtaatctta aggttattta 60 tttatttatt catattggtt ctagaacaga attaaatcta aaatgtgttt tgcattcatt 120 tgtgaatata catcctaatg ttgatgtgta atatatgagc gtctatctct gagaatatga 180 taagatagca agtttgatca accttcactg aaatacatgg ggttaacaag tttggtaggt 240 gttcctacat gattggttta tatgcgtgta gtcccgccct ctggtgtgga gggtgaaatg 300 tgattggctg ttgggaagag agaagggaga gaggagacac agccggcagt tttggaaact 360 gaaggggaga ctcgcgtcgc ggctgttatg attgcgtaac attcacggag aataaaaaac 420 aagagatagt aatagatatt gcgtgtgaac aacatgtttt tgattacttt gagtttgttt 480 tgtgtttgtt ttaagttcgc ggtgtgtttt catcatataa gtttccagtg tcagttattg 540 gtcagttcag agtgagtgag cattttttaa atgcctagat tgtcaactga cagttacgga 600 ggactaaagc tgaactgaga gagagacggt taaactgaaa ataaggagga gaaaaaagag 660 actgttagat tagtatgaaa caaggatgaa agagtcaaag aggagcaaat agcaccacaa 720 ataacctgaa tactacgtga gatttgcctc tcatcagctg gggaggaaaa gtgaaagtga 780 ggatttgaaa gacccgcggc agaagttcgc cgatcgatcg cggctcgttg gcccgttgtt 840 cacaagtgtt gtcagtttgg agaactgtca tatattgctg tcatctggat cgctgtcatt 900 tggattgctg tcatctggat cgctgtcaac tggatcgcgg tcatctggat cgctgtcaac 960 tggatcgctg tcatctggat cgctggcatc tggtttgctg tcatctggat caaggtcatc 1020 ttcatctgag gtatcttgtt taattttatt tgtgctcagg agagtgaaga ggccattttc 1080 attttcccgg ccacaacgta cactagcagc gatacaggcc tgcaacgggg ggtgtatgaa 1140 tgtgtgagtg tgtgggccca cagagtattg agttgtttat ggagatgcag tgagcataaa 1200 atactgctgc gtgttttacg gtatagattt atactaatag tttatatagg tttaattcag 1260 tgtgtaattt agtgacacaa ttgtagccac ttccaaagag aaacatattc agtcactctt 1320 agtcacttgt tgatttataa gtttaattgt tagagtgtag aaaggtggta tttttctata 1380 ttggtttaat taatattctt cattgtactg taaattgctg ggtttgactt tgggtcattt 1440 tactgtcttt tagctgtgca taaattctat atttgtattt tgttttacat ctggggtttt 1500 tcccaatatt tacatttcta ttctaattaa attatttata ttttctgatt tatctaagag 1560 ttttattatt gaaagagttt atttctagta atttattaaa acacagcaaa tatttgtatt 1620 ctcatctgac ttatttacat tacgtcattg aaatatatca gaggggccta cttcaacata 1680 tttgtaggcg tttagtgtat tgaacctgga gcgctatctt aaacttaata cacctaggtg 1740 gcgctaca 1748 // ID DNA-5-2_DR repbase; DNA; ZEB; 964 BP. XX AC . XX DT 17-OCT-2008 (Rel. 13.1, Created) DT 17-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE nonautonomous DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; TSD 5-bp; KW DNA-5-2_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-964 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1234-1234 (2008). XX DR [1] (Consensus) XX CC The consensus of this element is characterized by 35-bp TIR and CC 5-bp TSD. The members are ~86% identical to the consensus. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 964 BP; 339 A; 149 C; 148 G; 327 T; 1 other; caggcccgta gccagcctgg tgaaaggggt ggttcttttt ttctctcaaa aagtggacct 60 tttttgcagt tattcgcctc attttctatt taattatgag atttaaatac tgcattttag 120 tgacatttta agcactattt ttagctggat tagcttgtcg gatggtcatc ataaccacac 180 tctttttgat gtaccaaaaa tatttcctaa agatttagaa taaagaaata tgaaaaaata 240 cttattttta aaatacaaat ttattacatc atatatcatt agaaaaaata ggaatctgtt 300 aaagttttaa attaaaaact gtagcctatt gtcatttatt aggcaaataa caaactggcc 360 agcacattca actttgtcct cagtcagaat ttggccattt gaataaaaaa aataaataaa 420 taataatata tatatatata tatatatata tatatatata tatatataaa acataataat 480 aataataatg taaaacttmt acatttgaaa attcatggat aacaacaata gccaaaatag 540 tattcaaatt aatttcaata tgattaatat gggccgcttt tggtgcttca caaccttttt 600 attggtcatt ttttgtttca agaataaggt ccaagattta gaataaaagt tacttgtaaa 660 atgttttctc tatttaaaaa caatacagta gcgaaagatg acttcactta cgtcagattg 720 taaaatgaat attttttagc attggccaaa actgattagc tgaagtcaca ccgaagcgac 780 agccaacaga ggattccgct tggtcttaaa gcctttttcg atgggttatt ttcacaattt 840 atatttatga tggcatagca gtcaaaatag gacaaatgag ctcatgtatg ggacaaattc 900 tggacctttg tcagtgaggg gtgggtcttt cgaaccaccc gaaccccccc tggctacagg 960 cctg 964 // ID CR1-21_DR repbase; DNA; ZEB; 2769 BP. XX AC . XX DT 11-NOV-2008 (Rel. 13.11, Created) DT 11-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE CR1-21_DR is a CR1-like non-LTR retrotransposon - a consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; AP endonuclease; CR1 clad; CR1-21_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-2769 RA Bao W. and Jurka J.; RT "CR1-like non-LTR retrotransposons from zebrafish."; RL Repbase Reports 8(11), 1695-1695 (2008). XX DR [1] (Consensus) XX CC The 5'- part is incomplete. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS join(2..910,914..2533) FT /product="CR1-21_DR_1p" FT /translation="GTFGSRLYVFWPLLRVNRMPGFCLSTMACFASQTEEH FT GNLRAVYVNKRYCTAVTVRERICTPDVELLSVSLRPFYLPREFPQLFVTIV FT YIHPKANAMSACGTIFNVVQRLQMISPDAPNFVLGDFNHVSLNKTLTTFYQ FT YVSCPTRRDRCLDLCYGSVKDAYKSLPLPPLGSADHNCVHLIPTYKTVLKR FT QKTQTMDVMDWSDESALCLQECYDCTDWEMFKQSCGDNFDMFVDTVCSYVA FT FCRDMIIPCRRVKIYSNNKPWVTKSVKSCIQKKKLFFKQGSNSDLYLANKE FT LKREILLAKQNKTELENKMATNKLGSAWDSLKTITGLHNLKSSNQINLVGY FT STDFELANALNHFYNRFNHLDFTSEIEELNSKLTGGQHFIIQQTAVHKAFL FT SVRVNKSCGPDNICGRTLKICARELSPIFEYIFNRSLLEQQVPKVWKDAVV FT VPVPKSKSPQTFNEFRPIALTSIVMKIFEKFVRSEMLRRTEYALDPMQFAY FT RPHRGVEDATVTLFNLLFKHLEKNGSHARLLFVDFSSAFNTIQPHILIKRL FT LEQFDLSNNLVGWILNFLTNRTQSVRVNGCLSDKVCSSTGSPQGCVLSPLL FT YILYTNMCQSRWENRTIIKYADDTVIVSQLHDDETGHGPVLTDFVRWCEES FT YLQLNVLKTKDMIVDFRKAAHSQVSTSIKNQTVESVQSYKYLGTIIDSKLS FT FDENCAAVCKKGHQRLFCLRKLNYFHIDKQMMNLFYHAFLESVLSFALVSW FT FGNLSVKNKNSLNQIVKWSSKLTGVSLANLEAVYTRQLKRITNSILNDTAH FT PLHSEFQLLPSGRRFLVLSCKTKRYKNSFIPAAIRHLNKT*" XX SQ Sequence 2769 BP; 836 A; 509 C; 552 G; 867 T; 5 other; gggaacgttc ggttcaagac tttatgtgtt ctggccttta ctgagggtta acagaatgcc 60 aggattttgc ctgtccacga tggcttgttt cgcttcacag accgaggaac acggaaatct 120 cagggcggtt tatgtcaaca aaagatactg cacggctgta acggtaaggg aacgcatctg 180 cacaccagac gttgaacttt tatccgtgtc gctacggcct ttctatctcc ctcgtgaatt 240 cccacaactc ttcgttacca tagtgtatat tcatccaaaa gcaaatgcga tgtctgcctg 300 tggcactatt tttaatgtag tgcagagact gcagatgata tcacctgacg caccaaactt 360 tgttttgggt gattttaatc atgtctcttt aaataagaca cttacaactt tttatcaata 420 cgtgtcctgt ccgaccaggc gggacaggtg tttggacctt tgttatggat ctgtaaagga 480 cgcatataaa tcactgccac taccaccact cggctcagcc gatcataatt gtgttcatct 540 aatccccaca tataaaactg ttttaaaaag gcagaaaaca cagacaatgg atgtcatgga 600 ctggtctgat gaatccgcgc tctgtttgca ggagtgctat gactgtacag attgggagat 660 gtttaaacag tcctgtggtg ataattttga tatgtttgtc gatactgtat gctcctatgt 720 ggctttttgc agagacatga ttattccatg taggcgggta aaaatttact ctaacaacaa 780 accgtgggtt acaaaatcag tcaaatcctg catacagaaa aagaaattgt tttttaaaca 840 aggatccaat tctgatttgt atttggcaaa taaagagctt aaaagagaga ttctcctagc 900 taaacaaaat tagaaaactg aattagaaaa caagatggca acaaataagc ttggttcagc 960 ctgggatagt ttgaaaacta taacaggcct ccataatcta aagagtagta accagattaa 1020 tttagtgggc tatagcactg attttgagtt agccaatgct cttaaccatt tttataatcg 1080 ttttaatcat ttggatttta ctagtgaaat tgaggaatta aatagtaaac ttacaggtgg 1140 acagcatttt attatacaac aaacagctgt gcataaggct ttcctctcag taagagtgaa 1200 caagagctgt ggaccagaca atatttgtgg tcgcacactt aaaatctgtg caagagaatt 1260 aagtccaatt tttgaataca tttttaatag atctttacta gaacagcagg tacctaaggt 1320 ttggaaggat gctgtggtag tccctgtacc caaatccaaa tcaccccaaa cttttaatga 1380 attcagacct atagctttaa catcaattgt gatgaaaatt tttgagaagt ttgtgaggtc 1440 tgaaatgctg agaagaacag agtacgcact tgatcccatg cagtttgcct ataggcctca 1500 tcggggagta gaggacgcca cagttacttt gtttaaccta ctttttaaac atttagaaaa 1560 aaatggttcc catgccagac ttctgtttgt cgacttttct tctgctttta acacaattca 1620 acctcacatt ttaattaaga gattgctgga acagtttgac ctcagtaata acttggtggg 1680 ctggatcctt aattttttaa ctaatcgaac acaaagtgtt agagtgaatg gatgtctatc 1740 tgataaagtt tgctcctcaa ccgggtcccc acaaggatgt gttctttcac ctttgttata 1800 catcctttac acaaacatgt gtcagagcag gtgggaaaat agaacgatta tcaagtatgc 1860 agatgacact gtcattgtca gtcaactcca tgatgatgaa acaggtcatg gtccagtact 1920 cactgacttt gtcagatggt gtgaggagtc atatcttcag ttaaatgttt taaagacaaa 1980 agatatgatc gtggatttta ggaaggctgc tcactcacag gtgagcactt ccattaaaaa 2040 tcagactgtg gaaagtgtgc aatcatataa ataccttggc acaattattg actctaaact 2100 ttcctttgat gagaattgtg cagcagtgtg taaaaagggg caccagcgac tattttgctt 2160 gagaaagtta aattatttcc atattgataa acagatgatg aatctatttt atcacgcttt 2220 tcttgaatcc gttttatcct ttgctytggt gtcatggttt gggaatcttt ctgtcaagaa 2280 taaaaaytca ctaaatcaaa tagttaagtg gtccagtaag ctgaccgggg tgtcactagc 2340 caatctagaa gctgtgtaca ctaggcagct aaagagaatt acaaattcaa ttttaaatga 2400 cactgctcat cccctacayt cagagtttca gctccttccc tccggtcgta ggtttctagt 2460 attaagctgt aaaacgaaac gttacaaaaa tagtttyata ccagcagcca ttagacattt 2520 aaataagact taagcagaga tgtttcagtg acctgtatat tttaratcat tttaaatatt 2580 tacttggggg aggggggtta cacttaaagt gattatttat ttattcttta tgtttatgtt 2640 ttatgtccta atgtgcattt gttgttgtta ctgtgtgagt tgctttgtgt ggcactgttg 2700 tgagatgagg actcttgact gcaaatcaaa tctaccttcg ggtataaata aagtaaccta 2760 acctaccta 2769 // ID Gypsy48_LTR_DR repbase; DNA; ZEB; 967 BP. XX AC . XX DT 25-SEP-2007 (Rel. 12.09, Created) DT 25-SEP-2007 (Rel. 12.09, Last updated, Version 1) XX DE A long terminal repeat of the Gypsy48_DR LTR retrotransposon; a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy superfamily; LTR; Gypsy48-I_DR; KW Gypsy48-LTR_DR; Gypsy48_DR; endogenous retrovirus; KW Gypsy48_LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-967 RA Dib M.R. and Naveira H.F.; RT "Gypsy48_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 7(9), 826-826 (2007). XX DR [1] (Consensus) XX CC Gypsy48-LTR_DR is a long terminal repeat of the Gypsy48_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its CC internal portion is deposited in Repbase as Gypsy48-I_DR. XX SQ Sequence 967 BP; 282 A; 170 C; 219 G; 296 T; 0 other; tgtgacaaag gagattgtca ctgacctgcg gcagaactac atataaactt ttttccttca 60 cacgcatcca ttaaaacact gtattccata ctcaccatca gccgatcaca gggtgcacac 120 tggagtcgcg ctgctgccgg aacgcataca tacagacaaa cacgcacaca tttctagtta 180 atgatcccga tatacatccg atccgttatc acaaagcata cataacactt acatgaagca 240 tattcactta cctgttgttc aattgtgttt gtgtttgtct cgtctcgtct cgctgatctt 300 ccgggctgcg ccatgatggc tatggaatgc caaaagggcc aaaaagccta acgcgttcat 360 tttatgcaca agcgtgcact gcgcatgggc aaaagggacg gccggccgga gatcctattt 420 gttgtatgta tgggggagaa agattgtccg ggacaataaa acaaaagtgg tttataaaaa 480 ctggaaaata taaaaataat tagttatggg gtaaaactgt atagtatgca aatgtagtgt 540 atacatataa ttaaaatgtt gatcaatgaa atgtttatag atatagttat ttgatactta 600 cctgctgtga tgatatgagc cattccatac cagtctcccc tgctatataa agggcagagt 660 gagcacagta tggtgtgggg tgtaggagta aggttgtaga gtggtgtgga gaagtaagtt 720 gattattgga tttctttatt tgcagaaaga aaacgatagc caagtgctac ttgtattgta 780 aatatatatt ttttttgttt tgttttgttt tgttttgctg gttattattt ttgggggggg 840 ggggggattt ttttctactg agcactgtaa ataaagacac tttgcaccat agtgactatt 900 tcgactctgc caatttttga ggatttgaaa ggtctgggac ctgaagagaa aaatcgtctc 960 cgtaaca 967 // ID BEL29-LTR_DR repbase; DNA; ZEB; 321 BP. XX AC chr20; XX DT 03-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL29-I_DR; KW BEL29-LTR_DR; BEL29_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-321 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1402-1402 (2008). XX DR Genome; chr20; Positions 49691566 49691246. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 321 BP; 102 A; 49 C; 78 G; 92 T; 0 other; tgtaggagcc tatatgttga cttaaatgta atttgggctg tcaccacttg ggggcaatat 60 gtgcacatag ggttttaaag tcggtgtttc ttgcatgaga gaagaagact tgaggaaggg 120 aacgcatgca gtttttacca caccaaactc acacaccttt gaggtcgcta aagggttcaa 180 aatatttaat tgcaagtatg tatttataat aagtttaata aagtgaattt aaacttgaag 240 aggaagcaag actggacatt ggttgtgtgc gtaaaaacgt gcgttgagat gctaagcctt 300 cgacaataag ctgctgcaac a 321 // ID Gypsy-15-I_DR repbase; DNA; ZEB; 6739 BP. XX AC . XX DT 11-FEB-2005 (Rel. 10.01, Created) DT 19-MAY-2005 (Rel. 10.06, Last updated, Version 2) XX DE An internal portion of the Gypsy-15_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy superfamily; Gypsy-15-I_DR; Gypsy-15-LTR_DR; Gypsy-15_DR; KW endogenous retrovirus; gag; integrase; reverse transcriptase. XX NM Gypsy-15-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6739 RA Kapitonov V.V. and Jurka J.; RT "Gypsy-15_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 5(1), 3-3 (2005). XX DR [1] (Consensus) XX CC Gypsy-15-I_DR is an internal portion of the Gypsy-15_DR CC LTR retrotransposon that belongs to the Gypsy superfamily. CC Its long terminal repeat is deposited in Repbase CC as Gypsy-15-LTR_DR. Gypsy-15_DR is characterized by CC 4-bp target site duplications. The internal portion encodes CC two proteins: the 628-aa gag Gypsy-15_DR1p (pos. 112-1995) and CC 1554-aa Gypsy-15_DR2p polyprotein (pos. 2022-6683) composed CC of the protease, reverse transcriptase, and integrase CC domains. PBS is complementary to Arg-tRNA. The internal CC portion is flanked by 99% identical LTRs. XX FH Key Location/Qualifiers FT CDS 112..1995 FT /product="Gypsy-15_DR1p" FT /note="Gag-protein" FT /translation="MDVVRRENITVANSLIVSGLTLTELDNELEAYLLRYG FT SIRRNVIIDDPASDFHKLLIVEFNENSAFQTLHPHLPLTLGSLSDPSITFR FT VRALAAVCDPPVVSTATEGYLEQLKAIAKESGRPFQTVLQEELEKLKETHS FT VDQTLAESQKIEVADTQSRDNTLISESPNVAEIDPQDSESKNPTKNRIIYV FT SPPLPETTADDNLTTVFPSSSTNIMGDPQVQRMVVEHVVKTSDAMMSQQTS FT IRLRVFSGKSPRPPNEPDYDTWRASVDYLLNDPSISDLHRTRKILDSLLPP FT AADVVKHVRPPALPAVYLELLESVYGSVEDGDELLAKLMGTFQNQNEKSSD FT YLHRLQVLLSAVIRRGGIKESERDRYLLKQFCRGCWDSRLIVDLQLEKKEG FT QLLTFAELTILIRTQEDKNASKEERMRKHFGMTKPANVYPKTRAISNQVSA FT CACDVSSTYSSEAGSLKKQVAEIQAQVATLKQSPDKKSIKGQSERAELVAL FT KRTVEDLCVQVAAVKASVAEGLKGNNPEQSEIARLQRQVAELQAQGIVQKA FT YQAPHMQRSPGTEIGRALKKEPLRTNRPRPGYCFRCGDDGHLAVNCENPAN FT PPKVEEKRLKLREQQHQWDILHGRPAQFLN" FT CDS 2022..6683 FT /product="Gypsy-15_DR2p" FT /note="Polyprotein" FT /translation="MEAKLYNSRPGCYKQIPELSQTTVSFKHLPKGLVGTK FT CTAQVTIGGMEVNCLLDTGSQVTTIPHSFYKAHLSDFPLEPLKNLLEVEGA FT NGQAVPYLGYIELTLKFPKEFIGAEVEVPTLALVVPDLTSFSQILVGTNSL FT DVLYGKCAQDCAADVKSSFPGYQAVLKVLEARWRQASSETLGYVKFKGNSP FT EIVPAGGMVVLEGQAHFNGPHTEKLVTLEPPSVPLPNGLLIASCLHTSPNK FT RLSKLSVLLRNTTQTDIAVPPKVMLAEIHAIQSVLNQHHQSSDAKAEESIP FT TCANLTFDFGDSLPTTWKERITKLLNSMPEVFSLHDLDFGHTKKVKHQIKL FT NDETPFKQRARPIHPQDIDAVRRHLQELLVTGVIRESDSPFASPIVVVRKK FT DGSVRLCVDFRKLNAQTIKDAYALPNLEEAFSTLTGSKWFSVLDLKSGFYQ FT IEMEEVDKAKTAFVCPLGFWEFNRMPQGITNAPSTFQRIMERCMGDLNRKQ FT VLVFIDDLIVFSDTLEEHESRLLQVLNRLKEYGLKLSPEKCRFFQTSVKYL FT GHIVSHNGVETDPAKVEALKTWPRPRNLKELRSFLGFAGYYRRFVRDFSKI FT VKPLTDLTAGYPPLRKSCNTKQKDCEYFNPKAEFDTRWTTDCQDAFDSIID FT NLTSAPILGFANPKHPYVLHTDASTTGLGAALYQEQEGQLRVIAYASRGLT FT KGESRYPAHKLEFLALKWAVTSKFNDYLYGAEFTVVTDSNPLTYILTSAKL FT DATSYRWLSSLSTYNFKLQYRAGSQNQDADGLSRRPHGELVDDLTSLKERE FT RIRQFTLHHLMESEDESPVVMAEVVKAICEKHQVVGSPQGLHCIPSVTLVE FT SLTHCVDVLPYEFQHEDEHGLPSLPHLSQAALAELQRKDPELKIVIERVES FT GVKPCKLRELSSAVSLWLKEWKRLELRSNVLYRKRQEHGASSYQLALPTSL FT RNTVLQSLHDDMGHLGIERTLDLVRTRFFWPKMSHAVVQKVKTCERCVRRK FT TPPEKAAPLVNIQTSRPLELVCIDFLSLEPDQSNTKNILVITDHFTKYAMA FT IPTRNQTAQTVAKSLWDHFLIHYGFPEKLHSDQGADFESRTVKELCKVAGI FT HKVRTTPYHPRGNPVERFNRTLLQMLGTLENERKSRWKEYVKPLVHAYNCT FT RHDTTGYTPYELMFGRQPRLPVDLAFGLPVDTPNKSHSQYVENLKNRLRES FT YEMATKNAGKIAERNKQRFDKHVVALTLEEGDRVLVRNVRLRGKHKLADKW FT EQNVHVVVKKAHNLPVYTVKPEGKDGPLRTLHRDLLLPCGFLQSNKLVEPP FT KQKPARKPLTRFSLNNEMQESDLISENSESEEEHIVSNVPEGTLSFETQII FT VGPEYIPVGESGVSLTVLDPAVEDVSVPESVVSNPEEPAKKHLPGVEPVEK FT ETNELIEVEKNSNALESSNTVPFVLTEKNSELEQSSELWSESPDQTAKNVL FT DSFEWETEQNLIPSNVGHTEILQNEQPTCNEPDDILLRRSQRERRPPKKFE FT YPQLGNPLTLVIQSLLQGLDTALCSSLEKSVVAPVRHL" XX SQ Sequence 6739 BP; 2013 A; 1449 C; 1549 G; 1728 T; 0 other; gtaaagttgg cgagccagcc aggagtctaa ttattgcagc aagggtgtca aacgacaaga 60 aaaagggaat tgtatcagta gcaaaccgtt ttaaaaattt gagctgtcat aatggatgtc 120 gtaagacgag aaaatataac tgtagcgaac tctctcatag taagcggtct aacgttaact 180 gagttagata atgagctgga agcatatttg ctgagatacg gctctatccg tcgcaacgtg 240 ataattgatg acccagcatc agactttcac aagctgctga ttgtggagtt taatgaaaac 300 tctgcgtttc aaactttgca tccccatttg cccctgactt tgggaagtct ttctgatcca 360 agcattacct ttcgggtacg cgctttagcc gctgtgtgtg acccacctgt cgttagcact 420 gccactgaag ggtaccttga gcaattgaag gccatagcca aagagagtgg aaggcctttc 480 caaaccgtgt tgcaggagga gttagagaaa cttaaagaaa ctcattcagt agaccaaact 540 ctagcagaat ctcaaaagat agaggttgct gacacacagt ctagagataa cactctgatt 600 tctgagtcac ctaatgtagc agagatagac ccccaagatt ccgaatccaa aaacccaacc 660 aagaatagaa tcatttatgt gtccccgcct ttacccgaaa ctaccgctga tgataacctt 720 acaacagttt ttccttcctc ttcgacaaac ataatgggcg atccccaggt tcaaagaatg 780 gtagttgagc atgtagtaaa aacgagtgat gccatgatgt ctcagcagac atccattcgt 840 cttagggtct tttcagggaa gagtccccgc ccccctaacg aacctgatta tgacacctgg 900 cgtgccagtg ttgactattt gcttaatgac ccatctattt cagacctgca tagaacacgt 960 aaaatcctgg acagtctctt acctccagcc gcagatgtag ttaaacatgt gcgtccccca 1020 gcccttcctg ctgtctatct tgagttgctg gagtccgttt atggttctgt tgaagatgga 1080 gacgaactgt tagcaaaact aatgggtact tttcagaatc aaaatgaaaa atcatccgac 1140 tatctccatc gccttcaagt cctgttaagc gcagtaatca ggcgaggtgg tataaaagag 1200 agtgaacgtg accgttatct tttaaaacag ttctgtagag ggtgctggga tagccgcctc 1260 attgttgatc ttcagcttga gaaaaaagag ggccagttac ttaccttcgc tgaattaacc 1320 atactaattc gaactcagga agacaaaaat gcttctaagg aggagcgtat gaggaaacac 1380 tttgggatga caaagccagc aaatgtttac ccaaagacac gagctatctc aaaccaagtg 1440 tcagcttgtg catgcgacgt gtctagcact tatagttccg aagcaggatc tttaaagaaa 1500 caagtcgcag aaattcaagc tcaagtcgcc actttaaaac agtctcctga taaaaagagt 1560 attaaaggtc aatcagaaag ggctgagtta gttgctttaa agaggactgt tgaagacctt 1620 tgcgttcagg tggctgctgt aaaagcatct gttgctgagg gactaaaagg gaacaatcca 1680 gagcaatcag aaattgccag attgcagcga caggtagcag agctacaagc acaaggtatt 1740 gtacagaaag catatcaagc tcctcatatg cagagatccc ctggaactga aattggcaga 1800 gctctcaaga aagagccttt aagaactaac agacctaggc cagggtactg ttttcgatgc 1860 ggagatgatg ggcatttggc agtcaactgt gaaaaccctg caaatcctcc aaaagttgag 1920 gaaaagcggc tcaagctgag agagcagcag catcagtggg atatcctaca tggaagaccc 1980 gcccagtttt taaactaggt gaggtctcta tagcggggca tatggaggcc aaactttata 2040 atagccgccc tggatgttat aaacagatac ccgagctcag tcaaaccact gtgtctttca 2100 aacatttacc caaaggtctg gtaggaacca aatgtacagc ccaagtcacc attgggggga 2160 tggaggtaaa ctgcctttta gacacggggt cgcaggtcac cacaataccc cattcgtttt 2220 acaaagcaca tttatctgat ttccctttgg agcccttgaa aaatctactg gaggtagaag 2280 gagctaatgg acaggctgtg ccatatttag ggtacatcga acttacctta aaattcccca 2340 aagaattcat cggggcagag gttgaggttc ctacattagc tttagttgtt ccagatttaa 2400 ccagtttttc ccaaatttta gttggaacaa actcgttaga tgtgctttat ggtaaatgtg 2460 ctcaagattg tgcagctgat gtcaagtcta gttttcctgg ctatcaagct gtgcttaaag 2520 tgttggaagc tagatggagg caggccagca gtgaaaccct tggttatgta aaattcaagg 2580 gaaactcccc tgagatagta cctgcaggag gaatggtggt gttagagggt caagcccatt 2640 ttaatggtcc ccacacagaa aaactggtaa cactcgaacc accctccgtt cctttgccca 2700 atggtcttct tattgctagt tgcttgcaca catcaccgaa taaacgtctt tccaagctgt 2760 cagttctgtt aagaaatacc acgcaaactg acatagcagt tcctcctaaa gtcatgttag 2820 ccgagattca tgctattcaa agtgtcctga accagcatca tcagagttca gatgctaaag 2880 ctgaagagtc aatacccacc tgtgccaact taacatttga ctttggcgac tctctgccca 2940 cgacctggaa agaaaggata acaaaactgt taaactctat gccggaagtt ttctccctgc 3000 atgatttgga ttttggtcac acaaagaagg tcaagcacca aataaagtta aacgacgaga 3060 caccattcaa acaaagggcc aggcccatac atccccaaga catagacgct gtgaggaggc 3120 acctccaaga gttgctagtt actggtgtta tccgggagtc tgattctcca tttgcttcac 3180 ccatagttgt tgtccggaaa aaggatggct cagtgcggct atgtgttgac ttccggaagc 3240 taaatgcaca gacaataaaa gacgcctatg cgttaccaaa tttagaggag gctttctcca 3300 cactgacggg ctcaaaatgg ttttctgtgc tcgatctgaa gtctgggttt tatcaaatag 3360 agatggagga agttgataag gccaagactg catttgtctg tccacttggc ttctgggagt 3420 tcaaccgtat gccgcaggga attacaaatg cccctagcac ctttcaaagg attatggaac 3480 ggtgcatggg agatctaaat cggaagcaag tccttgtctt tattgatgac ctcattgttt 3540 tttctgatac tttagaggaa catgagtccc ggttgttgca agtcctaaac cgacttaagg 3600 agtatggatt gaaattgtca cctgagaagt gccggttctt ccaaacctca gtgaagtacc 3660 ttggccacat tgtttctcac aatggggtgg aaacagaccc tgcaaaggta gaagctttaa 3720 agacctggcc aaggccaaga aacctaaaag agctaaggtc ctttttaggc ttcgctggat 3780 attacaggag gtttgtgcgt gacttttcaa agatagttaa accgttaact gaccttactg 3840 caggatatcc tcctcttaga aagagttgta acacgaagca gaaagactgt gaatatttca 3900 atcccaaagc ggaatttgac actcgatgga ctacagactg tcaggatgca tttgactcca 3960 taatcgacaa tctcacatct gcacctatat tgggctttgc aaaccccaaa catccctatg 4020 tgctacacac cgatgcaagt accaccgggc tcggtgcagc tttgtaccaa gaacaagagg 4080 ggcagctgcg agtcatagct tatgctagta gagggttgac taaaggtgag agcaggtacc 4140 ctgcacataa acttgaattt ttagcgctaa aatgggctgt aacttccaag tttaatgact 4200 acctttacgg tgcagaattt actgttgtga cagatagcaa ccctctaaca tatatattaa 4260 cttcggcaaa acttgacgct accagttacc gctggttgtc cagtctatcg acttataatt 4320 ttaagctgca gtacagggca gggagtcaaa accaagatgc agatggtctc tctcgaaggc 4380 cacatggtga gcttgtggat gacctaacct cactaaaaga gagggaaagg attaggcaat 4440 tcactttgca ccatcttatg gagtcagaag atgagtcacc tgttgtgatg gcagaagtag 4500 tgaaagcgat ctgcgaaaag catcaagtag ttgggtcacc ccaaggactc cattgtatcc 4560 cttcggttac tttggttgag tctcttaccc actgtgtgga tgtccttcca tacgagttcc 4620 agcatgagga tgaacatggt ctcccaagtc tccctcatct ctcacaagct gctttggcag 4680 agttgcagag aaaggatcca gagttgaaaa ttgtcattga aagagtggaa agtggggtta 4740 agccttgtaa gttaagggaa ctatcttctg ctgtgagctt atggttaaag gaatggaagc 4800 gtcttgagtt gaggagtaat gttctgtaca gaaagaggca ggaacacgga gcttcatcat 4860 accagttggc tttacctacc tcacttagaa acaccgtatt acagagtctc catgatgaca 4920 tgggtcatct tggtattgaa cgaacactgg atcttgtgag gacaagattc ttttggccga 4980 aaatgtctca tgcagtggta cagaaggtaa aaacctgtga acgctgtgtt cggcggaaaa 5040 cacctcctga aaaagcagct cctttggtta atattcaaac aagtaggccc cttgagttgg 5100 tgtgcattga tttcctatcc ttagagccgg accaaagcaa cactaagaac attctggtca 5160 tcactgacca ttttacaaag tatgctatgg ccatacctac tcgaaaccaa actgcccaaa 5220 cagttgcaaa aagtctttgg gaccacttct taatacacta tgggtttccc gagaagctgc 5280 atagtgacca aggagccgac tttgagtcac gtactgtcaa ggagctgtgt aaggttgcag 5340 gaatacacaa ggtcagaaca accccatacc atcctagggg gaatccagtg gaacgattta 5400 atcgtacact gctccaaatg cttggaacac tggaaaatga gaggaaatct aggtggaagg 5460 agtatgtaaa acccctagtg catgcctata attgcaccag gcatgacaca actggatata 5520 ctccctacga gctcatgttt gggcgacaac ctcgtcttcc tgttgacttg gcattcgggt 5580 tgccagtgga cactcccaac aagtctcact cacagtatgt ggaaaacttg aagaatcgtt 5640 tacgtgaaag ttacgagatg gctaccaaaa atgctggaaa gattgcagaa cgtaacaagc 5700 aaaggtttga caagcatgta gttgccttaa ctctggaaga aggtgaccga gttctagtga 5760 ggaatgtgcg tttgcgaggc aaacataaat tagctgacaa atgggagcaa aatgttcatg 5820 ttgttgtcaa gaaagcacat aacctaccgg tgtatactgt caaaccagaa ggaaaggatg 5880 gtccgttacg aactttacac cgtgacctct tgttaccctg tggatttttg caatcaaata 5940 agcttgtaga accaccaaaa cagaaaccag ccaggaagcc tctaaccaga ttttccctta 6000 acaatgagat gcaggaatca gacttaattt ccgaaaactc agaatctgag gaggaacaca 6060 tcgtcagtaa tgtgcctgaa ggaacattaa gttttgaaac tcaaattatt gttggtcctg 6120 agtacatacc agttggggag tctggtgtta gcttaacagt cctcgatcct gctgtggaag 6180 acgtgtctgt tccggaatct gtagtaagta atccagaaga acctgcaaag aaacacttac 6240 ctggtgtgga acctgtggaa aaagaaacaa atgagttaat tgaagtagaa aaaaattcca 6300 atgcacttga gtcaagtaac actgtgccat ttgtcctgac tgagaaaaac tcagagttgg 6360 agcaatcctc agagctttgg agtgagtccc ctgatcaaac agccaagaat gtgctagata 6420 gttttgagtg ggaaactgaa caaaatctga tacccagcaa tgtgggacat actgaaattt 6480 tgcagaatga gcaacccact tgtaatgaac cagatgatat tttactcaga cgatcacaga 6540 gggagcgacg gcctcccaag aagtttgaat atccccagtt agggaatcca cttaccttag 6600 ttatacagtc tttgttacaa ggccttgata cagctctctg ttcttcttta gagaagtcag 6660 ttgttgctcc agtgcgtcat ctgtgaatac tgtttgctgt gcaatgcaaa gggacttgca 6720 tgtattcgag aggggaggg 6739 // ID L1-Tx1-1_DR repbase; DNA; ZEB; 5220 BP. XX AC AL713862; XX DT 11-FEB-2003 (Rel. 8.01, Created) DT 11-FEB-2003 (Rel. 8.01, Last updated, Version 1) XX DE L1-Tx1-1_DR is a non-LTR retrotransposon from the L1 clade. XX KW L1; Non-LTR Retrotransposon; Transposable Element; 15-bp TSD; KW L1 clade; L1-Tx1-1_DR; TX1; endonuclease; reverse transcriptase. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5220 RA Kapitonov V.V. and Jurka J.; RT "L1-Tx1-1_DR, a family of non-LTR retrotransposons from RT zebrafish."; RL Repbase Reports 3(1), 5-5 (2003). XX DR Genbank; AL713862; Positions 8174 2955. XX CC L1-Tx1-1_DR forms a separate family of L1-like non-LTR CC retrotransposons. CC This element is less than 70% identical to members of other CC L1-like CC families identified in the zebrafish genome. CC L1-Tx1-1_DR belongs to the Tx1 subclade of L1-like elements. CC L1-Tx-1_DR was active recently (ORF1 and ORF2 proteins are CC damaged CC by a few stop codons and indels only). XX SQ Sequence 5220 BP; 1778 A; 627 C; 1163 G; 1652 T; 0 other; tgagagaatg gcgtcgctcg tggatgagcg aacgccatct ctttcacttc gccatggtgt 60 caagtgtatg acagagcaag gagttaaggt ggaagacgtg ctaatagctg tgggagaaca 120 aatcggctgt gaaaatatag tttcggcgtc taggatgaac aaagcagttg ttgtttttcc 180 taaggaagaa ctttttgtct cgcagcttat tgagagtggt gtaataatta gtgggagttt 240 ttgtcccgta ctaccactag ttgcgttaac gtctaaggta attatctcga atgtgccgcc 300 gttcatcccc gatgaagcca ttgagcgaga gctcgttagg tttggaaaaa tagcgagtcc 360 gataaaagtt atatcgctag gatgtaaaag tccagagctg aagcatgtga tgtcgttttg 420 acgacaagtc ttcatgtttc taaatgaacc aacgttggac atatctttta gggtaatgaa 480 tgaaggtaag tcgtttatga tttatgcttc tactgctgga ttgaagtgtt atgaatgcgg 540 agattttggt catagacagt atttttgtcc ccataaagtg caaggaaata caaatgatga 600 caatggtgat gtaaatgagt tggagggaga gaagaatcaa gtgaaataaa cagaaacgga 660 aacgacaatt gagaatgatg ttgttggtaa acagcagaat gaagggaatg cgattcagag 720 aaatgtaatg aatgatgaaa atcaggcaac agtggtacaa gaggttgtag atgaaaatga 780 tttgagttct gaaactgtga ctgcgagtgc tagtggttta gtgaagctta atgagaaaat 840 ggaaggacct gtagaaggcc tattagaaga cgaacataaa caggatatga tggatgatga 900 cgacactttc tctcagactt cagatttttc cgaaatgggt tctcaaatgg aagacagtaa 960 tttttattct ctacaagaga taaatgaatt tttggatgat actttcggaa gaacagttga 1020 agttgaagat tttttcccag atactgataa atttgtgaaa acggttctta gactacggcg 1080 tactgttgga ttggatgaat taagtgataa aaagagattt agactgggga aatttctgac 1140 aagaataagg aagcataaag gctcgttttc aagaaaatag gtaaatgagt ctgaaacact 1200 gggtgttctt tctttcctat ttttttctga tccttgtcta tttctcttct attattatgg 1260 ggcttatgaa aataggttct tttaatataa atggagcgag agatcaaaat aaaagggcta 1320 tattatcaga atttttaaga atgaattaaa ttgatattat tatgttacaa gaaatccatt 1380 ctgataaggc aaatgagata gattggggtt tgtggtggga agggaagtac gtactcagtc 1440 atggttcaaa tttaagtgct ggtgttgggg ttcttttttc aaaatttttg aatgttgaca 1500 tttttaaaac tgaagaagtg gtagctggaa gggtattatt tgttcacgca caagttgaag 1560 gaatggaatt tttattcatt aatgtttatt caccaaatgt tggttctgaa agaattgaat 1620 tattcttaaa tataaagagt cagatttcaa aatatgataa taatgtttgt ataattatgg 1680 ctggcgattg gaactgtact gaaaatgtta aagttgatag aaacggtgaa gaacctcatt 1740 ttcagtcagg ttatgtactt tctaaagtga taaaagagca caatttaatt gacgtttgga 1800 gaatcagaaa taaaggtata aagcaatata catgggttaa agtaaataat actgaaatta 1860 agggggctag attagataga ttttatatat gtaagtcttt taatagtaga gtggtaaatg 1920 ctgctattgt ccctagtggt ttctcagacc accatatggt aactatagat ataaataaga 1980 aaataacagc tagaacacaa tattattggc attttaataa aaaattatta tataataagt 2040 tattctgtga gagttatgga gattagagaa aaataattat gaaactaaat gccagtggtg 2100 ggaagttgaa aaggcgcata taaaagtact ttgtcagagt tatgaatctc aaatctcttt 2160 taaagttaga gatgcggtaa aaaaattgca atgtgaaatt ggtgaaatag aaaaaattat 2220 gttagacaat gaagaaggaa ctgataaaag tgaaaaaatc cataagaaga agctagagtt 2280 gaaacaaatt ttacaagaac aagtcaaaaa tgctttaata agatcacgtt tttgcacggt 2340 taaggatatg gatgcgccta gttcctattt ttttaaacta gaaagaaagg tagttcaaca 2400 taatcctatg cttcatttaa gttatccaaa tggaaaaatt acggacaatc caaatgaaat 2460 gaaaaaaatt gcagttgatt tttattccga actatttagt gctgagcagt gtgatgaaga 2520 atgtatgatg gacctttttg aagatcttgc taagttgcaa actaaacaaa gagagaactt 2580 ggatagtgat atcagtttgc aagagcttac agatgcgtta cagcaacttt cattaggatg 2640 ttcgcccaga attgatggaa taactacaga attttatcat cacttttgga acattttggg 2700 acaagatttt tatgaagtgg caaaagagtg ttttgaaaat ggacttcttc cgagtagctg 2760 ccgaagagca gttttgtctc ttttaccaaa aaaaggagat ttggctttct taaagaactg 2820 gagacctgtt tcactgttgt gtgattataa aattttggca aaatgtttag caaataggtt 2880 aaaatgttat ttggagataa ttattcatgg tgatcagaca tattgtgtac ctgaaagatc 2940 tattatggac aatctttttc ttttgagaga tgtgattcat ttgagtatta aaaaaaatat 3000 taatatgggg attttagcat tagatcaaga aaaagctttt gaacgggctg accatagtta 3060 cctttttagt gttttaagtt gttttggtga aaagttcatt tcttggataa aactgttata 3120 tcatgatgtt tctacgttgg tgaaagtggg aggagggttg agtgcctcaa ttcctgtgag 3180 aaggggaatc aggcaaggat gtccactctc tggccagtta tacagtattg ttattgaacc 3240 acttttaaga agaataagaa aaaaatctta atggtttttc aatacctaat tatataaatt 3300 gtgataagat agtgttatct gcctatgctg atgatataac agtttttata catggagaag 3360 atgatgttaa taatcttaaa aatatatata gagtagtatg aaaaggcttc tactgcaaaa 3420 gttaattgga ataaaagtga agcatatgca gtaggtcgat ggattaataa aagacttcca 3480 gatctaccag gaggattaaa atggggaagg gaaggtctta aaattttagg tgtgttttta 3540 ggttcttcaa atttcatgga aaaaaattgg gagggactgc tggagaaagt ggtggcccga 3600 ttgtctagat ggaaatggct attaccacaa atgtcttata aagggagagt tttggtaata 3660 aataatctgg tagcgtctac actatggcat aggatgatgg tgcttgaacc accacaggag 3720 ttggtggtgt ctatacaaaa gaaactagta gattttttct ggactggtca acattggatt 3780 cgtgctgcac aactgtattt gccaaattgt gaaggagggc aaggattaat agacatcaaa 3840 agtaaaattc aagctttcag actgatttca gcctaaaaac ttttgtgtaa aagcaagttg 3900 tcatggactc atactgcttg tgcaatcttg agaacagtaa aggggctgaa ttatgatcta 3960 catatttttt ggatacaaac tgatgagatg gatttaacag ggattacatc gttttataaa 4020 tcagtgctac aagcatggaa taaaaccttc aagattcaaa gaatatttga catccctaaa 4080 tattgggtgt tggagaaacc cctttttcat aatccagtgg ttcatggtac attgatatct 4140 tcgaggagtg tacagaacaa tatgattaaa aggaactgta caaagttaaa acatctggtc 4200 aaggtggatg gatggaagtc gatagaggac ataatggttt ttacaggact aagatcccat 4260 cgtttagcta cacgattctt ggaggaaata cactcttttt tgcctggaca gtatagagcg 4320 attttacaag aagaagactc gagtgaagaa gttaaaattc tccctctggt ggtatcttca 4380 agttgtgaat ttggtgggtt cgaggaagag actggaattt ctaaaacatg gacggttgac 4440 ggaatgtcta aagaggtttt atacatctta tgtgttaaga atgttcacaa ggacatacta 4500 aaaggacagt tattgaaatg gactaaaaag tttggaccta atttcccctt aagagatcag 4560 tggaggtccc tatataaatc tcctcttgag aaaagaatcg gggacttgca gtggagactt 4620 gttcatgggg caatagctac caacagtcat gtgtctcatc ttaatccaat agttaggagg 4680 gagtgtgcct tttgtggtaa ggaagagaat gtagaacatt tgtttttaag ttgtacaaga 4740 ttgaaagacc tttttaatgt tttaaaatac tggtttagag gttttggtga aaccttttct 4800 gaatgtattt ttataggggg agtgaaatat agaatgtcta aaaagatcct ttttactctt 4860 ttaaattatg ttattggcac agcaaaattg tctatttgga aaactagaaa aaacaaaata 4920 tcaggtgttg gtggaataga tccagtgaat atgttaaaag gtttaatttc aaatagattg 4980 aaaattgaat ttactttcta tgaattggta agagatctaa taacatttga agaaaagtgg 5040 tgtctaggaa gcattttatg tagtgtttat gaagaacagc ttgtacttaa tttataaaaa 5100 caataataag gtagttgggt ttcatttttt aattttggtt ttttaatcaa ctcaaagatg 5160 aatggaatat gatgtgaatg aacatttaat ggaatgagaa tgttttgtta aaaaaaaaaa 5220 // ID L1-9_DR repbase; DNA; ZEB; 1131 BP. XX AC . XX DT 04-AUG-2002 (Rel. 7.07, Created) DT 04-AUG-2002 (Rel. 7.07, Last updated, Version 1) XX DE L1-9_DR is a non-LTR retrotransposon from the L1 clade - a DE partial consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1 clade; KW L1-9_DR; ORF2; endonuclease; reverse transcriptase. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1131 RA Kapitonov V.V. and Jurka J.; RT "L1-9_DR, an ancient family of non-LTR L1-like retrotransposons RT from zebrafish."; RL Repbase Reports 2(7), 29-29 (2002). XX DR [1] (Consensus) XX CC L1-9_DR is an ancient family of L1-like non-LTR retrotransposon. CC There are ~500 copies of L1-9_DR in the zebrafish genome. They CC are CC ~7% divergent from the consensus sequence. The consensus CC sequence is incomplete and represents the 3' end of L1-9_DR. CC It encodes the C-end of the ORF2 protein (L1-9_DR2p). XX FH Key Location/Qualifiers FT CDS 3..782 FT /product="L1-9_DR2p" FT /translation="TPICYNHAFKPSLTDKVFEQWQDKPVFSVTDLYINNI FT FATFSQLSEKFNLPPSNCFRYLQVRNYVRLNTPDFEVLSLEGELFDLFLNS FT SNARGLTSLFVNAFKNKANCSSLHLKVALEEDSGISTSETDWDQCLLSVYS FT CSRHHLIQYKVLHRLHYSKTKLDKFYPSVSPTCDKCKAAEGTLHLFRSCVQ FT IQNFWLEIFQFFAKVYDCVLLPDPMIAIFGWSDFLETLNRNVRLPVQYGMI FT IAKKVILCFWKKNVRPLFF" XX SQ Sequence 1131 BP; 306 A; 206 C; 180 G; 439 T; 0 other; tcactcctat ttgttacaat cacgcattta aaccctctct tacagataaa gtttttgagc 60 aatggcaaga taaaccagtt ttcagtgtta cagatttata cattaataat atttttgcca 120 ccttttcaca actctctgag aaattcaacc tccccccttc taattgtttt aggtatcttc 180 aggtcagaaa ctatgtccgg ctaaatactc cagattttga agttttatct ttagaaggcg 240 agttatttga cttgttttta aactcttcaa atgctagggg cctaacttct ttatttgtta 300 atgcttttaa aaacaaagca aattgttctt ctcttcattt gaaagtcgct ttggaagaag 360 actcagggat ttcaacatct gagactgatt gggatcaatg ccttttatct gtttattcat 420 gttctagaca ccatcttatt cagtacaaag tattacacag acttcactat tctaaaacaa 480 agcttgataa attctatccc tcggtttccc ccacatgtga caagtgtaaa gctgctgagg 540 gtaccttaca tctttttcgg tcctgtgttc aaattcagaa tttttggtta gaaatatttc 600 agttctttgc taaggtttat gattgtgttt tactccctga cccaatgata gcaatttttg 660 gctggtcaga ttttctggag actctcaacc gaaatgtacg gttacctgtt cagtatggca 720 tgatcatagc taaaaaagtt atcctttgtt tttggaagaa aaatgttagg cctctttttt 780 tctgactggc ttctcggact cacctccact ttacatttgg aaagaataag atataccacc 840 tctgggaact ctgcgaaatt tgataaaatg tgggatccag tctttttttt ctgaaacacc 900 ataaagacta agagtgttgt ttaatttctg aagatttgta agtttttttc tctcttttta 960 catccattgc tgtgtctttt tgtaggtcat atatgttttg tatgtcgcca ttttctcttg 1020 tttcttgctt tattttacat tttgcacaga ttattgtttc atgatgggtt tgtctgtctg 1080 ttctttgttt ttctgtctgc aaaaattata ataaaaaaaa taattaaaaa a 1131 // ID HATN15_DR repbase; DNA; ZEB; 1080 BP. XX AC . XX DT 04-AUG-2002 (Rel. 7.07, Created) DT 04-AUG-2002 (Rel. 7.07, Last updated, Version 1) XX DE HATN15_DR is a nonautonomous DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW HATN15_DR; nonautonomous DNA transposon; hAT superfamily. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1080 RA Kapitonov V.V. and Jurka J.; RT "HATN15_DR, an ancient nonautonomous hAT-like DNA transposon from RT zebrafish."; RL Repbase Reports 2(7), 19-19 (2002). XX DR [1] (Consensus) XX CC HATN15_DR is a family of nonautonomous DNA transposons that CC belongs CC to the hAT superfamily [2]. CC Several hundred copies of HATN15_DR are expected to populate the CC zebrafish genome. The element was active long time ago, its CC copies are ~9% divergent from the consensus sequence. CC This element is characterized by 8-bp target site duplications CC and CC 11-bp terminal inverted repeats. XX SQ Sequence 1080 BP; 372 A; 174 C; 167 G; 335 T; 32 other; tagggatgtg cacatcgatg ctcgagtatc gatatatcga tacthagaag httctaattt 60 ggtatcaata ttttttaaag tatcgatatt ttaactatat tttaataatt tctgcataac 120 taaatgtttt agcgctgtgt amaggattst cagtgacgct ttgtatacaa gacgttacac 180 caatcaatct gcggtggaac gtcacgtgag cgatcawgtg cctttcagat aacttacata 240 ctcaacacac gcaagtcagt cacacacttc tgcattatgg aatgacacga tggcagaggg 300 gtaacgcagc gtgttgtcgt gctcattagc tdaaaaagcg ttgcacdagc aatatttghg 360 ataagaaagt catacacagc aacaatacaa gcagccttta taaacactta cagacaacac 420 avcctgaaac ataaagaagt ggaacaaaaa caagatgccc agtgttcctc tgctcagtct 480 ccctctgtct gacaaaagac attataggag agtttaaaag gagrcaaacm tctctctctc 540 tctctctctc tctctctctt tcaattcagt tccattyaat aattgcktta wtggcatgac 600 aaatgttwtg ccaaagcatt tataaarttt acattaaaam rmacacacac rcacrcacat 660 atatatataa gcwtggtaaa cacagcaaac aatagtatag tattataata tatagtataa 720 taatanataa atayaataaa taaaaataaa taaataaaaa aaattaaatc agctctgtaa 780 atctttctct cattaccatt aggtgaataa raataagtat aagngtgtgt gtatctgtgc 840 agtgtgtata tacagtatgt atgtgtatgc agtgtgtata tgcaatatgg atatgtgtgt 900 gcrtgtatgt atatttatat ntnnacatgt attgattttt ttaattaaca ttattttgaa 960 aaattacatt aacattttaa ataataagtg gtatcgatat cggtatcgrt aacaraattt 1020 ttttaacaaa aagtatcggt attgtatcga attaaaaaat tgtggtatcg cccatcccta 1080 // ID DNA-2-9_DR repbase; DNA; ZEB; 638 BP. XX AC . XX DT 30-OCT-2008 (Rel. 13.1, Created) DT 30-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE nonautonomous DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; 2-bp TSD; KW DNA-2-9_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-638 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1222-1222 (2008). XX DR [1] (Consensus) XX CC TSD is 2-bp long, TIR is 37-bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 638 BP; 190 A; 107 C; 112 G; 229 T; 0 other; cactgtaaaa aatggccgtg atttcaacgg taaaaaactg taaaaatgct acagtacaaa 60 tccgtaacct ggttaacggt atgtttcctt aatatatacg gcgaataacc gtaaattgac 120 tttcccagaa ttccctgcat ggcacataac tttttatgca tttggttgac attaatatgg 180 atctttttag ttgtttcccc atcagttatg tacattagag atttatgtta catctaatgt 240 tgataaacaa tgtttatttc atgactttaa ttttatgcgt gttaccatac tggtgtttag 300 tagctgtgtg aatgacactg agcaccttct atatgctgat actcttttct gcttgtggga 360 aaagttgatt gtgatgagca ttggctcatt atgtaacttc ctcatcacca cctgcatgtg 420 ggtgtgctaa cgtgtctaac tgtgtaacaa aagatgtgta gatgttgatc attcaaaata 480 cagacatatt acctctataa attaataaaa tacggtaatt taccgtaaaa atgaaaaatt 540 acttttacgg tttgtaccgt gtttttaacg gtaaagtact ggcaaccaca gctgccgttt 600 ttttaccgta aattttacgg aatttatttt ttacagtg 638 // ID Gypsy163-I_DR repbase; DNA; ZEB; 4431 BP. XX AC chr7; XX DT 13-OCT-2008 (Rel. 13.1, Created) DT 13-OCT-2008 (Rel. 13.1, Last updated, Version -1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy163_DR; KW Gypsy163-LTR_DR; Gypsy163-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4431 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1604-1604 (2008). XX DR Genome; chr7; Positions 69939241 69943671. XX CC Positions [3250-3705] - Integrase core CC 'GAGCG' target site duplication CC LTRs are 99% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 234..2621 FT /product="Gypsy163-I_DR_2p" FT /translation="MAGLVGQIGSYDETVEQWSSYTERFDYFVQANGIKDE FT VLVPTFLSVMGSKTYNLLRSLVQPDKPGDRSFKDIVALLQEHYAPKPLIIA FT ERFRFHKRNQLEGESIAQFVAVLKRLSEHCEFGTTLNDTIRDRFVCGLRSE FT TIQKRLLCESNLTLERAMEISVSMEMAAKEAQELSTSSHVNKVYSEKMKQE FT PKKACYRCGKSRHSAHECWFKDKDCRNCGKKGHIERACQSKKLIATGKVNS FT SKLTEPRKDYIRKTTKKNVNQVENTGKEEYESDENEFSHIMSVSSNSDGYW FT VTPLLEGQAIPMQVDTGAAVSLVAESTYREKWPHLRPKETNLTLKSYTGEM FT VPLSGVVDITVELNKQKVKLPLYIVKGSSHTALMGRAWLEKIKLNWQEVHL FT IAKESTSLQNILRKHAEVFKEELGNMKDITVKLVIKHDSNPKSLKARPVPY FT AIKPKVETELARLEETGVLKKVSVSEWATPIVLVLKKDGTVRICGDFKVTV FT NPVLTVEQYPLPLIEDLFAGLSGGQKFSKIDLCQAYLQMQVDPVSQELLTI FT VTHKGLYRYQRLPFGITSAPALFQRAMDQILSGLNGVQCYLDDLLITGKDD FT EEHLRNLNETLQRLKEYGLRVKKDKCEFFQPTIEYLGHVIDSTGLHTAPSK FT VKAIVDAPSPRNISQLRSFLGLLTYYAKFMPNLASTLRPLHELLNKSKQWV FT WSNRCEKAFKDVKSVLTQFKVLTHYNPSLPIQLACDASPYGVGAVLSHVMA FT SGEERPIAFASRTLNSAESNYAQIEREALAIVFGVKKVPPVLVWT" FT CDS 2707..3795 FT /product="Gypsy163-I_DR_1p" FT /translation="MQRWALLLSAHQYDIKYRRSEHHCNADGLSRLPLPDT FT PPEGSQADIFYFKEVQSAPVTAAQVKRFTRTDPVLSEVLTWISHGKRGEMT FT ENCKPFLSRHDELTVQSGCILWGFRVIIPPPLRKLLLKELHAGHAGVVRMK FT EIARSYFWWPGIDAEIEEEARGCTDCQNVRNMPQLAPLHPWEFPEDPWQRI FT HIDFAGPIKNHMFLVMVDAHSKWPEVAIMNGTSSEKTIEELRSVFNRFGIP FT QQLVSDNGPQFVSEEFQSFLQMNGIRHIKSAPYHPSTNGLAERFVQTLKKA FT LKASQDKGSLNQRLHTFFLAYRNTPHAVTKVSPATALLKRQLRTRLDLLRP FT SETKQIVSSQQQRQVERRVK" XX SQ Sequence 4431 BP; 1427 A; 938 C; 1009 G; 1057 T; 0 other; aaactggcga cgaggataaa cgatcttccg cgtcactcaa agacgcctgt tatttagcct 60 atgttagcaa cacgcgattg ctaacggagg gaaaatcatt cacgaacaaa acgaaaataa 120 aaaagtacaa cggccgcaca tcacgacgac gacacacatc ggcgcgggaa ggaaagaaag 180 caacaacaac aaaaagcagc gtgagtaaaa gaaataaaag aagactaaag gaaatggctg 240 gtctagtagg gcaaatcggc tcttatgatg aaactgtaga gcaatggagc tcatatactg 300 agcgatttga ttatttcgtt caggctaacg gaataaaaga tgaggtttta gtaccaactt 360 tcctcagtgt aatgggcagt aagacgtata atttattacg cagtctagtc cagccagata 420 agcctggtga tcgctcattt aaggatattg tagcattgct gcaagaacat tatgcaccga 480 aacccctcat cattgctgaa agattccgct ttcataaacg taatcagtta gagggagaat 540 caattgcaca atttgttgct gtattgaagc gattatcaga gcactgtgaa tttggtacaa 600 cactgaatga cacaataaga gacagatttg tttgtggcct acgtagtgaa acaatacaaa 660 aaagacttct ttgtgaaagt aatctcacat tggaacgtgc aatggaaatc agtgtatcca 720 tggaaatggc tgctaaagaa gcacaagagt taagcacatc cagtcatgta aacaaagtgt 780 acagtgagaa aatgaaacag gagcctaaga aagcatgcta ccgctgtgga aaaagtcgtc 840 attcagcaca tgaatgctgg tttaaagata aagactgtag aaattgtgga aaaaaagggc 900 acatcgagcg tgcatgccaa agtaaaaagc tgattgcaac agggaaagta aactcatcca 960 aacttactga acctaggaaa gattatatca ggaaaacaac gaagaaaaat gtaaatcaag 1020 tggagaatac aggaaaagaa gaatatgaat ccgatgaaaa tgagttctca catattatgt 1080 ctgtgtcttc aaactctgat gggtactggg taacaccact gctggaggga caggctattc 1140 ccatgcaagt ggacacaggc gcagcggtat cactggtggc agagtcgacg tacagagaaa 1200 aatggccaca cctccgacct aaggaaacaa acctaaccct gaaatcatac actggagaga 1260 tggtacctct ttcaggagtt gtggacataa ctgtggagct aaataaacag aaggtaaaat 1320 taccattata tattgtgaaa ggaagcagcc acacagcatt aatgggccga gcatggctag 1380 agaaaataaa gctaaactgg caagaagtgc acttaatcgc aaaagagtct accagtttgc 1440 aaaacattct gagaaagcat gcagaggtgt tcaaagagga actaggaaac atgaaagaca 1500 ttacagtaaa gctggtcatt aaacatgaca gtaacccaaa gtccttgaaa gcgcgccctg 1560 ttccttacgc aataaagccg aaggtggaaa cggaattggc ccgcttggag gaaaccggag 1620 tacttaagaa agtgagtgtg agtgagtggg ccacgcccat tgttctggtg ttaaaaaaag 1680 atggaactgt ccgaatttgt ggtgatttta aagtcacggt caatcctgtg ctaactgtag 1740 aacagtaccc cttgccatta atcgaagatc tttttgctgg attatcaggg ggtcagaaat 1800 tcagcaaaat agatttatgt caagcttacc ttcaaatgca ggtagatcct gtttcacaag 1860 agctgctgac cattgttact cacaagggcc tctaccgata tcagcggcta ccattcggga 1920 taacttcggc tccagcactg ttccaacggg ccatggacca gatcctgagt ggtttgaatg 1980 gagtacaatg ttatcttgat gacctgttaa ttacaggaaa agatgatgag gagcatctca 2040 ggaacttaaa tgagaccttg cagcggttaa aggagtatgg tcttcgggtc aagaaagata 2100 aatgcgaatt ctttcaacct accatcgaat atctgggaca tgttatcgac agcactggtc 2160 tccacacagc accatcaaag gtaaaggcta ttgtggatgc accatctcca aggaacataa 2220 gccagctgag gtctttccta gggctcctaa catactatgc aaagttcatg ccaaacctgg 2280 ccagtacgct tcgtcctcta catgaattac tgaacaagtc taaacaatgg gtatggtcta 2340 acagatgcga aaaggcattc aaagatgtga aaagtgtatt gacacaattc aaggtcctca 2400 cacattacaa cccttcttta cccatacagt tggcatgtga tgcgtctcca tatggcgttg 2460 gcgccgtact ttcacatgtg atggcatccg gcgaagaaag accaatcgct tttgcatcaa 2520 ggactctaaa ctctgctgag agtaattacg cacaaatcga acgagaagct ttggcaattg 2580 tatttggtgt aaaaaaagtt ccaccagtac ttgtttggac gtaaatttac tctgctcact 2640 gaccacaggc cacttacatc tatctttggc gctcaaacag gcattccctc attggcggca 2700 aaccgcatgc agcgctgggc tctgctattg tcagcgcatc agtatgacat caaatacagg 2760 agatctgaac atcactgtaa tgcagatgga ctgtccaggt tgcctctgcc agacacgcca 2820 cctgagggca gtcaagctga catcttctac ttcaaggaag tgcaaagtgc tccagtcacg 2880 gcggctcagg ttaaacgatt cactagaact gacccagttt tgtcagaagt tttgacctgg 2940 atctctcatg gtaaaagagg agagatgaca gaaaactgca aaccctttct gagtcggcat 3000 gatgagctca cagtacagtc tggctgtata ctgtggggtt ttcgtgtcat tataccccca 3060 cctctaagga aactcctgct gaaagaactt catgcaggcc acgcaggagt agtcaggatg 3120 aaagagatag cccgtagcta cttttggtgg cctggcatag atgcagagat agaggaggaa 3180 gcaaggggct gtacagattg ccaaaatgtt agaaacatgc cacaattagc tcccctgcat 3240 ccatgggaat tcccagagga cccatggcaa agaatacata tcgactttgc aggaccaata 3300 aagaaccaca tgtttttggt gatggttgat gctcacagca aatggccaga ggtagccatt 3360 atgaatggca catcttccga gaaaaccatc gaagagctca gatcggtatt caatcgtttt 3420 ggcataccac agcagcttgt aagtgacaat ggcccccaat tcgtgtcaga agagttccag 3480 tccttcctac aaatgaatgg tatacgacac atcaaatctg caccatacca tccatctaca 3540 aatggtctgg ctgaacgttt cgtacagact ctaaaaaagg ctctgaaagc atcgcaggat 3600 aagggttcct taaatcaaag attgcatact ttcttcttgg cttataggaa tacaccacac 3660 gcagtcacaa aagtctctcc tgctacagca ttgttgaaac gtcaactacg aaccagactg 3720 gacctgctga gaccatctga aaccaagcag attgtctcct cacaacagca aaggcaggtg 3780 gagcggcggg taaagtaaaa cacagaagtt tcaacagggg tgattgggtt cttgctcgta 3840 attatgggaa aggatcaagg tgggtacatg caaaaattct tgcacaaact ggtcctgtct 3900 catacattgt tgaaactaga gacaagcaga catggaggag acatgtagat cagatattgt 3960 cttcagcaat gcctagcgat gactctggtg agttggagaa ccaacagaaa ttctgccctc 4020 acattgaccc agtgaaggat gtggtacatg aaaattttcc cacaccgctt gctagggagt 4080 cacaggtaat tgcaccctct gaggatacac ctctttcacc tgctacatca gcagctatta 4140 cagacacttt agtaaacctg ccatctggtt cttttcatcg ttacccaact agagtgcgga 4200 aaccaccaca gagattggat ctctagttag aggctaaccc acgtcgatgg ggcagaataa 4260 tccccagggt ttagccaggg aaatgaggca gtctaccctc tttcccttct gttgggactt 4320 attgtgttat attgttatgt tccctacatc tattaaaaat atatataata atcaaccgaa 4380 catatatagt gttaacgtac ccctgagtgt taaagtaaag ggggaggaat g 4431 // ID DNA-TTAA0-1_DR repbase; DNA; ZEB; 1033 BP. XX AC . XX DT 02-OCT-2008 (Rel. 13.1, Created) DT 13-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE non-autonomous DNA transposon - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; TSD 0-bp; KW target TTAA; DNA-TTAA0-1_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1033 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1247-1247 (2008). XX DR [1] (Consensus) XX CC This transposon is inserted into TTAA site (between TT and AA), CC but it does not produce any TSDs (confirmed by comparing several CC pairs of pre-and-post insertion sequences). The TIR is 86 -bp CC long. The classification of this family is unknown. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1033 BP; 325 A; 168 C; 174 G; 366 T; 0 other; acagtttttc tcagtcgctt tggtgcattt ctcacaacac tatttacatt tgcacaacag 60 ttaatgcatt tctcaaaaca attagtcatt tgtgcacatc ctagtagcag tttctcattc 120 ttttcaacaa attgcgaatg cttttggaca tgcatcaatt gctttcatac aactctctgc 180 tgtttataac attatcattt gcttatgtca tgtcagtcaa aatgaactaa acttgtaaat 240 gctgaatagt cattccatat aaaactaata gtcctcattt cattacttga gtcattacat 300 acaaaaatgt tgaactagtt gtcaaaatct gtcaagcaaa ttttataaaa caaatttaaa 360 tctttatttt cctgaaaatg tctttaaaat tgaacaattt gatgaatgat ttacagacct 420 gtgtatgttg taggctcagt tccaatatgt actgcaatat tgtttacagt tgtgcacagc 480 tctacctaaa gggagtttat gtttgttgta aataagggtg tgtttattct tttgcacaat 540 gctgtgaata aattaggatt gcaaagagca aatgcagtaa aaatgtgaaa catacatatt 600 cagtgtcttg tactcagata cctcactaaa tgcagtgatg tctaacttta ctgtagtttt 660 caaatggctg tgcaaatagt atatagtgct gtcttgagca ttttcaggaa gtgtcaccaa 720 aatctgactt tttttgcatt gaaaaaattg tacagagtaa actgtcataa tgaaaacatg 780 actaagccat ttgactatct tgttcataaa caatggtgtc aggacttttc attttgatga 840 tactgacact ttgattgaca tgaatacttg cttttgagga atgaactatc cattttgagc 900 atgtgacgcg cttttgcagg ttatcagcta ggttttgcag tttgcactaa ttgttttgag 960 aaatgcatta actgttgtgc aaaaatgcac atagtgttgt gagaaatgca ccaaagcgac 1020 tgagaaaaac tgt 1033 // ID TDR9 repbase; DNA; ZEB; 208 BP. XX AC . XX DT 04-MAR-2002 (Rel. 7.02, Created) DT 04-MAR-2002 (Rel. 7.02, Last updated, Version 1) XX DE Zebrafish non-autonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; TDR9; TC1. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-208 RA Jurka J. and Drazkiewicz A.; RT "TDR9: a non-autonomous DNA transposon from Danio rerio."; RL Repbase Reports 2(2), 37-37 (2002). XX DR [1] (Consensus) XX CC Distantly related to Tc1. Several hundred copies. XX SQ Sequence 208 BP; 71 A; 29 C; 28 G; 70 T; 10 other; ggttcattcy gctgtggcga ccccagatta ataaagggac taagccgaaa agaaaatgaa 60 tgaatgaatt aattaataaa attattataa ttatatttta aaaattaaat ttanntntna 120 ttttattaat cattaaatca nnaattcatt cattttcttt tcggcttagt cccttkattd 180 atctggggtc gccacagcrg aatgaacc 208 // ID hAT-N46_DR repbase; DNA; ZEB; 804 BP. XX AC . XX DT 17-OCT-2008 (Rel. 13.1, Created) DT 17-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Putative nonautonomous hAT DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW 8 bp TSD; hAT-N46_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-804 RA Bao W. and Jurka J.; RT "hAT DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1315-1315 (2008). XX DR [1] (Consensus) XX CC TIR= 14 bp. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 804 BP; 241 A; 172 C; 143 G; 248 T; 0 other; caggcccgga ttaagaattc agaggcccct gggcacaatg tcttgtaggc cccctacctg 60 gccgcacccc tatagttgaa ttaaaaaata aattaatata atatttttta atctataatg 120 tattgcccgc atttttaaat aaatgatata cagtacatat atttctagtc tacaaagatt 180 taacttattt ttaaagcatt taaagcacac tttgaaaaaa aaatatacta atacaactag 240 tgaaaattaa tggattttaa tatacttgtt caagttcact gaagcagtac taaaatttat 300 atttaagggt atttttaatg tataacttct acaaacttat aatgcatatg atttggatat 360 aacacctctc caatccagtt cagttctatg aatctgagtc ttccataata cacacactta 420 ttaatgattc ctacttgtct tcctcgtgat gaagagtagg caaaatctga tatgtagtgg 480 gggaaaaaga aaaaacgtgt tcatcagacg ctggattcga accgggttca tgatcgatcg 540 cgtcaaaaca tgttgccatg cgctttacga gctacgccac tcacgctgct gtttgttgcg 600 ctctttagtt atgttgtctc tgtcaatcaa acggtgatgg gcgggaatca ctcaactagc 660 ttattccaac gaggtgcgct atttgcactt agcctaaata gacactccaa aatctgcagt 720 ttttttcccc ccctcgcagg ggccccaagt gcgcacgggc ccctgggcct gtgcccataa 780 tgcccattgg ttaatccggc cctg 804 // ID DNA-8-23_DR repbase; DNA; ZEB; 811 BP. XX AC . XX DT 09-DEC-2008 (Rel. 13.12, Created) DT 09-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE A nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; TSD 8-bp; DNA-8-23_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-811 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(12), 2146-2146 (2008). XX DR [1] (Consensus) XX CC Copies are ~85% identical to the consensus. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 811 BP; 213 A; 163 C; 226 G; 207 T; 2 other; tagaggtctg cattcccgcg gctgtacygc gggagctgtg ggacccgacc agatttcttg 60 cggcgcggga ataaatttcc gaataaaaca cggtagcggt cggtaactct ggtgtaattt 120 gaacgggagc gggcggtcta acaaaatata gcattcccga gagatagaga gagcgagtga 180 gagagagagc aagcgcatcc gccrtggtgc tgtactgtgt gtgtgtgtgt gtgtgaatga 240 gagagagcgt ggtgctgtgt gtcgcttgtt ataggctgtc tggactctgt agcatccagc 300 tgggtgtttt ttgggaggca tatttaatcg ctatacattt tacattcttc cttggcccaa 360 tatgtacctg ctatgtgtat agaaataatg ctgatatgat atatgaaaca caatcggtag 420 ttggcaaata tattgagtgg atatgtttgt gtttacaaat ctgactttga aagagtcagt 480 gcctcaccag ccacaaacct caccgcacgt cacgcggcac tcatgtctca tcacaaaggg 540 gcaaacaaaa ctgacattta gcctacaatc cttgcacaac ctacactttt aatctgttat 600 ctctctcttt tggtagtacc caattttaac gtgagatttt ggaaaccaga tctaaattta 660 gcgggaacgg tcgggttggg tagaaaacgg ggcgggtcgg gcagcgggtg aacaaagact 720 gaatatacag cgggagcggt cgggtgcgga ataaaacctg gcgggagcgg gcaggagcag 780 gactaaaaat tcagtcccgc gcagatctct a 811 // ID BEL4-LTR_DR repbase; DNA; ZEB; 904 BP. XX AC chr16; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL4-I_DR; KW BEL4-LTR_DR; BEL4_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-904 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1360-1360 (2008). XX DR Genome; chr16; Positions 11383332 11384235. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 904 BP; 282 A; 141 C; 203 G; 278 T; 0 other; tgtaaagtca aatcgcgaat ttggacttca tatgaatcaa tgcatgaagg aattaggcgg 60 aactaacgaa actgaaagtg tttggcggaa gcaattggct gacggagctt gccgaggaga 120 tacggagctt gccgaggaga tacgggaagt gccgaggaga tacgggaagt gccgagtgaa 180 tgcggaaaga aaaaaaactc acatggcaag tgcatgcatg cagacgcgct gtgaaagctg 240 aagaacgcga tcgctgcatt aacaaacttt cacgattgca tgcagaaagt tttaagctgc 300 agtgagtaag gaagggaagg cacatcagca tttggtcatc tcagggacac tgatggtatg 360 agaattgact ttttctaatt acaaggactg ccacatgcat tgccttattc ataatatatg 420 tgcaatataa aagacagctg ctattttgtt gcactgtagt tcattatttt gtgcattata 480 atactttgtg cacaatatta agttaatcat gctgttcatt tatgtaaata gtttatgtaa 540 agtgaaatgc ttgcattaat attaagaagt attattttaa tatgcaaatt aataatgctg 600 agacaaaatt tgtgaaggaa ctgtctttat ttatattaca ttgtgtttgt ttaagtcagt 660 attgcgaata atgtatattt tatatatttt gttccttttc taaggttatc aacctattgc 720 tatcgctgct actggtatcg ctactactgt tcaagcgcta cagtattgta ttattgggca 780 aactgaaagt taaaatacaa aagctgaaaa gccttggagt tgttccctgt gtcccctgtt 840 attggctgaa gctctttcga gttagaagca gatactggtg tgtgtaaccc ataagaactc 900 gaca 904 // ID DIRS-1_DR repbase; DNA; ZEB; 5654 BP. XX AC . XX DT 26-SEP-2008 (Rel. 13.1, Created) DT 09-MAR-2009 (Rel. 14.04, Last updated, Version 2) XX DE DIRS-like LTR retrotransposon family - a consensus. XX KW DIRS; LTR Retrotransposon; Transposable Element; KW endogenous retrovirus; reverse transcriptase RNase H; KW phage integrase; DIRS-1_DR. XX NM DIRS-1_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5654 RA Jurka J.; RT "Families of DIRS-like retrotransposons in zebrafish."; RL Repbase Reports 8(10), 1267-1267 (2008). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 5654 BP; 1066 A; 1734 C; 1551 G; 1295 T; 8 other; ggcacgagta ttgcgtcaca tgcttggggg tccaacatgt tgatgcggtg ctcgcggaca 60 gttcttgccc gcattgcgat ggcttgactg ttgcgcaact gaggtcgcgg tgagctcttt 120 tattggggca agtcacccct gctgctcccc gccaagcggt tagcactcgg gcagacctga 180 gggttacagt gggggctaat ccgtcgcctg cggaccctcg ctcctctcca gtgcgctctg 240 tttctgcttc gagtgcgagc gctggtccat cctccgggct ggccgcgctt gcattagatg 300 acaccgggga cgtgacgtcc atcgctgcat cggagggcgg gttgacatgc tccgaggaag 360 atccggaccc cctgccaccc tccggggttg tcagtatggt tactggaatc ggacatgtta 420 gccgtgttat cccgggctgc ttcggccgtg gggctggaga tggcttgtcc cccagaaccg 480 cggccggacc gcttagacgg gtgctacgtg ggggtgaaga aaggcgaagc cttctagacc 540 cccccgtccc cttcttcccg gaggtacaca gagagctcac gcaggcatgg agggcacctt 600 tttctgcccg gtcctcgtgc gctcccgctc tcacctccct ccgtgatgga gcggccaggg 660 gatatgaggc gattccccgg gkggagcgtg ctagccgytc cactcgtgga gattcgccca 720 aactcccttc caaggcgtgc aggttatctg agtctctcat ggccagagcc tacactgctg 780 cgggtcaggc cgcctccgct ctgcacgcta cggccaccta tcaacgctac caagccgaga 840 agctggccga gatgcaggag ggaggttctc ccccgggctt gctgcatgag ctccgcacgg 900 caaccgatta tgctctcatg acccccaaat cggctgctca tgccctgggg aagacgatgg 960 ccacattagt ggtccagaaa cgccaccttt ggctgagctt ggctgatatg agtgacgttg 1020 acaaagctcg ctttcttaac gctcccatat cccaggccgg cctgttcggc gacaccgttg 1080 ggggttcacc caggaattct ccgcggtgaa agagcagtcc aaggcgatgg gtgaagtcat 1140 ctatcagcgg gctcagaaaa ctgctcctcc cgctgcccct tccgcttggc gctcctcgcc 1200 gagggcgtcc acccgcggct tcaacattcc gcttccgctg cccgctccac cggccaagcg 1260 gcaacgccga gcctcccacg ggctgcgacg ccaccgcccc agggcgccgt taagtcggta 1320 aacggacccc gaagcgttcc tgggacgggc cattcggaga agaggggacc tgctctttcc 1380 tcggtggggg gccgaggatt attaatcagt ccacaacgac tctaaactca tcgctgccgg 1440 cctccgggcc agcggcaacc aaattttcaa aagagcagtt tcctctttct ccggatgcgc 1500 aaacccgagc actgccagtc tgggacgctc cgccttccag ctcgcagcgc cgggacccct 1560 cgcctcaggc cctcagagtg cagcagaacg gactcctttc tctcactctg gcctcaccgc 1620 gggatccagg gaggaaggta agagaaattc tcttattttc agctcttcct cgggacgcwc 1680 tgcttcccgg gatgagcact cccatcccga gctgcccctc cgctggcacg tcagcgatcg 1740 ctccgatggt gccattagcg cgcgctctgc tggcttggtt agcgccgctc agcgcgttgc 1800 ggtggctcat acggacggtc agactcggct atgcgattca gttcgctatt cgccctccca 1860 agttcacggg tgtccttttc acgagggtga tccccgagag cgcccctgtc ttgcgagagg 1920 agattgctgt cctcctggcg aaggatgcaa tcgagcaggt ccctccagcc gagatgaggt 1980 ccgggtttta cagcccgtat ttcatcgtgc ccaaaaarag cggggggtta aggccaatcc 2040 taaatctatg cctttcagaa tgctcacgca gaaacgcttg attcagagcg tccgtccaaa 2100 ggattggttt gcagccatag acctgaagga cgcatgtttt cacgtctcct ttcttccacg 2160 ccaccgccct tttctccggt tcgcgttcga aggacgagcg tggcaataca aagtcctccc 2220 cttcgggctc tctctgtctc cacgggtttt caccaagctc gcggagggtg ccctagcgcc 2280 cctgcgcctt gcgggcatcc gcatactcag ctatcgtgac gattggctca ttctagcctc 2340 gttccgcgat cagctgatta tgcacagaga caaagtgctt cggcacctcg accagttggg 2400 gtttcaggtc aaccgagaga agagcaaact ttgccctgtg cagaggatct cttatctcgg 2460 gctggagctg gattcggtcg ctatggttgc gcgcctctcc gaggagcgcg ccaggctgat 2520 gctttcctgt gtaaacgagc tccacaggaa gatagtggtc ccactgaaat gttttcagag 2580 gctcctgggg catatggcat ccgcagccgc ggtcacgccg ctcggtttgc ttcatatgag 2640 accacttcgg cgttggcttt gcgatcgagt ccccagacgg gcatggcgcg cgggcacgca 2700 ccgggtgcgc gtcactccgc tgtgtctccg caccctcagc ccctggacgg atctggtttt 2760 tctacgggcc ggagtgcccc tagggctagt atccaggcat gttgtcgtaa cgacagatgc 2820 ctccagcatg ggctgggggg ccgtgtacaa cgagcatgca gccgcgggtt cgtggtccgg 2880 accccgcctg cattggcata tcaactgcct ggagctgttg gcagtgtatc tagctctccg 2940 ccgcttttta ccggtgctgg agcggaaaca catgctggtc aggacggaca gcatggcgac 3000 ggtggcctat atcaaccgta tggggggtat acgctctcgc cgcatgtctc agctcgctcg 3060 ccgtctgctc ctttggagtc acacgcggct gaaatcgctg cgtgccatcc acattccggg 3120 cgagctcaac cgtgcagcca gtgcgctctc acggcagcta gtgtcccgag gggagtggag 3180 actccacccc gattcggtcc agctgatatg ggcgcgcttc ggggaagccc agatcgatct 3240 gtttgcttcc accgagaacg cacattgcca gctgttttat tccctgaccg aggcccccct 3300 cggcacggat gcactggctc acagctggcc atcgggcacg cgcaaatatg cgtttccccc 3360 agtgagccta atcgcacaaa ctttgtgcaa agtcagggag gacgaggagc aggtcttgtt 3420 tgttgcgccc ctctggccca accggacctg ggtttcggag ctcacactcc tcgtggcggc 3480 ccctccttgg cgcattcccc taagggagga cctcctctct cagggacggg gcaccatctg 3540 gcacccacgc ccagatctct ggaacctcca cgtgtggtcc atagacggag cgcggaagac 3600 ttaggtgact taccgcccgc ggtacttaac accatcactc aggctagagc accctctacg 3660 aggcatgcct acgccctgaa gtggagtcta ttctctgagt ggtgcgcttc tcgccgagaa 3720 gacccccgaa cttgccagat tagcattgtg ttatccttcc ttcaggataa gctggagcgc 3780 gggctgtcac cctccacact gaaggtttac gtggctgcga tctccgctca tcatgacgcg 3840 gtagatggca acacgctcgg gaagcatgat ctaatcatcc gattcctcag aggcgcgcgg 3900 cggttaaatc cgtcccgccc ccctctcatg ccctcttggg atctctctct agtcctagcg 3960 ggtctgcaga gagatccgtt cgagccactc gagtcagtat ctcttaaaat tctgtcatta 4020 aagacagctc tgctgatcgc attggcgtca ttcaagagag ttggggatct ggaggcattt 4080 tcggtcagcg aatcgtgcct tgaattcggg cccggttact ctcacgttgt cctgagaccc 4140 cggcctggct atgtgcccaa ggttcctacc accccattta gagatcaggt ggtgaacctg 4200 caagcgctgc cttcggagga ggcaggctca acccactcac tgctttgtcc ckttcgcgct 4260 ttgcgccttt acgtggaacg aacgcaaaat gtaagatcat gtgaacagct ctttatctgt 4320 tacggtggtc grcagaaggg aagtgccgta tcaaaacaga ggttggccca ctggttagtt 4380 gatgccatcg ccctcgctta tcaatgccag ggcgagccgc gccctcctaa cgtgagagcg 4440 cactctacwa gagstgtcgc ttcctcatgg gcgttatcac gcggcgcctc tctcacagat 4500 atctgcagag ctgcgggttg ggcgacacct aacacattcg cgaggttcta taatctttga 4560 gtggagccag ttttctccca gttattggta accccaatca atcgggggga attaagctcg 4620 gtgtcacaaa cgcttgctgc gccatgctcc ctaacccgga gatgcgtgcg ctttattcta 4680 ctctgctagt aagtttccct tctcaggcga accctagttc ctccgaggcc cccatcatcg 4740 actcagcgga ggagtcgaat gcatggctca gtgtgcggtt ggtacgccca tttggtctac 4800 acgcatattg aggatcagct atgtgcatcc ccacttggtg atgccatatg cattattacc 4860 acggtgtgtt cccccttatc aggcggtccc gtgtcttccc taaccgctaa ccagcttatc 4920 atatgtagca ctccccctca ttagggctag tccatatgcc tccttaccat caggtctccc 4980 cttctgggta gaaggtggtc tccaccgcgt cctccccctg cggggactga cgcttcccaa 5040 cgtactgtcg tatttccaaa aatccctagt ctatattagc taggtaaaag cacacttaca 5100 cccccgatta acttaaacat ttattcccat gtaatggtaa tatgttgggc cgaggggacg 5160 ttggaaggtt gcgctcttgg tgatgtcagt gcgctcacgc tttgcttggc aaactacaca 5220 tcaggagcgt gacgggcttg gttgccgtgg cgctttccat acagtctccc aaagtctgtt 5280 tatacagaca cacgtcgaag ttcccatatg aaggggaacg tccaggttac gtacgtaacc 5340 cacgttcccc gaatagggaa cggagacgtg tgtctccact gccacagcac ctgagtctcc 5400 agctgggagc tgagcgatcg gctcttcagc aggcaaaatt ctgacgagct aactcacgac 5460 taatatagcc ctaattggct catcgtttca gctgtggagc taagctccgc caattcaatt 5520 ggcatttcat tggcccgttt ttatatcttc agaaaagatt ggtcgtctaa agcactccca 5580 aagtctgttt atacagacac acgtctccgt tccctattcg gggaacgtgg gttacgtacg 5640 taacctggac gttc 5654 // ID hAT-N35_DR repbase; DNA; ZEB; 1433 BP. XX AC . XX DT 15-OCT-2008 (Rel. 13.1, Created) DT 15-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Putative nonautonomous hAT DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW 8 bp TSD; hAT-N35_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1433 RA Bao W. and Jurka J.; RT "hAT DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1304-1304 (2008). XX DR [1] (Consensus) XX CC TSD is 8 bp long. Most elements are inserted into TA satellite. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1433 BP; 501 A; 305 C; 223 G; 404 T; 0 other; tagtgctgtc aatcgattaa aaaaatttaa ctaattaatc gcaccttttt aaaaaaatta 60 atcgcgatta atcgcattta aaatactgaa acttgtaatt ttggctattt aaatgtaaaa 120 ttaatgtaaa cgcaagacaa aaaatattta aattcaaaat ataattgttt aatagaattt 180 ttgtttaact tgtaacacag atttcttcat gtaaacaaca tacccacaat aaaccatcaa 240 gatcctggct tgacagccat atttattaca gaaattaaaa cacaggcatg ttaatgacat 300 ttaaatttca aaacaatcaa tgccaataaa gaaaacattg atttccatgt tggattctaa 360 gtggactgca aaaaaatgcc aaaatacagg aattgcagat atggaaaatg aaaaattata 420 ataataaact atagaataca aactgttgca ctcattgtaa agttttattg ctgtgagttg 480 agaacattaa ttatagagta gaaacaattt tgcttaatcc tcctttacat tcatccagtt 540 gctaagacta ggagtatccc gcaagtgcac agagactccc aaatgcccgc aaatgaatga 600 taatttcttg caaaccggtc gttaaataca ttgcacaccc ctctcgcccc tactcagata 660 cgtcgctcac tccacccctg acacccctca acgggcctga cacagacaca cacacagaca 720 gacacacaca caacgcgctg cagacgtttt ggccccaacg gccttccata ctggagcgac 780 tcccctcaga ttcaacacgc atgatcacgt tcatcaaatt tgctaagcgt cctaaactaa 840 gcgtcctaaa gtattactgt atttcacaag gattctgaca caacaacgcg aagcatacgc 900 tttcgttgcg tttaatgagg aaacacgcta aacttggagg gctcatgctg aaaggcagag 960 taatgccctg tctttgacag ctcgcgactt caccagacgt tctgagtaca tttacataag 1020 acaccaatac tccgatttta atatgattaa gataatactc ttattaaaat tctaccgtgt 1080 agccttaaaa ggaaccttac cttaaaggaa caccgagtca cgaaatgcgg aggattttct 1140 ttctgttcgc catgcggtat caacttacaa cagaagtcga aagttaaaaa tgaaacaccc 1200 gaaattgcat gaaactctgg ataacctgtg atgcaactaa ttgttttata ctggaacttg 1260 ccttcaaaaa gtgcttcctt ggtcttatcc acatttattg catgttgaac acacgtccac 1320 cacaacagga agtaaaaaaa acctgcaact gcgttaattg cgttaatttt ttttaacgcg 1380 ttaattattt aaaattaatc gcatgcgtta acgcgttaat tttgacagcg cta 1433 // ID CR1-11_DR repbase; DNA; ZEB; 1977 BP. XX AC . XX DT 29-OCT-2008 (Rel. 13.1, Created) DT 29-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE CR1-11_DR is a CR1-like non-LTR retrotransposon - a consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; Nonautonomous; KW reverse transcriptase; AP endonuclease; CR1 clad; CR1-11_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1977 RA Bao W. and Jurka J.; RT "CR1-like non-LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1206-1206 (2008). XX DR [1] (Consensus) XX CC The 5'- part is incomplete. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS join(2..589,552..1766) FT /product="CR1-11_DR_1p" FT /translation="KDCISAAKSTYYSSLISSNEGNSKALFSLFTNITKPS FT ISLPPNLYTVDFCNNLASFFSSKIEHIYQQILSTSVLNPSVQVIFTPSHSF FT SEFVLPSVTEICSLILKSKPSTCMLDPLPTTLVKAILPSLSPLITDIIHSS FT LTTGIVPSSLKVAAVTPILKKLGLDPNNLITFIHFQPSIHPKFLKKLCCST FT XHPSFLKNCAAQLXTHLSCNNLYEQFQSXFRPLHSTETALLKITNDLLLAA FT DSGSLSILLLLDLSAAFDTISHNILLDRLASIGIGNTPLSWFHSYLSGRTQ FT FIQLKSFTSSSVPVTTGVPQGSVLGPLLFIIYLLPLGYIFRKHQIQFHCYA FT DDTQLYLSTKPNHKLPPSSLTNCFADIKSWFTTNFLKLNDNKTELLLIGTR FT STLNKSHSFSISTDDSVITPSPQVKSLGVIFDSTLSFTAHVNNVTRSAYFH FT LRNINRLRPSLTPHSAAILVHSLVTSRLDYCNSLLFGLPKKXFINYNWYKT FT LQPVLSQKPPLSVTSHPFYNSFTGFPSLSELIIKYFFSPLKPSTIXLLHTL FT LILFISPHLLEHSGLLLPFTSLFPLSALSPWGADXSATLLLSFGIHXLLIS FT VI*" XX SQ Sequence 1977 BP; 509 A; 562 C; 228 G; 667 T; 11 other; taaagactgt atttctgcgg ctaaatccac atactattcc agtttaatca gttccaatga 60 aggcaattct aaagctctat tttctctctt tactaacata acaaaaccct ccatctcact 120 tcctccaaac ctatatactg tggatttctg taacaactta gcttcatttt tctcttcaaa 180 aattgaacac atttatcagc aaattctgtc cacttctgtt ttgaatccat ctgtccaagt 240 catattcacc ccttcacatt cattctcaga atttgttcta ccctcagtaa ctgaaatatg 300 tagtcttatt cttaaatcca aaccatctac ctgcatgtta gaccccctac ctactactct 360 tgtaaaggcc atactccctt ccctgtctcc tctcatcact gacataattc actcctctct 420 caccactggt attgttcctt cctctctcaa agttgctgct gttactccaa ttttaaagaa 480 acttggttta gatcccaata acctaataac tttcatccat ttccaacctt ccattcatcc 540 aaaattcttg aaaaaactgt gctgctcaac ttyacaccca tctttcctgt aataacttat 600 acgaacaatt ccagtctrgt tttcgcccac tccatagcac tgaaacagct ctcctcaaaa 660 tcactaatga cctcctccta gcagctgact ctggttcact atccattctt cttctgcttg 720 atctaagtgc rgctttcgac acaatctcac ataacattct tctggacagg ctcgcttcaa 780 ttggcattgg taacacacct ctcagctggt ttcattcata cctctcaggt cgcactcagt 840 tcattcaatt aaagtctttc acmtcgtcat cagtccccgt cactacaggt gtgccccagg 900 gttcagtttt gggtcccctg ctatttatca tctatcttct tccacttggt tatattttcc 960 gcaaacacca aatccaattt cattgctacg cagatgacac ccagctctac ctatcaacca 1020 aacccaacca caaactccca ccttcatccc ttactaattg ctttgcagac atcaaatcct 1080 ggttcaccac aaacttcytg aaattaaatg acaataaaac tgaactttta cttataggta 1140 cacgatccac tttaaataaa tcacatagtt tttccatttc cacagatgat tctgtcatta 1200 ccccctcccc tcaggttaag agtctggggg tcatcttcga cagcaccctt tcattcactg 1260 cacatgttaa taatgtcacc cggtccgcct atttccacct tcgaaatata aatcgtttac 1320 gtccttccct cacacctcat tcagctgcca ttcttgttca tagtttagtc acatcccgtt 1380 tagattactg taattccctt ctgtttggcc tccctaaaaa amcttttata aattacaact 1440 ggtacaaaac gctgcagccc gtattatcac aaaaaccccc tctgtctgtc acatcacacc 1500 cattctacaa cagcttcact ggcttcccat ctctttcaga attaattata aaatacttct 1560 tctcaccttt aaagccatcc acaatctygc tcctccatac cttgctaatc ttgttcatat 1620 cgccacacct tctcgaacac tcaggtcttc ttcttccatt cacctcactg ttccctctgt 1680 ccgccttgtc accatgggga gcagatcytt cagccactct gctcctcagc tttggaattc 1740 actycctcct gatctccgta atttagactt tcacaattta agtccagact caaaacacat 1800 ctgtttagaa cagcctattc tctttgaccc gattgcattt ttaaagtttg tattgtattg 1860 tctwttgcta ttktttttgt gtttttattt gaattgattg ttttagctgt cctgtacggt 1920 gaccttgagt gtcatgaaag gcgcctttaa ataaaatgca ttattattat tattatt 1977 // ID ANGEL repbase; DNA; ZEB; 318 BP. XX AC . XX DT 20-JUN-2000 (Rel. 5.05, Created) DT 27-FEB-2002 (Rel. 7.01, Last updated, Version 2) XX DE Non-autonomous DNA transposon - ANGEL. XX KW hAT; DNA transposon; Transposable Element; ANGEL; DNA element; KW Miniature Inverted Repeat Element (MITE); Zebrafish MITE; KW consensus. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RA Izsvak Z., Ivics Z., Shimoda N., Mohn D., Okamoto H. RA and Hackett B.P.; RT "Short inverted-repeat transposable elements in teleost fish and RT implications for a mechanism of their amplification."; RL J Mol Evol 48, 13-21 (1999). XX RN [2] RA Izsvak Z., Ivics Z. and Hackett B.P.; RT "Repetitive elements and their genetic applications in RT zebrafish."; RL Biochem Cell Biol 75(5), 507-523 (1997). XX RN [3] RA Ivics Z., Izsvak Z. and Hackett B.P.; RT "Genetic applications of transposons and other repetitive RT elements in zebrafish."; RL Methods Cell Biol 60, 99-131 (1999). XX RN [4] RA Jurka J.; RT "Consensus."; RL Direct Submission to Repbase Update (JUN-2000). XX DR [4] (Consensus) XX SQ Sequence 318 BP; 111 A; 52 C; 49 G; 104 T; 2 other; ttaaagggat agttcaccca aaaatgaaaa ttctgtcact atttactcac cctttacttg 60 ttccaaacct ttatgagttt cctttcttct gttgaacaca aaagaagata ttttgaagaa 120 tgctgaaaac ctgtaaccat tgacttccat agtagtaaaa acaaatacta tggaagtcaa 180 tggttacagg ttttcarcat tcttcaaaat atcttctttt gtgttcaaca gaagaaagaa 240 actcataaag gtttggaaca agtnaagggt gagtaaataa tgacagaatt ttcatttttg 300 ggtgaactat ccctttaa 318 // ID Tc1-8_DR repbase; DNA; ZEB; 1142 BP. XX AC . XX DT 09-JAN-2009 (Rel. 14.01, Created) DT 09-JAN-2009 (Rel. 14.01, Last updated, Version 4) XX DE DNA transposon - a consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Tc1-8_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1142 RA Jurka J.; RT "Mariner-type families from zebrafish."; RL Repbase Reports 9(1), 22-22 (2009). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 489..935 FT /product="Tc1-8_DR_1p" FT /translation="MVWGCFSARGTGRLHIIEGRMNGEMYRDILDKNLLPS FT TRKLKMKPGWTFQQDNDPKHKAKETLXWFQKKKIKLLEWPSQSPDLNPIEN FT LWRELKIRVHRRDPQNLQDFKTVCVEEWAKITPEQCIRLVSPYKRRLEAVI FT TNKGFSTKY*" XX SQ Sequence 1142 BP; 394 A; 210 C; 233 G; 304 T; 1 other; cagcggggaa aataagtatt tgacacatca gcatttttat cagtaagggg atttctaagt 60 gggctactga cacaaaattc ctaccagatg tagccatcaa gccaaatatt gaattcatac 120 aaagaaatca gaacatttaa gtatacaagt tgagtcataa taaataaagt gaaatgacac 180 agggaataag tattgaacac atgaagataa caaggtgcaa aatggcatag aaagtcagga 240 gatctgtcag tattgagaga aaaaccctgc tccctatcag tactaattga tatcagctgc 300 tttagtccta atagtttgcg aaacagcatt tggagaagcc ttggattatg ggaaactata 360 gtatggtcag atgaaagcaa aattgaactt tttggagtca ttctacacat catgtttgga 420 gaagaaatgg cactgcccac caccccaaga acactatacc aacagttaag tttgggggtg 480 gaagcatcat ggtttggggc tgcttttcag caaggggtac tggcagactt catattattg 540 aaggtaggat gaatggagaa atgtaccggg acattctgga taaaaatctg ctgccatcta 600 ccagaaagct gaagatgaaa ccagggtgga catttcagca agacaatgat cccaaacaca 660 aggccaagga aacactcaak tggtttcaaa aaaagaaaat aaagctgcta gaatggccca 720 gccaatcacc tgacctaaat ccaatagaaa atctatggag agaactaaag atcagagttc 780 atagaagaga cccacagaac cttcaagatt ttaagaccgt ttgtgtggaa gaatgggcca 840 aaatcacacc tgagcaatgc atacgactgg tttctccata caagaggcgt cttgaagctg 900 tgatcaccaa caaaggcttt tctacaaagt attaagtaaa gcgtgttcaa tacttattcc 960 ctgtgtcatt tcactttatt tattatgact caacttgtat acttaaatgt tttgatttct 1020 ttgtatgaat tcaatatttg gcttgatggc tacatctggt aggaatttcg tgtcaacagc 1080 tcccttagaa atccccttac tgataaaaat gctgatgtgt caaatactta ttttccccgc 1140 tg 1142 // ID hAT-N53_DR repbase; DNA; ZEB; 5819 BP. XX AC . XX DT 21-OCT-2008 (Rel. 13.1, Created) DT 21-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Putative nonautonomous hAT DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW 8 bp TSD; hAT-N53_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5819 RA Bao W. and Jurka J.; RT "hAT DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1322-1322 (2008). XX DR [1] (Consensus) XX CC This is a young composite nonautonomous transposon; copies are CC ~97 identical to the consensus. It contains insertions of other CC transposons, such as TDR18, DNA8-1_DR, etc. The TIR is 53-bp CC long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 5819 BP; 1915 A; 1065 C; 1092 G; 1747 T; 0 other; cagaggtggg accaagtcat tgtttggcaa gtcacaagta agtctcaatt cattgccctc 60 aagtcccgag tcaagtcccg agtcaagaca ggcaagtccc gagtcaagtc ccgagtcaaa 120 ggcaacaagt ctcaagtcaa gtccaaagtc ctgcagtttg attttcgagt cttttcgagt 180 ctttttaaca gaaaaataaa atatatacag attaggtatg gttgtaagat ctgtattaat 240 taaacaaacc tttttttatt aaagaacatt gcttagatat ataaaaatac aaatgcaatt 300 ttctgaaaaa gtactgaaca gtgctgcgtc tcgtctccac agcataatgc acaagaacga 360 gttccaccaa aaaaagactc cattttgcct cacatcacac agaaattgca ggtctactgc 420 tgtctaattc attaagttta gttgtgcccg tatgttatgt cttcggtgat caacttgtga 480 ttaaccaaaa ttacataacc tcaatgtggt agcagtagac caactcctgt atatgctgca 540 aagttaaatt gagtgaaatt ggaattttgt caattgaatc tcgtgtgtgc taaagagatg 600 gagttgcaaa tctgtctagc agcaatgtaa aatggtagca catattgtta atgaagtagg 660 ctacttcata caaacaataa agttgttttc ttcacataac gttgaagagc tttgctctct 720 accttgtgtg atgctcgtgc acggatttcc tctgtttgtg gacaaaactg attgcgagtt 780 ctcaaatttt ctctcgctct caaatatgca ctgctcacac acaaattttc ttgagcgctc 840 tcagagatta tatgcaaact cacaatttgt gtcgtgttcc ctcttccttt catgcttttt 900 gatatgattc atattgctgc actgtttaat aacaataacc aaaatactaa aatagactta 960 catattaaat gtttaatcta ataaaccata acatttttgg ttgttttata tgatgagata 1020 ttttcagttt caaacactta aagacagaga atagaccagt ggtgcccccc tgaaaatttg 1080 cttaaggggg gccagaagag gccagctacg tcagtggcac caatcaatcc acaataattt 1140 agtggctgct atttatttat tgaaatattg cattgtattg catttatgta agtagattta 1200 aataaataat gtcaacagca aaatcatatc gggttggcca caggggtggc tagagtttac 1260 acaggggtaa caccccttgt aaaggggttt acaccactat aaggcacctc tggaatagat 1320 gttggatgta aagaatacat tttattttaa aagcatttaa acaaaaacta atgcaagcag 1380 aaatgtaact gtgataaaaa taaggaaaca cttgataagg tattatagcc tactgtacta 1440 ttcactcttc aaataaatct cactatcaat ctcattttat catggcagct aaaataaagt 1500 aaactagtgt ctgctggctt ccactttgct gatgagattg tggaggacat tttccatgtc 1560 atcctcttca ttttgaggaa agctgtgcca cagggcccct gaacacgtat attggcatgt 1620 actgtgtttt gaacagtagc ctactacagt aaagaacttc aattgatcag ttgtggtaat 1680 actatagttg ctatggtaac acaagtgtaa tataaacaaa ttacccagtg ctgtattatt 1740 ttacaatata gggtatttta tactacaaat cactacactt tactgtagtt aaaactacac 1800 tttgtattat tttatctgtt cactattggt aatactactg tattatgaag cattaattaa 1860 caagtcgtaa ataatactag cctaaaacat aagtataata caagtataca acaagtataa 1920 tactcaaaaa cactagaatt tattaaagaa tttaccttaa ccttttagtt ttttgtgtat 1980 tgttaataaa tagtaaagca agaaagaaac catatcaaca ttcttggaat aacccttcgc 2040 taagcctcgc cctccttagt tactgttgct actacgcctg tcaagctttc atgcctggca 2100 catctattac agtgtgtatg cgctggtgtc agacattccc aaggatttaa ttagtcattt 2160 ttggcaaaca ggtgagatat ctgagaaagc cagacaaaag aggactgcag tatacattac 2220 aggggtatat tcacaacata atagcaactg attagaaaat aattttatca aaattgaagc 2280 aaggttagcc tagccacctc atacgctgaa cattcaacag catagttcat gcacagcagg 2340 agaggtgaaa ttataaaata atctaataac ctaagaaact gatggatttg tgccgaatat 2400 tagcatcatt gacccataac agtgtacagt acctattact gtcagcatgt ttgtgtgctc 2460 tttatccagt ttttattcaa atttgcagtt aagtggaaga aaaaaaattg aaatgcacat 2520 taaagtataa ataacagaag tgaacaaata gatttttcct accagcaaat attagtgtat 2580 gtatttatat gcgttacaat agcgagggtt taaactaagc agtgctcaca gggttcaacg 2640 ctaaggattt tttctactgg cccgatcggg ccagtggttc tgatttttac ttgcccagcc 2700 caaattttca ctggccccac caaaaaaaaa aaaaaagaag ttaatcgcta tttgaatgtg 2760 cgtgtgtgtg tgtgtataca cacacacaca cacacacaca cacacacaca cacacacaca 2820 cagttgattt cagaattatt agtcccccta aattattagc acccctgttt atttttttcc 2880 ccaatttcta tttaacagag aacagatttt ctcagcacat ttctaaacat aatagtttta 2940 ataactcatc tctaataact gatttatttt atctttgtca tgatgactgt aaataatatt 3000 ttactagata ttcttcaaga cacttctata cagcttaaag tgacatttaa aggcttaact 3060 agggtaatta ggttaactag gcaggttagg gtaattaggc tagttattgt ataatgatgg 3120 tttgttctgt agacagtctg aaaaacaatt agcttaagag gctaataaat tttaccttaa 3180 aatggttaaa aaaatgtttt tattctagcc aaaataaaac aaataagatt ttgttcagag 3240 gaaaaaatat tatcagacaa aatatcaaca tttctttgct ttgttaaaca tcatttagga 3300 aatattttaa aaagaaaaaa caaaaaacaa aagggggcta ataaatctgt atatatattt 3360 acggtaattt ttaaaatgtt taaggaaaac tattcagttc atcaaggcta tatttataaa 3420 tgtaaaaaag gtgtaattgt taaatattta ttaaaatttt aaataactgc tttgttattg 3480 tatttagttt caaataaaat aattactcca gacattattg attttataac tattaccatt 3540 aataaatata ataataatta atattagtga tttctgaagg atcatgtgac tggagtaatg 3600 aagctgaaat ctcatcttta aaatcactgg aataaatgaa taaattaaat gataaactac 3660 ttttgaacag ttattttata gtgcaacatt tcacaatttg tatttatgat gaaataattg 3720 cagccctggt gagcagaaga agcttgtttt aacatattta atcacactga ccccaaactt 3780 ttgtgtattg tgtatagaat attttataag tgaaaatgtg cactttaatt ttaacaaccc 3840 acagacatcg tcaccaaata caaatcgagg tcagactttc tcatatatac agagcctcat 3900 ttcaattgtc aggttttgta gaaatcgatc aattgaatta atcaatctat taaagaaacg 3960 ttgactataa ttctgtattt taggcttaaa ttatagagag acatagcaga tgaaccgaga 4020 ctgcatcaga tcaatgtaaa tctttctttc gagctgtcag tgcggaaatg aacaaaacaa 4080 caggcctaat acaacatagg cctattatac gattaaaata tggaaaaatt aacagatagt 4140 aatttcggtc aattttcctg acggataaac cgagatcagg ctggataaac agctctcggt 4200 aatatttcag catgctttca ctttcgcatt taccggtaag aatgacatct cgccctgctc 4260 atcattccct cttcatatag ccgtatatat ggctattaca cgatcatgat acactgtgat 4320 atagcctggg tcgttagttc gttgcattgt tttgttttct cgctacaatc gatacgctca 4380 ataggacaga gaccgcgtct cattcaggcg atcttggact gattgatttg gcgcggaagc 4440 gagcgcaact gctggatata tgctaaaaaa aacgcgctaa cgggtgaaac gagacagatt 4500 ccgaaacaaa cgtctatgtg tgaaagcacc gtaagattgt atttgcacac aattgacaga 4560 cagtcgcagc acgcagctct ggctcgatct ggcaggcaaa aaggcagcag ctggcccgat 4620 cgagccagta acattttgta cactggcccg agtgtctcgc acgctggccc cgggccatcg 4680 ggcagtcctt attgtcgagc cctggctcat aatatcgagt ggaaaaaaag aaaaagaccg 4740 ctgggaagca taacctgctt tgtttttttt tttttttttt ttttataaga aacacagttt 4800 agatctgttt aggataagag gagtgtgagt tattgccacc tatcattgaa tgtgaatatc 4860 gaaaacaaaa ccaacttagc ttaactcctt taccgcccct cacacagctt gatccacgct 4920 ggcatttctc ctcttgggct ttggttttga aaagggaaaa aatcccacca cccagtccaa 4980 acttttagga tagcgggtgt cagattttca caatccaatt gcgcaacgat ttggcttgtt 5040 tccctcgctc gaaagcttga cagagcctct gcccgtagct acgattggta agccgcgatt 5100 ggtgggtggc ggtttttggg cgtggcttag cgaagggtca gtttaacagg aggatcccgt 5160 gagagcaaaa gcactaaagg tatgagacgc catcgtaaac aaaagtcaag cgaacgcaaa 5220 gagactgaat gcacgagaga gagagagaga gagagagaga gagagagaga gagagagaga 5280 gagagagagg tgtatgttgt atgtgagaga gagagagaga gagagagaga gagagagaga 5340 gagagtattg gcacgagaga gagagagaga gagagcgagt attggctcga tctcaagttc 5400 aaattcaaat tctctctgcg ttcgcttgac ttttgtccac tccggcgcct catagtagtc 5460 acgttggata ctgtcgaatc aaaacaatct accgtgcttc gattggacgt catgccgaaa 5520 gcgcgcatgc tcacacacac gcgcgcgcgc atgctgtcac acacacgcac acataaacgg 5580 ggagagagag agagcggtcc atgtcaacat acttttttat cgttgggctt ttcatgggaa 5640 gcagcaagtc tttacaagtc aataggggca agtcctagtc aaagtccaag tcatttatgt 5700 tcaagtccaa gtcgagttgc aagtcttttt atattttgtc aagtcgagtc taaagtcatc 5760 aaattcatga ctcgagtctg actcgagtcc aagtcacatg actcgagtcc acacctctg 5819 // ID DNA11TA1_DR repbase; DNA; ZEB; 204 BP. XX AC . XX DT 11-JUL-2002 (Rel. 7.06, Created) DT 11-JUL-2002 (Rel. 7.06, Last updated, Version 1) XX DE DNA11TA1_DR is a nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA11TA1_DR; KW Dr000067; nonautonomous DNA transposon. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RA Bao Z.; RT "Dr000067, an unclassified and uncharacterized repeat."; RL http://www.genetics.wustl.edu/fish_lab/repeats/. XX RN [2] RP 1-204 RA Kapitonov V.V. and Jurka J.; RT "DNA11TA1_DR, a nonautonomous DNA transposon from zebrafish."; RL Repbase Reports 2(6), 1-1 (2002). XX DR [2] (Consensus) XX CC This element is characterized by 11-bp terminal inverted repeats CC (two mismatches) and the TA targets site duplications. CC Its classification is not certain yet, although it CC is expected to be a member of the Mariner/Tc1 superfamily. CC There are several thousand copies of DNA11TA1_DR present in the CC zebrafish genome; they are ~85% identical with the consensus CC sequence. XX SQ Sequence 204 BP; 67 A; 29 C; 32 G; 76 T; 0 other; cactcaaaaa ataatttttg ctgcttgttc aaactactta tttaaaatga gctgaaacaa 60 cacaattctt gagatttttt tggggacaac ttaattgttt tatgttcaat ccacttaaat 120 ttgtaaaaac tgttaagtta acttaatcaa tttgtgttgg gacaacatga aggaattgtg 180 tggaaccctg cattttttac agtg 204 // ID Gypsy91-LTR_DR repbase; DNA; ZEB; 294 BP. XX AC chr18; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy91-I_DR; KW Gypsy91-LTR_DR; Gypsy91_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-294 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1494-1494 (2008). XX DR Genome; chr18; Positions 25145235 25145528. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 294 BP; 89 A; 67 C; 80 G; 58 T; 0 other; tgtggtgggg aaggcgtgat cgaacaccca gaaagggggg agagcaggcg gagacaccgg 60 cggctggtca gtggtctaat cataaataat tattaacacc tatgttttct agtgattggg 120 ccggagagac accttcataa ggggaggaga aggagaagcc gggagagagg agagtctgtc 180 gaccttcaga ctcgacacgc acacacacac acacagaaaa ttataaataa aaattgtgct 240 tacctgccat cgccgaccct gccttcttct tcccacgaaa cgaacattat tgca 294 // ID Mariner-1_DR repbase; DNA; ZEB; 1508 BP. XX AC . XX DT 02-FEB-2010 (Rel. 15.11, Created) DT 02-FEB-2010 (Rel. 15.11, Last updated, Version 2) XX DE Mariner-type DNA transposon - consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-1_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1508 RA Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 10(11), 1868-1868 (2010). XX DR [1] (Consensus) XX CC ~83% identical to consensus. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1508 BP; 452 A; 321 C; 339 G; 385 T; 11 other; caggtgctgg tcgtataatt agaatatctt caaaaagttt atttatttca ttcaagaagt 60 gaaacttgtg taatgtatac attcattnca cacagactga natatttcaa gcgtttattt 120 gttttaattt tgatgacgac taaaaacgaa aaattcaaaa tagaatatcg tgaaaaggtt 180 cagcattgaa gacacccggt gccacactct aatcagctaa ttaactcaaa acacctgcaa 240 aggcctttaa atggtctcag tctggctctg taggctncac aatcatgggg aagacggctg 300 acttgacggt tgtccagaag accaccattg gaaccctgca caaggagggc aagacacaaa 360 aggtcattgc aaaagaggct ggctgttcac agagcgctgt gtccaagcac attaatagag 420 aggtgaaggg aaggaaaaga tgtggcngaa aaaagtgtac aagcaatagg gataaccgca 480 ccctggagag aattgtgaaa caaaacccat tcaaaaatgt gggggagatt cacaaagagc 540 ggactgcggc tggagtcagt gcttcaagaa ccactacgca cagacgtntg caagacatgg 600 gtttcagctg tcgcattcct tgtgtcaagc cactcttgaa caacagacag cgtcagaagc 660 gtctcgcctg ggctaaagac tggactgctg ctgagtgctc caaagttatg ttctctgatg 720 aaagtaaatt ttgcatttcc tttggaaatc agggtcccag agtctggagg aagagcggag 780 aggcacacaa tccacatgtt gcttgaggtc cagtgtaaag tttccacagt cagtgatggt 840 ttggggtgcc atgtcatntg ctggtgttgc tccactgtgt tttatnaggt ccaaggtcaa 900 cgcagctcta taccaggacg ttttagagca cttcatgctt cctgctgccg accagcttta 960 tggagatgca gatttcattt tccagcagga cttggcacct gcacacagtg ccaaagctac 1020 cagtacctgg tttaaggacc atggtatccc tgttcttaat tggccagcaa actcgcctga 1080 ccttaacccc attgaaaatc tatgggctat tgtgaagagg aagatgcgat atgccagacc 1140 caacaatgca gaagagctga aggccgctat cagaagcaac ccgggctctc ataacacctg 1200 agcagtgcca cagactgatc gactccgtgc cacgccgcag taattcgngc aaaaggagcc 1260 ccaactaagt attgagtgct gtgcatgctc acacttttca tcttcatact tttcaattgg 1320 caaagatttc taaagttaca ttttccgttt gttcattanc cgtaatcgaa attaaaacaa 1380 ataaacgctt gaaatatntc agtntgtgtg gaatgaacgt gatacattat acaagtttca 1440 cttcttgaat ggaattagtg aaataaacga acttttcgac gatattctaa ttatatgacc 1500 agcacctg 1508 // ID DNA-TTAA0-2_DR repbase; DNA; ZEB; 1359 BP. XX AC . XX DT 02-OCT-2008 (Rel. 13.1, Created) DT 04-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE non-autonomous DNA transposon - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; TSD = 0 bp; KW TTAA target; DNA-TTAA0-2_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1359 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1248-1248 (2008). XX DR [1] (Consensus) XX CC This transposon, like DNA-TTAA0-1_DR, is inserted specifically CC between 5'-TT and AA-3' (no TSD). The TIR is 57-bp long. The CC classification of this element is unknown. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1359 BP; 405 A; 223 C; 281 G; 448 T; 2 other; acagtttttc tccattggtt tggctcattt cttgaaacag aaattacatt ctcaaaacaa 60 catggacaaa cctccaaacc acttggcaat tgttcacaac agaattgaat ttctcattca 120 tttcatcaaa ttgcaaatgt ctaagtacat ctttgcaaat gattaagtac agatagccta 180 catttagcac aatttccaag tgaatagatc ttgttgatct aaactgattg ttaattctca 240 gtctaatggt tgttctctcc aaaatgtgtc agcgtcattt cattgtataw gtcattacat 300 gcacaatagt ctgttcaatt gtcaaaatta gtcaagacca taataccatg aataccatac 360 agtaggtgaa tccttggaac atttgatcat tgtaggggat ggttgcgaca ttcacgccgt 420 ttcttccctc gttgcatcgc aagggagaat acgctgtgat gtggacgaga atctgtggcc 480 acacagacta cagcatgtgg atggtcagga gggtgaggac ggcagacagt gaagaactgt 540 atgaaactcc agctttattt atagcactct asttttttgt ttgtgtgttt atattacagt 600 ttattatgct gtatacagtt tcttttgctc tgatatatgg aagttacttt gtgtgtggtt 660 gatcattaca ataaaaaaaa ggagttacca tttccttggc agtatgagtg caatgtgtga 720 cgtcaaacct tggaggctac tcttacccta aaatcctatt tctttttttc agttttatac 780 acaattgtat tctgttagct agacaaaaac agtacagtaa ccattgtcaa tgaaggactg 840 ggctaagact gtaaggtgga catgagggat attttagtgg tcctttgcca taatgaccaa 900 aacatctaaa cattttgact tgcagtgctt acacaatgcc gaagggacga agcattttgg 960 gggcactgac tgtttaaatg agaaggaaat ttagttttga cacatgagtg aactgttttg 1020 ggaaaggtat gagcttttgc aagtgagccg tagtgttgtg ccgaaccgta cacactgttt 1080 tgccaaatga tcttagagtt ttgagaatgt catttctgtt tcaagaaatg agtcaaacca 1140 atggagaaaa atatttgcca tttttgtggc actgattttt atatgggaaa ggaatttagt 1200 tttgatgcat gagtgaactg tttttggaga gatatgagct tttgcaagtg agctatagtg 1260 ttgtgctaaa ctgtaacact gttttgccaa atgatcttag aattttgaga atgtaatttc 1320 tgtttcaaga aatgagccaa accaatggag aaaaactgt 1359 // ID Gypsy-21-LTR_DR repbase; DNA; ZEB; 1470 BP. XX AC . XX DT 11-FEB-2005 (Rel. 10.01, Created) DT 11-FEB-2005 (Rel. 10.01, Last updated, Version 1) XX DE A long terminal repeat of the Gypsy-21_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy superfamily; Gypsy-21-I_DR; Gypsy-21-LTR_DR; Gypsy-21_DR; KW endogenous retrovirus. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1470 RA Kapitonov V.V. and Jurka J.; RT "Gypsy-21_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 5(1), 16-16 (2005). XX DR [1] (Consensus) XX CC Gypsy-21-LTR_DR is a long terminal repeat of the Gypsy-21_DR CC LTR retrotransposon that belongs to the Gypsy superfamily. Its CC internal portion is deposited in Repbase as Gypsy-21-I_DR. XX SQ Sequence 1470 BP; 476 A; 211 C; 267 G; 513 T; 3 other; tgtaatccat agtattacaa atagaaataa aagaaaaata tatgcaaata taaactgata 60 tacattaagc ttaattaata aattaaaaac ccaattaata gattaattta aaagtacttc 120 ataaatagaa ttcaagttta aatcttaaaa ttgcctaaag gtattttttg ttttcctgtt 180 tgtttattgt tgatgattta gttttatttt ctgaattttt gtgaattgtg aagatgtata 240 ttgttgcttt gcccttaaga ggttaaattt gagctcacca ctgacctatc aggtgaaagg 300 aaatgcgtgc agcggaacga cgtaaacaac gtcagtgacg tcagaggaag aaacggaaga 360 gaagcgagcg catggaagct gagatcggaa agctaatcca gcgagaaacg agtgaaatcc 420 tgtcttttaa acttgacaga acgtctagaa agatattgat agtgattcat cggacctgat 480 tgtttgttga tcttgcgata ttaagttgat attgacgcgg atgcagcgag gacacggatc 540 gcggttgtca tcgcgtgcaa tgagatggcg ggccacccaa cggatctcga ttgaacacat 600 tcacggtgaa attgcagcaa ttgcacctgt tgtttaatgt ttcatcttca tgacagaact 660 gtgctggatg acagagtaag ctggcatcac actttcttcc tttccacctg tactgtggta 720 tgttctgggc agcaggacgg taggactcaa tttatggatt gaaatcgaaa caatactttt 780 cctcgtgatt tcagagactg atatataaag tctacaacaa agactgttgg aattagcctt 840 cactgttttt catttgcact ttattagtgt gaagacaaat tgttatttta tttcataatt 900 gtgatttatt cattgtcatt tggggtattt atcttttgtt ttccaagctg agttgaaaat 960 ttgagtattt tgcttaaact aacataaaat tagctgatac tacatgtagt ttacttctaa 1020 tagaataaag taatataatt aggacacgtc aattagctat aaataaaaat aggtcaaacg 1080 ttgtacttaa aataagctaa aatatatata taaaaccata aaattgaaat aaataaacca 1140 gtttgtcaga gagaaagaaa agattaaata agaggttttc aagaaaggaa aaaaagagac 1200 aawtyattct tttgtttatt tggttttcat ttttttatya tcattaagag ggtttattga 1260 taaccaagtt ttttctttga aagaagttgt tcttctttat tgttgttttt ttttcctctg 1320 gaaagagaaa ttaagacaat tattattttt atttctttct ttattatata aagttcatta 1380 tagttgaaac taatctctgt tgtctgttct ctttagtcgg tctctcattt ctgctcctac 1440 gaacctacag tactggtcca cgttttccct 1470 // ID DANA repbase; DNA; ZEB; 394 BP. XX AC L42295; XX DT 22-DEC-1995 (Rel. 1.11, Created) DT 06-APR-2011 (Rel. 1.11, Last updated, Version 2) XX DE Danio rerio (clone DANA-m1) DANA retroposon. XX KW SINE; Non-LTR Retrotransposon; Transposable Element; retroposon; KW Repetitive element; microsatellite; DANA. XX NM DANA. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-394 RA Izsvak Z., Ivics Z., Garcia-Estefania D., Fahrenkrug C.S. RA and Hackett B.P.; RT "DANA elements: a family of composite, tRNA-derived short RT interspersed DNA elements associated with mutational activities RT in zebrafish (Danio rerio)."; RL Proc. Natl. Acad. Sci. U.S.A 93(3), 1077-1081 (1996). XX DR GenBank; L42295; Positions 1 394. XX CC misc_signal 1..64 /note="tRNA-related region" CC misc_signal 11..22 /note="Pol III promoter A box; putative" CC misc_feature 55..64 /note="Pol III promoter B box; putative" CC repeat_region 231..250 /note="TG dinucleotide" CC /rpt_family="microsatellite" polyA_signal 368..373 CC /note="putative" CC DNA is the first SINE isolated from zebrafish (Danio rerio) CC exhibiting all the hallmarks of these tRNA-derived elements. DANA CC is unique in its clearly defined substructure of distinct CC cassettes. In contrast to generic SINE elements, DANA appears to CC have been assembled by insertions of short sequences into a CC progenitor, tRNA-derived element. Once associated with each CC other, these subunits were amplified as a new transposable CC element with such a remarkable success that DANA-related CC sequences comprise approximately 10% of the modern zebrafish CC genome. At least some of the sequences comprised by the CC full-length element were capable of movement, forming a new group CC of mobile, composite transposons, one of which caused an CC insertional mutation in the zebrafish no tail gene. Being present CC only in the genus Danio, and estimated to be as old as the genus CC itself, DANA may have played a role in Danio speciation by CC massive amplification and genome-wide dispersion. There are CC extensive DNA polymorphisms between zebrafish populations and CC strains detected by PCR amplification using primers specific to CC DANA, suggesting that the DANA element will be useful as a CC molecular tool for genetic and phylogenetic analyses. XX SQ Sequence 394 BP; 90 A; 81 C; 119 G; 104 T; 0 other; ggcgacgcag tggcgcagtg ggaagtgctg tcgcctcaca gcaagaagct cgctggttcg 60 agcctcggtt aacaaggttt gaacgagcct cgctcagttg gcgtttctgt gtggagtttg 120 catgttctcc ctgcgttcca tgggtttcct ccgggtgctc cgttccccca cagtcctaag 180 gcatgtggta caggtgaatt gggtaggcta aattgtccgt agtgtatgag tgtgtgtgaa 240 tgtgtgtgtg gatgtttccc agcgatgggt tgcggctgga aggcgatccg ctaaaactaa 300 aaaaaaaact tgattaaaaa cttgctggat aagttggcgg ttcattccgc tgtggtgacc 360 ctggattaat aaagggacta agccaaaaag aaaa 394 // ID BHIKHARI-2-I_DR repbase; DNA; ZEB; 5912 BP. XX AC . XX DT 24-NOV-2008 (Rel. 13.11, Created) DT 24-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE Internal portion of retrotransposon bhikhari from Danio rerio - a DE consensus. XX KW LTR Retrotransposon; Transposable Element; Nonautonomous; KW BHIKHARI-2-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5912 RA Bao W. and Jurka J.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(11), 1791-1791 (2008). XX DR [1] (Consensus) XX CC TSD of this LTR-retrotransposon is 4-bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 2883..4664 FT /product="BHIKHARI-2-I_DR_1p" FT /translation="MARPKDPVGHWKDLEAWLSVATDSLLPKAAETLQHLT FT QDQLDENLNSLMKQDPSKSYNHKDLAKITGTLSHVLIATLKLSDRHSAQLQ FT HKLTHVQSRINQLELEAQERPEQPDETDQAAKEEIDKLQETLAATTQEMEQ FT AKAXHADVANKLEYVEQLLEEVNADSKDKDSRIKALETHLSEARHEVRRLT FT QQLDYIKEESNSIKDELKHAYELTHNEKAEGSLPKPSPSPSVPDNHVSPCG FT LDLRDLDKLVKNLGKFTPNLPGSQDVHAYLQDIDFHLEMRPNVTDKDRLYL FT LRATSSPEVRSFLDRQPAHTKTDYLLLREALIREFDDIESEQGLVAALETK FT QGRHESSQAYYSRLRRAYFGTRNEPEMEEDLNFKILFLRNLHPGVSHHLGV FT LACPHTMNTQQLRDLAHKAYVKQKTASEKSAKIPAVYDFNTQSQDLALEGA FT QGPDNARLPPSEWNVSSYNRQRDCHADTRPKQWNRRWKGLSGRQHSPERHR FT ERPWNRSTSFVNQRGTSSWESSGTSKVKQHHLDATSPRGQRKNSQRFHADQ FT APTESPQGTTSPCFDPQELMKMILKEFFQRREADRKWERKGKPDSA*" XX SQ Sequence 5912 BP; 1692 A; 1395 C; 1126 G; 1671 T; 28 other; tattaatggt ggagaatttt taggcacagc acgatctgac ttttaagcac aaaaacattg 60 tttatctttt ctattttgtc cggaagaata gatcagagac agagaaacaa agttgttgtg 120 ttactgtatg atttattgtg gtacccagtg tgttgaaacg cacgaatgcg caactcgagt 180 gcttttgaga aagcgaactg cagtaaatag ttatttgtcc acgagacaaa aacgatacgt 240 cgctgtaaag cgcacggatt gagaaacaat aagcgtgttc cttattcttt ctcctataag 300 gccttcacag ttgatttaaa ggtgcgtgtc ccagctttaa agactaactg tgcatttctc 360 cctggtcaaa ttaagcacag agtagctaac tgtctttttg ccatttaatt cactaacaac 420 agagttagtg aactcagcgc gccgcgctcg agagagcggc agagcgtatt ctcacgggag 480 aacttatctt gatcaagtaa agtgtgcgtg attgaatcca cgcgtgtctt tgactgtgaa 540 tagtcttgac acagagcgct aggatccgcc gagatcttaa tagtgctata ttatgttgag 600 cagtgtgaaa tgcgccgcya tttcaygtra tgcatgtawg tttgtgkgca taaccgcata 660 ttggttcatt ctgtgtacgt cgtcatagta acagcctccc gaaagtcctg gaagactttt 720 cagagtactc tcaagtgcgc atgcgcagtc ctccgcgctt gcatcggctg tagacattag 780 cacaargcta agctaactga agctcacgct gttgcattga agtgtgtggt atggctatgc 840 cattatagca ttaaaggcta gtgtagtagc gctcttcttr tggtcttgaa tgcagtgtgt 900 taagttgaag tgacaaccta attcacgcta gcragatttt gaatcttgat ttctcactct 960 ttataggcta tttgccttat ttctcatttt tctccctttc ttggaaaagg ctttgactaa 1020 tgcagattta gtaaacccca ttgtacgcga tctcatcgct ggtttaatca caattgtcaa 1080 atgcttttcc tgyttaacct ggcctgtttt attaattcaa atgctcataa gttgttggtt 1140 tattagcaat gaattatagt ttatgactat aacataatgc ttttgattgc tcaatcaaat 1200 taatgcttaa tgcttaatta atgattaatg cttttctctt ctgaaactta tcaacattat 1260 aggaacaacc attttctatg actgttgata tttgaagttg atttataagc acagtgttta 1320 cggattaatt aaattataac atttattgtc ctrggacttt acaaacacaa tcactattct 1380 gaaattttaa tttcatttaa tagttcattg atgcttggct gccctatctc aataacagtt 1440 attgattaat tttaaataaa cacttttgat caaatttaat gacatcagga ataccacgca 1500 ttccctatca tcattgattt tgaaattgat ttgttctcat aactgatttg caggaatgct 1560 tgctgattaa ttaaatcata attgtgttta tgactttaat gacaacctca ctgagtgtta 1620 gtctcaacca acattattac tatttggaaa ttttaatttc attaatagtt aattggtgtt 1680 tggtttccct tttctattac aattattgat ttatttaatt atagcaactc ttaatcaagg 1740 ttattgataa aaatcatatt tctgaatcat ttcatgtcat tactaatttg aattagtttt 1800 aaattaattt gcaatcaagt ttttttttta ttatccttya ggtattaaat trcctcacag 1860 gacagtgctc cccctggtgt ttggttgatg caatgagaca gtgtagcctg taggtggcta 1920 caactgaact cwagtcatcg ttaaacctag cataccagag cagtagcctt cgattagcca 1980 aaccccgaag gaaatatagg attttcctgt ctacttaggc taattagttr tcttgataac 2040 acaataaacc aacccctttt gatgtattga ctcaacagtg ccaccacttt gtcaactttg 2100 taatttgaaa acttgcttca ctttctattg tttcttttaa ctgtcttctt ccgagttgac 2160 tacgtctgcc cttttgagat tgtacctgtg tgaatcatga aagcagagaa ygttagaaac 2220 atttaacaga tgttgtgaaa cctaaacatt tcgtgttaaa atgatctgac tctctaccct 2280 ttctcgccat aggacgctac ctrattaagg acgaaacgag tcaaccagaa gacaggtgtt 2340 ttcaaatttg attttagatc cacatccaaa agtggtctac ttggtgtatg gtgtcaatcc 2400 taatgagtaa gaaaccaata ctgtcattgt tattgtactt tctttcacag cggaggaaat 2460 ttaataacaa tcacagtcct ggtgtttaga gctccccatt tccaacttcc agccgagaag 2520 catcctttca cctmatgctt gacaaaagct aaaggaatyg ttctctctag gaacctggaa 2580 gcagccaatt atcactctac tccttrtcag tcttcttctc ttcctctgtt ctctaactgt 2640 gtgtgtttac tgactgtccc cacagagact acaagttaga tgcacgcccc ctagaggatc 2700 acagtcacag agaacaacac ccacttgttt attccacact caacagtagc tcgcacctga 2760 cattacaccc tgacactttt tgtttgccat ttgtttttgt tttcttttgt tcttttctct 2820 ctctctctcc cctaggtctg tgtcagtgtc ggttataggt cttccccctc accgccgcaa 2880 acatggctcg ccctaaagac cctgttggcc actggaagga cctagaagca tggctaagtg 2940 ttgcaacgga cagcctcctc cctaaggccg ccgaaacgct acagcatctg acacaggacc 3000 agctggatga gaacctgaac agccttatga agcaagaccc gagcaaaagc tacaatcaca 3060 aagacctagc caagatcaca ggtactttga gtcacgtcct catcgccacc cttaagctga 3120 gtgacaggca ttccgcccag ctccagcaca agctgacaca tgtgcagtcc cgcatcaacc 3180 agctagaact ggaggctcag gaacgaccgg aacagccaga tgagactgat caagctgcca 3240 aagaggagat tgataaactt caagagactc tagccgccac cacccaagaa atggagcaag 3300 ccaaagccga mcatgctgac gtcgctaaca agcttgagta tgtcgaacag ctactggaag 3360 argtaaatgc tgactctaaa gacaaggaca gcagaatcaa agccctcgaa actcacctga 3420 gtgaagctag acatgaggtc agacgactaa cgcagcagct ggactacatc aaagaagagt 3480 ccaacagcat taaagatgaa ctcaagcatg catatgaact gacacacaat gagaaagctg 3540 aggggtcact gccaaaaccc tcaccatctc cttctgttcc agacaaccac gtatcaccat 3600 gtggcctgga cctcagagac cttgacaagc tagtcaagaa cctgggcaag ttcacgccga 3660 atttgccagg tagccaagat gttcacgcct atctgcaaga cattgacttt cacctggaaa 3720 tgagacccaa tgtcactgac aaagatagac tgtatttgct ccgagcaacc tccagccctg 3780 aagtgcgcag cttcctggac cgtcagccgg ctcacacaaa gactgactac ctcctgctcc 3840 gagaagccct catcagagag tttgatgaca tcgagtctga acaaggactt gtagctgccc 3900 tggagacgaa acaaggtcgc catgagtctt ctcaagccta ctacagcaga ctcagacgag 3960 cctacttcgg cacccgcaat gaacctgaga tggaagagga cctgaacttc aaaatcctct 4020 tcctgagaaa cctccatcct ggggtaagcc accatcttgg ggttcttgca tgcccccaca 4080 ccatgaacac tcagcaatta cgagatttgg cacataaagc ctacgtcaaa caaaagacgg 4140 cttcagagaa gagtgccaaa atccctgctg tttatgactt caacactcaa agtcaagatc 4200 tggcccttga gggtgcccaa ggcccagaca atgctagact acctcccagt gagtggaatg 4260 tgtcgtctta caacagacaa cgagactgtc atgctgatac cagacccaaa cagtggaacc 4320 gtcgctggaa aggactgtct ggacgacaac actcacctga acgtcaccgg gagagaccgt 4380 ggaacaggtc tacctcattt gtaaaccaaa gggggacaag ctcgtgggaa tcaagtggta 4440 cttccaaggt aaagcaacac caccttgatg caaccagccc aagaggtcaa cgaaagaact 4500 cacaaagatt ccacgctgat caagctccaa ctgaatcccc acaagggaca acatcaccat 4560 gttttgaccc tcaagaactg atgaaaatga tcctgaaaga gttcttccaa cgaagggaag 4620 cggatcggaa gtgggaaagg aaaggaaaac ctgattctgc ctgacaagca acgacctcac 4680 taaacagcac ctcttcacct cacttacaga gaactgtaag caccataact tcaaagactc 4740 aatagactgt acctgaacac cagagttaga cgctaactca acacctgatg cctcaggtca 4800 ccactgcatc acacaaacac ttgtggatga aacycttcca attcctaaga gtgctgtctt 4860 gatcgtttgc cacgcagctg aactcagaat caatccaaat cctacctgtg ccatgccaac 4920 tccagtccct caactccaag tgaccagaaa gggggaggra ccaacctgct gacacagacc 4980 cactatccta aactcttatt ttaacgctaa cctgtttttg tcatttgtgc cattttagga 5040 agacacctag ccctagctat gtttcctaaa cccaatacat gcactctccc ctccactgca 5100 ctactaggac caatctctag tagagaggag ctattacata tgtgacaytt gtttgtygtg 5160 ttaacctcac tcaatgtcct tatgtctcag tgttcttttc tttctctttc tctctgtctt 5220 ttaattaatt tctcttgcct catcagggaa cccccacaga tgatgtttaa gtccagagct 5280 cagaattagg attgactcct ttcaccaagc ctcacggatc aatgccttaa aggtgtgatc 5340 gctgtattgg taatttaacg ttgtagcacc tgtctgacga cacgttttag gcgaaaactc 5400 aaaagatttg ttttccacat ttcttcatct tcaatccaga accgaaaaag ggattgaggg 5460 taaagaatra aatttattgg tttgactctc gtcaaaatag cattgtaggg tatacgttga 5520 ctttgacact tgccctcttc ctccacgagc gagtaaaggc catccaaaag ttgttcaaat 5580 gtgcacgccc cacatcttgc gcatgcttag atgcgataac gctgaccaaa aatttaattc 5640 tttactgatt gagataatgc ctatacaact tcgttgaacc ttcaatacct cggttagtct 5700 taaaagccta aattctgatc atactaaact atgctcttct acaccttgag cattgctaac 5760 ccgagcattc caaagtkcct aaaccctgca ccaagattca gttagtgatg atgttgtttg 5820 tgtcttgtgt ttgcctgttc tctctgcttc tgcctgtttg ttccagtgcc tgccaatgtt 5880 cacaccggga acatcaccag wcaaaggggg ac 5912 // ID Gypsy137-LTR_DR repbase; DNA; ZEB; 727 BP. XX AC chr3; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy137-I_DR; KW Gypsy137-LTR_DR; Gypsy137_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-727 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1536-1536 (2008). XX DR Genome; chr3; Positions 1040146 1039420. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 727 BP; 196 A; 104 C; 161 G; 266 T; 0 other; tgtaacgaaa taaatatata ccatgttaat agttcagtat taaatcctgc tttaatggga 60 gtgcagttcc tgctagttcg ccggtggcgc tattgcgcac agagtctgtg tatagactgc 120 agcagagata gggagaactg ctgtgggtga gtcgggtttg tgttagctcc acactaaaat 180 ataaaaatat catttaaatg attgattgga tgcagcttta aaagtctatc acactttata 240 tacatttatt gatcgacaga gtgggttttc ttgtcatatt tgtttgattt tggtactttt 300 aatctcatct tgatattttc ccccatgtgc ttatacagca agtgctaatg gagttatatg 360 tgttttagat gtggatgcta aattaaaccg tgttgatctc actgttactg ctctgtgtgc 420 aggtaatttt ctattatatt ttgtgttaag gtattaaatg tgtttattca gtgatattag 480 ttacgcattt tctgtatttt tatatttctg tctttgtttt atataataat aatataagct 540 ttcagtatag actgcagcag agatagggag aactgctgtg gatgtggatg ctaaattaaa 600 ccgtgttgat ctcactgtta ctgctctgtg tgcagggcaa aataaaatcc tcctcatcca 660 agtccagtgt gtgtgtattg ctgaagggtg aatgaagtgg gttcgggagt gaagtataaa 720 ggtcaca 727 // ID ERV1-N8-LTR_DR repbase; DNA; ZEB; 577 BP. XX AC . XX DT 04-DEC-2008 (Rel. 13.12, Created) DT 04-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE nonautonomous endogenous retrovirus from zebrafish, long terminal DE repeat - consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR; ERV1-N8-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-577 RA Bao W. and Jurka J.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(12), 2178-2178 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 577 BP; 200 A; 68 C; 152 G; 155 T; 2 other; tgaggtggat tttgtgctct ttggagtgtg aagttaatct atatattcag gttatatgtg 60 tataatctgc ttgtagactt gcataccatt aaaacaattg gaycatgtaa ggggaaatgg 120 cctgaggaaa caactagggg tcataaaaga agtggatgaa aagagagtgg taaaagagcc 180 tgagactaag gtgtctcagg aaatggataa gaatgtgtaa acatagatgg ataagttata 240 tagttaccag gtttgggtca ggggttaacc taggtggaaw ttgtgtgaca tatgtagaaa 300 taagggaaaa acagatgtca gaggtggaat ttggggtcag gggttaaact aggagggatc 360 ttgtgtaata tatggagaac aaaggaaaac aactgatgta aaagggtatt taaacaaggc 420 cagaacatga gtgaggcaga atttttcaga cagagatgct actgggggtc tcctgaaagt 480 tctcctttgt tgtcgacaat aaagtatatt cttctgatat tcataacctc cagactgagt 540 ttgatttagt aagagaaaaa ccaagaaaac cactaca 577 // ID Gypsy39-I_DR repbase; DNA; ZEB; 5049 BP. XX AC . XX DT 21-SEP-2007 (Rel. 12.09, Created) DT 02-OCT-2007 (Rel. 12.09, Last updated, Version 1) XX DE An internal portion of the Gypsy39_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy superfamily; endogenous retrovirus; KW gag; integrase; reverse transcriptase; Gypsy39_DR; Gypsy39-LTR_DR; KW Gypsy39-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5049 RA Dib M.R. and Naveira H.F.; RT "Gypsy39_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 7(9), 808-808 (2007). XX DR [1] (Consensus) XX CC Gypsy39-I_DR is an internal portion of the Gypsy39_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its long CC terminal repeat is deposited in Repbase as Gypsy39-LTR_DR. CC Gypsy39_DR is characterized by 4-bp target site duplications. The CC internal portion encodes one polyprotein the 1585-aa polyprotein CC Gypsy39_DR1p (pos. 181-4935) composed of the gag, protease, CC reverse transcriptase, and integrase domains. Some insertions CC fairly recent, according to the high identity between their CC flanking LTRs. Consensus obtained after the alignment of at least CC nine independent insertions bearing at least 85% Homology over at CC least 1000bp. XX FH Key Location/Qualifiers FT CDS 181..4935 FT /product="Gypsy39-I_DR_1p" FT /note="ORF." FT /translation="MALSNVEAFLNNPSLEQFDKCRKDELFKIAEHFQISI FT SRQFLKKTIKKILYDHLVEIGVFQTTSENVTELDPNYSAERPCSPESAGHG FT SNVTDVEAKVEAKVALPPFDPSTPVSTDSKTDGRLRLRLAKLQYETQEKAR FT THKAELDLKLEIRRLEIEADKQVRLRQLELDAAAKAAMTSPVVSTDVLMPG FT QVIKPDVKSSFNQIDVSKQISLVPNFREDEVDSYFTAFERIATTLNWPKDI FT WTVLLQCKLSGKAQEVCATLSVEESLKYDIVKSAILRAYELVPEAYRQRFR FT KHKKTPQQTFVEFAREKSILFDKWCSASKTADYSTLRELILLEEFKNCLPE FT RVVLYLNEQKVTSLSEAAVHADEFVLTHKPVFPSARPESVTVTPPACSSSS FT RTLAKSEIPKETRVCFYCRKIGHLINDCRALKNKNQKPKSVGFVQSAPENK FT EQLENTFDESYKPFVTKGSISITGKPEDQHIINILRDTGASHSIIAADALP FT FDEQTSAGSSILMQGIQMGIVRAPVHKIHLQCELVNDFVHVAVRPALPVKG FT ISLILGNDIAGGKVMPVCEVLDKSPMPSFCDKLSDAFPETFLACVVTRAQQ FT KQTDNEITLSDSFMTVDDATVSNKVTNVEKQPLTEKCAFDNQVNLNLNLTR FT EQIILAQQKDQTLSKCFKLVVPREQLKQKQVGYFVENALLMRKWSSAMEQD FT DEWSTVYQVVVPSVFRQQVLVLAHDHVLSGHLGITKTYHRILKHFFWPGLK FT QDVARYCRTCQSCQFSGKPNQVIPPAPLNPIPIMMEPFERVIVDCVGPLPK FT TKAGNQFLLTIMCAATRFPEAIPLRKITAPVIIKALTKFFATFGLPKVVQS FT DQGTNFMSNVFNEVLRSLSIQHCVSSAYHPESQGALERFHQTLKSMLRTYC FT LDTGNDWDEGVALMLFAVRETVQESLGFSPAELVFGHTVRGPLKVLKEQLL FT DSESSSKTNVTQYVSRFRERLRNACSMAQQTLKSVQGKMKKRFDVKTVVRE FT FKPGDQVLVLLPIVGSALSARFSGPYVIDHKISNTDYVLRTPDRRRKTRVC FT HVNMLKAYLVRQDQIEKKEVTKPIALMNMDFDISTEADEDGLVLRNAPQQC FT ALLQNSKILKNLSSHLTHLTETQKKDVKNLIEQFPELFHDTPTQTSVIQHD FT IVIHNSNPIKQHPYRVNARKRQIMKQEVEYLCENGLAAPSCSPWSSPCILV FT PKSDGSNRFCTDYRKVNAVTVPDCYPLPRMEDCVDNLGSARFVTKLDLLKG FT YWQVPLTPRASDISAFVTPDHFMQYKVMAFGLRNAPATFQRLITTVLAGVA FT GCNAYLDDVVIYSSEWPEHMSQLKTVFQRLANANLTLNLAKCEFAQATITY FT LGKRVGQGQVRPVEAKVTAIAEFPTPNTRRQLRRFLGMAGYYRGFCKNFSS FT VVTPLTNLLSPSRQFKWSSECQHAFESIKALLCEAPVLMAPNFEKPFKMEV FT DASAVGAGAVLLQEDKEGIDHPICYFSRKFNKHQLNYSTIEKEALALLLAL FT QYFEVYVGSSSTPVVIYTDHNPLVFLSRMYNQNQRLMRWSLIIQSYNLEVR FT HKKGKENIVADTLSRPDT" XX SQ Sequence 5049 BP; 1463 A; 1033 C; 1086 G; 1467 T; 0 other; tttttggggg ctcgtccggg ataattaacg tcccaacatt tatttctgag gattttggtt 60 ctacagatac tcaattcttt ttgtacccaa agtaattttg tggattatac tagaggtagt 120 attagaggag tgagttactg aatttgtcta tttgcttctg actagcttgt gattaaaaaa 180 atggcattgt ctaatgtgga agcttttctt aataatcctt cactcgagca gtttgataaa 240 tgccgcaagg acgaactttt caaaattgca gagcattttc aaatttcgat cagtcgccaa 300 tttctgaaaa aaactattaa gaaaatactt tacgatcatt tggttgaaat aggagtattt 360 cagaccacta gtgagaatgt cactgaactt gatcctaact attcagctga aaggccgtgc 420 agtcctgaaa gtgcagggca tggtagtaat gtaactgacg ttgaagcgaa ggttgaggct 480 aaagttgcgt tgccgccctt tgatccatca acccctgtct ctactgattc aaagacagat 540 ggcagactta ggctacgtct ggccaaactc cagtatgaga ctcaggagaa agcacggaca 600 cataaggctg aactggatct taagcttgag atacggcgtc tcgagattga agctgacaag 660 caagttaggt tgcgccagtt ggagctggac gctgctgcta aggctgctat gacttctcct 720 gtggtcagta ctgatgtact catgcctggt caggtgatca agccagatgt taaatcttct 780 ttcaaccaaa ttgacgtaag taaacaaatt tcacttgttc ctaatttccg tgaagatgag 840 gttgactctt actttactgc gttcgaacgt attgcaacta cacttaattg gcctaaagac 900 atctggactg tgcttctgca gtgtaagctg agtggcaaag cccaagaagt ctgtgcaaca 960 ctttctgttg aagagagttt gaaatatgat attgtaaaat ccgcaatttt acgcgcttac 1020 gagctagtac cagaagcata tcgacaaaga ttccgcaaac acaagaaaac tccacagcaa 1080 acatttgtag aatttgcacg ggaaaaaagc attttatttg acaaatggtg tagtgcttct 1140 aaaacagccg attatagtac tttacgagag ttaattctgc ttgaagaatt taaaaattgt 1200 ctccctgagc gtgtggttct ctacttaaat gaacagaaag taacgtcttt gtctgaagct 1260 gctgttcatg ctgacgaatt tgttcttaca cacaaacccg tgtttccctc tgctcgtcct 1320 gagtctgtga ctgtcactcc gcctgcttgt tcctctagtt ctcgtacact ggcaaagtcc 1380 gaaattccaa aagagactag agtctgtttc tattgccgaa aaattggaca tttgataaac 1440 gattgtcgtg ctttgaaaaa caaaaatcaa aaacctaaaa gtgttggttt tgtccaatct 1500 gcacctgaga ataaggaaca acttgagaat acatttgatg aaagttacaa gccattcgta 1560 actaaaggat caatttcgat aactggtaaa cctgaagacc agcacataat aaacattctt 1620 cgtgatactg gtgccagtca ttccattata gctgctgatg cgttaccgtt tgacgaacaa 1680 acctcagctg gatccagcat ccttatgcaa ggtatccaga tgggaattgt gagagcacct 1740 gtgcacaaaa tccacttaca gtgcgaactg gtcaatgatt ttgtgcatgt ggcagttcga 1800 cctgcattac ccgtaaaagg catatcactt atccttggaa acgatatagc tggtggtaag 1860 gtgatgcctg tttgcgaggt tcttgataaa tctcccatgc cttcattttg tgacaagttg 1920 tcagatgcat tccctgagac ttttctcgca tgcgtcgtga cccgagctca gcagaagcaa 1980 acggacaatg agattacttt atctgattca tttatgactg tagatgatgc aactgtgagt 2040 aacaaagtga caaatgttga aaaacaacct ctgactgaaa agtgtgcatt tgataatcag 2100 gtcaatctga acttaaattt gactcgtgag caaatcatcc ttgcccaaca aaaagatcaa 2160 actttatcca aatgtttcaa attagtcgtt cctcgagaac agttgaaaca gaaacaggtg 2220 ggttattttg ttgagaatgc actactgatg cgtaagtgga gttctgcaat ggagcaagat 2280 gatgaatgga gtactgtata tcaggtggtt gttccttctg ttttccgaca gcaagtatta 2340 gttcttgccc atgatcatgt tttgtctggt catctaggaa tcacgaaaac ctatcacagg 2400 atcctgaagc atttcttctg gccagggttg aagcaagatg tcgccagata ctgccgcacc 2460 tgtcagtcct gccagttcag tggtaaacct aatcaggtaa tcccacctgc tcctcttaat 2520 cctataccaa tcatgatgga accttttgaa agggtaattg ttgattgtgt aggtccgttg 2580 ccaaagacga aagccggtaa ccaatttttg ttgacaataa tgtgtgctgc tacacgtttt 2640 cctgaagcca taccattacg taaaatcact gcacccgtaa tcattaaagc tctcaccaaa 2700 ttctttgcta catttggcct acccaaagtc gtacagagtg atcaagggac aaattttatg 2760 tcaaatgtgt ttaacgaggt gttaaggtca ctttctattc aacattgtgt ttccagtgct 2820 taccatccag agagtcaggg agcactagag cggttccacc agacactgaa atccatgttg 2880 cgaacgtact gtctagacac tggcaatgac tgggatgaag gagtggccct gatgctgttt 2940 gcggtacgtg aaacagtaca ggagtccctc ggattcagcc cagctgaact tgtgtttgga 3000 cacactgtcc gtggcccact taaagtgttg aaagaacaac ttttagattc ggaatctagt 3060 agcaaaacga atgtgaccca atacgtgagt agatttcggg agcgattacg taatgcttgt 3120 tccatggcac aacagacact caaatctgtg caaggtaaaa tgaagaaaag atttgacgtt 3180 aaaactgtgg ttcgtgaatt taaaccagga gaccaagtac tggtcctttt gccaattgtt 3240 ggttctgcat tgtctgcacg attttcaggg ccatatgtga ttgaccacaa aatcagtaat 3300 actgattatg tcctgcgcac acctgaccgg agacgaaaaa cccgtgtttg tcacgttaac 3360 atgctcaaag catatttggt acgacaagat caaattgaga aaaaggaagt gacaaaaccc 3420 attgcactaa tgaacatgga ttttgacatt tccacagagg cagatgagga tggattagtg 3480 ctgcgcaatg ctccacagca atgtgctttg ttacagaact ctaagattct gaaaaacttg 3540 tcctcacact tgactcattt gacagaaacc caaaagaagg atgttaaaaa tctgattgag 3600 caattcccag aactgtttca tgacactccg acgcaaacct ctgttataca acacgatatt 3660 gttattcata actccaatcc aatcaaacag catccttacc gtgttaatgc ccgaaagaga 3720 caaattatga agcaggaagt ggaatatcta tgtgaaaatg gactagcagc tccaagttgt 3780 agtccatgga gctccccttg cattctcgtt cccaaatctg atgggtcaaa tcgtttttgc 3840 actgattacc gcaaggtcaa tgcagtgaca gttcctgatt gctatccatt gccgcgcatg 3900 gaggactgtg ttgataactt gggttctgct cgttttgtga ctaaactaga cttgttaaag 3960 ggttactggc aagtgccatt aacacctcgt gcatctgaca tatcggcctt tgttacccct 4020 gatcacttta tgcagtacaa ggttatggct ttcggacttc gcaatgcacc tgctacgttt 4080 caacgtttga taactactgt actggcagga gttgcaggat gtaatgcata tttggatgac 4140 gtagtgatct attcttctga atggccagag cacatgtctc agttgaaaac tgtgttccaa 4200 cgactagcca atgctaatct aaccctgaat ttggcaaaat gtgagtttgc tcaggcaacc 4260 atcacctatc ttggtaaacg ggttggtcaa ggacaggtcc gcccagtaga agcaaaagta 4320 acagccattg cagaatttcc taccccaaat actcgtcgtc aattgcggag gtttttagga 4380 atggcgggat actaccgtgg cttctgcaag aacttttcaa gtgtggtaac gcctcttaca 4440 aatttactga gcccttctag gcaatttaag tggtcatcag aatgccagca tgcctttgaa 4500 agtattaaag cacttctgtg tgaagctcca gtgttaatgg ctccaaattt cgaaaagcct 4560 tttaaaatgg aggtagatgc aagtgctgta ggagctggcg ctgtcctgct gcaggaagat 4620 aaggaaggaa ttgaccatcc gatttgctac ttctcccgta agttcaacaa acaccagttg 4680 aattattcaa cgattgagaa ggaagctctt gctctattgt tggctctaca gtattttgaa 4740 gtgtatgtgg gttccagctc aactccagta gtgatttaca cagaccataa cccacttgtc 4800 tttctgtcac gcatgtataa tcagaaccag agactgatgc gctggtcact gattatacaa 4860 agctataacc ttgaggtccg acacaagaag gggaaagaga atattgtggc tgataccctg 4920 tctcgacctg acacgtaagt tttaaatttg acaattcagt tattgtggat ttgtttattt 4980 atgtttgaat gttcgttgtt aaaaatattt cttttacaaa cattatatgt ttgttcttta 5040 gtggggagg 5049 // ID ERV1-N3-I_DR repbase; DNA; ZEB; 4591 BP. XX AC . XX DT 29-OCT-2008 (Rel. 13.1, Created) DT 29-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE nonautonomous endogenous retrovirus from zebrafish, internal DE portion - consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW internal portion; ERV1-N3-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4591 RA Bao W. and Jurka J.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1600-1600 (2008). XX DR [1] (Consensus) XX CC TSD is 4-bp. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 4591 BP; 1656 A; 901 C; 725 G; 1300 T; 9 other; aaatggtgac ccatctccgt gatctcccaa ctcaggcgag ttacgtcttt ccagcacaaa 60 atggatttat atcaaccaag aggtttggat cttctattct atcttcagct gctgtggtaa 120 gtcaattctt ctttgcctat tagaaatgtt gagggaaagc tggacgcacc ctatataaac 180 acattgatag aatgagtcgg aaatgctcaa aaatgttaga ccggaatgcg ttatccggtg 240 gggttgagaa ttagacaaaa ccactttgat tatcagggac atgatagcag cagtatagat 300 aaatgccatc ccttgctctg aaatctcttt gtctgctagg cagcaagagt tgaagcttaa 360 aattggaata acatcatcca aattatattc tgggactcag tggagagtca taaattagat 420 cctaaaagtt taaatcaaaa actatccatt gttttaaatc tactgtttta taaaaaagag 480 aaaaatggaa aaaaggaaaa caggaaaaaa tggtttaaga tttaaaaaca attgtaaact 540 tttaaattgg attaataaca acagtctgtt taaaggcatt tgcatgtgaa aaataaaggt 600 gagcagaaca gaatgatttg attgcaaaaa agacaaataa gctctaatga aaactctgcc 660 tttacaaatc agaaatttgc catttaaatt taatggtgat aaattcaaat atgatgctga 720 tcaactacac tgtaatcccg aataagttga caaaactcaa aaaaatttga ggcaatcagt 780 tatgtcaaat tttttaagtt atgaaattat ctattttaag tgaacagaaa cattaagttt 840 tgagyttgtt atactcatgc aagacacttc aactaactta aaatacccaa gttactcaac 900 ttatacattt taattgcaat taacttaatt attttaattt ttccaaactt taaatactta 960 gtcacccagc tcatacattt taataccaat taacttaatt gctttaattt ctttcaactt 1020 taaatactta aatcacccag ctgatagcat tttttaaaga ttacattttt tcaaactcaa 1080 aaataagaaa aagttctgaa tactcattaa ggttaattaa gataaacgaa ttttaaggat 1140 ttaagttaat agagctcaat tcaattaatt agttacagca ttagggttta cagtgtaaaa 1200 tatccactat actttaattt aatctcaaaa cagactaaat ataaaagaaa ttctaagatt 1260 ttaactttaa atcagccctt agacctcctg atggctcaaa ccttttcata ttcaaatgtc 1320 ctgacagaga gcagagaaat ctcacctcag tgataataag atgctcagaa tgtgtttgct 1380 gtatttgcat aaaacaatta acattcttcc atttggccac aagcctttgc tgaccctttt 1440 gttctccatg caacttacaa gcaaacacar ccctgatcac aagataaaag cattaagcta 1500 aaatgcactg ttgttctaaa aacactctcc agagaccaga tataattaaa ttttaatatt 1560 tactatttta atataatata attacattag gaataaacaa taatgaacat tatattataa 1620 tcgccaatca tggcactaaa gaccaagggt gaaacatctt tcaccttgag gaggttygct 1680 ttctttatgc ttaatgaaaa cactgacaga actactatag cataactgat tgtaaaccaa 1740 taaaatcctc gctgaaaaaa caccacctac aattaaaatt tgatcagaga rttaaactca 1800 ccagaaaaaa cataatgaaa gggggatgga tatgataaca ttcagagtta ccaaagtttt 1860 tatgggtcaa aagatctaag aatctgtgag tcacaccata aaatgacaac cgttaagcag 1920 agaacaccaa aagggtgaaa tgatttctgc caacagatct gactgttaat gagagcaaaa 1980 aatgaactat gggtatcggc ccattagatt gtgaactgaa attaatgaaa aatcattgat 2040 tgattcagaa aaatgtcccc agcggaactg ggagataatg ctcagaatac agcagactgt 2100 gtaaacaggt gttacacact cacaatatgc tgaaatcaac aacagcaatt cagatctgac 2160 cagcaacatc atgtggcttc cagaattaat attgcaagat gatgcattaa cattaattcc 2220 acatgattga atttgaaact gatgaaaatt ctaaaaatgg ttctctctgc ttttgttctc 2280 atcttaggcg gaccaataac aaagaacact ccagatccgg tccacatcaa acaaatacaa 2340 atgagggcaa caaaaggtta ggccataacc tgaggccaaa agtctcccat cttgcatctc 2400 aaagcaagaa cccctgtggt gctaacacct cacctgcagt gagtcctctg aaaaaagtct 2460 ttgatcagtc atcagaataa caattatcca cagctgatct tatcaatgga ttcagcattc 2520 taggaggcta tgtgatttca caaaacatct twcttcaacw aagatgtaaa ttattayatg 2580 ttaaatgaaa cagagacaya raaagatctc atagttaaaa tcttcccaaa ttgcttaata 2640 gcaatgggac taaaaccctg gttacctaga ggcgataaag aactttactg cattgttcct 2700 gcaaagacca atagcaacag actctattgt cacgaatcct ggtggcattt ttaaatttgt 2760 ttttatacta tacacatgtt aaaccacatg tcctgtatga aaatggaatt aactatgttg 2820 aaattcatga tatttgtttc tttgatctcc aaaagtagaa aactctgcta tcccctgtat 2880 agctgcagtt tgaactacaa acaggagtat agagtgtgat aattaaatta accatgaaaa 2940 tgcatttttt acaggatcca gctttgaaca ttgcacctta aaatgctccg taaaagatat 3000 tatatggaat caatttgagc cggaacacac ctcatgttca cattcttatg ctggctccct 3060 gttaaacagg actaaaattt tgcaacacaa ttataaaagt tgattaccat tgttcggagc 3120 attacactga cacatagcca caggtaaacc caggcacata gataacacct tagtcttcaa 3180 agacccagga gtcaaatctc tttagttaag atcttttgtc catgtttata cgaacaaaac 3240 acattagcct cacacacata gaacagaata attaatctga tgacagacga gacaacaaac 3300 accctctgac accctctgtc agctgaaaag aaattacaca tttgcatggc tttgacaaca 3360 aattgatggt ataaaacttc aatagaggcg aacttgccaa taaatttaag gatagataca 3420 ctgactgttg tggggatggt tgacaaacca gagggcccct ggttttccaa tctatcacca 3480 ataacaggta taagtgctaa agtgattaaa attaaaccga agtgaggaca cttctaataa 3540 tcactaggac atttaaattt gacaaatacc atcaattgtg ttcaagcccc cccacacgca 3600 cataaacaca cacacacaca cacacacaca cacacacaca cacacacaca cacacacaca 3660 aacacacaca cacacacaca cacacacaca cacacacaca cacacacaca cacacacaca 3720 cacacacaca cacacaccac acaacataca cacacaaaca cacacattat tatagatata 3780 ccttcataat tatggttaaa taatgtgatt actatcatgt atgacaaata tcttcacgtt 3840 tttttttttc attttgaaaa tgaatgtcat ttatttgaat gtattgtcat gatgatatgt 3900 tttatgatgc atactgcctg ttatactgtt ttaataagtt gcactgacct tttgatagct 3960 ttaccttgag taattcaaac caagaatatt taatattctc cctatcatta taccctatct 4020 cagtggtgac aaagacttat ctgtttttga ttggaccatt taaacagatt gaaagagacc 4080 tgagaccagt tgtacgactc gcacttgctg acttttatgt tttgatcata tattttatgc 4140 ttaacctcat gactttctcc aatgcagaat gacacaagtt cataactgtt caatcgctga 4200 gccatcaagt tgctgtttag gtgtccgaga gaaggcagtg tatacacact gtagaccccc 4260 acatgccgat gtctgcatat ctggaggtga ccagattcat cagaagaagc agactatgag 4320 aagcccacac gacgtggaca cttttgtcag gggaaggaca cagtggaagc tatgaaggcc 4380 tgtatggact gaatgagcag tgcattcggg tcaggctcat caaaagaggt gaaaacgagc 4440 ctggacctgt ttagctacct gctgaacacg acttacaact gataatgctg cagagactgt 4500 cttgaattgc ccctgaacga cctcaacaag gatacatagt gaccccatgg acacatttat 4560 atctgaccca cgggatcaaa tgggggttat c 4591 // ID BEL25-LTR_DR repbase; DNA; ZEB; 881 BP. XX AC chr20; XX DT 03-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL25-I_DR; KW BEL25-LTR_DR; BEL25_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-881 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1398-1398 (2008). XX DR Genome; chr20; Positions 46213342 46212462. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 881 BP; 246 A; 161 C; 164 G; 310 T; 0 other; tgttgcaaag ggctctttat tcacccttct taaggactgt ggactgttgc tgcccccttg 60 tggaaagact gagtatgttt agaatgttcc agaaagttta ccatacgtgt gtcagtcggt 120 aatagtctgc agagcatgga gcacatgaga aacgtcctcc ttttctagcg gtgttattaa 180 ttgtgtgcgt gcatctaaat gtaagtttct tcgttcctta atcatattta tcatgttgtt 240 taagggttat caccgtgaga tgtttagttt gattatttaa ttactttgtc ctgtaaagtc 300 accctaagag ccagttttat actacctaaa tgcatgatta acatctgtat atagtttatt 360 cttattgttt acatcaagtg gaagcatatt tgcctttcat tgagtttagt ttaaggttat 420 tagacgattt aacgaggttg ccgtgtaggc taagaaatgc atagtcatgc taacgcacgt 480 gatatactgc gtgacgtcag aattgcaagc ggtcgtgggt tcgatccccg gtcggaccaa 540 agttgagaaa gtgctataat aagctatata aagtgaagaa tagtacttct aaatattatc 600 taatgtcttt ctacctattg ttactgctca tttatttatt gtttaaacta atgtttacca 660 aatgtttatt gtatttgtat actgtggttg aatgttaatt attctttgtt tctacttgtc 720 tgtagaacca cacacacaca caccttcacc ccaactatat gtctgcttat actaatggaa 780 ataaacccaa gtgagacaat aagcatcttc gtcgcgtttt gatcagacgt accacagtga 840 ttaattcaac ttaaccacac tacactatca gctacagtac a 881 // ID TE-X-3_DR repbase; DNA; ZEB; 1023 BP. XX AC . XX DT 17-NOV-2008 (Rel. 13.11, Created) DT 17-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE nonautonomous transposable element from zebrafish - a consensus. XX KW Transposable Element; Nonautonomous; TE-X-3_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1023 RA Bao W. and Jurka J.; RT "Transposable elements from zebrafish."; RL Repbase Reports 8(11), 1811-1811 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1023 BP; 306 A; 199 C; 218 G; 299 T; 1 other; tatagacgaa ttaccggttt gtaaacattc aggatgcgcg cgcagcgagg ttgcagaaaa 60 aaaacgggag cgctagcatt tacagcgagg gaatgacatc cgatgcggta cagagtttgt 120 tgttgtatta gagataagtt atattgatta catattatat atattattta cataatgctt 180 tcagcgtatt ataacagctt gtcacccagt gataagcaca gttattctgc aaaattaacg 240 ttaaagtgag cggcgttcgt ctgacaggtc catttgcgat agcaccctcc caatggatat 300 tggataagac gaaatggcca agcatccagc cccaaatata caactatctc attgaaactc 360 caagtgattt tacaagagag aagttaaagg cctacaagtc tttggatgcc aacaattttg 420 ttctgtgtgg tcatgtgcaa gaaataatgt tccatgatta ccagattcaa aactatgtag 480 tgctaaaaac cgaggttctg cctagccaaa gacaaggaaa gaagacggag ctgtacaagg 540 cctgggtaat catcaacaag caaaacaact gcattctgac agcgaactgc acctgtacgg 600 cagggtatgt gaacttacat ttttacattt cattctaagc attcagtgtc cgcagtttaa 660 ttcaccatat ttaactgttt ttgctgaaat cattgctgca atcaaaaacg agtaatgtgt 720 atgattcagc atagcgtgaa cttacctgat ataaaatgag aactgcagag tccagcattt 780 ttaattaggt cctcactcca ttcagctctt mtgatggctg ttagccaaag acgtctgcgg 840 ttggcattaa aagcagtgtg gtgtacggta aaatttaagt tttctatttt tggactttcg 900 attttggcat cctaccgcac aacacgacat gtttgctatt gcaaccggtc tttttttgca 960 accggtatgc gcggttgaac ccgtgtgacg tcatgtgtga cgtaggttgg taattcccct 1020 ata 1023 // ID KibiDr1 repbase; DNA; ZEB; 5541 BP. XX AC . XX DT 19-FEB-2010 (Rel. 15.02, Created) DT 19-FEB-2010 (Rel. 15.02, Last updated, Version 2) XX DE Danio rerio retrotransposon KibiDr1 DNA, complete sequence. XX KW Tx1; Non-LTR Retrotransposon; Transposable Element; KibiDr1. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5541 RA Kojima K.K. and Fujiwara H.; RT "Cross-genome screening of novel sequence-specific non-LTR RT retrotransposons: various multicopy RNA genes and microsatellites RT are selected as targets."; RL Mol. Biol. Evol 21(2), 207-217 (2004). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 203..1504 FT /product="KibiDr1_1p" FT /translation="AGESLLSPVESMAAPGSGSFNELTRRHGVKIDSTASI FT EECGLAVGEEVGHDNILSASRMNGAVVVFVKTVDLANLLVQNGIVINGIFT FT PVLPLSTPSKKVTLSNVPPFIPNEVLTNLLKRYGKIVSPIKMIPIGTKSPL FT LKHVVSFRRYVYMVLQEHLDELDLSLNFRHEDFNYVIFATTNNVKCFNCGG FT YGHLIRGCPNKVDNKSNEATLPVATDESDARQENQTAAAQVVEEEAPGPSS FT GQTAAVLTEPGNIEERESVDRAVDEEDMIVTGDEMNEVENVSTAAEAVVSE FT LFDQREGAIAELDDNLFKTPQKRRREQCPLNERAKRGAGDEPGQTDTESER FT DFSECSVSFSLPLSGYSSKNYTAHEIKTFLKDTKNARNIRIEQYFPDVRQF FT IDKVRIYRNEGCFNKQEVFRLRKFIGKINEEMESSASNEN" FT CDS 1470..5456 FT /product="KibiDr1_2p" FT /note="apurinic endonuclease and reverse FT transcriptase." FT /translation="MKRWSPVLVMRTSRHLYAVFCLWNLLCFWLFLAMGEL FT NIASLNINGAREGKKRFYLFEMIKEKKIDVLFVQETHSDALNACDWAREYN FT GLPILSHLSSVSGGVAVLFSKNFIPCSYQVEEIINGRLLKIRAQFENSVFV FT FISVYVPTRAIERMCFLETLSNVLANCNTEDVLILGGDFNCTESSLDRNHV FT EPHMPSQRRLIQLIHFNDITDIWRNFNSYQRQYTWLHTYDNLLSLARLDRF FT YGYRHQLNLFKECVISPVGFSDHSLVKCSVTMGQFRPKSAYWHLNTNLLSD FT CTFQDIFKEFWSNFKSTKSTFCSIQQWWDFAKTQVKQLCQQYTINVTKDRK FT STIEVLENEIKAMQEKADASGDLGFIRNFKEKKNSLNNLLDLTAQGALVRS FT RFQNIESMDAPSKFFFNLERKNGQKRCIYNLLSETGILLSESSEIRKRAVN FT FYQDLYKSEIISDESDTSDFFNNLPQVPQDFNVELKKAVTLEELQKALQSM FT ECNKVPGIDGLPAEFYKTFWKEVGLDLLQVINESLAYGQLPLSCRRAVLTL FT LPKKGDLNDIKNWRPVSLLCTDYKLLSKVLANRLSDVLSHIIHPDQSYCVP FT GRRIFDNISFIRDILDIGKIFNLNFGLLSLDQEKAFDRVEHKYLWNTLTAF FT GFNNEFVSMLKALYNDVESVLKFNGNLCAPFKVFRGIRQGCALSGMLYTLA FT IEPLLIKIRAKLSGLSLPSCTNCFKVSAYADDIVVLINGQIDVTLLLELMN FT DFKKISATKINWSKSEALLVGNWHNAEPKLPEGLTWTKQGFKYLGVFLGDE FT LIMQKNFEDIIEKVRGRLERWKFLLSKISYKGRVLIINNLIASALWHRLIC FT VDPPSDLLIKLQSLLIDFFWDKMHWVQKAILYLPKEEGGHHLMHIQSRVAA FT FRIQFIQRLLTGPVDSSWKIAACAILRGFRNLGLEKSLFWTDPQKMDWSKL FT PIFYRNMFKVWSLFKVQRHGNASSLFWLLQEPLIFGSRMDLSNQGILPSVN FT EILLNAGVVTVGHLFKLAGPAFRNVEPVANRLGLRSKRIVAKLLEKWKSFF FT TPEEANMLENYADGLIIPNCNDPFPDLYLSTDFKECEGVFLQNNVLFLMGT FT NSEFGKNVYNSCVKLLNRKSLNDKVDTPWRSVLHLKENAKPEWRALYKPPL FT PKKVGDLQWRILHGAIAVNAFVSIINPGNSDECPFCSQRETLYHAFMLCNR FT LSSFFQVLSNLFVCFGETFSMETFICGYKYTRKRRFFCQLLNFLLGQAKMA FT IYDTRKIQIEQNSSGCLKTFFFNLVKSRILIDFQYYKTMGDLMCFEKIWCY FT KGVLCQICDENLEFMVH" XX SQ Sequence 5541 BP; 1699 A; 817 C; 1187 G; 1838 T; 0 other; gtgcgcatgc gctggttgag tggactgagt gtgtgggcga gagcgtgtgg ctgccgtatg 60 aggtacgagt ggtctatact tctctatctt tacctgctgc tttccttttt ttttgttagc 120 gttaattata gtttttgttt ccttacattg ttagtaagta ttattttatg tttttggatt 180 agagaagtga gtgttggttt gagccgggga gtccctgctc tcacctgttg agagtatggc 240 tgccccaggt agtggaagtt ttaatgagtt gacaagacgt cacggagtga aaatagactc 300 aactgctagt attgaagaat gtggtttagc agtgggggaa gaagtcgggc atgataatat 360 tttatcagct tcccgaatga atggcgctgt tgttgttttt gtgaaaactg ttgatctagc 420 taatttgtta gttcaaaatg gaatagtgat taacggtatt tttacaccgg tgcttccttt 480 gtctacaccc tcaaaaaaag taacgttgtc gaatgtgcca ccttttatcc ctaacgaagt 540 gttgacaaac ttacttaagc gttatgggaa aatagtttca cctattaaaa tgatcccgat 600 cggaactaaa tcgccgctcc taaaacatgt tgtgtcattt agacggtacg tgtacatggt 660 tttgcaagaa catcttgatg aacttgactt gtccctaaat ttccgacatg aggactttaa 720 ttatgttatt ttcgccacaa caaataacgt taagtgtttt aactgtggcg gttacgggca 780 tttaattcgc ggatgtccca ataaggtgga caataaatca aatgaggcaa ctttaccggt 840 tgcgactgat gagagtgacg cgcgtcagga aaatcagacc gctgcggcgc aggttgtaga 900 agaggaagca cctggaccaa gctcaggtca gacagctgct gttttgactg agcctgggaa 960 tattgaagaa agggaatcag tggacagagc tgtggacgaa gaagatatga ttgtcacggg 1020 tgatgaaatg aatgaagttg agaacgtttc aaccgctgcg gaagctgtgg tgagtgaatt 1080 gtttgatcag cgggaaggtg cgatagcaga gttggatgac aatttattta aaacacctca 1140 gaaacgtagg cgggaacagt gtcctttaaa tgagcgagca aagagaggtg ctggtgatga 1200 accgggtcaa acagacacgg aaagcgaaag ggatttctct gaatgtagcg tcagtttcag 1260 tttaccattg agcggctatt ctagcaagaa ctatactgca cacgaaatta aaacttttct 1320 taaagacacc aagaatgcga gaaatattcg tattgaacag tattttcctg acgtgagaca 1380 attcattgac aaagtgagga tttatagaaa cgaaggatgc ttcaataagc aggaagtatt 1440 tcgtcttagg aaatttattg gcaagataaa tgaagagatg gagtccagtg ctagtaatga 1500 gaactagtcg gcacctttat gcagtatttt gtttgtggaa cttgctttgc ttttggcttt 1560 tcttagctat gggggaatta aacattgctt ccttaaatat aaatggagct agggagggaa 1620 aaaagcgttt ttatcttttt gaaatgatta aggagaaaaa aattgatgtc ttatttgttc 1680 aagaaaccca cagtgatgcc ctgaatgctt gtgattgggc cagagaatat aatggactac 1740 ctatactaag tcatttaagc tctgttagtg gtggtgttgc tgttttattt tctaagaatt 1800 ttattccatg ctcttatcaa gtggaagaaa ttataaatgg tagattatta aaaataagag 1860 ctcaatttga gaattctgtt tttgttttta tttctgtata tgttcctacc agagccatag 1920 aaagaatgtg ttttttagaa actttaagta atgttttggc aaattgcaat actgaagatg 1980 ttttaatttt aggaggggat tttaactgca ctgagagttc tttagataga aatcatgtag 2040 aaccccatat gccatcacaa agaaggctta tacagttgat acattttaat gatattactg 2100 atatatggag gaattttaat tcttatcaaa gacagtatac ttggctacac acttatgata 2160 atttattatc tttagccagg ttagacagat tttatggata taggcaccaa cttaatttat 2220 ttaaagaatg tgttatttct cctgttggct tctccgatca tagcttagta aaatgttctg 2280 ttactatggg ccagtttagg ccaaaaagtg cttactggca tttaaatact aatttactaa 2340 gtgactgtac atttcaagat attttcaagg agttttggag taactttaag tctacaaaat 2400 caaccttttg ctccatacaa cagtggtggg attttgcaaa aacacaagtc aaacagttgt 2460 gccagcaata caccattaat gttacaaagg acaggaaaag tacaattgaa gtattagaga 2520 atgaaattaa agccatgcaa gaaaaagcag atgcttcagg agacttgggt tttataagga 2580 attttaaaga aaaaaagaat tctctaaaca atttacttga cctgactgct caaggagctc 2640 ttgtcaggtc aagatttcag aatatcgagt cgatggatgc accatccaaa tttttcttta 2700 atctagaaag aaagaatggg caaaaaagat gtatatataa cctgctctct gagaccggaa 2760 ttttgctgtc agaatcttct gaaataagaa aaagggcagt taatttttat caagaccttt 2820 ataagagtga gatcatttct gatgaatcag atacaagtga tttttttaac aatttacccc 2880 aagttcctca agattttaat gttgagttaa agaaggctgt cacactagaa gaattacaaa 2940 aagcacttca aagtatggaa tgtaataaag tccctggcat tgacggctta ccagccgaat 3000 tttataagac tttttggaaa gaagtcggtt tggacttatt gcaagttata aatgaaagtt 3060 tggcatatgg acaattacct ttaagttgta gaagggcagt gcttacttta ttacccaaga 3120 aaggtgatct taatgatatt aaaaattgga ggcctgttag cctgttatgt actgactata 3180 aattactctc taaagtttta gctaacaggc taagtgatgt ccttagccat atcattcatc 3240 cggaccagtc atattgtgtt ccaggaaggc gaatatttga taatatctct tttatcagag 3300 acattttaga tattggaaaa atttttaatt tgaattttgg ccttctttct cttgatcaag 3360 aaaaagcttt cgatcgagtt gagcacaaat atttgtggaa cactttaaca gcctttggtt 3420 ttaacaatga gtttgtatct atgttaaaag ctttatataa cgatgtggaa agtgtcttga 3480 aatttaacgg gaatttatgt gcccctttta aagtttttag gggcattaga caaggatgtg 3540 ccttatcggg tatgttgtat actttagcta tagagcctct gctaattaaa ataagagcga 3600 aattaagtgg gttatctcta ccaagttgta caaattgttt taaagtgtct gcatatgctg 3660 atgatattgt tgttttaatt aatgggcaga ttgatgttac cttgttactg gaattaatga 3720 atgattttaa aaagatatct gccacaaaaa tcaattggtc aaaaagcgaa gccttattag 3780 ttggaaattg gcacaacgca gagccaaaac ttccagaagg actaacttgg actaagcaag 3840 gttttaagta tttgggagtg tttttgggag atgagctgat catgcagaaa aattttgaag 3900 acattattga aaaagtcaga ggtcgacttg aaagatggaa atttttactt tctaaaattt 3960 cctataaagg acgagtgttg atcataaaca atctaatcgc ttctgctctg tggcatcgat 4020 taatttgtgt ggaccctcca agtgacttat taataaaatt gcagtcactt ttgatagact 4080 ttttttggga caagatgcat tgggtacaaa aagctatttt gtacttaccg aaagaagaag 4140 gaggacatca tttaatgcat atccagagca gagttgcagc tttcaggatt cagttcattc 4200 aaagacttct gactggtccg gtggactcta gttggaaaat tgctgcatgt gctattttgc 4260 gagggtttag aaatttgggc cttgagaaat ctttattttg gactgaccca caaaagatgg 4320 actggtctaa attgcccatt ttttatcgaa acatgtttaa agtttggtct ctatttaaag 4380 tacagaggca tggcaatgca tcatctttat tttggttgct ccaagaacct ttgatttttg 4440 gatctcgaat ggacttatcg aatcagggga ttctcccttc tgttaacgag attcttctga 4500 atgcgggagt tgtaactgtt ggacacttgt ttaaactggc tggccctgct tttaggaatg 4560 ttgaacctgt tgcgaatcgt ttgggtttaa ggtccaaacg aatagtagca aagttacttg 4620 aaaagtggaa atcctttttt acccctgaag aagctaacat gttggaaaat tatgctgatg 4680 gtttaatcat tccaaattgc aatgatcctt ttcctgattt atatttatca actgatttta 4740 aggagtgtga aggtgttttt ttgcaaaata atgtgttgtt tttaatggga accaattctg 4800 agtttggaaa aaatgtttat aatagttgtg tgaaattgct aaataggaag tcactgaatg 4860 ataaggttga tactccatgg cgctctgttc tccatttaaa agagaatgct aaaccagagt 4920 ggagagctct gtataaaccc cccttaccaa aaaaagtggg ggatttgcaa tggagaatct 4980 tgcatggtgc catagctgtg aatgcgtttg tttcaattat aaatccaggt aatagtgatg 5040 aatgtccatt ttgttctcaa cgtgaaaccc tatatcatgc ttttatgtta tgtaacaggc 5100 ttagttcctt ttttcaggtc ttatctaact tgtttgtttg ttttggcgag acattttcta 5160 tggaaacgtt tatttgtggt tataagtata cacgtaaacg acgttttttt tgtcagttat 5220 taaattttct tttgggtcaa gcaaaaatgg caatttatga cactagaaaa atacaaattg 5280 agcaaaactc aagtggctgt ttaaaaacat ttttttttaa cttggtgaaa tccagaattt 5340 taattgattt ccagtattac aaaacaatgg gtgatcttat gtgttttgaa aaaatttggt 5400 gttacaaagg agttttatgt caaatttgtg atgaaaattt agagtttatg gttcattagg 5460 ctgtttctaa tttttatgtg taacatttga ttgtataacg tctttaaaat gtttattaaa 5520 ggcgtgttga aaattcaaaa t 5541 // ID BEL31-I_DR repbase; DNA; ZEB; 6749 BP. XX AC chr22; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL31-I_DR; KW BEL31-LTR_DR; BEL31_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6749 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1405-1405 (2008). XX DR Genome; chr22; Positions 3521005 3527753. XX CC Positions [5754-6314] - Integrase core CC 'TTAGT' target site duplication CC LTRs are 100% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 630..6701 FT /product="BEL31-I_DR_1p" FT /translation="MADQEDCDIDAVELQLQVSPEDENCETSAQEPSTNEA FT AKEESLRKSQRVRKFTEKGQALHDEKVNKFQTHFKACYDEWKSAAKQAKKA FT LENPVSVTLHDHIAHVQYASSEVKQTYDKLRSYVTPDGDTRRRVDTCDAVS FT QKLIKLAEDWLVESGSVKDVAKGSLRWSEAGSVFISAASQKSKSAGSTYSS FT SSRLSVKKQEAAAELAATEAALKVMEEMDRERNELENLEAENRQRLAIQEA FT ENAEKLKALEQKRRELERLETVKKMNAAKARLKVYEEEINSDEEISDLLHD FT NVKQRKAAPKPGSLPEPLLTPSKPTVKLHQFSALPQTTRCQLPNNATQEDS FT TTALARALAESINISRLPVPEPAVFNGDPLKYKDWKMSFQTLIDRKNIPVN FT EKIYYLRKYVGGPARRAVESYFLLGTDEAYYAAWDTLEERYGGSFVIAKAF FT RDKLNSWPKIGTKDSSELREFSDFLRGCEAAMLQIKNLDVLNDCSENQKIL FT SKLPDWLTARWNRKVIEIEEQHGSFPTFSNFVDFITREAKIACNPVTSLHA FT LKGSDLEKVKPTKTRSIGTKVLASAAEEKPNIQKCVFCDRLYHSIHTCRQF FT MDKSIMERVKFVQTKGLCFGCLNPGHHSKKCGKRSVCDTCKGKHPTCLHEE FT RDKQIRKNKEGIKDTKENKQSKEIPCETISHRVVQNNSSNLTSTISPVWLS FT TTNNPEHEILVYALLDSQSDTTFILQEKADGLDADKKHVQLKLSTLSTRDS FT VIQSERLSGLQVRGFYSSKRILLPVTYTREFIPANLNHISTPKTARAWPHL FT EHLAEEIAPPIECDVGLLIGYNCSQALLPREVVSGKDNEPFAQRTDLGWTI FT VGGADPCVDYGDAMGSSHRIIVKEVTPKLSSDQLPNEVHYICRTQVKEIIT FT PPDVIKVLESDFSERKVKESHFSQEDLRFLSIMEEGVRITADGHCELPLPF FT KEDRPCLANNKSCAEHRLQCLKKRLERDKQYCNDYTTFMNETIERGDAKRV FT PLEELDKSSTWYLPHHGVYHPQKPGKIRVVFDCSAKFKGVSLNDHLLTGPE FT LTNTLIGVLCRFRKGPVAVMCDIERMFHQFRVRDEDQDYLRFLWWDNGDFH FT LSPSVYRMRVHLFGAASSPACANFGLKYIAAQGRGQFSEATIRFIERNFYV FT DDGLISFHSEEEAIRLVHEAKKLCSIGKLRLHKFVSNSPQVLESLPKEDCV FT ETVRNQDLALGEQLIERALGVKWSVASDHFQFRVVVSERPFSRRGVLSTVA FT SIYDPLGFVAPFVLLGKQILQQMCRDKSDWDTPLSNEHQSRWESWLTDLKD FT LAHVKIDRCYLPADFQEVQRYELHHFSDASVAGYGVCTYLRAVSRSGRIHC FT SLVIAKSRVAPTKVTTVPRLELSAAVVAVRISDMLKGELELENVQEFFWTD FT SQVVLGYINNEARRFHVFVANRIQRIKESTKPTQWKYVASEDNPADHASRG FT LKSKELIASNWFSGPSFLWEDSLPTGEIKVGDLDTDDPEVRKVLVHHTLTT FT ENSLAERFSKFSSWTKLVKAIARLIRFVKELKGSIKRTNKATSLEERQEAE FT HVVIAIEQGLVFSKEIKELHSKKELITKDRASRLYRLNPFLDHKGILRVGG FT RLEHATLHPNVKHPFILPKTSHITKLLIDHYHQKVKHQGRGMTTNELRSNG FT IWILGCSQAVSSFIYNCVKCRKFRRQTEEQKMANLPSERMETTPPFTYCGM FT DCFGPFYIKEGRRELKRYGLLFTCLCSRAVHIELLDDLSTDAFLNSLRAFI FT ALRGNVRQLQSDQGTNFVGARREFLEAVKEMDQECLKQLGCEFVMNPPSAS FT HMGGAWERQIRTIRSVLNSILDQSSRTLDSSSLRTYLYEVMAIVNSRPITT FT QLLNDPTGPQPLTPNLLLTMKSSIILPPPGDFVKEDLYLRKRWRRVQYLAN FT EFWQRWKKEYLLSLQQRQKWHKTNRNAKVNDIVIIQDDTTQRNDWKLAKVT FT AVYPSQDGCIRKVQLLISDSTLDNHGRKLCKPVHLVRPIHKTVTLLEAD" XX SQ Sequence 6749 BP; 2272 A; 1309 C; 1505 G; 1663 T; 0 other; gtcagaactc agcctgtttc gcgctggaag agaggagagg aggagaggag agcagagcgg 60 tgccggtcgc agcagcggag ctgtaaacga gcagctcgtt aggcggtgac gtcgttctca 120 agcatcaaag gaaaaaagga ctgcaagcgc gcaaatccca catcaacggc tgtcggtgta 180 taagagtgca gcgtggttaa gctaagaaga agtaacacat ttaaaacaag ttaaatgtga 240 agcagattaa ccttgttaat ttaaagggac tttattaagt gtttacaggg gtaaagtgca 300 gtggtaaaaa atttgagact aagctgaagt tattgtttgt ttaatataag gtgactgtta 360 gctacaagta ttttattatt catttaagga tacattaaga tatatttaaa gggaagtcat 420 tgctttttca ttttttatat tttatatttt gtaacaagtg agaaaaattt aaaatttata 480 atttggctta ttaggtgttt atgacacaca tttaagttta tctttaaata gttgagaaat 540 ttactgctat aagcacatta aagacagtga tatcacttag ttgagatata gttataagta 600 acaagtttaa agcacactat aatagtggta tggcagatca agaagactgt gatattgatg 660 ctgtggagtt gcagttacaa gtcagtcctg aagatgaaaa ttgtgaaaca agcgcacaag 720 agccaagcac aaatgaagct gctaaagaag agagcttgag gaaaagccaa agagttagga 780 aattcactga aaagggtcaa gcacttcacg atgaaaaggt gaataagttt caaactcact 840 ttaaggcatg ctatgatgaa tggaaatctg ctgctaaaca agctaaaaag gcacttgaaa 900 accctgttag tgtaacattg catgatcaca ttgcccacgt ccaatatgcc tcatcagaag 960 taaaacaaac ctatgataag ctgcgcagtt atgtaactcc tgatggagac acccgtcgaa 1020 gagtggatac atgcgatgct gtttctcaaa agcttataaa actagctgaa gattggcttg 1080 tagaaagtgg aagtgtaaag gatgtagcaa aggggtcact acgctggagt gaagctggtt 1140 cagtgttcat ctcagcagcg tctcaaaaat ctaagagtgc aggaagcaca tattcaagtt 1200 caagcagatt atctgtaaag aaacaggaag cagcagctga attagctgca actgaagctg 1260 ctcttaaagt tatggaggaa atggatcgtg aaagaaacga acttgaaaat ctagaagcag 1320 aaaaccggca aaggttagca atacaagaag cagaaaatgc tgaaaaacta aaagcattag 1380 aacagaagcg aagagagcta gaacgtctag aaactgtaaa gaaaatgaat gctgccaagg 1440 cacgactgaa agtgtacgag gaagaaatta attcagatga agaaatctca gacctcctac 1500 atgacaatgt taaacaaagg aaagctgctc ctaagccagg aagtttgcct gaacctctgc 1560 taacaccatc aaaacctact gtaaaactgc atcagttttc cgctttacca cagaccacaa 1620 ggtgtcaact tccgaacaat gctacacaag aagatagcac tactgctctg gctagagcat 1680 tagcagaatc cattaatata agccgtctac cagtaccaga gcctgctgta ttcaatggag 1740 acccactcaa gtacaaagac tggaaaatgt cttttcagac actgatagat cggaaaaaca 1800 tcccagtaaa tgaaaagatt tactatctcc gcaagtatgt gggtggacca gccagaagag 1860 ctgttgagag ttactttcta ctgggcacgg atgaagcata ttatgcggca tgggacactc 1920 ttgaagaaag atatggaggt tcatttgtga tcgccaaagc cttcagagat aagctaaatt 1980 catggccaaa aatcggaaca aaagacagtt ctgagttacg agagttttct gattttctca 2040 gaggctgtga agctgctatg ctacaaatca agaatcttga tgtcttaaat gattgcagtg 2100 aaaatcagaa aatactaagt aagcttccag attggttaac agcaagatgg aatagaaaag 2160 tgattgagat tgaagagcag cacggttcct ttccaacatt tagtaacttt gtggacttta 2220 tcacaagaga agccaagatt gcgtgcaacc cagtaacttc actacatgca ctcaagggaa 2280 gtgacttaga aaaagttaag cctacaaaga ctcgaagtat cggaactaag gtgctggcaa 2340 gtgctgcaga ggagaaacca aatatacaga aatgtgtttt ctgtgacaga ttgtaccata 2400 gcattcacac atgtcgacag tttatggaca agtcaataat ggagcgagtc aagttcgtgc 2460 aaacaaaggg attatgtttt gggtgcctta accctggaca ccattcaaag aaatgtggaa 2520 agagaagtgt ttgtgacaca tgcaaaggga aacacccaac ctgtttacac gaggaacggg 2580 ataaacaaat cagaaagaat aaagaaggca taaaagacac aaaggaaaac aaacaaagca 2640 aagaaatacc atgtgagaca atatcacacc gtgtagttca aaataatagt agtaacctca 2700 catctacaat cagtccagtt tggttatcta caacaaacaa cccagaacat gaaatcctgg 2760 tgtatgctct tctagatagt cagagtgaca caacattcat tttacaagaa aaggcagatg 2820 gtctggatgc tgacaagaag catgtacagt taaagctttc tacactgtca accagagatt 2880 cagttatcca aagtgaaagg ctgtctgggc tgcaagtcag aggcttctat tcctcaaaga 2940 gaattcttct tcccgtaaca tacacaagag aatttattcc tgcaaatcta aatcacatat 3000 ccactcctaa gacagcaaga gcatggcccc acttagagca ccttgccgaa gagatagccc 3060 ctcctataga atgcgatgtt gggttactca taggttacaa ttgctcacaa gcgcttttac 3120 ctagagaagt tgtgtcagga aaagacaatg agccattcgc tcaaagaaca gacttgggtt 3180 ggacaatagt cggtggagct gacccttgtg ttgactatgg tgatgctatg ggaagcagcc 3240 acaggattat tgttaaggaa gtgacaccca agttgtcatc agatcagttg ccaaatgaag 3300 tccattacat ttgtcgcaca caagtcaaag aaataatcac accaccagac gtgattaaag 3360 ttcttgaatc tgatttcagt gaaagaaagg tgaaagagtc tcacttctcc caagaggatc 3420 tacgattttt gtctataatg gaagaaggag taaggataac agcagacgga cattgtgagc 3480 taccattgcc atttaaagag gacaggccgt gtctggcaaa caataaaagc tgtgcagaac 3540 accgactcca atgtcttaag aaaagacttg aaagagacaa acaatattgc aacgactata 3600 caacattcat gaatgaaact atagagcgtg gagatgcaaa aagagttcca cttgaagaac 3660 ttgacaaaag ctctacctgg taccttccgc atcatggggt gtaccacccc caaaaaccag 3720 ggaaaataag agtagtgttt gattgttcgg ccaaatttaa aggtgtgtca ttaaatgatc 3780 atctgctcac agggccagag ctaaccaata ccttgatagg tgtcttgtgt cggtttcgca 3840 agggtccagt ggcagtgatg tgtgacatag aacgtatgtt tcaccagttt agagtcaggg 3900 acgaagatca agactacctc cgtttcctgt ggtgggacaa tggagatttt cacttaagtc 3960 catcagtcta ccgaatgaga gttcatttgt tcggagctgc ttcgtcaccg gcttgtgcaa 4020 actttggtct caagtacatt gcagcacaag gtcgaggcca gttcagtgaa gcaactataa 4080 gattcattga aaggaacttt tatgtggacg atggccttat cagtttccac tccgaagaag 4140 aggctattcg tttagtccat gaagcgaaaa aactctgcag tattggaaaa ttgcgactgc 4200 ataagtttgt ttctaacagc ccacaagttc ttgaatctct ccctaaagaa gattgtgtag 4260 agacagtaag aaaccaagat ctagcactgg gcgagcaact gattgaaaga gctctagggg 4320 ttaaatggag tgttgcttca gatcatttcc agtttcgagt agtagtgagt gagcgtccat 4380 tctccagacg aggagtcttg tcaactgtag cttccatcta cgacccactt ggatttgtgg 4440 cgcctttcgt tctgcttggc aagcaaattc tacagcaaat gtgtcgtgac aagtcagact 4500 gggacacacc actctcaaat gagcatcagt cacggtggga gtcctggctt acagatctaa 4560 aggaccttgc gcatgtcaag attgatcgct gctatctacc agcagatttt caagaagttc 4620 agagatatga actccaccat ttttcagacg caagtgttgc aggatatggt gtctgtacgt 4680 atctaagagc agtcagtaga tcaggacgaa tccactgttc acttgtcatt gcaaagtcca 4740 gagtagcacc caccaaggta actactgtac caagattgga gctgtccgca gcagtagtag 4800 cggtgcgcat aagtgatatg ctcaaaggag aactagagtt ggaaaatgtt caagagttct 4860 tctggacaga ctctcaggtt gtcctgggat atataaacaa tgaagcgagg cgttttcatg 4920 ttttcgtggc aaaccgtatc caacgaatca aggaaagtac taaaccaaca cagtggaagt 4980 atgttgcatc agaagataac cctgctgacc atgcgtctcg aggtctcaaa tctaaagaac 5040 tcattgcttc caactggttt agtggcccaa gttttctttg ggaagattca cttcctactg 5100 gagagattaa ggtgggagat cttgacacag atgacccaga ggtacgcaaa gttcttgtac 5160 accacacctt gacaacagag aattctctgg cagaacgttt cagcaaattc tccagctgga 5220 caaaattagt taaagctatt gcaagactaa tcagatttgt aaaggagttg aagggctcaa 5280 taaagcgaac caataaagct acaagtctcg aagaaaggca agaagctgaa catgtagtta 5340 ttgccattga acaaggatta gtcttttcca aagaaatcaa ggaactccac tccaagaagg 5400 agttaattac taaagacaga gcaagcagat tgtatcgact caatcccttc ttggaccaca 5460 agggcatcct cagagtggga ggaagattag agcatgcaac tttacaccca aatgtcaagc 5520 atccatttat cttaccaaaa accagtcata tcaccaagtt actgattgat cactatcatc 5580 agaaagtgaa acaccaaggg cgtggcatga ccacaaacga actgagatct aatggcatat 5640 ggattctcgg ctgtagtcaa gcagtatcat catttatcta caactgtgtc aaatgcagaa 5700 aattcaggcg gcaaacagaa gaacaaaaga tggcaaacct gcctagcgaa agaatggaga 5760 caactcctcc cttcacatat tgtgggatgg attgtttcgg tccattttac atcaaggagg 5820 gaagaaggga actaaagcgc tatgggctgc tgttcacatg tttgtgctct cgtgcagtac 5880 atatagagtt acttgatgat ttatcaactg acgcattcct taactcactt cgggcattca 5940 ttgctttacg tggaaacgtt cgtcagctgc aatcagacca aggcaccaat tttgttggag 6000 caaggcgtga gtttttggaa gcagtgaaag aaatggacca ggaatgtctg aaacaactag 6060 gctgtgagtt tgtcatgaac ccaccctctg ccagccatat gggtggagcc tgggaacggc 6120 aaatccgaac aatcagaagt gtgttgaatt ccatcctgga tcaatcatct agaactctgg 6180 acagttcatc cttaagaaca tatctatatg aagtgatggc aatcgtaaat agccgaccca 6240 taacaacaca attactcaac gatcctactg gaccacaacc acttacaccc aaccttcttt 6300 tgacaatgaa atcgtcaata atactgcctc caccaggaga ttttgtaaag gaagaccttt 6360 accttcgtaa aagatggcgt cgagtgcagt atctggctaa cgaattttgg caaagatgga 6420 aaaaggaata tttgttgagc ctacaacaaa gacaaaagtg gcacaaaacg aacagaaatg 6480 caaaagtcaa tgacatcgtc ataatacaag atgatacaac gcagagaaat gattggaagc 6540 tggcaaaagt tactgctgtg tacccaagtc aagatggatg catacgaaaa gtgcagctac 6600 taatcagtga ctccacatta gataaccatg gaaggaaact gtgtaaacct gtccatctag 6660 taaggccaat ccacaagact gtcacattac tggaagctga ttagcacagt tcacatttca 6720 ttctggaaat cacaagtgat tggtgggag 6749 // ID BEL31-LTR_DR repbase; DNA; ZEB; 711 BP. XX AC chr22; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL31-I_DR; KW BEL31-LTR_DR; BEL31_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-711 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1406-1406 (2008). XX DR Genome; chr22; Positions 3520294 3521004. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 711 BP; 228 A; 92 C; 142 G; 249 T; 0 other; tgtagctgcc gtcagaagca atgcttatat attacactct ataaacataa actttacatt 60 aaaaagtcaa taaatagata aaatgttcaa attaatcatt agaataaata agtaatgtat 120 atttgttgtt aatgcatgtt aattcattta aagttattta actaaaacta taagggggag 180 gggggaacct tttcttgtta tgtttagctt gggaattatt tcattgttgc aatgaagata 240 ttcttccgat ccactacaag tagttcagcc acaaggtgct gtgctaaaac aagtttttaa 300 ggtttaagaa gtgcaaagaa gtttgttaaa gagaaactgc aacccttcaa gtttattcct 360 cttgagttag agaggccaat gaggtatgtg ttgtatggtg ttaacgtatg tgaacgtgta 420 tgattatgtg tccgtatgta atgtataatt agccaattaa gtgtatttcg ttacatgttg 480 aatgtgcact ttattttttg tttgtatgta tgttgtttat tacgtgcaac atcaccttta 540 aagggttaat tgaagtttat gtatttcttt tcgtagttct gacggcaata ttgggacata 600 aaggagtgca aggctagaat gtacaagaca tactacagac tcgtgtccga ataaatgtca 660 gttaaaataa agaacgactt gagcctttgt ttcatctcga ggggagtaac a 711 // ID Gypsy42-LTR_DR repbase; DNA; ZEB; 888 BP. XX AC . XX DT 21-SEP-2007 (Rel. 12.09, Created) DT 25-SEP-2007 (Rel. 12.09, Last updated, Version 1) XX DE A long terminal repeat of the Gypsy42_DR LTR retrotransposon; a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy superfamily; LTR; Gypsy42-I_DR; KW Gypsy42-LTR_DR; Gypsy42_DR; endogenous retrovirus. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-888 RA Dib M.R. and Naveira H.F.; RT "Gypsy42_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 7(9), 815-815 (2007). XX DR [1] (Consensus) XX CC Gypsy42-LTR_DR is a long terminal repeat of the Gypsy42_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its CC internal portion is deposited in Repbase as Gypsy42-I_DR. XX SQ Sequence 888 BP; 180 A; 167 C; 170 G; 371 T; 0 other; tgttacgttt cacgtatgtt ttgtagtatt ttttgttgtt ttctgctctg tgcgctcttc 60 tctctctctc tctctctctc tctctctctc tcttgttgtg gtcctatcct gtttcaggtg 120 tgctaccatt ggtcgggtga gctgtcattc accttcgtca ctaaatcggc gcgccaataa 180 ggatcgggtc cgcgcagtat aaaaccggtg cgcttgcgct gttctggagg agcgcttggc 240 ctgccaatct cctcacttgt gtcggccgga ctgtttgttt gattgatccg cttgatttgt 300 ttgcgagtgc tattttctag ttttctacct gtttattctg atttatcagt taaacttgtg 360 ctgctcttgt gaatgtttag tgagattttg aacgatctgt atgagaattg gattacctga 420 gatcttgttt tgctaatggc taacagtgtg tactttacga gaataatagc atttcgagca 480 gcttagtaaa attttgtatt agttagtttt tggaggcgtt gccacatgtg tacttgtgtt 540 tggaagttaa gaagtttagt taagctacgt gctgaagatt acggttttgt ttctacattt 600 ttctttgttc agtcagttta gtgggttggg gtttagttag ttcgttttgt tatatttatt 660 tctttgtttt ggcaacactc ctatttttct ttagttgatt caattattta caaaattaca 720 tttatcttct atcttacatt cattgtttga aattatccat taataaatac ttactttttg 780 tttcacttat caagcatatg cgtcttctcc tcattcaccc agcatcaata gactcactga 840 ttgttacgta tcatcccttt tattatttaa catcataaac acgtaaca 888 // ID Gypsy-31-I_DR repbase; DNA; ZEB; 4522 BP. XX AC . XX DT 26-APR-2007 (Rel. 12.05, Created) DT 17-MAY-2007 (Rel. 12.05, Last updated, Version 1) XX DE An internal portion of the Gypsy-31_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy superfamily; Gypsy-31-LTR_DR; KW Gypsy-31_DR; endogenous retrovirus; gag; integrase; KW reverse transcriptase; Gypsy-31-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4522 RA Dib M.R. and Naveira H.F.; RT "Gypsy-31_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 7(5), 154-154 (2007). XX DR [1] (Consensus) XX CC Gypsy-31-I_DR is an internal portion of the Gypsy-31_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its long CC terminal repeat is deposited in Repbase as Gypsy-31-LTR_DR. CC Gypsy-31_DR is characterized by 4-bp target site duplications. CC The internal portion encodes one polyprotein the 1074-aa gag CC Gypsy31_DR1p (pos. 673-3894) composed of the gag, protease, CC reverse transcriptase, and integrase domains. Some insertions CC fairly recent, according to the high identity between their CC flanking LTRs. Consensus obtained after the alignment of at least CC nine independent insertions bearing at least 85% Homology over at CC least 1000bp. XX FH Key Location/Qualifiers FT CDS 673..3894 FT /product="Gypsy-31-I_DR_1p" FT /note="ORF." FT /translation="MDVDESLNYDELKSAVLRKYDISRETYRQRFRSLEVE FT HLESPKELYVRLKELYGKWMQPKSKSIEEVGEVIILEQYLRMLSPELQVWV FT REHDPQSASQAATLADVFVAARQRNQSWWWKSSRDDRRANPPQQSMRNFPG FT AGKPPGGGPAFVNAPKNYRKMPICYLCNQEGHTKPMCPRNVIKNTHLCYVP FT RHLAKPAPKAERSPASTTVEINGKEMIALVDTGSDQTLVHMKCISPALLRY FT TNLKAVRCVHGDEKLLPTAEVYLKVRGQTYFLEVGVADNLPFPVVLGHDLP FT VLWDLLQPPSTCNMVVTRAQSHKRDESKELLATLPFFDAEIDGSTKPRKSR FT RQKRFEKIQYNASVMPVDKFAEFSPKFKLPVNIVQMQQEDASLVPYLERAK FT QEDGKTVDGETKERFCMYQGLLYRQMGQVTQLVVPQCVREVILTLGHSIPW FT AGHLGKRKTIARIRKHFYWPGLNSEVAQYCKTCPECQKVSSDCPRRVPLQP FT LPLIGTPFERLGMDIVGPVEKSRSGNRFMLVVTDYATRYPEVFPLRSVKAK FT YVATCLVQLFSRVGFPSEILTDQGTNFMSNLLKQVYRLLGIKSLRTTPYHP FT QTDGLTERFNQTLKQMLRKFVGESGTDWDQWLPYLLFAYREVPQASTGFSP FT FELLYGHEVKGPLALLREVWEGNPQRNDETNVASYVIQMRERLQKMSALAQ FT SHLAEAQTRQKTWYDQSARERQFKPGQKVLVMLPSHESKLLAKWQGPYEIK FT RKLGPTTYEVALSDHAHSTRTLHINLLKEWFPRPPESSHSLMIRQVKEEDD FT SEVFLPQSALGDVDLSHLSPQQQHEVREICLPEVFSEYPGFTTLIEHNIEL FT KPDAVVRRMSYRVPERLQEALKEEVDLMLRLGIIEPSKSEWCHPVVLVPKK FT DGSIRFCIDFRYLNSVTKFDAYPTPRISDLTDRLGTSKFLTTIDLSKGYWQ FT IPLTPQSRELTAFKTPWGLFHFKVLAFGLHGAPASFQRLMDQVLRGLPFTA FT AYLDDIVVYSDTWQQHLQHLQEVLQRLQAAGLTVNPQKCTIAKAETEYLGF FT VIGKGVLRPQVEKVRAI" XX SQ Sequence 4522 BP; 1237 A; 997 C; 1136 G; 1152 T; 0 other; agtggtgtca gaagtgggat tttgccagta gggggcatag tttttttttt ttttgctttg 60 gtgaactttc tgggccatga accgaggagg acgaagagga gccccaaagg ctgtttcagt 120 ggagcaactg gatgaagttg cacaagagct gggggatgaa gaattacagg aggtgagaga 180 atttgatgtg gagactgaat gcactgaacc aacaatagct gacttgacag gcttgttgag 240 agctcacatg gcaaaaaatg gaggctcagg aagcacaaag gatggcagag ctggcccagc 300 aggaacgccg attcaaagct ctccatcatc agtttggcct gttgcagctg gaggtgcagg 360 ctcgcacaac tcctgttcca aatactttgg taacatctca agatgatttg gatcacccag 420 atgagggtgc tggtccaagt aggccattat ctcagtcaag tacctgtcgg agagagggag 480 ctgaagcccg tgatacaggt cagtgccaac ttaaagagcc taaattggaa aagttgtctg 540 atggtgatga tgttgagcat ttcctaatta cttttgagag aatggcagca gtttgccgat 600 ggccaaaaga ggagtgggta tttcatctga ttccccattc tgacgggtaa ggcccgggct 660 gcttatgttc acatggatgt tgatgaatct ttgaattatg atgaacttaa atctgcagta 720 ttaaggaaat atgacatcag tcgagaaacc tacaggcaga ggtttcggtc tcttgaagta 780 gaacaccttg aaagccctaa agaactatat gtgaggttaa aagaactgta tgggaaatgg 840 atgcagccaa aaagtaagtc tattgaagaa gttggtgaag tgatcattct tgaacaatac 900 ttgagaatgt tgtctccaga gctacaggtg tgggtacgag aacatgatcc acagtcggct 960 tcacaagcgg caacactggc agatgtgttt gtggcagcac gtcagagaaa tcagtcctgg 1020 tggtggaaat ccagccgaga tgaccggaga gcaaatcctc cccagcagtc tatgagaaac 1080 ttccctggtg ctggtaagcc tcctggagga gggcctgcat ttgtcaatgc tcctaagaat 1140 tataggaaaa tgccaatatg ttacctatgt aatcaagaag gtcataccaa gcccatgtgc 1200 cctagaaatg taatcaagaa tactcacctg tgttatgtgc ccaggcatct tgcaaagcca 1260 gcccctaaag ctgaacgctc accagcaagc accactgttg agattaatgg aaaggaaatg 1320 attgctttgg ttgataccgg gagtgatcag actttggtac acatgaagtg catttctcca 1380 gccctccttc gctacaccaa cctgaaggct gtccgctgtg tgcatggtga cgaaaagctg 1440 ctacccaccg cagaggtcta tctgaaggtg agaggtcaga cttacttctt ggaggtcggt 1500 gtagctgaca acctaccttt ccctgtagtc ttggggcatg acctgccagt actgtgggac 1560 ttattgcagc caccatccac ctgtaacatg gttgttactc gagcccagag ccataagaga 1620 gacgaaagta aagagttgct tgctacgcta ccattctttg atgcagaaat cgatggctca 1680 actaaaccta ggaaatccag acgacagaaa cggtttgaga aaatccaata taatgcttcc 1740 gtaatgcctg ttgataagtt tgctgaattc tcacccaaat ttaagctgcc ggtcaacatt 1800 gtgcagatgc aacaagagga tgccagtctt gttccttatc tggagagagc aaagcaggag 1860 gatggaaaaa ctgttgatgg agagacaaag gaaagatttt gtatgtacca gggtctacta 1920 tatcgtcaga tgggacaagt gacacaactg gtggtaccac agtgtgtccg tgaagttatc 1980 cttactctgg gtcactcgat tccttgggct ggccacctgg ggaaacgtaa aaccatcgcc 2040 cggatcagaa aacacttcta ttggcctggt ctgaactcag aagtagccca gtactgtaaa 2100 acctgccctg aatgtcaaaa agtgtcctct gactgcccaa ggcgggtccc tctccaacca 2160 ctgccactca taggcactcc attcgaaaga ctgggaatgg acatagttgg accagtggag 2220 aaaagtcgtt cgggtaaccg atttatgctt gtagtgactg actatgccac taggtatcct 2280 gaagtgtttc ccttaaggtc agttaaggcc aagtacgtgg ctacctgtct agtgcaactg 2340 ttctccagag ttggctttcc aagtgaaatt ctaaccgatc aagggactaa ttttatgtcg 2400 aatcttctga agcaggtcta taggttgctg ggcatcaaga gtttgagaac cactccttat 2460 catccccaga cagatgggtt aaccgaaaga tttaaccaga cgttaaagca gatgctacgc 2520 aagtttgttg gtgagtctgg cactgattgg gatcaatggc tcccatatct tctgtttgcc 2580 tacagggaag tgccacaagc ctccactgga ttttccccgt ttgagttgct ttatggtcat 2640 gaagtcaaag gtcccttagc attactccga gaagtttggg agggaaaccc acagagaaat 2700 gatgaaacta atgtggcatc ctatgtgatt cagatgaggg aacgtttgca aaagatgtct 2760 gcactagctc agtctcattt ggcagaagct caaacacgcc agaaaacctg gtacgatcaa 2820 tccgctcggg agcggcagtt taagcccggt cagaaggtac tcgtgatgtt gcccagtcat 2880 gaaagtaagc tgcttgcaaa atggcagggg ccgtatgaaa ttaaaaggaa actaggtcct 2940 accacctatg aagttgctct atctgaccat gctcattcta cacgtactct gcatataaat 3000 ttgcttaaag aatggtttcc tcgtcctcct gaatctagcc acagcttgat gatccggcaa 3060 gtcaaagaag aggatgattc tgaagtattt ctgcctcagt cagctctagg agatgtggat 3120 ctgagtcacc tgtccccaca gcaacagcat gaggtgagag aaatttgttt accagaagtc 3180 ttctcagaat accctggctt taccaccttg attgaacata atattgagtt gaaacccgat 3240 gctgtagttc gaagaatgag ttacagggtc cctgaacgtc tccaagaagc gctgaaagag 3300 gaggtggatc tgatgctgag gctgggaatt atagaaccct ctaaaagcga atggtgccac 3360 cctgtggtcc ttgttccgaa aaaggatggg tctataaggt tttgtataga ctttcgttac 3420 ctcaactccg tgaccaagtt tgatgcttac cctactccac gaatcagtga tctcactgat 3480 cgactgggca cctcaaagtt tctaaccacc attgatcttt caaaaggtta ttggcagatt 3540 ccgttgactc cacagtccag agaactcact gcgtttaaaa caccctgggg cctattccat 3600 tttaaggttc tagcctttgg actgcatggg gcaccggcca gcttccaaag actaatggac 3660 caggtactgc gaggactacc atttacagct gcttatctgg atgatattgt ggtgtacagt 3720 gacacatggc agcaacatct gcagcatctc caggaagttc tgcaacgcct tcaggcagcc 3780 ggcctgacag ttaaccccca aaagtgtacc attgcaaagg cagagacaga gtacttgggg 3840 tttgttatcg gtaaaggggt gctccgaccc caggtggaga aggtacgggc catttgaaaa 3900 gtgtccacag ccacaaactc gtaaggagtt aaggtccttt ttgggtatgg ctgggtttta 3960 tcacagattt attcccaact tttctgggcg ggctgcagta ttgactgaca tggttggttc 4020 aaggagtccc aatcagctga agtggaccca agaaactgtt gcagctttcc aagatctcag 4080 aaaggccttg agtaaagatt ctgtcttgca taaccctgat ttccaccaac ccttttgttt 4140 tgcaaactga tgcttcagat cgaggcctgg gagctgtatt gctacagggc agccctgaca 4200 ctcgacgccc agtggccttc ttgagtcgaa agctcttccc cagggaagtt cgctattcca 4260 ttgtggaaaa ggaatgcttg gctgtgaagt gggcattgga ctctttaaag tattatctgc 4320 taggccggga gtttatactg gagactgatc ataaagctct acaatggctt caagaaatga 4380 aagataccaa cagtcgtatt acccgctggt acctggccat gcagcccttc cggtttatta 4440 tcagacatgt ccctggcaaa gaaaacctta cggctgacta cttgtcccga tgtgccagtg 4500 acattcccga agggagggag ta 4522 // ID Gypsy-115-I_DR repbase; DNA; ZEB; 4321 BP. XX AC . XX DT 16-NOV-2008 (Rel. 13.11, Created) DT 16-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-115-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4321 RA Bao W. and Jurka J.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(11), 1785-1785 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 18..962 FT /product="Gypsy-115-I_DR_1p" FT /translation="MDPASISGLAEVVAHATTRMNRHDDQMVTISRAVQAL FT VTQISDLTTRLQRLHAVGLQQPVAANLSPSMPVPERTAGFSEPRLPPPAFY FT SGEPEFCRAFLAKCSLYFSLQPSSFSTEESKVAFVITLLTGRAAQWGTAVW FT ERKLPCCSSFDTFSTELKKVFDRGLAGREAARALTELQQGQRSVADYSIEF FT RTLAAESEWNAQAQWDVFLHGLADQIKDKIFTLDLPKTLDELIDLAIRVDS FT RLQLRERNTRRTHALVRSPFPAVTVVPGQEATPLADPEGMQMGRSRLSLEE FT KRRRRSEDLVFHSGGVDHIAIH*" FT CDS 1057..4134 FT /product="Gypsy-115-I_DR_2p" FT /translation="MLEWSSGNFSTQAIIDSGAEGNFIDSALVKKLRLPVI FT SLSQPISVHALNGSSLPSITHSTGPIRLITSGNHSEIIHFFLTEAPVTPVV FT LGHPWLVIHNPHINWRQESVISWSESCHATCLLSACSSVSRSVFQEEHMDL FT SNVPKEYLDLKRVFSKSRAASLPPHRPYDCAIELLPGTSPPKGKLYSLSVP FT EREAMEKYISDSLAAKIIRPSSSPAGAGFFFVKKKDGSLRPCIDYRGLNAI FT TVKNTYPLPLMSSAFERLQGASFFTKLDLRNAYHLVRIREGDEWKTAFNTP FT RGHFEYCVLPFGLSNAPAVFQALVNDVLRDMLDQFIYVYLDDILIFSHSLQ FT EHVQHVRRVLQRLLENGLYVKAEKCVFHAQSVPFLGHIVSVEGMRMDPEKV FT QAVVDWPTPDSRKALQRFLGFANFYRRFIRNFSQLAAPLTALTSLKTPFRW FT SNAAQVAFDRLKSCFVSAPILIAPDPSRQFVVEVDASEVGVGAILSQRSSS FT DGKMHPCAYFSHRLNNAEQNYDIGNRELLAVKLALEEWRHWLEGSGVPFIV FT WTDHKNLEYIQSAKRLNSRQARWALFFGRFDFSISYRPGSKNVKPDALSRI FT FDHSERASSPETIVPRRLFISAVTWEIESRVRMALEGVTPPPGCPPSRLFV FT PEELRSDVIRWGHSSKLACHPGVSRTLYLIKQRFWWPVMARDIRNFVLACS FT VCAVSKTSNRPPAGLLQPLSVPSRPWSHIALDFVTGLPPSNGNTVILTVVD FT RFSKATHFIPLPKLPSARETAAAVIDHVFRIHGLPTDVVSDRGPQFISKFW FT REFCHLMGATVSLSSGFHPQSNGQTERANQDLERMLRCLVSQNPSSWSQQL FT SWVEYAHNSLPVSATGLSPFECSLGYQPPAFPSLESEVAVPSAHAFVQRCR FT RTWNRARQTLLQVGLRTKAKADRHRSKPPVYVVGQKVWLSSKNIPLRTVCN FT KLAPKFIGPFSVTKIISPVAVRLKLPPAYRRIHPVFHVSKLKPVFHTAINP FT PTPVPPPAASRRRGDGLFG*" XX SQ Sequence 4321 BP; 938 A; 1116 C; 1073 G; 1194 T; 0 other; gaacggactg atcaaacatg gatccagcga gtatctccgg cctggctgaa gttgttgcac 60 acgctaccac acgcatgaac agacatgatg atcagatggt caccatcagt cgcgctgtcc 120 aggcgctggt cactcagatc tctgatctca caacacggtt gcaacgtctt catgctgtcg 180 gtcttcagca gcctgttgca gctaacctta gtccatctat gccggttcct gagcgaacag 240 ctggattttc cgagcctcgc cttccacctc cagcgttcta ctccggagag cctgagtttt 300 gtcgagcctt tttagcgaag tgctcattgt acttttccct gcagccttcc tcgttttcca 360 cggaggaatc gaaagtggcg tttgtcatca cgcttctcac cgggagagcg gcgcagtggg 420 gaactgctgt gtgggagagg aaactcccat gctgctcttc atttgacacc ttttcgacag 480 agcttaaaaa ggtattcgat cgaggtttgg cgggcagaga ggcggcacgg gcgctaacag 540 agttgcagca ggggcaacgg agtgtcgctg actactccat tgagttccgc accctggcgg 600 ccgagagcga atggaacgcg caggcgcagt gggacgtctt tctccatggg ctggcagacc 660 agataaagga taaaatattc accctggatc ttcctaaaac cctcgacgaa ctcattgatc 720 tggccattcg tgtggattct cgtcttcagc tcagggagag aaacacgcga cgcacgcacg 780 ctctggtgag atcacctttt ccggctgtca cggtagttcc ggggcaggag gccacgcccc 840 tcgcggatcc cgaaggcatg cagatgggcc gctctaggct gtcgctggag gaaaagcgca 900 gacgccgcag cgaggacctg gtgtttcaca gcggtggtgt tgatcacatc gcgattcatt 960 gagggtaaaa gccagagccc gtcagtagaa aagagtttac tgacgggtgg agtatctttg 1020 aacaaatctt cccttgcagc tactcttttg aatgttatgt tggagtggtc atctggcaat 1080 tttagtactc aagcaattat tgactctggg gcggagggaa attttattga ctctgcttta 1140 gtaaaaaaac tcagacttcc tgtgatttct ctctcccaac ctatatctgt tcatgctctt 1200 aatggttctt ctcttccgtc cattactcat tcaactggtc caataagact gatcacctct 1260 ggtaaccact ctgaaattat tcattttttt ttaacagaag ctcctgtcac tccagtagtc 1320 cttgggcatc cttggctggt aatacacaat ccacatatta actggagaca ggaatctgta 1380 atatcttgga gtgagagctg tcatgctacg tgtttgttgt ctgcctgttc ttctgtgtct 1440 cgttctgttt ttcaggaaga acacatggat ctgtcaaacg tgcccaagga gtacctcgac 1500 ctgaagagag tgttcagtaa gtctcgagct gcttctctgc ctccacatcg tccctatgac 1560 tgtgctatag agttattgcc aggtacgtct ccgcctaaag gcaaacttta ttcgctctct 1620 gttccagaga gggaggccat ggagaaatat atttctgatt ctctagcagc caagatcatc 1680 cgcccttctt catcaccagc aggggcgggg tttttttttg tgaaaaagaa agatggctcc 1740 ctccgtcctt gtatagacta ccgagggctg aacgctatca cggtaaagaa tacatatcct 1800 ttgccgctga tgtcttctgc cttcgaacgt ctgcaggggg catccttttt cacgaaatta 1860 gacctccgca acgcttacca tttggtccgc attcgagaag gcgatgaatg gaagactgcc 1920 tttaataccc ccagggggca ctttgaatat tgtgttcttc ctttcggtct gtccaacgcc 1980 cctgcagtct tccaagcact cgtgaatgac gtgttgagag atatgctaga tcagtttatt 2040 tatgtttacc tggatgacat actgatattt tctcattctc tccaggaaca cgttcaacac 2100 gtcaggcgag tgcttcagag gctgctagag aatgggcttt atgtcaaggc ggagaaatgc 2160 gtttttcatg cacagtctgt tccttttctg ggacatatcg tgtcagtcga ggggatgcgc 2220 atggatccag agaaagttca ggctgtggta gattggccaa ccccagattc ccgcaaggcc 2280 ctgcagaggt ttcttggttt tgccaatttt taccggcgtt tcattcgcaa cttcagccag 2340 ctcgccgcac ctctgactgc cttaacctcc ctcaagactc cgttcaggtg gtccaatgca 2400 gcgcaagttg cgtttgatcg tctgaaaagc tgctttgttt cagcacctat tcttattgcc 2460 cctgatccat cacggcagtt cgtggtggag gttgacgcat cagaggtggg ggtcggtgct 2520 atcctgtccc agcgctcttc ctcggacggc aagatgcatc cctgcgcgta tttttctcat 2580 cgtctcaaca atgctgaaca aaattatgat attggtaatc gagagctgtt ggccgtcaag 2640 cttgcattgg aagagtggcg tcattggcta gagggttcgg gggttccctt tatcgtttgg 2700 accgatcata aaaatctgga gtacatccaa tccgccaaaa gattgaactc taggcaggcg 2760 cggtgggctt tatttttcgg acgtttcgat ttttccatct cgtaccgacc aggttccaag 2820 aatgttaaac ctgatgcact ttctcgaatt tttgatcact ctgagcgcgc atcatctcct 2880 gagaccatcg tgccacgcag actttttatc tctgctgtca cctgggagat cgagtcgagg 2940 gtccgcatgg ccttagaagg ggtaacgcct ccgcctggat gcccaccgag tcgtttattt 3000 gtgccagagg aattacggtc tgacgtcatc cgttggggtc actcttccaa gttggcctgt 3060 catccagggg tgagtcgtac cttgtatcta atcaaacaac gattctggtg gccagttatg 3120 gctcgtgaca taaggaattt tgtgttggct tgctcagttt gtgctgtttc taagacttcc 3180 aatcgccccc ccgctggact tcttcagccg ctgtcagtgc catcgagacc ctggtcccat 3240 attgcgctag atttcgttac gggtcttccc ccctccaacg ggaatacggt gattttgacc 3300 gtagtggacc ggttttcgaa agccactcac tttattcctc tgcccaaatt accatcagcc 3360 agagagacag cggctgctgt cattgaccat gtctttcgta ttcatggcct cccgacggat 3420 gtggtttctg acagggggcc tcagttcatc tctaaatttt ggagagagtt ttgtcattta 3480 atgggggcga ctgttagtct ttcttctggt ttccatcctc agagtaacgg tcaaacggag 3540 agagccaacc aagatcttga acgcatgtta cgatgtttag tctctcagaa tccatcctcc 3600 tggagccagc agctctcatg ggttgagtac gcacataact cattaccagt gtctgccacg 3660 ggcctctctc catttgagtg tagtttaggg taccagccac ctgcttttcc cagtctggaa 3720 tccgaagtcg cggttccctc tgcccacgcc tttgtccaga ggtgtcgacg tacttggaat 3780 agggccagac agaccctcct tcaagtaggt ttgcgcacca aggctaaagc cgatcgccac 3840 cggtctaagc ctcccgttta cgttgtcggt caaaaagtgt ggctttcatc taaaaatatt 3900 cctctccgca ccgtatgtaa caaacttgca cctaaattca ttggcccgtt ttctgtcacc 3960 aagatcatta gtccggtggc agtccgcctc aaacttcctc cagcgtacag gagaattcat 4020 cccgtttttc atgtctctaa attgaagccc gtttttcata cggccattaa tccgcccact 4080 ccggttcccc cccccgccgc gtctcgtaga aggggagacg gtttattcgg ttaagcgcat 4140 tctggactcg agacggaggg gacgaggatt ccagtacttg gtggactggg agggttacgg 4200 tccagaggag aggagttggg ttcctgcaag ggacatattg gatcattctc tcatcgatga 4260 ttacaatcgc caggtaaact cttctgggag cgccaagagg cgctcttagg agagagggta 4320 c 4321 // ID Gypsy150-LTR_DR repbase; DNA; ZEB; 247 BP. XX AC chr4; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy150-I_DR; KW Gypsy150-LTR_DR; LTR67_DR; LTR_Retrotransposon. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-247 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1562-1562 (2008). XX DR Genome; chr4; Positions 40545621 40545867. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 247 BP; 60 A; 40 C; 31 G; 116 T; 0 other; tggtcaaaca tcttatgtca tttgttatct gtgaaattca atgtcccttt gtaaatatct 60 ttctatatat attttttttt tggcattcat tattatcaat atttttgtaa ataaaatcac 120 gtattgcatt tagctggtct tggtttttgt ccctcattta ttgatgttgt gtggttgctt 180 atgtgtttca cccaaaagtt aatgttactc ctcttcccct ttatttaaca tcttttaaat 240 cgtaaca 247 // ID EnSpm-N6_DR repbase; DNA; ZEB; 346 BP. XX AC . XX DT 16-OCT-2008 (Rel. 13.1, Created) DT 16-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE EnSpm-N6_DR is an non-autonomous DNA transposon - a consensus. XX KW EnSpm; DNA transposon; Transposable Element; Nonautonomous; KW EnSpm-N6_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-346 RA Bao W. and Jurka J.; RT "EnSpm DNA transposons in zebrafish."; RL Repbase Reports 8(10), 1266-1266 (2008). XX DR [1] (Consensus) XX CC Copies are ~94% identical to the consensus, which is a CC palindrome. The TSD is 2-bp long. The 11-bp terminus of CC EnSpm-N6_DR is 80% identical to the EnSpm-6_DR terminus. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 346 BP; 110 A; 59 C; 62 G; 115 T; 0 other; cccagcaagc attttttttg tttttaaaag atgtctaata gatgtctaat agacgtctaa 60 acatagtcgt cttggctaaa acaaggctaa atttgggctg tcagtgaaaa tctaatagac 120 gtctaagaat aggccaaaac tagactagtc atcaaataaa cagaaatgaa tgactacaca 180 tataaagtct gtctaatctg tctatttgac gactagtcta gttttgggct attcttagat 240 gtctattaga ttttcactga cagcccaagt ttagccttgt tttagccaag ctgtctacgt 300 ttagatgtct attagacgtc tattaaacac aaaattgttt gctggg 346 // ID HATN16_DR repbase; DNA; ZEB; 944 BP. XX AC . XX DT 16-JUN-2003 (Rel. 8.05, Created) DT 16-JUN-2003 (Rel. 8.05, Last updated, Version 1) XX DE HATN16_DR is a nonautonomous DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW HATN16_DR; nonautonomous DNA transposon; hAT superfamily. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-944 RA Kapitonov V.V. and Jurka J.; RT "HATN16_DR, a family of nonautonomous hAT-like DNA transposons RT from zebrafish."; RL Repbase Reports 3(5), 90-90 (2003). XX DR [1] (Consensus) XX CC HATN16_DR is a family of nonautonomous DNA transposons that CC belongs CC to the hAT superfamily. CC Several hundred copies of HATN16_DR are expected to populate the CC zebrafish genome. They are ~8% divergent from the consensus. CC This element is characterized by 8-bp target site duplications CC and CC 16-bp terminal inverted repeats. CC The 45-bp 3' terminus of HATN16_DR is 89% identical to that CC of HATN6_AG from A. gambiae. XX SQ Sequence 944 BP; 336 A; 184 C; 174 G; 248 T; 2 other; caggggtgtc caaactacgg cccgcgggcc atctgcggcc cgcaatcagt tttgtggcgg 60 cccgcgaggc tttttataaa tattaataga atctggcccg ctatacaaaa atgaacgtaa 120 ttcaataaat aaccaccggg tgtcgctatt acatgccttc aattaggcag cagttcttgt 180 tatgaagtaa aacgaaccga acaaactgaa agaaacataa ttcataatca cgttttggcg 240 agccatggca caagcaagaa aaaggaaaat caacagtgag tgcaggaaat ttcagtcacg 300 atggggcaaa gaatatttct tcacagaggt cagtggaata tgtgtctgtt taatttgcca 360 ggaatcagtt gcagtaatga aggaatataa tattaaaaga cattataaaa taaaacatca 420 ggccttcagc tyttacactg gtgccgaacg agatcaaaaa gtaaaacaaa tagcagctgg 480 cttatcagct caacaacagc agtttttttt tgtgctaata aagtgcaaga aaattctacg 540 ctgrctagtt atgatgtagc ccagctaatc gcacagcaca ggaaaccttc actgaaggag 600 aatttataag ggaggagtgt tgcaagtgta acttgcccag agaagattca agatttaaga 660 aagtctttct agaaacacag ttggacgatg aattaaaata accaattaaa ataattctat 720 aacacacaca cacacaggcc acacacacac acacacacac acacacacac acacacacac 780 acatatatat atatatatat atatatatat atatatatat atatatatat atatatatat 840 atacacgtgt ctgtgtaatg tatgtaaaat ttggcccggg acaacgtttt tttttttttt 900 ttttttgcat ctggccctcg gcccaaaaag tttggacacc cctg 944 // ID Gypsy81-I_Dr repbase; DNA; ZEB; 7082 BP. XX AC . XX DT 17-APR-2009 (Rel. 14.05, Created) DT 17-APR-2009 (Rel. 14.05, Last updated, Version 1) XX DE An internal portion of the Gypsy81_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; endogenous retrovirus; Gypsy superfamily; KW gag; integrase; reverse transcriptase; Gypsy81-I_DR; KW Gypsy-81-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-7082 RA Dib M.R. and Naveira H.F.; RT "Gypsy81_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 9(5), 956-956 (2009). XX DR [1] (Consensus) XX CC Gypsy81-I_DR is an internal portion of the Gypsy81_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its long CC terminal repeat is deposited in Repbase as Gypsy81-LTR_DR. CC Gypsy81_DR is characterized by 4-bp target site duplications. The CC internal portion encodes two proteins: the 450-aa gag CC Gypsy81_DR1p (pos. 441-1790) and 1553-aa polyprotein (pos. CC 1757-6415, conceptual translation)composed of the protease, CC reverse transcriptase, and integrase domains.Some insertions CC fairly recent, CC according to the hight identity between their flanking LTRs. CC Consensus obtained after the alignment of at least three CC independent insertions bearing at least 85% homology over at CC least 1000bp. XX FH Key Location/Qualifiers FT CDS 441..1790 FT /product="Gypsy81-I_Dr_1p" FT /note="Gag-protein." FT /translation="MMHSNKDTGSLGEYHLGRGRGVLPFSSLCIPGEMFNT FT DSDVNRVQFSTPVSHDIPPLEMSIAEPTVTLSNEQVRQLSADISQSIKASL FT FPSNQLPTSNSCLDCQKCSHVNQCGTTIIDASKMNLVLKSGKEPPYFRGDS FT ADKCSIFEWEDLMRAYLEKGNYSKQECIDELINKLMGRARDVIQIWLRNSA FT SVSRTVEVDMVFRILKQHFSGVISTGLPLADFYATKPFHNEGPYDYWIRLN FT KAADLAEQHLKSEGRCLQDRDMEIAVMFIRNCPDKGLSMMFLSKPQREWTA FT VEVQDRLDEFVRDRKVYECQFSQQAASAVNTDVYGTPTCKPQTVRPSGSAF FT EPDQAVCENNTMDKMLNMLEKALSCNTQSSRATGHHKYNVRRPQVCKVCQS FT NNHSTVAHCRLHRLCFSCYLPGHTRDMCEMSASQPVPNSSTEAGDGHRQKP FT SQGNC" FT CDS 1757..6415 FT /product="Gypsy81-I_Dr_2p" FT /note="polyprotein." FT /translation="RSQAETLSGKLLSLHLEEGDGRHINSTSVDDDDIESV FT YAFVCENAPKSSKVILQNTMRTAQSESLFYTNVLVQDKIELKGMLDTGSMA FT TTISADTIPMLKQAGIMTDDMLASADIVLVGCGGKQSSPLGLCDLKFEVYG FT YKFIVPVLFVEGQIDPIILGTNVLKPLIRHFKSNDGFWRVMGKPDAACQSD FT SSQFLRLLSNLERWRGESIPDKVGTLKLKNAVTLQPMTEHLVWGRLPTGTV FT LSAGSTVLIEPCTSPCVNKQIMVGRIVSPLWGDGWLPIKIINPTSTEITLR FT RNSKVADVAPCIALEDFDQSANSFTVQQNVGKMQSSSSCCSLTARSLCSDP FT SDVNEKLEMLGLSSIPIGDCEVSAQCKEKLVDLIETYESVFSRHHLDCGEA FT QDFCHRIRLTDDRPFRLPYRRISPGHYQKLKETLDEMEAKDIIRKSCSEYA FT SPLVLVWKKNGDLRICTDFRWLNARTVKDAHPLPHQSDVLAALGGNMFFST FT MDLTAGYYNIPLHELDRKYTAFSSPLGLHEYNRLPQGLCNSPATFMRMMLT FT IFGDQNFLSLLCYLDDLLVFAKSEEESLTRLEMVFQRLKVHNLKLSPSKCQ FT FLRRSVKFLGHVISQHGISRDPDKVESIVKVTEADLMELDGVSPSPSKIRS FT FLGMVVYYQHFIENCSVLAKPLFSLIGGPKRPRKLKGRKRPVLYRKLGPDD FT WTVECKKAFEDLKAALVEEVLLAHPDFNKPFILSVDASTAGLGAVLSQIHN FT NSDKAQPIAFASKSLNHAQAKYPAHRLEFFAMKWAICNKFSHWLRGHRFTV FT WTDNNPLKYILTKPRLDACEQRWVAQLAPFNFDIQYIPGTKNVVADALSRE FT PFVQPKILHRLTRAPYSTLLKQAANFKNDDVQNMFRLSAELSELRDEHGQE FT YGDAVLKASCTAASVYVHSEEISAVMQKHQHWEDAGTVRAISHVQYLESLA FT AMSQNPVPAFNHDELLAEQANDAVVSRVRFFIDRGRLPSRREKLHEARGTL FT KILKQWGKLTVRLGIVYRVSKHPVTKQKMYQYVVPESLRMTVLKGIHDDVG FT HQGQQRTLGLARQRFYWETMEKDVKEYVHNCKRCILSKAPEPEARAPLVSI FT TTSSPLELVCIDFWSAEDQNNKSIDVLVITDHFTKLACAYPCPNQTAKTVA FT RVLWNNFFCIYGFPVSIHSDQGANFESLLIAELCQLAGIDKTHTTPYHPMG FT NGQVERMNRTLGNMIRALPPRSKAKWPQLLNTLTFAYNCTVHETTGFSPFF FT LMYGRTPRLPVDLMFESVLLNEETVNIDKYVQSLRRDLCEAIKLAQSHSEK FT QQRKQADMYNKRCKGQSVDIGQRVLLANKGERGKKKLADRWESTVYVVVGK FT NSMLNTYKLRNPSGVVKTVHRNLIMPVDFLPLPDTDAVEEQLSNVSDDELE FT DSSIENVAEDRTTQWVADLSTSTGGDGSVSHELDERSVMCEEIDQASDRTQ FT DEGDFTAIAADRSNVADVDSEVGVRSVHEGSGSGASRLVHPSASEQTESLL FT NRPSQSEGFRTRHGRLVKPLNRLIQTMSNQFVKNLFG" XX SQ Sequence 7082 BP; 1962 A; 1351 C; 1693 G; 2076 T; 0 other; ctggtgacgt gacccgcgcg attgttcagc ggctgccgaa cacccgagat tgctgcacgg 60 accgcgagac gtcggatttg tgcttcaccg accgcgagac atcggacttc tcgtgcttca 120 cggaccgcga ggcgtcggac ttctcgtgct ctttccgctt caacgagggg actcttggac 180 taaagttgcg gtgattagac gacgtcagac tttgctactt gatcgcgtcg tcgagggaac 240 gcacgtttca acgtatcctt ataccagctg gactaaatgt gttatatgga gtataattag 300 gtgaattctc gaattttaag tgattgattt attgtgtacc agactaattt aagtttgccc 360 tgattatata gctgtgggct tattgaaaga aaaaaagtgt atttcactat tttcttttat 420 tattttatgc attataatta atgatgcact ctaacaaaga tactggcagt ttgggtgaat 480 atcatttggg gaggggcaga ggggttttgc ccttcagtag tttgtgtatt cctggggaaa 540 tgtttaacac agattctgat gtaaatagag tacaattttc taccccagtg tctcatgata 600 tcccacctct tgagatgagt atagctgaac ctactgtgac tttgtccaat gaacaagtga 660 gacaactgag cgctgatatt agtcagtcga ttaaagctag tttattccct agcaatcagc 720 tacctacatc aaactcgtgt ttagactgtc aaaagtgctc acatgtgaat caatgtggta 780 ccacaatcat tgatgcttca aagatgaatc ttgtcctgaa gagtggaaaa gaaccaccat 840 actttcgtgg tgacagtgca gataaatgtt cgatctttga atgggaagat ttaatgagag 900 cttaccttga aaaagggaat tattccaaac aagaatgcat tgatgagctc ataaacaaat 960 taatgggcag agcgagggat gttattcaga tttggttgcg caacagtgcc tcagtttcaa 1020 gaactgttga agtagatatg gtgtttcgta tcctcaaaca gcactttagt ggtgtaatat 1080 ccacaggtct ccccctggct gatttttacg ccaccaagcc attccacaat gagggtcctt 1140 atgattattg gatcagattg aataaagctg ctgatttagc tgaacagcat ctgaaaagtg 1200 agggtagatg tttgcaggat cgggacatgg agatcgctgt aatgttcatt cgtaattgtc 1260 ctgataaagg tctatcaatg atgtttttga gcaaaccaca gcgtgaatgg acagcagttg 1320 aggtacagga ccgattagat gaatttgtga gagaccgtaa agtttatgag tgtcagttct 1380 cacaacaagc agctagtgct gtcaatactg atgtgtatgg cacaccgact tgtaagcctc 1440 aaactgttag accatcagga agtgcgttcg aacctgacca ggcagtatgc gaaaataaca 1500 caatggacaa aatgttaaat atgctagaga aggcattgtc atgtaacaca cagtccagca 1560 gagctacagg gcatcacaaa tacaatgtta gaagaccaca agtgtgcaaa gtatgtcaga 1620 gtaataacca cagcacagtt gctcactgca gactgcacag gctgtgcttt agttgttacc 1680 tacctggaca tacacgtgat atgtgcgaga tgtctgcatc acagcctgtg ccaaacagtt 1740 caactgaagc tggtgacggt cacaggcaga aaccctctca gggaaactgt tgagcctcca 1800 tctggaggag ggcgatggta ggcacataaa ttcaacctcc gttgatgatg atgatattga 1860 atctgtatac gctttcgtgt gtgaaaacgc tccgaagtcg agcaaagtaa ttttgcaaaa 1920 tacaatgaga actgctcaaa gtgaaagttt attctacact aatgttttag tgcaggataa 1980 aattgagtta aaaggaatgc ttgatactgg atcgatggcg actaccatca gtgcagacac 2040 aatacccatg ttaaaacagg ctggcataat gactgatgac atgctggcgt ctgcggatat 2100 tgttttagtg ggttgtgggg gaaagcaatc cagtccatta ggactttgtg atttgaaatt 2160 cgaggtttat gggtacaagt ttatagtccc agtcttattt gttgaaggac agattgatcc 2220 cattattttg ggtacaaatg tgcttaaacc actgatcaga catttcaagt cgaatgacgg 2280 attttggcgt gttatgggta aacctgatgc agcttgccag agtgacagta gtcagtttct 2340 tcgccttttg tctaacctgg agagatggcg aggggagtcc atcccagata aggttggcac 2400 attgaagtta aagaacgctg taactcttca gcctatgact gagcatttgg tgtggggtag 2460 gttgcctacg ggcacagtgt tatcagcagg cagtacggta ctcattgagc catgtacgtc 2520 gccttgtgta aataaacaga taatggttgg ccggatagtt tctcctttgt ggggtgatgg 2580 ctggttaccc ataaagatta tcaaccccac ttccactgaa ataactttgc gtagaaactc 2640 aaaagttgca gatgtcgccc cctgcattgc tttagaggat tttgaccagt cagctaactc 2700 tttcactgtt cagcagaatg tgggaaaaat gcaaagcagt agttcctgct gtagtcttac 2760 tgccagaagt ctttgcagtg accccagtga tgtgaatgag aagctggaaa tgctagggct 2820 aagctctatt ccgattggtg actgtgaagt ttctgcacag tgtaaagaga agcttgttga 2880 tctgattgag acttatgagt ctgtcttttc aagacatcac ctggattgtg gtgaggcgca 2940 ggatttctgt catcgcatca ggttaacaga tgaccgccca ttcaggttgc catatcgtag 3000 aatatcacct ggccattatc agaaattaaa agaaacgtta gacgagatgg aagcaaaaga 3060 cataatccgc aagtcctgta gcgagtatgc ttcccccttg gttttggtgt ggaagaagaa 3120 tggcgatctg cgaatttgca cagatttcag atggcttaac gccagaactg ttaaagacgc 3180 ccatcctctt ccacatcaaa gcgatgtcct tgctgcattg gggggaaaca tgttttttag 3240 caccatggat ctgacggctg gctattacaa tattccctta catgaacttg acagaaagta 3300 taccgcattc tcatcaccac tgggattgca tgaatacaat cgtctcccac aagggctatg 3360 taatagccca gccaccttta tgcgcatgat gctgactata ttcggtgacc agaacttttt 3420 aagtcttctg tgttatcttg atgacctgtt ggtgtttgcg aagtctgaag aagagagttt 3480 gacacgattg gagatggtat ttcaacgcct taaagtccat aatttgaaat tgtcaccctc 3540 taaatgtcag tttttgagac gttcagtcaa atttctgggt catgttattt cccagcatgg 3600 aatttcaagg gatccggata aagtggagtc catagttaaa gtgacagagg ctgatttgat 3660 ggagttagat ggtgtctctc cttcgccatc taaaatacgt tcctttttgg ggatggtagt 3720 ctactatcag cacttcattg aaaactgttc agtgcttgct aaacctttat tttctctgat 3780 tggtggacca aaaaggcctc gaaaattgaa aggcagaaag agaccggtgt tgtaccgcaa 3840 acttggcccg gatgattgga ctgtggagtg taaaaaggcg ttcgaggacc tgaaggcagc 3900 tctcgttgag gaagtacttt tagctcaccc agactttaac aaaccattca tattgtctgt 3960 tgatgcatct acagctggac ttggcgcagt attgtcccaa attcataaca acagtgacaa 4020 agcacagcca atagcttttg caagcaaatc cctaaatcat gcacaggcaa aataccctgc 4080 gcatcgcctt gaattttttg cgatgaaatg ggcgatttgt aataaattta gccattggtt 4140 acggggtcac aggttcactg tgtggacaga caacaacccc ctgaagtaca tactgaccaa 4200 gcccaggtta gatgcctgtg agcagcgctg ggtagcgcaa ttggcaccgt tcaactttga 4260 catccagtat atccctggca ctaaaaatgt ggtggctgac gccctcagta gggagccttt 4320 tgtgcagcct aaaatattgc atagattaac tagggctccg tattccacac ttctcaagca 4380 ggcagcaaat ttcaaaaatg atgatgttca gaacatgttt cggctttcag ctgagctttc 4440 agagttgcgg gatgaacatg gtcaggagta tggggatgct gtgttgaagg ctagttgtac 4500 tgcagcaagt gtctatgtgc atagcgagga gatttctgct gtcatgcaaa aacatcagca 4560 ttgggaagat gcgggtacgg tgagagccat ttcacatgtg caatatttgg agtctcttgc 4620 ggctatgagc cagaaccctg tgccagcttt taaccatgat gaacttctag ctgagcaggc 4680 caacgatgct gttgtcagtc gagtcaggtt tttcattgac cgtggtcgtc ttccttcaag 4740 acgtgaaaaa cttcatgagg cacgtggcac attgaagata ctgaaacagt ggggtaaact 4800 aactgtgcgt ctagggattg tttatcgtgt gtccaaacac ccagtgacaa aacaaaaaat 4860 gtatcaatat gttgttcctg aatctctgcg aatgacagtg ttgaaaggta ttcatgatga 4920 tgtggggcat cagggccagc agcgcacatt aggtttagcc aggcaaaggt tttattggga 4980 aaccatggag aaggatgtaa aggaatatgt ccataattgc aaaagatgca ttttgagcaa 5040 ggctcctgag ccagaggcta gggcaccact cgtttccatt accacttcgt cgcctctcga 5100 gttggtctgt atagactttt ggtcggctga ggaccaaaac aacaagtcta ttgatgtttt 5160 ggtcatcact gaccacttca caaagttggc gtgtgcttat ccctgtccta atcaaactgc 5220 aaaaacagtt gcgagggttc tttggaacaa tttcttttgc atttatggct ttcctgtgag 5280 cattcattcc gaccagggtg ccaattttga gagtttgttg attgcggagt tgtgtcagtt 5340 ggctggtatt gacaaaacgc acactacacc gtatcaccca atgggcaatg gccaagtgga 5400 aaggatgaac cgcactcttg gtaacatgat tcgtgcgctg cctcccagat ctaaagccaa 5460 atggccacaa ctgctaaata ccttgacatt tgcttataac tgcacagttc atgaaacaac 5520 aggtttttca cctttctttc tgatgtatgg ccgtaccccg cgactacctg tggatcttat 5580 gtttgaaagt gtacttctga atgaagaaac cgttaacatt gacaagtatg tgcagtcact 5640 tagacgggac ctctgtgagg cgattaaact tgcacagtca cattcagaaa agcagcaacg 5700 taaacaagca gacatgtaca acaagaggtg taagggacaa tcagttgaca ttggtcagag 5760 ggttttgttg gcaaacaagg gtgaaagagg gaaaaagaaa cttgcggacc gttgggagag 5820 tacagtgtat gttgtggttg gcaaaaactc aatgttaaac acatacaaac tcagaaaccc 5880 ttcgggtgtg gttaaaactg tgcatcggaa tctgattatg ccagttgatt ttctgccatt 5940 gcctgataca gatgctgttg aggaacagtt atccaatgtg agtgatgatg aacttgagga 6000 ctccagcatt gaaaatgttg cggaggatag aaccactcaa tgggtagctg atctttctac 6060 cagcactgga ggcgatggtt cagtcagtca tgaattagat gaacggagtg tgatgtgtga 6120 ggagattgac caggcaagtg atcgtacgca agatgaaggg gatttcactg caattgccgc 6180 agacaggtct aatgttgctg atgtggactc tgaagttggg gttcgatcag tacatgaggg 6240 ctctggtagt ggggcttctc gtcttgtaca cccatcagct tcggaacaga ccgagtcgct 6300 cttgaacaga cctagtcaga gtgagggttt tcgcactaga catggcagac ttgtaaagcc 6360 gttaaatagg cttattcaga ccatgtccaa tcaatttgtt aagaacttat ttgggtgaaa 6420 caacattaga aacatgtttg ctgtgaaagc attgaaaata cgtttttttt ttcttattat 6480 tttttttttc tctcgctctt cattattgtt tccttatgaa ggttaacatt gaagtaaagt 6540 cttcaaagac ttatgccgtt aaatgggtat gttggaaatg tctaatatgt ttgagtttcc 6600 ttgggaaaaa aaaagaagga aacaatgttt aagctattgt gtatatatat atatattttt 6660 ttatttatgt attcaatctt atctcgaaat ggagtaatac acgtttgtgt gtagggtatt 6720 gagataaata atcatgcaat cttggatgtg atctattgaa aagatgtgta tatggcatgt 6780 cttcctatgg ttgggtaaag gagtagctaa cctttgtctt tcctagtcct tcaatggaca 6840 gcaatactgt tggtagactt tcttcagtct gtagtttaat gatcacactt gattgctggg 6900 taacttgact gtagagttga gtatttttgt ctatattttt ggatattttt ctcttgatgc 6960 tatttgtttt atgttctgtt gttttaattt ttattttctt taacttctct ctgagtatgt 7020 gttactcttc tatatagtgt ttcggatgtt atgtaaaatt tagtgaaatc tggtggggtg 7080 ag 7082 // ID DNA8-6_DR repbase; DNA; ZEB; 1064 BP. XX AC . XX DT 17-NOV-2004 (Rel. 9.1, Created) DT 17-NOV-2004 (Rel. 9.1, Last updated, Version 1) XX DE DNA8-6_DR is a non-autonomous DNA transposon. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-6_DR; KW nonautonomous DNA transposon. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1064 RA Kapitonov V.V. and Jurka J.; RT "DNA8-6_DR, a nonautonomous DNA transposon from zebrafish."; RL Repbase Reports 4(10), 256-256 (2004). XX DR [1] (Consensus) XX CC DNA8-6_DR is a nonautonomous DNA transposon. Its is CC characterized by 12-bp terminal inverted repeats and 8-bp CC target site duplications. It is expected to be a member of the CC hAT or P superfamilies. XX SQ Sequence 1064 BP; 334 A; 180 C; 210 G; 340 T; 0 other; tagggttgta acggtatgaa tttttcacgg tatgataatc gtctaaaaca ataccacggt 60 ttgacggttt cgcggtatac ggtatgttac aaatgttaca aaataataga acagtgaagc 120 aaatttgact ttttccaaat aatatatttt tagttactat aaacaacacc acttacaatg 180 aacaaataaa aataagaaaa taataaataa tgtgtcaaag tccaaataaa gtccaaataa 240 acatggtgca aatcctcagt aaaaaataga tataaatatt tactatacta taaatagtta 300 cataacgaaa ctagattcaa tatggaacat ccttaggttt tatgtgccag catggatatg 360 gttgtctatg gatatggatt tggatatggt tgtctgcaat gcagacaaac agctgcatct 420 tcatttgcgt cttttgaaaa cagcaagagc agcagttcgt ctttgctgtg tcactgtttt 480 gtcatgtttc tgtgctgctg tgcatgcaaa agtacttagt atgagaatta tttcatgcaa 540 aacttttttt tttgcgtttt atttagccgc gcataaatgt atgcctatct ctcattccgt 600 gttgttcaaa aagcttggtc cacaaacaaa agcgaaacct atgcttattg gttgtgatat 660 agcgagtttg aaccaatctg ggcatggagg agggacaatg catcaatgta tcatgtctga 720 tttgtccgga gacacagtga cgagcgtttc tttgtcaaat cagcgttgtc aaatgttgat 780 gacgtaaccg cactgattcc ggagcctctg aaagtccgcg aatgttatgt gatacagcac 840 tggaaagctg agattctctt ctttatgcca atctttgaat tgtatgaatc ggatcagcgg 900 atcaaaagtt attaaacatt taagagcaat acttattttt agccgcgggc ggctgtctcg 960 gtctttaagg gttaaaaccg ttgatatgca attgttcatg gtatgataat cgtgcacgtt 1020 caaatcgtgg taaaccgtca taccggtata ttgttacaac ccta 1064 // ID BEL27-I_DR repbase; DNA; ZEB; 6122 BP. XX AC chr20; XX DT 03-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL27-I_DR; KW BEL27-LTR_DR; BEL27_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6122 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1399-1399 (2008). XX DR Genome; chr20; Positions 22705015 22711136. XX CC Positions [5005-5574] - Integrase core CC 'TGCAT' target site duplication CC LTRs are 100% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 40..5961 FT /product="BEL27-I_DR_1p" FT /translation="MMTDHGSNLCTPAKVNEQIDNVQNKEDDEEIEVQSEE FT LDTNVRRSTREQTQTDRMLAYQREESHKAERKLMHAYEKWKAEARKARSQL FT KLDISESELASLIDSLEKEKDSVMNAYIRVRSNVTPPTDMRRKIDACDAVT FT KDIVKIAYERISGVDGGFDNETVKGHLRELLERDYARSVYGSTVSRISSKS FT STPISQPSLNSILMAKRIEAAAELAAKEAEYATVIEEREQREKLRLLEEKQ FT RKELEAQKGEFERLQAMKEVRAARARLEVYDKEETVDTVNQDEGLQQPVSP FT PTHKPVYLSSVNPTPNISLQRPNADVSQLAQAVQDSITLNRLPMPEPTVFS FT GDPIHFIEWKASFQSLIDKRHISSGDKLYYLKKYVTGPALKVLDGIFYRND FT EEAYKDAWKRLLDRYGQPFIIQRAFREKLACWPKIQSKDSVGLRNFSDFLN FT SCKDAMPHVKGLEILNDCEENRKLVSKLPDWAAARWNRQTTQTLSETQDFP FT TFQEFAHFMSVEAEVACNPVTSFHALHVSEPNKEKIYFKVSKPKANVFHTK FT TVTQHDNSKPTGKVNKPCLFCQNGEHQIHECSKFSARSLEERRQFVKDTRL FT CYGCLRLGHSAKDCRSRHCCNTCKGRHPTCLHDDSFNRKVRSSSAQSPENA FT HEGVATMSLSVESGCTPVNTSMIVPVWLSTHKEPVSEKLVYALLDTQSDSV FT FIECAVCKSLKVDSCPVTLKLTTLVGKDSLMSSERISGLRVRGFNSSLIID FT LPPAYTKECIPVDRAHIPTMETASHWKHLATLADKIPPLQNCEVGLLIGYN FT CSRALAPREVILGTENEPYAVRTDLGWSIIGPSLTHFEPQSSAAMCHRVSI FT KEIPAVTPTDVIKVLESDFKDTEGHTKVTSQEDIMFLRKLQENIRLNKDSH FT LEMPQPFRKRPYLPDNKPLAVIRLQHLKRRLMRDQEYREHYVTFMEEVIEK FT GNAEQVFEEGREGERWYIPHHGVYHSKKPGKLRIVFDCSARYKGTSLNDHL FT LTGPDLRNSLTGILLRFRQYPVALMCDVEKMFHQFHVDHADRDYLRFLWWR FT NGDFNSQPQTFRMTVHLFGASSSPGCANYGLKHLAREGERLYPLGSQFIMQ FT DFYMDDGVSSIESTEKAIKLAEEARQLCALGSLRLHKFVSNDKEVLKTIPP FT SECAVDVTAVDLALTDQPLERALGIYWSLEQDNFKFHITVKDQPATRRGIL FT SIVASLFDPLGFLAPFVLKGKTILQEMCRSGMGWDDNLPADLQSAWEHWKA FT DLVNLEKIEVPRCIMPSGFGRIIRREIHHFSDASMSGYGQCSYLRLENEQG FT DISCSLLMAKSRVAPLKITTIPRLELAAAVVSVAVNDMLKEEMNLADAEAF FT FWTDSQVVLGYINNEARRFNTFVANRVQRIHRTTTPQQWRYICSDENPADY FT ASRGLSVNNLVTSNWFRGPKVLWEKQIPPPMEISKQLPIGDPEVKKVQSLN FT TQTVQYSCLSDRLTKLSSWSKAIQAVARLIRRVRKDKSHNHSTLAERNDAQ FT CIIIKDLQKQTYAEEITLLCKGKRLPRSNRLYNLDTFVDQDGLLKVGGRLC FT EASIPNAVKYPVILPKEHQLTKLLIADCHEKMAHQGKGMTINEIRSRGFWI FT TGVNRTVASFVRQCVRCRKLRGPTEEQKMANLPSERIEPSPPFTYSGMDVF FT GPFITCKGRKSNKRYGLLFTCFCCRAIHIEMLDDMSTDAFINGLRCFIAIR FT GAVHQIRCDQGSNFIGAKNELTKAMEEIDTNRLVTFLAEKQCDFVFNAPHS FT SHTGGVWERQIRTVRSVLRSTLSQSSGRLDDSSLRTFFYEAMSIVNSRPLT FT VDSLTDPSSPEPLTPNHLLALKPTQALPPPGKFVREDVYARKRWRHVQYLA FT EQFWGRWHKEYVSNITTRQCWHTPRRNMQVGDIVLEKAVDLPRNEWRLARI FT IEAVTDKDGLVRRVKIQFGDRNLGKDGKRPHKPSVVERPVQKLVLLMEAA" XX SQ Sequence 6122 BP; 1895 A; 1276 C; 1417 G; 1534 T; 0 other; gtaagagctt gcacccactg acattatttg ctgatgtaaa tgatgacgga tcacggttcc 60 aatttgtgca ctcctgcaaa ggtaaacgaa caaattgata atgttcaaaa caaagaagat 120 gatgaagaaa tagaagttca atctgaagaa ttagacacaa atgtacgtcg ctctacccgt 180 gaacaaactc agacagacag gatgcttgca tatcaaaggg aggagtctca taaagcagaa 240 agaaagctta tgcatgcata tgaaaaatgg aaggctgagg ctcggaaagc aagaagtcag 300 ttgaaattag acatttctga gagtgaatta gcatcactca tagactcatt ggaaaaggaa 360 aaggacagtg tgatgaatgc atacataagg gttagaagta atgtaactcc acccactgat 420 atgagaagga aaattgatgc ttgtgatgct gttactaagg atatagtaaa aattgcatac 480 gagaggatct caggggtaga tggaggcttt gacaatgaaa ccgtaaaggg gcatttacgt 540 gagctgcttg aacgagacta cgctcgctct gtttatggtt ctactgtttc acgcatcagc 600 tccaaatcca gtacacctat aagtcaaccc tctctgaatt ctatactaat ggctaaacgc 660 atagaagcag cagcagagct agcagctaag gaagcagaat atgctactgt aatagaagaa 720 agggagcaaa gagaaaaatt acgacttcta gaagagaagc aaagaaaaga acttgaggct 780 caaaaaggcg agtttgaaag gctccaagcg atgaaggagg taagagcagc tagagcaaga 840 ctggaggtgt atgacaagga agaaactgtt gacactgtta atcaagatga agggctgcag 900 caacctgtaa gccctcccac gcataaacca gtatacttat cctccgttaa tccaacgcct 960 aacatttcat tacagagacc caacgctgat gtgtctcaat tagcgcaggc tgtccaagat 1020 agcataacac taaacagact tcctatgcca gagcccacag ttttcagtgg tgaccccatt 1080 cacttcatag agtggaaggc ttcatttcaa tcactcattg ataaaaggca tatctcttca 1140 ggagacaagt tatactacct gaagaagtac gttactgggc ctgctctgaa agtgctagat 1200 ggtatctttt acagaaatga tgaggaggct tacaaggacg cgtggaagag gcttctagat 1260 cgctacggac agccattcat catacaacga gcattcagag agaaacttgc atgctggccc 1320 aaaattcaat ccaaggactc ggtaggactt cgaaatttct ctgatttctt gaactcatgt 1380 aaggatgcaa tgccacatgt aaagggacta gaaatattga atgattgcga agagaatagg 1440 aaacttgtaa gcaaactccc tgattgggcg gctgcccgct ggaatcgtca aaccacacaa 1500 acgttgagtg aaacacaaga ttttccaact tttcaagaat tcgcccattt catgtctgtt 1560 gaagctgaag ttgcctgtaa tccagtcaca tcctttcatg cccttcatgt ttcagaacct 1620 aacaaagaga aaatttattt caaggtcagt aaacctaaag ctaatgtctt ccatacaaag 1680 actgtcacac agcatgataa ttcaaagccc actggaaagg tcaataagcc atgtctgttc 1740 tgtcaaaatg gtgaacatca aattcatgag tgttctaaat tctctgcaag gtctcttgaa 1800 gaacgtagac agttcgtaaa ggatacaaga ttgtgctatg gatgtttgag gctgggtcac 1860 agtgccaagg actgccgctc ccgacattgc tgcaacacct gtaagggaag acaccctacc 1920 tgcctacatg acgacagctt caatagaaag gtgaggtctt cttctgctca gagcccagaa 1980 aatgcccatg aaggagttgc gacaatgtca ttgagtgtag aatctggatg cacgcctgtt 2040 aacacctcta tgattgtgcc agtgtggttg tccacacata aagaaccagt ttctgagaag 2100 cttgtctatg ctctgttaga cacacaaagc gattcagtct ttattgaatg tgcagtatgc 2160 aaaagtctta aagttgactc ctgtcccgtg acgcttaaac tcactacttt ggttggaaag 2220 gactccttga tgtcaagtga aaggatttct ggtcttcgag ttagaggttt caattcctca 2280 ctgattatag atctaccccc tgcatacacc aaagaatgca ttcctgtgga tcgcgcacac 2340 atccctacaa tggagactgc aagtcattgg aaacatctag ccactctagc agacaaaatt 2400 cccccacttc aaaattgtga ggttggacta ttgataggat acaattgttc cagagcgcta 2460 gcgccccgag aagtaattct tggaacggag aatgaaccat atgctgttcg cacggacctc 2520 ggatggagca ttataggtcc ttccttaaca cattttgagc cccaaagcag tgctgccatg 2580 tgtcatagag tgtctatcaa ggaaatacct gcagtaactc ccacagatgt gataaaagtg 2640 ctggaatctg atttcaagga cacagaaggg cacacaaaag tgacgtctca ggaggacatc 2700 atgtttctga gaaagctgca agagaatatc agattgaaca aggatagtca tctggaaatg 2760 ccccagccat tcaggaaaag accatatctt ccagataaca aacctcttgc tgtcataaga 2820 cttcaacatt tgaagaggag attaatgagg gatcaggagt acagagagca ttatgtaaca 2880 ttcatggagg aagtaataga gaagggtaat gcagaacagg tgtttgagga aggacgagaa 2940 ggagaaagat ggtacatacc acatcacgga gtgtaccact cgaaaaagcc agggaaattg 3000 cgtatagttt ttgactgttc agccagatac aagggaacta gcttgaacga ccatcttcta 3060 actggcccag atctgaggaa cagcttaact ggcatccttt tgaggttcag acagtaccct 3120 gtagccctaa tgtgtgacgt ggagaaaatg ttccaccaat ttcatgtaga ccatgcagat 3180 cgtgattatt tacgatttct atggtggaga aatggagatt tcaattcaca gcctcaaacc 3240 ttccgcatga cagtgcattt gtttggagcc tcatcgtccc ctggatgcgc taactatggg 3300 ctgaagcatc ttgcaagaga aggtgaacgt ctgtatcctc tgggctcgca atttattatg 3360 caagatttct acatggatga tggagtttcc agcattgaaa gcacagagaa ggccatcaaa 3420 ttggctgaag aagctcgtca gctttgtgca ctgggaagct taaggcttca caagtttgtg 3480 tctaatgaca aggaagtctt gaagacaata ccaccctcag agtgtgcggt agatgttaca 3540 gctgtcgatc ttgctctcac tgatcagcct ttggaaagag ctttgggcat ttactggagt 3600 ctggaacaag acaatttcaa attccatatc actgttaagg accaaccagc aactcgtaga 3660 gggatactgt ctatagtggc ctcattgttt gatcccttag gctttcttgc cccctttgta 3720 ctcaaaggaa agaccattct gcaagaaatg tgccgaagtg gtatgggttg ggatgataac 3780 ttacccgctg atctacaatc agcatgggaa cactggaagg cagatctagt taacctagaa 3840 aagattgaag tgcctcgttg tattatgcct tctggctttg ggagaatcat aaggagagag 3900 attcaccact tctcagatgc cagcatgagt ggatatggtc agtgttcata tctcagactc 3960 gagaacgagc aaggtgacat cagttgttca ttgctcatgg caaaatctag agtggcccca 4020 ctcaagatca caacaattcc tcggctagaa ttggctgccg cagtggtgtc agttgcagtg 4080 aatgacatgt tgaaggagga aatgaacctg gcagatgcag aagcgttttt ctggactgac 4140 tcacaagtgg tgttaggcta cataaacaat gaagcccgcc gtttcaacac gtttgtggca 4200 aatagagtac aaaggattca ccgcaccaca actcctcaac agtggcggta catttgctca 4260 gatgaaaatc cagccgatta cgcatcgcgt ggtctaagtg ttaacaatct tgtcacttcc 4320 aactggttta gaggacctaa agttttatgg gaaaagcaaa taccaccacc tatggaaatc 4380 agcaagcagc ttccaattgg cgaccctgaa gtcaagaagg ttcagtcact caacacgcaa 4440 actgtacagt attcatgttt gtcagaccgt ctcaccaagt tgtcctcatg gtccaaagct 4500 atccaagctg ttgcacgttt aatacgtcgt gtcaggaaag acaagtcaca taatcacagt 4560 acattggcgg aacggaacga tgcacagtgt atcataatca aggacttaca gaagcagaca 4620 tatgcagagg agataacttt actctgtaag ggcaaacgac tacctcgcag caacagacta 4680 tacaatcttg acacctttgt cgaccaagat ggattgctga aggtgggagg gagactttgt 4740 gaggcatcta tccctaatgc tgtcaagtat ccagtgatac ttccgaagga gcaccaactt 4800 acaaaacttc tgattgctga ttgtcatgag aagatggctc atcaaggaaa gggaatgact 4860 ataaatgaaa tcagatcaag aggattctgg attacgggag ttaacaggac tgtagcttcc 4920 tttgtacgac aatgtgtgag atgtcgcaag ttacgtggac ctacggaaga gcaaaaaatg 4980 gctaacttac cctcagagcg catagagcca tcccctccat tcacatacag cgggatggat 5040 gttttcggtc cattcatcac ctgcaaaggt cgcaagtcaa acaagaggta tggacttctc 5100 tttacctgtt tctgttgcag agccatccat attgagatgc tggatgacat gtccacagat 5160 gcctttatca atggcctgcg ttgtttcatc gctatcagag gagcagtcca tcaaataagg 5220 tgtgatcaag gcagtaattt cattggagcc aagaatgagc tcaccaaagc tatggaggag 5280 attgacacca accgtctggt aacattctta gcggaaaaac agtgtgactt tgtttttaat 5340 gcacctcatt caagccacac tggtggagtt tgggaaagac agattagaac tgtcagaagt 5400 gttcttcgct ccaccctgtc acagtcatct ggaaggctcg atgactcttc tctgcgaacg 5460 ttcttttacg aggccatgtc cattgtaaac agtcgtccgc tcacggttga tagtctaact 5520 gacccaagta gccctgaacc tctaacccct aatcacctcc tcgctctaaa acctactcaa 5580 gccctaccac ctcctggcaa atttgtcagg gaagatgtgt atgcccgtaa gagatggcgg 5640 catgtccaat acttagcgga acaattctgg ggacgttggc ataaggagta tgtgtctaac 5700 atcacaacaa gacagtgctg gcatacacca agaaggaaca tgcaagtagg agatattgtc 5760 ttggagaagg cagtggatct gcccaggaat gagtggcgct tggcaaggat tattgaggca 5820 gtcactgaca aggacggatt ggtgagaaga gtgaaaatac agttcggaga cagaaatctg 5880 gggaaggatg gtaaacgtcc acataaacca tctgtggtgg aacgtccagt gcagaagtta 5940 gtcctgctga tggaggcagc ctgaacatac tgacatatcc tttgagtaaa cagttcatag 6000 taatatatgt tttagtgttg tttacatagt cttaaatggt tatagtagta gtaattataa 6060 ttcatttaaa atcattcgtg attaaaaggt taaagttcat aatcactcat gattggtggg 6120 ag 6122 // ID GYPSY3I_DR repbase; DNA; ZEB; 4321 BP. XX AC AL591172; XX DT 05-JUN-2002 (Rel. 7.05, Created) DT 05-JUN-2002 (Rel. 7.05, Last updated, Version 1) XX DE GYPSY3I_DR is an internal portion of the GYPSY3_DR LTR DE retrotransposon. XX KW Gypsy; LTR Retrotransposon; Transposable Element; GYPSY3I_DR; KW GYPSY3LTR_DR; endogenous retrovirus; gag; pol; KW reverse transcriptase. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4321 RA Kapitonov V.V. and Jurka J.; RT "The GYPSY3_DR LTR retrotransposon from zebrafish."; RL Repbase Reports 2(5), 16-16 (2002). XX DR Genbank; AL591172; Positions 61126 65446. XX CC GYPSY3I_DR is an internal portion of the GYPSY3_DR LTR CC retrotransposon. CC GYPSY3_DR was active recently, it is flanked by 99% identical CC GYPSY3LTR_DR. Gag, called GYPSY3_DR1p, and pol, called CC GYPSY3_DR2p, CC are encoded by ORF1 and ORF2, which are slightly damaged by two CC false CC frame shifts. A conceptual translation of these proteins is shown CC here. CC GYPSY3_DR1p (300-aa, positions 18-1005): CC MNPETPGSQQLLATVAQHESTIQRHENAFAQQQTLMVQHSQLLADLMASVRQLFDKVHVDSTPAPAAASI CC PLPESRVISPMMEPRLPPPQRFSGDPSACDGFLXQCSLTFELQPSSFPSDRARIAYVITLLSGKALSWAT CC AVWKAKAPFCSNYKAFEQEFKRVFDHPISGRQASKRLLMLRQDTGSVAEYAIKFRTMAAGSGWNNEALMV CC CFQNGLSETIQDELATREPSLDLENLMEQAIRLDNRLRERYFNHPKPSFGVPTPAPTPTSQVSQFSAEPM CC QLGRTRLTPTERDRRMREQLCLYCGLSAHFRSTCPQLSGNAQSRTGQEGL CC GYPSY3_DR2p (1084-aa, positions 1047-4299): CC LFLPITLSWADQQSQFQAFVDSGAAGNFLDLTLAKTLNIPSEPLPDPLTVTALDGRPLAPGEVTHLTSQL CC CLSIYQHQERICFHLIHSPDFPVILGHPWLLQHNPYIDWPTSTVLGWSPTCQTTCLSQKSLQPSFESQEP CC LDLSRVPAQYHHLKAVFSKRRATVLPPHRPYDCAIDLLPGSCPPRGGIFSLSLPERTAMDLYIEESLAAG CC IIRPSTSPAGAGFFFVGKKDGGLRPCIDYRGLNKITIRNRYPLPLMSTAFEMLQGASIFTKLDLRNAYHL CC VRIRQGDEWKTAFNTPTGHYEYLVMPFVLTNAPAVFQALINDVLRDMLNKFVFVYLDDILIFSNSFQEHV CC QHVHKVLRHLLDNHLYIKPEKCQFHVSQVKFLVFVIEPEQIQMDPQKIQAVVDWPSPSSVKEVQRFLGFA CC NFYRKFILNFSTVAAPLFALTKGNMIRFRWGPEAEEAFKILKKQFTSAPVLLIPNADEPFTVEVDASEVG CC VGAVLSQRGEDKRLHPCAFLSHRLTPTERNNHVGDRELLAVKLALEEWRHWLEGAKHPFQVLTDHKNLEY CC IQQAKRLNPRQAHWSLFFNRFHFTLTYRPGSKNLKPDALSRVYVKSPQEDFIRPIVPSSKIVAPIRWELE CC GLIRKAQTKDPEPGGCPPNCLYVPRVVRSKVLMWGHSSQLTCHPGSARTLEFLQRRFWWPTIKEDVMAFV CC KACPTCNQNKSSHQPPQGFLHPLSIPHRPWSHISMDFITGLPPSQGNTTILVIVDRFSKAARFIPLSKLP CC TAKETAELVMNHVFRVFGIPQDIVSDRGPQFSSRFWQAFCQSLGTTTSLSSGFHPESNGQTERLNQDLET CC TPRCMTANNPTTLSQFIMWAEYAHNSLCSSATGMSPFECQFGYSPPLFPEQEVEVAVPSALQFVRRCRRT CC WKKARLKLLKVSQQYQHQANRRRRPAPNLRPGQRVWRNIPLRVDSRKLSQKFIGPFKIARKVNPVTYKLY CC FPKSLKINPTFHVSLLKPVLSSPFFVTGKPSPPRFVGGQAVYTVRRILDTRKVRKTRQYLVDWEGYGPEE CC RSWVLAKDILDPRLIRDFHALKMKSLGKNVRSRS. XX SQ Sequence 4321 BP; 1080 A; 1227 C; 909 G; 1105 T; 0 other; cgaataaccg agccaacatg aacccagaga ctccaggttc ccagcagctt ctagccactg 60 tggctcagca tgagtccaca attcaacgcc atgagaatgc ctttgcccaa caacaaactt 120 taatggttca acactctcag ttacttgcag atcttatggc ttctgttcgt caactctttg 180 acaaggtcca tgttgattca accccagctc ctgctgctgc ttcgatacct ctgccagagt 240 cccgagtaat ctcccctatg atggaacctc gcctaccacc accacaacgt ttctcaggtg 300 accccagtgc ttgtgatggc tttctaacag tgttctctaa cttttgaact gcagccttca 360 tctttcccct cagaccgtgc ccgtattgca tatgtcatca ccctgttatc tggtaaggcc 420 ctctcctggg ccactgcggt ctggaaggcg aaggcaccct tctgttcaaa ttacaaggcg 480 tttgaacaag aattcaaacg ggtgtttgat caccccatca gtggtcggca agcctcaaaa 540 agactcctaa tgctacgaca agatactggc agtgtggcag aatatgccat aaaatttaga 600 acaatggcag ctggaagtgg ttggaataat gaagccctca tggtctgttt tcaaaatggg 660 ttatctgaga caatccagga tgaactggct acccgggaac catctctgga tctcgagaac 720 ctcatggagc aagccatccg tttggacaat cgactgaggg agagatattt taaccacccc 780 aagccctcct tcggggtgcc cactccagca cctaccccca catcacaggt atctcaattc 840 agtgcagaac ctatgcaatt aggcagaact cggcttaccc ccacggagag agaccgccgc 900 atgagggagc agctttgtct atactgcggt ttatctgccc attttcgttc cacctgccct 960 cagctttcgg gaaacgccca gtcccgtaca ggccaggaag gactgtaacg ggagttacat 1020 gcaataactc tccttcctca agtggtcttt ttctccccat tacgctctcc tgggcagacc 1080 aacagtccca atttcaagcc tttgtagact ctggtgctgc tggaaatttt ttagatctca 1140 ccttagcaaa aaccctcaat atccccagtg aacctcttcc tgacccatta acagttactg 1200 cactagatgg tcgaccactt gctccgggag aggtgacaca ccttacgtcc caactctgtc 1260 tctccatcta tcaacatcaa gagaggatct gttttcacct catccattca ccagattttc 1320 cagttatcct tggtcacccc tggctccttc agcataaccc atatattgat tggcccacta 1380 gcacagtttt gggttggagt cccacatgcc aaaccacctg tctgagtcag aaatcccttc 1440 aaccttcttt cgagtctcaa gaacccctag atctgtcccg agtcccagcc cagtaccacc 1500 atcttaaagc agttttcagc aagaggagag ctactgtcct acccccgcac cgtccttacg 1560 actgtgccat tgatcttctg ccaggttcct gtccccctag aggtggaatc ttttccttgt 1620 cccttcctga gcggactgcc atggatctct acattgagga gtccctcgca gctggaatca 1680 ttcgaccatc cacctcccct gcgggcgctg ggtttttctt tgtggggaag aaggatggtg 1740 ggttacgccc ctgcatagat tacagaggtc tcaataagat cacgatccgc aatcgttacc 1800 ctttaccact gatgtccaca gcttttgaaa tgctgcaagg ggcttccatc ttcaccaaac 1860 ttgatttacg caatgcttac cacctagtgc gtatcagaca aggcgatgaa tggaagaccg 1920 cctttaacac gcccacaggg cactatgaat acctggttat gccctttgta ctgaccaacg 1980 ctcctgcagt ttttcaagcc ctgattaatg atgtgctccg tgacatgcta aataagtttg 2040 tgtttgtcta cttggatgac attcttattt tctcaaactc cttccaagag catgtgcagc 2100 atgtacacaa ggtccttcgt catctgctag ataaccatct atacatcaaa ccagagaagt 2160 gtcagtttca tgtctcccag gtgaaattcc tggtttttgt tattgaacca gaacaaatcc 2220 agatggaccc tcaaaagatc caggcagtgg tggattggcc ctcgccttcc tcggtcaagg 2280 aggtacagcg gtttttgggg tttgcaaatt tttatcgaaa attcattttg aacttcagca 2340 ctgtggcagc ccctttattt gccctcacca aggggaacat gatcaggttt cggtggggac 2400 ctgaagctga agaggcattc aagattctta agaagcagtt cacttcagca cctgtccttc 2460 ttatccccaa tgctgatgaa cccttcactg tagaggttga tgcctctgag gtgggcgttg 2520 gagcagtgct atcgcaaagg ggtgaagata aaaggcttca tccttgtgct tttctctctc 2580 accgtctcac ccctactgag agaaataacc atgtgggaga tagggagctg ttagccgtga 2640 aactggctct cgaggaatgg agacactggc tcgagggggc aaaacaccca ttccaagttc 2700 ttactgacca caagaattta gaatatatcc aacaggccaa acgacttaat ccccgccaag 2760 cacattggtc tctattcttt aaccgattcc acttcacact cacctatcgt ccaggatcta 2820 agaatctgaa acctgatgca ctgtcccgag tttatgtgaa gtccccacaa gaggacttta 2880 tcagaccgat tgtccccagc tctaagatag tagcccccat caggtgggag ctggagggtt 2940 tgataagaaa ggcccaaact aaggacccag agccaggcgg ttgtcctccc aactgtttgt 3000 atgtccctag agttgttcga tcgaaagtcc tcatgtgggg acactcgtca cagcttactt 3060 gccatccagg ctctgcccgt acccttgaat tcttgcaaag acgattttgg tggccaacca 3120 tcaaggaaga tgtgatggca tttgttaagg cctgccccac ttgcaaccaa aacaaaagtt 3180 cccatcagcc acctcaagga tttctccacc cactctccat acctcacaga ccatggtctc 3240 atatatctat ggattttatc actggacttc ccccatccca aggaaacacc actattttag 3300 tcatagtgga ccgattctca aaggccgccc gcttcattcc cctgtctaaa cttcccacgg 3360 ccaaagagac ggcagaacta gtcatgaacc atgtgttcag agtctttggc attccccagg 3420 acatcgtctc agatcgggga cctcagtttt cctcccgatt ctggcaagca ttctgtcaat 3480 ccctagggac cactacaagc ttatcctccg gtttccaccc agaatcaaat ggacaaacag 3540 aaagactcaa ccaagacctc gaaaccaccc ctagatgcat gacagcaaac aatcccacca 3600 ctttgtctca gttcattatg tgggctgaat atgcacataa ctccctgtgt tcatcagcta 3660 caggcatgtc cccttttgag tgccagtttg gctattcccc cccactattt ccagaacagg 3720 aggtggaagt agcagtcccc tcggccctcc agtttgttag acgctgtcgc cggacttgga 3780 agaaggcacg tctcaaactt ctcaaggtat cccaacagta ccaacaccag gctaaccgca 3840 ggcgcagacc ggctcccaat ctgcgtccgg gccaaagagt ctggcaggaa cattccccta 3900 cgggttgact ccaggaaact atcccagaag tttattggtc cattcaagat cgccaggaaa 3960 gtaaacccag tcacctacaa attatacttc ccaaaatctt taaagataaa ccctaccttc 4020 cacgtgtctt tactaaaacc agttttgtct tctcctttct ttgtgacagg taaaccttcc 4080 cctcctcgtt ttgttggagg ccaagcagtc tatacagtcc gccggatact ggacacccgc 4140 aaagtacgca aaactcgaca gtatctcgtg gactgggaag gctatggccc tgaggagcgt 4200 tcctgggtcc ttgccaagga cattttagat ccgagactca ttcgagactt tcacgctctt 4260 aagatgaaat cactcggcaa gaacgtcagg agccgttcct agaggggggg ggggggggtc 4320 c 4321 // ID Gypsy-22-LTR_DR repbase; DNA; ZEB; 1143 BP. XX AC . XX DT 11-FEB-2005 (Rel. 10.01, Created) DT 11-FEB-2005 (Rel. 10.01, Last updated, Version 1) XX DE A long terminal repeat of the Gypsy-22_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy superfamily; Gypsy-22-I_DR; Gypsy-22-LTR_DR; Gypsy-22_DR; KW endogenous retrovirus. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1143 RA Kapitonov V.V. and Jurka J.; RT "Gypsy-22_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 5(1), 18-18 (2005). XX DR [1] (Consensus) XX CC Gypsy-22-LTR_DR is a long terminal repeat of the Gypsy-22_DR CC LTR retrotransposon that belongs to the Gypsy superfamily. Its CC internal portion is deposited in Repbase as Gypsy-22-I_DR. XX SQ Sequence 1143 BP; 270 A; 205 C; 234 G; 434 T; 0 other; tgtgacaata cagaaacttt ctgtactgcc cctattttgt gtttacttcc tttgtcactg 60 ccttttataa tgagttcaac cccattggat gctacaacca attacaatga tgggaaatta 120 tataaaaggg tttttttctt tcttctgaga gagagagttc tgacctccag tgtgggagtg 180 gtgtttgcgt caccaagttg cgttgcgggt gtggctctgc ctgggtctga ggaagctgag 240 gaagagatgc ggcgatgctc aagttttcac tgacttgtcc tttatagcca ccgtaacacc 300 aagtttcctg gtaattgctg acgagtgcca gtcttacgtt accttcgtgt tgaaaaatta 360 ccggtgcgat gtgttaggac acctccgaaa acggctgcct gtttatcatt tctgtccgtt 420 ctgccaagtc acctgccagc cagtgaactg ctgacgtgta acctcctgac gtcaccactg 480 caagcctctt gggtcagttt cactcataca cttacataca tacattctcc acccaacctc 540 cctttatttt catttaatta gttatttgag aaatttgaaa agttttgttt atttatttct 600 tggtgtattt gttgttgttt ttagagagga atcttgtttt gggttactta aaatgattgt 660 aggatattga ctatataaat atttaatttt ggtattgttg ttacttgtag ttgtatttgc 720 catttaaatt aattcattaa atctttctaa gtggaattcg attttcttgt tcttatttcg 780 tatggttcta tgttactgta tgtaacggag ggtcttgtat aagttttgtt ttcgctatcc 840 tgggtaatta attatagtta gatttatttg tggcaaggtt taaaacttgt ctggcacccg 900 aaatcatgca taaaatttgt taatatatgt atgttatatt ataatatatt ttattttttt 960 ttttgcccga gggtgactgc gttagaggga gatattacca gacgcaccgc ctcaacagcg 1020 ccttccgtgg ttcagtggta gaaatctcgc ctgccatgcg ggagacccgg gttcgaattt 1080 aatctaagag ttattaatta aatttttttt tttttttttt gtgtgtgtgt gaccaccgtt 1140 aca 1143 // ID Gypsy-25-I_DR repbase; DNA; ZEB; 4574 BP. XX AC . XX DT 11-FEB-2005 (Rel. 10.01, Created) DT 11-FEB-2005 (Rel. 10.01, Last updated, Version 1) XX DE An internal portion of the Gypsy-25_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy superfamily; Gypsy-25-I_DR; Gypsy-25-LTR_DR; Gypsy-25_DR; KW endogenous retrovirus; gag; integrase; reverse transcriptase. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4574 RA Kapitonov V.V. and Jurka J.; RT "Gypsy-25_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 5(1), 23-23 (2005). XX DR [1] (Consensus) XX CC Gypsy-25-I_DR is an internal portion of the Gypsy-25_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its CC long terminal repeat is deposited in Repbase as CC Gypsy-25-LTR_DR. Gypsy-25_DR is characterized by 4-bp target CC site duplications. The internal portion encodes one CC polyprotein: the 1503-aa Gypsy-25_DR1p (pos. 64-4572) composed CC of the gag, protease, reverse transcriptase, and integrase CC domains. The consensus sequence was built from 3 copies less CC than 0.5% diverged from the consensus sequence. XX FH Key Location/Qualifiers FT CDS 64..4572 FT /product="Gypsy-25_DR1p" FT /translation="MDTAYTTVDEVEITEPQINPVIHTVDLSEGNETDYTG FT EQFTTRRNPTEQLTSLLSSLYISDHDQTADVVSDEVGNQGIHDEVFQLKED FT LLALSDKVQNMEHDFNQAFESTVNRETNFRQALDDRLNSMQENFEDSLKKL FT EKAVIDCFLRRDEKWESKIKQIYSTSTPFPSRRQTRQTNVTHSDLFSAAQS FT PITQPSFNATYHVSAPSSSGHQTQSDANVLPGSSSFFARPPVRLEFPSFGE FT TCETADVFNFIEKCEDYLEVRPLSNIELLGTLSTVLQGPALSWWKAEKGKV FT KDWESFKRALMDAFLPADYMSEVEEKLRNMVQQPNQRLRDFAYNYRALCLK FT WKPDASEEELVNRILNNINPRVAGCLRGTVNTVSQLVKVGALVEKDCMGTK FT EYWKKVESSNERDKNKKSSEAGSRKHLAGVTLTQHHPTSFLLGVTIKVNEQ FT EITAILDTGSTFTLIQESVWKRLRCKRVEASKTLSSQKFIMADGTIHHSKD FT HQLMSVEWHGGKFEIGAFIMKDSHLAFPVIIGLDFLKLARVVVDLEQGRYG FT MRRGKDCEYFPFMSVLPPCSQEMPVSPSCSVRHAALHLYCAVPPDYQPKVT FT YPSDTSNICFPEEVRDLITAWPITTSEKLGHTSIYQHKITLTDHTPVTSRA FT YRVSPLKKRIIEEEVDRMLENNIIEPSFSPWSSPVVLVPKEDGTYRFCVDY FT RRLNKKTIFDAYPMPLIQDILESLEGATWFTSLDLRSGYWQVEMAEESKEL FT TAFITTKGLFHFKSMPYGLKNSAATFQRLMERVLGELRGKICFVYIDDIII FT YSKTPEEHKKHLHQVFERLTQANLTLNMKKCQFFTRQLKFLGHLITERGIE FT MDKEKIQAVVDFPTPSDLKSLQRFLGLAGWYHRYIPHFADITAPLNQLKKK FT GVRWDWTEECQSSVNALKQSLQHAPILAQPDTTKPFQIHTDASDVGLGAVL FT TQTCDNQEKVIAYASRTLSPAEKNYSTSEKECLAVVWAVEKWRHYLEGMPF FT DVFTDHSALTWAFNCPKTSSRLTRWTLRLQQFDFRVHHRKGCVNLVPDALS FT RAGDSSVPNPCLSITSTTASELPISLDEIQKAQQDDKALLQLTSTPSRCVA FT KDQAITFEKIQGIWYRKVPLKGEGNKYQLVVPEEFTQNFLHYFHDNPLAGH FT LGQLKTLLKILEVAWWPSVRKEVWSYVKSCKLCQQYKPSNSKPSGLLQSNL FT ITEPGHTLGTDLMGPFPMSKKRNAYILVIVDYFTKWTELFPLRDSKTQKIA FT KILKEEIFTRWGVPKYLVSDRGPQFTSSILSDVCKSWGCIQKLTTAYHPQS FT NLTERVNRTLKTMIASYVGQQHQNWDQWLPELRYAINTAQQETTGCTPAEL FT MVGRQIHGPLERLIHLPPSPEQASYTVLDRQHYLQQEVVQRMRMNQAKQAK FT YYNLKRKDVQFKVGDLVWLKTHPLSSAVSKFTSKLAPKWEGPGVITKRKGP FT INYSVSWGDPPRTDCFNVVNLKRFYGRSSSEMPAGGGI" XX SQ Sequence 4574 BP; 1425 A; 939 C; 982 G; 1228 T; 0 other; tagtggtggc cagtacgggg aaattcaata tttagcacta gttctctgtt tcattttttt 60 atcatggata ctgcatatac aactgtggat gaagttgaaa ttacagaacc acaaattaac 120 cctgtaatac acactgttga tttaagtgaa ggaaatgaaa ctgattatac aggtgaacaa 180 tttacaacta gacgtaatcc aactgaacaa ttaacatcac tgcttagttc attatatata 240 tcggatcatg atcaaactgc tgatgttgtt tctgatgaag tgggtaatca aggaattcat 300 gatgaagttt ttcagttaaa ggaggattta ttggcccttt ctgataaagt tcaaaatatg 360 gaacatgact ttaaccaagc ttttgagagt acagtgaatc gtgaaacaaa ctttagacaa 420 gctttagatg atcgtttgaa cagtatgcaa gaaaactttg aggattcatt gaagaaatta 480 gaaaaagctg tcattgattg ttttctaagg agagatgaaa aatgggaaag taaaattaaa 540 caaatttact ccactagcac cccttttcct tctagaaggc aaactagaca gacaaatgtt 600 acccattctg atcttttttc tgctgcacag tcacctataa cccaaccatc ttttaatgct 660 acttaccatg tttctgctcc tagttcctca ggtcaccaga cacagtctga cgctaacgtt 720 ttacctggtt catcctcctt tttcgctcga cctccggtac gtctggagtt cccctctttt 780 ggtgaaacct gtgaaacagc tgatgtattc aacttcatag agaaatgtga agactatcta 840 gaggtgagac cattgtccaa tatagagcta ttaggtacac ttagcactgt tctgcaaggt 900 ccagctttaa gttggtggaa ggctgaaaag ggaaaggtga aagactggga atcctttaaa 960 agagcattaa tggatgcatt cctaccagct gactatatgt ctgaagtaga agaaaaactt 1020 agaaatatgg tacagcagcc caatcaacgg ctcagggatt ttgcatacaa ttatcgtgct 1080 ttatgcctga aatggaaacc agatgcttca gaagaagagc tggtgaatag gattctgaat 1140 aacatcaacc ccagagttgc aggatgcttg agggggacag ttaatactgt tagtcaactg 1200 gtgaaagtgg gagcactggt ggaaaaggac tgcatgggga caaaggagta ctggaagaaa 1260 gtggaaagta gtaatgaaag agacaagaac aagaaatcat ctgaagctgg ttcacgtaaa 1320 catctggctg gtgtgacttt aacccaacat catccaacat ccttcttgtt gggagtaacc 1380 atcaaagtga atgaacagga gataacagcc atcctggata caggtagtac ctttacttta 1440 attcaagaga gtgtgtggaa gagactaaga tgtaagagag ttgaagcatc aaagactcta 1500 tcctctcaaa aattcatcat ggctgatgga accattcacc actctaagga tcatcagttg 1560 atgagtgttg agtggcatgg aggaaagttc gagattggag ccttcattat gaaagattct 1620 catttggctt tccctgttat cattggactt gactttctga agctcgccag agtggtggtg 1680 gatttagaac aagggagata tggtatgaga agaggaaaag actgtgagta cttccccttc 1740 atgtctgttt tacctccttg ttcccaggaa atgcctgtaa gtccatcctg ctcagtgaga 1800 catgctgcac tccacctgta ctgtgctgtt ccacctgact atcaaccaaa agttacttat 1860 ccatctgaca catcaaacat ctgttttcca gaagaagtaa gagacttgat cactgcatgg 1920 ccaattacca cctcagaaaa acttgggcat acatcgattt atcaacacaa aatcaccctc 1980 acagaccaca ctccagtaac atcaagagct tatagagtat ctccattgaa gaaacgcatt 2040 attgaagaag aggtagatcg aatgctggag aacaacatca ttgagccttc tttctcacct 2100 tggtcttctc cggtggtcct ggtccctaag gaagatggaa cttatcgttt ctgtgtagac 2160 tacagacgtc ttaacaaaaa gaccatcttc gatgcttatc ccatgcctct aattcaggac 2220 atcttggagt cattggaggg agctacttgg tttacatcct tggatttaag atctggatat 2280 tggcaagtcg agatggcaga ggaaagcaag gagttgactg cttttatcac caccaaaggc 2340 ctattccatt ttaaatcgat gccatatgga ctgaaaaact ctgctgcaac attccaacga 2400 ctaatggaga gagtgttggg agagttgagg ggtaagatat gctttgtcta catcgatgat 2460 atcatcattt attcaaaaac cccagaagaa cataagaaac acttacacca agtattcgag 2520 agacttactc aggccaacct aaccctgaac atgaagaaat gtcagttttt caccagacaa 2580 ctgaaattcc ttggacacct gatcactgaa agagggatag aaatggacaa agaaaagatt 2640 caagcagtgg tggattttcc tacaccatct gacctgaagt ccctccagcg gtttttgggg 2700 ttagcaggtt ggtatcaccg ctatattcct cattttgctg atatcactgc ccccctaaat 2760 cagttaaaga aaaaaggagt gagatgggat tggactgagg aatgtcaatc aagtgtgaat 2820 gctctgaagc aatccctaca acatgcacct atacttgcac aaccagatac cactaagcct 2880 ttccagatac acacagatgc tagtgatgtg ggactaggag ctgtacttac tcaaacttgt 2940 gacaaccaag agaaagtaat cgcctatgct tcaaggactc tgtctccagc agagaagaac 3000 tatagtacat cagagaaaga gtgtttggcg gtggtctggg cagtagaaaa gtggagacat 3060 tacttggaag gaatgccttt tgatgttttc acagatcatt ctgcattgac ttgggcattt 3120 aattgcccaa agacctcttc tcgtctaaca cgctggaccc tgaggctgca gcaatttgac 3180 tttagagtgc atcaccgtaa agggtgcgta aacttagtac ctgacgccct gtcaagagct 3240 ggtgattcat cagtacctaa cccttgtctt tccatcactt caacaactgc atcagagtta 3300 cccatcagtc tggatgaaat tcaaaaagct caacaagatg ataaagctct cctccagtta 3360 acatccactc catcacgctg tgttgccaaa gatcaagcca ttacctttga gaaaatccaa 3420 ggtatttggt acagaaaagt tcccctaaaa ggtgaaggga ataagtatca gcttgtggta 3480 ccagaagaat tcactcagaa ctttcttcac tacttccatg ataatccact agctggtcac 3540 ttgggtcaac tcaaaactct tctgaagatc ttggaagtgg catggtggcc atctgtaaga 3600 aaagaggtct ggagttatgt taagagttgt aagctctgtc agcagtacaa gcctagtaat 3660 agtaaaccct ctggactttt gcagagtaat ttaattactg aaccaggaca cactctaggg 3720 actgacctga tgggaccatt ccctatgagc aagaagcgga atgcttatat tcttgtgatt 3780 gtggactact tcacaaagtg gactgaatta tttcccttga gagacagtaa gacccaaaag 3840 attgcaaaga ttctcaaaga ggaaatattt acccgatggg gtgtcccaaa gtatttggtc 3900 tcagacagag gtccacaatt tacttcttcc atcttgtccg atgtgtgtaa atcctgggga 3960 tgcatacaaa agctaactac tgcttaccat ccacaaagca atcttactga gagggtaaat 4020 aggactctta aaaccatgat cgcttcctat gtaggccaac aacaccagaa ctgggaccag 4080 tggttgcccg agttaaggta tgccatcaac accgcacagc aggagacaac tggttgtacc 4140 ccagcggagc ttatggtagg acgtcagatc catggacctc tggaaagact catacatttg 4200 cctccatcac ctgaacaagc atcatacact gtactggatc gacaacacta ccttcaacag 4260 gaggtcgtgc aaagaatgag gatgaaccag gctaaacaag ctaagtacta caaccttaaa 4320 aggaaagatg tacaatttaa ggtgggtgac ttagtttggc ttaaaactca ccctttatcc 4380 agtgcagtca gtaaattcac ctccaaatta gctccgaagt gggaaggacc tggggtgatt 4440 actaagagaa aagggccaat aaattactct gtatcatggg gtgatccacc aagaactgat 4500 tgttttaatg tggttaattt aaaacgcttt tatggtcgtt cttcctccga gatgccggct 4560 ggggggggga tcta 4574 // ID Gypsy63-I_DR repbase; DNA; ZEB; 6337 BP. XX AC chr14; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy63-I_DR; KW Gypsy63-LTR_DR; Gypsy63_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6337 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1441-1441 (2008). XX DR Genome; chr14; Positions 13344949 13338613. XX CC Positions [4571-5047] - Integrase core CC 'TGTG' target site duplication CC LTRs are 99% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 1796..6133 FT /product="Gypsy63-I_DR_2p" FT /translation="MPLIPIENLEIWGLSMQKYPYDGCLSLRLEFPGSVAG FT VAESIDALVLVCPDPVMKGDVNILVGTNTSVVKKLVEACKEKVGDDFLHTL FT TVHPEIRDAYERAIAVEKSHCDDSRGTVWYLNPKPLKLPPGESARVYGVLK FT CPSAFSGKTLLVDKPEESVFPDELLVTAEVQANTVVQSRKIGVTIRNVSSR FT TVNLKRGMPIAHVFPVEVVSTSSKPQECSGQTKLDASSFNFGDSSVPNEWK FT QRLVKKMLERTDVFSCDEFEVGCSKSTQHRIRLTENKPFRERSRRLAPADL FT EDVRLHLEKLKGAGIITESRSPYASPIVVVRKKNGQIRMCVDYRTLNRRTI FT PDQYTVPRIEDALTCLNGSKWFSVLDLRSGYYQIPMSEADQEKTAFICPVG FT FYQFQRMPQGICGAPSTFQRVMEKTVGDMNLLEVLVYLDDLIVFGRTLEEH FT EERLLKVLDRLQSEGLKLSLDKCTFCQTSVSYVGHIVSQDGVSTDPSKIDA FT VKSWPRPQNVSELRSFLGFCGYYRRFVKDYSKVSYPLNQLLQGCLPAASLK FT KIKAGLRPEQLAYKATDPFGSRWTDACETAFKELKVRLTEAPVLVFANPQL FT PYVLHVDACCEGLGGVLYQDQGEGLKPVAFVSRSLTPSEKNYPVHKLEFLA FT LKWAVVDKLHDYLYGVKFEVQTDNNPLTYVLTTAKLDAVGHRWLAALSAYD FT FSLKYRPGRQNIDADALSRRSHQKEKEWTSIPSSGVRSLCKLPGQDKPNCF FT ASVSRIVTSLGGSPKSIPKTYCNLATLDTLKIPKLSSVDLSCAQQEDPCMG FT EIWRALKRGDLSTVNKEAHKDIRLIMRDWDKLVLDNGVMFRCTQVPERPCH FT KQLCLPKKFHKVVLKSLHDDSGHMGFDKTYGFVRNRFYWPRMKVDVEKYCQ FT TCARCVQRKTLPRKAAPLSHLSSNGPMDLVCMDFLSIEPDSKNICNVLVIT FT DHYTRYAQAFPTKDQKATTVVKVLWEKYFIHYGLPKRMHSDQGRDFESKLI FT HELFDLLGVEKSRTTPYHPQGDPQPERFNRTLLDMLGTLDSRQKQQWSRHI FT GHLVHVYNCSMNDATGYSPYFLMFGREARLPIDLSFGVSSDGTSVRSYQRF FT VKKMQNELKSAYQLAEENAEKKNAGNKRRYDQRLHYSHLVPGDKVLIRNLG FT LQGKHKLADRWNSELYVVQSQMPNLPVFRLKPQSGNGPVKILHRNHILPIR FT QQLRQRVSDKIPKAQERLLRQRKNVVESQSTEQGTNDASVVQEIEVRADQT FT VEESCESEDEEYGVWVPIPKDSQLPFSGNESLQSDSVVSENEVRDTVKPVD FT VKDKTPQGVRTDDSVETDSRLLVENDSENLQNLSQVSDLCDTVTSIVQDLE FT SVDQSNENKTISDSLPEIKGRSQRMRTVPKRLTYDTLGMPTVEPVVVQAKV FT VNPVKYNSSVYLWLHVPSRKISTCAVTLKKRQ" XX SQ Sequence 6337 BP; 1905 A; 1161 C; 1535 G; 1736 T; 0 other; tttgtggggg ctcgtccggg atcaatcaga ttactcagat tgttaattca gaaagaaatt 60 tagaagtgat ttatttgcgt gtatacttat tctggggttt attttgaata tatatttttc 120 tctttcattc agtttttttt tacaaacaca ttttacattc tttttttttt gtaagttaaa 180 aaaaatggag gttagtgaga ttattgcttg gtgcagagac aaagatgtgg gtcttgacaa 240 agttattatt cttagcaatg tgcctacaga ctgtgaggac agagtgatct ataatgtact 300 agatactgct ctgggagttg gtaaatgcaa agtgcttggt tgtcgcttag ataaaaataa 360 ggaatttcaa tttgttttga ttgaatttga tgatgacata tccaagacat ctatccctgc 420 agagattgga ggacctgaag taggtctctg gtacacacag gttgtgtgtg ttgcagctga 480 agctcctaaa aagagcaagg aagatgaatt tcagactaag ttgaatttgt tccttcaaaa 540 cgaaggaaaa tcattagctg acatttcaag tttggcatct cctgctcctg acctcaacac 600 taagctggtc gaggccataa actcccttgt tcaaacatgt catgtggctt catctgagga 660 gagagggata tatagaaagc tgcgcccttt ttcagggttt ataccaactc ctcttgggga 720 agatgagttt gagatttggg tggagcagac tacacacatt ttggaagagt ggcagtgctc 780 tgataatgtt aagaagcaga aactagtaga gtgcctcaga ggtcctgcag cagatattgt 840 aagatttgaa aagactggca actcagccgc tacttatagt gactatctta gtgtactaga 900 gtcagcattt ggcagtactg aggatgctgc agatttaatg cttaaatttc gaagcacata 960 ccagaatgaa ggagagaagc tgtcagccta catattgagg ttggacaggc tgctccatag 1020 catgctgcgg aaaaaaagga attgagtatt ctgctcttaa tcatcttaga atgcagcaaa 1080 tagtcagagg tgccctagca ggtgacatgg tggctatgcg tttgaggatg acccacaagc 1140 tttgtgagcc accttctttc aacgagttaa tgaaggaggt tagagaagaa gaaaatatgg 1200 tcagcagtag gcgcacctta ccgtctaatg ttgccatgtc tgtagtgccc actaccaaga 1260 aggattcgtc tgctaacact ggtactgtcg actcagatgt agaaaagtta aagaaggaga 1320 ttagggggct gaaaaatgag gtgtctcggt tgtcagctgc agccaaagaa cctgtagccc 1380 atgatcgtcc agccatgcac agtctggctg ctgttgctga ctgtaaaact ccaacacgta 1440 cagtaaccaa agccaacata ttttgctaca gatgcgggga agatgggcat cttaagcggg 1500 actgtaaaaa cgatgaaaat ctcaggaaag taaaccagcg actcatcaga atgagacaat 1560 cttcgggaaa cttctccggg gctcagtaaa ggaacggcct gaagctccgg gaaagacacg 1620 ttccacaaag tgcaaaggag tccaagctga gccagcagtt cttccgagcg ggttagtagg 1680 acccagttct gtccttcctg ttcaagtgga aggaatctat gcaaaggctt tgttagatag 1740 tggatcgcaa gttaccctgc tatacagatc cttctatgac aaatacttga gacatatgcc 1800 attgatacca atagaaaatt tggagatatg gggattgagt atgcaaaagt acccatatga 1860 tggctgtttg tctttgaggc tggagtttcc aggatctgtt gcaggagtgg cggaatccat 1920 agatgcactt gtgctggtct gtcccgatcc tgtcatgaaa ggggatgtaa acattctggt 1980 gggaacaaat acatctgtag tgaaaaagct ggtagaagca tgcaaagaaa aggtgggaga 2040 tgattttcta cataccttga ctgttcatcc tgagatcaga gatgcttatg aacgggccat 2100 agcagttgaa aagagtcact gtgacgacag cagaggaact gtctggtatc ttaaccctaa 2160 gccattaaag ttgcctcccg gagagagtgc tagggtttat ggagtgttga aatgcccttc 2220 tgctttttct ggaaagactc tgttagtcga taagccagag gaatctgtgt ttccagatga 2280 actgcttgta accgccgaag tacaggccaa tactgttgtt cagtcaagaa agattggagt 2340 tacgatcaga aatgtctcct ctcgcactgt aaatcttaaa agaggaatgc ccattgctca 2400 tgtgtttcca gtggaggttg tgtctaccag ctcaaaacca caagagtgta gtggacagac 2460 taaactagat gctagctcat ttaactttgg tgattcctct gtgcccaacg aatggaaaca 2520 acgactggtt aaaaagatgt tggaaaggac agatgtgttc tcctgtgatg aatttgaagt 2580 tggatgttca aagagtaccc agcatcggat tcggctgaca gagaacaaac cttttcgtga 2640 gcgttcaaga cgtcttgctc cagctgatct ggaagatgtc aggctacatc tagagaagtt 2700 aaaaggagca gggattatca cagaatccag gagtccatat gcttctccga ttgttgtagt 2760 acggaaaaag aatggccaga tcagaatgtg cgtggattat cgaacgctaa accgacgtac 2820 tattccagat caatacactg ttccacggat agaggatgca ctaacttgtc tgaatgggag 2880 caagtggttt agtgtattgg atctcaggag tggatattac cagattccta tgagtgaagc 2940 agatcaagaa aagacagcat tcatctgccc agtggggttc tatcagttcc agagaatgcc 3000 acaggggatt tgtggagctc catcgacttt tcaaagagtt atggaaaaga ccgttgggga 3060 catgaacttg ttggaagtat tagtgtatct ggacgacctc atcgtatttg gcaggacatt 3120 ggaggagcat gaagagaggt tgctaaaagt cttggacaga ttacagagtg aaggattgaa 3180 gttgtctttg gataaatgca ccttctgtca gacctctgtc agctacgttg gacacattgt 3240 ctcccaagac ggtgtctcaa cagacccctc caagattgat gctgtaaaat cttggcctcg 3300 acctcagaac gtgtcagaac tgcgttcatt tttaggtttc tgtggatact atagaagatt 3360 tgtgaaagac tattcaaaag tatcctatcc actgaatcag ctcttacaag gttgtctgcc 3420 tgcagcaagt ttgaaaaaga tcaaagcagg acttcgccca gaacaattag cgtacaaagc 3480 tactgaccct tttgggagta gatggactga tgcatgcgaa actgccttta aagagttaaa 3540 ggtgagatta actgaagctc cagtcttggt ctttgcaaat cctcagctcc cctatgtatt 3600 acatgtcgac gcctgttgtg aaggacttgg tggcgtgttg taccaggatc aaggagaggg 3660 gttaaaacca gtggcatttg tcagcagaag tttgacgcct tctgaaaaga actaccccgt 3720 ccacaaactt gagttcttgg cattaaaatg ggcggttgtt gacaagctac atgactacct 3780 ctatggggtt aaatttgagg tccagacaga caataacccg ctaacttatg ttctgacaac 3840 agccaagctg gatgctgtag gacatcgttg gctggcagcc ctctcagcct atgacttcag 3900 tctcaagtac cggcctggga gacagaatat cgatgctgac gcattatctc gtcgatctca 3960 tcaaaaagaa aaggaatgga caagtattcc atcttctggt gtaagatctt tgtgtaagct 4020 gccaggccag gacaagccaa actgctttgc aagtgtcagc aggattgtga cgtcattagg 4080 tggatctcct aagtcaatcc ccaaaacgta ctgtaattta gctacattgg atactctaaa 4140 aattcctaaa ttgagttctg tggatctgtc ctgtgctcag caagaagacc catgtatggg 4200 tgagatatgg agagctttga aaaggggaga tttgtccaca gttaataaag aagctcacaa 4260 agacattcga ctgatcatga gagactggga caaactagtt ttggataatg gagtaatgtt 4320 tcggtgcact caagtaccag aaagaccttg tcataaacag ctatgtctgc ccaaaaagtt 4380 ccataaagtg gttttaaaat cacttcatga tgattcaggt cacatggggt ttgacaagac 4440 ctatggtttt gtgagaaata gattttactg gccccgaatg aaggttgatg ttgagaaata 4500 ttgccagaca tgtgccaggt gtgtccaaag aaagactttg cctagaaaag ctgcaccctt 4560 gtcacatttg tctagtaatg gaccgatgga tcttgtatgt atggattttc tatccataga 4620 gccagattcc aaaaacatct gtaatgtcct tgtcatcacg gaccactaca caaggtatgc 4680 acaggccttt ccaactaaag atcagaaggc aaccacagtt gtgaaagttc tgtgggaaaa 4740 atacttcatt cactacggac tgcctaaaag aatgcactca gatcaaggtc gggatttcga 4800 gagcaagctc atccatgagt tgtttgacct attaggagta gagaaatccc gaacaacacc 4860 gtatcatccg cagggtgatc cgcagcctga acggttcaac cgaacgcttt tagatatgct 4920 ggggaccctg gattcaaggc agaaacaaca gtggagtcga cacattggac atcttgtgca 4980 tgtgtacaat tgcagcatga atgacgccac agggtattcc ccttactttc ttatgtttgg 5040 cagggaagcc aggctgccca ttgacttgag ttttggtgtg tcaagtgatg gtacttctgt 5100 aaggtcttac caacggttcg taaagaagat gcagaacgaa ctgaaatctg cttatcagct 5160 agcagaggaa aatgctgaga agaaaaatgc aggaaataag agaagatatg atcagagatt 5220 gcattattct cacctggttc ctggggacaa agtcttgata cgaaatcttg gactccaagg 5280 aaagcacaaa ttggcagaca gatggaattc tgagctttat gttgtccaga gccaaatgcc 5340 taacctacct gtgtttcgac ttaagccaca gagtggaaat ggaccagtca aaattctgca 5400 caggaatcat attcttccta ttcgacaaca gttgagacag agagtttcag acaaaatacc 5460 aaaagctcaa gaaagacttc tgagacagag gaaaaatgtt gtagaaagtc agagtacaga 5520 acaaggaact aatgatgcct ctgtagtaca agaaattgaa gtcagagctg atcagactgt 5580 tgaagaaagt tgtgagagtg aggatgaaga atatggggta tgggttccaa ttcctaaaga 5640 cagtcaattg ccttttagtg ggaatgaatc tttacagagt gattcagttg tttctgagaa 5700 tgaagtcaga gatactgtaa aacctgtaga tgtgaaagat aagactcctc aaggagttag 5760 aacagatgat tctgtagaga ctgattcaag acttctggtt gagaatgatt cagaaaattt 5820 acaaaatttg agtcaagtgt cagatctttg tgatactgtt acaagtatag tgcaggatct 5880 agagagtgtg gatcagagta atgaaaacaa aactatctct gatagtttac ctgaaattaa 5940 agggaggtct cagagaatga gaactgtgcc caagaggtta acatatgata cacttgggat 6000 gccaacagtt gaacctgttg ttgtgcaggc caaagtagta aatccagtca aatacaattc 6060 ttctgtatat ttgtggttgc atgtgccatc aagaaaaatt agtacatgtg ctgttacttt 6120 gaagaaaaga cagtaattag agttattgct gggtaattga tcattgtgtc atatttgccg 6180 ttattttgtg tttttatttt gagagtactc tgcatatgtc attgattcag ataatgagta 6240 actgtgaaag aaaacaagtg taattttgtg atacttcatt catttagatg tatttttcct 6300 cgttatgagg gcataacgat tcttggtggg gggagaa 6337 // ID Gypsy76-I_DR repbase; DNA; ZEB; 4336 BP. XX AC chr13; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy76-I_DR; KW Gypsy76-LTR_DR; Gypsy76_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4336 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1467-1467 (2008). XX DR Genome; chr13; Positions 37932842 37937177. XX CC Positions [1760-2215] - Reverse transcriptase CC Positions [3230-3709] - Integrase core CC 'AATTT' target site duplication CC LTRs are 99% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 18..1031 FT /product="Gypsy76-I_DR_2p" FT /translation="MNPAEASAQEMVSIVAQHDATVKRHEAALIQQEALME FT KHSQLLSELMTSMRQLSDSLRQRPTTSPEEASTPTPVIHTPQPQFEPRLPP FT PQRFSGDPETCEGFITQCDLTFKLQPSTFPSDQAKIAYILTLLSGKALAWA FT TAIWNAKSPCCTNYSAFVKELRRVFNHPLSGREASKHLLTLCQGGKSAAEY FT AIQFRTIAAGAGWNDEALIVCFQNGLSEALKDELATRDPAKTLESLIDQAI FT LLDNRLRERRLTPCYVPSRSPNLPNTPYATESPQFSEPDSPEPMQLGRARL FT TLQERTRRMKSRLCLYCGGSGHFRSDCPELKENSPSQAVRRELSRE" FT CDS 1148..4321 FT /product="Gypsy76-I_DR_1p" FT /translation="MDQTLATNLKIPLKLLDNPLEVTALDRRPIAPGAVTH FT LTVPLHVSVFHHKEEISFYLIESPDLPLIFGYPWLTLHNPHIDWSNGEILH FT WGPLCHKVCLDPPSNNPESQDPVDLSRVPPQYHSFHSVFSKQKASVLPPHR FT PYDCAIELMPGQCPPRGRVFSLSPPEQTAMETYIMEGLKAGIIRSSTSPAG FT AGFFFVGKKDGGLRPCIDYRALNKVTIRNRYPLPLMATAFELLQKATIFSK FT LDLRNAYHLVRIKQGDEWKTAFNTPTGHYEYLVMPFGLTNAPAVFQALIND FT VLREMLNKFVFVYLDDILIFSSSLQEHESHVRKVLRRLQENHLFVKPEKCE FT FHTTEVLFLGFIIKPGQVQMDPKKVQAVLDWPAPTSVKEVQRFIGFANFYR FT KFVQNFSSVVAPLTALTKVGSARISWNPEAEAAFRELKRRFTSAPILTIPN FT PELPFVVEVDASDVGVGAVLSQRGKDNCLHPCAFLSHRLSSCERNYHVGDR FT ELLAVKLALEEWRHWLEGAKHPFQVLTDHKNLEYIQQAKRLTPRQARWSLF FT FNRFDFVLSYRSGSKNLKPDALSRIYSNTPRERNPEAIIPQSKILAPLRWG FT FESAIRKAQIQDPDPGGGPTNRLYVPKAVRSQVLQWAHSSRLTCHPGISRT FT LDFLQRRFWWPTAKKDVISLVRACPVCNRGKTSHLPPQGLLHPLPIPHRPW FT SHLSMDFITGLPPSHGNTVILVIVDRFSKAARFIPLPKLPSAKETADLITS FT HVFRVFGIPQDIVSDRGPQFSSRFRRSFCQSLGASISLSSGFHPESNGQTE FT RINQSLETTLRCMVGHNPSSWSSYLLWAEYAHNTLRSSSTGLSPFECQFGF FT SPALFPEQEIQVAVPSVQQHINRCRRTWRVVRRKLIQTSNQYQRQANLRRR FT SAPSLRVGQRVWLSTRNLPLRIESPKLNQRYIGPFKIIRRVNPVSYRLEIP FT RSFKINPTFHVSLLKPVLCSSFVPSDRSPPPPRNIGGKPAYTVRRILDVRR FT VQRSRQYLVDWEGYGPEERSWIPAKDILDPKLIRDFQARSSGGLGGNVRSR FT S" XX SQ Sequence 4336 BP; 1029 A; 1215 C; 946 G; 1146 T; 0 other; gaaaaaccaa gtcaaacatg aaccctgctg aggcgtctgc ccaagaaatg gtctctattg 60 ttgctcagca tgacgctaca gtcaagcgcc atgaggcggc cttgattcag caagaagcac 120 ttatggagaa gcattcccag ttactatcag aactcatgac ttccatgcga caactctctg 180 acagtctgcg tcaacgtcca actacatccc cagaagaagc ttcaacccca acacctgtca 240 ttcatacccc tcaacctcag tttgaaccac gtcttccccc tccgcaacgc ttttcagggg 300 acccggaaac ctgtgagggg tttataactc agtgtgacct cactttcaag cttcaaccct 360 ccacattccc ttcagatcag gctaagatcg cttatatact caccctatta tctggcaaag 420 cacttgcttg ggcaacagca atctggaatg ctaagtctcc ctgctgtact aattactcag 480 cttttgtgaa ggaattacgg cgtgtattta accatcccct ttcaggacga gaggcttcca 540 agcacctact caccttgtgt caaggaggta aaagtgccgc agaatatgcc attcaattcc 600 gaacaattgc tgcaggagca ggctggaatg atgaggccct catcgtttgt tttcagaatg 660 ggttgtctga ggccctcaaa gatgagttgg ctacccgaga tccagctaaa acccttgagt 720 ccttaataga ccaagctatc ctcctggaca accgcctgag agaaagaaga ctaaccccct 780 gctatgttcc ttcacgttct cccaacctcc ccaatacccc atatgctact gagtcacctc 840 agttttcaga accagattcc ccagagccta tgcaactagg ccgtgcccgg ctaactcttc 900 aggaacgcac tcgccgtatg aagagccgtc tctgcctcta ttgtggtggg tctggtcatt 960 tccgttctga ttgtcctgag ttgaaggaaa actcaccatc ccaggcagta aggagagagc 1020 tgtctcggga gtgacccaat tcgtctctcc tagcaattcc agtctgctct tacccatcac 1080 tttaacattt gagtctaagg aacatcaact tcaagcactt gttgattctg gagcagcggg 1140 gaacttcatg gatcagacac tagcaactaa tctcaagatt ccccttaaac tgttagataa 1200 ccctctggag gtgaccgctt tggataggag accgatagct cccggagctg ttacccatct 1260 tactgttcct ctgcatgtct ctgtgttcca ccacaaggag gagatatcct tttatttgat 1320 tgagtcccct gacctgccct tgatctttgg ctatccctgg cttactcttc ataaccccca 1380 cattgattgg tctaatggcg aaattctcca ctggggcccc ctgtgtcaca aagtttgctt 1440 agatccacct agtaataatc ccgagtctca agaccctgtt gacctttctc gagtcccacc 1500 ccagtatcac tcctttcact cggtcttcag caaacagaaa gcttctgtct tacccccaca 1560 tagaccatat gattgtgcca tagagctaat gcccggccag tgcccccctc gtggtcgggt 1620 gttttcattg tctcctcctg agcaaacggc catggagaca tatatcatgg agggccttaa 1680 agcagggatc atccgttctt ccacatctcc ggcaggtgca ggttttttct ttgtggggaa 1740 gaaggatgga ggtctccggc cctgcataga ttacagagct ctcaataagg taacaatccg 1800 gaatcgttat ccccttcccc ttatggccac tgccttcgag ttactccaga aagccaccat 1860 attttctaaa ctggacctgc gcaatgccta ccacttggta cggatcaagc agggcgatga 1920 gtggaagacg gcatttaata cccctacagg ccactatgag tatttggtca tgcccttcgg 1980 ccttactaac gctcctgcag tttttcaggc tctcataaat gatgtcctaa gggagatgct 2040 gaacaagttt gtttttgtct acttagatga catcctgatc ttctcgagtt ctctccaaga 2100 gcatgagagt cacgtcagaa aggtattaag acgtcttcag gagaaccatc tatttgttaa 2160 acccgagaaa tgtgagtttc acaccactga ggtcttgttc ttgggcttca tcattaaacc 2220 agggcaggta cagatggatc ccaagaaggt ccaggcagtg ttggactggc cagctcctac 2280 ctctgtaaag gaggttcaga ggttcatagg atttgccaat ttctatagga aatttgttca 2340 gaatttcagc tccgtggtgg ctcccctgac agcccttaca aaagtgggtt cagccagaat 2400 ttcctggaat ccggaggctg aggcggcttt cagggaactc aaacgcagat ttacctctgc 2460 acccatcctt actatcccga atccagaatt accctttgtg gtcgaagttg atgcctcaga 2520 tgtgggagtg ggagctgtgc tatcccaaag ggggaaagat aactgtcttc atccatgtgc 2580 atttctttcc catcgtctca gttcctgtga acgtaactat cacgttggag accgagaact 2640 actggctgta aagttggcat tggaggaatg gaggcactgg ctggagggtg ccaagcatcc 2700 gtttcaagta ctgactgacc acaagaactt ggagtatatc cagcaggcta aacgccttac 2760 tccccgtcaa gccaggtggt ccctcttctt taaccggttt gactttgttc tatcctaccg 2820 ctctggatcc aagaacctaa aaccagatgc tctgtcccga atatacagta acactccccg 2880 ggagaggaac ccagaagcta tcattcccca atccaagatt cttgcacccc tcagatgggg 2940 gtttgagtca gccatccgta aagcccagat tcaagatcca gacccaggtg gcggccctac 3000 taaccgattg tatgttccaa aggctgttcg ttctcaagtt cttcaatggg cacactcttc 3060 ccggctgacc tgccatccag gtatttcccg caccttggac tttttgcaaa gacgcttctg 3120 gtggcctaca gctaaaaagg acgtaatatc attggttagg gcctgccctg tttgcaaccg 3180 gggaaaaacc tcacatcttc ccccccaagg tcttctgcac cctctgccta ttcctcaccg 3240 accatggtcc cacctctcaa tggacttcat tactggcctc ccaccctccc atggaaacac 3300 cgttattctg gtcattgtcg accggttctc caaggctgcc cgattcattc ctttaccaaa 3360 actcccatct gctaaggaga cagcagacct cattacttcc catgtttttc gagtctttgg 3420 aattccacag gacattgtgt cggaccgggg accccagttc tcatcccggt tcaggagatc 3480 tttttgtcag tcattgggtg catccattag tttgtcttct ggttttcatc ccgagtctaa 3540 tggacagaca gaaaggatta atcagagtct cgagaccacg ttaaggtgca tggtgggtca 3600 caacccttca tcatggtcct catacctcct atgggctgaa tatgctcaca acactctgcg 3660 ctcatcctcc actgggttat ccccctttga gtgccagttt ggattctccc ctgccctctt 3720 ccctgagcaa gagattcaag tggctgttcc ctctgtacag cagcatatta atcgatgccg 3780 tcggacctgg agagtagtcc gccgaaaact catccaaacc tctaaccagt accagaggca 3840 ggccaacctt agacgcagat ctgctccctc ccttcgagtt ggtcaacggg tttggctctc 3900 cactcggaac cttccccttc ggatagagtc ccccaagctt aaccaacgct acatcggacc 3960 atttaaaatt atcaggagag taaacccagt ttcttatagg ctagaaatac ctaggtcatt 4020 taagattaat cccacctttc atgtatctct cctaaaacct gtgttgtgtt cttcttttgt 4080 tccctctgac agatcccctc ctcctcctcg caacattggg ggtaagcctg cctacacagt 4140 ccgcaggata ctggatgttc ggagggtgca acggtctcga cagtatcttg tggactggga 4200 ggggtatggt cccgaggagc gctcctggat tcccgctaag gacatccttg accccaagtt 4260 gatccgggat tttcaagctc gcagttcggg tggtcttggt gggaacgtca ggagccgttc 4320 ctagaggggg gggtcc 4336 // ID ERV1-1-LTR_DR repbase; DNA; ZEB; 578 BP. XX AC chr20; XX DT 03-OCT-2008 (Rel. 13.1, Created) DT 03-OCT-2008 (Rel. 13.1, Last updated, Version -1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW ERV1; Endogenous Retrovirus; Transposable Element; ERV1-1_DR; KW ERV1-1-I_DR; ERV1-1-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-578 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1595-1595 (2008). XX DR Genome; chr20; Positions 8885997 8886574. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 578 BP; 208 A; 94 C; 118 G; 158 T; 0 other; tgacggagaa attgtaatct tgtctccaat agtgtgcagt gaacatatat ttgtatgtga 60 aggatttgat cttctctcca atagtgtgca gtgaacatat atttggtaaa gtgtgcagtg 120 aatatgcagt aaatatctcc aattgtgtac aagtgtgcaa gtaaacatat attgtttaac 180 atatgcaatg aacatgttat tctgtataat catattaata tgtatacaaa taagtgtgaa 240 gtgaatgtat aacagtaaga gcaagcctag tagttagctg agtacatgta aactttaaac 300 cagcacctaa agggcaaaca gccaagaata taactgagtt caaaaggcac acagccaaaa 360 atatagcagg ttcacagcac aagaccatat ctgtgggaaa agtctgcaac actacatgta 420 taaaagcaag acagggacac agagaggcag aggttcatgg agagacattt gtttgtctgt 480 tagcctctcg tttttattgt ctggcaataa atccaacttt tgaaatcaag acccactgac 540 tcgattgctg attgaaagac aaggaacaga acacgaca 578 // ID I-4_DR repbase; DNA; ZEB; 1414 BP. XX AC . XX DT 23-OCT-2008 (Rel. 13.1, Created) DT 23-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Non-LTR retrotransposon from zebrafish - consensus. XX KW I; Non-LTR Retrotransposon; Transposable Element; I-4_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1414 RA Bao W. and Jurka J.; RT "I-type retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1344-1344 (2008). XX DR [1] (Consensus) XX CC 5'-portion is incomplete. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS join(56..556,490..669,645..1163) FT /product="I-4_DR_1p" FT /translation="SLYIEAKEPSLERRRLKXALQYAVKLKSKRGNPAYPS FT VFHPEYLELYEDRPNYISPFGIRMKPHLKNMGVNMEKLSLPDLSHIPPWCN FT SKIRILWNLTKQKKSLTHPNEYKQEFKQITDSLPHHEPIYTDGSKTDKGVT FT AAVVFKKMMFLVLGFQEKVQFTQQKLMQKDDVFGTRIPGESSIYTAEAYAI FT NMALEQMERTQYRNFLICTDSKSCLQALEASISVITPSFYKCDHPIISMIV FT KKMTTLENKKFNIIFCWIPGHMGLHGNELADEAAKQAISLKVTECLIPPED FT LKPTINLYINETWQREWDNCINNKLHEINPKINERHLANYNFETRHDQTVY FT TRCRIGHSRLTHLYLLNNEEPPKCNYCQAALTIKHILLECGSLNTIRQNFX FT LRRVL*" XX SQ Sequence 1414 BP; 525 A; 240 C; 226 G; 417 T; 6 other; ayatcacaca ggactacgaa tctcattggg agcattcaaa acatcccctg tttaaagttt 60 atatattgaa gcaaaagaac catcccttga aagacgacgt ttaaaattwg cactccaata 120 tgctgtcaaa ctgaaatcaa aaagaggtaa cccagcttat ccttcagtyt tccaccctga 180 atatttagaa ttgtatgaag acagacccaa ctatatcagt ccttttggaa ttcgtatgaa 240 accccacctg aagaatatgg gtgttaatat ggaaaagctt tcactacctg atttgagcca 300 tattcctcct tggtgcaact caaaaataag aatcttatgg aacttaacca aacagaagaa 360 gtccttgact cacccaaatg aatacaaaca agaattcaaa caaataacag acagtctccc 420 tcatcatgaa cccatatata cagatggatc gaaaacagac aaaggagtaa cagcagcagt 480 agtgtttaaa aagatgatgt ttttggtact cggattccag gagaaagttc aatttacaca 540 gcagaagctt atgcaataaa tatggcttta gaacaaatgg aaaggacaca gtaccgtaac 600 tttttaatct gcacagattc caaatcctgt cttcaagcac ttgaagcttc tataagtgtg 660 atcaccccat aatttccatg attgtaaaga aaatgacaac actagaaaat aagaagttca 720 acattatctt ctgttggatt ccaggtcata tgggactaca tggaaatgaa ctggcagatg 780 aagcagcaaa acaggctatc tccttgaaag tgacagaatg tttaattcct cctgaggact 840 tgaaacccac aatyaatttg tacataaacg aaacatggca aagggaatgg gacaactgca 900 taaataataa attacatgaa attaacccga aaataaatga aagacattta gcaaattata 960 attttgagac caggcatgat cagactgtct atactagatg tcgaattgga cattctaggt 1020 tgacacattt atatttactg aataacgaag aaccacctaa atgtaattac tgccaggctg 1080 ctcttaccat caaacatatt ttgttggaat gtggaagttt gaataccatt agacaaaatt 1140 tttwtttaag gagagtactt tgatggacat ttttaaaaag gtaccaccag gaagaatttt 1200 aaattattta tcaaaaattg aactgaaaaa tcatatttaa ctttgaatat atttttgtgt 1260 gaattttatt tatttatatt ttatatattt gtctaagtaa ctttttattg taatacgaat 1320 ttgtttttat catgtaatay ttttatgtat atccattttg ccacgaaata gccataagtt 1380 gctgatgtgg caataaataa ataataaata aaat 1414 // ID TDR19 repbase; DNA; ZEB; 392 BP. XX AC . XX DT 04-MAR-2002 (Rel. 7.02, Created) DT 04-MAR-2002 (Rel. 7.02, Last updated, Version 1) XX DE Zebrafish non-autonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; TDR19. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-392 RA Jurka J. and Drazkiewicz A.; RT "TDR19: a non-autonomous DNA transposon from Danio rerio."; RL Repbase Reports 2(2), 29-29 (2002). XX DR [1] (Consensus) XX CC TA-target site duplication. CC Several hundred copies in the genome. XX SQ Sequence 392 BP; 96 A; 95 C; 93 G; 97 T; 11 other; taagagcttg attagctggt tcaggtgtgt ttgattaggg ttggagctaa actntgcagn 60 ncnncggccc tccaggaccg agtttggaca cctncagggg tgtcaaactc agttcctgga 120 gggccgcagc nctgcacagt ttagctccaa ccctaatcaa atcacacctg atcaacaact 180 tataaaagnc ctgaagacct tgattagttt gatcaggtgt gtttgattag ggttggarct 240 aaactgtgca gagctgcggc cctccaggaa ctgagtttga cacccctgct ctaaactgta 300 ggtgtcnaaa cttrgtcctg gagggccggt gtcctgcaga gtttagctcc aaccctaatc 360 aaacacacct gaaccagcta atcaagctct ta 392 // ID BEL7-I_DR repbase; DNA; ZEB; 6067 BP. XX AC chr18; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL7-I_DR; KW BEL7-LTR_DR; BEL7_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6067 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1363-1363 (2008). XX DR Genome; chr18; Positions 45451720 45445654. XX CC Positions [5098-5658] - Integrase core CC 'CATGT' target site duplication CC LTRs are 99% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 37..6018 FT /product="BEL7-I_DR_1p" FT /translation="MSADGNPTSPSHARPLRQCQAPRYLEDYVLDSHHRPA FT LTSSTVHSVMEEPKGAAAAVYSSQADVTSSQGRHLVTPSPSQADLLSMDSL FT TKLMETINEKDKEENAEMHSLLRKLEQLKKRKQRREEMMEHITSFIREEQG FT NENNEPVQSSPAPSNPSSPPPLLSWDTVPQSEHQVAAGLSMGPEKSSHQMS FT AIVSRPTVGSHNAAYESVNQSRPQTTDLLFSPLSSGMASPVPHPPQSQLYE FT PMSSQAPRPLPRQEQVSTDHLERSFHPLEPQVISPTPVYPRSMAPYGYITT FT PYSQMQLVNAPNSHNLAPPYQAMMSSTLLPQNMFSGPMPSVSTAAVAPQLV FT TAQVAAPPNGSIPQRALYSPPKPKIPDFTNDSERDFANMKLALDNLLEPYP FT ELTEKYKYHVLLEHLKLPEAQMIGQSCRHHPYPYSAAMQALQLQYGQPHQL FT AQSEIAAILTSPDVKPNDGHSFQSFALRVHLLVSMLLSLEGPRGMELNCCS FT HVDRLLSKLPKYLRDGFIEFLQQQGKLNALSLNTYNLQDFAGWLQVKAQQQ FT RLSNRLMQRYQHERPSSIGKEKTQARTKGTSLVLYQGAAPNETALPTHPKE FT NKQRKPLKVQCLFCNSKEHYITRCNNIKEQSVAELHKWISDGRRCWKCARF FT HAPEMCNLKKPCSDCGRIHLQVLHRIAPNDQPNTSGTPESRIYLTPASVTC FT RVLLKVVPVLLHGKFKSVETYAILDDGAQRTIVLPMAAQMLQLKGEPEVLT FT LRTVRADVTHLSGSKINLEISPKGEPNKRFQILDAFTASGLDLVEQSYPVQ FT RLQKLYAHLRGLPVQSFHNVRPLVLIGSDHVHLITADKPVHQGVKGGPVAI FT HTALGWALQGAEKSTPIHNSVQQCLFTSASCTNDLLYRSVEKLWQLDILPY FT RNEKLVVRSREDQEAIKLLECKTQTINVEGVQRYATPLLRKPGAPTLLSTT FT RSVMPTLRSIERKLQRDPEKATIYSREIEKLIKAGYVTKIRPDEISQSKEA FT WYIPHHLVCHNDKPRLVFNCSFRSQGQSLNDQLLPGPALGPSLLGVLLRFR FT QYNVAVSADIRGMFHQVRLLPEDRPLLRFIWRDLQYENCPDVYEWQVLPFG FT TTSSPCCAIFAVQQHARNNQESYPSALQTIQQSFYVDNYLASFPTTSEARV FT SVDQLRSLLATGGFDLRPWASNQPAVISHLPTEARSSAAEQWLAQNRTDPM FT EPTLGLRWNCAADTLGYHYRPIEHATLTMRTAYQILATQYDPLGFIVPFTT FT RAKVLIQQLWSKKRDWDDPNLPQDLSRAWETWENELKHLSDITIPRCYSSI FT PHDAQQQYELHVFCDASERAYGAVAYLVEYADVIHSTFVMPRSRIAPKRQQ FT SIPRLELCAALAGAQLAKLLKEEITLKIQQIVLWTDSTTVLEWLQSDSCRF FT KVFVGTRVSEIQDLTEHSTWRYVDTQQNPADDITRGKPLQSFTIPGRWSQG FT PSFLNLGPEHWPKRPEPSRSEGLPDLKGTNICCLMTVVPDHTFPDATHFHT FT WKELVEATRQADGKATGSQPDTQLMDHRTAELELIKGSQAQSFPEETAALK FT MHKPVPNHSRLRCLAPEWDPVMEVIRVGGRLRRLESLNAEEIHPIVLDPQH FT PTTKLLIKEFDERLLHPGTERVYAKLRRQYWILRGRQAVKHHQLKCSSCQR FT WRAQPKVPKMADLPPERLRILCPPFYSTGVDCFGPYQVKIGRRVEKRLGVI FT FKCLTTRAVHIELLNSMDVDAFLLALRRFIARRGRPKELRSDCGTNFRGAD FT RELREAFAAMESPLKERLADHQITFKFNPPHAPHFGGTWEREVRSIKTALR FT AAVGGQSVSEDVLTTVLVEVEGILNSKPLGYASTDIADPDPITPNLLLMGR FT RDASLPQVTYASGEMGRRRWRHCQNLVDQFWTQFTRNYLPTLQTRQKWQTS FT SDNLKVGSVVLIVDPQLPRAQWPIGKVAETVISDDGCVRSANINVQGKVYT FT RPVARLIQLPAFQDDSKDS" XX SQ Sequence 6067 BP; 1765 A; 1510 C; 1343 G; 1449 T; 0 other; aatggtcctt cgagccggag cactgactgc ttcagaatgt cagcagatgg caatccaaca 60 tctccttctc atgccagacc attacgacaa tgtcaagccc ccaggtatct agaggattat 120 gtccttgact cccatcaccg acctgccctc acctctagta cggttcacag tgtaatggag 180 gagccaaaag gcgcagcagc cgcagtgtat tcgagtcaag cagatgtaac ctccagtcaa 240 ggtagacatc tggtcacacc cagtccaagt caagcagact tattgagtat ggactcatta 300 actaaattaa tggaaaccat aaacgagaag gacaaagaag aaaatgctga aatgcacagt 360 cttctcagga aattagagca gctcaagaaa cggaaacagc gcagagagga gatgatggag 420 cacattacgt ctttcataag ggaggaacag ggaaatgaga acaatgaacc agtacaatct 480 tctcctgcac cttctaatcc atcaagtcct ccacccttac tgtcatggga cacagtacca 540 caatcagagc accaagttgc tgcaggctta tcaatgggac cagagaagtc gtcacatcag 600 atgtcagcta ttgtttctag accaacggtg gggagccata atgctgctta tgaatcagtc 660 aatcagagtc gtccacaaac tacagattta ttgttttccc ctctctcatc tggcatggca 720 tctcctgttc cccatccgcc acagtcacag ctatatgagc ccatgtcctc tcaggctccc 780 agacctctcc ctaggcagga acaagttagc actgatcatt tagaacgatc attccacccc 840 cttgaaccac aggtgatttc gccaacacca gtctatccac gttctatggc accttatggc 900 tatattacaa caccatactc acaaatgcag ttggtgaatg cacccaattc acacaatctt 960 gcccctccgt atcaagcaat gatgtcatca acattacttc cccaaaatat gttctcaggc 1020 ccaatgcctt cagtgagcac ggctgcagtc gcaccacagt tggtcacagc tcaggtagca 1080 gcaccaccca atggtagtat acctcaacgt gccctataca gccccccaaa accaaaaatt 1140 ccagacttca caaacgacag tgaacgagat tttgctaata tgaagctagc attggacaat 1200 ttgcttgaac cttaccctga actaacagaa aagtacaaat accatgtact attggagcat 1260 cttaaacttc ctgaagctca gatgataggg cagtcatgta ggcatcatcc atatccctat 1320 tcagcagcta tgcaagcact tcaattacag tatggtcagc cccatcaact ggctcaaagt 1380 gaaatagcag ccatcctcac atcaccagat gtaaagccca atgatggtca tagctttcag 1440 agtttcgctt tgagagttca tcttctggtc agtatgctgc tgtcactaga gggacccaga 1500 gggatggagc ttaactgttg ctcgcatgtg gaccgcttgc ttagcaagct accaaaatac 1560 cttcgagatg gcttcattga gttcctacaa cagcagggaa aacttaatgc cttaagtttg 1620 aacacttata atctgcaaga cttcgctggg tggctacaag tcaaagcaca gcagcagcgg 1680 ctgtctaatc gactaatgca acgctatcaa catgaaaggc cttcaagtat tggaaaggag 1740 aaaactcaag ccagaacaaa ggggacaagt ttagtacttt atcagggtgc ggccccaaat 1800 gagactgctc tccctaccca tcctaaagag aacaaacaaa ggaagccact taaggtacaa 1860 tgtctgttct gcaatagtaa agaacattac attacacgct gcaacaatat caaagagcag 1920 tcagtcgcag agctccataa gtggatttca gatggaaggc gatgctggaa atgtgcaaga 1980 tttcatgcac cagagatgtg caacctcaaa aagccatgca gtgactgtgg tagaatacat 2040 ctacaggtgc tccatcgtat cgcacctaat gatcaaccaa acacatcagg tacgccagag 2100 agtcgtattt acctcacccc tgccagtgtg acctgtagag tgcttcttaa agtggttcca 2160 gtgttgctac atggcaagtt caaatcagta gaaacctatg ctatactaga cgatggagcg 2220 caacgaacca ttgtcttacc aatggctgca cagatgcttc agttaaaagg tgaacctgaa 2280 gtactgactt tacgcacagt gcgagcagat gttacccatc tttctggatc caagattaat 2340 ctggaaattt ccccaaaagg cgaacccaat aaacgtttcc agatacttga cgccttcaca 2400 gcttccggtt tggatctagt tgagcaatca tatccagtac agagacttca aaagctgtat 2460 gctcacctta gagggctccc tgtacagtcc ttccacaacg tacgcccact agtactgatc 2520 ggatcagatc atgttcatct tatcacagcc gataagccag tccaccaagg agtcaaaggt 2580 ggcccagttg caatacacac agccctcgga tgggcattac aaggagcaga gaagagcaca 2640 ccaatccaca attcagtaca gcagtgtctt tttacatcgg cctcctgtac caatgacttg 2700 ctttatcgca gtgttgaaaa gttatggcaa ctggacatct tgccttatcg aaatgagaaa 2760 ctggtggttc gatccaggga agaccaagaa gcaatcaaac ttcttgagtg taaaacacag 2820 acaattaatg tggagggtgt ccagcggtat gccacccctc tcctgcgcaa acctggggct 2880 cctacactat tgagtacgac tcgatccgtc atgcctactt tgagaagtat tgagaggaaa 2940 ctccagagag accctgagaa agcaaccatc tactctcgtg agattgagaa gctcattaaa 3000 gcaggctatg tcactaaaat tcgtccagat gaaatcagcc agtccaaaga ggcctggtac 3060 ataccccacc acctggtgtg tcacaatgat aaaccaagac tggtcttcaa ttgctcattc 3120 aggtctcaag gtcagtctct gaacgatcaa ctgctcccag ggcctgcact tggtccatca 3180 ttattgggcg ttcttttaag atttcgtcaa tacaatgtgg ctgtaagtgc agatatccgg 3240 ggaatgttcc atcaggttcg cttgttacct gaggacagac ctcttctccg cttcatttgg 3300 cgagatctgc aatatgagaa ttgtccagat gtatatgaat ggcaggtttt gccatttggt 3360 acaaccagta gcccatgttg tgccattttt gctgtgcagc agcatgctcg caataatcaa 3420 gagagctacc caagtgcttt acagacaatc cagcaaagct tctatgtgga caattatcta 3480 gcgagttttc ccaccacatc ggaagccagg gtgtccgttg atcaactgcg cagcctacta 3540 gctacaggtg gctttgacct tagaccatgg gctagcaacc agcccgcagt gatctcacat 3600 ctacccactg aggcaagatc ttcagctgca gaacagtggc ttgcacaaaa tagaactgat 3660 ccaatggaac caactctggg cctcaggtgg aactgtgctg ctgatacact gggataccac 3720 tatcgtccaa tagaacatgc aaccctgaca atgagaacag cgtatcaaat actagccaca 3780 cagtatgatc cactgggctt cattgtgccc tttaccacac gagccaaggt gctcattcaa 3840 cagttgtggt ctaaaaagcg agattgggac gacccaaatt taccacaaga tctcagcagg 3900 gcctgggaaa catgggagaa tgagttaaaa catctcagcg atataaccat acctcgttgc 3960 tactcatcta ttccccacga tgcacaacaa cagtacgaac tacatgtgtt ctgtgacgca 4020 tccgaacggg cttatggggc tgtggcttac ttggtggagt atgcagatgt tattcattca 4080 acctttgtca tgcccagatc aagaattgca cctaagaggc aacaatccat acctcgactg 4140 gagttatgtg ctgcattagc aggtgcacag ctggcaaagc tcctgaagga ggagatcacc 4200 cttaaaatcc agcagatagt tttatggact gattctacaa ctgtgctaga gtggctccag 4260 tccgattcat gtcgatttaa agtttttgtt ggaacacggg tctcagaaat ccaggatctg 4320 actgagcata gtacttggcg ctatgttgac acacagcaaa acccagcaga tgatatcacc 4380 aggggtaaac cactccaaag ttttaccatt cctgggcgct ggagtcaggg accctcattc 4440 ctaaacctgg gtcctgagca ctggccaaaa agacctgaac catcaaggtc agaaggatta 4500 cctgatttaa agggtaccaa tatctgttgt ttaatgactg tggtgccaga tcatactttc 4560 cccgatgcca ctcatttcca tacatggaaa gagttagtgg aagcgactcg acaggccgat 4620 ggaaaggcga caggtagtca accagatact caactcatgg accatcgcac agcagaattg 4680 gaattaataa aagggtctca ggctcaaagc tttcccgagg agaccgcagc cctcaaaatg 4740 cacaaacctg taccaaatca cagtcgatta agatgtcttg ctccagagtg ggatccggtg 4800 atggaggtaa tcagagtcgg aggacgatta cgaaggctgg aaagcctaaa tgctgaggaa 4860 attcacccta tagtattaga cccacagcat ccaaccacaa agcttctcat caaagagttt 4920 gatgagcgcc tgctccaccc aggtacagaa agagtgtacg ccaaacttcg gagacaatat 4980 tggatcttga gggggcggca agctgtcaaa catcaccagc ttaaatgttc atcatgccaa 5040 cgatggagag ctcagcctaa ggtaccaaaa atggcagacc tgccaccaga gcggctcagg 5100 attctttgcc caccattcta ctctactgga gtagactgtt ttggtccata tcaggtaaaa 5160 ataggcagac gagtggagaa acgtttggga gtcattttta aatgtctcac aacaagagcg 5220 gttcacattg agctcctcaa ctccatggat gtggatgcct tcttgcttgc tttgcgacga 5280 ttcatagcca gacgtggcag accaaaggag cttaggtcag attgtggtac aaacttccgt 5340 ggggctgatc gagagctaag agaagccttt gctgccatgg aatccccatt aaaagagcgg 5400 ctagctgatc atcagatcac atttaagttc aaccctcctc atgcaccgca tttcggtggc 5460 acttgggaac gagaggttcg ttccataaag actgctctac gggcagcagt tggtggccag 5520 tctgtctcag aagatgtcct cactactgta ttagtagagg tagagggcat tttgaactca 5580 aaaccgcttg gatatgcctc cacggatatt gcagatccag atcctattac accaaacctt 5640 ctcctcatgg ggcggcggga tgcgtcactc ccacaagtga cttatgcttc aggggaaatg 5700 ggacggcgca gatggcgcca ttgtcaaaat cttgtggatc agttctggac tcagtttaca 5760 aggaattact tgcccacact acaaacccgc caaaagtggc aaacttcatc cgacaatcta 5820 aaagtgggtt cagtggtttt gattgttgac ccacaattgc ccagggctca gtggcccatt 5880 ggcaaagtgg ccgagactgt tatcagtgat gatggttgtg taagatctgc aaacataaat 5940 gttcaaggta aagtgtacac taggccagta gcccggctta ttcagttgcc agcctttcaa 6000 gatgacagta aagactccta aagaattctc ttgacttgca catttgctgt tcaaatgtgg 6060 gggcggc 6067 // ID CR1-15_DR repbase; DNA; ZEB; 1664 BP. XX AC . XX DT 03-NOV-2008 (Rel. 13.11, Created) DT 03-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE CR1-15_DR is a CR1-like non-LTR retrotransposon - a consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; Nonautonomous; KW reverse transcriptase; AP endonuclease; CR1 clad; CR1-15_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1664 RA Bao W. and Jurka J.; RT "CR1-like non-LTR retrotransposons from zebrafish."; RL Repbase Reports 8(11), 1689-1689 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1664 BP; 541 A; 262 C; 319 G; 541 T; 1 other; gttaataatt tagatatagt aggaactaag actcagtcac agaccatgat catacaggaa 60 agagaggtta ttgcagtgct tcataatgtt aaatcacaca aggcatctgg cccagatggc 120 ctgaagggga aagttttaaa agagtgtgcg gagcaattgg gttgtgtgca ccagactgtt 180 tcaagtattt ctagacaatg gttttgtacc attagcctgg aaaaattcaa tagttattcc 240 agtacctaaa atacctcatg taaagtcctt aaatgatttt cgcccaatca cgcttacgtc 300 tgttctttgt aaatgtatgg aacgaatagt gtgtaaggaa ttgtcctctc aagttgagaa 360 gacaatggat cccctgcagt ttgcttatag ggccaacagg agcactgtgg atgcctctct 420 tactcttttg aacaaagttc aacatcatct agataggcca aatacgtatg ttagaatact 480 ttttatggat ttctcagctg cttttaatac tgttcaacct aatcttcttt tagaacgtct 540 ttataattta ggagtaagta actggttgat tatatggatc agagagtttt taaaaaaccg 600 accacaacgt gtctgtgtta atcatgctat gtctgactgc tctgttttaa atataggggt 660 acctcaggga tgtgtactat cacctctttt attttctata tatattaatg aactgcagtg 720 caatattgat gacaatctca cactcataaa atatgctgat gatctggcat tggtttcctg 780 tcaggaagaa atgaatagct ctttatattt tagatatatt gatagtatta tagattggtt 840 tggcaaaagt tcattacacc taaatattgg gaaaacaaaa gagttatgtc tggggagtca 900 aacaagaaga gtagatactg gactctttaa accagtcacc ataaacggac aaatagtaga 960 gcaggtttca aatttcaagt atttaggcac aattattgac aataaattga attttaatag 1020 taatgtagag gcggtttata agaaagcaag tcagcgcctc tatctcctcc gcaaattgag 1080 aagttttaat gttagtacac agactttaaa cttggcatat agatcactta tagagagtgt 1140 tctcacatat aatattgttt catggtttgg taacacaaca cttaaacaga aaaagaaatt 1200 agcgcaaatt atcaatcagg caaataagat aactggtcac aaacagcatt cattgcagat 1260 tttatttgac tcttttatgg aaaaaagagc agttgtgatt tttaaagata atacacatcc 1320 ccttcactca gcatttgaat tgctaccatc aggacgcagg tttagaattc ctaaagcacg 1380 aagaaatgcc tataaaaagt catttatccc aatggcagtt gttgttctaa accggatcaa 1440 tgtttgtagt gaatgaggtg gttaccaagt atgatttgca gtattgtgya tggggctaat 1500 ttggttttta aacttttaac ttatgttgtt gtgttgataa gtgttattgt gtcgttgttg 1560 tatgtttgta attgttgtcc taaatccagt gttgaagaaa aatttccttt ctgtgtcaag 1620 cagaacggac aataaaagtt taaactaaac taaactaaac taaa 1664 // ID DNA-8-2_DR repbase; DNA; ZEB; 824 BP. XX AC . XX DT 09-SEP-2003 (Rel. 8.08, Created) DT 05-OCT-2008 (Rel. 13.11, Last updated, Version 2) XX DE DNA-8-2_DR is a nonautonomous DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; 2-bp TSD; DNA-8-2_DR. XX NM DNA-8-2_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-824 RA Kapitonov V.V. and Jurka J.; RT "DNA-8-2_DR, an ancient nonautonomous DNA transposon from RT zebrafish."; RL Repbase Reports 3(8), 151-151 (2003). XX RN [2] RP 1-824 RA Jurka J.; RT "Classified as hAT transposon based on revised size of TSD (8 RT bp)."; RL Direct Submission to RR (05-OCT-2008). XX DR [1] (Consensus) XX CC This element is characterized by 8-bp terminal inverted repeats CC and 8-bp target site duplications. There are >2000 copies of this CC element present in the genome. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 824 BP; 277 A; 143 C; 128 G; 272 T; 4 other; tagggttgtc gcgataccat taattcatct tacgatacta taccagctga agtatcayga 60 taccaagtag tattgcgata ctgtaattcc ataactcaaa ttataaagaa aattgtcaga 120 aatactatat tttatgttat aataggccta cttgaattta attcataatt cccttattat 180 taaaaagtat tatttgcacg tcacttcacc taaaatgtat tcattctttt tgtttatttt 240 gtagataact gaccaacaaa aatacattaa aataaagata caaattatac caaatgaaat 300 ttgttacaaa agagcatttt tccaacaaaa ttaggctata tgaagtggtc aaaaattgtt 360 tcaatgtttc ctgttgctat tttgggtttt tcytaatcta ttttttttca gtcttatcag 420 gtaacaatcc aaagtaattc aaaatttcac tttgtgagtc gttcttttta aggagattgt 480 cgcggttgtt gtagctacta aatcatattg acaggaacag aaatgaacyg attcagatgt 540 gcgttcgaac tgaagcgtca gaaaacgtgc aataggctac cataaaaggn gcgaattcta 600 cacagttttg caaccttaaa gggctccaca gactattcaa ataaaatggt ctattgttgc 660 acaaacgtgt gagcatttta gtgacttttc cacatgcaac attatctatt gtagatagac 720 cattaatgac gatactaccg tttacaaact acagtggcac cgccagtatt ttggagccat 780 agtatcacga tactaccata gtaccggtaa accgtgcaac ccta 824 // ID BHIKHARI-3-LTR_DR repbase; DNA; ZEB; 1876 BP. XX AC . XX DT 24-NOV-2008 (Rel. 13.11, Created) DT 24-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE LTR of retrotransposon bhikhari from Danio rerio - a consensus. XX KW LTR Retrotransposon; Transposable Element; Nonautonomous; LTR; KW BHIKHARI-2-LTR_DR; BHIKHARI-3-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1876 RA Bao W. and Jurka J.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(11), 1775-1775 (2008). XX DR [1] (Consensus) XX CC TSD is 4-bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1876 BP; 592 A; 380 C; 321 G; 583 T; 0 other; tgtagtgatc agcggtaaag ccaggaactt gtggaaatta atttccaata taaacaaatg 60 ctgcagagtt aacactaaac tctgaccctt ttcttaagct catcagaaag tggcaattga 120 gacagtgacc atctgtcttt tgtttatctg aaagacaaag gtgttaagag cccagaactt 180 gcttcttgat ccaatcaact tttgacatcg tctcaaactc agagatacac atggatcagt 240 aaacttgcat ttcacaaaag agacacaaat ggaaaatttc acttcatata cctgtatatt 300 caaatgatca tgtatatatt taatatgcac tgcttgaaat acttcactgt atgaaacttt 360 gcttattaag gtttatatat gaatgcttgg gtaaagatca acattacata gcgatgttta 420 gactctatct attgcaaata atgccttggg taaattagat gtggtttgga aagtgaatta 480 tgcataggaa aatatttaag actcttaaat attataatag ttaaccagtg gactatgact 540 atgcaaatgc aggccacatt atggatgaca cctcccagtg accaactcac aggatcatcc 600 aatcacaatg ctggatttga aaggtgacat ccttcaatcc cgccttccag agaaagtatt 660 aaagacattg tcttaaacaa agaaggctgt taaaaagtct gttaagtgtg ggtcctaaag 720 agtgtgtcca gaagagtgtg tctgttaaga aaggtggtcg aagagtggtc aaagtgtggt 780 tcatggtggt ttgcggggtt ttgtcctaga catcagaact tttgctgcgt gttccagcaa 840 ggatttggta attggattca cagtgctttt tccaaggcgt ctgaaaccat cgcatcaaga 900 ccaccacctc caaaaatctg cgtgtcccag atttccagga actctacctt caaagaaaag 960 gcatcggcgt gttcccgaca tcgaaccacc acctccagaa atctccaatt tctgcgtgtt 1020 ccagaagata cagatttcta cgctcaagga taaagacgtc ggcgtgtccc cgacatcaga 1080 ccaccacctc cagaaatctc caatttctgc gtgttccaga agatacagat ttctacgctc 1140 aaggataaag acgtcggcgt gttcccgaca tcgaaccacc acttctagaa atctccagtc 1200 tgcgtgttcc agaaactaca gatttcttcc actcaaagct caagagagcc aagtctgctc 1260 gtttcatgct tcaagatctg caggaagatc caacttcttc catccactgc atcaaaggca 1320 gaaacgtaaa tattccactt tccaaccctt taaattagac cttggcatag tgtgtttcag 1380 tataaaatat tctaattaat tagatgttta taactgatat tcattaataa caaaaacttc 1440 ctcagttctt tgttattgta aaataaggtt taccttacct taacttaatc accttcatca 1500 ctcgcctatg cacttcattt attgtacatt tagtaattgt agttaccctt gattactatt 1560 ttgttaataa atacacattt attacaattg tgtcttcctt gtgttgataa tacagaaaaa 1620 ggctctacaa gtcaaacgat ctcattatcc ttcagatttg gccagataat aaactcctca 1680 tttgtataat ttaactaaca atattttatt gttaattatt ttgacagtgg aaagctgtct 1740 ggtgccccgt ttaaaaggag ttatttatct gttatttcta cattaattgg tgcccccacg 1800 tgaggctaaa atatcaatat taatatcaat acctaaatat tgattttaat atttgataaa 1860 tccaattatc actaca 1876 // ID Gypsy-150-LTRb_DR repbase; DNA; ZEB; 960 BP. XX AC . XX DT 09-APR-2009 (Rel. 14.04, Created) DT 09-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy-150-LTRb_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-960 RA Bao W. and Jurka J.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 9(4), 864-864 (2009). XX DR [1] (Consensus) XX SQ Sequence 960 BP; 191 A; 212 C; 174 G; 383 T; 0 other; tgttacattc ctgtacattt ctgtgttgcg taattattat ttatatatat tttttttctc 60 gctagtctct ttctttctct ctctctctct ctctctctct ctctctctct ctccgtgctc 120 gcttgctctc tacctctctc ccgcttccac ctatacagtg gctccgattg gctaccgctg 180 cccagctgtt tgccgtgccc cccttgacgt catccaaggt ctccaatcca gccgctgcca 240 ccgtggcata aaagtggtga caggacgcta ctctggaggc cctgaatttc tgtgttgtgc 300 tgtgtgtctg tgttttttga tatatgtcta ctgtgtgtta caatttggta tctgtccgtt 360 atttgttctc catttgtcta ccgtcgatgt catgcgcatc tgaggttgga gagtgggaca 420 ggtaagaatg tcgtgcgaca tacattgtgc acaagtagtt tgaatatctc tgtattagac 480 acactaggta aggggcgtgc tccacccact gtactttttg ttttcccatc agagatcagc 540 gtaggttagg tagtgcgtat tagacgcgaa gatttctttt caataacttc tgcattttta 600 ttttggatag atagggtaga gattgggcca aatagattgc ttttgtttta tttattttta 660 ttgtttggat gccgcccgcg tctcatttct ccaactctcc catatctggt caaacatctt 720 atgtcatttg ttatctgtga aattcaatgt ctctttgtaa atattcaata atctttctat 780 atacattttt tttggcattc attattatca ctatttttgt aaataaaatc actgtattgc 840 atttagctgg tcttggtttt tgtccctcat ttattgatgt tgtgtggttg cttatgtgtt 900 tcacccaaaa attaatgtta ctcctcttcc cctttattta acatctttta aatcgtaaca 960 // ID DNA-8-12_DR repbase; DNA; ZEB; 731 BP. XX AC . XX DT 17-NOV-2008 (Rel. 13.11, Created) DT 17-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE A nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; TSD 8-bp; DNA-8-12_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-731 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(11), 1727-1727 (2008). XX DR [1] (Consensus) XX CC TIR is 328-bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 731 BP; 263 A; 106 C; 105 G; 257 T; 0 other; ggtaacactt tacaataaca gtacatgaat aatgatgtat taatatgtaa actaaacatt 60 actttattat gaattaatga tgagttaagg tatgcactaa tcatgaatta actttaagta 120 caacatgagt cacatgagtt catgtgtaaa taacgactac cttaattact tgttagtaca 180 tgttgttaat taatgtatta attaacattt aagtttaggc taaatgtaac caacatgaac 240 tcatgagctg ttaatgtata gttatgtcat gactttactt ggaggggcac atcactatta 300 actcatcctt aactacttat gaactcctct tcatgttgat gttgacagaa cacttttcta 360 ttgttattca gtgtaaactc actagttgga cgtgcataag gagttcatga gtagttaaga 420 atcggttaat gatgatgtgc ccctccaagt aaagtcacaa cataattata cattaacata 480 tcatgagttc atcatggtta aattaagcat aaactaatgt ggtaattaat acattaatta 540 acaaacaatc atgaactaac aagtaattaa ggtagccatt attcacacat gaactcatgt 600 gaatcatgtt gtagttaaag ttaattcata attagtgcat accttaactc atcattaatt 660 cataataaag taatgtttag tttacatatt aatacatcat tattcatgta ctgttattgt 720 aaagtgttac c 731 // ID SINE3-1a repbase; DNA; ZEB; 570 BP. XX AC . XX DT 22-OCT-2004 (Rel. 9.09, Created) DT 05-FEB-2007 (Rel. 12.03, Last updated, Version 2) XX DE SINE3-1a is a SINE retrotransposon - a consensus. XX KW SINE; Non-LTR Retrotransposon; Transposable Element; CR1; 5S rRNA; KW polIII; SINE3-1; SINE3-1a; conserved; CNE. XX NM SINE3-1a. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RA Kapitonov V.V. and Jurka J.; RT "SINE3-1, a novel class of SINE elements that utilize 5S rRNA."; RL Repbase Reports 2(6), 22-22 (2002). XX RN [2] RA Kapitonov V.V. and Jurka J.; RT "A novel class of SINE elements derived from 5S rRNA."; RL Mol. Biol. Evol 20(5), 694-702 (2003). XX RN [3] RP 1-570 RA Jurka J.; RT "Direct submission."; RL Direct submission (September 2004). XX DR [3] (Consensus) XX CC SINE3-1a is a consensus sequence of a SINE3-1 subfamily. The CC SINE3-1a and SINE3-1 consensus sequences are 84% identical to CC each other. XX SQ Sequence 570 BP; 124 A; 149 C; 155 G; 141 T; 1 other; cagctagctc tctgcaactc tcacatggtc gcccactgaa gctaagcagg gctgcgcccg 60 gtcagtacct ggatgggaga ccacatggga aagctaggtt gctgccggaa gtggtgttag 120 tgaggccagc agggggcgcc caacctgcgg tctgtgtggg tcctaatgcc ccagtatagt 180 gacggggacn ctatactgct cagtgagcgc cgtctttcgg atgagacgtt aaaccgaggt 240 cctgactctc tgtggtcgtt aaaaatccca ggatgtcctt cgaaaagagt aggggtttaa 300 ccccggcatc ctggccaaat ctgcccactg gcctctgtcc atcatggcct cctaaccatc 360 cccatatcta attggcttca tcactgtctc ctctccacca atcagctggt gtgtggtgtg 420 cggtctggcg caaaatggct gccgtcgcgt catccaggtg gatgctgcac actggtggtg 480 gatgaggaga ttccccccaa tgtgtaaagc gctttgagtg cccagaaaag cgctatataa 540 atgtaaggaa ttattattat tattattatt 570 // ID TDR5 repbase; DNA; ZEB; 531 BP. XX AC . XX DT 04-MAR-2002 (Rel. 7.02, Created) DT 20-FEB-2009 (Rel. 14.03, Last updated, Version 2) XX DE Zebrafish non-autonomous DNA transposon - a consensus. XX KW EnSpm; DNA transposon; Transposable Element; Nonautonomous; TDR5. XX NM TDR5. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-531 RA Jurka J. and Drazkiewicz A.; RT "TDR5: a non-autonomous DNA transposon from Danio rerio."; RL Repbase Reports 2(2), 33-33 (2002). XX DR [1] (Consensus) XX CC Contains 13 bp TIRs. The 5'-end is simiar to that of EnSpm-14_HM, CC therefore this element is classified as EnSpm element. XX SQ Sequence 531 BP; 155 A; 109 C; 113 G; 150 T; 4 other; cccagcaggc acacaacatc ataagacgtt aatattaggt tagatttagg ttgtgatgtc 60 aggtgaccaa aattcaatgt ctagccagca tctaaggaca atgttatttt gacgtccaat 120 aacaacgtca aatgacgttg atatttggtt gattttaggt tgtgttggaa agtgaccaaa 180 atccaacgtc gagccaacat cttaaaccaa cgtcatattg acgtcaaata ctgacattta 240 ttcgtcaggt atggcaacca aaatccaacg tctgatagac gtcatagtgg taatgtccac 300 acaacgtcaa gctgtaacat cattagacgt tgatatttgg ttgattttag gttgtgttgg 360 aaagtgacca aaatgcaang tcngtccgac gttggacatt gacgtcagcc tgatgttggg 420 ttctgacgtc aacccgattt tcatttccaa acaaaatgca acgtcccacg acnttggggt 480 aatgtccaca acgtcaatct gacgtcatgt tgacntcctg tgcctgctgg g 531 // ID DIRS-N3_DR repbase; DNA; ZEB; 6237 BP. XX AC . XX DT 26-FEB-2009 (Rel. 14.02, Created) DT 26-FEB-2009 (Rel. 14.02, Last updated, Version 1) XX DE nonautonomous DIRS-like LTR retrotransposon family - a consensus. XX KW DIRS; LTR Retrotransposon; Transposable Element; Nonautonomous; KW reverse transcriptase RNase H; phage integrase; DIRS-N3_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6237 RA Bao W. and Jurka J.; RT "Families of DIRS-like retrotranspsons in zebrafish."; RL Repbase Reports 9(2), 542-542 (2009). XX DR [1] (Consensus) XX SQ Sequence 6237 BP; 1624 A; 1798 C; 1246 G; 1566 T; 3 other; gtgatgtttg taaacaaatt tcgggaggag catgcgcaga agtttcagta tcgaatacgt 60 cattgacaat tatcccatca tcaaatcagt ttctgagcaa cgcctatata taccaaagct 120 gtcttacctg cagcatctca cgacttcagc attggcacat ccaatcgctg tagcctcgtc 180 caaacagcaa atttccatga ggatgacacg gctcctcttg caaactccaa gccgacgtcc 240 agcgcatcaa cactcggcag agcatatcaa cagagtccgc ggtcttaact taggcccgcc 300 acgccgccgc tctttcagac accggcaaca aacctgagcc gctccccagt tggaaaccac 360 cggcccagat caagcgagtg ccaggccacc gctctacagg cgaagcgcca ccatcggcca 420 attcgccatg acagcgtgag cacgtctccg ctacgggctc atccaccccc cacgcacgga 480 acatttccac ttagtagcgc ccgcaatcgc tgctctcggc gccccaaatt taggaaaaac 540 aagccagtcc gctcgtgagt gtgacgtcac gcgaagcggc ttccgggtcc aagcgctcta 600 ttcaactgca tggggagacc catgaaatgg taattataac gtttacaaag cgatttatta 660 ctttcgaaaa tcatgatcac aatatatatg tctatgcctc atacatgatg caagtgatta 720 attttataaa ttgttaaaaa tgtttggtat ttgttatgca gcaagcccaa aaatggttgt 780 gtacaacatg actctgtata aaactaactt atcgtgtaat atgtttaaaa ttgatcataa 840 acaaataatt tcttaactta ctgcatgggg agactcatga aatgttaatt ataatgttta 900 caaagcaatt tattactttc gaaatcatta tcacaacggg gtatatgcca aagcatgcta 960 ataggactct taatttccca gtaacctggg ctaagcactt cgggcgaatt gaatgccaat 1020 gcaaagccag tgcgtttaag gtctgttttc tttaaaaatg agcgatctcc actagctgag 1080 tgatatcgcc atatctttat aatatgagca tttatttgac cccagtatgt ttaatggagc 1140 ttatgcgtta gcctgccgat tctgacgacc gcgtgcgagt aaacagcttc tgtctaattt 1200 tacgcttgag caaccaaata aatgctaaag ttaatttaac tgaatgtatt ttaatgacat 1260 tacaacatcg atgctgtata aggaactgta caaaacggaa aaagttcaca ataacaggtc 1320 actggaagtg tatcagcatg ggaaaagcag tagctcagca tataccctat acatacgtcc 1380 acaccttata tctgacgggc agaaagtgat taatttttat aaattgctta ttgttttttg 1440 gtaattgtta tgcagcaagc aatgcttgtg tacactgatt ttgtataaaa ttaactttat 1500 tgtataaaat taactttatt gtgtgatgag aatttaaagt gatcataaac gaagaaattt 1560 ctcaactcaa atgagtggcg gcttggaccc ggaaacagta ttacttgcgt caccaatagg 1620 tcaccactta acaagcggag tcttccttca tgccgcttgc agattaacca ggtttgttca 1680 ggccctgatg caagttccag cgtgaggctg cctccgctga tgtcaccaac gccggcagcc 1740 ccttctctct tctcttttcc ttccacctgt gatcctaggg ccatcagtag cgcgagcacg 1800 cctccgcaat cggctctagc ccccaacttt cttcaggccc cgacgcaagt tccagcgtga 1860 ggccgcctcc gctgatgtca ctagcgctgg tagctctcct ctcttctctt ttccttccac 1920 ccgtgatcct agggtcatca gtagcgcgag catgcctccg caaacggccc tagtccccaa 1980 ctttctgtca ggccccgacg taagtgccag cgcgaggccg cctccgctga cgtcacttac 2040 gccggcagcc cctcttctct tttcctccca cctgtgattc tagggtcatc agtagcgtga 2100 gcacgcctcc gcaaacggct ctaatcccca actttctgtc gggccccgac gtaagtgcca 2160 gcgtgaggct gcctccgctg acgtcactta cgccggcagt ccctcttctc tttccctccc 2220 acctgtgatt ctagggtcat cagtagcgtg agcacgcctc cgcgaatggc tctagtccct 2280 aaatttctgt cagaccccga cgtaagtgcc agcgtgaggc cgcctccgct gacgtcactt 2340 acgccggcag cccctcttct cttttcctcc cacctgtgat tctagggtca tcagtagcgt 2400 gagcacgcct ccgcaaacag ctctaatccc caactttctg tcgggccccg acgtaagtgc 2460 cagcgtgagg ctgcctccgc tgacgtcact tacgccggta gtccctcttc tctttccctc 2520 ccacctgtga ttctagggtc atcagtagcg tgagcatgcc tccgcaaatg gctctagtcc 2580 ctaattttct gtcagacccc gacgtaagtg ccagcgtgag gctgcctccg ctgacgtcac 2640 ttacgccggc agcccctctt ctcttttcct cccacctgtg attctaggtt catcagtagc 2700 gtgagcaygc ctccgcaaac ggctctagtc cctgattttt ctgtcaggcc ccgacgtaag 2760 tgccagcgtg aggctgcctc cgctgacgtc actaacgccg gcagcccctc ttctcttttc 2820 ctcccacctg tgatcctggg ccacgtatag actatattcc ttagattcat cgtctaggta 2880 cttaacatgt ataaacttaa gtacgttatt aacgtttaca tagacaacac gcgtctataa 2940 aataaattgc ttaattaaat aaatttgata agtaattctg gagtaataat aacaccaagc 3000 tgataagcaa cttcagctca gaaatcatcc agaatcccat tactagctta ccccatctat 3060 tttcactgcg atttcaagtc gcaattgcac tcagtataaa ccatggaact ccattcaaaa 3120 cttcaggtac tgctcaatgc tccttgcact accttaagtt tcctttaatc ctatcaatta 3180 agttttccac agaaattaaa cctaaaagtc agtggaaatg cgctatcgtt gccccctttt 3240 ttaccatttc aacctctaaa cctttcctaa ccctggtcat taaccccttc aaggagttct 3300 attataaggt tgctgctgca cagcgaccgc tctcattggg gatgcacacc ctattacgta 3360 ctactgcaag cacagtggct gcttccactc tagacacacg ccttattatg ctccactgca 3420 agcgcagtga ctgctcctat ggtagacgta cacatcatta ttatgcacca ctgctaacgc 3480 agtgaccgct ctcactggag acgcacacct tatttatcat gcaccactgc aagcgcagtg 3540 actgctccta tgggagaagc acaccttatt tatcatgtgc caccgcaagt gcagtgacag 3600 ctcccactag agacgtacac ctcatcaagc atcactgcta gtccagcgac cgctcccacc 3660 ggggaagccc accttattat tatgcaccac tacaagtgta gtaaccgctc ctactggaga 3720 cgtacacttt attatgcacc actgcaagtg cagtgaccgc tcctacggga gacacacact 3780 tgattayaca ctactgctag tgcagtgacc actcctacgg gagacgcata tcttattatg 3840 caccaccgct aatgcagtga ccgctcctat gggagacgta caccttataa ttatgcatca 3900 ccgcaagtgc agtgaccgct cctatgggag acgtgcaagt gcagtgactg ctcctacggg 3960 agacgcacac cttataatta tgcaccactg caagcgcagt gactgctcct atgggagatg 4020 tacaccttat tacacactgc cgctagggca gcgaccactc ataccccata caggagacac 4080 acaccttatt atcatgcacc actgttagta cagtgaccgc tcctacggga gacgtacacc 4140 tcataattat gcaccactgc aagtgcagtg accgctctta cgggagacgc acatcttata 4200 attatgcacc actgcaagcg cagtgactgc tcctacggga gacgtacacc ttattacaaa 4260 ctgcctctag ggcagtgacc actcatacct cattcaggag acacacacct tactattatg 4320 caccactgtt agtacagtaa ccgctcctac gggagacgta caccttataa ttatgcacca 4380 ctgcaagcgc agtgaccgct cctacgggag acgtacacct cataactatg caccactgca 4440 agtgcagtga ctgctcctac gggagacgta caccttataa ttatgcacca ctgcaagcgc 4500 agtgactgct cctacgggag atgtacacct tattacacac tgcctctagg gcagtgacca 4560 ctcatacccc atacaggaga cacacacctt attatcatgc accactgtta gtacagtgac 4620 cgctcctacg ggagacgtac accttataat tatgyaccac tgttagtaca gtgaccgctc 4680 ctacgggaga cgtacacctc ataattatgc accactgcaa atgcagtgac tgctcctacg 4740 ggagacgcac atcttataat tatgcaccac tgcaagcgca gtgactgctc ctacaggaga 4800 catacacctt attacacact gcctctaggg cagtaaccac tcatacctca tacaggagac 4860 acacacctta ctattatgca ccactgttag tacagtgacc gctcctacgg gagacgtaca 4920 ccttataatt atgcaccact gcaagcgcag tgaccgctcc tacgggagac gtacacctca 4980 taactatgca ccactgcaag tgcagtgact gctcctacgg gagacgtaca ccttataatt 5040 atgcaccact gcaagcgcag tgaccgctcc tacgggagac gtacacctca taactataca 5100 ccactgcaag tgcagtgact gctcctacgg gagacgtaca ccttataatt atgcaccact 5160 gcaagcgcag tgaccgctcc tacgggagac gtacacctca taactatgca ccactgcaag 5220 tgcagtgact gctcctacgg gagacgtaca ccttataatt atgcaccact gcaagtgcag 5280 tgaccgctcc tacgggagac gtacacctta taattatgca ccactgcaag cgcagtgact 5340 gctcctacgg gagacgtaca ccttatgcac cactgcaagc gcagtgacca ctcctatgag 5400 agactcatac ctcatcatac accactgtca gcacagtgac agctattacg gggatgcata 5460 ctttaacaca caacattgct aatgcagtga ccactcccac ggaagacgcc cgtctcctgt 5520 gggagcagac actcttgcta gagacattca tcatgcacca ctatcaacac agcaactgct 5580 tcctcccgaa acacatacct tataactcaa cagcacccct ccaaattttt aagatcagca 5640 aattttgggg ggcttgcaca ttgtctggct gctgtcctgt aacaaatagc gttttttggg 5700 gagcgctctg gggccgggct agatacttcg ctcgaatccc aactcctctt tgtttcctta 5760 taaggggaat aactcgagtt ggggtgtctc ctcgagctca gagccctccc cccggacagc 5820 acgccaaata cgctttatac ttaaacgaat gcaagtgtga actcgtgaag tgatgtttgt 5880 aaacaaattt cgggaggagc atgcgcagaa gtttcagtat cgaatacgtc attgacaatt 5940 atcccatcat caaatcagtt tctgagcaac gcctatatat accaaagctg tcttacctgc 6000 agcatctcac gacttcagca tccctccacc accccgtcac ctcacctctt gcatttcatt 6060 cccgggggga gcgctctggg gccgggctag atacttcgct cgaatcccaa ctcctctttg 6120 tttccttata aggggaataa ctcgagttgg ggtgtctcct cgagctcaga gccctccccc 6180 cggacagcac gccaaatacg ctttatactt aaacgaatgc aagtgtgaac tcgtgaa 6237 // ID Sat-5_DR repbase; DNA; ZEB; 196 BP. XX AC . XX DT 21-AUG-2010 (Rel. 15.11, Created) DT 21-AUG-2010 (Rel. 15.11, Last updated, Version 2) XX DE Satellite-like DNA - consensus. XX KW Satellite; Simple Repeat; Sat-5_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-120 RA Jurka J.; RT "Satellite-like DNA from zebrafish."; RL Repbase Reports 10(11), 2027-2027 (2010). XX DR [1] (Consensus) XX SQ Sequence 196 BP; 88 A; 32 C; 42 G; 34 T; 0 other; tgaagtgtgc acgcaaaaca tgcgacgcaa gctaaacatg tgtcgcaaaa atagaaacgg 60 tgtctaaaat agaaatgtca cgaaaaaaat cgaaaatgtg aagtgtgcac gcaaaacatg 120 cgacgcaagc taaacatgtg tcgcaaaaat agaaacggtg tctaaaatag aaatgtcacg 180 aaaaaaatcg aaaatg 196 // ID DNA15TA1_DR repbase; DNA; ZEB; 442 BP. XX AC . XX DT 05-JUN-2002 (Rel. 7.05, Created) DT 05-JUN-2002 (Rel. 7.05, Last updated, Version 1) XX DE DNA15TA1_DR is a nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA15TA1_DR; KW nonautonomous DNA transposon. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-442 RA Kapitonov V.V. and Jurka J.; RT "DNA15TA1_DR, a nonautonomous DNA transposon from zebrafish."; RL Repbase Reports 2(5), 7-7 (2002). XX DR [1] (Consensus) XX CC This element is characterized by 15-bp terminal inverted repeats CC and CC the TA targets site duplications. CC Its classification is not certain yet, although it CC is expected to be a member of the Mariner/Tc1 superfamily. CC There are ~3000 copies of DNA15TA1_DR present in the zebrafish CC genome; they are ~91% identical with the consensus sequence. XX SQ Sequence 442 BP; 127 A; 65 C; 76 G; 174 T; 0 other; cactgtaaaa cccaacagtc aactttatca aatgaaatga gtgtagttaa ctcaaaattt 60 actgaaagtt aattctactc atttgaaaag agttttgaac tcagtgttga aggtaatgag 120 ttaattaaat acctcattac ttcaacttaa atggagtaag ttcacagtac tcatatagat 180 tagttttttt tttaactcaa atggtttgta gcaatcggtt tcctcaaacg gtttgagttg 240 ccttaactta ttgggtttta cagtactcag ttggtttgag ttctcttcat ttattgggtt 300 ttactgtgct caaattgctt catttactca aatggattaa gttcacagta ctcattagga 360 ttagtttttg aacttaaatg gtttgttgca atcggtttcc tcaaatggtt tgagttacct 420 taactttttg ggttttacag tg 442 // ID EnSpm-N8_DR repbase; DNA; ZEB; 662 BP. XX AC . XX DT 28-FEB-2009 (Rel. 14.02, Created) DT 04-MAR-2009 (Rel. 14.02, Last updated, Version 1) XX DE EnSpm-N8_DR is an non-autonomous DNA transposon - a consensus. XX KW EnSpm; DNA transposon; Transposable Element; Nonautonomous; KW EnSpm-N8_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-662 RA Bao W. and Jurka J.; RT "EnSpm DNA transposons in zebrafish."; RL Repbase Reports 9(2), 539-539 (2009). XX DR [1] (Consensus) XX SQ Sequence 662 BP; 253 A; 89 C; 108 G; 211 T; 1 other; cactgttaga ctttgctgca attttacagt tatttactgc aaaaacgccc agtaaaatac 60 cacatacatt ctttacagtt cactactgta atatgcactg cattgtggaa tttttaaaaa 120 cttttacagt aacttactgc atattaggaa tttgcggtat tctactgtaa tcaaagtaat 180 caagtactgc tactgtagtt gaagacaaaa atgaaaatta attaacatct tgtatcacaa 240 aatgtattta aaatagattg atgaatgaaa attacttttg taaaccattt aaaatggata 300 taataacaat gaataataaa aatactgttg tacatgatct cgtcacttgt ttggaattta 360 ttaaattcaa tagtaaatac taaaaataaa ggttttgtac tgctagtgta aaatgtaaat 420 aaattacagt aaaagtactg taaaaattgt catactgtaa aattaaaatg tggtaaaggg 480 cattttacca taaaaagaag ttgcggtaag gctattttac tgtaaaatga aattgcggta 540 gggccctgct actgtaaaac acatttacag gtaagttact gtaaaggggc agttgcggta 600 aattgctggc aacagtgctg ccagtaagtt acygcaaaaa tacaataaaa tgtctaacag 660 tg 662 // ID BEL14-LTR_DR repbase; DNA; ZEB; 617 BP. XX AC chr14; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL14-I_DR; KW BEL14-LTR_DR; BEL14_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-617 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1378-1378 (2008). XX DR Genome; chr14; Positions 9438001 9437385. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 617 BP; 172 A; 90 C; 127 G; 228 T; 0 other; tgttatgtgt gatactagtt taagggtgct ggctccttta agagtcactt gctggtgatg 60 tcatcagtat aagaccagca tcagtctgtt gcatgcagga aagcagcctg ctacatttct 120 tgttttacgg aagcatcgtt ggtatgtaat cttatgttaa agtataattc acagttattt 180 gatagtattt tgtgttaagt acagactttt agatgaatct ttattcagac atgcgtgttg 240 tgatatgtca attgtgttat gctctgatga gtgaaatagg ttgctgttac atacagttta 300 gcatttattt aatccaacat gatacagttt agtcagtata cagtttaaaa taagagatat 360 taggtgttac agtattttgt ttgcataaat gtatacaccc tattattcat gtgtgcttat 420 cacaatagga atgcatatgt gaaatatttc ctttctttag tgtgtctaag ttcaattttg 480 ctctttttta gttttacacc tacttcctgt caccaaataa actaaataaa gtgagcagag 540 tatccgcttc agcgtggtga ttggtagagg agttatctgt ctgctaacat atgcaaggag 600 cattgggtgg cagaaca 617 // ID DNA-8-21_DR repbase; DNA; ZEB; 1284 BP. XX AC . XX DT 02-DEC-2008 (Rel. 13.12, Created) DT 02-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE A nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; TSD 8-bp; DNA-8-21_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1284 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(12), 2144-2144 (2008). XX DR [1] (Consensus) XX CC Copies are ~90% identical to the consensus. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1284 BP; 443 A; 245 C; 212 G; 383 T; 1 other; tagggatggg taccgaaact ctgtacttta tcggtatcgg tgctacatta taaaagaccg 60 aagtatcgat aagctctgac gttaacggtt ctgctatcgg tactggagaa ttattgctga 120 ataaacatta cttacagtag cataatttaa ctgaccaacc ttttctaatt tgcgatattt 180 gatattcgtc tgcacagttg gcaatctcac atgagaaatg cttgtgtttc cggcgagcgc 240 gtcaagtata aaagccttca cagttctcgg ctgtgcgcgc gcactcagcg gaggctcgcg 300 ctaagcgcac acacatagct cgcgacacag actcgcgcac acacagctcg taagcttgcg 360 gagtgaacca aacgggttgt atttcccgag tggacagcga caatgcccgt tgcaacaaac 420 gcaacaagtt gttttcttgt ataagcggaa acgcaagcaa tctggctaag tgaaagtatc 480 tttcaaaagt gcattaactg caatgaaaaa gtaatgaaaa gtttttgact gcctagctac 540 taacggtaac gttacatcat ccacatcgtt atgccagcag tcaatcacct aaattagtat 600 aatattaata gttaaataaa cacacacatt cggttacaat gaaaacagta ttttttctgc 660 agtagcctaa ataaatctta aacattaaaa atgcttttac atcagctgta tttacaaata 720 tagcctatga atagcctaat aataaactat gcttattaag aaattattta aaaaactgct 780 tatttttaca ttgcaagatg ctccaaaact gtattaaaaa cacagttaac tcccaaaaca 840 ctgttaactc atttatgtga atgcgttaat gaaaaatcat aaacattatc tcatttaact 900 gcattttgca actcaaagag ggtattttca actgcaggga gcacaataaa ccaagaaaga 960 tcccgacttc tgccagaaaa ggcaaacatg ttgatttttc tccagaagaa tggttaaaaa 1020 ctgagtcatt ttccttcaaa ccattgtaca ttcttttaca ggaaaaaaat atttatttgt 1080 tttacaatta ttggttttgt tttattttac attaaaaacr taataataaa ataataaaag 1140 tgttgttaat tctgttcaat ttgttttttt ttttcataaa acactacaaa aagtaccgat 1200 aagagaaccg ttaaagtacc gaaccgataa gtggtatcga taaaagtagt aataccgtta 1260 aaaccttaac gatacccatc ccta 1284 // ID Gypsy118-I_DR repbase; DNA; ZEB; 5179 BP. XX AC chr20; XX DT 03-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy118-I_DR; KW Gypsy118-LTR_DR; Gypsy118_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5179 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1523-1523 (2008). XX DR Genome; chr20; Positions 25869212 25874390. XX CC Positions [4001-4477] - Integrase core CC 'TGTGTG' target site duplication CC LTRs are 100% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 48..1019 FT /product="Gypsy118-I_DR_2p" FT /translation="MSTTQQLDQSPSTSGEQQQGADSSPNIAQNQAMLMFP FT LMMGTPWCQKFSGEKTAQENNFQDWHNMQKSMFDMYPFSEAQKVSALISNL FT DGEAKREVLALPIAERVTSGQILTFLKNNYGDTVPLATLRSQFFTRKQRTD FT ESVRQFALVLQELSNRLESRGDHTTGGYVLRDQFILGLFDAGLRRELQGMV FT RASSDIAFVDIKREAILQADAFGEVRQAVAATVQMKPTLDTTKLVEEIKQE FT VFLQLKGEIKEIVAGMMSEVREGVQNMWPKSEGVRPRYELMGGRRSSDQFD FT HQGRPLCRRCKQVGHIERRCRTVEGPQGPLNL" FT CDS 1223..5137 FT /product="Gypsy118-I_DR_1p" FT /translation="MYPPIQKFNLVAANGLVIPYIGYIEVDLNICGKVIPK FT RGVLIVKDSHGSSYPGLIGMNVIKECREFLMTKTGDDYLGSESEVQVKVWR FT QMFSVCEDLLHFADESGQISSIYWPQGRKKIIPPQSEMVIEACARVRPDKR FT DYVGLVEPCPDVLPGQLLVARSLSTVSKGRVMVRILNLSPVAVILSQRCRI FT GQLSDVELCSSGLQGNVELRSLDDDTVEVSVRQNRVQDDEQVQMPWPVAVP FT LDPQQFSLKQQEMISGLLAKHQKVFSEHEEDYGFTDRVLHVIPTGDAAPIR FT ERYNRIPPKLYQEVRGLLNNMLENGIIRESYSPWAAPVVLVRKKDGTLRFC FT VDYRCLNGVTHKDSYPLPRIEESLASLKKSTVFSTLDLAHGYWQVGVHPAD FT KEKTAFVTPMGLYEFNRMPMGLCDAPGTFQRLVESCLGDQNFETVLLYLDD FT IIIFSSSFEAHIDHLDLVFSRLGDFNLKVKPTKCSLFQSKVHYLGHVISEG FT GVSPDPGKLQAVETWAAPQNITQLRAFLGLVGYYRRFIKDFAKIAAPLNSM FT LWGTSKRRTQPAAGKAKRWAWGEMQESAFKKLKESLLQAPILAYADFQLPF FT VLYTDACQTGLGAVLAQVQEGKERVIAYASRSLRPTERNDKNYSSFKLELL FT ALKWAIAEKFRDYLAVSPFTVFTDNNPLAHLNTATLGAVEQRWMAQLAGFQ FT FDVRYRPGRTNGNADALSRWPGKETDGQSIEDEAVPCGEPGYVPSEVVHAC FT VHAFVPVVKDQEKEPSKEREPQLTQEECVGPSLLDDWVRAQRADPVQSRLW FT MYMDRGRGPGPRERAAEDRPTQVVLRQWGRLGTDRGLLCREVCDPKTFDKL FT KQVLVPQSLQNQVLEWVHDRAGHLGVEKTLGLARRRFFWPGMTQGVEKWCL FT KCVRCNLHKTPANKVSAPLVSIDSDYPLQIVSVDFLSLEPARSGRCNILVM FT MDHFTRYAVAVPTMDQTAVTTARALWEHFIQQFGGFDQLHSDQGPNFESRV FT IKELCGLYDIRKSHTTPYHPAGNGQCERFNRTLLSMLGTLRDDQKENWDQH FT VAEMVQAYNNTPHPSTGYAPYFLMFGRYAKLPMDIMLGRGDGFSGTVGSWV FT HHHHKRLVTAYSQARKQVQKAQLHQKKGFDRRVRGEPLLIGQRVMVLNKRA FT RGKGKLEDRWERNTYVVISQPNLDIPVYVVKKEGGNCDERVLHRNMLSPCK FT FEVTSHSENEVGEQAPALQVDPLNSNWCMYPWVGGCLTDLPAPVRENERTV FT MDNAETTAQTIQREGLSSPPDVSVEGLRRSARATKGLLPQRYRI" XX SQ Sequence 5179 BP; 1499 A; 938 C; 1403 G; 1339 T; 0 other; acttggcgta gttttggcag tctggacttt gggcaataat caggaaaatg tcaaccacac 60 aacagttaga ccagtcacct tctacatctg gtgagcagca gcagggggct gatagctcac 120 ctaatatagc ccagaatcaa gccatgctta tgtttccttt gatgatgggg acaccatggt 180 gtcaaaaatt tagtggggag aaaacggcac aggagaataa ttttcaggat tggcataata 240 tgcaaaaatc aatgtttgat atgtaccctt tttcagaggc tcaaaaagtt tcagctttaa 300 taagtaattt agatggggaa gcgaagaggg aagtgttagc tttgccgata gcagagcgtg 360 taacatcagg acagatattg acatttttga aaaataatta tggggataca gtccctcttg 420 caacccttcg ttcccaattt tttactcgta aacaaaggac cgatgagagt gtgagacagt 480 ttgcattagt acttcaggaa ttaagtaatc ggttggagag taggggagat cacactactg 540 gaggctatgt gttacgggat cagtttattt tgggactttt tgatgccggg ctgcggaggg 600 aacttcaggg tatggtaaga gcatcgtcag atattgcttt tgttgatatt aaacgagaag 660 ccatcctaca agcagatgcg tttggagagg taaggcaagc agtagctgca accgtacaaa 720 tgaaaccaac tctagataca accaaattgg tagaggaaat taaacaggag gtatttttgc 780 agctgaaagg agaaataaaa gaaatagtag ctggaatgat gagtgaagtc cgtgaaggag 840 tacagaatat gtggcctaag agtgagggtg ttagaccgag gtatgagttg atggggggtc 900 gaagaagttc tgatcaattc gaccatcagg gaagacctct ttgtcggcgt tgtaaacaag 960 tgggtcatat tgaaagacgt tgtaggactg tagaaggacc acaagggcct ttaaacttat 1020 agagcccact gtacaggccc aaacggtggg tacatcaagt agtggccaat tagaaaagaa 1080 agatcaccta ataggacaat gtccagtcgt gcaggtagaa atacaaggag ttaaaacatc 1140 ttgcttaatt gatactggct cgcaggtaac cacattgact gaaagttatt ttaagaaaca 1200 tttcgagacc gaagggccaa cgatgtatcc acccattcaa aaatttaacc tagtggctgc 1260 taatgggtta gtcatcccat atattggcta cattgaggtg gatttaaata tttgtggcaa 1320 agtcatacca aaaaggggcg tgcttattgt caaagattca cacggatcat catatccagg 1380 gttgattggc atgaatgtaa ttaaggaatg tagagaattt ttaatgacaa aaactggaga 1440 tgattaccta ggctcagaaa gtgaagtaca agttaaagta tggcgccaga tgttttctgt 1500 gtgtgaagat ttattacatt ttgctgacga gtcggggcag ataagttcaa tttactggcc 1560 ccagggaagg aaaaaaataa taccaccaca gtcagagatg gtgatcgagg catgtgcacg 1620 ggttcggcct gataagaggg attatgtggg tttggtagaa ccatgtccag atgtattgcc 1680 agggcagtta ttggtagccc gatcgttgag tacagtttca aagggaaggg ttatggttag 1740 aattcttaat ctgtccccag tggctgtgat cctctcccaa cgctgccgga ttggacagct 1800 ttcagatgtg gagttgtgtt cttctggcct gcaaggaaat gtggaattaa gatcattaga 1860 tgatgacact gtggaggtaa gcgtgagaca aaatagagtg caggatgacg aacaagtgca 1920 gatgccatgg cctgttgctg taccgttaga tccacagcag tttagtctaa aacaacaaga 1980 gatgatttct ggtttgctgg ctaaacatca gaaggttttc tcagagcatg aggaagatta 2040 tggatttaca gatagagtct tgcatgtaat accaacaggt gatgcagccc caattagaga 2100 acgttacaat aggattccac caaaattgta tcaggaagtt aggggacttt taaataacat 2160 gcttgagaat ggtattatta gggagagcta tagcccctgg gcagcacctg tagtgcttgt 2220 cagaaagaag gacggcaccc ttcgtttctg cgtagactac aggtgtctga atggtgttac 2280 acataaagat tcatatcctt tgcctagaat tgaggaatca ctagcctcat tgaaaaaatc 2340 tacagtgttt tcaactttag atttagcaca tgggtattgg caagtggggg tgcacccagc 2400 tgacaaagag aagaccgctt ttgttacccc aatggggcta tatgaattta atcggatgcc 2460 aatggggtta tgcgatgctc caggaacttt tcaaaggctg gttgaaagct gtcttgggga 2520 tcaaaatttt gaaactgtgt tgctttatct ggatgatata attattttct cctcctcttt 2580 tgaggctcac attgaccatc tggacttggt ttttagtcgt cttggggatt ttaacctgaa 2640 agtaaaacca actaagtgtt ctctgtttca gagcaaagtg cattatttgg gccatgtgat 2700 ttcagaagga ggggtgtcac cagacccagg gaagttgcaa gccgtggaga catgggcggc 2760 tcctcagaac attacccagc ttcgggcctt tttgggccta gtgggttatt atagacgttt 2820 tataaaagac tttgcaaaga ttgcagcacc actgaattca atgttatggg ggacatctaa 2880 acgacgaact caaccagctg caggtaaagc aaagcgttgg gcatgggggg agatgcaaga 2940 aagcgccttc aagaaattaa aggagagcct cctgcaagca cccatattag catatgctga 3000 tttccagctg ccatttgtgc tgtatacgga tgcctgtcag acgggtttgg gagcagtact 3060 tgctcaggtt caagaaggga aggagagagt aattgcttat gcaagcagaa gcctacgccc 3120 tacggagagg aacgataaaa actatagctc atttaaatta gaactgctgg ctttaaaatg 3180 ggctatagcg gaaaagttca gggactatct tgctgtgtct ccctttactg tgttcacaga 3240 taataatccg ctggctcacc tgaatacggc tacgctgggg gcagtggagc aacgatggat 3300 ggcccaattg gctggatttc agtttgatgt gcgctataga ccaggacgga caaatggaaa 3360 cgctgatgcc ctctcgagat ggcctgggaa agagacagat ggccagtcta tagaggatga 3420 agcagtgcca tgtggtgagc cgggatatgt accatctgaa gtggttcatg cttgtgtcca 3480 tgcctttgtt ccagtagtta aggaccagga gaaggagcca agcaaagaac gagagccaca 3540 gttgacccag gaggagtgtg tcggacctag tttgttggat gactgggtga gggcacagcg 3600 ggcggatcca gtgcagtcac gattgtggat gtatatggac agaggaagag gccctggacc 3660 aagagaacga gcagcagagg atcgacctac tcaagtggtg ctgcgacagt ggggtcgtct 3720 gggcaccgat cgaggattgt tgtgccgtga ggtatgtgat ccaaagacat ttgacaaact 3780 gaaacaagtt ttggtgcctc aaagtttgca gaaccaagtc ctagaatggg tgcacgatcg 3840 tgccggacac cttggagtgg aaaaaactct cggcttggcc cgtcgccgtt tcttttggcc 3900 gggaatgaca caaggagtgg agaagtggtg tcttaaatgt gttcgatgta atctgcataa 3960 gaccccagct aataaagtga gtgccccgct tgtatccata gattcagatt atccattaca 4020 gattgtgtct gtggacttcc tgtcactgga gcctgcgcgt agtgggaggt gtaacatttt 4080 agttatgatg gaccatttta cccgttatgc tgtggcagtg ccaaccatgg atcaaacagc 4140 ggtgacaact gcaagggcgt tatgggaaca tttcattcag cagtttgggg gatttgatca 4200 gttgcactct gatcagggtc ccaattttga gtctagagtc ataaaggaac tgtgtggatt 4260 atatgacatc agaaagtctc atacaacacc ctatcaccct gcagggaatg gacaatgtga 4320 gcgctttaat cggacattgt tgagcatgct gggtacatta agggatgacc agaaggaaaa 4380 ttgggaccag catgtagcag agatggtaca agcctacaat aatacccccc atccatctac 4440 agggtatgcc ccttactttt tgatgtttgg acgatatgct aagctgccca tggacattat 4500 gttgggtcgg ggagatgggt tttcaggaac agttggcagc tgggtgcatc atcaccataa 4560 aagactggta actgcctata gtcaagcgag aaaacaggtc cagaaggctc aacttcatca 4620 aaagaagggg tttgatcgta gggtaagagg ggaacctttg ctaataggac agagggtaat 4680 ggttcttaat aagagggcca gaggaaaagg taaattggag gataggtggg agaggaatac 4740 atatgtggtt atttctcaac ctaatcttga cataccagtt tatgttgtaa agaaggaagg 4800 gggtaattgt gatgagaggg tgttgcaccg caatatgtta agtccctgca aatttgaggt 4860 aacctcacat tcagagaatg aagtagggga gcaagctcct gctcttcagg ttgatccact 4920 gaacagtaat tggtgtatgt atccttgggt cgggggctgt ctaacagatt tgcctgcacc 4980 tgtaagggaa aatgaaagga ctgttatgga caacgctgaa acaactgctc agactataca 5040 gagagaagga ctctctagcc cacctgatgt atcagtggag ggtttgaggc gatctgcacg 5100 ggccaccaag ggcctgttgc ctcagcgtta cagaatttaa gtggaaatgt gtggacacat 5160 tttgagtaga gggggtgga 5179 // ID CR1-32_DR repbase; DNA; ZEB; 2560 BP. XX AC . XX DT 12-DEC-2008 (Rel. 13.12, Created) DT 12-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-like non-LTR retrotransposon - a consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; Nonautonomous; KW reverse transcriptase; AP endonuclease; CR1 clad; CR1-32_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-2560 RA Bao W. and Jurka J.; RT "CR1-like non-LTR retrotransposons from zebrafish."; RL Repbase Reports 8(12), 2129-2129 (2008). XX DR [1] (Consensus) XX CC The 5'-part is incomplete. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 61..2439 FT /product="CR1-32_DR_1p" FT /translation="YSRAVRTDLLLINNLLDSFGLSQCVVGPTHEHGHTLD FT LVLTSGFSVCNLQIADTVFSDHKAVLFNSYFTFNCQADDSSVAVETRVLSA FT STASNFSSAFDMGSSRSLMEFPSSDMGPEELVIQFNSICSNILNSVAPLKI FT KKKKVKPQPWVNSNIRALRQECRRAERKWLKDKLQVSYESLRDCLTTFQRA FT VKAARSNYLSDIISKNCHNSKVLFSTINTILQPVVSPSISSSVDTCEKFLH FT FFADKIVGIRSQISPVVSVSTISVAPLKNLASFIPVSLSQLNDIVAHMKPS FT GSSCDVIPATLFKEAMHSIGPSVLSIVNSSLSTGIVPSSFKHAVIQPLLKK FT ANLDVSDMKNFRPISKLPFISKILEKVVSVQLNEFLANNNIFDKFQSGFRA FT RHSTESALLRVSNDILLEVDSGSPVVLVLLDLTAAFDTVDHDILTDRLREH FT AGIQGLALDWFSSYLKDRTISVSLGNSSSSVMQLMCGVPQGSILGPVLFSL FT YLLPLGKFFEKHGINYHLYADDSQIYFPIKSRDQSSLQALLNCLADVKCWL FT ANNFLQLNEDKCEIIVFGQKGSDSNGNLSLLPGKIHSSVKSLGVIFDSELK FT FDRQINAVVKNSFFYFRYIAKLKSILSFSDLEKVVHAFISSRLDYCNALYL FT GVSQASLSRLQLVQNAAARLLTGTKKREHITPVLVSLHWLPIQFRIQYKVL FT LYVFKSLHGLAPEYISELIIMHQPVRSLRSMDRLYLQVPRSRLKCRGDRAF FT SVAAPRLWNDLPLSLRTAQTFESFQRSLKTHLFSLAFNHVM*" XX SQ Sequence 2560 BP; 653 A; 499 C; 482 G; 924 T; 2 other; ttattgtatt tttcactttt atcttctatt tttctctgat attttaattt tggaagataa 60 tattcacgtg ctgtccgaac agaccttttg ttaattaata atttattaga ctcttttggt 120 ctatcacaat gtgttgtggg cccaacacat gagcatggtc atacycttga cttagtgcta 180 acttctggtt tctctgtctg taatttgcaa attgctgaca ctgttttctc tgaccacaag 240 gctgttctat ttaattctta ttttactttt aattgtcaag ctgatgatag ttctgttgct 300 gttgagactc gtgtgctctc cgcctccact gctagtaatt tttcatctgc ttttgatatg 360 ggttcatcaa gatcacttat ggagtttccg tcttctgata tgggacctga ggagttagtt 420 atccagttta actccatatg ctctaatatt ctgaattctg tagctccatt gaaaattaag 480 aaaaagaaag ttaagcccca gccttgggtg aacagcaata ttagggccct taggcaggag 540 tgtagaagag ctgagcgcaa atggcttaaa gacaaactac aagtttcata tgaatcattg 600 agagattgct tgacaacatt tcaaagggca gtcaaagcag ctagatcaaa ttatttatct 660 gacataattt caaagaactg tcataattca aaagtactgt tctctactat taacaccata 720 cttcagcctg ttgtttctcc atcaatttct tcatctgttg acacctgtga aaaatttctg 780 catttttttg ctgataagat tgttggaatt cgttcacaga tttcacctgt agtcagtgtt 840 tctacaattt ctgttgcccc cttaaaaaat ctagccagct ttattcctgt gtctctgtca 900 caacttaatg acattgttgc acatatgaag ccttctggtt cgtcctgtga tgtcattcct 960 gccaccttat ttaaggaggc gatgcactca attggtccaa gtgtgctttc aattgtaaat 1020 agtagtctct ctactggtat cgttcctagt agttttaaac atgctgttat tcagcccctg 1080 ctaaagaaag cgaatcttga tgtctcagac atgaagaatt ttcgccctat ttctaaattg 1140 ccttttatct caaagatttt agagaaggtt gtgtctgtac aactaaatga gtttctggct 1200 aataataata tttttgataa gtttcagtct ggttttagag ctcgccatag cacagagtcc 1260 gctctattga gggtttcwaa tgatatcttg ttggaggttg attcaggtag ccctgttgtt 1320 cttgtactgt tggatctcac tgctgctttt gatacagtgg atcatgacat tctcactgat 1380 cggcttagag aacacgctgg tatccagggt ctggccttag attggttctc ctcatatctg 1440 aaggatagaa ctatatcagt cagtttgggc aactcctctt catctgtcat gcaattaatg 1500 tgtggggtac cgcaaggctc aatactgggt cctgttttgt tctcccttta cttacttccg 1560 ttaggaaaat tttttgaaaa gcatggcata aattaccatc tatatgctga tgactctcaa 1620 atctacttcc ctataaaatc tagggatcaa tcctcccttc aggctttgct taattgtttg 1680 gctgatgtga agtgttggtt ggcaaataat tttctccaac tgaatgaaga taagtgcgaa 1740 attattgttt ttggccaaaa aggttcagac tctaatggca atctgtcatt gcttcctggg 1800 aaaatacatt cctctgtaaa gagtcttggt gtcatctttg actctgagct aaaatttgat 1860 cggcaaatta atgcggttgt caaaaatagt tttttctatt ttagatatat tgccaagcta 1920 aagtctatcc tctctttcag tgatttggag aaagttgttc atgcttttat ctcttctcgt 1980 ctggactatt gcaacgcatt gtatctgggt gtgagtcagg cctctctctc gcgactgcag 2040 ctagtccaga acgcagctgc caggctttta acaggcacaa aaaaacgaga gcacattaca 2100 cctgtacttg tttccctaca ctggctgcca atccagttta gaattcagta caaagttctg 2160 ttgtatgttt ttaaatccct ccatggcctg gcccctgagt atatttctga gctaattatt 2220 atgcaccaac ctgtgagatc tcttcgctct atggatcgtc tctatttgca agttcctaga 2280 tcacgcttaa aatgtcgagg tgatagagct ttctcggtag cagctccaag actctggaac 2340 gatctcccac tctctttaag aacagctcaa acttttgagt cttttcagag gtcactgaaa 2400 acacacttat tttctttagc ttttaatcat gtgatgtgag tttatgtatt ttggttttat 2460 gtatatttta catgcttttt tatttgtttt tatgtacagc actttggtaa tccattgtga 2520 tttgtgaaag tgctatagaa ataaagtgag ttgagttgag 2560 // ID L1-15_DR repbase; DNA; ZEB; 5524 BP. XX AC . XX DT 28-FEB-2009 (Rel. 14.02, Created) DT 03-MAR-2009 (Rel. 14.02, Last updated, Version 1) XX DE L1-15_DR is a non-LTR retrotransposon from the L1 clad. XX KW L1; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; endonuclease; L1 clad; L1-15_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5524 RA Bao W. and Jurka J.; RT "L1 non-LTR retrotransposons from zebrafish."; RL Repbase Reports 9(2), 553-553 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 210..1469 FT /product="L1-15_DR_1p" FT /translation="MASSGDAVCESLTRRHGIKIMCKASVEDCILAVGEVV FT GCSSVVSASRMNNAVVLFLNTIEKTNEIVQKGVVINDELIPVLPLSSPSRK FT IIVSNIPPFVSDACIAKELSSFGKLVSPIRKIALGCKSPMLKHVMSFRRQV FT YMILKDNNDELNLSFNVKVDGFNYTIYATSESIMKCFGCGKIGHLIRSCTS FT GGETGQGDVQGESSAAASVAVDAAAPAADAAVSEERGGGGAVAPTMAEVVA FT GSSAVQTSINSKDDTSSQKATFSTAGTAKESTGEGIVEMEIEPEQIFKVPN FT KRKKVSENKMIKQAKKDKSKADYIISDNDADSSDSSSSVYSFSSQNESESQ FT TQRYTAEGIKKFLRETKFLPNVEAEDYFEDVSAFVKDVKILTREGAFIDTE FT VYRLRKLVGKVKKKNLLNDEKPDLV*" FT CDS 1444..5436 FT /product="L1-15_DR_2p" FT /translation="MMKNQIWCNLFLVKVAVFLIVSFIFKTSFTMRVFNIG FT TLNLNGARDSQKRMCLFEFIKMKRLDVTFIQESHSDNLNEINWKKDWDGQI FT FMSHLAYNSGGVAVLFSKTVSPLSVETEEIVKGRFLKVKVKFEHATMVFLN FT IYAPNKGAERIIFLKKFCDVINNVDSEDLLFLGGDFNCTENDKMDRNHLEP FT HNASQKEMKNLVQVCDLCDVWRNVHLNDRQYTWVHSKDNQISMARLDRFYM FT FKFQINYVKDCRIIPVGFSDHCAVICKIFVNSVKVKSAYWHFNTSLLNDEN FT FKEFFSVFWENFKKQKQMYSSLQQWWDCGKIQIKQLCKEYTFNVTKETMRA FT MKELEKSITELLNTTASTNNSTDFEESKTKKDLLADLLNVKAKGALIRSRF FT QNISEMDAPSKFFFNLEKKNGKSRLIHSLKSETGEDITDSSKIRKYAVNFY FT SSLYKCEVLQRDVENSVFFENLSKVDDESNAMLELPISQDELYTVMMSMEN FT GKSPGIDGIPVDFYKVLWPVIGEDLFLVLNDSLNKGSLPLSCRRAVITLLP FT KKGDLQKIGNWRPVSLLCSDYKILSKALAVRLSKVLDQIIQPNQNYSIPNR FT SIFDNIFLIRDALEVAKLFGINCGLISLDQEKAFDRVEHEYLKKVLKVFGF FT STHFIKMIEVMYSNIQSVLKINGGLSAPFDVQRGVRQGCAMSGMLYSIAIE FT PLLHRIRAELKGFSIPHCNAQVQLSAYADDVIVFITNTDDVRRLECIINNF FT KKISSAKVNWEKSEALLIGDWKRDPPLLPGGLKWKRGFLKYLGVYLGESDF FT VQQNWNGLLEKIQSRLQKWKWLLPQLSYRGRTLIINNLVSSSMWHKLACVD FT PPAGLLPKIQGELVNFFWDNLHWTPQSVLFLPRDEGGQGLVNLVSRGATYR FT LQFVQKLLYGPENLVWRPLAQCILRGINGLNLHASLFLTDNVHLDLTGIPS FT FYKSIFKVWGLFKHSQADSTTSLFWLLEEPVLLGSRLDVTKDDTPGLKELL FT ISNKMTKLRDIVDKAGFGLRDAENVALYMGIRSVRFVTKFLNVLKSFLNEK FT EKILIEGYSQGIIFPNDQDCFPDIRLTPKIENSGINSPLLCFKENGNVDFS FT TVKGKVLYKNIVKLNNKCTLKERSDTVWRDKLCLVEEEKPEWRILYKPPLN FT KRSGDLQWRILHGAIAVNAFVTKINSSVSDQCPFCLERETIFHCFMYCKRL FT APLFDLLNVLVSLLGFLFTKQCFILGFKYKRQNKYKCQLLNFIFGQAKSAI FT YITRKNEIEKRNGKNIVVVFKNLVKSRIIVDFNYYKMMKAINVFKIEWCCD FT DGLCSVINEELFFINELI*" XX SQ Sequence 5524 BP; 1827 A; 748 C; 1145 G; 1803 T; 1 other; tctctctctc tctctctctc tctctctctg tgcgtgtgcg catgtgcctg agcgagtgtg 60 tggtgagcga gtgttgggaa agtgcggtga gtttgaatgg ttctctttct aacttttctt 120 tctttgtttg ttgtttgtat acgtgtgctg ttggttagtg tttgtgagtg tagtgagttt 180 actcccggcg tgttttgttg ccgggaggta tggcgtcgtc aggagacgca gtttgtgaaa 240 gtttgactcg gcgtcatgga attaaaatca tgtgtaaggc cagtgtggaa gattgtattc 300 tagcggttgg cgaagtagtg ggatgttcat cggtggtttc agcttctagg atgaacaatg 360 cggttgtttt gtttttgaac acgatagaaa aaactaatga gattgtgcaa aaaggagtgg 420 tgataaatga tgaactcata cctgtacttc ctcttagtag tccatccagg aaaattattg 480 tgtcaaacat acctccgttt gtttccgatg catgcattgc aaaagaatta tcttcttttg 540 gcaaattagt ttcaccgatc aggaaaatwg ctcttgggtg taaatcaccc atgcttaaac 600 atgtgatgtc ttttagaaga caagtttata tgattctgaa agacaacaat gacgaattaa 660 acctatcgtt taatgtcaaa gtagatgggt tcaattacac catttacgca acatccgaga 720 gcataatgaa gtgcttcggg tgcggaaaaa ttgggcattt aattcgatcg tgtacaagtg 780 gaggtgagac aggtcaggga gacgtacaag gcgagtcttc tgcggctgcg tctgtcgcgg 840 tagatgccgc tgcacctgcg gctgatgctg cggtctctga ggagagaggc gggggaggtg 900 cggtggctcc tactatggca gaggtagtag caggctcgtc tgctgttcaa acctcaatta 960 attctaagga tgacacaagt agccaaaagg caactttttc cacagctgga acagctaagg 1020 aaagtacagg tgaagggata gtggaaatgg aaattgaacc agagcaaatc tttaaagttc 1080 ctaataaaag aaaaaaagtc agtgaaaaca aaatgatcaa acaagcaaaa aaagacaaga 1140 gtaaggctga ttatataata tcagataacg atgctgattc ttctgattcc agttcttcag 1200 tttattcctt ttcatcgcaa aatgaaagtg aaagtcaaac acaaagatat acggctgagg 1260 gaatcaagaa atttctgagg gaaacaaaat ttttaccaaa cgtggaggct gaggattatt 1320 ttgaagatgt ttctgctttc gtgaaagatg taaagattct gactagagaa ggtgcattca 1380 tagacacaga ggtttatcgt ctaaggaaat tagtgggtaa agtaaagaaa aaaaacctcc 1440 ttaatgatga aaaaccagat ttggtgtaat ttgttcttgg ttaaggttgc agtatttctg 1500 attgtttctt ttatttttaa aacatcattt actatgaggg tgtttaacat tggaactctc 1560 aatttaaatg gtgctagaga cagtcagaaa agaatgtgtt tatttgaatt cataaaaatg 1620 aaacgcctag atgtcacgtt tattcaagaa tcacatagtg acaatcttaa tgaaataaac 1680 tggaagaaag attgggatgg tcagattttt atgagccatt tggcttacaa cagtggagga 1740 gtagctgttt tgttttctaa gacagtgtct ccactgtctg ttgaaacaga agaaatagtt 1800 aaaggccgtt ttttgaaagt caaagtcaag tttgaacatg caacaatggt gtttttaaat 1860 atttatgctc ctaataaagg agctgaaaga ataatttttt taaagaagtt ttgtgatgtt 1920 atcaataatg tagattcaga agatttattg tttttaggtg gagattttaa ttgcacagaa 1980 aatgataaaa tggatagaaa tcatttagaa ccacataatg cttcacagaa agaaatgaag 2040 aatttagtac aagtttgtga tttatgtgat gtgtggagaa atgtacattt aaatgataga 2100 caatatactt gggtacatag caaagataac caaatttcta tggcaagact tgacaggttc 2160 tatatgttta agtttcaaat aaattatgtc aaagattgtc gtataattcc tgttgggttt 2220 tctgaccact gtgcggttat ttgtaagatt tttgtcaatt cggtgaaagt aaaaagtgca 2280 tactggcact ttaatacatc acttttaaat gatgaaaatt ttaaagaatt tttttctgtt 2340 ttttgggaaa attttaaaaa acaaaaacaa atgtattctt cactgcaaca atggtgggat 2400 tgcgggaaaa tacaaattaa acagttatgt aaagagtaca cttttaatgt tactaaagaa 2460 acaatgcgag caatgaaaga gttagaaaaa agtataacag aacttttaaa tacaacagct 2520 tctacaaata atagcactga ttttgaagaa agtaaaacta aaaaagattt acttgcagat 2580 ttattgaatg taaaagcaaa gggtgcttta attcgttctc ggtttcaaaa tatttcagaa 2640 atggatgcac cttcaaaatt tttcttcaat ttggaaaaaa agaatggaaa aagccgatta 2700 atacattcac taaagtctga aacaggagaa gatattacag attctagtaa gattcgcaaa 2760 tatgcagtta atttttattc atctttatat aaatgtgaag tattacagag ggatgttgaa 2820 aactcagttt tttttgaaaa tctctctaaa gttgatgatg aatccaatgc aatgcttgag 2880 ctgcctatat cacaagatga attatataca gtaatgatga gcatggaaaa tggcaaatcg 2940 cctggcattg atggaattcc tgttgatttt tataaagttt tatggccagt gattggtgaa 3000 gatttgtttt tagttttaaa tgacagttta aataaaggat cattgccttt gagttgtaga 3060 agagcagtta ttactctttt acccaaaaaa ggagatcttc aaaaaatagg taactggaga 3120 cctgtgtcac ttttgtgttc agattataaa attttatcaa aggctttagc tgtaagacta 3180 agtaaagtac tggaccaaat tatacagcct aatcaaaatt actctatacc aaatagatca 3240 atttttgata atattttttt aattagggat gcgttagaag ttgctaaatt atttggtatt 3300 aattgtggat tgatctcatt agatcaagag aaagcttttg atcgtgttga acatgagtat 3360 ttgaaaaaag ttttaaaggt ttttggtttt agtactcatt ttataaaaat gattgaggta 3420 atgtatagta acattcaaag tgtactcaaa atcaatggtg gtttaagcgc tccttttgat 3480 gtacagagag gtgttagaca gggttgtgct atgtcaggaa tgttatattc cattgccatt 3540 gaacctcttc tacacagaat aagagcggaa ttaaaaggtt tttctatacc ccactgtaat 3600 gctcaagtgc aactttcagc atatgcagat gatgttattg tttttataac taacactgat 3660 gatgtaagaa gactggaatg tattattaac aattttaaaa agatttcttc tgctaaagtt 3720 aattgggaaa aaagtgaagc acttctcata ggtgactgga aaagagaccc tcccttatta 3780 cctggaggat taaaatggaa gaggggtttt ttaaaatacc ttggagttta cctaggtgaa 3840 agtgactttg ttcaacaaaa ctggaacgga ttgcttgaaa aaattcaaag tcgtttacag 3900 aagtggaaat ggttgctccc acaactttct tacagaggac gaacactcat tattaacaat 3960 ttggtgagtt cctctatgtg gcataaactt gcttgtgttg atccacctgc tggactctta 4020 ccaaaaattc aaggagaact ggtcaacttc ttttgggaca atttgcactg gacacctcaa 4080 agtgttctgt ttctgccaag agatgaaggt ggccaagggt tggtgaatct tgtgagcagg 4140 ggagccacat atagactaca attcgttcag aagttgttgt atgggcctga aaatctcgtg 4200 tggaggcctt tagctcaatg catcctacgt ggaataaatg gcttgaatct tcatgcgtca 4260 ctttttttga cagataatgt gcatctagat ctgactggaa taccttcttt ttacaaaagt 4320 atttttaaag tatggggact ttttaaacat tcgcaggcag attcaacaac atctcttttc 4380 tggctgcttg aagaaccagt acttcttggc agtcgccttg atgtaacaaa agatgatacc 4440 ccaggactaa aagaattgct tatttccaac aaaatgacca aactaagaga cattgttgac 4500 aaggcgggtt ttggactaag agatgcagaa aatgttgctt tatatatggg aattagatcg 4560 gtgaggtttg ttacaaaatt tttgaatgtt ttgaaaagtt ttttaaatga aaaagaaaaa 4620 attttaattg aaggttacag tcaaggtatt attttcccta atgatcaaga ttgtttccca 4680 gacattagac ttacacctaa gattgaaaat tctgggatta atagtccatt gttatgtttt 4740 aaggaaaatg ggaatgttga ttttagtaca gttaaaggga aagtgcttta taaaaatatt 4800 gtgaaactaa ataataagtg tactttaaaa gagagatctg acactgtgtg gagagataaa 4860 ctgtgtttag tagaagaaga aaaaccagaa tggagaatat tgtataaacc accattgaat 4920 aaaagatcgg gggatttgca gtggagaatt ttacacggtg ctatagctgt taatgcattt 4980 gtgactaaaa taaattctag tgtgagtgat caatgtccgt tttgtttaga aagggaaaca 5040 atttttcact gttttatgta ttgtaaaaga cttgctcctc tgtttgatct tttaaatgtt 5100 ttagtttctt tacttggttt tttatttacg aaacaatgtt ttatattagg ttttaaatat 5160 aaaagacaaa acaaatataa atgtcaatta ttgaatttta tttttggtca agcaaaatca 5220 gctatttata ttacaagaaa aaatgaaata gaaaagagga atggtaaaaa cattgtagta 5280 gtttttaaaa acctagtaaa atcaagaatt attgttgatt ttaattacta taaaatgatg 5340 aaagcaatta atgtgtttaa aattgaatgg tgttgcgatg atggcttgtg ttctgtcatt 5400 aatgaagagc tgttttttat aaatgaattg atttgagctg taaatcaatt gttaaataat 5460 tgcttgttaa taaaaggatt tgtaaaatca aaaaaatctc tctctctctc tctctctctc 5520 tctc 5524 // ID TDR17 repbase; DNA; ZEB; 248 BP. XX AC . XX DT 04-MAR-2002 (Rel. 7.02, Created) DT 04-MAR-2002 (Rel. 7.02, Last updated, Version 1) XX DE Zebrafish non-autonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; TDR17. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-248 RA Jurka J. and Drazkiewicz A.; RT "TDR17: a non-autonomous DNA transposon from Danio rerio."; RL Repbase Reports 2(2), 27-27 (2002). XX DR [1] (Consensus) XX CC Contains ~30 bp TIRs. XX SQ Sequence 248 BP; 86 A; 36 C; 33 G; 93 T; 0 other; cgtcacctta gaattataag gttctatctg acatttttgt caaaattgag ttattcacat 60 attcttatta aatgacaact tatttacatt atgggatgtt tttttataag ttgtttacat 120 tttaagactt tttatttaaa aaagaaatgt tacacacata ctgtttgcta tggaatgcaa 180 aaactttgaa gctcaatatc tcaaaatcat tcagaacgca gatagaacct tataattcca 240 aggtgacg 248 // ID DNA-8-5_DR repbase; DNA; ZEB; 2140 BP. XX AC . XX DT 10-NOV-2008 (Rel. 13.11, Created) DT 10-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE DNA-8-5_DR is a nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; TSD 8-bp; DNA-8-5_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-2140 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(11), 1720-1720 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 2140 BP; 641 A; 340 C; 454 G; 705 T; 0 other; cagagatgta tagtaacgaa gtagaactac ttcactactg tacttaagta ctaaaaggca 60 gtatctgtac tttactggag tattattttt ttctcctact tccactttta cttaagtaca 120 tattttccat gagtttaata cttttactcc gatagatttt ttatgtgctg catcgttact 180 cgttactagg ggtgtcaaaa tgattgatat cagttcagta ataggtcata accggttatt 240 acgtgatgac gtcatttatc tcctatgcgc gatgtcgcaa tactatggcg aaggacggag 300 gcgcgagggc gaggcggcac agcataccag ccgctagttt actattaggg agaaatgttg 360 taaaacttca cctctgccta tctctcgctc tctcactttg gacatttgac ccgctggcca 420 gtttgtaagg taacttttcc tctcttctta gctgttaaag ctgtggatcc agacatgagc 480 gtgcctcatg tgcaattaag ttttctccac tgtgtctatt acctgtgata atcgcgtatt 540 cttcccgcca tgggtgaaca gcgcttatat ttctaaagcc ctttctgttg aggacaataa 600 ccaatcatag ccgtgtaaga cccgcctgtc agcgctcaaa aagcaggcgg gataatattg 660 acatgagttt attagctgta aatgctcgtg ctgtcttctg tgcatgagtt ttgtttgata 720 catccagaaa gagtgttttc tttgagcagg tcaaattaag ggcacagttc gtatatctga 780 ctgctgtatt aaaggacaaa attttattac aaaaaaccct ttaactgtca caccaagaca 840 aaagcaatag aatttttttt gattgcattt cttaactcct aatgtcgcta gttaacacaa 900 tcagtgcatt tttattcaac acatcacatt atttctaaaa tatcagcctc taccaaatgg 960 ttgagatgtt ggtttatggt aaaagtacca gaattaataa atatactaca gaaaatatga 1020 actgtaagac caatttcatt aaaaacatga taaaaataaa taacaattat tactaaatca 1080 tagccataag ccataaaata tttcagattt tcatttaaca cagtaatata cacattttcc 1140 tgtttttttt tttttacaat aaaatcaaaa tcaagtgaat ttgtgaagca ggatttagtt 1200 tgcttgttta gtttagtttc acgagtgctt ccagaagaca gaactcctgt ttattgtggt 1260 attaaccaat caaacttcaa aactcccaaa gtgtgagtgt gtctttgtgt gtgtgtgtga 1320 gtgagagtga ataatgtgtg agtaagagaa aacaacagtg tgattgtacc tgtgtgtgcg 1380 tctgtgtgtg tgtgtgtgtg tgtgtgtgta agagagagtg taaaaacaat tgcaacctat 1440 gtaatgtccc cacaattcac aaaaatgtgt gtgtgagaga gagagagaga gagagagaga 1500 gagagagaga gagagataga gagagaactg tgtgtgtgag agtgtatgaa tgtgtgagag 1560 tggggagaga gtgtacattt aggtgtgtgc agggcagtgt gtgtgtgtgt gtgtgtgtgt 1620 gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgagt gagagagaga 1680 gagagagaga gagagagaga gagagtatgt atgtgagtga acatgctgaa ttatctcatc 1740 agtgaaggca ttttctggta aaagggcctg acgatgtgag ctcttgtacc atagaaacta 1800 tttacagaga aaaacagtat cataagccat gtttcttttc gttctgctta atcaagtacc 1860 ccaaaaaaga tatttaaaat aaacaaaaca attagtgtat tttgactgct ttctttaata 1920 actacattac acaatacttg tacttttact ttcagtactt gagtagtaaa ttttgaaata 1980 aactacttgc aatacttaag tacaaaaaat gttgaatact tttgtacttc cacttaagta 2040 tggtgcttaa agagcacttc tacttctact caagtcaatt tttgataaag cacttgtact 2100 tttacttaag tatgggtctc tagtacttta tacatctctg 2140 // ID HATN7_DR repbase; DNA; ZEB; 595 BP. XX AC . XX DT 11-JUL-2002 (Rel. 7.06, Created) DT 11-JUL-2002 (Rel. 7.06, Last updated, Version 1) XX DE HATN7_DR is a nonautonomous DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW Dr000300; HATN7_DR; nonautonomous DNA transposon; KW hAT superfamily. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 6-592 RA Bao Z.; RT "Dr000300, an unclassified and uncharacterized repeat."; RL http://www.genetics.wustl.edu/fish_lab/repeats/. XX RN [2] RP 1-595 RA Kapitonov V.V. and Jurka J.; RT "HATN7_DR, an ancient nonautonomous hAT-like DNA transposon from RT zebrafish."; RL Repbase Reports 2(6), 15-15 (2002). XX DR [2] (Consensus) XX CC ~1000 copies of HATN7_DR are expected to populate the CC zebrafish genome. The element was active long time ago, its CC copies are ~10% divergent from the consensus sequence. CC This element is characterized by 140-bp terminal inverted CC repeats and the 7-bp target site duplications. CC There is a 89% identity between internal regions of the CC HATN7_DR (positions 330-454) and HAT1N_DR (positions 447-561) CC consensus sequence. CC Putative classification [2]: nonautonomous DNA transposon from CC the CC hAT superfamily. Presumably, some of hAT transposons may generate CC 7-bp TSD instead of the canonical 8-bp size. Given the structure CC of TIR and similarity to HAT1N_DR, it is unlikely that HATN7_DR CC belongs to the PROTOP family that is characterized by 7-bp target CC site duplications. XX SQ Sequence 595 BP; 181 A; 107 C; 132 G; 175 T; 0 other; cacccgcttc gaatgttatc actcgattga atgggcatta tcagtggtta tcagctgata 60 gatatgggcc agtaggggcc ttctgcgcct actggtaggc tcagaaggct gttatcatcc 120 atccacatgg ttaatcagct atactaatag agaagactcc agatccagat ctgtttttac 180 attactgtga cggttgggtt tagggttggg gtaggggtag acattaataa aatacaatta 240 atgggaaatt taataaataa tataaataat tatcatggtt aatcagctat actaatagag 300 aagactccag atccagatct gtttttacat tactgtgacg gttgggttta gggttggggt 360 agaggttgac gttaataaaa tacaattaat gggaaattta ataaataata taaataattc 420 ttgttaactt ccggccacag ccgtatgtga tctatagctg attaaccatg tggatggatg 480 ataacagcct tctgagccta ccagtaggcg cagaaggccc ctactggccc atatctatca 540 gctgataacc actgataacg cccattcaat cgagtgataa cattcgaatc ggctg 595 // ID DNA-1-7_DR repbase; DNA; ZEB; 1438 BP. XX AC . XX DT 26-SEP-2008 (Rel. 13.09, Created) DT 22-OCT-2008 (Rel. 13.09, Last updated, Version 5) XX DE nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; TSD 1-bp; KW DNA-1-7_DR. XX NM ENSPM-1N_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1438 RA Jurka J.; RT "EnSpm-type families from zebrafish."; RL Repbase Reports 8(9), 929-929 (2008). XX RN [2] RP 1-1438 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Direct Submission to RR (22-OCT-2008). XX DR [2] (Consensus) XX CC This element produce one 1-bp TSD. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1438 BP; 480 A; 235 C; 203 G; 517 T; 3 other; ggggtgcgtt tcccaaaacc atcgttagcc aactaaggtc gcaagttccg tcgttacaaa 60 catagtttgt tgatttgccg tttcccaaat ccgtcgctcc aacgaacatt cgcaaactgc 120 gtcgcaaact tgagcgctcg caactacacc tctggagctg tagttagaaa catagttcct 180 ggctgtgttc tattcccact tatcccccct atgccctatt catttagaac attctaacat 240 tcaaagttgg aattattaaa aataaaaaag cattaaggtc atctctctta ggtgtaattt 300 gctttcaaac tatttttaca gttcagtttt agcgatcttc atgtttacaa ttgtgctccc 360 ttcgcagtgc actttgaaaa cattgatgtc attttgaaca cagcctcatg gacgaaagct 420 ataggtgacc tattatttaa aagcggaatt tatgttacgt ctccaaagct tgtgaaaaca 480 aaaatatagc atacgttaat tcttttaatt tatagtaggt tatttattaa gtatctgtac 540 tgtatatgac atgggcctgc tggttagaac tttctgcagt ggtttacatg tgtcaaattg 600 taaaagtaga cttttctaaa aaaaaaaaaa taataataaa taaataaaca atatcctact 660 tamttcttat tantactaat aataataata ataataataa taataataat aataataata 720 ataataataa taataataat aataatcatc atcatcatta tcattattaa tattattatt 780 aaattatata tttttttatt tctaatacat aataataaaa ataataatta ttatcatcat 840 catcatcatc atcatcctca tgatcatcat tagtattatt aatattatta ttaaattagg 900 atttttttca tttcctaata tataataaat attacatttc attttataat aaatagcctc 960 attatttatt tatttatatg atttatatat gatattagaa tacgtgttag cttttgtaag 1020 tgattttatg ttttagaata gaatatgttc atsttctcaa taatatttgt aaaggaaaca 1080 caggccctag gctctatatg tgccatttac atatatttca gttaatatgg aaagcgagtg 1140 catgttttta ccaaaactaa tattggattt tatttttaaa tgcgtgtgcg tgtaaaacaa 1200 tataatttgc acaaagaaat gatggggttc ttctctaaag aagtagttac tccaccccgt 1260 ttagagcgtc attatgggcg ttttacgtta taactaacat ggttcaagcg atggatctgc 1320 gacagagaaa ctacgcgttt tgggaaacac tcgtcactac atcgttcttt tcccaaacga 1380 tgcatcgtac tatgatagtt cagccgtgag ttacgtcgtt gtttgggaaa cgcacccc 1438 // ID Gypsy-33-LTR_DR repbase; DNA; ZEB; 1080 BP. XX AC . XX DT 01-JUN-2007 (Rel. 12.06, Created) DT 05-JUN-2007 (Rel. 12.06, Last updated, Version 1) XX DE A long terminal repeat of the Gypsy-33_DR LTR retrotransposon; a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy superfamily; LTR; Gypsy-33-I_DR; KW Gypsy-33_DR; endogenous retrovirus; Gypsy-33-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1080 RA Dib M.R. and Naveira H.F.; RT "Gypsy-33_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 7(6), 182-182 (2007). XX DR [1] (Consensus) XX CC Gypsy-33-LTR_DR is a long terminal repeat of the Gypsy-33_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its CC internal portion is deposited in Repbase as Gypsy-33-I_DR. XX SQ Sequence 1080 BP; 268 A; 215 C; 218 G; 379 T; 0 other; tgtgataaag tcacattaac tgtcaccgct gggaaagcag cactggcatt cacacacaca 60 cattctcgta cgcacacgca cattctcacg cacacacata cttgcaaaca cacacacaca 120 cagacgcgct gacacgcaga catactcaca ggcccactta cctcattcat tcatcataat 180 gttcctccct gccgttgccg ttatcggagg agcgggtgca cacgcgatct cgatagcaaa 240 tggtcatatt tccataaatc cactccccgc aattcataag tcatccatag gcatttcata 300 cacctccact tcacctgttg tgttgtcttc tctgtttggt gtttgtctgt gtgggcgtag 360 tcaggcttcc ggtcgcgatc ggtcaaaccg agcacatagg aagtgctggt ttaagaatga 420 ttgggattgt gtgcgcatgc gcactgcgat cggaagctga cgaaaccatt atgtctgtta 480 aataggtgtt ttgtgtttat gaatgggtgt gtattatgtg tttatttata ttaagaaaat 540 gtatgttttg ttgttaatta tatattaagg attttttctt tatgtgttta atgtgaagtg 600 tggcttgaat gtcaagttat ttaatgtttg tgtatttgta cgttttaata cttacctgtt 660 gaatggttta gcccacaaac gggcgggagt ttgggctata taagccagag cccaactttc 720 ttacaacact tgttaaccaa gtcgatgcaa gagtgctact gtggtgcctc aagcaagtat 780 attgtggata ttgtgaatac ctgatattat cgtatattat tattattatt attgtagata 840 taattggagt atatttttgc agattttatt ttcctgtttg ttgtttcttt ggtacttttt 900 attactattt ttttttgtta gcacttgtaa atatcttcac tgtggatgga ctttttggca 960 ctgtaaataa atatcacctt gcacgacagt gctttgcgag taagccatca tgtttttttt 1020 ttacgaactc acatttcttc ccccccttga agacgagttg gtgggctcgt tttcatcaca 1080 // ID Gypsy94-I_DR repbase; DNA; ZEB; 6881 BP. XX AC chr19; XX DT 02-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy94-I_DR; KW Gypsy94-LTR_DR; Gypsy95_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6881 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1499-1499 (2008). XX DR Genome; chr19; Positions 22469761 22462881. XX CC Positions [3205-3744] - Reverse transcriptase CC Positions [5199-5675] - Integrase core CC LTRs are 99% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 611..1990 FT /product="Gypsy94-I_DR_2p" FT /translation="MSQEGEKTDLSASNGFELGFARGRIFGELEHTPLRRA FT VNFGSPNFCSTRTSDRVHNTEQCVNAESHDNKCSNTDLSNLITQLAQQIGQ FT SISDQLKKGSEKNETIDNQPQSIRTSQVLTESPSLNLTGMKLVMKPDVKAP FT PCFRGDGSDKLTVHEWEELMNVYLRKKGIPLPEQADEIMSNLTGRAKDIIK FT ITLRSNPLLQPQKDPRVVTDILKQHFSDLTYSSMPLADFYGTLPVPGETAM FT EYWIRLNKAVDIADECLRRQGRSIEDPNREVTMMFVKYCPDHSLSAILKFK FT TADKWTAAEIQERLDEFQAEKRAQMKAQSNRSAFMKPVATYAQAPTLDVEL FT SGNEPDVSAGQNVSPLPPSVQSSDSCTQSLINLLDRVLKQNAQTPSVPEDY FT RHANARRMFCKVCRATNHSTVAHCRREGLCLSCFEYGHRKRECPNWSSRHS FT PDVNYSSQGTGPLNH" FT CDS 2041..4890 FT /product="Gypsy94-I_DR_1p" FT /translation="MDLERQYENACNAVSDGTHVIYTQNIQKIRPFDDLFY FT TSVIVNDHFHLNGMLDTGSMSCTLSEAAEQKLLSENAVLDKRALPENTVLV FT GVGGRTAQPKCLYEVNMKVYGICCRVPILVVPGQHDDLILGTNLIKHIVSK FT MKNTDEYWDFISQSPTQLSQEGEHFLDVMSNITRWRHDEIPHKIGTVKLQR FT AVTLAARQEHLVWGRLPSKTPLSLGSTILVEPTTSKTMPRNIMVARVITSM FT WGDRWVPMKITNLSDLPVTLRKNRRLADVSPCLAVEDFPILQGTGKIQMVN FT SEIQANTSCTSNSKLNDVGLDEIDIDLCKVTDASKEKLVQLLIKYNDVFSK FT HSLDCGEAKEFEHCIRLTDERPFRLPYRRIPPAHYQKLRQALTEMEEQGII FT RKSVSEYASPLVLVWKKDGSLRICTDFRWLNARTLKDAHQLPHQSDCLAAL FT GGNNYFSTMDLTSGFYNLPMREQDKKYTAFVTPMGLHEYNRMPQGLCNSPA FT SFMRMMLSIFGDMNFSSLLCYLDDLLVFASTEEEALNRLEIVFHRLRMHRL FT KLSPKKCQFLRDSVKFLGHIISGSGISVDSAKVDVISKMSKSQLMEEDGCT FT PSPKRIKSFLGMIFYYQHFIPNCSAIAKPLFALTAGQKRRGKVRSGKHVGT FT FRKLNPSDWVDECDVAFSMLKEKLLKCAVLAHPDFSKPLILSIDASLDGLG FT AVLSQVPAGEEIARPIAFASKTLSVSQKKYSAHRLEFLALKWSVCEKFSHW FT LKGHTFTVWTDNNPLTYIMSKPKLDACEQRWVSKLAPYSFDIKHIAGTKNV FT VADTLSRDPFARTVSHRLMNESYNHLLTEADAVKEDGIQDAFRLKVECHKC FT NDRVECLSRYEQPCDSVVIKLLCQVHDEWETATFTRAVQLAQSVCQVAAAN FT QISLLGFSHDELQRSQELDSTISKVLPLVIRKRLLDEKETSWISEL" FT CDS 5004..6572 FT /product="Gypsy94-I_DR_3p" FT /translation="MKEKALSGLHDLAGHQGQARTIHLARQRFFWPKMDHE FT IKEYVKCCRRCILAKTPEPAARAPLESIKTSAPMELVCLDFWSAEDNKQRS FT VDVLVLTDHFTKLAHAFPCINQTAKQVARKLWDHVFCVYGFPERIHTDQGT FT NFESELIAELLALSGVSKSRTTAYHPMGNGITERFNRTLGTMIRSLPLRSK FT DKWPQQIQTLTFAYNSTVHETTGYAPFQLMFGRIPRLPVDVMFKQVLNDPV FT TVDYSNYAAALMSHLSEAAEIAQKHATQEQNKQAEMYNRRIKGVHLHCGDR FT VLIANKSERGKRKLADKWEPTVYTIVDSDPKTHIYKVEDDKGLTKVVHRNM FT LLDISFLPVTLPEETSDESATEECSDDSECESLLSNSTDCLEDETSRNKTY FT SGALNREHSPNSLSVDEPIQQDVDSSDTYHGDSNSGSQIQILSDEASSPCD FT SNSDSPSANSVMSNPDVGDSVIVDVLSSDHEQGKITTRAGRVVKRVNRLIE FT TMAQKPFKMPNLSNSFLKKSQSVLSLF" XX SQ Sequence 6881 BP; 2055 A; 1341 C; 1552 G; 1933 T; 0 other; ggtatgtgtt tacatatgtg tttatgcttc tgttatgttt ttgaatttat gtataatgtt 60 tctttttttt ggtgtatgtt aaacacactt caactttatt tatgtgagtt cgcggcaaac 120 gcgattgagg agaggttggt gagctcctaa ggaaggcaca tatactcagc tttgccattt 180 gtttgcttta ttcatacaga ataaaagcct gttaattccc atatgactgg ttcctttcgc 240 ctaatcacaa cacaacgcag agtaaagcga ggaccggtca cattggtgcc gtgactttca 300 actgatgtgt catcatggat tctgcttgat caacgccagt ttggtcgccg gggtgtactg 360 catcgttgga catctgattt ctgctggtat ctgtgtcaaa agggaagagg ttggatatta 420 acccgtgcat acagaaaaaa agaagctgaa agtttgaaat taatcttacc aaagacagta 480 gaaaaaatat aaaggacaaa aaaaaaaaaa aaaaaaaaaa caattacttg tgaattaatt 540 ttacgttaat gatctgacta ttttttttgt tttatcaact gtttattttt ggtgatcatt 600 attaattaag atgtcacaag agggtgagaa gacagatctt tcagctagta atgggtttga 660 attgggtttt gctagggggc gtatctttgg tgaacttgaa cataccccac taagaagagc 720 tgttaatttt ggtagtccca atttttgctc tacccgtact tctgataggg tgcacaatac 780 tgaacagtgt gtaaatgctg agtcacatga taataagtgc agtaacactg acctgagtaa 840 tttgatcaca cagctggctc aacaaatagg tcagtcaata tctgatcagc tgaaaaaagg 900 tagcgaaaag aatgaaacta ttgataatca accacagagc ataagaacaa gccaggttct 960 cactgagtca ccgtctctca atttaactgg tatgaagctg gttatgaagc ctgatgttaa 1020 ggcacccccc tgcttcagag gggatggatc tgacaaattg acagtacatg agtgggaaga 1080 actcatgaac gtttatctaa gaaagaaagg catccctcta cctgaacaag ctgatgaaat 1140 tatgtcgaac ttaacaggca gagccaaaga tataataaag attactcttc gtagcaaccc 1200 attattgcag ccacaaaaag acccaagagt tgtcacagac atactaaagc aacatttcag 1260 tgacttgaca tattcttcga tgccgctagc tgatttttat ggtaccttac ctgttccagg 1320 agaaactgct atggagtatt ggattcggct taataaagct gttgatatag cagatgagtg 1380 tttgaggaga cagggacgaa gtattgaaga tcccaaccgt gaagtgacga tgatgtttgt 1440 taaatactgc cctgatcatt ctctttctgc catcttaaaa tttaagactg ctgataaatg 1500 gactgcagct gaaatacaag aacgccttga tgaatttcaa gctgagaaaa gagcgcagat 1560 gaaggcacag tctaatcgtt ctgcctttat gaagcctgtt gccacttatg ctcaagcacc 1620 gactttggat gttgaattgt ctgggaatga accagatgta tcagcaggtc aaaatgtgtc 1680 accattacca ccatcagttc agtcaagcga cagttgcaca cagtctctta taaatctgct 1740 tgatcgtgtg ctgaaacaaa atgcacagac tccatctgta cctgaagatt atagacatgc 1800 gaatgcgaga cggatgtttt gcaaagtgtg cagagctact aatcattcta cagtagcgca 1860 ttgcagaaga gaaggtttgt gcctgtcttg ctttgaatat ggacacagaa agagagagtg 1920 cccgaactgg agtagtcgac acagtccaga cgtaaactac tcctcacaag gaactggtcc 1980 gttaaaccat tagacccaca ttcggtgagg ggatgtgtgg gtgatgatga aaccctcata 2040 atggatctgg aacgacagta tgagaatgca tgtaatgcag tttctgatgg tactcatgtt 2100 atatacacac aaaatataca gaaaataagg ccttttgacg atttattcta tacctctgtg 2160 attgtgaatg atcacttcca cttgaacgga atgcttgaca ccggctctat gtcctgtaca 2220 ctaagtgaag cagccgaaca aaagttgctg tctgaaaacg ctgttttgga taagagagca 2280 ctcccggaga atactgttct tgtgggtgtt ggaggtagga cggcgcaacc aaaatgtttg 2340 tacgaagtta acatgaaagt ttatggcatt tgctgtcgag ttccaattct tgtggtccca 2400 ggtcagcacg atgatttgat tcttggtact aacttgataa aacacatcgt gagcaaaatg 2460 aagaacactg atgagtactg ggatttcata tcacagagtc ccacacagct atcacaggaa 2520 ggggaacact ttttggatgt gatgtcaaat atcacacgct ggcgacatga tgagatccct 2580 cacaagattg gaacagtaaa actccagcgt gctgtcactc ttgctgctag acaagaacat 2640 ttagtgtggg gtcgactgcc tagtaagact ccattgtccc tagggagcac tattttggtt 2700 gaaccaacaa cttccaagac tatgccaaga aacatcatgg ttgcacgggt gataacatct 2760 atgtggggtg ataggtgggt tcctatgaag ataaccaact tatctgacct acctgtcaca 2820 ctgagaaaaa atcgtagact ggcagatgtt tcaccttgtc tggctgtcga ggatttcccc 2880 atcttgcagg gcactggaaa gatacagatg gttaattcag agatacaagc gaatacaagt 2940 tgtacttcaa attcaaaact aaatgatgtt ggattggatg aaattgacat cgatctctgc 3000 aaagtcacgg atgcatctaa ggagaaactt gttcagttat tgatcaagta caatgatgtc 3060 ttttctaaac attcgttaga ttgtggcgaa gccaaggaat ttgaacactg tatacgactt 3120 acagatgaga ggcctttccg attaccatat cgtcggattc ctcctgccca ttaccaaaaa 3180 ctgcggcaag cgctaactga aatggaggag caagggatta ttagaaaatc tgtcagcgaa 3240 tatgcatcac cgttagtgct tgtatggaaa aaagatggaa gtcttcgcat ttgtacagac 3300 tttcggtggc ttaatgcaag aacgctcaaa gatgctcacc aactccccca tcagtcggac 3360 tgcttggctg ctttaggagg taataattac ttcagtacga tggacctaac ctcaggattt 3420 tataatttgc ctatgaggga acaggacaag aagtacaccg ctttcgttac tcctatgggg 3480 ctacatgagt acaaccgcat gccccagggt ttgtgcaaca gcccagcttc attcatgcgg 3540 atgatgctta gcatttttgg agacatgaac tttagcagct tactctgcta cttggatgat 3600 ttacttgtat ttgcatctac tgaggaagaa gctttgaaca ggctggaaat agttttccac 3660 cggcttagga tgcatcgcct aaagttgagt cccaagaagt gccagttctt gcgtgattca 3720 gtaaaatttt taggccatat tatttctgga agcggcattt ctgtggattc tgctaaggta 3780 gatgttatat ctaaaatgtc caagtcacag ctcatggagg aggatggttg cactccatca 3840 ccaaagagaa ttaagtcgtt tttaggaatg attttttact atcaacattt cattcctaat 3900 tgttctgcga ttgctaagcc cctctttgct ctaactgcag gtcaaaagag acgaggtaaa 3960 gtgagatctg ggaagcatgt tggcacattt cggaagctga atcctagtga ctgggtagat 4020 gagtgcgatg ttgctttcag catgcttaag gagaagttgt tgaaatgtgc tgttctagca 4080 caccctgatt tctcaaaacc acttattctc tctattgatg catctttgga tgggcttggc 4140 gcagtgcttt cccaagttcc agctggggaa gaaatagcca gacctattgc tttcgccagc 4200 aagaccttga gtgtgtcaca gaaaaagtat tctgctcaca ggcttgagtt tttggctttg 4260 aagtggagcg tgtgcgaaaa gttcagccat tggcttaaag ggcacacctt tacagtttgg 4320 actgacaaca atccgcttac ttacataatg tcaaagccaa aacttgatgc ctgtgaacaa 4380 aggtgggtct caaagctagc cccttactct tttgacatta agcatatcgc aggaacaaag 4440 aatgtggtgg ctgacacttt aagtagagac ccgtttgcga gaacagttag tcacagactc 4500 atgaatgaga gctacaacca tctcctgact gaagctgatg ccgttaagga agatggtata 4560 caagatgcgt ttcggttaaa ggttgagtgc cacaagtgta atgatcgtgt ggaatgtcta 4620 tccagatatg agcaaccatg tgactcagta gtcatcaagt tactttgcca agtccatgat 4680 gagtgggaaa ctgctacatt tactcgtgca gtccagttgg cacaatctgt ctgtcaggtt 4740 gcagctgcta accaaatctc tttacttgga ttttctcatg atgagttgca aaggagtcaa 4800 gagcttgaca gtactatttc gaaagttctc cctttagtaa ttcggaaaag acttctcgac 4860 gagaaagaga ccagttggat atcagagctc taacactcat aaagcagtgg gatcgtctga 4920 aaatccagga cgggatcttg tatagagtga caaaggaccc agtaagcaga cagaaacgcc 4980 accagtatgt tctccctcaa tgtatgaagg aaaaagcttt gagtggtctc catgatcttg 5040 ctggtcacca aggacaggct agaacaattc atttagcaag gcagcgattt ttctggccca 5100 agatggacca tgagattaag gagtatgtca agtgttgcag gaggtgtatt ctggctaaaa 5160 cacctgagcc agctgctaga gccccactgg aaagcataaa aacttctgcg ccaatggagt 5220 tggtttgtct tgacttttgg agtgcagagg ataataagca acgttcagtg gatgtgttgg 5280 tcttgacaga tcatttcact aagctcgcac atgcttttcc ttgcattaac caaacagcta 5340 agcaagtcgc tcggaagtta tgggatcatg tgttctgtgt ctatggtttc cctgagcgaa 5400 tccatacaga tcaaggcaca aatttcgaga gcgagcttat cgctgaactt ctggcgcttt 5460 ctggtgtctc caaatctcgc acgactgcct atcatcctat gggcaacggc atcacagagc 5520 ggtttaaccg aacattgggg actatgatac gttcattgcc cttgagatct aaggacaagt 5580 ggccacaaca gatacagact ctgacgttcg cctataattc aacagtacat gaaacaacag 5640 gttatgcccc ttttcagctt atgtttgggc gcataccacg gttgcctgtg gatgtaatgt 5700 ttaaacaagt tttaaatgat cctgtaactg ttgattatag caactatgct gctgcattga 5760 tgtctcacct ttcagaagca gctgaaattg cccaaaaaca tgctacacaa gaacaaaaca 5820 aacaggctga gatgtataac cgcaggataa aaggggtaca tctacactgt ggagacagag 5880 tgctgattgc caacaaaagc gaaagaggca aacggaagtt agcagataaa tgggaaccta 5940 ccgtttatac catcgtggat agtgacccta aaacacacat atacaaggtg gaggatgata 6000 aaggactgac caaagtggtt catcggaaca tgctcttgga tattagtttc ttacctgtaa 6060 cacttccaga ggagacatca gacgaatcag caactgagga gtgctcagat gactcagagt 6120 gtgagagcct attatccaat tccactgatt gtctggagga tgagacttct agaaacaaaa 6180 catattcagg ggcattaaac agagaacatt caccaaattc actgtctgtt gacgaaccaa 6240 tacaacaaga tgttgatagt agtgacactt accatggtga ttccaattca gggagtcaaa 6300 tccaaatcct ttctgatgag gcttcaagtc catgtgattc taacagtgac agtccatctg 6360 cgaattctgt tatgtccaat ccagacgttg gtgattcagt gattgtagat gtgttgtcgt 6420 cagatcacga acaaggcaaa attacaacac gagctggtag agtcgtaaag agagtcaacc 6480 gtcttataga gaccatggcc caaaagccat tcaaaatgcc aaacttgtcg aacagttttc 6540 taaaaaagtc tcagtcagtt ttaagtttgt tttgatgaca ctatattttg acaagtctac 6600 tttttttttt ttggtcattt tggcagataa tcgatgctgt taaattgcat ataagacact 6660 ttatgtgagt gaggatttca cacaaatgat tacatgtggt atggtgtatt tggcaacata 6720 ctttactaag aggttttgag gcctgatcaa cctcaaattc ctctgtatct gtcaggacag 6780 atagctttca gtgctgtaat tatgatgttc atgtaatggg atttcttgaa aactcctctt 6840 catagatgtg cagaaatttt gatgaaatta ggggaggtga a 6881 // ID DNA-8-17_DR repbase; DNA; ZEB; 1420 BP. XX AC . XX DT 01-DEC-2008 (Rel. 13.12, Created) DT 01-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE A nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; TSD 8-bp; DNA-8-17_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1420 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(12), 2140-2140 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1420 BP; 508 A; 230 C; 230 G; 450 T; 2 other; gagtaagggg aagctgcagg cccggagacc caactgcacg ctcagacctg cgcgcacaga 60 cctgcacttc cagaccagac agtgccccct gctgttgggt tgggtgaagc ggcaggtacg 120 gagacccaac tgcacactca gaccagcact ttctagcaag tagcaacatc agaaacgcca 180 gtaaggttat tgattattaa agcgattcct gaaaattacc aagttttatg gcataaatca 240 atgtggtgta aaatattaca ctaacaaaca attatttcag tgtttgattt gatcaaaatt 300 agtgtcaaaa gagcattatt ttatgatatt tacagcttgc tcttatttta tctattattg 360 taccattctg ggacaatttt ctagcaatgt tttctggcaa aataaaaaaa aaaactttaa 420 agtaaaaatg ttaaaataaa atgcagccta taatatatag cgcaattatt atttgtaaca 480 tattttagtg ctgtataggt ctactgtaca tattactgta ttatctatct gaaagtgcta 540 ccaaagacaa taacagtaac acaataacag tttatcaatt atttataagt tctgtaatga 600 ttacaaacac gttttttgga aatcaaaata tcagttaaat ctgcatagaa tttacaatat 660 gagctctctt aggtgaaatc aaaaataatt tgcttgtggt gtgtattttt tcccagattg 720 atattacaat ttaaaaagtg ggtctctgcc aacactaata cagtgtacca aagactgcag 780 atatttccag ttttttttgc tctagaaatg aaaaatctcc ccaataagaa tgtaaagcat 840 ttattttatt tgatgttatt ttctacgtaa aaaaataatt attactccaa ttattatttt 900 tcaccatttt tcyatgcatt gtttacaact tgacttgtag gacctcatga aaaagtacac 960 aaaaataaag tttgttgtga gaaaattact tagagtacta taattgtatg tgaagaaatt 1020 tattgtatat gaagataata ttcattccct caactcaata cgaaacagtg ataaatcctt 1080 aatgataaaa aaacaacaac aacaaatgtc tgaaaatatt gaagcaactg aacaatatct 1140 atttacaaat acaattaata aaaatacaac agaattawtg cattaaatca aaaatctagt 1200 atgtttaata tttcatataa taatctattc aatcatagcc taatatttta aatagtagaa 1260 aaatagttta aattacaatt tagaagcagc tgaaatcagg ttagtcgatc ttttaaacag 1320 caggtggcac tgtgaaggtc tggaagtgca ggtctagccg tctgagcgag caggtctgag 1380 cgtgcggttg ggtctccggg cctgcagctt catactattg 1420 // ID DNA-8-25_DR repbase; DNA; ZEB; 1557 BP. XX AC . XX DT 10-DEC-2008 (Rel. 13.12, Created) DT 10-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE A nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; TSD 8-bp; DNA-8-25_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1557 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(12), 2148-2148 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1557 BP; 479 A; 392 C; 273 G; 413 T; 0 other; taagggtgta acgatacgcg tattcgtatt gaaccgttcg gtacgagact ttcggttcgg 60 tacgcattac aaaccgaacg attcgattca gactaacatc taaaaagata aaagagaata 120 agtacaaaat ataatgtttt cagtgcaaca acgctcaaca tggcatccca aatgacgtgg 180 cagacaacat gctgccttcc ccgctgtcat gttaaacagt agaccatcgc ttttgaactc 240 agaatatcac tttattaaag catgcactca ttagcaatct tgctgccttc tgtgtcagcc 300 tggctgctcg tgctggctcc gcccccgttg ccccagcaac cctggacccc cgacacaaaa 360 cagccaatct caataacctg ttcacatatc accacacccg cccacctcca aacactactc 420 cagtcaggca cgcgctacac tagctcgcaa aatggctagt aacgttaatg taatcccaga 480 catcccaagt ctcccggaag tttcaggagt ctcccgcaaa tgcatagaga ctcccggatg 540 cccgcaaacg aatgataatc ttccggaaat cgcgcgtctc ccccccggtc cttaaatacg 600 tcgcacaccc ctctccaccg catccctccc agctcttcag gtacgttggg cacaactcac 660 ccccgccgct caggtacatc gcacgcacac gcctccaccg ctcttcagat acgtcgcacg 720 cacaccaccc ccaccactct tcagatacgt cgcgcaccat caccgccacc aagagagtgg 780 tggaaaaaaa caatgacatc tgctatgaca gtaagctatt ttagcggtaa tatttgagca 840 tatcaactca tttacactga catcacggaa tcagtaactg gaactggaac agcattcaga 900 caggcaattc ctacagattc gaacagggca aaacaaatca cagaggcgat cggtacattt 960 atagctgtgg atatgagaga tcttaatcgg tagtggaaat ttcgggtttt aagaaaatgt 1020 taaatgtaat agagccccgt tacattattc cttcataagc ccatttcagt cagataattc 1080 ctgctttcag atccactaaa atggcaggtg acagtcaaat caagcagatc tgtgctgtct 1140 gaagaaaagg tggacactca tctttctgaa aataaaaaaa tacttaaaaa tagattgaaa 1200 ctgagttcca tataaaaact aaattgttct tttcttttta tttgtttaac tactacaata 1260 ttatatttat attttttttt ataggaagct ttttggtttt gagtataaag aaggatttgt 1320 tttatttgct gctaagcaga aaccccggaa gtatagctct atcactgttt aaagtaaaag 1380 aaaaaaaaaa acaagatcat acttttatga tgttagtttt aatacattaa aaaatttaaa 1440 tcagagaaat cctctggctt ttttcttttt ttttgctgta tcgaaaacat accgaaccgt 1500 gacaccactg tgtcgtatcg aaccgaaccg tgatttttgt gaaccgttac accccta 1557 // ID Gypsy122-I_DR repbase; DNA; ZEB; 4524 BP. XX AC chr16; XX DT 03-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Gypsy retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy122-I_DR; KW Gypsy122-LTR_DR; Gypsy122_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4524 RA Jurka J. and Kohany O.; RT "Gypsy retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1280-1280 (2008). XX DR Genome; chr16; Positions 51319382 51314859. XX CC Positions [2171-2647] - Integrase core CC 'TTTAT' target site duplication CC LTRs are 100% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 2..4522 FT /product="Gypsy122-I_DR_1p" FT /translation="MVSEVGCRVWFRTQASLFAPRGSFFFPPPPPPLFIMS FT RVTKKKNPSLHQGEDNDEEEVQVVVEEAVGGKPAEVTEVSVAELASLLRAH FT MARTECREMDRQREYVDQERRFKALQHQFSLLQMEVQARTSPTTHLLDDDQ FT EASGWSDANPLNADGLPVTQVRQNNSDQTVDNIVLPPSLVPRLEKLNETDD FT VEHFLVTFERIAVACRWNRSDWVWHLIPLLTGKARAAYVNMEVTESADYDK FT VKSAILKKYDINVETYRQRFRSLAVNPSESPKELYSRLKELFTKWIQPKGK FT TVEEVSEAIILEQYLRMLSPELQVWIRERDPGSASEAASWADVFVAARGRS FT KPWTFKSGGENRSISMGQGRQKTEGQGKPYFGKAPGNLSLTSRRPPICHMC FT GQEGHIKPNCPKNSVQSMHLCFVPKRNPIPADSTALRMTVVETDGKLLSAL FT IDTGSDQTLMDRRFVPPALIDQRNQLPIRCVHGDERLLPTASISMKVKGQA FT YVLKVGVSDSLPFPIILGRDLPVLFELLHPAQECNMVVTRSMAKPNEEHVH FT TLSTLPFFETEIETGVAKRRKTRREKIREKVKHNAFKPSVSCHLPVNFQLP FT TNIIQMQQSDISLKECLSRSVDAEEEKPEDDKTVTFVRIKGILYRQIGPRR FT QLVVPQCVREVVLHLSHSVPWAGHLGKNKTIARIKRYFYWPGLEGDVAQFC FT KSCPVCQKVSLQRPRKAPLQPLPIITTPFERLGMDVVGPLEKSRSGNRFML FT VITDYATRYPEVFPLRSVKAKSVATSLVQLFSRVGFPMEILTDQGTNFMST FT LLKQVYRLLGIKSIRTTPYHPQTDGLTERFNQTLKQMLRKFVTETGQDWDQ FT WLPYLLFAYREVPQSSTGFSPFELLYGRDVRGPLALLKEVWKDSPDQKEPN FT NVVSYVLQMRERFEKMASLAQTNLTKAKEQQRLWYDPAARERNLQVGQKVL FT VMLPNEESKLLAKWQGPFDVKRKLSPTTYEIFTPGQARASRVLHVNLLKEW FT TPRAKGELRSLMVCRVGEEESDEQYLPEPNVGDLELDHLSEAQQSQVRAVC FT TSDVFSEFPGFTKLIQHDIVLKPEAVVRRRSYRVPERLQEKLKEEVEMMLH FT LGIIEPSHSEWCHPVVLVPKKDGSIRFCIDFRYLNSVSKFDSYPTPRISDL FT IDRLGQAQYITTMDLSKGYWQISLTLSSRPLTAFRTPWGLFHFKVLPFGLH FT GAGATFQRLMDKVLHGLTFAAAYIDDIVVYSNTWDEHVQNLQEVMRRLQSA FT GLTVNPKKCAVAKRETEYLGYVIGQGMVRPQIGKIQALEKCPVPQTRKDLR FT SFLGMAGFYHRFIPNFSSRAATLTDMVGTRCPNQLQWTEDKLKAFRDIQGA FT LTTNAVLYNPDFNLPFVVQTDASERGLGAVLLQGMPEARRPVAFISRKLFP FT REMRYSTIEKECLAVKWALDSLRYYLLGGEFVLETDHKALQWLEKMKDTNG FT RITRWYLAMQPFSFKVQHVPGKTNVTADYLSRCSSEVLEEGE" XX SQ Sequence 4524 BP; 1235 A; 921 C; 1173 G; 1195 T; 0 other; gatggtgtca gaagtgggat gtagagtttg gtttagaaca caagccagtc tatttgcccc 60 tcgtggcagt tttttttttc cccccccccc ccccccctta ttcattatga gtcgtgtaac 120 caagaaaaag aacccatctc tgcatcaagg agaagataat gatgaagagg aagtccaagt 180 ggtggttgaa gaggctgttg gaggaaagcc agcagaggta actgaagttt cagtggctga 240 acttgcaagc ttactaagag cacacatggc caggacagag tgtcgtgaga tggacagaca 300 gagggagtat gtggatcagg agcgacgctt taaagctctc cagcaccaat ttagtttgct 360 acagatggag gttcaagccc gcacttctcc aaccactcat ctgcttgatg atgatcaaga 420 ggcctcagga tggtctgatg caaacccgct aaatgcagat ggactgccag tgacccaggt 480 acgtcagaat aattctgatc aaactgtgga taacatagtc ctgcccccct ctctagtgcc 540 taggctggaa aagttaaatg agactgacga tgtggagcac tttctagtca cctttgaacg 600 gattgctgtt gcttgtagat ggaacaggag tgactgggtc tggcacctga ttccactgtt 660 gactggtaag gccagagcag cctatgttaa tatggaagtg acagagtcag ctgattatga 720 taaagttaag tctgctattc tgaagaaata tgacatcaat gttgaaacat atcgacaaag 780 gtttcgttct ctggctgtta atccatccga aagtccaaag gagctttata gccgtcttaa 840 agagctgttt actaaatgga ttcaacctaa ggggaagact gtagaagagg tgagtgaagc 900 tattatactt gaacagtacc tcagaatgct ctcccctgaa ctgcaggtgt ggataaggga 960 acgtgaccct ggatctgctt cggaggcagc gtcatgggct gatgtgtttg tggcagctag 1020 aggaaggagt aagccttgga catttaagtc tggtggtgag aatcgctcta ttagtatggg 1080 gcagggccga cagaagacag agggacaagg taaaccttac tttggtaaag cacctggcaa 1140 tttgagtctg acgtcaagac ggccgccaat atgccatatg tgtgggcaag aagggcacat 1200 caaaccaaat tgtccaaaga attctgtaca gtctatgcat ctgtgttttg tccccaagag 1260 aaaccctatc ccagctgatt caaccgcttt aaggatgact gttgtggaga cggatggtaa 1320 actattgtct gctttaatag atactggcag tgatcagact ctgatggaca ggagatttgt 1380 gccaccggct ttgatcgacc agagaaatca gctgcccatt cgttgtgtgc atggagatga 1440 gaggttgctg cccacagcca gtatcagtat gaaggtaaaa gggcaagcat atgtgctgaa 1500 ggtgggcgtg tctgacagct tacctttccc aataattctg ggtcgtgatc tacctgtatt 1560 atttgaattg ttgcatccag cacaggagtg taatatggta gtcacccgat caatggcgaa 1620 accaaatgaa gaacatgtgc atacattgag tacattacct ttctttgaga ccgaaataga 1680 gactggcgtg gcaaagagga gaaagacaag gagggagaag atcagagaaa aggtaaaaca 1740 caatgccttt aaaccttctg tctcatgtca tttgcctgtt aacttccagt tgccaacaaa 1800 catcattcag atgcaacaga gtgatatcag cctgaaggag tgtttgagta gatctgtaga 1860 tgctgaggag gagaaacctg aggatgacaa gactgtgacg tttgtgagga tcaaggggat 1920 actttatcgc cagattgggc cacggagaca gttggtggtt ccgcagtgtg ttcgagaggt 1980 tgtattacat ctgagccact ctgttccttg ggctggccac ctggggaaga ataagaccat 2040 tgctcgaatc aaaaggtatt tctattggcc tggcttggaa ggagatgtag cccagttctg 2100 caaaagttgt cctgtctgcc aaaaagtctc actccagcga cctaggaaag caccgttaca 2160 accccttcct attatcacca ccccatttga aaggcttggg atggatgttg tcggacctct 2220 tgaaaagagt cgatcaggta accggttcat gttggtgata actgattatg ccacaagata 2280 cccagaggta tttcccttaa ggtctgtgaa agcgaaatct gttgctactt cactggtcca 2340 gttattttcc agagttggtt tccccatgga aatcttaact gaccagggta ccaattttat 2400 gtcaacactg ctaaaacagg tttaccggct tctgggcatt aagagcatac gaactacccc 2460 ttatcatcct caaactgatg ggttgaccga acgattcaat cagacgttga aacagatgct 2520 caggaaattt gtaactgaga ccggccaaga ttgggaccag tggctgccct atcttctatt 2580 tgcttataga gaagttcccc aatcttctac aggcttctct ccctttgagt tgctatatgg 2640 tcgtgatgtt cggggcccac tggctctgct taaagaggta tggaaagaca gtcctgatca 2700 gaaagagccc aataatgttg tctcctatgt tttacagatg agggaacgct ttgagaaaat 2760 ggccagtttg gcccaaacta acctgactaa agctaaagaa cagcaacgtc tttggtatga 2820 ccctgcggct cgagagagaa atttacaagt tggtcagaag gtacttgtga tgttgcctaa 2880 tgaggaaagt aagctgttgg ctaaatggca aggccctttt gacgttaaga ggaaactcag 2940 tcccacaact tatgagatct ttacaccggg tcaagcacga gccagtcgag ttctgcatgt 3000 taacctgtta aaagagtgga cccctcgagc taaaggagag ttaaggtctc taatggtctg 3060 tcgagtcgga gaggaggaat cagatgagca gtatttgcct gagccaaatg tgggagatct 3120 tgagctggat catctgtctg aggcacaaca gtcgcaagtg agagcagttt gcacttctga 3180 tgtgttttct gaattcccag gctttaccaa actaatccag catgatatag tcttgaaacc 3240 tgaggctgtt gtgagacgtc gaagttacag agtgcctgag cgactgcagg agaagttgaa 3300 ggaagaagtg gaaatgatgt tacatctggg aattattgag ccatcccaca gtgagtggtg 3360 tcatccggtt gtcctcgttc ccaagaaaga cgggagtatt cgattttgta ttgacttcag 3420 gtatctgaat tcagtttcta aatttgattc ttatcctact cctcgtatta gtgatctgat 3480 tgaccgattg ggacaagccc aatatataac gacaatggat ctttctaaag gctattggca 3540 gatttccctg accttgtctt ctaggccgct cactgccttc agaacaccat ggggcctatt 3600 tcattttaag gtgttaccct ttggattgca tggagctggt gcgacgttcc agagattaat 3660 ggataaggta ttgcatggat tgacttttgc agctgcatat attgatgata ttgttgtgta 3720 cagcaacacc tgggatgaac acgtgcagaa tcttcaagag gtgatgaggc gtcttcagag 3780 tgcaggcctg actgttaatc cgaagaaatg tgctgtggcc aagagagaaa ctgaatattt 3840 gggctatgtc ataggtcaag ggatggtccg accgcaaata gggaagatcc aagctctgga 3900 gaagtgtcct gtgccacaaa cacgcaagga tttgcgatct ttcttgggta tggcagggtt 3960 ttaccatcga ttcataccca atttttccag tagagctgcc acgttgaccg atatggtggg 4020 gacacgttgc ccgaatcagt tgcagtggac tgaggacaaa ttgaaagctt tccgagacat 4080 ccagggagct ttaacgacca atgctgtact ttataatcct gactttaacc ttccatttgt 4140 tgtacagact gatgcctcag agaggggcct aggtgccgtt ctcttgcagg gtatgcctga 4200 agccagacga cccgttgcct ttatcagccg aaagttgttc ccaagggaga tgcgttactc 4260 taccattgaa aaagagtgtt tggccgtgaa gtgggctttg gattctttga ggtactatct 4320 gttgggtgga gagtttgtct tggagaccga tcacaaggca ttgcagtggt tagagaaaat 4380 gaaggacaca aatgggcgga taacgagatg gtatcttgca atgcagccat tcagtttcaa 4440 agtacaacat gtgccgggaa aaactaatgt tactgctgac tacctgtccc ggtgctctag 4500 tgaagttctg gaagaggggg agag 4524 // ID DNA-8-7_DR repbase; DNA; ZEB; 1049 BP. XX AC . XX DT 12-NOV-2008 (Rel. 13.11, Created) DT 12-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE DNA-8-7_DR is a nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; TSD 8-bp; DNA-8-7_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1049 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(11), 1722-1722 (2008). XX DR [1] (Consensus) XX CC Copies are ~91% identical to the consensus. TSD is 8-bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1049 BP; 337 A; 184 C; 166 G; 360 T; 2 other; catgggcgtt gctaggccta ttttaggggg gctgtagccc ccctatattt ctactcagcc 60 cccctaaaac ttttgcgatt agctcataaa ctgtacacta aattatcgcc tcagtcccct 120 ttaaatttga tatgaaaacg gcggcagaat tcggctcctc cccgtctttt gtttttcatt 180 tgatcagcaa accacagccg tggacagcga gagaacgagg tgtgtgttta ttgataaatt 240 gttataatat ctgcttattt gcagttcttt gatatagata accttgtatc acctgtatcc 300 caatgtcacg gaggagcagt cgggttttct cggcgttcac ctcatcagca cagctgtttc 360 ctccagygaa aaatgtaact cttaacgaac atttcgctat ttttatttca aaatgaacta 420 ccaatattaa acactttaca gttcgtgtgg cattaactat accatacagt cttgcacaga 480 aaggattaaa cacagtgaca gaagcactta gagaatgtac tcgttttaac tgtcactgag 540 tcactgacag agatagaaac atcaagtgtt gtctagtatt aaaaaaaaaa atacttctta 600 tttattatta tcattcagat cgtgaaatat gtgaattagc ctgtaagttt aaaaatatat 660 atttatatgt atgtcattta attagaaaag tggcagttta tttatttaat acatggaaac 720 aaaaaattgc acttgaatat agaaagtggt ttttactaca gtaaacaggt gggttgagcc 780 tatttggctt attcagaact caaatggctg tgcctgtcta tgtaatacag tggaatayac 840 atatgtatac aaccctacca tattaaacaa acatttttaa gtgtattact attaaattac 900 tacttataaa tgtaatatta ggcttatata tatatatata tatatatata tatatatata 960 tatatatata tatatatatt tttttttttt tttttttacc ttataaggag ctgagccccc 1020 ctaaaatgaa aatcctaaaa tcgcccctg 1049 // ID HATN3_DR repbase; DNA; ZEB; 571 BP. XX AC . XX DT 05-JUN-2002 (Rel. 7.05, Created) DT 05-JUN-2002 (Rel. 7.05, Last updated, Version 1) XX DE HATN3_DR is a nonautonomous DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW HATN3_DR; nonautonomous DNA transposon; hAT superfamily. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-571 RA Kapitonov V.V.; RT "HATN3_DR, ancient nonautonomous hAT-like DNA transposon from RT zebrafish."; RL Repbase Reports 2(5), 23-23 (2002). XX DR [1] (Consensus) XX CC Several thousand copies of HATN3_DR are expected to populate the CC zebrafish genome. The element was active long time ago, its CC copies are ~8% divergent from the consensus sequence. The genome CC is populated by multiple subfamilies related to HATN3_DR. CC This element is characterized by imperfect 14-bp terminal CC inverted CC repeats and the 8-bp target site duplications. XX SQ Sequence 571 BP; 136 A; 120 C; 133 G; 182 T; 0 other; caggcccgga ttggctaatc gggaggaccg ggagaattcc cggtgggccg gtccgttttt 60 tggccgcgag ggccggtgtc cctagctcca gaatctgttg ctctcagcag tcacactttt 120 taaatttaat ttatttattt acttgaccac agccttctta ttcattattt taccgcagct 180 ctgctctttt tatctatttt ctcgcagcct cgtgagcaag atgcagcctg caggttaatg 240 atgatataac tcagatgagt cacctttcaa tgtacagctg ttgtggtcca gtggttagca 300 cgttaggtta cgacgccgcc gacccgggtt cgatcctcgt ctgagtaatt tttttttttc 360 atttttattg ttaagacata taatactgtt agggttgttg aacatttgaa gttctaaagc 420 agctgttttc tcaaaaaaaa aaaagacgtg atagtgtaat tagaaactga attggaaatg 480 accttatttt aatatagtca gtcgtgaact gaggtgggcc ggtctgaggc ttgaaactcc 540 agggctgaaa aggagtccca ctccggccct g 571 // ID piggyBac-N2_DR repbase; DNA; ZEB; 536 BP. XX AC . XX DT 17-OCT-2008 (Rel. 13.1, Created) DT 17-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE piggyBac-N2_DR is a putative piggyBac nonautonomous DNA DE transposon - a consensus. XX KW piggyBac; DNA transposon; Transposable Element; Nonautonomous; KW TSD TTAA; piggyBac-N2_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-536 RA Bao W. and Jurka J.; RT "piggyBac-type DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1617-1617 (2008). XX DR [1] (Consensus) XX CC The elements is characterized by 13-bp TIR and TTAA specific TSD. CC It is putatively classified as piggyBac transposon. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 536 BP; 197 A; 82 C; 65 G; 192 T; 0 other; ccctttaact accccaccaa gaaaaaaata aataaatttg gattgcattt cttaaactcc 60 taatgtcgct agttaacaca ctcaatgcat tttttttttc aacacatccc ataatttcta 120 aaatatcaac ctctaccaaa tggttgatat gtcggtttat ggtaaaagta cagtaccgga 180 attaatacat atactatgaa aaatctgaaa atatgaagtg taagaccaat ttatttaaaa 240 acataatcaa aataaaacat ttttgaatac taaatcatct ttattttttg aagtatgcca 300 taaaatattt cagattctca tttaacatgt tttcttgttt tttttttttt acaataaaac 360 caaaatcaag tgtagtagtg caacaggatt atgtttgttt gatttttttt gtaaaccaac 420 aatgctgtgt gtgtaaacca ttatttaaaa cagtcattct ttaaatgact ttaacagtca 480 ttatttaaaa cagtcaaaat atggtcaacc atttggtaga gcaggcagtt aaaggg 536 // ID ERVN1-LTR_DR repbase; DNA; ZEB; 345 BP. XX AC . XX DT 07-JAN-2005 (Rel. 10, Created) DT 07-JAN-2005 (Rel. 10, Last updated, Version 1) XX DE A long terminal repeat of the ERVN1_DR nonautonomous endogenous - DE a consensus sequence. XX KW Endogenous Retrovirus; Transposable Element; Nonautonomous; KW ERVN1-I_DR; ERVN1-LTR_DR; ERVN1_DR; KW nonautonomous LTR retrotransposon. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-345 RA Kapitonov V.V. and Jurka J.; RT "ERVN1_DR, a miniature nonautonomous endogenous retrovirus."; RL Repbase Reports 4(12), 312-312 (2004). XX DR [1] (Consensus) XX CC ERVN1-LTR_DR is a long terminal repeat of the ERVN1_DR CC miniature nonautonomous endogenous retrovirus. Internal CC portion of ERVN1_DR is deposited in Repbase as ERVN1-I_DR. CC Solo copies of ERVN1-LTR_DR are flanked by 5-bp target site CC duplications. Some copies of ERVN1-LTR_DR were multiplied not CC during retrotransposition of ERVN1_DR but due to CC transpositions of the hAT-2N1_DR nonautonomous transposon. XX SQ Sequence 345 BP; 71 A; 87 C; 109 G; 78 T; 0 other; tgtggcgagg gggcgtggcc gagagccgtg ggaacggagt gaggccaccg cgtaagtgga 60 tacacctgcg gtgcgcacct gcctcggatc ccacggaagg agctctggac cataaaagga 120 ggaacgatgg cagaggaagc cgagagagga ccgggcccgg acatatttta cttttgcttt 180 cagtttgacg gcagtctccg tgaggagctg ccgtccgttt gttttggttt agacggcagt 240 ctccgtgagg agctgccgtc actgttatta ataaatcatt ttaaaactgc gtcggttctc 300 cgcctccttc ttcccggcgg ccaaagggcc gtgaacttca ttaca 345 // ID DIRS-13_DR repbase; DNA; ZEB; 5732 BP. XX AC . XX DT 07-JUN-2010 (Rel. 15.11, Created) DT 07-JUN-2010 (Rel. 15.11, Last updated, Version 2) XX DE A family of DIRS-type LTR retrotransposons - consensus. XX KW DIRS; LTR Retrotransposon; Transposable Element; DIRS-13_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5732 RA Kojima K.K. and Jurka J.; RT "DIRS-type LTR retrotransposons from zebrafish."; RL Repbase Reports 10(11), 1841-1841 (2010). XX DR [1] (Consensus) XX CC This consensus is generated from 9 sequences with >91% identity. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 386..1924 FT /product="DIRS-13_DR_1p" FT /translation="MSKVTVEKGKETSSRSCPSACGFMISGRDSHPMCIAC FT MGVKHAQASLANTESCVHCQAMPMRILERRLRVAASSKDDPVLSAAPSSAN FT DAKSSPPRDLASWGDIMEVESPEYEPLFDQQLLAGGDEMEGDEEEDDETLA FT RLLRDEPDDEEEDVIRPSSHASRPTSAQSGEVASTVVDCDLTEVCKRAAAK FT LGVAWPVTPGHPGVKRDVYDGKRLAPRAPPAKQLLPALPDCITEMKRSWDK FT PFTNRVPVKGYSSLDVSEMEGLGLSNPPVVEQTVALHLHPNRRAAVSSATP FT SLPGKMERFTASMYQKIYKSSALTVRALNVTSLLTAYQAELLEELGTQLDA FT GNPNPAVWEEICNITDLNLRASRGAVQSSGRTMALAVAGERSLWLNLSSIG FT DREKLDYLDAPVDSSGLFGQSVASMRQRCDLKKKDEETFDLCLPRKRTSRP FT PVPATRPTPPTQGRKFVSASRPPKPHSTEQTARPPAAPNAKPWGKQSFAAA FT AAKSRSANPSAKKKRDS" FT CDS 1934..4291 FT /product="DIRS-13_DR_3p" FT /note="tyrosine recombinase. Truncated at FT C-terminus." FT /translation="MFGEERRAAPRVFCDVFSTVRRVCFPTRKNAEEEEFG FT RRNGSELCSGQFSFPSTHNVPWIADGSSTSSCAEHSATMRSAEFRHTFASV FT LSQSTSFPVFMANKKSPVASRLNAEGAESQKQNKNTEKSESSKPRDGPVSE FT PNRSWPNFSLATRGRQRSTSGAGSAAASVSSPGGLDRVFSAPVGPRNCNQG FT LQASVCWKTPSFQRGYSICGKRGFGSGAGGRDILPHGEGSYKTSSSGGNSD FT GFLLTLLSNPKERWESAPNTGPACFKQALKEVQIQDAHSRSSHSFDPPGRL FT VRLDRSQGCLLSYKHLSSTQEISQVFLSERSVRICYSSIRAQSSSAGVQQM FT HGSSAVSAETQGSAHIRLSGRLSDMRSHTRARGAGCRDSFVSSHKAGIQNQ FT QRQKPADSVPGDRVSGPAPRLGNLPRHAFGEKDYGVRSMPDPFSQRKPGLV FT QNLSPSDGHDGFISVCDPTGVIKDERFSALDYREAPVPAQAPRAQGTDIHG FT VCDGSPPLEKPLHIPIRGSAGERVSEKSRHYGRIPNGMGSRISGQIGKRSL FT DTPTARASHKHAGADGSFSGFKTFSSILEGFSCFSQDGQHNGGGVHQPPRG FT NAIVAAAQSSAQADCLERRSLELTARDARPGSQKCGGGSSVEGQPDVRRMG FT SPPAGGESNLGDVRQGCRRSLRLAGKRKMSAVFLSRGRGCAVGCGCAGAPV FT AKRAAVCIPTVKSNFSHPGQSKRKRVVSPSNSPPLAGRLWLAEIAELLQGE FT PWPLPLRRDLLSQAGAQIFHPHPEQINLWVWPVKG" FT CDS 4588..5322 FT /product="DIRS-13_DR_4p" FT /translation="MKGVKRVRPAVKRLVPSWDLSLVLDALTRAPFEPLED FT IPIKLISLKTALLLALVSAKRVSELHALSVHQACMDFSRDDDRVTLLPNPA FT FVPKVSDSAYNCSAIELHAFHPPPHLSAEERKLHTLCPVRALRLYVNRTRS FT FRRSDQLFVSWATRRKGEAVSSQRLSHWIVEAIELAYTSRGLRSPEGLRAH FT SSRGIATSWALFRGISVRDICKAASWSSPHTFIRFYRLDVTAPSLAHAVLN FT VCPL" FT CDS 1977..4283 FT /product="DIRS-13_DR_2p" FT /note="reverse transcriptase and ribonuclease H." FT /translation="MSSPQSAAFVSPPAKMQRRRNLGGEMVQSCVQASSAS FT PQHTMFRGLPTAVPQVHVQSIRPPCGLPSSGTRSQVCCHKALHSLCSWQIK FT NPLSHPGSMPRAQKAKNKIKTQRKAKAQSRETGQSLSQTDPGQISLWPQEG FT ASAPPVGPVRQQPLSLHPEAWIECSLHPWVLATVTRGYRLQFAGKPPPFNG FT VIASVANEDLAQVLEAEISSLMEKGAIRRVPVEETQTGFYSRYFLIPKKDG FT SLRPILDLRALNKHLRKYKFKMLTVGALTRSIRRGDWFASIDLKDAYFHIS FT IYPAHRKYLRFSFQNEVFEFVTLPFGLSLAPRVFSRCMEAALSPLRHKGLR FT ISAYLDDYLICARTRERAERDAETLSSHLTRLGFRINNAKSQLIPSQEIEY FT LGLRLDSVTYRAMLSERRITAFGLCLTHFRKGNLVSFRTCLRLMGMMASSL FT SVIPLGLLKMRDFQRWITAKRLCPRRHLARKVRISTECVMALHHWKNPCIF FT RSGAPLGSVSLRKVVTTDASLTGWGAVFQGRSVNGRWTHRLRELHINMLEL FT MAVFLALKHFLPFLKGFHVLVRTDNTTVVAYINRQGGTRSLQLHNLARKLI FT VWSAAHLSSLRATHVPGVRNVGADLLSRGNPTYGEWVLHPQVVNQIWEMYG FT KAAVDLFASRANAKCPLYFSLEDEDAPLGVDALAHPWPNVLLYAFPPLSLI FT SPTLDRVRESGLSLLLIAPRWRAGCGWQRSPNFCKESPGRSRCDGTCCHKR FT GHRYFTLTQNKSTFGSGP" XX SQ Sequence 5732 BP; 1278 A; 1532 C; 1577 G; 1342 T; 3 other; gttctctttc aaacattcgc tcggtatctc actatgggaa acgcctcagg cgtgacagac 60 tgtggaagca ccaattacac cacgtccgtc tgaggacgga caaatcacag ctcgcgatta 120 ggagcccacc ctctccttta taatccgctg tgattccccg aatcggcatt ctagcactct 180 tcctcgtgaa gactttcctt aagcacccag gctgcgctgc tctgttcaca aacgtaccgt 240 caaggctaac gttgccgaac gcagctcagt catctgaatc tgtcgtgggt tctttgctcg 300 ttacggagta ttttttactg cttttactag cagcaactga tatagctaac gcgttaaggc 360 gagctaactc tgattttgac tcaccatgtc taaagtcacg gtggagaagg ggaaggagac 420 ctcttcgcgc tcctgcccgt cagcgtgcgg atttatgatc tcgggccggg attcacatcc 480 gatgtgcatc gcatgtatgg gtgtgaagca cgcccaggcc tcgttggcga acaccgaaag 540 ctgcgtgcac tgccaggcaa tgccgatgag gattttggag agaaggcttc gagtggcggc 600 atcttccaag gacgatccgg ttctctctgc cgctccgtcg tcagctaatg acgccaaatc 660 ctctcctccg cgggatcttg cttcgtgggg ggacatcatg gaggtagaat cccccgagta 720 tgagccgctg ttcgaccagc agctactggc gggcggggat gagatggaag gggatgagga 780 ggaggatgat gagacactag cccgtctcct ccgagatgag ccggacgatg aggaggagga 840 tgtcatccgc ccttcttccc acgcatccag gccgactagc gcgcaaagcg gtgaggtggc 900 gtcgaccgtg gtggattgcg acctcacaga ggtttgcaaa cgtgctgccg ccaagctcgg 960 cgtggcgtgg cctgtgactc ctggtcatcc aggagttaaa cgggatgtgt atgacgggaa 1020 aaggctcgct ccccgcgctc ccccagcaaa acaactactt cccgctctcc cagattgcat 1080 cacggagatg aagcggtcat gggataaacc ctttaccaac cgcgtgcccg tcaagggtta 1140 ctcgtcgctc gacgtgagcg agatggaggg gctagggctc tctaaccctc cagtggtcga 1200 gcagacggtg gctcttcacc tgcatccaaa caggcgggct gctgtttcmt cggccacacc 1260 gtccctccct ggtaaaatgg aacgcttcac tgcttccatg tatcagaaga tatataaatc 1320 ttcggcgcta acagtgaggg ctttgaacgt cacttcattg ctgacggcct atcaagccga 1380 attattggag gagctkggca cgcagctgga cgccggcaat ccaaacccgg cagtctggga 1440 agagatctgc aacattacag acctcaatct gcgcgcctca cgcggggcgg tgcaaagctc 1500 cggccgtacc atggctctcg ccgttgcggg cgagcgatcg ctgtggctga atctctcgag 1560 tatcggggac agggaaaaat tggattatct cgacgctccc gtagactcga gtgggttatt 1620 cggtcagtcg gtcgcgtcga tgaggcagag atgcgacctt aaaaagaagg acgaggagac 1680 ttttgacctc tgcttgcctc gtaaaaggac atcacgtcct cctgttcctg ctactcggcc 1740 cactcctccg acgcagggca gaaaatttgt tagtgcgtcc agaccgccta aaccccacag 1800 cacagagcag acggctcgcc cccccgctgc tccaaacgct aagccatggg gaaagcagtc 1860 attcgcggca gcggctgcaa agtccagatc cgctaacccc tccgccaaaa agaagaggga 1920 ctcctaggac gctatgtttg gagaggagag acgggcagca cctagggttt tctgcgatgt 1980 cttctccaca gtccgccgcg tttgtttccc cacccgcaaa aatgcagagg aggaggaatt 2040 tgggcggaga aatggttcag agttgtgttc aggccagttc agcttcccct caacacacaa 2100 tgttccgtgg attgccgacg gcagttccac aagttcatgt gcagagcatt cggccaccat 2160 gcggtctgcc gagttcaggc acacgttcgc aagtgtgttg tcacaaagca cttcattccc 2220 tgtgttcatg gcaaataaaa aatcccctgt cgcatccagg ctcaatgccg agggcgcaga 2280 aagccaaaaa caaaataaaa acacagagaa aagcgaaagc tcaaagccga gagacgggcc 2340 agtctctgag ccaaacagat cctggccaaa tttctctctg gccacaagag ggcgccagcg 2400 ctccaccagt ggggccggtt cggcagcagc ctctgtctct tcacccggag gcctggatag 2460 agtgttctct gcacccgtgg gtcctcgcaa ctgtaaccag gggttacagg cttcagtttg 2520 ctggaaaacc ccctcctttc aacggggtta tagcatctgt ggcaaacgag gatttggctc 2580 aggtgctgga ggcagagata tcctccctca tggagaaggg agctataaga cgagttccag 2640 tggaggaaac tcagacgggt ttttactcac gttactttct aatcccaaag aaagatggga 2700 gtctgcgccc aatactggac ctgcgtgctt taaacaagca cttaaggaag tacaaattca 2760 agatgctcac agtaggagct ctcactcgtt cgatccgccg gggcgattgg ttcgcctcga 2820 tcgatctcaa ggatgcttac tttcatataa gcatctatcc agcacacagg aaatatctca 2880 ggttttcctt tcagaacgaa gtgttcgaat ttgttactct tccattcggg ctcagtctag 2940 ctccgcgggt gttcagcaga tgcatggaag cagcgctgtc tccgctgaga cacaagggtc 3000 tgcgcatatc cgcttatctg gacgattatc tgatatgcgc tcgcacacgc gagcgcgcgg 3060 agcgggatgc agagactctt tcgtctcatc tcacaaggct gggattcaga atcaacaacg 3120 ccaaaagcca gctgattccg tcccaggaga tagagtatct gggcctgcgc ctagactcgg 3180 taacttaccg cgccatgctt tcggagagaa ggattacggc gttcggtcta tgcctgaccc 3240 attttcgcaa aggaaacctg gtctcgttca gaacctgtct ccgtctgatg ggcatgatgg 3300 cttcatctct gtctgtgatc ccactggggt tattaaagat gagagatttt cagcgttgga 3360 ttaccgcgaa gcgcctgtgc ccgcgcaggc acctcgcgcg caaggtacgg atatccacgg 3420 agtgtgtgat ggctctccac cactggaaaa acccctgcat attccgatca ggggctccgc 3480 tggggagcgt gtctctgaga aaagtcgtca ctacggacgc atccctaacg ggatggggag 3540 ccgtatttca gggcagatcg gtaaacggtc gttggacaca ccgactgcgc gagcttcaca 3600 taaacatgct ggagctgatg gcagtttttc tggctttaaa acattttctt ccattcttga 3660 agggttttca tgttttagtc aggacggaca acacaacggt ggtggcgtac atcaaccgcc 3720 aagggggaac gcgatcgttg cagctgcaca atctagcgcg caagctgatt gtctggagcg 3780 ccgctcactt gagctcactg cgcgcgacgc acgtcccggg agtcagaaat gtgggggcgg 3840 atcttctgtc gaggggcaac ccgacgtacg gagaatgggt tctccacccg caggtggtga 3900 atcaaatttg ggagatgtac ggcaaggctg ccgtagatct cttcgcctcg cgggcaaacg 3960 caaaatgtcc gctgtatttc tctctcgagg acgaggatgc gccgttgggt gtggatgcgc 4020 tggcgcaccc gtggccaaac gtgctgctgt atgcattccc accgttaagt ctaatttctc 4080 ccaccctgga cagagtaaga gaaagcgggt tgtctctcct tctaatagcc ccccgttggc 4140 gggcaggctg tggttggcag agatcgccga acttctgcaa ggagagccct ggccgctccc 4200 gctgcgacgg gacctgctgt cacaagcggg ggcacagata tttcaccctc acccagaaca 4260 aatcaacctt tgggtctggc ccgtgaaggg ttaaatctaa atgctctcgg gcttccacgg 4320 gaggtcataa gcactatcca gagcgctagg gctccttcaa cgcgctcact gtacgacctg 4380 aaatggcgtg tgtttgagga ctggtgcact agtgaagggc ttatcccctt ccagtgctcg 4440 gtgagggata ttctctgctt tttacaaagc ttactggata atgggagagc gttctctacc 4500 attaaagttt acctggccgc tatttccgct tgccatgtag ggttcgaagg ggtttcagtg 4560 ggacggcacc cccttatatg ccgttttatg aaaggagtga agcgcgtgag accagcggtg 4620 aaacggctgg tcccgtcatg ggatctgtct ctcgttctgg acgctttaac tcgggcccca 4680 ttcgaacctc tggaggatat tcctattaag ctcatatcac tgaagacggc cttactactc 4740 gctctagtct cagctaagcg agtgagcgag ctgcatgctt tgtcggtgca tcaagcgtgc 4800 atggacttta gccgggatga tgacagggtt acactgctac caaacccggc attcgttccc 4860 aaggtcagtg attckgctta taactgttct gctattgagt tgcatgcctt tcatccccct 4920 ccgcatttgt cagcggagga gagaaagtta cacactttgt gtcccgtacg ggcgttacgc 4980 ctctacgtga acaggacgcg atcgtttagg agaagcgatc agttatttgt gtcatgggcc 5040 actcgccgta agggtgaggc tgtttcatct caacggctct cgcactggat agtggaggcg 5100 atcgaactgg cctatacatc taggggtctt cgttccccag agggtttaag agctcattct 5160 tcaagaggta tcgctacctc atgggctttg ttcagaggga tttcggtaag agatatttgt 5220 aaagcggcaa gttggtcttc cccgcataca tttatcaggt tttaccgcct tgacgtcacc 5280 gcgccttcat tggctcacgc ggttcttaat gtgtgtccct tataagcttg aaaggttgtc 5340 tttctaccga ttctgtccgg gggcttcgga aagcatgcgt tacgggagtg gtatatatca 5400 ttcccatagt gagataccga gcgaatgttt gaaagagaac gctaggttac ttgcgtaacc 5460 ccggttctct gagaacagag cgaggtatct caccagactt cccttcacgc attacactcg 5520 ggaagagcgt gcttcgaatg ccgattcggg gaatcacagc ggattataaa ggagagggtg 5580 ggctcctaat cgcgagctgt gatttgtccg tcctcagacg gacgtggtgt aattggtgct 5640 tccacagtct gtcacgcctg aggcgtttcc catagtgaga tacctcgctc tgttctcaga 5700 gaaccggggt tacgcaagta acctagcgtt tt 5732 // ID RTE-1_DR repbase; DNA; ZEB; 4083 BP. XX AC . XX DT 29-APR-2005 (Rel. 10.04, Created) DT 15-SEP-2009 (Rel. 14.1, Last updated, Version 2) XX DE RTE-1_DR is a non-LTR retrotransposon - a consensus sequence. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; KW Interspersed repeat; reverse transcriptase; endonuclease; KW RTE-1_DR. XX NM RTE-1_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4083 RA Kapitonov V.V. and Jurka J.; RT "RTE-1_DR non-LTR retrotransposon from zebrafish genome."; RL Repbase Reports 5(4), 94-94 (2005). XX DR [1] (Consensus) XX CC RTE-1_DR is a non-LTR retrotransposon that belongs to the RTE CC clade. Copies of RTE-1_DR are ~1% divergent form the consensus CC sequence. RTE-1_DR encodes a 1049-aa protein composed of CC endonuclease and reverse transcriptase domains. XX FH Key Location/Qualifiers FT CDS 891..4037 FT /product="RTE-1_DR" FT /translation="MLTGLSEDLRGIDDSRKTAVINNELLRLNVDIAALQE FT TRLADSGTLKEKDYTFYWQGRAPDEPRQHGVAFAVKNNLLSMVEPGRNGTE FT RLLTLRLNTTTGPLTLVSVYAPTLNATLETKDKFYGNLTSVINNIPDKEQL FT VLLGDFNARVGANHESWPSCLGKFGIGKMNENGQRLLELCAFHNLCIANSY FT FQTKPQHKVSWRHPRSKHWHQLDLILVRRSAINCVLHVRSYHSADCDTDHS FT LVCCKIRLNPKRFHRLRKQGNPRINVSRMSQPDLTQKFAETLVKELDTAQT FT GDSALEMWESLRNTMQRTALATFGKRTTKTHDWFEAKASTMIPCIEAKRAA FT LTEYKRSPTQKNLQILRSTRSKAQHIARYCANEYWQELSNDIQKAAIAGNI FT RGMYDGIKKALGPTQCKTAPLKSSTGEIISDKQQQMERWVEHYSDLYSRQN FT TVSSAALDVIKCLPTMEELDEEPTAEELRKAIDKLASGKAPGSDGIPPDLL FT KQCKDSLLHPLHKALCQCWKEGAVPQDMRDAKIITLYKNKGERSDCNNYRG FT ISLLSIVGKVFAKVILARLQKLAERVYPESQCGFRAERSTVDMIFSLRQLQ FT EKCREQQMPLYISFIDLTKAFDLVSRDGLFKILPKIGCPPKLQSLIESFHT FT DMKGTIQFNGSCSEPFSISSGVKQGCVLAPTLFGIFFALLLRHAFGSATEG FT IYLRTRSDGKLFNLSRLKAKTKVRETLIRDMLFADDAAVTTHTQEELQSLM FT TRFSMACKDFGLTISLKKTNVLSQDTATPPTITVDDYQLDVVHQFTYLGST FT ITDNLSLDAELDRRIGKAASTLARLTTRVWTNHRLTTATKMAVYNACVIST FT LLYGSETWTTYARQERRLNTFHLRGLRRILGITWQDKVSNVKVLTRAGLPS FT MYTMLRQRRLRWLGHVCRMEDGRIPKDILYGELSSGKRTTGRPFLRFKDVV FT KRDMKALDINTKSWEDLAADRLKWRCTLTKQLKSGEKNMMRASEDKRVHRK FT VQSVSQGATYQCDCCGKKCLSRIGLFSHQRRCLRQSVMPQYKS" XX SQ Sequence 4083 BP; 1141 A; 1112 C; 993 G; 837 T; 0 other; ggttccaagg ttgtcatagc cgggggcggt aatggggata agctcccact atctataaag 60 tacccctata cggcgtgcgt ctcaaatagc ctctgacaac caagtccagc tcctggcctt 120 caagtgtggt ttagctacca aacccggcgg aactgttttc actgacagga gaaggggcgc 180 aggcgggtca ctggcgcctt acaaccagtt gcttcgggga gatgatattc gttagcctgg 240 gaaggcagat catctagggg aaggcaaccc tgttttcaaa cctccgctgc cttgcggcta 300 tatccattca tggaaaaggc ttcaggagta aacctcgagg aaaaatccgg agtcggagtc 360 cctaaggcag tttaacgctg tttgcagcct cactctggca actcctgcga cggcgccgat 420 accaaactgt agcagccctg ctgttccttt ggatttgtcg acaacgtgga gaggggggac 480 ccgctacatg ggcaacagcc tgtcctccat aatacattgc cctggctagt atccgatctc 540 gcacaccctg gagaggacac tccagcctcg ctagcactct ggcgtggata caacacgggg 600 agcagtagtt taccggttat aagccacagc tcggttggcg tagagcaagg cgccagggac 660 tgctcccgac ggtgggaggg atcttcgggt cccactggac agttaccgcc cgcctcaagc 720 tgggcagccc ccagtcaatt aggtactgcc ccgccacagt ctgctttcct cattgggtgc 780 atggggaata ggagcatttc gaacagcaga ctgcaaccat cgcaccagac aataaaataa 840 cacagagaaa gagaccagct ctaaaactgg gatgctggaa tgtccgtaca atgcttacgg 900 gcctctccga ggacttacgg ggcattgacg actcacgaaa aacagctgtc attaacaacg 960 aactactgag gctaaacgta gacatcgctg ctctacagga aacacgacta gcagactcag 1020 gaactctaaa agaaaaagac tacaccttct actggcaggg aagggcccca gacgagccca 1080 gacagcatgg cgtggctttt gctgtgaaga acaacttact gagcatggta gaaccaggca 1140 gaaatggtac agaacgactt cttaccctcc gcctcaacac caccacaggc cctctcactc 1200 ttgtcagcgt gtacgctcca actctgaacg caacactgga aactaaagat aagttttatg 1260 ggaacctaac atctgtcatt aacaacatcc ctgataagga acaactcgta cttctgggcg 1320 atttcaatgc cagagtgggt gcaaaccacg aatcatggcc ctcgtgccta ggcaaatttg 1380 gcattggaaa aatgaacgag aacggccaac gcctgctcga gctttgcgct tttcacaacc 1440 tgtgcatcgc caactcatac ttccagacca agccccagca taaagtctcc tggcggcatc 1500 cgcggtcaaa acactggcac cagctggacc tcatcttagt tcgccgctca gctatcaact 1560 gcgtcctgca cgtacgctct tatcacagtg ctgattgcga cacagaccac tccttagtgt 1620 gctgcaagat caggttaaac ccaaaaaggt ttcaccgttt aaggaaacaa gggaatcctc 1680 gcatcaacgt cagcaggatg tcgcagcctg atctgacgca gaaatttgca gaaacccttg 1740 tgaaagaact tgacaccgca cagacaggtg attctgccct ggaaatgtgg gaatcactac 1800 gaaacacaat gcaacgcact gccctggcaa cttttggaaa gaggaccaca aagacgcatg 1860 actggtttga agcaaaggcc tctacgatga tcccatgcat agaagccaag cgtgcggccc 1920 tgacagaata caagcggtca ccaactcaga agaaccttca aattctcaga tcaactagga 1980 gcaaggctca acacattgcc agatattgcg caaacgagta ttggcaagag ctcagcaacg 2040 acatccagaa agcagccata gcggggaaca taagaggcat gtacgacggc attaagaaag 2100 cgctaggccc cacccagtgc aaaacggcac cccttaagtc atctactggg gaaataatct 2160 ccgacaagca acaacagatg gagagatggg tggaacacta ctccgacctc tactctagac 2220 agaacacggt gtcctccgca gcactagacg tcattaaatg cctgccaacc atggaagaac 2280 ttgacgagga gccaacagca gaagagctca gaaaggctat cgataaactg gcctcaggca 2340 aagcccctgg cagcgacggg attcctccag acttgctgaa acagtgcaag gattccctac 2400 tgcaccctct tcacaaagcc ctctgtcagt gttggaaaga aggggccgta ccgcaggata 2460 tgagggatgc taagatcatc accctctaca aaaataaggg tgagagaagt gattgcaaca 2520 actacagagg catctccctt cttagcatcg ttggaaaagt atttgctaag gtcatcttgg 2580 cccgactgca gaagctggct gaacgtgttt acccggagtc acagtgtggt tttcgcgccg 2640 aacggtcaac ggtagacatg attttctccc tcagacaact gcaggagaag tgtagagaac 2700 agcagatgcc cctatacatc tcctttattg acctcaccaa agcctttgac ctggtcagta 2760 gagacggact ttttaaaatc ctccccaaga ttggctgccc accaaaactg cagagtttga 2820 ttgaatcttt ccacacagat atgaagggaa caatccagtt caacggcagt tgctctgagc 2880 ctttcagtat aagcagtggc gtcaagcaag gctgcgttct tgcccccaca ctgttcggaa 2940 ttttctttgc cctgctccta aggcatgcct ttggttcagc aacggaagga atctacctcc 3000 gcaccaggtc agatggcaag ctatttaatc tttctcgcct gaaagccaag acaaaggtac 3060 gcgagacact gattagagac atgctttttg ctgacgacgc tgcagtcacc acacacaccc 3120 aggaagaact acagtcgctg atgacccgtt tttccatggc ctgcaaagac tttgggctga 3180 ccatcagttt gaaaaaaaca aatgtcttga gccaggacac tgccactcca ccaaccatca 3240 cagtagatga ttaccagctc gatgtcgtcc accagttcac gtacttgggc tccaccatca 3300 ccgataacct ctccctggat gctgaacttg acaggaggat cgggaaggca gcctctactc 3360 tagcccgcct gacaacccga gtgtggacaa accataggct gacaactgca acaaagatgg 3420 cagtgtacaa tgcttgcgtc atcagcactc tgctgtatgg gagtgagaca tggaccacct 3480 atgcaagaca ggagaggaga ctgaacacct tccacctaag aggtctgcgt cgcattctgg 3540 gcattacctg gcaggacaaa gtctccaacg tcaaagtctt gactcgagcc ggccttccca 3600 gcatgtatac catgctccga caacgtcgcc tgcgctggct tggccatgtg tgccgtatgg 3660 aggatgggag aatcccaaag gatatccttt acggagaact ctcatctggg aagagaacaa 3720 caggacgccc atttctgaga tttaaagatg ttgtgaagag ggacatgaag gcccttgaca 3780 taaacaccaa gtcctgggaa gacctcgcag cagaccgcct gaaatggagg tgcaccctga 3840 ccaaacagct caagtcaggt gagaagaaca tgatgcgtgc gtcagaggac aagcgagttc 3900 accgaaaggt gcagagcgtc agccagggag ccacctatca atgcgactgc tgtggtaaaa 3960 aatgcctctc ccgtattggt ctcttcagcc accaacgacg ctgtcttaga caatcagtca 4020 tgcctcaata taagagctag gatacgtcat ccatggtcaa cactgaccga cggaggccta 4080 cta 4083 // ID hAT-N78_DR repbase; DNA; ZEB; 319 BP. XX AC . XX DT 07-JAN-2009 (Rel. 13.11, Created) DT 15-JAN-2009 (Rel. 14.01, Last updated, Version 2) XX DE Nonautonomous hAT DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW hAT-N78_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-319 RA Jurka J.; RT "hAT DNA transposons from zebrafish."; RL Repbase Reports 9(1), 20-20 (2009). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 319 BP; 88 A; 56 C; 68 G; 107 T; 0 other; cagtggttct caaactgtgg tacgtgtacc actagtggta cgcaggcttc cttctagtgg 60 tacgcggagg aatgaaatat gtcatgtaca tgctacacac atttcaaaat ttatcaaaaa 120 tgatgtatat aatatgccat atatgacata tagcctatat ttctgaggta atctgccacg 180 ttttttaact gtgcagagtt gtagctgctt tactgggcct actacgctac tgtatttcaa 240 tactgctcat tttggtggta cttggagaga caattttttt ctgaggtggt acttgatgaa 300 aaaagtttga gaaccactg 319 // ID hAT-N37_DR repbase; DNA; ZEB; 433 BP. XX AC . XX DT 16-OCT-2008 (Rel. 13.1, Created) DT 16-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Putative nonautonomous hAT DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW 8 bp TSD; hAT-N37_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-433 RA Bao W. and Jurka J.; RT "hAT DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1307-1307 (2008). XX DR [1] (Consensus) XX CC Members of this family are ~97% similar to the consensus. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 433 BP; 146 A; 82 C; 79 G; 126 T; 0 other; caggggtcac caacatggtg cccgcgggca ccaggtagcc cgcgaggatc acatgtgttg 60 cccgcaggcc tgttctaaaa atagctcacc atagcgccac ttaccagtaa gcttcatcta 120 atatagaagt aatcatttaa aaatgtaaat atttgcagag atatataaaa ataaagtgtt 180 gcacattgat acattaaata cagggtattt cctaccctgt taaatcattg ttgataactt 240 ttgtgagaat cattaacatg atcagtgtct tcacatggat gaatatcatt aattattacc 300 aaaaacataa agtataatta aaagtaaatt gagcaaattt gttatttgag aagtgtttat 360 caaactggta gccctccaca ttaatcggta cccaagaagt agctctcagt ttcataaagg 420 ttggtgaccc ctg 433 // ID Gypsy-16-LTR_DR repbase; DNA; ZEB; 1890 BP. XX AC . XX DT 11-FEB-2005 (Rel. 10.01, Created) DT 11-FEB-2005 (Rel. 10.01, Last updated, Version 1) XX DE A long terminal repeat of the Gypsy-16_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy superfamily; Gypsy-16-I_DR; Gypsy-16-LTR_DR; Gypsy-16_DR; KW endogenous retrovirus. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1890 RA Kapitonov V.V. and Jurka J.; RT "Gypsy-16_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 5(1), 6-6 (2005). XX DR [1] (Consensus) XX CC Gypsy-16-LTR_DR is a long terminal repeat of the Gypsy-16_DR CC LTR retrotransposon that belongs to the Gypsy superfamily. Its CC internal portion is deposited in Repbase as Gypsy-16-I_DR. XX SQ Sequence 1890 BP; 602 A; 237 C; 336 G; 715 T; 0 other; tgtaacccag gttactagtg ggtagtatac agcaatagag caaatagaaa tatgaggaaa 60 aaacattggc agaatactcg attaaataat atacattaaa attatttact tatataaata 120 atttataata gagtattata cagcattcaa gggaatagtg ctttagaatg acatcatcgt 180 gccttacact atagcagcag atcagctttc caaacaataa caaatctcgc ttgtctgtta 240 gcttgtgttt accgttgcta tggtgattca gtgttcgcaa gggaaagcag agatgtaagt 300 tgttatatta ataacagtaa tataactatt agatttacat cttttaatgt tttaatgcgt 360 attaactgtg aaaattgacc attaataaga tgctgacatg atttcaatca tatattttta 420 atgaaggatt gagagagaaa aaaaaaacaa gatttgtctc tgagagtgaa atctgcacac 480 gagtgccacc tggagttttc ctgtttgttt cgttttgatt ttgaatgatt tccttggtga 540 gttcgtgttt ttattattta ttattgtttt cattataaca tgatttaaac tagtgtcgaa 600 aaatgtgttt tattaattaa ccatttgttt ggtctagtaa aaacaaagta ttatacagag 660 tattatacaa gtgattgtgt aaagtttaag tgagaaagtt taagtaaaat gctgtcattt 720 gtaaaaagaa aatgccaaat aggagagaga aataggaaaa acacagaaat tcagcagaga 780 gaagcatata catggttatt tactgttgct gtttatttag aaatgtgtta agctgagtta 840 atgcatacac tgtaaaatgc taatggtttc tttggccttg tgttttcaag gccaggtttt 900 tttttttgtt tggttttgtt cgtttgtgca tcttcttctg gtttttcatc ggtgagatgt 960 tctattgacg tcagttgctg gatgtgatct gctctggatt cgaattaatt gaagtttgaa 1020 ctcttttctg gatttggatt gtctcaactt ccatcggcgt gtctaattgt ggagttggac 1080 tggcgagcct cccattgagt ggattctttg gatacacctg ttgaaaagat acaaagtaaa 1140 acctattcag cgtcatggta ggagtgtaaa tttacaacac actcacaagt gcaacatcct 1200 ggatctctgg aactgaattt gcaacctagt tgttttgcaa aagaatcttt ttgaaagttt 1260 atcaccttta ctggactttt ggaacatttt gaaatgtttt aagtgaatgt tttattttca 1320 tttttatttt atttgtacat tgttccttta gagtgttcat tcgttgaaat ttttgaatct 1380 ttatagagag aggttaactg atttaagaag ggaaatcttg attatatgtt aaataataat 1440 acaattttaa tattgtatta ttaacttaca cttgcaaacc tttaatatag tactgttatt 1500 agcatattaa tattatatta ttgttataat agtaaccaaa atataatatt agccttaaca 1560 atagaaaaac agaaaaaaca acaatattaa cttaccttat tagtgatata gcctgtgaag 1620 aaagagaagt gttattagag agtgtaaata agattgaggt cattagataa gtgtgaaaaa 1680 atctagaatt tgttttattt atttattttt atttattttt ttattttata tttcagtgaa 1740 cattttacaa tacaatcttt tattattctt acctctgtgt ccgagtcctt cactgttgtt 1800 gtatcttctc tctctttgga ggaatttaga cttctgagca tctggtccat tgattaaatc 1860 ttataataca tttgtgtacc gagggttaca 1890 // ID hAT-N73_DR repbase; DNA; ZEB; 439 BP. XX AC . XX DT 03-NOV-2008 (Rel. 13.11, Created) DT 03-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE Putative nonautonomous hAT DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW 8 bp TSD; hAT-N73_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-439 RA Bao W. and Jurka J.; RT "hAT DNA transposons from zebrafish."; RL Repbase Reports 8(11), 1740-1740 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 439 BP; 159 A; 66 C; 70 G; 144 T; 0 other; cagtggcgca acaagggggt atgcagtata tgcggcgcat aggggcgcca cacatggggg 60 gggcgccgtt gtgccaaaac tatttttaaa attatcctta ttaaaagttt caaataataa 120 aaataaataa atatgttttg ttaaaaaaac atcattattt tgcattttaa acgaatataa 180 tattgttatt gtttaattca aaatcattta acacaataac atgcagtccc acagccgaaa 240 gtgttcatgt agatgtaact gtttagattt gtttttattt ttatcaaaat aaactgtcaa 300 atcatacata atgtcaaaag ttataagatt aatctgaaaa tttgtcttca tttattatta 360 attaattaat ataaaataag agggatcaca caaaatttga acccgcatac ccctcagtaa 420 atgtgtagtt gcgcccctg 439 // ID L1-10_DR repbase; DNA; ZEB; 5708 BP. XX AC AL772154; XX DT 11-FEB-2003 (Rel. 8.01, Created) DT 11-FEB-2003 (Rel. 8.01, Last updated, Version 1) XX DE L1-10_DR is a non-LTR retrotransposon from the L1 clade. XX KW L1; Non-LTR Retrotransposon; Transposable Element; 15-bp TSD; KW L1 clade; L1-10_DR; endonuclease; reverse transcriptase. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5708 RA Kapitonov V.V. and Jurka J.; RT "L1-10_DR, a family of non-LTR L1-like retrotransposons from RT zebrafish."; RL Repbase Reports 3(1), 4-4 (2003). XX DR Genbank; AL772154; Positions 89080 94787. XX CC L1-10_DR forms a separate family of L1-like non-LTR CC retrotransposons. CC This element is less than 70% identical to members of other CC L1-like CC families identified in the zebrafish genome. CC This family was active recently (no stop codons in ORF2). CC L1-10_DR encodes two L1-like proteins: 281-aa L1-10_DR1p CC (positions CC 136-978) and the 1260-aa L1-10_DR2p (positions 1588-5367). CC L1-10_DR1p is a putative RNA/DNA binding protein, and L1-10_DR2p CC is CC composed of the AP endonuclease and reverse transcriptase CC domains. XX FH Key Location/Qualifiers FT CDS 1588..5367 FT /product="L1-10_DR2p" FT /translation="MINNSEIKIGSWNVRGLNNLVKLKQVLGRLKQMKANI FT IFLQETHLLKDDVSRVTKRWPGQVFNASFTSRARGVMILIHKSIPFQLKQQ FT YIDPLGRYIILNGTILTSHINLISIYAPNVDDPSFFQNFFFTISAYAGNYV FT IGGDFNCVLNPINDRSSGVDTHQQSKKTILKYMTDLNLVEIWRYLNPNKKE FT YSCFSNTYKTFSRIDYFLISNGLVTKVGKSWYDSILLSDHAPVLFTIQFDN FT LVLPPPRFRFQASWLLDHDFVKFLDNKIDLYFLVNTNQTSASVKWEAFKAY FT IRGEILSYTRYRSKIYQAQLEDLGKQIKVIERHLFSNFDPSKQKELLVLKA FT KYNEITSSKISKNLMWLKQSYYDQGEKSGKLLAWRIKKIQTDRAINSIFLE FT NGEKTVDPLEINNIFKLYYENLYKSEIANNLEGQNNFLDKITFPKLSEETR FT SNLEKNLSIEELSEALQDMCNGKAPGPDGLPMEIYKTFAGKILPHLLEMFN FT ESLGKGILPPSLRSALITLILKPGKPPNEKSSYRPISLMSCDTKILCKALS FT KRIEKFIPNLIMNDQNGFVLGRQAFHNIRRVLNIVFKKQKAKDHAILSLDA FT EKAFDRIEWRYLFEVLERFGLGDGYIRWIKLLYNDPQAEIITNNQISKPFN FT LSRGTRQGCPLSPLLLLFAIEPLAMAIRSSPEIKGIIIGEREHRLSLFADD FT IVVFLSSLELSIQALNALLKVFGEFSGYKVNNNKSALLLLNKDEQRNYTNN FT RQFFNSQEELTYLGIKIVTNIKEIISINYDPLLKKVMDSLERWNAMPISMI FT GRINIIKMSILPKFLYLFQSIPLPLPATFFVTLKKMFTRFIWNNKRPRLRL FT SLLYLPYERGGLKLPNIKLYYWAAQLCSAMYYFIETDPPAWIDIEKNEITT FT PLQMYLYSSPVKVLKKYTNNPFLRNSICVWYEAHDFLGETIKLSSLSPIWG FT NAMFVPGRNDGGFKNWMYKGAKQVKDLYKDGTMMSFQQLMKEYDIPQKHFF FT KYLQIRSFIHSEIKTYVEPSLSTIEEHTVKHLRDKGNLSFFYNTLLEGSKE FT SSMSYLTAWRNDLQEDISKDEWMDSCLFAQTYSVNTRCRLLQYKWLMRTYI FT TPVKLHKFNPNIPDSCLKCKQDIGTLYHCMWECTEIQTFWKSILVMIGKLT FT EENVPCDPKLCLFHIYPVNFVVSASKRKLIDFSLLQAKRAIALKWKEMQGP FT SSILWIKEMTNNLAMEKLTYAVKGKLKDFYNIWTPFLCYCNQEDLTGMDD" FT CDS 136..978 FT /product="L1-10_DR1p" FT /translation="MPPKTIKLPQHSKPSGQMEEEGKVLASKKLTEPQSED FT NAATNDDILRAVQSFRDDCSKQFTDTMEAINGIKTDLLSQAQRIGAAEERI FT SQAEEDVTALQHKVNKLEETTEFLRNKVQDLEDRGRRSNLRLIGLPEKTEG FT SNMCTFIENFFPTILRDEFGSPPAIERAHRVGQVNPNRPSAPRAIVIKFLN FT YQDKEKALRAARKMKELRYEGQRISLFQDLSAETRQRQRQFDGVKAQLRGM FT EIRYGMLYPAHLIVTHVGQRHVFKTVAEAEDFVRSVRTNI" XX SQ Sequence 5708 BP; 1912 A; 972 C; 1100 G; 1724 T; 0 other; gacttccggt tatggcgtgc agctgataag acgcgtgaat cgggctcgct actgaatgtt 60 cgtttaaatt tccaaaaagg ctagaactcg tataataagc gtttggaaaa ccacttaaaa 120 gttcgacgga caaaaatgcc acccaaaacc atcaaacttc cacaacattc taagccatct 180 ggacagatgg aagaggaagg caaggtgtta gctagcaaga agctaaccga gccgcaaagt 240 gaagacaacg cggccacaaa cgacgatatc ctacgtgcag ttcaatcctt tagagacgat 300 tgttcaaaac aattcactga cacaatggaa gccataaacg gcataaaaac ggacttactg 360 tcacaagcgc aacgaattgg agcggctgag gagaggatct cacaggctga agaggacgta 420 actgcccttc aacacaaagt aaataaactc gaagaaacaa ctgaatttct acgaaacaag 480 gtgcaggatt tggaggacag aggaaggcgt tctaacctga gactaattgg cctgccagaa 540 aaaacagaag ggtccaatat gtgcaccttc attgagaatt ttttccctac gattttgagg 600 gatgagtttg gatcgccacc tgccattgaa cgagcgcacc gcgtaggaca agttaacccg 660 aaccgcccgt cagctccaag agcgatcgtg ataaagttcc tgaattacca ggacaaggag 720 aaggctttaa gagcggctag gaaaatgaag gagttaagat acgaagggca acgaatcagc 780 ttgtttcaag atttatcagc agagactcgt cagagacagc gacagttcga cggagtaaaa 840 gcacaacttc gaggcatgga aattcggtat gggatgcttt acccggcgca cctgatagtc 900 acgcacgtcg gccagcgtca tgtctttaaa accgtagcgg aggcagagga ttttgttcgg 960 tcagtacgaa ccaatattta aactttgcga cacaagtaag cgctgactca tcaggacttg 1020 aacactgtgt ctagttaata aggatacatt ttttcattcc ttttcaagtg gttaagatgt 1080 ttacatttac gtggaattaa acgtgattat tttctaccag aactggtaaa tatcggcaaa 1140 ttacattacc agttatgtag tttaaataga tttttccgtt ctctactatg ttcaccagta 1200 tgagtaaaca agttcatgta tagaggcata cgagtcagtc caatctcgac tgatgttcta 1260 tagagtggaa tttactctac ataattcaga gcccagctat gttcctcttt agcgagccca 1320 ggacccatag taatgagagt aggcttgcaa aaaattgctc catattacat ggatctaaca 1380 tgtttcataa ccaaggttca ttgtaagggg tattattatg ctttccagtt gggggattgc 1440 aggtgggggg tggcgggggt tgtttatgta tgcgtttgtg ttaacctttt tttttttttt 1500 tttttttttt ctctcactct ttttgtttta ctttctttat ggtacgacca ctcacagttg 1560 tatttacatg gttatataag taggtaaatg attaataatt cggagataaa aattggcagc 1620 tggaatgtaa gaggactaaa taatttggta aaacttaaac aagtactagg tcgattaaaa 1680 caaatgaaag caaacataat ctttttacaa gaaacccatt tattaaaaga tgatgttagt 1740 agagtcacta agagatggcc agggcaagtt ttcaatgcat cgttcacctc tcgagccagg 1800 ggagtcatga ttcttataca caaatccatt ccatttcagt taaaacagca atatatagat 1860 ccgttaggga gatacataat tcttaatggt acaattttga catcacatat taatttaata 1920 agcatttacg ctcctaatgt ggatgatcct tcgttttttc agaatttctt tttcactatc 1980 tcagcctatg ccggaaatta tgtaataggg ggagatttta attgtgtact aaaccccata 2040 aatgatcgct cttcaggagt agatacccat caacagtcta aaaaaactat cttgaagtat 2100 atgactgact taaatttagt tgagatatgg aggtatctca atccaaacaa gaaagaatat 2160 tcatgctttt cgaatacata taaaacattt tcaagaatag attatttttt aatttccaat 2220 ggtttagtga cgaaggtagg taaaagttgg tatgatagta ttttactttc cgaccatgcc 2280 cctgttttgt ttacaatcca atttgacaat cttgtactcc cacctcccag atttcgattt 2340 caagcaagtt ggcttcttga tcatgatttt gtgaaattct tggataataa aattgatcta 2400 tattttttag ttaatacaaa tcagacaagc gcctcggtta aatgggaggc cttcaaggca 2460 tatattagag gggagatatt aagttataca agatacagat ccaaaattta tcaggcacaa 2520 ttagaagacc tggggaaaca aattaaagta atagaacggc atctcttcag taattttgac 2580 ccttcaaagc agaaagaact tcttgtactg aaagccaaat ataatgaaat aacgagtagt 2640 aaaatatcta agaatttaat gtggctaaag cagtcctatt atgatcaggg ggaaaaatca 2700 ggaaaacttc tagcatggag aatcaagaaa atacaaacag atagagccat caattcgatt 2760 ttcttagaga atggggagaa aacggtagac ccattggaaa taaataatat ttttaaactg 2820 tactatgaaa atttgtataa atcggaaatt gccaataatt tagaggggca aaataatttt 2880 cttgataaaa ttacatttcc aaaattatca gaagagacga gaagtaattt agaaaagaat 2940 ttaagtatag aagagctctc agaggctcta caggacatgt gcaacggcaa ggctcctgga 3000 ccggatggcc tcccgatgga aatctacaaa acatttgcag gaaaaatttt accacactta 3060 cttgaaatgt tcaatgaatc tttgggaaaa ggaatattac ccccctccct gagatcagct 3120 ctaattactc ttatattaaa gccaggtaaa cctcccaatg aaaaatcctc ctataggccc 3180 atatcgttaa tgtcttgcga tactaagatt ctctgtaagg ctctatctaa aagaattgaa 3240 aagttcatac caaaccttat catgaatgat caaaatgggt ttgtgctggg aaggcaagcc 3300 tttcataaca tacgaagggt cttaaatata gtttttaaaa aacaaaaggc caaggaccat 3360 gcaattctgt cgttggacgc agaaaaagcc ttcgacagga ttgaatggag gtatcttttc 3420 gaagtgcttg agagatttgg gttgggtgat gggtatatta gatggatcaa attactttac 3480 aatgatccac aagcagaaat aataacaaat aatcaaatat caaagccatt taacttaagt 3540 aggggaaccc gtcaagggtg cccactatcc cctttgttgt tgttattcgc aatagaacct 3600 cttgcaatgg caattcgcag tagtcctgaa atcaaaggca taataatagg tgagagagaa 3660 caccgtttat ctctgtttgc agatgatatt gttgtttttc ttagtagtct ggaactttca 3720 attcaagctc ttaacgcttt attaaaagta ttcggagagt tttcaggtta taaagttaac 3780 aataataaaa gtgccttgct attactaaac aaggatgaac agagaaacta tacaaacaat 3840 aggcaatttt tcaattcaca agaagagctt acgtatttag gaattaagat agtcacaaat 3900 ataaaagaga ttatttcaat taattatgac ccattgttaa aaaaagtcat ggactcgctc 3960 gagagatgga atgcaatgcc aatctcaatg attggacgta taaatattat aaagatgtca 4020 attctaccta aattcttgta tctctttcag tcaattcctc ttccacttcc agctacattc 4080 tttgtaacac tcaagaaaat gttcacaagg tttatatgga acaacaagcg tcctcggtta 4140 cgtttgtccc tactttattt accatatgag cggggagggc ttaaactacc taatataaaa 4200 ctctattatt gggcagctca attatgttcc gctatgtatt acttcataga aacagaccct 4260 ccagcatgga tagatataga aaaaaatgaa ataacaactc cattacagat gtatctttat 4320 tcttctccag ttaaagtact aaaaaaatat acaaataatc ctttccttag aaactctatt 4380 tgtgtatggt atgaagctca tgacttttta ggtgaaacaa ttaaactctc aagtctgtcg 4440 ccaatttggg gtaatgcaat gtttgtgcct gggagaaatg atgggggatt taaaaattgg 4500 atgtataagg gtgcaaaaca agtaaaagat ttatataagg atggtacaat gatgtcgttc 4560 caacaactga tgaaagaata tgatattcct cagaagcact tcttcaaata tctacagata 4620 agaagtttta tacactctga aattaaaact tatgtagagc cctctttgtc aactatagaa 4680 gaacatactg tgaaacattt aagggataaa ggcaatctgt ctttttttta taatacactt 4740 ttagagggtt caaaagaaag ttcaatgtcc tatttaaccg catggagaaa tgaccttcag 4800 gaagatattt ctaaagatga atggatggat tcttgtttgt ttgcgcaaac atatagtgta 4860 aatactagat gcagattgct tcaatataaa tggctcatgc ggacatacat taccccggtt 4920 aaattacata agttcaaccc aaatattcct gacagctgtt taaaatgtaa acaagatatt 4980 ggaacccttt atcattgtat gtgggagtgt actgagattc aaacattttg gaagagtatt 5040 ctggttatga ttggtaaact gactgaagaa aatgtacctt gtgatcccaa actctgtcta 5100 tttcatatat atccagtaaa ttttgtggtg agcgccagta aacgcaaact aattgacttt 5160 agtttgcttc aggccaaacg ggctattgct ttaaaatgga aggagatgca aggtccttcc 5220 tcaatactct ggattaaaga aatgactaac aacctcgcca tggaaaaact gacttatgca 5280 gttaaaggga aacttaaaga cttttacaat atttggactc catttttatg ctattgtaac 5340 caagaggact tgacagggat ggacgactaa tactcaacat tcacttcttc ttcgtattgt 5400 gtatctccat tacataatgt gatctcctaa aattactaaa tacgcagtga aggtaaaagg 5460 ttaaactgta aattggatgg ctgtgaatgc tggtgtacgg aaactagcga gagttttttt 5520 tttttttttt atttgttttt ttattcattt attattattt ttttaattat tatttattta 5580 ttattattat tttggggggg ggggggttaa agttctgcaa ggttctctta tgtgtgtttt 5640 ttctctttta tttaaatgta ttatgtgttc aaaacaaatg caaataaata tattttgaaa 5700 aaaaaaaa 5708 // ID Gypsy76-LTR_DR repbase; DNA; ZEB; 460 BP. XX AC chr13; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy76-I_DR; KW Gypsy76-LTR_DR; Gypsy76_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-460 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1468-1468 (2008). XX DR Genome; chr13; Positions 37932382 37932841. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 460 BP; 81 A; 99 C; 89 G; 191 T; 0 other; tgtcatattc tgagctatta tttgttttct ccacaagatg tcgccatttt ccccagtgtt 60 tctgtttgtg gttcatttgt gttaatttgg ttcagctgtg tcttgttaag actagcctat 120 atattctacc ctcttgtctt cacttgtcac ttggtttttg agtttgatgt gctaccgctg 180 ttgccgtaag tcctcttgtt attacctttt gtattttttt ggctatagta ccagttgctc 240 tacggagaac tttatgtttg ttaactgctt agcttttgga gttcctatgt ttttgcctgg 300 cggcttttta ttgatgattt tggatttcct taataaacac ctttgtttga actagacttt 360 gtgccttagc aattgggttc accaactctc cctgtggctc atcgttaaac cactggtcta 420 ctacaccagc gacccgggtt cgagccttgg gctcatcaca 460 // ID Gypsy159-LTR_DR repbase; DNA; ZEB; 458 BP. XX AC chr3; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy159-I_DR; KW Gypsy159-LTR_DR; Gypsy159_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-458 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1580-1580 (2008). XX DR Genome; chr3; Positions 34990544 34991001. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 458 BP; 94 A; 129 C; 87 G; 148 T; 0 other; tgttgtgaaa tccactcatg aacgttcctc gtgtgaccac cagagggagc catcgcctga 60 gttctaatct ccccaaggct cgactgggac tgattacata cagctgaaac ccatttcaga 120 ctctctatat aaacgcacac atccttcact cggtgtgaag ttttgattag ccccggccat 180 catctctgag cgttaccttg ttgtttattg cctatctgtg tatcgacctg gactgtctta 240 ccgtttctga agctatctgc ctgccttgac cactgcatag ttatattgat tccttctgcc 300 agccgcctgc ctagaatctc tgcctgtctc tcgttaacga tcactctctc ctagccccga 360 cccacgcctg ctcttcgact attctactgg aaagttcaat ttctatctga tgtttgtgag 420 tttccggctg tgatatctta ctgttattaa agtgtgca 458 // ID DIRS-16_DR repbase; DNA; ZEB; 6695 BP. XX AC . XX DT 07-JUN-2010 (Rel. 15.11, Created) DT 07-JUN-2010 (Rel. 15.11, Last updated, Version 2) XX DE A family of DIRS-type LTR retrotransposons - consensus. XX KW DIRS; LTR Retrotransposon; Transposable Element; DIRS-16_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6695 RA Kojima K.K. and Jurka J.; RT "DIRS-type LTR retrotransposons from zebrafish."; RL Repbase Reports 10(11), 1844-1844 (2010). XX DR [1] (Consensus) XX CC This consensus is generated from 7 sequences with >91% identity. CC The consensus is similar to DIRS-N3_DR but encodes proteins. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 176..1111 FT /product="DIRS-16_DR_1p" FT /translation="MSEINLFAEDDAFLAEDPLRSPAGTQSTPAPQTTSAQ FT PAAPPRGRRSARTTAARSRRGSPSPSPRRQSSPASSFASARSSVPATGINT FT ASDLRQALASAGVTVPRRTTKAELLAMYNSLQSGAPLPSTTPPSKATKKTS FT QVRHNPYQRPDPSPSSRPGRSSRPSASLGRAPGPDDAGTQSPEVGINAGSR FT HAPSSRQHSVSLPPLAEPEVHPWPAAPPSNSSAGSLQEAQAPTHNPPAHPF FT QVPTSSASVRPPPLPAQTPASLTPPFLPQVLPGPDASSSVRPPPLMSLATV FT APPPHSSLFLPPAILEPAVA" FT CDS 1117..3735 FT /product="DIRS-16_DR_2p" FT /note="reverse transcriptase and ribonuclease H." FT /translation="MPPQPALASYFPPGTDASSSVRPPPLMSLSSAAPPLS FT SVLPTHDPNASCSASMPPQTAPAPNLPQSLSHTKSFSLSTATPLPVPPNAL FT ALDPPPVSNTIRNQILSGLDVDLFSLLSPIPPTSADRQIDCGQFSVTLKNT FT NNIQSRILTFPEFTVAFSRYTEVICSVYPHRRRELNDYLAIVAELALSYGG FT SHFYTYHRLFSAKCAIRISQWNQSPFWGALDTELHNKVFSGCRNISCAVCR FT STTHPTESCPFINPIPSPLPEPAKPRSTSYVPSNPNIPSLLSGHRPSVTAS FT ASPCTEFNSGRCWRQKCRFLHVCNFCGGAHARSVCPVKRSVNKNAKNYLST FT PVNVSRLRSELLQHPDPKFTEYLLTGLSSGFHPGVISLPSRSLICPNLQSA FT LAEPDVVDLLIKKEVDENFMIGPFSAPPFSVYRVSPIGVATRKFSDKKRLI FT IDLSSPHNSPFPSINSLIPPDEFAMNYHDIDQAISLIKIAGRGAWMAKVDI FT TSAFKVMPIHPDFWHLFGILWRNNYYFSVRLTFGCRSSPKIFDTLSEAICW FT ILSNNYXIPHLIHLLDDFLIISPPDAIPAAHLLTVQKVFAELGIPIAEEKN FT AGPSTSIEFLGIKLDSNKFQASLPKEKIDRTILVASSLLDSPTCSKRELLS FT ILGHLNFAMRIIPQGRSFISHLLALASSAHSLEDQISIDESSLNELRLWIL FT FLKQWNGLSFFYSDLISSPADIKLFTDAAPSIGFGGYYQGRWFASTWPPQL FT LDLPQSLKSSALFELYPLVVAASLWGKEWSSSSIIIHCDNEATVHCINKGR FT SNSIALIPLLRRLTWISACEQFILTAKHIPGSKNQIADSLSRFAFQKFRNL FT APEADQFPVQVPPYSELIFP" FT CDS join(3636..4268,4115..4714) FT /product="DIRS-16_DR_3p" FT /note="tyrosine recombinase." FT /translation="LSLSLCFPEIQKFGTRGGSVSSPSPSLFGADIPIDHP FT LKNLLGASLEYILQAVAPRTLQSYLTAWKSFKTFHACYNILFPDFSLLTIS FT SYITYLTITKKLQINSIKGYLSGIQFFHKLIXGSPAPHLANSQTALLLKGI FT QRTHPTRQDTRQPITLKTLTKCISTLRKGYESIHTARTLDAMFILAFFGFL FT RCSELTITLGSTQTFIPPYPIMHIHSSQRIRINPHSTYTRCXVYFSVLRIF FT EVFRTHYYSGFNPNIHPTVSDLVTLDNETISFFIKQSKTDQAKKGHYIYIF FT NLQSPICPYQTLLAFSQXRKSQASSLADPLFTDDSNRPVTRFWFQKHLKCV FT LIKSGFPADNFSSHSFRIGAATTAAQNGLSEQQIQTLGRWSSQAFKCYIRS FT DRSHIRRAQQTLIKKPL" XX SQ Sequence 6695 BP; 1727 A; 2150 C; 1231 G; 1572 T; 15 other; agtgatgttt tgtaaacaaa tttcgggagg agcatgcgca gaagtttcag tatcgaacac 60 gtcattgaca atcawcccat taccaaatca gtkcttgaac aaacccctat atataccaaa 120 gctgtcttac ctgcagcatc tcacgacttt agcatccaac cttcgacgct gaawcatgtc 180 agagattaat ctcttcgcag aggacgacgc cttccttgcc gaggatcctc tgagatctcc 240 cgccggcacc cagagcaccc cagccccgca gactacatca gcccagcccg cggctccacc 300 ccgaggccgc cgttcagccc gaaccacggc cgctcgatca cgtaggggct caccttctcc 360 atctcccaga aggcagtctt ctccagcctc atccttcgcc tcagccagat cctccgtccc 420 cgccacgggg ataaacacgg cctccgacct ccgccaagca ttggccagcg caggcgtcac 480 cgttccgcgc cgaaccacca aagcggaact cctggcgatg tataactccc tgcaatccgg 540 agcaccactt ccatccacca ctccaccttc caaagccacg aagaaaacca gccaggttcg 600 acacaatcca taccagcgac cagatccaag cccatcctcc agacccggac gcagcagcag 660 gccttcagcg agtctggggc gtgccccggg tcccgatgac gcaggaaccc aatccccgga 720 agtcgggatc aacgccggct ctagacacgc tccctcgagc cgccagcata gcgtgagctt 780 gcctccgcta gcggaaccag aagtccatcc ttggcccgct gctcccccgt ccaactcaag 840 cgcagggtcg ctacaggaag cacaagcccc aactcataac ccgcccgccc accccttcca 900 agttcccacg agcagcgcga gcgtgaggcc gcctccgctc ccagctcaaa ccccagcttc 960 tctaacccct ccttttcttc cccaagttct tccaggcccc gacgcaagtt ccagcgtgag 1020 gccgcctccg ctgatgtcac tggcgacggt agcccctccc ccccactctt ccctttttct 1080 tccacctgcg atcctagagc cagcagtagc gtgagcatgc ctccgcaacc ggctctagct 1140 tcctacttcc ctccaggcac cgacgcaagt tccagcgtga ggccgcctcc gctgatgtca 1200 ctgtcgtcgg ctgcccctcc tctctcctct gtgcttccca ctcatgatcc caacgccagc 1260 tgtagcgcga gcatgcctcc gcaaacggct ccagcaccta acctcccaca gtcactttct 1320 cacactaaat ctttttcttt gtccacagcc actcctctac ctgttccacc caacgccctc 1380 gccctggatc ctccacctgt ctccaacacc atcaggaacc aaatcctgtc aggtttggat 1440 gtcgatcttt tttccctcct ctcacccatt ccacccacat cggcagaccg ccagatagat 1500 tgcggccaat tctcagtaac ccttaaaaac acaaataaca ttcagtcccg tatccttaca 1560 ttccccgaat tcaccgtagc cttttcacgc tacacggaag taatttgttc cgtttacccc 1620 cacaggaggc gcgagctcaa tgactatctc gcgatagtcg ctgagctcgc gctttcatat 1680 ggaggctccc atttttatac ttaccaccgc cttttttccg caaaatgcgc cattcgcatc 1740 tcccaatgga accagtcccc attctgggga gctttagaca cagaacttca caacaaagta 1800 ttttcaggtt gccgcaatat atcttgcgcg gtctgccgat ccaccaccca ccccactgag 1860 tcctgcccat tcataaatcc catcccttct cccctccccg aaccagccaa gcccagatcc 1920 accagttacg tcccttcaaa cccaaatatc ccgtccctcc tttctggtca ccgcccttct 1980 gtcacagctt ctgccagccc ctgcactgag ttcaacagcg gcaggtgctg gagacagaaa 2040 tgtcgtttct tgcatgtctg caatttctgt ggcggcgctc atgcccgctc ggtatgtcca 2100 gtcaaaagat ctgtgaataa aaatgcaaaa aattatctat cgactcctgt gaatgtttct 2160 cgactgagat cggaattgtt acaacatccc gatcctaaat tcactgaata tctattaacg 2220 ggtctgtctt caggttttca tccaggcgtm attagcctcc cctctcgtag tcttatttgt 2280 ccaaatcttc agtccgctct cgccgaaccc gacgtcgtcg atcttttaat taaaaaagaa 2340 gtagacgaaa atttcatgat tggccctttc tccgcacctc cgtttagcgt ttaccgcgtc 2400 agtcctatcg gggtagccac taggaaattc tccgataaaa aacgcctaat aatcgacctt 2460 tcctccccac ataactctcc gtttcctagt atcaacagtc tcatccctcc cgatgagttc 2520 gctatgaatt atcatgacat cgaccaagct atttcactaa tcaaaatagc gggtcgcggc 2580 gcttggatgg ctaaagtcga cataacatcc gcctttaaag tcatgcccat ccatcctgat 2640 ttctggcacc ttttcggcat cctctggcgg aataattatt atttttccgt ccgcctaacc 2700 ttcggttgca gaagtagccc aaaaattttc gacacgctgt cggaagcaat atgctggatc 2760 ctttccaata actacgmgat cccgcactta atccatctat tagacgattt tctaatcatc 2820 tcaccacccg acgctatccc agccgcgcac ctcctgacag ttcaaaaggt tttcgccgaa 2880 ctcgggattc ccatagcaga agaaaaaaac gccggtccaa gcacttcaat cgaattcttg 2940 ggcattaaat tagactccaa caaattccaa gcgtccctac caaaagagaa aatcgatcga 3000 acgatcctgg tagcttcctc cctattagac agccccacct gttctaagcg cgagctatta 3060 tcaatcctcg gacacttgaa tttcgcaatg cgcataattc cccaaggccg atcgttcatt 3120 tctcacctct tagctctcgc gtcttcagct cacagtctag aggatcaaat atccatcgat 3180 gaaagtagcc taaatgaact ccgtttatgg atattattcc tcaaacaatg gaatggtctc 3240 tccttctttt acagcgacct aatatcctcc ccagcagaca tcaaattatt caccgatgct 3300 gccccgtcaa tcggattcgg aggttattac caaggacgct ggttcgcgtc cacctggccg 3360 ccccagctgt tagacctgcc gcaatcactt aaatcatcgg cgctattcga actctatcca 3420 ctagtcgtcg ctgcatccct atgggggaaa gaatggtcct cttccagcat tataatccac 3480 tgtgacaatg aagcaaccgt tcactgtata aataaaggcc gttcaaattc tatagcatta 3540 attcctctac ttagacgact cacttggata tcagcatgtg aacaatttat tttaactgca 3600 aagcacattc ctggatccaa aaatcaaatt gctgactctc tctctcgctt tgctttccag 3660 aaattcagaa atttggcacc agaggcggat cagtttccag tccaagtccc tccttattcg 3720 gagctgatat tcccatagat catcctttaa aaaacctgct tggagcctcc ctcgaatata 3780 tccttcaggc ggtggcgcct agaaccctgc agtcatattt aacagcatgg aaaagtttta 3840 aaacattcca cgcatgttat aacatactgt ttcctgattt ttccctactc accatctcat 3900 cttacataac ctacctgacc attaccaaaa aactccagat caattccatc aaaggctatt 3960 tgagtggaat tcaatttttc cacaaactaa tttwtggttc ccccgcacct cacttagcta 4020 attctcaaac tgctctttta ttgaaaggca tacaaagaac ccaccccaca cgtcaagaca 4080 cacgacaacc cataacactg aaaacactca ctaaatgcat atccactctt cgcaaaggat 4140 acgaatcaat ccacacagca cgtacactcg atgckatgtt tattttagcg ttcttcggat 4200 ttttgaggtg ttcagaactc actattactc tgggttcaac ccaaacattc atcccaccgt 4260 atccgattta gtaacattag acaacgaaac catttccttt tttattaagc aaagcaaaac 4320 cgatcaagct aagaaaggcc attacatcta catcttcaac cttcaatcac ctatttgtcc 4380 ataccaaaca ctgctagcat tttcacaaat scggaaatct caagcctctt ccctagcaga 4440 ccctctattc acagacgata gcaatcgccc agtcacccgt ttttggtttc aaaagcatct 4500 taaatgcgtc cttattaagt ccggcttccc agccgacaac ttctccagcc actctttcag 4560 aataggcgca gctaccaccg cagcacaaaa tggtctatca gagcagcaga tccaaacact 4620 cggacgctgg tcctctcagg cctttaagtg ctacattcga tctgaccgct ctcacattag 4680 aagagcccaa caaaccctca tcaaaaaacc cctctaagat aaattcgaca cccactagag 4740 cmcatcacaa cacatccgct acacttcgtc atcgacaatt cacaaggttc tttctcmaaa 4800 caccctkcgc atactatctc gataatgctc gcagcgtcgc tccaaccgga gtcgcacaac 4860 ttattwcgca cacctcatca ttaatatcaa gcatcacagc tagcgcaacg accgccccta 4920 agggagatgc acactttatt acagaacacc gcaagcgcaa ccaacaatta tacaggagac 4980 ccacacatta tcacgatgca ccaccgctaa wgcagtggcc gctcctacgg gagacgtaca 5040 cctcaccatc acgcactatt tcaagagcag tkaccactca tacaggagac acacactttt 5100 acgcaccacc gctagtgcag tgaccgctcc tacgggagac gtacacctca caatwgtgca 5160 ccaccgcaag cgcagtgaca gctcctacgg gagacgtaca ccctatgatt atgcatcacc 5220 gcaagcgcag tgactgctcc tacgggagac gtacacctca caattatgca ccactgcaag 5280 cgcagtgacc gctcctacgg gagacgtaca ccttacaatt atgcaccact gcaagtgcag 5340 tgactgctcc tacgggagac gtacacctta caattacgca ccactgcaag cgcagtgact 5400 gctcctatgg gagacgtaca ccttacaatc atgcaccact gcaagcgcag tgactgctcc 5460 cacgggagac gtacacctca caattatgca ccactgcaag cgcagtgact gctcctacgg 5520 gagacgtaca ccttacaatt atgcaccact gcaagcgcag tgactgctcc tacgggagac 5580 gtacacctta caatcatgca ccactgcaag tgcagtgact gctcctacgg gagacgtaca 5640 cattacaatc acgcaccact gcaagcgcag tgactgctcc catgggagac gtacacctca 5700 caattatgca ccactgcaag tgcagtgact gctcctacgg gagacgtaca ccttacaatc 5760 atgcaccact gcaagtgcag tgactgctcc tacgggagac gtacacctta caatcatgca 5820 ccactgcaag cgcagtgact gctcccatgg gagacgtaca cctcacaatt atgcaccact 5880 gcaagcgcag tgaccgcccc tacgggagac gtacacctta tcattatgca ccactgcaag 5940 cgcagtgact gctcctacgg gagacgtaca ccttacaatt atgcaccact gcaagtgcag 6000 tgaccgctcc cacgggagac gtacacttta caactatgca ccactgcaag cgcagtgact 6060 gctcctacgg gagacgtaca ccttgtaatt ctacaaccgc ctcctcccga aatacatacc 6120 ttacaactca acagcacccc tccaaatttt tagatcagca attttggggg gcttgcacac 6180 ttgtatggct gctgtcctgt aaccaaatag cattttttgg ggagcgttct ggggccgggc 6240 tagacacttc gctcgaatac cgattcctct ttgtttcctg ataaggggag taactcgagt 6300 tggggtgtct cctcgagctc agagccctcc ccccggacag cacgccaaat atgctttata 6360 cttaaacgaa tgcaagtgtg aactcgtgaa gtgatgtttt gtaaacaaat ttcgggagga 6420 gcatgcgcag aagtttcagt atcgaacacg tcattgacaa tcaacccatt accaaatcag 6480 tgcttgaaca aacccctata tataccaaag ctgtcttacc tgcagcatct cacgacttta 6540 gcatccctcc accaccccgt cacctcacct cttgcatcga attcccggag ggagcgttct 6600 ggggccgggc tagacacttc gctcgaatac caattcctct ttgttccctg ataaggggag 6660 taactcgagt tggggtgtct cctcgagctc agagc 6695 // ID HATN6_DR repbase; DNA; ZEB; 1011 BP. XX AC . XX DT 11-JUL-2002 (Rel. 7.06, Created) DT 11-JUL-2002 (Rel. 7.06, Last updated, Version 1) XX DE HATN6_DR is a nonautonomous DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW Dr000001; Dr000081; HATN6_DR; nonautonomous DNA transposon; KW hAT superfamily. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 4-746 RA Bao Z.; RT "Dr000001, an unclassified and uncharacterized repeat."; RL http://www.genetics.wustl.edu/fish_lab/repeats/. XX RN [2] RP 364-1011 RA Bao Z.; RT "Dr000081."; RL http://www.genetics.wustl.edu/fish_lab/repeats/. XX RN [3] RP 1-1011 RA Kapitonov V.V. and Jurka J.; RT "HATN6_DR, a family of ancient nonautonomous hAT-like DNA RT transposons from zebrafish."; RL Repbase Reports 2(6), 14-14 (2002). XX DR [3] (Consensus) XX CC Two portions of HATN6_DR were identified as unclassified CC repeats Dr000001 [1] and Dr000081 [2]. CC About 1000 copies of HATN6_DR are expected to populate the CC zebrafish genome. They are ~85% divergent from the consensus CC sequence. CC This element is characterized by 19-bp terminal inverted CC repeats and the 8-bp target site duplications. XX SQ Sequence 1011 BP; 337 A; 168 C; 178 G; 328 T; 0 other; taggggtgta atggatcacg gttgatcagt gattcatacg gatcacaacc cacggttcga 60 aacacacgtg acccgcggat taatactttt tttttttttt cttgtagatt aatcctaaat 120 ttgtaacaat cgcagagaga tcgcctctcg cgtcattcaa atcacatgta tgaaagcatt 180 taggctttcc tgtaaaatat aatgatgacg gaagaagttg tggtaggtta ttcgggatgt 240 ggtgggtttc ttaggttatt aaaggtgttt tctgtcacta cttgtttagc ctgcattaat 300 gcattcagtg taagctgcac aattaatcat taaaagatcg ggatctcaac acccacgcaa 360 cataaatgat aaatgattga tgtttattaa tccttcgaat cgctgcattc aaatctgaaa 420 atacaaaaat caagaaagag attcagtctt tagtcttttg agttagttat gaagttacaa 480 acagtcaaat aacacagaag tactgggata acagtttata gtttagatat aaccactgat 540 gtttatggta aagattaaaa tcaccgtcat atgcatttaa cgacgctcaa aaatgaaagt 600 tgtaggcagc aattacatta tttaaccaat aggtggcgac aaccagccat caaaaatatg 660 ccactgaata attcttcaaa aatgacacat ctagcaatga aacatgaagt tttatgagtg 720 aaaacagcaa cagtagtagt aactgaattt ttcacagtac tgaatatttt acaatttatt 780 tttattcagc tgattatttc ttaattttat tagaattttt aataaaatta caggttctaa 840 gttctgtttg ttaaaaccaa aagtgtcttg cagtactgtt tttgtaataa atggaaagca 900 aattacattt gtctcctccc ctttttggct gatccgaaaa atggtccgat ccgtgactaa 960 aaaaccataa tgtgatccga accgtgagat ttgtgatccg ttacaccact a 1011 // ID CR1-29_DR repbase; DNA; ZEB; 2528 BP. XX AC . XX DT 03-DEC-2008 (Rel. 13.12, Created) DT 03-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-like non-LTR retrotransposon - a consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; AP endonuclease; CR1 clad; CR1-29_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-2528 RA Bao W. and Jurka J.; RT "CR1-like non-LTR retrotransposons from zebrafish."; RL Repbase Reports 8(12), 2126-2126 (2008). XX DR [1] (Consensus) XX CC The 5'- part is incomplete. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 2..2296 FT /product="CR1-29_DR_1p" FT /translation="VVYIPPQADAGWALSRLHDVLSGYINKHPDAALIIAG FT DFNKANLRQVMPNFHQHVSCPTRGLNTLDHCYTQFKNAYKARSLPAFGRSD FT HAAIFLXPEYKQRLVQDSPVQRVVTRWSSHSETMLQAALDDVDWDMFRVSS FT LDVSEFTDVALSFINTLIEQVTETITIRSFPNQKPWVDKTIRAAVNTRTAA FT YNEALSSGNMSKYKASCYALRRAVRAAKHKYKERIESHFQLNDSRRMWQGL FT RTICTFGNKSSAEVRADPLLADELNIFYGRFDCGGVILPTSASGNVRQGCD FT DPVITVSEDEVRRALKRVNVRKAAGPDGIPGRVLRSCAGQLAGLFTSIFNE FT SLATSVVPTSFKKSIIIPVPKTNKPSCLNDYRPVALTSLVMKVFERLVKSY FT VCSSLPVTLDPFQFAFRLNRSTDDAISHVLHSSLTHLDSKXGNYVRLLFID FT YSSAFNTIVPIKLASKLIDLGLNSSLCNWIQDFLTGRSQVVRVGQFTSNSI FT TLNIGAPQGCVLSPLLYSLYTHDCVSSHSSTSIIKFADDTVVLGLISNNDE FT TAYLDEVQRLTSWCQDNFLFLNVNKTKELIMDFRRRQQQPYTPLMISGTPV FT ERVSSFKYLXVIISENLTWTTHIERQVNRARQRLYHLRQLRKFRVSPAILK FT TFYSGTVESVLTQGISVWYGNSSNQDRKALQRVVRLAERISGSAFPSLQDI FT YIKRCKNRAVKICKDLNHPDNYLFTLLPSRKRFRSLMAKTERLRRSFFPQA FT IRLLNSNPVLXIT*" XX SQ Sequence 2528 BP; 650 A; 593 C; 541 G; 737 T; 7 other; tgttgtatac attccaccac aggcagatgc tggctgggct ttgtctagac ttcacgatgt 60 gctcagcggc tatattaaca aacaccctga cgctgccctt atcatcgcgg gagactttaa 120 caaagctaat ctcaggcagg tcatgccgaa tttccatcaa catgtatcct gtccaactag 180 aggacttaac acactggatc actgctacac tcagttcaag aatgcctata aagcccgttc 240 attaccggct tttggcagat cggatcatgc cgctattttc ctarcaccgg aatataaaca 300 aaggctcgtc caggattctc cagtgcagag agtagtgacg cgctggtcct cccattcaga 360 gactatgttg caggcggcgc ttgatgacgt cgactgggac atgttccggg tgagttctct 420 tgatgtcagt gagtttacgg atgtagctct gagttttata aacacgctaa ttgagcaagt 480 tacagaaaca ataacgataa ggtcattccc caatcaaaaa ccgtgggtgg acaaaacaat 540 ccgcgctgcg gttaacacac gcactgccgc ttacaatgag gctctttcgt cggggaatat 600 gagcaagtac aaagcatcgt gctatgctct tcgacgcgca gtaagagccg ctaaacataa 660 atataaagag agaattgagt ctcattttca gctcaatgac tcccgacgga tgtggcaggg 720 actgaggact atctgtacct tcgggaacaa atcctctgca gaggtgaggg cagacccgct 780 gctggccgat gagttaaaca ttttctatgg tcgttttgat tgcggcggcg tgattttgcc 840 gaccagtgcg tcaggaaacg ttaggcaggg ttgcgatgat cctgtaatca ccgtgtcgga 900 agatgaggtt cggagggcac tgaaaagagt gaatgtcagg aaggcagctg gacctgatgg 960 gatacctggc cgtgtcctga ggtcctgcgc cggtcagctt gctggtttgt ttacttccat 1020 ctttaacgag tcccttgcta cctccgtggt ccccacctca ttcaaaaaat caataattat 1080 ccctgtgcct aagactaata aaccctcttg cctgaacgac tatcgcccag ttgctctcac 1140 atctttagtt atgaaggttt ttgagagact tgtgaagagc tatgtctgct cctccctacc 1200 tgttactctg gacccctttc agtttgcttt tcgtctaaat agatccacgg acgatgccat 1260 ttctcatgtt cttcactctt ctctcacaca cttggacagc aaaartggga attatgtaag 1320 gctgcttttt attgattata gctctgcttt taacactatt gttcccatta agctggcatc 1380 caaacttatt gaccttggtc taaattcctc actttgtaac tggattcarg actttctcac 1440 cggtagatct caagtggtga gagttggcca gttcacatca aattccatca ctctgaacat 1500 aggagcccca cagggctgtg tcctgagtcc cctactttac tctctctaca cgcatgactg 1560 cgtttcctcc catagctcca catccatcat caaatttgcc gatgatactg tggttttggg 1620 cctcatctcc aataacgatg agaccgcgta cttggatgag gttcagaggc tyacatcatg 1680 gtgccaggac aatttccttt ttctgaatgt gaacaaaact aaggagttga ttatggactt 1740 taggaggaga caacagcagc cctacactcc tcttatgatc agtgggaccc ccgtggagag 1800 ggtgagtagt ttcaagtatc tckgtgtaat tatctctgag aacctgactt ggactacaca 1860 cattgaaagg caagtaaata gagccaggca aagactgtac catctgcgac agctgaggaa 1920 attcagggtc tctccagcaa tcctgaaaac tttctattca gggaccgtag aaagtgttct 1980 gactcaaggt atctcagtgt ggtatggaaa cagctcaaac caagaccgca aagcacttca 2040 aagagttgtg cgtttagctg agcgcatctc agggtctgct ttcccctcct tgcaggacat 2100 ctatattaaa cgctgtaaga atagagctgt taaaatctgt aaggacttaa accatcctga 2160 caattatcta ttcactttgc ttccatctag aaaacgattt cgtagcttga tggcaaaaac 2220 tgagagactt aggaggagtt tctttcccca ggccatcagg ctactaaatt ctaacccggt 2280 gttayacatt acgtgatttt ttcgtattta tcattgtagt agtcatattc ataagtacat 2340 catattgctg ctattcttat taccttgtat aattttttgc acgctttttt ttttcttctt 2400 tttccttaat attwaatttt ttgtttatct atacttgcac tatctctgga gcggacctga 2460 cctacatttc actgctggtc atactttgta taactatgta tgtgacaaat aaaacttgac 2520 ttgacttg 2528 // ID R2Dr repbase; DNA; ZEB; 3582 BP. XX AC NM_001030097; XX DT 29-MAY-2009 (Rel. 14.06, Created) DT 29-MAY-2009 (Rel. 14.06, Last updated, Version 1) XX DE R2 non-LTR retrotransposon in the Danio rerio genome - a single DE copy. XX KW R2; Non-LTR Retrotransposon; Transposable Element; R2Dr. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-3582 RA Kojima K.K. and Fujiwara H.; RT "Cross-genome screening of novel sequence-specific non-LTR RT retrotransposons: various multicopy RNA genes and microsatellites RT are selected as targets."; RL Direct Submission to Repbase Update (29-MAY-2009). XX DR EMBL/GenBank/DDBJ; NM_001030097; Positions 1 3582. XX FH Key Location/Qualifiers FT CDS 1..3579 FT /product="R2Dr_1p" FT /translation="MESTAKGKSYWMARRPVEGATEGSLGRVPFVTRDPKR FT KPEAKRTLTHGLGLRECSVVLTRLIEGRRGRDHTPSGWNAQRGMPNDESSV FT EEPNGPIPSNPIPTGTQALPEPMADGEQGEHPGVVVTLPLRDLNCPLCGGS FT ASTAVKVQRHLAFRHGTVPVRFSCESCGKTSPGCHSVLCHIPKCRGPTGEP FT PEKVVKCEGCSRTFGTRRACSIHEMHVHSEIRNRKRIAQDRQEKGTSTDGE FT GRAGVERADAGEGPSGEGIPPKRPRRARTPREPSEPPANPPILSPQPDLPP FT GGLRDLLREVASGWVRAARDGGTVIDSVLAAWLDGNDRLPELVDAATQRTL FT QGLPAGRLARRPATFVAPNRRRGRWGRRLKLLAKRRAYHDCQIRFRKDPAR FT LAANILDGKSETSCPINEQAIHEHFRNKWANPSPFGGLGRFGTENRANNAH FT LLGPISKSEVQTSLRNASNASTPGPDGVGKRDISNWDPECETLTQLFNMWW FT FTGVIPSRLKKSRTVLLPKSSDPGAEMEIGNWRPITIGSMVLRLFTRVINT FT RLTEACPLHPRQRGFRRSPGCSENLEVLECLLRHSKEKRSQLAVVFVDFAQ FT AFDTVSHEHMLSVLEQMNVDPHMVNLIREIYTNSCTSVELGRKEGPDIPVR FT VGVKQGDPLSPLLFNLALDPLIQSLERTGKGCEAEGHKVTALAFADDLALV FT AGSWEGMAHNLALVDEFCLTTGLTVQPKKCHSFMVRPCRGAFTVNDCPPWV FT LGGKALQLTNIENSIKYLGVKVNPWAGIEKPDLTVALDRWCKRIGKSLLKP FT SQKVYILNQFAIPRLFYLADHGGAGDVMLQNLDGTIRKAVKKWLHLPPSTC FT NGLLYARNCNGGLGICKLTRHIPSMQARRMFRLANSSDPLMKAMMRGSRVE FT QKFKKAWMRAGGEESALPRVFGANQYQEGEEVANDLVPRCPMPSDWRLEEF FT QHWMGLPIQGVGIAGFFRNRVANGWLRKPAGFKERHYIAALQLRACVYPTL FT EFQQRGRSKAGAACRRCSSRLESSSHILGKCPAVQGARIRRHNKICDLLKA FT EAETRGWEVRREWAFRTPAGELRRLDLVLILGDEALVIDVTVRYEFAPDTL FT QNAGKDKVSYYGPHKEAIARELGVRRVDIHGFPLGARGLWLASNSKVLELM FT GLSRERVKVFSRLLSRRVLLYSIDIMRTFYATLQ" XX SQ Sequence 3582 BP; 895 A; 961 C; 1069 G; 656 T; 1 other; atggaaagta cagcaaaagg aaagtcatac tggatggccc gtcgcccagt agaaggtgcc 60 acggagggat ctttgggtcg ggtccctttc gtaacgcgag atcctaagcg caaaccagag 120 gctaaacgaa cacttacgca tggcttagga ctacgagaat gctcggttgt cttgacacgc 180 ctcatcgagg ggcgtcgagg tcgcgatcac acaccatcag gatggaacgc acagcgcggc 240 atgccaaacg acgaaagctc ggtcgaggag cccaatgggc cgataccatc taaccccata 300 ccaacgggca cccaagccct gcctgaacct atggcggacg gggagcaggg ggagcacccg 360 ggagtggtgg tgaccctgcc gctcagggac ttaaactgcc ccctatgtgg cgggtcggcg 420 agcaccgcgg tgaaagtgca aagacacttg gcatttcgcc acggaacagt gccggttaga 480 ttcagctgtg aatcatgtgg aaaaacttct ccgggttgcc attccgtcct ctgtcacatt 540 ccgaaatgtc gcggaccgac aggcgagccg cctgagaaag tggttaagtg cgagggatgc 600 agtaggacgt ttggcacaag gagagcgtgt agtatacatg agatgcacgt tcactcagaa 660 atccgcaata ggaaaagaat tgctcaagac aggcaagaaa aagggacctc gacagatgga 720 gaggggagag ctggagtcga aagggctgac gctggggaag gtccctctgg ggaagggatc 780 ccccctaaac gtcccagacg tgcgagaacg cccagagaac cgtctgagcc ccccgcgaat 840 ccgccgattc tctcgccaca acccgatctg cccccaggag gcctccggga cctactccgg 900 gaggtggcca gtgggtgggt aagggcagcg agagacggag gtacggtgat tgacagcgtg 960 ctcgcagcat ggttggatgg caacgatcgg ctccctgagc tggttgacgc ggcgacgcaa 1020 aggacactgc agggcttacc tgcagggagg ttggcccgaa gacccgcaac ttttgttgcg 1080 cctaaccgga ggagaggcag gtgggggcgc cggctcaaac tgctcgctaa gcgccgcgcc 1140 taccacgatt gccaaattcg gttccgaaaa gacccagccc gcctagccgc gaacatccta 1200 gacggcaaaa gcgaaacaag ttgcccaatc aatgagcaag cgattcatga gcactttcga 1260 aacaaatggg caaatccaag tccatttggt gggctgggac gatttgggac ggaaaacagg 1320 gccaacaacg cccacctcct cgggccaatc tccaaaagcg aggtccaaac tagcctccga 1380 aatgcatcga acgcctccac accaggccca gacggcgttg ggaaaaggga catttccaac 1440 tgggatcctg agtgtgagac cctcactcag ctgtttaaca tgtggtggtt cacaggtgtc 1500 atcccctctc gcttgaagaa aagtcgtacg gtgcttctgc ccaagtcctc agacccagga 1560 gcggagatgg agatcggcaa ctggagacca atcaccatcg ggtcgatggt cttgcggctt 1620 ttcacaaggg tgatcaatac gagattaacg gaagcctgtc cgttgcaccc aagacagaga 1680 gggtttcgac gaagccccgg gtgttcggag aacctngaag tactcgaatg tctcctccga 1740 cactccaaag aaaagcgcag ccaactggca gtggtattcg tcgattttgc acaagcgttt 1800 gacaccgtct ctcatgaaca catgctgtca gtccttgagc agatgaacgt ggatccccac 1860 atggtaaatc tgatccggga gatttacaca aacagctgca caagtgtcga gctaggccgg 1920 aaagagggac cagacatccc agtgagggtt ggtgttaagc aaggggatcc tctgtccccg 1980 ctgcttttca acctggcttt ggatcctctc atccaaagtc tcgaacgcac aggcaaaggg 2040 tgtgaggccg aaggtcacaa agtgacagct ttagcgttcg cggatgacct ggcactggtt 2100 gcgggctcgt gggagggaat ggcacacaac cttgcgcttg tagacgaatt ctgcctaacc 2160 accggcctca cagtccaacc caaaaagtgc cacagtttca tggtcaggcc ctgcagaggt 2220 gccttcacag tgaacgactg ccccccatgg gttctggggg gcaaggccct gcagctaaca 2280 aacatcgaaa actccatcaa atatctggga gtaaaagtca atccttgggc ggggattgaa 2340 aagcctgacc ttacagtggc actagaccga tggtgcaagc gcattgggaa gtcactgctc 2400 aaaccctcac agaaggtata cattctcaat cagtttgcca tcccgcgact cttctacctg 2460 gctgatcacg gtggggccgg cgacgtcatg ctccagaacc tggatgggac aatcaggaag 2520 gcggtgaaga aatggctgca tcttccaccg tcaacctgca acgggctgtt gtatgccagg 2580 aactgtaatg gtggcctcgg tatatgcaag ctcactcggc acatcccatc aatgcaggcg 2640 agacgaatgt tccgcttggc caactcatcg gacccgttga tgaaggccat gatgcgcggc 2700 tcccgagtcg aacagaaatt caaaaaggcc tggatgcggg ccgggggaga ggagagtgcg 2760 ctcccacggg tgttcggggc gaatcagtac caggaagggg aggaggtcgc taacgatctg 2820 gtacctcgct gcccaatgcc gagcgattgg agactggaag aattccaaca ctggatgggc 2880 ctgccgatcc agggtgtggg tatagccggc ttcttcagaa acagggtggc taacggatgg 2940 ctcaggaagc cggcagggtt caaagagcgg cactacatcg ccgctctaca actgcgagca 3000 tgtgtatacc ccaccctcga attccagcaa aggggcagga gcaaagcggg tgcggcctgc 3060 aggcggtgct catcccggtt ggaatccagc tctcacatcc tcggcaaatg tccggcggtg 3120 cagggagcca gaatcaggcg tcataacaaa atatgcgacc tcctgaaggc cgaagccgaa 3180 acccggggtt gggaggtacg ccgggaatgg gccttcagaa ctccggctgg ggaactgaga 3240 aggctcgacc tggtactcat cctcggggat gaggcattgg tcattgacgt cacagtaagg 3300 tacgagttcg ctccggatac cctccagaat gccggaaagg acaaggtcag ctactacggc 3360 ccgcacaaag aagcgatcgc tcgggagctg ggcgtaagaa gggtcgacat acatgggttt 3420 ccgttgggtg cacgcggact ttggctcgcc agcaactcca aagtgctgga actgatggga 3480 ttgagcaggg aaagagtgaa ggtcttctcc agactcttga gtcggagagt gctcctgtac 3540 tctatcgaca tcatgaggac attttacgca accctgcaat ga 3582 // ID DNA-5-6_DR repbase; DNA; ZEB; 1229 BP. XX AC . XX DT 22-OCT-2008 (Rel. 13.1, Created) DT 22-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE nonautonomous DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; TSD 5-bp; KW DNA-5-6_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1229 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1238-1238 (2008). XX DR [1] (Consensus) XX CC TSD is 5 bp long, TIR is 24 bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1229 BP; 368 A; 256 C; 251 G; 353 T; 1 other; cagggctcga aattaacctt tttgcttggt agcaccggtg ctcctaactt taaaaattta 60 ggcgcatcag ccaaaattta gtcgcaccca ccaattatga gcaccgttac tacaagtttt 120 atacaaacat atttattgta tttgaaatca acactaatca aactaacaag caaaaaaata 180 tatatatata tatatgcaag cgtgaggaaa atatgtctca acgaaatccc gttatcacaa 240 gaaaatacat ttttataaca gaactgagtg accaaaagat acacggagcg ctcaccggcc 300 ctgcacgcac tgcttgaatg aatctgttaa cggtataact gtgctgttct cacaagtgct 360 gtctgatatt gcactgatgc ttttgacaca tactgtacct tattatatgc atacgtcatg 420 ttaatagcaa taccggtcta tttttatgag tagcgatatt agtttactca gtgcaagccc 480 gtttgcgatg gtgtacgtgg tgttaaattt gacatatagc tgccccttgc acggcggaca 540 gcatctggcg attatatgaa ggattaaaca gaagctctcg tgctagatgt ggcaaacagt 600 tacctgaaaa ctccatccca cagacttttc atagcggact tgaagccaat ggaagtcttc 660 tactcttgga aaagtgtaga tggtggatta acattcagca catgatcact agtaagatgt 720 gtcattattt ataactttag atgycatggt ttctaaaaca aaatctgtca gtgttatctt 780 cattctcatc ttcagcgctt tacaattttt cgcggtagta gctctctctg tcatcccaag 840 ccagaaggcg ctgactgagt gattgacagc tgatattaac caatcattcg cgttcagtgc 900 tagagcagtg gtgggccaat aagaagagcg tgaaggcggg gcaagcatta cggacttgga 960 tttgtttact gcagcaagtt gacatgacaa ctgttttaaa ccattcctga gcgctgcgtt 1020 tcgcgtttca agtgcagaca aagagcttag agatgcattc tgcgtgaatg tctcccgaat 1080 ccgcgcatgt ggtgaacatt ttgcagtaaa aactggtcgc acacaacaat tttaagcact 1140 cgcagaaatg ctcccaaata tattttaagg tcgcatagat aaaatttcgg gcgcatatgc 1200 gaccaaaatg gtcgcaattt cgagccctg 1229 // ID Harbinger-N10_DR repbase; DNA; ZEB; 328 BP. XX AC . XX DT 17-NOV-2004 (Rel. 9.1, Created) DT 17-NOV-2004 (Rel. 9.1, Last updated, Version 1) XX DE Harbinger-N10_DR is a non-autonomous DNA transposon that belongs DE to the Harbinger superfamily. XX KW Harbinger; DNA transposon; Transposable Element; Nonautonomous; KW Harbinger superfamily; Harbinger-N10_DR; KW nonautonomous DNA transposon. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-328 RA Kapitonov V.V. and Jurka J.; RT "Harbinger-N10_DR, a nonautonomous DNA transposon from RT zebrafish."; RL Repbase Reports 4(10), 280-280 (2004). XX DR [1] (Consensus) XX CC Harbinger-N10_DR is a nonautonomous DNA transposon that is a CC member of the Harbinger superfamily. Its is characterized by CC 15-bp terminal inverted repeats and the TWA target site CC duplications. XX SQ Sequence 328 BP; 83 A; 90 C; 69 G; 86 T; 0 other; agctgcggtc acactggact tttctcccca tagacttcca ttcatacgca cgcgaatgcg 60 tcagaccgga aacgcaaggt cgtgcgtcaa gtttcgcagt tcgctgcgtt gcaaagttca 120 agcttggtga actctgacct gcgaaatcgc atcacttgac tgcgtgagac caatcgagga 180 tcaaaacatg acctctctgg acagaaattt aaaatatgga ccaatcgctc gcttttttaa 240 atgtctaatc atcttgttta atcccgcccc ttttcgcagc gccgtacgac agaatttcgc 300 atgctcaaac tctagtgtga ccgcagct 328 // ID DNA-2-17_DR repbase; DNA; ZEB; 885 BP. XX AC . XX DT 12-NOV-2008 (Rel. 13.11, Created) DT 12-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE nonautonomous DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; 2-bp TSD; KW DNA-2-17_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-885 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(11), 1708-1708 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 885 BP; 275 A; 129 C; 144 G; 337 T; 0 other; cacactctaa aaaacgttga gttatttatt taacccaatg gttgagttaa acatatttgg 60 tgctttgttg ggttattttt aacctttatt aggttattta ttaataactc aaccagttga 120 gttaaatatt tggtgctttg ttgggttatt tttaaccttt atttggttat ttatttaata 180 actcaaccag ttgggttaaa aatttagaat ttcaacagtc aactgattta actgacacaa 240 aacagctgta ttaatatgcg tgcgtaatgt tgtttttgcg ttataaagtt tatttaagct 300 ataacgcaat aatattgttg ttatttttta tcaaattacc tctccgtgtc gtgtttttat 360 atccgtttca ccatccaaga ccatctttta gaagcgtttg gatgtggtta aaatgtattg 420 tacattgtgg acactgagtg aagcctgctg cagctgctgg acagtctcta cagcacacag 480 agctccatat tctgctccat caacacatga ttttctgctt gcacagttta taactgtctg 540 cactttgttc tgtttcttca ttgtttgtga ttcttgattt attttaacaa aagtgtctgg 600 aactaaaatg taatggaaat aaagcatttt caatatattt tttgaaaact gtatgttatt 660 tatttacaca gccataatta taaaattaat agtagtacta gtaatagtag caatagtaat 720 accagaatgg aaaaaataac atttaatcaa aataacccaa tgcattgggt taaattatat 780 aacccaatgc attgggttaa aataacccat agttgggaag gtgctataat aacctatagt 840 tgggttatat gttggggtat ttttaaccca actattttaa cagag 885 // ID GYPSY5I_DR repbase; DNA; ZEB; 4212 BP. XX AC AL672044; XX DT 05-JUN-2002 (Rel. 7.05, Created) DT 05-JUN-2002 (Rel. 7.05, Last updated, Version 1) XX DE GYPSY5I_DR is an internal portion of the GYPSY5_DR LTR DE retrotransposon. XX KW Gypsy; LTR Retrotransposon; Transposable Element; GYPSY5I_DR; KW GYPSY5LTR_DR; endogenous retrovirus; gag; pol; KW reverse transcriptase. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4212 RA Kapitonov V.V. and Jurka J.; RT "The GYPSY5_DR LTR retrotransposon from zebrafish."; RL Repbase Reports 2(5), 18-18 (2002). XX DR Genbank; AL672044; Positions 28401 24109. XX CC GYPSY5I_DR is an internal portion of the GYPSY5_DR LTR CC retrotransposon. CC GYPSY5_DR was active recently, it is flanked by identical CC GYPSY5LTR_DRs. CC Remnants of the ~300-aa gag and ~1080-aa pol proteins are encoded CC by one ORF (1358-aa, positions 137-4141). XX FH Key Location/Qualifiers FT CDS 137..4210 FT /product="GYPSY5_DR1p" FT /translation="MQQLRHPNPAVTPPATMTASQPTHVSNAPATVSPRLA FT FPEKFDGSPSRCKGFLLQCKLFVNQQPALYPTDSSRVAFVCNLLTGKALEW FT ATAIWKEEECNYPTFETFLKQFREVFEHPADGRSPGEQLLSLTQGRNTAAE FT YALSFRTLAAQTNWVEDTLKTLFRRGLSTNLQSELACRDEGKKLNELIELT FT IRLDHLIRSRRNPRTQLHVSSDTPPDEPMQLGFTRLTHEERMHRLQNHLCL FT YCGQSGHRKSTCSVRPQTPVQSVSPSHVPALFSVLIPVIIKFDKRMINTTA FT LVDSGSAGNFISKKFALCHELSLSSYDSCLAVEALDGRPVGEGRIRMITNK FT LQLQVGVLHIEEMQFYIIDSVNHPLVLGLPWLRRHDPHISWRDGQILQWSE FT SCMKQCIQPIHKIPLRTTNVLPELVDELIPIEYHSSSSPPTGRIFPLSQPE FT HKAMSEYIDEELAKGFIRPSTSPASAGFFFVKKKDGSLRPCIDYRGLNKIT FT VKFRYPLPLVPPALEQLRKARYYTKLDLRSAYNLVRIRAGDEWKTAFSTTR FT GHYEYTVMPFGLSNCPSVFQSFMNDVFRDMLDRWFIIYIDDILIYSNTMKE FT HVEHVRMVLQRMIQHRLYAKLEKCEFHQTQIAFLGYVISAEGITMDDTKVQ FT AVQRWPLPQNLKELQRFLGFANFYRRFIRGFSSIAAPLTAMTKRNSHKLSW FT SSEARQAFSDLKTQFTTAPILRHPNPDLPFIVEVDASNTGVGAVLSQRQGQ FT PSKMYPCAFFSRKLTSAERNYDVGNRELLAMKLALEEWRHWLEGASQQFTI FT LTDHKNLEYLRSAKRLNPRQARWALFFTRFDFIVTYRPGSKNSKADALSRL FT SEDKTPISDETIIPTNLLVAPVQWDILTEIQEANRENNPPVDCPNHLIFVP FT NNIRTRLLTHIHDTPSSGHPGITATLELVKARFWWPSLTKDVINYVQKCKI FT CQSTKASHQTPAGLLQPLPVPQRPWSHIAVDFITDLPVSQNNTVILTVVDR FT FSKACRLIPLPKLPSAFETAEALCNHVFRMFGLPDDIVSDRGSQFTSRVWS FT ALCKNLNINVSLTSGYHPQSNGQTERMNQEVIRFLRSYCHQQQHDWSRYLM FT WAEYAQNSLIKPATGITPFKCILGYQPPLFPWSGEPTELPAVTDWLQRSED FT VWNQAHRHLMRAVRRREIQANRHRRQGPTYQPGQWVWLSTRDLRLRLPCRK FT LSPRYVGPFKIVRQITPVSFRLALPNHYRISPTFHMSLLKPAGGPREEAAE FT EESETRTPPPVMIEGEEAYLVREVLDSRRRGGALQYLVDWEGYGPEERSWV FT NARDILDPTLTEEFHQRFPEKPAPRQRGRPRRQVPSRFRSRSQEGG" XX SQ Sequence 4212 BP; 1128 A; 1099 C; 921 G; 1064 T; 0 other; gaagacttcg ccactacaga atccagcagc tttagcccaa gtctcgactg aactttccgc 60 tcaagccact cagttagcca cccatcatca acaactcacc catctaactt ctctcacgga 120 ggaactggtg agagctatgc agcaactccg tcatccgaat cctgcagtta ccccgcctgc 180 taccatgaca gcttctcaac caacgcatgt ctccaatgct ccggctaccg taagtccccg 240 attagctttt cctgaaaagt ttgatgggtc accttctcgc tgcaaaggct tcttattgca 300 atgcaaacta tttgttaacc aacaacctgc attatatccc actgattcta gtcgggtggc 360 ttttgtatgc aacctgttaa cggggaaagc gctggaatgg gctacagcga tatggaagga 420 ggaggaatgt aactatccca catttgaaac ttttctcaag caatttcggg aagtgttcga 480 gcacccagca gatgggagaa gccccggtga gcagttactt tctcttacac aggggcgtaa 540 cacagctgca gaatacgccc tgtcatttcg cactctcgcc gctcaaacta actgggttga 600 ggatacactg aagacactat tcagacgggg tctgagtacc aatctacaat ctgaattggc 660 ttgccgtgac gaaggaaaga aattgaacga actcattgaa cttaccatcc gtcttgatca 720 tctgattcgc agccgtagaa atcctcgcac acagctccat gtttcaagtg acacgccccc 780 tgatgaaccc atgcaactgg gtttcacccg tctcacccat gaagaaagaa tgcatcgatt 840 gcaaaatcat ctatgcctgt actgtggtca atcagggcac aggaaatcca cctgcagcgt 900 acgacctcag acaccagttc aatcggtgag tccttcccat gttcctgcac ttttcagtgt 960 actgattcct gtgattatta aatttgataa aagaatgatt aatactacgg ctcttgtcga 1020 ctctggatct gccggaaact ttatatctaa gaaatttgct ttatgtcacg aactatctct 1080 cagctcatat gattcctgtc ttgcagtgga agcactagac ggtcgtcctg tgggtgaagg 1140 acgcatccgt atgattacga ataagctaca gttacaagtt ggtgttttac acattgaaga 1200 aatgcaattc tacattattg attcagttaa tcatccactg gtgttgggac tgccctggtt 1260 aaggcgacat gatcctcata tctcctggag ggacggacag atattgcagt ggagtgagtc 1320 atgcatgaaa cagtgcatcc agcccattca caaaattcca ttacgaacca ccaatgtgtt 1380 acctgaactc gttgatgaac taatccccat tgagtaccat tcaagttcaa gtcctccaac 1440 gggtcgaata ttccctctgt cacaacctga gcacaaggca atgtctgaat atatcgacga 1500 ggaattggcc aagggtttca tccgaccttc tacatctcct gcttcagctg gattcttctt 1560 tgtaaagaag aaagatggca gtttacgtcc ctgtattgat taccgagggc tcaataaaat 1620 caccgtaaaa ttccgctatc ctttacccct cgtacctcct gccctggaac aattaagaaa 1680 ggctagatat tataccaagc ttgatcttcg cagtgcatat aaccttgtca gaatccgtgc 1740 tggtgacgag tggaagaccg cgttttccac cactaggggg cactatgagt acacggtaat 1800 gcccttcggc ctgtcaaact gtccatccgt gttccaatct tttatgaacg atgtctttcg 1860 ggatatgctg gatcgctggt tcatcattta catcgacgac atcctcatat attcaaacac 1920 gatgaaggaa catgttgaac atgtacgtat ggtgttgcaa cgcatgattc aacatcgtct 1980 gtatgccaaa ttagagaagt gtgaatttca ccagacacag atcgcctttt tgggatacgt 2040 catcagtgca gaagggatta ccatggatga tacgaaggta caggcagttc aacgatggcc 2100 gttaccccag aatctcaagg agttgcagcg atttctaggt tttgccaact tctatagacg 2160 gtttatcaga ggctttagtt caatagccgc accattaaca gccatgacca aacgaaattc 2220 ccacaaacta tcctggtcct ctgaagcacg ccaagcattt agcgatctga agactcaatt 2280 caccacagct cctattctcc gtcatcccaa cccagaccta cctttcattg tggaagtgga 2340 tgcatccaac acaggagtcg gtgctgtgtt atctcaacgt cagggtcaac catccaaaat 2400 gtatccatgt gccttcttct cccgtaaatt gaccagcgca gaacgaaact atgatgtggg 2460 aaatcgcgaa ttactggcaa tgaaattagc cctggaagaa tggagacact ggctggaggg 2520 ggctagtcag caattcacca tattaactga ccacaagaat ctggaatatc ttcgctcagc 2580 caaacgtcta aatcccagac aagctcgttg ggctttgttc ttcaccagat tcgacttcat 2640 tgtcacctat agaccaggga gtaaaaacag taaggcagat gcactatcca ggctatcaga 2700 ggataaaacc ccaattagtg atgaaactat tattcctacg aacttactag tggctccagt 2760 acagtgggac atacttaccg agatccaaga ggccaaccga gaaaacaacc ctccagtaga 2820 ctgtccgaat cacctgattt ttgtacccaa taacatacgt accagattac tgacccatat 2880 tcatgacact cccagttcgg ggcatccagg catcactgca actttggagt tagtcaaagc 2940 ccgtttctgg tggccgtccc tcacgaagga tgtaattaat tatgtccaga aatgtaaaat 3000 ttgccaaagc acgaaggcat ctcaccaaac ccctgctggc ctgttacaac cgctaccggt 3060 tccgcaacgg ccctggtccc atatagcggt ggacttcatt acggatctcc ctgtttccca 3120 gaacaacaca gtcattctca cggtggttga tcgtttctcc aaggcttgtc gcctaatacc 3180 gttgcccaaa ttgccctctg cctttgaaac cgcagaagcc ctgtgcaatc atgtatttcg 3240 aatgtttggc ttacctgatg atatcgtctc agacagggga tcccaattca cttcccgggt 3300 atggtccgct ctctgtaaaa atctaaatat taatgttagt ctcacctcag ggtaccatcc 3360 tcaatccaac ggccaaactg aacgtatgaa tcaagaagtc attcgatttc tacgttccta 3420 ctgtcatcag caacagcacg actggagtcg ctatctgatg tgggcagaat atgcacaaaa 3480 ttcattaata aagccagcca cgggtattac gccattcaaa tgtattcttg gctaccaacc 3540 acccctgttt ccatggtccg gagaacccac tgagttacct gctgtcacgg attggctgca 3600 gcggagtgag gatgtctgga accaggctca cagacatctc atgagagcag tcaggaggag 3660 agagattcag gcgaaccgtc accgacgcca gggacccacg tatcaaccgg ggcagtgggt 3720 gtggttgtcg acacgagacc ttcgtctgag attgccctgc cgcaaactca gtcccaggta 3780 tgtagggcca ttcaaaattg tacgccagat tactccagtg tcttttcgct tagccttgcc 3840 taatcattat cgtatttctc ctactttcca catgtcatta ctcaaacccg ctggtggtcc 3900 acgagaggag gcggctgagg aggaatcgga aaccaggacc cctccacctg tcatgatcga 3960 gggcgaggaa gcttacctcg ttcgagaagt gctggactct cgacgtcggg gtggcgccct 4020 ccaatatctg gtggactggg aagggtacgg tccagaggaa cggtcctggg tgaatgctag 4080 agatattcta gacccaactc tcacggaaga gtttcatcaa aggttcccag agaaaccagc 4140 cccccgacag cgcggtagac cccggcgtca ggtgccttct cgcttcagga gccgctcgca 4200 ggaggggggt tc 4212 // ID hAT-N20_DR repbase; DNA; ZEB; 1773 BP. XX AC . XX DT 18-OCT-2006 (Rel. 11.1, Created) DT 18-OCT-2006 (Rel. 11.1, Last updated, Version 1) XX DE hAT-N20_DR is a nonautonomous DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW Interspersed repeat; hAT superfamily; hAT-N20_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1773 RA Kapitonov V.V. and Jurka J.; RT "hAT-N20_DR, a family of nonautonomous hAT DNA transposons from RT zebrafish."; RL Repbase Reports 6(10), 493-493 (2006). XX DR [1] (Consensus) XX CC hAT-N20_DR is a family of nonautonomous DNA transposons that CC belongs to the hAT superfamily. The genome harbors several CC hundred copies of hAT-N20_DR that are ~97% identical to their CC consensus sequence. This transposable element is characterized by CC 8-bp target site duplications and 12-bp terminal inverted CC repeats. XX SQ Sequence 1773 BP; 566 A; 342 C; 309 G; 556 T; 0 other; tagggatgct catttcggtt aattttgcta accgacaacc gccgctcatt aatcggttat 60 taacggttaa ctggtcagat tactattaat tttatattaa acaaattgac gtgtctattt 120 tgtgtctgac acattaaata ttgcataaaa tactgattta tttttatatg ctagcatatt 180 aataatagaa caatacaaca gagcattttc acgcacctgc agtgtttagc acagaagaac 240 agaataaact aaataaaatg aataaaataa ataggactgc cctaaattat tttccactta 300 aataattaat ttatattaca aatcgaaaac actgactgat tatttttaat aaccggcata 360 ggctgttttt ttccaatcat atttttgtct tccgtcaaat aaaatagcag acttcctcaa 420 atcgctcgct ccacggaaaa tatgcattta tttaatgccc cgcttgtatt attataatca 480 caaaaccttt ttacattttt aatagaaatc attattttaa agattatgtc ttgacgtggt 540 ttcctttctc tccatcgcgg ctcaagtgcg cgcgctgcgt gatgaaaaga caacaacacc 600 gcgcgcatat gttgactttt tgtaaacagt tttgttgttt aaatatgata ttgcattaat 660 gtgcatacat gctttataag ctgtaaaata aaaggtaaag cctatttgtt gcgtttatga 720 tgcagatcca ggtcaacatc agcgcttttt ttggcatcaa cacgtctgtt tcaaacagca 780 atattcttct tccattttgc ttctttggca aatgtgtctg taggcacggg ttatacgtta 840 tcgtcccgca agttaagtat gtcttgcagt tgaacaagtg gccgaccaca gctaacgtgc 900 ttttttcatt ccaaaaacgc gaggcgcacc tcactgcctt tatatgttga caaggaaaaa 960 agaagagaag aaagcgcggt cctcttatga aacgactgaa tcagctgggt atcgattgtg 1020 caaaagtgtt gctgtatatg ttgataaaat tgtaatattt aataatatta tgatattcaa 1080 gatttgtaca ttcatacagc tctggataac ttaaaacagc gattagacct tcagagcggc 1140 aataatgcgt cctgaagtaa agcgaaacgg ctataaactc cagcaagata gagtgattat 1200 atgcagagtc atactttatc tataaataaa gtataactat agaaactgtg ttcatcttaa 1260 ctgaaagctt ctgcgtgttt gctggcctca cgcatcgcgc acctgtcagt cagtcagtca 1320 gtcagcacgt aaccccaaag gattaaacaa tagcgcacag cactatacgg ttacaaaaaa 1380 gtttgcgctg ttataatcca cttacctttt aatacgtttt gatgcgatta taatccgcta 1440 ttaaaaacaa caacaacaat aactgaatgt tttgaatggg taatgtagcc gtggcgggat 1500 gcattttgtt ccccgccatg gaagaatgaa tgtagcggaa accaagctct ttaaaagttc 1560 tctgtagttg tcttgacaac acaaactgct gctctgggat gagccgcgtg caaaagatgc 1620 ccctcttaac gcaaaggcgc attcaatcgg ccccttatgc agccaaacta aaaatatata 1680 aaataataat tttaatcgtt taactgatag cattaatcgg tcacaaacgc accctttcgg 1740 ttaacggtta atcgattatt ttgagcatcc cta 1773 // ID Gypsy97-LTR_DR repbase; DNA; ZEB; 385 BP. XX AC chr19; XX DT 02-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy97-I_DR; KW Gypsy97-LTR_DR; Gypsy97_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-385 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1506-1506 (2008). XX DR Genome; chr19; Positions 23619582 23619966. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 385 BP; 63 A; 81 C; 91 G; 150 T; 0 other; tgtcacggtt gcaggtttgt gcgcttctcc ggagtgtatg tttgatcacg tgggtttgtt 60 ttgttttggt tgtccatgtg gatttgttag gatcacgtgt tatgacgaca cccagttggt 120 ctgattagtt cgccagctga agctcatcat ggtgcctata tctgtgcaac gcttttgtgt 180 gtctgtgtca gttcgttacc atcacttatg tttgtgctgt ctgtgctcag gccctgagga 240 gtttgtctgg atctggtttc cccgttctgt tcctgtcctg tgttctacgt tcatcttgcc 300 tggcactaat tgttttattg tcagattata ataaacattg acacttgcat ttggatcctc 360 tctgactctc attcaaaacg ttaca 385 // ID ERV4_DR-I repbase; DNA; ZEB; 6893 BP. XX AC . XX DT 29-AUG-2008 (Rel. 13.08, Created) DT 29-AUG-2008 (Rel. 13.08, Last updated, Version -1) XX DE Internal portion of ERV1 Endogenous Retrovirus from Danio rerio. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR; KW ERV4_DR-I. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6893 RA Smit A.F.; RT "ERV4_DR-I - ERV1 Endogenous Retrovirus from Danio rerio."; RL Direct Submission to Repbase Update (05-AUG-2008). XX DR [1] (Consensus) XX CC ORFs: gag 252-1766, pol 1767-5357, env 5382-6854. XX SQ Sequence 6893 BP; 1993 A; 1520 C; 1527 G; 1853 T; 0 other; acttggtgac ccccgacgtg atgaagctga tcaggtaaga catgaatttg gttttgacta 60 cctgattagt ttcatcactg aactgagtgt tcaggaagct ggcaagacat gaatttggtt 120 tgactgcctg ataagcttca ttactgaact gagtaatcct tggctgggat tttccctgaa 180 aagccaagaa gagcgcgcta tacaaaagta aaacgatcat tctgtgtgtt ttctagtaag 240 tatttactaa catgggtggt tcatacccca agcctgagat aggggaaggc cctaaagatt 300 ggatgtatcg atgtgggtgg ggatattaca ctaatccatg gttagataat ctagcagggt 360 ggactgaggc acaagcgatg cgtccctacc ctagagatgg gtcatttgac ccccatgtgc 420 taaaacgggc tcacaaatta atttatgaaa gtgaaggaga tcccgaatgg gaccatgatt 480 atatgaaaaa gggttatgaa atctggatga agatgatgag tgtgtggaat gataaacaaa 540 atattaacct accaaaagag gtaaagaaat taattccgaa gcctcttaaa ggatcagcat 600 caaagattga ggtaaaaaga aaggagacac ccaaagtgcc tccagcccct gttgttactg 660 ttccctcagt tacagttact gttcctactc ctaccctctc ctcccccatt cctgatgagg 720 ggcaaggaaa ggcacatcag aggatgcctc tgtatcctgt acaagaatta gagaggcttc 780 gcacagagaa accagaccaa ggggataatc aggctttcct atgggtctca aagaaagggg 840 gacatgccga agtaacccct ccctccgcaa cacagctcaa ggatgtcatt ttactgcttc 900 ctgatatgag caggccagta gaatttgtag aaactatgat acgtgtctct agaaatgcac 960 agcttacagg agctgattat aaatttatat tgatgaacaa aatggggtct caatatgatg 1020 agacagccct cattgatgct gttccttgct tgtgctctaa aaatgatgag gttaaactac 1080 aaccccagca aatgatacgt agagtaaaaa ttaaagaagg ggattatgag gaacaagaat 1140 acacaaaaga tgtaccgtac ttttactggc gggatgaaca gggtgctata gataccctga 1200 agcgacagct tactgcttac cttgtggaca ctcgttgtgc tcatagagat ttgtcacacg 1260 tgaccagctg taagcaggaa aagaaagaga cagtttctgc cttcctccag cgtttttcca 1320 atgcttggat acatctggga ggatttgatc tccctgaaag gcagccacaa ccactgttta 1380 tttccacttt tattagtaac tgtactccag atgttcagaa gatacttaaa tttcaattat 1440 ctgatcgatc tcaaatgact ccagaacagc tcaagcaaag gcttcgaacg atgactggcg 1500 atggtctgtt cgatcaaact ttgaatgtcg ccctcctaca aatgggaggt gcgcagcagg 1560 gtgggttcca aggctctagg ccggcaccta atgggggggc acgaccccga agggagggaa 1620 agtgtcatta ctgtggaaaa cagggtcatt gggcccgaga atgtagaaag aagcaatctg 1680 ataagcagaa aggagcctcc ccttaccaac agtgtaattt ttccccaacc caggtgccca 1740 atcaagggac acagggtcca caataggggt ccccacagag tcctgaaccc ccaaacgcaa 1800 caatgctctc tacttctatt atctctgtga agggccaaga accagccacc cttccggttg 1860 aggtgcaggg caaaatgtat acatttcttt tggactctgg ggcaactcgt tcgtcgtttg 1920 ggggacagta tgagggtcct atttgtgctt caactgtaga ttctgtggga atagatgggg 1980 tcccttttaa aacttctctg acccctcccc tgcttgtaaa ggttggaaac acaaggacta 2040 tggtacagtt tgcatttatg ccttcgtgtt catataactt aatgggacgt gaccttatgt 2100 ccgcattagg tgtcactatg acctttaaac atgaccagct ggtggtacac acaacagata 2160 gtgctgagcc ctgtctgggc ccactcaatg gcctgccctt ctttctgcat gacaacttaa 2220 ctgttacaga ccaagaagca ctttctagtt tacctcctca tctatgggca acccataagg 2280 atgacactgg tctggtggta tgtcctccat acgaagccac tttaaaacat aatgtccccg 2340 tcagcataaa acagtatcca ctaggtgagg aaaaggcttc ggccattgat tcaataatcg 2400 cctccttttt gaaacaaggg gtcctcagac cctgtcaaag cccttacaat acccctgtta 2460 accccgttcc gaaaccaaat ggttcttggc ggttcacaca agatttacga aaaatcaatg 2520 aggccgtaat accaataacg cctgttgtgc ccgatgtacc aaccatcatg tctagtatca 2580 attgtataca tgatactttt tctgttgttg acctctgttc tgcctttttc agcattcctg 2640 tagaagagca gacccaacct ctttttgctt tcacccacag aggaatccaa tacacctgga 2700 ccaggttacc ccaggggtat gtcgacagtc cagcctactt ttctgcagta gtacatgatt 2760 gtttacaatc attaaatatg cctgaagggg ctgctgtcca ccaatatgct gatgacctct 2820 tagtcacagc acaaaactat gatacgtgca agcatgcaac catgcttctg ctaaatcatc 2880 tggcacactg tgggtttcga gtttccaaag agaagctaca attttgccaa gacaaggtaa 2940 tatatttagg tcatcatttg tcaaaaggcc agagatcatt gacaacagac cgtattacat 3000 ctattgccac acaccctatc cccaaaacaa gacagcaact actgtcattt ctgggtttgg 3060 tcaattattg taggcaatgg atccctgatt gttcgcatta tgatcacatt ctccgctcat 3120 gtgtagagca caaagaacca ctaacctcgc ccgtcatttg gacagatgag gctcgggcag 3180 cttttcatgc tctaaagaag gctatttgtt ctgtcccagc tttaggctta ccaaattatg 3240 accttccttt ccatttgtat gtccataatg acgacagaac cattgcagga gtgttggccc 3300 agcaacacgg gggtggcatg agacctgtcg cctacctgtc aaagacttta gatatagtgg 3360 ctcggggcct ccccagatgc ttacaagcta ttgctgcctg tgcagtcatg gtacaggatg 3420 cagagaaaat tgttctctcg catcctctga ttttgcattc accacatcag gtcacgcatg 3480 tattgaataa cctcaataca cagcatttat cggcccagag aagggctggt tatgaacacg 3540 ttttgacttc tactgccaat ctgactgtga aaagcctgac tggccatagt catattgccg 3600 ctgccctcca tcacatgtta tcaccatttg atgatgaact gttggttgaa caacatgact 3660 gtttggagga aatcaatgct gtctgctcca taagatctga tttggctgaa agaccactac 3720 aacatggaga agtcattttt gtagatggct cttgttcaaa gccacaagat ggcgtctacc 3780 tctgtggtta tgctgtgtgt gcactgcctg atcaagttat agaagcccgc cctctacctt 3840 ttaagtccgc acaagcggct gaactgtatg cccttactcg agcctgcata ttatatgctg 3900 acaaaccagt tacaatctac acagactcta gatatgcctt tggtgtggca catgattttg 3960 gcgccatctg gcaatctagg ggatttattg ccgctgacgg tagaccagtg tcccatcata 4020 ccctcataac tgatttaata acggcctgcc atcttccctc gcaactggcc attgttaaaa 4080 ctcagggcca ttcgaaggaa cacactgatg aagccaaagg aaacgctctg gctgaccgac 4140 atgctaaatt agcagcacaa cagcctttgc ctgaggactc tatgcacgtt tccttgttgt 4200 ctacaacctc tgcttcgggg atgctccctg acattgattt ggctatgtta caagcatcag 4260 ccactgacga ggataagtcc ttctgggacg ccttttcttc cacgtttgat gaaaagatgg 4320 gtctatggca cgacaacact ggtcgcctat gtctcccaca tgtggcactg tcattcctcg 4380 tccacgaatt tcatggtgtc actcatcgtg gacgaagagg ggtacatgaa acaatgcgat 4440 ctttgttttg tgtcccaaaa ttagaaacca ccattaataa catacttgac aaatgcctta 4500 tatgcgctca atataatgta tcaaagcctg tggcacagca ccaacattta ccaagaccga 4560 tcacaccttt ttctgaatgg caagtagatt ttacgcatat gcccaagagg ggaccgttta 4620 agtatctact ggtttgcgtg gataaatttt caaaatgggt agaagctttt ccttgctcaa 4680 gggaaaatgc aaaggtggta acacagaaat tgatctctga aattatacct agatatggga 4740 tcccacaaag tatcgattca gataaaggaa caccttttac atcaaaggta acgcaacaat 4800 tggcaactga attaaatata aattggcgtt tccacatacc ctatcatcca caatcatctg 4860 gatttgtgga gcgaactaac agaaccttga aaggaaagat aaagaaggcc ttaagacaac 4920 acggtcataa agattgggta aaaattttac caatagtact agcagatttg agaatgacac 4980 cacaagcaaa tttggattct ttatcaccat atgaggtggt catgggaaga cctttcccta 5040 tcccatggca agggggaatt atgggagtgg gtattgagct ttctaaccac atatcagaat 5100 tctcagcggc cctaattcag aaacttaatc agtattggga aagggtaaca aagaaacacc 5160 ctgaaattcc acttgaggca gcacaccctt tccagatagg ggacaaggtg ctaataaaag 5220 atataaaacc ccagaaagat ttaactgacc ccaaatacga agggccagct gaggttctgg 5280 ctgttacacg cactgctgta ttaactgatc tttttccaca gtggatccat ggcaccagag 5340 tgaaaccatt gaagtaaatg tctatgatat gatattaata gtttttgttt tgcagactac 5400 ttctactgat aaccctactg ggactgacac aggcccacac tgaagtacac aacttattct 5460 ggcagtatgc taactggact gcacgacaga caaccaatga gtcatgtgtt gtctgccaag 5520 aagtgttctc ctctgctgtg actatatcct tgaaaccttt gcctttcaca tgtattaaac 5580 ttgctttttg tacaaatgct ttgggagata tttatgagtg gcgccccaca tgtgtttcta 5640 acaagaaaac atgggagctt ctttgcaaat caactagcag aaagaatcat tctgttgtaa 5700 tacctacacc tagaactatg ggtttgaata ttttgttgac gaaaccccag gggttcactt 5760 tttctttttg ctttaatggg acaaatggag gtaatgtatc caccttaaga gctaaagatt 5820 gtaatatagt aattggcgaa agggatcgat tttcaggggt atatttacca tggcgaaaga 5880 ataaaaaata tagttctcta ctaagtgttt gtccacgtta catcccaggc acctgtgaac 5940 tgaacaaact taaatgtgct aatatcacaa atgagacagc tgtcacggca tgttacaacc 6000 tgaacaacac atactgtcaa cgcagcaatg taacttgtgt ccccacttgg ccgagacagg 6060 atgcggtcct ggccgatgac tggtactggt attgtggggg agaaaccatc ttcaacactt 6120 accccaccaa ttggactggt ctttgtactg tgatccaatt acaacacatg gtcactgtca 6180 tgcatcttgc cagacaattc ggaagggaac ggaggtcagt tccccaggac gacgtacccg 6240 acgaacaccg tatgagatca cactggacta gattctggga ggctatgatt cccagttttg 6300 gcgtagcaga tgccttaaaa cagcaagaga tcatgcatta tagactggcc tcttttataa 6360 acagtaccac ggatgccatt tctggactaa gagaggagat gagagctctg agactaatga 6420 ctatgcaaaa cagactggct ctggacatgc tcttagcgga aaggggcggg gtctgctccc 6480 tggtcggaga gagccaatgc tgcacttacg ttcccgcgga tgatgaggat cttggacgag 6540 taggacaagc tgtaaaggcg atgaaaagga tatcatcaca ggtgtatgag gatgagatga 6600 aggaaagaaa ttttaattgg ggatggggat ttttagaaag cctgtttgga tcattggctc 6660 catatgtctc aatggtagta ccggtactta ttattttctt atgtgtatgt atttttggtc 6720 catgtcttct gagatgtttt atggaaagaa tgtataggat ggttaatgcc ttaggtaaag 6780 ggtatgaaca tttagctatg aatggaaatg agggtaacca tttaacaaat agagtttatg 6840 agaatatggt atgatccagg ctcttgtaag aaactgagcc tggaagaggg gat 6893 // ID BEL14-I_DR repbase; DNA; ZEB; 6159 BP. XX AC chr14; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL14-I_DR; KW BEL14-LTR_DR; BEL14_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6159 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1377-1377 (2008). XX DR Genome; chr14; Positions 9444160 9438002. XX CC Positions [5163-5651] - Integrase core CC 'GATTT' target site duplication CC LTRs are 92% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 114..1310 FT /product="BEL14-I_DR_2p" FT /translation="MENSEIPNQSSLVTLSKQSMSSKCSKSSAANSAVAIA FT LAEAEEAKAKVKFAEKEMQIKMQRARLDAERACLEAQLAVLDVEREAAVAV FT AKAEALTTALSSDKLSSHKSSNQLTSGSYTTTQRTNEYVQEQARLQADKQL FT DIPAQTEYIPPPPLTEPLHASISQLPPYKSEEHYLADVEHCQDVSGIVPDK FT ETHNWCKGPEEAYKFTPYVDEYDHRVSQYVKLSPSVKHRQDPYFTHVRPPD FT NPSHLTAHPCSHLTRQSQDKNRNTNSYPQNKSYASTPPHAQFHTNTPHTNT FT GVNKDMSDLVRFMARREIISTGLLQFDDLPESYRSWKASFFTATRDLNLSA FT KEEMDLLVKWLGADSREQAKRIRAVHVKNTDRGLSMIWERLDDCFGAPEVV FT ESSLL" FT CDS 1667..3049 FT /product="BEL14-I_DR_1p" FT /translation="MSITAYKTDISAQKTTTGETEVDLAKFCPMHKKPHPF FT DKCCGFREKSIEDRKAFLKQNAICFRCVLSTKHIARNCDKLVKCIECGSKA FT HHSLLHPKPVYPLVKAQTLTKDYGGEKLDHSENDVSTKCTEVCGSGLSSKT FT CSKICLINPAIRVYAIIDDQSNQSLVRSKFFYVFNVDCDSSSYYLKTCAGV FT SQKTGRRAHGFQVASIDGGVIHSLPTLVECDDIPDDRSEIPIPQVALHHPH FT LKDIARHIPEIEPDAPIMMLLGRDIIKVHKVMKHINGPRDAPYALKLDLGW FT VIIGDVCLGNVHKPDRVNVLFTNTLDSGRQSLFKPCPNIYHIKENVCGQTL FT SVQTTAISAKPGLEEGHHTEGDVFRQTKDDEKLALSVEDQTFLHIIKECFY FT KDERNSWVAPLSFKPERRCLPNNKSQALVRFRSLERSFSKKPLMKEHFLCI FT HGKDNGERACRDCPTT" FT CDS join(3267..4601,4605..5651) FT /product="BEL14-I_DR_3p" FT /translation="MFHCFLVRPEDRNFLRFLWYEDNDPEKNVIEYRMKVH FT IFGNSPSPAVAIYGLRQAAKEAQQEFGADVGRFVERDFYVDDGLRSVPSAA FT AAIDLLKSTQAALARSNLKLHKIASNSKEVMDQFPMDDQASTLMDMDIGNG FT AMPVQRTLGVSWNLHTDTFTFQLSCDTKPFTRRGVLSTVNGVYYPFGFAAP FT VVIHGKALIRELTTDSCDWDAPLPAEKRETWQQWKNSLQELQHLQIPRHYT FT KDFSSEVSHRELCIFSDASVSAIAALAYLKSSSPDGTCKISFILGKAKLAP FT RPELTIPRLELCSAGLAVPMSNIILSEMDTEFDSVNFFSDSKVVLGYIHNE FT KWRFHVFVNNRVLRIRKSTHPQQWHFVPSEHNPADHATRSVPAACLKDTTW FT LTGPSFLSSWNRTYSCSGLFDLVDPATDVEVRAEISAFITLMKDSQLSSNR FT WIWKDLIRAIACLIHIVRTYKKELVKDGKDCKGWHHCMVSVDELSQAKNVI FT IRAVQQEAFSEEFKCIKDNKNIPKSSPLFTPNLTIDENGLMRVGGRMPRVN FT TGIDENNPIIIPRLHHIAILIVRHYHEQSQHQGRHVTEGVVRMAGFWILGA FT KRCICSLIFGCVICRKLRGKSETQMMANLPMDRVNTEPPFTYAGTDVFGSW FT TISARCTRGGLVYNKRWAVLFTCLTIRAVHIEVMESLDTSCFINALRHFIA FT IRGPVKQIRSDRGTNFVGACRELDSPSNLDELKVTKFLADQGCSWIFNPPH FT ASHMGGVWERMIGVTQRILDSMMLQLGSSKISHEVLVTFMAEL" XX SQ Sequence 6159 BP; 1875 A; 1318 C; 1351 G; 1615 T; 0 other; gtaaaacaac ggctacttca cagacagatc tacagcagtg aagaattgag cgtgtctttc 60 agtgtgatta aggaacagag ccatacgtct gtgagactgg cctctactac attatggaaa 120 attcagagat tccaaatcaa agcagtcttg tgactttgtc caaacagtca atgtcctcca 180 aatgttcaaa aagctcagct gctaattctg ctgtcgctat tgcccttgca gaagctgaag 240 aagctaaagc caaagttaaa tttgctgaaa aggagatgca gataaagatg caaagggcac 300 gtttggatgc agagagagca tgtttagagg cacagttagc agtattggac gttgaaaggg 360 aagctgctgt ggcagtagca aaagctgaag cacttacgac agcactaagc agtgacaaat 420 tatcaagtca caaatccagc aatcaactta cctcaggctc ttataccact actcaaagaa 480 ctaacgaata tgtgcaagaa caagctagac tgcaagctga taaacaactt gatatacctg 540 cacaaacaga atacatacca ccacccccat taacggagcc tctgcatgca agtatatcac 600 aactgccccc atataaatca gaggaacact atttagctga tgttgaacat tgccaggatg 660 tttcagggat tgtcccagac aaagaaacac acaattggtg caaaggaccc gaagaagctt 720 acaaatttac tccttatgtt gacgagtatg atcatagggt gtcgcagtat gtaaaactct 780 ctccatctgt caaacacaga caagatcctt acttcactca tgtgagacca cccgataatc 840 ccagccattt gacagctcat ccatgctcgc atcttaccag gcagtcacag gataagaata 900 ggaataccaa cagctacccc cagaacaaat catatgcttc aaccccacct catgctcaat 960 ttcatacaaa cacacctcac acaaacacgg gtgttaataa ggacatgtct gacctagtta 1020 ggttcatggc aagaagagaa ataatttcga caggcctatt acaatttgat gatcttcctg 1080 aaagctatag atcttggaaa gcatcatttt tcactgccac cagggatctt aatctatcag 1140 ctaaggagga gatggatctc ttggtcaaat ggttaggagc agattctagg gagcaagcta 1200 aacgcatcag agcagtgcat gtgaaaaaca cagacagagg tctctcaatg atttgggaac 1260 gtttagatga ctgttttggt gcaccagagg ttgtagaaag ctcactactc tgaagactgg 1320 agagctttcc aataatttca aacaaagaat ggaagaaact ccgggagtta ggagatctcc 1380 ttatggagct tctggtagca aaatctgaag gaatgcttta gggtcttgca tgcttagaca 1440 cagctcgtgg catagccagc ataatacaaa agctaccttt taacctacaa gataaatgga 1500 tgactttggg ctatactttc aaacaacaac acagagtacc tttccctcca ttttctgtgc 1560 tggttaactt tgctgtagag caagccaaaa tgcggaatga tcctagcttt gctcttccct 1620 ttcagtctga tcattcaagg ccagaacact ccagggccac caaaaaatgt ccataacagc 1680 ttacaaaaca gacatctccg ctcaaaagac aacgactggt gaaactgagg tagacttagc 1740 aaagttttgt cctatgcata agaaaccaca tccttttgac aaatgttgtg ggtttagaga 1800 gaaatctatc gaagatcgca aggccttttt gaagcagaat gcgatttgtt tcagatgcgt 1860 cttatcgaca aaacacattg caaggaattg tgacaaactt gtcaaatgta ttgaatgtgg 1920 cagtaaagcg caccattccc tgctacatcc aaaacctgtt tatccgcttg tgaaagctca 1980 aacactcacc aaagactatg gtggggagaa gttagatcat tctgaaaatg atgtgtctac 2040 caaatgtaca gaagtctgtg ggagtggttt aagcagcaaa acatgctcca aaatctgcct 2100 cataaaccct gccatcaggg tttatgccat catcgatgac caaagtaatc aatctttggt 2160 gagatcaaaa ttcttttatg tttttaatgt ggattgtgac agctcttcct attatcttaa 2220 aacttgtgct ggggtctctc aaaagacagg gaggagagca catgggtttc aagttgcatc 2280 aattgatggt ggtgttattc actctttacc taccttagta gaatgtgatg acatcccaga 2340 tgatcgttct gagataccca tccctcaagt tgctcttcat catccacatt tgaaggatat 2400 tgctagacat atcccagaaa tcgagccaga tgctccaatc atgatgttac ttggaaggga 2460 tatcataaag gtgcataaag tgatgaaaca cattaatgga ccccgtgatg ccccttatgc 2520 tctgaagtta gaccttggct gggtcataat aggtgacgtt tgtctaggaa atgtgcacaa 2580 gcctgataga gtcaatgttc tcttcaccaa cacactggat agtggtcgcc aatcactttt 2640 caagccatgc cccaacattt atcacatcaa agaaaatgtt tgtggtcaaa ctttaagtgt 2700 tcaaactacg gctatttctg caaaacctgg tttggaagag gggcatcaca ctgaagggga 2760 tgtcttcaga caaacaaagg atgatgagaa gctagctctg tctgtcgaag atcagacttt 2820 tctccatatc ataaaggaat gcttctacaa agatgagagg aatagctggg ttgctccact 2880 ttcatttaag cctgagagac gatgcctacc aaacaacaaa tcacaggctt tagttcggtt 2940 cagatctctg gagcggtctt tttccaaaaa acctctcatg aaagaacatt ttctttgcat 3000 tcatggaaaa gataatggcg aaagggcatg cagagattgc cccaccactt aagcttcagg 3060 atgagtgctg gtacttgcct ctttttggtg tttatcacct gaagaagccg gattagatca 3120 gggtggtctt cgattcaagt tgtcagttca acggtgtgtc actgaatgac gtacttttga 3180 agggcccaga tctaaataat gggcttcttg gggtgctttt gcgctttaga aaggaagctg 3240 tagcaatcac tacagatatt caacaaatgt tccattgctt ccttgtcagg ccagaggata 3300 ggaactttct taggttcctc tggtatgaag ataatgaccc agagaagaat gtcattgagt 3360 acaggatgaa agttcatatc tttgggaaca gtccttctcc tgctgtggcg atttatgggc 3420 ttagacaagc agcaaaagag gctcaacaag agtttggagc agacgttgga aggtttgtag 3480 aaagggattt ctacgtggat gatggtttaa gatcggtacc atctgcagca gctgcaattg 3540 atctcctcaa gagcactcaa gcagccttag caaggtccaa tttgaaacta cataaaattg 3600 cttccaacag caaagaagtt atggatcaat tccctatgga tgatcaagct agtactctaa 3660 tggacatgga cataggtaac ggtgcaatgc cagtacagcg tactcttgga gtaagttgga 3720 atctccatac agacacattc acgtttcagc tgtcatgtga tactaaacca ttcacacgtc 3780 gtggtgtcct gtccactgtg aacggcgtat attacccctt tgggtttgcg gcaccagttg 3840 tcatccatgg gaaggctctg ataagagaac tcaccactga ttcatgcgac tgggatgctc 3900 ctttgcctgc agaaaaaagg gaaacttggc aacaatggaa aaattctctt caagaacttc 3960 aacacttgca aataccaaga cattatacaa aggatttctc ttctgaagtt tcacacagag 4020 agctttgcat cttttcagat gcttccgtct ctgctattgc tgcactagct tatcttaaaa 4080 gttcaagtcc tgatggaacc tgcaaaataa gcttcatctt gggaaaagcc aagctagccc 4140 cccgaccaga gctaactatt cctcgactgg agctttgcag tgcagggtta gctgtgccga 4200 tgtctaacat catactatct gagatggaca cagagtttga cagtgtgaac ttcttttctg 4260 acagcaaggt tgtacttgga tatattcata acgagaagtg gaggttccac gtttttgtga 4320 ataaccgagt tctaagaatc aggaaaagca cgcacccaca gcaatggcat ttcgttccta 4380 gtgagcataa tccagcagac catgcaacaa gatccgtacc tgcagcctgt ctgaaggaca 4440 caacatggct tactggtcca tcattcttgt ctagctggaa ccggacatac tcctgctcag 4500 gcctgtttga tcttgttgat cctgcgactg atgtcgaagt ccgagcagaa atatcggcat 4560 ttatcacctt aatgaaggac agtcagttaa gctcaaatcg ctgatggatc tggaaggatc 4620 tcattcgtgc tattgcatgc ctaatacaca tcgtacggac ctataaaaaa gagctggtca 4680 aagatgggaa ggactgcaaa gggtggcatc attgtatggt cagcgtagat gagttaagcc 4740 aagcaaaaaa tgtcatcata cgtgcggtgc aacaggaagc tttctcagag gaattcaaat 4800 gcatcaaaga taacaaaaat atcccgaaaa gcagtcccct cttcacgcct aacctcacca 4860 tagatgagaa tggtcttatg agagtaggtg gtcgtatgcc tcgagtgaat acaggaattg 4920 atgaaaacaa tcccatcatc attccacgac tacaccacat agcaattttg atagtgagac 4980 attaccatga acaatcacaa caccaaggga ggcatgtcac ggaaggtgtt gttcgtatgg 5040 caggcttttg gatcttgggt gctaaaagat gcatatgttc cctaatcttt ggatgtgtga 5100 tttgtcgaaa actgcgtggg aaaagtgaaa cacagatgat ggccaatctg ccaatggaca 5160 gagtcaacac cgaacctcca tttacttacg caggaactga tgtgtttggt tcttggacca 5220 tttctgcccg ctgcactagg ggaggactcg tctacaacaa gagatgggca gtgttgttca 5280 cctgtttaac catcagagct gtccatatcg aagtaatgga atctctggac acatcttgtt 5340 tcataaacgc cttgagacac ttcattgcaa ttcgtggtcc agttaagcaa atacggtcgg 5400 atcgaggaac taattttgta ggagcatgca gagaactgga cagtccttcc aacttggatg 5460 aacttaaggt cacaaagttt cttgcagatc aaggatgttc atggatcttc aaccctcccc 5520 atgcttctca tatgggagga gtgtgggaga gaatgattgg agtaactcag aggattttgg 5580 actcaatgat gctgcagctg ggatcatcaa aaatctcaca cgaggttctt gttacattca 5640 tggcggaact ctagaccctt ggtaccggtt tctatggacc cagaagactc actcatactt 5700 accccctcaa cattactcac tcacaagtac agtccttgtc caccaggaga gttcgatcat 5760 gctgaccttt acagaaagca gtggaaacag gttcaaagtt tggcatcaac cttctgggac 5820 aggtggagaa agcaatacct cttcacattg caacctagaa agaaatggca aactaaaaaa 5880 caagacatca aggagggcag tgtggttctt atgaaagact aactgtctaa acgtaatcag 5940 tggcctctgg ggcgcattgt acgagtgttc ccaagtgaag atggaagggt gcgaaaagtg 6000 aagatcaagg ttatggacaa agaagtgtca aaaatgttca ttagacccat aactgaagtg 6060 gtgatgctaa ttccctccga ctaattctag gactggttta tccttctcat agcttagctt 6120 gtttagcttg tttggtatat tgtgatacca gacggggag 6159 // ID DIRS-9_DR repbase; DNA; ZEB; 6431 BP. XX AC . XX DT 08-JAN-2009 (Rel. 14.01, Created) DT 09-MAR-2009 (Rel. 14.01, Last updated, Version 2) XX DE DIRS-like LTR retrotransposon family - a consensus. XX KW DIRS; LTR Retrotransposon; Transposable Element; Nonautonomous; KW reverse transcriptase RNase H; DIRS-9_DR. XX NM DIRS-9_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6431 RA Jurka J.; RT "Families of DIRS-like retrotransposons in zebrafish."; RL Repbase Reports 9(1), 15-15 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 481..2838 FT /product="DIRS-9_DR_1p" FT /translation="MEPPPISNSARNQILSGADIDLISLLSPVAPPAAERQ FT VDCGEFSVTLKPSANTQSRTLTLAEFSIAFSRFTDIICSVFPHRRRELNDY FT MAIIAELALSYGGTHFYTYHKLFSAKCAXRVTQWNQCPYWGALDTELHNRV FT FLGCRNLTCAVCRSSLHPTTSCPXIIPPXDPTQXPSKSTSYVPXPPSRNIP FT SLLSTXSKPSLPNRDICQNFDXGRCHGXPCRYLHLCSYCGGAHAKIXCPIL FT RANNKKSKNYLSTPVNISNLARELNSHPDTNFSDFLISGLTHGFHPGVSAL FT PSHNLICPNLQSATSEPETVDYLIKKEIDNKFVIGPFKAPPFNISRISPIG FT VATRKFSGKKRLIXDLSXPHNSSFPSINSTIPLEEYTLNYHDIDQAISLIK FT IAGHNAWLAKVDISSAFKIMPIHPDFWHLFGIYWRSKFYFAVRLTFGCKSS FT PKIFDMMSEALCWILSNNYGIPYLIHLLDDFLIISPPSSPPAKHLAITQQV FT FADLGIPLAEEKTSGPSTSIEFLGINLDSHKFQASLPKEKIDRIISLSQIF FT LEKQMCTKRELLSILGHLNFAMRIIPQGRPFISHLLQLSTTVQGLEEIIIL FT SKPSRDELCLWISFLKQWNGCSFFYSDLTTSPIDINLYTDAAPSIGFGGYY FT KGHWFASTWPPQMFNSIPKDQCSSALFELYPIVAAAILWGDEWSTFSILIH FT CDNEATVHCINKXRSHSQALMPFLRRLIWISAKKQFIMIAEHVPGCKNQIA FT DSLSRFSLQKFRQLAPEADPHPTPVPPYSEMILP*" XX SQ Sequence 6431 BP; 1491 A; 2220 C; 915 G; 1756 T; 49 other; aaaacaaaaa cacgcascaa gccacwcccc cgcaagaccg cacccccagg aaacccagcc 60 gccaccgatg acgtcacacc tctcccacca ggactccaat caaacgaact caatccccaa 120 tctgtccctc ctcaacycta ttctacctct ttctcctggc ctccagcccc ctctagctct 180 cccattagcc ctcatctccc ttctacttca gctactcagt tcctcccttc taactctgct 240 cctcctgctc ttcctcaaca tcaatctttt cccccccctc tccaaccctc cgcttcatgc 300 tccatcccct tccaacctgc ttctcatccc tcctccttcc cctcttcttc actccccctc 360 gctcacacta acactaaccc tacccgsymc ttttctatcc ccacaccagt ttcttctaca 420 cgcccccctt tcactctgtc ttctgccacg cccctccctc cgccgaataa cgctctagct 480 atggaacctc cccccatctc taattcagca cgcaaccaaa tcctctcagg tgcggatatt 540 gacctcatct cactcctctc acccgtmgca ccccccgcgg cagaacgcca ggttgattgc 600 ggcgaatttt cagtaaccct caaaccgtca gctaacactc agtcacgcac cctaacctta 660 gccgaattta gcatagcctt ctcacgattc accgacataa tttgttccgt attcccccat 720 aggagacgcg agctaaatga ttacatggcc attattgccg agctcgcgct ctcctatggg 780 ggcacccact tttacactta ccacaarcta ttctccgcta aatgcgcamt gcgagttacc 840 cagtggaatc agtgtcccta ctggggggct ttggacactg agctccacaa cagggtmttt 900 ttaggttgcc gcaatctaac ctgygcggtc tgccgctcca gtctgcaccc cactacctcc 960 tgtcctttma ttatccctcc tyctgatcca actcagmcac cttctaaatc taccagctac 1020 gttcctcrcc cccccagtcg taacattcct tctcttcttt ctaccycttc taaaccctcc 1080 cttcctaacc gtgacatctg ccaaaacttc gacatkggca gatgtcacgg aawgccatgc 1140 agataccttc ayctgtgctc ctactgtggc ggcgcccacg ccaaaatart ctgcccaatc 1200 ytaagagcaa acaataaaaa atcaaaaaat tacttgtcga ctcctgtgaa tatttctaac 1260 cttgctcgtg aaytaaattc tcaccctgat actaactttt ctgattttct catttcaggt 1320 ctaacycacg gattccaccc aggtgtttca gctctccctt cwcataatct aatctgtcct 1380 aacctgcagt ctgcgacctc cgaacccgaa acagtcgatt atcttattaa aaaagaaatc 1440 gacaacaaat tcgtgatcgg accttttaag gctcctccat tcaatatttc acgcattagc 1500 cccattggcg tcgcaactcg aaaattttcc ggcaaaaaac gcctcatart ygayctttcg 1560 kccccacata attcctcttt ccctagcatt aacagcacga ttccactaga agaatatacg 1620 ctcaactatc acgacatcga tcaagcaatc tctcttatca aaatagccgg ccacaacgcc 1680 tggctagcca aagtagacat ctcttctgcc tttaaaatca tgccaatcca cccagacttc 1740 tggcaccttt ttggcattta ttggcgatca aaattctatt ttgcagtccg actaaccttc 1800 ggatgcaaaa gcagcccaaa aatatttgac atgatgtcag aagcattatg ctggattcta 1860 tccaataatt acggaattcc atacctcatc caccttctag acgattttct cattatttct 1920 cccccgtcat ctcctccagc caaacaccta gcgatcaccc aacaagtttt cgctgatctc 1980 ggaattcctc tagcagagga aaaaacttca ggtcccagta cttcaatcga atttctgggc 2040 attaatctag actcgcacaa attccaagca tccctcccca aagagaagat cgatcggatc 2100 atttctctat cccaaatctt cctcgaaaaa cagatgtgca caaaacgaga actcctatca 2160 attctcggcc atctaaattt cgctatgcgc atcattccac agggccgccc ctttatttca 2220 cacctccttc aactatccac cacagttcaa ggtttagaag aaataattat tctctctaaa 2280 ccaagtcgcg atgaactctg cttatggatc tctttcctta agcaatggaa cggctgttcc 2340 tttttctata gcgacttaac aacatccccc atcgacatta acctatacac agacgctgcc 2400 ccctctattg gtttcggcgg ctactacaaa ggacactggt ttgcctcmac atggccaccc 2460 caaatgttca attccattcc aaaagaccaa tgttcttcag ccctattcga actctacccc 2520 attgtcgcag cagccatctt gtggggggac gaatggtcta cttttagcat tctcattcac 2580 tgcgataatg aagccacagt gcattgcatc aacaaarggc gctcccactc ccaagcactt 2640 atgccatttt taagacgcct tatctggata tctgctaaaa aacaatttat catgattgct 2700 gaacatgtac ctggttgcaa aaaccaaatt gctgactctc tctctcgctt ctctttacag 2760 aaattccggc aattggcccc ggaagcggac cctcacccaa cgcctgtacc tccgtattca 2820 gaaatgatat tgccataaac cacccwcttc ataatctyca ccaaacttct ctatctctca 2880 tcctgcaagc aatagctcct agaaccctcc attcatacct cacagcatgg aattcrttca 2940 aacaattcca taytctacac caacttcctt tccctgattt ttctctcctc tctatcacyt 3000 ccttcgtatc ccaccttcac acygcaaatc acctacaagc cagttcaata aaaagctacc 3060 ttagcgggat ccagtttttt cacaaattaa ttcatgggtc tccttccgay gccatcacaa 3120 attcgcaaac ctccctcctt atcaagggta ttcagaaaaa ccaccctcac cagccctgat 3180 gccagacaac ccatcacact caaaatcctt acctcatgca tccacaccct tcgcaaaggg 3240 tatatttcca cccatacagc ccgcacccta gatgccatgt ttaayctagc attttttggs 3300 tttcttagat gttccgaatt aacagttaca tctaaattta acccatctac tccaccccac 3360 catctcagat ctagctttgc aagataagga aaccatctct ttccttatca aacaaagcaa 3420 aacagatcaa atccagaaag gacactctat ctacattttc gacatacctt cccccactcg 3480 cccattccaa accctcctag cmtatctata atctaagaaa atctcaagaa gctaaccctc 3540 tggccccgct ttttactgac gacgctaacc gtccagtaac tcgattctgg ttccaaaaac 3600 accttaaaga aattcttcgc ctatcaggtt tttccccaga gcctttttcc agccactcat 3660 tcaggattgg cgcagccact acagcagcct ctaacgggct ctcccacaat cagatccara 3720 cccttggtcg ctggtcttct gaagctttca aatcytacat acgcctcagt aaataccacc 3780 tcaaagaagc acaacaggct ctaaccagac ccccaccatc ctaattacag caactacccg 3840 caaggctcca actcacaaag gtacctaata gagccccacc tgctcgctag agccctcatc 3900 ctagccaata gccccaagct accagtagca accttatagg tcttcacttc cttcttgcat 3960 cgagtttctc cgcacctccc ttctctcctt tctagcgttg agttcctccg cacctccttc 4020 tcttccttca agcgttgaac gcttccgctc ttctctccat cccctccctc tttgaacctc 4080 cccccaacat ccttactccc tacccccttc cagcgtcgag tttctccgct acttttctct 4140 tttcagcgtc gagttgctcc gctactatcc cttccttcta gctccaagyt cttccgctac 4200 tcttatctct tcagcctaag taacacatct aaagcccgac tcccccggag tcaattaccc 4260 cctcaattac atccctccac aatcgacatc tttcttcttt tctagcgttg agttcctccg 4320 catctcctat cctaagttct atagcgttga gttyctccgc atctcctttc cttccttcaa 4380 gcgtcgaatg cctccgctct ttcctccatc ccctccctct ctctcccccc cccaacatct 4440 tcactcccta cccctttcca gcgtcgagtt tctccgcttc ttttcttttt tcagcgtcga 4500 gttgctccgc tactatcact tccttctagc gttgagtatt ccgctactct cttcagccta 4560 agctttgact cccccggagt cctgctcagc ccacctccta caggagtctc cacctccccc 4620 tccctatcta gactcctgta ggagcataat ctttcagctc taactcccgc agaggtcgct 4680 ccaagagcta cgactcccac ggagtcctcc ccctccccct ctctcctgcc ccggccaaat 4740 acgcatcttt cctaccttct agcgtcgagt tcctccgcat aggctcccgt cctatattcc 4800 ctagcgtcga gttcctccgc atctcctttt cttccttcca gcgtagaatg cttccgctct 4860 tttctctctc ccctcttata acccctctcc tctcccctac ccttaactcc caatccccct 4920 tccagcgttg agttccttcg ctacttttct ttcttcagcg tcgagtttct ccgctacctt 4980 actacattct agcgtcgagt tcctccgcta ctctttctat atttcaaagc atcactcacc 5040 caagctcaga ctcccccgga gtccccgccc agcccaccac ctacaggagt ctccacctcc 5100 ccctcccttc ccagactcct gtagtagccc aactctacag ctctaactcc cacggagttg 5160 accacagagc tccgactccc atagatttct tttctttacc tttccagcgt cgagttcctc 5220 cgcatctcat gctctctcct tcctagcgtc gagttcctcc gcatcctctt tacttccttc 5280 cagcgtcgaa tgcttccgct cttctctcca ccccctctac aacccctctc cccttcccac 5340 tccacacccc ttttcccagc gttgagttcc tccgctactt ttcttctttc agcgttgaat 5400 tactccgcta cttcctattt ctagcatcaa atctctctgc tactcttctt ccttccgtta 5460 ggcacccttc ccaaccctgc ctccccaaac ccgactccca cggagtcccc gaccactcct 5520 agaatcagtt acaacttctc ccacagtaac ctctctcact ttaacttata ttccagcagc 5580 cggatatagc actgaatctc ctgccttttg gggggttttt tcttcgaata cgcggctgct 5640 gtcccgagcg raaaacattt gcatttttgg ggagttctcg agatctacct gagctcaaac 5700 tcccctctcg ccctgctaac gggagggagc cccgggctcg aggatctcat gagctcgggg 5760 ctctctcccg ggacagcatg ccaaataagc tttattaatc atcagctaag tgtgaactct 5820 tgaagtgaag tttattcata aactaatttc gagaggatca cgtgcttatg attwatcacg 5880 gccggccccg tattagctaa tccgtaatca gcccaatcag atgattccta aagcactata 5940 aataacccga gtttttcact tcagtttatc ttcgtcttga agaamccccc cttccacccc 6000 ttcatcctcc tcctttacct gaaattcggg cggcacggtg gcccagtggc tagcactgtt 6060 gacctcacag caagaatacc gccggtccta cttcgatcgg accggtgagt gtttctgtgt 6120 ggagtttgca tgttctcccc gtgttcgcgt gggttttccc cgggttctcc ggtttcctcc 6180 caccatccaa agaacatwaa acatacccaa attgactaaa tcaaattatc acctaataca 6240 acctcagctt acacttctca cggckacaac ggcaggggag ttctcgagat ctacctgagc 6300 tcaaactccc ctctcgccct gccgacggga gggagccccg ggctcgagga tctcatgagc 6360 tcggggctct ctcccgggac agcatgccaa ataagcttta taaatcatca gctaagtgtg 6420 aactcttgaa a 6431 // ID DNA-1-6_DR repbase; DNA; ZEB; 625 BP. XX AC . XX DT 20-OCT-2008 (Rel. 13.1, Created) DT 20-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE nonautonomous DNA transposon from zebrafish - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; TSD 1 bp; DNA-1-6_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-625 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1215-1215 (2008). XX DR [1] (Consensus) XX CC The copies of this family are ~90% identical to the consensus. CC The insertion site shows TWA pattern, and the TSD is likely only CC one base pair (the central W). The family is characterized by CC 25-bp TIR. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 625 BP; 183 A; 149 C; 131 G; 161 T; 1 other; aaggccggct cacactgtgc gatttttttt ataatcctga acgattgtag catgtcagac 60 tgcacgaaca tggctcccat gtcacactgt aagatctcag ctgtcataaa gtcagactga 120 acgacaatga agatgcgcca ccaaacacga aagaaaactc ccctggagtt tgacgtcatc 180 cacccgtgac actttcacta tcccgcgttc tgaaatgatt cctcacagca aacgtaaaca 240 atctgtagcg gcaattttgc acacggcgtg tctcaataat aaaacmacag gaagaaaaaa 300 actgttttga catttccccg cggtcatttg aattgtcgca tggattgcgt catcaaattc 360 ggagctccta ttggttcttg gattgacgct catcgcagct gttgtcacac tgcaggaaag 420 tgtctgaaat cttctgacac tgccagaact tcatcggagg aaaaatctga tcgcaacggg 480 cactaagcgg ctgtctgtga acatgtcaaa ccaacgatca aagatcacag attttagcct 540 aggatttcag gaatctttta ggatttccca aatttgtctc agatgacaaa atcgtggctg 600 aaatctcaca gtgtgagcag ggctt 625 // ID hAT-6_DR repbase; DNA; ZEB; 3529 BP. XX AC . XX DT 22-MAR-2010 (Rel. 15.09, Created) DT 22-MAR-2010 (Rel. 15.09, Last updated, Version 1) XX DE DrAc1 has 16 bp. TIRs with 8 bp. TSDs. XX KW hAT; DNA transposon; Transposable Element; hAT-6_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-3529 RA Arensburger P. and Atkinson P. W.; RT "haT-6_DR."; RL Repbase Reports 10(9), 1227-1227 (2010). XX DR [1] (Consensus) XX CC At least two complete copies are present in the current D. rerio CC assembly. This element is more similar to Ac elements of the hAT CC tree. XX FH Key Location/Qualifiers FT CDS join(225..1247,2078..2944) FT /product="hAT-6_DR_1p" FT /translation="MASGAPCDLVSKKNTTAAVWEYFGFRPNEKGEVVNTE FT DAICKLCYKKVIARDGNTSNLRSHLRIHHPLTAARMDQSVSATVSASTGAG FT ATTSRSTAITQPTIMGAFSKTIQYKRDSARWKTCTDAVTKYLAKEMVSFYT FT VEKKSFKDMVKVLDAQYELPGRKFFSQTAVPQLYSKVRDDVQVLLSAADSY FT SLTTDMWSSVNMTPYMSLTVHIITPDWKLESKCLQTTYFPESHTADNLAAT FT LTAALQEWQLDEKKLSAITTDNAANIHAAIRSLHWPWLNCFGHNLNLAVTN FT GLHDQKQKTERTLGLCRNIVGAFSHSWQRKHELHKKQVDLGLPKHSLVTDC FT ATRWGSKLAMVERILEQAQAIRHVLSDDRRSSLSLTWQDMDVLKAIHEALK FT PVGDFTDILSGENYVTSSCILPILQLCRDNVLAASENDLQLTKSIKTGILT FT KLEAKYESSSVRKILRKCTFLDPRYRGGYEADDNALAETKAELQAEIVSLE FT AAGPVAIRVEEGEAQLEPLRKKMTLGSMLQKKPDAMAGAVGSGTVEDRVGA FT EITAYCLEPVTQGDEDPLLWWKSAAGRFPQMSRVARKYLCVCATSTPSERV FT FSTAGKVVGPQRSLLKPDKVNMLVFLAKNID*" XX SQ Sequence 3529 BP; 1031 A; 660 C; 715 G; 1123 T; 0 other; taggcttggg cgatattacg gtaatatcgt ataccgcggg atctaaaaat agcaacggtg 60 tcagtttcaa taccgttata ccgtcataaa aaaatttttt taaaaggcac ttttatagga 120 gtcaaggatt aaatattatg aaaatataat ttatttaata attaaaatgc gtatgtgtcg 180 ttgtcattac gggacagcgt ggagaaagag agaggtgagg aaacatggcg agtggcgcac 240 catgtgacct ggtttctaaa aaaaacacaa cggcagcagt ttgggagtat tttgggtttc 300 gaccgaacga gaagggagag gtggtgaata cggaggatgc gatctgcaaa ttatgctaca 360 aaaaagtgat cgcgagggat ggaaatacct cgaacctaag gtcgcatttg cgaattcacc 420 atccactcac ggctgcaaga atggatcaaa gtgtcagtgc aaccgtttca gcatctaccg 480 gtgcaggagc aacaacatcg aggtccaccg ccataacgca gccgacgata atgggggcct 540 tcagcaaaac aatacagtac aaaagggaca gtgcccgttg gaaaacctgt actgatgcag 600 taactaaata cttggcaaag gaaatggttt ctttctacac tgtagaaaaa aaatcattta 660 aggacatggt aaaggtccta gatgcccagt acgagctgcc tggacggaaa tttttctcac 720 aaacagccgt cccacaatta tatagtaaag ttagagacga tgttcaagtg ctgctatcag 780 cggcagacag ttattcttta acaactgaca tgtggtcgtc agttaacatg acaccatata 840 tgtctctgac tgtccatatt attacacctg actggaaact tgaatccaaa tgtctccaaa 900 ctacgtattt tccagagagt catacggcgg acaatctagc tgcgacgctt acagctgcac 960 ttcaagaatg gcaacttgac gagaaaaaac tctcagccat aaccactgac aacgctgcca 1020 acattcatgc cgcaatcagg tccctgcatt ggccgtggct taattgcttt ggtcacaatc 1080 taaacttggc tgtcacaaat ggattgcacg accagaagca aaaaaccgag cgcaccctcg 1140 gactatgccg taatattgtc ggggcctttt cacacagctg gcaacgtaag catgaactcc 1200 acaagaagca agtggacttg ggtctcccaa aacatagtct cgtaacggta agcataaaac 1260 gagagttcct caggggaaag gactcaaatc tataaaataa aataaattat taaataaata 1320 aaataaaaaa acatgttttt aattgttttt atttctggga cttatttagc ctgagaaatg 1380 tgcatgcaat tttaactaaa tagaataatg tacattaatg gacgtattta tttattcgtg 1440 tatttaggcc tatttattca tttatttatc cgttaaacta ataaataggc cttaataagt 1500 aaatttcgtt tattttaggc ctatttattc attcattcgt ttatgtaggc ctattcattc 1560 attcgttcgt ttattaaggc ctatttattc gtttattttt ttatttaagc ctatttattc 1620 ttttatttaa gcctattcat ttgtttattt aagcctattt attcgtttat ttaagctcat 1680 ttattaattt attcgtttat ttaagcctat ctattcgttt atttaagcct atttattcgt 1740 ttatttaagc ctatttatta atttatttgt ttatttaagc ttatttatta agttattcgt 1800 tcatttaagc ttatttattc attcatacgt ttatttaagc ctatttattc gtttatttaa 1860 gcctatttat tcatttatct aagcctattt attgtttatt taggctaatt catttatttt 1920 gttaaacgga taagtggaaa aatacgcctt tgtttattta gacttactca ttaattattt 1980 cgtttattta tgccaattta ttcattcgtt tatttaggcc aattcattaa tttcatcgtt 2040 tattcattca ttaattcatc catccatccg tcggcaggac tgcgcaactc gctggggctc 2100 caaactggca atggtggagc gcatcctcga gcaggcccaa gcgatcagac acgtcctgtc 2160 tgatgacaga aggagcagcc tgtccctgac ctggcaggac atggacgtct tgaaagctat 2220 tcacgaagca ctgaaaccag tcggtgactt cactgatatt ttgtcaggag aaaattatgt 2280 gaccagttcc tgtatcctcc ccatactaca gctctgcagg gacaatgttt tggctgcgtc 2340 agaaaatgac cttcagctaa caaagtcaat caaaactgga attctaacaa agctggaggc 2400 caagtatgaa tccagttcag tgcgcaaaat attgcgaaaa tgcactttcc tcgacccgcg 2460 ttaccgcgga ggatatgagg cagatgacaa cgcattggct gagacaaagg ctgaattaca 2520 ggctgagatc gtcagcttag aggcagcagg tccagtggcc ataagagtgg aagagggaga 2580 ggcgcaacta gaacccttac gaaaaaaaat gactctggga agtatgctgc aaaaaaaacc 2640 tgatgcaatg gcaggtgccg tcggcagcgg caccgtagaa gaccgagtcg gagctgaaat 2700 aacagcttac tgtttggagc cagttactca aggagacgag gacccacttc tttggtggaa 2760 atctgctgcc gggcgttttc cccagatgtc acgagtggca cgaaaatacc tgtgcgtctg 2820 tgccacaagc acaccatcgg aacgggtttt cagcaccgca ggtaaagttg tcggtccaca 2880 acgttccctg ttaaagcccg acaaagtaaa catgctggtt tttctagcga aaaatattga 2940 ttagattgat gccaattgca atttgtttct attcgcattg tttaattttg tttaattaat 3000 tcaggtattt ttatttgctt tacgttagga attatattag aagactggaa ataagttttt 3060 gtttaggacc attgctctgc aatacttttt ataataagga tgatgtttat gttagcctaa 3120 ttaatctctg gttgactgtt gtgttttttt gcatatttta taattttata actgctcttc 3180 tttgttatta aaagttctgg cggtgtgatt taattgattt gtgcgcgcgt ttctgcgcga 3240 aaacggttct atatgtttta tgtttattta attctgtttg actgtcgtat ttgcactttg 3300 ttgctaataa aagatgttta ctttgttgtt aatattgttg tacatgttat tttgttattg 3360 ttttattgtt tggtattcct ttctaaacgg cttaggcaca agccaacagt ttggtgacgc 3420 tgtctgagct tacgtcataa tttttaaatt attcacggta ataccggata ccgaggtaaa 3480 atagggaggc ggtttgacgg tgtcaaaatt tggataccgc ccaagccta 3529 // ID TDR3 repbase; DNA; ZEB; 205 BP. XX AC . XX DT 04-MAR-2002 (Rel. 7.02, Created) DT 04-MAR-2002 (Rel. 7.02, Last updated, Version 1) XX DE Zebrafish non-autonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; ANGEL; HP_FR1; TDR19; TDR3. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-205 RA Jurka J. and Drazkiewicz A.; RT "TDR3: non-autonomous DNA transposon."; RL Repbase Reports 2(2), 31-31 (2002). XX DR [1] (Consensus) XX CC TA-target site duplication. Similar to ANGEL, HP_FR1 and TDR19. CC Several hundred copies in the genome. CC Present in Barbel (Barbus barbus). XX SQ Sequence 205 BP; 52 A; 55 C; 51 G; 47 T; 0 other; tagagcaggg gtgtccaaac tcgttcctgg agggccgctg tcctgcagag tttagttcca 60 accctaatca aacacacctg aaccagctaa tcaagctctt actaggtata cttgaaactt 120 tcaggcaggt gtgttgaagc agggttggag ctaaactctg caggacaccg gccctccagg 180 aacgagtttg gacacccctg ctcta 205 // ID DNA-2-21_DR repbase; DNA; ZEB; 1411 BP. XX AC . XX DT 25-NOV-2008 (Rel. 13.11, Created) DT 25-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE nonautonomous DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; 2-bp TSD; KW DNA-2-21_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1411 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(11), 1712-1712 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1411 BP; 479 A; 248 C; 230 G; 453 T; 1 other; tgaagagttt cattgcagaa cgagatatat ccatttaaaa aaatgttggt tattttgaca 60 ttatttttca rgacatcaac agtcaagttc attcacaatt tttgtacatt aaactataac 120 ttgagctcag tgaaggccag gaacacaata ttttcaaatc caggatattt tttatttaat 180 tttatgtcta taattaagta aatcaaaaaa catgccaaaa tgctacatat taatccaacc 240 cacattgtca aacatcttaa attggttaaa aaaaaactat acatcataaa tatatggaat 300 aatatatgca aagattgatt aataataata agattctaag ttagatttgg ccagaatgta 360 cacaacataa cataacataa ctttatttta tttttttttt tgcaaaacgc tacatccacc 420 ttaggctata ttatactaca ttaataccat taccaaaagc tagtgaatca atgttttgtt 480 tgataagtaa gattaagtta gttagataaa tgacctttag cttggtacag aaagagtgga 540 gggcggctcc ttccacaccg ctgcagaaaa ccaaacctgt atgattagag cgggcggagc 600 tgcggtgtca cagagtccgt gacacaaact tgcctcgcta tgtcatattt aatgaattat 660 ttcactgtaa taataataat aataatagct tatttatttg ttgttgtgaa aaagctaaac 720 cgtttatttt tctccataac aacatgtgcc ctttaagttc ctttataata aatcaacaac 780 aacaagaatt aatgtgaggt tttaggctat aaataatagg tgcaaccgca taaataaatc 840 ttcagatagg ctactcagct acatgtaaga aattatttaa aaaataaact tataaaggac 900 acttagaaat gtcgccttgc agacaggtat cttgtgatgg ggagtaggaa aaaaggcgct 960 cctaggcctt cctcttgaca tcagcggaga ctcgttccct gctgcatttg tgtattgttg 1020 acagacgcag aagtatctta ttctggtcca cttttgtggc ctttttataa atttttgtga 1080 agttgaagac ttcatcttct atcgctaaat caatgagcaa actctgagca cttcaccggc 1140 gatgactctc ttcccagggg atgctgagct cgcgccgcat cctgaacgtt ctctaactgt 1200 tatatcgcac tttgtaaaga aactggtctg acggatattg cgttttgtga aaaaaaatac 1260 gttttgtact tggcctaaaa tgtgcattct taaatgttat taatggcagc aattcacaca 1320 tagattaagg tacatctgaa cagtgatttc caataataaa atccaaatgt caaaaaatgc 1380 cgtttatcgc attttgcaac caaactcttc a 1411 // ID DNA-2-30_DR repbase; DNA; ZEB; 388 BP. XX AC . XX DT 13-JAN-2009 (Rel. 14.01, Created) DT 13-JAN-2009 (Rel. 14.01, Last updated, Version 1) XX DE Nonautonomous DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; 2-bp TSD; KW DNA-2-30_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-388 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 9(1), 8-8 (2009). XX DR [1] (Consensus) XX CC TSD is 2-bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 388 BP; 148 A; 49 C; 51 G; 139 T; 1 other; cactgcaaaa aatgcttttc ttagagtttt tgtcttgttt ctagtccaaa tatctaaaaa 60 ttcttaaatc aagaagcatt ttctagacaa gcaaaaaaat attgtcttgt tttcagaaaa 120 aatttttaag tgagtttttc cttaaaacaa gcaaaataat ctgccaattg gggtaagcga 180 aataatctta tttcaaaatg aaaacaagat ttattttttt acttacccca ttggcagatt 240 attttgcttg ttttaagaac atcattttga caaaaattat gaaacaatat ttttaattat 300 caagaaaatg cttcttgatt taagaatwtt gtagatattt ggactagaaa caagacaaaa 360 cttaagaaag aaaagcattt ttgcagtg 388 // ID TE-X-5_DR repbase; DNA; ZEB; 24954 BP. XX AC . XX DT 27-FEB-2009 (Rel. 14.02, Created) DT 27-FEB-2009 (Rel. 14.02, Last updated, Version 1) XX DE nonautonomous transposable element from zebrafish - a consensus. XX KW Transposable Element; Nonautonomous; GT minisatellite; KW CA minisatellite; TE-X-5_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-24954 RA Bao W. and Jurka J.; RT "transposable elements from zebrafish."; RL Repbase Reports 9(2), 567-567 (2009). XX DR [1] (Consensus) XX CC Like TE-X-4_DR, this element are also flanked by GT/CA CC minisatellite, which is not sure exist before insertion or CC generated during insertion. The TIR does not exist, and TSD is CC not determined. The 5'- ~150-bp portion is ~75% similar to the CC corresponding region of FUGUREP2, which is also flanked by GT/CA CC minisatellite. The insertion sequence corresponding to HATN-5_DR CC is masked out. XX SQ Sequence 24954 BP; 7956 A; 4148 C; 4734 G; 8039 T; 77 other; gtgtgtgtgt gtgtgtgtgt gtgtacttgc ttatgtccaa aatgaggact ttcgccagag 60 tacacaccag taaaatgagg accaccttga caaatgagga ctttttcttg gtcctcattt 120 gtctgaccat atagcagtat tcagcatgct tcaaaatggt gtcctggctt aaaatctcac 180 ttgtcctcaa aatgcacttt tacatgattt gtgtatctaa gtgtgtatcc ccccacccca 240 aaagtgtgta taacacagcg ccctctagga gaactgcaat gacgcaatga tacacacttt 300 tgggatctgg ccaatcagaa agatacaagg aggatgtcac gccccctatt gatctgacga 360 atcagaggcg aagtgcgtaa aaataaagat ggcggcagca agttactatg gttacttcac 420 agaataaaag agacagtatg aataaaattg cactttaatt agtatttaaa tgaaaatgac 480 caaactgcgt caagaaagcc ttatatttgg tgtccttaag cacggtactt tattctaggt 540 ttgaatcgta ctgtttctgc gcagatcgcg ttgtatcccc gagggaacgg ataacacgga 600 agtaagttac agtatgtaac ttagtttagg cggtacagta gcagatccga gaaatactgt 660 ggcataatag cgcgaaacgg cccgcgagag cgacacgcaa cagaatcaaa ygttttcctg 720 gtttgtgtgt tctttgaact gtctgtctgt cctacactgt cagcgaccgc accgcgcacc 780 tcccccgccc gcagctacgc tccctccggc tacaaagagc tcagctgccc gggaacgaaa 840 ggaggagagg agctccgagt ccaaccggga ggaaattagg cggaatataa tatataattt 900 gatgtatatc cgtgtataat aaatggtatc atcacttgga aactttttgt atacatgttt 960 aacttttttt ccagagttct gtttatctaa ataacatgtt aataagtatt tgtttaagtt 1020 aagaaagcta aaatgccagg tttaaatcat ttttctaaag aatgaaaagt gtattattta 1080 tatttccttt gcactgagat ttacatatat tataaatagt atgagtgaat aaatttctta 1140 ttgagtaatt taattacatt ttagataaag ataaatataa tcagtgatgg gagtatagtc 1200 caaaataatc agttaactta ctgtaacgaa atcacaaagt gaatatatat tttttctaaa 1260 acttaaaaaa tgtaaaaaaa ataaaataaa ataagaaata cttcaagcct gtttatttca 1320 ttcaaatgta aacaatgaat taagacattt gccagttctg tgtataatat atagttaaat 1380 cacttggaaa ctttttatat gtttaacttt ttccagagtt ctgtttgtta tctaaataac 1440 atgttgataa taaagtattt gtttaagtta ggaaagctaa aaggccaggt ttaaattata 1500 tttaaagaat aaaaagtgtt tgatttatat ttcctatgca ctgagattta tagagattct 1560 gatagtattg agtgaagtga ttacatttct tcttgagtaa tttaattaca ttttagatat 1620 ggtagaaaag caatttacag ccttttttaa atgctcagac actaaacgtg aaaactgaca 1680 cagactcact gaaactcttc actcttacct tattaacacc aattctacaa aactacaagt 1740 ccattgctgt agtgtatttg tggatatttg tttgtgtgtt cagttactag tttcaagaaa 1800 tattgataag tattctggaa gagctgtaaa tacaacttta atgttgtagt aattcattac 1860 tttttttaaa tcgtttaccc aacactgata ataataatgt attaatacta catgcatttg 1920 atatttttca gccaaattct taatggaata aagccatggt ctttctgtaa taacttaagt 1980 tcaggcagtt tcagacactc tttacacaaa agtctaaaca gatgtgtgtc cactataaca 2040 gatttgcact gtgatgctct tgttacaaaa caccamacaa aaagagtgac agttaaaaac 2100 aaataaaaca cttggaatga tacgatagct gtatgaatgt gtgccagttc tgtattattt 2160 ttttcacagc tgcatgtgca aaatatgagc cagaatagac acaggtgtta caaatttaga 2220 tatgaacaat ggcgacaacc taaaaagagg gaaggaaaag aaggaggaga aggaggatga 2280 ggaggatgaa agagcctgat cagcctggtc agagggtagg cctcagactg agctctgtat 2340 gaagacagtc agtctctcat tgacagaagt tttagttgcc ttgctttgca ttcactgttt 2400 ctattaaact tgtgcttttt gcataatgag agtgctgaat acacagatat tgtgatttca 2460 cctagtggaa catgttcttg gatcaacatc tcacatgtgt gaaactcttc ctggaaggct 2520 acaggtcaca gttcagcttt acttcaatga aacacccctg atcaaactaa ttgagtcctc 2580 caggcttgtt tgaaacagac attataatag atagctagat gccccattaa caagccttga 2640 tatcactttt atgaacttga aaagtakaac tttattataa ctaattcatt gtttggtaca 2700 acaactatac cgttttacca tgaagagttt ttactgtggg acagctattt agagccattt 2760 atacactata tgcattacaa aaaactaaac tcataacaaa attaattatg ggatgttgat 2820 ctctgctctg tccacttttt tgatgttgaa aattcaactc tacagtttaa caaagtgact 2880 tttattgaca tttcagtatt tgaatgataa agtgtataag aaataatatc tagagaaata 2940 cgtctgtaaa ataacaggaa atgtactggc agcttattac aagctagttt aaattggata 3000 tacaacacca acccagacaa tagatcactt taaaccaggg gtgtccaaac ttggttatgg 3060 agggccataa tcctgctgaa tttagctcca acagacctgc caggaagttt tttttttttt 3120 ttttttgaca ccataaaatt ataatcagtt aagaaaatat actgttctta tattttatat 3180 caggcaaaac agcagctcct ctgttcagtt gcctctgttt atagcgcagt cataaccgga 3240 ttaacctcgt tgacttatcg gaacaacgga ctgaagcgga ccatgaggcg tttagctttc 3300 tattataatg tctatggttt gaaacctaaa ggtttgttga agcaggctgg gaactaaact 3360 gtgcagggct gtggccctcc aggaactgag tttgacaccc ttgacctgtt gaacctggaa 3420 caacatttct attagtcaat cagattgatg aataagttca cattttatgt taaggttgat 3480 ttacattgag ggttacatgc ttctacatca ttgttgtacg actaatcatc ctctctgata 3540 ttgagaataa tttacagtta tggctaggtt tatgccgaca agaagttgtc taacagcaag 3600 aaggtcgctg gttcgagtgt catctgggcc agttggcatt tctgtgtgga gtttgcatgt 3660 tctccccgtg tcggcgtagg tttcctccac agtccaaacc catgcgctat agtggtactg 3720 atcaactaca ttagcagtag tgtattagtg tgagtgaaaa tgatgggccc actgtatgaa 3780 acatatgcca gagtagttac aggttcattc tgctgtggcg acccctgata aattagtaac 3840 tatgctgaag gatagtgagt gagtgaatga gtgagtgagt gaggtttagg ggtagggatt 3900 aagcagtggc gtagcggatg ggcccgctgt atgaaacatg ccagagtagt tagcggttca 3960 ttccgctgtg gcgacccctg ataaatcagt aactatgctg aaggacagtg agtgagtgag 4020 tgagtgagtg agtgagtgag tgagtgagtg agtgagtgag tgaggtttag gggtagggat 4080 taagcagtgg cgtagcggat gggcccgctg tatgaaacat atgccagagt agttagcggt 4140 tcattccgct gtggcgaccc ctgataaatc agtaactatg ctgaaggaca gtgagtgagt 4200 gagtgagtga gtgagtgagt gagtgagtga ggtttagggg tagggattat gcagtggcgt 4260 agcggatggg cctgctgtag gaaacatatg tcagagtagt tagcagttca ttccactgtg 4320 gcgacccctg ataaatcagt aactatgctg aaggatagtg agtgagtgag tgagtgagtg 4380 agtgagtgag gtttaggggt agggattatg cagtggcgta gcaaatgggc ccgctgtatg 4440 aaacatgcca gagtagttag cggttcattc cgctgtggcg acccctgatg aataagtaac 4500 tatgctgaag gatagtgagt gagtgtgtga gttaggttta ggggtaggga ttaggcagtg 4560 gcgtagcgga tgggcccgct gtatgaaaca tatgccagag tagttagagg ttcattctgc 4620 tgtggcgacc cctgataaat cagtaactat gctgaaggac agtgagtgag tgagtgagtg 4680 agtgagtgag tgagtgaggt ttaggggtag ggattaagca gtggcgtagc aaatgggccc 4740 gctgtataaa acatgtcaga gtagttagtg gttcattccg ctgtggcgac ccctgataaa 4800 tcagtaacta tgctgaargw sagtgagtga gtgagtgagg tttaggggta gggattaggc 4860 agtgacgtag cggatgggcc cgctgtgtga aacatatgcc agagtagtta gcggttcatt 4920 ctgctgtggt gacccctgat aaatcagtaa cttatctgaa ggatagtgag tgagtgagtg 4980 agtgagtgag gtttaggggt agggattagg cagtggcgta gcaaatgggc ccgctgtata 5040 aaacatgtca gagtagttag tggttcattc cgctgtggcg acccctgata aatcagtagc 5100 tatgctgaag gacagtgagt gagtgaggtt taggggtagg gattaggcag tggcgtagcg 5160 gaagggccaa ctgtatgaaa catgtgccag aatagttagc ggttcatttc gctgtggcga 5220 cccctgataa attagtaact atgctgaagg ayagtgagtg agtgagtgag tgaggtaggt 5280 ttaggggtag ggattaggca gtggcgtagc ggatgggccc gctgtaggaa aaatatgtca 5340 tagtagttag cggttcattc tgctgtggcg acccctgata aattagtaac tatgctgaag 5400 gacagtgagt gagtgagtsa gtgagtgagg tttaggggta gggattaggc agtggtgtag 5460 cagatgggcc tgctgtagga aacatgccag agtagttagc agttcattcc gctgtggcga 5520 cccctgataa atcagtaact atgctgaagg atactgagtg agtgagtgaa tgagtgagtg 5580 agtgagtgag tgagtgagtg agtgagttac tagagattga ttttatcact ggtgatactg 5640 tgtcaaacca gttctggaag ctcaaattag tacgatcaga tgtgtttaga ggttaaattg 5700 agggtctgct gtcccctcac ttttatttac ttaaaaaacc tataataaaa atgtgtatgt 5760 gatctatcaa gcatgtagtt tgtatttcat ttttttacta tataagttgg cagttcattc 5820 cgctgtggcg accccatgaa tgaatgaatg aatgaatata tatagtaaaa aaaaagaaat 5880 acaaactaca tgcttgatag atcaggaatc cctgttttta tatcagttgt ttattattaa 5940 taaataaata gataataaat aaagaaatgt attaatttgt ttggagatta aatttaagtg 6000 cattgttatc aaagaaatcg ggttttctaa aaaaaaagtc aaatgaaaag agcatgttta 6060 atgtgaaaca aatatgtgaa aacttcaaca cgtggatgca tacatttagc aaactccaag 6120 ttgaagtgtg ttacctgctg cagttcccct cccccacccc tctgttcgtg tgtatgtgtg 6180 tgtgtgtgtt tgtgttttaa gcacccatcc cccactgtca actcacagaa gcctcaagag 6240 agctgccctg gggtagacta cgactataac aataatatta aattagtttg taaatgtaat 6300 tcaagtttac aacacctatc aggtttcccc tcacttttat tttacttaaa aaaaaaaata 6360 aataaatata tatatatagt aataaaaaca tgtttgatgg atcaggaagc atgttgaaaa 6420 catgccttaa gattgtgtta aaacgctaar aatctctgtt tttatatcaa ttgtttttat 6480 taattaataa attatatata tatatatata atttatttat ttgtttggag attaaagtgc 6540 accatcaaag aaatcggatt ttctaaaaaa aaaagtcaaa tgaaaagagc atgtttaatg 6600 tgaaacaaat atgtgaaaac ttcaacacgt ggatgcatac atttagcaaa ctccaagttg 6660 aagtgtgtta cctgctgcag ctcccctccc ccacccctct gttcgtgtgt atgtgtgtgt 6720 gtgtgtcttg agcacccatc ccccactgtc aactcacaga cgcctcgtga gagctgccct 6780 ggggtagaga ctataggtgc gttctgaagc tcagctgcar ccggtgatca acgagattag 6840 ccgagcgcag gcggggcgga gttgaaaacg gagccgtaag acggacagct rgacacttag 6900 ggactcatgt gcagtcatga tgaccgtggc gtcatcatat tttyytgtta tttktattta 6960 taacaacaaa acatttcaaa trarwcagaa tacaggytct tcaarctgta gttccttttt 7020 aaacaataag gaattatgaa taaaatatat accaamtgca tgagagaaat aagaggragt 7080 gagaggttaa tataaacata aaaacacagg ttcgttttca aattctagga gttaaaatca 7140 atatytakgc taatacatcg tgtttgaatc tgctaaawga gggtttacat atatgtacat 7200 ttatagatat aaaaattwcc yatgtaaaaa gcaaaacaag gtgttttgat taggtcttaa 7260 taratgacat tattttatga tgaatgartt agcattttaa tccaaaaagg ctgtggtgaa 7320 aagaacaaaa ggtaaaataa gtgaactatg tagaagtttc ccagaaagag caggtagcct 7380 ttaaaytgct agatagamaa taaaatatat atttatatta ttataataaa gccakttyat 7440 gattatttaa taaaatagtt ctttacttta atttgttwag agagctttar gtgggcagga 7500 tcaatgatga tgattttctt atttcaatty aattcaattc aattcagctt tattgtatag 7560 cgttttacaa tgtagattgt gtcaaagcat cttcacatma atgtcatagt aattggaaca 7620 tgtggttcag gttttagtgt ttaagttcar ttcagttcag tttmgctcag ttcgtgtgay 7680 ttaatcatka ctgagagttc atgaattcat caatgcgagc tctaccaatc ccccagtggc 7740 gacggaaaaa aaacttcaca aaaaaagcct tgagagarcc agactcagtt gggcacgacc 7800 attwtaattt ctcaaaagtc tgtaagtgta atacaagtct gyaagactta tttgagttca 7860 tatttaggtt attcactaaa atgtacacta aaatatatca tttaaatcgt tcatggagat 7920 gaatgcagta aatagtctgt ataaaaaaag ttaaaaaatt gtgcaaacaa gctaaataac 7980 cagatttttt aacaaaagat gataytgcag taaaatattt gtatgcaaga tgtktacagg 8040 ataggagggt tgaaaagtgc attgtttgtt ttgaaactga atcggaattt gtccaattaw 8100 aaacccaaaa kacacgtgyw tgttgtcatg tggtgaactc caatcgtgta atatgtgtcg 8160 tcatcaaagc tcctgcagtc gctcttyaga agctgcacgg tytgagtttg ccrtctgatc 8220 yyggagcgct gtcgcrgyac tttgatgcat gtgcaygtgg cgtcggcgca gcatcaatcc 8280 cccaccgtya agacagactc ctcgtgagyg ctgcctggag tcgacactat gactatgatg 8340 acgacgacaa tataaaatta gtttgttaac gtaattctag tctacaacac ctatcaggtt 8400 tcccctcact ttttttttac tttaaaaata tatatatata tttttatttt ataattattt 8460 tttattatat atatatatat atatatatat atatatatat atatatatat atatatatat 8520 atatttattt atttacttat ttggagatta aattaaagtg caccataatc aaagaaaatc 8580 aggttttcta aaaaaaaaaa aagtcaaatg aaaagagcat gttcaacata aaacaaatat 8640 gtgaacactt caacacgtgg atgcatacat tttgcaaact ccaagttgaa cctgctgcag 8700 tttccctccc ccacccctct gttcgtgtgt atgtgtgtgt gtgtgttaag cacacatccc 8760 tcacagcgcc tcattcaact catgaacgcg cggcatttgc aaacacgtgc taatgacgca 8820 atcaattgga gacgcctttt cccgcgaact catctcatct ctcctgggag agcaggtaaa 8880 tcaatttatg tatttaaatg cataattact gcttacacga caaaacattg cctttgtaag 8940 atgtatgttt agtgcttata ttcctttgtc gattatattt gttgaaatgt atttaatagc 9000 aatgttagct gtatactgtt taacacacta catttctgtt gatcaaaata gtgcttgcgt 9060 gtgcttgtgt ggttgcccca aaaataaaat ctttattttt tgtttaatat ggtgattatt 9120 ttaaattact ttaagtaaat aagcctttta atgatgttaa ttagcttgcc caaacgcaca 9180 gaacattata tatatatata tatatatata tatatatata tatatatata tatatatata 9240 tatatatata tatatatata taacgtcttt attatatcaa agtatatata attttgaggt 9300 gtgtaaagtt acaaaaacat gaaaatggtt catgtgggtc tagccctgtt ttcagtctaa 9360 ataacaatgt atttatagct ttgtttaacg aatataggaa aattaatctt ggatgtgatt 9420 tacatcctac ctattctgtt tttaattaca ttatgattta atttctgatt taaattttta 9480 gatagaagta cattactaga tctatgtgaa atcatttggg ttcagcagaa caatagacaa 9540 tatataaaat gtattatgct attattatta tgttatttat tagcataata taatttatta 9600 tatttatttt ttcgtaaagt ctttatagga aacatgccct ttgtcttcat gctgggatta 9660 agaaaaaaca gaaagcgtaa aatacctcaa gaggatgcag aacattacat ttgttgtaaa 9720 agtgacaaac ctgcaagtgc aattcatcaa caattacaaa ggtaagttat actcacaaac 9780 actctatctc tctgtctata tctctctatg cagtggttca atatagaatg ataatgaaca 9840 tgtacaatat tgacatcatg ggaactcaaa atacattgaa taattattta ttatgtaaac 9900 ttttacagtt ctttttagta aaatccatgt tgtgttgcag tggtattgtt aacagtgcaa 9960 cattctaaga gacttaattg tttttcaata atattttcaa gttcatctgg cctactacag 10020 gttgaaatat taactgaaac tgttactaaa atttaaaata tttataggtc agttgtactg 10080 tacattaact ctgtgcaaga aggcttcaat agtaaacatt agttaattaa aagtattggt 10140 aaagtttaat gttaatctta ggtgatgatm atttcttaat gctgatgtat gtgttaaaat 10200 tatagtaatc tgattacaat cgttttacta aagttagtcg ttttactaaa tggggataat 10260 aactaataat attcagttgt attattttat cctatattag cacagtaact gtaatgaatc 10320 tggctttttt tctcaccgtt gactttaatc cattaatttt attttaaagt gttacctgtt 10380 gtactttatc atacttttgc aattaaattg ggttgaaaca tttattaact ctgaacatgt 10440 gtatatacaa caaagcacta cattaattat ttttgctatt ataattgtta aagtaaccaa 10500 tgcaaaccat acacacactt aaattagttt ttaatgtatc tgtaatgttg taaaatattt 10560 tacagattac cagactttta attttcagcc aggcactcca atggcttcca atgacatctt 10620 tccatttcaa aattgtcacg tttaagatct gatttcttta aaaatcagta atcttcacta 10680 cctgttatat ttatgatatg aagtgggttt cagatcggac aagaagcact gcattaaagt 10740 ccatttcccc ccgccatgtc tttaaaatat gacagttttt ttaccctgta ttttggttgt 10800 tgctagatca gatatggttg cagccgaaag ttaacgaaaa ctatttatat tatttcttaa 10860 atttcacatt tgttgtattt tattaacgtc tatccccacc ccaaccctaa atccaactgt 10920 cacagtaacg taaaaacagt agttgtattg agtattattt atgctgctat ctattaaaat 10980 acccaataca ttttattttt taaagcctac cttcacccca accttaaacc caaccgtcac 11040 agtactgtaa aaatattcat tattgttata cattcattca ttcattttct tttcagctta 11100 gtccttttat taatctgggg tcgccacagc agaatgaacc gccaactatt ccagcatatg 11160 ttttacacgg cggatgccct tccagctgca acccatcact aagaaaccca tacacattca 11220 ttcacactca tacacttcag acaattttta tcttacccag ttcacctgta ccacatgtct 11280 ttggactgtg ggggaaaccg gagcaccccg aggaaaccca cgcgaataca gggagaacat 11340 gcaaactcca cacagaaaca ccaactgacc cagctgaggc ttgaactagc gaccttcttg 11400 ttgtgaggcg acagcactac ctactgtgcc accgcgtcgc ctattataca gtgtcataaa 11460 aatgctgcta tatggatgtg catagcacat accagccggc cacgtattcg atttagactt 11520 tacccagtat tttcaatgga atttctgcgc tagccttttg ctatactgac ggcgctagtg 11580 gttataccat cgttcatcta attttacgct tgaacaacta aatgaatgct ttagtctgtt 11640 taactgaatg tattgtaatw acattacaac attgatgatg tacaaacaac tttgtgaaat 11700 tgaaaatatc cacattaagc tatcaccgga agtttatcgt tggctttaaa actctagctg 11760 tgcagagagc ccattgtgca taaacaattt tgttccaaat ccccgttaat atctaatctt 11820 tgtgctaaca aacaatagcg aacacagttg ggcaagttac ttccaaaatg waatgcatta 11880 cagtatatat atatatatat atatatatat atatatatat atatatatat atatatatat 11940 atatatatta aaagttactg ccatttcttt ttttgtaatt ggttacatga caatattaat 12000 atctcagaaa tgcattacac tacttttgta ttacttttag gttactttaa ccaaaatacc 12060 cacagaagta cgactctaat gaattaaagt attaattttc aatagtatgt ctatgttttc 12120 aaatacattt agctatatat tgtaattatt cagagaaatt tttgtttgca caaggagtct 12180 gacaacagcc agtgctccac acagagatct gatcatttat ttgacatatt ttcttttaaa 12240 tatatattca tagtgcttca tcttttacac atttatatat atatgactgt gtaaccttaa 12300 tcacttctgg ggatggggac tttttacagt aaaaggtagc aattttattt tatccaataa 12360 atgcattgat aatatccaca gatttttaaa tagtcaatca aatgaactta aactattaaa 12420 attgcaacaa acatttgttt tggcccaaac tacaaaaaca aacaaattta aaactagttt 12480 aaacacactg gtttaatcat ctatacatgt cattataact atattgtata aatattattt 12540 caaagaccta catcatgcta acaattaaaa gctgaatgac atgacataac aggcgattgc 12600 cctctgagtg cacctgaaaa gacatgaatg acacttctca taaagcttga gaagcataca 12660 ggacaagtag aacaattagt tcaaataaaa acacaaaagt ctgtagaatt taatgcaaac 12720 tcaaaaaatg caattatgaa ccaatgataa ttaattttca acaaactgca gctaatgaac 12780 gtcttttgtt taggaggatg agttttcacc actgaagctt tattcagagg ggattttgtt 12840 cttgagtaca agtttgggtg gtgagtatga ttagccgaaa ggttattaaa tatgtttaga 12900 tgtggttaaa taaatgagca aataagcaaa cataacactg ccgtgcccct atgccaggga 12960 tcaccaaact tgttcttgga gggccggtgt tctgctgatt ttagctctaa cccaaatcaa 13020 acacacctga acaagctaaa caaggtctta ctaggtatac ttgaaacacc caggcaggtg 13080 tgttaaggca agttggagct aaaccctgca gggcactgga cctctagaaa taagattggt 13140 gacccctggc ctatgctata gtatcatgat aggtgatctc aaaggtggat gatatgaaaa 13200 ttgtgtatca cccaacaata atgtccaagc actgcacaac tctgaacaag ttcgacccac 13260 gcatgtgcaa ggcaggcaat tctgtaatct atcagcttaa atatatcggc catattttga 13320 tatcagactg atgaccataa cattaaaaaa atagcattta tcggctgata ttgatatggc 13380 cgataatata ttatgcgtct caaaatttaa gtatttctga ttctgatcat tctgattaca 13440 atcagttccc atatttgatt gcaatttcat aaaaaatgtt cctcacagac ataaaaaata 13500 ttttacagga attgagaagt gaaatgtaaa ttaacacttc taatcaaatc tgtagttttt 13560 agtgaaataa ttagcaagtt ttaaaataga attaattctc gatactattg ttaacacctc 13620 tttgaagatg gtcacagaaa aaaaaatagt acatcttaaa tgtataaact ttaaaaattg 13680 tttgtatttt atgcttagtg attgttttat gttttacagc atggatgcat cgaaagaaga 13740 tgggtcttta gggagacttg taaatgatga ccacaaagct cctaactgca aaatgaaaat 13800 catatctttg ctgtgtgcaa catacttcca ggagaagata taacctacaa ttatggagat 13860 tctggttatg gtgagtaggg gtcagtagta tgagtctaca gaaatcagta ggttcactta 13920 ccaattgtct atattatgaa acataaaatt caaccataat tatacggatg ctttattctg 13980 taatattgac agtcaaaata gcaatactga ctgacatgta taagtagaaa gtctggtcag 14040 aaggaatata cagggtttcc gcggggtctt aaaaattcta aaattaaaaa aatcgaaatt 14100 taaggccttc aaaagtccta aattcactgt tctaagtcta aaatattttt gcacaggtct 14160 tatttttgcg atgtccatgt aacgctacat ctaaaactca tttaaattct ttttttgttg 14220 ttgttgctag gttttcaggg tgttgtagtt ctttatttca ctagtccaat tgtaatttgc 14280 gttattacaa ctacaaataa gaccgacatg ccctataccg acaattgcca atcagctttc 14340 tgttattgaa ctcagtgcgc gcggtaaatg tgatgtcatc actgtgttta tggaggtctg 14400 ccagccgcga catgattttg gctggcagag ctcacaaaat gttttttcag ttcgtgcggt 14460 gtcaaatttg atttgaattg aatatgaaac actttgaaga gattttgtct gaaacaggta 14520 tgactgtaca cacatagggc tggacaataa ttcgatatca atatatatcg cgatagattt 14580 ttttcaataa cagtgatgat ttttaaaccc atttccggta tttcgatata catttgcata 14640 catctgtttt attatagata agatcttata ccttgtatca tttgtcactg aatgactttt 14700 tcgtgctcag aatgagcgtg atgtcacttg cgtgatgacg tacatcacag acacgctgcc 14760 actgccagcg ctcagaagtt agttaagctg ctccgtcgca gaatgctaca aaatgctaca 14820 aaatgtgtgc cctgaatact gatgtgaatg aacaagcttc cacaactaag aatgcagaca 14880 aattagttga tgagagagga aagacgaatg cagtggtgtg gaagtgattc agctttttaa 14940 gtttcgataa acgtttagtt tcggtgctct gcaaaatgtg cggtagagtg gtgccgagtc 15000 tgccgactaa cagcggaaac acatcaaatc tgttccaccc gatagaacat actgagagtc 15060 agaagcaatg acagaagatg aggcatcatt taagtttaag tcaacctgtc gcctcaccat 15120 ccaccttccc gacacagatg gctatgaggc accgtgtagg atctaaataa aactttgctt 15180 atcaacacga gcatataaat tactcgcata taaacctata ctatgttcaa aacatgtatg 15240 aaacttgcgc atatataaac ttgatgcatg cccattgata cacccattga tacacccatt 15300 agggctggac aatgattcga tatcaatata tatcgctata gaattgtttt cagtaacgtg 15360 atatgagttt taaactcatt tccggtattt cgatataaat ttgcatatat atatatatat 15420 atatatatat atatatataa aataattata tttttaagat cttatagatt gcatgattgc 15480 atcatttgtc actgagtgac ggtcagtgct cagccatcag aaagagcatg atgtacgtca 15540 tcacgcatgt gacatcacag acacgctgcc agcgcttagc acagcagttg gttagctacg 15600 tcgcagaaag tttgccctga agtgtgccct gaatacagat gtgaatgaac aagctatata 15660 tatatatata tatatatata tatatatata tatatatata tatatatata tatatatata 15720 tatatatatt tgcacagggc ttgacattaa cttttttgat caccagccac tgtggctagt 15780 agatttccaa ctttactagc cactctgcat tttcactatc cacaattttg ttgttggaaa 15840 aatatatttt atatgcataa gtttgacgtt gacatgctaa aattacttga tttaaatttt 15900 gtgttatgtc cacacgtgtt ctcaatcatt tcactttttg tgtgtcgagt atgagcttgc 15960 tcaatgagca tgagcaaagg agtcatggtt tcatagtgtt tgcacgacta gcatgacttt 16020 ttttagagct caaaattaac ggtttaccaa atttatctct ataaccatac aaaacaataa 16080 ttatttctta gacacaaatg tttaatgtgt caaaacaagc aaaatatagc tgtatactat 16140 actttttatt taataaaaca tttcttaaaa aagaaagaaa agcaattgac ttttaaaaaa 16200 ataactattg tcatgtatca aaaatattta cagtaatctg tcgtggctaa tgatcatcct 16260 ctcatttggt attcacttgc tttcttctgc tcacgtgatc acaattattt tacttctcga 16320 ttctaagatc acatttgcac gcatattggg tcatccttat attgtcgaac cctgctttta 16380 ggaaaactac aaaattgaaa aaaaattatt ttcaaccagg caaagtggct tgtgggagtg 16440 actgtctaac acgccaaagc tgaaatctac ctgaatttgg cgtgttgccg ggtgttaatg 16500 taaagccctg tgtgtgtgtg tgtgtgtgtg tgtatatata tatatatata tatatatata 16560 tatatatata tatatatatt atgtcatata tatatatata tgtctgtttg taaaggctaa 16620 atacaaaaaa gagagtgaac atttgaaaaa gttaacaagt aaaatttttt taaacctttt 16680 tattaaaaga aaagttataa gagcctggag gtattataga ctttgcaaat tcagtttgca 16740 gatatttgta ggtacagaca tctattgtgt gcgttcttgg cagttttttt atgtcttttt 16800 aatattgtcc atatcgatat cggaattata tcatatcgac cgaaattaag aagtatatcg 16860 tgatataaat ttttgtcata tcatccagcc ctatacacac atcttcttta tgatcagtcc 16920 atgcgtgatc atagggagaa gtagaaccag atgaccaata ctttttggct agaaattgca 16980 acagcagtgg gaaaggagga cagtttttgt aaaagtttgg aaaaacctga gggataagtt 17040 tgtttaaaaa aagaggccat gcaaaagtgt agacccagtt ggttttaata ttacagaata 17100 tgtttttcca tcatgcatag gttttacact aatgtataga cctatattac aatttcgaat 17160 ttaaaacaat ttacaagtta caaactaaaa ttaagtaaca aaattttgcc taattaccct 17220 aacctgccaa gttcacctaa ttaacctaat taggctttta aatgtcactt taaggtgtat 17280 agaagtgtct tcaaaaatat ctagtcaaat attatttact gtcatcatgr caaacagaaa 17340 ataaatcwgt tattagagat gagttattaa aactattatg tttgaaatgt gttgaaaaaa 17400 aaatctgctc tccattaaac gggaattggg gaaaaataaa caggggggct aataattctg 17460 acttcaactg gaaaggaatr tatraatata tgtatcagtt tacagtatat tgatgcccaa 17520 tttctaggct ttattggtra taactaaact agccagaaag tgatgtgtga attgttgagt 17580 gggcaaaata aaaaataaaa aacctcaata tttttagtga gtgtacttgt tttctgtttt 17640 tatttctgcc ttaataaaaa atataattgt agagcaaccc aagatctgtt gccattaata 17700 tttaacttta atgggtagaa ctgtctgaga tatgaagaaa agcatgtgat gcatcattta 17760 gataaaaaat gtatttaaaa aatcttgaat ggttataaaa atctttataa ttaattataa 17820 aataattaaa aacgattatc ctgaatgata ttaatgatat gacctagttc cagaagcttt 17880 ttacttctct gtttatttgt cagattttac tgtcggtttc ttcttcttct gatctcttta 17940 tgtgtatgcg tgtttttgat attgcgatat aggtcctaaa tttaatactt aatggtctta 18000 aaaaggtctt aataagtctt aaatttgaca ttatgatatc tgcagaaacc ctgaatatag 18060 aaaaagaaga atatacttgt tctgtcttga caaatatttc tgatttcgat attctctgtt 18120 ttttttctga catccagtcc cataatgctt ctgtctcctc acctgctttg gacaacgtga 18180 agatgagtga tcctcacaaa caaactgagc cacaacagtc agacaaagca agaagatcag 18240 ggaataagac ggtaaccctt tcattcaggc accaggtagc ctatctagtg atatttgctg 18300 ctaagtgaac agggcagagt aaccagaaca aggtaagaca gaccctttgg caggtttatt 18360 taaaatgtaa tcttgttcta actatataac atcatacaat tgtgtcttaa agcaatttat 18420 ttaaagatga aaggatagct caattatgta ttttcacatg ccagatattc tggagttaat 18480 tcatacagta tgaccaatag tacacataca gattttacta taaaagattg gttaattact 18540 ggcaagtgat agttggactg catagttaat cgtaaaaaga tggcgatctc gattcgaacc 18600 cctgaacgtc gatgcgtgaa cgacatttaa atgaccatat tgcagtttgg agttatgttt 18660 actacaagat agggttgtgc cgattgacac tagtgttgtg tatcgatgat agggagagat 18720 atcacagaga gctgaggcct gactatttta aagatgatgt ttggttcgtt tcacatttat 18780 gtagacctat tacatgtgtt aaattcattt ttaaataaaa gacttattat taatattatc 18840 ttgttgttga tgttattctt atcagaatta ataatcaatt aactgtaatt tgtccagctt 18900 tagctgtgtc cgtcagcatt gcctaaccga sactttcaga ctgcagcagc aatcatttaa 18960 attgaaaatt gcctgctcaa aggaaaaaaa atacaatgat tttaaaattg caaggcgagt 19020 attacctata ttcacaattt ttaccaaaat cagcatgtca aaaagaattt atttcccatt 19080 ctttgcatat caagttaata tgcaacttca ctgttactta aaaataatag gctaaataac 19140 aaaacaggat ttaattaaca aacaagttat aggtgcagta ggtgatctgc caaaatrcta 19200 accggttggc ataatatctt tgaaacacag tccctcccct gccatccaaa gccacgcctc 19260 ctaaaacacg aatgctcatc gtaaagatgg cagtggacaa cccactagat catgtcattc 19320 accatttata aatgtagcta gtgtttggcc ggtggtgtgc aggaattaca cttattacta 19380 aaccatattt gcatgctatt tcacatgaat attcagagta gtggtgaaca atataaagag 19440 cgcatcacag gctgtcattg ttcagaaatg acccaatata aatacaaaag catcttcagc 19500 tcagtaaagt aggctaggcc gaatagctct atttactgat gttttgttgt taaactaaat 19560 ataaatctac tttatcaayg ctgtaaaagg accttatgaa actgaaaata gtcgcatcaa 19620 tctttcacgg ggagatttca gtggctgaac aacacttctg tgcagataac ccattcataa 19680 caacacaatc tacatcagct ctgtgtgact aaaagagttt aacacagaac atgacctgcc 19740 taacagaaat acttcagtca tggtattatt cttcttccag cgtgcaaaac tcactccaat 19800 attgattcag gagtttaaca ggtttttatt cagcatttga ttcgcaactg tgactgctcg 19860 tctgcacata aacgcgcggc gcacatgcac gcatccacac gcaaatggtg aatctgcatg 19920 aagagatgcc cagtggttca aatctacatt tgctgacagt ttgaactact tataagaatt 19980 atgggaattg tcagcccgac aatatttaat tggatgaaca tgttttagtt ttatgcctta 20040 cccagaatat aaaaatacat ataaacacat ttagatcatt ttctttaatc tttacttttg 20100 gactgtgaag agactttcga ccagcacaac aacacatgat tctgaagatg atcacctact 20160 gcagctttaa atagacttaa aaaaataaaa taaaaccaaa aatgaaaact ttagaactga 20220 aagtggctca gatgttctgg aataaatggt gttgcagtaa tgcacaattg ttggtcacct 20280 gagactcctt ttataacagc actctcacaa gattatctct tgctttcaca aaaagttttt 20340 tatcttttat ttatatttgt tactaatagc ctattttagt tttgcctacg ggtaaaggaa 20400 gcctatggac tttttaaatt tgtttttagt gcatatattt actgatttgt caaataccca 20460 gaatatacag ggtgtccgcg ggggttgtga aaggtagtaa atgaaattag tcaaaattaa 20520 gggcattaaa aagtattaaa aagtagttaa tgtcatctga cgaggtatta cattttcgag 20580 ctcaagataa atttttagat ttttagatac attttcagtt tgtgttgagt acaggccttt 20640 attctaaaat ttgcgtggat ttatttaaat gttgagagta ggaccggagc gatatcgtga 20700 ggtggtatgg taacttggca acaaagccgc aacgtcaaca caccagggct cgaaattgtg 20760 actattctgg ttgcatatgc gcccaaaaat taagctatgc aacctcataa tatatttggg 20820 agcatgtgtg cgactgcata taatggttgt agtgcgacct gcttagattt tttctaaaga 20880 tgtgctgaat cgcttttccc ggccattaaa atggttcata ttagctgtca atcacttaag 20940 gctttccgct gtcagatgac agggagcttt tgtgactgca ggaattgcaa acgcctgaag 21000 agtgaaaagt acacaggttt gcaaatccca cctaaagtta caaataatgg cacagatgac 21060 cactatcatg tggtgaatgt tgatccacca accgagatcg agtgttttcg gagaaggtgg 21120 ccgaatctgg atgcgttgca ttcactgcgt gttcagcrca aatgtccgct aaatgtaaaa 21180 tctaatactg tacacatcat cgccaaagaa gcaggccttc taagtttaca ctgaaactgc 21240 agctcataac aaagaaccgc attgcgctgt tggtcatcgc gagaatcctg ctctgcctgc 21300 tcatttattg ggtcggctga ctcgcctgct ttattccaca aacacagaaa aatgtaaatc 21360 agctcataac tagtgtgagc atttacaatg acacaaacca aaatcaawac tttcaaagag 21420 tggcagaagt gagacttttt ttttgagatg tatattcttt ttctatttaa ttactgatga 21480 ctgtttgcag cttttagcct tgaattgaat gatttattat aatcttttgt ttgttttgta 21540 gtagaaatat tattcattaa attaactcgc atataaaaaa acatattttt gtttgatgta 21600 aactatacat ttattcttca tgaagtaaca gtaggacttt atatagcaga taacccacat 21660 tcaaggcagc atgataatca caaatgtaat tctttgttag tattattgtc atcattgatt 21720 cagcacgatt gtctaaccgt gctgagaacg gtktgtaatt gtgccgcact gatccaggct 21780 cagtggagaa acaaccataa ccgtactgaa ctgttcttag aagagcggct attgatatta 21840 atgacatgca taacattgat taaaaactat gaacaaaggc ccattgtgac atctagatta 21900 atccctctgc tttacttaaa tgtatatgtt atctgcaata tgttattgca tttagaaaga 21960 tttaatttac gccactacag ttgcagctct aagtcgtgct ggtattacat tttttttgaa 22020 ggcattaaaa aaggtagtaa aaggtattaa attcaactta agaagttctg tatataccct 22080 gaaataaata tgtttgtttt atttgtgttt aattcgatgg aatgataaac atggacatgg 22140 gtgttattca aaacatgacc tatttatttt taaatatatt atatttatat atttaaaatg 22200 ttttatttaa ttgtttaaac catattacat aaatttacaa ctactatttt accatttgtt 22260 ttggctttag ttattaaaca gattattaat aaataatcaa agaacatttt actgagctca 22320 gtcaatagtt ttgtctgtag gtctagtcaa aagtctaata caattagagc catggctgca 22380 acaagtcggc aacactgtgt ctaaaatgaa actctcttat cagaaaacaa taaacagaat 22440 tattccaagc acaacattgt gttagatttt agcaaataca aatgtaasct tgttgattat 22500 cacttatgtg actatgatca gacctgccgc actcaactga acaaaacatg gacaaataaa 22560 actctatcac agagaaatag tcatccgcgg gggtccttct ctcatctaaa caggacatta 22620 atgcatcttt gtttcttatg tgtttatcat tgtaggtgtc cccaaaggtg tgtgtgtgta 22680 tcaccagagt cctcaaagtg tagctaaagc agtgtgtgtg tatcaccagt gtccccaaag 22740 gtgtgtgtgt atgtcaccag tgtccccaaa ggtgtgtgtg tgtgtatcac cagagtcctc 22800 aaagtgtagc taaagcagtg tgtgtgtatc accagtgtcc ccaaaggtgt gtgtgtgtat 22860 tacccgagtc ctcaaagtgt agctaaagca gtgtgtgtgt atcaccagtg tccccaaagg 22920 tgtgtgtgtg tattacccga gtcctcaaag tgtagctaaa gcagtgtgtg tgtatcacca 22980 gtgtccccaa aggtgtgtgt gtgtgtatca ccagagtcct caaagtgtag ctaaagcagt 23040 gtgtgtgtat caccagtgtc cccaaaggtg tgtgtgtgtg tatcaccaga gtcctcaaag 23100 tgtagctaaa gcagtgtgtg tgtatcacca aggtccccaa aggtgtatgt gtgtattacc 23160 cgagtcctca aagtgtagct aaagcagtgt gtgtatcacc aaggtcccca aatgtatgtg 23220 tgtatcacca gagtcctcaa agtgtagcta aagcagtgtg tgtgtatcac cagtgtcccc 23280 aaaggtgtgt gtgtatgtca ccagagtccc caaagtgtat gtgtgtatca ccagagtcct 23340 caaagtgtag ctaaagcagt gtgtgtgtat caccaaggtc cccaaaggtg tgtgtgtgta 23400 ttacccgagt cctcaaagtg tagctaaagc agtgtgtgtg tatcaccaag gtccccaaag 23460 gtgtgtgtgt attacccgag tcctcaaagt gtagctaaag cagtgtgtgt gtatcaccag 23520 tgtccccaaa ggtgtgtatg tgtgtatcac cagagtcctc aaagtgtagc taaagcagtg 23580 tgtgtgtatc accaaggtcc ccaaaggtgt atgtgtgtat tacccgagtc ctcaaagtgt 23640 agctaaagca gtgtgtgtat caccaaggtc cccaaatgta tgtgtgtatc accagagtcc 23700 tcaaagtgta gctaaagcag tgtgtgtatc accaaggtcc ccaaaggtgt gtgtgtatca 23760 acagagtccc caaaggtatg tgtgtatcac cagagtcctc aaagtgtagc taaagcagtg 23820 tgtgtgtatc accaaggtcc ccaaaggtgt gtgtgtatca actgagtccc caaaggtatg 23880 tgtgtatcac cagagtcctc aaagtgtagc taaagcagtg tgtgtgtatc accaaggtcc 23940 ccaaaggtgt gtgtgtatca actgagtccc caaaggtatg tgtgtatcaa ctgagtcccc 24000 aaagtgtagc taaagcagtg tgtgtgtgtg tatcaacaga gtcctcaaat gtgtgtattt 24060 ccactgtagt ctaaactaaa ttaattatag acactcctct ttcaactaat catttgtata 24120 aagactttcm cagtaattgt tttcactgag aaacgtgtct tatggcgcat gtaagtagkt 24180 agtgtatatg aatttgaact tcacaaccct tgttttatac agtatgggtt ttagtgtgaa 24240 tggatttgag rtgtactata ttaaccatca tatgacttat cagtagagtt tcatcacttc 24300 tcatatcatt ggatattaat gtgtccattg agcttcataa actcactgta aggtcatcca 24360 tttacttctc ccttactgtt ttttaaaatt agtttgtatg tattattctg ctaacatgct 24420 gtctgtcaca tactatttat gagagaaata tgtcatttca tgaacaatac ttgacaccaa 24480 atgaccttct gacaccagat tgagttattt gtgaaggaaa aaatgtaatt gtctattttc 24540 agtgatctta tttactttat catatgtctc tctgcactaa cattattata attattaata 24600 atattatatt agaaatattc agataaccga gtcctgaaag tatactttaa cctgagtttg 24660 tgtataactg cggtcctcaa agtgtgacta taccggagtt tgtgtacatg tccccaaagg 24720 tgagttataa aatactgtcc tcattttgaa ggtaatttgt caggtcttta tagaagtatt 24780 tttttgcaaa tatcaactga tattcataat aagcacattt ttagtagtgc agtcatgtct 24840 ttcttttgtc tctagaccct tcagaaaggg tagtccccaa aggtagggtg ttgagtcatg 24900 tgtcctcatt ggtttgttag gtgggtatgc gtgtgtgtgt gtgtgtgtgt gtgt 24954 // ID ERV1-N8-I_DR repbase; DNA; ZEB; 6801 BP. XX AC . XX DT 04-DEC-2008 (Rel. 13.12, Created) DT 04-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE nonautonomous endogenous retrovirus from zebrafish, internal DE portion - consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW internal portion; ERV1-N8-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6801 RA Bao W. and Jurka J.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(12), 2177-2177 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 6801 BP; 2688 A; 761 C; 1454 G; 1895 T; 3 other; tttggcgtcc ctctgggtgg gctggaaagg agcaggactg gtgttgctgt gcatgctgga 60 ctggagagaa aacacaaata tgtggccaca ataaaaaacg gtaagcgatt ttatgcttat 120 tagtttgtgt tttttctgcc aggacctgct tgtacaggta aatacacttg aactgtttct 180 ccagctttaa agaattaaaa gtaaatgtaa aaaagatatt aaagaaaaaa ggggttttga 240 ataacagaag aaaaagagta atttgcatgc aaatgatctg tgttttccta agagggcctt 300 gatggatagg ataaacatta actagtaact gttactcaga tttgggaaat tattaatgat 360 ataagatatg catagagcat ttataggttt taagagagag tctttaatgt gtggtgacag 420 ttaagagatt aaagcagagt aaggtatgca aatgatctgt gttttcctaa gtgggccttg 480 atggatagga taaacaatca ctagtaactg ttacccagat ttgggaaatt aataatgatg 540 taagaaatgc atagagcatt tataagtttt aagagagagt ccttaatgtg tggtaacagt 600 taagagatta aagcagatta aagtagagag atgcgcgcta agacctacga taccgctttg 660 ttaaatgaga taaacaaagg gctgcatggg taggcaagaa agcctgtggt taccgctcta 720 aggttagagg gctgctcgga caggtcactc aagaggagtg tagtgtagag tatttattta 780 attgaagtgt ttgtgtattt gctgcttcag attgtggcaa aatagtccac attggtgggc 840 agagaagact gccttcctcc tccttgaatg gaatgaggat ggggggatgc agctcctgtt 900 acactgagtg gaggagagga gtgaatgaga agaggaaaag taataagatt agaagtgtga 960 atgtgtttaa aagtatgttt tcaataagga gtaggttaaa ataatgatta tagtatatta 1020 taatataaaa ggagggctgc atatgtgtgg tgtgatttaa agaacaatag gaatagtgat 1080 tgatgcagtt taagagaaga gattatgata taaagagagg agtctcctaa attaaaacta 1140 atggaacatg ttgcatttga aagacacaga gtaaaaagca gaagaatata atagttaaac 1200 atttttctga aattatcttt gtgttttaag tttgaatatg ttaaaaaaag aacaataggc 1260 agagtgaact taaattcctg acatagcaat aggtgaaaga cagtctgctc caggatgggg 1320 gtaagagaga gatttatggt gtccatgtgt gaacaataaa tagattgcct atttacagca 1380 ctagggaaac agagaagagg agagttaatg agaggagatc aaactataga taaaataatg 1440 acatagactt taacataatt tggatagaat ataagttatt aatgaataga taaattataa 1500 agtagtatag taagataaat gaaatgagga aatactaata atagagttat attatataga 1560 aaagagtttg aaactttact aatgaaaaag aataaatgaa gccatagtga tcaatatata 1620 tagagataaa ggaagtatgg agtaaaagaa tgtattaata aacaaataga tttactaaat 1680 aagtatgaat aaataacata taagttagtt ctttgattga gaataagaaa tagaaaaaga 1740 aaacttgaaa aatgaatatt gattaattga agatttaatt ataaaatgtg taagcttcaa 1800 aataagcttt aagaaatgta taagaaaacc taagagctaa aaactcaaat tacrgaaagg 1860 aaaaagaata aaatagaaat aggaaaaaat aagtccactg tttcatagat aaaattagat 1920 gtttaaaata tgcttacaat ataggcacat tgaacagata agataaaaag taaaaaagca 1980 gtaacaattt aaagaaataa ttgactttag agaatttgat tggatttaag ggaaataaaa 2040 aaatgttttg tcatctttct caaagtcagc tatgtcattt gtttgtagta aagtacagtg 2100 attcagatga aaatcataga agctaggtaa atgttaaaga atgcattaga gtgagttaaa 2160 aaataaattt taaagtttat gcagtaagaa gctaaattga taagtaaaca ttgcaattca 2220 aaaattgaaa aatgctttta gtaaataaat atgtgagttt aattaataat atgaaccctg 2280 agctatagca gtaatgattt tgaacagtta taccagatat tgggccacca tgtagtattg 2340 ttgaattaaa tcaggtatgg aaaaacagat atggaaataa aaaatgcaat actaaaatat 2400 gtgacagaga aaataaaaga taagataatg acaaatggca aacagaagaa acatttgatg 2460 taagcagtga aaaaatagaa actgagtata gaaaatgtta gggaaattaa aattcaatgt 2520 aaaaatgagt ggttagtaaa ggaacactag catagatgag gagaaatgta aagtaaagca 2580 taatctgaag ctaaagaaat tatttagaag aacactaaag aaatagaaaa ttaaacatat 2640 ttacatatga taaaatgatt gtgaattaga gaaaaataaa ctgttaaaac cgtaaagagg 2700 agaaaataga tgagaggagt ctccatttga tggaaaaaaa aatgggaaag aaggaaagac 2760 ttttataggt catgagctaa attccatata ttaattaaga ggacaaaggg atggactttg 2820 ggggactgtg gtctcttacc tactagacat ctctgagtgc actggcaaat ggaaaaggtg 2880 tttgaaaaac acaatggaca aactgttggc tgtggacatt tgtgaaagca gtgaggcaga 2940 cactgcatgc ccaggaatta ctgaggactg gaaaggctac ttatgttcag agacactaaa 3000 aagaggcaaa caagctaatg atgaactgat caaagagaaa aagcgctcca ccagaacttt 3060 acaacagctc aatgaaagaa gccagcacta agaaagaagg gggaaacact gtgccctgat 3120 atggacacca cagaaaagac tacaccaatc tatgaaccag ttgtcttctc agggtgtgag 3180 gagggggagt ctggcagtcc atctcaaagc agtcgaccta tagtccaaat tgagtacaat 3240 gtgaccactc agagggcaaa aataagagca aaagtgtagc tacccattct tgtatcaaat 3300 cagactccta tcaatttgat aaaaagaaaa gtatcatgta aactgtaaga aaagtcttag 3360 atgataatgt acgatgaaac tttaaatgtg taatgttgaa atgataaatt atgaagaatg 3420 ttaaattatt ttggttcatt ttcataggtt aactcataga gattattgaa cataggagga 3480 taaattataa tgcatggtgg tgattaaaaa aagataataa aagacagttt ttgataacta 3540 aaaaaaaata ataatttggg ttagaagttg atggagttaa gtatttaagt taaaattagg 3600 gaaaacttat tttgagtaaa ctaaaccaag atttgagaga aaaattatgt cagagattaa 3660 gacagaaaag tgatttatac ttgaagagaa tagaaacaca aaaaaagtga aatgaactca 3720 cagtactgaa taggtcaaaa agtaacaatt ctaaatgtac taaaagaata tttgaaagag 3780 aaaagtgttt aaaaacagaa aataggttgt tttacatatg tgaagaaata attaaaagca 3840 aaaaagaatt aaggactaaa gatgaataat cagatcagat agagaggaac ctataaagaa 3900 atccagtgat ttttcagtca acttgtagaa aattacaaca aaagttgtat gaataaccag 3960 tttgatgtga gagcaggaag gaattaggag ctgaaaatgc tgaaaagtga aaaatggtaa 4020 ttaatataaa atgcaaaata agaaatagct cagaaataag aaattactta ggtgtcaaat 4080 tttaataatg attaacagtt caggacatta agggaaaaca aaattagcca gcatagggag 4140 gattaattaa aaattaaacc agatgcaggt ctgttatatg ctggaatagg cattattgaa 4200 acttttgaga acatagcaat aatagaaaca cttagatatg ttatggcatt tctaattaaa 4260 gtcaatttca attcggttgt gattgtgtag gtgacgaata aaatggattc tatttgaatt 4320 tgatttgaat ttaaattttg attatggcat tagtcaaaac aggcaaaagt agatgttgta 4380 aagtttatga tttaagaaca gtcagtgata gagaatttct tagcagaggt tcctaacacg 4440 cactgtactt aaatttgtta gtttggatgc caaatatgtt aaagtaatta attgatttct 4500 gttcaggtaa ttatcagagg agggtagaca cttggttaca cttaaagtaa ataggctgat 4560 ttacatagtt acatacatta gaatgcataa aaggatttag aatatttcgt gatggatatt 4620 gtaacgtgta ggatttagtt atctgcaggt acatctagct caagctaagt aaaagacata 4680 aataaaactt tttagggatg acgggtttag tgttaacaag attctgtatc tattctagat 4740 gcaataataa ctgttaagat gataaataag aaaattgatt ttacattaaa gaaaataggg 4800 tagcaggtct aacaatagta tagagtcaca accttaatta gattatagta caaattaaac 4860 aacgtactag tctttatgaa caatatgtgt agcatcaaag ggaagcaaaa aagataggta 4920 tgatttttgt aataggtaca aagtacagat tgttgaggca acagcatttc tagatattgt 4980 cattggcgat gtacatggtt tgaatcaagg gtgaagtggt aaggaaaatt aagcagcaag 5040 gataatgatc tgaaaaaatt atcctttttg aatgcagtga ttagtgaatt gttgtccaca 5100 tgtgacattt gtgcaaagta gaatgtgagt aaaagatcag caataggcct aaggggatat 5160 tcaagaactt gattgtggat gttgtgaaat gataaaatta gcttagggca agagatacaa 5220 gcttttgggc atgacagttg tggtagatga gtgaaggtaa ttccatcagc agagcaggga 5280 agaagraaca gttattacat ttaagacaag agaagtctca gatttggaat aacttctgaa 5340 aaaagttcaa ataatgtgtc tgcatttaaa aataaattag agtaattaaa gtcaattaaa 5400 gtgaataaag gtaattaagt agataaataa gtggatatgt gattcagtga atagataagt 5460 aaatgtattg ataaagtatt ggaaatcgga aagaaaattg agcaattaca cataattgta 5520 aggaatgatt gaaagagctg aagatgatyc atcaaggcct agtttaacaa gatgtgttaa 5580 aataatatac cttttgcatt aataagtgat tgcatgaaaa ctgacagaaa caagcaacta 5640 acattgcatg aagtactatt gggtcgaccc ttgcatgtaa tctcgtattt aaccttgtgg 5700 aatgtttgga gaacatttct aatgcattgg aaagaagttt ggtcttttat gaggaaacta 5760 actgccatgt gcaaagcaat atctgtacag gaaaccagaa aggaccagaa gcaggagatt 5820 aatccggtga tcgagtatat ctgaggtgtt ctggagaaag tggagtgaaa tcaggagaaa 5880 tggaccttat gccaaggtta ccaaagggga tgccatcaac tggataccat cgggaccaca 5940 gcacaagaca gccaaaggaa aaaactccag caagagaaag ttaacgggaa acagaggagg 6000 cagaaaatcc caaagctcaa accaacgaat tgcaaagtgt ggcagaagca gttcatcaga 6060 acagttaaaa gagtaattct gattatatac ctgagtgaga aatacagaag gtgccacaaa 6120 ttcaagtgat gataatcacc acaacttcag gttatagact ctcaacataa agtctttagc 6180 aacacttttt aaggctagaa cgaaacaaag aatgctcgca agtatgtgcc agatttaggg 6240 tcaaaaagat gaactagtgc taaaaatttt tatagtttag ataaagaaga ggatgatgaa 6300 gaggatccat gccttgggtg ttaagtgcta gtaacctcta ccccaaaacg gcggctctct 6360 acgatacgca aagtatctgg gctgaagatc aacgtactct acggattatg ttttgtgggt 6420 gttagattag aatattgtgt tgtgacccta gattgggaga ttgtgcatgt ttattttttg 6480 ttttagtttt attctttact gtatgttaaa taatcatgac ccccagaagc aaatagataa 6540 ctaattgtat gcttaaaata ctactttttt gatgatatca acaaaataga atcaagtctt 6600 ttactgtgtc ctcgtcagaa caggacacag aggtttggcg tgattctatt ccaaagtgca 6660 attacatgat ttggtcattt gtaataatac tgtgacatat atttgtcatt aggtaatatg 6720 gctgagggct agatcatgga gttgcactat gggccaaaag caagaatatt aagacttaaa 6780 gtcttaagag gagggattat a 6801 // ID Gypsy59-LTR_DR repbase; DNA; ZEB; 1428 BP. XX AC chr15; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy59-I_DR; KW Gypsy59-LTR_DR; Gypsy59_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1428 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1434-1434 (2008). XX DR Genome; chr15; Positions 13180626 13179199. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1428 BP; 280 A; 264 C; 336 G; 548 T; 0 other; tgtggcagga gaggtaatta gttgccaagt gtcattaatt tacgggtcct tttctctgcc 60 ctatcacaat acggcatacg ggctataaaa agagggaagc atttgctacc tgggatgcgt 120 cacttgatta gttgcggcgc cgtgagcgcg cgtcgtcact tcccacacct gttgagggcg 180 gtgaggggtt ttgatcgggc tggaagttgc ttcgcctgaa ttgtttggct gcaatcgcca 240 acgctagaac gcgtgtaagg actggaaaag ctgcgagaac cagatgcggg tgcgcgactg 300 tttgttggct gtgccggctg ttgactgaaa tgtacgtgct atctgcctct taagtggctg 360 attgaacccc gattcccttc ggtgagcagt aggctgggtg gcactctgcg aggggagctc 420 cgtcgggcgc tccgcatccg tgtgcggcta cagtgctgtc ctgtttcccg tgtttttttc 480 tgttttctta gattttgtct ttgtttcctt aattgtttat ctttttgagt ttttttggtt 540 tacacaagga ttatttaatt tattttcttg ttggtacatt tatttatctt cgtttgtttg 600 ggttttcatt ttctttattg gataaatcta tatagtgcgg ctggatttat ttgtttattt 660 ttatgatttt gacattcatc cagtatgttt tgatttttcc ttacattttt gtgctttttg 720 tgtaaatctt tactgttgta ttttcatcat tgtttcattt atattctctt taattaattt 780 ggtttaagat ttaattgttt cctccctttt aaggtggaat ttagttgctc tttctttgtt 840 ttcatttatc aacttcatta catttgtatt ttgtttattt aatcatttac ttgtgtaaac 900 tgatttattc ttttgtttaa ttgcagggca gtggtccgac ctttcggctg gtgacagagc 960 cagagctccc tggggtggga ggctaatttt attggacatt ttagaatttc tcacagtgtg 1020 tagtgatcac tgtgtgctgg tttgtgcgtg tgggtgctct tatttttaca cagtggattg 1080 tagtgggttt cacttgcctg aagtatagcc tggccacttt ccctaacgct ggcgagctct 1140 ggccctggag caccttcata aaatttacaa actgtcagtg tcaatgtcaa catattggtt 1200 ttaatgctgt ttttatttga gatttgaaaa taaagaaatc ctatttttgt accgtgccaa 1260 tgtctctgac tgcaatttgg tggaacgaac ttgtgcgtgc cttttatatg ggtggtgcgt 1320 gtatttaaat tctttgggtg aaattcccaa ggtggcgtag tcgggtaaat cattaatatt 1380 gtcagattgg aggtctagct ccatcagccc acctgccccc ccgccaca 1428 // ID DNA-2-6_DR repbase; DNA; ZEB; 565 BP. XX AC . XX DT 16-OCT-2008 (Rel. 13.1, Created) DT 16-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; 2-bp TSD; KW DNA-2-6_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-565 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1219-1219 (2008). XX DR [1] (Consensus) XX CC Members are 99% identical to the consensus. TSD is 2-bp long, and CC the TIR is 35-bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 565 BP; 145 A; 141 C; 123 G; 156 T; 0 other; cccacatagc aaaatttctc tggcccagct ctggcccaca caatcaggtt ttgcttggcc 60 cacatgccgc agtgaattac ggtacatgac tggaccaaat ctggcttcca gacaagggcc 120 aaacacagac catatctggg ccaagtctca gccaagttaa taactcataa ctgggcctga 180 actgggccag ataggttggt gtgtcacgat tgcaatgaaa ttgataaacc catggagtga 240 tgcgctttag gcacactatg ggcacgcttt ttctcaaagt gacctgattg gtagaatttt 300 ttttctttac tcaaaaatga ttttagtgta ttttaaatgt gggtcaagac tggccaaact 360 cacatggccc acttatcaaa tttttaaatc tgggccaaat actacgtttt tcatctggcc 420 caaatcttgt gtgccgcctt aaaaacggtg ccacctctgc caaacccggg ccatgtttgg 480 cccacatgct gtatgccagt gccggatgaa tgcctgctgt gccagctttt tgccaaatct 540 gggccagaat tctttgctac taggg 565 // ID DIRS-4N1_DR repbase; DNA; ZEB; 4462 BP. XX AC . XX DT 24-OCT-2008 (Rel. 13.1, Created) DT 09-MAR-2009 (Rel. 14.04, Last updated, Version 2) XX DE nonautonomous DIRS-like LTR retrotransposon family - a consensus. XX KW DIRS; LTR Retrotransposon; Transposable Element; Nonautonomous; KW reverse transcriptase RNase H; phage integrase; DIRS-4_DR; KW DIRS-4N1_DR. XX NM DIRS-4N1_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4462 RA Bao W. and Jurka J.; RT "Families of DIRS-like retrotransposons in zebrafish."; RL Repbase Reports 8(10), 1269-1269 (2008). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 4462 BP; 1232 A; 1229 C; 763 G; 1237 T; 1 other; aagtgaagtt tcataaacta atttcgagag gagcacgtga tatgattgtg caccgctggc 60 cactcatccg taatcagtaa taatccaatc agaatgatcc tagcttagta taaatggatc 120 actttctccc cattgcacta tcttcatttt ggaagaatgc ttcccgctac aaacgctcca 180 gcatttaaac tacgcttcag catcattcaa ccttctggca tggaagaaca aacaatcaac 240 aacaacaaca acaacttctc caaccagaac cccgctctcc cctaaactcc agcagctgcc 300 tcgccagcaa tcgagccaga atccccgcaa gaagccggca tatcaactct tccatcttct 360 acttcatctg cccttcacta cacaagccac aacttcgtct aaacttatga cgccggcgat 420 tcaagccacc ggagagaagc cgaactcaca gatcacctaa cgttaacggt ccactttcgg 480 taacgtgcat gctttaaagg gaagtctggc gagatcagcg tgaatagaac gctttaaatg 540 aagaactcac ctctcagcct cccgctggtc agcagatcca taacattctg acagctgaaa 600 tagttcgaag gcaaatgacc aggcaatcta atccataaac attattccat aacaccttca 660 ttcgtcgata tacatccatg aaaaggtgca ctgatatcca aaccagctgc tggtgagagt 720 ggagccatac tttcacgcat aaacattgta gcgatctgat ctacaaaatg gccgccggcm 780 tttgcactct gaactcttga ccgtgactcc aaggagccaa tagcttaaag gggaagtatc 840 accatccaat gagctttcca aaactggaga cggtcccgcc ttctctcccg acaaattcat 900 gaatggactg cgtgcagaga ctttgattcc ggccaaataa tcatgttgag attacagaca 960 tgttacttat catggatgga gtaaagccat gataaacatt acattcacac agttcctatt 1020 aagtggcaac atcgtttttt aagtttgatg taggcagttg atatatttac attttattaa 1080 taattatata tacatatatt aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aagtaaataa 1140 aattctgaaa tgagtattac ccagcaagtg atagacaacc tacacatcaa gcagtaaccc 1200 ttttgtatca cgccaattgt gactctcata tcaatacacc tgaatgtcag tcagtttata 1260 atgacagtaa gaaaacaatt tcctgctcga ctgcaacaaa caagaggagg acatttctgg 1320 ccatggagta gcttttcata cttatacttt gtgttaaata cattcgagtt ttaaaaactt 1380 ctatatcagt aaggggttta aactccactc catcatcatc tcaccagctc tccaagaggt 1440 ttcctggact tccagcatag aaacgcccca tacgctcagt tctccttcta attaccaccc 1500 tgcaggagga caaatgccca cttctacaac cctggcacat tgtgacatta agtattggaa 1560 catgcattaa caaaggcgct tccattcaca atcatccata cacttatgaa gactcctccc 1620 acgaatatca gccaatacaa ttaatcatgt ctgctgtaca tgtgccatgt tgctaagatt 1680 tcatttctaa ttctctcttg ttttccattt tagagttcaa gactgccagc tcctgaaact 1740 gccctcattc aacccatcac actcttcatc cattcctaat gcatttacac tcttcgcaaa 1800 gatactttct gcctatacag cccacaccct tgataccata ttcacatata cccccccccc 1860 cccccattta ctcttcatat cactattgac aatatttagc actcataacg ctccaatgct 1920 gactatcatt tgtacttccg ctatagcaga gtcgctccgc cgagcctcat tctccttctg 1980 caaccccccc cccctcttct tcccatttat agatgacaca accgccctgt cacacgattc 2040 tgatttaaga gatgcttata gcagttcttt gctccacagg tatcccccag atcaactccc 2100 agtcactctt acacagggtg tctccggagt atcagaaggt gttgggggct gataagtcaa 2160 tatagaggag tttgagggct taaagtatta gaaagtctta gattccttta caaggtaata 2220 catttattgt tgaacaaggt atcatcgtat gctaaagttt gactgtattc aagccgtgaa 2280 tatcaggata ctaggtagtt acgtaatcag taaaattcta ctaaggtttg acagcttgcc 2340 gcttgtttac tgcaacacta tcagtctgca gctctatagg tgacaacctg tcgaatcggc 2400 tagattttaa tatacatgtg tctgtattcg aatgtttagt tggattcatt tgttacattt 2460 taatatttag taaatattcc gtaagataaa tctcgccagc ttttttgatt gaaaatggtg 2520 ataaggtctt taaatgtgtg ggaaaagtat taaaggtgtt gaattacttc tcttattcct 2580 gtatactatg tttcagaata tgagcagcca tcacagaagt acaaagggca gcaacaaatc 2640 cagatcctag aaaggtggtc acctgatgcc ttcaagacat gcgtcctagc cactggcccc 2700 ccaaagaagc ccagatggct cttgtcagcc ataatcccac atccactcaa gggcgagggt 2760 gtgacccagc caccttacct tcttcttttt cttccttcta gcctgagtaa cactcagttt 2820 cctcccccag ccacataggt aattggagtt tcatccaagc cccccgtcac cccgccaccc 2880 tggccgtttc cgctggagtc ttcacagccc cctcccattt cccgactcct gccggacccc 2940 gcccccccta aagctctgac ttccgcagaa gtgttacccc gagctacgac ccccgcaggg 3000 gtcgtctcac cgtcccttcc aggccttagc aatctttatt tatatatata tatatatata 3060 tatatatcta tatacacctg tacatagata tatatttata atagcgctgt cactcccccg 3120 ctccatctcc taacggagtg ttcctcgagc acctaactta tcccgtcacc ctctaacagg 3180 agtcttcact gtccaaatcc cctttcccag actcctgcta gagtaggcca gcttgccctg 3240 ttccccgggc gccgaccccc gcaggggtca gtcactgccc cccccaggcc actgaaactc 3300 attatatatg tatatctatg gactttcgtt tatagatata tatttatata gagcgctgtc 3360 actccccgct ccatctccag tcggagtgtt ccacgagcat cgactccagc aagagtctgg 3420 ccaaacttgc cactcaccct ctagcagaaa tctccaccgc ccaaatcaca cttcacgatt 3480 tctgctagag atggcaaaat aaaaaattgc tgcacccaac tcccgcagcg cccattctga 3540 ctcacagaag tctcctgatc accccctcca ggccttagat tatcccattt tatatatata 3600 tatatatatt tatatactct ctcatatata catatatatt tatatatagc gctgccactt 3660 ccctgctcta tctctgtttg gagtgttcct cgagcatttt tgactcttaa tgagccaacc 3720 ccgcccaccc cttatggccc cccttcacta gtctccaccc aaccccctcc cccgctctgg 3780 cttccacagg agtcagtttc aaactttgct ccaactggag ccccctactc tttcttcatt 3840 ccttaattac tatatccagc agccggatat agtaaaaact ttctagcttt ttgggggaaa 3900 ttctttgaaa tactcggctg ctgtcccgag ctagaggcat tttttgggga gcgatcgaga 3960 cctacctgat ctcggttctc ctgatatgct tctagaccgg gcgggagccc tgggctcaaa 4020 tatctccgag ctcagggttc tctcccggga cagcatgcca aacctgctat aagtgccaag 4080 catatctaag tgggaactct tgaagtgaag tttcataaac taatttcgag aggagcacgt 4140 gatatgattg tgcaccgctg gccactcatc cgtaatcagt aataatccaa tcagaatgat 4200 cctagcttag tataaatgga tcactttctc cccattgcac tatcttcatt ttggaagaat 4260 ccccccttcc accccatctc ctcctttttc tccccttcta aagggggagc gatcgagacc 4320 tacctgatct cggttctcct gatatgcttc tagaccgggc gggagccctg ggctcaaata 4380 tctccgagct cagggttctc tcccgggaca gcatgccaaa cctgctataa gtgccaagca 4440 tatctaagtg ggaactcttg aa 4462 // ID Polinton-2N1_DR repbase; DNA; ZEB; 11092 BP. XX AC . XX DT 15-MAR-2006 (Rel. 11.02, Created) DT 15-MAR-2006 (Rel. 11.02, Last updated, Version 1) XX DE A family of nonautonomous Polinton DNA transposons - a consensus DE sequence. XX KW Polinton; DNA transposon; Transposable Element; Nonautonomous; KW Interspersed repeat; Polinton-2_DR; Maverick; Tlr; KW Polinton-2N1_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-11092 RA Kapitonov V.V. and Jurka J.; RT "Self-synthesizing DNA transposons in eukaryotes."; RL Proc. Natl. Acad. Sci. USA 103(12), 4540-4545 (2006). XX DR [1] (Consensus) XX CC This nonautonomous transposon is characterized by ~4.7-kb-bp CC terminal inverted repeats and 6-bp target site duplications. The CC consensus sequence was built based on multiple alignment of CC several copies that are >90% identical to each other. This CC nonautonomous family was derived from the autonomous CC Polinton-2_DR. XX SQ Sequence 11092 BP; 3766 A; 1799 C; 1803 G; 3724 T; 0 other; agtagtctta taatacacac agtgtcaaga tgatttattg ccttatgtca ataatttatg 60 agtatgtttt ggacggtatg gtaataattt atggcatggt ttgtccaatg tgtttcatga 120 atgtaagact ggtgaccaat gacaggctgc tttccccagg tcacagggct tagggtccct 180 gcttagtggt aggatttatg acggtttgtc ccaaattgac agttgaacca agtgtcgggc 240 tgattttggc aaagatttac agtatagtta gatttgcggc aatgtgtatg gtttgtcatt 300 tttacaacct atgcagctct tctggcattc ctgctccttt caatctgtga gtgtaagcag 360 tggggtttca ggttgggggt gtaatgggta agcaaattta gaatgaaaga atagcagtaa 420 atagtttttt gtaatttggt ccatacacat gcaaagctgg ctatccggtg tgtttgcgat 480 gtttttacat ttttttttca tttcctaaaa agtcgtttga cttttttttt gttttgtttt 540 gtttaaccat gattctcaag cctctgtcct atctgtggcc attttctgtg ggggagaaga 600 aaaggtgtca tgttttgtgt gcccttctca gaacatcagc atgtgtttat ctcagtacta 660 tcagtacaac acgatctaga ggctaatgta aaaactaaaa cattaagcag aaacaaaaag 720 tattcaaaca agtatttcca ttgttgcctt gtccaacctc agtgtgtttg agccacattt 780 ttgtagatgt acatcaagac cttttaacca attctttgag tttacatata catttagcat 840 gtcaatgtac tgaaagtatg ttgtgtttac atttttattt atttttttgc tgtttaacaa 900 atgttttggg tatctgcttt aggagtcaga tttaatatgc ttgttccaaa actgacagtt 960 gaaccgggtc tcagtcggat ttgggctaga taaacttcta gttagaattg tggcaatgta 1020 tggtttgatg ttgcatctca ttcttttttt tatgacggtg ctcctttcct gactatctgg 1080 attcttccag tctgtgagtg taagcaattg ggattctggt tgggggtgta atgggtaagc 1140 aaatttgtaa tgaaagaatg ccagtaaata gttttttgtt tctttttttt tttttttttg 1200 taaacaggcc tatacacatg caaagccggc tatccagcgt gtctgcaata ttcttaattt 1260 ttttcttaaa aaaagatagt ttgattttaa aacatttagg gttaaacgct gttacccagg 1320 cctcatctat cctatctgcc atcctactgt gcctggatgg tgactatgag tccaaattct 1380 gtgggagaga agaaaaagtg tcatgttttg tgtgtttttt ctcaaaacct tcagcagttt 1440 ttatccctgt actatcagta caacacacaa tagaggctat gtataaacaa gaaaacgaca 1500 acaataacaa caaaaacaac aacaaaaaaa aaagtcaaaa gtattcaaac aagtatatcg 1560 ctcttacata tattgtctgt ttacatgggg atcaagtgaa attagcagtg tggtagtatt 1620 aatgattttt acgatggcaa ttttgtgtat atgtgcatca acactttatt accaatactt 1680 tgagtacaac gtagagcaaa ctgaactact aaaaacaagt tgtgtgtaca tttgtgttat 1740 tgtttaacaa atgctttcat tctgatctgg ccccacagat agtttgtaaa ttataaaagg 1800 aacaaacata aaccacattt cgctatgtca gaatataaat ataaaaggaa actgtgattt 1860 ttaagcatac ctcaacatat cgcacgtcac tatattaagg cgcttttgtc aatccgcttc 1920 aaaaaaactg ttttacttta gattgttgtt caaaacgaaa ttcagcacca ttttcttcaa 1980 gtgcttttct cacttcagta gaaacatcat gcccagtttt tatgacaaac tttaatttca 2040 ggaaaaaatc tattaattta ttactgaaca ttttcctcaa aaatgtgtta aaaatgcttt 2100 atgcgtcacg atcaactgaa ctgtaggtaa ttggtacaat agtgttaggt ttttttcacc 2160 tacttctgta tgtgtaaata tttgggctaa aaacctcaca tgggatttta catgcctaca 2220 aatatatgcc acacccatgt tctatacaca cttaaaccca tatctaacgg cgatattctt 2280 gtccacaaaa cactgatgta aattttaaag atgtgtaatt tacagtaaaa atctttttgt 2340 attcatttga aaatactttt gtaaaatctt aattttagaa actatgttta cagcactaga 2400 ttaattctag atttgtatca agatcatgga caacactgtg cacgtttttg tttgacagtg 2460 tttttaggtg aagactcgtt tctgatagct gtcagcattg tatttacaga aaagatacaa 2520 gaataatttc aatatatttg tgaaaaatta acaagtttca tcatgaaaat gttttttttt 2580 ttttttaaca tgtatgaacg gaatgcttat tttagagtaa cttgtttcaa attatttact 2640 tgcatttttg ttttagaatt aaacaggtat agctcatgcg accacggcac aatcatggtg 2700 tactattaca tttctgaggg tacactcaga acatcaaagc tttgtttgag gaattttaaa 2760 aaaaaattgt tcacaccaaa acatagaaaa gaacagctgt aagatatgtc tgtttcgaac 2820 agtgagcaca cttcaacata cgctatacaa aacctttaat caagaactaa tcttttcaac 2880 atctaccata aagtgaaaga gctgataaaa gatgttttat ctttactacc aacttctaga 2940 gtgaacgacc acaaaacatt tttttttttt ttcccaattg aacgcagcac aaataaatgt 3000 gttacatttt taatgtctta ccacctctaa ctagtattgt ttgtgatgcg aaaatccatg 3060 tgtggttttt taaactgcaa acaaatttat gctatgcctt agtaaaacat tttcattcta 3120 aaatctccca tgttatgtat aaaacataca tatctaacta aaatatagtc atgcacaaaa 3180 tcacttatgt aaattagaaa aatgtgcaat ttactatgaa aacccccccc ccccccccaa 3240 aaaaaaacat ttgtaattat tttacaatgt gctgatttgt aaatgatgtt ttcttctctt 3300 aagatacaat cttgatatcc atcaagatca tggacaacat tgcacacatg gttttgttgg 3360 gcgatatgtt cgtaggtgag gacaatgaaa cgaaacgaaa cacaactaaa ttctaaatcc 3420 aactaaattt agaatgtttt cattatcgaa aaaacttttc aatgtgtaac attattcttt 3480 agagtaacat gtttcaataa tttaccagca gggactacat tttgttttat acgtaagcaa 3540 caggtgtaca cacgactacg actatagtac gacgctatga tacgattatt gtgtttctaa 3600 gggaatcact cagaaaacaa aacatttgtt ttaggaatcc ggatacgtca aacatgcatc 3660 tatcatgcta aacaagaaag aaatacataa atgtatgtga atattcacag aacaaactga 3720 ttatacttta aaatgtatta taaaaaatca ttaaccagca tttaattttt caacaactag 3780 cacaaattaa aagagccagg gatgactcaa ctacttttct gcaaacatct acactgtaag 3840 aaatgcaggg ttccacacaa ttcatacatg ttgtcccaac aaaaattgat taagttaact 3900 taacactctc aacaaaattt atgtgcgttg aacataaaaa aattaaggtt gtcccaatga 3960 aatctcaaga attgtttttg agtgtaaagt gaaagtccaa caaacagtta acagatttta 4020 gcatatgaaa ggggtagtgc taaaaaaaat aaaaaatgtt tatctcttac catcttgtta 4080 gcagtgttaa agcacaccat tacaatacaa gcagattcaa caactatgat gttagaaggg 4140 cttgcaaaag tgtgcaaaaa ccgtgtgaaa ctcaggaggc cggatctgag attctaaaac 4200 aggaaaaata aagtaatgtt taggatgcaa gtacaggtcc taaatggtct agtgtttgac 4260 tcccggtatg ggaagcccac cccttattgc cctcctgaac aataacttgg taggacaaga 4320 cactattaat cctaaaataa aaaaataaaa aaaaacaaac aaacattttt caccactaga 4380 ttagttatct gcatcagaga ataattcaca gttgttattg tttggcttca tatgttttta 4440 acagtgttga cattgtcaac aaaaccatgt taaaaccaaa agtgtacaat gaacaccttt 4500 aagaaattca actttagact ttttttttaa aacatgtatt tcaaatcatt cagcatcagg 4560 tgtttaacaa aattttcttt cttttttttc accctcaaat ccatattctc catctcaaat 4620 aaagcaaaag ctctggtgca cataaggttc tacaactcgt gtcacatcag aacgataatc 4680 ataatttttt atgtcaaaca catatttgaa tacacattga aataatgact aaaagatgaa 4740 aaagaaaata atcgcaaatt aacagcatac cattaataat gatataacag tctaaagaat 4800 ataaaaacta tacacaaaac aatgattata tgttctgatg taatccattt tttcttcact 4860 attttaatgt ataaatgtac attctacaat atttgtgttt taaaataaaa cttttaaaca 4920 cacattattc ttttgtgata aatggttcaa aacaaaatta aaagcaaatc taatctaaga 4980 acaaaattaa ttacagagat aggtaaaact aacaagtgct tactgttctt ttttatgttt 5040 tatttattta ttaattgtat tttttaaagc acaactacct ataaaactga tgataccatt 5100 acatacgtca atgtctaaac ggtactaaat atttcgaaga tccatttgct gcgttatgtt 5160 ttgacatgta ttacctaaaa tatttaattg aatacgtagt taaaacgagt acatttaaag 5220 gaatactgtc gtacgtctga gatattttct gaaaaagtaa agataaaaaa gtaaatgggg 5280 tatgccgtgt cagaacagtg tgaccaaaat tacagaaata ttaaaacact gctgtgcaaa 5340 agaagagcaa aggatcaaat ctgcaactat gaaatgcttt atacatacat ccacaatggt 5400 gtatgcttac tgtgataaac agtaaccaat attgtttggt atatagatta tagcatatgg 5460 catagaccat ctaagagaat tctaatagct ttttctaatg tgttacctct tttaggcata 5520 gtcaaatgtc ctttaatcat ggtgaaatgt caaaaaaata ataatcacac ttctttaaag 5580 cacattgctg tttgacaact tggtaaaacc aatatttatg caaaataaaa agtagaacaa 5640 tgtagatatg ttacaacatg ttattgttct gttaactaca tgccttaaaa tgatcgacta 5700 catgattttt attttattca agtagacaaa agcactacag cgagcggtta tacccttttc 5760 ctggatcaga catgtaggtc agaacgatgt ctttattatg ttttaatgtg ttgggtctgg 5820 atgaaaatat atactttttc aattaattga ctgctttttt actatagaaa aataaatgta 5880 tgacattttt tgactaagac ataaatgtgt agctgtaagt gtagtatata gatatggttt 5940 aagtgtaaaa gtttatgtta tgcatgagaa acagtgagat ttacttttca tgttgactac 6000 ttttcatcct atgtcttgta accatagagt agacgggcca aaaagataaa ctaaaaagaa 6060 aaatgatact tttagtttta aagaatctta tctaggtggg ttttaatttg tagaaatttc 6120 tttgaacaca cacttcttgt ttgactgcaa taccgttcac caaacatggt gttttgcaat 6180 ccaacctacc attacagttc agaagagtgg attttcaaaa aagacatgtt aattatattc 6240 ttttctaata cacagcacac acctttatta aaaaatacat ctttgtgaaa taactttttt 6300 ttttttatta cttactcatg cataattgtt aacatttttg ctttaaaaaa gcattgaccc 6360 atgtagtttt tccaaacgtg tttgcatcag tagtataaca taatcacact gacagtggta 6420 aaaaagactt tgacacgagt tgtagaacct tatgtgcacc agagcttttg ctttatttga 6480 gatggagaat atggatttga gggtgaaaaa aaagaaagaa aattttgtta aacacctgat 6540 gctgaatgat ttgaaataca tgttttaaaa aaaaagtcta aagttgaatt tcttaaaggt 6600 gttcattgta cacttttggt tttaacatgg ttttgttgac aatgtcaaca ctgttaaaaa 6660 catatgaagc caaacaataa caactgtgaa ttattctctg atgcagataa ctaatctagt 6720 ggtgaaaaat gtttgtttgt ttttttttat ttttttattt taggattaat agtgtcttgt 6780 cctaccaagt tattgttcag gagggcaata aggggtgggc ttcccatacc gggagtcaaa 6840 cactagacca tttaggacct gtacttgcat cctaaacatt actttatttt tcctgtttta 6900 gaatctcaga tccggcctcc tgagtttcac acggtttttg cacacttttg caagcccttc 6960 taacatcata gttgttgaat ctgcttgtat tgtaatggtg tgctttaaca ctgctaacaa 7020 gatggtaaga gataaacatt ttttattttt tttagcacta cccctttcat atgctaaaat 7080 ctgttaactg tttgttggac tttcacttta cactcaaaaa caattcttga gatttcattg 7140 ggacaacctt aattttttta tgttcaacgc acataaattt tgttgagagt gttaagttaa 7200 cttaatcaat ttttgttggg acaacatgta tgaattgtgt ggaaccctgc atttcttaca 7260 gtgtagatgt ttgcagaaaa gtagttgagt catccctggc tcttttaatt tgtgctagtt 7320 gttgaaaaat taaatgctgg ttaatgattt tttataatac attttaaagt ataatcagtt 7380 tgtgctgtga atattcacat acatttatgt atttctttct tgtttagcat gatagatgca 7440 tgtttgacgt atccggattc ctaaaacaaa tgttttgttt tctgagtgat tcccttagaa 7500 acacaataat cgtatcatag cgtcgtacta tagtcgtagt cgtgtgtaca cctgttgctt 7560 acgtataaaa caaaatgtag tccctgctgg taaattattg aaacatgtta ctctaaagaa 7620 taatgttaca cattgaaaag ttttttcgat aatgaaaaca ttctaaattt agttggattt 7680 agaatttagt tgtgtttcgt ttcgtttcat tgtcctcacc tacgaacata tcgcccaaca 7740 aaaccatgtg tgcaatgttg tccatgatct tgatggatat caagattgta tcttaagaga 7800 agaaaacatc atttacaaat cagcacattg taaaataatt acaaatgttt ttttgggggg 7860 gggggggggg gggtttcata gtaaattgca catttttcta atttacataa gtgattttgt 7920 gcatgactat attttagtta gatatgtatg ttttatacat aacatgggag attttagaat 7980 gaaaatgttt tactaaggca tagcataaat ttgtttgcag tttaaaaaac cacacatgga 8040 ttttcgcatc acaaacaata ctagttagag gtggtaagac attaaaaatg taacacattt 8100 atttgtgctg cgttcaattg ggaaaaaaat aaaaaatgtt ttgtggtcgt tcactctaga 8160 agttggtagt aaagataaaa catcttttat cagctctttc actttatggt agatgttgaa 8220 aagattagtt cttgattaaa ggttttgtat agcgtatgtt gaagtgtgct cactgttcga 8280 aacagacata tcttacagct gttcttttct atgttttggt gtgaacaatt tttttttaaa 8340 attcctcaaa caaagctttg atgttctgag tgtaccctca gaaatgtaat agtacaccat 8400 gattgtgccg tggtcgcatg agctatacct gtttaattct aaaacaaaaa tgcaagtaaa 8460 taatttgaaa caagttactc taaaataagc attccgttca tacatgttaa aaaaaaaaaa 8520 aaacattttc atgatgaaac ttgttaattt ttcacaaata tattgaaatt attcttgtat 8580 cttttctgta aatacaatgc tgacagctat cagaaacgag tcttcaccta aaaacactgt 8640 caaacaaaaa cgtgcacagt gttgtccatg atcttgatac aaatctagaa ttaatctagt 8700 gctgtaaaca tagtttctaa aattaagatt ttacaaaagt attttcaaat gaatacaaaa 8760 agatttttac tgtaaattac acatctttaa aatttacatc agtgttttgt ggacaagaat 8820 atcgccgtta gatatgggtt taagtgtgta tagaacatgg gtgtggcata tatttgtagg 8880 catgtaaaat cccatgtgag gtttttagcc caaatattta cacatacaga agtaggtgaa 8940 aaaacctaac actattgtac caattaccta cagttcagtt gatcgtgacg cataaagcat 9000 ttttaacaca tttttgagga aaatgttcag taataaatta atagattttt tcctgaaatt 9060 aaagtttgtc ataaaaactg ggcatgatgt ttctactgaa gtgagaaaag cacttgaaga 9120 aaatggtgct gaatttcgtt ttgaacaaca atctaaagta aaacagtttt tttgaagcgg 9180 attgacaaaa gcgccttaat attgtgacgt gcgatatgtt gaggtatgct taaaaatcac 9240 agtttccttt tatatttata ttctgacata gcgaaatgtg gtttatgttt gttcctttta 9300 taatttacaa actatctgtg gggccagatc agaatgaaag catttgttaa acaataacac 9360 aaatgtacac acaacttgtt tttagtagtt cagtttgctc tacgttgtac tcaaagtatt 9420 ggtaataaag tgttgatgca catatacaca aaattgccat cgtaaaaatc attaatacta 9480 ccacactgct aatttcactt gatccccatg taaacagaca atatatgtaa gagcgatata 9540 cttgtttgaa tacttttgac tttttttttt gttgttgttt ttgttgttat tgttgtcgtt 9600 ttcttgttta tacatagcct ctattgtgtg ttgtactgat agtacaggga taaaaactgc 9660 tgaaggtttt gagaaaaaac acacaaaaca tgacactttt tcttctctcc cacagaattt 9720 ggactcatag tcaccatcca ggcacagtag gatggcagat aggatagatg aggcctgggt 9780 aacagcgttt aaccctaaat gttttaaaat caaactatct ttttttaaga aaaaaattaa 9840 gaatattgca gacacgctgg atagccggct ttgcatgtgt ataggcctgt ttacaaaaaa 9900 aaaaaaaaaa gaaacaaaaa actatttact ggcattcttt cattacaaat ttgcttaccc 9960 attacacccc caaccagaat cccaattgct tacactcaca gactggaaga atccagatag 10020 tcaggaaagg agcaccgtca taaaaaaaag aatgagatgc aacatcaaac catacattgc 10080 cacaattcta actagaagtt tatctagccc aaatccgact gagacccggt tcaactgtca 10140 gttttggaac aagcatatta aatctgactc ccaaagcaga tacccaaaac atttgttaaa 10200 cagcaaaaaa ataaataaaa atgtaaacac aacatacttt cagtacattg acatgctaaa 10260 tgtatatgta aactcaaaga attggttaaa aggtcttgat gtacatctac aaaaatgtgg 10320 ctcaaacaca ctgaggttgg acaaggcaac aatggaaata cttgtttgaa tactttttgt 10380 ttctgcttaa tgttttagtt tttacattag cctctagatc gtgttgtact gatagtactg 10440 agataaacac atgctgatgt tctgagaagg gcacacaaaa catgacacct tttcttctcc 10500 cccacagaaa atggccacag ataggacaga ggcttgagaa tcatggttaa acaaaacaaa 10560 acaaaaaaaa agtcaaacga ctttttagga aatgaaaaaa aaatgtaaaa acatcgcaaa 10620 cacaccggat agccagcttt gcatgtgtat ggaccaaatt acaaaaaact atttactgct 10680 attctttcat tctaaatttg cttacccatt acacccccaa cctgaaaccc cactgcttac 10740 actcacagat tgaaaggagc aggaatgcca gaagagctgc ataggttgta aaaatgacaa 10800 accatacaca ttgccgcaaa tctaactata ctgtaaatct ttgccaaaat cagcccgaca 10860 cttggttcaa ctgtcaattt gggacaaacc gtcataaatc ctaccactaa gcagggaccc 10920 taagccctgt gacctgggga aagcagcctg tcattggtca ccagtcttac attcatgaaa 10980 cacattggac aaaccatgcc ataaattatt accataccgt ccaaaacata ctcataaatt 11040 attgacataa ggcaataaat catcttgaca ctgtgtgtat tataagacta ct 11092 // ID BEL25-I_DR repbase; DNA; ZEB; 5826 BP. XX AC chr20; XX DT 03-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL25-I_DR; KW BEL25-LTR_DR; BEL25_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5826 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1397-1397 (2008). XX DR Genome; chr20; Positions 46219168 46213343. XX CC Positions [4857-5417] - Integrase core CC 'AATTT' target site duplication CC LTRs are 100% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 45..5792 FT /product="BEL25-I_DR_1p" FT /translation="MFPEQRIREGQRSDVRPKRRIQHPAWLEDYEVSLPQY FT NQQSPAAHTLPHQKPQEPYTERVAEMTPLIYQPQSDYALRADQRVINHSLH FT MRPRYESTPVSQPVTQEDVSEILRTVQELRRENQQLQFVMQDMQQKMSLNR FT APLLQQRAEPLPYDRTQSKASPAPLHVNDEWPLPPPPVVDDGFLPLAGDIP FT PPLPRHMSNFVDELTSRLRNLKTKDHHLSCPSTPEYCEPESFSMALPPQQR FT TREYPMSKYPQTAPISSKPFCMDHELHPYYPQQERMYRGPKPTIPDFTKGD FT PREFARLKVSLDNLLPEDSTERFKYQILLEHLKFEDALLIADSYINSSRPY FT SDTMASLAEQYGQPHQLALRRIADLMEDPTIRSQDASGFKRFALKVRALVG FT MLNQLGDSGRVELQCGSHVTRLLSKLPHDLRADFKRYVYPLNVRIPTLLHF FT ADWLEYELKIQASGFEFLGGDRRERPDQRRDRSKDFKSTKTVAVFHSTDPA FT LNTTAPGESSLSAAKTQDKPRIFCPYCTNTQHYLNQCQNFSQLTKEQKTNW FT VKTNKRCWRCGRAHQAAQCNLKTSCKVCKGKHLDALHELNERPVSENTCLV FT NNANEVLYLDRPLGCSQVLLKVTKVILRNGEHTLETYAILDDGSERTILLQ FT AAAQELKLQGKPENLALRTVRQDMKIIHGTTVSFTISPACQPHKVFKIHRA FT FTADHLGLAEHTCPATTLQHKYRHLRGLPIPSLNSVQPLLLIGSDYPYLIT FT PIEPVRLGPPGGPAAVKTRLGWTVQGPTKLVRHQLLTQQCLNISICSPSDE FT LFRHIEKLWQLDVLPYRSEKLATRSKQDQEAVDLLEAKTTRIDVNGTQRYA FT TPLLRVKDMPNLQATKQAVMPNLRSTEARLAKDPIKSEAYRAEIQRLEEAG FT YVVEVPDEKLSKKEEESWFIPHHMVSHNAKNRVVFNCSFSYQGKNLNNLLL FT PGPALGSSLLGVLLRFREHAVAFSSDIKGMFHQVRLLPEDKPLLRFLWRDL FT KREELPRVYEWQVLPFGTTCSPCCATYALQRHIVDHSQQGDNVRHSLERCF FT YVDNWLQSVATPDEAKELLDTGMNLLAEGGFELRQWACNFPNVIEHLSKEA FT RSESSERWLNQTESDPQELALGLRWMCQSDTLGYQTRLDNCSTPTMRNIYK FT VVASQYDPLGFLVPYITRAKILIQSLWAKQREWDDPLLPTEILKTWHDWQG FT ELQHLSQLTLPRCYVSSQLDSLNCKRQVHIFSDASERAYGSVAYLRSEDLK FT GNVEVAFLTARSRVAPKRQLSVPRLELCAALTGAQLASLLAKELTLPIESF FT VLWTDSTTVLTWLQSESCRYKVFVGTRIAEIQELTSAHAWRYVESNNNPAD FT YITHGKTLKELATDKTWRQGPEYLWNSPCYWPVIPDINAKENPAELKKPTF FT CSHVTVDVQPLPDPQQFSCFKALVEATALLLHGAAGKDGDLSAVDYQSAER FT NILRKVQIDCFPDDINALVAGKPVPSSSRLITLAPEYDSTVELIRVGGRLR FT RSVQLDADAVHPIVLDPSHKITQLLIQNIDKELHHPGAERVFAEIRRKYWI FT LRGREAVKRLQHLCPDCQRWRAKPTVPQMADLPQARLQIFKPPFFSTGMDC FT FGPFTVKLGRRSEKRWGILFKCLTTRAVHIDLLSSLDTDSFLMSLRRFISR FT RGKPSELLSDQGTNFKGGERELQQSFTSMCPTLQKELAKQQIKFRFNPPGS FT PHFGGVWEREIRSVKSALYATVQRQSMTEEVLNTVLTEIEGILNSKPLGYV FT STDIADPNPVTPNILLMGRLDPSLPQTVYHETELLSRRRWRHSQVLADQFW FT MHFTKHYLPSLQIRGKWQRERDQWQPDMIVMIVDPQLPRALWPVGKIINVF FT PSADGRIRTAEVRIRDRTYVRPVARLIKLPALPDMGTVSSPSE" XX SQ Sequence 5826 BP; 1742 A; 1436 C; 1279 G; 1369 T; 0 other; ttggtccttc gagccggagt cagtaattgc tgtggattta gaagatgttt ccagaacaga 60 ggatcagaga aggacaaagg agtgatgtgc gtcctaaacg tcgaatacaa catccagcct 120 ggttagagga ttatgaggtt tctcttcctc aatataacca acaatctcca gctgctcata 180 cattgcctca tcaaaaacca caggagcctt acactgaaag ggtagccgag atgacccctc 240 ttatttatca gccacaaagc gattatgctt taagggcaga ccagagagtc attaaccact 300 cattacatat gaggccaaga tatgagagca cacctgtatc tcagccagtc acacaggaag 360 atgtatcgga gattttgaga acagttcagg agcttagacg agaaaatcaa cagcttcagt 420 ttgtcatgca agatatgcaa cagaaaatga gcttaaatag agcccctctc cttcaacaaa 480 gagcagagcc cttgccatat gacagaaccc agtcaaaggc aagtccagcc ccattacatg 540 ttaatgatga atggccacta ccacctccac ctgtagtaga tgatgggttt ctccctttgg 600 ctggggatat accaccccca cttccgagac acatgtctaa ttttgtggat gagttaacat 660 ctcgactcag gaacttaaaa acaaaagatc atcatctctc atgcccatca accccagagt 720 actgtgaacc tgagagtttc tccatggcat tacctccaca acagaggaca cgtgaatacc 780 caatgtctaa gtaccctcaa acagcaccta tcagcagtaa gccattctgt atggaccatg 840 aactgcaccc ttattaccct caacaagaaa ggatgtacag aggcccaaag cccacaatcc 900 ctgactttac taaaggagac ccacgggaat ttgcacgact gaaagtctca ttggacaatc 960 ttttacccga ggattctacg gaaaggttta aatatcaaat actcctagaa cacttgaagt 1020 ttgaggatgc tcttctcatt gcagactcct acatcaattc cagtagaccg tattcagata 1080 ccatggcctc tctcgcagaa caatatgggc agccccacca gctagctttg aggcgaattg 1140 ctgacttaat ggaagatcct accatacgta gtcaagatgc tagtggattc aagaggtttg 1200 cattgaaagt aagagctcta gtaggcatgc tgaatcaatt gggagatagt ggcagagttg 1260 agcttcagtg tggctcacat gtaacacggc tgttgtccaa gttgccccat gacttacgag 1320 cagatttcaa acggtatgtg taccccctga atgttcgcat ccccactctg ttacactttg 1380 cagactggct tgaatatgag ctaaaaatac aggcgagtgg atttgagttt cttggtggag 1440 accgaaggga acgtcctgac cagaggagag acaggagtaa agactttaag tccacaaaga 1500 cagtagctgt atttcatagc acagaccctg ctttaaatac tacagctcca ggagagtcca 1560 gtttgagtgc tgctaaaact caggacaaac caagaatatt ttgcccctac tgcacaaata 1620 cccagcatta tctgaatcag tgtcaaaact tcagccagct caccaaagaa caaaaaacaa 1680 actgggtgaa aaccaataag agatgctggc gttgtggtcg agctcatcag gcagcacagt 1740 gcaatttaaa gacaagctgt aaggtctgta aaggcaaaca tctcgatgca cttcatgagc 1800 tgaatgaaag gccagtttca gagaacactt gtcttgtcaa caacgccaat gaagttctat 1860 acctggacag accattaggc tgcagtcaag tactgctaaa ggttacaaag gtaattcttc 1920 ggaatgggga acataccctt gagacatatg ccattttgga tgatggctca gaacgcacaa 1980 tccttctgca agcggccgcc caggaactga agttacaagg caaaccagaa aaccttgctc 2040 tgagaactgt cagacaggac atgaaaatca tacatggtac tacagtctcc tttaccatct 2100 cccctgcctg ccagcctcac aaagtcttta agatacatag agcatttact gcagaccatt 2160 tggggctcgc tgaacacaca tgcccagcta ccaccttaca gcataaatac agacacctca 2220 ggggactgcc aatcccatcc cttaacagtg ttcagcccct ccttctaatt ggttctgact 2280 atccctacct aattactcca atcgagcctg tacgcctggg gcctccgggc ggacccgcag 2340 cagtgaaaac acgactaggc tggacagtcc aggggcctac taagcttgtg cggcatcagc 2400 ttttgaccca gcagtgtctc aatatatcca tctgtagccc ttctgatgag ctgttcagac 2460 acattgagaa actctggcaa ctagatgtac tgccctacag aagtgagaaa cttgcgacca 2520 gatctaagca agatcaagaa gctgtcgacc tgttagaggc aaagacaact agaattgatg 2580 tcaatgggac acaacgatat gccacacctc tcttaagggt taaagacatg ccaaatttac 2640 aagctactaa acaagctgta atgccgaatc tgcgcagcac cgaagcccgc ctagccaaag 2700 atccaataaa gtcagaggca tacagagctg aaattcaaag actggaagaa gcaggttatg 2760 tggttgaagt acccgacgag aaactatcga agaaagagga agagtcatgg ttcattcccc 2820 accatatggt gagccataat gccaaaaaca gagttgtatt caactgctca ttctcatatc 2880 agggtaagaa cctaaacaac ctgctgctgc cgggaccagc tctggggtcc tcattacttg 2940 gagtgttgct gcgcttcagg gaacatgctg tagcctttag tagtgatata aaggggatgt 3000 ttcatcaagt aagattactg cccgaggata agcccctact caggttctta tggcgtgacc 3060 taaagagaga agaactcccc agagtctatg agtggcaagt gctcccattc gggactacct 3120 gcagtccgtg ttgtgcgaca tacgcactac aaaggcacat tgttgaccac agtcagcaag 3180 gagacaatgt ccgccactca ctggaaaggt gtttttacgt agataattgg ctacagagtg 3240 ttgcaacacc tgacgaagct aaagagctgt tagacacagg gatgaatctg ttggctgaag 3300 gaggatttga attgcgtcag tgggcatgca attttccgaa tgtcattgaa catctgtcaa 3360 aggaagccag atcagaaagc agtgaacgct ggcttaacca aactgaatcc gatcctcaag 3420 aactcgcttt aggattacga tggatgtgcc agtctgatac tctcggatac caaacccgcc 3480 ttgacaactg ttccaccccc acaatgagaa acatatacaa ggtagtagca agccagtatg 3540 accctcttgg attccttgtc ccctacatca cccgagccaa gatccttata cagagtcttt 3600 gggctaagca acgagaatgg gatgaccccc tactgccaac tgaaatcctc aaaacatggc 3660 atgattggca aggggaactt cagcatcttt cccaactgac actgccacgc tgctatgtga 3720 gttcccagtt ggattccctg aactgtaagc gacaagtgca tatcttcagt gatgcctctg 3780 agcgggcata cggatctgta gcatacttgc gctcagagga tctaaaggga aatgttgagg 3840 tagcgttcct cacagccagg tcaagagtag cccctaaacg ccagttatct gtgcctcgac 3900 tggaactttg tgcagctctc acaggtgccc aattggcttc actattagct aaagagctca 3960 ctctacccat cgaatccttt gtactctgga ctgattctac cactgtccta acatggcttc 4020 agtcagaatc ctgccgttac aaagtatttg taggcaccag aattgccgaa atacaggaac 4080 ttacaagtgc acatgcctgg cgttacgtgg aatccaacaa taatccagct gattatatca 4140 ctcacgggaa aactctgaag gagctagcta cagacaaaac ctggagacaa ggaccggaat 4200 atttgtggaa ctcaccttgc tactggcctg ttatccctga cataaatgca aaggaaaatc 4260 ctgccgaatt aaagaagccc actttctgta gtcacgtaac agtcgatgta cagccattgc 4320 ctgatccgca acagtttagc tgctttaaag ccttagttga agcgacagca ctacttctgc 4380 acggggcggc aggtaaagat ggagatcttt cagccgtgga ttatcagtca gccgaaagaa 4440 atatcttacg aaaggttcaa attgactgct tccctgatga catcaatgcc ctagtcgccg 4500 gtaaaccagt accatcctct agccgcctca taacattagc acctgaatat gacagcacag 4560 ttgagctcat ccgagtgggt gggagactac gccgcagtgt tcaattagat gcagatgctg 4620 ttcacccaat tgttcttgat ccatctcaca aaattaccca acttctcatt cagaacatag 4680 acaaagagct tcatcaccct ggggcggaac gggtctttgc agagatccgc cgtaaatatt 4740 ggattttacg cggtagagaa gcagttaaac ggctacaaca cttatgtcct gattgtcaaa 4800 ggtggcgagc taaaccaaca gtgcctcaga tggcagatct cccacaagcc agacttcaga 4860 ttttcaaacc accattcttc tctacaggaa tggactgttt tggccccttt actgttaaac 4920 taggcagaag aagtgaaaaa cgttggggca tcttgttcaa atgtctgaca acccgtgctg 4980 ttcacatcga cctactaagt agtcttgata ctgattcatt tctgatgtca ctaaggcgct 5040 tcatatcccg acgaggaaaa ccttcagagc tattatcaga tcaaggcaca aactttaaag 5100 gaggagaaag ggagctccaa cagtcattca catcaatgtg tcccacactg cagaaagagc 5160 ttgctaaaca acaaattaaa tttcgattca accctcccgg tagtccccat tttggaggag 5220 tatgggaacg cgagattaga tcggtcaagt ctgccctgta tgccactgta caaagacaaa 5280 gcatgactga agaagtcttg aacactgttc tcactgaaat agaaggaata ctgaattcaa 5340 aacctctggg ttatgtgtcc acggacattg ctgaccccaa cccggtgact ccaaacattc 5400 ttctgatggg gcggctagac ccttcactcc cgcaaacagt ttaccatgaa acagagctct 5460 taagccgccg aaggtggaga cactcgcaag tgctcgcgga tcagttctgg atgcacttca 5520 caaagcacta cctacccagc ctacaaatcc gtggaaaatg gcagagagaa agagaccaat 5580 ggcagccaga tatgatcgtc atgattgtgg acccacagtt acctagagcc ctctggcctg 5640 taggcaagat aattaatgtt ttccccagtg cggatggacg aatcagaact gctgaggtca 5700 gaattaggga cagaacatat gtcagacctg tagcacgcct cattaagcta ccagcactcc 5760 cagacatggg cacagtttcg agcccttctg aataagcaaa tttgactgtg caaatttggg 5820 ggcggc 5826 // ID TDR23 repbase; DNA; ZEB; 1147 BP. XX AC . XX DT 01-APR-2002 (Rel. 7.03, Created) DT 01-APR-2002 (Rel. 7.03, Last updated, Version 1) XX DE Zebrafish putative non-autonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; TDR23. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1147 RA Jurka J. and Drazkiewicz A.; RT "TDR23: a putative non-autonomous DNA transposon from Danio RT rerio."; RL Repbase Reports 2(3), 16-16 (2002). XX DR [1] (Consensus) XX CC Approx. 25bp imperfect TIRs. CC Several hundred copies in the genome. XX SQ Sequence 1147 BP; 355 A; 213 C; 211 G; 352 T; 16 other; ttaaaggtgc agtangtaag tttgacaccc agtggttgaa ctaggtattg caytcctggw 60 tcaaaacama cgcaagcgca ggttgccaga ttgatgacac caacagcagt gagcctgact 120 attgagccta aaggctgatt taagtcgtgt tctaaataaa agcaacggca cgcgatagaa 180 ggaatatttt catattaaaa ggaggttttt gttctaacca acacctgraa ttnatatatt 240 agaaatggct tctatttctt ncagctgaac aacagraatc taacactgat cacctcaggt 300 acacctcatg tgctttattc agtgttaaat gctaataatg tgagtttgaa tgccatttta 360 catgacattt attgccatac tactgaaagc agcagcagat agttyacctc agatcttaaa 420 aaataaaata aaccatttga aattgaactt tagaactgtg actcagtgca aaccaacaca 480 tatcagtgat tcagcatgta catttaataa tgttaaagag gtttttaata tgtattaatt 540 agattataaa ccttaccatt tcgttggagt gcagtgagtg catattctat scttctgtgc 600 ttctgaatgg ctgtatttaa atttctgtcg tgtttcgtct ggtgcaaaca gccaaattgc 660 ttatcactgc aaatctcgtc acgtagcrtg ttgttaggac acggggttac aatgtaacct 720 gctcacctaa tgtttacgtt cgtaatattt atattatttg ctaattaata accaactcat 780 gtggaactct gaatctgcgt ctcatttcgg agtctgctac tgtccaccgg aggtcgcatt 840 tcagtcacgg acgcatnctt tgagagcctt yctgactgaa tgaatgaaat atgtggtttt 900 ccaccaaggc aacccagggt gctgctgaaa tataattggc taaactggca ttgggcgggt 960 taaaggaacc aaaacaaaga cagcattctg gcacgkaacr cacattttca aagcagaata 1020 tctgacttca gcattgtttt tcagataaac aagaatgttc acttgncatg tttcttaaat 1080 atctgcaaac atattatggt atttttatgc tttagaaaag tcaaaaactt acatacagca 1140 cctttaa 1147 // ID Gypsy83-LTR_DR repbase; DNA; ZEB; 665 BP. XX AC chr1; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy83-I_DR; KW Gypsy83-LTR_DR; Gypsy83_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-665 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1478-1478 (2008). XX DR Genome; chr1; Positions 46851391 46852055. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 665 BP; 192 A; 97 C; 146 G; 230 T; 0 other; tgtaacaggg ttaaaataat gtttaattgt ctttcaccaa aatgggttat tttgtttatt 60 ttgtggctgt tgagagacct cttgtggcaa cagtgtgtaa ttgtttcttt cacctttcac 120 ctgcaaagcc ccgcctacag cagaaaatgg ttgctgaatg cagaaaggcg gtaggtaaaa 180 tgtgctccag ctaaatgtga caaattttgt attttaacac atattaataa cacatacaag 240 tccaatatga acattaagtg atgttttaga tcaaatctgt caagtgattg ccgtgtgtga 300 caaactttgt gtggtttaat ttgctcatgc tatgtgtagc tggaatgatg acattaatat 360 gtgcaggggt gatttcttac tatttgtttg gtttttttag atgcatgttt aaatgcatac 420 tgttgccaga ggatgtatat ttctgttaca ggtatgttga acttttgtaa ataaataatg 480 ttttgttaaa atgactttat gtggcaacac agcagaaaat ggttgctgaa tgcagaaagg 540 cgatgcatgt ttaaatgcat actgttgcca gaggatgtat atttctgtta cagaataaac 600 aacgcagaac gcagttctct gtgtccgcct gagtgatttg cgtcttcagg aagagaagcg 660 ttaca 665 // ID CR1-23_DR repbase; DNA; ZEB; 2250 BP. XX AC . XX DT 18-NOV-2008 (Rel. 13.11, Created) DT 18-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE CR1-like non-LTR retrotransposon - a consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; AP endonuclease; CR1 clad; CR1-23_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-2250 RA Bao W. and Jurka J.; RT "CR1-like non-LTR retrotransposons from zebrafish."; RL Repbase Reports 8(11), 1697-1697 (2008). XX DR [1] (Consensus) XX CC The 5'- part is incomplete. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 1..2106 FT /product="CR1-23_DR_1p" FT /translation="LDLCYGSIKGAMKAVAGPPVGFSDHNVVHLLPVYKSV FT LRREKVRKREIKVWSEDNSLALQGCFDCTDWSVFKETSESIDELTEVVCDY FT ISFCTDMIIPHKTVIDFPNNKPWLSNDTKSLLQSKHKYFKQGDMSSVKQLK FT KAVRVQINKDKIRYKDKIEKQLSSNNLRSAWQGVKAMAGCKEDRGGSEINL FT NGFSSAREFAEELNTFYLRFDEDDFKSENDNLKQHLSCSNSVNISRIQVKG FT LLQRTNTKKSPGPDNIGGRVLHTCAEQLSEIFHYIFSQSLMLQRVPEIWKK FT SVIVPIAKTKFPRELNDFRPVALTSLVMKCFERFIKGEILLQSKDKFDPLQ FT FAFRQSRGVEDANLTMLNYLFNHLEKPKTHARLLFIDFSSAFNTMKPHLLV FT EKLISLFNLDLNICGWILDFLVGRQQCVRVNGALSDFQYCSTGSPQGCCLS FT PLLFIMYTNDCRSIHRDCHIIKYADDSVIVSLLHDQDLGHGPAVQEFTSWC FT DRFSLQLNASKTKDMVIDYRKASPTPNITNIKGLDIEIVDTHKYLGVVIDN FT MLTFQPNTQAVYKKVQQRLFFLRKLRSFQVCTSMMTLFYKQFIESVLSFCI FT VAWYGNLSVSSKNRLSSLVKVAGKIIGFNQTGLMVIYHNYVLXRAQGILCT FT PDHPLYSVFQLLPSGRRFRVPICRTNRTRNSFIMVAIRSLNESRGEGDVSI FT X*" XX SQ Sequence 2250 BP; 656 A; 365 C; 494 G; 726 T; 9 other; ttggacctct gttatggatc gattaaaggg gcaatgaaag ctgtagcggg tccaccagtc 60 ggcttctctg atcataacgt ggtacacctc ctcccagtat ataaatctgt gctgaggagg 120 gagaaggtgc ggaagaggga aattaaagta tggtcagagg ataattcact ggcccttcag 180 ggttgtttcg attgtacaga ctggtccgtt tttaaagaga catctgaaag tattgatgaa 240 cttacagaag tagtgtgtga ttacatatct ttttgcacag atatgattat ccctcacaaa 300 actgttattg actttccaaa taacaaacct tggctgtcaa atgacacaaa atctctcctg 360 caaagtaaac ataaatattt taaacagggt gatatgagct cagttaagca attaaaaaag 420 gcagtgagag tgcaaataaa caaggataaa attaggtata aggacaaaat tgagaaacag 480 ctgagcagta ataatttaag atcagcctgg caaggggtga aggcaatggc tgggtgtaaa 540 gaagatagag ggggcagtga aatcaattta aatggtttta gttctgcgag agagtttgct 600 gaggaactaa atacatttta cttaaggttt gatgaggatg attttaagtc agaaaatgac 660 aatttaaaac aacatctgtc atgtagtaac tccgtaaaca ttagtagaat tcaggtcaaa 720 ggcctccttc agagaactaa tactaaaaaa agcccgggtc ctgacaatat tggtggtagg 780 gtgctacata cgtgtgcgga acagctttct gaaatatttc attacatttt ttcacagtct 840 cttatgctac aaagggtccc tgaaatctgg aagaaatctg ttattgttcc tatagctaaa 900 accaagtttc cccgggagtt aaatgatttt agacctgttg cycttacttc attagtyatg 960 aagtgttttg agagatttat taaaggtgag attttattgc agtcaaagga taaatttgat 1020 ccattgcagt ttgcttttag acagagcagg ggagttgaag atgccaatct tacaatgtta 1080 aattacctgt ttaatcacct ggaaaagcct aagactcatg ccagattgtt gtttattgat 1140 ttctcttctg cttttaatac tatgaagcca catttgttgg tggagaaatt aatttctctt 1200 tttaacttgg atcttaatat ttgtggatgg attcttgatt ttttagttgg acgtcagcag 1260 tgtgtcagag ttaatggtgc tctttctgac ttccagtatt gttccactgg gtcccctcaa 1320 ggttgttgcc tctcacctct tttgttcatc atgtatacta atgattgtcg tagtattcac 1380 agggattgtc atatcattaa atatgcagat gattctgtga ttgtaagtct cctccacgat 1440 caggatttgg ggcatggtcc tgctgtgcag gagtttactt cctggtgtga tagattttct 1500 ctccagttga atgcaagtaa aacaaaagac atggtgatag attacaggaa ggcttctcca 1560 acccccaaca taactaatat taaaggcttg gacattgaaa ttgttgacac acataaatat 1620 ctaggggttg ttattgataa tatgttgaca tttcaaccta acacccaggc tgtmtataaa 1680 aaagtccaac aaagactgtt ttttctgagg aaacttagat catttcaggt ttgtacctct 1740 atgatgacgc tgttttataa acaatttatc gaatctgttt tatctttttg cattgtagct 1800 tggtacggca atctgagtgt gtcaagcaag aacaggttga gcagcctggt gaaggtggca 1860 ggcaagatca ttggttttaa ycagactggt ctaatggtaa tctaccataa ttatgttcta 1920 argagagctc agggtatcct atgcaccccg gatcacccat tgtactctgt atttcagctg 1980 ttgccatcag gacgtcgttt tagggttcct atttgcagga caaatagaac cagaaattct 2040 tttataatgg tggccattag gtcattaaat gagtccaggg gggaggggga tgtwtctata 2100 tawtgatgta tggatgttgt gtagtagtat ttttttattt ttwtttttta atgtgtacat 2160 ggaacttcgc ctgtgttatg gyaccttgtt gcaattttag tttccctaac gggataataa 2220 agctgagtct gagtctgagt ctgagtctag 2250 // ID Gypsy160-I_DR repbase; DNA; ZEB; 4160 BP. XX AC chr4; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy160-I_DR; KW Gypsy160-LTR_DR; Gypsy160_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4160 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1581-1581 (2008). XX DR Genome; chr4; Positions 35828846 35833005. XX CC Positions [3094-3429] - Integrase core CC 'CAGTG' target site duplication CC LTRs are 95% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 506..1828 FT /product="Gypsy160-I_DR_2p" FT /translation="MIRDKIVFSISDQRLKERLLRETNLTLEKTVDICRAA FT EAAKTQIQAMGEQSKTVHALKKTKDVKQNKIFERKSFQQQKTVTFTCKKCG FT KSHLPRQCPAYGATCHACGKSNHFASVCMSQQKDTKPKHNRNVMVDSLFIG FT SVKLNQTSGSVQNAWHTDVNIGNVTVKFKLDSGAEANIIPLNIFQSLHKTA FT LLQPTSTMLVAYGGTKLKPEGVANLQCVTPKVQACLPFYITRHSSIPILGK FT EACERMQLLKRVETVVIKHTTSKEELIAQHPTFFEGLGQFPGEHHIHVDPE FT AIPVIHGCRKIPLTVLGKLKDTLDQLLQADVIAPVTQPTPWVNSLVITEKK FT RMGHYECAWILVTLIKLSFVNISLFQLLKMFCVGFQERQFSPFLMKRTATG FT KSHLIKNLHYYARLTRHGVDIDSNDCRLVSNLLVKFSSSTTMKCLET" FT CDS 1858..3852 FT /product="Gypsy160-I_DR_1p" FT /translation="MIVAAATEQEQDIIVAEIMQRAEHHNVKFNPDKIQYN FT VHFMGHVITPQGVKADEGKIQAVVSMPAPEDRQALQRLLGMIRYLAPFIRG FT EATLTATLRQLLRKDIVFQWQPEHDEALSALKTALTNTPVLRYYDPNTAVP FT IQTDASKDGLGSCLLQEGQPIAYASRALTDTEKNYAQIEKELLAIVFSTKK FT FHQYVYGRSVNVQSDHKTLEAIFKKPLSKATARLQRMLLQLQKYDLIVQYT FT PGKDMLIADTLSRAFTEGQQTSTDELSDERVVYSLEATEALSEEMLKQLTD FT ATAKDNTLQLLVKTQKTGWPTHRKKLDPSMYQFWPVRHTVAMQDGILLVSG FT RILIPESMRTEMLQKLHVSHQGMQGTKAHARKCLYWPGMTKHIEQMVEMCP FT TCQQFQPRNQKEPLISHEIPELPWLKVAADIFEIRGQSFLLIVDYMSKFPE FT VMNINDKTARTVIEKMKTVYARHGIPKELVCDHVPFASYEMKKFAAEWGIK FT VTHSSPAYPQSNGLAERTIKTVKHVLKKAEQSGVDHHFALLSLRNTPITGT FT SYSPAQVLMGRVLRSTLPVSSEVLRPATPKGVHQELQTLQKQQACHYNVGA FT KSMSKLNAGNTVHIETNRGWQPGLIVSKRNEPRSYNVVNEAGQQFRRNRRH FT LRKTKSQVQRNIGSCG" XX SQ Sequence 4160 BP; 1391 A; 889 C; 913 G; 967 T; 0 other; tggtgtcaga agagagttgt tttacggact caagcctgca gttgacagca aattgatgaa 60 gaacatgaac tcatcctgac taaagtaagt ctgccatagt gaaaaacaaa catggatcag 120 ttcagattac cacctccatt agtcctgtct ggcaatacag gtgaaaactg gagaagatgg 180 atacagcggt tcaacattta tatgactgct acaggatcag actcaaaaac tgaaaaggta 240 aagattgcca ttctgcttca tgctttagga gaggaggcat tagaagtata taactcactg 300 tctgttgagc tagaagggga aaatgaaaca atgcaagata tagtaactgc aatggagaag 360 tattgcttac caagaaagaa cattgttttc gaaaggcacc agttctgggc ttaccccatg 420 cttgatgcga ttaacatcga taagtatgtc acagaactaa aacaaaaaga gtaaggactg 480 tgagtttggc tcaacagaat ctgacatgat aagagacaag attgtgttca gtatcagtga 540 tcagcgttta aaggagaggc tactaagaga aacgaacttg acccttgaaa aaactgtgga 600 catatgcaga gcggcagagg cagcaaaaac acagattcaa gcaatggggg agcagagtaa 660 aactgttcac gcccttaaga agacaaaaga tgtaaaacag aacaaaatat ttgaaagaaa 720 gtcatttcaa cagcaaaaaa cagtaacatt cacttgcaaa aagtgtggaa aatctcacct 780 tccacgacag tgtccggcct atggcgcaac atgtcatgct tgtggcaaat ccaaccactt 840 tgcctcagtg tgcatgtctc aacaaaaaga cacaaagcct aaacacaata gaaatgtcat 900 ggttgactca ttattcattg gatcagtgaa actaaatcag acaagtggat cagtacagaa 960 cgcatggcac acagatgtaa acattggaaa tgtgacagtg aagttcaaac tggactctgg 1020 ggctgaagcc aatataatac ctctgaacat ttttcaatct ctgcacaaaa cagcactgtt 1080 gcaaccaact tcaactatgc ttgtggccta tggtggaaca aaattaaagc cggaaggtgt 1140 ggccaacctt cagtgtgtca cacccaaagt gcaagcatgc ttaccctttt acatcaccag 1200 acactcatcc attccaatac ttggtaagga agcatgtgaa aggatgcaac tgctgaaaag 1260 agttgagact gttgtaatta agcatactac ctccaaagag gagctgattg cacaacatcc 1320 aacattcttt gaagggcttg gtcagtttcc gggtgaacac cacattcatg ttgatccgga 1380 agcaatccca gtcatacatg gctgtagaaa aattccctta acagtgctgg gcaaactaaa 1440 ggatacacta gatcagctgt tgcaagcaga tgttatagca ccagtcaccc agcccactcc 1500 ttgggtcaac agccttgtta taactgaaaa aaaaagaatg ggtcattacg agtgtgcctg 1560 gatcctcgtg actttaataa agctatcctt cgtcaacatt tctctattcc aactactgaa 1620 gatgttctgt gtaggctttc aggaaagaca attttctcca tttttgatga aaaggacggc 1680 tactggcaag tcacacttga taaagaatct tcactactat gcacgtttaa cacgccatgg 1740 ggtagatata gattcaaacg actgccgttt ggtgtcaaat ctgctagtga agttttccag 1800 cagtacaaca atgaagtgtt tggagacata gaaggtgtac acattgtagc agatgatatg 1860 attgttgctg ctgccacaga acaagaacag gatatcattg ttgctgaaat tatgcagaga 1920 gcagaacacc acaacgtgaa attcaacccc gacaagatcc agtacaatgt acatttcatg 1980 ggccacgtga taactccaca aggtgtaaaa gcagatgaag gcaagataca ggcagtagta 2040 agcatgccag cacctgagga cagacaagca ctgcagcgtc ttttgggcat gattcgatat 2100 ctcgcaccat ttatcagagg tgaggcaaca ctgacggcca ctcttagaca gctgttaaga 2160 aaggacattg tattccagtg gcaaccagaa catgacgagg ccctgtctgc actcaagaca 2220 gcactaacca acacaccagt tctcaggtac tatgacccca acacagcagt ccccatacag 2280 acagacgcat ccaaagatgg cttaggatca tgtctgctgc aggaggggca gccaatagca 2340 tatgcgtctc gggcattaac agatacagaa aagaactatg cccagattga aaaggagctg 2400 ctggcgatag ttttctcgac caaaaagttc catcagtatg tatatggacg cagtgtgaat 2460 gttcaatctg accataaaac attagaggct attttcaaga agccactgag caaagcaaca 2520 gcgagactcc agcgtatgct tttacagcta caaaaatatg acctcatagt ccaatatacg 2580 cctggtaagg acatgctaat tgccgataca ttgtcacgtg ctttcactga gggccaacaa 2640 acaagcacag atgagctcag tgatgaacgg gtcgtatact cgcttgaagc aacagaggct 2700 ctcagtgagg aaatgctgaa gcagcttaca gatgctacag caaaggacaa cacactccag 2760 ctgctagtga aaacacaaaa gacaggatgg ccgacacaca gaaagaaact tgatccctct 2820 atgtatcagt tttggcctgt cagacacact gttgccatgc aagatggcat attactggta 2880 tcaggtagaa tactgattcc cgagtcgatg agaacagaga tgttgcagaa actacatgtc 2940 tcacaccaag gaatgcaggg cacaaaggca catgccagaa aatgtctgta ttggcccgga 3000 atgaccaaac atatagagca gatggttgag atgtgtccaa cgtgccaaca gtttcagcca 3060 aggaatcaga aagagccact catctctcat gaaatcccag agcttccctg gttaaaagta 3120 gctgctgaca tcttcgaaat cagaggtcag tcatttcttt tgattgttga ctacatgtca 3180 aagtttcctg aggtgatgaa catcaatgac aagacagcac gcactgtaat tgaaaaaatg 3240 aaaacagtat atgcaagaca tggcattcca aaagagctgg tgtgcgatca cgtacccttc 3300 gccagttatg aaatgaagaa attcgctgca gagtggggca ttaaggtgac acactcaagt 3360 ccagcctatc ctcaatccaa cgggttggcg gagagaacaa ttaaaactgt caaacacgtc 3420 ctgaaaaaag cagaacaatc aggagtggat catcactttg ctcttctctc gttgagaaat 3480 actcctatca ctggcacatc ctattcacca gctcaagttc tgatgggaag agtcttaagg 3540 agcaccttac cagtgtctag tgaagtcctg cgaccagcta ctcccaaagg tgttcaccag 3600 gagctccaaa ccctgcaaaa gcagcaagcg tgccactaca atgtaggagc aaaatccatg 3660 tcaaaattga atgccggaaa tactgtacac attgaaacaa acagagggtg gcagccaggt 3720 ctcattgttt caaaaagaaa tgagccaagg tcatacaatg ttgtcaatga agcaggacaa 3780 cagtttcgcc gcaacagacg ccacctgagg aaaacaaaat cacaagttca gagaaacatt 3840 ggatcctgtg gatgagaccc acaatgatac acatgctcac atcccagggg cacaagaagt 3900 gacaaattcg tctgtagagt gtctgccaga tcgaccttta catgacagca gtaccacaac 3960 caccaggagt ggtcgggttg tgaaagttcc cgtgaggttt caggactact gtatgaactg 4020 aatacatatg tttgcctagt aatagagtaa actttgctat aaataagaaa aatgtccttg 4080 ttatatggac tgtaagttta gcattcagtg tttatatatc ttgtttaaag cttgcatagt 4140 ttactgtgaa aaaaggggga 4160 // ID Gypsy84-LTR_DR repbase; DNA; ZEB; 640 BP. XX AC chr11; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy84-I_DR; KW Gypsy84-LTR_DR; Gypsy84_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-640 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1480-1480 (2008). XX DR Genome; chr11; Positions 42647936 42647297. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 640 BP; 136 A; 94 C; 141 G; 269 T; 0 other; tgcgaactta tttaaagagt gctattacgg gatcattgct tgctgattgt tgcctgagtg 60 tcagcatggg aaattaagga tcttagaggg atgcacggtt gccgtttact ttagacttac 120 aataatattt ttgttattga atctttgttg ttgacttttg ttgtatgcga atttgggttg 180 tttatgatgt tgctaggttg acaacaattt gtatttttcc tttgttaaac ttttctttct 240 tttgtaggtt ctgttttgga tttgttttac catcagtggc cactgggttc cctgcactgt 300 gtggagaaaa ggtggttggg attgtgtggt gatttggtca ctgttgattt ttgtgttttt 360 taggttgcag tgtgattgct gtgagctgag tgtggtgaaa acactgctgc ttgggtacct 420 catctgtcct cactttctgg gtaaccggtg gccattagtg gtttgttttt ttcttttctt 480 ttatttagtt tatttacttt ttttcctcct tatttgcaat aaagatggag acaatttgta 540 tattacaact tgtggaataa aatataattg tcctcctaat tgtaagtcac ttcatttacc 600 ctctgctaaa tcattaacac tgacaaaagt catctggtca 640 // ID DNA-8-24_DR repbase; DNA; ZEB; 1783 BP. XX AC . XX DT 10-DEC-2008 (Rel. 13.12, Created) DT 10-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE A nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; TSD 8-bp; DNA-8-24_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1783 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(12), 2147-2147 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1783 BP; 574 A; 274 C; 315 G; 620 T; 0 other; caagggcgta ggtttgcgct tgacattggt ggggactggg gacgagtcac ccccgccaag 60 aatgaatatt gtttatatat atatatatat atatatatat atatatatat atatatatat 120 atatatatat atatatatat atattatata ttttatatat tgctatttat tatttaactg 180 ctattaaaat gtattattaa gtaatcctct cttcttacaa tttattaagt taaattaaat 240 agtcatgcgt gtagccaagg ggggtcaggt ggtttgaaag accaaaagtt ggattattat 300 tattattatg tttgaattta tcttattatt caaatggata aattttcact gaggaaggtt 360 gaatgcgccg gccagtttgt tatttgccta taggcttcag tttttaattt aaatcaagtt 420 ttaacagatt tcccaaagaa aattatgata aaaaaattgt attttaaaaa taagtatatt 480 tacatatttg aagagtttag atgcaaaaac ctctaaatgc catttaatat tttcttctaa 540 aattagcatt tttctccaac tcctgtgtgt ggactcagtg attttacttt tatagtgaca 600 aatacgttct tttcattgcc tttaaagtga aataactgaa cataaatata caaggctgag 660 aaaaatgctc attttacatg gcatttagag gttttggcat ctgtactgta cttcatttct 720 ttattctaag aatttaaagt cctaatttct agttcaaagt atttttatta tttataaaac 780 tgtaataaaa cttacagttt aaatgtgtaa ataaagcaat ttgacaaaat ggtaggaaat 840 agttttggta catcaaaaag tgtagttgta ataactatcc aacaagatta tccagcaaaa 900 attagttctt aatatgtcac taaaatgtca tatttatacc tcaattaaac agaaaatcag 960 gtgaataact gcaaaaaggt tcagtttttc agaaaacaaa taaccccttt caataagctg 1020 gctacaggcc tgaaagtgtc atgttaaaat aatgtttatt aactgataaa gagagagttt 1080 aataaagact ttcgttcatt tttcgtacaa attaaacatg tctcatagta ttattaaaag 1140 atggagcaca ggctgtagtt tttctgacaa attgtttcaa ccaaaattgg agatgttact 1200 ctaagagatt aggagaactg gcagggtcat atacgtttac atgtaggtgt ttattctgca 1260 gttagttttc agggaatatt gtttacagac tatatcagtc tgctttaacg ttagactgaa 1320 cgtaaaagcg gtgaaaacac tagcataacg gcattatcat gatggactca ttcagtagcg 1380 aacttgagtt tatgtgactt acacttttcg ttggaaaaag ctctttatgt ccacctcttc 1440 ttgctacctc catcctcact tgtgcgtgtc tgtcatacac atctcacaca tcgggaaatt 1500 ccccccgcgc agttgttttg gcgctgtttg tatctgaaac tatgatctga aagcagccac 1560 tcacatgaca gcagggaaag gcacagccaa tcacaatgtt catatgtgtt tgggggcggt 1620 gcgactcgag gagagaacgt gatttaaccc tttcggcatg tctaggttta tactgacagc 1680 gcgatgtttt taagtgaatt aaattattgg tggggacatt tcggtagttt gtggatattg 1740 gtggggacac gtccccaccg tccaccctaa atctacgccc ctg 1783 // ID HARBINGERN4_DR repbase; DNA; ZEB; 860 BP. XX AC . XX DT 11-JUL-2002 (Rel. 7.06, Created) DT 11-JUL-2002 (Rel. 7.06, Last updated, Version 1) XX DE HARBINGERN4_DR is a nonautonomous DNA transposon - a consensus. XX KW Harbinger; DNA transposon; Transposable Element; Nonautonomous; KW 16-bp TIRs; 3-bp TSD; Harbinger superfamily; HARBINGERN4_DR; KW nonautonomous DNA transposon. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RA Bao Z.; RT "Dr000061, an unclassified and uncharacterized repeat."; RL http://www.genetics.wustl.edu/fish_lab/repeats/. XX RN [2] RP 1-860 RA Kapitonov V.V. and Jurka J.; RT "HARBINGERN4_DR, a nonautonomous DNA transposon from Danio RT rerio."; RL Repbase Reports 2(6), 5-5 (2002). XX DR [2] (Consensus) XX CC HARBINGERN4_DR copies are flanked by 3-bp target site CC duplications CC generated upon their integration in the genome. CC It has imperfect 17-bp terminal inverted repeats similar to other CC HARBINGERs. There are approximately 1000 copies of HARBINGERN4_DR CC harbored by the zebrafish genome; they are ~8% divergent from CC the consensus sequence. XX SQ Sequence 860 BP; 227 A; 213 C; 186 G; 234 T; 0 other; gggtgctttc acatctgtag ttcgcttcat ttggtccgga ccaagggcaa taaatgatac 60 attgttgcat tttctgccgt ctttgggtcg ttttcacacc acactgctgg ctttggtccg 120 aaccagttga aacgaaccaa aatgcagtca tctgacaaaa tccacatctc tcattggcca 180 gatgttgttg aacatatttc ctaaactgct tatcgattgg tcagaattca cgtgcgggaa 240 aatgccaatg aactcccgca agtaaacaaa ccggcagaca caaaatgtcg tttttactat 300 ggagggacga ctgcgctgac tgattgtatc cctgctttag acaaactata catttcgaga 360 atgaagcgcg gctgcggctg gaaaacaatg ttttaatgca tcgctcgtca cttcaggagg 420 aggcgtgaat taatttagct aaactgcgac atggtcatct tgcggaaata ttaccaacga 480 tccatcaggt aatgttttcc ccctctctct ctctctgttg gtaaagcgct gtcaacaaat 540 atttttcctc catatcccat aatgcacagc gcatcagcct acggcagctg gattagtcca 600 aaaggcgcag tactttttgc ggttggacct gttttagttc ggatcatatt ctcaccacaa 660 acgaaccgct ccagggtttg tttgaaagcg taccgagacc acctcttcaa gcaggtctcg 720 gtacgctttt ttggtccgct tttggtgcgc actcgagtac gattgctgca ttcacacctg 780 cccaaacgaa ccgcaccaag agggaaaacg aactctagtg cgattcaatc gaactaaata 840 aggcaggtgt gaaagcaccc 860 // ID Gypsy87-I_DR repbase; DNA; ZEB; 5873 BP. XX AC chr10; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy87-I_DR; KW Gypsy87-LTR_DR; Gypsy87_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5873 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1485-1485 (2008). XX DR Genome; chr10; Positions 24999936 25005808. XX CC 'ATTA' target site duplication CC LTRs are 88% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 181..1536 FT /product="Gypsy87-I_DR_1p" FT /translation="MADEELDAAAARGDRHLADAANTHLPFKIDFPHSFSG FT DGTEPFSSWIQRFEVALNVSAVPLDKAKLLPVKLIGPAFAYWQSLSSDVKA FT DYALAKASLTTVFGRTTFLATFQTFLNARPRKPQEPLEVYAAELTNLVTEA FT FPQYDAAARNCEIFRRFVTGLDPSLQLKIHEHGAVTLDNALKIATQCERAQ FT LAISVANPSAVMPVVLNQQPPTNTVTANHITELASVIAELRTDLHALRQSH FT QRQMDERVDQLSHRISSLQDKRTSSTHSSHQSCAAVDYDITTRHRQPDLDD FT HHRNCCCTSQHTHGCRQHPQDDCCPAQRRRDYFEHEQAPPTDSFSRQRRYS FT YSRSCESHPPYTTHYAEHTPHRPYVNARSGRSPSRERRSDYNPEYVTHYKR FT PTSHSPQYRHSSYRRHSSPSPPPQSHKSSRYSSSPTRHVHFTESDSEKAPF FT SPHRGNE" FT CDS join(1560..3473,3477..5609) FT /product="Gypsy87-I_DR_2p" FT /translation="MPASSSKVVHSDSVVYNTPSTPFIQIALAGTELHALV FT DTGSGLSLITDDCRKSIPALSTQSISRSFVLASSVTGQPLDILGSVTAPIH FT IGDVTFSHVFHVVRTATYPVIIGWDFLIEHNVTVNISLTRLELHNTTVPFS FT SPHSLIPVQNSAVTIAQVTIPPLSEVVVPISVKDKGVANLFIDTLGLTLGV FT ARTLTTVQKGRAIVRVVNPTNEQVSLDAGCPLGQIFSVTGNAHDEYALVSA FT VTSSEALRGHPMSDVHLEDGQLNADEQVELKSLLKEFSDIFSVHSHDYGKT FT NLITHSINTGNAAPIKLRPYRTSPATQAVLQQEVSKLLDHNIIEESQSPWS FT APVVLVRKKDGTHRFCVDYRRLNDVTIKDSHPLPRVDDTLGRLAGARVFST FT IDLTAGYWQIPLNPSDKEKTAFSTGAGLYQFCMMPMGISNAPPSFQRLMEL FT VLRGLHWSICLIYLDDIIIYSADFSQHLQHLREVFQRFRTAGLKLKPSKCQ FT FARSSVTFLGHRVSSAGIEPDPSNIDKVRTWPIPISATQVRAFLGLCSYYR FT RFISQFAKTAEPLYHLTHKGVPFTWSVEANEAFLVLKQALTSSPIIAFPNF FT SAPFLLYTDASLHSVGSVLSQKKEGKEHVIAYASHVLSAARKWSTFDRKLY FT AIVWSVRHFRHYLACHPFTIITDHKPLVGLKKLPLDHDPTGRRARWALELD FT LYDWHVIHREGAKHLNADAMSRRPDNNPVDSDSYIQGNMSTFSVATQTTLA FT PLRAGTASAHTQTLQMNKDVLLPTTISGNTTANLVLIQSDWNVIEKQKLDP FT DLAVVFTWMEKGHRPPLWRLREASPYLRKLWTQFSRLIIHNGVLCRHLRNS FT FRDDTFQVVIPISLIPEILSHVHGHPSAGHYGQAKTLDRAMRSFYWPYMSS FT DIAKHCSQCTACQSRRSPVPRPQAPLIPITPERPFQIVAADITELPVSTKG FT NRYALVMMDLFTKFVNLYPLKDQTAHSVAVFIFDHYIPQHGVPEALHSDQG FT RQFESDLIKHLCNLLSIKKLRTSPYHEQCDGAVERFNRTLKGELSKCLFDS FT GSEWDEYLPQVALAYNTTKHTSTGLTPFFLAHGREARVPIETLLQDNNASC FT SATAGTPAGYANKVRKRLACAYRSAIAFRDKAQEQQRLNYDRHLKYTPYNR FT GDLVLVNDPAHRHNKLHPRWVGPYEVLQSICPFGSSTPVNFEVHDVSKPHA FT KAKIIHYNRMKPYITDMQNTHTLTPSPSPAQPNTLNTLSGLLPPHITPAPP FT GLPTVPVPLELPQLQTQPQLHSPETLQSFGPMDNGQSLNSGLSPQTQRTPD FT TGLSTGVSESPDSPVATRNVLSRTRRLPYHLKDFALY" XX SQ Sequence 5873 BP; 1549 A; 1441 C; 1194 G; 1689 T; 0 other; ctggtggcag ctgtggttgg cttcttttgc tgggacacta attttctcat attaaacgag 60 gggagcgtcg ccactttgac cagcagagac gttgatacac tttctgtgta tttttttacg 120 tgctcttttt gtttgcaagt ttggattacg tgcgttattt tttttttctt tttcaacaaa 180 atggcggacg aagagcttga tgcagctgct gcacgagggg acaggcatct ggcggatgca 240 gcaaacacac acctgccatt taaaattgat tttcctcatt ctttttctgg tgatgggaca 300 gaaccattta gctcatggat acagaggttt gaagtagctc ttaatgtctc agctgtacca 360 ctggataagg ccaaacttct tcctgtcaag cttattggtc ctgcttttgc ttattggcag 420 tctctctcat ctgatgttaa agctgattat gctttagcta aagccagttt gaccactgtt 480 tttggccgta caacattttt ggctactttt cagacatttt tgaatgcacg tccacgtaaa 540 ccacaagaac ctcttgaagt ttatgctgct gagctcacaa atttagtgac tgaagcattt 600 ccacaatatg atgcagcagc acgaaattgt gaaatttttc ggcgttttgt gactggttta 660 gacccatcgc ttcaactcaa aattcatgag catggcgctg tgactttaga caatgcttta 720 aaaattgcca ctcagtgcga acgtgctcag ctggccatca gtgttgctaa cccttcagct 780 gttatgcccg tggtgttgaa ccagcaacca ccaactaaca cagtgaccgc aaaccatatc 840 actgaactgg catcagtgat agctgaactc cgaactgatc tgcatgcctt acgccagtct 900 catcaacgac aaatggatga acgtgtggac cagctttcac atcggatttc aagtctgcag 960 gacaagagga cgtcttccac acactcttca catcagtcct gtgctgctgt ggactatgac 1020 atcacaacac gacatcgcca gcctgacttg gatgatcatc atagaaactg ctgctgtact 1080 tcacaacaca cgcacggatg tcgacagcac cctcaagatg actgctgtcc tgcacaacgc 1140 cgccgtgact acttcgaaca tgaacaggca cccccgactg acagcttctc ccgccaaaga 1200 cgttatagct actcccgtag ttgtgagtca caccctcctt acaccacaca ctatgctgaa 1260 cacacacccc acaggcctta tgttaacgct aggtcaggtc gctcaccttc aagggagcgc 1320 cggtcggatt acaaccctga gtatgttacc cactataaac gtccaacttc acacagtcca 1380 cagtatagac actcgagtta tcgcagacat agctctccgt caccaccccc tcagtctcat 1440 aagtcatcaa gatattcttc ttcccccaca cgacacgtac acttcactga atctgacagt 1500 gagaaagctc ctttctcgcc acaccgggga aacgagtaga agctgtcatt ggggaccaaa 1560 tgccagcttc ctcctcaaaa gtggtccatt cagattcggt tgtttataac actccgtcca 1620 caccttttat tcaaattgca ctggctggca ctgaacttca tgcattagta gatacaggct 1680 cgggtctctc acttattaca gatgactgcc gtaaatctat tccagctttg tctactcagt 1740 ctatcagcag gtcgtttgtg ttagcgtcgt cagttactgg acagccattg gacatacttg 1800 gatctgtgac agctccaata catattggag atgtcacatt ttcccatgtt ttccatgtag 1860 tacgcactgc cacttaccct gtaataatag gatgggattt cttaattgag cacaatgtca 1920 ctgttaacat atcacttacg cgtttagaac tacataacac tacagtgccc ttttcgtcac 1980 ctcattcact cattccggtt caaaattcgg ctgtcacaat cgctcaagtg actatcccgc 2040 cattgtcaga agtggtagtc ccaatatcag taaaagataa gggtgtagcg aatcttttca 2100 ttgacacttt aggtttgact ttaggtgtag ctcgaacatt gaccactgtt caaaaaggga 2160 gggcaatagt tcgagttgtg aacccaacta acgaacaagt ctctcttgat gctggatgtc 2220 cattaggtca gattttctct gtcactggca atgcacatga tgagtatgca ttagtctcag 2280 ctgtgacttc tagtgaagct ctccgtggcc acccaatgtc agatgtgcat ttggaggatg 2340 gtcaactaaa cgctgatgag caagtagagc ttaagagcct actgaaagaa ttctctgaca 2400 tttttagtgt tcattctcac gactatggca agacaaactt gatcacacat agtattaata 2460 caggcaatgc agctccaata aaacttcgtc catatagaac atcaccagct actcaggctg 2520 tactgcaaca agaggtgtca aagctcttgg accacaacat cattgaggag tctcaaagtc 2580 catggtcagc cccggttgtt cttgttcgaa agaaggatgg tacacaccgg ttctgtgttg 2640 attatagacg cctcaatgat gttaccatta aagactcgca ccctcttcct cgcgtggatg 2700 acacccttgg caggcttgct ggagcccgag tgtttagcac aattgacttg actgccggct 2760 actggcaaat tccacttaac cctagcgata aagaaaagac agccttttcc actggtgctg 2820 ggttatatca attttgcatg atgccgatgg gtatctccaa tgcaccaccc agtttccagc 2880 ggttaatgga acttgtgctt cgaggtctgc actggagtat ttgtttgatc tacttagatg 2940 atattatcat ctacagtgct gatttttcac aacacctaca gcacttgcgt gaagttttcc 3000 agagatttcg tacagctggt ctaaaactta agccatccaa gtgccagttt gcacgctcct 3060 ctgttacttt tcttggccac cgtgtttcca gtgccgggat tgagccagac ccatcaaaca 3120 tagataaagt gagaacatgg ccaattccaa tatcagccac acaggtcaga gcttttcttg 3180 gtctgtgttc ctattatcgt cgattcatta gtcaattcgc taagacggct gaacccttgt 3240 accatctcac acacaaggga gtaccattca cttggtcagt ggaagcaaat gaagcttttc 3300 ttgtactaaa acaggcgctg acttcatcac ctattatagc gtttccaaat ttttcagctc 3360 ctttccttct ttacacagat gcttctcttc attctgtggg ctctgtcctt tcccaaaaga 3420 aagaaggaaa agagcatgta attgcatacg ccagtcatgt cttgtctgct gcgtaaagaa 3480 aatggtcaac atttgacaga aaattgtatg ccattgtttg gtcagttcgc cacttcaggc 3540 actatcttgc ttgtcaccca ttcactatca taacagatca caagccgctg gtcggtctta 3600 agaaacttcc tcttgatcat gatcctacgg gtaggcgagc tcgttgggcc ttagaattgg 3660 acttgtatga ttggcatgtc attcatcgag agggagctaa acaccttaat gcagatgcta 3720 tgtccaggcg gccagataat aatcctgtag actctgacag ttacattcag ggcaacatgt 3780 ccaccttttc ggtagcaact cagaccacat tggcgccctt gcgggcgggg acagcttcag 3840 ctcacaccca gactttgcag atgaataagg atgtgctttt gccgacaacc atttcaggta 3900 acacaacagc aaatctcgtt ctaattcaga gtgattggaa tgtgatagaa aagcaaaagt 3960 tggatccaga tttagctgtt gtttttactt ggatggaaaa agggcataga ccacccttgt 4020 ggagattacg tgaggcttcc ccatacctca gaaaactgtg gacacagttt agtcgtctta 4080 ttatccacaa tggtgtgtta tgcagacact tgcgtaactc ttttagagat gatacctttc 4140 aggtggtcat tcccatatca ttgattcctg aaatactcag tcatgtccat gggcatccat 4200 cagctggtca ttatggtcag gcaaaaacat tggacagagc tatgcggtcc ttttattggc 4260 cttatatgtc ttcagacata gccaaacatt gctcccaatg tacggcttgc caatcacggc 4320 gttctcctgt gcccaggccg caggcaccac tcattcctat tactcctgag cgtccattcc 4380 aaatagtggc ggcagacata acagagttgc ctgtttccac taagggtaat cgttatgccc 4440 tagtaatgat ggacctattt accaagtttg tcaatctcta cccactaaaa gatcagacag 4500 ctcactcagt ggcagttttc atttttgacc actatatacc acagcatggt gtccctgagg 4560 ccctacactc ggatcagggc aggcagtttg aatcagacct aataaagcac ctttgtaact 4620 tgctatcaat caaaaagctt cgtacatccc cttatcatga gcagtgtgat ggagctgtag 4680 agcgttttaa tcgcacgtta aaaggtgaac tttctaagtg tctatttgat tcaggttcag 4740 agtgggatga gtatctccca caggttgcat tggcttacaa caccaccaag cacactagca 4800 ctgggctaac acctttcttt ctagcccatg gcagagaggc ccgtgttcct atagaaacac 4860 tacttcagga caataatgct tcctgttcag caacagctgg tacacccgca ggatatgcta 4920 acaaagtgcg taaaagacta gcatgtgctt atcgttctgc gatagctttc cgagataagg 4980 ctcaagaaca acaacgcctg aactatgatc gtcatttgaa gtacactccg tataacaggg 5040 gtgacttggt tttagtgaat gaccctgcac accgccacaa caaactgcat ccacgttggg 5100 taggcccata tgaggtttta caatccatat gcccatttgg tagttctacc ccagttaatt 5160 ttgaagtcca tgatgtatct aaacctcatg caaaagcaaa aataattcat tacaatcgca 5220 tgaaaccgta cattactgac atgcagaaca ctcataccct taccccctct ccctcccctg 5280 ctcagccaaa cactcttaat acattgtctg ggctcttacc accccacatt acacctgcac 5340 caccaggact tcctacagtt ccggttcctt tggaacttcc tcagttacaa acacagcccc 5400 aattgcactc acctgaaacc ctacaatctt tcggacctat ggacaatggc cagagcttga 5460 actcaggact gtcacctcaa actcagagaa cacctgacac gggattgtca acaggtgttt 5520 ctgaatcacc ggactctcca gtggctacac ggaatgtact ttccaggaca cgtcgtttgc 5580 cttaccattt gaaagatttt gcactttact gagtttttgt cttttatctg tacttgtttg 5640 atgtgcagct ttatatgtct aatagttaag ggaaggtttt tcaaatactg atgccttgtg 5700 gcgctatgtc atttttgttg ctcatggtta gacatgcttt tgaaagtata agattttatt 5760 tttttatata tacagttttg ggtaacattg tttctggatg ttgagcacat acagttttgt 5820 gctaaattgt gttagagaaa tgaggacatt tcttgtaggg taagggagta ata 5873 // ID Gypsy136-LTR_DR repbase; DNA; ZEB; 825 BP. XX AC chr13; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy136-I_DR; KW Gypsy136-LTR_DR; Gypsy136_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-825 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1534-1534 (2008). XX DR Genome; chr13; Positions 43712480 43711656. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 825 BP; 248 A; 125 C; 169 G; 283 T; 0 other; tgtaacggag taaataagtg tgttaaataa tgcctaaatg gggacagcgc cctctagagt 60 gtgcgtgatg taataagtgc gtggagtagg gaccgcccac agcagagtta acgaagcatt 120 gctgtgggta agttgcacat ttctcccggg tggaaaatga ccattaataa gtgtttaaaa 180 cacataaata ttagctttaa tgtgacttaa tgtttcaatg aatatgttgg gacactgttg 240 agttgtgttt ggcgtcgagt gtgctgctca attttgagtt tataacgtgt ttatataacg 300 aggcctgatt gaaatcaatt tgtcttgtgt ttcatataga ggtggtcatt ttctttgatc 360 cagcaactat ttacacgcgc tgtctgtctc cattgtacag gtatgtgtta aatgttataa 420 attaattgtt tgtaattatt tttcttctta ctcaaaacga gttactaaaa gtctccacta 480 gatggggaag tttcattact tttggagttg aattgtattg ctctgttaca ctcagtgatt 540 gtacattaag aattgtgatt cattaatttg tcatgtaaaa ttcaaagtgt gttatgtttt 600 tcatattctg actgttaaaa tttacactat acgatcacaa agttaaaaat aattcattgg 660 catataaaaa atatatataa aactgtttta aataaaacag agcagagtta acgaagcatt 720 gctgtggagg tggtcatttt ctttgatcca gcaactattt acacgcgctg tctgtctcca 780 ttgtacagtg tacagtaaga agacagccca agcaagaccg tcaca 825 // ID Gypsy12-LTR_DR repbase; DNA; ZEB; 1476 BP. XX AC . XX DT 07-JAN-2005 (Rel. 10, Created) DT 07-JAN-2005 (Rel. 10, Last updated, Version 1) XX DE A long terminal repeat of the Gypsy12_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy superfamily; Gypsy12-I_DR; Gypsy12-LTR_DR; Gypsy12_DR; KW endogenous retrovirus. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1476 RA Kapitonov V.V. and Jurka J.; RT "Gypsy12_DR, an LTR retrotransposon from zebrafish."; RL Repbase Reports 4(12), 318-318 (2004). XX DR [1] (Consensus) XX CC Gypsy12-LTR_DR is a long terminal repeat of the Gypsy12_DR LTR CC retrotransposon. Its internal portion is deposited in Repbase CC as Gypsy12-I_DR. XX SQ Sequence 1476 BP; 449 A; 209 C; 292 G; 526 T; 0 other; tgtagggacc cctaaaatat tctcagatta taaaatgttt tttgtttatt tctaactgca 60 tgtaagtagt taatccagca acatttgatt tttgtatttt tatgacacaa catattcaga 120 gagcactatt ttgctggtca gttactgttc ctttaagatg acatcataac gtaagctgcg 180 tgagagagcg agagaaaaaa aaagagagga gcctcgtgag aagcagcagc agatcatcct 240 atgtgttttc atagttggga ggcgatatat tggtaatatt aagtcttcaa agtgaaagta 300 aaagtgactg ggactcagct cgttttattt gcttttctct ctgtggagtt ttcgctttgc 360 gatcgtcgtg tatcacacga gtgtatgctc tcagctgagg acaccatgct cgtagcatag 420 ccgctaacgc taatgacatt tacctgagtg aacagtgact gtgctgattg agttggtggt 480 gatgctgctc tgaactggac agacctgttt cagcgctgga cgtggaatta tcgcgtggga 540 cagcggtaaa tcacagcttt gtgcaggagg gatttcatta tatgtttggc tttcatcgga 600 tggacgtaag tgacatcatc attgcatttt gagagatcgt tctgaaacaa tacaactgtg 660 aatgtgcacc aacgaactgg gccccaacgt taaagtatgc atactaatgt tttttttgac 720 tgagtgactt tgtgactgtt atgactgtga ctgatttata ataagtttgt gaactagact 780 gagaccattt aaattgtaaa gatttggagg ttcactttat tcattttttg tttacaattg 840 tgaaacgatc tgagttaata gtgtttaggt gtgttatttc agaagcgttt aactttagct 900 ttaatgtgat ttttatctca tctgttgatt tgaggtaaaa caaaatgtag ggaaacataa 960 gaaatataga acaaaggaag tcagtaaaga acaacactta gacttgtaag gctaatttac 1020 atattttatt ttgaagaatt aaggtgctaa attttgactg aaagagtaaa actttattta 1080 gtgggatttc cttttgaatt gtttattctt ctttattcta ttctattgtt gttgttttat 1140 tattttcttg tggtgtagat aaagggctga ttcttttatt ttatttattt tctttttact 1200 ttaaacaaaa ttgtaataaa ttgttattat ttttcttaaa atctaaattt gacatctcat 1260 tgtgtttttg ttatatattg aattcatata taaaacacaa aacaaacaaa ctgcaaacat 1320 atacacactg taatagttta aattcagcta aaactagttt ttttagagta caaaggagta 1380 aaaataccgg ctaattcacc attcaactgc cgggaaccca ggtatttctg aataggatag 1440 ttaaaaataa ttcttaaggt taagaaaagg gttaca 1476 // ID DNA-8-22_DR repbase; DNA; ZEB; 1434 BP. XX AC . XX DT 08-DEC-2008 (Rel. 13.12, Created) DT 08-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE A nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; TSD 8-bp; DNA-8-22_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1434 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(12), 2145-2145 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1434 BP; 477 A; 282 C; 281 G; 393 T; 1 other; gggttgggta ccgaaacctg gtgccattat agcaccggta cctttgtaac cggtatgtac 60 cggaccraat cagcatatga atttcggtgc ctaatttcgg tgccactggt gctgcgacaa 120 ggacgtaaga agcatcaagg ggtgcatcaa aggcacacat aggtacgcgt tgtcacacca 180 tctctaaaca tttaattcta aacaactttg aggaatacgg acaggcgatg ttggcaggcg 240 tttgttcttg ttgcttaggg aacggacgca cgcaacgcac gcagtacagc gcattcggtg 300 agcgagagcg actctcttca gatcaacacg catgatcgcg ttcatcaaat ttgcttaaac 360 taagcgttct aaagcgtatt ttacaagcaa ggattctgac acagcaacgt gaagcagatg 420 ctttacaaaa gttgcgtgta agagggggaa acacgtcaaa cttaatgaca catttgaggg 480 ctcatggaat caacttaaag gcagaggaat gcactgtctc tgaaagcttg cgacatcatc 540 tggcactttc aatgagtacg tacatggaca ccaatgctcc gattttaata tgattaagac 600 aatactctga ttaagagtct agactaccat gtaaacagtg atttttgatt accttaaagg 660 accacagaca cctgcgtcgc gaaatgcgga ggttttttct ttccattcaa cgtgcggtat 720 caaattccat taaaagagca ctcttccacc agtccttact taatgctcgc atcaaatacc 780 tcagtttgtc atgggggcat gaatgaaatg ttcctgaatg aaagtgaaac tgcagttaaa 840 gtcgaaaaat taaaaatgaa acacctgaaa ttacatgaaa ctctggagga aacatggata 900 gcgtggtgac gcaatgacgt taatcgatct atgtactata acatgtaaga cgggatcatg 960 aaaggaacat tcaaaaagca actcatgtaa acaccttaat catattattg tcttattcag 1020 attaaggcaa atcattagat tactgataac catgtaaaca tagtcacaag ccccaaccct 1080 agaaaaaagg agttcagtat tttgatgaca gcctttccac aggttagtag taagctaatg 1140 taatttcata atgcaaatct taaattcatg ctaaccaaca aatgatcaaa ggaaatatat 1200 taatgtaatt caaatatata aaatatgaat tgatgtttac tttaaacttt aattatattg 1260 ttctattaaa attattttta aaaagatttt gtcaaataaa agatttttct agagtcatcc 1320 accataattt tgtggctcaa aagtatcggt tcaggcaccg ttttggcacc ggtaccgttt 1380 taaaagtatc gatttagcac cggtatcgaa ataaccccaa acgataccca accc 1434 // ID LTR-9_DR repbase; DNA; ZEB; 801 BP. XX AC . XX DT 25-NOV-2008 (Rel. 13.11, Created) DT 25-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE Putative long terminal repeat (consensus). XX KW LTR Retrotransposon; Transposable Element; Nonautonomous; KW solo-LTR; LTR-9_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-801 RA Bao W. and Jurka J.; RT "Long terminal repeats from zebrafish."; RL Repbase Reports 8(11), 1772-1772 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 801 BP; 239 A; 130 C; 172 G; 260 T; 0 other; tgttggggaa ttcattgtgt cttgatgaat ttctgatgaa tgtattactc tcatttatat 60 gcttatttga gtgatttcag tattattatt attattatta ttatctattg tattattacc 120 catacctgtg gcattattat caggtatgtg tgctttggta aataaaaaag tcattctgtg 180 taaatgcagt cagagatgag aagataaatt agaagagcag cattaatgtc acaatgaggg 240 ttgatttatg gaagcaaggc cataaggaca cctgcgttct gttctgtaac tcatgataaa 300 gcacacctgc acgtgcgccc acactcattt gtactgacat ctaaaaaatt atgagtacct 360 atgaacttca gctgaactta cagttgacag ctcagtggac agtgaacaag atatattgtt 420 gagcattgtt ttttggctga gtcatcactt ttgaaacagg aaggaaagac cttaactgga 480 catgggtaag acccctgaaa gaccctgaga ttctattggt ggcgggcacc tgagagattt 540 aagatcccat tagcggagaa gctgatggag gtgtgtaatg tgtaaatttg ggtggagtat 600 taagacacag aaatgtattt aaactgtgtg caagcctatg ttcggcagac actcgaacga 660 cctgtctctg ttgttactga tgctgtaaca ataaactgct ttgcctaaag acttcggaga 720 tctcagactt tgtcattttt tgaaaagttt gacttagaga cttagaatat ttttgcagac 780 cagaatcttt tttccacaac a 801 // ID Gypsy53-I_DR repbase; DNA; ZEB; 4456 BP. XX AC chr17; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy53-I_DR; KW Gypsy53-LTR_DR; Gypsy53_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4456 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1421-1421 (2008). XX DR Genome; chr17; Positions 36952658 36957113. XX CC Positions [3239-3754] - Integrase core CC 'GTTTT' target site duplication CC LTRs are 100% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 134..4180 FT /product="Gypsy53-I_DR_1p" FT /translation="MAGEFRRPDPLVFNENIAENWRMFEQEFDIFIAAAHG FT DKSPRTQAFILLNLAGPEAIERERSFVYAQEVRVPGENDSIITIPAESRED FT PECLKRKFRELCAPHVNITMERHKFNTRSQRPGETFESYISDLKIKAKSCK FT FGALQDELIRDRLLCGIQSDILRKSLLRDPELTLNKAISACQIYEQTEQHT FT KDLATLQRTSATVHAVHQAFTGKNKRYKPKHTESNKGPTAIRNCNNCGGDH FT PAKRDKCPAFGQQCNSCKKWNHFKKCCRSTQSNTTKRRFTKTVHNVEPHQS FT SSEESESFFVDGVTDANVYSTANAAHKSELYSTVHVHGKSVELKVDTGAKC FT NVMDLHTFQHLRKQEKLSMSSTAKLIAYGGNELHTLGTVILPCSLASQSYD FT LLFYVIKNSAQPLLGLSDCIRMKLLTLNKEVHQLNTYSQDEFTDTILTKYD FT DLFKDEVGKLPVTYSIKLDPNVPPAVRPARRIPVAMQDKVKKELQRMTDLN FT VITPVTEPTDWVSSMVAAHKKDSNSIRICIDPRDLNMAIQRPHHPMRTVEE FT VATQMSNSVVFSVLDAKSSFWQISLDHKSSLLTTFSTPFGRYRFLRMPFGI FT NSASEVFQRSMEQIFAGYPCAVIVDDILVGGRTIQEHDDNLEKVLARARQV FT NLRLNPQKCKFRLKEVSYVGHIFSSDGLHADPSKIKAITEMPVPANATALM FT RFLGMVNYMGKFIPNLSDISAPLRQLTHKDTTWCWLQQHQDCFNTLKNCLT FT CPPVLSYYNVDNPVTLTCDSSQFGLGAACLQDNKPIAYASRVLTDTEQRYA FT QIEKELLAVVFACTKFNDYIYGKPVTIETDHQPLVTITKRPIHSTPARLQR FT MLLQLQRYNISLVYKKGKQMYIADTLSRAPQTSAHEHPEDKHTFEVMNICN FT ISSSRLDELKLHTTGDATLQLLANTIKHGWPNKLHSLPPVLWPFYPYRHEL FT ITDDSLVWKGHKVIIPETLRLQYTQIVHRGHPGAEATKQRARGIFFWPTMT FT KDIDAYTTSCAICNSTRPHQQKEPMKLHTVPSLPWSTVAADIFEWNNHPYL FT VLVDSYSGWYEIDQLRDMTSTCVIQKLKRHFSVHGSPQTLITDNGRQFVSS FT EFNDFALAWDFTHTTSSPNYPQSNGLAERAVRSAKQLMEKSKRDGTDVYLN FT LLNLRNIPRDSKLGSPAQRLMSRQTRSTLPLHSSLLKPSPLSSNDIHAQLK FT KRRMSQKRSYDKTSKRLNTLSKGQTVRMQTPHGYDRKGVVKEICCEPRSYI FT VLSGGKEYRRNRRHLLSVKEPEPHQHDDDEYKLPGDPTKSEVTLPKIPVAH FT TQTTTGQLTSQTHTQTRTCPYVTGSGRTIRPNSKYKD" XX SQ Sequence 4456 BP; 1480 A; 1080 C; 857 G; 1039 T; 0 other; tggtgtcaga agtacaactg cacgcgaaag tgagttttgt ctacgctgga aaagtttcca 60 cgcatttctc tcgactcgtg agtaactaac tagcggctaa gctacgtaaa cgaaacctaa 120 gcacaaagca acaatggcag gggaattccg aagaccagac ccactcgttt tcaatgaaaa 180 tattgctgaa aactggcgta tgttcgaaca agagtttgat atcttcattg cagcagctca 240 cggggacaaa tcacctcgta cacaagcttt cattttgctc aacctcgcgg gtcccgaagc 300 cattgaacgc gaacggtcct ttgtttatgc acaagaggtg agagtacccg gtgaaaatga 360 cagtattatt acaataccgg ctgagtccag ggaagaccct gagtgtttaa agagaaagtt 420 tcgtgagctt tgcgctcccc atgtaaacat cacaatggaa cgacacaagt tcaacacgcg 480 atcacaaaga cctggtgaga cttttgaatc ctatattagc gatttgaaaa ttaaagcgaa 540 aagctgcaaa tttggagcat tacaggacga gcttattcgt gacaggctgc tctgtggcat 600 acaaagtgat atcctgagaa aatcactgct gcgtgatcca gaactgactt taaataaagc 660 catatcagca tgtcagatat atgaacagac tgaacaacac actaaagacc tagcaacact 720 ccaaaggaca tctgccactg tccatgcagt gcaccaggca ttcacaggaa agaacaaacg 780 ttataaacca aagcacacag aaagcaacaa aggcccaaca gccataagaa actgtaacaa 840 ctgcggtggt gatcacccag ctaaaagaga taagtgtcca gcctttggcc aacagtgcaa 900 tagttgcaaa aagtggaacc atttcaaaaa atgctgcagg tccacacaat caaacacaac 960 aaagcgacgt ttcacaaaaa cagtccacaa tgttgaacca caccagtcaa gtagcgaaga 1020 aagtgaatca ttctttgttg atggagtgac tgatgcgaat gtttattcca cagctaatgc 1080 agcacataaa agtgaactgt acagcactgt gcacgttcac ggtaaatcag tggaactaaa 1140 agttgatacg ggagctaaat gtaatgtaat ggacttgcac acctttcagc acttacgaaa 1200 acaagaaaag ctcagcatgt caagcacagc aaaactgata gcatatggtg gcaacgagct 1260 acacacactt ggtacagtta tactaccatg cagtttagcc agccaatcct atgacttact 1320 gttttatgtc atcaaaaatt ctgcacagcc attactgggt ctgtctgact gcatacgcat 1380 gaaacttcta acactcaaca aagaagttca tcagctaaac acatacagtc aagatgaatt 1440 cacagacaca attctgacca agtatgatga tcttttcaag gatgaagtag gtaaactacc 1500 tgtcacatac tccataaagc tcgaccctaa tgtaccccca gctgttcgtc ccgcaagacg 1560 cattccagtc gcgatgcagg ataaggtcaa aaaagaactt caacgaatga cggatcttaa 1620 tgtcatcaca ccagttacag aaccaactga ctgggtgtcc tccatggtgg cggctcacaa 1680 aaaagactcc aacagcatcc gaatctgtat agatccaaga gatctgaata tggcaattca 1740 acgaccgcac cacccaatga gaacagtgga ggaagttgca acacagatgt caaattctgt 1800 tgttttctca gtacttgatg caaagagttc gttttggcag atatctttag atcacaaatc 1860 atcactgcta acaacattca gcaccccatt tggccgatat agattcctca gaatgccatt 1920 tggcattaac tcggctagtg aggttttcca gcgctcaatg gagcaaattt tcgctggata 1980 cccttgcgct gtgattgttg acgatatact agtcggcgga cgcacaatac aggaacacga 2040 tgacaacctg gagaaagtgc ttgcacgagc tagacaagtc aatctcagac tcaatccaca 2100 aaagtgcaaa ttcaggctga aggaggtaag ctacgtaggc catattttca gcagcgacgg 2160 tttacatgca gatccatcca aaatcaaagc catcactgaa atgccagtac ccgctaatgc 2220 cactgctctc atgcgatttc tcggcatggt taattacatg ggcaaattta tcccaaacct 2280 cagtgacata tctgcacccc tccgccagct gacacacaaa gacacaacat ggtgttggtt 2340 gcagcagcac caagattgtt ttaataccct aaagaactgc ctcacatgcc ctcctgtttt 2400 gtcatattac aacgttgaca accctgttac cctaacatgc gactcatcgc aatttggtct 2460 aggtgcagcg tgtctacaag acaacaagcc catagcttat gcttcacggg ttctcacaga 2520 tactgaacag cgctatgcgc aaattgaaaa ggaactcctc gctgtggtgt ttgcctgcac 2580 caagtttaat gattacatct atggcaaacc agtaaccatt gaaacagacc accaacctct 2640 ggtgacaatc accaaacgtc caatccacag cactcctgcc agactccaaa gaatgctgtt 2700 acagctccaa cggtacaaca tctcccttgt gtacaaaaaa ggaaagcaaa tgtacatagc 2760 ggacactcta tctagagccc cacaaacatc tgcacatgag catccagagg acaaacacac 2820 atttgaggta atgaacattt gtaacatctc atcatcacgc ctggatgaac ttaaactcca 2880 cacaacaggt gatgccaccc tacagttgct tgcaaacacg attaaacatg gatggccaaa 2940 caaactacac agtcttccac cagtactgtg gccgttttac ccatacagac atgaattgat 3000 cacggatgac tcactcgtgt ggaaaggaca caaagtgata atacctgaaa cactacgctt 3060 acaatacact caaatcgtgc ataggggtca ccctggagca gaagccacca aacaacgtgc 3120 acgtggcatt tttttctggc caactatgac aaaagacatt gatgcttaca ccacatcctg 3180 tgccatctgc aatagtacca gaccccacca gcaaaaagaa cctatgaaac tgcatactgt 3240 tccaagcctg ccatggtcaa ccgtagcagc cgatatattt gaatggaaca atcatccata 3300 cctggttctc gttgactcgt attcaggctg gtacgagatc gatcagttac gagacatgac 3360 ctcaacatgc gtcatacaaa agctgaagag acatttctct gtccatgggt cacctcaaac 3420 cctcataaca gataatggta gacagtttgt gagcagtgag ttcaatgatt ttgctttggc 3480 ctgggacttt actcatacaa ccagtagtcc aaactatcct caatcaaatg gcctagcaga 3540 acgcgcagtt cgtagtgcca aacagctcat ggaaaaatca aaaagagatg gtacggacgt 3600 ctatctcaac ctgttgaatc tgagaaatat tcctagagat tcaaaactag gctccccagc 3660 tcagagacta atgtccagac aaacccgttc taccctacca cttcacagtt ctttgttgaa 3720 accctctccc ctttcgtcca acgacattca tgcacaactg aaaaaaagaa ggatgagtca 3780 gaaacgcagc tatgacaaaa catctaaacg actaaacaca ctctcaaaag gacaaacagt 3840 gagaatgcaa acccctcatg gatatgatcg caaaggagtt gtgaaggaaa tctgctgtga 3900 accaagatct tacatagttc tgtctggagg aaaggaatat aggagaaacc gcagacatct 3960 cctatctgtg aaagagccag aaccacatca gcatgatgat gatgaataca aactacctgg 4020 agatccaacc aagagcgaag ttacactccc aaagatacct gtcgcacaca ctcaaaccac 4080 cacaggtcaa ctaacttctc aaacacacac acaaacgaga acctgtcctt atgtgacagg 4140 gtcaggacgc accatacgcc caaactcaaa atacaaagac tgaagctcta tacaccaaag 4200 tacacataaa ctgtacattc attatttttc attgttacat taatcaacaa tgtcagttta 4260 acgtgcaata tgtttcacag tttattaatc acaatgcaca aatgttcatt acagttattg 4320 ccaatgtttt aagtaggtat acttgttcga taaggtgcac attccattgc atagtgaatt 4380 tatgtcagtt tgaaaagaat tctgatgtaa atgcatttga aactcaaagt gttatatgta 4440 tgcttagaaa agggga 4456 // ID DNA8-4A_DR repbase; DNA; ZEB; 929 BP. XX AC . XX DT 17-OCT-2008 (Rel. 13.1, Created) DT 17-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Putative nonautonomous hAT DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; 8 bp TSD; KW DNA8-4A_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-929 RA Bao W. and Jurka J.; RT "hAT DNA transposons from zebrafish."; RL Direct Submission to Repbase Update (11-NOV-2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 929 BP; 298 A; 144 C; 147 G; 339 T; 1 other; taaagcagtg gttctcaaac tgtggtacgt gtaccactag tggtacgcag gcttccttct 60 agtggtacgc ggaggaatcg ctgaataatt aaagtaaaaa aatgaacaca cttttaaccc 120 tttgacgcgt acgataacac cratgtgatc agtaaattga ttttaatagg ctaaaccttt 180 tattttattt tcaattttct aatgtttgtt atgtattcta tcatttgaag ctaatttcct 240 ccccatattt gtaacggttg ttgggggcgt atcccgtatt tttatatccc aatgttgaca 300 ggtatagttt aatcacttgc ttttctgaca cacaaagcct actctaacgc gcagtaagca 360 gatctaattt ctaatttaaa tatgtaaatc caattcatat atttaaaata caagttaatg 420 tttagatttc aaagatatac aaataagtca tatatacatg caacatatat ttcaaaattt 480 atcaaaaaga gtttatatat gaatatgatg acatatagcc tatatttatg aggtccaagg 540 aacatttcca ggttttgatg aataggctac tttatgttaa tactttacat ttaagtacag 600 cagttttctt tttacttttt aagaacagtg ccatttttaa ttaactttta aaagcacatt 660 ttaatttaaa cattatttat tttttacctg taagcacagt acagtgttaa tgttcaaact 720 atttataatg tttaaagtgg ctgacaataa taaatattca taataatagg aattaatctg 780 ccacgttttt gaactgtgca gagctgtagc tgcttaatta ggcctactac gctactgtat 840 ttcaataccg gtcattatgg tggtacttgg agagacaatt tttttctgag gtggtacttg 900 gtgaaaaaag tttgagaacc actgctcta 929 // ID HATN11_DR repbase; DNA; ZEB; 927 BP. XX AC . XX DT 11-JUL-2002 (Rel. 7.06, Created) DT 11-JUL-2002 (Rel. 7.06, Last updated, Version 1) XX DE HATN11_DR is a nonautonomous DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW Dr000287; HATN11_DR; nonautonomous DNA transposon; KW hAT superfamily. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RA Bao Z.; RT "Dr000287, an unclassified and uncharacterized repeat."; RL http://www.genetics.wustl.edu/fish_lab/repeats/. XX RN [2] RP 1-927 RA Kapitonov V.V. and Jurka J.; RT "HATN11_DR, a nonautonomous hAT-like DNA transposon from RT zebrafish."; RL Repbase Reports 2(6), 10-10 (2002). XX DR [2] (Consensus) XX CC HATN11_DR is a family of nonautonomous DNA transposons that CC belongs CC to the hAT superfamily [2]. CC Several hundred copies of HATN11_DR are expected to populate the CC zebrafish genome. The element was active relatively recently, its CC copies are ~3% divergent from the consensus sequence. CC This element is characterized by 8-bp target site duplications CC and CC imperfect 11-bp terminal inverted repeats (one mismatch). XX SQ Sequence 927 BP; 224 A; 310 C; 208 G; 185 T; 0 other; tagagatctg cgcgggacta aattttgaat cccgctcccg cccgcacccg ccaggtttta 60 gcccgaaccc gaccgctccc gcttatatta agaatttatt gtcccgctgc ccgacccgcc 120 ccgttttctg cccgccgcgc ccgatcccgc taaagagcgg gggagaacaa aaccgaaaat 180 cccccagcta tacagtccag acagccaagc tgtctgtata aacacacaca cacagcacca 240 agcacacaca cacagacaaa tctctctctc tcatacgcgc gcgcactaac acacacacaa 300 gcaccaagca gacacacagg cgtttgctgt tatatcaact caaaggcttg taataccatg 360 tatcaagtac caatgtaata ccaaaaggtt ttatataaag gtatataaat actgagtcgc 420 gccagctccg ggccgcagcg cacattactc tcgggccgat tccggcgcgg atgcggcagc 480 gtcggctcgc ttacggccgc cgcatctggc ccgattgatt cgggccggaa tcgggcagtg 540 agtccacaag catccggagt ccggcagccg gagccgggct gagtggtaag tctccggcag 600 gcgagaatct agccggaact ggcccgagtg tattttgcta tctgggaaag ctctctctct 660 ctcattcgcg cgcgcgcgca ctaacataca cacacacaag caccaagcac acacacaggc 720 aaagctctct ctctctctct ctctctctct ctctctctct cattcgcata gcaatatccg 780 cgatattgct agacagcccg ctcccgtccc aaattaaacc cgttaccgac cgctcccgcg 840 atttattcgg aaatttattc ccgcgccgca gaaatctggt cgggtcctgc ggcgcccgcg 900 ggacagccgc gggaatgcag acctcta 927 // ID hAT-N47_DR repbase; DNA; ZEB; 424 BP. XX AC . XX DT 17-OCT-2008 (Rel. 13.1, Created) DT 17-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Putative nonautonomous hAT DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW 8 bp TSD; hAT-N47_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-424 RA Bao W. and Jurka J.; RT "hAT DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1316-1316 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 424 BP; 130 A; 102 C; 78 G; 114 T; 0 other; caggggcgga ctgggacaaa aattcagccc tggcactgtt gccacaccag cccacattac 60 cacaccgaca cagccccacc cacggacacg cacattcact atttattttg gtgtaaagat 120 ggtgaaataa tatgacattc ttgccagatt ttgattatgt ccgggtaacc ttggattggg 180 tcggcccatc ttgaaaccag gcggcagttg ccgattgggc caaatgctta acatttatta 240 tttatattat tattattatt tttattatca tcatcataat atcacatcta gcaagagata 300 aaagctgtct gcacttaaaa acaatacata ttaaaaaaaa aaaaaaactg accggcccac 360 atttaaaaat tggtccggcc cttctggcat ttgccagaat tgccagatgg ccagtccgcc 420 cctg 424 // ID Gypsy108-I_Dr repbase; DNA; ZEB; 4301 BP. XX AC . XX DT 29-APR-2009 (Rel. 14.05, Created) DT 29-APR-2009 (Rel. 14.05, Last updated, Version 1) XX DE An internal portion of the Gypsy-108_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy superfamily; endogenous retrovirus; KW gag; reverse transcriptase; integrase; Gypsy-108_DR; KW Gypsy-108-LTR_DR; Gypsy-108-I_DR; Gypsy108-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4301 RA Dib M.R. and Naveira H.F.; RT "Gypsy108_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 9(5), 950-950 (2009). XX DR [1] (Consensus) XX CC Gypsy108-I_DR is an internal portion of the Gypsy108_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its long CC terminal repeat is deposited in Repbase as Gypsy108-LTR_DR. CC Gypsy108_DR is characterized by 4-bp target site duplications. CC The internal portion encodes two proteins: the 327-aa gag CC Gypsy108_DR1p (pos. 17-997) and 1105-aa polyprotein (pos. CC 985-4299, conceptual translation) composed of the protease, CC reverse transcriptase, and integrase domains. Some insertions CC fairly recent, according to the hight identity between their CC flanking LTRs. Consensus obtained after the alignment of at CC least three independent insertions bearing at least 85% Homology CC over at least 1000bp. XX FH Key Location/Qualifiers FT CDS 17..997 FT /product="Gypsy108-I_Dr_1p" FT /note="Gag-protein." FT /translation="MDPAAMQVLSHEVTAQAQVLTTHQQQLSHLTQLTDEL FT VKSLQNLQAASTAQLTANYSPSQPFVTQTQTVSGARLAFPDKFSGNPAKCK FT GFLLQCKLFISQQPHLFKDENSKIAFVCSLLTGKALDWATAVWPDSTPIFP FT SFNDFLKRFCTVFDHPEGGRNAGEELLCVQQRSQPAAEFALHFRTLAAQSG FT WADDPLKTLYRKALNPELQKEMACRDDGKSLDQLIELSIRLDHLLRSRKPL FT CSVTPSPVSPESPTEPMQLGRTRLTPEEREQRRRNHLCLYCGLSGHMKILC FT PNKPPPKTLPVSATTVFTIANDILSVPVYLRCGVN" FT CDS 985..4299 FT /product="Gypsy108-I_Dr_2p" FT /note="Polyprotein." FT /translation="MWCKLRSRLSPWLTQEPLATIIDHSFATTHSIPLTSC FT DSSLAITAIDGRPLGEGHIKFRTLPISLQTGSLHKEELSFLAIDSPRHTII FT LGLPWLQLHDPQISWKTGEIIKWSNNCFNHCLQSVLPVQINTISSTEDPEL FT SQIPEVYQDLIEAFNKQKATKLPPHREHDCAIELLPGTTPPRGRIFPLSQP FT ETEAMNNYISEELEKGFIRPSTSPASAGFFFVKKKDGSLRPCIDYRGLNDI FT TVKFRYPLPLVPAALEQLRSAQYFTKLDLRSAYNLIRIRQGDEWKTGFSTA FT NGHYEYLVMPFGLANSPSVFQAFINEIFRDMLNQWVIVYIDDILIYSNSLP FT EHIQQVRAVLKRLIQNQLYAKASKCEFHQTCISFLGYIISPEGVAMDQQKV FT DSVTQWSKPETIRQLQRFLGFANFYRRFIRNFSTVAAPLTAMVKANNARLK FT WNPEAIRSFNQLKSRFTTAPILRHPDPNLPFVVEIDASNTGIGAVLSQRSQ FT TTNKLHPCAFYSRKLNPAERNYDVGNRELLAMKAALEEWRHWLEGAKHPFT FT VITDHKNLEYIRSCKRLNPRQARWALFFTRFDFKVTYIPGSKNVKADALSR FT LFNEEALADDVEPILMDSLVLAPIQWDIETEILQASEQNPTPQACPENRIF FT VSPLLREKLISEVHNHPSSGHPGSTATVQLIQSRYWWPSINKDVIKFINNC FT SPCQMAKHSRHRPAGLLQPLEVPRRPWSHIAIDFITDLPQSQGNTTILTVV FT DRFSKSCRLIAIPKLPTALETAELLCECVFRYYGLPEDIVSDRGPQFTSRL FT WSAFFKNLQVNISLTSGYHPQSNGQTECLNQEIGRFLRTYCHSNQAEWNKF FT LIWAEYAQNSLRKPSTGLTPFQCVLGFQPPLFPWSGEPSELPAIDTWFKKC FT EEVWNAAHTHLSHAIRRFKEQADRHRRPGPTYSPGQWVWLSTRDLRLRLPC FT KKLSPRYVGPFQIERQISPVSFRLTLPNHYRISPTFHVSLLKPAVGPAEVD FT REVAAGEQGPPPIMVDGEEAYRIHEILRSRRRGGQLQYLIDWEGYSPEERS FT WINRKDILDPTLLNEFHLQHPEMPAPRPRGRPRRRDSSHFRSRSLEGG" XX SQ Sequence 4301 BP; 1118 A; 1247 C; 874 G; 1062 T; 0 other; gaagactttg cccaacatgg atcctgcagc catgcaggtc ttgtcccacg aagtcacagc 60 tcaagctcag gtattaacta cacatcagca acagttgtct catttaaccc aacttacaga 120 tgaactggtg aaatcactgc aaaacctgca agctgcttcc acagcgcaac tcaccgccaa 180 ttactctcca agtcaaccct ttgttacaca gacccagact gtatccggag ctcgtttggc 240 attccccgac aaattttcag gtaacccagc taagtgcaaa ggctttttac tccagtgcaa 300 actgtttatc tctcaacagc cccatctgtt taaggatgaa aacagtaaaa ttgcttttgt 360 gtgttctctg ctcacgggaa aagcattaga ctgggctact gcagtttggc ctgacagcac 420 cccgatattt ccctcattta atgactttct caaacgtttt tgcactgtgt tcgatcatcc 480 tgagggtggt cgtaatgctg gtgaggagct cttgtgtgtt caacagagaa gtcaacctgc 540 agccgaattc gctctacatt tccgcacact ggctgcacaa tctggctggg ctgacgatcc 600 tctaaagacc ctatacagga aagctctaaa ccccgaactg cagaaagaga tggcatgtcg 660 tgatgatggg aaatcgttgg accaactcat tgaactctca atcaggttag accatttact 720 ccgctcccgt aaacccctgt gttctgtcac tcccagtcct gtatcccctg agagtcccac 780 tgaacctatg caactgggca gaacccgact aacccccgag gaacgtgaac aaagacggag 840 aaaccatctg tgcctgtatt gcggtctttc gggtcatatg aaaatcctgt gtcccaacaa 900 acctccgccc aagacccttc cggtgagtgc aaccaccgta ttcacgattg ccaatgacat 960 tctgagtgta cccgtttatt tacgatgtgg tgtaaattga gatctcgact ctcgccatgg 1020 ttgactcagg agccgctggc aactattata gatcactcgt ttgccacgac ccactccatt 1080 cctctaacct cctgtgattc ttccctagcc atcactgcta tagacgggcg ccccctgggg 1140 gaaggacaca taaaattccg aactctgcca atctctcttc aaacaggctc tctccataaa 1200 gaagaactct ccttcttagc aattgactct cctcgacaca caattatcct cgggttgccc 1260 tggctacaac ttcatgaccc ccaaatttcc tggaaaacgg gtgagatcat taaatggagc 1320 aataattgtt ttaaccattg cctgcagtct gtcctccctg tccagattaa taccatttcc 1380 agtactgaag accccgaatt aagtcaaatc cctgaagttt atcaagatct catcgaagcc 1440 tttaacaaac agaaagccac taagcttccg cctcatcgtg agcatgactg tgccattgag 1500 ttactgccag gtacaacgcc tcctcgtggc cggatttttc ccctctcaca acctgagacc 1560 gaagccatga ataattacat ctcggaggaa ctggaaaaag gctttatacg accttccacg 1620 tcacccgcct cagctgggtt tttcttcgtc aaaaagaagg acggtagcct acgcccatgc 1680 attgactaca gaggactgaa tgatatcaca gttaagtttc gctatccttt accactagtc 1740 ccagcagccc tcgaacaact acgctcagca cagtacttta cgaagttgga cctccgcagt 1800 gcttacaacc tcattcgtat ccgacagggg gacgaatgga aaaccgggtt ctccaccgct 1860 aatggccact atgaatattt ggttatgccc ttcggcctag caaacagtcc ttcagtgttc 1920 caggctttca taaatgagat attcagagac atgctcaatc agtgggtcat cgtgtacatc 1980 gacgacatcc tcatctactc caattcccta cctgaacaca ttcaacaggt cagagccgtc 2040 ttaaaacgcc taatccagaa ccagttgtac gccaaagcct ccaagtgtga gtttcaccaa 2100 acatgtatat catttctggg ttatatcatc agtcccgaag gcgtggccat ggatcagcag 2160 aaggtagatt ctgtcacgca gtggtccaaa cctgaaacca tccggcaact acaacgtttc 2220 ctggggttcg caaacttcta tagaaggttc atccggaact tcagtacagt agccgctcct 2280 ctcacagcca tggtaaaggc caataacgct cgcctgaaat ggaatccaga agcaattcga 2340 tcattcaacc agctcaagtc acgcttcaca accgcgccca tcctacgtca tcctgacccc 2400 aatctaccat tcgtggtcga aatagatgcc tccaacacgg gcattggagc cgttctatcc 2460 cagaggtccc aaacgactaa caaactccat ccttgtgcct tttactctcg caaactcaat 2520 ccagctgaga gaaactatga cgttggcaac cgggaactct tagctatgaa agcggcattg 2580 gaggagtgga gacactggct tgagggcgct aaacacccat tcaccgtcat aactgaccac 2640 aaaaatcttg agtacatccg gtcctgcaag agacttaacc caaggcaggc aaggtgggct 2700 ctattcttta ctcgctttga cttcaaggtc acttacattc ccggttcgaa aaatgtcaag 2760 gctgacgctc tatctcgcct ctttaatgaa gaagcattgg ctgatgatgt cgagccaatc 2820 ctaatggact ccctagtcct agcacccatt caatgggaca ttgagactga aattctccaa 2880 gcatctgagc aaaaccctac tccgcaggca tgtcccgaaa acagaatctt tgtttccccg 2940 ttgctccgag aaaaacttat ttctgaagtt cacaaccacc ccagttccgg tcatccaggt 3000 agcacagcaa ccgtccaact catccagtcc cgttattggt ggccatcaat caataaagat 3060 gtgattaaat tcataaacaa ctgctctccc tgtcaaatgg ccaaacactc ccgtcaccgt 3120 ccagccgggc tactccaacc cctagaagtt ccacgtcgcc cctggtcaca tatagctatc 3180 gacttcatca cagacctacc tcaatcccaa ggaaatacca ccatccttac cgttgttgac 3240 cgtttctcta agtcttgccg actcattgcc atacctaaac tgcctacagc tttggaaacg 3300 gcagagctac tttgcgaatg tgtcttccgc tactatggtc tacctgaaga cattgtttca 3360 gatcggggtc cccaatttac ctcccgtttg tggtccgcat tcttcaagaa cctgcaggtt 3420 aacatcagtc tcacttccgg ctatcaccca caatccaacg gccagactga atgcctcaat 3480 caggagattg gtcgatttct ccgcacctat tgtcactcca accaagctga atggaacaaa 3540 ttcctcatat gggctgaata cgctcagaac tccctgagaa aaccatctac aggtctgact 3600 cccttccagt gtgtactcgg ctttcaaccc cctctattcc cttggtctgg cgaaccttca 3660 gaacttccag ccattgacac ctggttcaag aaatgtgagg aggtatggaa cgcagctcac 3720 acccatctat cgcatgccat ccgaagattt aaagaacagg ctgatcgtca ccgtcgtcct 3780 ggtcccacgt attccccagg acagtgggtg tggctatcca ctcgagatct gcgcctgaga 3840 ctaccctgca agaaactcag ccctaggtac gtgggtcctt ttcagataga gagacaaatc 3900 tctcctgttt cttttcgact gacacttcct aatcattacc gtatttctcc tacattccat 3960 gtctctctgc tcaagcctgc tgttggtcca gccgaggtgg atagggaggt ggcagccggt 4020 gaacagggtc ccccacctat catggtcgac ggagaagagg cttatcggat ccacgagatc 4080 ctgagatcca gacgccgggg cggacaactt cagtatctca tcgactggga ggggtacagc 4140 ccggaggaaa gatcttggat caaccgtaag gacattctcg acccaactct gttgaatgag 4200 ttccacttgc aacatccgga aatgccggcc cctcgccccc gtggaagacc ccggcgtcgc 4260 gattcttctc acttcaggag ccgttcgttg gaggggggct c 4301 // ID hAT-N27_DR repbase; DNA; ZEB; 1305 BP. XX AC . XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 13-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Putative nonautonomous hAT DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW 8 bp TSD; hAT-N27_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1305 RA Jurka J.; RT "hAT DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1297-1297 (2008). XX DR [1] (Consensus) XX CC putative hAT transposon, TSD=8 bp. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1305 BP; 458 A; 182 C; 180 G; 484 T; 1 other; ggtaacactt tagaataact atccgttata actagttaat agwccattaa taaactgtta 60 gttaacgagt tataaatgac ttgttaaata aaagttaata gtttgttaat tatttataac 120 tgtgccttat agatagccaa tagataactt acaagctgtt agttaacaag ttataaatga 180 cttgttaact gtattttaat gatttataac tatacctaat aaattagtaa tagatcatga 240 acaagctgtt agtaaattac ttactaaaga cttgttaagt atcagttaat agtttatata 300 tgttattggg acgttattct aaagttgcaa ctattcttca tttattaact gttagtaaat 360 gaggaatagt tgcaacttta gaataacgtc ccaataacac gcataagcta ataactaaca 420 cttaactaat atattaactc acactttaca gatcagttgt tcatagtttg ttaaccatct 480 aataacacca gtgaaacttt tgtagaactt ttctaataaa ataaattttt atttattttg 540 ttttcatgtt tagcacaaaa agtgcccaaa cctcatgggt ttgtattctg ttgtatactg 600 ctgttctgcc atgtgatggc tcagtatgtg tgtttctata cattaaacac aatgttcaac 660 atttcatctg tggagtttct ttggtaaaat acagcacaca gaaaagccta tgagtggaac 720 aaggttaagg tacaaacatt ttatatttta aatatcacat caaatttctg tagcttgaac 780 ttgttccatc catttgctgt tctacaaagt ttcactggta ttagtagatg gttaacaaac 840 tatgaacaac tgatctgtaa agtgtgagtt aatatattag ttaagtgtta gttattagct 900 tatgcgtgtt attgggacgt tattctaaag ttgcaactat tcctcattta ctaacagtta 960 ataaatgaag aatagttgca actttagaat aacgtcccaa taacatatat aaactattaa 1020 ctgatactta acaagtcttt agtaagtaat ttactaacag cttgttcatg atctattact 1080 aatttattag gtatagttat aaatcattaa aatacagtta acaagtcatt tataacttgt 1140 taactaacag cttgtaagtt atccgttggc tatctataag gcacagttat aaataattaa 1200 caaactatta acttttattt aacaagtcat ttataactcg ttaactaaca gtttattaat 1260 ggtctattaa ctagttataa cggatagtta ttctaaagtg ttacc 1305 // ID BEL21-LTR_DR repbase; DNA; ZEB; 312 BP. XX AC chr12; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL21-I_DR; KW BEL21-LTR_DR; BEL21_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-312 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1390-1390 (2008). XX DR Genome; chr12; Positions 22019062 22018751. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 312 BP; 89 A; 89 C; 61 G; 73 T; 0 other; tgttagagag gcgggacttg acgtacctgt ctatcatcct catgacgcag cacaaagcag 60 agctatgcag agaagcgccc gcagctatgg cgaatggcct catcctatgt gcctttgtgt 120 gttattgtgt gcgtctataa cgaactgacc acacacacac acgcctatac acacacctct 180 tcagtgttca ataaaccagt ttgattgacc ttaactgcct gatccactcc tgtgttctga 240 tcgacacccc aagggcaagg ctagagaaat tagaacctcc tcagaataca atcctaacct 300 aacaacccaa ca 312 // ID DNA-2-18_DR repbase; DNA; ZEB; 3225 BP. XX AC . XX DT 14-NOV-2008 (Rel. 13.11, Created) DT 14-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE nonautonomous DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; 2-bp TSD; KW DNA-2-18_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-3225 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(11), 1709-1709 (2008). XX DR [1] (Consensus) XX CC Copies are ~97% identical to the consensus. TIR is 16-bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 3225 BP; 1159 A; 550 C; 520 G; 995 T; 1 other; cactgttaga ccttaccagg cttttttaca gtagaacact ggcaacactg ttgccagcta 60 ctcactgtaa aagtttggtt acagtaagta actggcaaca gtgttgccag ttaattactg 120 aaactgaacc attacagtga gtggctggca acagtgttgc cagttgttta ctgtaaccga 180 accattgcag tgagtagctg gcaacagtgt tgccagttaa ttactgtaat agagcagtag 240 tggtaactaa ctggaaactg tgtacagtta attactgtac tgtagtgtta caactcacag 300 cattatactg catacacaat agtatgtcaa ggtgttacta aaattaatta gtattcttac 360 ctgttattaa ttctttaaat taattaagtt gtaaattctg acaaatttat tttattgttt 420 tggcagcaaa caaatataaa acagaaaatg tataacaaaa ttaagaaatc agcagatata 480 aatatcatct tacatattac tgagagccac aggtctgcac agcaaggctg cagtcaaact 540 aacgtttaag catgcaaaat tctgtcatat ggctctgcga aaaggagtgg gattaaacaa 600 aatatttaga gattgaaaaa gcaagcaact ggtccataat ttttgttcag agaggtcatg 660 gtttgatctt tgattagtct caagcaatca catgatgtga tttcgcagtt cagagttctc 720 caagcttgaa ctttccaatg ctgcaaactg taaaatttgt cgcatgagca tgcattccag 780 gtctgcagca ttcacatgcg tatgaatgga agtctatagg gagaaaagtg cagtgtgacc 840 gggactttaa gctgtaactc taattaaaca cacctgataa aactaattaa ggctcgttag 900 aaatctacag gtaatgttga agcagggtgg gaactaaact ctgctgccct gcagttctcc 960 agtaacttgg agtttgaccc ccatggtata tagcaggggt gcttaatcct gttcctggag 1020 atctacgttc ctgcagattt cagttgctac ccatatcaaa cacacctgcc tgtaattatc 1080 acgtggtgtt caggtcctaa ttaattggtt taggtgtgtt tgatatggat agcaactgaa 1140 atctgcagga aggtagatct ccaggaacag gattaagcac ccctggtata tagcatactt 1200 ttaaagcaac tgctaacatt tatcacaatc acatatccaa taaacacaca cacacacaca 1260 cacacacaca cacacacaaa gtgctgcagt gaaatgtgtt actctctcct caatgctgac 1320 catgcaaaaa aaaaaaaaaa aaaaaaaaac tgaacagcaa aaaacacaaa ataattattt 1380 taaattttaa agttttaaat aattaaaagc tctgtcaaat ctcaaaaagt ttaaaataat 1440 tatagaaaac aaaaataaag ttggcaacac cgaaaaacca aaggatccaa tcttttattg 1500 ttataattat ctcatgacaa caaacttttt tattgtttca aatataaatt cataggtgga 1560 aacacwgctc tgaaaaatac ttagcaacaa actcacaacc tgggaccgga caggggagct 1620 tttgtgtgtg gatggctatg tgtatcatgt ttttgtgatt ccctgtgatg atgtgcacat 1680 gttaccttac tgagatgatg gaatttaatt ttggcttagt tgccagaaag atacatacaa 1740 ctgttagcat ctgtacatta tggcagtatg caatatccaa taaatacata tacttaaaca 1800 aaagtaatgt aagaagttat atgaggagag gctaactagc taacaatgta gctttcaagt 1860 acacagttca agataacaac aatataactt aaaacacagc cttattttat aaaccctcaa 1920 aaacatttga acacatttta cttactcatt ttagattctt atttaaagcc taaaacatat 1980 cagactcaac atctgaattg atagacagag aatagagcac actcagcagt aaacaaatca 2040 aacacctggc tctcttaaag ggccagcatt ttctgaaaac tctttctaac acctttgtgt 2100 ttagaataag acggacagcc tgtgggagtg tgtggtgatg cctaaatcat tgttttttac 2160 agttatttac tgcacaatct acagagtaac agtgcagtta tactaattaa acacacctga 2220 tcaaactaaa tctaaaggta gtgtgttgaa gcagggctgg aactaaattc tactaggctg 2280 tggccttcca ggagtttgac acccctggta catagcatat tttcaaagca actgcctaca 2340 tttatcacag ttatgcacat atcgtttaat aaaaacaata aaaatatact ataatataaa 2400 cttcatccat agaaaaaaaa ataaataaat aaataatctg attctcatgt cgttccaaat 2460 ctgtagtaaa aataataaaa aacacaacaa atatcatgat ttatacattt atttaggaaa 2520 gcctttatac tgttaatggc acttttactt gccagttttc cagacaaatt cattgagtct 2580 ctgcaaaaat gtatgtgcgt gttgtttaag atatatatat atatatatat atatatatat 2640 atatatatat atatatatat tatatatttt tttttttaaa taacacattt ttactagtta 2700 tttagcaaaa tactaatatt cagttgaaag tgcaatttaa aatcttaact ataggtcaaa 2760 ctaggcaatt aaattagatt aattatagaa aaagtaaaca aagttataaa aattattgtg 2820 acaatttctt tgctctgtgt gaatattttt gcctttaaat gcatttcgtc catatcttac 2880 acttaccttt taataagctc tgtggcattg gttgactcct gatactcaat gttaaaaaca 2940 taaaaggaac caaagaaaac acagagataa aataaatata aaatagcaac ataaagcatc 3000 agagtagatg gggcagtaat ggttgaacag taatttacca tttattatca cagtaaatat 3060 ctgtaatggt acatagagta aattactgta atttattctt tgcactacag aatttcatac 3120 aaatcctgct ccttttacag taaaatactg tttcttttca ttacagctaa acactggcta 3180 ttttacagtt atttactgca taatctacag taagggttaa cagtg 3225 // ID Gypsy102-I_DR repbase; DNA; ZEB; 6485 BP. XX AC chr19; XX DT 02-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy102-I_DR; KW Gypsy102-LTR_DR; Gypsy102_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6485 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1513-1513 (2008). XX DR Genome; chr19; Positions 23312177 23305693. XX CC Positions [2835-3374] - Reverse transcriptase CC Positions [4827-5303] - Integrase core CC 'TAAA' target site duplication CC LTRs are 99% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 297..1604 FT /product="Gypsy102-I_DR_1p" FT /translation="MAEGVNSFAEEMKDNEVFDQSIQFGRGRGFFIIPSEP FT RMPGPTSRAFDTPVLTSTRYVDRAMPSPINQNPDLSSLISEIAEKLVLRSD FT NSTNRSNSEQPAEMTLSNVKLVMQSDVKEPPIFRGDGSDKLTVQEWENIMS FT LYLRKRAIPIEDQSQEILDRLMGKAGDVVKIKLRNNTSADCTSKPRLIFDI FT LKQHFSTISYSSMPLADFYNTLPMPSEDAMEYWIRLNKTMDVAKECLERQG FT RRVCDPSHEISMMFIKHCPDKSLSNVFKFKSAEKWSAGEIQERLDEHMQER FT KARTFTNLRSLNITEHRACTQSPDVVKTVASNSPLSIPPSPMVQSGSDSAC FT MKSLVSLLDRLVTQQTQFQVSPHHQATTFQTYCKPCRVCRSLDHSTVSHCR FT QDNRCLKCLTPGHWKKDCPQRADRNQHQLESGAGRPPNQLNY" FT CDS 2091..6197 FT /product="Gypsy102-I_DR_2p" FT /translation="MIIGTLLLDLTISLMKTFSSSCQCLRVLSVGKGDTVP FT DKIGTVKLTQAVVLLPKHEHLVWGRLPSNVPMSPGSTVIVEPTNSKAMPRD FT ILVGRLVVPLWGDRWVPMTVMNPTNKVITLKRNSKMADVFPCIASEDFEIF FT QGLHITLVENKSDNSCNSCAADHIDKNLLNLVLEEIDVNWSQISDDCKSQL FT TQLLADYQDIFSKHPLNCGEATGYVHRIRLTDNRPFRLPYRRVPPAHYEKL FT RQVLTDMEEKRIIRKSSSEYASPLVMVWKKDGGLRICTDFRWLNARTMKDA FT HPLPHQADCLAALGGNCFFSTMDLTSGFYNIPMSEEDKKYTAFTTPLGLHE FT YNRMPQGLCNSPASFMRMMLGIFGDLNFSKLLCYLDDLLVFAPSESEALSR FT LRTVFDRLRENNLKLSPKKCHLLQSQVRFLGHIIDSGGVSVDPAKVEVITR FT MTAHDLMDNDGCTPSVKRIKSFLGMVFYYQHFIPNCSSISKPLFGLTAGQK FT RRGKLSKERKFQGTYRKLTPADWTTECQVAFDKLKTALLECVMLAHPDFNE FT PFILSVDASLDGLGAVLSQVPKGELQARPIAFASKTLSTSQRRYPAHRLEF FT MALKWSVCEEFSHWLKGHKFTIWTDNNPLTYILTKPKLDAYEWRWVSKLAA FT YTFDLKHLPRKANVVADTLSHDPFVEPIGRRLLNEPYSYLMQEASGIGEDS FT VHDVFRVSSHIQPNQPMECLSQSVDYNVDEVKALCQSHCSWTDGAESRALH FT LVHHVQMISGCQETLPVLSAQELKDSQQEDLTISKVLSFLIRQRRPSRRER FT HGASAQVIRLLKQWSKLEVRDDVLYRVTKDPLTKRKRLQYVLPDSLRNKAL FT SGIHDLTGHQGQDRTISLARQRFYWPDMEKNIREYVQCCRRCVLAKSPEPA FT ARAPLENIKSTAPMELVCIDFWSAENSKQCSVDVLVVTDHFTKLAHAFPCA FT NQTAKQVAKKLWDNVFCIYGFPERIHSDQGTNFESNLIAELLRLAGVAKSH FT TTAYHPMGNGGTERFNRTLGNLLRSLPLKEKCQWPQQIQTLTFAYNATINE FT TTGYAPFFLMFGRIPRLPVDIMFNQVLDNPDIADYDSYVKSLLCCLKNAMD FT IAQKHSSVEQRHQANQYNKRVKGTRLSVGDRVLVANKSERGKRKLADKWED FT GVFTVVEVNPDIHIYRIRDASGRTKVVHRNLLLEVNFLPIVGIDKEQSTSE FT NNQPLIIVESDQNGADQDSSEDLSAVSFTESSIQSSSDYPYPLPVCESDRN FT PVVHVTLPVDLENSHDLDEVDTVDADIHQSPGAANELPTNEKHEQDSNEIA FT QYLPGSPDLLCSDPSLRGAENADLVKPVRTRVGRIVKSVNRLIESMVQMPV FT SKVQVV" XX SQ Sequence 6485 BP; 1883 A; 1291 C; 1470 G; 1841 T; 0 other; atggtgccgt gaccttttaa ggatccatct gaagaccgac gtctgctcgc tcgccgtgga 60 agctgcatgg actggtttta ggtacttttc aaagacaaaa ggactatttt atataaagtg 120 aggtggacta tatacatttt ttttctctca cagaagacat tttcgaactg aattcggact 180 tttgaaatat acatttgctg actggttact gttattgatt gccattgttc agtacaatat 240 ttcagttcat ttattgattt aaatttgaag ataattaatt ggtacaatac agcaatatgg 300 cagaaggcgt taatagcttt gctgaagaga tgaaggacaa tgaggtgttt gaccaaagta 360 tccagtttgg taggggaagg ggttttttca taataccttc tgaacctagg atgccagggc 420 ctactagcag agcatttgat acacctgttc tgacttctac aagatatgtt gatcgtgcca 480 tgccatctcc gattaaccaa aatccagatt tgagttcact catcagtgaa atagctgaaa 540 aattagttct gcgttctgat aacagtacaa atcgtagtaa ttctgaacag ccagctgaga 600 tgacactgtc taatgtcaaa ctagtgatgc agtcagatgt aaaagagcca ccaattttta 660 ggggcgatgg ttcagataaa ctaactgtac aagagtggga aaatattatg tccctgtatc 720 tgagaaaacg tgctattccg attgaagatc agtcacaaga aatattggat aggctaatgg 780 ggaaagcagg tgatgtagtg aaaattaaat tgcgtaataa cacatctgct gactgcacca 840 gtaaacctcg cttgatcttt gacattttaa aacagcattt cagcaccatc tcctattcta 900 gtatgcccct agccgatttt tacaatactt taccgatgcc atctgaagat gcaatggagt 960 attggatcag gctgaacaag acaatggatg ttgccaagga atgcctggag agacaaggtc 1020 ggcgcgtttg tgatccaagt catgaaatca gcatgatgtt tatcaagcat tgcccagata 1080 agtctctttc taatgtcttc aaattcaaat cagcagagaa atggtcagct ggtgagatac 1140 aggagagact tgatgaacat atgcaagaga ggaaagctag gactttcact aatttacgca 1200 gtttgaatat aactgaacat agagcttgca ctcagagtcc tgatgttgtt aagacagttg 1260 cttctaactc tcctttgtcc attccacctt ctcctatggt gcagtctggc agtgacagtg 1320 cttgcatgaa aagtctagtg agtctattgg accgtcttgt cactcaacag acccaattcc 1380 aagttagccc acatcatcaa gctacaacat ttcagactta ctgcaaacca tgcagggttt 1440 gtagatcttt ggatcattca acggtgtcac attgtagaca ggataacaga tgtctgaagt 1500 gtttaactcc tggacactgg aaaaaggact gccctcagcg agctgatcgt aaccagcatc 1560 aattggaaag tggggctggc agaccaccta accagttaaa ctattgaacc cgcattatgg 1620 aaggggtgat gtgggtatgg atactaaata ccttcatgaa gaatgcgatc ttgcccagct 1680 ctatgagaaa aactgtaata gtgcccctag agggaccaag atagttgtac aatctagtca 1740 gaaagtcgaa gccttcagta gccttttcta tgttcctgtt cgtgtcaatg gccacgttat 1800 tcttaacgga atgattgatt ccggttctat ggcatgcaca atcagtaaac atgctgtgga 1860 gaaaattagt tctgcaggca ttttgcctga aaaaaagcat tcagaagaaa agattgtctt 1920 gatcggctgt ggaggtcagc aaacccagcc tgatggtttc tatgatttgg agatccagtt 1980 ctttgacact cggtgtgttg caccttgctt ggtcgtgcct ggccagcgtg atgatcttat 2040 tctgggctct aatatcatta aacacttaat tcatgagctg aagaacaaca atgattattg 2100 ggacattgct tctagacctg acaataagtt tgatgaagac gttcagcagt tcttgtcaat 2160 gtttacgggt gttgagcgtt gggaaaggag acactgttcc tgacaaaata ggcactgtca 2220 aactcactca agcggtggta cttttaccaa agcatgaaca tttggtctgg ggccgtttgc 2280 catctaatgt gcccatgtcg ccaggcagca cagtgatcgt tgagccgacc aattcaaaag 2340 ctatgcctcg agacattcta gttggtcgtc ttgttgtacc actttggggt gacaggtggg 2400 tgcctatgac agttatgaat ccgacaaata aagtgatcac gttgaagagg aacagcaaaa 2460 tggctgatgt atttccatgt attgcatctg aagattttga gatttttcag ggactgcaca 2520 ttactctagt ggaaaacaag tcagataatt catgtaattc ctgtgcagct gaccacattg 2580 acaaaaatct gctgaacctg gttcttgagg aaattgacgt caactggagt cagataagtg 2640 atgattgtaa atctcaactt acccagttac tagctgatta ccaagacatc ttctccaagc 2700 atccattgaa ctgtggtgaa gccactggat atgtgcaccg aatacgactg acggataatc 2760 gaccctttcg attgccttat aggcgagtac ctcctgccca ctatgaaaaa ttaaggcaag 2820 ttctcactga catggaagaa aaacgtatca tcaggaagtc ttcaagtgag tacgcatcac 2880 cattagtgat ggtgtggaaa aaggatggtg gattgcgtat ttgcacagac ttcaggtggt 2940 taaatgccag aactatgaaa gacgcacacc ccttgcctca ccaggctgac tgcctggctg 3000 cgcttggagg aaactgcttc ttcagcacta tggacctcac gtcagggttt tataatattc 3060 caatgagtga ggaggacaaa aaatatacgg cttttactac accgctaggc cttcatgaat 3120 ataatcgaat gcctcagggg ctttgtaata gtccagcctc cttcatgaga atgatgcttg 3180 ggatcttcgg agacttgaac ttttcaaagc ttctatgtta ccttgatgat ctccttgttt 3240 ttgcaccatc tgaaagtgag gctctgtcca ggctgcgtac agtgtttgat agactgaggg 3300 agaacaatct gaaactttca cctaaaaaat gtcatctact tcaaagtcaa gtaagattcc 3360 ttggccatat cattgacagt ggaggtgtgt cagttgatcc agcaaaagtg gaggtcatca 3420 caaggatgac tgctcatgat ctgatggata atgatggctg cacaccgtct gtcaaaagga 3480 ttaagtcttt tcttggcatg gtattttatt accaacactt tattccaaac tgctcatcta 3540 tctctaaacc tcttttcggc cttacagctg gacagaaaag aagaggaaaa ttgtctaagg 3600 aaaggaagtt tcaaggcact taccgtaaac ttacccctgc agattggact acagagtgtc 3660 aggtcgcttt tgacaaactt aagactgctt tactggagtg tgtcatgctt gcacatcctg 3720 acttcaatga gcccttcatt ctttctgttg atgcttcatt ggacgggcta ggggctgtat 3780 tgtcccaagt gccaaaaggt gaactacaag ccagacccat tgcttttgca agcaagaccc 3840 tcagtacatc gcagcgcaga tatcctgctc atagactaga atttatggct ctaaagtgga 3900 gtgtatgtga ggagttcagc cattggctaa aaggccacaa atttaccatc tggacggata 3960 ataaccctct cacatatatt ctgactaagc caaaacttga tgcttatgag tggagatggg 4020 tttccaaact ggcggcatac acttttgacc tgaaacattt gcccagaaaa gctaatgttg 4080 tggcggacac cttgagtcac gacccttttg ttgagccaat tggacggaga ctgttgaatg 4140 agccgtattc ttacttgatg caggaggcca gtgggatagg cgaagattcc gttcacgatg 4200 ttttcagggt cagtagccat attcagccaa atcagcccat ggagtgtctt tcgcaaagtg 4260 ttgactacaa cgtggatgaa gtcaaagctc tttgtcagtc acactgcagt tggacagacg 4320 gagcagagtc aagagccctt catttagtcc atcatgttca gatgatatcg ggttgtcagg 4380 agactttacc agtgctctct gcccaggagc tcaaggacag tcaacaggaa gatctgacta 4440 tatccaaggt gttgtcattc ttgattagac agagacgtcc atcaagaaga gaaaggcatg 4500 gtgcaagtgc tcaagtaata agactgttga agcagtggag taagttggaa gttagagatg 4560 atgtgttata tagagtgacg aaagaccctt tgactaaaag gaaaagatta cagtatgtac 4620 tgcctgatag tctgaggaac aaagctctgt ctggtatcca tgatctcaca ggccatcaag 4680 ggcaggaccg tactatttca cttgcaagac aacgctttta ttggcctgat atggaaaaga 4740 atatcagaga atatgttcaa tgctgtcgaa ggtgtgtcct tgcaaagtca ccagaacctg 4800 ctgctcgtgc ccctttggag aatatcaaga gcactgcccc gatggagttg gtgtgcatag 4860 atttttggtc agcagaaaat agcaaacaat gttcggtaga tgtcctcgta gttacagacc 4920 acttcaccaa gcttgctcat gcattccctt gcgctaatca aacagcaaag caagttgcaa 4980 aaaagctttg ggacaatgtg ttttgtatct acgggtttcc agaacgcata cactctgacc 5040 agggcacaaa ttttgagagt aacctcatag cagagcttct tcgtctagct ggtgtcgcaa 5100 aatctcacac aacagcttac cacccaatgg gcaatggtgg cacggagcgc ttcaacagaa 5160 ccttgggtaa tttgcttcga tcgctccctt taaaagagaa atgtcaatgg ccacaacaga 5220 tccaaactct cacatttgct tacaatgcaa ctatcaatga aactacaggt tatgcacctt 5280 tctttttgat gtttggccgc atacccagac ttccggttga catcatgttc aatcaagtac 5340 tcgataaccc cgacattgct gattacgact cttatgtcaa gtctttactg tgctgcttga 5400 aaaatgctat ggatatcgct caaaagcatt cctctgtcga gcaaagacat caggctaatc 5460 agtacaacaa gcgtgttaaa ggaactcgct tgtcggtggg ggatagagta ttggtggcta 5520 ataaatcaga aagaggcaaa aggaagttgg ccgacaaatg ggaagacgga gtgtttactg 5580 ttgttgaagt gaatcctgac attcatattt ataggataag ggatgcatca ggccgcacta 5640 aagttgtgca taggaatctt ttactggaag tgaatttctt gccaattgtt ggaattgata 5700 aagagcaaag taccagtgaa aataatcagc cactgatcat tgtggagtct gatcagaatg 5760 gagcagatca ggactcaagc gaagacttgt ctgctgtttc gttcactgag agctcaattc 5820 aatcttcctc tgactatcct tacccacttc ctgtttgtga atcagaccgc aaccctgtgg 5880 ttcatgtgac tttacctgta gatttagaaa actctcatga tttggatgag gtagacacag 5940 tggatgcaga tatccatcaa tctcctggtg ctgccaatga gttgcctaca aatgagaaac 6000 atgaacaaga ttcaaatgaa attgcacaat acttacctgg ctctcctgac ttactgtgca 6060 gtgatccatc gttgagaggt gctgaaaatg ctgatcttgt caagcctgtt agaacgagag 6120 ttggaaggat agttaaatct gtaaacagat tgattgaatc tatggtgcag atgccagttt 6180 caaaagttca agttgtttaa attgagaatt cattttgaag tcttagtaat ttaatgtact 6240 gacattgttt gtgacaaaag tcaagtttaa tagagccaag tgctgtgtgg tgtattaggc 6300 atcacattaa cttataggga atttgagact tgatcaacct cttgttttcc ttaaccttga 6360 aattgaagta acttttatgt tggtttcgtg aattagattt gcgagtcact tgtgctagta 6420 tcgtgttcac tgggaatgtc tttttggtac attgaaagat tttgatgaaa ataaggaggg 6480 gtgaa 6485 // ID BEL23-LTR_DR repbase; DNA; ZEB; 1020 BP. XX AC chr20; XX DT 03-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL23-I_DR; KW BEL23-LTR_DR; BEL23_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1020 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1394-1394 (2008). XX DR Genome; chr20; Positions 46536868 46535849. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1020 BP; 278 A; 202 C; 219 G; 321 T; 0 other; tgtgcaaaag gctgttgagc aatgttgact ctcagagact tttattctga aatcgcgtca 60 caaaccgaaa ccggaatagg cgctactgaa tacagagtga actgttgctg cacgagactt 120 tgccgagcgc ggctcgttaa ctaaagtggc gctggaaacg tgtattgctg ctataatgag 180 tgtctgactg gatgtaattg tgcttctgtg cgtgttatga aagtgagtac agttccggtt 240 ataacaatta tttaggagtt atacttatta gtcccagtta tcacagagat acctgatgtg 300 tcatagtgct aattagcatg cgttagcatc gcgatcgcga tgctattcac tgtttgttca 360 gctgaagtga ttcaaatgta taatgtttgt tagtaaatgt ggatgttaga gtgtgattac 420 atccaattat gttcattttg tgagggaaga gcctataatg tatatgttag tgctgtcatg 480 tgtaagtttg ttaaagtaat gatgtatatt actgattgct gatgctggag tgttgttatt 540 tgtctgttac agagaaacca cgactttcat aagcacaact catcggctac acgctcattt 600 aatcaggaat tcaccaataa ccagctaagc aagtgcaatc ttcagtatat tgtgaggaaa 660 gggcagttat gcgtcaatgg ctaagtgata gactcatagc actctgaagg ctcattggac 720 cacggtattc atacacatac ttcggaccaa tcacaaggct gggcctaaac gtcacgtctg 780 tgtgcgacta ataattgtgc tttataaatc atcctctgaa cctgtgcacc ggaaccaatt 840 ggacacttca ttgtacatct gctaatattc acagttggct cttctctctc atgttaatac 900 ttacttgcca caataaatca gtttgatttc atccttccct ctgtcgtgtt ctaacagaca 960 cacctggttc actgctatac acaacgagtt agcctctgct atcaggtagc cttcgctaca 1020 // ID Gypsy71-LTR_DR repbase; DNA; ZEB; 943 BP. XX AC chr1; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy71-I_DR; KW Gypsy71-LTR_DR; Gypsy71_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-943 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1458-1458 (2008). XX DR Genome; chr1; Positions 50985418 50984476. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 943 BP; 268 A; 133 C; 211 G; 331 T; 0 other; tgtaacaggt attaaattta tttatgtttt cgttataaat tcacgcaaaa tctttgtttt 60 gttggtcatt aatatgtaag aaataaagag aaaatgtttt atttttgttt tgacgcgtcg 120 gcgcgatgac gtcattgacg tgcgttgccg cgatgacgtc agtgacatag cgcgcgcttt 180 gcttcagtct gtcggaggac agagcagagc gaggtaggca tttactcctg tcccgtagaa 240 aagtgtgtat aaagtacaaa aatatgtaat ttagatgaag tgagtttgtt ataatatgtt 300 atataacatg tacatgagtt atatgagtgt gtttgggttc tttatatcgt gttttaacct 360 gccaaactgt cgtttcaacc aagagaagtg cagtgcacaa tattagcatg gaatttagcc 420 tgtgaaattt gtattttgat tgctgatttt cgatttgttt tgtgtatgaa tgtttagcag 480 tagcaggcgt aagaggagtt ttaagagaag actaactgca tgttctctgt ttgtgcaggt 540 attgtttttt gtgttttatg tgtctgatgt ttaatgtgtt atataaaata ctggtaaatg 600 ttatatatta tatattaagt ggagtttttg taaagcattg ggaagttaaa atattcacta 660 tattaattaa cagttttata atgacttatc ttttttcatt atgtaatatg ttatatttac 720 tgttttgttc ttgcttgagt ctgtcggagg acagagcaga gcgagcagta gcaggcgtaa 780 gaggagtttt aagagaagac taactgcatg ttctctgttt gtgcagcact taaataaatg 840 agaccgctga acctgcagac agactccagc tctttacttt acacaacaca acacaacaca 900 acgcagaggg gtagtcgagc cgttaagacc ggccccggct aca 943 // ID Gypsy88-LTR_DR repbase; DNA; ZEB; 521 BP. XX AC chr10; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy88-I_DR; KW Gypsy88-LTR_DR; Gypsy88_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-521 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1488-1488 (2008). XX DR Genome; chr10; Positions 24368863 24369383. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 521 BP; 101 A; 101 C; 91 G; 228 T; 0 other; tgtcagggtt ttgccctttg tcattttagc ttttcctgta atgctttttt ttgtccacta 60 gatgtcgtca tttagtagtt ttctatttgg tatccgtaat cattgtcatt gatttcacat 120 gtgccttgtt tagtttcttg tatttaagta gcctcagttt catctgtact ttgctcagtt 180 attgtggatt caagcctgtc aaccactgtg agttactttt tgtaaagcct cttaattata 240 actctaagct ctttaatgac ttatatcttt tgtgtttgtt gagttcaact gttcaggtgt 300 ttttgcctac tttttgagtt ttgtattttt ggatattgga ggactctgtt ttttgcctct 360 ctttactttt atcacttatt gaagattttt ttgtcaataa taaatttcca ctgagaagca 420 agttgccttt cttcgtctgg gttcttcatc tcccagtagc tcagtaggga agtcattcac 480 ctctcatgcc agagacccgg gttctatccc cgatcctgac a 521 // ID EnSpm-2_DR repbase; DNA; ZEB; 7988 BP. XX AC . XX DT 31-JUL-2008 (Rel. 13.07, Created) DT 31-JUL-2008 (Rel. 13.07, Last updated, Version 1) XX DE EnSpm-2_DR is an autonomous DNA transposon - a consensus. XX KW EnSpm; DNA transposon; Transposable Element; KW Autonomous DNA transposon; EnSpm-2_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-7988 RA Kapitonov V.V. and Jurka J.; RT "Zebrafish En/Spm DNA transposons."; RL Repbase Reports 8(7), 750-750 (2008). XX DR [1] (Consensus) XX CC EnSpm-2_DR is young family of autonomous En/Spm DNA transposons. CC The consensus sequence was derived based on multiple alignment of CC several copies of EnSpm-2_DR that are less then 5% divergent from CC each other. The consensus sequence codes for a 815-aa En/Spm CC transposase. Based on multiple alignment of diverse En/Spm CC transposases from plants, fungi and animals, we suggest that the CC En/Spm catalytic core is defined by a DDE triad. Its coordinates CC in EnSpm-2_DR transposase are defined by the 380-469-634 aa CC positions. Zebrafish EnSpm transposase contains also in its CC N-terminal portion a novel type of zinc finger motif, called CC ENS-finger: C-x2-C-x12-H-x4-H-x11-C-x4-C-x12-H-x4-H. CC EnSpm-2_DR transposons are characterized by 2-bp target-site CC duplications (TA - most frequent, CA, TG) and imperfect 30-bp CC terminal inverted repeats (6 mismatches). EnSpm-2_DR is a CC composite transposon: its 869-1079 portion is similar to the CC 172-388 portion of the TDR15 transposon. CC Contrary to plant En/Spm transposons with the canonical CACTA CC 5'-termini, zebrafish En/Spm transposons contain the CACAG or CC CACTG 5'-termini. Zebrafish En/Spm transposons differ also from CC their plant relatives by a target-site duplication length: it is CC 2-bp in zebrafish (3-bp in plants). CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 5074..7518 FT /product="EnSpm-2_DRp" FT /note="En/Spm transposase." FT /translation="MKCFVCETELNGSKLLVRHFRFVHGLVPGKSLRLKCV FT EAGCGSEFGTFSGFRKHLNIKHPEQCEQSCDPSGICETEEENSAAMFHQSD FT SAEDVATTSVMLKSNKNTLDMCASAVAQLKVAGLSQSAINGFVSSMEEVVF FT EIHSQAQDAALLCISSEDTVSKRKIEESFTNLENPFTLLNSEVKRQRLFTA FT KWGIVEPSEIVLGTRFDSRRNKTTGTFDQVVVTDKFSYIPILETLKTILKN FT PQLTDLFKPRHISKEGIYVDLSDAAHFKSSPLFSTEKDALQIQLFYDDFET FT ANPLGSKKGIHKLGAIYFSLRNFPPVFNSSLVNIHLCALFHAQDVKRYGFN FT SIIEPLVNDLKVLEIEGVKNPVSGRCIKGTIVQVTGDNLGLHSLFGFLESF FT GARYCCRFCLLEKHQFQSVFSEDSPEVELRTAEMHALHCETLQSDSTLPHV FT YGVKRACLLNSLKYFNTATNFAVDIMHDILEGVAQLEVKLVLQYIQHNFLS FT ANDLDGRVHAFDYGYNQQRNRPPKVKLLDGSNDLGLNAIQSWCLLRNLPLI FT FGDLIHKDDKHWHLLLLLLQIVNIVFSPVLTDGMTTYLKHLIIEHHRLFKM FT LFPQKNLLPKHHIMIHYPQCIRKIGPILHTWCMRYEAKHKFFKTQLKSFKN FT ITKTLAKKHQSCMAMYWESFSQYRLTLGPGKMVELREIRGGPEIASKLGVV FT ISEIVYAVKWIKHHGTEYRPDFIICTEVAHEMPVFCKIMTIAVKNENVLLC FT GTLMETVCFDEHYYAFTVRLHPDKVLTVVNVNELCYFKPFDLQRKYGTTDS FT VLHIVPYCHFMQT" XX SQ Sequence 7988 BP; 2354 A; 1348 C; 1543 G; 2743 T; 0 other; cacagtaaaa tccgctgtgt taattaagct gtgttaatct gcatttactc aaaatgagtt 60 aattttacaa gaaacggtga aagacacccc cttgtgttgg tgaaaatgat cagagttgta 120 ttctacgtcg ttaactttgc tctgttaagc tccgcccctc aatctacccc ctgcagagaa 180 cgtgttggag ttttgtcggc gccattttct gctccctcgt gcgtataggt aagctctctt 240 gtttaacgtc ttaaaacatc attttatcga tttatatgag taaatgtgct aatacaacgg 300 tatatgttgt actttgatga gttttgtttg ttaaaacatt gattatgagt cactttgtga 360 ccgtgtaatt cgatcgattt tatcagtgtg gtcaccttaa cgttaaagat gaagcttaaa 420 cgatgccaat gatcttactt aaattgtttt tgcgatttct aatcttactt ttgtattcat 480 ttagttttgt actgtgaact ctctgaagaa gacgttaacg ttaggctaca tttagcctcg 540 aagttttgct ttcaagagca actctgcaat gttccaggta actgttaggt cagtattttg 600 ttgtttcaac atattaagat ggtaacttac agagaaagtt atatgatata aacatatgta 660 gtgtaaaatt aactgtctct gtatggctgt ccagctcact gctggttaaa tattttcagt 720 caatttcaaa ttaagttact tcctaattaa cgtttctctg ttaagaggtt gcattttaac 780 agcttgttag tgttgtcatg atactgtttt tcggtagcaa ttaacgttac tgaaattcgc 840 tgttgagcgc ggtcctaaac cccgccgttt ggccgtgaag gtcatcagtt caccgcactt 900 aacgttaacc gctgtttacc gagtgcaaac accgatacag agacagcgga tcattttaaa 960 actgtgaatc atcagcggat ctcttaaatg tcagcccata gacacgccag ctcatcttcg 1020 cggcttttaa aacgatcaag tggtgtgcat ctgtgcttgt gcccggtaaa cagtggtgac 1080 atgtagtgat gggtcgttca taaacgattc gttcattttg aatgaatctt ttgtatgact 1140 cgaaaacgtt gagtcctttc ggagtgattc gtttaattgc acaattgtgc tggtggagca 1200 gaagattcgt tccttttcga gtcttctaag ttatgtgctt tgagtcgttc gttcatcgcg 1260 tgaaagacag aagccagtca tgcatttaga gccggaaaaa gttttgattc gttcatttat 1320 cgagtcttcg gatttgattc atctctcttc acgtgctgcc acatgataaa cgaacgactc 1380 aaaaaccaga agactcgaga ggtaaactaa aaatcatggc tccttttggc agcttaatta 1440 atgtgggtct gtgtgacgtg aacgaaccac tgacgtttga agacctgaag ctgataatcg 1500 tagacaaaag accaggtaaa caataaatga tgatttactc ttacagcatt ccaagttatg 1560 acttgtttgt gtaatgtggc aacgtctgtg ctagttgtag atgagtttgg aaacaactta 1620 ttacatttta ataatatttt ggctactcaa aaacacttaa atattagcgg aaatggatgc 1680 ttttaattcc agtaccaatt ggtaccaaaa ttcagtatca tgacaaccct tcagcacaga 1740 cattatttca aaactaagtg gattagaatc atttttttct atggtgaatg tatgttatac 1800 tttgacataa caggaaaatg attgttgaca tattggtgtt ttcttttccc ccaagtttct 1860 gtgcatataa ggctctacag atcacattgt gagtaaaacc tttaagtact tgtgtaatta 1920 ctttactaac tgggatgacc aattcttaaa taataaatat atgttaatgg ttataaaatc 1980 ttgcaaaaat ttataaacct atgtgccatt tttgtgtcat ttattaattt atgtatgatt 2040 aaagtccccc cccattggca cctgtatgaa acattctcat ttttcttaca gagcatgctg 2100 gtaaaggtga aatttgggga tgcacaaaag tttgttaaaa tagcccatct gaacctggag 2160 gactttttgt ctgcaggtaa aaaacaatta cacttttgtt attcagattt cttatttggt 2220 atgtggtcaa tttaaatact gatgtatatg ctgctgtgtt atgtttccag atgatattca 2280 gagttaccag tatataactg atcagctttc acattttatt attattatgc aatatattat 2340 ttatgtattg tttagccttt ttgaagtttg gggttccaaa tgtacctgag aacgtgaagg 2400 ttgtggatga atctgggact gaggtggatg gtgatgtttt tgaggagtta atcaaagatc 2460 cctccgttgg cgttctcacc ataaaacatg gtgcaggttt gtattttatc acattatttg 2520 actgtaatgt ttcacatttt ttgtcataga gacttcctca cccccccctc cttatattgt 2580 ttatgcaaac agatttggaa ccggcctctc cacaatcatc ttctatgcat ttagatcagt 2640 ctctgtcttc atccattgac tcccttgact ctcaggatac agtcattatt gaagaaggcc 2700 ctgacagtaa gcgaatgaag ctggatgatg aagcaaaaaa ggtggacttt ttttttctac 2760 ataaaaatga tttaaattga aaaaatatat aatttacatt taatttaaat tcattttttg 2820 ctttttgtta ggtgtgtttg tttaatttgt ttataacctg ccttttcccc tcttatttta 2880 aagctggtgc aatctgttct cgtcaagaaa cctggaggcg agagtataat aaatgagtat 2940 aacagaacaa aggctttgaa ggatgaaacg aggagaaaaa tggttaatat cctggcagct 3000 gatatgaccg aaaaaaatgg gtaatctgca aagtactgat ttgtgtgtac acatagctac 3060 ttgatatagt tgtggattta tttattttat tattttttta taattaatca ttattttgca 3120 gagggtcccc accaaggctg gtcaaagaaa aatatgccag aggaatcgtg gctttgttcc 3180 cttacctcag tgacccctat tccaaaaatg gctttgtaag tcttgcacct ttttaaaatt 3240 ttacatttgt gttgtcacgt tttttttttt tttacacacc tatactacag atggtatatt 3300 ttgctcctgt gtcccccttt ttacccattt ttatctcttt aggagcatta ctatgatggt 3360 gagagtggca ccggatattt ggcatggaga atcaaaacta ttcagagaaa cctggctaaa 3420 gaacgacgag gatcatttga aggtacaaaa tgaacttcag aaatgtagca atctaatttt 3480 ctttgtgctt ggatgattat tttttttatt ttgtataaac aaacagttaa aattagtttt 3540 gttgtttgtt aggtcaagtg tcgtatgctg agcggggatc cggtggacca actgtagcaa 3600 ggcattcaaa gtttactcca gagattgtcc ttagtgagga tgaatgcaag gaagcaattg 3660 cattcatgaa ccattctgct gatgaggatg ccatcaagaa gaaaatgaaa ctgacatttg 3720 actatcgtcg taagatggtt cttgatccca tgcagtccag tgatctattg acaatttttc 3780 ctcgtttcaa ggacattaaa ggcttggtaa gtgtttataa tcatatttga taaatggttt 3840 gttgtataaa aatcaaatag tcagtggtga catgagttta aagatttgcc ctaagcatta 3900 caataattaa aatgttatgc ttattagatt gagcaagact ttgttctgat gtttggagaa 3960 ggagtgtcag gcaagctgct agagaagtgg acaacagcat tcaagaaaaa agtcattcag 4020 cagtgcaaaa gacttcctgc aaccagcgat ttacaggaac ttttgctagc agcagaatct 4080 cttcgtgatg acactgaaga agatgttcat attggtaaga attttgatta atgtttaggt 4140 aatgtttcat accatggtta ttataatatt taaaggaacc caagcatgaa tgaagttagc 4200 catagttaca cactgtcagt ttaattttta attgttcata ttcaattgga attcattcat 4260 atatatgtgt gtgtattatt gcaaatcata ttcatcaatt tttttatatt ttggtagttt 4320 ggatcaaagt tttatcagat ttacaaataa ttggattcaa actaagattg ttaatgtagc 4380 tttagtcact ttgtatatat ttttggcttt tttatgtagt taacttctag ttttccacat 4440 tgttttgctg cgagttaaca tttattatat taaccaattg tttgttacct gtttaaacag 4500 tttgggacag cgaccttgct tcagtactac tactgttgca tctgattcca ccatctgctc 4560 aaggccggaa gagaccaggg aaagtttctg cttctcaagc agaaaatcat cttgttgtct 4620 tcaagaaggt ttgtttgctg gttttattta atgattgttc tgtcatagtt tgttattatt 4680 taacttgtta gtgatgtttc ttttttactt gcttgctttt tttttttttt agagtggaac 4740 aaacatcgag gagcatctgc aagacatctc tgcgagcgct cagccctatc tcctggctgt 4800 aggtcctcag aaggattcaa ttcaccagtt cttcatcatc cttgatcagc atgccattcc 4860 atgcaagtcc acctcttctc ttggtgcctt tgatgaactg tttaaggcac attatgtttt 4920 tggcacctct tacaacacca tgctgcacaa catgtacact ttcattcaaa ccactgtgta 4980 caacatagat gttgggaaag tgaaagaaag tcctcgtgtt gctgaggtta gggcaaggct 5040 gcttaactag gtgcctaaag ttccagtgca ctcatgaagt gttttgtttg tgagacggaa 5100 ttaaatggtt caaaattgct tgttaggcac ttcagatttg ttcatggttt ggttcctgga 5160 aagagccttc gacttaaatg tgtagaggca ggatgcggtt ctgagtttgg taccttctca 5220 ggttttagaa aacatttgaa tataaaacat cctgaacaat gtgaacagag ttgtgaccct 5280 agtggcatct gtgagacaga ggaagaaaat agtgctgcaa tgttccacca gagtgacagt 5340 gctgaagatg tggccacaac atctgtaatg ttaaagtcta ataagaacac tttagatatg 5400 tgtgcgtctg ctgttgcaca acttaaagtt gctggtttaa gtcagtctgc aataaatggt 5460 tttgtttcat ctatggaaga agtggttttt gagattcata gtcaagccca ggatgcagct 5520 ttgctctgta tatcttctga ggacactgtt agtaaaagga aaatagagga atcttttaca 5580 aatttagaaa atccatttac tttgttaaat tcggaagtca agcgacagcg actgtttaca 5640 gcgaaatggg ggattgttga acctagtgaa atagtactcg gcacaagatt tgacagcaga 5700 agaaataaaa ctacaggaac atttgatcaa gttgttgtga ctgataaatt ttcttatatt 5760 cccattttag aaacactaaa gacaatttta aaaaatccac aacttaccga tttgtttaaa 5820 cccaggcaca tttcaaagga aggaatttat gtggacttaa gtgatgcagc acattttaaa 5880 agcagtcctt tattttctac agaaaaagat gccttacaaa ttcagttatt ttatgatgat 5940 tttgagaccg ctaatccact gggctcaaag aaaggtatac ataaattggg tgcaatctat 6000 ttttcattaa gaaatttccc accagttttt aactcctcac tggttaatat tcatttatgt 6060 gccctctttc atgcacagga tgttaagcgc tatggtttta actccataat tgagcccctg 6120 gtcaatgatc tgaaagtgct tgaaattgag ggagttaaga atccagtgtc tggaagatgc 6180 attaagggta ctatcgttca agtcacaggg gataacctag gtctgcacag cttgtttggc 6240 tttctggaat catttggggc tcgatattgt tgtcgttttt gtctgcttga gaagcatcag 6300 tttcagtcag tgttctctga agatagtcct gaggttgaac taagaactgc tgagatgcat 6360 gctctgcatt gtgaaacttt acagtctgat tctacactac ctcacgtgta tggtgtcaaa 6420 agggcatgct tattaaattc cctcaaatat ttcaacacag ccaccaattt tgctgttgat 6480 ataatgcacg acattctaga aggagttgcc cagctagagg taaaacttgt tctgcagtac 6540 attcagcata attttctgag tgccaatgat cttgatggta gagtacatgc ttttgactat 6600 ggttacaatc agcagagaaa ccgtcctcca aaagtcaaat tgcttgatgg aagcaatgat 6660 ttaggtttaa atgctattca atcttggtgc ttattgcgca atttgccttt aatatttgga 6720 gatttaatac ataaagatga taaacactgg catcttttgc ttttacttct gcagattgtt 6780 aacattgtct tttcacctgt cttaacagat ggcatgacca cttatcttaa acatttgata 6840 attgaacatc atcggctatt taaaatgtta tttccccaaa aaaatcttct gccaaaacat 6900 cacatcatga tacattaccc acaatgtata aggaaaattg gtccaatttt gcacacatgg 6960 tgtatgcgtt atgaagctaa gcacaaattc tttaaaacac aattaaaaag tttcaaaaac 7020 atcaccaaaa cactagccaa aaagcaccag agttgtatgg caatgtattg ggagtctttt 7080 agccagtaca gactgactct tggcccagga aagatggtgg aactcaggga aattagaggt 7140 ggcccagaga ttgcttcaaa attaggtgtg gtaatatctg aaatagtcta tgctgtaaag 7200 tggatcaaac atcatggcac agagtatcgt cctgacttta ttatctgcac agaagtagca 7260 catgagatgc cagtgttttg taagatcatg actatagctg tcaaaaatga aaatgtactg 7320 ttatgtggca cactaatgga aactgtatgt tttgatgagc attactatgc ttttacagtc 7380 agactgcatc cagacaaagt tttaacagtt gtcaatgtaa atgagctgtg ttatttcaaa 7440 ccatttgatc ttcaaagaaa atatggtaca accgattctg tacttcatat tgtaccatat 7500 tgccatttta tgcagacttg aaatttttgc actgtatttt tgtaagcctg aatgttttaa 7560 taaaagtttt aactgatatt gagtttcgtt ttctgtttta tttgttgttt aatgtttttt 7620 tagtatgttc agtagcatta aaaaattgca ccagagtgaa ataatgacag ttaaagcaac 7680 accatatggg ttgtaaatca actctttatt agtgatgtat taacatttga ccggagttga 7740 tttctgtttg gtgttaaatt aaggaacacc agacagagtt aatttctgtt cagagttaaa 7800 caccttaaca attttcagtg ttgaccagtg ttaaatttta actcaaaatt gagttaattt 7860 tactctgaaa atataacaca atgaaaagtg ttaatttaac accaattctg agtgggacca 7920 aatacactca cgtgcagtgt taaatttaac tctttaagag tttattcaac acagcaaaat 7980 ttactgtg 7988 // ID BEL3-LTR_DR repbase; DNA; ZEB; 644 BP. XX AC chr1; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL3-I_DR; KW BEL3-LTR_DR; BEL3_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-644 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1358-1358 (2008). XX DR Genome; chr1; Positions 36987960 36987317. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 644 BP; 176 A; 132 C; 123 G; 213 T; 0 other; tgtcagaaag cccactagga accgaaaata tttagcgagt ttcacacgga ctgttattag 60 cggtgcaact tcccatactc agatgccggg ccagatcaca atgcgccctt ttcttccgac 120 atgttgtgcg ttcatgcacc aaatgtgagt aataatgtga caatcactga gtgaacattt 180 actcctttca tataagagac tgtttgttat atatgacgac gatcgcgccg cgatcgcggg 240 aaaatatcgc tcgcgctagc cgcgagagaa tacagtgtgc aatatatgct ccgtcattcc 300 gcctattgcc ttttactgcc atttaactaa ttagtttgtt tgttataacg tatatatttg 360 ttcctttatt atcactgttt aatcattgta tatatggtgt gatttctcat tgaagtgaat 420 gtgtaaagat accttatatt tatactgtat tcctcatctg ttctacagac cacaacatac 480 ttggaaagtt tgatggcaat gactgcttaa taattgttat taaagaagtt tgaagaggat 540 tacttctttg ttgcctgatt ctctgaccgg aatcaatgtc catatttgtc cgccagtaga 600 actgtaaatt tactccgttg acagaggtat agtaccccgt caca 644 // ID Gypsy45-I_DR repbase; DNA; ZEB; 5192 BP. XX AC . XX DT 25-SEP-2007 (Rel. 12.09, Created) DT 25-SEP-2007 (Rel. 12.09, Last updated, Version 1) XX DE An internal portion of the Gypsy45_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy superfamily; Gypsy45-I_DR; KW Gypsy45-LTR_DR; Gypsy45_DR; endogenous retrovirus; gag; integrase; KW reverse transcriptase. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5192 RA Dib M.R. and Naveira H.F.; RT "Gypsy45_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 7(9), 821-821 (2007). XX DR [1] (Consensus) XX CC Gypsy45-I_DR is an internal portion of the Gypsy45_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its long CC terminal repeat is deposited in Repbase as Gypsy45-LTR_DR. CC Gypsy45_DR is characterized by 4-bp target site duplications. The CC internal portion encodes one polyprotein the 1585-aa polyprotein CC Gypsy45_DR1p (pos. 339-5093) composed of the gag, protease, CC reverse transcriptase, and integrase domains. Some insertions CC fairly recent, according to the high identity between their CC flanking LTRs. Consensus obtained after the alignment of at least CC three independent insertions bearing at least 85% homology over CC at least 1000bp. XX FH Key Location/Qualifiers FT CDS 339..5093 FT /product="Gypsy45-I_DR_1p" FT /note="ORF." FT /translation="MEFDLTTFSLAPTIEAFNKCRKKDLFLIADFFNISVQ FT RDVTKQTLKEELFGKLVEEGILPKDSDDSVEGQDEVVEQMSEMDVNHDPVI FT AQEAEMAEQITDLGVNYDHVMTQDPVIALKLKELDLLIKKQECEAEMIKLR FT VVEKQADRDIQLGKLELEAKRLALTPVPMPRSRPVSVSTPVTSANSTVTED FT FSRVPSRESFDISKYIRLVPPFREAEVDSYFVAFERIASKLSWPKDMWALL FT LQCSLTGKAQEVCSSLSIENSLDYDVVKAAVLRAYELVPEAYRQKFRSHSK FT TVKQTYVEFVREKRVLFEKWCFSSKISTLADLQELILLEEFKNCIPSNIAV FT HLNEQKVSSLSDAAVLADEFVLTHRNIFPNTTVKNSPVNVESSVVNRFTRA FT AKSEMSPKMTPKPAGADKRVCFFCLDPNHLIADCRAWKQKNTTIKPKNVAL FT VEKPHEGNSPVEESYRPFLFEGSVSLSPDSKFKTVTILRDTGSVQSFIAAD FT VLPFSAESFTGNDVLIRGIEMRCVNVPLHSVYLKSEVVSGPVNLAVREQLP FT VDGVGLILGNDLAGGIVFPRPVVSHTPNFMQKPDLAEKFPSAFPACAITRA FT QSKRFEDVVDVSDSFMVKEPDSLECVLSVTPDPDPDEICLDDTPTVPLEVS FT REHLAAAQKADPSLTKCVLAADGIKHAPDVGVVYFWENGLLMRKWKPHEED FT LSWQEVQQIVLPSGYRQQVLKLAHENVFSGHVGVTKTYNHIVKYFFWPGLK FT SEVAKFCKSCHTCQVDGKPNQKIPPAPLCPIPVITDPFERLIIDCVGPLPK FT AKSGHQYILTIMCAATRFPEAVPLRTLKAKAVVRELIKFCTTFGLPRIVQS FT DQGTNFTSKIFKQTLKELGISHQTSSAYHPESQGALERFHQTLKTMLRRYC FT TETGKDWVEGLPFLMFAVRESVQESLSFSPAELVFGHTLRGPLKLLSEQIL FT NPRSNPVPVDDYVSSIREKLQNAQAIAKRHLSVAQSKMKCHYDKKAVKRDF FT QPGDSVLVLLPTPGSILHSKFAGPYLIERKLNETNFLVATPDRKCKSRVCH FT INRLKAYVDRHHLGKAQDKSAVLPVSTTATAVISSNFSLEEDDLVSKDVSA FT SCTRLNNSVIMQDLQHFLSHLTAEQFKDITGLLHAFPDLFNDVPGRTAVCV FT HDIDVGDAVPVKQHPYRVNPRKREIMQAEVKYMLDHGLAEPSQSPWSSPCL FT LVPKPDNTFRFCTDYRKVNNLTKPDSFPLPRMEDCVDRVGLAKYVTKLDLL FT KGYWQVPLSQRASEISAFVTPDSFLQYKVLAFGMRNAPATFQRMMLQVLSD FT VTNCEVYLDDVVVYSDNWSDHVKTLEKVFKYLTSASLTLNLAKCEFAKGVI FT TYLGKQVGQGLVKPVDAKITAIIEFPVPCNKRELRRFLGMSGYYRSFCPNF FT SSLVSPLTDLLSTCKKFEWTNDCELAFNGAKDLLCQAPVLSAPNFTKPFSL FT QVDASATGAGAVLLQEDEAGIDHPVSYFSKKFSKTQQNYSVIEKEALALLL FT ALQHFEVYLGNSPQSIIVYTDHNPLVFLSRMSGSNQRLLRWALTIQEYNLD FT IQHKRGSENVMADALSRV" XX SQ Sequence 5192 BP; 1441 A; 1078 C; 1162 G; 1511 T; 0 other; aagtgggggg ctcgtccgat tggtaatcaa gtgggggctc gtccgggatc ggtaatcaag 60 tgagggctcg tccgtctgac tggttaacaa gtgggggctc attcgtccga ccggtaaaca 120 agtggggtct cgtccatccg gtaaaaatta ggggctcgtc caaccggtat caaattgaaa 180 gctcgtcaaa ctggtaacaa attggagggt taatttgacc agcaacaatt tggggactct 240 gctatctctc aagtggttgg taagtactct cttttctcaa acattttgac tgttttcgac 300 tgtaggtggg gaggacagtg cagagaaagc attgaaaaat ggagtttgac ttaaccactt 360 tttctctagc acctactatt gaggctttta ataagtgtag gaaaaaggat ctgtttttga 420 ttgcagactt ctttaacatt tctgtacaga gagatgttac caagcaaact ctaaaagagg 480 agttgtttgg aaaattagtg gaggaaggta ttttgcctaa agactctgat gatagtgttg 540 aagggcaaga tgaagtggta gaacagatgt ctgaaatgga tgttaaccat gaccctgtga 600 ttgcgcaaga agctgaaatg gcagaacaga taactgattt gggcgttaac tatgaccatg 660 tgatgactca agatccagta attgctttga aattaaagga attggatctt ttaattaaga 720 agcaagagtg tgaggcagaa atgataaagc ttcgggtagt tgagaaacaa gcagaccgtg 780 atatccaact aggtaaattg gagttggaag caaaacgttt agcattgaca ccagttccca 840 tgccacgttc taggcctgtg tcagtctcca cgcctgtaac atctgctaac agtactgtaa 900 cagaagactt ttcacgagta ccatctagag aatcttttga tatcagtaaa tacataagac 960 ttgtacctcc attcagggaa gctgaagttg actcctactt tgttgcattt gagcgcattg 1020 caagtaagtt aagttggcca aaagatatgt gggccttatt actgcagtgt agcctcactg 1080 gaaaggctca ggaagtttgt tcgtctctgt caattgaaaa ctctcttgac tatgatgtag 1140 ttaaggctgc tgtgttgcgg gcttatgagt tagtccctga ggcctacaga cagaagtttc 1200 gtagtcattc gaaaacagtc aaacagactt atgttgagtt tgttagagag aagagagtcc 1260 tttttgaaaa atggtgtttc tctagcaaaa tttcaaccct ggcagattta caggagctaa 1320 ttttgctaga ggagttcaaa aattgtattc cctctaatat tgctgttcat ttgaatgaac 1380 aaaaagtttc atcgctatca gatgctgccg tgttagctga tgagtttgtg ttgactcaca 1440 ggaatatttt tccaaacacg acagtcaaaa attcgcctgt aaatgttgaa agttcagtag 1500 ttaatcgctt tacacgtgct gctaaaagtg aaatgtcacc caaaatgact cctaaaccag 1560 cgggtgctga taaaagagtg tgtttctttt gtttagatcc aaatcatcta attgcagatt 1620 gtagagcctg gaaacagaag aacacaacca taaaacccaa aaatgtagct ctagtggaaa 1680 agccacatga gggtaattct cctgttgagg agagctatcg acctttcttg tttgaaggtt 1740 cagtttctct gtctccagac tctaaattta agactgtaac aatcttgcgt gacactggtt 1800 cggttcagtc ttttatagca gcagatgtat tgcccttttc tgctgaatct tttactggta 1860 acgatgtgtt aattcgtgga attgaaatgc gctgtgtgaa cgttcccctt cactcggttt 1920 atctaaagtc agaagttgtg agtggtccag tcaatttagc tgtacgtgaa cagttacctg 1980 ttgatggggt gggtttaatt ctaggaaatg accttgcggg tggcattgtt ttccctagac 2040 cagtggtgtc tcacacccca aattttatgc aaaaacctga cttggctgag aagtttcctt 2100 ctgcttttcc tgcatgtgcc ataacccgtg cccagtcaaa gagatttgag gatgtggtgg 2160 atgtatcaga ctcctttatg gttaaagaac ctgactctct tgaatgtgta ctctctgtaa 2220 ctccagaccc tgatcctgat gaaatctgcc ttgatgacac ccctactgtc ccattggaag 2280 taagcagaga acaccttgct gctgcccaaa aagcagaccc ttcccttact aagtgtgtac 2340 tggctgctga tggtatcaaa catgcacctg atgtaggtgt agtgtacttc tgggaaaatg 2400 gattgttgat gcgcaagtgg aaaccgcatg aggaagattt gtcctggcaa gaagtacaac 2460 aaattgtttt gccctctggt taccgacagc aggtattaaa acttgcccat gaaaatgtgt 2520 tttcaggaca tgttggtgtc acaaaaacgt acaatcacat tgtgaagtat ttcttttggc 2580 ctggtcttaa atcagaagtt gcaaaatttt gtaagtcatg tcacacatgc caggtggatg 2640 gtaaacccaa ccagaagatc ccacctgcac ctctatgccc tattcctgtt ataactgatc 2700 catttgagcg gttgattatt gactgtgtgg ggccattgcc caaagccaaa tctgggcatc 2760 agtacatttt gactattatg tgtgctgcca cccgttttcc agaggctgtt ccattgcgga 2820 ccctgaaagc aaaagctgtg gtaagagagc taattaaatt ctgcacgaca tttggcctac 2880 caagaatagt ccaaagtgac caaggtacaa atttcacttc caagatcttt aaacagactt 2940 taaaggaact aggtataagc catcagacat cctctgccta ccatcctgag agtcaaggtg 3000 ccttggagag atttcaccag accctgaaaa ccatgctacg ccgttactgt acagaaactg 3060 gtaaagactg ggttgaaggt ttaccttttc taatgtttgc agtaagggag tctgtgcagg 3120 agtccctcag ctttagccca gctgagcttg tgtttggcca tactctgaga ggtccactaa 3180 aactgttgag tgaacagata ttgaatccac gttccaatcc tgtacctgtt gatgactatg 3240 tgtcctctat ccgcgaaaag ctgcagaatg cccaagctat tgcaaaacgc catctctctg 3300 ttgcccaatc aaaaatgaag tgtcattatg acaagaaggc tgtaaaaaga gactttcagc 3360 ctggagattc ggtcctagtc ttacttccaa ctcctggttc catcttgcat tccaagtttg 3420 cagggcccta tttaattgag agaaagctaa atgagaccaa ctttcttgtt gccaccccag 3480 accgaaaatg caagagccgt gtttgtcata taaaccgact caaagcctat gtagatcgac 3540 atcaccttgg aaaagctcaa gataagtctg ctgtccttcc agtttccacc actgccactg 3600 ctgttataag ctcaaacttc tctttggagg aggatgatct tgtgagtaaa gatgtctctg 3660 cttcttgtac aagactgaac aattctgtta taatgcaaga tcttcagcac tttctttctc 3720 acctgactgc tgagcagttt aaagatatta ctggtttgct gcatgctttt cctgacttat 3780 tcaatgatgt gcctggccga actgcagtat gtgtgcatga cattgatgtt ggagatgctg 3840 taccagtgaa gcagcaccct tacagagtga atcccagaaa acgggaaatc atgcaagctg 3900 aagtgaagta tatgttggat catggcctgg ccgaacctag tcaaagtcct tggagttccc 3960 cctgcctcct tgtcccaaaa ccagataaca cttttagatt ttgcacagat tatagaaaag 4020 ttaacaatct tactaaacct gactccttcc cccttcccag aatggaagac tgtgtcgatc 4080 gagttggttt ggccaagtat gtcacaaagt tggatctact aaaaggttac tggcaagtgc 4140 cactcagtca acgtgcatct gagatttctg catttgtcac ccctgacagt tttctacagt 4200 acaaggtcct ggccttcggg atgcgaaatg cacctgctac ctttcagcgt atgatgctgc 4260 aggttctgtc agatgtaacc aactgtgaag tgtatctgga tgacgttgta gtatattcag 4320 acaattggtc tgatcatgtg aaaacccttg aaaaggtgtt taagtatttg acatcggcat 4380 cattgaccct aaaccttgct aaatgtgaat ttgcaaaagg tgtaatcaca tacttgggca 4440 agcaagttgg tcaaggatta gtgaaacctg ttgatgccaa aattacagcc ataattgaat 4500 tccctgtgcc ctgtaataag cgggagctga gacggttcct tggaatgagt ggatactaca 4560 gaagtttttg tcccaatttt tcttctctag tctctcccct cacagacctg ctcagcacat 4620 gcaaaaagtt tgaatggacc aacgattgtg aacttgcttt taatggtgca aaagacctcc 4680 tttgtcaagc ccctgttctc tcagctccaa actttaccaa acctttcagt ctgcaagtgg 4740 atgcaagtgc taccggagct ggggcagttc tgctgcaaga ggatgaggcc ggcattgacc 4800 atccagtgtc ctacttttct aaaaagttct ctaagaccca gcagaactac agcgtaattg 4860 aaaaggaagc tctagccctg ttactggcac ttcaacattt tgaagtttat ttgggaaaca 4920 gtcctcagtc cattattgtg tacacagacc ataacccttt ggtgttcctt tctcgcatgt 4980 ccggttctaa ccaacgctta ttacggtggg cccttaccat ccaagagtac aacctggaca 5040 ttcagcacaa gaggggaagt gaaaatgtta tggctgatgc tctttctaga gtgtaaatgt 5100 ggggttcatt tctgacccag gtaccttaag caatgtgata aggtaaaacc atcttgattt 5160 acaaaccttg atttgtattc ttagtggtgg gg 5192 // ID DNA9TA1_DR repbase; DNA; ZEB; 3333 BP. XX AC . XX DT 02-MAY-2002 (Rel. 7.04, Created) DT 02-MAY-2002 (Rel. 7.04, Last updated, Version 1) XX DE DNA9TA1_DR is a composite nonautonomous DNA transposon - a DE consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA9TA1_DR; KW nonautonomous DNA transposon. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-3333 RA Kapitonov V.V. and Jurka J.; RT "DNA9TA1_DR, a composite nonautonomous DNA transposon from RT zebrafish."; RL Repbase Reports 2(4), 11-11 (2002). XX DR [1] (Consensus) XX CC The consensus sequence was reconstructed based on 7 copies CC present CC in sequenced DNA representing 1% of the genome. Approximately CC 700 copies of DNA9TA1_DR are expected to populate the complete CC zebrafish genome. The element was active recently, its copies are CC only 2% divergent from the consensus sequence. CC Given its length, 9-bp terminal inverted repeats, the TA targets CC site duplications, DNA9TA1_DR is a quite unusual nonautonomous CC DNA CC transposon. Its classification is not certain yet, although it CC is expected to be a member of the Mariner/Tc1 superfamily. CC DNA9TA1_TR carries also remnants of the HARBINGER2N_DR DNA CC transposon (positions 2186-2311). The genome harbors also copies CC of CC an ancient transposon which is ancestral to DNA9TA1_TR and does CC not CC have insertion of HARBINGER2N_DR. XX SQ Sequence 3333 BP; 1081 A; 617 C; 660 G; 975 T; 0 other; cacagcaaaa tatggggacc caaaacaact ctatgggtgt taatttcaac actttgaaag 60 tgtctcatgg ggtccactca aaacagagtt aaatcaacac tttcaaaagg gttggcacat 120 tgacaccgaa gtaagggtta attttaactc tgggcagagt taaaaaatgt aactatattt 180 aacaccgcct ggtgtaatat attgttattt tattcaactc tcttgagtgt aaatatggta 240 aaaaggtcaa atagactgta aattacaaaa gaaaaaacat tttatttgaa aacattcacc 300 acttcaacaa ttcattcagt ttttaaaatg caacaatgtc aaatatgctt taaatgtttt 360 attagtacag acagtatcaa caaaatatgt atgaccagtg ctcaaaaagg ctgtttcagt 420 cctttggtcc aaacttgatg tttcatcaga gcaatttgaa agttcaatat agcgtcactc 480 atagttttct tctgaacgca tagttttctt ctgaaattac attttaaaca acttggcgtg 540 caaatctttc acttctggtg tttccttggt cctccatatc aaacacagca gtttaaatga 600 aggtacaaat gctgtaaact tgtgtgaaaa caaactggag ctaggaaatc tcatcaagca 660 tgtcctgtga atgaagtgct tcccagccac aggatgacct ttttatccat gacgatgtag 720 tagctgctga tcgctcgtct ggtggttcct gatgcacgga tatagggtga tgctcatcta 780 agaactcttc aagactccag catgactaag aaaaatgttt gaaaacaaat tgcaatcaaa 840 ttctatgata tatttaaaag aacatttaaa gtaaataaaa cattcaagaa atctaaactc 900 accttttgaa aatcttcatg atgatccaca acttgactct cccagctggt tgaggtgaca 960 ggatatggaa aagtagaaaa aacacataca tcactgttgg atctccaaaa acaattaggt 1020 taaccactat gactagaact ggaaaaacag gcagctgtct gtccagaatc ctgaatttaa 1080 cacaacactt ggagcaaaat ttagaggagc ttaaaatctt ttatatcatt tagtgatgct 1140 gacttcccca aaatattggc cacagtgtaa aaaatattca taaacttaca gttttctgta 1200 agcgtgtgcg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgcgtgcgtg 1260 tttctttacc agattcaact tgttgggccc tttcaggtcc agtattgacg ccacctttcc 1320 tggagtctag cagatgtttc aggtccaaat ataatctaaa atattaggat aagcagaaga 1380 ggaaaagtgc aaagcaaaga gcaaaacatt atttaagtaa cagttcaccc acaatacaaa 1440 acttatcacc ttcatgtcat ttgttattct ctaatgctct aataccctct cagcttattg 1500 tgaaacaaat acagaacttt atcaaacata ccttcattat tatcaccaca tatgaccaag 1560 aaggtgtgct tggtcatttc accaactctg gaaatacacc atccgtctct aatcagccac 1620 cgtcacaggg tcttgtgtta ttgtatagag aactgtggct gttaaaaaac aacaaaacat 1680 tagtatctaa cacatatgca taaccttcag ataaaaaaga gagatgaaca tcacaaagta 1740 tgcctcacac tttttcagat atcttttgat tcagatgttg attattgata ataaatctac 1800 aaatcaaacc tgtatcatca ttaggctgct caaatatgaa tcagttgatt aattttacag 1860 acaggtggct gtttacaacg cactaaattg tctttaataa aacggaaatg ttgtgtacag 1920 tacatgctaa gtttagccta tagttttaag cacaatatca tgcgggagaa aaagcttgac 1980 taagatgttc ctgactacag aaatagccta acttactaac gtcagacatc ccgagttggg 2040 aaaagtcatt ttcgggggat acagagttga tttgcgggag gtagcgggag agatgagtac 2100 ctgaagagca atgggatgag ttgcgcgcga cgtacctaaa gagcggtggg ggtgacttgc 2160 gcgcgacgta cctaaagagc ggtgggggtg acttgcgcgc gacgtacctg aagagctggg 2220 agggatgcgg tggagagggg tgtgcaaagt gtttaatgac cgggcgggag acgcgcgatt 2280 tccgggagat tatcattcat ttgcgggcat ctacttgggc atctgggagt ctctgtgcat 2340 ttgcgggata acgttaacgt tagtcccgca acttccggga gacttctgta acgtcaaatg 2400 tctgagaaag tgcaaacgcg gtcatttaaa gacattaatg ttaacattcc gactttaatg 2460 gttgtcaaca cttaatataa ttcaagcgta cgttatcacg cgagtctatg gactaacggt 2520 atatggacgg ccacgaatga accagtttaa aagggccgcg gtgtttaaaa taaccaaatt 2580 aacgtacacg aataacaaac aatcatatgt tttagtcagg aagcctttta caagtaagca 2640 tatgttcatt tactcaccag atatgtttta gaaactctcc agagcttatg gaaaccgtcc 2700 tgtgttgccg ttagcatccg ggcagagtag gctaggctgc tccggggatt ccctcgctag 2760 tttgcttcga attaaaggag aagtgtcgat tttattttac agatgggtaa caacgcacaa 2820 aatggcatta agcgtgacag aaatgtgttg aacatgtaca tttgatgttt ttatgcacta 2880 tgtatgcgtg agcatatata aaaacattct tgaaaagaag tcaatagtaa tgcagttaag 2940 ctagatacac aaacgaccat gaatgaaccg cagaagttta aaattgtcac tccattctaa 3000 agttattaac ttaacaatat gtttacaact gtatttcctt aaagaaagtc agtaaaatac 3060 caacatttag gtagtttgac tcaccagttg ctgttctaaa cctcagatgg aaaatggcgt 3120 ctggaaaatt cttcagtgac tggggctgca tgaattcccg gatgttggaa ataacaccac 3180 caagtgttga aaagataact ccttgatttc gcactgaata aaatcaattc taactcttat 3240 tatattcatt tatcaaaatc taactcttta acgtcaacac tttactctga aatgcttcaa 3300 caccgagaat ttaactctga tgaatttgct gtg 3333 // ID Gypsy135-LTR_DR repbase; DNA; ZEB; 579 BP. XX AC chr14; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy135-I_DR; KW Gypsy135-LTR_DR; Gypsy135_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-579 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1532-1532 (2008). XX DR Genome; chr14; Positions 9614285 9613707. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 579 BP; 97 A; 149 C; 109 G; 224 T; 0 other; tgtcggggtt ttttgtgact actttgtctc cctcttgtgt attatcattt gtatgctgtt 60 tcacctgtta tcaatttgct tgttttcccg tccccattcc tctcacctgt tctccctgat 120 tattgtaact atttagtctc cccttctccc ttgttctgtg ctagtgaatt gattctgtgt 180 tcaaagagtt atgtgtgcta ctggttctcg ctgttctcaa gaaatctacg catttaaatc 240 cgtctcgtct cgtctagttc agtgttgttg atctttacca ctcatcatcc gtttatctgt 300 gctctggctc acctctgctc tgcccacagt ccacgatcgt gtgaggtctc atcttcccgg 360 ctccctgagg tctgttcccc acttttctac taacccaaga gggtctattt gagtttgatc 420 agtcctacta cgcccagagg agggtcgatt actgactgcc tgtgtttttg gagtgattgt 480 tttcctactg cccagaggag caaaagattg ttgatttatt tctgttcaat aaatcctttg 540 aacttctccc ttgcgcttgg gtctccatat ttcctgaca 579 // ID BEL30-LTR_DR repbase; DNA; ZEB; 640 BP. XX AC chr21; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL30-I_DR; KW BEL30-LTR_DR; BEL30_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-640 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1404-1404 (2008). XX DR Genome; chr21; Positions 25876106 25876745. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 640 BP; 169 A; 109 C; 124 G; 238 T; 0 other; tgtaaagggc cacaagaggc cgctataggt tgttaatttt tcactcacct tattcaggtt 60 aatggcttta aagcaaaccc cggaacctgt ataattcata ccatgccaca ctcatcaaaa 120 cgatctcatc ccagaagctt ggtgattaaa ttcccgtgag aaatgtaagt taatgcaagt 180 tctctaagca cgatcgtgtg cattataatg ttgttggttt gtttttgttg cattatctca 240 tgtttacatt cgtgtgttta gccagatgac catcacatta cttagaagct acgtgtttaa 300 gctagcatcc cttggtgagc ggagtgaggt gtgtaattta tgtttgtgtt tcaaattatg 360 atgttaatgt ctaaatggtt tacttttgga aacttttgtg ttttaaaatt gtcaactaag 420 tgaattattg tgtatatgtt gctttacata gctcctttgt tgttctgtgt gcagctgcat 480 gataatgcac tgtaaatttg tagtaattta ttgtaaattt gctttgcaat ttatgtgttt 540 acttccagtt ttcacggtgt ttatgctggt tccaaaataa atcactttac acccgtcaga 600 gactctctgc ttcgtttaaa gaaaatcaag ggctgctaca 640 // ID ERV1-LTR_DR repbase; DNA; ZEB; 530 BP. XX AC AL713862; XX DT 05-JUN-2002 (Rel. 7.05, Created) DT 05-JUN-2002 (Rel. 7.05, Last updated, Version 1) XX DE ERV1-LTR_DR is a long terminal repeat of the ERV1_DR endogenous DE retrovirus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; ERV1-DR; KW ERV1-I_DR; ERV1-LTR_DR; LTR retrotransposon; class I ERV; env; KW reverse transcriptase. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-530 RA Kapitonov V.V. and Jurka J.; RT "The ERV1_DR endogenous retrovirus from zebrafish."; RL Repbase Reports 2(5), 12-12 (2002). XX DR Genbank; AL713862; Positions 67385 66856. XX CC ERV1-LTR_DR is a long terminal repeat of the ERV1_DR endogenous CC retrovirus. XX SQ Sequence 530 BP; 144 A; 105 C; 97 G; 184 T; 0 other; tgtggaaata taaatttcta gcatctaatt ccaaatgaac aatactctac gtataactga 60 aaagttattt tctttggttt tttttaatta aatcattgca tggacgtgcc atataggagg 120 gacatttaac cttgccttgc tgactacaga cttcatttgt atgctaaagc gatgtaatta 180 atggtttcac agcatggctg tatctattgt attccctgtc tccagaacat catttgcaaa 240 ttgttgtaat gatgtactta tctattgatg agatcagcat ggcacccaga cttcatttgc 300 aaccatggca cccagacttc atttgcatgc tttgttttac tgtcccccat gactgacccc 360 acctacatgc tgtaagatgt ataaaatgta ttatgttttg ctgtggagtt gagagtgctt 420 tcgatgacgc cacagccacg ctgagtttcc tcttattaaa aaggattctg ctttgaagac 480 aaccgaaaga ttctccctgg ttttttgtgg gctcttagaa gtttacaaca 530 // ID LOOPERN6_DR repbase; DNA; ZEB; 1291 BP. XX AC . XX DT 11-JUL-2002 (Rel. 7.06, Created) DT 11-JUL-2002 (Rel. 7.06, Last updated, Version 1) XX DE LOOPERN6_DR is a nonautonomous DNA transposon - a consensus. XX KW piggyBac; DNA transposon; Transposable Element; Nonautonomous; KW Dr000222; Dr000286; Dr000624; LOOPERN6_DR; KW nonautonomous DNA transposon; piggyBac superfamily. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 138-1 RA Bao Z.; RT "Dr000222, an unclassified and uncharacterized repeat."; RL http://www.genetics.wustl.edu/fish_lab/repeats/. XX RN [2] RP 507-1057 RA Bao Z.; RT "Dr000286, an unclassified and uncharacterized repeat."; RL http://www.genetics.wustl.edu/fish_lab/repeats/. XX RN [3] RP 1037-1283 RA Bao Z.; RT "Dr000624, an unclassified and uncharacterized repeat."; RL http://www.genetics.wustl.edu/fish_lab/repeats/. XX RN [4] RP 1-1291 RA Kapitonov V.V. and Jurka J.; RT "LOOPERN6_DR, a nonautonomous piggyBac-like DNA transposon from RT zebrafish."; RL Repbase Reports 2(6), 21-21 (2002). XX DR [4] (Consensus) XX CC Several hundred copies of LOOPERN6_DR are expected to populate CC the CC zebrafish genome. LOOPERN6_DR copies are ~15% divergent from the CC consensus sequence. CC It is characterized by imperfect 15-bp terminal inverted CC repeats and the TTAA targets site duplications. CC Presumably, this old elements is a member of the piggyBac/Looper CC superfamily. It could be a composite DNA transposon that was CC dependent CC on the piggyBac-like transposase. It has long ~400-bp inverted CC repeats (positions 156-551 and 870-1276). XX SQ Sequence 1291 BP; 425 A; 183 C; 206 G; 462 T; 15 other; cccttatgtg ctgttgggga tgttttcatc cactctgggg tgattttgag tcttaatttg 60 gccataactt ttctgtgttt cagctagcag aakgattttt ggtgacaatw ttttgacata 120 ttttgagaaa atgctttgaa ttaatacact ctgggcaaat ktactaccct tttgttatgt 180 tagggatgaa aacatccact aaattaaact gctgtaaaaa tgcatyagat caatattttt 240 taataatttt tttcataaat ccgttaatca acctcagtcc tgatcaaaac tactaaattg 300 tttagaaaat tactggattt taacanttta attgtcaaat tcataaatga tgtcactgat 360 ttrgtgaaaa aamacacaaa atkacgtatt ttcaataaaa aagtaattgt ggactgaatt 420 ttttttttac cttttatcam agtcttggac atgtgaaaca acattgcctt gtttttgatt 480 gattttcttt tttctcccta atttactgtt tgtggctgtt tttgccccat tgacttccat 540 tataaccaca tgttttgatt acaaagccat gacaccatat aatcatgcat ttttgattgt 600 tggtggtttt ccctgttggg aagaaataaa atttttaatt tttactgttg atcatcagtt 660 ggcaccatta accnttagat aggcttgtgc aaaaaaaaaa gcttagtttc tggattttat 720 atggagtata acagcaaatt aaagtgtgta tatgtgtgtg tgtgagagtg acctttgcgc 780 acttaccttg atrtgtytga gaaaancaaa atatgcatct cagctctcag aactacatgg 840 agtaaaattt gtatttatgc actwtcaaat gtggttataa tggaagtcaa tggggcaaaa 900 acagccacca acagtaaatt agggagaaaa aaaaagaaaa actaacaatg catcaaagcc 960 aatgttgtta ctaatcgttc acatgtccaa gattgataaa aggtataaaa aaatccagtc 1020 cactttttat attgaaaata cttcattttg tgtttttttt tcaccaaatc agtgacatca 1080 tttgtgaatt tagcaaatta aagagttaaa atcctgtaat tttctaaaca atttagtagt 1140 tttgatcagg actgagattg attaatagat ttatgcaaaa aaataaaata twtaaatatt 1200 tatctgatgc atttttacag cagtttaatt tagtggatgt tttcatcccg aacataacaa 1260 aagggtagta aatttgaaca gtgcacaaag g 1291 // ID Gypsy70-I_DR repbase; DNA; ZEB; 4295 BP. XX AC chr14; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy70-I_DR; KW Gypsy70-LTR_DR; Gypsy70_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4295 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1455-1455 (2008). XX DR Genome; chr14; Positions 3111110 3106816. XX CC Positions [1711-2166] - Reverse transcriptase CC Positions [3181-3660] - Integrase core CC 'ACACT' target site duplication CC LTRs are 100% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 17..940 FT /product="Gypsy70-I_DR_2p" FT /translation="MDPAEDSSVLSAMEFQGTMLGRHEQELTSARRTVEML FT ASRVAELSDQLHSLQFQQGLGPSTTTRIPQLEPRVNNPPVYSGEPSNCRSF FT LTQCEVVFSLQPQTYASERTKVAFIISLLAGRARDWGAAVWEAEDVCCLDF FT HRFKEEMIKTFDRSVFGKEASRLLAALQQGRRSVADYAIEFRTLAATSEWN FT QAALSARFLDGLTDIIKDEIYARDPPERFDELVSLAIRLDHRFDLRRRVLK FT GSRPEATELPPAAQLKFGPTEPEPMQVGRLRLTAEEKQRRLSKGLCLYCGG FT QGHVASLCPVKASARQ" FT CDS 1012..4296 FT /product="Gypsy70-I_DR_1p" FT /translation="MVSIKNVLHSCSALVDSGAEGNFMDSTWAHLKGVHIQ FT PLVDPVSVSTLDGRELAVISHITVPVSLVTSGNHREELVFYLFESPATPIV FT LGHPWLSLHNPHINWAENTIITWKSSCHANCLVSAPLPSSVSSVSVLQEED FT VDLSRVPGVYHDLRMVFSRSRAASLPPHRPYDCAINLLPGTSPPRGRLFSL FT SAPERATMEKYLSDSLAAGIIRSSSSPAGAGFFFVKKKDGSLRPCIDYRGL FT NDITIKNRYPLPLMSTAFEILQGARVFTKLDLRNAYHLVRIREGDEWKTAF FT NTPTGHFEYLVLPFGLSNAPAVFQALVNDVLRDMINKFVFVYLDDILIFSP FT SLQVHIQHVRRVLQRLLENQLFVKAEKCLFHAQSVPFLGSIISVEGIRMDP FT EKVRAVSDWPVPGSRKALQQFLGFANFYRRFIRNYSQVAAPLTALTSTKSH FT FCWSIAAQAAFRELKSRFTTAPILVLPDPARQFVVEVDASEVGVGAVLSQI FT CPKDNKLHPCAYYSHRLSPAERNYDIGNRELLAVRLALGEWRHWLEGAAEP FT FVVWTDHRNLEYIQTAKRLNSRQARWALFFGRFNFTLSYRPGSKNGKPDAL FT SRCFGTPYGNPHPDTILPRGHVVGAVIWGVEGLVRRALAKVQVPKGCPVGL FT LYVPVSARAAVLRWGHSSRFACNPGIRRSLAAVRQRFWWPAMVEDIRRFVG FT ACSVCAQNKSSNSRPVGLLHPLPIPSRPWSHIAMDFVTGLPPSDGNTVILT FT VVDRFSKAVHFIPLPKLPSAREMARVVVDHVFRIHGLPEDVVSDRGPQFIS FT HFWREFCRQIGSTASLSSGFHPQTNGQTERANQDLGRMLRCLASHNPSSWS FT QQLVWAEYAHNSLPVSSTGLSPFMCCLGYQPPVFPSQVADAAVPSVQAFIQ FT RCRRTWKRARQALLRTRERTKRTADRHRADAPKYVCGQKVWLSTKDLPLRV FT PARKLAPRFIGPCLITKVISPVAVRLRLPRSLRRVHPVFHVSCVKPVLRSP FT HSPLSSTPSPPPPLMVEGSPAFTVRKILNSRRRGRGFQYLVDWEGYGPEER FT CWVPARDILDHALIEEFHHRQLPQISGTPGGVPGERGT" XX SQ Sequence 4295 BP; 860 A; 1174 C; 1099 G; 1162 T; 0 other; gaacactctg accagtatgg acccagcgga ggattcgtct gtattgtctg caatggagtt 60 tcaaggaacc atgctaggcc gccatgagca ggagctgact tcagcacgtc gtacagtgga 120 aatgttggcc tcccgggtag ctgagctctc tgatcagctc catagtctcc agttccagca 180 ggggttaggg ccatccacaa ctactcgcat ccctcaactg gagcccaggg ttaacaaccc 240 accagtctac tcgggtgagc cttctaattg tcgatctttc ctcacccagt gtgaagttgt 300 gttttctctt caacctcaga cttatgcttc ggagagaacc aaagtggctt tcattatttc 360 tctccttgct ggtcgggccc gagactgggg ggcagcagtc tgggaggctg aagatgtgtg 420 ttgtcttgac tttcatcggt tcaaggagga gatgatcaaa acgtttgata ggtctgtgtt 480 cggaaaggaa gcgtctcgac tgctggcagc tctccaacag ggtagacgct ctgtggcaga 540 ctatgccatt gaattccgca cacttgcagc aaccagcgag tggaatcaag cggcattgtc 600 agctcgattc ttggatggtt tgacagacat aatcaaagat gaaatttatg ctcgtgatcc 660 cccagagcgt tttgatgaac tagtctccct ggccattcgg ctggatcatc gttttgacct 720 tagacgtcgt gttttaaagg gctcacgacc agaagccact gagttgcctc ctgctgccca 780 attgaagttt ggtccaactg agcctgaacc catgcaggtg ggcagacttc gccttactgc 840 agaggagaaa caacgacgat tgtctaaagg gctgtgtctc tactgtggcg ggcagggtca 900 tgttgcttca ctttgcccag taaaagccag tgctcgccag taatcaggag agttctggcg 960 agcggcactt tcatctcctc atccctctgc tcccgcaccc tactgcctgt catggtgtcc 1020 atcaagaatg tcctccattc atgctctgcg cttgtggatt caggtgccga gggcaacttc 1080 atggactcga catgggctca tcttaaaggg gttcacatcc agcctctagt ggatcccgtg 1140 tctgtctcca cactggatgg ccgtgagcta gctgtcatct cccacatcac tgtgccggtg 1200 agtctggtca cttctggcaa ccaccgggag gaattggtgt tttacctttt tgagtcacca 1260 gctaccccca tagtgttggg ccatccatgg ctatctttac ataatccaca cattaactgg 1320 gcagagaaca ctattataac ttggaaatct tcttgtcatg ctaattgtct tgtgtcagca 1380 cctttgcctt cttctgtttc ttctgtttct gttttgcagg aggaggatgt tgatctctcc 1440 cgggttccag gggtttacca cgatctgcgc atggttttca gtcgatcccg agctgcatct 1500 ctcccccccc accggccgta tgattgtgct attaacctcc tacctggtac ctcacctcct 1560 cgagggcggc tcttctcttt gtcagcccca gaacgagcga ccatggagaa gtatttgtct 1620 gattctctgg cagccggcat catccgctcc tcctcatctc cggccggagc agggttcttc 1680 tttgtgaaaa agaaagacgg ctccttgcgt ccctgcattg attatcgagg gctcaatgac 1740 ataaccatta aaaacaggta cccccttcct ctaatgtcaa cagccttcga aatcctgcag 1800 ggagcaaggg ttttcacaaa gttggacctg cgcaacgcgt accatttggt ccggattagg 1860 gagggcgatg agtggaagac cgcctttaac acccccactg ggcactttga atatttagtt 1920 cttccctttg ggctgtccaa tgccccagct gtcttccagg cactcgtcaa tgatgtgctg 1980 agagacatga taaacaaatt tgtttttgtg tacctggatg acattctcat cttttctccc 2040 tctctccagg tgcacatcca gcacgtcaga cgagtgttgc agaggctgct ggagaatcag 2100 ttatttgtca aagcagagaa atgccttttt catgcccagt cagttccgtt cttgggttcc 2160 ataatttcgg tggagggcat tcgaatggat ccagaaaagg taagagctgt ctccgactgg 2220 ccggttcctg gctcacgcaa agctctacaa caattcttgg gctttgcgaa tttctatagg 2280 cgtttcattc gcaactacag ccaggtagcg gcccctctga cagcattaac ctccaccaaa 2340 tctcattttt gctggtctat agctgctcag gcagctttta gagaattaaa gtctcgattt 2400 acaactgcgc caatccttgt cctacctgac ccagcacgac aatttgtggt ggaggttgat 2460 gcatcagagg tgggtgtcgg agcagttctt tctcagatat gccctaaaga caataaattg 2520 cacccttgtg cttactattc tcaccgtttg tctcctgcag agcggaacta cgacatcggt 2580 aaccgggaac tgctggcagt gcggctggct ttgggggagt ggcgtcattg gctggagggg 2640 gcagctgaac cttttgtggt atggaccgac cacaggaatc tggagtatat tcagacagcc 2700 aagaggctga actcccgaca ggctcgctgg gcattgttct ttgggcgctt taattttact 2760 ctttcttaca gacctggttc taagaatggt aaaccagatg ccttgtcacg atgctttggc 2820 actccgtatg gcaatcccca ccctgatacc attttgccca ggggccatgt ggtgggggct 2880 gttatctggg gtgttgaggg ccttgtcagg cgagccttag ccaaagtcca ggtgcccaag 2940 ggatgtccgg tgggtctgtt gtatgttcct gtgtcggccc gagcagctgt ccttcggtgg 3000 ggtcattcct ccaggttcgc ttgcaatcca ggcataagga ggtcgctggc tgccgttcgt 3060 cagcgctttt ggtggccagc catggtggag gacatccgcc ggtttgtggg tgcatgctct 3120 gtctgtgctc aaaataaatc ctctaattct cgtcccgttg gtctgcttca tcctctcccc 3180 attccctccc gcccctggtc acatattgcc atggattttg tcactggtct ccctccatcc 3240 gatggcaata ctgtgatcct cactgtggtg gaccgctttt ccaaggcggt ccacttcatt 3300 cccctcccca aacttccatc tgctagggag atggcccggg tggtcgtgga ccatgtcttc 3360 cggattcatg gtcttccgga ggatgtggtc tctgacaggg ggccccagtt catttcccac 3420 ttttggaggg agttctgccg acagattggt tctactgcta gtctgtcgtc aggattccat 3480 ccacagacca acggccagac tgagcgtgca aaccaggatc ttgggcggat gctccgatgc 3540 ctggcttccc ataacccctc ttcctggagt cagcagttgg tgtgggcaga atatgctcac 3600 aactcgttgc cggtgtcgtc gactgggctg tctcctttca tgtgttgtct tggttaccaa 3660 ccccctgtgt ttccttccca ggtggccgat gctgccgttc cctcggtcca agcatttatt 3720 caacgctgcc gccgcacctg gaaaagggcc aggcaggcac tgttgcggac cagggaacgc 3780 accaagagaa ctgctgaccg ccaccgagct gatgcaccca aatatgtctg tggtcaaaag 3840 gtgtggctct ccactaagga cctgccactt cgggtccctg ctcgtaaatt ggcaccccga 3900 ttcattggac catgcctcat taccaaggtg atcagtccgg tggcggtgcg gcttcggcta 3960 ccccggagtc tacgccgtgt ccaccctgtt ttccatgttt cttgtgtcaa acctgtcttg 4020 cgttctcccc actcccccct ttcttctaca ccctctcctc ctcctcccct gatggttgaa 4080 ggctctccag ccttcaccgt cagaaagatt cttaattcaa ggaggcgagg tagaggtttt 4140 caatatctag tagattggga ggggtacggt ccagaagaga gatgttgggt cccagctcgg 4200 gacattctgg accatgcgct cattgaggaa ttccaccatc ggcagttgcc tcaaatctcg 4260 ggtacgccgg gaggcgttcc tggggagagg ggtac 4295 // ID BEL7-LTR_DR repbase; DNA; ZEB; 769 BP. XX AC chr18; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL7-I_DR; KW BEL7-LTR_DR; BEL7_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-769 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1364-1364 (2008). XX DR Genome; chr18; Positions 45445653 45444885. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 769 BP; 204 A; 130 C; 152 G; 283 T; 0 other; tgttctgaat tctttatact atggctgtaa cacgatttac tgagtgttct gtaaagggct 60 tatattatga atgtaggatg taacttccgg aggatgcata tttgagtcca tgtatgagaa 120 gttgaatgta ctgactcatc tgagtgagac gcgattgcca tccgtgagat attgtgaagt 180 gcggggtaaa gttattgcca agtgagtttg ttccttcata tcatgtttgt aattaccctt 240 gtatgcttga tattaagtgc gtgatgttgt taatgacccg cgatcggccg cgagcattag 300 tttccgtgta aaccaaaagt gaaagtgctg ctaccatcag catcgcggct aatcccgatg 360 tcctcgtgga gatctcgttt gttttattat gattcagttt actatattct acatttgtta 420 tgccttatgc tgtcacatgc cctttatata atgcagcttt agtcattgat gtctatttat 480 attatgttta cattcagcat acattagtct attgactgca ctttaagaga gctgcaaata 540 ctgtgttgtt tggtttggga tatttattgt gtcagtacat atatatatat atattttcgt 600 gtttgatatt ttattattta tgtatactta ttgtttatta caggaaacta cctccctgct 660 cataaataaa agattgaaat tatctcctga ctcctactcc ttaaccagag tgctgaagtg 720 gtcagcaaca tttaaagaac tggcattcta aattgggcaa tacccaaca 769 // ID CATCH2I_DR repbase; DNA; ZEB; 5825 BP. XX AC AL645822; XX DT 05-JUN-2002 (Rel. 7.05, Created) DT 05-JUN-2002 (Rel. 7.05, Last updated, Version 1) XX DE Internal sequence of the CATCH2_DR endogenous retrovirus. XX KW BEL; LTR Retrotransposon; Transposable Element; KW BELL/CATCH superfamily; CATCH2I_DR; CATCH2LTR_DR; CATCH2_DR; KW CATCH2_DRp polyprotein; endogenous retrovirus; endonuclease; gag; KW protease; reverse transcriptase. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5825 RA Kapitonov V.V. and Jurka J.; RT "CATCH2, an endogenous zebrafish retrovirus from the BEL/CATCH RT superfamily."; RL Repbase Reports 2(5), 2-2 (2002). XX DR Genbank; AL645822; Positions 129172 134996. XX CC CATCH2I_DR is an internal portion of the CATCH2_DR endogenous CC retrovirus, which belongs to the BEL/CATCH superfamily. CC It is flanked by 99% identical long terminal repeats, deposited CC in CC Repbase as CATCH2LTR_DR, and is characterized by 5-bp target CC site duplication. Usually, internal portions of retroviruses CC from the Bel/Catch superfamily carry one long ORF that encodes CC gag, reverse transcriptase and endonuclease domains. CC Presumably, a false frame shift splits an originally intact ORF CC in CC CATCH2I_DR into two ORFs. XX SQ Sequence 5825 BP; 1734 A; 1544 C; 1221 G; 1326 T; 0 other; aaattggtcc ttcgagccgg agctggtcgt taccatggca accccaaagt cagccccatc 60 ctcttctcga cgtccacaca gagtgatcca gagacctgcc tacttagatg actacatagt 120 caattatgct ggaaacccat ctagccatcc agttggacct aaagagaaag caataatgac 180 gtcaaccccc atcactacta gtcaatcata tggaactgct ggttcatcag aggactttat 240 gcagcagatg atggagatta caaaacagca aagtgagaca gcaaggcgcc aaacagagct 300 cttggagaaa gtcctgctac acagtagcct tcaagcatcc accccctcct cacgagcatc 360 atcacggcat cagacacctc ggacccgaat cctgcaatca gtggaaaagg gacagccgac 420 atctgaagta agccctgatg ttgttagaat acagtcctct ccctggccac tgcctaataa 480 tcccccaact gtggaagagc tttctcagtc gctgaggaaa acccacctca gagacaccta 540 tttccagtca cctccacctg taagcagaga aaccacctta acgtcacatc agtatgccac 600 accctatagc agctctccat tagaccaaat tagaacccat cagcacaacc aagcagagcc 660 atggactaaa ccactgcctc ctctccctgg acagcaaccc atccaaacca cccagccaca 720 tttctataac ataccacctg cagctgtcca cccagtacac agccggttcg tttcggtgac 780 cgagccaatg ccacaaacgt cataccaagc tacatctcaa tttacagtac cctatgcact 840 gtaccctatg cacctcctcc tccttcatac tggagtccaa cagtgaaagg gcccaccttt 900 ccagattttt acaaagagga cagagctcag tatgtggaac ttaggatggc tttggataat 960 ttgctccacc ctcaaatacc cgagaattat aagtactcca tcctcctgaa gcatgtgaag 1020 gtacctaatg ctcacagact agtgttagca catgcagaat cagtcactcc ttacacaaat 1080 gccctgcaag ccctcgacag gaggtatggc aggccctacc agtttgtgct gagagaaata 1140 gacattggag aacctcccag ctattcgtgc tggggatgag cgggcattcg atgagttttt 1200 tctcagagtc caagcaatag tcggcatgct taaagccctc aaggatgatg gtattgagga 1260 actgcacagt gagtcaaatg ttaaacgact gttaaactgt ctacctcgtt cacaacaaac 1320 tcaattcagg aggcatcatc tcagagaaac ccagataaga caaagttctc tctccttgag 1380 tttgcagatt agctgcagct ggaagctaac tgcttagaat ttgacccaac tgataccacc 1440 agaactctaa atcgagatca aagagcagtc actcgacaac aaaccaggca aaccagcata 1500 ctgcatggtg ttaatcaaag acggactaca gaacaggtaa ccaaacccac cataatacag 1560 cagaaagcac ctaacaaaca caaacctgtc tgtccatact gcaatgcgga acattacctc 1620 agccagtgta ccgccttcaa tacattaact aaggagcaga ttgtgggttg gattaaagag 1680 aataaacgat gctggaaatg tgcacgttta cacatggcta aagactgtga ccttcaaaag 1740 ccttgccacc tgtgtcaagc caaacatctc tctgtggtcc atgacataaa tcaaaggaaa 1800 gctgaagtta ccaatgaaag cagtagtttg tataattcaa ccacagagac tctgtaccta 1860 gaccagccac gctgtggggg caaagtcttt ctaaaggttg tcaaggttac acttcattac 1920 caacaaaagg ctctggacac atttgcaata cttgacgatg gctcagagcg caccattctg 1980 cttgccaccg ctgcaaatca ccttggttta aagggcaaac cagaacagtt gaacttacgg 2040 acagtacacc aagatatcag gacattacat ggcacatctg tatctttctg tatatccact 2100 ccaagtaacc ctggtaagag atatgccatt cacggggctt ttaccgcaga gcagtttgga 2160 ttagcagaat acacttatcc agtgagttct ttgcagaaaa ggtacagaca cctcaggggc 2220 gttcccatac cacccctgca caaagcttgt cccctgctgc tcattggttc agatcacact 2280 gagctcatca caccaactga accagtgtta ttgggtccac ctggaggtcc agcagcaatc 2340 aaaactcgcc ttggctgggc tttgcaagaa ccctccaaac atgtacaaga acaactccca 2400 atttcacagt gcctgttcac atctaattgt tctccatccg cagaacttta tcatcagatt 2460 gaaaagttat ggcagatgga catcctgccg taccgaagtg agaaaagtgt agtaaggtca 2520 caacaggacc aagaagcctt gaacatgctc gagacacaaa cagtgcaggt gaaagtggat 2580 ggaattgaca gatatgccac cccactttta agagtcaaaa caatgacttt actcaatgcc 2640 cccaaagagg ctgtaatgtg ccatctccgt agcactaaac gccgtctgac aagggaccca 2700 ccattagccc tcaaatatca agaggaaatc aaaaaactgg agaatgctgg ttatgccact 2760 aaaataccac ctcaagaagc agcccaatct acagagtcat ggtatctgcc ccaccacatt 2820 gtgacacaca atggaaagca cagagtggtg ttcaattgct cttacacata caatggagaa 2880 aatctgaaca agcagctgtt gccaggacca acattaggcc ccactctcct tggagtgtta 2940 attcgcttca gagaacatcg agtcgccatc agtggagaca ttaaagggat gtttcaccag 3000 gtccgtctgc tcaccaaaga taaacccctt ctgcggttct tgtggcggga cctaaaccct 3060 gaaaacccac ctgatgtgta tgagtggcaa gtccttccat tcggcacaac ctgtagcccc 3120 tgttgtgcaa cctttgccgt tcagcgacac gtttttagcc attgccaaca gggtgacaag 3180 ctacgacagt cagttgaaca ttccttctat gtcgacaact gcctacaaag cacttcatct 3240 gcagttgaag cccgtgaact aattgatggt ctaaggtcca tactagcctc agggggtttc 3300 gaaatccgtc agtgggcaag caatgaggcc acagtggtca gacatctccc taaagaggcc 3360 aaatcagaga atactgagct gtggatttct caaagtcagg cagaccccca tgaaatgaca 3420 ctgggactta gctggcattg tgtaccagac accctacatt accgtcgtcg cccgctgccc 3480 taccaggagg taacaatgag gaatatctat cgtgtccttg ccagccaata tgacccacta 3540 ggctacatca tacccttcac aacaagggcc aaaatcattg tatgtcaact gtgggcaaaa 3600 gaacgagaat gggatgaccc ccttctacca tcagaactgc tgcaggcctg gcagaattgg 3660 gaggccgaac tgaatcatct tccaaaaatt accatgcccc gttgttatgt gtcggccact 3720 ctagacgttc cagaggcaga acgagagctc catatcttct gcgatgccac tgcagaagct 3780 tacgggtcag tagcatacct tcgtactgag caccaaggcc agattgagct ggcattcatt 3840 catgcaagat ctagagtgtc acccaaaagg caacaatctg tacccaggct cgaactgtgc 3900 gccgccctca ctggagcaca gttagccaag ctgcttcata atgagctgtc actcaagatc 3960 aaagacacat tcttgtggag cgactcaaca accgtactta cctggatcaa atcagaatct 4020 tgcagattta aagtttttgt ggggactaga gtaactgaaa tacaagagct caccgaaggt 4080 cactattggc ggtttgtaga ttcaggaaac aatgttgcag atgacatcac atgtggcaaa 4140 tccttgcttg aactgtccca acccaacaga tggagtaaag ggccagattt cctctatcta 4200 ccacccgtta gttggccaga aagcctttct gtagaactag atgcagacct gactgagctc 4260 cgtaacacaa agttttgtgg tctcaccttg gatggttctg gatcatcact gccagatgcc 4320 aaccaattct ccacttttga agaacttctg gaggccactt gtagatcctt acatggggca 4380 gcaggagaaa ccactgaccc agaagccata cagtacagac aagctgaaac ggatttacta 4440 aggcgagccc aaacagacag tttcccctta gagtataatg ccctgacagc tggtaagcca 4500 atcccatcat caagtcgttt gctttcacta tcacctgagt tggataagac agctgcactc 4560 atccgtgttg gaggacgatt acgacaagcc cagcacttgt gttacagtac tgtacatcca 4620 atagtcctgg atccaaatca cccaatcacc cgactcctca taaaacagta tgatgccaaa 4680 ctcggtcacc ctggtccaga aagggtcttt gctgaaatgc gaaggtatta ctggatcctc 4740 agagggagag aagcaatacg gcgccatcaa cgttcttgtg tcgaatgcca aaggtggaga 4800 gcaaagccca atatcccaaa gatggcagaa ctccctccag ccagacttcg tcttatgaaa 4860 ccgcctttct tttccacggg agtagattgc tttggcccat tcctggtaaa acgaggaagg 4920 agcaatgaaa agaaatgggg tattattttc aagtgcatga caacacgctg tgtgcactta 4980 ggtcttctgg ccaacatgga cacagactcc tttctaatgg ctctcagacg catggttgct 5040 cgacgaggaa ctccctctga aatactggca gatcagggca ccaattttcg aggaggagat 5100 aaagaattac aaactgcatt tacagccatg agtcccgatc tccaagccca gttagcaaaa 5160 cagaaaatcc agttccatta taatcctcca aatgccccac attttggagg tatgtgggag 5220 cgagaaatcc gctcagtcaa agcagcctta cgcaccataa taggacccca aactctcaca 5280 gaggaagtac tcaaaacact ccttatagag gttgaggcta tactcaatgc aaagccactg 5340 ggttatgttt cttctgacct tgctgaccca gatcctgtga ccccaaatta cctgttgatg 5400 gggcggccag atgtgtcact accacaagtc atctacccag aatctgagat ccttagtcgg 5460 aaaagatggc ggcactcaca ggtcttggcc gaccagttct ggacgagttt cattaagaat 5520 tacttaccca ccctgcaaca acgtcagaaa tggatgacag acaccagcaa cctcacacct 5580 ggaacagtcg tgatgataat agatcaccaa cttcctcgag ccttgtggcc tgtcggcaaa 5640 gttgttacca catacctcgg attggatggt cgtgtttgtt cagcaacgat agaagtcaaa 5700 aacaaacaat atcatcgtcc tgtggctcgt cttatcactc tgccttctat tccagatgat 5760 aaccctacaa cggactaaaa tggcaaagga tgattgtaca tttgcatagc aaatgtaggg 5820 gcggc 5825 // ID Gypsy13-I_DR repbase; DNA; ZEB; 6488 BP. XX AC . XX DT 07-JAN-2005 (Rel. 10, Created) DT 19-MAY-2005 (Rel. 10.06, Last updated, Version 2) XX DE An internal portions of the Gypsy13_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy superfamily; Gypsy13-I_DR; Gypsy13-LTR_DR; Gypsy13_DR; KW endogenous retrovirus; gag; integrase; reverse transcriptase. XX NM Gypsy13-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6488 RA Kapitonov V.V. and Jurka J.; RT "Gypsy13_DR, an LTR retrotransposon from zebrafish."; RL Repbase Reports 4(12), 319-319 (2004). XX DR [1] (Consensus) XX CC Gypsy13-I_DR is an internal portion of the Gypsy13_DR CC LTR retrotransposon that belongs to the Gypsy superfamily. CC Its long terminal repeat is deposited in Repbase CC as Gypsy13-LTR_DR. Gypsy13_DR is characterized by 4-bp target CC site duplications. The internal portion encodes two proteins: CC the 556-aa gag Gypsy13_DR1p (pos. 95-1755) and 1577-aa CC polyprotein (pos. 1756-6486, conceptual translation) composed CC of the protease, reverse transcriptase, and integrase domains. CC PBS is identical to that in Gypsy9_DR. Some internal portions CC are flanked by 100% identical LTRs. PBS is complementary CC Arg-tRNA. XX FH Key Location/Qualifiers FT CDS 1756..6486 FT /product="Gypsy13-I_DR2p" FT /note="Polyprotein" FT /translation="FKLESVLVVGQTGAELESQCPNTNHVFEKSQVSKVQS FT TSLPKGLIGAMSIAEVTIANEKCSCLLDTGSQVTTVPKSFYEQHLSGYPIK FT SIDDILEVEGANGLSVPYEGYIEMGITFPEELLGVSVEIPTVALVVPDVKA FT HNQSMVLIGTNTLDVLYKKYLSADPPKFQPCSYGYKVVLKTLEIRWRQNTS FT GVLGHVRLRSRAPKVLMAGQTVVVGGSVSNPCRIDQTILVEHSPDSSLPGG FT VFVKRCLLNQPENQMNSIPVVLTNETNHNITIPPRCVIAELHAVDSLQSLS FT KTSNSNGEGDFTLNFGDSPLPQVWKERISKKLREIPEVFSHHDLDFGHTQK FT VKHSIKLHDETPFKQRARPIHPQDIEAVRRHLQDLLASGVIRESESPFSSP FT IVVVRKKNGDVRLCIDYRKLNIQTVRDAYALPNLEETFSALTGSKWFSVLD FT LKSGYYQIEVDEADKPKTAFVCPLGFWEFNRMPQGVTNAPSTFQRLMEKCM FT GDINLKEVLVFLDDLIVFSDTLEEHETRLLNVLFRLKEYGLKLSLEKCKFF FT QTSVRYLGHIVSEHGVETDPEKVQALKTWPVPKNLKELRSFLGFGGYYRRF FT IKDYSKIVKPLNDLTSGYPPLRKGAKKCNKGSQYHNPKESFGDRWTPSCEE FT AFRTLIEKLTSAPILGFADPKLPYFLHTDASTKGLGAALYQEQDGQMRAIA FT FASRGLSHSESRYPAHKLEFLALKWAVTEKFNDYLYGNHFTVITDSNPLTY FT ILTTAKLDATSYRWLSALSTFSFKLQYRAGKQNVDADSLSRRPQEPIPETA FT WSSKEQERIHQFVQHHHHDAADIVSAPNDVVHAICEKHLINQDAASGVALV FT GSLALRPDAIPDEYGEDCNLDGLPVMPYLPHEIGEKQRADSVLREVIFHLE FT LGEKPSPTVRKEIPTFPLFLKEWNRLELRDGILYRRRQENNLLTYQLVLPE FT ELRPLVMSSLHDDMGHLGIERTVDLIRSRFYWPKMAADVERKVKECSRCVR FT RKAQPQKAAPLVNFHATRPLQLVCMDFLSLEPDRSNTKDILVITDFFTKYA FT VAYPTPNQKAKTVAKCLWENFVTHYGFPERLHSDQGPDFESHVIRELCEVS FT GIKKSRTTPYHPQGNPVERFNRTLLGMLGTLEEKDKAHWKDFVKPLVHAYN FT CTKHEVTGFTPYELMFGRQPRLPVDLAFGLPHHGKSDVIPHSEYVKQLKSH FT LKESYLLASQGMLKTAEKNKTRFDKSVTHSSLEVGDRVLVRNVRLRGKHKL FT ADKWESEVYVVVKKAGDLPVYTVHPENSESPLRTLHRDLLLPCGYLPLSNS FT SNPSPPKMSRPRTRQNPGFQQPEEDCSFLSPEDEDYEWYGDNHQNVEPLQF FT STVYDVPQPVKGKSVSLADESAGQNLGKDGTQCKEGTQNDNLPVTPPVDNL FT PDDLPANDPPVYNAPVHSSVDNPSDDVLEVCETTETPGKDQPETDNSKDLP FT TQRESKVEEANDSENSVSCEREEQQSEQEIPSLRRSARERDKPERLTYFQL FT GNPLSHVVQSLFQGLSTALVNSLNSVEDLGDSPITSDIPANIVTTQPLRAC FT KGTCIVSRGE" FT CDS 95..1762 FT /product="Gypsy13-I_DR1p" FT /note="Gag-protein" FT /translation="MDIIEAEGIKIPNSVIISGLTQDKSDDELFDFLKQYG FT SFAKTVFISDKDSEFYQSVILEYTSGQALHSLEPQLPYTHQLSSDPSVTYH FT VRALSSVFTQYKGTSVTKSYLEGLKEVAKLSGTDFEIVLSQMLSQMTAELT FT PTSADTEADDEDLDEPQAQVCPEESFTPAPVKISQPDDSLSQHTSAKPNKP FT PLLTSSEVLNPPEVQKLVIEHVVRTGEVATQGLMQQRLRVFSGKCPRPGSE FT VDYDTWRSSVELMLKDSTLSDLNVSRKIVDSLLPPAADVIKHLSSEAPSSA FT YLQLLDSAFGVVEDGDELLAKFMNTLQDAGEKPSTYLYRLQTALRVTIKRG FT GVSPEEADRHLLKQFCRGCWDNDLITDLQLERRRNNPPSFGQLLLMLRTEE FT DKHTAKVTRMKQHLGSSKPRAVMHSQRTWVSSEVEQGEVSNMVSLAAETKE FT IKRQIAKLQSQLASLVPAHKTQKKASQQAVVNKQDKKKSDTANQLTRTPVS FT QRQKDRPRPWYCFTCGEDCHIASSCTSEPNPTLVNAKRKLLREKQLLWDSQ FT NANSNPDLN" XX SQ Sequence 6488 BP; 1949 A; 1372 C; 1449 G; 1718 T; 0 other; aagtggcgag ccagccagga ggtggcgctg ttgctgagta ttaattacat tctaagccat 60 tatttttgcc tacgttgacc agtaacaatt gacaatggat ataatagagg cagaagggat 120 taaaattccg aactcagtga tcattagcgg attaacacaa gacaaaagtg atgatgagct 180 gtttgatttc ctaaaacaat atggttcttt tgccaaaaca gtttttatta gtgacaagga 240 ttctgagttt tatcaaagtg ttatacttga atataccagt ggccaagctt tgcattctct 300 agagccacag ctaccttata ctcaccagct atcaagtgac ccaagtgtta cttaccatgt 360 gagagcatta tctagtgtgt ttacacagta taagggaacc agtgtcacaa agtcatacct 420 ggagggactg aaagaagtag caaaattgag tgggactgat tttgagattg tcttgagcca 480 gatgctgtca caaatgactg ctgagcttac tccaacgtcc gctgacacag aggctgatga 540 cgaggatctg gatgaaccgc aagctcaggt atgtcctgaa gagagtttta ccccagcccc 600 tgtcaaaatt agtcaaccag atgactcact gtcacagcat acctctgcta aaccaaataa 660 gccccccctc ttaacctctt cagaggtgtt aaatccacct gaggttcaaa aactcgtaat 720 tgaacatgtg gtgagaactg gtgaagtggc tactcaagga cttatgcagc aaaggcttag 780 ggtattttca ggaaaatgtc ccagacccgg aagtgaggtc gattacgaca cttggcgctc 840 cagtgtagag ctaatgctga aagattccac cttatctgat ttgaacgtat ccaggaaaat 900 agtagacagt cttttaccac ccgcagcaga tgttataaaa catcttagct ctgaagctcc 960 atcatcagct taccttcagt tgctggattc tgcttttgga gttgttgagg atggagatga 1020 actccttgca aagttcatga atactctgca ggacgctggt gaaaagccat ctacttactt 1080 gtacagattg cagacagctt tgagagtgac aataaaaaga ggtggtgtct cacctgaaga 1140 agcagatcgg catcttctca agcagttctg tagaggctgc tgggataatg atctaattac 1200 tgacctgcag ctagagcgaa ggcgtaataa tcctccttca tttgggcagc ttctacttat 1260 gttacgcact gaagaagaca aacacaccgc aaaagtcact cgtatgaagc agcatcttgg 1320 gtcttctaaa ccaagagcag taatgcactc gcaaaggacg tgggtttctt ctgaggtgga 1380 gcaaggagaa gtttcaaaca tggtatcact tgcagctgaa actaaggaga tcaagagaca 1440 gatagcaaaa ctacaaagtc agttggctag ccttgttcct gcgcataaaa cccagaagaa 1500 agcttcacag caagcagtag taaataaaca ggacaaaaag aagtcagata ctgctaacca 1560 gttaactagg actccagtta gccaaagaca aaaggacagg cctagaccat ggtactgctt 1620 tacctgtggt gaggattgtc atattgcatc ctcttgcact tctgagccaa accccacact 1680 tgttaatgcc aagcgtaaac ttttgagaga aaaacagcta ctatgggact ctcagaacgc 1740 caactccaac cctgatttaa actagaatca gtccttgttg tgggacagac aggggctgag 1800 ttggaatcac aatgtcccaa tactaatcat gtttttgaaa aaagtcaggt ttctaaagtg 1860 caaagcacta gcttgccaaa ggggttaata ggtgctatga gcattgctga agttactata 1920 gccaatgaaa aatgcagctg tttattggat acaggctctc aagtgacaac agttccaaag 1980 tccttctatg aacagcatct ctcaggatac ccaattaagt ccattgatga tatcctggaa 2040 gtagaaggcg caaatggtct atctgttcca tacgaaggct acatagaaat gggaattacc 2100 tttccagaag aattactagg agtgagcgtt gaaataccta cagtagcctt agtagttcct 2160 gatgtaaaag ctcacaatca gtcaatggtt cttattggga caaacacctt ggatgttctc 2220 tataagaagt atttaagtgc tgatccccca aaatttcaac cttgttcata tggttacaaa 2280 gtggtactta aaactcttga aataagatgg agacaaaaca caagtggtgt tcttggccat 2340 gtacgattga ggagtcgagc acccaaagtt ttaatggctg gtcagactgt agtggtggga 2400 ggttcagttt caaatccatg cagaatagat cagactatac ttgtggagca ctcacctgat 2460 tcgtctctac ctggaggggt gtttgtcaag cggtgtcttc ttaatcagcc tgagaaccag 2520 atgaatagta taccagttgt gcttaccaat gagacaaatc ataacattac tatcccaccc 2580 agatgtgtga tcgctgagct ccatgctgtg gattctttac agtctctttc aaaaacttcc 2640 aatagtaatg gagagggtga tttcactctg aacttcggtg attccccctt acctcaggta 2700 tggaaggagc gcatttctaa aaaactcaga gaaatacctg aagtgttcag tcatcatgac 2760 ctagattttg gccacaccca gaaagtgaag catagcatta aattgcacga tgaaactccc 2820 tttaaacaga gggcacgacc tattcatcca caagatattg aggcagttcg taggcatcta 2880 caagatcttc ttgcaagtgg agtcatccgg gaatcagagt cacctttctc ttccccaatt 2940 gtggtggtga ggaagaagaa tggggatgta cgtctatgta tagattatcg caagctgaat 3000 attcaaacag tgagagatgc ctatgcattg ccgaatcttg aggaaacatt ctcggctctg 3060 acaggatcta aatggttctc tgtcctcgat ttaaagtctg ggtactacca gatagaagtg 3120 gatgaggcag acaaacccaa aaccgccttt gtctgtccgc tgggattttg ggagtttaat 3180 cgcatgccac agggtgtgac gaatgcccca agtacattcc aaagattaat ggaaaaatgt 3240 atgggggaca ttaatctgaa ggaggtactt gttttcttgg acgacttaat agttttctca 3300 gacacattgg aagaacatga gactcgacta ttaaatgtcc tgtttcgtct aaaggaatat 3360 ggtctgaaac tctccttgga aaaatgtaag tttttccaga cttcagtccg ctatttggga 3420 catatcgtgt cagagcatgg agtggagact gaccctgaga aggtccaagc tttaaaaacc 3480 tggcctgtac ctaaaaacct aaaagaactt aggtctttct taggctttgg gggatattat 3540 cgtcgtttca tcaaggacta ctctaaaata gtgaagccac ttaatgatct gacctcagga 3600 tatcctccgc taagaaaggg tgccaagaag tgtaacaaag gaagccagta ccataaccca 3660 aaagagtcct ttggtgatcg atggacgcct tcttgtgaag aagcatttcg aacccttata 3720 gaaaaactca cttctgcacc tattctggga tttgctgacc ccaaactccc ttattttctc 3780 cacactgatg caagtacaaa gggactaggg gcagcacttt atcaagaaca ggatgggcag 3840 atgcgtgcaa tagcattcgc aagcagaggg ttgtctcaca gtgagtctag atatcctgct 3900 cacaaacttg aattccttgc cctaaaatgg gcagtgactg aaaagtttaa tgactactta 3960 tacggtaacc atttcaccgt tattacagat agcaaccctt tgacttatat cctcactaca 4020 gcaaaattgg atgctacaag ttatcgatgg ctgtcagctc tttccacctt ctctttcaaa 4080 ttgcaataca gagctggcaa gcagaatgta gatgcggata gtctctcaag gagaccccag 4140 gaacccattc ctgagactgc ttggtccagt aaagagcagg aaagaatcca tcaatttgta 4200 caacaccatc accatgatgc tgctgacatt gttagtgccc caaatgatgt agttcatgct 4260 atttgcgaaa agcaccttat caatcaagac gcagcttctg gggttgcact ggtgggatca 4320 cttgcactcc gtcctgatgc tatacctgat gagtatggag aggattgtaa tttagatgga 4380 ttacctgtta tgccttacct accacatgag attggtgaaa agcagagagc agactcagtc 4440 cttcgagaag tcattttcca cttggagttg ggggagaaac cttctcctac agtgcgaaaa 4500 gagattccaa cttttcctct ttttctaaag gagtggaatc gacttgagtt gcgagatgga 4560 atactctata gaagaaggca ggaaaataat ttactcactt accaactagt actccctgag 4620 gagttaagac ccttggtgat gagcagttta catgatgaca tgggtcacct agggattgag 4680 aggactgtag atctgattcg atctcgtttc tactggccaa aaatggctgc agatgtggaa 4740 cgaaaagtca aggagtgtag tcgctgtgtg cgtaggaagg cacaacccca aaaagcagct 4800 cctctggtca attttcatgc cactaggcca ttacagctgg tgtgtatgga tttcctttca 4860 ttggaacctg acaggagtaa taccaaagac atcttggtta ttacggactt ttttaccaag 4920 tatgcagtgg cttatcctac acctaatcag aaggctaaaa cagtagcaaa gtgtctttgg 4980 gagaactttg tcacacacta tgggtttcct gaacgcttgc atagtgacca gggccctgat 5040 tttgagtcac atgtcatcag agagctctgt gaagtgtcag gcataaagaa aagccgaaca 5100 actccctatc acccgcaggg aaacccggtt gaacgcttca ataggaccct gctgggcatg 5160 ttgggtactt tggaggagaa agataaagcc cattggaaag actttgtcaa gccacttgtc 5220 catgcgtaca attgtacgaa gcatgaagtc actgggttta ctccttatga attgatgttt 5280 ggcagacagc caagattgcc tgttgacctt gcttttggtc tgcctcatca tgggaaatca 5340 gatgtcattc ctcattcaga gtatgtgaaa cagctgaaat cacacttgaa ggaaagctat 5400 ttgcttgctt cacaaggtat gttgaaaact gctgaaaaga acaaaaccag gtttgacaaa 5460 tctgtcactc attcttcttt ggaggtcggg gatcgtgtcc tggtgcgaaa tgtcagactg 5520 cgtggtaagc acaagttagc ggacaagtgg gaatctgagg tgtatgtggt ggtaaagaag 5580 gctggcgact tacctgtcta caccgttcat cctgaaaaca gtgaaagtcc ccttcgtacc 5640 cttcataggg accttcttct tccttgtggt tacctgccat tgtcaaacag ttcaaacccc 5700 tccccaccca aaatgtcaag acctagaaca aggcagaatc caggtttcca acaacctgaa 5760 gaggattgtt ctttccttag tccagaagat gaggattatg agtggtatgg tgataaccat 5820 caaaatgtgg aaccattgca gttctctaca gtgtatgatg ttccccaacc agtaaaaggg 5880 aagagtgtgt ccttagctga tgaatcagct ggtcaaaacc tcggaaaaga tggaactcaa 5940 tgcaaggaag gaactcaaaa tgataaccta cctgtaactc ctccagttga caacttacct 6000 gatgatcttc cagccaatga tcctccagtt tataatgcac ctgtacactc ttcagttgat 6060 aacccatctg atgatgttct ggaagtctgt gaaacaactg agacacctgg aaaagaccaa 6120 cctgaaactg ataacagtaa agacttaccc actcagagag agagtaaagt tgaagaggca 6180 aatgattcag aaaactctgt ttcttgtgaa agagaagaac aacaatcaga acaagaaata 6240 ccttcactaa gacgttctgc aagggagaga gacaaacctg agagacttac ctactttcag 6300 cttggaaatc ctctgtctca tgttgttcag tctctgttcc aaggattaag cactgctctt 6360 gttaactctc tcaatagtgt tgaagatctg ggtgattctc ctataacttc tgacattcca 6420 gcaaacatag tgactaccca accgctcaga gcatgcaaag ggacttgcat tgtttcaaga 6480 ggggaagg 6488 // ID CR1-40_DR repbase; DNA; ZEB; 3349 BP. XX AC . XX DT 25-FEB-2009 (Rel. 14.02, Created) DT 18-NOV-2010 (Rel. 15.12, Last updated, Version 2) XX DE CR1-like non-LTR retrotransposon - a consensus. XX KW Rex1; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; AP endonuclease; CR1 clad; CR1-40_DR. XX NM CR1-40_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-3349 RA Bao W. and Jurka J.; RT "CR1-like non-LTR retrotransposons from zebrafish."; RL Repbase Reports 9(2), 525-525 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS join(103..306,310..2955) FT /product="CR1-40_DR_1p" FT /translation="MYTLSLPTTDTHFGKLAPSLHIEDRTLSSLTTLCLQT FT QQQSPLSGSHNRSDEGNAAGKEEREPASSSDDAVDFDLRSPPFCWLTFSHW FT TTNFASYGRAFPSNGKQETAALSALQKPGYLRRYQTQLLNFRGSPCTARIE FT RKNSQGKAKEEVFVSLLTTHGVMRGTYTQLRSFCSPDLEYLMLRCRPYWLP FT REFTAVIITAVYIHPQANTEQALRELYGSISEQETAHPEAAFIVTGDFNNA FT NLRKIAPKYYQHITTNTRGDRILDHCYSPFRDAYKSLPRPPFGKSDHSSVL FT LLPAYRQKLKREPPTLRTFQSWTDQSDSILQDCFDHVDWDMFRAACDDDIE FT VYSDTVTCFIRKCIEDXVPTKTVRIYPNQKPWINGEVRTALSVRASAYKSR FT NAEEQKQANYNLRKTIKAAKRQYRDKVEGQFNTNNARSMWQGLNYITDFKS FT NKPATVNIAASLPDELNSFYARFEAQDSARTLRAPAAETETASTLSVSVAD FT VXRSFRRVNIRKATGPDGIPGRVLKACAHQLAGVFTDIFNLSLSQSVVPTC FT FKTATIVPIPKSAKTTCLNDWRPIALTPIFSKCFEKLIKKHICSVLPAHTD FT PLQFAYRNNRSTDDAIAFTLHTALSHLENKNTYVRMLFVDYSSAFNTIVPA FT KLVVKLQALGLHSSLCNWILDFLSSRRQVVRMNNITSSTLILNTGAPQGCV FT LSPLLYSLYTHDCTAKHSSNVIVKFADDTTVVGLITDNDETAYREEVHTLT FT QWCEENHLSLNISKTKELVVDFRREKREHTPITINGTPVERVSTFKFLGVH FT IAEDLTWTAHTDAVLRKAQQRLFFLRRLRRFGMSPHILRSFYTCTVESILS FT GCITTWYGNSTSSNRKGLHRIVRTAGRVVGGELPSLQDIYTRRCMRKAKRI FT ISDSSHPSHRLFSLLPSGRRFRSIRSRTSRLKESFFPQTIRLMNT*" XX SQ Sequence 3349 BP; 895 A; 911 C; 736 G; 803 T; 4 other; catcaagatg gcgccgagca tggccgccgt gttgcgagct cccagcaaac tttgttgtgt 60 tttgtgtgtt ttacttgtat ttttgtcgtt tttctgtgct ggatgtacac actctcatta 120 cctacgacag acacacactt cgggaaattg gctccctcgt tgcacatcga agaccggact 180 ttgagttctt taacgacgct ctgtttacaa acacagcaac agagcccttt gtctgggtca 240 cacaaccgaa gcgacgaagg aaacgcagcc ggaaaagagg aaagagagcc ggcgtcctcg 300 tcagactgag acgccgtgga tttcgacctc cgctccccac cattctgctg gctaacgttc 360 agtcactgga caacaaactt tgcgagctac gggcgcgcat ttccttccaa cgggaaacaa 420 gaaactgctg cattatctgc cttacagaaa cctggctatc tgcggaggta ccagacacag 480 ctgttgaact ttcggggttc tccgtgcacc gcgcggatag aacgaaagaa ctcacaggga 540 aaagcaaagg aggaggtgtt tgtttcttta ttaacaactc atggtgtgat gagaggaaca 600 tacacccagt taagatcatt ttgttctcct gatctggaat accttatgct tcggtgtcgg 660 ccatactggc taccaaggga gttcacagct gttatcatta cggctgtcta catccaccct 720 caagccaaca cagagcaggc gctcagggaa ctgtacggga gcataagcga gcaggaaacc 780 gcacacccgg aggcagcgtt tattgttaca ggggacttta acaacgccaa tctcaggaaa 840 atcgctccaa aatactatca acacatcacc acaaacacgc gtggtgaccg gattctggac 900 cattgctatt ctccgttccg ggacgcatac aaatccctcc cccgcccacc gtttggcaaa 960 tcagatcact cttctgttct gctcttgcct gcttacaggc agaaactgaa acgggaacca 1020 cccaccctca ggacgtttca gagctggacg gaccaatcgg attccatact tcaagactgt 1080 tttgatcacg tggactggga tatgttccgg gcagcgtgtg atgacgacat tgaagtgtac 1140 tcagacacag tcacatgctt catcaggaaa tgcatagaag acrtggtccc aacaaaaact 1200 gtccgtatct accccaacca aaaaccatgg atcaatggcg aagttcgaac agccctatca 1260 gtgcgagctt ccgcctataa atccagaaat gctgaggaac aaaaacaagc aaattacaac 1320 ctcaggaaaa ccatcaaagc agcaaaacgt caatacagag acaaggtaga gggtcaattt 1380 aacaccaata acgcaaggag catgtggcag ggacttaatt acatcacaga ctttaaaagt 1440 aacaaacccg ccactgtaaa cattgctgcg tctctcccgg acgagctcaa ctcgttctac 1500 gcccgctttg aagcccagga cagcgcgcgc acactgcgcg ctcccgcggc cgaaactgaa 1560 accgccagca cactctctgt ttctgtagcg gacgtaasga gatctttccg tcgcgtgaac 1620 atccggaaag ctacgggccc agatggcatc cctggacgtg tacttaaagc atgcgcacac 1680 cagctagcgg gggttttcac ggacatcttt aacctctcgc tctctcagtc tgtggttccc 1740 acatgcttta agacagctac tattgtgccc ataccaaaat cagctaaaac cacatgcttg 1800 aatgactggc gtccgattgc tctgacaccc atcttcagca agtgctttga gaagctgatt 1860 aaaaagcaca tctgctctgt actgcccgct cacactgacc ctctgcaatt tgcatacagg 1920 aataaccgct ccactgatga tgcaattgct ttcaccttgc acactgctct gtctcacctg 1980 gaaaacaaaa acacatatgt gagaatgctg tttgtggact acagctcagc attcaacacc 2040 atagtgcctg ccaagctggt ggtgaagctc caggctctgg gtctacacag ctctctgtgc 2100 aactggatcc tggacttcct gtcaagcaga cgccaggtgg tcagaatgaa caacattaca 2160 tcatccacac tgatcctcaa cactggtgct ccacagggct gtgttctcag cccactcctg 2220 tactccctgt acacacatga ctgtacagcc aaacacagct ccaacgtcat cgttaaattt 2280 gctgatgaca caacggtggt gggcctaatc acagataatg atgagacggc ctacagagag 2340 gaggtgcaca ctctgacgca gtggtgtgag gaaaaccacc tctcactcaa catcagcaaa 2400 accaaggagc tggtggtgga tttcaggaga gagaagagag aacacacccc catcaccatc 2460 aacgggacac cagtggagag agtcagcact ttcaagtttc ttggagtaca catcgctgag 2520 gatttgacat ggactgctca cacagacgca gtgctgagga aggcacaaca acgcctcttc 2580 ttcctcaggc gtctcaggag gtttggaatg agcccccaca tcctccgctc gttctacacc 2640 tgcactgtgg agagcatcct gtctggctgt atcaccacct ggtatggaaa tagcaccagc 2700 agcaatcgca aaggcctaca taggattgtg cgaactgctg gacgcgtagt aggaggtgag 2760 cttccctccc tccaggacat ctacaccagg cggtgcatga ggaaagccaa gagaattatc 2820 agcgactcca gccacccaag ccatagactt ttctctctgc taccctcagg cagacggttc 2880 cgcagcatcc ggtcacgcac cagccggctg aaggaaagct tcttccctca gactatcagg 2940 ctgatgaaca cttaacacac cccacacaga ctcttccata cccctcactg cacaccatca 3000 atatgtagca tgcactgcac tttaaccaat ccatacttga aacaatactg cctacaacta 3060 tgtggacacc tattcattgt acatatcgct gtcaatttta cattgtcctg tttttttatt 3120 tttttttttg rggagtactg tgttattttg cactgtcgtt gtatttgcac tgtctgtatt 3180 ttgcactgtt gttgtatttg cactgtcatt ttatttgcac tgtctgtatt ttgtactgtc 3240 tggagccagc acctaagctt ttcactcatc atagcacacg tgctgctgat gatgtgacaa 3300 taaaagtgat ttgatttgat tttatttgat ttaytgcatc taaatggat 3349 // ID Tc1-8B_DR repbase; DNA; ZEB; 1597 BP. XX AC . XX DT 26-FEB-2009 (Rel. 14.02, Created) DT 26-FEB-2009 (Rel. 14.02, Last updated, Version 1) XX DE DNA transposon - a consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Tc1-8B_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1597 RA Bao W. and Jurka J.; RT "Mariner-type families from zebrafish."; RL Repbase Reports 9(2), 561-561 (2009). XX DR [1] (Consensus) XX CC This element is the full-length version of Tv1-8_DR. XX FH Key Location/Qualifiers FT CDS 365..1390 FT /product="Tc1-8B_DR_1p" FT /translation="MMGKNKELSQDLRSLIVEKHFDGNGYRRISRMLNVPV FT STVGAIIRKWKKHKFTINRPRSGAPRKIPVRGVQRIIRRVLQEPRTTRAEL FT QEDLASAGTIVSKKTISNALNHHGIHARSPRKTPLLNKKHVEARLKFAKQH FT LEKPVDYWETIVWSDESKIELFGSHSTHHVWRRNGTAHHPKNTIPTVKFGG FT GSIMVWGCFSARGTGRLHIIEGRMNGEMYRDILDKNLLPSTRKLKMKRGWT FT FQQDNDPKHKAKETMKWFQRKKIKLLEWPSQSPDLNPIENLWRELKIKVHK FT RGPRNLQDLKTVCVEEWARITPEQCRRLVSPYKRRLEAVITNKGFSTKY*" XX SQ Sequence 1597 BP; 537 A; 309 C; 346 G; 405 T; 0 other; cagcggggaa aataagtatt tgacacatca gcatttttat cagtaagggg atttctaagt 60 gggctactga cacaaaattc ctaccagatg tagccatcaa gccaaatatt gaattcatac 120 aaagaaatca gaacatttaa gtatacaagt tgagtcataa taaataaagt gaaatgacac 180 agggaataag tattgaacac atgaagataa caaggtgcaa aatggcatag aaagtcagga 240 gatctgtcag tattgagaga aaaaccctgc tccctatcag tactaattga tatcagctgc 300 tttagtccta attgatggcc tataaaggct tctcattact caggaggcac acaggaaaga 360 cttcatgatg ggtaaaaaca aagaactctc tcaagatctg cgtagtctta tcgttgaaaa 420 gcattttgat gggaatggtt ataggcgcat ttccagaatg ctgaatgttc ctgtgagcac 480 tgtgggggct attatccgga aatggaaaaa gcataagttc accataaacc ggccacgatc 540 aggtgctcca cgtaagatcc ctgtccgagg agtccaaaga ataatcagga gagttctcca 600 agagccaaga accactcggg cagaacttca ggaagacctc gcatcagcgg gtactattgt 660 ttcaaagaaa actataagca atgcactgaa ccaccatggc atccatgcac gctcaccacg 720 caagactcca ttgctgaaca aaaagcatgt tgaggcccgg ttaaagtttg cgaaacagca 780 tttggagaag cctgtggatt attgggaaac tatagtatgg tcagatgaaa gcaaaattga 840 actttttggc agtcattcta cacatcatgt ttggagaaga aatggcactg cccaccaccc 900 caagaacact ataccaacag ttaagtttgg gggtggaagc atcatggttt ggggctgctt 960 ttcagcaagg ggtactggca gacttcatat tattgaaggt aggatgaatg gagaaatgta 1020 ccgggacatt ctggataaaa atctgctgcc atctaccaga aagctgaaaa tgaaaagagg 1080 gtggacattt cagcaagaca atgatcccaa acacaaggcc aaggaaacaa tgaagtggtt 1140 tcaaagaaag aaaatcaagt tgcttgaatg gcccagtcaa tcacctgacc taaatcccat 1200 agaaaatcta tggagagaac tgaagatcaa agttcataaa agaggcccaa ggaaccttca 1260 agatttaaag accgtttgtg tggaagaatg ggccagaatc actcctgagc aatgcagacg 1320 actggtctct ccatacaaga ggcgtctaga agctgtgatc accaacaaag gcttttctac 1380 aaagtattaa gtaaagtgtg ttcaatactt attccctgtg tcatttcact ttatttatta 1440 tgactcaact tgtatactta aatgttctga tttctttgta tgaattcaat atttggcttg 1500 atggctacat ctggtaggaa ttttgtgtca gtagcccact tagaaatccc cttactgata 1560 aaaatgctga tgtgtcaaat acttattttc cccgctg 1597 // ID SINE2-1_DR repbase; DNA; ZEB; 198 BP. XX AC . XX DT 21-OCT-2008 (Rel. 13.1, Created) DT 21-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE A family of SINE retrotransposons - a consensus sequence. XX KW SINE2/tRNA; SINE; Non-LTR Retrotransposon; Transposable Element; KW Nonautonomous; Interspersed repeat; SINE2-1_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-198 RA Bao W. and Jurka J.; RT "SINE retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1619-1619 (2008). XX DR [1] (Consensus) XX CC SINE-2-1_DR elements are ~95% identical to the consensus. The CC 5'-end is derived from a tRNA gene. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 198 BP; 53 A; 59 C; 35 G; 51 T; 0 other; gtcggcgcca atagcctagt ggttagtgcg tcgacacata gcaccgaggt gctcgcagcg 60 acccgagttc gattcccgtc tcgaggtcct ttgctgatcc ttcccctatc tctgctcccc 120 acactttcct gtctctatat ctccactgtc ctatcaataa aggtgaaaac ccctaaaaaa 180 taattataaa aaaaaaaa 198 // ID Gypsy12-I_DR repbase; DNA; ZEB; 6169 BP. XX AC chr8; XX DT 07-JAN-2005 (Rel. 10, Created) DT 19-MAY-2005 (Rel. 10.06, Last updated, Version 2) XX DE An internal portions of the Gypsy12_DR LTR retrotransposon - a DE fossilized copy. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy superfamily; Gypsy12-I_DR; Gypsy12-LTR_DR; Gypsy12_DR; KW endogenous retrovirus; gag; integrase; reverse transcriptase. XX NM Gypsy12-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6169 RA Kapitonov V.V. and Jurka J.; RT "Gypsy12_DR, an LTR retrotransposon from zebrafish."; RL Repbase Reports 4(12), 317-317 (2004). XX DR Zebrafish.; chr8; Positions 22688544 22682376. XX CC Gypsy12-I_DR is an internal portion of the Gypsy12_DR CC LTR retrotransposon that belongs to the Gypsy superfamily. CC Its long terminal repeat is deposited in Repbase CC as Gypsy12-LTR_DR. Gypsy12_DR is characterized by 4-bp target CC site duplications. The internal portion encodes two proteins: CC the 469-aa gag Gypsy12_DR1p (pos. 153-1559) and 1543-aa CC polyprotein (pos. 1475-6105, conceptual translation) composed CC of the protease, reverse transcriptase, and integrase domains. CC PBS is identical to that in Gypsy9_DR. The internal portion is CC flanked by 99% LTRs. XX FH Key Location/Qualifiers FT CDS 153..1559 FT /product="Gypsy12-I_DR1p" FT /note="Gag-protein" FT /translation="MDISQTAQWSTEENINSSRAIVLSNVPLNTSDETIEK FT VLNTVKVFGRTQIHGRRGDVTGKHLFVLVETRADLDPSTIPPEIGIESEAG FT PWPVHFVGRLQVQNPAPENDTFQSKLLTLMQQEGKSMDEVKAILMGEHSPK FT SDINVDLVDAIGKLVDRCNQASNDGPSYRKLRLFSGLKPVPPGEEEYEIWM FT EQAAQMISEWQCTEASKKQRIVESLRGPAADIVRFLKVSQPSATATEYLAA FT LETAYGTTECGPDLMAKFRHTYQDNGEKLSAFLYRLDKLLHRALLKGGIDV FT AGINKARMEQLIKGALTNDMVALRIRMTHTLQNPPSFTQLMKEIREEEHWV FT AARENVKASVATVISPQSDGPSELQSLKKEVKELSSQMSHLLNVATATCAS FT ECAPQKTSSKNSESVKRDKSQPTKLTQQPVPGIFCYKCGEDGHKKWECKGQ FT EDLRKVNQKLIKMHRLQGNWAGVQ" FT CDS join(1466..4891,4894..6105) FT /product="Gypsy12-I_DR2p" FT /note="Polyprotein" FT /translation="KVGMQGTRGPQESKSKADQNASFAGKLGRSSVKERHG FT APGTTRSNCDSFLLDASKPRLPEGLIGPVSEVPVQIEGVYAKALLDSGSQV FT TLLYRSFYDTYLKHLELQPVENLEIWGLSSHKYPYDGYLPLRLEFTESVAG FT VHQIIDTLAIVCPDPVKREGIAILLGTNTSLVKKLLESCRKQAGEEFLNVL FT TIHPVIREAFETIQLTDFSQDDSDTHGTVWFIKHNPVVLKPNQVRQLPGLL FT KFPGQSTESLVLVDRVADSDTSSEHLDVRPELHAASVVSSRQVTVTVRNMS FT NREIWVKRGTPLAHVLPVSLVPQLTAKQPPVQNPLSPASFDFGDSPMPEEA FT KQSLREKMMQRKDVFSLHEWDVGCSKSTTHEIRLNDSRPFRERSRRLAPAD FT LEDVRLHLQELQSSGIISESRSPYASPIVVVRKKSGKVRMCVDYRTLNQRT FT TPDQYTVPRIEDALHSLSGSKWFSVLDLRSGYYQIPMSDADKEKTAFICPL FT GFYQFERMPQGICGAPATFQRVMERTVGDMNFLEVLVYLDDLIVFGRTIDE FT HEERLLKVLDRLSDEGLKISLDKCQFGRTSVNYVGHIVSQDGISTDPSKIE FT AVVSWPKPQTVTELRSFLGFCGYYRRFVKDFSKFCRPLNELLKGYPSTRKN FT RDSLVCNTKPCYKSSEPFGSRWSAQCDTAFETLKKCLTQAPVLAFADVQKP FT YVLHVDASMDGLGGVLYQEHESGLRPVAFISRSLSPSERNYPAHKLEFLAL FT KWAVVDRLHDYLYGVPFEVRTDNNPLTYITKSAKLDAAGHRWLSALTTYNF FT SLKYRPGRRNVDADALSRRPHTYRSEDDEWQEIPAVGVRTFCQAVSLEKRA FT ENGFYTCVVKQAGAHMSAVPKAYSHVMEVAADHLPLFSSSDLQTAQRNDSL FT LGEVWKAVCDKKPASSIRSSHPSMKILKREWEKLVVNNGMLYRIVRQSNHK FT VKQQLVLPKQFHSSVLKSLHDDIGHLGFEKSYGLVRDRFYWPHMKPDVESY FT CKTCERCIKRKTLPQRAAPLSHMQSSGPLDLVCIDFLSIEADSRNVCNVLV FT VTDHYTRYAQAFPTRDQKASTVAKTLWEKYFIHYGLPTRIHSDQGRDFESQ FT LVSEMLTMLGIKKSRTSPYHPQGDPHLRDLTELCNMLGTLQPSQKSKWSQH FT IARLVHAYNCTVNEATGFSPYFLMFGREARLPVDVCFGVSADSSSSGSYSK FT YVSKMKQELQAAYQLAQVSSEKMNQSNKARYDQKVRYHSLSVGDRVLIRNL FT GLKGKQKLADRWSENPYVVESQLSGIPVYRLKPVDGNGPIKVMHRNHLLPL FT GQEVRLKPKVDLGPTSLPKNLRHRSVKDKRKTAQSENPPIAVDVFSREHDS FT SDSDSEYGCYVEDMAPISSESAQEIQGETPEQAVECLDYSNAHRVSEIPVI FT PCQSETSEFQLTDADRNTDVVDDVMTEVSESTSQTTVQTTDNETQPEIVPF FT EVRRSNRERKPSTRFTYDKLGVPYLHSVSSKCCGINALTTDVLNVYGSLNK FT SHAWWCNPNALCKTCKNRPVLVPCKQMVAM" XX SQ Sequence 6169 BP; 1783 A; 1200 C; 1458 G; 1728 T; 0 other; tttttggagg caccgctggg atcttgtttt tttccttttt accagatttt tttttctctt 60 tagcagaaaa aaatatatat atctgaaata gtgttaatta atattgatat atttttctgg 120 cttattttgc atagtaactc attagttgaa gtatggatat ctctcagact gctcaatgga 180 gtacagagga gaacataaat tcctcacgtg ccattgtgtt aagcaatgtt cctttgaaca 240 ctagtgatga gactattgag aaagtgttaa acacagtgaa ggtttttggt cgtactcaaa 300 ttcatggtcg acgtggtgat gtgactggaa aacatttgtt tgtgttagtg gagactagag 360 ctgatcttga tccaagcacc ataccacctg aaataggtat tgaaagtgaa gctggaccct 420 ggcctgtaca ctttgtaggt agactacaag ttcagaaccc tgctcctgaa aatgacacat 480 ttcagtccaa gttgttaaca ttaatgcagc aggagggcaa gtctatggat gaagtgaaag 540 ccattttgat gggggagcat tctcctaaat ctgatattaa tgtggattta gttgatgcca 600 taggtaaatt agtggacagg tgtaatcaag cgtctaatga tggacccagt tacagaaaac 660 taaggttgtt ttcaggtctg aaacctgttc ctccaggtga ggaagaatat gaaatctgga 720 tggagcaagc cgcacaaatg atcagcgaat ggcaatgcac tgaagcttct aagaaacaac 780 gcattgttga gagtttgcga ggtcctgctg ctgatatcgt taggtttcta aaagtgagcc 840 agccatctgc cactgcaact gagtacttgg ctgctcttga aactgcgtat ggaactactg 900 agtgtgggcc tgacttgatg gctaaatttc gtcacactta ccaggataat ggagaaaaac 960 tttcagcttt cttgtatcgc ttagataaac ttcttcacag agcgttgtta aagggtggga 1020 ttgatgtagc tggaataaac aaagctagaa tggagcagct aattaaggga gcacttacca 1080 atgatatggt tgctctgcga atcagaatga ctcacacttt gcagaatccc ccatctttta 1140 cacagttaat gaaggaaata cgtgaggagg aacactgggt ggctgcaagg gaaaatgtca 1200 aagcttcggt tgccactgtt atctctcctc agtcagatgg gccctctgag ttacaaagcc 1260 tgaagaagga ggtgaaggag ctatcttcac agatgagtca cctattgaat gtggctactg 1320 caacgtgtgc ttctgagtgt gctcctcaga aaacatctag taaaaactct gagagtgtga 1380 aacgagacaa atcacaacca actaaactca cccagcaacc agtgcctggg atcttttgct 1440 acaaatgtgg tgaggatgga cataaaaagt gggaatgcaa gggacaagag gacctcagga 1500 aagtaaatca aaagctgatc aaaatgcatc gtttgcaggg aaactgggca ggagttcagt 1560 gaaggaacgg cacggggctc ctgggacaac acgttccaat tgtgattctt ttttacttga 1620 tgccagtaaa ccaagattgc ctgaagggtt gataggacct gtttctgaag tgcctgtcca 1680 gatagaaggt gtttatgcaa aagcccttct tgacagtggc tcacaggtga ctctattata 1740 ccgcagtttt tatgacactt atctaaaaca cttggaactt cagcctgtgg aaaaccttga 1800 gatatggggt ttaagttcgc ataaataccc ctatgatggg tacttgcccc ttagacttga 1860 gtttacagag agtgtagctg gagtgcatca aataattgac acacttgcga ttgtatgccc 1920 tgaccctgta aagcgagaag gaatagccat tttgctcggg actaacacta gtctggtgaa 1980 gaagctactt gagtcttgtc gtaaacaagc tggcgaagaa ttccttaatg tgctgaccat 2040 acatcctgta atcagagaag catttgagac tattcaacta acagattttt ctcaagatga 2100 ctccgacacg catgggacag tttggttcat aaagcataac ccagttgtcc taaaaccaaa 2160 ccaagttagg cagcttcctg gtctattgaa atttcctggt caatcgactg agtcattagt 2220 attagttgac agagtagcag atagtgacac aagttctgag cacctagatg tgagacctga 2280 actgcatgca gcatctgttg tatccagtcg gcaagttaca gtgactgtga ggaacatgtc 2340 taatagagaa atatgggtga agagaggaac tccgcttgca cacgttcttc cggtgtcctt 2400 agtgccacaa ttgactgcta aacaaccacc agtacaaaat cctttgtcac ctgcttcttt 2460 tgattttgga gattccccaa tgcctgagga agcaaaacag agcttacggg agaaaatgat 2520 gcagagaaag gatgtgtttt ctctacacga gtgggacgtg ggctgttcaa aaagcaccac 2580 tcatgagata aggttgaatg attcgcgccc tttcagagag cgatctcgtc gtcttgcccc 2640 tgctgactta gaagatgtgc gactgcattt acaagaactg cagagtagtg gtattatttc 2700 tgagtctcgc agcccctacg cttcacctat tgttgttgtg cgtaaaaagt cagggaaggt 2760 tagaatgtgt gtcgactatc gaacacttaa tcaacgaact acaccagacc aatatactgt 2820 gccccgcatt gaagatgctc tccatagtct atcgggaagt aagtggttca gtgttctcga 2880 tttgaggagt gggtactacc agatacccat gagtgatgct gataaggaaa agactgcgtt 2940 catatgcccg ttagggttct atcaatttga acgtatgcct cagggtattt gtggagcacc 3000 cgccactttt caaagagtca tggaacgtac tgtaggggat atgaactttt tggaagtgct 3060 tgtatacctt gatgatttga ttgtctttgg gcgaaccatt gatgagcacg aagagcgtct 3120 tttgaaagtg ctcgataggc taagtgatga gggactaaag atctcccttg acaagtgtca 3180 gtttggtagg acttcagtga actatgtagg acatatagtg tcacaagatg gaatttcgac 3240 agatccgtcc aagatagagg ctgttgtatc ctggcctaag ccccagacag tgacagagct 3300 caggtctttt ctaggattct gtggatatta caggcgcttt gttaaggatt tctcgaagtt 3360 ttgccgccct cttaatgaat tgctgaaggg atatccatct accaggaaga acagagattc 3420 acttgtttgc aacactaagc cctgctataa gtcctctgaa ccgtttggtt ctcgatggtc 3480 ggctcagtgt gatacagctt ttgaaacgtt gaaaaagtgt ttgacacaag caccagtgtt 3540 agcctttgct gatgtacaga agccctatgt cttgcacgtg gacgcaagca tggatggact 3600 aggaggagtt ttgtatcaag aacatgagag cggattacgt ccagtagctt ttatcagtcg 3660 cagcttatcc ccttcagaga gaaactatcc agcccataaa ctggaattct tagcactgaa 3720 gtgggctgtt gtggatcgac tgcacgacta tctatatggt gttccgtttg aggttagaac 3780 cgacaataat cctctaacct atataacaaa atcagcaaaa ctggatgcgg caggtcaccg 3840 ttggctgtct gccttgacca cctacaattt cagcctaaaa tacagacctg gccgcagaaa 3900 tgttgatgct gatgcgttgt cgaggcgtcc gcatacttac cgcagtgagg acgatgagtg 3960 gcaggaaatt ccagctgtag gggtgaggac tttttgtcaa gctgtgtccc ttgaaaagag 4020 agcagagaac ggattttaca cttgtgtggt gaaacaagct ggagcacaca tgtctgctgt 4080 ccccaaagct tactctcatg ttatggaagt agctgctgat cacctaccct tgtttagttc 4140 aagtgacctt caaacagctc agaggaatga ctctttactt ggcgaagtgt ggaaagctgt 4200 gtgtgataag aaacctgcta gtagcattcg aagtagccat cctagtatga agattctgaa 4260 acgggagtgg gaaaaattgg tagtaaataa tgggatgctt tacagaattg tcagacagag 4320 taatcacaaa gtgaaacaac agcttgtact accaaaacaa tttcacagct cagtgctgaa 4380 gtctttgcat gacgacattg gacatttagg gtttgagaag tcctatggtc tagtccgaga 4440 tcgattttat tggccacaca tgaagcctga tgtggaatcc tactgtaaga cctgtgagcg 4500 ctgcatcaaa agaaagacgt tacctcaaag agcggctcct ttgtcacata tgcagagttc 4560 aggacctctg gaccttgtat gtattgattt tctttccatt gaagctgact ctcgaaatgt 4620 gtgcaatgtc ttagtggtta ctgaccatta cacacgctat gcccaagctt ttcccactag 4680 agatcaaaaa gcttcaacag tggcaaagac tttgtgggag aaatatttca tacattatgg 4740 tctccctact cgaattcact ctgaccaagg cagagatttt gagagccagt tagtgtctga 4800 aatgctgact atgctaggga ttaagaaatc tagaacatca ccatatcatc cccaaggtga 4860 tccccacctg agagatttaa cagaactctg ttgaatatgc tgggtacctt acagcctagt 4920 cagaaaagta aatggagcca gcacatagca cgtttagtac atgcttataa ttgtactgtc 4980 aatgaagcta caggtttctc tccttacttt ttgatgttcg gccgagaagc tagactgcct 5040 gttgatgttt gctttggtgt gtctgctgat agctcatcat ctggttccta ttcaaagtat 5100 gtgtccaaaa tgaagcagga attacaggca gcttatcagt tggctcaggt ttcctctgaa 5160 aagatgaatc aaagtaacaa agcaaggtat gatcagaaag ttcgctatca tagtttaagt 5220 gtgggggaca gagttctgat ccgaaacctt ggtctcaagg gaaaacaaaa acttgccgat 5280 aggtggagtg aaaatcctta tgtagtggaa agtcaattgt ctggtattcc agtttatcgt 5340 ttgaagcctg ttgatggtaa tggaccaatt aaagtcatgc accggaatca cctcttgcct 5400 ttaggacaag aagtaaggct aaagcccaag gtagatttag gtcctacttc tttacctaag 5460 aatttaaggc atagaagtgt gaaagacaag cgtaaaactg ctcagtcaga aaacccgcct 5520 attgcagttg atgttttctc aagagaacat gattcttcag attcagactc tgaatatggg 5580 tgttatgttg aggatatggc accgatttca tctgaaagtg ctcaggagat acagggtgaa 5640 actcctgaac aagctgtaga atgcttagac tatagcaatg ctcaccgtgt gtctgaaata 5700 ccagttatcc catgtcaaag tgaaacctct gaatttcagt taaccgatgc agataggaac 5760 actgatgtag tggatgatgt aatgactgag gtttctgaga gtacatcaca gactactgtt 5820 cagactactg ataatgaaac tcagccagag attgtacctt ttgaagtacg taggtctaat 5880 agggagagaa aaccttctac cagatttact tatgacaagc tgggtgtacc ataccttcat 5940 tctgtatcat ctaaatgctg tggtattaat gcacttacta cagatgtgct taatgtttat 6000 gggagtttaa ataaatcaca tgcttggtgg tgtaatccca atgctttgtg taaaacctgt 6060 aaaaaccgac ctgtacttgt gccctgtaaa cagatggttg ctatgtaatt gatctgcagc 6120 aatttagtaa ccagcatggg gacatactgg atatttggtg gggggagta 6169 // ID Gypsy154-LTR_DR repbase; DNA; ZEB; 125 BP. XX AC chr3; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy154-I_DR; KW Gypsy154-LTR_DR; Gypsy154_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-125 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1570-1570 (2008). XX DR Genome; chr3; Positions 403683 403559. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 125 BP; 34 A; 26 C; 23 G; 42 T; 0 other; tgagtttgct tagaagaacc ggatccgcgt gattcatttg aattactgac agactgcctt 60 tgttgaacta ttaataaaca ttgattgaac tgcacaaacc tctcctttga ttctgttccg 120 tgaca 125 // ID DNA-8-28_DR repbase; DNA; ZEB; 582 BP. XX AC . XX DT 12-DEC-2008 (Rel. 13.12, Created) DT 12-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE A nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; TSD 8-bp; DNA-8-28_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-582 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(12), 2151-2151 (2008). XX DR [1] (Consensus) XX CC Copies are ~88% identical to the consensus. TIR is ~30-bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 582 BP; 209 A; 109 C; 94 G; 170 T; 0 other; cagggtgcga attacagggg ggtttggggg ggttgacccc cctaattaac gcttgatccc 60 ccctgaagga catcaaaaca agatgtatgg ggggggggtc agccctttaa tagtaagaaa 120 tagctttctc tgatctctga tctatatctt aaatattaat aattaaaaat gtataatata 180 aatatacatt tgaccccccc tataacggtt tataccattg tgaatgcata atcgcgccaa 240 tcaatgctag gaaaaatatc attcaacagt ctgaaactgg cagttctaga gcaccgatag 300 acatcttaag tattcacaaa acacgtttct tgtaaaatag gtgtttatat ctgcatgtag 360 cctaaaaaaa aaaaaaatta gacataattt gtatagtttg acctacagta acctctaaaa 420 tagtcactaa atatccctca aaacactgaa aatgtataca ttttattaat aaattagatg 480 taatttaaat acattcataa atttgattca ctgaagcatg tattaccaaa ctactaccga 540 aaaagttgac cccccgatcg tcaatgtata attcgcacac tg 582 // ID Harbinger-N11_DR repbase; DNA; ZEB; 1211 BP. XX AC . XX DT 03-OCT-2008 (Rel. 13.09, Created) DT 03-OCT-2008 (Rel. 13.09, Last updated, Version 1) XX DE Harbinger-N11_DR is a non-autonomous DNA transposon that belongs DE to the Harbinger superfamily. XX KW Harbinger; DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; Harbinger superfamily; KW Harbinger-N11_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1211 RA Bao W. and Jurka J.; RT "Harbinger DNA transposons from zebrafish."; RL Direct Submission to Repbase Update (03-OCT-2008). XX DR [1] (Consensus) XX CC TSD is mostly TWA. TIR=60-bp. Its terminal sequence is similar to CC that of HARBINGER1-DR. Based on these data, it is classified as CC Harbinger elements. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1211 BP; 323 A; 287 C; 231 G; 369 T; 1 other; cgcctgtttc acaccgcaag cgtcagcggc gcgtgagcag agcgtgagca gcgcgtatgt 60 cgagcagtag cagcacgcag gcgtttgcct ttcacaccag ctgcgtctgc agcgcgcagc 120 tcttcggcag ctgctgctga gatcaaccta tctctgtcta ttctatctag ttacccatgt 180 ctctctatat acaaatacac tttttaaact tttcatctgc accaaggccc ctcctatgct 240 gtttcattaa cctgcaaatg tttattttac aaattttata gatcaaatag tactgtatgc 300 ttctcaagct aacttatatg agaagtgtct acagtcagaa gacaatcgcc tgtgatatca 360 tgtcatttgt tttttgatta tcaggttgct gtagacctag ttataatcat atttatacaa 420 tatagttata atgatattta tacatgatca aactagtgtt actttctccg cttcagtgat 480 gtccagcggc agaataacag ctcgttaatt catgctcgtg cagatgatga gacggacaaa 540 agtaattaat gcatgccatt cttttactta ttttcgtgtt gtcagattca cagtgcaaac 600 agctcccggt aggaataact cgagtgaaca taaaacaaag ccgattaaaa ctttcttsct 660 ctgctcagct gcacagaaca cagcgagctc acataatcca gcatgcgtgt cgcactacac 720 tacccacaat acacttcgct atggactaca aatcccgtaa atattctatt tcccctctcc 780 tacaacacta gtgtgcaact taagcaaaac tgatagtaaa attgttttac atttggtttt 840 gtagttcggg cctaaataaa tgtttgttgc cgtttttaat cgttttaagt tcatttgaat 900 gactatatat aaaccatttg acattattaa cgcatttatt gggtaaaatt gctacttttt 960 actgtcattc aatgtgtttt ggtcattatc aatgcactgt cactttaatt gagcgtgagc 1020 agcgcgtcaa aaatagaccc agcgccgaaa ctatcgctgc actgctgctt cggcgacgct 1080 cacgcctcgc tctgacgcgc tgctgctgcg tgcggtatga aagctctaat ctgttaacat 1140 ggacgccgaa aacacacgcg ctgctcacgc tctgctcacg cgccgctgac gcttgcggtg 1200 tgaaacaggc g 1211 // ID TDR11 repbase; DNA; ZEB; 242 BP. XX AC . XX DT 04-MAR-2002 (Rel. 7.02, Created) DT 04-MAR-2002 (Rel. 7.02, Last updated, Version 1) XX DE Zebrafish non-autonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; TDR11. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-242 RA Jurka J. and Drazkiewicz A.; RT "TDR11: a non-autonomous DNA transposon from Danio rerio."; RL Repbase Reports 2(2), 21-21 (2002). XX DR [1] (Consensus) XX CC Contains 33 bp imperfect TIRs. XX SQ Sequence 242 BP; 85 A; 45 C; 35 G; 77 T; 0 other; ttttggtaac actttatttt gatggtccat ttgagtatta gtagactgtc tgcttaatat 60 ctgttgatac tgctccttca acagacattt aactgactat aagaaacttt gcaagtacat 120 gtcaacttac actaacccca acctaacagt ctacttataa tctaatgaag aattagttgg 180 catgtagatg caatgtaact taaattcaac aaacggacca tcaaaataaa gtgtgaccaa 240 aa 242 // ID Gypsy-29-I_DR repbase; DNA; ZEB; 5173 BP. XX AC NW_633549; XX DT 29-NOV-2006 (Rel. 11.11, Created) DT 13-DEC-2006 (Rel. 11.11, Last updated, Version 1) XX DE An internal portion of the Gypsy-29_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy superfamily; Gypsy-29-I_DR; KW Gypsy-29-LTR_DR; Gypsy-29_DR; endogenous retrovirus; gag; KW integrase; protease; reverse transcriptase. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5173 RA Dib M.R. and Naveira H.F.; RT "Gypsy-29_DR, a family of LTR retrotransposons from zebrafish."; RL Direct Submission to Repbase Update (29-NOV-2006). XX DR EMBL/GenBank/DDBJ; NW_633549; Positions 1 5173. XX CC Gypsy-29-I_DR is an internal portion of the Gypsy-29_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its long CC terminal repeat is deposited in Repbase as Gypsy-29-LTR_DR. CC Consensus obtained after the alignment of at least three CC independent insertions bearing at least 85% Homology over at CC least 1000bp. CC The internal portion contains two ORFs encoding the 412-aa CC Gypsy-29_DR1p gag (pos. 62-1297) and 1292-aa Gypsy-29_DR2p pol CC proteins (pos. 1297-5172) composed of the protease, reverse CC transcriptase, and integrase domains. Some insertions fairly CC recent, according to the hight identity between their flanking CC LTRs. XX FH Key Location/Qualifiers FT CDS 62..1297 FT /product="Gypsy-29-I_DR_1p" FT /note="ORF1 (gag-like)." FT /translation="MWKKTQGKIKSESKDQSQRFRGALDGSMSPPEHLIKQ FT EDRWESHRARVIDATRRMAADAGLSLPEPSAAESYRTVTRETNKHGGAHVT FT TRGTDENKHSSPQGLALSADVNIRRGTFGTPPGADKNGHDGAHGAALPDDV FT NKHGGAHGSALSADAYECSGGRGATAIVTVKTPKFSGKADWEAFHAQFELL FT ARAGGWSEDRKALELAMCLTDDALSCLLLLSPEHRHDYNALVGALKRRFGQ FT FEQSGLLRSELNNRRRLPGEQLRVLANDIENLTRRAYAHMPPSVQGELARD FT QFVQALAPVELRRQTQLAHPHTLQDALEMALEREIVCGTPKLEESHHNIVP FT VRSADVGEGLSERPAWVSEITELIRSVTLQAEQRPRTRPRVCWGCGQTGHL FT VRQCPNVAGNQGNGLGSA" FT CDS 1297..5172 FT /product="Gypsy-29-I_DR_2p" FT /note="ORF2 (pol-like)." FT /translation="MSGDVRTLNSHPDHETFPSGVKGVGQTKRKHVLLGEG FT KYGPKESVVVVGRTCVSDFCHVPILVEGVPCVALVDTGSTVTVLRPDLVPT FT NARLDPTTVLLRTVTGELAPMKGKGEVELTVGGKTLQHSVWVATVQDPCIL FT GLDFLRSAGCKLDLKNGMLCFEGGPEVVMSTGNDRDMSFTGLLEAKSSHGK FT PNGGEHIPHLQNHNKADNSPQGVKNSPTSLALAPPVVYPRVAPQFHDMQGQ FT GTSTLISPLGEETGLGDLRELWCKNCEGLDECQQKQFWDLLFKFKHCFAFR FT EGDVGQTHLVQHEIETGDATPVRLRPRRLPFARQDAADKALTEMAQAGIIE FT PSESAWAAPVVLVPKKGGQWRFCVDYRRLNEVTRKDSYPIPRVDESLDLVV FT GSSWFTSLDLRSGYWQVPLAPDARPKTAFCTSRGLWQFRVLCFGLCNAPAT FT FARLMDKVLSGIPRSECLVYLDDILVHGSSFQTSLESLSHVLVRVSESGLK FT LHPEKCKFFRKEVTFLGHKLGGQGIGTVEEKVQAVSNWPAPTTAQQLKSFL FT GLASYYRRFVEGFASIADPLFRLLKKDKSFRWTEECQDSFAALKLSLVTAP FT ILAPPDPALPFILDTDASNVGMGAVLSQVGASGERVIAYYSKAFSKAERRY FT CVTRRELVAMLFAVRHFKYYLCGLHFTVRTDHSALQWLMSFKEPEGQLARW FT IEELQAYNMKVVHRTGLSHANADALSRRPCAIAGCRYCDKKEARDRELGNV FT DAGENSADCKCGDTLVCAAMEVVGIEDWKRLQEQDVDLQPVIQWVQAQQRP FT PWEEVAIMSLSTKTLWAKFKGLRLRDGVLERGWKDPTTGEERWQVVVPKVL FT REGVIGASHGTVGSGHFGVSKTLNRVRQCFYWGRVRRDVEDFCRRCDACTA FT RKGPPGRSIAPLQQFPVGGPMERVGVDVMGPFPRSERGNRFVLTSMDYFTK FT WPEAYALPDQEAETVVDALVDGMFSRFGVPESIHSDQGRNFESRVFAVMCE FT KLGIHKTRTTPLHPQSDGLVERFNRTLAEQLAIVTETNQQDWDRHLPFILM FT AYRSAVQDSTLCTPALLMLGREIRTPGEMAYGRPPDAPAVPPGPEYASRLQ FT ASLETAHSFARQHLLKAGIRQKRNYDLKTKGRHFTAGERVWVYSPQRKKGR FT CPKLDAQWVGPCIVLERIGEVVYRVQMPPRGRKVAVHRDRLAPYRGCASAD FT TVLASPTVMPVLENEAGHGLGSSNTIDVLRETTPVPELQSSHSGGPVADRG FT SPRSQRVKRPPRRLQDYVCSLEVEELCEEGA" XX SQ Sequence 5173 BP; 1283 A; 1132 C; 1520 G; 1238 T; 0 other; ctggtgtcag aagtgaactg gatagccgca tttacttatt tttcgtcctg caaccaacgc 60 gatgtggaag aagacccaag gaaaaataaa gtccgaatct aaggaccaaa gtcaaaggtt 120 tcgtggggct ctggatggat caatgtcgcc tcctgagcac ttgattaagc aggaagatcg 180 atgggaatct cacagagctc gtgtgattga tgccacgagg cggatggcag cagatgccgg 240 gctttcgtta ccagagccga gtgccgccga gagttaccga acagtcacga gagaaacaaa 300 caaacatggc ggcgcccatg ttacaacgcg aggtactgat gaaaacaaac atagcagccc 360 ccaagggttg gcgctgtcgg ctgatgtaaa tatacgccgc ggcacctttg gaaccccgcc 420 gggtgcggat aaaaatggac atgacggcgc ccacggagct gcgctgcctg acgatgtaaa 480 taaacatggc ggcgcccacg gatccgcgct gtctgccgac gcatatgaat gcagtggcgg 540 ccgcggcgct acagctatag tgactgttaa aacacccaaa ttttcaggca aagctgattg 600 ggaagctttt catgctcaat ttgaattgct ggctcgagct ggaggctggt cagaggaccg 660 caaagctctt gaattggcta tgtgcctaac tgacgatgcc ctgtcctgct tgttacttct 720 aagccctgaa cacagacatg actataatgc acttgtcggt gctctgaaac gacgttttgg 780 ccaatttgaa cagtcaggcc tcctgcgttc tgaactgaat aatagacgca ggcttcctgg 840 agagcagctg agagttttag caaacgacat tgagaatctg actcgacgtg catatgccca 900 catgcctccc tctgttcaag gggagcttgc aagagaccag ttcgtccagg ccctggcccc 960 tgtggagctc cgcagacaaa ctcaactggc ccacccacat accctacagg atgctttgga 1020 aatggctttg gagagagaaa ttgtttgtgg aacccctaaa ttggaagaga gtcatcacaa 1080 tattgttcca gtaaggtctg ctgatgtggg ggaggggtta agtgaaagac cagcctgggt 1140 ctcagagata actgagttaa ttagatctgt gacacttcaa gccgagcaac gtcctcgcac 1200 gagaccaagg gtttgttggg gttgtggtca gactggacat ttagtcagac agtgtcctaa 1260 tgtagcgggg aaccagggaa acggcctggg gtccgcatga gcggggatgt gcggaccctc 1320 aactctcatc ccgaccatga gacttttcca tcaggggtca aaggagtggg gcaaacgaag 1380 agaaaacatg ttttgttggg ggaggggaaa tatgggccca aagagtctgt agttgtggtg 1440 ggtcgaacat gtgttagtga tttttgtcat gtaccaattt tagtagaagg agttccatgt 1500 gtggcattgg tggatacagg ctctactgta acggtcttga gaccagattt ggtaccaacc 1560 aatgccagac tggatcccac cactgttctg ttacgcacag tgacagggga acttgcacca 1620 atgaagggta aaggggaggt ggagctcact gtcggaggga agacattaca acattcagtg 1680 tgggtggcta ctgtccagga cccctgtatt ctcggtctag acttcctaag aagtgcaggc 1740 tgtaagttag acttgaaaaa tggtatgctt tgtttcgaag ggggtcctga ggttgtcatg 1800 tccactggaa atgacaggga tatgtctttt acaggccttc tagaggcaaa atcttctcat 1860 gggaagccta atggggggga acatatacca cacttgcaaa accacaacaa agctgacaac 1920 agtcctcaag gggtaaagaa ctctcctact tccttagccc ttgctccgcc tgtggtctat 1980 cccagggtgg ccccacaatt ccatgacatg caagggcagg gaaccagcac tctgatttcc 2040 cctcttgggg aggagacagg gttgggggat ttgagggagt tatggtgtaa gaattgtgag 2100 ggcttggatg agtgtcagca gaagcagttt tgggatttgc tgtttaaatt caaacactgt 2160 tttgcattta gggagggaga tgtggggcaa acacacctag tacaacatga aatagagact 2220 ggagatgcca cacctgttag gctgcggcca cggaggctcc cctttgcacg acaggatgct 2280 gctgataagg ccttgacaga aatggcacaa gcagggataa ttgagccctc cgagagtgca 2340 tgggctgcac cggtagtctt agtgcctaag aaggggggcc agtggagatt ttgtgttgac 2400 tacagacgcc ttaatgaagt gaccagaaag gactcctacc ccattccccg tgtcgacgaa 2460 tcactggacc tggtggtggg gtcctcatgg ttcacttcct tggaccttcg ttctggctat 2520 tggcaggtac cgcttgcgcc tgatgcaagg cctaaaacag cattttgcac cagtcgaggg 2580 ctgtggcagt tccgggtgct ctgtttcggt ctctgcaatg cacctgccac gtttgcaagg 2640 ttaatggaca aggtgttgtc gggtattcca aggtctgagt gtctggtgta cctagatgac 2700 atactggttc atggtagctc ttttcaaacc tcactggaat ccctcagtca tgtgctggtt 2760 cgtgtgagtg agtctggctt gaagctgcac ccagaaaaat gtaaattctt taggaaggag 2820 gtgacatttc tagggcacaa attgggagga cagggaattg gcacagtgga agagaaggtg 2880 caggctgtga gcaactggcc tgctcccaca acagcacagc agcttaaaag ttttcttggc 2940 ttagcgtcat attacaggag gtttgtggag ggttttgcca gcattgcaga cccacttttc 3000 cgtctgctga agaaggataa atcctttagg tggacggagg agtgtcagga ttcttttgct 3060 gccctgaaac tgtctctggt tacagcccca atcttggccc cccctgatcc ggcacttcct 3120 ttcattttag acacagatgc cagcaatgtg ggtatggggg cagttctctc tcaggtgggg 3180 gctagtgggg agagagtcat tgcctactat agcaaagcat tcagtaaggc tgaacgcagg 3240 tactgcgtta ccaggcggga actcgttgca atgctttttg ctgtgcgaca ttttaagtat 3300 tacctgtgtg ggttacattt cacagtacgc actgaccatt ctgctctcca gtggcttatg 3360 tcatttaaag agccagaagg ccaactggca cgctggattg aagaacttca ggcgtacaac 3420 atgaaggtag ttcaccgtac agggttgagc catgcaaacg cagatgctct gtctcgccgg 3480 ccgtgtgcga tagctgggtg tcgctactgc gacaagaagg aagccaggga ccgagagctg 3540 gggaatgttg atgcagggga gaattctgct gattgtaagt gtggggatac actggtgtgt 3600 gcagcaatgg aggttgtggg catagaggac tggaagcggc tgcaagagca agatgtagac 3660 ctgcagccag taattcagtg ggttcaagcc caacaacgac ctccgtggga ggaagtggct 3720 attatgtcac tgtcaacaaa aactttatgg gccaaattta agggcttgcg tctgcgggat 3780 ggtgtgttgg agagagggtg gaaagacccc accactggag aggaaaggtg gcaggtggtg 3840 gtaccaaaag tgttgcgtga gggggttatt ggggcttctc atggtacagt ggggtctgga 3900 cactttggtg tttcaaaaac acttaatcga gtgaggcaat gtttttactg ggggcgggta 3960 aggagagatg tagaggattt ttgtcggcgt tgtgatgctt gcactgcccg taagggtccc 4020 cctggtcggt ccattgctcc tcttcagcag ttccctgtag gagggccaat ggagagggtg 4080 ggggtagatg taatgggccc attcccacgc tcagagaggg gtaacaggtt tgtgttgacg 4140 tctatggatt attttacaaa atggccagag gcgtacgcac tgcctgacca agaagctgag 4200 acagtggtag atgcattagt ggatgggatg tttagccgtt ttggggtccc tgagtccatc 4260 cacagtgatc agggccggaa ctttgagtca cgtgtatttg cagttatgtg tgaaaaactg 4320 gggatccaca agacaaggac taccccttta cacccacaaa gtgacggtct tgtcgagaga 4380 tttaatagga ctctggcaga acaattagcc attgtaactg aaaccaatca gcaagattgg 4440 gacaggcact tgccttttat tttaatggca taccgatcag ctgtgcaaga ctcaacttta 4500 tgcactcccg cccttttaat gttgggcaga gaaatcagaa cgcccgggga aatggcttat 4560 ggaaggcctc ctgatgcacc tgctgtgcca ccgggccctg agtatgcaag taggcttcag 4620 gcctctttgg aaaccgctca tagttttgca aggcaacact tgctgaaagc ggggatccga 4680 cagaagcgga attatgactt gaaaacgaaa gggcgacact tcacagcggg agagagggtg 4740 tgggtgtaca gcccacagag gaagaagggc aggtgcccta aacttgatgc acagtgggtg 4800 gggccttgca ttgtactgga gagaattggg gaagttgttt accgtgttca gatgccacca 4860 agaggcagga aagtagcggt gcacagggat agactggctc cgtacagagg gtgtgcttca 4920 gcagatacag tgctggcctc tcctacagtt atgcctgttc tggagaatga ggcgggacac 4980 gggttgggta gttctaatac tatagatgtc ctccgtgaaa ctacaccagt tcctgagcta 5040 caaagttcac attctggtgg ccctgttgca gataggggct cacctcggtc tcagagagtc 5100 aagaggccac caagacgtct ccaggactat gtttgttccc tcgaggtcga ggaactttgt 5160 gaggaggggg cag 5173 // ID hAT-N57_DR repbase; DNA; ZEB; 1286 BP. XX AC . XX DT 21-OCT-2008 (Rel. 13.1, Created) DT 21-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Putative nonautonomous hAT DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW 8 bp TSD; hAT-N57_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1286 RA Bao W. and Jurka J.; RT "hAT DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1326-1326 (2008). XX DR [1] (Consensus) XX CC TIR is 15 bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1286 BP; 385 A; 227 C; 252 G; 421 T; 1 other; tagggctggg cgataaaatc ggtatcggta tttattgacc gaacaccatt gtcaatatcg 60 ataaaaaaaa atttggtatg aagtttcggt atgaagcgct atagttttat taaaatgtgg 120 ctgctgagcg cgcgccttgg aagcaatgcc aataagctta gggtgtaagt ttgtcatttc 180 caatggaggc gcgcgacttg catgacgcgc atatgggagc gacgcgcaca tgttgtgcaa 240 gtggcgcaca cgtgacgcac ctagttgaaa aacatctgaa cttttccgaa tgccgcgagc 300 gcaccgcgat tcatgtaagt agaactaacc aatctgcttc acactttgta tggaatatac 360 acatttcagt aataacggta gactaattac agactaatta cctcagacaa acacagcgca 420 cccaaacact gcagtgcttt ttacttttct ccataaactt gcatcggagc tgcagcaatg 480 gtttgtccgt ttgtcatcct ctgagaaaaa tggttgatgg tcgcctagtt acgagagacg 540 ccaggggagc gcgagcgcaa agtccattga aaatggtgaa ggaaaccacg cgctcgcgct 600 tgtcacttca ggtcagaata acaacacatg cgaaaatgag tgccgtagct agcagcagtg 660 aggagattgt aaataaaaaa agatttaagg atgtcgccag agtggctttt tggtttcttt 720 tcttgtgcat cccagccact agctccccat ctgaaagatt tttttagcat agggggtaat 780 atagttttta gtcattcaac ttcacaattt gtaatgttgc agtatgtatt tgaataatga 840 tggttagttc ctttacttga aagctcagat ttgattatca taatttgttt tgtttacaga 900 gtttgaggtt tactgtatca ttattattca caccttacag atgttgatct accttaagtt 960 tgctttgatt gttcagcatt tacactttac cattgcacac actttgtaac attttattta 1020 tcacagttta agagtctctt taacacttat gtttatttta ttataaagag atcagatatt 1080 ttgtttaaat atttttgttt ttgactatta aaactatttg ccttagtaat gttggtaaat 1140 taaaataaag tggttaaaat aaataaaaaw cctaatattc ctaaatgtat tgttaaaaaa 1200 tattcaataa ttatcgatat cgaatgatat gaaacatgat atcgtgataa ttttttttta 1260 tttttttgca atatcgccca gcccta 1286 // ID Gypsy-168-I_DR repbase; DNA; ZEB; 4291 BP. XX AC . XX DT 16-MAR-2009 (Rel. 14.03, Created) DT 16-MAR-2009 (Rel. 14.03, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-168-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4291 RA Bao W. and Jurka J.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 9(3), 696-696 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 17..940 FT /product="Gypsy-168-I_DR_1p" FT /translation="MDPAALQVLSNELTTQAQVLATHQQQLAHLTQLTEEL FT VSTMRNLKVATPAQPTANYSTHQPPVTFTPHHTSPRLAFPDKFYGNPSKCK FT GFLLQCKLFVAQQPHMFTDENSKIAFVCSLLSGKALDWATAVWPDNTLTFP FT TFKEFLQRFCVVFDHPEGGRNAGEELLSIQQNYQPAAEFALHFRTLAAQTG FT WADDPLKTLYRKALNPELQTELACRDDGKSLDQLIELSIRLDNLLRTRKHS FT PVALSCPSQESTPAPSSSGEPMQLGRAHLSPEERERRIKNNLCLYCGLGGH FT TKLKCPNKPPSKLNR*" FT CDS join(976..3375,3299..4189) FT /product="Gypsy-168-I_DR_2p" FT /translation="MVVPIVVCCRNTNINTVALVDSGAAGNFMDEAFARNN FT NIPLTSCDFPLAVTAVDGRPLGDGKITYRTVPLSLQTGSLHKEELDLLITK FT SPQHPIILGLPWLQAHNPFISWKTGEITKWSDSCIKNCLCSIVPVQLNNMV FT TSPDTSEISMIPDEYKDLAEAFSKKKATTLPPHREYDCAIDLLPGTTPPRG FT RIFPLSQTETETMNAYIKEELEKGFIRPSTSPASAGFFFVEKKDGTLRPCI FT DYRGLNEITVKYRYPLPLVPAALEQLRSAKYFTKLDLRSAYNLIRIRRGDE FT WKTGFSTTTGHYEYRVMPFGLANSPSVFQAFINEVFRDMLNQWVIVYIDDI FT LVYSEMLEDHHNHVRAVLQRLIKNQLYAKLSKCEFHQTCTSFLGYIISADG FT VAMDESKVNAVLNWPRPKTVKEMQRFLGFANFYRRFIRNFSSVAAPLTSLV FT KSSNPKLKWNPDAERAFNLLKTRFSSAPILCHPDPKLPFTLEIDASNTGIG FT AVLSQKQATSNKLHPCAFFSRKLNSAERNYDVGNRELLAMKSAMEEWRHWL FT EGALHPFSVITDHKNLEYIHSAKRLNPRQARWALFFTRFHFTVTYIPGPKN FT IKADALSRLSEEDPKIEDDQPIIPDSMILAPISWDIEAEIAQSTAEHPPPP FT TCPADKVFVPPPLRXKLIEEVHTNPSHPGIAGTLDLILNRYWWPDVRKDVF FT VKNCAPCNMNKHSRHSPAGLLQPLEVPRRPWSHIAIDFITDLPLSMGHTTI FT LTVIDRFSKACRLIPIPKLPTALETAELLCTHFMQPSFXILWXTRRYCVKX FT LSSYALTLCNQVFRFYGLPEDIVSDRGPQFTSRVWSAFFKNLNINISLTSG FT YHPQSNGQVERLNQEIGRFLRTYCHQHQANWSRFLMWAEYAQNSLRKAATG FT LTPFQVILGFQPSLFPWSGEPSELPALDHWFQQCETTWNAAHTHLSHAVRR FT FKEQADRHRRPSPTYTPGQWVWLSTKDLRLKLPCKKLSPRYVGPFTIDRQI FT TPVSYRLNLPNHYRISPTFHVSLLKPAAGPAEVDREVAAGDQGPPPIIIDG FT EEAYQIREILRSRRRGGHLQYLIDWEGYGPEEQSWINRK*" XX SQ Sequence 4291 BP; 1127 A; 1224 C; 884 G; 1045 T; 11 other; gaagacttcg ccagacatgg atcccgcagc cctacaggtc ctttccaacg aactcacaac 60 ccaagctcaa gttctagcga cccaccagca acagcttgct catctcacgc agctaacgga 120 ggaacttgtg agtaccatgc gaaatttgaa agttgcgacc ccagcgcaac ccaccgccaa 180 ttactcaact catcaacctc cggtaacctt tactcctcat cataccagtc cccgcctggc 240 attccctgac aaattttatg gtaacccttc caaatgtaaa ggttttttgc ttcagtgcaa 300 gctgtttgtc gcacaacaac cccatatgtt tactgatgag aacagcaaga tcgcctttgt 360 ttgctcgttg ctctctggaa aagccctgga ttgggctact gcagtctggc cggataacac 420 cctcactttt ccaacattta aagagtttct ccagcgattt tgtgtggtct tcgatcatcc 480 cgagggaggt cgaaacgcag gcgaagaatt attatctatc cagcaaaatt atcaaccagc 540 ggcggagttc gctctccatt ttcgcacttt ggctgcacaa actggctggg ctgacgatcc 600 tttaaagact ctctaccgca aagcattgaa ccctgaatta cagactgaat tagcgtgcag 660 agatgatggc aaatcactcg accagctcat tgagctttca atcaggctgg ataatctgct 720 tcgcacccga aaacactcac ctgtcgctct ctcctgtccc agccaggaat ccactccagc 780 tccctccagt tcaggtgaac ccatgcaact gggtcgagcc catttgtccc cggaagagcg 840 tgagagacgc attaagaata acctctgcct gtattgcggt cttggtggtc atacgaagct 900 caaatgtccc aataagcctc cttccaaact gaatcggtga gtgcaacctc aattcttgcc 960 attaataaaa gcgctatggt tgtccccatt gtggtgtgtt gcaggaacac caatattaac 1020 actgtcgcct tagttgactc aggagccgct ggcaacttca tggatgaggc ttttgctcgc 1080 aataacaaca tcccactaac ttcttgtgat tttcccctag cagtcacggc agtagacgga 1140 cgccccctag gggatggaaa gatcacttac cgcactgttc ctctatcatt acaaactggc 1200 tctcttcata aggaagagtt ggatctgcta atcaccaaat ccccgcaaca tcccattatt 1260 cttggtttac cgtggttaca ggcacataac cccttcattt cctggaaaac cggtgagatc 1320 acaaaatgga gtgattcatg tattaaaaat tgtctatgtt ctatcgtccc tgttcagctc 1380 aataatatgg ttaccagtcc tgatacctct gaaatttcca tgattcccga tgagtacaag 1440 gatttagccg aagctttcag caagaagaaa gccaccacac tcccacctca ccgtgagtac 1500 gattgtgcaa tagatttatt acctggcaca acgccccctc gtgggagaat ctttcctttg 1560 tcccaaacag aaacsgaaac catgaatgct tacattaagg aggagctgga gaaagggttt 1620 attcgcccgt ccacatcacc cgcatccgct ggcttcttct ttgttgaaaa gaaagatggt 1680 actctccgcc catgcattga ctacagaggt ttgaatgaaa taacagtgaa gtacagatat 1740 cctttgcctc tagtccctgc tgccctggaa caattacgct cagccaagta ctttaccaag 1800 cttgatcttc gcagtgctta caacctcata cgcataaggc ggggtgacga gtggaaaacc 1860 gggttctcaa cgaccaccgg gcactatgaa taccgggtga tgcccttcgg cctggcaaac 1920 agtccttctg tgttccaagc attcattaac gaggtatttc gggacatgtt gaatcaatgg 1980 gtcattgtgt atattgatga cattcttgtc tactcagaga tgcttgaaga ccaccataac 2040 catgtacgag ctgtcctcca gaggctaatc aagaaccagc tgtatgccaa gttatccaaa 2100 tgcgagtttc accagacgtg cacttccttc ctggggtaca taattagcgc agatggagtc 2160 gctatggatg aaagcaaggt aaatgctgtg ttgaattggc caagaccgaa aactgtgaaa 2220 gagatgcaac gcttcctggg atttgcgaat ttctatagga ggttcattcg caacttcagt 2280 tcagtagctg caccactcac ctccctggtt aaatcaagca atcctaaact caagtggaat 2340 ccagatgccg aaagagcatt caacctcctg aaaacccgat tctcctcagc acccatcctc 2400 tgtcaccctg atcctaagct ccccttcacc cttgaaattg acgcctccaa tacaggcatt 2460 ggcgctgtat tatctcagaa acaagccaca tccaacaagc tccacccttg tgccttcttt 2520 tcccgaaagt taaactcagc cgaaagaaac tatgacgtag gcaatcgaga acttttggct 2580 atgaaatcag cgatggagga atggaggcac tggctagaag gagctttaca cccattctct 2640 gtcataacag accataaaaa cctcgaatac atccattcag caaagagact aaatccccga 2700 caggctagat gggctctatt cttcactcgt ttccatttca ctgtgacata cattcccgga 2760 cccaaaaaca tcaaagccga tgcactttcc cgtctgtctg aagaggatcc caaaattgaa 2820 gatgaccaac ccattatacc cgactccatg attctggcac ccatctcttg ggatatagag 2880 gccgaaatcg ctcaatccac agctgaacat cctcctcctc ccacctgtcc agctgataaa 2940 gtcttcgtcc caccaccact caggkgcaaa ytaattgaag aggttcatac taaccccagt 3000 catccaggca tcgcaggcac ccttgatctc attctcaacc gctactggtg gcccgacgtg 3060 agaaaggacg tctttgtgaa gaattgtgcc ccctgtaaca tgaacaagca ctcccgtcac 3120 agtccagccg gcctcctaca gcctcttgag gttccccgtc gcccttggtc acacatcgcc 3180 atygacttta tcaccgatct tcccctttct atgggtcaca ctaccatact caccgtcatc 3240 gaccgcttct ccaaggcttg caggctgatc cccataccaa agcttcccac agcattagaa 3300 acygctgagc tcctatgcac tcactttatg caaccaagtt ttycgattct atggrctacc 3360 agaagatatt gtgtctgaca gaggtcccca atttacytca agagtatggt ctgcattctt 3420 taaaaacctc aacatcaaya tcagcctaac ttccggttat catccycagt ccaatggyca 3480 ggtagagcgg ctgaatcagg aaattggaag attcctaaga acttactgtc atcagcacca 3540 agcaaattgg agtcgttttc tgatgtgggc cgaatacgcc caaaactcct tgagaaaagc 3600 agccactggc ctaacgccgt ttcaagtcat cctagggttt caaccctccc tgttcccatg 3660 gtctggcgaa ccatccgaac tcccagccct cgatcactgg ttccaacagt gtgagacaac 3720 ctggaacgca gcccatacac atctctccca tgcagtccga agatttaagg aacaggctga 3780 tcgtcatcgc cgtcccagtc ccacttacac tcctggacaa tgggtttggc tatccaccaa 3840 agaccttcgc ctcaaactcc catgcaaaaa gctcagtccc aggtacgttg gtcctttcac 3900 gattgatcgc caaattacac ctgtatccta cagacttaac ctgcctaatc actatcgcat 3960 ctctcctacg ttccatgtat cactgctcaa gccagctgct ggtccagccg aggtggatag 4020 ggaggtggca gccggtgatc agggtccccc accaatcatc atcgatggag aggaggcgta 4080 ccagatccgt gagatcctga ggtccagacg ccggggggga catctacaat acctgatcga 4140 ctgggagggg tacggtccgg aggaacaatc atggatcaac cgcaagtgag tttcatcgtc 4200 tacatccgga aatgccggcc cctcgtcccc gtggtagacc ccggcgtcgt ggcgcccctc 4260 acttcaggag ccgttcgcag gaggggggct c 4291 // ID hAT-N25_DR repbase; DNA; ZEB; 829 BP. XX AC . XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Putative nonautonomous hAT DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW 8 bp TSD; hAT-N25_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-829 RA Bao W. and Jurka J.; RT "hAT DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1295-1295 (2008). XX DR [1] (Consensus) XX CC Putative hAT transposon, TSD is 8 bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 829 BP; 271 A; 142 C; 139 G; 277 T; 0 other; tagggctggg cgattaatcg aaaagataat cgaaatcgac attcagaacc tataatcgat 60 caaatttttc caggtcaatt ttttcaatta ctttccctac cgcgtgtgga gtcacgtgac 120 cccgctctgt taaggctgat ttatacttct gcgtcgaacg cacgggtgtg gtccggcgca 180 gccttcgcgc ggtcgcatac ccgcgcacct ctcaaaaaaa tgtaactaca cgtcgcacgt 240 ttgcactaca ttgacgatga ttggaagctg cgccagagat gccttgagac ggcatatttt 300 ccattacaat aaaattatta tttacattaa aatatttgtt tacagaagat tcaagaaatg 360 cactgtttaa aatattattt acaggatata caagagttac atatttattt tatttagaag 420 agatactgct tattttctac ttttaatatg aaaaacatta aaaataatta attttgtttc 480 caaaagtgca agttatttat tttcactttt tttataagaa aaaagtgact gttggtttta 540 ggcaatggcg tgtgttttaa tttcagttgt tcaacgttga tgttcaataa ataatcatag 600 atagtagata gtgtgtgttt ccttcaatta ttttaaaatc aagtaatgca cccttcattc 660 aaaaaatctc tcacttgtaa tatgtgagca tatttactgt acaaaacctg tcagtgaact 720 atgagggcaa aaaaaaataa taaataaaat aatcgttcat taatcgtaat cgagttaaaa 780 tgttcaatta atcgagattt tgattttagg ccaaatcgcc cagacccta 829 // ID Gypsy-32-LTR_DR repbase; DNA; ZEB; 1160 BP. XX AC . XX DT 01-JUN-2007 (Rel. 12.06, Created) DT 05-JUN-2007 (Rel. 12.06, Last updated, Version 1) XX DE A long terminal repeat of the Gypsy-32_DR LTR retrotransposon; a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy superfamily; LTR; Gypsy-32-I_DR; KW Gypsy-32_DR; endogenous retrovirus; Gypsy-32-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1160 RA Dib M.R. and Naveira H.F.; RT "Gypsy-32_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 7(6), 180-180 (2007). XX DR [1] (Consensus) XX CC Gypsy-32-LTR_DR is a long terminal repeat of the Gypsy-32_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its CC internal portion is deposited in Repbase as Gypsy-32-I_DR. XX SQ Sequence 1160 BP; 260 A; 252 C; 254 G; 394 T; 0 other; tgtgaagtag ccgggcttct tccctttatc tctggcgacc gcactggttc acacacaccc 60 acagacgctc acagacacgc acatacatac gcacatacat atgtctactc acctctcctt 120 ttcctctcag tatgctgcta gtgacaaagg gccctgtcat ctcgcacgca tccagtcatc 180 cattccatat ctccattttc ccgcaggtta acattaagca tacatttaca cacacctttg 240 tgttcgtgtc tttgtgtttt tgtttcgtca tttgtccggg aaagttggtc ttcagtgttg 300 tgccggccaa accgagcgag cgaatggcgc cggcttatga cgtcaccggc cgcgccatgg 360 cagcaccaat cacagaagcg ccaaccgtac caattgtttg tgtttgtagg gtacagccag 420 cgttgtagag gagcgattta tatgtcttta aacctctgat tgttggttgt tttatgtatg 480 tatgtatgtc agatattatt agtagctgtg tgctttttat atttagagat gccacattta 540 aatgtttaat ggtgtgatgt agaagtttac ctgaatttac ctgtttatgg tatgctccaa 600 aaagggggcg gagtctagcc tttaaaagga gacggcagac tcagaaacgg ggtcagttta 660 cctggctagc gagcgtgttg tggcagaacc gtaagcaatt ccctttatca atctattgtc 720 tgtgtgacat agatttcttt cgtatatata ttgtagatat attgttttgt gttgtatata 780 tcgtattttg tttttttgtt ttattgagtt gttactttgt tttttttttg tacatacctt 840 ttgggaaata ggtcactgta catttatctg cactggactg ttttgcgtta tccttgtaaa 900 taaaccatct tttttacact tctgggataa ggccatcatt tttgagaact ctttattttt 960 attattattt tattttttgc tggtgttggg catcccacac ctggtagtga acccattccg 1020 ggtccattac ataattactc acggcggcac gggaaccatt ctgacacgca ccactctgta 1080 gtgagcacta tcaccgctcg aagctcacag ggtgccacag caggtggcag tagtgagttt 1140 gtgggtcgcg ggccgtcgca 1160 // ID BEL-38-I_DR repbase; DNA; ZEB; 6551 BP. XX AC . XX DT 09-APR-2009 (Rel. 14.04, Created) DT 09-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion - consensus. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-38-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6551 RA Bao W. and Jurka J.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 9(4), 862-862 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 348..6392 FT /product="BEL-38-I_DR_1p" FT /translation="MEMAEGSESETAAFEHKRAPKPTEKALEEKLHRIIGL FT RRAKLSALTSNIKTLERLMNDDENLQNVHDLMQLDFAQLLAEFNDLNVQVQ FT DLLSEDERIADQQNWFQPKMDSVNIFRNKTENWIAAVAEKNEKNDNVEMED FT DIMPHDSASQVSLNKNELKQRCDSVTSGSVCSSVSSAYAQEESKRAALLAR FT AASLKKKQQLVIEEMQIKAKMEELEIETALAESDAKLKVLKEYEHSEDGKS FT SVRSKKRKGIHHKQLELKTEQKESSPSHSTVAKDKAKSVSSPKIHPATELN FT IQVPGHSDYPQADNSIIQVMQKQNDLTELLVRQQRQIQLPSKDIQVFRGDP FT LTYRSFIRSFEHTIENKTDNEKDKLYYLEQYTAGEPQEIVRSCEYMPLSRG FT FKEAKRLLERHYGDELIIARAFIDKVQKWPQIKSDDGKALSAYALFLIGCR FT NTLEDGESMEEMDSPTNMRDVISKLPYKMKERWRSEAYDIKERQGVRARFT FT HLVDFIDRQAKVVTDPFFGDILDTRNAVTAKVNQKEKFPATRGIKKSSFAT FT NICMEERRPQDTSSKLASPAKMVSAFEKPCMFCSKSHTLESCTEIKEQPHS FT IRVDFLKSKGLCFGCLTRGHLSKMCRKRIECKECSQRHPDILHMKEDRKVT FT KQVETPDKEISCAQISLSHQSCNYTESGGDNCVLSIVPVKVKSNKSDRYVE FT TYAFLDAGSTATFCTEELRNKLKLKGKPTQILLSTMCQDKPGEQKLVSSFI FT LTDLEVCGLDDTKYIELPKVFTHSNIPVQNENIPKQEDIQKWPYLSEVNIP FT CIDANVGLLIGANNPKAMEPWHIINSQQDGPYAVKTALGWVVNGTIKKTNN FT AERSKLPHHTVNRLSVVEIEKLLIQQYNSDFPERLYEEKEEKSLEDKRFMQ FT LMQEKTKFENGHYCVKLPLRDETVDMPNNRCVAEIRAANLKRKLQKNSELL FT DDYTTFMKSIIEKGYAVRVPTEHLNRNDNKVWYIPHHGVYHPKKNKIRVVF FT DCTASFQGMSLNSQLLQGPNLTNTLIGVLTRFREEPIAVMADVESMFYQVK FT VPEEDTDLLRFLWWPDGNLNAPIEEYRMTVHLFGATSSPSCASYALRKTAE FT DRKHVASQKAVDTVLNNFYVDDCLKSVSNEQEAINLVKEIQDLCLEGGFRL FT TKWVSNNRKVLLSIPEDQRASGVKDLDLDQDSLPIERALGMQWCTDDDTFT FT YHIKVQEKPSSRRGILSVVNSIYDPLGFLAPLILPVKLLLRDLCKQGYGWD FT DEIDDKRADQWVKWLEDLSQISNVKFKRCLKPEKFGITAEAQLHHFSDASE FT SAYGTATYLVLTNEQNQKYCSLLMGKSRVSPLKQITIPRLELTAATIAVKM FT DKILRQELQMPLQPSIFWTDSTTVLSYIENESARFKTFVANRISLIRDATT FT PLQWKFVRSAQNPADQATRGIKAKDFLQAETWTKGPNFLLKSEEEWPQRPD FT QINQYQQKDPEIKIETKVNILNVNENDILCRLTDYYSSWFRLKRAVAWMLR FT LKETLLQLCRARKQFQASIETSEKDPEKQASLLQTEMQKFKSKIKKRSLSL FT EDLNQAEILLIQHSQKQQFLEEIEAIKQNVTIKRRSQLYRLDPVLQDGILR FT VGGRLNKAAMPNESKHPAILSKYSRISTLILSDIHQRYGHCGRNYTLSILR FT QKFWIPQANSAIRKLIHKCSVCRRLNGRVGEQKMASLPEDRLLPDKPPFTN FT VGVDYFGPFEVKRGRSTVKRYGVLFTCLTIRAVHIEVADSLDTDSCINALR FT RFISRRGQVAVMRSDNGTNFVGAERELREALKNLNHITIEQAMLQKNIKWI FT FNSPAASHQGGVWERQIRTVRRILNALLKEQSISDDSLHTIMCEVESIINN FT RPITTTSEDPNDLEPLTPNHLLLLKTQPNMPPGVFKKEDQYMKRRWRQVQY FT LADLFWSRWTHEYLPILQERSKWTKLKRNFEPGDVVLIVDSSAPRNSWIMG FT KVVQTLPDSSGAVRRVKLQTKTSILERPVNKLCLIQEAI*" XX SQ Sequence 6551 BP; 2276 A; 1219 C; 1446 G; 1610 T; 0 other; atttagtcaa aaaacttgct ctatggatat ggaaatgaag gagcgcacgt ggcacgttat 60 gacggacgga gattcatctc taaactatcg tctttgagtg atctgaaatg actcgcgatc 120 gtggattata aatagtggat gatatttctc cggtggattt aaagctgttt tacaggaagg 180 aacgtgactg acacgtatga aacgttgcat aacagtaagg taacggactc tgattcggct 240 atatgatgac tcaatgcata tttgaagaag ctgaaatgca tacggttgat gcagaaaagt 300 attgcgtgta cttttaatcg tctgcaaaca gtttgtgcat tgattgaatg gagatggctg 360 aaggaagcga aagtgaaact gcagctttcg aacacaagag ggcgcctaaa cctactgaaa 420 aggctttaga ggagaaattg cacagaatta ttggattgag aagagctaaa ttatctgctc 480 ttaccagtaa tataaaaacc ctagaaaggt tgatgaatga tgatgaaaat ctgcagaatg 540 tgcacgattt aatgcaactt gactttgcac aattgttagc agagtttaat gatttaaatg 600 ttcaagttca agatcttttg tctgaagatg aaaggattgc agaccaacaa aactggtttc 660 agcctaaaat ggactctgtg aatattttta ggaataaaac agaaaactgg atagctgctg 720 tagctgagaa aaatgagaaa aatgacaatg tggaaatgga ggatgatatt atgcctcatg 780 acagtgcctc acaggtatct ttgaacaaaa atgagcttaa gcagcggtgt gattcagtta 840 caagtggatc tgtttgctcc tctgtatcat ctgcatatgc acaagaagaa tcaaaacgtg 900 cagctttgct agcaagggca gcatcattaa agaaaaaaca acaacttgta attgaagaga 960 tgcagataaa ggcaaaaatg gaggaacttg aaatagaaac agcgcttgca gaaagtgatg 1020 ccaaattaaa ggttctgaaa gaatatgagc attcagagga tggaaaaagt agtgtgcggt 1080 caaagaaaag aaaaggaatc caccataaac agcttgaact gaaaacagag cagaaagaaa 1140 gctcaccttc tcattctact gtggcgaagg ataaagcaaa gtctgtctca agtccaaaga 1200 tacatccagc aacagaactg aacatacaag tacctggaca cagtgactat ccacaagcag 1260 acaacagcat tattcaagtt atgcagaaac aaaatgactt gactgagttg ttggtgagac 1320 aacaaagaca aattcagctt cctagtaaag acatacaagt gtttagaggg gatccattaa 1380 cctatagatc ctttatcagg tcttttgagc ataccattga aaataaaaca gacaatgaaa 1440 aagacaagtt gtattatctt gaacaataca ctgcaggaga gcctcaagag attgtcagaa 1500 gctgtgaata tatgccgtta agcagaggtt ttaaagaagc aaagcgactg ctcgaaaggc 1560 attatggaga cgagcttatt atagcaagag cttttattga taaagttcaa aagtggcctc 1620 aaatcaagtc tgatgatggc aaagccttga gtgcttatgc tttattccta attggatgtc 1680 gcaatacatt ggaagatggg gagtccatgg aggagatgga cagtcccacc aatatgagag 1740 atgtaatttc aaaattacca tacaaaatga aagaaaggtg gcgatcagaa gcatatgaca 1800 tcaaggaaag acaaggagta agagctagat ttactcattt ggtggatttt attgatcgtc 1860 aagcaaaagt ggtcacagat cctttctttg gagacatact ggacactcgc aatgctgtaa 1920 ctgcaaaggt gaatcagaag gagaaatttc ctgctactag gggaataaaa aagagcagct 1980 ttgcaaccaa catatgcatg gaggaaagac ggcctcaaga tacatcaagt aaacttgcca 2040 gtccagctaa gatggtaagc gcttttgaaa agccatgtat gttttgctca aaaagtcaca 2100 ctcttgagtc atgcactgaa atcaaagagc agccacatag tatacgtgtg gactttttga 2160 agtcaaaagg cttatgtttt ggatgcttaa ctcgaggtca tttgagtaag atgtgcagaa 2220 agagaattga atgtaaagaa tgctctcaaa ggcatccaga cattttacat atgaaggaag 2280 atcgcaaagt gacaaagcag gttgagacac cagacaaaga gatctcatgt gcccagatct 2340 ccctaagtca ccaatcatgc aactacacag agtctggggg agataactgt gtgctgtcaa 2400 tagtgccagt taaagtcaag tcaaataaaa gtgacagata tgtggaaaca tatgctttcc 2460 tcgatgctgg aagtacagca acattttgca cagaggagct gcgaaataaa ctgaaactga 2520 aaggaaagcc aactcagatt ctattgagca ctatgtgtca agacaagcca ggtgaacaga 2580 agttagtgag cagttttatc ctcaccgacc tggaagtatg tgggctagac gacaccaagt 2640 atattgaact acctaaagtc tttacacaca gcaatattcc tgttcaaaat gagaacattc 2700 ctaaacagga agacattcaa aaatggcctt acttaagtga agtgaacata ccctgtattg 2760 atgcaaatgt ggggcttctg attggtgcaa acaatccaaa ggcaatggaa ccatggcaca 2820 ttataaacag ccaacaagat ggaccatacg ctgttaaaac tgcactaggt tgggtagtaa 2880 atggcaccat taagaaaaca aacaatgcag aaaggtccaa actgccacat catacagtta 2940 atcgtctctc agtagtggaa atcgaaaaat tgttgattca gcaatataac tcagactttc 3000 cagaacgcct gtatgaggag aaagaggaaa agtcactgga ggacaaaaga ttcatgcagt 3060 taatgcagga aaagactaaa tttgagaatg gacattactg tgtgaagcta cccttgagag 3120 atgaaacagt tgacatgcca aacaatcgat gtgttgctga aatacgtgca gctaacctaa 3180 aaagaaagct tcaaaagaac tcagaattgc ttgatgacta cactacgttc atgaagagca 3240 taatagaaaa aggatatgct gtcagagttc ccactgaaca tcttaatcgc aacgacaaca 3300 aggtgtggta cattccacac catggggtgt atcacccgaa gaaaaacaaa attcgtgtgg 3360 tttttgattg tactgcttcc tttcaaggta tgtctctgaa cagccaattg cttcaaggtc 3420 caaacctcac aaacacactt attggtgtac tcaccagatt tagagaagaa ccgatcgctg 3480 tgatggcaga tgtggagtca atgttctatc aggtgaaagt tcctgaggaa gacacagatc 3540 tacttcgctt tctttggtgg ccagatggca atttgaatgc acctatagag gaatacagaa 3600 tgacggtgca cttatttggt gccacctcat ctcctagctg tgcctcctat gcactgagaa 3660 agacagcaga ggacagaaaa catgtagcat cgcaaaaggc tgtggacaca gttctgaaca 3720 atttctacgt ggatgactgt ttaaaatcag tgtccaatga gcaagaagca atcaaccttg 3780 tcaaggaaat tcaagatttg tgtctggaag gaggttttcg attgacgaag tgggtgagca 3840 ataataggaa agtattgttg tctatcccag aagatcaaag ggccagtgga gtaaaagacc 3900 ttgacttgga tcaggactct ttgcctatag agagagcact gggtatgcag tggtgcacag 3960 atgatgacac cttcacatac catatcaaag tccaagagaa gccgtcgagt agaagaggta 4020 tcctctcagt ggttaattca atctatgatc ctcttggctt tctggctcca ctcatactac 4080 cagttaagtt acttttaaga gatctgtgca aacaaggata tggctgggat gatgagattg 4140 acgacaagcg tgctgatcag tgggttaaat ggctagaaga tttaagtcag atctcaaatg 4200 tcaagttcaa aaggtgtcta aagccagaga agtttggcat cacagcagaa gctcaactgc 4260 atcatttctc tgatgccagt gagagtgctt atggcacagc aacttatctg gttcttacaa 4320 atgaacaaaa tcagaaatat tgctcattgt tgatggggaa gtcaagagta agcccactta 4380 aacaaatcac aattcctaga ctggaactga ccgccgcaac catagcagtc aaaatggaca 4440 agatactaag acaagaacta caaatgccac tgcagccatc tattttttgg acagacagca 4500 caacggtgct gagttacatt gaaaatgaga gcgcccgctt caaaaccttt gtagccaaca 4560 ggatttcact aatacgggat gccaccactc cgttacagtg gaagtttgtc aggtcagctc 4620 agaatccagc agatcaagct actagaggta ttaaggcaaa ggactttcta caagcagaaa 4680 catggacaaa gggcccaaac tttttgttga aatcagaaga agaatggcca caaagaccag 4740 atcagattaa tcaatatcaa caaaaagacc ctgaaattaa aattgaaacc aaagtcaaca 4800 ttcttaatgt caatgaaaac gacatcctgt gcaggttaac tgactactac tcgagctggt 4860 ttcgtctgaa aagggcagtg gcatggatgc taagactaaa agagacactc ctacagttat 4920 gtagggcaag gaaacagttt caggcgtcta ttgaaacatc tgagaaagac ccagaaaaac 4980 aagcatccct tctacaaact gaaatgcaaa aattcaaatc aaaaataaag aaaaggtcac 5040 tgagtctgga agatttaaac caagcagaaa ttcttcttat tcagcacagt cagaaacaac 5100 aattcctgga agaaattgag gctatcaaac aaaatgtgac tattaagaga agaagtcagc 5160 tgtacaggct tgaccctgtg ctccaggatg ggatcctaag agtgggtggt agacttaaca 5220 aagcagccat gccaaatgaa tccaagcatc ctgcaattct gtcaaaatac tccagaattt 5280 caacactcat tttaagtgac attcatcaaa gatacggaca ttgtggtcga aactacacat 5340 tgtccattct cagacaaaaa ttctggattc cccaagctaa ttcagccatt cgaaaattaa 5400 tacataaatg ctcagtttgt cgccggctta atggaagagt tggagaacag aaaatggcaa 5460 gtttgccaga ggaccgtctg ctacctgata aaccgccctt tacaaacgtg ggtgttgatt 5520 acttcggacc ctttgaagtg aaacggggcc ggagtactgt aaaaaggtac ggagtgttgt 5580 tcacttgctt aacgattaga gcagttcaca ttgaggtggc tgatagtttg gacactgact 5640 cttgcattaa cgctttaagg cgttttataa gtagaagagg acaagtcgct gtcatgcgct 5700 cagataacgg cacaaatttt gttggtgcag aaagagagtt gcgagaagca ctgaaaaatc 5760 tgaaccacat cacaatagaa caggccatgt tgcagaaaaa cattaaatgg atcttcaaca 5820 gcccagctgc ctcccatcag ggaggagttt gggagcgtca gatccgtaca gttagaagga 5880 tacttaatgc tctcctaaag gaacaatcta tcagcgatga cagtcttcat accatcatgt 5940 gtgaggtgga aagcattata aacaacaggc caataactac cacatccgag gatccaaatg 6000 accttgagcc tctgacacct aaccatttac tactgttaaa aactcaaccc aacatgccac 6060 caggcgtctt caaaaaagaa gaccagtata tgaaaagacg ttggagacaa gtccaatatc 6120 ttgctgattt gttttggagc agatggactc atgagtatct gcccattctt caagaacgca 6180 gtaaatggac aaaacttaaa aggaactttg agccaggcga tgtagtgcta attgtggaca 6240 gttctgctcc ccgcaattca tggataatgg gaaaagtagt tcaaaccttg cctgactcca 6300 gtggagctgt acgccgtgtg aagctacaaa ccaaaacaag cattctagag agacctgtaa 6360 acaaattgtg cttaattcaa gaagcaattt aagaagaata ggtgaaacaa agacaatgtg 6420 aaaggactac atttacagaa tgactgaatt tctatgcttc ttattgatgt ttaaatttga 6480 atggctctta gaagttaaat atgtatatga taattgtttg gtctcctgtc catacaatta 6540 ggggccggat a 6551 // ID hAT-N36B_DR repbase; DNA; ZEB; 1383 BP. XX AC . XX DT 30-OCT-2008 (Rel. 13.1, Created) DT 30-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Putative nonautonomous hAT DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW 8 bp TSD; hAT-N36_DR; hAT-N36B_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1383 RA Bao W. and Jurka J.; RT "hAT DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1305-1305 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1383 BP; 442 A; 248 C; 262 G; 430 T; 1 other; taagggtgga gccgaacccg aatacggtat tcggaaaggc acgaatagcg tgtttttacg 60 aatacttgat tcgaacaaat acttgaaaaa ttatttgtat tcgggagcaa gaaaaacact 120 atatcaaaaa gcagcgtttc ctcatgagac cacagtgcat gcccgcgtga gtgagtgagt 180 gagtgagaga gagagagaga gagagagaga gagagagaga gagagagaga cacacgctca 240 gtgcaaaacg cgccaaagcc cgaaactgaa agcgagacgt gacttttaag ggactgtttc 300 atatggattt attaatcatt cttactgttc agtgatcgca aactgccgta gtttattaaa 360 gacgcaaacc tctcactgca cgtcagctgc gcgccttcag cagacctcct cattcctgca 420 gcacgagagc tttatgattg tttatgcgcg ccaaaagtgg cggatctgtc cggtaaaata 480 tctgactgcg tgtcaccgca tccctaagga ctgtttggcg aaatatttga ctgcatgtca 540 ctgcataaca aacgactgaa acgatataac tagagaaatc tccactgtgc cactgtgtga 600 gagcgtatgg caagccgttt tagcatttaa accttgttca gactatccat aaatatattt 660 agagatatct ctaattatat tttgacttgt cataattata attcgactag tcagattgga 720 aatatctgca aatatattat gactgactag tcatattcct ccattgactt ccattgaaaa 780 atatttgcag atatctctaa ataaacaaac aaacaaactt tgtgtctttt tttgtggctg 840 gcagatgata atagcagggc tgtatctttt agtattatat agaatacttt tgttctgcca 900 gatctcccag gcagggtttt atttagattt atttagttaa twttagtttt tggaattctg 960 tccattggaa aaaagttctt tcagaagaag aacagttttt tgaagtatta tttgttttta 1020 ttctgtctat tcagtgtcct tcaacaagaa cggtggtggt ggggttcata atattcacaa 1080 tggtatttta ttagttgttg gtttaaccat ggatgagata atggcttcta tgttattttc 1140 ttttcaatga catttcttat cacatgttca aaaacaacag ccaatattgc atatctataa 1200 tttatataat gggtataatt aaacaatact ttcactataa cgtaaattag aagtgtaata 1260 gaaaaaatat atatttttgc gccattttga attttactcg aatacaaata caaatacaaa 1320 tacttttccc ccctcaacaa atacaaatac aaatacaaat accggctgct ccgcacatcc 1380 cta 1383 // ID Gypsy128-I_DR repbase; DNA; ZEB; 4038 BP. XX AC chr23; XX DT 03-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Gypsy retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy128-I_DR; KW Gypsy128-LTR_DR; Gypsy128_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4038 RA Jurka J. and Kohany O.; RT "Gypsy retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1288-1288 (2008). XX DR Genome; chr23; Positions 24367049 24363012. XX CC Positions [2707-3081] - Integrase core CC 'GTTG' target site duplication CC LTRs are 99% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 568..3828 FT /product="Gypsy128-I_DR_1p" FT /translation="MASKIETFVSAPSEDLLNDLTKDQLIELADHYKINLS FT SQDKRVKDNVKLLIKTELMDRGILAFELSESASDLIETTTMSPLTFEQQKQ FT LLLIQKEMKEKISAMQNRVEMSKLQFQQQQLDLERYRLDLIRDGKLLPTTS FT GERSSSSNSDIVANLRLIPKFDEKDVERFFLLFERVADARNWPDEDRTLML FT QSVFIGKAQEAYSSLGVEDAKNYSKVKNAVLRAYELVPEAYRQRFRSWKKT FT DKQTHVEFLHDISVFFSRWCSASDVKTLDDLRELMLLEQFNNSVTERTATY FT ISERKPNTASEAAVMADEFFLIHKTSFDNKKTEMSFNKEMWHYSDKCSKFR FT RTSPSRVRVDERERANMCNYCHLLGHWKNECPSLKSKEDAKIKSISSLVKP FT VALAETVLKVNNIHKSSSPRVSSFSPFVTDGSVRLVDSEQVVPVKILRDTG FT SAESFVLQSVLPFSNDSYTGNNVLIRGIGLNVISVPIHKLVLNSDLIQGEV FT KVAVRSCLPVEGVQVILGNDLAGDRVWRNVSPNLVVTSSELTSESGIEDLS FT CSSNVFPSCVTTRSMSKIQGGDRSEEKRVSTVLEIPSILTVSREDLIREQK FT ADSTLIELFDRVVPNDTIVDLSSGYYLDGGVLCRKWVPHEECVIVEPMFQV FT VVPQSLRQLVLQTTHDTSGHLGVKKTYRLLLKQFFWPKIKRDISKYMSCQT FT CQLTGKPNQSAKPAPLYPIPSISQPFEHLLIGCVGQLPRLKTVDEYLPAVM FT CQVTRYPAGYPLHSTTADVKALTWFISVFGIPGLIESDQGSNFTSNLFGEV FT LKSLHVQHNLSSAFYPQSQGALSRFLQALKSMLKSYCTQMDWKQTDPPKNV FT VSYASDFHLRMSEACQLVKGFWREAWDLMWRLLVNVLKCLLQSGDRVLALV FT AVVGSPFQAEFLDSSNAEQLPNSVLLDRDAGDLVVPDCIHGTLNNYLSHLS FT HVHCTDLINLIHEYVALFSDTPFCTNLIEHNVDVRDSIPIRRLLLYCKVLS FT NRKKQFEGEVDYVLKIDFGSTLEFMWNLLRDLWWILRATILSFFHPLQNPS FT QRSILRWWFLQGYLLDICHIKGSQNIRAGA" XX SQ Sequence 4038 BP; 1087 A; 661 C; 944 G; 1346 T; 0 other; aatatggggg ctcgtccgta agtttaattt ggtgataact ttgttgaatt aatgaatttc 60 gttaatctaa tgattaattc gttaagtata taattttttt ggattaatat tcgttttgtt 120 gccaatattt gtggaggatt atattagatg gtttggttta tttttgtttg acgggagcac 180 ggccatggtt ttgggtaagt ctgacttctt tgtgtatttg tggagagatt tgttttggaa 240 tatgcttttc cctgggcagg tttagtgagg tgtcctttgg atgcctcaaa atattttttg 300 ttttagtttg ttatttttgt acggtgaacg cggctggagc agttgtctcg ggtacacatc 360 cgtgttttgc agcggtgatt tgtcaattgg tcgctgttgc aaacgctggt ttatagtgta 420 taagaacccg ccgcaagtaa ctgtatgaag actagggttg agtttagtta gctaaatttc 480 agttagaggg aactgtttat tccagtcttg tttctcctca aatattgttg ggaatatttg 540 gtagtttgcc ttacatttgt cgtttcaatg gcttcgaaaa ttgaaacatt tgttagtgct 600 ccatcagagg atttgttaaa tgatttaact aaagatcagt tgatagaact tgccgatcac 660 tacaagatta atttgtcaag tcaagataaa cgtgtcaaag ataatgttaa attattgata 720 aagactgagt taatggatcg tggaatttta gcattcgaac tgtctgaaag tgcgtctgat 780 ttgattgaga ccactacaat gtctcctcta acatttgaac aacaaaaaca actgttgtta 840 attcaaaaag aaatgaaaga aaaaatatca gcaatgcaaa atcgcgtgga gatgtctaaa 900 ctgcagtttc agcaacaaca gcttgattta gagcgatata ggttagattt gattagagat 960 ggaaaattat taccaactac aagtggggaa cgtagttcgt cgtcaaattc agatatagta 1020 gcaaacttga ggcttattcc aaaatttgat gaaaaggatg ttgaacgttt ctttttattg 1080 tttgaacgag tagcagatgc acgaaattgg ccagatgaag atcgcacgct catgctgcag 1140 tctgtgttta taggtaaagc tcaagaggcg tattcttctc tcggggtaga ggatgcgaag 1200 aattattcaa aagtcaaaaa tgcagtcttg agggcttatg aactggtccc agaggcctat 1260 aggcagagat ttaggtcctg gaagaaaact gataaacaaa cccatgttga gtttttgcat 1320 gatatttctg ttttcttcag tcgctggtgt tcagcgtctg atgtaaaaac attggatgat 1380 ttaagagaac taatgttgct tgagcagttt aataactccg tgaccgaacg tacagcgaca 1440 tatatcagcg agcgtaaacc taatacagca agtgaggcgg ccgtaatggc tgatgaattt 1500 tttctcattc ataaaacctc ttttgacaac aaaaagactg aaatgagctt taacaaagaa 1560 atgtggcatt attctgataa atgttctaaa tttaggcgta cttcccctag tcgcgttcgg 1620 gttgatgaaa gggagagggc aaatatgtgc aattactgcc atcttttggg tcactggaaa 1680 aatgagtgtc cttcacttaa atcaaaagag gatgctaaaa taaagtctat ctcttctttg 1740 gttaaacctg ttgctttggc tgagaccgtt ttaaaagtga acaatattca taaaagtagc 1800 tctccacgtg tttcatcctt ttctccattc gttacagatg ggtctgtcag gttggttgat 1860 tcggagcaag tggtcccagt taaaattctg cgggacactg gatctgcaga aagttttgtt 1920 ttgcaatctg ttttgccttt ttcaaatgac tcatacactg gaaacaatgt cttgattaga 1980 ggtattgggt tgaatgttat atctgttccc atccataaac ttgttttaaa ttctgacttg 2040 atccagggag aagtgaaggt ggctgtgcgg tcctgtttac ctgtggaagg agtgcaagta 2100 atcttgggta acgatttggc tggtgatcga gtttggcgga atgtttcacc aaatttagtg 2160 gtgacatcat cagaattgac tagtgaatca ggaattgaag atctttcctg ttcctcaaat 2220 gtgtttccat cgtgtgtgac aacacgttcc atgagtaaaa tccagggagg tgatagatct 2280 gaagagaaaa gggtgagtac agtgttggaa atcccttcta tcttaacggt ttcacgagag 2340 gatttgatta gagagcagaa agctgactct actctcattg agttgtttga tcgagtggtt 2400 cctaatgaca ccattgttga tctctcgtct ggatattacc tagatggtgg tgtattgtgt 2460 agaaagtggg tgccccatga ggagtgtgtg attgttgagc ctatgtttca agttgtagta 2520 ccccagtctc ttcgtcaact tgttctgcaa actacacatg atacatctgg acatttagga 2580 gtgaaaaaaa cgtatagatt gcttctgaaa caattttttt ggccaaagat aaagcgtgac 2640 atatctaagt acatgtcatg tcagacgtgt cagttaactg gaaagccaaa tcaatccgct 2700 aaacctgctc ctttgtaccc aattccttct attagtcaac catttgaaca tcttttgatt 2760 ggttgtgtag ggcaattgcc taggttgaag acagtggatg agtacttgcc tgcagttatg 2820 tgtcaggtaa ctaggtatcc agctggatac ccattgcatt ctaccactgc agatgtaaag 2880 gccttgactt ggttcatttc tgtgtttgga atacctggac ttattgaatc ggatcagggg 2940 agtaatttta cttccaactt gtttggtgaa gttttgaaat cgctccatgt tcaacataat 3000 ctttccagtg ctttttatcc ccagagccag ggagccctgt caagatttct tcaagcactg 3060 aagtctatgc tgaagtcata ctgcacacaa atggattgga aacaaactga tccaccaaaa 3120 aatgttgtat cgtatgcgag tgattttcat ctcagaatgt ctgaagcatg tcaactggtg 3180 aagggctttt ggagggaagc gtgggatcta atgtggcggc ttttggtcaa cgtacttaag 3240 tgtttacttc agtctggtga tcgagttctt gctctggtgg ctgtagttgg ttcacccttt 3300 caggctgagt ttttggattc atccaatgca gagcagttac caaattcagt tttgttggac 3360 agggatgctg gagatttggt ggttccagat tgtatacatg gtactttaaa caattatctt 3420 agtcacttat cacatgtaca ttgcactgac ttgattaatt tgattcatga gtatgttgct 3480 ctgttttctg atactccgtt ttgcacaaat ctaattgagc ataatgtcga tgtgagagat 3540 tcaattccaa ttcgaaggct attattgtat tgtaaagttt tatcaaatag gaagaagcaa 3600 tttgagggtg aggttgacta cgtgttgaaa attgactttg gcagcacctt ggagtttatg 3660 tggaatcttc tgagggatct ctggtggatt ttacgggcca caatccttag ctttttccat 3720 ccactacaaa accccagtca gaggtcgata ttgaggtggt ggttcttgca gggttacctg 3780 ctggatattt gccacattaa aggctctcaa aacattagag caggtgctta aactgttatc 3840 gaaggttgta tattggtttt tctctctctt tgtctctgta tttctctctt aaattgcttc 3900 ctaggcgcca tggttgctgg gaggagttgg ggagccgagg atggagaacc agtttgacct 3960 tcatactgag gtttaggggg attgtggggt tcgatgtcat gttaaaagac tttggtcttt 4020 ttctttttag ggggggtg 4038 // ID hAT-N32_DR repbase; DNA; ZEB; 511 BP. XX AC . XX DT 14-OCT-2008 (Rel. 13.1, Created) DT 14-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Putative nonautonomous hAT DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW 8 bp TSD; hAT-N32_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-511 RA Bao W. and Jurka J.; RT "hAT DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1302-1302 (2008). XX DR [1] (Consensus) XX CC TSD is 8 bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 511 BP; 150 A; 91 C; 117 G; 153 T; 0 other; tagagcagtg gttctcaaag tgggggtcgg gaccccccga ggggtcgcgg gacaatgaag 60 gggggtcgcc tggtgatttc caaaaatcta tttattttta ttaaaccata agaattacca 120 tattttatcc ataacctact gaagagaaaa aaaatagtcg tttatagtta ctatatacta 180 ccgtatatag ttactatagt agcttatagt tactagttct attggattgc gacccctggg 240 gtaattacat tatattaaag acacagcaat agcgtcagat gcagcagatt gattttataa 300 caccaggtta aactttctgg cccatttaca gcactgacat acattaaaaa aaattaaacg 360 aagaatagac tttgggtcta ttggtgtgtg tgcgccattg catgcaaggt ttctgtattt 420 ataaccacct cagaggatat tgggggtcgc gagtcactgg cattgttatt ttgggggtcg 480 cgggctgaaa agtttgggaa cccctgctct a 511 // ID DNA-2-7_DR repbase; DNA; ZEB; 1629 BP. XX AC . XX DT 22-OCT-2008 (Rel. 13.1, Created) DT 22-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; 2-bp TSD; KW DNA-2-7_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1629 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1220-1220 (2008). XX DR [1] (Consensus) XX CC TSD is 2-bp long, and TIR is 17-bp long. The classification of CC this element is unknown. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1629 BP; 626 A; 223 C; 250 G; 526 T; 4 other; cactgtaaga aaaaaaaatc gtaaaaaaaa ggtcaaatga ctggcagcta cggctgccaa 60 acaaaaacca taaaattacg gtaaaatgtc tttgttgaaa taaagtgcaa aaaataataa 120 atctacagat attttcatta aaatgtttac agaaaaacac tgttatttta cagacttttc 180 ctagattatt acgatcaaat ccgttcaata aagttacaca ctaagagttt tacaaagaaa 240 cactgttatt atacagactt ttcctagatt attatgatca aacaccatca ataaagtgac 300 tgcactaaaa gttcaagaga aaacaatgtt atttaaaaga attttcccta aatcattacg 360 attaaacacc tttaataaaa tgccaagaca tgctcatgat cgtaatttaa cagttttatt 420 ttggatcaaa aaattttgga aaaaacaaca aacgagatac aactgattaa aattctcaca 480 actttaaaat tacagttgtt acattttatt gaatagtgtt tatgtaaaac atgtagaggt 540 ttaagttata tggaatgatt cagttacaaa tcttaaatga aatggaactg taacgtgaag 600 gggacgctga caaaggagtg cagatccaaa tgcaggttta ttacacagag atggtcaggc 660 aaacaacagt caacacaggg gcaaacagat gtaacattat gcagggaatc cagagtcatg 720 gtcaaataac aggcaaatgg tcagtcccgg cagcaaacaa cgtaaacaat taacaaacaa 780 agcaaggcaa aagaagagaa acgcgtcata atgttcacag taacagtata acaagactaa 840 gcaattggtg cgtgcgtctg tgctgctttt aaagtgcatg taatcagttc ataacgatcc 900 tccgactgtg tgtgtgcaat cagcggaatc tggaacaggt gtatgtgagc gctcttagtg 960 tgtaacttta ttaaatggat ttgatcgtaa taatctagga aaagtctgta aaataacagt 1020 gtttttctgt aaacatttta atgaaaatat ctgtagattt attatttttt gcactttatt 1080 tcaacaaaga aattttacca taattttatg gtttttgttt ggcagccgta gctgccagtc 1140 atttgacctt ttttttacga tttttttttt tttttttaca gagcaagtaa ttacaattta 1200 ttactttaat tttacgtaat tttaaatgta cttcataaaa aacaggaaaa aacactgtgt 1260 aaataatacg tcaattttat tgtataaaac agacaaattg ctgtaatttg tcattactta 1320 tatagtacat aaaataacag tcaagctgtt aatttacaga tattgtttgt aatttttacg 1380 rayttttgaa tataatttaa aataacagaa aaaatactgt ataaataata ctgtaatttc 1440 cctgtataaa aaacaaaaat gtcagtaatt tgtctttaac gataaagtac ctaaaatgaa 1500 agtcaaackg ttaatttaga gagattgttt gtcattttac taagtttcga atataatktt 1560 aaatgacaaa aaattactgt aaaaaattta gggatatttt ctgtaaaatt agtttttttt 1620 tttacagtg 1629 // ID EnSpm-3_DR repbase; DNA; ZEB; 8448 BP. XX AC . XX DT 31-JUL-2008 (Rel. 13.07, Created) DT 27-SEP-2008 (Rel. 13.07, Last updated, Version 2) XX DE EnSpm-3_DR is an autonomous DNA transposon - a consensus. XX KW EnSpm; DNA transposon; Transposable Element; KW Autonomous DNA transposon; EnSpm-3_DR. XX NM EnSpm-3_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-8448 RA Kapitonov V.V. and Jurka J.; RT "Zebrafish En/Spm DNA transposons."; RL Repbase Reports 8(7), 751-751 (2008). XX DR [1] (Consensus) XX CC EnSpm-3_DR is young family of autonomous En/Spm DNA transposons. CC The consensus sequence was derived based on multiple alignment of CC several copies of EnSpm-3_DR that are less then 5% divergent from CC each other. EnSpm-3_DR transposons are characterized by 2-bp CC target-site duplications and imperfect 18-bp terminal inverted CC repeats (1 mismatch). See also commentary on EnSpm-2_DR. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 2115..4736 FT /product="EnSpm-3_DRp" FT /note="En/Spm transposase." FT /translation="MQCKNCKFSTSSEDVLLKHYQLHHHRISNWPCLYTEC FT VCAFKTAGALRSHLSKSHHNTDRVSRDLSTFNCGLCEFHEICTEKRFFLHL FT QNHLKRKETIQCPFKGCGFKTNNRPTFSSHRSRNHRNQTLSDRQSQIVSDS FT LGVSEVLCTTQNISEFDEEDIIESVEDVNVHTLERKIASLLLCMQTVLHVS FT KSATQTIFEELKSILLFSKSHALHTIETIVTKHNINIESVVIKEIADSVCL FT TNPLFASISEKGTLSTDYKRNQYFKRNFFVIEPTEYLYEHSHKEVFVHVPV FT IRLLETLLNQDSFLNNIEFTHKHLPGQYSSFKDGKYYRESGFVTEDDVKLS FT LAFYVDEFEICNPLGTSRKIHKITAVYWVVLNLPANLRSTLSSIQLAVLGK FT SIDVKKFGFDKFLEPLIKELKSLEHEGLFVEALGHYIKPTVFCVCADNLGA FT HGLAGFQENFNVEKFCRFCLISRDQISTVKPCDFPLRTVDQHDLFVEQLKQ FT SAVQSVNGIKSECALSKHLRYFHPVTGFPPDILHDFFEGVIPVELSLCLRD FT LISKGFITLEGVNHSIRTFPYKYSDRVNKPKTILKTSLAKGSIGGNGHENW FT TLLRLLPLMIGDRIPEHEPSWDILMDLKEIVEIVLSNSLSDETLCYLSFKF FT SDHRLLLTSTFPDYALKPKHHFIEHYPELTKCFGPLVTLWTMRFESKHSFF FT KKVARDAHNMKNVLLTLSMKHQQMIAYHLDAQSLLKSDLHVEKLDVVSISL FT LDATLRHAVQTKFPQLHTVSLSRDVCLHGTRYAKNMIISAGHCNGQPEFFR FT IESMLIHSAKVFFITKRVSAWYLEHLRSYELVESHYTDMVIFDSGDLNGYH FT PLIPYSVGSKVFVTLRTYLQH" XX SQ Sequence 8448 BP; 2644 A; 1442 C; 1556 G; 2806 T; 0 other; cactgtaaaa aataaaattt caatttgttc agcacactgt actttttctg tcatatatac 60 taagaattgt gaataactac tgatctacat aaaaatatat gttatgtcaa taatattcgc 120 taaaatgtta tgtctacatg tttaccagag ttttctggac atacattttt gtgttgaaca 180 ggtatgttga tatttaaata aattgcgcat gcgtgacgca acgtttgaaa acggcgccaa 240 ccctccccca tattgaccat tcgtttcgct cctcatctcc attttgtctt ggtgccatat 300 cggacttcac gcatattttc gactcttcga ggaactaaac attaacttgt tcctgtaggt 360 gagtaacaac acttattaac actaattcta tgcatgcaat gaattttact atattcacaa 420 cctgacggtt tgtttttagc gacgagaatt attgcatatg ctgaagtttc aaatgcagtg 480 ggtaattaat gtggcctata gtttcagcta agtccaaact cataaatgtt tccgtgtatt 540 catgacataa gtggcggatt taggcagagt caatcgctca gggaggcatg gagaccccgc 600 gcgacatcct tcccgcccac gcccgcacct tcgagagggc ggaggaccgt gagttcggca 660 cggtcgtcgt ttgcctaacg ccagttaagc ttgtgccgct ccttaagaca ttacttggaa 720 aaaaacggaa actgcttttc tgtgccgttt aagtttaaaa tgtttcatgt taacactaaa 780 attacatcat tatgcttaac tgttaatcac tgataaaaga tggctaatat gggaccatag 840 tattaacatt tacgttaggg aattcagagg ttttgtttgt ttttaggtga aacgttttca 900 tttctctggc gagttttcac atttttctca ttttacgtgc ttctccgtgt gtgccgccta 960 gtgatgggtc gttcttgaac gattcgttca ttttgaacga atcttttgta tgactcgata 1020 acgacgagtc ctctctagga gtgatttgtc cactcgcgca tgcgcacatt tgtgcaggtg 1080 gaggaaaaga ttagttcatt tatcgagtcc tctatctggt ttaagtcatt cgttcatcac 1140 gttaaagaca taaggcaatg aaatgcaatc agagccggaa atttttttaa atcttctttc 1200 gagtcctcgg cgttgagtca tctctcttta catgctgtca cgtgatgaac gaacgagtca 1260 aaaaccagat gactcgaaag gtgaactaat ggctcctttc ggttcagact gtgctttggt 1320 taagcttata tgggtctgtc tgtgtgacgt agacgaaccc ctcaaatttg aagacctgtc 1380 agaatagctg aactcacatt atcatagaca aaagactagg taaacaaacg attattttct 1440 ttttcttata gcattcaagt tatgacttgt ttgtagtgtg atcaacgtct gggctagttg 1500 tagatgcgtt tggaataacc tgtatcattt taataatatt ttggcaaatt gatggaaatg 1560 acttgaaaaa aagatttgtt catctcaacg aatgagactc aaaggtccaa atcagtaata 1620 tgatccgaac ttcccatcac taagagcgag tgagagaccg taagttgtac ttataaattt 1680 gtgaagcata tataaagcac agatattcag attatattag ataaagtaat atgtaaatgt 1740 agtttatatt ttttatttat aaaatattgc cgaaaacgtt atttaatcct gtcaaattaa 1800 tataaatagt tcctattagg tttaaaatag tcttaatttc aaagcatata ttattactaa 1860 ctatattaca ttatttgtgg actcctctgt atacatcaat atcatatatt gatattgata 1920 tcattgtctg ctcccaaaca tgtttcatgt taaaatatgg gggcataaac ttgaaaaatg 1980 tttaaaaaat tttaaatgtg aagctttttc aaatatattt tttgtaagaa ttatatttta 2040 tttaaagttg tgatttaaca atacaaggtt ataactgatt atttctgttt tcaggatgat 2100 ctcctcaaga cttcatgcag tgtaaaaact gcaaattcag cacttcaagt gaagatgtcc 2160 ttctgaagca ctatcagcta catcatcata gaatttccaa ctggccctgt ctttacacgg 2220 agtgtgtttg tgcttttaaa actgcaggtg ctttacgatc ccacttatct aaatcacacc 2280 acaacactga tagagtcagt cgagatcttt caacttttaa ttgtggacta tgtgaatttc 2340 atgagatttg tactgaaaaa agattttttc ttcacctaca aaatcatctg aaacgtaaag 2400 aaacaattca gtgtcccttt aaaggatgtg gatttaaaac aaacaaccgc ccaaccttta 2460 gctcgcatag aagcagaaat catagaaatc aaaccttaag tgacagacag tctcagatag 2520 tttcagatag tctcggagtc tcagaagttc tttgtacaac acagaatata agtgaatttg 2580 atgaagagga tataatagag tctgttgaag atgtaaatgt tcatacactt gaacgcaaga 2640 tcgcctcact tttattgtgt atgcaaactg ttttgcatgt ctcaaaaagt gctactcaga 2700 caatttttga ggaacttaaa agtattttgt tgttctcaaa atctcatgct cttcatacaa 2760 tagaaacaat tgtaacaaag cacaatatta acattgaaag tgttgtaatc aaggaaattg 2820 cagattctgt ttgtctaaca aatccacttt ttgcatcaat ttctgaaaag ggcactttgt 2880 ctactgacta taaacgaaat caatatttca aaaggaactt ttttgtaatc gaacctactg 2940 aatatcttta tgagcattct cataaagaag tgtttgttca tgttccagtc attcggttgc 3000 ttgaaacctt gttaaatcaa gacagctttt taaataacat tgaatttaca cataaacatc 3060 tccctggaca atacagctca tttaaagatg gaaagtacta cagggaaagt ggatttgtta 3120 cagaagatga tgttaaacta agtttagcct tttatgtgga tgagtttgaa atttgcaacc 3180 ctcttggaac atctcgaaaa atccataaaa tcactgctgt gtactgggtg gtcttaaatt 3240 tacctgcaaa tttaagatct actttatcat caatccagtt agctgtttta ggaaaaagta 3300 ttgatgttaa aaaatttgga tttgacaaat ttcttgaacc tttgataaaa gagttaaaat 3360 ctctggagca tgaaggtttg tttgtggaag ctttaggaca ttatataaaa ccaactgtat 3420 tctgtgtgtg tgccgataat cttggagcac atggtcttgc tggttttcag gaaaatttta 3480 atgtagaaaa attctgtcga ttctgtttga ttagtcgtga tcagatttca actgtaaaac 3540 catgtgactt tcctttgaga actgtggatc aacatgattt atttgtagaa cagcttaagc 3600 agagtgctgt tcagagtgtt aatggtataa agagtgagtg tgcattgagc aaacacttaa 3660 gatactttca tcctgtaact ggatttcccc cggacatttt acatgatttc tttgaagggg 3720 tcatccctgt ggagttatct ttgtgcctca gagacttaat ttccaaaggt ttcattactc 3780 ttgaaggagt aaatcactcc attagaacat ttccttacaa gtactctgac agggtcaaca 3840 aaccaaagac aattctaaaa acaagtcttg ctaaaggatc aatcggagga aatggacacg 3900 aaaattggac gttattgcgc ttacttcctc tgatgattgg ggatcgtatc ccagagcatg 3960 agccatcatg ggacatatta atggacctaa aagaaatagt tgagattgtt ttgtcaaaca 4020 gtctctctga tgaaactctg tgttacttgt catttaaatt ttctgaccac cgcttgcttc 4080 ttacttccac ttttccggac tatgcattaa agcctaagca tcactttatc gaacactacc 4140 cagaactaac taaatgtttt ggacctttag tgactttgtg gaccatgcgc tttgagtcta 4200 agcactcttt cttcaagaag gttgcacgtg atgcccacaa catgaaaaat gtacttctca 4260 ctctttccat gaaacatcaa cagatgattg cataccattt ggatgcacaa agccttttaa 4320 agtcagactt gcatgttgaa aagttggatg tggttagcat atcattgttg gatgcaaccc 4380 tgaggcatgc tgtacaaaca aagttcccac agttgcacac tgtgtcactg tccagagacg 4440 tttgtcttca tggaactaga tatgccaaga acatgatcat atcggcagga cactgcaatg 4500 gacagcctga gttcttcaga atagaaagca tgttgatcca ttctgccaaa gtgttcttta 4560 taacaaaaag ggtttctgcc tggtatttag aacatttgag atcttatgaa cttgttgaaa 4620 gccactatac tgacatggtt atctttgact ctggtgacct aaatggctat cacccattaa 4680 ttccatacag tgtgggatca aaagtgttcg tgaccctgag gacctatttg cagcattaaa 4740 tgtcctttac aatggtatgt accacattaa ctaattaatt tattaattgc ttaaatttag 4800 taattaaaat tggcttcttg tttatttaca gcctttgcta ctacgagtca tcatttcctc 4860 cactgaagcc cggcgagtcc agcttcctga agtgcctgaa tcagtggaat ctctcatcac 4920 tattcttcaa gagaagctgc aattacaagg acagttttct cttaagtttg aggatgctga 4980 ttttggcaat gcactttgca acctgtctga catctcagaa ttgcccagtg gaaaagcagt 5040 cttgcatatt cagtggtgca agtcatcagc ttatgaaagc agtagccttc catcagtttc 5100 atcacttgat actgctagtc tcgactctga agaatccttg ccaagcactt caggctctgt 5160 gcaaaactat ttacgtactg cctcagaatg gccctcgcca ttccccatac cagcgctgtc 5220 atttgatgtg gagctaaaac taagacgagg aaacgaggca tttgaaaaaa caaaaatagg 5280 cattgatgtg actagagaca tgaaaataga gattcttgac aaaatagtgc agacagtttt 5340 tgacataaag gcataccctg acaatcagga aattgaatcc attgcatctg ctttggtttt 5400 aaaatatcct tgccttaagg agcctggcaa aggaaagggt tttgagggat ggttgatcag 5460 catcaaaaac aagctaaaca attatagggc aaagttgcga gaggcaggtt gcaatgaagt 5520 aattgttaac agaaagcgaa acgatgatgc cagtggtcgg aggagtttca ctctgaaaaa 5580 ggcaaagcgt ggagaagtca atcatgtacc ggaacatcca tgcaaccaca ctgacacttc 5640 acttgaagag caaagagttt ttttggtaga ggaaaccaaa aaggcaagaa gagacatggc 5700 agccataagt gaaaaaatgg aactaacatt ttcccttaga agaaaagaaa tggtccaaga 5760 gcagccaatg attgtagagg ttcaggagag gtggcctgca ctcttttttc aagaacaggt 5820 aaagataatt tttttcttac ttattaggct ttaagtttac agaatgcaag atgctctgtt 5880 tacagaaaaa attgattgct gtaaatctgg ttggtttgtt ttaaatagat ctgtgaggaa 5940 tttttccgca tcaccaacaa agacctacta ggagtcttca tggcagccac tgacaagtac 6000 acaccaaagc ttctgaaatt atacagagcc agaaaaggag cgtttggaca tgaaatggaa 6060 gagctcctgg aaagacttga tgaaagggta agttaataat tttcattatt tacatgtttt 6120 tgatctatgt agttgatgat atttcagttg ttattctgtg actttttggt tgtatctttt 6180 ttatgttttt gcagacaaca gaaattgtta atcacagaag gactgctgct ttggagggcc 6240 tgcctttgtt tcttcgagag aaacacacta accttttcaa gaaatgtaaa gtaggtatca 6300 aacactttaa atggtttaaa gcatgagtgt ccaaacttgc ttctgtattg ctggtgtctt 6360 gaaaagttga gtgccaatct atttggagct ttcatgaaaa cttttgatta gctgagttaa 6420 aatgtttaat ttaggtatca ttaacagaaa ttacaaatct tctgacctta aggtataaga 6480 agtaaatttc atgattcaca agtatcttgt tatttttaaa agattgatat caggttctat 6540 attagtacca gtttagacct gctcactgat ttatgcaggt agttaaggga aagaacatgt 6600 aaaagagatg caagagaaac tgttgctttg tctgttatta cataaaatta gagttgtgcc 6660 gagagtctga gacgttattc ttgtcaacca aaatggcagc tgttttattc agagtacaac 6720 aaatcagcat ttatgtattt attcatttat gacagtattt taaaataaat cacttaaaat 6780 atttaaagtc caattactgc attaatgttg catatgaata ctgatgaatc cttgttattt 6840 acatgttgtc tgtttaattg ttacattcgc ttttgcttaa tgatttgaat tataacacta 6900 tgtcttccta cagacaaact acacattaca tagcagtttt atgatcataa ctatttgatg 6960 taactggtca ttttgcttaa ctgcaattaa gtttgcatag tcattgtatt atgctggttt 7020 gttattttta cagaatgttc atatttctga aagataatgg tatatatttt taactgaata 7080 aataactttg ttacaattat ttaaatgtgt gtgttgttat agccctataa taaacaagta 7140 cttgcaaaat tttttaaatc tctataatca ccattaattg gaatatttta tttgacaatt 7200 aatcgtcacc taaatttcat aagcatgaca gctatatttg caatataaca tcttgcttgt 7260 ttgtagataa agtactataa tgactttcta tgctcaaata ttcttattgt gttttcacag 7320 gaaactgaag atggaacaaa gggcgtatca gttggcattc tctatgttac ggaagaggac 7380 tcccgggcag catctccagt gatccagaac attgctgttg tactggaaga ggttgttgtt 7440 ttggaagaca ttccagatac ctcaagtgct gtagcatacc tctttggcct tctttatgca 7500 ctcaactttt cctatcccaa agaactccgg tacacttttg acacaattca gaatgttttc 7560 atggagcttg ggactggatg tacgcaacgt gtgctttctc ttaaaaacaa gatgttaaac 7620 taatactgta aacagtgttg tgtcaatggt ctgatgttac aaagacctaa tgtacaattt 7680 gagtactaaa tctgtttcat gagagccact gtttaggcac tgtgttcagt taaatgttaa 7740 aggttttaaa attaaaaggt ttacattttg tcaagacacc caagcactta acttttgttt 7800 tggtttaaat gtttatggaa actggtgtcc aaatcaaccc ttttgtttat aatttgtgca 7860 taaaagcatt caaagcaaat ttagtttgtg tcaaatgcaa tgtttaaatt gttaatttct 7920 gtgctgttat tgcagctaaa atgttaaagc tagcatatgt gcctgcaaat tagcagaaaa 7980 gtttgtatac tgcaaagttt gtaatttgaa gcacttttca tcttaaaata tttacaaatg 8040 tttgtgctat ttcctattgt aaagcgtgtt atctgttgct ttcagtggaa ataaattgaa 8100 gaagtagtaa gtttgtgttt gtcatttgta agtagaatga ttaaaacaaa atgatatgct 8160 aactagtact aattacgtat tttgcaaatt tgaaaataaa catccactaa acagaaaaaa 8220 attaattgac tcaatagcag taactgttgc tttagttgat gtgacaagaa tatctttatt 8280 agtataatac tataatcaat attgcatgaa caaagaaaat tatgttagct caactaaata 8340 ttgttactga gaagaactaa aaaacagttg ttgagacaac tcaaagttct tactgcatca 8400 agttgcctta tttttttatg tttgctcaac attttttttt ttacagtg 8448 // ID Gypsy40-LTR_DR repbase; DNA; ZEB; 936 BP. XX AC . XX DT 21-SEP-2007 (Rel. 12.09, Created) DT 21-SEP-2007 (Rel. 12.09, Last updated, Version 1) XX DE A long terminal repeat of the Gypsy40_DR LTR retrotransposon; a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy superfamily; LTR; Gypsy40-I_DR; KW Gypsy40-LTR_DR; Gypsy40_DR; endogenous retrovirus. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-936 RA Dib M.R. and Naveira H.F.; RT "Gypsy40_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 7(9), 811-811 (2007). XX DR [1] (Consensus) XX CC Gypsy40-LTR_DR is a long terminal repeat of the Gypsy40_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its CC internal portion is deposited in Repbase as Gypsy40-I_DR. XX SQ Sequence 936 BP; 201 A; 192 C; 180 G; 363 T; 0 other; tgttacagtc atatgtatat gcctgtttgt tttgcttgtg tgtgtgtgtg tgcgctctct 60 ttctctccct ctctcgctcg ctcgcttgct ccctcctcct ctctctgtca gttctcaggt 120 tcgcccattc cagtatccca ttcgtccgtg aagacgtcaa tctcaattac accttgtgac 180 gtcatcacct tcacaaccaa tcagcacagg agctcaaagt ttaaaaggga tgcgagcggg 240 gaagctcgtt ctgcttttgc tctgccttgc ttgcggtctt tgcttgcctt tgtgtgtgtt 300 ttttggcttg ttgttatgtg ctgtgatgtt gacgtgaact tagttttgat tttggtcttg 360 ttcttgtgtt gatcgctgtt accgtcttgg atgtgtatat gtgacaatcc cctgacgtct 420 gtcttcagcc tttcccttgg attcgaggag tttagatgtc acgtgacata catattgcat 480 cacgcagata aaactccact gtctaaacac actagtttag aagtgagcgc cacccgctgt 540 aagttttctc cgtatatttg gttagaagag tagattagtt atagcgtgaa aacgctgaag 600 atgttttgtt tgtttatttt gtattagaca gtttagaggg aaattattag ttagaattgg 660 ggtttttgtt atatttcttt catttatttg gcgccactta actctcgttt cccccaccat 720 ctttaatcta tattgattga tttattgtat gttgtgagat cttgtaaata atactttatt 780 gctttttgac ttcctttcac tattgtaaat aaattcactt atttttgcag tattttggtt 840 gtcgtgttga tattttgcca ccaactgacc acaaaatatt atcaccttaa atgttacctt 900 ttaaacccta gataccaaaa cttaaaagtc gtaaca 936 // ID Gypsy44-I_DR repbase; DNA; ZEB; 5077 BP. XX AC . XX DT 25-SEP-2007 (Rel. 12.09, Created) DT 25-SEP-2007 (Rel. 12.09, Last updated, Version 1) XX DE An internal portion of the Gypsy44_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy superfamily; Gypsy44-I_DR; KW Gypsy44-LTR_DR; Gypsy44_DR; endogenous retrovirus; gag; integrase; KW reverse transcriptase. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5077 RA Dib M.R. and Naveira H.F.; RT "Gypsy44_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 7(9), 819-819 (2007). XX DR [1] (Consensus) XX CC Gypsy44-I_DR is an internal portion of the Gypsy44_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its long CC terminal repeat is deposited in Repbase as Gypsy44-LTR_DR. CC Gypsy44_DR is characterized by 4-bp target site duplications. The CC internal portion encodes one polyprotein the 1597-aa polyprotein CC Gypsy44_DR1p (pos. 198-4988) composed of the gag, protease, CC reverse transcriptase, and integrase domains. Some insertions CC fairly recent, according to the high identity between their CC flanking LTRs. Consensus obtained after the alignment of at least CC five independent insertions bearing at least 85% homology over at CC least 1000bp. XX FH Key Location/Qualifiers FT CDS 198..4988 FT /product="Gypsy44-I_DR_1p" FT /note="ORF." FT /translation="MEFDLIEFSLAPTVEVFNRCRKKDLLLIAEFFKISVS FT RDVTKAVLKDELYTELVNSGILPVESKTDGLEEQELVEIAETNVDAVPIGI FT EPIMDPLLEIKLKELELAIKQQEHETELVHLRVIEVQADRDIKLKTLSLEA FT EALRYKPVSAPSSRPSSPLTALPAQSSTVNLNKPVSAPSSRPSSPLTAVPA FT QSSTVNFGSPNDARVNFDVAKYIKLVPPFREAEVDAYFTAFERIAEKLGWP FT KDMWGLLLQCNFVGKAQEVCAALPIDQSLDYDVVKAAVLRAYELVPEAYRQ FT RFRGTTKTAKHTFVEFAREKRTLFEKWCLASKVNSLDQLQELILLEEFKNC FT LPENIVIHLNEQKINSLSEAAVLADEFALTHRSVFSSTRQFRHRGFANEQV FT KAISPVLSSEKSEKVKPMLDNSSRKRVCFYCLDSSHLISDCKAWKQKSAST FT KSKSVAFAQAGSVSDPSFQPLQDSGYKPFLITGSVSLSADSPGQTVSILRD FT TGAAQSFILAGVLPFSDKTYTGTDVLVRGIELSCIKVPLHTVFLKSDLVTG FT QAQIAVRSELPVEGVSIILGNDLAGGKVFPCPIVLDKPEVTVQPDVVAAHF FT PAVFPACAVTRAQSRKLDDVVNLSDSFMNPVSGMSECTLFINPEISSPCER FT SDSDSAATLLKVGKEDLAAAQKADPSLSLSLDASVSFEKAKNLSGPVTYYW FT EDGILMRRWKPQVNGDVLAVHQVVLPSAYRPQVLKLAHEHPLAGHLGITKT FT YKRILKYFFWPGLKNSVVSYCRACHDCQLSGKPNQTVPTAPLQPIPVINEP FT FERLILDCVGPLPKSKSGHQYILTLMCAATRFPEAFPLRSLRASVIVKEII FT KFCSTFGLPKVIQTDQGSNFTSKLFAQVLKELGVSHQMSSAYHPESQGALE FT RFHQTLKTMLRTFCVSTGKDWVEGLPLLLFAVRETVQESLGFSPSELVFGH FT TVRGPLKLLQEQLLSKETSAVNLLDYVSNFRERLHSACDIAKAHIVCVQSK FT MKSRFDKKSVKRSFQPGDQVLVLLPVPSSALHARFAGPYCIEKKLSETNYV FT ISTPDRRRKSRVCHINMLKSYVSENKTQDSAANEVEAARPVEVKTVVLSSI FT VDGSDEDEVIPAHFQNSTILSQLDNYLAYLPEDQKQSLAQLFNKYPMLFSD FT VPGRTSMITHDIDVGNSPPIKQHPYRVNPHKREIMKQEVEYLLQHGLATPS FT QSPWSSPCLLVPKQDSTFRFCTDYRKVNSVTKPDSFPLPRMEDCVDRIGSA FT HFITKLDLLKGYYQVPLSHRASEISAFVTPDNLLQYSVMAFGMRNAPATFQ FT RLMQKVLSGIPNCEAYLDDVVVYTHSWEEHLSILDEVFKQLTRASLTLNLK FT KCEFAKAVVTYLGKKVGQGQVKPVEAKVEAILKFPIPCNKRELRRFLGMVG FT YYRSFCQNFATIVTPLTDLLSTARKFVWSPECDSAFKAAKDLLSSAPVLSA FT PCFDLPFLLQVDASSSGAGAVLLQDVNGVEHPVSFFSKKFTSAQQKYSTIE FT KEALALVLALQHFEVYLGGSSCVTVYTDHNPLVFLHRMRNSNQRLMRWSLI FT IQEFNLTIQYRKGSENVVADALSRISHAVE" XX SQ Sequence 5077 BP; 1377 A; 1063 C; 1103 G; 1534 T; 0 other; gtgaaaattg ggggctcgtc cattggattt gaacaaagtt aaacgttacc ttttcaaatt 60 tgactttgag aattttgtat tctacaattt gaattttggg gaagttttgt aacttattag 120 tgatttttct tagtctaatt gtagttgtgg ctgttttctt ttttttggaa aggggcagat 180 aggttgtact ttgtaaaatg gagtttgatt tgattgagtt tagtcttgct cccactgtag 240 aagttttcaa tcgttgtcgt aagaaggatt tgctgttaat tgctgagttc tttaaaatct 300 ctgtgtctag ggatgttact aaagcagttt taaaagatga gctatatact gagttggtta 360 attctggtat cctaccagtt gagtcaaaaa ctgatgggct agaggagcag gaattagttg 420 aaatagctga aactaatgtt gacgctgttc ctattggtat tgagccaatt atggatcctt 480 tattggagat aaagctaaag gagcttgagt tagccattaa acaacaagag catgaaactg 540 aacttgttca tcttcgtgtg attgaagtac aagctgatag agacattaag ctgaagactt 600 tgtcacttga ggcagaagct ctacgctata agcctgtgtc agctccaagt tcaagacctt 660 cctcacccct gactgctctt cctgcacaga gtagcactgt taatttaaac aagcctgtgt 720 cagctccaag ttccagacct tcctcacccc tgactgctgt tcctgcacag agtagtactg 780 ttaattttgg ctctcccaat gatgctcggg ttaactttga tgtggcaaag tatattaaat 840 tagtaccccc ttttcgagaa gcagaggtag atgcatattt cactgcattt gagcggattg 900 ctgaaaaact tggttggccc aaggacatgt ggggtttact actccagtgt aactttgtgg 960 gtaaagctca ggaagtatgt gctgcacttc caatagatca gtccttggat tatgatgttg 1020 taaaagctgc tgttttacgt gcatatgaac ttgtgcctga agcatatcgc cagcgctttc 1080 gaggtacaac gaaaacagcc aaacatacct ttgttgagtt tgcacgggaa aagagaaccc 1140 tttttgagaa atggtgtctg gccagtaagg ttaactctct cgatcaactg caggaactca 1200 tcttgcttga agagtttaaa aattgtcttc ctgagaacat tgtgatacac ctaaatgagc 1260 aaaagataaa ttccctctct gaagctgcag tattagcaga tgagtttgca ttaacccata 1320 ggtcagtatt ttcaagtact cgtcaattca ggcatcgagg ttttgcaaat gaacaggtaa 1380 aggcaatctc acccgttctt tcatctgaga agtcagagaa agtaaaaccc atgctggata 1440 attcaagccg aaagcgtgtt tgtttttact gcctagattc tagtcaccta atatctgact 1500 gtaaagcatg gaaacagaag tctgcttcta caaagtcaaa gagtgtagct tttgctcaag 1560 ctggttctgt atctgacccc tcatttcaac cactccagga ttctggatat aaacccttcc 1620 tgataactgg ctctgtgtct ctttcagctg actcaccagg ccaaacagta tccatcctga 1680 gggatacggg agctgctcag tctttcattt tagctggagt gttacctttc tctgacaaaa 1740 catacactgg cacagatgta ctagtcagag gaatagaact gagctgtatt aaagtaccct 1800 tgcatactgt ctttttaaaa tcagatctgg tcacaggcca agctcaaatt gcagtacgga 1860 gtgaactgcc ggtggagggc gtaagcatca tccttggaaa tgacctggct ggaggtaaag 1920 tttttccttg tcctattgtg ttggataagc ctgaggtgac tgtacagcca gatgttgttg 1980 ctgctcattt ccctgctgtt tttcctgcct gtgctgttac tagagcacaa tctcgaaagt 2040 tggacgatgt tgtgaacctc agtgactcct ttatgaatcc tgtgtctggc atgtctgagt 2100 gtacactctt cattaaccct gagatttctt ccccttgtga aaggagtgat tctgattcag 2160 ctgcaactct tttaaaagtt ggaaaggagg accttgctgc tgctcagaaa gctgatcctt 2220 ccctgtcgtt gtctctggat gcttctgtca gctttgaaaa agctaaaaat ctgagtggtc 2280 cagttactta ctactgggag gatggtatac tcatgcgacg atggaaacca caagtaaatg 2340 gagatgtgtt agctgtacat caagttgtgt taccttctgc ttaccgtccc caggtcctga 2400 agctggcgca tgagcatcct ttggctggtc accttggtat cactaagact tacaagagaa 2460 tcttgaaata cttcttctgg cctggattga aaaacagtgt tgtgagttat tgcagagctt 2520 gccatgactg tcaactctct ggtaagccta atcagactgt tcccactgcc cctcttcaac 2580 ctataccagt tataaatgaa ccatttgaac gcttgatctt ggactgtgtt gggccccttc 2640 caaaatcaaa atctggtcat cagtacatac taactttaat gtgtgctgct acaagatttc 2700 cagaagcgtt tccactgcga tctctgcgag ctagtgtaat agtcaaagag attatcaaat 2760 tctgctccac atttggcctt ccaaaggtca tccaaactga tcaaggttct aactttacat 2820 caaagctgtt tgcacaggtt ttaaaggagt tgggagtgtc tcatcagatg tcaagcgcct 2880 accacccaga atcgcagggg gcgctagaac ggttccatca gactctcaag accatgctcc 2940 ggacattttg tgtgagtact ggaaaagact gggtggaagg tctgcctttg ctattgtttg 3000 ctgtccgaga aacggtccaa gaatcattag gattcagccc atctgaactt gtttttggcc 3060 acactgttcg tggaccccta aaactcctcc aggaacaact gctgtctaaa gaaacatctg 3120 ccgtcaacct gttggactat gtcagtaact tccgtgaacg actgcatagt gcctgtgaca 3180 ttgctaaggc ccatattgtg tgtgtccagt cgaagatgaa gtcaagattt gataagaaat 3240 cagttaaacg cagttttcag cctggtgatc aagtgcttgt tctactccct gttccatcct 3300 ctgcactcca tgctagattc gctggcccct attgcattga aaagaagttg agtgagacaa 3360 attatgtaat ttctactcct gaccgcagaa gaaaaagcag agtttgtcac ataaatatgt 3420 taaagtctta tgtgagtgaa aataagacac aagattcagc agccaatgaa gttgaggcag 3480 cccgacctgt tgaggtgaag actgttgttc tcagttctat tgtggatggt tctgatgaag 3540 atgaagtgat acctgcacac tttcaaaatt ctacgatatt aagtcagctt gataactacc 3600 ttgcatactt gcctgaagat cagaagcaat ctcttgccca attgtttaat aaatatccca 3660 tgcttttctc tgatgtccct ggcagaacct ctatgatcac acatgacata gatgtaggaa 3720 actctcctcc catcaaacaa cacccttacc gtgtaaatcc tcataagcga gagataatga 3780 agcaggaagt tgaatatctc ctccaacatg gtcttgccac accaagccag agcccttgga 3840 gctccccatg tttgttagtg cccaagcaag attcaacctt ccgtttctgc actgactacc 3900 gcaaagtaaa tagtgtgacc aagcctgatt cctttccttt gccccgaatg gaagattgtg 3960 tggacagaat aggtagtgcc catttcatta caaaacttga tcttctaaaa ggatattatc 4020 aagtgccact aagtcaccgt gcatctgaga tttcagcctt tgtgacccct gataacctgc 4080 tgcagtactc tgtaatggcc ttcgggatgc gcaatgctcc tgcaaccttc caaaggttaa 4140 tgcaaaaagt gttgtcagga atccctaact gtgaagctta cctagatgat gtagtggttt 4200 acactcacag ttgggaagaa catctgagta ttcttgatga agtcttcaaa caactgacta 4260 gggcttctct aacacttaac cttaagaagt gtgagtttgc caaagcagta gtaacttacc 4320 taggtaaaaa ggtaggtcag ggtcaagtga agcctgttga ggcgaaagtt gaagctattt 4380 tgaagtttcc cattccctgc aacaaaagag agctgagacg tttccttgga atggtgggtt 4440 actataggag cttctgtcaa aactttgcta ccatagtaac ccctttaaca gacttgctca 4500 gcactgcccg aaagtttgta tggtcccctg aatgtgactc tgcctttaaa gccgccaaag 4560 acctcttaag tagtgcacct gtactctctg ccccttgttt tgaccttcca ttcctactgc 4620 aagtagatgc cagttcctca ggtgctggag ctgtgttact gcaagatgtt aatggtgtag 4680 aacatccagt atctttcttt tcaaaaaaat ttaccagtgc ccagcaaaag tacagtacga 4740 ttgagaagga ggccttagcc ctggtgttgg ctttgcaaca ctttgaagtg taccttggag 4800 gtagtagttg tgtcactgtg tatacagacc acaatcccct tgtctttttg cacagaatga 4860 gaaattcaaa tcagcgtcta atgaggtggt cccttattat tcaggaattt aacttgacta 4920 tccagtaccg taaaggttct gaaaatgtcg tagctgatgc cctttcgaga atttcccatg 4980 ctgtagagta gaaaggaaaa gaaaaatgag aatttggttt gtatagtttt tttttttttt 5040 tcaagattgg ttaacttgaa aacttatggg tgggggt 5077 // ID hAT-2_DR repbase; DNA; ZEB; 2536 BP. XX AC . XX DT 07-DEC-2004 (Rel. 9.11, Created) DT 19-MAY-2005 (Rel. 10.06, Last updated, Version 2) XX DE hAT-2_DR is an autonomous DNA transposon - a consensus sequence. XX KW hAT; DNA transposon; Transposable Element; hAT superfamily; KW hAT-2_DR; transposase. XX NM hAT-2_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-2536 RA Kapitonov V.V. and Jurka J.; RT "hAT-2_DR autonomous DNA transposon from zebrafish."; RL Repbase Reports 4(11), 305-305 (2004). XX DR [1] (Consensus) XX CC hAT-2_DR is an autonomous DNA transposon. This transposon is CC characterized by the 8-bp target site duplications and 16-bp CC terminal inverted repeats. The consensus encodes the 611-aa CC hAT transposase (pos. 255-2087). The genome contains about CC 1000 copies of hAT-2_DR nonautonomous elements that are ~97% CC identical to the consensus. Nearly all these copies are CC nonautonomous elements. The transposase-encoding elements are CC present just in 2 copies. XX FH Key Location/Qualifiers FT CDS 255..2087 FT /product="hAT-2_DRp" FT /note="Transposase" FT /translation="MPANTMDRFLRPPDSKPSSSGLKPKRRRYDDQYLSLG FT FTWTGPADEPRPLCVVCQDILANDSMRPAKLRRHLETKHDEVAGKPPEFFK FT RKLQTLQGQKKIVEDFVKLNGKATEASYRVALRIAKAGKAHTTGETLILPA FT AKDICSVMLGEAAASKIDSVPLSDNTISRRISDMAQDVKEQVLDGVRHSPF FT YALQIDESTDVASCAQLLTYVRYVKNMDIHEEFLFSSPLPAHTTGEQIFNQ FT LNEFVRKNDVEWERCCGICSDGAKSMTGRYSGLMSRVKEVAPNAIWTHCTI FT HRQALAAKKMPNDLRSVLDEAVKIINLIKARPLNARLFHILCDELGAHYKQ FT LLLHTEVRWLSRGRVLSRLLDLREEVLLFLSNVQSTLVQHMSDLSWIARLA FT YLSDIFERLNALNLSLQGRDCNVFSAFEQVSSFRRKLDLWATRVEKGCLDM FT FPTLADFMQEAGSVVHIQPLVAEHLRGLCQQFTHYFSNETILDEWIRNPFK FT FKPAESDVLSIQDEEALIDLTSNHELQQMITHSSIEHFWLSVQNEFPELTQ FT KALRKLLPFVSTYLCEPEFSALTFIKNKYRSRLQVEDDLRLFLTSLQPRIS FT LLCAARKQLHTTH" XX SQ Sequence 2536 BP; 746 A; 506 C; 560 G; 724 T; 0 other; caggggtttt caaagtgtga ggcgcgcctc ccctgggggg cgccagagca tgtcagggga 60 ggcgcgggaa aaaatattat ataataaaaa tataattatt aagtttaatt attatatgta 120 ttttttatta tatttaaacg ttttaattaa acaaagctaa aaaaataata cgtcaaaaat 180 aagaaaacct tttttaccca gaaggccata gctgtgaatt cgcttctgtt tggcaagccc 240 gccaatacag gtatatgcct gctaatacaa tggatcggtt tctgagaccc cccgattcaa 300 agccttcaag ttcagggctt aaacccaaaa gacgacgata tgatgatcag tatttgagtt 360 taggatttac gtggacagga ccagctgatg aaccacgacc tttatgtgtg gtttgtcaag 420 atattttggc taatgacagc atgagacccg ctaaacttcg gcgacacctt gaaaccaagc 480 atgatgaggt agcaggaaaa cctccagaat ttttcaagag aaaacttcaa acccttcaag 540 gtcagaaaaa aattgtggaa gattttgtca aattaaatgg aaaggccact gaagcttcat 600 atcgcgttgc attgcgtatt gccaaggcag gcaaagcaca taccaccggg gagacgttaa 660 ttctgccggc agcaaaagac atttgttctg tgatgctagg agaggcagcg gcttctaaga 720 tcgattctgt cccactctct gacaacacaa taagtcggcg catctcagat atggcacagg 780 atgtgaagga acaagtttta gacggcgtca gacacagccc attttatgca ctccagatcg 840 acgaatccac agatgtggcc agctgcgctc agctgttaac atatgtgcgg tacgtgaaaa 900 acatggacat tcacgaagag tttctattta gtagtccttt gccagcccac acaacaggtg 960 aacaaatttt taaccagctg aacgaatttg tgagaaagaa cgatgtagag tgggaacgct 1020 gctgtggcat atgcagcgat ggggcaaaat caatgacggg ccgctacagc ggtctcatgt 1080 cgagagttaa agaggtagct ccgaatgcca tatggaccca ctgcactatt catagacaag 1140 ccttagctgc caagaagatg ccaaatgatc ttcggagtgt cctcgacgaa gctgtgaaaa 1200 ttattaacct cataaaagca cgacctctaa atgctcgtct tttccacatt ttatgcgatg 1260 aattgggagc gcattacaaa cagctgcttt tgcacaccga agtccgctgg ctgtctcggg 1320 gcagagttct atcacgactt ttggatttgc gtgaggaagt actacttttt ctgtcaaatg 1380 tgcaatccac tctggtgcag cacatgagtg atttgagctg gatcgcaagg ttggcttatt 1440 tgtcggacat attcgaacgc ctcaacgcgc ttaatttatc attgcagggc agagactgca 1500 atgtgttttc ggcatttgag caagtttcct cgttccggag aaagctggat ctatgggcca 1560 ctcgtgtgga gaaaggatgc ttagacatgt ttcccacgct ggctgacttt atgcaagagg 1620 cagggtcggt ggttcatatt caacctttgg tcgctgaaca cctaaggggg ctgtgtcagc 1680 aattcacaca ctacttttcc aacgagacaa tactggatga gtggattcgc aatccattca 1740 agttcaagcc agcagaaagt gacgtactgt ctatccaaga cgaagaggct ttgattgatc 1800 tgactagtaa tcatgaactg cagcaaatga ttacacactc ttccattgaa catttctggc 1860 tctccgttca aaatgaattt cctgaactta cacaaaaagc actgaggaaa cttttaccat 1920 tcgtttcaac gtatttgtgc gaaccagaat tttctgcttt gactttcatc aagaacaaat 1980 atcgttcacg tcttcaagtg gaggatgacc ttcgtctctt tctgacgtca ctacaaccac 2040 gaattagtct tctctgtgca gcaaggaaac aactgcatac tacccactaa ggtaggaata 2100 taattatgta agcatattaa tcaatagtag agatatttaa attttggcgt tgtttaaagt 2160 tctttcccca ctctgttaca gatcatctga ctgttttgac tgggatggac agttcttgga 2220 tctgttctat tcatattgaa ccatcatcct agttttaaaa acgttatatt aaaatacttg 2280 ttattactct gtaaataatt gcactttcat gcaaatgatg ataaaagtga gttaacagtc 2340 tgacatctgt ctgtgtcttt atatatactg tatatatata tatatatata tatatatata 2400 tatatatata tatatatata tatatgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgcgtg 2460 tttgggagga ggggggcgcc aatggataag ttgtgtcaaa agggaggccc actgtcttag 2520 actttgaaaa accctg 2536 // ID Gypsy74-LTR_DR repbase; DNA; ZEB; 1023 BP. XX AC chr13; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy74-I_DR; KW Gypsy74-LTR_DR; Gypsy74_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1023 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1464-1464 (2008). XX DR Genome; chr13; Positions 21207022 21206000. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1023 BP; 259 A; 143 C; 214 G; 407 T; 0 other; tgtaacagtg tgaaaaatac tttcattttg acgttttatt tgtttattta tttttattta 60 attcatggaa atagaaactg aaagcaattt gtattatttt cctttggtgt cctttaactg 120 tttaccatta ctagtaatct ttttgtcaaa ctgaagtctc atatgttatg ttataatgtt 180 atatttatgg gttatgaact tcgcaatttt gttcctattg tttatagtta atcagtggtg 240 tttggatagc ggaacaattt ttggggtgat gtttgagggg ggagcaactt tagcgggaca 300 cttctggctt tctagagagt gcgcatgtgc tgaaatcgtc agtccgacgt tttgcactgc 360 tgacgagcgc atggctctcc gtctctcatc tatatacatg gtatgagcac aatattgata 420 aatatgagta tatcctgctt gacaacaatt atataagtgt gtataagtgt atctaacgat 480 ggattgttgt aaatgaagtt tgatgtgttg agtttaagct tgatttattc atgaggttta 540 tgctgctaac tgtttagcat tgtgctcggt tttgctagtg tgtgtgttgg agacagagag 600 agagagtcca gttgatgtcc agttgctcca ctgctgttta tttattcttt tcaggtatgt 660 taattaattt tgcttataaa gaaaaaggga aaggtttaca gttatatgta aatgaacata 720 ttttgtttaa ccagatgttt tatatatata ttttgtatat ttattttcct ctttgccctt 780 ttatatacat ttaatttaat attgtatttt gtgctgaaat cgtgagtccg acgttttgca 840 ctgctgacga gcgcatggct ctccgtctct catctatata catgtgtgtg tgttggagac 900 agagagagag agtctagttg atgtccagtt gctccactgc tgtttattta ttcttttcag 960 aataaacaag tttcatctct gaagtggtcg tgggtttgtg aattctaaac tgtccatgct 1020 aca 1023 // ID hAT-N61_DR repbase; DNA; ZEB; 525 BP. XX AC . XX DT 28-OCT-2008 (Rel. 13.1, Created) DT 28-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Putative nonautonomous hAT DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW 8 bp TSD; hAT-N61_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-525 RA Bao W. and Jurka J.; RT "hAT DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1330-1330 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 525 BP; 164 A; 113 C; 83 G; 162 T; 3 other; caagggcgta gaattggcat ggacggaggg gacgtgtccc caccaatatc caccaattac 60 tgaaatgtcc ctaccaataa tttaatcgac ttcaaaataa artaatgctc catcaactct 120 taacctacac gtgcaccaag tcaagttcgc tacgagttca ccgtaatact tattaaccaa 180 ggctgtgcaa ttaatcaaaa ttcagtttcg attttggcct ccatgattat gaaaaacaat 240 aatcaggata aaacagttaa attgcgtcac acacctctct aaatctctct aaatttctct 300 acattcatac ctcctcaaag cccgactgca gtaaaatcat gtaaattgac ttttgcagca 360 tgggacgkga ttgttatwtt ttatttgtat tattaagtgt ttattttata ttattaagta 420 ctttattcat gtttttaaaa gcgcgaaaca gaatttgtgc tgttcaatgc atgcgctcta 480 gggatgtatg tccccaccaa tgtcaagagc aaatctacgc ccttg 525 // ID Gypsy172-I_Dr repbase; DNA; ZEB; 4216 BP. XX AC . XX DT 23-JUL-2009 (Rel. 15.09, Created) DT 23-JUL-2009 (Rel. 15.09, Last updated, Version 1) XX DE An internal portion of a Gypsy-172_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW endogenous retrovirus; Interspersed repeat; reverse transcriptase; KW gag; Gypsy superfamily; integrase; internal portion; KW Gypsy-172-I_DR; Gypsy-172-LTR_DR; Gypsy-172_DR; Gypsy172-I_Dr. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4216 RA Dib M.R. and Naveira H.F.; RT "Gypsy-172_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 10(9), 1210-1210 (2010). XX DR [1] (Consensus) XX CC Gypsy-172-I_DR is an internal portion of the Gypsy-172_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its long CC terminal repeat is deposited in Repbase as Gypsy-172-LTR_DR. CC Gypsy-172_DR is characterized by 4-bp target site duplications. CC Gypsy-172-I_DR is an internal portion of the Gypsy-172_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its long CC terminal repeat is deposited in Repbase as Gypsy-172-LTR_DR. CC Gypsy-172_DR is characterized by 4-bp target site CC duplications.The internal portion contains three ORFs encoding CC the 157-aa Gypsy-172_DR1p gag (pos. 125-595) and 458-aa CC Gypsy-172_DR2p pol proteins (pos. 709-2083) composed of the CC protease, reverse transcriptase, and the 736-aa Gypsy-172_DR3p CC pol proteins (pos.2004-4211) composed of the integrase domain. CC Some insertions fairly recent, according to the hight identity CC between their flanking LTRs. Consensus obtained after the CC alignment of at least three independent insertions bearing at CC least 85% homology over at least 1000bp. XX FH Key Location/Qualifiers FT CDS 125..595 FT /product="Gypsy172-I_Dr_1p" FT /note="ORF1(Gag_like)." FT /translation="MGKALQNLHISPPAATMASTTGPVIPVMETPSQVSPR FT LAFPEKFDGDPTKCKGFLLQCSLFVNQQPTLYTTDSGKIAFVFSLLTGKAL FT EWATAMWKEDGTAFPSFKDFSERFSKVFDHPREGKTADEQLLELTPGKMTA FT TEYALTFRVGGHHCLEGD" FT CDS 709..2082 FT /product="Gypsy172-I_Dr_2p" FT /note="ORF2(Pol-like)." FT /translation="MSVPVTTDESEPMQVNSYHLSSEERNRRLTQRLCLYC FT GQPNHLRVSCPSRPNPASQDRVSFPVHSLNPDMCVSIPITIEINGHEITTK FT ALLDSGAAGNFMSKKFAYRHKIPLTRCSSPLTVEAIDGRPLGSGQILHLTQ FT ELNMQTGILHTETIQFYILVTPRTPIILGLPWLRRHNPDINWRMGQIVKWK FT DTCFSECITPISPLSVRSVTVSDEVNCDINIPPEYSDLSLAFSKIEATKLP FT PHRSYDCAIDLLPGTMPPRGRIFPLSKPETEAMEKYINEELKKGFIRPSTS FT PASAGFFFVKKKDGGLRPCIDYRGLNEITVKFRYPLPLVPAALEQLRTAKY FT YTKLDLRSAYNLIRIRQGDEWKSGFSTSTGHYEYLVMPFSLVNSPSVFQSF FT VNDIFRDLLNRIVIVYIDDILVYSDTLEEHIHHVRTVLKWLIKHPPHFWAT FT SLVMKVWPWTNGR" FT CDS 2004..4211 FT /product="Gypsy172-I_Dr_3p" FT /note="ORF3(Pol_like)." FT /translation="QMAHQTSTSFLGYIISHEGVAMDKRKVNSILKWPQPS FT TLKELQRFLGFANFYRRFIRNFSSVAQPLTSMVKKGAHSLQWSLPATQAFQ FT QLKQRFVTAPILHHPNPELPFIVEVDASNTGVGAILSQRQGSPPKLYPCAF FT YSRKLNPAERNYDVGDRELLAMKSAFEEWRHWLEGSTHPFTVLTDHKNLEY FT LRTAKRLNPRQARWSLFFTLFKFVVTYCPGSKNTKADALSRQFEGDQIPQA FT PENILSPSLIVAPIQWDIITEIEQANAQSETPAECPTNKIFVPETLRTRVL FT ELVHSLPNSGHPGINATVQLLENRFWWPSLSKDTTTFIQHCITCNTQKSSR FT QLPAGLLQPLPVPQRPWSHIAIDFVTDLPMSNNCTTILTVIDRFSKACRLI FT PLPKLPTALQTAEHLCNLVFRFYGLPEDIVSDRGPQFTSRVWSAFFKMLNV FT NVSLTSGYHPQANGQTERLNQEITRFLRSYCSQHQNDWSRYLMWAEYAQNS FT IQKSSTGLTPFQCILGFQPPLFPWSGEPSNVPAVDDWLLRSEETWNLAHTH FT LQRAIRRQKNQADRHRRPNPEYIPGQWVWLSTRDLRLRLPCKKLSPRYLGP FT FQIDKQITPVSFRLILPNHYRISPTFHVSLLKPAAGPTEEGEEESADKGPL FT PTLVDGEEVYQVRELLDSRRRGRVLQYLVDWEGYGPEEHSWVNADDILDPN FT LIESFHRNHPDKPAPRPRGRPRRHLPPRARSRSQKG" XX SQ Sequence 4216 BP; 1164 A; 1146 C; 829 G; 1077 T; 0 other; gaagacttcg ccatcctacg atccagcagc tatcactcac ttgtcctcag cactatctac 60 acaggcaagt caactcgccg cacaccatca tcagctgaaa cacctgacgt cacttaccga 120 ggaaatgggg aaggcgctcc agaacctgca tatctcgccg ccggcagcta ccatggcatc 180 gacaactggt cccgttatac ccgtcatgga aacgccatct caagtgagtc ctcgcctcgc 240 ctttccggaa aagtttgatg gtgacccgac gaagtgtaaa ggatttttac ttcagtgctc 300 tctctttgtc aaccaacaac ccacgcttta tacaacggac tccggtaaga ttgctttcgt 360 tttctccctg ctaactggca aggcattaga atgggctacg gcaatgtgga aggaggatgg 420 caccgctttt ccttcattta aggatttctc agagcgtttt agtaaggtgt ttgatcatcc 480 cagagaaggg aaaaccgctg acgagcaact attggaatta acgccgggaa aaatgaccgc 540 aactgaatac gctctcacat ttcgtgtggg gggacatcat tgtttagaag gggattgaat 600 catgaactac aatccaagct ggcatgtcga gacgaaagaa atcatctcaa acagattgat 660 aaccttgtca gagcccgacg atgacatgtt cctcacagtt cttatcgtat gtccgtccct 720 gttaccactg atgaatccga acccatgcaa gttaactcct atcatctctc ctctgaagaa 780 cgaaatcgtc gcttaacaca acgactctgt ctatactgtg gtcaaccaaa ccatcttcga 840 gtttcctgtc cctcacgtcc aaatccagct tcccaagatc gggtgagttt tcctgtacat 900 tcattgaatc ctgatatgtg tgtcagtata ccaattacca ttgagataaa tggacacgag 960 ataacaacaa aagcattgtt agattcagga gcagcaggaa acttcatgtc taaaaaattt 1020 gcttaccgac ataaaatacc tttaactcgc tgttcctctc ctttgacagt ggaagcgata 1080 gatggacgcc cactaggatc gggacaaata ttacatctca cgcaagaact caatatgcaa 1140 acaggtatac tacacacaga aactatccaa ttttacatcc ttgtaacacc tcgcactccc 1200 ataatacttg gactaccctg gctgcgaaga cataacccag acatcaactg gaggatggga 1260 caaattgtta aatggaagga tacctgtttt tctgaatgca tcacccccat ctcacctctg 1320 tctgttagat ctgtcaccgt ctctgatgaa gtcaattgtg acattaacat cccacctgaa 1380 tattctgatc tgtctctggc ctttagcaaa attgaagcca ctaaacttcc tcctcaccgg 1440 agctatgatt gtgccattga tctgttacca gggacaatgc cccctagagg acgaatcttt 1500 cccctatcca aacctgaaac agaggccatg gaaaaatata tcaatgagga attaaagaaa 1560 ggctttatac gaccttcaac atcacctgcc tctgctgggt ttttttttgt gaagaaaaaa 1620 gatggtggac ttcgtccctg cattgattac cggggcctca atgagataac tgtcaagttc 1680 cggtacccac tacccctggt gcctgctgcc ttagaacaac ttagaaccgc taaatattac 1740 actaaacttg atctacggag tgcatacaac ctaatacgca ttcgccaggg agatgaatgg 1800 aaatctggtt tctccacaag cacagggcat tacgaatacc tggtcatgcc tttcagcctg 1860 gtaaacagtc catccgtgtt ccagtcattt gttaatgaca tcttcagaga tctcctaaat 1920 cgcatagtca tagtatatat agatgacatt ctggtatact ctgataccct agaagaacac 1980 atccatcatg tacgaacggt gctcaaatgg ctcatcaaac atccacctca tttctgggct 2040 acatcattag tcatgaaggt gtggccatgg acaaacggaa ggtgaactca attcttaaat 2100 ggccacagcc ttcaaccctg aaagaacttc aacgctttct tggtttcgcc aatttttaca 2160 gacgctttat tcggaacttt agctctgttg cacaacccct aacatcaatg gtaaagaaag 2220 gagcccacag tctacaatgg tcacttcctg ccacccaggc attccaacaa ctgaaacaac 2280 ggttcgtcac agctcctatt ctccaccatc ccaacccaga actccccttc attgttgagg 2340 tggatgcttc aaatactggc gtgggtgcaa tactgtctca gagacagggg tcacccccca 2400 aattatatcc ttgtgccttc tattccagga agctaaaccc agccgagagg aattatgatg 2460 ttggtgatag ggaacttctg gccatgaaat cagcctttga agaatggcgt cactggttgg 2520 aggggtctac ccatccattc accgttctga cagatcataa gaatctagaa tatctgagaa 2580 ctgccaagag actcaacccg aggcaagcac gctggtccct gttctttacc ctcttcaagt 2640 ttgtggtcac ctattgtcct ggatcaaaaa atacaaaagc agatgccctg tcacgacaat 2700 ttgaaggtga tcaaatccca caagctcctg agaatattct gtctcccagt ctcattgttg 2760 cacccatcca atgggacata attaccgaaa ttgaacaagc taacgcccag tcagagaccc 2820 cagctgaatg tccaaccaat aagatatttg tccccgaaac cttacggact cgagtattag 2880 aactagtaca ctcactgccc aactccggac accctggcat taacgctaca gtccagctac 2940 tggaaaatcg tttctggtgg ccatcgttgt ccaaggacac caccacattt attcaacact 3000 gcatcacctg caatacgcag aaatcatcac gacaactacc agccggttta ctccaacctc 3060 tccccgtacc acaacgcccc tggtcacaca ttgccattga tttcgtcacg gatctaccca 3120 tgtccaacaa ttgtaccacc atcctcacag tcatcgacag attttctaaa gcttgccgtc 3180 tcataccttt acccaagctc ccaactgctc tgcaaactgc agaacattta tgcaacttag 3240 tatttcgatt ttatggttta cctgaggata ttgtctcaga tagagggcct cagtttacat 3300 ctagagtctg gtcagcgttc tttaaaatgc taaacgtcaa cgtcagtcta acttctggtt 3360 atcaccccca agccaacggc caaactgaaa gacttaatca agaaattacc cgtttccttc 3420 gttcctactg cagccaacat cagaatgact ggagccgtta cctaatgtgg gctgaatacg 3480 cccaaaattc cattcaaaaa tcctcgactg gtctcacccc atttcaatgc attctgggat 3540 tccaaccccc gctgtttccc tggtctggag aaccctctaa tgttccagca gtagatgatt 3600 ggctactcag aagtgaggaa acttggaacc tagctcatac tcatctacag cgtgccattc 3660 gcagacagaa aaaccaagct gatcgccatc gtcgtcccaa ccccgaatac attcctggtc 3720 agtgggtttg gttatctacc agagatctgc gccttcgatt accctgcaag aaacttagtc 3780 ccaggtactt gggccctttc cagattgata aacaaattac acctgtttcc tttcgtctta 3840 tcctgcctaa tcattatcgc atctctccca ccttccatgt ttctttgctc aagcctgctg 3900 ctggtcccac cgaggagggg gaggaggagt ctgctgacaa gggtcccctg cccactctgg 3960 tggatggcga ggaggtgtac caagtacgag aactacttga ttccaggcgt cggggaagag 4020 ttcttcaata cctagttgat tgggaggggt acggtccaga ggaacactct tgggtaaatg 4080 cagacgacat tctcgaccca aaccttatag agtccttcca tcgcaatcac cctgataaac 4140 cggcccctcg accacgtggg agaccccgtc gccatctgcc tcctcgtgcc aggagtcgct 4200 cacagaaggg gggctc 4216 // ID Gypsy79-I_DR repbase; DNA; ZEB; 4240 BP. XX AC chr12; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy79-I_DR; KW Gypsy79-LTR_DR; Gypsy79_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4240 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1473-1473 (2008). XX DR Genome; chr12; Positions 13086561 13082322. XX CC 'ACCGT' target site duplication CC LTRs are 98% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 36..1268 FT /product="Gypsy79-I_DR_3p" FT /translation="MNSIPAPEGMRMSGDISKNWDIFRAEYEDYELATGLI FT EKPSEVRAAALRRLLGNECRHVYNHNIVLTEEQTKDPKAILDALGAYFKPA FT KNVIYERYMFGCCKQEVNEPIDSFLTKLRERASTCDYKELKDEMIRDRLVL FT GITNENTRRRLLRERELTLSQSVEICRLAEVTEQRVKTIDSSITDSVNVAV FT AQRMKGKDGENEQAFRNAFCKYCGGGHTKGKDLCPAYGKTCRSCGVLNHFA FT KVCRARQRAYSSGRVNVVTDDARDEQAPYNEGRLFTAEECVDTVKCPGPKW FT FVNLTINKQKQACQLDTGATCNVMSRIIKEKLDYKCPLQPSTTKLKLYSGA FT TMLSLGRFHTECTVKGTKHMLIFEIVEANQDPLLSGDTCQRLGLMKLTIPR FT SFTKLLIAQIYPSQSNS" FT CDS 1178..2506 FT /product="Gypsy79-I_DR_2p" FT /translation="MPAARAHETHHPEELHKIVDCPDIPLTKQQLICSYKD FT VFNDPVKSVPGEVRFVLDNSVSPVQCAPRNVPVAVKARVKEQLYKHIREGH FT ITSVTEPTPWISNMVVIAKPHKLRICIDPKQLNQALQRSHYHMPTLEDILY FT KLPKARLFMLVDVRDAFLHCRLDNNSSLMTTFWTPWGRMRWCKLPFGVSVA FT PEIYQRKQHELLMGLRGIEPIADDILVVGCGDSDAEAESDHDKNLRALMEH FT CRAVKLRLSEKKLQLKLKAVHFHGQILSAEGLRIDPEKTKAVLAMPTPQDV FT KAVQRFIGFVTYLAKFLPRLSDVCEPLRRLLDKGVAWHWLPKHDEAVQEII FT RMVSDTPVLKYYDNDKPVTIQSDASKNGLGCCLLQQGQPVAFASRALTSTE FT QNYAQIEKECLSIVFACQRFHHFLYGRESITTETDHKPLTTICKKKITL" FT CDS 2533..4083 FT /product="Gypsy79-I_DR_1p" FT /translation="MMLQLQNYNLNVVYKPGPEMYISDTLSRAALHKTVSN FT EPGLLSQTVNATDSYEAALTIVDQALHLNVTDASLRKIVKETKLDGTLQEL FT ANIVLPGWPERKEDVPLSVREYWALRDELNIQNGVLFRGQCVIIPKALRAE FT MLTRIHATHIGGEACYRLAKETLFWPNMRSEIKDYVTNCAACNEYAHRQQK FT ETMMSHEIPVRPWQIVSMDLYAYSGKDFLIIVDHYSDYWEIDQLPDLTADT FT VITRCKVQFARHGQPDKVITDNGPQFDCESFRCFARQWGFAHVTSSPQHPQ FT SNGKSESAVKIVKSICKRAKLDGSDPWLAILHWRNTTTEGLDSSPAQRLMS FT RRLRTGLPMANSLLIRNVVEGVSEKLRWKRRISKFQYDLRAKDLPELNVGE FT YIRMKPLPGDRMGRWRRGQCLGKVNPRSYVVDVEGTWYRRNRVDLRRAERF FT DQFNHQEAESKENSQGTTGDSEIIITRGENTSDIIDCEPEQAQDQRLIVTN FT KLSPRGAPIITRSGRQSQLLG" XX SQ Sequence 4240 BP; 1307 A; 953 C; 1068 G; 912 T; 0 other; tggtgtcaga agtgcaataa attctgaata catccatgaa cagcattcca gcacctgagg 60 gcatgcggat gtccggggat atatcaaaga attgggacat tttcagagcg gaatacgagg 120 actatgagct agcaactgga ttaattgaga aacccagcga ggtgagagca gcagcactac 180 gccgcctttt gggcaacgaa tgtcggcacg tctacaatca taatatcgtg ttgactgagg 240 aacaaactaa agatccgaag gccatactcg atgccctggg ggcctacttc aagcctgcaa 300 aaaacgtgat ctacgagaga tacatgttcg ggtgctgcaa acaagaagtg aatgagccta 360 ttgacagttt tctcacaaag ctgcgggagc gagcatcaac ctgcgactac aaagaactca 420 aagacgaaat gattcgtgat agacttgtct tgggtattac taatgagaac accaggcggc 480 gcctattgcg tgaacgtgag ctgaccttgt cacaaagcgt ggaaatatgc cgattagcag 540 aagtgacaga gcaacgtgtt aaaacaattg acagctcaat cactgacagt gtgaacgttg 600 cagtagcaca gagaatgaaa ggcaaggacg gtgaaaatga gcaagcattc cgtaatgcat 660 tttgtaagta ttgtggggga ggtcacacga aaggcaaaga cctgtgcccc gcgtatggga 720 agacctgtcg atcatgtggt gtattaaatc acttcgctaa agtgtgtagg gcacgacaga 780 gagcctacag ttcaggcaga gtcaacgtag taacggatga tgctcgagac gagcaagcgc 840 catacaatga gggtcggctt ttcactgcag aggaatgcgt tgacactgta aagtgtccag 900 gtccaaaatg gtttgtgaac ttaactatta ataaacaaaa gcaggcctgt cagctagata 960 caggcgctac ttgtaacgtt atgagcagaa taatcaaaga gaaactggac tataaatgcc 1020 ccttgcagcc aagcacgaca aagctgaaat tatattcggg cgcgacaatg ctttctctgg 1080 gacgattcca cacagaatgc acagtaaaag gcaccaaaca catgttgatt ttcgagatag 1140 tagaagccaa tcaggatcca ctactttcgg gtgacacatg ccagcggcta gggctcatga 1200 aactcaccat cccgaggagc ttcacaaaat tgttgattgc ccagatatac ccctcacaaa 1260 gcaacagtta atttgcagtt acaaggatgt gtttaatgac ccagtcaagt ctgtgccggg 1320 tgaggttcga ttcgtgctgg acaacagtgt ttcgccagtg caatgcgcgc cacggaacgt 1380 accggtggcc gtaaaagcca gagttaaaga gcagctttac aagcatatca gagagggaca 1440 catcacatcg gtcaccgaac ccactccatg gatcagcaac atggtagtta tagcgaagcc 1500 tcacaaattg agaatttgca tagatccaaa acagctgaac caagcactgc agagatctca 1560 ctaccacatg ccgacactcg aggacatcct gtacaaatta ccaaaagcac gtctgtttat 1620 gctggtggac gttcgggatg cctttttaca ctgcagattg gacaataaca gcagtttaat 1680 gacaactttc tggacgccat gggggcgtat gagatggtgc aaactgcctt tcggcgtctc 1740 ggtggcgccc gagatctatc aacgcaagca acatgaactg ctaatgggac tccgaggcat 1800 agagcccatt gcggatgaca tactggtagt tggctgcggt gactcggatg cagaagcaga 1860 gtcggatcat gacaaaaatc tccgcgcact aatggaacac tgcagagccg tgaagctccg 1920 actgagcgag aaaaaacttc aattgaaact gaaagcagtg catttccacg ggcaaatatt 1980 atcagccgaa ggcttacgca tcgacccgga aaaaacgaaa gcggtcctgg caatgccgac 2040 gcctcaggat gtgaaggctg tccaacggtt cattggcttt gtaacatatc tggccaaatt 2100 tctcccacgg ctctcagatg tgtgcgagcc tctgcgcaga ctgctggaca aaggcgtggc 2160 gtggcactgg ttacccaaac atgacgaggc tgtccaagag ataatacgta tggtttctga 2220 cacacccgtg ttaaagtact atgacaatga taagccagtg acaatccaaa gcgacgccag 2280 caaaaacggt ctgggctgtt gtctgctgca gcagggacaa cctgtggcat ttgcctcgcg 2340 ggcactcact agtacggagc agaactatgc ccaaatcgaa aaggaatgcc tgagcatagt 2400 attcgcatgt caaaggtttc atcacttcct ttatggcagg gagtcaatca caactgagac 2460 cgaccataag cccctcacga caatttgtaa aaaaaaaatc actctttaag tgcgcccaaa 2520 cgtttccaaa gcatgatgtt gcaactacaa aactataacc tgaatgtcgt gtacaagccc 2580 ggccctgaga tgtacataag tgacacgctg agcagagcgg cgcttcacaa aacggtgtcc 2640 aacgagcctg gactactgtc acagacggtg aacgccacgg acagctatga ggccgcgtta 2700 acaattgtag atcaagcgct tcacctcaac gtgactgacg ccagccttcg taaaattgtt 2760 aaagagacaa aattggacgg cacattgcag gaactggcga atatagtttt accaggctgg 2820 ccagagcgca aagaggatgt gccactctca gtacgcgagt actgggcatt gagagatgaa 2880 ctgaacattc agaatggcgt actgttcaga ggtcagtgcg ttattatacc taaagcactg 2940 agagccgaaa tgttgactcg cattcacgca actcatatcg gaggtgaggc ctgttacaga 3000 ctggcgaagg aaacactgtt ctggccgaat atgaggagcg aaataaagga ttatgtgact 3060 aactgcgctg catgtaacga gtacgcacac agacaacaaa aggaaacaat gatgtctcac 3120 gagatcccgg tccgcccctg gcaaattgtg agtatggacc tgtatgcgta cagtggtaaa 3180 gactttctta tcattgtgga ccactactcg gattactggg aaatcgatca gttaccagac 3240 ttgacagcag acacggtcat cacgcgttgc aaagttcaat ttgcacgtca cgggcagccg 3300 gataaagtaa tcactgataa cggaccgcaa tttgattgtg aatctttcag atgttttgca 3360 agacagtggg ggttcgctca tgtaacatcc tcacctcagc acccacagtc taatggaaag 3420 tctgagtccg cagtgaagat tgtaaaatcc atatgtaaga gagcaaaact cgatgggtca 3480 gacccatggc ttgcaattct acactggcga aacacaacga cagagggatt agacagcagt 3540 cccgcgcaac gactcatgtc ccgtagactt aggacgggac ttccgatggc aaacagtctg 3600 cttattcgaa acgtggtgga gggagtttca gagaaactga gatggaaacg gcgaatatct 3660 aaattccagt acgacttaag agccaaagac ctcccagagc ttaatgttgg cgaatacatc 3720 agaatgaaac ccctacctgg agaccgcatg ggccggtggc ggagaggcca atgtttagga 3780 aaggtgaatc cgcgatctta tgtagttgat gttgagggca catggtacag gcgcaaccgc 3840 gtggatttgc gaagagctga gcgttttgac cagtttaatc atcaagaggc agaatcaaag 3900 gagaacagcc agggcactac cggcgacagt gaaataataa tcacaagagg ggagaacaca 3960 tcagatatta tagactgtga accagagcaa gctcaggacc aaagactgat agtgaccaac 4020 aaactgtcac cacggggagc gccgatcatc actcgcagtg gccgtcagtc acagctatta 4080 ggctagcatc aaaggaaagt ttaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaacattta 4140 acgttattcg ttaggtggtg tttgataacc tattgtttct gtcaaatttg agtgtttggg 4200 gctctctgtg aagttaagcg cctctctaaa gaaagggaga 4240 // ID Gypsy61-I_DR repbase; DNA; ZEB; 6521 BP. XX AC chr15; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy61-I_DR; KW Gypsy61-LTR_DR; Gypsy61_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6521 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1437-1437 (2008). XX DR Genome; chr15; Positions 7140565 7134045. XX CC Positions [5097-5576] - Integrase core CC LTRs are 98% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 5049..6056 FT /product="Gypsy61-I_DR_3p" FT /translation="MQCDVKSWCEQCERCCLRKTPTAGKRAPLVSIHTNAP FT MELLCIDFLVLEKSRGGFEIPTKDQKAVTVAKVLWKIFFCRFGFPARLHAD FT QGRNFESAVVKELCKLIGVTKTHTTPYHPQRNGTTERFNRTLMNMLGTLDP FT ENKPQWHEYVDAMTLAYNCTQHDSTGFSPYLLMYGRHPRLAVNLVFGLSTT FT KGQCESSEYVQNLRESLTFAYDQADKMSRHAKDLQKKHYDKKVKDYVFETG FT DRVMVKVCHVEGKQKLADRWEPHPYVVVKKQPGIPVYVVQSEDGDKERVVH FT RNLLTQCMFFPVEKTYQDQVESESEGQNEVESTDDVSDEDKVDL" FT CDS join(476..1384,1388..2293) FT /product="Gypsy61-I_DR_1p" FT /translation="MQGHYDSTPHRPPTCSQESFRQPFHVTPWLPIRQEKI FT KSHDMSTTVDMSNVGESHLDESPLQSVRPKSTQLATHREIVTPPVVSWLKP FT GTHTNASVQVSAHDQAPDNHVYPTSNHILPIATHPHVSCQTSMHDSVTPVQ FT APPDMAIGMQADMHTSPNIPYSTPATVILPHTNLTGHPQPIQAPHSSYYGV FT PNTMYSSQSDSVLPPAPPVFSGIMFPRKLDPIQAHQAPHDQKLTLHSNYAV FT ASSSSTPVTMPAYPASRIYAQQPSVVQQLQTAPSHISYQQGTNQHASVMTT FT SPPSFTQTYQVNVQVFSGGPECKILIEDWIRDMQYLLEAGGMPPNLCFATL FT VRHLNGEARRLVLNLPPSKQTTRRAFVELRAEYSNMQTSLDPLADFYERCQ FT RSGKTARSYAIALEATLRSVEETQYGGQTFPDRDGKLTRQFMRRLSDEEVY FT HRLAPIKPQLLSFRELQAELRNPARETKKFQMLPKSKRAYTQAQFMAASSN FT RSGESTGTEKRNKQNNDLTELTALVKNLLISQEDQVNRLTQLEARFAASPR FT MIFPQTQRPQGSSDTPARSVVCYRCNKPGHIARLCRTIMTDEVSQAENQTE FT KSQDLNA" XX SQ Sequence 6521 BP; 1992 A; 1337 C; 1513 G; 1679 T; 0 other; tctttggtgc catgacccgg atccttggat ttcacgtcaa ttaaagtagg ctttttcctt 60 aaaaaaaaaa aaaaaaagac aattgatctg aatttctgag ttttttgttt ttcctatctt 120 tttttttccc actgtttctc tgataagtag actaatatgt gaatttactg atgtaacatg 180 tttaaattga tgtgtttttg aaagtgtatc atttattgtg tgcactgaca agtttttggg 240 tgtatctgaa tttaaattga actaggttta ttttgttgaa aaaaaaaatt gctaattttt 300 ttgtttctgt ttattattgt acattttttc tctaacaata gaaagtttgc aaagaaacca 360 taacacacac acacacacac acacacaaaa acatagtcaa ttttaattgc atgcaccaca 420 catagtactg cacatagagt gttgcgtagg cagagtcgag aagtgaaata taataatgca 480 aggacactat gattccacgc ctcatcgacc acccacctgt tcccaagaat catttaggca 540 gcctttccac gtcactccat ggttgcccat aagacaggaa aaaattaaat cgcatgacat 600 gagcactaca gttgatatga gtaatgtggg tgaatctcac ttagatgaaa gccccctaca 660 gtctgtcagg ccaaaatcaa cacagctagc gacacacaga gagattgtga ctcccccagt 720 agtaagctgg ctaaagccag gcacccacac taatgcatct gtgcaagtat cagctcatga 780 ccaagcccca gataaccatg tctatccaac aagcaaccat attttgccta ttgccacaca 840 cccacatgtt tcatgccaaa cttccatgca tgacagtgtc acacctgttc aggctccacc 900 agacatggcg attggaatgc aagctgatat gcatacttca ccaaatatcc catactcaac 960 cccagcaaca gtcattttgc ctcacacaaa cctaaccggc catccacaac ccatccaggc 1020 tcctcattcc tcctattatg gtgttccaaa taccatgtac tcatcacagt ctgactcagt 1080 actgccccct gcaccgcctg tcttttctgg aataatgttt cctcgcaaat tggaccctat 1140 tcaagcacac caagctcctc atgatcagaa gctgactttg cattcaaact atgctgttgc 1200 ttcatcttct tccactcctg tgacaatgcc tgcataccct gcatcacgta tatatgcaca 1260 acaacccagt gtagtgcaac agcttcagac tgcaccttca catatctcat atcaacaagg 1320 caccaaccaa catgcctctg taatgaccac gtcaccacct agttttacac aaacatatca 1380 ggtatgaaat gttcaagtgt tcagtggtgg tcctgaatgc aagatactaa ttgaagactg 1440 gatacgggat atgcagtatc tccttgaagc aggaggaatg ccaccaaatt tatgctttgc 1500 aacgctagtg agacatttaa atggtgaagc caggaggtta gtcctaaatc tccctccctc 1560 taagcaaacc accagaagag cttttgttga actgcgagct gagtatagta atatgcagac 1620 atctcttgac ccattagctg atttctatga aagatgtcag cgctctggca aaacagctcg 1680 ttcctatgcc attgccctgg aggccacttt acgttcagta gaggaaacac agtatggagg 1740 acaaacattt ccagatcggg atggaaagct aactcgtcag ttcatgagaa gattgagtga 1800 tgaagaagtt tatcataggc tagcaccaat aaaacctcag ctcttaagtt ttcgtgaact 1860 gcaagcagag ttacgtaacc ctgctcggga gacgaaaaag ttccagatgc tgccaaagtc 1920 caaaagggca tatacccagg ctcagtttat ggcagctagt agtaacagga gtggagaatc 1980 cacaggaacg gaaaagcgaa ataaacagaa caatgatctg actgaactta cggccttggt 2040 aaaaaatttg ttgatcagtc aagaagatca agtcaatcgg ctgacacagc tagaagccag 2100 atttgctgca tcaccacgaa tgatatttcc gcagacccag aggcctcaag ggagcagtga 2160 tactccagca aggagcgttg tatgctatcg ctgtaacaaa ccaggacaca ttgcacgtct 2220 gtgtcggaca ataatgactg atgaggtcag ccaagctgaa aatcaaacag agaaaagcca 2280 agatttaaat gcgtaaggcc tgtggtcatc ggggcgacca caggcattca acagagtccc 2340 cagcaacata aggcgagaat gtccaaacta gctaaaactc cgctggtagg tccttgtaat 2400 gaaggggtgg ttgagttgaa gtagatgggg aggtgtgtaa ggctctaatt gattccggct 2460 ctcaagtaac tacaattaca gatgagtttt ggggcagaca cccaatctta tgtactaaga 2520 aagtgcagcc atcagaaatc ccaattgaat gtgccgctgg tcaacccgta ccttatgttg 2580 gggtactctg tatcaacctg aagtaaaaga gtaccgatcc agtgttccat tgctcattgg 2640 gactaatgtc atacgggctt tcagaaatga tctcgaagca acatatggta gcaagtacct 2700 ggccaatatg aagcagacta atcctgagtg gcactcagca ctggtgtcca taggtaaaag 2760 tgaaccttgt gggattaatg gcaaagtggg tcaagtacag tatgctggtg atgcaataca 2820 aataccagca ggaagagagc tggatgttgt gggcagagta atgggtggtc caaagaggac 2880 acaatacact gttttggtgg agagccaatg tttgaaaaaa gtaccagaag gtattgtcat 2940 tgccagactg ctagctaatg tgaaaaaggg ttatgtacct attcgtctct tgaatctttc 3000 ggaaaaagtt gtgactgtgg agccaaggac cctgctggca gatgctttct tggtcaatga 3060 ggtatttgaa gagggggatg agggcaaaat actaggccag aaacagatca tgaatcggaa 3120 tctgtgtgaa aaaaactcag tgactgcaac ttgtcaaagt taagaacagg ttaccagtgc 3180 caaaaacgag tttgatatgt catgtggagt tgaccttagt gccatcacag tggaagggca 3240 ggagcagttg gatttactgc aggatttatt gaggaaaaat tctgatgtct tttccaaaca 3300 ttccatggat tatggtcata ctactactgt gcaacacgag attcctctgg tagacccaaa 3360 gccatttcgc ctcccttatc ggaagatacc tccctcacag taccaggaag ttcggaaagc 3420 catatcgcag atggaagagg ctagatttat acgcccaagc aaaagccctt atgcatcccc 3480 tattgttgtg gtatcaaaaa aggatggttc catgaggatt tgtgtggact acagaaagct 3540 gaattcttgt agcaccagag acgcttttcc tttaccaaga atagaggacg cattagaagc 3600 attgggtcaa gcaagttact tctcggctct gggtattggc aagtggaggt ggccgaacat 3660 gataagtata aactgctttc agcacaccaa tggggttgta cgaggctaat aggatgccgt 3720 ttggtctaca aaatgctccc tcaacgttcc aaagattgat gacttgctgc tttggtgact 3780 taaactttga gagcctcctg atctaccttg atgacataat tatcttctct agaacattca 3840 atgagcacct tgaaagattg aaggaagtgt tcaaccgcct ctgcaaacat gggttaaaac 3900 taaaccctac aaagtgttgt ctgctgagga aagaggttca gtaccttggc catgtagtgt 3960 catcggaggg tatacggact gattctgaga agataagtaa agtcagggat tggaagcggc 4020 cgtcaactgc aaaggaagtg ctaaggtttc ctggcactgg cagctccctt gtatgctctg 4080 acatctggag accctaagtg caggaaaaaa gaggggtaaa gacattaaaa gagaagctga 4140 ctactgctcc tattttgggc taccccaact acagtttacc atttttgttg caaactgatg 4200 cttccagggg ttttgtgaca ttaaaagaga agctgactac tgctcccatt ttgggctacc 4260 ccaactacag tttaccattt ttgttgcaaa ctgatgcttc cagggatggt cttggggctg 4320 tccttgccca aattcaagat gatgcagagc gagtgattgc ttatgggagc cggggcttga 4380 gtccaccaga aacccgctac ccagcgcaca agttagagtt cttggctctg aagtgggctg 4440 taactaacaa attttatgat catctctatg gacatacatt ctctgttttg actgataaca 4500 acccactcag atatgtgatg agcacattta aattggatgc cacaggccaa cgttgagtgt 4560 cacagcttgc catatttgac tttgacttcc agtataggca aggaaagtgt aactctaatg 4620 ccgatgcact gtcaagaatg tctagccaag aattcgcaaa agccctccag tcctgtcctc 4680 aacagatcct tcctcagggg aataaagaaa atagtagcac tggccaaggg gagctggaag 4740 caggtgatgg tgtggaagat aaggtctagt cctgtatttt aaaacactga attgtaagtc 4800 acggcgaggg aagaggttgg aggaaggaaa acaaacacgg cttcttctaa aagagtggag 4860 gaagttagtg gtcagagatg gaattctgta cagacaagtt caaggtggcc agggaaggac 4920 tattgatcag ttggtcttgc cagaaaaaaa tgcgagaggt ggctaggaca agtcttcatg 4980 atgatgctgg tcatttaggt tttgagcgca ccctgatttt gttccgtgag tgattttatt 5040 gcccaaaaat gcaatgtgat gtgaagtcct ggtgtgagca gtgtgaaaga tgttgtttaa 5100 ggaagactcc aactgctgga aagcgagctc cgttggtaag cattcacact aatgcaccta 5160 tggaacttct atgcattgat ttcctggtcc tggaaaagtc taggggagga tttgagatcc 5220 ccacaaaaga tcagaaggct gtcacagtgg caaaggtgct gtggaagatc tttttctgtc 5280 gatttggttt cccagcaaga ctccatgcag accaggggcg caattttgag agtgcagtgg 5340 taaaggaact gtgcaagctg attggagtca caaagactca taccacccct taccacccac 5400 agaggaatgg gactacggag aggttcaaca gaactctcat gaatatgttg ggaactcttg 5460 atcctgagaa taaacctcag tggcatgaat atgttgatgc aatgactctt gcttacaact 5520 gtactcaaca tgattcaaca ggtttctccc catacttatt gatgtatggc agacatccca 5580 gactcgcagt taatcttgtt tttgggttgt ctaccaccaa agggcaatgt gagtccagtg 5640 aatatgtcca aaatctgcgt gagagcttga cttttgcata tgatcaggca gataagatgt 5700 caaggcatgc taaggatcta cagaagaagc actatgataa aaaagtgaag gattacgtgt 5760 tcgaaactgg tgacagagtg atggtcaaag tatgtcatgt agaagggaag cagaagctgg 5820 ctgataggtg ggagccccat ccttatgttg tggtaaagaa acagccagga attccagtgt 5880 atgttgttca atcagaagat ggtgacaagg aaagagtggt ccaccgcaat cttctaactc 5940 agtgcatgtt ctttccagtt gaaaagacct accaagatca agtagagtct gaaagtgaag 6000 gacagaatga agtggagtca actgacgatg tttcggatga ggataaggtg gatttgtaga 6060 aaatagacaa aagagtggat ggttctgtaa acttgtcttc aagcaaggaa cctgtggaca 6120 ttgtagaaga aggagaagag actgtgagta ttttgcagga aacaataagt gaaatggaaa 6180 gtggacacga aaagcataca ctgttgcaag ttctggaatt cagaaagcca gaacggagat 6240 atcctgagcg gaaaagacgc cctccaaata gactctcttt ggagctgcga gtcttcacaa 6300 gagaaagtga caatgaaaag attgaaagag gaaggaaagt gtgggaaaag gctaaagcga 6360 agaaggagta gatgtgcctt agcaacacta catatagaca tgcacacaat acacactcac 6420 tgaaaagaac ttcttaatgt gtttatttga ctttatttta caatttttgg aggggtgaat 6480 tgaattatga ctgtgacgcc atcaatcaaa gagggggtgg g 6521 // ID Gypsy-26-LTR_DR repbase; DNA; ZEB; 719 BP. XX AC . XX DT 11-FEB-2005 (Rel. 10.01, Created) DT 11-FEB-2005 (Rel. 10.01, Last updated, Version 1) XX DE A long terminal repeat of the Gypsy-26_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy superfamily; Gypsy-26-I_DR; Gypsy-26-LTR_DR; Gypsy-26_DR; KW endogenous retrovirus. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-719 RA Kapitonov V.V. and Jurka J.; RT "Gypsy-26_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 5(1), 26-26 (2005). XX DR [1] (Consensus) XX CC Gypsy-26-LTR_DR is a long terminal repeat of the Gypsy-26_DR CC LTR retrotransposon that belongs to the Gypsy superfamily. Its CC internal portion is deposited in Repbase as Gypsy-26-I_DR. XX SQ Sequence 719 BP; 187 A; 123 C; 140 G; 269 T; 0 other; tgtaacagtt ttcctgttac tatttgaaat atctacctgt agtagctatc tcacccatta 60 gagggcgatc taggtagtaa gtctccaatc tgcttaaaag gaggtagaga agaagcagta 120 ggcaggaagt atggtgccta gcacatggaa aggcttcttc actgcgctgt tgtttgttga 180 tgcccttatt tgaggaacat ttttgttctt atatctcaga tgtaagtagg cctgtttttt 240 tgtatatatt tcggttttgt acatatgttt tgtgtggtgt tggcaaggtt ttgattgtat 300 gtttgtttat atgcagaggg tttttggtat ccactttaag tcatcaaaac aggacctgga 360 caatccatcc tcatatccac agactttttc catccacttt cctactgatt ccgtggccag 420 ccattgatga actgtttgct gaacttgatc tttcatgtga ttcacttaag gactgttggt 480 atcttttttg gattgctata aatactggat tcaatgagaa agacacttat cgaatgctta 540 tttctgtcta atagtgatca ttaatgagtt gactgctgat catttcatat ccccagtgtg 600 taacaataat tattcatacc tgctgtcata tacatttcta aatacacatg aatctctgaa 660 agttgttttg tggtctctca atttctttaa tgtagcttca aaaaaaagta gttgttaca 719 // ID Gypsy-117-LTR_DR repbase; DNA; ZEB; 748 BP. XX AC . XX DT 25-NOV-2008 (Rel. 13.11, Created) DT 25-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy-117-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-748 RA Bao W. and Jurka J.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(11), 1790-1790 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 748 BP; 153 A; 109 C; 157 G; 329 T; 0 other; tgttacaatc ctatgtttgt cttgtctcat ctgttttttt ttcttgttcc tgttttgctt 60 agaacacatc caattggcca cttccagctt gcttgatcct gattggccta cagtgtttta 120 aaaagccatg acaccacttc cttggcagcc atcttgtttt tggtcacttg cttgtgtttg 180 ttgaaatgtt tgattgtggt atagtgcaga gtttttggat gtgttaggac tgagaatatt 240 gagatttgtg agtactgttt gctttgtttg acttctgaga gaatattgag atttgtgagt 300 actgtttgtt ttgtttgact tctgagatta tattgagaat tgtgagtact gtttgtttta 360 gtgtttgctt tgtttatgat actttgttcc accggcttgt ttgacctttg ttgcctgctg 420 acattgattt tggattgtcc ttcaaatatg gttgcctgtt tttgtaaata gattgtaaat 480 agtgtatata ttgggatagt aaggagtttg acgccatttt ctttgtaaaa ccttttggcc 540 tctgtttttg tgttaggtag ttagggaagt gaattgtact ttcttttctt ttatttttga 600 tcaggtaagt tagagattta atataaagga atttttgttt gttattttgg ccttgtcagc 660 tcccgaattc aaaccccaat aaatctttga attcattttt gttgtcttgt ctttcatgtt 720 gtgcaccatc cctagacggt gcgtaaca 748 // ID Gypsy117-I_DR repbase; DNA; ZEB; 6564 BP. XX AC chr20; XX DT 03-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy117-I_DR; KW Gypsy117-LTR_DR; Gypsy117_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6564 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1521-1521 (2008). XX DR Genome; chr20; Positions 53925226 53918663. XX CC Positions [2862-3401] - Reverse transcriptase CC Positions [4857-5333] - Integrase core CC 'TATG' target site duplication CC LTRs are 99% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 274..1617 FT /product="Gypsy117-I_DR_1p" FT /translation="MAGVRKDSEFMTPYLGRGRGLFGVEIPTAGKPRALIY FT SPQSHNTPTNEPVGPTASTPNNADDNVAQQLRDLIGELGSQIGDSIVNRLL FT VSQTPVATSNPVTVNRVVPESTSAGHSLANVSLIVKPDIREPPMFRGDDSD FT KCTIQEWIELMELYLHKADCSNKDQVDEILSHLLGRAKNIVKVKLKCNPDE FT AISPETIYDVLRRYFSEHPGSCQPLADFYATQPKANECPVDYWVRLNTAAE FT LADSHLQKQGARIKNMSSEISMMFIRNCPDPGLSSVFRCKPISKWTITEIQ FT EAIDEHHRDVQSKTLPVTHVVKSVHVATAKVLSSPHTVPGGESLNAAAPAF FT VPDSPLKAAESPSSEAGILERVLNMLERVLERTSQPAPASQPRVTQWRNTS FT PCRVCGDRNHSTRSHCMREKRCLSCLETGHQQKQCHKVVSPAGPPASDSYV FT QGN" FT CDS 1518..6227 FT /product="Gypsy117-I_DR_2p" FT /translation="MLIMPRNWTSTKAVSQSCQPSWTSCFGFICSGKLMHS FT HFGGDYVSVQHHTLAMDDNFSVYDECRKSASSDIVFFQNITKLEKSDSLFY FT TPVLLNDKVSVQALLDSGSMACTINGETERKLLSEGMPAQVDQSYMDILLV FT GCGGVKVKPSGVYQLKMDVYGKQLSVPTLVVPDQKDQLILGTNVIKFLLTQ FT LKQDSNYWNVVNHPVLSGEPEIEQFLNMLSGINRWEGNKIPDIVGTAKLTQ FT AVTLLPKQEHLVWGKLPQTAPVSLGSAIMIEPTKTLTHKKSVMVGRVIASM FT SGDRWVPVKIINLLDKPITLKRNSKIADVFPVVAVEDLCVPNDHDDEMVSV FT HCQKTVGDIVDEEDRPVSKFHESMQKLNLGDLDIDSCEVSTYWKNELVQLI FT KRHEEVFSKHKLDCGKAKEFVHRIHLSDDRPFRLPYRRVPPAQYLKLRTVL FT SEMEEQEIIRKSYSEWASPLVLVWKKTGDLRVCVDYRWLNTRTVKDAHPLP FT HQADCLAALGGNAIFSAMDLTSGFYNIVVSEEDRKFTAFTTPMGLFEFNRL FT PQGLCNSPASFMRLMTNIFGDQNFLTLLCYLDDLLVYAPNEEEAIKRLELV FT FTRLKAHGLKLAPKKCHLLRRSVKFLGHIIDETGVATDPDKVSAISAVSEA FT DLMMSDGVTPSQKKIKSFLGMVMYYQKFIPNCSSVAKPLFNLTAAPKGRKT FT SQKGEINFKKLNPGDWTSEHSEAFQNLKSSLLHSVVLAHPDFSRPFILCTD FT ASLDGLGAVLSQVPEGENKARPIAFASKALTKAQTKYPAHRLEFLALKWSV FT CDKFSHWLKGHDFTVWTDNNPLTYILTKPKLDACEQRWVSKLAPYSFNIQY FT IPGSKNVVADALSRQPFVHQRVGQRLISEPYSVLLNEAEKVKEDTVQDAFR FT LSANICTVEHLACRPLEHCSLTCTEVSAVLDVHTQWETGASDRAIQWLSQG FT THQLLTPGPSPLPVFSLSELRQKQQDDKILSRVLFYVSRAKKPSRRERAGE FT TFEVLKTLKQWEKLKMLDGVLYRVTKDALTGKKRWQYVVPASLVSQALLGI FT HNEAGHQGQGRTLSLARQRFFWTSLEHDVREHVKCCKRCVVSKTPEPEGRA FT PLESIKTSCPLELVCIDFWSAEDCSGRSVDVLVITDHFTKMACAFPCKSQS FT SKQVARILWDKFFCIYGFPERIHSDQGANFESQLIRELLEVAGVKKSRTTA FT YHPMGNGHVERFNRTLGSMIRSLPPRNKQKWPQMLQTLTFAYNCTAHESTG FT YAPFYLMYGRIPKLPVDVMFSSIERDCAIADYDTYVKRLRDDLKEALSLAQ FT MNSEASQKRQADLYNKKTKGCRIEVGDQVLLANKGERGRRKLADKWDSTPY FT VVVDLNPQCHTYRIRNTHNGLEKTVHRNLLLQANFLPIETEGVEPSFSDDC FT ELDDCSSALSGALTPSSACSHADRTASWVAETLVPDDVPDSVPIDTESVLS FT LVVNPEASRTCLSIVQSDTLSCEDVMVHSEVTKTHSETDSLVSNRALTRGA FT NTSESVVLSSSRVTLPVRTRVGRLVKPVNRLIQNMTQNVRMTSSVSEFAKS FT LLS" XX SQ Sequence 6564 BP; 1839 A; 1353 C; 1549 G; 1823 T; 0 other; ttaaaatggt gccgtgacct actggatcaa cagagtcagc tgcagccgac caccggagtt 60 gttctacatg ttcgtggcgt tcgtgtgctg taagtggagg ctgaagattc cgtttcacca 120 ctgttccagt agctgcttcg taaaagagac tgaaaagaca aagactattc tgcggtttga 180 tactgatgaa cttttggttt ctcctcatct aatttatttc acattttctg aggttagtta 240 tcctttgtgc ttattttgct gtaaaatcat accatggctg gggtaagaaa agactctgag 300 tttatgacac cataccttgg gaggggtaga gggctgtttg gtgtggaaat ccctaccgct 360 ggtaaaccta gagcgttaat ttatagtcct caatcacaca atactccaac aaacgaacct 420 gttggtccga ctgcctctac tcctaataat gccgatgata atgttgcaca gcagttacgt 480 gaccttattg gtgaattagg cagtcaaata ggtgattcca ttgtcaaccg cctgttagtt 540 agccaaactc cagttgccac ttctaaccca gtcactgtta atcgtgttgt ccctgaatcc 600 actagtgcag gccatagtct cgctaatgtc agtctaattg ttaaaccaga catcagagaa 660 cctcctatgt ttagaggtga tgattcagac aaatgcacga ttcaggaatg gatagagctg 720 atggaattgt atctccacaa agcagattgc tctaacaaag atcaggtaga tgagatactg 780 agccatctgc taggcagggc taaaaatatt gttaaagtta aactgaagtg caaccctgat 840 gaagccatct cccctgaaac catctatgat gtgttaagac gttattttag tgaacatcct 900 ggatcctgtc aacccctagc tgacttttat gcgacacaac ccaaagcaaa tgagtgccct 960 gttgactact gggttaggtt aaatacagct gctgagttag cagacagtca cttgcagaaa 1020 caaggcgcga ggataaaaaa catgagttct gagatttcta tgatgttcat taggaattgc 1080 ccagacccag gcctttctag tgtctttagg tgtaaaccta taagtaaatg gactatcact 1140 gaaatccagg aagccataga tgagcaccac agagatgttc agtctaagac attgcctgtc 1200 actcatgtag ttaagtctgt tcatgttgct acagccaaag tattgtccag tccacacact 1260 gttcctggtg gtgagagctt aaatgcagct gctccagctt ttgtccctga tagtccactc 1320 aaggctgcag aatcgccatc ttcagaagct ggcattttgg agcgtgtttt aaacatgcta 1380 gaaagggtgc tggagcgtac tagccaacct gctcctgcct cacaaccaag agtaacgcag 1440 tggcgcaata catctccatg tagagtctgt ggtgacagaa atcattcaac ccgttcacac 1500 tgtatgagag aaaagagatg cttatcatgc ctagaaactg gacatcaaca aaagcagtgt 1560 cacaaagttg tcagcccagc tggacctcct gcttcggatt catatgttca gggaaactga 1620 tgcactcaca ttttggaggg gactatgtga gtgtacagca tcacaccctc gctatggatg 1680 ataatttctc tgtttatgat gagtgtcgca aatctgcctc cagtgatatt gttttttttc 1740 agaacatcac caagctagag aagtcagata gtctgttcta tacaccagtg ttattgaatg 1800 ataaagtttc agttcaagct ttgctagata gcggatccat ggcttgcacg attaatggag 1860 aaactgaacg aaagttgttg agtgaaggta tgcctgcaca agttgatcag tcatacatgg 1920 acatccttct cgttggctgt ggtggtgtta aagtcaaacc aagtggagtg taccagctaa 1980 agatggatgt gtatggcaaa cagctaagtg tgcccacgtt agttgtgcct gatcaaaaag 2040 accagttgat tctgggtact aatgttatta agttcttact cacccagctc aaacaagact 2100 ctaattattg gaatgttgtg aaccaccctg tgctgtctgg agaacctgaa attgagcagt 2160 tcctgaacat gctgtcgggc attaacagat gggagggtaa caaaatccct gacattgtgg 2220 gcactgctaa attgacccaa gctgtcactc ttttgcccaa acaggaacat cttgtgtggg 2280 gtaagctacc acagactgct ccagtttcct taggtagtgc tattatgatt gaacctacaa 2340 agacactgac ccacaaaaaa agtgtcatgg tgggaagagt tatagcgtcc atgtctggag 2400 atagatgggt cccagttaag atcataaatc ttcttgataa gcctattacg ctgaaaagga 2460 actccaaaat tgccgatgtg ttccctgttg tggcagtgga ggacctttgt gtgcctaatg 2520 atcatgatga tgaaatggtg agtgtgcact gccagaagac tgttggtgat attgttgatg 2580 aggaagatcg cccagtctcc aaatttcatg agagcatgca aaaacttaac cttggagacc 2640 tggatattga ctcgtgcgaa gtatcgactt attggaagaa tgagttagtg cagctcatta 2700 aaagacatga ggaagtgttt tcgaagcata aattggattg tgggaaagcc aaggagtttg 2760 tccaccgaat tcatttgtca gatgatcgtc ctttcagact tccataccgc cgtgtccctc 2820 ctgctcagta tctaaagctg agaactgtgc tctctgagat ggaagaacag gagataatac 2880 gcaagtcgta tagtgagtgg gcttcaccac ttgtgctagt atggaagaag actggagatc 2940 ttagggtgtg tgtggattac cgctggctga acacacgaac tgttaaggat gcccaccccc 3000 tgcctcatca ggcagactgc ttggcagctt tgggaggcaa tgccatcttc agtgcaatgg 3060 acttgacctc tgggttttat aacatcgttg tatccgagga agacagaaag ttcactgcct 3120 ttacaactcc catgggcctc tttgaattta atcgtttgcc acaagggctc tgtaacagtc 3180 ccgccagctt catgagactc atgaccaaca tcttcggaga tcagaatttt ctcacactgc 3240 tgtgctatct cgatgatctt ttggtgtatg ccccaaatga agaggaggcc attaaacggc 3300 ttgagcttgt gtttactagg ctgaaggcac acggactgaa attggcccct aagaagtgcc 3360 atttgttaag acgtagtgtg aagtttctag gtcatatcat tgatgagact ggagtggcaa 3420 cagatccgga taaggtcagt gccatatcgg ctgtgtctga agcagacctc atgatgtcag 3480 atggagtaac accctcacaa aaaaagatca agtcatttct aggaatggtg atgtattatc 3540 agaagtttat tcctaactgt tccagtgtag ctaagcccct gttcaacttg acagcagcac 3600 ctaaagggag gaaaacatct caaaagggtg aaataaattt taagaaatta aaccctggtg 3660 actggacaag tgaacatagt gaggcttttc agaatttaaa gtcatctttg ctacattctg 3720 ttgtgctggc acaccccgat ttcagcagac cattcatttt atgcaccgat gcgtccttgg 3780 atggtttagg ggctgtactt tcccaagtac cagaaggtga aaataaggca cgcccaatcg 3840 cctttgccag caaagccctc accaaggcac aaactaagta ccctgcccat cgtctggaat 3900 ttctagcttt aaagtggtcc gtctgtgaca agttcagtca ctggctgaaa ggacacgact 3960 ttactgtctg gacagacaac aatcctttaa cgtacatatt aacaaagccc aaactagacg 4020 catgcgagca gaggtgggtg tctaagctgg ccccctatag ctttaacatt caatacatac 4080 ctggaagcaa aaatgtggtc gctgatgcgt taagtagaca gcccttcgtc catcagaggg 4140 ttggtcaaag actgatatct gaaccataca gtgttctgtt aaatgaggca gaaaaagtta 4200 aagaagacac tgttcaggat gcctttaggt taagtgcgaa catctgcact gttgaacacc 4260 tagcctgtag gccactagaa cactgttctc ttacttgtac tgaagtatca gcagttcttg 4320 atgtccatac tcaatgggaa acgggagcat ctgatagggc gatccaatgg ctctctcaag 4380 gtactcacca gttgttgact ccaggtccca gtccactgcc tgtgttctca ctaagtgagt 4440 tgcgacagaa gcagcaagat gataagatcc tctcaagagt tttgttttac gtgagtagag 4500 caaagaagcc ttctaggcgt gagagagctg gtgagacatt tgaggttctc aaaaccctta 4560 aacagtggga gaagcttaaa atgcttgatg gagtgttgta cagggtgaca aaagatgccc 4620 tgacagggaa aaagaggtgg cagtatgtgg ttccagcttc ccttgtcagt caggctcttt 4680 tggggatcca caatgaggct ggacatcaag gtcagggtag gactttatcc cttgcgagac 4740 agaggttctt ctggacaagt ttggagcatg atgtgcgtga acatgttaaa tgttgtaaac 4800 ggtgtgtcgt gagcaaaaca cccgaacctg aaggaagagc ccctctggaa agcattaaga 4860 catcatgccc actggagtta gtatgcatag atttttggag tgctgaggat tgcagtggtc 4920 gcagtgtgga tgtcctggtc ataactgacc attttaccaa aatggcctgt gcttttccct 4980 gtaaaagtca atcttcaaag caagtagcga ggattctttg ggataagttt ttctgtattt 5040 atggcttccc tgagaggatc cattcggatc agggtgccaa cttcgagagc caactcatac 5100 gtgaactact ggaagtcgct ggtgtgaaaa agtctagaac aactgcttat caccccatgg 5160 gcaatggcca tgttgaacgt tttaatagga ccctgggtag catgattagg tcattacctc 5220 caaggaacaa gcaaaagtgg ccgcaaatgt tgcaaactct gacctttgcc tataactgca 5280 cagcccacga gtcaactggc tatgcgcctt tttatctaat gtatggcaga attcctaagc 5340 ttccagtgga tgtgatgttc agtagcatag aaagagactg tgccattgct gattatgata 5400 cttatgtgaa acgtctaagg gatgacttga aagaagcttt gtctcttgct caaatgaatt 5460 ctgaagccag tcaaaaacgg caagcggact tgtacaacaa gaagacaaaa ggatgcagga 5520 ttgaagttgg tgaccaggtt ttgttggcga acaaaggtga acggggacgg agaaaactag 5580 ctgacaagtg ggactccaca ccttatgtcg ttgttgatct taaccctcag tgtcacacat 5640 accgcatccg taacacccat aatgggttag agaagacagt tcatagaaac ctgcttcttc 5700 aagctaactt cttgccaatt gagactgaag gagttgaacc ctcattcagt gatgactgtg 5760 aactagatga ttgtagcagt gcattgagtg gcgctctaac accatcgtct gcttgtagtc 5820 atgcagatcg tactgctagt tgggttgcag agacattagt cccagatgat gtcccagaca 5880 gtgtacctat tgacacagag agtgttctgt cattggtagt gaaccccgaa gcttctagga 5940 cttgtctttc tatagtgcag tctgacacct taagttgtga agatgtcatg gttcattctg 6000 aagtgacaaa aactcattca gagacagact cactggtatc aaacagagct ttgactaggg 6060 gtgcaaatac ctctgaatct gttgttttgt cttccagtag agttacactt ccagttagga 6120 ctagggtcgg aagacttgtg aagcctgtaa ataggttaat tcaaaacatg actcagaatg 6180 tgagaatgac aagttcagtc agtgagtttg cgaaatcttt attgtcataa gtcctgataa 6240 tcataggata aaatcctttt tcttgtagaa tcttatcttt tgagtcaatt tctcacagtg 6300 tgtggaattc tgagttttca agtcactgtc ttgttgaggt caatatgatg gtgttttggg 6360 caccattata gcaagtgcgt cttagtaggc ctgatcaacc tgtttggacc acttaccctg 6420 ttaggtggga agcatatatg ctgctgtgac ttgcgtatta aatgacagtt gagtttctca 6480 gataaatact agttaaatta attttttttt cctaattgaa tgcttacttt gaggtgcaag 6540 ttttgtgaag ttcagagggg agta 6564 // ID Gypsy69-LTR_DR repbase; DNA; ZEB; 408 BP. XX AC chr14; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy69-I_DR; KW Gypsy69-LTR_DR; Gypsy69_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-408 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1454-1454 (2008). XX DR Genome; chr14; Positions 6150186 6149779. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 408 BP; 82 A; 101 C; 75 G; 150 T; 0 other; tgtaacaaac agaactgaca accactcatc caacctgcag agggagccct caccggaatt 60 ctgactgtca gttcttcgtt ggttacttcc tgtttggtgg ccatattaac ctggccacac 120 caaacactca atgcgaagta ttgccagtat ctgctgcctt accaagcgta ttttccctgt 180 ttgatctgcc ttttgttatg acctttgagc ctgttctttt tgactactga ttttggaaga 240 ctgtttttgc cttgtttgcc ttggtggaca ctctgtttgt tactttgaac ctgcctgctt 300 tttgactacg atttagccta gtgttttgga tttgtttact atttactatt aaacttcttg 360 catttggatc ccatctgttt ccatgtcgtc accaccgcca ttgtcaca 408 // ID DIRS-1D-LTR_DR repbase; DNA; ZEB; 601 BP. XX AC . XX DT 21-JAN-2009 (Rel. 14.01, Created) DT 21-JAN-2009 (Rel. 14.01, Last updated, Version 3) XX DE A solo-LTR derived from DIRS retrotransposon - consensus. XX KW DIRS; LTR Retrotransposon; Transposable Element; Nonautonomous; KW LTR; MER6; DANA; SINE_DR2; DIRS-1-LTR_DR; DIRS-1B-LTR_DR; KW DIRS-1C-LTR_DR; DIRS-1D-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-601 RA Jurka J.; RT "DIRS LTR subfamilies from zebrafish."; RL Repbase Reports 9(1), 4-4 (2009). XX DR [1] (Consensus) XX CC See comments in DIRS-1-LTR_DR. >98% identical to consensus. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 601 BP; 141 A; 163 C; 127 G; 170 T; 0 other; ttaagtgaag tttagtttta aactaatttc gagaggagca tgtgctcatg atttacccag 60 ctggtccaca ttaactaatt atgatcctcc aatcaaagga tcccaaatca ctatatatat 120 cctcatttcc tttctacaac tatcttcgtc tggaagaaac cccccctcct ccccttcttc 180 ctcctttatc cagaatgggc ggcacggtgg cccagtgact agcactgttg cctcacagca 240 agaataccaa tggtgttggg tcctcgctga gccatctggt atttctgtgt ggagtttgca 300 tgttctcccc gtgtccgcgt gggtttcccc cgggttcccc ggtttcctcc caccatccaa 360 atgtgctcta tattatagat aaaacaagcc taaacctttt ttataatgtc ttactctcag 420 gaagttcgcc ttggcctcag cagcggggga gtttgagata gacctgagct caatctccac 480 tcgccctgca aaaggggggg agccctgggc tcgaggatcc cttgagctca gggctctctc 540 ccgggacagc atgccaaaca agctatgtat aaatcatgag ctaagtgtga actcttgaaa 600 t 601 // ID CATCH1I_DR repbase; DNA; ZEB; 6170 BP. XX AC AL591492; XX DT 02-MAY-2002 (Rel. 7.04, Created) DT 02-MAY-2002 (Rel. 7.04, Last updated, Version 1) XX DE Internal sequence of the CATCH1_DR endogenous retrovirus. XX KW BEL; LTR Retrotransposon; Transposable Element; KW BELL/CATCH superfamily; CATCH1I_DR; CATCH1LTR_DR; CATCH1_DR; KW CATCH1_DRp polyprotein; endogenous retrovirus; endonuclease; gag; KW reverse transcriptase. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6170 RA Kapitonov V.V. and Jurka J.; RT "CATCH1, an endogenous zebrafish retrovirus from the BEL/CATCH RT superfamily."; RL Repbase Reports 2(4), 4-4 (2002). XX DR Genbank; AL591492; Positions 62281 55361. XX CC CATCH1I_DR is an internal portion of the CATCH1_DR endogenous CC retrovirus, which belongs to the BEL/CATCH superfamily. CC It is flanked by identical long terminal repeats, deposited in CC Repbase as CATCH1LTR_DR, and is characterized by 5-bp target CC site duplication. It is possible that CATCH1_DR is still active CC in zebrafish. CC CATCH1I_DR encodes one polyprotein, 2029-aa CATCH1_DRp (positions CC 190-419). CC CATCH1_DRp includes gag-like, reverse transcriptase and CC endonuclease domains similar to other retroviruses from the CC Bell/Catch superfamily. CC N-terminal region of CATCH1_DRp (positions 20-250) is similar to CC coiled CC coil domains present in various myosins. Gag-like domain CC (including CC zinc fingers) is located at ~420-770. XX FH Key Location/Qualifiers FT CDS 30..6116 FT /product="CATCH1_DRp" FT /translation="MESEPPSEADPAVRPRSPRERHLPGHLADYDIQLPPS FT LHPEVPQTGASSQRPSRRSRHASTSISAGTMSSTSSRSHRSAHYRLPRHGL FT SDLQAAMLEEKLKTMDLEELQQQIEEEAIVDKECERLDTQAREAQYIQEQA FT TKARESIAKQVDTRRRLKKVQNELKAAKFVCELLKQESPDVSSNASQSSLA FT VDLRPEVSPQHDNAVNLNASVQLMRSHSTPCHVSFVPASSVMHPPVLPVTN FT SVVTVSEPQLICVTSTPNITQPNIVTNSLAALSTIPPTLMSNSDLVAPLTF FT ESSLVNKPQYCAANSATYSQAPSVRVHNVVPSPATVTVGSPLQVQCYPSMH FT SSPDGTSAPYTGPAPDTSPIRSGLYPSLAQFPGLDMLAASAYGVPKPAIPH FT FDTGRESDFALLKMALENVMNNQPHLSEHYKYQVLLSHLKLPSALQLAKAY FT MYDPKPYTAAMQALQDKYGQPRQLVQSELSIILNAPAVKFGDAEAFDSFSL FT SIQTLVGMLKTLEGPNGYELRCGSHVDRLLSKMPPSNRDGFVEYCFSRGIL FT QTGTDRTYTLPDLSEWLQMKSQAKRISNKAATMYQTENARFIKKEQPRSAP FT FRIKAKSTSILYSTDGSSAVQGHPKSKPPVKIQPFCPYCNMKDHFLHACPK FT FKMLTAEKIGEWLSTEKRCWKCGRTHPPDHCTLKKPCTKCKEIHLTILHEA FT ASQIQNNVLLVNVPSLQVYLDRPNRSQKVMLKIVKVLLHHKEKAMEAYAVL FT DDGSERSIVLSQVVDQLKLPTEPETLTLRTVHQDVVNLKGANVSFYVSPLH FT KPFRKYLISHAFTADNLRLSKHSYPVPALQRKYEHLKSLPLPSIDQAQPLL FT LIGSDMPHLLTPVQPVCTGSSHGPIAICTRLGWTLQGPTGLSQTSVSTPQC FT LHITTTTQNTELFRNVERLWQIDTLPYTNEKSATRSKHDQYALSLLQSDTT FT KVEVNGVMRYATPLLRRPNAELLKASKEAVMANLRSTERKLAKDHKKAEAY FT CLEMRRLQEAGYVAEISIEEAEQSRESWYIPHHMVTHNGKDRIVFNCSYSY FT QGQALNDILLPGPILGPSLLGVLLRFRENPVAISGDVKGMFHQVRLLPTDK FT SVVRFLWRDMQRSNQPKIYEWQVLPFGTTCSPCCAIYALQMIAQTHSDGDP FT VLESTVKGSFYVDNCLHSLQTVAEARALVDNLRQLLLTCGFELRQWASNKP FT EVIQHLPPDARSSNSELWLSQKSTDLLEGTLGLLWNCLSDSFSYKPSQIEC FT FEPTLRNVYKVLASQYDPIGYLIPFTTRAKVLVQDLWKTNLTWDDPITSES FT LLSRWQTWLQELQGLNHIAVPRCYSPSDWQADKLNRELHIFCDASERVYGS FT VAYMRTENDQGHVHISFVLARSRVAPRKQMTIPRLELSAALTGAQVANILQ FT EELTVDINRIILWSDSSTLLQWLKSDSCRYKVFVGTRVAEIQSLTNINSWR FT YVDSANNPADVITRGQTLNSLTPPCHWINGPPFLYQPDSCWPTLPADEPEP FT VTELKKSVICLHVLTCPDTTLPDVNQFSTWAELLKATVTSLHGAATPQTQP FT QDDSGSYILAEKLLLQQSQRDSFPEEFKALTTNRPLPSNSRLASLSPEYDE FT VSGLIRVGGRLRHAHHLELDTTHPVVLDPQHYLTKLLIKDYDTTLLHPGLE FT RVFAEVRRRYWILRGREAIKKHQYLCTDCQKWRAKPKMPKMADLPPTRLRL FT YKPPFYSTGMDCFGPFTVKIGRRREKRWGIVFKCLTTRCIHLDLVESLDTD FT AFLMALRRFISRRGKPFELLSDNGTNFVGGDKELRAAYESMIPQLREQLAE FT QQITFRFIPPGAPHFGGVWEREVKSVKQALKVVLKDLTVTETVLRTVLIEV FT EGILNAKPLGYVSSDVSDLDPITPNILLMGRHDSSLPQVFYDSSNVLGTRR FT WKHSQILADHFWSRFIHYYLPNLQERQKWRKDGGEIGLDQVVLIVDPQLPR FT ACWPVGKVTNTFPGLDGRTRSVEVQVRNRTYVRPVSRLVPLPKHTDDDPVN FT PDT" XX SQ Sequence 6170 BP; 1819 A; 1544 C; 1263 G; 1544 T; 0 other; tggtccttcg agccggattg tgtcacaaga tggaatctga acctcccagt gaagctgatc 60 cagccgtcag acctcggtct cccagagaga gacatctccc cggtcatcta gctgactacg 120 acatccagct acctcccagc ctgcatcctg aagtacctca aactggtgca tcttcgcaga 180 gaccatccag gcggtcaagg catgcatcca catccatttc agcaggcacc atgtccagca 240 catcatctag atctcataga agtgctcact acagacttcc tagacacgga ctttctgatc 300 tccaagctgc catgttggaa gaaaaactaa agactatgga cttggaagaa ctacaacagc 360 agatcgaaga ggaggccatt gtagataaag agtgtgagag actagataca caagctagag 420 aagcacagta tatacaggaa caagccacta aggctagaga gtcaatagct aagcaagtag 480 ataccagacg tcgtctaaaa aaggtacaaa acgagcttaa agctgctaaa tttgtgtgtg 540 aacttcttaa acaggaatcc ccagatgtca gtagtaatgc atcacagtcc agcctagctg 600 ttgatcttcg gcctgaggta agcccgcaac atgacaatgc tgtgaacctt aacgcatcag 660 tacaattaat gcgtagtcac tctactcctt gtcacgtttc ctttgtaccc gcgtcttcag 720 taatgcatcc ccctgtatta cctgtaacaa acagtgtggt tactgtatct gaacctcagt 780 tgatttgtgt cacctcaact cctaatataa ctcaacccaa catagttaca aattcattag 840 ctgcattaag taccataccg cccaccctaa tgtcaaattc agatcttgtt gctcccttga 900 catttgaatc atctttagtg aataagccgc aatattgtgc tgctaattct gctacctata 960 gtcaggcacc atctgtcagg gtccacaatg tagttccctc acccgctact gtaactgtgg 1020 gatctccact acaagtacaa tgctacccct ctatgcattc tagccctgat gggacatctg 1080 caccgtacac tggtcccgca cctgatacct ctcccattcg gtcagggttg tacccctctc 1140 tagctcaatt tcccggtctt gacatgttag ccgcttctgc ctacggggtg cctaagccag 1200 ctatcccaca ttttgacact ggtagagaga gtgattttgc cttattaaag atggccttag 1260 aaaatgtaat gaataatcaa cctcacctga gtgaacatta caaatatcag gtccttctaa 1320 gtcatttaaa actccccagc gcactgcagt tagcaaaggc ttatatgtac gaccctaagc 1380 catacaccgc agccatgcaa gcactgcaag ataaatatgg acagccacgg caactagtcc 1440 aatcagaact tagtattata ctgaacgccc ccgcagtcaa gttcggtgat gcagaagctt 1500 ttgactcctt ttctctgtcc atccaaactc tagtaggcat gctcaaaact ttggaaggcc 1560 caaatggata tgagttgcgc tgtggctctc atgttgaccg attgctaagc aagatgccac 1620 catcgaacag ggatggattc gtggagtact gcttcagccg aggtattcta cagactggta 1680 cggatcgcac ctatacctta cctgatctat cagaatggct acaaatgaag tcgcaagcaa 1740 agcgtatttc taataaagca gctacaatgt atcaaacaga aaatgctaga tttattaaga 1800 aagagcagcc aaggtcagca cctttccgca taaaggcgaa atccacatcc attctgtatt 1860 caaccgatgg tagttcagca gtgcaaggac acccaaaatc taaacctcct gttaagattc 1920 aacccttctg cccatactgt aacatgaagg accactttct tcacgcatgt cctaagttta 1980 aaatgctgac agctgagaaa ataggagaat ggctatccac agagaaacga tgttggaaat 2040 gtgggcgaac tcaccctcca gaccactgta ctctgaaaaa accttgtaca aagtgcaaag 2100 agatacatct gaccattttg catgaagcag cttcgcaaat acaaaataat gttctgttgg 2160 ttaacgtccc cagcctacag gtctatttag atcggcccaa taggtctcag aaggtaatgc 2220 ttaagatagt taaggtacta ctacaccata aagagaaagc catggaagca tacgccgtat 2280 tagatgatgg ttcggaacgt agcattgtct tatcccaagt agtggaccag ttaaaactcc 2340 ctaccgaacc tgaaacgctg acactacgca cagttcatca agatgtggtt aatctaaaag 2400 gagccaatgt atccttctat gtatcaccac tgcacaaacc attcaggaaa tacctgattt 2460 ctcatgcttt cacggcagat aatctgagac tgtccaaaca ttcctaccct gtacccgctc 2520 tccaacgcaa gtatgagcat cttaagtctc tgccattacc ttcaattgac caagcccaac 2580 ctctactact aattgggtct gacatgccac atctacttac gcctgtccag ccagtgtgta 2640 cgggatcctc acatggaccg atagccatct gtactagact tggctggacc ctgcaaggtc 2700 ccactgggct cagccagact tccgtttcca caccacaatg tctccacata accacaacca 2760 cacagaatac tgagctattc agaaatgtgg aacgcctttg gcaaattgat actctgccct 2820 atacgaatga aaaatcagcc actcgctcca aacatgacca gtatgctctc tcgcttcttc 2880 aatcagatac caccaaggta gaagttaatg gagtcatgcg ctatgccact cctttgctca 2940 gacgacccaa tgctgaactt cttaaagcct ccaaagaagc agtgatggct aatctccgaa 3000 gcactgagcg caagcttgct aaagatcata agaaagctga agcttactgt ttggaaatga 3060 gaagactgca agaagctggc tatgtggcag agatatcaat agaagaagca gagcaatccc 3120 gggaatcgtg gtacatccct caccatatgg tgacacataa cggcaaagac agaatagtat 3180 ttaactgttc ttattcctac caagggcaag ctctaaatga catactttta cctgggccca 3240 tccttggtcc atcattactt ggtgtcttac tgaggttccg cgaaaatcct gtagccatta 3300 gcggagatgt gaaaggaatg ttccaccagg ttcgcctctt gcctactgat aagtctgtag 3360 tacgcttcct gtggagagat atgcaaagga gcaaccaacc caagatctac gaatggcaag 3420 tcttaccgtt cggcaccacg tgcagcccgt gctgtgccat atatgcactc cagatgattg 3480 ctcagaccca ctccgatgga gacccagtac tggagagtac agttaaaggt tccttttatg 3540 tggataactg cctccatagc cttcaaactg ttgctgaagc tagagcatta gtagacaact 3600 tacgacaact tttgctcacc tgtggatttg agctccgtca atgggccagt aacaaaccag 3660 aggtcattca gcatctccct cctgatgcta gatcttccaa tagcgaactt tggctatcac 3720 aaaagagtac tgacctccta gaagggacac tagggctact atggaactgc ctctcagact 3780 cattcagtta caaacccagc cagatcgaat gctttgagcc cactttacgc aatgtctaca 3840 aagtgttggc ttctcaatat gatccaatcg gttatctaat tccatttaca actagagcga 3900 aagtactggt ccaggatttg tggaagacca acctaacatg ggatgaccca atcacatctg 3960 agagtttgtt aagccgatgg caaacctggc tacaagagct ccaaggttta aatcacattg 4020 cagtccccag gtgctacagc ccctctgact ggcaggcaga caaacttaat cgagaattgc 4080 atatcttctg cgatgcatcc gaaagggttt acggctctgt agcctacatg cgcactgaaa 4140 atgatcaagg acatgtgcac atttcatttg tgttagcccg ctcaagagta gcacctcgga 4200 aacaaatgac aatcccccga ttagaattga gtgcagcact cactggggca caagtagcta 4260 acatactcca agaagaatta actgttgaca tcaatcgcat cattctctgg tcagactcat 4320 ctacattgct acaatggctt aaatctgact cgtgcagata caaagtcttt gttggaacaa 4380 gagtggccga gatccagtca ctcaccaaca ttaattcgtg gagatatgtt gattcagcga 4440 acaaccctgc tgatgtcata acccgaggtc agaccttaaa cagtttgacg ccaccttgcc 4500 actggattaa tggaccacca tttctatatc aaccagattc atgctggccc actttaccag 4560 cagatgaacc agaaccagtg acagaattaa aaaagtcagt aatctgtctg catgttctta 4620 cttgtccaga caccaccctc cctgatgtca atcaattcag cacatgggct gaactattaa 4680 aagccactgt gacatccctt cacggggcgg ctacacctca aacccagccg caagatgact 4740 caggtagtta cattctagca gagaaacttc tcctacaaca gtctcaaaga gattcctttc 4800 cggaagagtt taaagctctt accactaata gaccattacc atccaacagc agattagcat 4860 ctttatcacc tgagtatgat gaggtcagtg gacttattag ggttggaggg aggttaagac 4920 atgctcatca tctggaacta gacacgacac accctgttgt tcttgatcca caacactacc 4980 tcacaaagct tctaatcaag gattatgata ccaccctact acatcctgga ctagagcgag 5040 tcttcgctga ggtaagaaga cgatactgga ttctgagagg aagggaagcc atcaagaagc 5100 atcaatactt gtgtacagac tgccagaaat ggcgtgctaa gccaaaaatg cctaagatgg 5160 ctgaccttcc accgacacgt cttcgtttgt acaaaccccc attctactcc actggaatgg 5220 attgctttgg cccattcaca gtgaagattg gcagacggag agagaaaagg tggggaatag 5280 tattcaaatg tttgaccact cgttgtattc atcttgacct agtagaaagt cttgatacag 5340 acgcattcct aatggctctc cgcagattca tatcacgaag agggaaacca tttgagctcc 5400 ttagtgataa tgggacaaac tttgttggag gagacaaaga attgagagca gcctacgaat 5460 ctatgattcc ccaactcaga gagcaattgg cagaacaaca aattacattt agatttatcc 5520 cacctggagc ccctcatttt gggggagttt gggaacggga ggttaagtca gtaaaacagg 5580 cccttaaagt agtattgaag gatctaacag tgactgagac tgtactgcga acagtgctca 5640 tcgaggtaga aggcatccta aatgccaagc ctctgggata tgtgtcttcc gatgtatcgg 5700 atctggaccc tattacaccc aacattcttt taatgggtcg ccatgattct tctctcccac 5760 aagtatttta cgactcaagt aatgttctgg gaaccagaag atggaaacat agtcaaatac 5820 ttgctgacca tttttggtct agattcatac actactattt accaaattta caagaaaggc 5880 agaagtggcg caaggatggt ggggaaattg ggctagacca agtggtactg atagtggacc 5940 cacaactccc acgagcatgt tggccagtag ggaaagtaac taataccttt cctggacttg 6000 atgggcgaac cagatcggtt gaggttcagg tacgcaaccg cacttatgtt agacccgtat 6060 ctcgtttagt ccctctaccg aaacacactg atgatgaccc tgttaatccc gatacttagt 6120 taggatatct cactatttca aatgtactca cttacatttg ggggcggctg 6170 // ID ERV1-2-I_DR repbase; DNA; ZEB; 10225 BP. XX AC . XX DT 02-DEC-2008 (Rel. 13.12, Created) DT 02-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion - consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; ERV1-2-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-10225 RA Bao W. and Jurka J.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(12), 2170-2170 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 157..2184 FT /product="ERV1-2-I_DR_1p" FT /translation="MTDASGIQIHLISNEELKTRKKENPLHVSPQSCTIGN FT CVYIGVNDQENNWLELFDAIDKATGTGEQVELILHNHKGPVQFPERFRGRI FT KVLKNNTTVSPEFYAESEDERAGSILGEQAHAAAPYRSEVTFLPXQRDKEP FT ETDRQIQWGPSTSTPYDETLMRKAKAKTKHSPLNPLKAATXRIHKMSPLRK FT DSSSDSSSEEDRXLQTRGRIRQKNARPSSKDRKTSAHVDGRFIKAGNRRVT FT IPEGFKIQLKKVEGTTRSEQKXNADMIYLATAXTADEILERTDINQKAKLD FT MVCELADALQALISGIIWSGMTEDENASSKYNPKLGTVHKIRKHFAKSXSQ FT ESRGRQXXAESRKRQDTPLPSDSSETESSEATXSSDEEFSVRKSIMLAGKE FT NKKTKNSQAVSELKERALYPLSQLSFPAEATIMDKAQEIVRMGELYKADDD FT LIEAMIVHHTRKRIAPATVRLQLGTILGKSPEDERNDRAEMIRAIGVYPYC FT LDRQGKVESVYALIRELQEVTDLVPGLAEAKLLGPETTWVELSNFAIRWER FT KRKLISEKQRVQPPRAAAQRAQRAPPARKGRQQRQRAEAPVTERHQKSMQE FT KPKSRNKFQKSDFKQKPAESDYVSKENWERMTPEQRQDLIDKRNKKKQSKS FT KNKTREVKRNERVVNSNRKESRNREE*" FT CDS 2215..5214 FT /product="ERV1-2-I_DR_2p" FT /translation="MSEQXXETADMNLNINXEGILFDLDTGADVTVVSQLP FT KEAKETETAFNVKDFRGNXETHKVYDWKNLQIIKGVNNLMTLKDASTILQS FT KQIGWLKQINLEQEIKRMAETSKADRTKVTEILRKYQTHAARSKNDCGKLS FT DKWQYEIKGGIPPPQRQYPINRXALPEIKITIEELKRKGIIKEVDNAPTNS FT PIQAVAKPDGGWRLVTNYKALNKQTVPDTRYLINARDATGAAPEGRLLSKI FT DLANGFWSVPLKPESRARTAFTFENKQYVYNVLPQGFKNSPNAFQAIMMEI FT LKGLPVVVYIDDILIVTNDEEEHLKILDETLSRLTKEGFRLNYKKMEIAKD FT DVEFLGFQITGNERFLSQTTENKLKELRGKEITSLKQLQSMLGTLNFVRDL FT IPGYAKHAKPLYNATRGGVLTWNEKLSEIKAKLIDEALNSGRLVRRDETTP FT LVVKIANTEDEIEIIIYNEXNEKHPVSFISSTKPANQRKLENMTGGDIXAT FT LAKSLITLKCLALDQPIXVKAKGEGINQIVREXKNLISENKRVHCHTWAKW FT SVILNDTQFQFKSDKIPKAEKTTKVPIDKVCYYTDGTTEQGITRWGWIRKK FT GRKVTESDKGSLQAGLSAQAAEVKAFRKALEKAKQLNDLKCFXVTDSDYIY FT QAVEEHLGTWKNNDFNTSKGKPLKQADDWRVIDELITEITPTVLHQTSHTS FT QNTTAAIGNKEVDSYVRRVRAVTKESQKDLLEKLHIELKHPTTKYLALYCH FT ALGLNIQNLKSEYYKIKVNCPNCRKAIDSRYNDYGSIKTDHENDEISMDFA FT GPIRPQTKKKNCYFLLITDNASEYTQIFPCKRATEEVVIESLQQWFKFRGI FT PNRVRADGAXNXSGRKVEQFIKDSGAKLIKSVKYQPXSNGIAERRIKEIKN FT WXCKNNXLPWDEQXIDCLKFVNYRVPIPEKEENPPRNRAKIKVGDIVYIQK FT RVRGHRFKENLGTEDKVKSIPSNTTVELEANGIWSLRDIIKKPE*" FT CDS 6550..8364 FT /product="ERV1-2-I_DR_4p" FT /translation="MKRVCLCSLLQESMQKDGPVRGVRESRKCTESRVQET FT VMSLMITSSLLLIIIGTIFSGLEIKTQSTEETRHVCKHVEITEDWVCPNQE FT ITVKNKTYNSSDFETMMKSYIRNIDDRFWVCNHSNPIALFIPSKYNGLMNN FT KVKFKWDSLIVINESHVRLIGLEILNKNQTVWTNESSLSVWNLELGVELSK FT SVGHCWQNLTNAKQWIMTDSISAPRNDKILRGLHGGRISKVGLLSETGNET FT SRNRRLTNWLMAGAKGHVTYWPDGLNSTHLILGVYKEKPVIPYEEIDWVTC FT KQHQTQGKYCDSEKACIHKLGMRPFSSNGAKFLIRKYGGLKPDSWASLFQH FT NMVKYCTDLTKAEYWYCYITGQSLYDQNLMDQYESEIFDKWKAYVRYMGIR FT EERVLNETCETIKCFIDSHWIDNRGIHPLMLDSSHGVPESVANKSGLTDFI FT KSIPDQQYRMALALYVYKWDMLTRAEYKTRPSPVHMGLVDSVHGLWHNKFY FT IKFDFSHKLMGAYRHQFNKLRFFHPNFRKTIYTRNLVDVSYENRTHDYWKG FT GATLNMGFSGFKWNLYENSGLNNSASLIGAELQECDPDIFNNWTYKYIPPY FT ARKEXR*" FT CDS 5214..6548 FT /product="ERV1-2-I_DR_3p" FT /translation="MADTNLTCDLKACEGLYKQKGFIRVIRGFSGHLTIEK FT WLKPEYDRALGRKSVSVXGTPAISYRIEDCAVCGKPNVAFKLAEGLYANRP FT NKAELLIQHGLGAHFPWWNELGDIDACALHDVTICGDCRESHQKVEIEQYE FT IKQDGCLCRNCQSCKQRFIALGRECRCVYDGEEFTSCERCKSLFSCKCLKL FT ISKKQIWMTRKSRKALGALVENRIMTMWDINPVSLMTNAXSKFMAQRLXEL FT SFCPEGLPGGQESLNGNESFRNGLIYLTHLGGWETAVKLQLHTQVSYKRGG FT LGACVWVLHDSEWFKCELSDAIIITAKXMTFKQCIKVCEPKDEPSYFTESN FT ISVYGSSQSFREGTAKLKIIRSTQPYHLMTREPSDEDKSRIEILDLTVLLN FT LTGSYLGIGECIQGVNVYLNRSKDKSDMATGNKLIQFKKWISKKFSRGTE* FT " XX SQ Sequence 10225 BP; 3659 A; 1665 C; 2234 G; 2541 T; 126 other; tggaggcccc aggcgaggaa cgttaatatt tttgcgaaaa gagaattaga ttaagagaga 60 aagtagggga cgcccgaaaa agaaatctct cgaacaagta tagaataaag tgcacataga 120 cctaaagaag gatctatatt ctatacgaac aaaaacatga ctgacgcttc gggtattcaa 180 atccatctta tctcaaatga ggagttgaaa acgcgtaaaa aggagaatcc tttacatgtt 240 tcgcctcaat cttgtacaat aggtaattgt gtttacatag gcgtaaacga tcaagaaaat 300 aactggctgg aattgttcga cgctattgat aaagctacgg gtacmggtga acaggtagaa 360 ttaattctac ataaccataa aggtcccgta caattcccag agcgattcag aggacggata 420 aaagtgttaa agaataacac aaccgttagc cctgaattct atgctgaatc agaagatgag 480 agagctggtt caattctggg cgagcaggct catgccgccg ctccatacag gtcggaagtt 540 acctttttgc cttmgcaaag agacaaagaa cctgagactg atcgacagat tcagtggggt 600 ccatctacta gtacaccgta tgatgaaacc ctaatgagaa aagcaaaagc taaaacaaaa 660 catagccctt taaatccttt aaaagctgca acaastagga ttcacaaaat gagtcccttr 720 agaaaagaca gctcgagcga ctcgtcctca gaagaagata garttttaca gactcgrggt 780 agaattcggc aaaagaatgc caggccgagc tcaaaagaca gaaaaacgtc agctcatgtt 840 gaygggcgmt ttattaaagc tggaaataga cgagtaacma ttccggaggg atttaaaata 900 cagctgaaaa aagttgargg caccactcgt tcagagcaga aagakaacgc tgacatgatt 960 tatctggcra cagcmwgtac cgcggatgag atccttgaac gcacagatat aaatcaaaaa 1020 gccaaattag acatggtatg cgagcttgcc gacgcgcttc aagctttaat cagyggrata 1080 atttggtccg ggatgaccga agatgaaaac gcttcgagca aatacaatcc taaactagga 1140 acagtgcata agataagaaa gcactttgct aaaagcgrct cacaagaaag cagaggcagg 1200 cagkcasgcg ctgaaagtag aaaaagacag gataccccac tsccgagtga ttcragcgag 1260 actgaaagct ctgaagcrac agrctcctcc gatgaggaat tttcagtaag aaaatccatc 1320 atgcttgcgg gaaaggaaaa taaaaagact aaaaattctc aggcagtaag cgaactaaaa 1380 gaaagagccc tctatccgct atctcagcta tcttttcctg cagaagcaac cattatggat 1440 aaagctcagg aaattgttag aatgggcgaa ctttayaaag cggatgacga tctaatagaa 1500 gctatgatcg tgcatcacac gcgcaaaaga attgcacccg ctactgttag gttgcagcta 1560 ggcacaatac tgggcaagag cccagaagat gaaagaaayg atcgtgctga aatgatcaga 1620 gcgattggag tttacccgta ttgtcttgat agacagggta aagtcgaatc agtgtatgca 1680 ctgataagag aactccaaga ggtgacagat ttagtccctg gcttagctga agcgaaactt 1740 ttagggccag aaaccacctg ggtagaattr tctaatttyg ccatcagrtg ggagagaaaa 1800 agaaagytaa tctcrgagaa gcagagagtt cagcctccga gagcagcagc tcagagagca 1860 caaagagccc cgcccgcacg aaaaggcagg cagcagagac agagagcaga agcccctgtc 1920 acagagagac atcagaagtc tatgcaggaa aaaccaaaga gcagaaataa attccaaaaa 1980 tcagatttca agcagaaacc tgcagaaagt gattatgtct caaaagagaa ctgggaaaga 2040 atgaccccag agcagagaca ggacttaata gataaaagaa ataaaaagaa gcagtcaaaa 2100 tcaaaaaata agactagaga agttaagaga aacgagmgag ttgtkaacag taaccgaaaa 2160 gaatcccgaa atcgagagga atagggttgc ctgactccag gaaaaaaggc cataatgtcc 2220 gaacaartta magaaacggc agatatgaat ttaaatataa attwcgargg aatattattt 2280 gatcttgata caggagcaga tgttacrgta gtaagtcagc tacctaaaga agctaaagaa 2340 acggaaaccg cgtttaatgt aaaagacttt agaggtaatr tagaaacgca taaagtttat 2400 gattggaaaa acttgcaaat tatcaaaggt gttaataatt taatgacact aaaagatgcy 2460 agtacaatat tacaaagcaa acaaataggc tggttaaaac aaataaacct ggagcaggaa 2520 ataaaaagaa tggccgaaac cagcaaagcc gaccgtacta aagtyacaga gatattaaga 2580 aaatatcaaa ctcacgccgc aagaagtaaa aatgattgcg gaaaattatc tgataaatgg 2640 cagtacgaaa taaaaggtgg tattccacca ccacaaaggc agtatccgat taatagaarc 2700 gcgttgcctg aaataaaaat cacaattgaa gaattaaaac gaaaagggat aattaaagaa 2760 gttgataatg cgccaacaaa ttcccctatt caagcggtag caaaacccga cggggggtgg 2820 aggttggtaa caaattataa agcattaaac aaacaaactg tacccgatac aagatatctg 2880 attaatgcaa gagatgctac gggcgcagcc ccagagggga gaytgttgtc caaaatagat 2940 ctcgcaaatg gattttggtc tgtgccgctt aagccagaga gcagagcgcg caccgcattt 3000 acgtttgaga ataaacaata tgtatataat gttctaccac agggatttaa raattctcca 3060 aatgcttttc aagcaataat gatggaaata cttaaaggcc tccctgttgt agtgtacata 3120 gatgacattt taatcgttac aaatgatgaa gaagaacatc ttaaaatatt agatgagacc 3180 ttaagccgac taactaaaga aggatttagg ttaaactata aaaagatgga aattgctaaa 3240 gatgatgttg aatttttagg atttcaaata actggtaatg aacgatttct gtcgcaaact 3300 acagaaaata aattaaagga attaagagga aaagaaataa ccagtttaaa acaattacag 3360 agtatgctgg gtacattaaa ctttgtaaga gatttaatcc ccggatatgc taaacatgcg 3420 aaaccccttt ataatgcaac aagaggggga gttttaactt ggaatgagaa actttccgaa 3480 ataaaggcta aactcataga tgaagcttta aatagtggtc gtcttgtgag aagagacgaa 3540 accactccgt tagttgtaaa aatagcyaac acggaggatg aaatwgaaat aattatttat 3600 aatgaryaaa atgaaaaaca tccagtaagy ttcatttcct ccactaaacc cgcaaaccaa 3660 agraagctgg agaacatgac tggaggcgay atyyttgcta cactcgctaa atctctgatt 3720 acactgaaat gccttgcrct ggaccagccg atcatrgtta aggccaaagg tgaaggcata 3780 aatcaaatag tgagagaakc aaaaaayctt ataagygaga acaaamgagt mcattgtcat 3840 acktgggcta aatggtcngt aattttaaat gatacgcart ttcaatttaa atctgacaaa 3900 attcctaaag ccgaaaagac taccaaagtt cccattgata aagtttgtta ttatactgac 3960 ggtacaaccg arcagggrat aactagatgg ggatggattc gaaagaaagg ccgaaaggtc 4020 accgagtcag acaaaggtag tttacargca ggrttaagcg cgcaagctgc tgaagtaaaa 4080 gcatttagga aagctttaga gaaagccaaa carctraatg accttaaatg ttttatrgtg 4140 acagatagtg attayatcta tcaagctgta gaggaacatt taggtacttg gaaaaataat 4200 gattttaata cttccaaagg gaagccatta aaacaagctg atgactggag ggtaatagat 4260 gagctgatca cagaaattac accaacagta ttgcaccaaa ctagycatac gtcacagaat 4320 acgacggctg ctatagggaa taaggaagta gacagttatg tacgtagagt acgggctgtt 4380 accaaggaaa gtcaaaaaga cttacttgaa aaattacata tagaattaaa gcaccctacc 4440 acaaaatatt tagcgctata ttgtcatgct ttaggcctaa atatacaaaa cttaaagtct 4500 gagtactata agataaaagt gaactgtcca aattgcagaa aagcaattga ttcacgttat 4560 aatgattayg gatctataaa aacagatcac gagaacgatg aaattagcat ggattttgct 4620 ggacctatta gaccgcaaac gaaaaagaaa aattgttatt ttcttttaat cacggacaat 4680 gcgagtgaat atacgcagat ttttccttgt aaacgagcta cggaagaagt agtaattgaa 4740 agtttacaac aatggttcaa atttagrgga atacctaatc gagtcagagc cgacggagck 4800 twtaacmttt caggtcgaaa ggtcgaacag tttataaaag attcaggggc taaattaatt 4860 aaatccgtga aatatcaacc ckckagtaay ggaatcgccg aaaggcgrat aaaagaaatt 4920 aaaaattggt wttgtaaaaa yaatcawtta ccttgggacg aacaamttat tgattgctta 4980 aaatttgtaa attacagagt ccctatacct gaaaaggagg aaaatccacc aagaaacagg 5040 gccaaaataa aggtaggcga catagtctac atacagaaaa gagtccgagg gcatagattt 5100 aaagaaaatc taggcacgga agataaagtt aaatcaatac catcyaatac taccgtagaa 5160 ttagaagcca acggtatttg gagtctcaga gacataataa araagccgga ataatggcag 5220 acacaaacyt gacctgtgac ctcaaagctt gtgaaggcct ttataagcaa aaaggcttca 5280 tcagagttat tagaggtttt tctggacact taaccattga aaaatggctt aarccagaat 5340 atgatagagc actgggaaga aaatctgtta gtgtacmagg aacaccagct atcagttata 5400 ggatcgaaga ctgcgccgta tgtggaaaac ctaatgtagc gtttaaacta gctgaaggct 5460 tatacgccaa taggccgaat aaagcagagt tgctaattca gcatggttta ggagctcatt 5520 tcccgtggtg gaatgagtta ggagatatcg atgcgtgtgc tttgcatgat gtcaccatyt 5580 gtggtgattg cagagagtca caccagaagg tcgaaatcga acaatatgag ataaagcagg 5640 atggatgctt atgccgaaac tgtcagtcat gtaaacaacg gtttattgcc ctaggaaggg 5700 aatgccgatg tgtgtatgat ggggaagaat ttacatcgtg tgaaagatgt aagagtctat 5760 tcagttgcaa gtgcttaaag ctaataagca aaaagcagat atggatgact cgtaaatcca 5820 gaaaggccct aggggcctta gttgaaaata gaataatgac aatgtgggac ataaacccag 5880 tgtccctaat gacaaatgca gawtcraaat tcatggccca ragactsart gaactgtcct 5940 tttgyccgga aggtctgccg ggggggcarg agtctytgaa tggaaacgag agttttcgga 6000 atggcctgat atatctcact cacttaggtg gatgggagac cgctgtcaag ttgcagttac 6060 acacgcaggt cagctacaag agaggtggct taggtgcatg tgtatgggtt ctgcatgaca 6120 gtgagtggtt taaatgtgaa ctttctgacg ctataataat tacagcaaaa gamatgacat 6180 ttaaacagtg catcaaagta tgcgaaccaa aagatgagcc cagttatttc actgaatcaa 6240 acatctcagt ctatggatct tcacaatcct ttagagaagg gacagcaaaa ctcaaaatta 6300 tcagatccac acaaccatat cacttaatga ctcgtgaacc aagcgacgag gataaatcta 6360 ggatagaaat tctggattta actgttctgt taaatttaac aggaagttac ctgggtattg 6420 gagaatgtat tcagggagta aatgtttatc tgaatagatc aaaagataaa tctgatatgg 6480 caacaggaaa taaattaata caatttaaaa agtggatctc caaaaaattc agtcgaggga 6540 cagaatagga tgaaacgagt ttgtctctgt tctctgttgc aggaaagcat gcagaaagac 6600 gggcctgtca gaggggtgag agagagccgg aagtgtacgg agtcacgggt ccaggagacc 6660 gtgatgagcc taatgataac tagctcttta ctcctaatta taataggcac gattttttca 6720 gggttagaga taaaaactca aagcacggaa gaaacccggc atgtctgtaa acatgtcgag 6780 ataacggaag attgggtatg tccaaatcag gaaataactg tgaaaaataa aacctacaat 6840 agtagtgatt ttgaaacaat gatgaagtcg tacattagaa atatagacga cagattttgg 6900 gtatgtaatc attcwaaccc aattgcttta tttataccat csaaatataa tggtctaatg 6960 aataataaag taaagtttaa atgggatagt cttatagtta taaatgaatc ccatgtaaga 7020 ctgattgggc ttgaaatcct aaataaaaat cagacagtat ggactaatga aagtagccta 7080 agcgtctgga atttggaatt aggagttgaa ttaagcaaat cagtaggaca ttgttggcag 7140 aatttgacaa atgctaaaca atggataatg accgactcca tttctgcacc cagaaatgat 7200 aaaatactta gaggcctaca tggcggccga atcagtaaag ttgggttatt atcggaaaca 7260 ggcaatgaga cgtctaggaa tcgcagatta acaaattggt taatggcggg agctaaagga 7320 catgtgactt attggcctga tggtttgaat agtactcacc taatattagg tgtttataaa 7380 gaaaaaccag taattcctta tgaggaaata gattgggtaa catgtaaaca acatcaaacc 7440 caaggtaaat attgtgatag cgaaaaggca tgtattcata aattgggtat gagacctttt 7500 agctctaacg gggctaaatt tctcatccgg aaatatggtg gattgaaacc agattcatgg 7560 gcatcccttt tccaacacaa tatggtaaaa tattgtacgg atttgactaa agcagaatac 7620 tggtattgct atataaccgg tcagtcactt tatgatcaaa atctaatgga tcaatatgaa 7680 agcgaaatat ttgacaagtg gaaagcttat gtcagatata tgggcattcg agaagaaagg 7740 gtcttaaatg aaacctgtga aacaataaaa tgttttatag acagtcattg gatcgacaat 7800 cgaggaatac atccgttaat gttagatagt agtcacggtg tacctgaatc agtggctaat 7860 aagagtggyt taacggactt tattaaatcg attccagatc aacaatatag aatggcccta 7920 gctttgtatg tatacaaatg ggatatgctc acgcgcgcag aatataaaac taggccaagc 7980 ccggttcata tgggactggt agatagcgtg cacggcctat ggcataataa gttctatatt 8040 aaattcgatt ttagccataa actcatggga gcttatcggc atcaatttaa taaattgagg 8100 ttttttcatc caaattttag gaaaaccatt tataccagaa acttagttga tgtttcctat 8160 gaaaatagaa ctcatgatta ctggaaaggt ggagctacct taaacatggg ttttagtggg 8220 tttaaatgga atttatatga aaacagtgga ttaaacaaca gtgcaagcct cataggagct 8280 gaattgcaag aatgtgatcc agatatattt aataattgga cttataaata tataccgcct 8340 tatgctcgta aggagaygag ataagattgt ttaagggaga gaatagtttc tgctcctggg 8400 aaaattattc catcataacg gaaaaaatta gggataggtg tgacaaaatt aattgctcca 8460 ttccttggca tgcaatcttt gtgtcagcac ttcccatagt tgcaaaatat gaaaaccagg 8520 ttaaaaaaca ctgtgcgatt atagggcttg aaaattttca tgttaaaaga aaattaacag 8580 aattagttga tgattacata gaggggacga gcttgggtcc aaattggcac tttatgcaca 8640 attggatctc agccttagac attgtcatag atgatgtccc aataattgtc aaaattagaa 8700 agaccttgtt aaatgcagga gtaataatta gcttgggtgc tggaacagca gttgaaatta 8760 ttaaaagaac cgtgatcgga trstggaagt ggatcaartt aattttgatc attatgggat 8820 gcatwgtggy tattgtgctt cttattagcr tatcccttaa gacagcacar ctatgtggtt 8880 tctgagcagt gtctgcgggg tcaggygtta tgctcgccta aggtcaaggg tcccagagta 8940 gagtaaaaaa caagtcatgg actgtattta aaggctgmag tcataacagt ctaaacgaac 9000 atttagtgga agagactgac ttgattgctc atctccaaaa cctctgaaat tggaagaaaa 9060 tggcatctat gaatggaaat tacgggaatg ctgataacat tgtgtcggga acaaaccttg 9120 ctcaggtaaa taaattctaa tcattttaaa ttgtgaagaa ataatttatt atgatgaccc 9180 agtaatatgg gaatagaatg agagaaataa cagttatgat tgttttaata gtcttgccac 9240 tgtttaatca atcataacaa gatgtcagac acatctgggt acgagataca gtgttatcag 9300 tagggtaatt tttgggtttt tgtggttttt tatgattttt gagaaggatt gaattttatt 9360 tgaacttttc aaaatgaatg tatattaaac caaggaccaa ctgttttcat ttgtgtaatc 9420 tataaattgc gtaacaatgt taaagcaatt gtgggtgaag aatgaaacaa cagatgtaat 9480 tgttatagtt atgacatttg ggtaattatg acatattgat ctttttagat ttcgagaggg 9540 ggaagagcaa gtgacgacaa cctggcctgt acccatagta caagaatgtg caaaatttat 9600 taccacaaca atggaaggga atttcagtgt tgtatgaact gttttgtgac cactgataaa 9660 aactttaaca cgattcatta tattctctgt tggaatgaag gccaattaca gccgttgtta 9720 tcaaaattaa tractgccag agggcaatta attttaccat gggctcccgt ggttacacat 9780 agaatgaatc ggcgtaaagt tctttgttcg gccttttggg taaatggaga aatccaaatg 9840 ggttagtagt tagttggcac ctacatatcc aacccacagg aacaagttct ggtgtggatg 9900 gatggccgtg ggagagtacc aagacggagg aaatcctatg gcgaacgggt gatacttcga 9960 ctaacgaacg cccagatctg gagtttacca gcgcggcctt tctcgaagac gaaaagaatt 10020 ggccggcgtc tttggccgca atggggcgaa gtaacttgct aaccgtccag ttattttacc 10080 gtaagagaga aaagttattc aatctttggt taattggcay ggatggatgg gccgttcaac 10140 tccaacagtg tatatataaa tckwgtgtry tagatgaatt ggataatgat tattttrtgt 10200 ctcaaytrmr matgkcaaga ggggr 10225 // ID Gypsy-17-LTR_DR repbase; DNA; ZEB; 1670 BP. XX AC . XX DT 11-FEB-2005 (Rel. 10.01, Created) DT 11-FEB-2005 (Rel. 10.01, Last updated, Version 1) XX DE A long terminal repeat of the Gypsy-17_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy superfamily; Gypsy-17-I_DR; Gypsy-17-LTR_DR; Gypsy-17_DR; KW endogenous retrovirus. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1670 RA Kapitonov V.V. and Jurka J.; RT "Gypsy-17_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 5(1), 8-8 (2005). XX DR [1] (Consensus) XX CC Gypsy-17-LTR_DR is a long terminal repeat of the Gypsy-17_DR CC LTR retrotransposon that belongs to the Gypsy superfamily. Its CC internal portion is deposited in Repbase as Gypsy-17-I_DR. XX SQ Sequence 1670 BP; 505 A; 237 C; 286 G; 637 T; 5 other; tgtagcacag gtgaattaag tcacacagaa cgagtattta aaatatataa mtaaatraak 60 atgagcaaaa cacacattac atagtatact gatrtaatta aataaatgtt atttttcttc 120 aaatcttata taaataagaa aaggatattt tctttttagt ggatagaatg ttttttatga 180 tttttcattt atctgagtca tttttggttc atttgtatta tgtatgtaaa aaaggggtac 240 tggcccttta aatcagtttt tcggagaaag tgcttctctg acgaggcaga gttgaatcct 300 ccattttgtg aggggtgacg gagcagatct gacgcagatt agtgcatcta aacatcgcta 360 tttgtgccga aatacaccaa taatcatctt atgaaaggga tttggtgagt gtttaacact 420 tttagcatgt ttttgccttg tttttgttga agtttgtcga acgctaatct gataacgatc 480 agtaaacaag tgcctacgca tttgtatgat aaaatccctt taacatatat gctttaagta 540 aaaccctgag tttaaattaa tgtgtttata cttttatagt gatttttgtc ctggtcttta 600 ttctgtgtaa acattttatt tgagtttagc ttgttagtct gttaattgtt ttataagaag 660 gttatgttta agcttgatta agttaattgc tttaatgaaa aggtactttc agcaaaacgc 720 agtgtcctaa ttgtattttt tatttacaat atacaaacta aatatggaca tttcctgtta 780 tctttggcct aaaatacagg tcaggttttt gattggtgga actgaatgtt ctggaaatca 840 gttgatggcg agcctcccac ggatctggac acttgattga ctgagttgat ctcccatcgc 900 tatactacgg ttttttcygc tttatgaggt gggatctaag cttacttatt ttttttattt 960 cttcatcatt ctcggttgga aatactgaga aattcctgga ctgtctcaca gacctatatt 1020 tttcatattt tttccttctg ccatccgcgt tacacggatt tgagttgaac tgttcttttc 1080 tttttataaa caaaaaggga atagactgat ttgaagttaa agagacttaa gaaagacttt 1140 tttttggatt ttgacaactt gggttttttt ttgtacaaag aaaaaaaaag aacaaagtct 1200 gttggactgt aatgattatt gattctttga gctttagtgg ttttaagaaa gccaactaat 1260 tcacagtgta actcttttct gtgagataca aaaaattcat gaatgggaaa attacatgct 1320 taaagagaca taagtgtgtt tagaaggtgc acctttagga ttgtatgtgc tgtacaatat 1380 tgtcaattgt atttttctga tacaattata ctgttttatt tttactgtta aactttagtt 1440 tatttttttt cttttttttt ccattggcca tttcatatag taaactacca gctgatttaa 1500 agaaaaaacc acgtggaata ttttcttttc tcaaatattt attcatacta tttacaggct 1560 ccatagtcga acctaactgg tccaaataac atctttgcac ttaggaaata acacataaaa 1620 ttagcacaac caatagctct taattattat aggtgtaaac aggtgttaca 1670 // ID CR1-36_DR repbase; DNA; ZEB; 1779 BP. XX AC . XX DT 19-FEB-2009 (Rel. 14.02, Created) DT 19-FEB-2009 (Rel. 14.02, Last updated, Version 1) XX DE CR1-like non-LTR retrotransposon - a consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; AP endonuclease; CR1 clad; CR1-36_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1779 RA Bao W. and Jurka J.; RT "CR1-like non-LTR retrotransposons from zebrafish."; RL Repbase Reports 9(2), 522-522 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS join(131..721,600..1631) FT /product="CR1-36_DR_1p" FT /translation="SFPXFCSXXLSXSVRAHITTIVNASLSTGFIETALKT FT AAVTPILKKPGSDSSDLTNYQPIFNLPFLAKVLERVVVGQLQEHLQANALF FT DAFQSGFRSGHSTETALVKVVNDLLLIADSGACGILVLLDLTAAFDTISHS FT ILLDRLHNWVGVDGIVLNWFKSYLSGRSQFIYVGGSQIAYCSFMSWCPPRV FT STWSTSLIGLNPIFLDVHSLSMWGGHRSRTVPLCHGVPQGSVLGPLLFSIY FT MLPLGQILQKFGIRYHCYADDTQIYINTSPNSSASMSSLNACLTEIKAWMQ FT HNYLKLNSSKTEILLVGTPSNIKRCSEFELTVDGSVLSPSPQVRNLGVLLD FT SQLSLKPHFKHVTKTAFYHLRNIARLRPFLSRPDAERLVHAFITSRLDFCN FT SLFGGLPANSLCKILQYIQNSAARVLTHTSSRSHITPVLQQLHWLPVKSRI FT DFKTLILTYKAVHGSAPGYICDLISISSASRCLRSASGPTLFQPRCKLSTM FT GGRAFSFRAPKLWNALPTNIRNAGSLDGFKKLLKTHLFRTVFNL*" XX SQ Sequence 1779 BP; 404 A; 378 C; 338 G; 645 T; 14 other; ttactttgcc ggtgatcaca tatgttgagc cctyaccctc tgttcttcct gcttctggct 60 tcttaacatt gctgattatg attttattgc aaaaactgyt gtcaytgaac tcwycatcyt 120 gtgwtcttga tcctttcccr acttctgttc tgragractg tctcwyagtg ttcgtgctca 180 cattaccact attgtgaatg cttcacttag cactggtttt attgaaactg ctctgaaaac 240 tgctgctgtt acaccgattt taaaaaagcc tggcagtgac tcctctgatc tgactaatta 300 tcagccgatt tttaatctac cttttctggc gaaggttctt gagcgggtgg tcgtggggca 360 actgcaagaa cacctgcaag caaacgctct gtttgatgca tttcagtcgg gttttagatc 420 aggacatagc acagaaactg ctttagttaa ggttgttaat gaccttctcc tgattgctga 480 ttcgggggct tgtggtattt tagtgctttt agacttaacc gcagcttttg atacaatttc 540 tcatagcata cttttggaca gacttcataa ctgggttggt gtggatggga ttgtgctaaa 600 ttggtttaaa tcctatcttt ctggacgttc acagtttatc tatgtggggg ggtcacagat 660 cgcgtactgt tcctttatgt catggtgtcc cccaagggtc agtacttggt ccacttctct 720 ttagtattta tatgttgcct cttggtcaaa tcctgcaaaa atttgggatt agatatcatt 780 gttatgccga tgacacacag atctatatta acacttctcc caattcatcg gcatcaatgt 840 catcgcttaa cgcttgctta actgaaatta aagcttggat gcaacataat tatcttaaac 900 tgaatagttc aaaaacagag attttattgg ttggcactcc ttctaacatc aagaggtgta 960 gtgaatttga gttaacrgtt gatgggtctg tcttatcacc atctccgcag gtgcgtaatc 1020 ttggtgttct cctggactct cagcttagtc tgaagcctca ttttaaacat gtaactaaaa 1080 cggcctttta tcatcttcga aatatagccc gcttacgtcc tttcctctcg aggcctgatg 1140 ctgaaagact tgtccatgct tttatcacgt ctcgcttgga tttttgcaat tctctgttcg 1200 gagggttacc agctaattct ctttgcaaga ttttacaata catccaaaat tctgctgcac 1260 gtgtgttgac acatacatcg tctcggtctc acatcactcc tgtactccaa cagcttcact 1320 ggctgcccgt caagtctcgt attgatttta agacattgat tttaacatat aaagcggttc 1380 atggttctgc tccaggttac atttgtgacc tgatatccat ttcctctgcc tctcgctgtc 1440 tacgctctgc ctcgggtcct acgctgtttc agcctcgctg caaactcagc accatgggag 1500 gtagggcgtt ctctttccgt gcacccaaac tgtggaatgc tcttcccact aacattagga 1560 atgctggttc attggacggt tttaagaagc tacttaaaac tcatcttttt aggactgttt 1620 ttaatttgtg atatctttta ttctttactg tgtttttttt actgtatttt ttattgtttt 1680 actgtgcttt gttgttttta ctttcttgta agcgctttgg gttttagaaa agcrcgttat 1740 aaataaaata tattattatt attattatta ttattatta 1779 // ID BEL34-LTR_DR repbase; DNA; ZEB; 588 BP. XX AC chr6; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL34-I_DR; KW BEL34-LTR_DR; BEL34_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-588 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1412-1412 (2008). XX DR Genome; chr6; Positions 17777661 17777074. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 588 BP; 162 A; 77 C; 124 G; 225 T; 0 other; tgtatggtgt tgattcgttt ggtaagctgt gcattgattt atttaagtgg tgtaattgtg 60 acatctagtg gcggaaaatg tcattgcgcc attctttaga acaaccggat atttaaaaag 120 tcatagaaga ggagactgtg atcgccatgc agagacgcgt tgttgtttgc tttcagccct 180 tgtgatagaa acgactgtaa gttactaatt ttattttgaa aacatctgca atacttactt 240 tgggttaaat tgattaagga gggtcagtgt tgaaatgttt aatgcttgat gttctgttta 300 aatgctaaca agcttgcata tagatgcacg tcattaaagt gcctggattg tttatttagc 360 tgtttgatat cagtttaaat atgtcagttt atgtttacaa atgtgaaaga tgtatattat 420 ttgtgatatt tcgtaatcac tttgtaattt ttaccttttg tattttcctt tttagtttta 480 caagcaaatc aagtaaaagt gcttcacatt cgcaagaaga aagtgtgatg gtgctttatt 540 tctgttaact atctgtcagc tttggttttg aacaccattg gcagggca 588 // ID DNA-4-5_DR repbase; DNA; ZEB; 927 BP. XX AC . XX DT 01-DEC-2008 (Rel. 13.12, Created) DT 01-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE nonautonomous DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA-4-5_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-927 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(12), 2136-2136 (2008). XX DR [1] (Consensus) XX CC TSD is 4-bp long, and TIR is 13-bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 927 BP; 284 A; 140 C; 169 G; 334 T; 0 other; cctcctgaga ccccgcccat tgacttgtgt cctctgtagt ggacattgcg ttttcatgaa 60 ttttctttga attttttgag ctactctgtc aagtcctgtt gtactgttca gaggacatcc 120 tgggctttct agtgatatgt catttgattg gctggcatgc taggaaccac ttcttacact 180 gcatccaaaa tggccgacat acaaaaaaag cacattttat gggaaggaga gaggtatgta 240 aaatttagct ttttaaatgt ttttttacta ttattattac atattatatt tgtttattaa 300 agtgtcagga acaatacatt tagctttcaa agctgttaac ttgtttgtgt ttcaaaatat 360 catccttttt taaatgtcca ctatagtgga caccaggacc atcttaatgt aattaatgac 420 tgcatgttgt aggaggcaga actgaagcag agaggattct ttataagata gttgagggag 480 actctgattt agagtctgac aatcagacaa ttagagaatt agagacaatt agagaatgac 540 aatcagtttt aaatcgcttt tatgttaaat ttgttttgga ttaacatcac ttcttttttt 600 tttttttttt ttttgagttc ggctctcttt cagactaaac tgtttcagga atttcaatac 660 aaaaggaaaa aaatggcttt tgttttgttt ttgttacatt aatattggat taatatccct 720 ttacagccat atcctgtaca gttttgttgt ctgagtttgg cggtcgtttc gaactaaaat 780 ggtcctgtgg tccactacag tggacatgct gtaaaattaa aaataaaaac aaaatgtaaa 840 aactctaaat tgtagatttt tttttaaccc aaaggatgta ggaaatcaca gaaaaaaaaa 900 aaaaaaaaat ctttgggtct caggagg 927 // ID DNA-TTAA0-6_DR repbase; DNA; ZEB; 1188 BP. XX AC . XX DT 08-JAN-2009 (Rel. 14.01, Created) DT 09-JAN-2009 (Rel. 14.01, Last updated, Version 1) XX DE non-autonomous DNA transposon - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; TSD 0-bp; KW target TTAA; DNA-TTAA0-6_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1188 RA Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 9(1), 13-13 (2009). XX DR [1] (Consensus) XX CC These transposons insert specifically between 5'-TT and AA-3', CC and do not produce TSD. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1188 BP; 395 A; 196 C; 189 G; 408 T; 0 other; tcaattgctt tggctcaatt ctcgattgaa tttttatttt tcaaaacaat aagtttagat 60 ctctgaacaa ttagcttatt gttcacttca gagtggcatt tcttattgat ttaagcaaat 120 tgcaaatgct ttggcgcatg tgtgcaaaca gtaagtacaa ttgtctgtag tttggacaaa 180 aactaattgc atatggcttg ttgatcaaaa ctgatgagtc aattctcatt taaactgtca 240 aactcttcac aacttctcaa tcatttttca tcgtgtaagc catcacactc aaaactattt 300 attcaattat caaacatttg tatacgcacg tttacaccat aaatatctga tattgacatt 360 gtttgtaatg tctgctaaat tggtaaatga cctctagttg caatacaatt tgaatcatct 420 atttaaccaa tcaagtttgg gagcatatat acagtactgc agcaacattt gaggtggatc 480 aaaatgtttc atcaaagatg tctaaaaact attttcaata cccactcttg tcttatagta 540 agacaatttt gattcacttc aatactatat atgtagatac aatcacaatc actacacttt 600 tccctgaact gcagtaacat gcttgtggaa gagcaattgg aagaagtgta atctctggca 660 aggccaacaa gagcgacagg atgtccacca agaataattt tttcatggat gtatattttg 720 tagtttttgg tgtgtatttg tggtaacata agaaatatga aatatacaat acagtaattg 780 gacaagcaag attgcacttt gcatgtaata catttgtaca caaataaatg tactgtataa 840 atacaaatgt tcatattttt tctatgtact attcactctt ctcaacaaat atcagatttt 900 ttttaaaacc ttaatttcca tggttgactg tcatggtgaa aaaacaacta atcattttga 960 gcagtgtggc tcacacgatg acaaatacac tttatgtttt ggtggccttg gccaaattac 1020 tgacacaata actaggtttt gaagcatgaa ttaaaaatgt tttgggtgag taactacatt 1080 ttgcagacat gcaataaagt tctgaagttc cagcaaacag ttttgagatt tgcactcaca 1140 gtttagagaa tgttaagtct gtttcgaaaa atgtgccaaa gcaattga 1188 // ID DNA-8-6_DR repbase; DNA; ZEB; 683 BP. XX AC . XX DT 11-NOV-2008 (Rel. 13.11, Created) DT 11-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE DNA-8-6_DR is a nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; TSD 8-bp; DNA-8-6_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-683 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(11), 1721-1721 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 683 BP; 216 A; 115 C; 122 G; 230 T; 0 other; cagtgtttct caactggtgg gtcgcgaccc aaaagtgggt cgcgggaaca ttttcagtgg 60 gtcgcggagt gtgtggtcaa aaaaacaaca aaagtttaat tttttactta ttttgcttat 120 accggacttt tattttgaaa tgcgtgcacg acaaccctac cgtttgacat gtgaaatttc 180 atttaattat agcaacaaat atgtcgaagc gcaagtacga ccccgaatat gtaaagtatg 240 gatttacata tattgacgac aaaaaaagac ttaaagcctc agtgtgtcat atgtagtgag 300 gtgctctcac aagagagcat gaaaccttct aaattaaaac gacatttaga aaccagacaa 360 catgttttat taacagttca gtttagttca tggtaactga acctttcctt accctaacca 420 ctgtttacag tatttgtcgt ttttactttg taatatttct ataatttgat acattttgtt 480 aaatgacagc aataaaatat tgtttatttg taagtcttgg tgattttctt atgatttatc 540 tgtcactact tgtgtatgtt aacaaataaa tacaaattaa ttataattcc taaaattgta 600 ttatagttcc taaaaatttg ggtcgcggct tgatgaccat gtaaaaatgt gggtcccatg 660 gccaaaccag ttgagaaccc ctg 683 // ID HATN5_DR repbase; DNA; ZEB; 1156 BP. XX AC . XX DT 05-JUN-2002 (Rel. 7.05, Created) DT 05-JUN-2002 (Rel. 7.05, Last updated, Version 1) XX DE HATN5_DR is a nonautonomous DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW HATN5_DR; nonautonomous DNA transposon; hAT superfamily. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1156 RA Kapitonov V.V. and Jurka J.; RT "HATN5_DR, a family of ancient nonautonomous hAT-like DNA RT transposons from zebrafish."; RL Repbase Reports 2(5), 26-26 (2002). XX DR [1] (Consensus) XX CC About 1000 copies of HATN5_DR are expected to populate the CC zebrafish genome. They are ~5% divergent from the consensus CC sequence. CC The genome harbors several subfamilies closely related to CC HATN5_DR. CC This element is characterized by 17-bp terminal inverted CC repeats and the 8-bp target site duplications. XX SQ Sequence 1156 BP; 304 A; 227 C; 282 G; 341 T; 2 other; tagggatgta acggtattgt aaataccgtc ataccgcaat attaattttt ttcgatatta 60 ccgaagtcgc atgactcggt aaaactatag gtcttctgag aaaatttgct caggcgaatg 120 aagcgaacgg gaggtagcgg aaactacaat tcccatcagc ccaggcgtgg ccatmatcct 180 ttgcggtctg ttgtcgctac agatccagta atgcggaaat ggagtgtgct gctagwagcg 240 gggatgaaaa agagctggaa atgatcgaac ctaaagcggg tgttgtcgcc gcgcgcgtac 300 tgaatagcgg tgttgtcgcg cgcgttctga tcagctgtgt tgtggcgcgc gtattgtaaa 360 gcgccgaggg ggggttgagc gcgcatactc aagagaggtg ttgtcgcgcg cctactgaag 420 ggctggactg gacggaggtt gtgtcgcgtc gcgggggcac ttttgatcgt tttggaaggg 480 cattttctat ccaagactaa aaagggcatg tgcactgcac aggttgagcc ctatgtgtgc 540 acgtgcctgc aagttgggga atacgactaa taacagtcat ggggactgca gaaacacagc 600 acactgttca gattgatgca gacattgact tgtaccgcaa agagacctct atctcactca 660 tggtttgtcc tctcaagtgg gggaaagaca atgcacaacg ttacccactg ctgtcaacct 720 gggccaagtc atatctctct tgtcccagaa acctcagtcc caaatgagag ggtttttttc 780 tgttgcaggg gacattgtaa atacccagag ataccagctt ttaccagatt atagttatat 840 gataattttc ctttaaaccc atctctatct aagtgagtga gtgattaaat gttgaatgtg 900 atgagttttc aacaatacta aattgaaact ttattttttt acatggttta ataatttttt 960 gttattaaaa ttgaagttcc tgtttcaaag cttacagata gatggctaat ttgtatgtca 1020 ttgacacttt tggcactttt ttggagtatt ttcataagtt ttgttttttc ctgtaaatga 1080 ttcaataaat accgtaccgt gacattcata ccgaggtatt accgtaccgt gaaattctga 1140 taccgttaca tcccta 1156 // ID DNA-8-34_DR repbase; DNA; ZEB; 1395 BP. XX AC . XX DT 28-FEB-2009 (Rel. 14.02, Created) DT 06-MAR-2009 (Rel. 14.02, Last updated, Version 1) XX DE A nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; TSD 8-bp; DNA-8-34_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1395 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 9(2), 534-534 (2009). XX DR [1] (Consensus) XX SQ Sequence 1395 BP; 402 A; 287 C; 301 G; 397 T; 8 other; caggggcgga cttagtgatt tgggggccct aagcaattcc aggtatgggg ccctagcatt 60 tttaaaaaat aaaatatgtt yatattttat atattttaat atttaaagga acctttatct 120 tcctaatraa aattaaaagc aataarttca caaattatgt acaacagggg tgtccaagtc 180 ctgcagattt tagctccaac ttgtctcaac acacctgcac agatgtttct agaaagccta 240 gtaagagctt gattagctag cccaggtgtg tctgattggg gttggaacta aactttgcag 300 gacaccggcc ctccaggacc gagtttgggc accctgatgt acagttaatt agccattttt 360 gtgattagat tttaatagta ataataataa taataataat aaaaaaaaac atttaaacaa 420 atttaattca caatccatca aactaatctt ttcaccattt ttcatatgaa ttgartaatt 480 gctgtttgtg aatcggcggc cgtttcactg ggaaatgtta tttgaacagg ccctcgatct 540 cttacaagct cacagcgctc atcatctgac aggtgtttcr gccagacagc aggatcgcgg 600 aggaagctgc gtttcgttgt ttgttgtatt tctatttttt tcccaaatgt tctcaaaatt 660 ctcactcttg cacactctac ctgcctctac tctgatcgct ggcagactgg tgtcactaaa 720 atctctgtga ttgtgaacaa cagcaagaaa ataataaacg gtccatttaa attattacat 780 ttaaaacaga ggagaagcag atcggtttga ctgtaccagg gaaatacctc acgctcgcgg 840 tgctaaaaca tttgccgtcg gtatgcagag gaatgatctg ctctcaatgc tgcaggtgtg 900 agagacggct gtgtggcact gtgccgcctt tgagtgctct gaggcggggg agagaccctg 960 acggcatcac ccgcagctcg ttgcgaagag taacgttagg aacggtacag cacggacaca 1020 tgagagtctc taaaaatctc cagtaacacc taaaaacatt cgctagtcgc ttttttgaaa 1080 atttggcgct aggggggtct gaaaagtcrc taagttggca acactggctg gcagtgtctt 1140 tgactgaagc gctcacacct cgcgcatgtg cagtacagga aggaaygttc cattattcat 1200 gggggccccc ggttttgaaa agaagtaata acgttaaaat cttaacagtt atagacagat 1260 tttacaacat atgatagaaa atgtataaaa agagctatat gattaatata ggcttcttgg 1320 cattggtcgg gggcccctaa ttccccrggg ccctaagcgg ctgcttacct cgcttattgg 1380 ttaagtccac ccctg 1395 // ID L1-2_DR repbase; DNA; ZEB; 5746 BP. XX AC . XX DT 04-AUG-2002 (Rel. 7.07, Created) DT 19-FEB-2010 (Rel. 15.03, Last updated, Version 2) XX DE L1-2_DR is a non-LTR retrotransposon from the Tx1 clade. XX KW Tx1; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; endonuclease; L1 clade; L1-2_DR; MutsuDr1. XX NM L1-2_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5746 RA Kapitonov V.V. and Jurka J.; RT "L1-2_DR, a family of non-LTR L1-like retrotransposons from RT zebrafish."; RL Repbase Reports 2(7), 22-22 (2002). XX RN [2] RP 1-5746 RA Kojima K.K. and Fujiwara H.; RT "Cross-genome screening of novel sequence-specific non-LTR RT retrotransposons: various multicopy RNA genes and microsatellites RT are selected as targets."; RL Molecular Biology and Evolution 21(2), 207-217 (2004). XX DR [2] (Consensus) XX CC L1-2_DR is a family of L1-like non-LTR retrotransposon. This CC family was active recently (ORF1 and ORF2 have two stop codons CC only). CC The element is incomplete (its ~100-bp 5' terminus is deleted). CC It encodes two proteins: CC 469-aa L1-2_DR1p (positions 1-1407) and 1280-aa L1-2_DR2p CC (positions CC 1513-5353). These proteins are most close to corresponding CC proteins CC encoded by other L1-like elements. CC This is the same as the 5S rDNA-specific non-LTR retrotransposon CC family MutsuDr1 in the Tx1 clade [2]. XX FH Key Location/Qualifiers FT CDS 227..1636 FT /product="L1-2_DR1p" FT /translation="WYIPGSVWLQLVREAHQIFFVYFCFCLKYIFVKLFFV FT CLVCPPSSVSCSGGVFFIYLFFFAMDGSTVTDLAMADGVEKANDNGLDRRQ FT QETNQSEKEARKRIYLKEATVTVDIGQATEVRAIDVIKAETERIGDGKILA FT VRPKHNKEYEVTLEREEDADELMDELTIKGINCAVKRLQNRDYVVSFMHLP FT VYVADKDILDKLDHWGVCPISKIKRRFYPGTNIEDGTRFVKTRFPKELASL FT PYSTRIETAEGPQYFRVMHSHQVKTCRLCLSPDHVVKDCPDFRCYKCEERG FT HFAKFCTAVKCPDCNKVLNKCECWIGEEEEEVEQQVGRQMYEGDNIQSEDK FT TTTQEKSTESESEKLQENETNGIDRVTEQEGTTWTQMDMTDSLKSVLEAAE FT LSDSNNKDLNQGQQEDTFWTQMDITDSFQKALDTEETKGQSNDEQADLEGH FT LQKDGNKETQGKSAKRRRSLKINLI" FT CDS 1730..5581 FT /product="L1-2_DR2p" FT /translation="MMFFMNVFMSFMNVFIFFMVLGIVSFNARGLLDIRKF FT EKVKEMCKREDVILLQETNWREECMKEIRKRWSGEMLYNNGDGRLGRGVAI FT LLKENSGVLCKTIYNDKEGKCMICEMEYVKKKVIMVNVHAPTEENKKKEYY FT NVLRDYLKKHERVIIMGDFNTVFSKLEMAEGMVFKTDKGRKELKILMEEMN FT LIDVWRERNEQTKEYSRRQIVGNFCCQTRIDFILCTRNVEGFINKIKYEET FT SLSDHKPLFMKLDWSNVKRGPGVWVLNTAVLKNEDYVLSVKEIIQKEKGNE FT IYNEDKRMWWENVKYLVKKFTIKYCRQLQNCKKYKEKELKEKLENELKNEN FT GKNIQKIKELQGRLNEMEEEKFEGARLRSKAKFTVEGEKCTKFFFDLEKRR FT GKSEMIREIRSKNGNVVEQHEEILEEIRSYYEKLFCTEGIKEKEKGELLNL FT IKSRVEEGEKRECDEEIREEEIKRAISGLNKKKSPGIDGLGSEFYIVFKDI FT LSSILKEVYDEIFENGEINKRMGMGLMKVIYKGKGDKVDLKNYRPITMLNT FT DLKILAKVLANRLKEVMPSIIKTNQAYSIKGRDIADTTMSIKDTIRYINDK FT QKDGFLISLDFEKAFDRVEHDFLFGVLKSFGFGENFIKWVQILYRGAVTRI FT KCNGFLTDCFKIRRSIRQGCPLSALLYALVAEPLGLAVKHEDRIKGIEVEG FT GVNKIFQYADDTTLILQDLASVKQAMETVQHFCKGSGAKINENKTGYLRFG FT RTEALSGHFTFKEMDEIKILGIVIGRDEKKAEVTMWEEILGGIERRLRFWK FT LMSLTLKGRVLILSVLMVSKLWNILYVSSMPLWMEKRLKQCFLDFLWEGKP FT PRIAYATLIGEVGKGGLGLIDVEQRKNSLRVKMVRKYLDEDNKAAWKRTME FT YFLSKSGNFNMGDNILYMRMKKFMTEGLPDFYKELIGAWGKFLTCVHFNIQ FT GRENILNQPLFLNSGILNQEKVVFFRKWWEVGITRVRDVLYEFKEGFLPVQ FT YVIDVMDEAKEDFNRQDLIKEYDIIKNAIPAEWLTRIENMEENKQSKDVIV FT RFGEKWWNLKDSTVKMIYGFFRDGVFKKPRANENWIRMFKDVNEDNIWANI FT KGKLVQSKVENLEYLIRNKAVFTDIILNKIGMEESVTCKVCQDADEGFLHL FT FLYCNELKDFNEKCKSIILTLKGERDDELEWEKVLMLGVNKECNNEKLINL FT LVMLRKSAIWERRVAAKKEKAVLDVWNVFKRKVEKYVECLFYYFKLEDMQE FT AFYDVFTQEVSKILNDTGMKMPF" XX SQ Sequence 5746 BP; 2186 A; 593 C; 1420 G; 1547 T; 0 other; caggtgctgt aagcttttca aagtccttca tttggcagct gatagacttc cccgtgtcac 60 agcttcgctg ccattatttt tctcgccttc aagggcattg tttctgtcga tgctgtgtcg 120 atgccagctg agctgcgtcc ttccctggtg ctgcgcctgt tttaaatagc caactcttga 180 cagcctcttt cgcttacggc cacaccaccc tgtgcatgtg tgatagtggt acattccagg 240 aagtgtttgg ctgcagttgg tgagagaggc tcaccagatt ttctttgttt atttttgttt 300 ttgtcttaaa tacatttttg ttaaactttt ttttgtttgt ttagtttgtc ctcccagcag 360 cgtgagctgt tcgggaggag tgttttttat ttatttattt ttttttgcaa tggacggatc 420 tacggtgaca gacctggcaa tggcggacgg agttgagaag gcaaatgaca atggactgga 480 cagacgacaa caagaaacaa accaatcgga gaaggaagca aggaaaagga tttatctaaa 540 agaagcaact gtaacagtgg acataggaca agcaacagag gtgagagcaa tagatgtaat 600 taaagcagag acggagagga ttggggatgg aaagattttg gccgtaagac caaaacacaa 660 caaggaatat gaagtaacac ttgaaagaga ggaagatgct gatgagttaa tggacgaatt 720 gactattaaa gggataaact gtgcagttaa gaggctacaa aaccgtgatt atgttgtctc 780 cttcatgcat ctgcctgtct atgttgctga taaagatatt ttagacaaat tggatcattg 840 gggagtttgt cccatttcaa aaattaaaag aaggttttat ccgggcacaa atattgaaga 900 tgggacgagg tttgtgaaaa ccagattccc caaagaactg gcgtccctcc cgtacagcac 960 aagaatagag acagcagagg gtccacaata ctttagggtg atgcacagtc atcaggtgaa 1020 aacatgcagg ctgtgcttga gcccagatca tgtggtaaaa gactgtcctg attttaggtg 1080 ctataagtgc gaggaaaggg ggcactttgc aaagttttgc actgctgtaa agtgcccgga 1140 ttgtaataag gttttgaata aatgtgaatg ttggattggg gaagaggagg aggaggtaga 1200 gcagcaggtg ggcaggcaga tgtatgaagg agacaatatc cagtcggagg acaaaacaac 1260 aacacaagaa aaaagtacag aatctgaaag tgaaaaacta caagagaatg agactaatgg 1320 aatagacaga gtcacggaac aggaagggac aacatggaca caaatggata tgactgacag 1380 tttaaagagt gttttggaag cagcagaatt gagcgattcg aataataaag acttgaatca 1440 aggacaacag gaagacacat tttggacaca aatggacatc acagacagtt ttcaaaaggc 1500 attggacaca gaggagacaa aaggccaaag taatgacgag caagccgatt tagagggaca 1560 tttacaaaag gatggaaaca aagagacaca ggggaaatca gcaaaaagaa gaagatcgtt 1620 aaagataaac ctaatttaga gactgtaaga aaaaaactgc taaaagatga agaaattgaa 1680 tgcgcaaata agtatgagtt gctaaagggc ttggaagaca tggactgaga tgatgttttt 1740 tatgaatgtt tttatgtctt ttatgaatgt ttttatattt tttatggttt taggaattgt 1800 gtcttttaat gcaagagggc ttttagacat caggaaattt gaaaaagtga aagaaatgtg 1860 taaacgagaa gatgtgattt tacttcaaga gacaaactgg agggaagaat gcatgaagga 1920 aataagaaaa aggtggagtg gggaaatgtt atacaataat ggggatggga ggctagggag 1980 aggagttgca attttattaa aagaaaacag tggggtttta tgtaaaacaa tctataatga 2040 caaagaggga aagtgtatga tatgtgaaat ggagtatgta aagaaaaaag taattatggt 2100 gaatgttcac gccccaacag aggagaacaa aaagaaagag tattataatg tacttagaga 2160 ttatttaaag aaacacgaaa gagttattat catgggtgat tttaacactg tttttagtaa 2220 attagaaatg gctgagggaa tggtttttaa aacggataag gggagaaaag aactaaaaat 2280 attgatggag gaaatgaatt taattgatgt gtggagagaa aggaatgaac agacaaaaga 2340 gtactcaaga agacagatag tggggaattt ttgttgtcaa acaagaattg attttatttt 2400 atgcacaaga aatgttgaag ggtttataaa caagattaaa tatgaagaaa caagtctgag 2460 tgaccataag ccacttttta tgaagctaga ctggagtaat gtgaaaagag ggccaggggt 2520 atgggtttta aacacagcgg ttttaaagaa tgaagactat gttttaagtg taaaggaaat 2580 tattcaaaag gaaaaaggga atgaaatcta taatgaggac aaaagaatgt ggtgggagaa 2640 tgtgaagtat ttagttaaaa agtttacgat aaaatattgt agacaattac aaaattgtaa 2700 aaaatataag gaaaaggagc tgaaagaaaa actagaaaac gaattgaaaa atgagaatgg 2760 aaaaaatata caaaagatta aagaactgca aggaagatta aatgaaatgg aggaggagaa 2820 atttgaaggt gcaagattaa gaagtaaagc aaaatttaca gtagaggggg aaaagtgcac 2880 taaatttttc tttgatctag agaagagaag agggaagtca gaaatgatta gagaaataag 2940 gagcaaaaat gggaacgtag tagaacaaca tgaggagatt ttggaagaaa taagatcata 3000 ttatgagaaa ttgttttgca cagagggaat aaaagaaaaa gaaaaagggg aattactaaa 3060 tctaataaaa tcaagagtag aagaagggga aaaaagagaa tgtgacgagg agataagaga 3120 agaagaaata aaaagagcaa ttagtggatt aaacaaaaag aaaagtccag gaatagatgg 3180 gttgggaagt gaattttata ttgtttttaa agatatttta tctagtattt taaaggaagt 3240 atatgatgag atttttgaga atggtgagat aaataaaaga atggggatgg gcttaatgaa 3300 ggtgatatac aaaggaaagg gggataaagt agatttaaaa aactatagac ctataacaat 3360 gcttaatact gatttgaaga ttttagccaa agttttggct aatagactaa aagaagtgat 3420 gccaagcata ataaaaacaa accaagcata tagtataaaa ggacgagaca ttgcggatac 3480 aactatgagt attaaagaca caattagata tataaatgat aagcagaaag atggtttttt 3540 aattagtctg gacttcgaga aagcttttga tagggtggag catgactttt tatttggagt 3600 gttaaagagt tttggttttg gggaaaattt tataaagtgg gttcagattt tatatagagg 3660 agcggtaaca aggataaaat gcaatgggtt tttaacagac tgttttaaga taagaaggtc 3720 aatcagacag ggttgtccgt tatctgcact tttatatgct ttagttgcag aaccactggg 3780 attagctgtg aagcacgagg acagaataaa aggaatagag gtagaggggg gagtgaataa 3840 aatatttcaa tatgctgacg ataccacatt aatactacaa gatctggcaa gtgtaaagca 3900 agcaatggaa acagtacagc atttttgcaa ggggtcaggg gctaaaataa atgaaaataa 3960 aacagggtat ttgagatttg ggagaactga ggctttatct ggacatttta cttttaagga 4020 aatggatgaa ataaaaattt taggcattgt aattgggagg gatgaaaaga aagcagaagt 4080 aaccatgtgg gaggaaattt taggagggat tgaacggagg ctgaggtttt ggaaattaat 4140 gtctttaact ttgaagggga gggttttaat tttgagtgtt ttaatggttt ctaaattatg 4200 gaatatttta tatgtgtcat caatgccact gtggatggaa aaaaggctga aacaatgttt 4260 tttagatttt ttatgggaag ggaaacctcc aagaatagca tatgcaacgt taattggaga 4320 agtaggcaaa gggggtctag gtttaataga cgtggagcaa agaaaaaaca gtttaagagt 4380 aaaaatggta aggaaatatt tggatgaaga caataaggca gcatggaaaa gaacaatgga 4440 atatttttta agtaaaagtg gcaattttaa tatgggagat aatattttat acatgaggat 4500 gaaaaaattc atgacagagg gtctaccaga tttttataaa gaattgattg gagcatgggg 4560 aaaattttta acttgtgtac attttaacat acaaggacgc gagaacattt taaatcagcc 4620 tttattctta aacagtggca ttcttaatca agagaaagtg gtgtttttta ggaaatggtg 4680 ggaggtggga ataacaagag taagggatgt tttatatgaa tttaaggaag gatttttacc 4740 agtacagtat gttattgacg taatggatga agcgaaggag gattttaaca gacaagactt 4800 aataaaggaa tacgacataa tcaaaaatgc catacctgca gaatggttaa caagaataga 4860 aaatatggaa gaaaataaac aaagtaaaga tgtgattgta agatttggag agaaatggtg 4920 gaacttgaaa gatagtactg tgaaaatgat ttatgggttt tttagagatg gggtttttaa 4980 gaaaccgcgt gcaaatgaga actggatacg gatgtttaaa gatgtaaatg aagacaacat 5040 atgggctaat ataaagggca aattagtaca gtcaaaagtg gagaatttgg aatatttgat 5100 cagaaataaa gcagttttta cagatataat tttaaacaaa atagggatgg aggaaagtgt 5160 cacatgtaaa gtatgtcaag atgcagatga aggattctta cacctgtttt tatattgtaa 5220 tgagttgaaa gattttaatg agaaatgcaa aagcattatt ttaactttga aaggagaaag 5280 agatgacgaa cttgagtggg aaaaggtgtt gatgttggga gtgaacaaag aatgtaataa 5340 tgaaaagctc ataaatttac tggtgatgtt aaggaaaagt gcaatatggg agagaagagt 5400 tgctgcaaaa aaggaaaaag ctgtattaga tgtgtggaat gtatttaaga ggaaggtgga 5460 gaaatatgtt gaatgtctgt tttattattt taagttggag gacatgcagg aggcttttta 5520 tgatgttttt actcaagaag tttcaaagat tttaaatgac acaggaatga aaatgccttt 5580 ttaaaaatgt gattataccc tttaaggagt tctacttgca acattttata tttaaaatgt 5640 tttattgtga agatatgatg taataaggac ctttttaatg ttttgtctga agtgaaatat 5700 gtataaataa gtgaattgta aaaaaaaaaa aaaaaaaaaa aaaaaa 5746 // ID DIRS-8_DR repbase; DNA; ZEB; 6621 BP. XX AC . XX DT 12-DEC-2008 (Rel. 13.12, Created) DT 09-MAR-2009 (Rel. 14.04, Last updated, Version 2) XX DE DIRS-like LTR retrotransposon family - a consensus. XX KW DIRS; LTR Retrotransposon; Transposable Element; Nonautonomous; KW reverse transcriptase RNase H; DIRS-8_DR. XX NM DIRS-8_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6621 RA Bao W. and Jurka J.; RT "Families of DIRS-like retrotransposons in zebrafish."; RL Repbase Reports 8(12), 2161-2161 (2008). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 6621 BP; 1850 A; 1868 C; 1054 G; 1845 T; 4 other; gtgaagttta ttcataaact aatttcgaga ggagcacgtg attatgattg aacacggctg 60 gtcctgcatt agcatgcttg atccaccaat caggccattc ctaaccacta taaagagcca 120 gggttttctc actacagtca tcttcgattt gaagaataca ctgctctgca tcagctgcta 180 ctgctactgc tactgctact gctactgcta ctgctactgc tactgctact gctactgcta 240 ctgctactgc tactgctact gcagctacat caacttctcc agctacatca acttctccag 300 ctacaacttc tccagcaaca acatcaactt ctccagctac atcaacttct ccaacaacaa 360 caacttctcc agctacatca acttctccaa caacaacaac ttctccagct acatcaactt 420 ctccaacaac aacaacttct ccagctacat caacttctcc aacaacaaca acttctccag 480 ctacatcaac ttctccaaca acaacaactt ctccagctac atcaacttct ccaacaacaa 540 caacttctcc agctacatca acttctccaa caacaacttc tccagctaca tcaacttctc 600 cagctacatc aacttctcca acaacttctc cagctacatc aacttctcca gctacaccaa 660 cttatccaac tacaacttct caaccttcta cttcaaattg cctcacacaa tgcaatctct 720 gtgtcttcaa cccaattctc caacaagatc acagtaaaaa ctcgatcacc atccatgaac 780 tttgcctaaa ctctcgatgc ctgcgattca agctgaactc tcttctaggg gtgcaccagc 840 ggtaacatac atgacacttt ttaaggggaa gcctcacaaa ctacaactaa gattaaacat 900 ccatctttaa aggatgattc tagcgtgaat agaatgactc acctctcaat caaggttgta 960 gacaatccaa tccttaaaac atcagtccac aacttatgca tacattgata aacatccatg 1020 ttaatcctcc aaactagtcg ctgtggtgca ttcaatagag tagcgtctcc aatcataaca 1080 ctagttctct ggcaaacaaa atggccgcca gcctttttct gcaactttga ttgacactcc 1140 ttcgagccaa tagctgaaag gagaagcgtc accatccaat gaactctcaa aaaatttgag 1200 atggtcccac cctctctcat gacaagctca tgaatgatta tcattacatt tacacaatta 1260 ctactaaatg gtgacattaa ttttaagttc agggtttgat gttggcagat gataaataaa 1320 taaatacata aattaataaa taaataataa agaaaataaa taattataaa atgtaaaaaa 1380 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaact gccacatctg atagcagttt 1440 aacttctaaa aagtttactc tgctactgaa aaacattcaa cgcattaggc atagaagtcc 1500 aatcgtaata tgccccagtg actcatgtcc atacacttat aagccataag cttataatgg 1560 caagttggtt agatgtgaaa acagcaaaga aacaaacttc ccgctttgac ttccacgaat 1620 gagaggagga catttctgac atgtctgatg tttcatcaat tgactctgcg ttagatacaa 1680 accaaggggt ttaaactcca ctataattcc acctaccctg tcttcaagag gtacatcagg 1740 atcactagac gagcagattt cccatttaat atgccatcct taggagggac aaatgcctac 1800 ctctactatc tttgctcgtt gcgacaatga ggcttcgtac catgcattaa caaaggcact 1860 tccacactta gttgcccatg ctcttattaa gacgccccgc atggttatca gctaaaataa 1920 ttcattgtgt ttgccataca tgtaccatgt tgtaatgtca tttctgattc tctcttgttt 1980 ttctcatttc agagatcatg tcagctggtt gcagaagctg accctcatcc aacctattac 2040 actccacatc catttctaaa tacatttaca ctctttgcaa atgacacttt tccactcaca 2100 tagcccacac cctcgatgcc atattcacac tagccttttt aatttaagtc ttctgaacta 2160 agtatgacat ttcaatccaa cgatacacct aaccatatct gatctatctt cgcaggatga 2220 agggatgcag gatgatatcc gttcccccac acacctgttc taaacaggcc cggattggct 2280 aatcgggagg accgggagaa ttcccggtgg gccggtctgt tttttttttt tttggccgcg 2340 agggccggtg tccctagctc cagaatctgt tgctctcagc agtgacactt ttttaattca 2400 tttacttgac cacagccttt ttattcatta ttttaccaca acttttctct tttgttgtgg 2460 tcgagtggtt ggcacgttag gttaagacgc tgcggacctg ggttcgatcc ttgctagagt 2520 aatttagtgt tttcattttt attgataaga catataatac tgttagggtt gtagaacatt 2580 tgaagttcta aagcagctgt tttctcaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2640 aaaaaaaaaa aagacatgat agtgccatta gaaactgatt tggaaatatt tatttactta 2700 ttttaatata gtcagtcgtg aactgaggtg ggctggtctg aggcttgaaa ctccagagct 2760 gaaaaggtgt cccactccgg ccctggttct aaacctctta gcatttttac accacagaag 2820 agcaatacca cctcggggaa gcccagccga tcctctccag ccaacttcca ccatcatgca 2880 ttattgggta tgaagtatga ctctgtcatt tccttcctat ttcccttcct atttaacata 2940 ggagctcctt cactaccctt ccctgttttt ctagctttaa ctttttctct ccaaatttaa 3000 actatcctat acttctattc actttcattc accacagctc cgaccaattc aggggttgtc 3060 ttgagtttca gttgctgcaa caggaacgtt ctaaatgagc ttgcatacac ttacaaattt 3120 tctaaatacc actatcgttg cactccccgc actaatacag cagtaaatgc ttctgcggga 3180 gcacaagttg cctccatact ggcccactac accactgcag ccgcagaaat gctccgcaga 3240 gttgcacacc caaagcacca ctgctaccgc agtgaccctc tagcatatgc cttattcacc 3300 ttcatcttta catatcacta aaggcagtat ttaacgtccc tgcgggagca ctcaaaactt 3360 actaatatcg actgtttagg tgctccactt aactatacac cccattaaca tcactgcaac 3420 tgcagtgacg ctctagctga gccccttttt cactttcatc tctaaatatc actgctaata 3480 gtacttagta tttccgtgtg aggctctaat actgagtacc actgcacctc tgctmytgca 3540 gagacgcttt gccgagcctt attctccctc tgcaccactg ctgcaacagt taagctttgc 3600 ctgagctgca actgcagaga cggctgttta gccttaattc tctgcaccac tgattttatt 3660 gcacttctgt aactgcagag aagctctgct gagattattt tttcccctgc accaatgctg 3720 cagcagtgac gccctgcctg aacagtcttc caacaacaga tataaactct gtcattggtc 3780 ctctaacacc tctaaatcct caatttgcct cagccaatat ttcctcagag aagcccagct 3840 gattctcggt tgcagacttc aacctagttt ccaagcagca cgactctgtc tgttcttcct 3900 actttctgtc ttagttgcat aggactcatt tttactaaac tttccttttt attttattaa 3960 acttcccttc tcttyctctt cataattttt caatttcact tgcactcatt ctccgcagct 4020 cggactccca caggggtagc cccaagctcc cactgccaca gcagtgaagc tctttaarag 4080 ctcacatgta cacttttcaa aaaatttaac cattatcaca ctccgccgca ataatacagc 4140 cgtaaaggcg tagcgggaac tcgtattgct tccatactga ctacaattac acctctacag 4200 cagctccact taattacatt gcattgtagc atcaatttat tctccatcac cacactcccc 4260 gcaacaatac agccataaag gcttagcggg aactcgtatt gcttacatac tgaccaccat 4320 cacaccttta cagcagctcc acttaattac attgcattgt agtatcaatt tattttccat 4380 caccacactc cccgctataa tacagctgta aaggcttagc gggaacatgt attgctttca 4440 cacatataat gcaccagact ctgaatacct atgcacccct gcagccacag agtcgctcta 4500 ctgtgcctga ttcctatctg ctacacagct gttaatcagc tctccaaagc gcacttccct 4560 ttagaaactg actttcaccg cacctctgca gccgcagaga cgctcagccg agcatcactc 4620 ccctctacat cactgccgca gcagtgacgc ttagcgagcg catatacact tccacttagc 4680 aatccactac tgatacactc tccagcactg ccaaagcagt taaacgtctc tgcggaggcg 4740 tatttacctt ccagatactg acttccattg cacctctgca gcagcagaga cgctcagccg 4800 agcatcattt acatcccctg caccactgct gcagcagtga cgctctgctg gagctcatgc 4860 acacttccat tacacttata ccacttctgt cactcccagc actgctaaag cagttaaaac 4920 gctgctgcag aagcgtattc ttccccttag gactgactac caccgcacct ctgcagtcgc 4980 agagacgctc agccgagcat cactcccctc cacatcactg ccgcagcagt gacgctcagc 5040 gagcgcatat acactttcat ttagcaaccc aatactgata cactctccag cactgctaaa 5100 gcagttaaac gtccctgcgg aggcgtattt cccttacaaa tactgacttc cattgcacct 5160 ctgcagcagc agagacgctc agccgagcat catttacatc ccctgcacca ctgctgcagc 5220 agtgacgctc tgctggagct catgcacact tccattacac ttataccact tctgtcactc 5280 ccagcactgc taaagcagtt aaaacgctgc tgcagaagcg tattcttccc cttagtactg 5340 actaccatcg cacctctgca gtcgcagaga cgctcagccg agcttcatcc cccctgcacc 5400 actgctgcag cagtgtcgct ccaccggagc tcacgcacac taccatttat ccataccact 5460 actgacacac tccccattgg actgccaaag cggttaatca gttaaccgct tctcatctgc 5520 agctacagct ccgcagaacc ttacagcttt attttccatc actactatac taaaccttta 5580 cgaaactcac cacatatgtt ttaaaccttt acacttaccc cactcactct cccgctggtc 5640 cttacaatta acagaacccg ggagcacaca tagtcataca taagcacttt cagttaattt 5700 ttacacccac accagtctct gttgctcctc caagctattt ctgtatcact tttcagcagc 5760 cggatatggc attaatctcc tgtgcctttt ggggggttct tcaaatacgc ggctgctgtc 5820 ccgagcggag cattttgggg agttgtcgag atctacctga gctcgaggct cccctctctt 5880 cctccaaacg ggagggagcc cagggctcaa gaaccttcga gctcagggct ctctcccggg 5940 acagcatgcc aaacttgctt ataatcaatc atcagctaag tgtgaactct tgaagtgaag 6000 tttattcata aactaatttc gagaggagca cgtgattatg attgaacacg gctggtcctg 6060 cattagcatg cttgatccac caatcaggcc attcctaacc actataaaga gccagggttt 6120 tctcactaca gtcatcttcg atttgaagaa tccccccttc cacccctacc ttttcacctt 6180 tccctccata gggcagcacg gtggctcagt gactagcact gtcgcctcac agcaagaacg 6240 tcaccggttc tagttcctta acaggccggt ggtcgtttct gtgtgtagtt tgcatgttct 6300 tcccgtgctt gcgtgggttt tccccgggtt ctccggtttc ctcccacatt ccaaaaacat 6360 gtacaacaag ttaatcgtta aatctaaatt tcaatacagg taatctaata atgcagcata 6420 tcttttaata gccttcaatc ttaatcttta gctattataa aaaggggagt tgtcgagatc 6480 tacctgagct cgaggctccc ctctcttcct ccaaacggga gggagcccag ggctcaagaa 6540 ccttcgagct cagggctctc tcccgggaca gcatgccaaa cttgcttata atcaatcatc 6600 agctaagtgt gaactcttga a 6621 // ID Gypsy78-LTR_DR repbase; DNA; ZEB; 796 BP. XX AC chr12; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy78-I_DR; KW Gypsy78-LTR_DR; Gypsy78_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-796 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1472-1472 (2008). XX DR Genome; chr12; Positions 39484177 39484972. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 796 BP; 219 A; 129 C; 178 G; 270 T; 0 other; tgtagccaag tgaaaaatag ctggtaatga cgcctccctg ccactagagg cttattggtg 60 cgtttatacg gagtcgttgt gacaggagaa gaagtgccct cctgagagtt tagctacggt 120 aatctgttgt cttttgagcc tgcaagatag gtactgtaac agttttattt agaggaaaaa 180 gaggaagtcc tcgctgcagt tttgagagca gctatttctc ttcatgtttg actggtgatc 240 ggtgggattt tgcatgcttt cgggtgcaga aatattcacg tggtgagact gtgtaatctc 300 cagcaacgaa gattgagcag gggagagcga gtacacgcga gtgcgcacgt gatcggaggc 360 tttgtggatc attcagagtg cggttcaaca ttcaacagag gatacataag ccactatcgt 420 tgtttattta ggattattta ggtttattta gatttattta gtgctattaa tctttattag 480 tgacacaagc aaaatcactt ttgttttatt tacttacctg tgtggacatt attactgtaa 540 attaaaagga ggagatattt aagtagtcat tttaaaacat tgtttggtct gagctaacct 600 attgtaaata ttgtaattct ttcattattg tccacagaac attttacatt tcttgttttt 660 gcccactgtt ggaaatctgc aactttaatg ccacattttt tctatttaaa cgtggtcaca 720 aataaagaac cctttgttta tagatgacta aggatgcttg atttatttaa ttaggaagag 780 cagtatcccg gccaca 796 // ID TDR8D repbase; DNA; ZEB; 679 BP. XX AC . XX DT 03-OCT-2008 (Rel. 13.1, Created) DT 04-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE A non-autonomous hAT DNA transposon from zebrafish - consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; TDR8D. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-679 RA Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1258-1258 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 679 BP; 203 A; 144 C; 136 G; 196 T; 0 other; ggctgtttct caattccaag aacgcagaga acggacttgt gttcttgtgg agaccggtct 60 tgccaggtgt cctcggaaga acgaactcag gagaccgcga gggcagagaa cgcgtccttt 120 gagaaatgag atgctgcgtt cttcctgatg gtcacatgac cttcacgtgt tttaaatggt 180 aaattattta aacattacag ccttcataca acgatttatt gtttttcccc ttttcaaaat 240 atatactttg cataaaaaca ttataaatat atgttgcaca atataaataa aacagatttt 300 aatacgaatt tcagcaaaca aacaccctta atgtgtttat tcctttatta agatgtccat 360 ggtaatgttt actttcaccg tttcatttag ggaaactcct gaggtaaata attcatatct 420 atgaacttta ataataaatc tagataaaat gcagcgcttc ccacctccag tcgcaatgac 480 ttctgggact tccagagcga gttcggtgct caagtctgca tcagtgcgtc ctcgatatca 540 agaacacatc cgggaagttt cacgcgtcct ccgtacttgc ggtcttgagt attggaactg 600 aacttaggca gctgatgatg acgtttcacg agaacacgag gacgcaagac cgctgaagaa 660 cgcatattga gaaacagcc 679 // ID hAT-2n1_DR repbase; DNA; ZEB; 1210 BP. XX AC . XX DT 07-JAN-2005 (Rel. 10, Created) DT 07-JAN-2005 (Rel. 10, Last updated, Version 1) XX DE A nonautonomous hAT transposon - a consensus sequence. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW ERVN1-LTR_DR; hAT superfamily; hAT-2_DR; hAT-2n1_DR; KW nonautonomous DNA transposon. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1210 RA Kapitonov V.V. and Jurka J.; RT "hAT-2n1_DR, a nonautonomous DNA transposon that contains a RT retroviral LTR."; RL Repbase Reports 4(12), 334-334 (2004). XX DR [1] (Consensus) XX CC hAT-2n1_DR is a composite nonautonomous hAT transposon. Its CC copies are ~97% identical to the consensus sequence. They are CC flanked by 8-bp target site duplications. hAT-2_DR was an CC autonomous element hAT-2n1_DR was derived from. The consensus CC sequences of these transposons are 97% identical. A portion of CC hAT-2n1_DR (pos. 323-649) that is not similar to hAT-2_DR is CC the ERVN1-LTR_DR long terminal repeat. XX SQ Sequence 1210 BP; 331 A; 239 C; 283 G; 357 T; 0 other; caggggtttt caaagtgtga ggcgcgcctc ccctgggggg cgccagagca tgtcagggga 60 ggcgcgggaa aaaatataat ataataaaaa tataattatt aagtttaatt attatatgta 120 ttttttatta tatttaaacg ttttaattaa acaaagcaaa aaaaataata cgtcaaatat 180 aagaaaacct tttttaccca gaaggccata gctgtgaatt cgcttctgtt tggcaagccc 240 gccaatacag gtatatgcct gctaatacaa tgggtcggtt tctgagaccc ccccgattca 300 aagctgtaat gaagttcacg gccctttggc cgccgggaag aaggaggcgg agaaccgacg 360 cagttttaaa atgatttatt aataacagtg acggcagctc ctcacggagg ctgccgtcta 420 aaccaaaaca aacggacggc agctcctcac ggagactgcc gtcaaactga aagcaaaagt 480 aaaatatgtc cgggcccggt cctctctcgg cttcctctgc cctcgttcct ccttttatga 540 tccagagctc cttccgtggg atccgaggca ggtgcgcacc gcaggtgtat ccacttacgc 600 ggtggcctca ctccgttccc acggctctcg gccacgcccc ctcgccacaa aagccttcaa 660 gttcagggct taaacccaaa agacgacgat atgatgatca gtatttgagt ttaggattta 720 cgtggacagg accagctgat gaaccacgac ctttatgtgt ggtttgtcaa aatattttgg 780 ctaatgacag catgagaccc gctaaacttc gactgttttg actgggatgg acagttcttg 840 gatctgttct attcatattg aaccatcatc ctagttttaa aaacgttata ttaaaatact 900 tgttattact ctgtaaataa ttgcactttc atgcaaatga tgataaaagt gagttaacag 960 tctgacatct gtctgtgtct ttatatatac tgtatatata tatatatata tatatatata 1020 tatatatata tatatatata tatatatata tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg 1080 tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgcgtgcgtg cgtgtttggg 1140 aggagggggg cgccaatgga taagttgtgt caaaagggag gcccactgtc ttagactttg 1200 aaaaaccctg 1210 // ID Gypsy140-I_DR repbase; DNA; ZEB; 6908 BP. XX AC chr4; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy140-I_DR; KW Gypsy140-LTR_DR; Gypsy140_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6908 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1541-1541 (2008). XX DR Genome; chr4; Positions 36211359 36204452. XX CC Positions [3034-3573] - Reverse transcriptase CC Positions [5050-5526] - Integrase core CC 'CAAG' target site duplication CC LTRs are 99% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 335..1795 FT /product="Gypsy140-I_DR_3p" FT /translation="MDSYANHKLPKGRGVWLDYTADSRDVRHSKPIADDYT FT DTGLGHSPGCTPVSGFKRENLRTSTPATDVDAIQQLTDMVGQLGAQIGESI FT VEKLMSAGVVNIASDHQTTPTAQITHSEPTQHDSPHVTVHVRSDREFPTFK FT GDVTDKHSVQDWIDLTKTYLRKHEIPVYDQGEEIMSHLMGKARDVVKIALR FT SDPVIDLKQTPEKIYDVLHYFSEASSCLPLADFYATLPKHKENPVDYWIRL FT NKAADLALDGLRRQGKKTENINNEVALMFVKQCPDPELSYTLKCKPIHEWT FT SRDVQIRIDDYQRESTASSRAIGTTQLKNHVSAVTPEQPSISPVNSTMPSQ FT CHTPNLSLQAQHSVGPPTSPIPTCVPAQGKSIHSLSHSSTPVVAQNLQSQS FT DERLLAHMVDIFQDMMDKMQQRNTNHLSRGGRFRRTPRERRLNQPVCKVCN FT DSSHTTISHCMSERLCFACFGSGHTRLNCPVSSSPQSQSEGN" FT CDS join(2008..3996,4000..6570) FT /product="Gypsy140-I_DR_1p" FT /translation="MLDTGSMACSMSEEAEIKMKEAGVINDLHKVDVNVIL FT VGCGGLHVKPKCAFDVEMEVYGCKIVVPMLVVQGQRDELILGTNVIKHILH FT QSKKCESYWKTVSSPFPSKDYEVEQFLSMLSGLKPWRGDEAPQKIGTVRCN FT SAVCLESGCEYLIWGKLPKNTAISPGSTVMTEPTSSRSAPRGVLVSRVVTP FT LWGEKWVPLKLINASDRPVLLRRNAKLADVFSCMAIEDMDITEFPENHWAS FT FSHSLMPQPPASMQSTKERLMSVGLSNIDIDSCEVSEVCKRKMTDLVLQYE FT DIFSRHHLDCGEAKSFVHRIHLSDKRPFRLPYRRVPPGQYQKLRQAISEME FT DKEIIRKSTSEYASPLVLVWKKNGDLRICTDFRWLNKRTLKDAHPLPHQAD FT CLAALGGNSLFSTMDLTSGFYNMPLHEEDRKYSAFTTPMGLYEYNRLPQGL FT CNSPGSFMRMMMSIFGDQNYLSLLCYLDDLLVFAPDEESALLRLKMVFERL FT RSHNLKLAPKKCFFLRRSVRFLGHIVDEKGVSTDHSKVENITNMTSTDLMD FT PDGVTPSQKRIRSFLGMINYYQHFVPGYSAIAKPLFDLLKGEKKKRKGQKD FT RTTGRKLCAADWTPQREHAFDHLKASLANSMILAHPDFTRPFMLSTDASLE FT GIGAVLSQVQDGDTARPIAFASKSLTQAQKNYPAHRLEFLALKWSICDKFS FT HWLKGHKFTVWTDNNPLTHILTKPKLDCCEQRWVAKLASYDFNIKYVPGRQ FT NIVADALSRVPFVKESVGQRLLTEPYTNLLSAVTDVSCNSVKNAFMSSSGQ FT VKPVTVCGNVQAACNSPNVWTQSIGMEDVSAVLQSHIVWETGPRARAMQTL FT QFLPQLIPPGLNALPAYTEKDLRDKQLDDGTLSRVLYYVERRWRPSRRARA FT KESVSVTRYLKHWDKLTMSNGVLYRISKDPRTRAKRSQYVVPDSLKDEVLK FT GVHDEAGHQAQSRTLSLVRERFFWTNIDRDVRDYVRHCQRCIVSKTAEPEG FT RAPLESITTTRPLELVCIDFWSAENSCNKSIDVLVITDHFTRMAQAFPCKD FT QTAKQVAKVLWDKYFCVFGFPERIHSDQGTNFESRLISELLKMSGIRKSHT FT TPYHPMGNGSVERFNRTLGCMIRALSPEAKADWPRRLQTLTFMYNCTSHET FT TGYPPFYLMFGRVPRLPVDVLFHTVLHDSNVTSYDKYVASLTNDLKEALLI FT VQDHAQKEQKRHAHLYNRKVKGSSIEVGDRVLLANKTERGKKKLADKWEST FT IYTVVDMNPETHTYRICDTVTGREKVVHRNLLMLVNFLPVENGINVSDLPP FT SVSAVESSPLGTADVDDISETLFERESENDRGSDENYDVDSLDNLPACSSD FT GVSLLSDAEPGDSERRTIEWITQFSDPSLSQMDVTDVMCDTSRPQDVSTLP FT GNITNEQSVTCDSVPITDIAVDIRQSEAAPVNTTQTHNAPHALHTADQTSL FT QRSNFPNAHTQVRSRFGRLVKPVNRLIQSMSRQEVVQDNFSVKSVCKSIFQ FT SLVE" XX SQ Sequence 6908 BP; 2003 A; 1467 C; 1574 G; 1864 T; 0 other; ctggtgccgt gactccttgg atctccagat cagcttgatg ccggtcgccg tggatgctgt 60 gttgctgatc tagtctacac ttcaagatca aagtggggtc agagagagaa agaaaacaaa 120 acaaaacact aaaagagaaa aaaataataa taatttaatt atttgatttt tactgttgta 180 tatcttatga tgtttagttg aaaacttttg attacagttg ataatattct tttgttttca 240 aagcgattat tgttttgttt tattaagttt ttagtattag catacacaac gcaagtattt 300 tttttcactc aagaaaaaaa aaaaaataag aacaatggat agctatgcaa atcataaatt 360 accaaaggga aggggagttt ggttagatta cactgctgac agtagggatg tgagacattc 420 taagcccata gctgatgatt acacagatac agggttgggt catagcccag gttgtacacc 480 tgtcagtggg tttaaaagag aaaatcttcg cacatccaca ccagccactg atgttgacgc 540 tattcaacag ctcacagaca tggtagggca gttaggagct caaattggtg aatccattgt 600 tgaaaagcta atgtcagctg gtgtagtaaa catagccagt gaccatcaga ctactccgac 660 agcccaaatt actcacagtg aacctaccca acatgactca ccacatgtga cagtccatgt 720 tagatcagac agagaattcc caacatttaa aggtgacgtt actgacaaac attcagtcca 780 ggactggatt gatctgacca aaacttacct cagaaagcat gagattcctg tgtatgacca 840 gggtgaggaa attatgagtc atttaatggg caaagccaga gatgtcgtca agattgcctt 900 gcgtagtgat ccagtgattg atcttaaaca gacacctgag aaaatctatg atgttctcca 960 ttatttcagt gaagcttctt catgtctccc actcgctgac ttttatgcta cactgcccaa 1020 acacaaagaa aatccagttg attattggat aagactaaat aaggcagctg atttggctct 1080 tgatggctta cgcagacagg gaaagaaaac tgagaacata aataatgaag ttgcactcat 1140 gtttgtcaaa cagtgccctg atcctgagct ctcctacacc ctaaagtgta agcccattca 1200 tgaatggacc tcacgtgatg tacaaataag aattgatgat tatcagagag agtcaacagc 1260 tagtagcaga gccattggga ccacacagct caagaaccat gtcagtgcag tgactcctga 1320 gcagcccagt atttctccag tgaattcaac aatgccgtcg caatgccaca ctccaaatct 1380 ctctctccaa gctcagcact cagtgggccc tcccactagc cctattccta cctgtgtccc 1440 tgcacagggc aaatccattc acagtctgag ccactcttct acgccagttg tagcacaaaa 1500 tctccagtca caatcagatg agagacttct agcccatatg gttgatatct ttcaggatat 1560 gatggacaaa atgcagcagc gtaacaccaa ccatctatct aggggtggca gatttcgacg 1620 cacacctcgc gaaagacgcc tcaatcagcc agtttgtaag gtctgtaatg actcaagcca 1680 caccaccatt tcacactgca tgtcagaaag attgtgtttt gcctgttttg gatctggtca 1740 caccagactg aattgcccag tcagtagctc tcctcagtcc cagtctgagg gaaactagtt 1800 gacctgtatt tggagggggg cagtacaggt ctcagcaacc actcccacac tgacgacact 1860 tctgatgctg aaacagttta cacatcagca agagacttat cccatgattc tgaaattgtt 1920 gtttatcaga atattcacag agtcagcagt agtgagagcc ttttctatac acccgtgtta 1980 ctgggtggga cgttaaaaat tggtggcatg ttggacactg gctctatggc ttgcagtatg 2040 agtgaagagg ctgagataaa aatgaaagag gctggagtga taaatgacct ccacaaggtg 2100 gatgtgaatg tgatcctcgt tggatgtggt gggcttcatg ttaaacccaa gtgtgctttt 2160 gatgtggaga tggaggtgta tggctgtaaa attgtggttc caatgctcgt tgtccagggc 2220 cagcgggatg aactaatatt gggaaccaat gttattaaac atatcctgca tcagtccaag 2280 aagtgtgagt cttactggaa aacagtgtca agtcctttcc ctagtaaaga ttatgaggtg 2340 gaacagttct tgtctatgct ctcaggtctc aagccctgga gaggtgatga agctcctcag 2400 aagattggta ctgtcaggtg caactctgct gtctgccttg agtctggctg tgaatatctg 2460 atctggggaa aactgcctaa aaatactgct atatcaccag gcagcactgt gatgacagag 2520 cccacatcat ctcgctcagc tcccagaggt gttctggtgt caagagtcgt gactccattg 2580 tggggagaaa aatgggtccc actaaaactc atcaatgcgt ctgacaggcc tgtattgtta 2640 agacgcaatg caaagttagc agatgttttc tcctgcatgg caattgagga catggacata 2700 actgagtttc cagagaatca ttgggctagt ttttcacatt cactgatgcc tcaaccacct 2760 gctagtatgc agtccaccaa agaaaggctc atgtcagttg gtctcagtaa cattgacatt 2820 gattcgtgtg aggtgtcaga agtatgtaag aggaagatga ctgaccttgt tttgcagtac 2880 gaagatatat tttcacgcca tcatctcgac tgtggagaag cgaaaagttt cgtacaccgc 2940 attcatcttt cagataaaag gccgttcagg ctaccttata ggagagtgcc ccctggacaa 3000 tatcaaaaat tgcgtcaggc aataagtgaa atggaggaca aagaaatcat taggaaatca 3060 accagcgaat atgcatcacc actcgttctt gtgtggaaga agaatgggga tcttcgtata 3120 tgtacggatt ttcgctggct gaataaaaga actctaaagg atgctcaccc tcttccccat 3180 caggcagact gcctagctgc attaggaggc aacagtctat ttagcacaat ggatttaacg 3240 tctggatttt acaatatgcc acttcatgag gaagacagga agtactcagc ttttacaact 3300 cccatgggcc tttacgagta caaccgtctc ccccagggtc tatgtaatag tccaggcagt 3360 ttcatgcgaa tgatgatgag catttttgga gaccagaact acctgagttt gttgtgctac 3420 ttagacgacc tgttagtgtt cgccccagat gaggagagtg ccttgttacg cctgaagatg 3480 gtgtttgaga gactgcgtag ccataatcta aagttggctc caaaaaaatg tttctttctc 3540 aggaggtctg tgaggtttct gggccacata gttgatgaaa agggagtttc aacagaccac 3600 agtaaagtcg agaacatcac aaacatgact agtactgacc tcatggatcc tgatggtgtg 3660 accccatcac agaaaagaat aaggtccttt ctagggatga taaattatta tcagcacttt 3720 gtgcctggat actccgccat agctaagcca ctctttgacc tgttgaaggg tgagaagaaa 3780 aaaagaaaag gtcagaaaga caggactaca ggcagaaagt tgtgtgcagc ggactggaca 3840 ccacaacggg agcatgcctt tgaccatctt aaggcttccc tggctaactc tatgatcctt 3900 gcccatccag acttcactcg tccctttatg ttgtcaacag acgcatcctt ggaaggcata 3960 ggagctgttc tgtcccaagt acaggatggg gacacgtgag ccagaccgat tgcatttgcc 4020 agcaagtcat tgactcaagc ccagaagaac tacccagctc accggttgga gtttctggct 4080 ttgaaatggt cgatttgtga caagttcagt cactggctca aaggtcacaa attcacagtc 4140 tggactgaca ataacccgtt gactcatata cttacgaagc cgaagctgga ctgttgtgag 4200 caacgctggg tagccaagtt ggcaagttat gacttcaaca taaagtatgt cccaggtcgg 4260 cagaacatcg tagctgacgc tttgagccgt gtcccttttg tcaaagagag tgttggccaa 4320 agacttctta ctgagccata tacaaacctt ctgagtgctg ttacagatgt gtcatgtaat 4380 tctgtgaaaa atgctttcat gtcatctagt ggtcaagtga aacctgttac agtgtgtggt 4440 aatgtacagg cagcatgcaa ctcacctaat gtatggactc agtctattgg aatggaagat 4500 gtgtcagcag tactgcaatc acatatagta tgggaaactg gtcccagggc ccgtgcaatg 4560 caaacactgc agttccttcc tcagttgata ccacctggac ttaacgcctt acctgcatac 4620 actgaaaaag accttcgtga caagcagttg gatgatggca ctctctctcg tgtcctgtat 4680 tatgttgaga ggcgatggcg gccttctagg agagcaagag ctaaagagtc tgtctcggtc 4740 acaagatacc tgaaacactg ggacaaactt accatgagca atggtgtcct atacagaatc 4800 tcaaaggacc ctagaacaag agcaaaacgg tcccagtatg ttgttcctga ctctctcaaa 4860 gatgaagtcc tgaaaggagt acatgatgag gctggtcatc aggcccagtc caggaccctt 4920 agtttagtaa gagagaggtt cttttggaca aacattgaca gagatgtgag agactatgtt 4980 cgccattgcc agcgatgcat tgttagtaag acagctgaac ctgaagggag agctcctctg 5040 gagagtataa caacaactag accactggag ctcgtctgta ttgatttttg gtcagctgaa 5100 aactcctgta acaagtccat agatgtccta gtaatcacag accattttac aaggatggct 5160 caggcgtttc cctgtaaaga ccagacagcc aagcaggtag caaaagtact ctgggacaaa 5220 tatttctgtg tctttggatt cccagaaagg atccatagtg atcaagggac taattttgag 5280 agtcggctga taagtgagct cctcaagatg tcaggtatca gaaaatctca cactactccc 5340 taccatccaa tgggaaatgg tagtgtggag agatttaata gaactttggg ctgtatgatc 5400 cgcgcactgt ctcctgaagc taaggctgac tggccaagac gtttacaaac attgacattc 5460 atgtacaact gcacatccca tgaaacgacg ggctacccgc ccttttatct catgtttggc 5520 cgtgtccccc gcttacctgt tgacgttctg ttccatactg tcctgcatga ttcaaatgtg 5580 acgagctatg acaagtatgt ggcttctctt accaatgatc taaaggaggc attgttaatt 5640 gttcaggacc atgctcagaa ggagcagaaa agacatgctc atctgtacaa caggaaagta 5700 aaaggatcca gtattgaagt tggggataga gtgcttttgg ccaacaaaac agagagaggt 5760 aaaaagaaac tcgccgataa gtgggaatca accatctaca cagttgtgga catgaaccct 5820 gaaacacaca cgtacaggat ctgtgataca gtcactggcc gggagaaggt ggttcatagg 5880 aacttactca tgcttgtcaa cttccttcct gtggaaaatg gaattaatgt gtctgactta 5940 cctccatccg tgtctgccgt tgaatcttca cctctgggta ctgctgatgt ggacgacata 6000 tctgagactc tgtttgaaag agagagtgag aatgatcgtg gatctgatga aaactatgat 6060 gttgacagcc tagacaactt acctgcatgc tcgagtgatg gagtgagcct cttgtctgat 6120 gcagagcctg gggattctga aaggagaacc atagaatgga ttacacagtt ttctgatcca 6180 agtttatcac aaatggatgt tactgatgtg atgtgtgata cctctagacc acaagatgtt 6240 tctaccttac ctggaaacat cacgaatgaa cagtctgtga cctgtgactc tgttcctata 6300 actgacattg ctgttgacat aagacaatca gaagctgcac ctgtcaatac aacacaaaca 6360 cacaatgcac cacatgctct acacactgcg gaccagactt cacttcagcg cagtaacttt 6420 cctaatgcac acacacaagt cagatccaga tttggtagat tagtcaaacc tgttaataga 6480 ctgatacagt ccatgtccag acaggaggtt gttcaggata atttcagtgt taaatctgtt 6540 tgcaagtcta tttttcagtc tctagttgag taaattacag atcagttgcc ctagctttta 6600 cgggataaag tttattattt tttttggaag tggcattact gctttatata attaatttag 6660 aagttttcat ctctttaatg agcgtattct acactgaaac tttgggtctt gtggggtgta 6720 caaggcaccc tgttgccagt agctggattg ggggatagcg ctgctctcat accacttcat 6780 tcttttggaa tactgttgat gttggatttt tattttgatg accctttcat tgtgaatgtt 6840 gttgaagttg tgtcagatat gtacattgta gatgtagctc aaatttgata aaattcagtg 6900 gaggtgaa 6908 // ID BEL10-I_DR repbase; DNA; ZEB; 6313 BP. XX AC chr18; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL10-I_DR; KW BEL10-LTR_DR; BEL10_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6313 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1369-1369 (2008). XX DR Genome; chr18; Positions 45435258 45441570. XX CC 'GTATG' target site duplication CC LTRs are 99% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 725..1762 FT /product="BEL10-I_DR_1p" FT /translation="MHQQQLPQSHATRAAMQPPIPHWSFNPSLFGPVYPGF FT WPPVAGHMYPPQMPIPMNAYEYQHQRPPDFQDVFASSQCQPHPQQVLHNPG FT VGQEYFNSQLHFLPVPSASHISNPSSALITAQAGGLTNHVQQQSQVGIPRN FT VSTGTMCVQTTFHDVQFSSQLAMSSPSSPLTVRPTQQQSQISQVLQNNPLP FT HISGPSVPLPVSGFQVESEPRFHASLPPRSSRIKGPSFPYLCKEDPKQFIM FT LKMALENLLPSDETEQYKFHILLDHLKLDSARNLALAYSNHSLPFSTAWAA FT LQQKYGQPHQLVLREIAEIMSLPAIRTSDSQAFSALAVRIRALVGMLQSLG FT QCE" FT CDS join(2756..4159,4163..5974) FT /product="BEL10-I_DR_2p" FT /translation="MLLIGADHPHLLAPVEKVRLSPQGGPAVVHTRLGWAL FT HGPVPSDKSTELETRCLFLSGTSVSSQLQQDVERLWQVDVLPYRSEKAITR FT SKQDQYALDCLEQMRTRENVNGVSRYVTPLLRSKNAPLLHAPKEAVMSSLQ FT RTERQLAHDLEKALVYEGEIQKLVEAGYVKKLSPEEVSQSQESWFVPLHLV FT KHNEKYSLVLNCSYVFKGDSLNTHLLPGPTLGSSLLGVLLRFRENPVAISC FT DIKSMFHQVRLLPSDKPLVRFLWRNMRKEEPADVYEWHVLPFGTTCSPCCA FT TYALQKHVRDLSENCEDLKDSITKAFYVDNCLQSVSDIQKGKELVEKLRAH FT LKEGGFEIRQWASNVLEVVEVLPSDMRSAHTELWLSQDCVDPSEGMLGLTW FT HCAKDTLGYRYRPVEYKALTLRNVYRVLATQYDPIGFLIPFTTRAKVLLQQ FT LWVKERQWDDPVLPEGIQKAWAAENELPDITSIQLSRTYGPVELDSTSVKR FT ELHVFCDASEQAYGSVAYLRIVNDDKVLVSFVMARSRVAPRKSISIPRLEL FT CAALTGVQLANLLERELDLNIQRTILWSDSMTVLHWIRSDTCRYKVFVANR FT ITEILENSSPEQWQYVDTTCNPADDITRGKCLSDLAKPNRWSQGPDFLYSK FT PNQWQKFPDRIEDSDTTELRKTLFCGLAETREECLNLDVHQYKSWDDLIKA FT VAAQPGNSSLSKAQIELLLLQDVQRKDFAEEYGSLRAGKNMPMSSRLYNLV FT PEFDPIAKVLRVGGRLRHSPDLALETIHPIILDPKNIVTQFLIADVDSKLG FT HPGADRVYAHLRRYYWILKGRQAVRKYQRTKCLDCVKLKGKPIIPKMADLP FT PSKLNLFKPLFWSTGMDCFGPFTVKTGRKTGKRWGLLFKCQTTRCVHLELL FT TGLDTDSFLMALRRFVARRGQPYEVICDQGTNFHGGESELKNAFEHLSPVL FT KEKLWNHQIKFTYNPPYAPHFGGTWEREIRSIKSALRSTLGSQTVKEEVLA FT TVLVEVEEILNSKPLGYVSSSAEDLDSVTPNLLLMGRREAALPQVIYSDSN FT ILIQRKWRHSQVLAD" XX SQ Sequence 6313 BP; 1742 A; 1345 C; 1391 G; 1835 T; 0 other; tggtccttcg agccggagtt gagtgtttca atggatcaag atcaagttaa tagccaatgc 60 catgtaccaa ctcgttgaca gtcttcaaga cagaccaggc ctccagtcta tcttgaagac 120 tatgagagta ctattcccag tttgaagtct caaaccagtc attcagtgac ctcagcccca 180 gtccagagca gtgctgtaat ccatgcagag agtcatccag aaacttctga ttcaattttt 240 caagagatga agcagttgaa gcaatgtgta atggagtagg gacaaagtgt taagcagata 300 aacagtaaag ttaatcgcat gttttcttca gcgagttctt ggtcatctga agcaagtagc 360 caacacttat ctgcaccttc ctcacctaaa atggatactc ttgcaccttc tactgtgact 420 ggcacagttg gaacattagt tcaagagtta cgggatcacc tgcaggagaa acaaaatgag 480 caagctattt ctgagtcacg gtcacaatct tgtttgagca gacagttatc tttgcctctc 540 cctccaccct ctatgttacc ttctcctcag tctcactcaa ttaaatgctg ctttgactgt 600 tccactagtg aatctgcccc ctattgatgt gaactcccag ttgcctgccc aagatggaag 660 tttgcagcct ttagccaatc ggtttgtttc ccctatatcg catgcttcaa attacagggt 720 tgatatgcac cagcagcaat taccacaatc ccatgcaact agagctgcaa tgcaaccacc 780 aatacctcat tggagtttca acccttcatt gtttggccca gtttaccctg gattttggcc 840 accagtagca ggccatatgt atccacctca aatgccaatc ccaatgaatg cctatgaata 900 tcaacatcaa cgtccaccag atttccaaga tgtttttgct tcatcccaat gtcaaccaca 960 ccctcagcag gttttacata atccaggagt tggacaggaa tactttaata gtcagttaca 1020 ctttttacct gtaccttcag catctcacat aagtaatcca tcatctgcac tcattacagc 1080 acaagctgga ggtttgacta accatgttca gcagcaaagt caagtaggca taccaagaaa 1140 cgtttcaact ggtacaatgt gtgttcaaac tacctttcat gatgttcaat tttcatccca 1200 gttggctatg tcaagtccat cttccccttt gacagtgaga cctacccagc aacaaagcca 1260 aattagccaa gtgttacaga ataatccttt gcctcatatt tctgggcctt cagttccatt 1320 acctgtctca ggatttcaag tagaatctga gccaagattt catgctagcc ttcctccaag 1380 atcttctaga attaaagggc cttccttccc ctatctgtgt aaagaagatc ctaaacaatt 1440 cattatgttg aaaatggctt tggagaattt attaccaagt gatgagactg agcagtataa 1500 attccatatt ctcttagacc atttaaaatt ggattcagca cgcaacttgg ctttagccta 1560 ttctaatcat tctttgccat ttagtacagc atgggcagcc ctccagcaga aatatggtca 1620 accccatcaa cttgtcctga gggaaattgc tgagatcatg agcctgccag ccatcagaac 1680 cagtgattct caagctttca gtgctttggc agttcgtatt cgtgccctgg tgggtatgct 1740 acagtctttg ggtcaatgtg aatgagcagc agaattggct tgtgcttctc atgtgcaaca 1800 gcttttgagt aaactccccc ctgaacaagt tgctaattat atccggcatg gtagactcac 1860 tcgaccagga caatctcatg atttggtgga cttttctgca tggctccaag aggaagcaaa 1920 ttgtcaagca gtagtgggac aagttggtgc tgtccgtagt aagtctaatg cctaccacct 1980 ggaggagagg aaaccaataa ggtcatatca accaactgcc acttttctta atggagtaag 2040 cagtgttcca gtggtaagag ccgatgattt tttgtaaacc taccagatcc ttcactacca 2100 aagaagaaac tgagtttata aagttccatt gtccattttg taagagtgat gaccatcatc 2160 ttagtaaatg ccaagccttc agtctattag atttggagga tataaagaag tggattaaag 2220 ataatcatcg tttttggaga tgtggcagag atcactatgc tttccagtgt gatctaagaa 2280 gaggatgtcc taagtgtaaa ggtaaacatc tgggagttct acatgaggtg aatcagaaga 2340 agccagattc tggtgttttc ttgcttagtc gtccaagctg ttcccctaga gttctattaa 2400 aggttgttaa agtcctattg agccataatg gtaaatgtat ggaaacctat gccatattgg 2460 atgatggatc cgaaaggacg atgcttttgt atctggcagc acagcaactt ggattaaatg 2520 gcacagctga gtctttggtg cttcgaactg tgcggcaaga aactgaggtt ctcagaggga 2580 ctagtgtgag cttcaccatt tctgctccca actcccagat caggtatcag gttaaagggg 2640 cctttactgc acagcaattg gagcttcctg agcaatccta tcctattcca agcctcctaa 2700 acggtttagc catcttaagg ggatccaatt ggaaacgttt aacaaagtcc agcctatgct 2760 tctgattggt gctgatcatc cacatctctt ggctccagtg gagaaggttc gtctaagccc 2820 tcaaggaggc ccagctgtag ttcatactag attgggatgg gctcttcatg gtccagtacc 2880 atcagataag tcaactgaac ttgaaactcg gtgtctgttt ctttctggaa cttctgtctc 2940 ctctcagctt cagcaggatg tagaacggtt gtggcaagta gatgtgctcc catataggag 3000 tgaaaaggcc attacacggt ctaaacaaga tcaatatgcc cttgattgtc ttgaacagat 3060 gaggacaaga gaaaatgtga atggtgtctc tagatatgtt actcctttgc ttcgttctaa 3120 gaatgcaccc ttgctccatg ctcctaaaga ggcagttatg tcatctcttc aaagaactga 3180 gcgccagctt gctcatgatc tcgagaaagc ccttgtctat gaaggggaga tacagaaact 3240 agtggaggct ggttatgtga aaaaattgag ccctgaagaa gtctcccaga gccaagaatc 3300 ctggtttgta cctctacact tagttaaaca taatgaaaag tatagtttgg tattaaactg 3360 ttcgtatgtt ttcaaaggag actcactgaa tactcatctg ttacctggcc caaccttggg 3420 gtcatcctta ttaggtgtcc tccttcgctt tagggaaaac cctgttgcca ttagctgtga 3480 tattaagtcc atgtttcacc aagttcgttt actaccctca gataaacccc tcgtgcgttt 3540 tctatggagg aacatgagaa aagaggaacc agctgatgta tatgaatggc acgtcttacc 3600 ctttggaact acttgcagtc cttgctgtgc tacttatgcc ttacaaaagc atgtacgaga 3660 cctgtcggag aactgtgagg acctgaagga ttctatcacc aaggcctttt atgtggataa 3720 ctgcctgcaa agtgtctctg atatccaaaa aggtaaagag ctggttgaga agttacgagc 3780 ccatttgaaa gagggaggct ttgagattcg acagtgggcc agtaatgttc tggaggtggt 3840 ggaagtgcta ccaagtgata tgaggtctgc ccacactgag ctttggttga gccaagactg 3900 tgtagatccc agtgaaggta tgctgggact gacttggcac tgtgcaaaag acaccttggg 3960 ttacagatat cgtcctgttg aatataaagc ccttactttg cgcaatgtct atagggttct 4020 tgcaactcag tatgatccta taggatttct tatacctttt actactcgag ccaaagtcct 4080 gctgcaacag ttatgggtta aggagagaca atgggatgat ccagtattac cagaaggtat 4140 tcagaaagca tgggcagcat aggagaatga acttcctgac atcacttcta ttcaattgtc 4200 tagaacatac ggccctgttg aacttgacag cacatcagtt aagagggagt tgcatgtatt 4260 ttgtgatgct tcagaacaag cttatggatc tgtagcttat ttaaggattg tcaacgatga 4320 caaggttcta gtctcctttg tgatggcacg ttcaagagtt gctcctcgta agtccatttc 4380 catacctcgt cttgagttgt gtgctgccct tactggggtc cagttggcaa atctgttgga 4440 gagggagctt gacctgaaca ttcagcgaac catcttgtgg tctgattcaa tgactgtact 4500 ccactggatt agatccgata cttgtcgtta taaagtgttt gttgcaaacc gcattactga 4560 aatcttggag aactctagtc ctgaacaatg gcagtatgtt gacaccacct gtaatccggc 4620 tgatgatatc accagaggta agtgcctttc tgatctagca aagcctaacc gttggagtca 4680 aggtcccgat ttcctttatt ctaaacccaa tcagtggcaa aaattccctg acaggattga 4740 ggactctgat accactgaac tgagaaagac cttgttttgt ggcctggctg agactagaga 4800 agaatgtctg aatttagatg tacatcaata taaatcatgg gatgatctta tcaaagcagt 4860 ggcagctcaa cctggcaact cttctcttag taaagctcag attgaactgc tcttgttgca 4920 agacgttcaa agaaaggatt tcgcagaaga atatggttca ctgagagccg ggaagaacat 4980 gccaatgagt agtcgactgt acaatcttgt tccagagttt gatcctattg ccaaagtgct 5040 tcgagtagga ggacgactca gacatagtcc agatctggct ttagaaacca ttcatccaat 5100 tattcttgat cctaagaata ttgtcactca atttttgatt gctgatgtgg attctaagtt 5160 gggacatcct ggagctgatc gagtctatgc tcatcttaga cgttattatt ggattctgaa 5220 aggaagacaa gctgtgagga agtatcaacg aacaaagtgt ttggattgcg tcaaattgaa 5280 gggtaaacca ataattccta aaatggcaga tcttcctccg tctaagctca atctgttcaa 5340 gcctctcttc tggtccactg gaatggattg ttttgggcca tttacagtca aaactggacg 5400 caaaactggg aaacgatggg gattactttt taagtgtcaa accactcgct gtgtacatct 5460 agaattgttg acgggtcttg atactgacag ttttttaatg gcattacgta gatttgtggc 5520 tcgtcgagga cagccttatg aagtgatctg tgatcaaggg accaattttc atggaggtga 5580 gagcgagtta aaaaatgctt ttgaacattt aagtcctgtc ttaaaagaga aattgtggaa 5640 tcaccaaata aagttcacct acaatccccc ttatgcccca cattttggag ggacgtggga 5700 gagagagatt cgctctataa agtctgcact tcgctcaacc ttgggaagcc aaactgttaa 5760 agaagaagtt cttgcaactg ttttggtgga ggtggaggaa attttgaatt ctaaacccct 5820 aggctatgtc tcttctagtg ctgaggatct ggattcagta actcctaatt tgctccttat 5880 ggggcggcga gaagctgctc tacctcaagt tatttattct gattcgaata tcttgattca 5940 aaggaaatgg agacacagcc aggttctggc agattaattc tggactagct ttattcgcaa 6000 ttatctgact aatctacaag ttcgccaaaa atggcttcag gatcaggata atttgaaaaa 6060 ctctgttgtg gtaatggttt tggaccatcg acttccacgt gctttatggc ccattggtaa 6120 agtgctcaag actttctcgg gcaaggatgg tcgggtcaga gttgcagaga ttgaggtcaa 6180 tggaaaagct tatgtacgac cagttagccg cctcattcct ttacctcctg ttaaggattg 6240 acttggagca tggtggtttg atatgagatt ttctggaagt aaaaatatgt catgccatat 6300 tttggggcgg cct 6313 // ID TDR8C repbase; DNA; ZEB; 1122 BP. XX AC . XX DT 03-OCT-2008 (Rel. 13.1, Created) DT 04-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE A non-autonomous hAT DNA transposon from zebrafish - consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; TDR8C. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1122 RA Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1257-1257 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1122 BP; 388 A; 218 C; 226 G; 288 T; 2 other; ggctgtttct caataccaag tacgcaaagt tcggacttgc gtccttggaa gttcagactt 60 gccaagttca gacttggaag aacgaactcc cgaggacgcg aggacacaag tcgggtactt 120 cttcaaatgg aacggcagtg tactttataa cgtcacttac ctcaccatgg attcatcggt 180 ttcactgtac attaacaata aaatgattta attatataaa gcatggcccc gtaattaata 240 ttatattaat attataaatc tacatttttg atttaagaaa atgtaaacat aaatcactat 300 aaatgtgaag taaaatgrgt tattatacac agacacgtgt ctttgattat cagatttaat 360 aaactacaat ggtaattata caagtataaa gtacaagacr tacacaataa gaaataaaga 420 taacgtcaag caatttggta aacaaagaca aacagcgtac aacacggtaa cataattaaa 480 gacaattata aaaggtaaca aaataacact gccaaaataa tacacctgag aacaaataat 540 agatttataa gaccaaaaac tgtccgttag atatgcacaa gatgtattaa atatcatgtt 600 ttaactacaa tggagagatg agatccagcg gtacatccga gatggactgg ccgaagtaaa 660 gctgctctcg gcgggggagg tgatgacgga gctcccgctg gcaggctggg agtcagagtc 720 cgcataagct gaggcacatc gccaaagacg cgctatcttt gtgaataaac tgccgattcg 780 agtttaaaac aactacattc tcgcctgaaa aactcttaaa actttatttt gtgacacagt 840 aaaggagtat ttttaaaacc gtgcgggttt tctcctccgc cattagcttt cgctggttac 900 gctgcaatgc attctgggat acctgagctt cgcaagttcg cacaagtcac ctctcgatgc 960 atccttggca aaaggggcag agcaagtaca catccgggaa ttttatctgt acttggcaag 1020 atgtgaactt tgaattggaa cagtacttcg gcgacggttg atgacgtttc acgaggacac 1080 aagaacgcaa gtacagacaa gaacgcatat tgagaaacag cc 1122 // ID MSAT-5_DR repbase; DNA; ZEB; 161 BP. XX AC . XX DT 21-AUG-2010 (Rel. 15.11, Created) DT 21-AUG-2010 (Rel. 15.11, Last updated, Version 2) XX DE Minisatellite-like DNA - consensus. XX KW MSAT; Satellite; Simple Repeat; MSAT-5_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-161 RA Jurka J.; RT "Minisatellite-like DNA from zebrafish."; RL Repbase Reports 10(11), 2012-2012 (2010). XX DR [1] (Consensus) XX SQ Sequence 161 BP; 40 A; 16 C; 80 G; 24 T; 1 other; ctgatgaagg ggaggagctg ctgatgaagg ggaggagctg ctgatgaagg ggaggagctg 60 ctgatgaagg ggaggagctg ctgatgaagg ggaggagctg ctgatgaagg ggaggagctg 120 ctgatgaagg ggaggagctg ctgatgaagg ggaggagctg n 161 // ID DNAX-10_DR repbase; DNA; ZEB; 1141 BP. XX AC . XX DT 08-MAR-2010 (Rel. 15.11, Created) DT 08-MAR-2010 (Rel. 15.11, Last updated, Version 2) XX DE nonautonomous DNA transposon - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; DNAX-10_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1141 RA Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 10(11), 1867-1867 (2010). XX DR [1] (Consensus) XX CC Likely TSD 2 bp. >97% identical to consensus. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1141 BP; 309 A; 270 C; 242 G; 320 T; 0 other; tacgcttatt tcacgcggcc gccattttaa agaatcaaag cgaggctgcg gtgggaagaa 60 acccggaagt ataggtccag cactgtaaac attgcagtaa catgccgtgt actataaacc 120 tccagctgtt gcagagattt caaaatgcag aaatggtgta accatcactt taatattatt 180 cagtaagttt attttaaaca attaaaagca atttagcatc aaacaagatc acaatctcag 240 tgatacgctt aagtgtcctc ctaggcttca gctcatggca ctagttaaac accgtgagga 300 cgctttcctg cttcagcctg tgtaacccgg tgagtgataa aacactgcag tcctctgcag 360 cacaccctcg tgttgtcttt aatagtgttg tcccggtact gaagttttaa acccggctat 420 ttcccgctaa catggaagcg ccgctgagcg cggatgactc gactgatctt cacggcttcg 480 cgcttcagtc tccgtgtatc tgtgtttggt ttgcgctcag tttaccgctc ttcacccagt 540 gcaagcacag atacacggac tggagcgcga agcagccgtc aagatcagtc gagtcatcaa 600 gcagatcgct cggctcgtct gtgtttgtgc tcagtataca gcggtgggtg aactgatgac 660 cttctcggcc aatcacaagc atttctgttg aacacgtgaa cacaatggca aatcagcgct 720 gtttcaagaa agccatcaac attgccttaa atgttagcgc gaaattgccg gtctttcttt 780 taattcagta tcgtgacaag tctaatatag tgaaagaatc agtttattaa ctcgagtttg 840 atgaatagca tctaaaacgt gtgtttggga aagaaaacgt tagctaacta gtgccatttc 900 ccttaactta gctgaaaatg agctctgata tcccttttta cattcaccat tcattcagaa 960 cggaccttcg ggtcacccgg aaggcggaga aatgataaat ttgtgtcact ttctcttcgc 1020 tgataagata tatagccagt tacacaacat tcctatgtaa ctcccaatga tacttccggg 1080 tttcttccca ccgcagcctc gatttcgctt ttgaaaatgg cggccgcgtg aaatcagtct 1140 a 1141 // ID DNA-2-34_DR repbase; DNA; ZEB; 1222 BP. XX AC . XX DT 13-MAR-2009 (Rel. 14.03, Created) DT 13-MAR-2009 (Rel. 14.03, Last updated, Version 1) XX DE Nonautonomous DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; 2-bp TSD; KW DNA-2-34_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1222 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 9(3), 685-685 (2009). XX DR [1] (Consensus) XX SQ Sequence 1222 BP; 427 A; 213 C; 199 G; 381 T; 2 other; cagttttttt tcaatcgcta acaagcgctt acccatactt cagatacttt ttctaaactc 60 ttaacacaga ctcacaccta caaaacacaa ttggccaaat ggataatttt cttctcaaaa 120 atacattttg ttaaatatat actaactctt catttcaaaa tagagcacat ctttctctgc 180 acacactaac tttaccaaaa cactggaaat ctgactcaaa atgaaattat tctgtcaaag 240 aataacactt gttttcactt cacaaggtac atgcagtcaa tcaaagtaca ccaggtttca 300 aaatactggc tattgttgac attacaaaaa ctgcagagac ttttatgttt cagttttaca 360 ggtatttgca tgcaaaacat gcaatccacc attttgacac taaatttctt ggttgggaac 420 tgtatgtcca catgtacagt aatgttcaca ttcaaaagca ttccagtaaa aagcaaatat 480 tttttttttc ctttactgtt tactacagga ggtacagtag aaacatggta tgatagaaca 540 gaaaaagttt tacagtattg cttacagtga acaaaatatc catgaaacac acaagagcat 600 tctgaaaaaa aaaacaaaaa aaaaaacagc aacaaaagcc cagataaaaa aggggtaaaa 660 aaaaaaaaac taatctctgc gatcttcagg gttaggccac atgttttcat caacatcact 720 cctcaaagtc ctccttttac tgtataartg taaawgaaaa aaaaacccct gccactgagt 780 ctaagccaga ctggaatcag ctgtggttgg cccagtttac ccaacataaa tcagctgtgt 840 gtcaattatt caattgtttg tttattatca gctgagatat atttttgata ttgatattgt 900 ttttgaactg atatcattgc agaagcagag gtttttatac tgtatctaag gtttggaata 960 ttgtttttgc tattgtggga tgttgtgtta acatttgtaa atactacaaa aacaatccat 1020 cattttgttg ggaggtatag cttgtctgtt aagaaaatgt aagcattgtg gaaatgtgtt 1080 cactgactgc atattgggtg aaaacgacat gaaatgtgtg aatggtatgg ccacaaaaga 1140 ccgatgctgt gctaattgtg tttagagttt tgaaaatgtg acaactggtt ggacaaatgc 1200 ttgttagcga ctgaaaaaac tg 1222 // ID Gypsy50-LTR_DR repbase; DNA; ZEB; 353 BP. XX AC . XX DT 25-SEP-2007 (Rel. 12.09, Created) DT 25-SEP-2007 (Rel. 12.09, Last updated, Version 1) XX DE A long terminal repeat of the Gypsy50_DR LTR retrotransposon; a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy superfamily; LTR; Gypsy50-I_DR; KW Gypsy50-LTR_DR; Gypsy50_DR; endogenous retrovirus. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-353 RA Dib M.R. and Naveira H.F.; RT "Gypsy50_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 7(9), 830-830 (2007). XX DR [1] (Consensus) XX CC Gypsy50-LTR_DR is a long terminal repeat of the Gypsy50_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its CC internal portion is deposited in Repbase as Gypsy50-I_DR. XX SQ Sequence 353 BP; 75 A; 60 C; 79 G; 139 T; 0 other; tgtgacgacc ccaggacttc agtatggatg tggcagatcc tgattgggtg cctggttgtc 60 accaattggc tcaaaccata taaaagggtt ttgagtgcta gcttctggag gacggtgtgg 120 ctacgggccg tcctcgtgtt cagatcatct cttcatttct gcaaccttat ggttttgttt 180 agcattattt atttgtgtag ttaaagtagt tattttcttt attttttccc ctagacgttt 240 gtttgttgat ttgttgttct atttagtttt atgtataata aataagttgc atatacattc 300 ttttgttgtc cgaactcttt tatgttgcga ctcaaaacga gggggtcgta aca 353 // ID Gypsy97-I_DR repbase; DNA; ZEB; 4318 BP. XX AC chr19; XX DT 02-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy97-I_DR; KW Gypsy97-LTR_DR; Gypsy97_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4318 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1505-1505 (2008). XX DR Genome; chr19; Positions 23619967 23624284. XX CC Positions [1744-2199] - Reverse transcriptase CC Positions [3214-3693] - Integrase core CC LTRs are 99% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 19..996 FT /product="Gypsy97-I_DR_1p" FT /translation="MDPASVSQISEFVSHSNARIDRQDEALATTAQAIQAL FT VGQVSLLTSQVQQLVTRAAQADTAPAPETAPTPEVLPAPAVLGRSVEPRLP FT PPTSYSGEPLLCRSFLSKCSLYISLQPSLFSSEQSKVAFVITLLSGKAAQW FT GTTAWEQKLPCCSTFDLFSKELKKVFDRAASGREAARVLAELRQGNRSVAE FT YSIEFRTLAAECGWNAEAQWDMFLHGLSDRLQDAIYSLDLPKTLDELVDLA FT IRVDTRLFRRETRLQQSRYSDSVPDFPALAATPEVVSADPEPMQLGRSRLS FT VQEKRRRRTKGLCLYCGGAGHRVASCTVKDNTHQ" FT CDS join(1384..2802,2806..4128) FT /product="Gypsy97-I_DR_2p" FT /translation="MSEKCHESCLSSARSAVSCSVFQEERMDLSNVPSEYL FT DLKRVFSKSRAASLPPHRPYDCAIDLLPGTSPPKGKLYSLSIPEREAMEKY FT ISDSLAAKIIRPSSSPAGAGFFFVKKKDGSLRPCIDYRGLNSITVKNTYPL FT PLMSSAFERLQGASFFTKLDLRNAYHLVRMKQGHEWKTAFLTPRGHFEYCV FT LHFGLSNAPAVFQALVNDVLRDMIDQFIYVYLDDILIFSRSLQEHVQHVRR FT VLQRLLENGLFVKAEKCVFHAQSVPFLGHIVSVEGVRMDPEKVQAVVNWPI FT PESRKALQRFLGFANFYRRFIRNFSQLAAPLTSLTSAKTPFRWSSAAQVAF FT TNLKSRFVSAPILVTPDPSRQFVVEVDASEVGVGAILSQRAASDDRIHPCA FT FFSHRLSPAERNYDIGNRELLAVKLALEEWRHWLEGSGVPFIVWTDHKNLE FT YIRSAKRLNSRQARWALFFGRFDFTISYPGSKNIKPDALSRLFDSSERNTP FT LEPIVPQRICISAVTWEIESRVRTALDGVTPPVGCPPSRLFVPEEIRSDVI FT RWGHSSKVACHPGVSRTLFLVKQRFWWPAMARDIREFVLACSVCAVSKTSN FT RPPAGLLQPLSVPSRPWSHISLDFVTGLPPSGGNTAVLTVVDRFSKATHFI FT PLPKLPSARETAEAVINHVFRIHGLPTDVVSDRGPQFVSKFWREFCHLFGA FT TVSLSSGLHPQNNGQTERANQDLERMLRCLVSQNPSSWSQQLPWVEYAHNS FT LPVSATGLSPFQCSLGYQPPAFPSLESEIAVPSAHAFVQRCRRTWSRARQT FT LLQVGARTKAKADRHRSKPPVYVVGQKVWLSTKNIPLRSVCNKLAPKFIGP FT FTVIKIISPVAVRLKLPPAYRRIHPVFHVSKLKPVFHTAINPPSPVPPPAA FT SRRWGDRLFG" XX SQ Sequence 4318 BP; 914 A; 1141 C; 1097 G; 1166 T; 0 other; gaacgaactg acctgttcat ggatccagcg agtgtttccc agatctccga gttcgtctct 60 catagcaacg ctcggattga tcgacaggat gaggccctgg ccaccacggc tcaggcgatc 120 caggccctgg ttggccaagt ttcattgctc acatcccagg tccaacaact ggttacccgt 180 gctgcacagg ctgatacggc accagccccc gagactgcac cgaccccgga agtgttacca 240 gcgcctgcgg tccttggtcg ctctgttgag ccgcgtcttc cacctccgac cagctactct 300 ggagagcctc tcttgtgccg atccttcttg tctaagtgtt ccctttacat ctcactgcag 360 ccgtcattat tttcctccga gcaatcaaag gtggcgttcg tcatcaccct cctttcgggc 420 aaggcggctc agtggggaac aacagcttgg gaacaaaagt taccatgctg ctctacgttt 480 gacctgttct caaaagaact taagaaggtc tttgatcgag ccgcttccgg gagagaggcc 540 gcccgagtcc tcgctgaact ccgccagggg aatcggagcg tggcggagta ctcgattgag 600 ttccgaacac tagccgctga atgcggctgg aacgcggagg cacaatggga catgtttctg 660 cacggattgt ctgatcgtct gcaagatgcg atctattcct tagatcttcc aaagactctg 720 gacgagttgg tggatttggc catccgtgtg gacaccagac tgtttcggcg tgagactcgc 780 ctgcagcaga gcaggtattc ggattccgtc cctgactttc cggcattggc tgcgacaccg 840 gaagtggtaa gcgctgaccc ggaacccatg cagttaggga gatcccgctt gtccgtgcaa 900 gaaaagcgcc gacggagaac gaagggactc tgcctctact gcggaggggc gggtcatcgg 960 gtggcttcct gtaccgtaaa agacaacacc catcagtagg taagaggtta ctgatgggtg 1020 aaattaattt ggacaaatcc tctacgactg tcactttact gcccgtcact gtatcatggg 1080 gttccggaag tcatgacacc catgccctcg tcgattccgg ggcggaggga aatttcattg 1140 actccagttt cgctttcagt tctaaacttc cggttatagc tctgtcacaa cccattgcag 1200 tacgcgccct tagtgggctt tcccttccca ccatcactca ctccaccaaa cccataaaac 1260 ttaacacgtc tggaaatcat gttgaacaca tttcattttt tttaacagac tgttccaact 1320 caccggtggt tttaggacat ccatggttaa ttcttcataa acctcacatt aactggggtc 1380 ttaatgagtg agaaatgtca tgagtcttgt ctttcgtctg ctcgttctgc tgtttcttgt 1440 tctgtgtttc aggaggagcg catggacctg tcaaacgtgc ccagtgagta cctcgacctg 1500 aagagagtgt ttagtaagtc tcgagctgct tctctgcctc ctcatcgtcc ctatgactgt 1560 gcaatagact tattgccagg tacctctccg cctaaaggca agttatactc tctttctatt 1620 cctgagaggg aggccatgga gaaatatatt tctgattctc tagcggccaa gatcatacgg 1680 ccctcttctt caccggcggg ggcggggttc tttttcgtga aaaagaagga tgggtctctt 1740 agaccttgca tagactatcg agggctgaac agcatcacgg tgaagaatac gtatcctttg 1800 ccgctgatgt cttcagcgtt cgagcgtctg caaggggctt cgtttttcac gaaattagat 1860 ctccgcaacg catatcattt ggttcgcatg aagcagggtc atgagtggaa aactgcgttt 1920 ttaaccccca gggggcattt tgaatattgt gttcttcatt ttgggctttc taatgctcct 1980 gcagttttcc aagcactcgt aaatgacgtg ttgcgagaca tgatagatca gtttatttat 2040 gtctacctgg atgacattct gattttttct cgttctctcc aggaacatgt gcagcacgtc 2100 aggcgagtgc ttcagcggct gctagaaaat gggctttttg tcaaggcgga gaaatgcgtg 2160 ttccatgcac agtctgttcc ctttctagga cacattgtgt cagtcgaggg ggtgcgcatg 2220 gacccagaga aagttcaggc tgtggtgaat tggccaatcc ccgagtctcg taaggccctg 2280 cagagatttc tgggcttcgc taatttttac cggcgtttta ttcgcaattt cagccagctc 2340 gccgcccctc tgacgtcctt aacctccgcc aagacgccct tcaggtggtc gagtgcagca 2400 caggttgcat tcactaatct aaagagccgc tttgtttcag ctcccattct ggtgacacct 2460 gatccctccc ggcagtttgt ggtggaggtt gacgcgtcag aggtgggggt gggcgcgatc 2520 ctgtcccagc gcgcagcctc ggatgacaga attcatccgt gcgcgttttt ctcacaccga 2580 ttatctcccg cagaacgaaa ttacgacatt ggtaataggg agttgttggc tgtcaaactc 2640 gctttggagg agtggcgtca ttggttggag ggttcggggg ttccctttat cgtttggacc 2700 gatcataaga acctcgaata catcaggtcc gccaaacgac ttaactccag gcaggctcgg 2760 tgggcattat ttttcggacg ttttgacttt accatctcat attgaccggg ttctaaaaac 2820 ataaaacccg acgcgttatc ccgtcttttt gattcttccg agcgcaatac gcctcttgag 2880 cccatcgttc cccagaggat ttgtatttct gcagtaacat gggagatcga gagcagggtc 2940 cgcacagccc tggatggggt aacgcccccg gttggatgcc cacccagtcg tttgtttgtg 3000 cctgaggaga ttcggtctga cgtcattcgg tggggacatt cctccaaagt ggcttgtcat 3060 cctggggtga gtcgtacctt atttttggtt aaacagcgat tttggtggcc agctatggca 3120 cgggacatac gcgagtttgt tttggcctgt tccgtttgtg ctgtttctaa gacttctaat 3180 cgcccccctg ctggactcct tcagccgctg tcagtgcctt cgagaccctg gtcgcacatt 3240 tcgctagatt tcgtcactgg tctcccgcca tctggtggca atacggctgt tttgaccgtt 3300 gtggaccggt tctcgaaggc tactcatttc attcctctgc ctaaattacc ctcagccaga 3360 gagacagcgg aggctgtcat taatcacgtc tttcgcattc atggcctccc gacggacgtg 3420 gtttctgaca ggggacccca gtttgtctct aaattttgga gagaattctg tcatttattt 3480 ggagccactg taagtctttc ttctggttta catcctcaga ataatggaca aacagagagg 3540 gccaatcaag atctcgagcg catgttacgt tgtttggttt cccaaaatcc ctcctcttgg 3600 agtcagcagc tcccatgggt ggagtacgca cacaattcat taccagtgtc ggctacgggc 3660 ctctctccgt ttcagtgtag tttaggatac cagccaccag cttttcccag tctggaatcc 3720 gaaatcgcgg tcccctctgc ccacgccttt gtccagaggt gccgacgcac ttggagcaga 3780 gccagacaga ccctcctcca agtgggtgcg cgcaccaagg ctaaagccga tcgccaccgg 3840 tcgaagcctc ccgtttacgt cgtcggtcaa aaagtgtggc tttcaactaa gaatattcct 3900 ttgcgttccg tatgtaataa acttgcacct aaatttattg gcccattcac tgtcatcaag 3960 atcattagtc cggtggcagt ccgcctcaaa cttcctccag cgtacaggag aatacatccc 4020 gtgtttcatg tttccaaatt aaagcccgtt tttcatacgg ccattaaccc gccctcaccg 4080 gttccccccc ccgccgcgtc tcgtagatgg ggagaccgtt tattcggtta agcgcattct 4140 ggactcgaga cggaggggac gaggatttca gtacttggtg gactgggaag gttatggtgc 4200 ggaggagagg agttgggtcc ctgcaaaaga catattggat cactctctta tcgatgatta 4260 caatcgccag gtaagctctt ctgggagcac caggaggtgc tcgtaggaga ggaggtaa 4318 // ID Gypsy155-I_DR repbase; DNA; ZEB; 3660 BP. XX AC chr4; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy155-I_DR; KW Gypsy155-LTR_DR; LTR68_DR; LTR_Retrotransposon. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-3660 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1571-1571 (2008). XX DR Genome; chr4; Positions 7943910 7947569. XX CC Positions [1123-1644] - Integrase core CC 'ATATAT' target site duplication CC LTRs are 99% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 148..3573 FT /product="Gypsy155-I_DR_1p" FT /translation="MAEFDLNSFVSAPSVEQLDTFRKDDLLKIAEHFQIKI FT SRQQLKREIKGVIVRHLKELGLLVLAESSPGADYVSADSAHIGAEEASETA FT VAEGYEAKAVLPPFEPFSPSEVESGGDVRLKTRIARLQKEERERESHAERE FT LRLEIRRLEIESETQIKLRELELNAARHAPVSPGQLAQNAALSSAVTSSVG FT TFDVSKHISLVPQFRETEVDSYFNVFERIACALKWSKEVWPLLLQCRLTGK FT AQEVCSALSLEDSLNYDVVKAAILRAYELVPEAYRQRFRMHKKHSNQTYVE FT FAREKGILFDKWCTANKVSDFQALRELILLEEFKNCIPERVVIYLNEQKVT FT SLAQASVSADEFALTHKNVFLLARTEKNSVPSVEKDQSRLKNKNAQIREVR FT ECFYCHKTTLPKVVQSDRGTNFMSNLFEQVLKTLKISHRTSSAYHPESQGA FT LERFHQTLKSMLRKYCFETGKDWDEGTPLVLFAVREAVQESLGYSPADLVY FT AHSIRGPIKMLKEEMLSQETSEKTTVLDYVSRFRERLHNACSAAKEALVSA FT QITMKSHFDRKTVVRDFKEGDKVLVLLPVVGSSLSSRFSGPYEVIKKLSCT FT DYVIGTPDRRKKTRVCHVNMLKTYVREASNNEAKLDQPLSDAPVTSLVCDV FT MSDHEDDGVKVRHTYEQCARFKNSEILADLDSSLFHLSDAQRCDIKALIEA FT FPLLFRDVPSCTTVLQHDIDVGNSAPVKQHPYRVNAVKRSVMQTEVKYLRE FT NGLAKPSCSPWSSPCLLVTKSDGSARFCTDYCKVNALTVPDCFPLPRMEDC FT VDALGTAKFVSKLDLLKGFWQIPLSDRASDISAFVTPDDFMQYCVMAFGMR FT NAPSTFQRLINTVLMGVRNCNAYLDDRVIYSTDWSEHVSTLREVFMRLEKA FT SLTLNLAKCDFGKATITYLGKEVGQGQVKPIGAKVSAISEFPVPTTRKELR FT RFLGMSGFYRNFCKKFSIVVSPLTSLLSPSRVFQWSDECQAAFESVKSLLC FT SDPVLAAPVYEKAFTLEVDASSVGAGAVLIQEDTHGIEHPICFFSRKFNKA FT QLNYSTIEKETLALLLALQYFDVYLSSSNLPITVFTDHNPVVFLSRMHNQN FT QRLMRWSLIVQGYNLMIKHKKGSENVLADALSRV" XX SQ Sequence 3660 BP; 1030 A; 687 C; 857 G; 1086 T; 0 other; tttaattggt gtcagcggtg agattatatt taaatttatc tctttatggg atgaggtgag 60 acaatgtgtg tttgacggcg gatcaatttt gagttttgag tgaacgaaga ttagtgattg 120 cgtggagttt tagagcatta tatcagtatg gctgagtttg atttaaacag tttcgttagt 180 gcaccatctg ttgaacaatt ggataccttt cgaaaagatg atttattaaa aattgctgag 240 catttccaaa ttaaaattag tagacaacag ttaaagcgtg aaataaaagg agtcattgtc 300 cgacatttaa aggagctggg tttattggtg ctggctgaaa gctcacctgg cgctgactat 360 gtgtctgctg actctgctca cataggtgcg gaggaagcga gcgaaacagc tgttgcggag 420 ggctacgaag ctaaagctgt tttgccgccc ttcgaacctt tttcaccctc agaggttgaa 480 tcgggaggtg atgtgcgcct caaaactcgc attgcgagac tccaaaaaga agagcgtgag 540 cgagagagtc atgctgagag agagctgcga cttgagatcc gcaggcttga aatagagtcg 600 gagacgcaaa tcaagttgcg agagcttgaa ctgaatgcag ccaggcatgc acctgtttct 660 cctggccaac tagcgcagaa tgcggctctt tcatctgccg tgacatcatc tgtcgggacg 720 tttgatgtga gtaaacatat ttctcttgta cctcaattta gagagactga ggttgactcc 780 tattttaatg tattcgaacg aattgcttgc gctttgaagt ggtctaagga agtttggcct 840 ttactgttgc agtgcaggtt aacaggaaaa gcacaagagg tatgttctgc gttgtcctta 900 gaggacagct taaactacga cgttgttaaa gcggcaattt tacgagcgta tgagctcgta 960 cctgaggcgt atagacagcg ttttagaatg cataagaaac attctaatca gacctatgtt 1020 gaatttgcga gagaaaaagg tatattgttt gacaagtggt gcacagcgaa taaagtctcc 1080 gattttcaag cgttacgtga attaattctc ttggaggagt tcaagaattg tattccggag 1140 cgtgttgtaa tttatcttaa tgaacagaag gtaacttctc tggctcaagc ctctgtttct 1200 gctgacgaat tcgcattaac ccacaaaaat gtttttctgt tagctcgtac tgagaaaaat 1260 tctgttccat ctgttgaaaa agatcagtct cgtttgaaaa ataaaaatgc acaaattcga 1320 gaggtaaggg agtgttttta ttgccacaaa actactctgc ccaaagttgt gcagtctgat 1380 agaggaacaa attttatgtc taatctattt gaacaagtgt tgaaaactct aaagatttcc 1440 cacagaactt caagtgcata ccatccagaa agtcagggag ccttggaacg atttcatcag 1500 actttaaaat caatgctcag aaaatattgc tttgaaactg gtaaagactg ggatgagggg 1560 acacctctag ttttatttgc tgtgagagaa gccgtgcagg aaagtttagg atatagcccg 1620 gcagatctgg tgtatgctca ttcaattcgg ggcccgatta agatgctgaa agaggagatg 1680 ttatctcagg agactagtga aaaaacaact gtgcttgatt acgtaagtag atttcgcgaa 1740 cgtcttcaca atgcttgctc tgctgcaaaa gaagccttag tgagtgcaca aataaccatg 1800 aaatctcact ttgatcggaa aactgttgtg cgcgatttta aagaaggcga taaagtgctc 1860 gttttattgc ctgttgttgg ctcttcacta tcttcacgtt tttcgggacc gtacgaggta 1920 attaaaaaat tgagttgcac agattatgta attggtactc cggatcgccg caagaaaact 1980 cgcgtgtgtc atgtaaacat gctgaaaact tatgttcgag aggcatctaa taatgaggcg 2040 aaattagacc agccgctctc tgatgcacct gtcacttcgc tggtctgtga tgtaatgtct 2100 gatcacgaag atgatggcgt gaaagttaga cacacgtacg aacaatgtgc acgttttaaa 2160 aactccgaaa tcctggctga tttagattct tctctgtttc atctatctga tgctcagaga 2220 tgtgacatta aagcgcttat agaggcattt cctttacttt ttcgtgacgt tccttcttgc 2280 actacagtat tgcaacacga tattgatgtg ggtaattcag cccctgtgaa acaacatccg 2340 tacagagtta atgccgtgaa acgttctgtg atgcaaactg aagtaaaata cttgcgtgag 2400 aatggtttag ccaaaccaag ctgcagtcct tggagttctc catgtctact ggtgactaaa 2460 agtgacggat cggcgcgatt ttgtacagac tattgtaagg tgaatgctct gactgtacct 2520 gattgttttc cgttaccacg catggaagat tgcgtagatg cacttggaac tgctaaattt 2580 gtcagtaagt tggatttact caaaggtttc tggcaaatac cgttatctga tcgcgcatcg 2640 gacatctctg cttttgtaac ccctgatgac ttcatgcagt actgcgtcat ggcattcgga 2700 atgcgtaacg caccgtcaac attccaacga ctcatcaaca ctgtacttat gggagtgcgt 2760 aattgtaatg cgtatttaga tgatcgtgtg atctattcca ctgactggtc ggaacacgtt 2820 tcaacgcttc gtgaagtatt tatgcgtctt gagaaagcat cgctaacttt gaatcttgct 2880 aagtgcgatt tcgggaaagc cacaataaca tatttgggta aagaagtggg gcaaggacaa 2940 gtgaagccta ttggagcgaa agtgagtgct atctctgagt ttccagtacc taccacacgc 3000 aaggagttgc gtagattttt gggaatgtct ggtttttacc ggaacttctg taaaaaattt 3060 tccattgtcg tttctcctct aacttccctc cttagccctt cacgtgtgtt ccagtggtct 3120 gatgaatgcc aggcagcctt tgaaagcgtc aaatctttac tatgcagtga tcctgtgttg 3180 gcagcaccag tttatgagaa ggcttttacg ctggaggtgg acgcaagctc cgtgggagcg 3240 ggcgcggtac tcattcagga agatacgcac ggtatagagc atccaatttg ttttttttca 3300 cgcaagttta ataaagcaca gttaaattat tccacaatcg aaaaagaaac gttagcgtta 3360 ctactggcat tgcagtactt tgatgtctat ttgagttcta gtaatctgcc cataactgtc 3420 ttcacagatc ataaccctgt ggtatttctg tcccgtatgc acaatcagaa ccaacggctt 3480 atgcgctggt ctctgatcgt tcaaggttac aacttgatga taaaacataa aaaaggatca 3540 gaaaatgtac tggccgatgc attatctaga gtgtgaagaa aaagtggttg aaagtaaaat 3600 ttgttaaaac tatattgttt tggaatttca tccattatag ttttatctta agcaggggag 3660 // ID EnSpm-5_DR repbase; DNA; ZEB; 7671 BP. XX AC . XX DT 31-JUL-2008 (Rel. 13.07, Created) DT 31-JUL-2008 (Rel. 13.07, Last updated, Version 1) XX DE EnSpm-5_DR is an autonomous DNA transposon - a consensus. XX KW EnSpm; DNA transposon; Transposable Element; KW Autonomous DNA transposon; EnSpm-5_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-7671 RA Kapitonov V.V. and Jurka J.; RT "Zebrafish En/Spm DNA transposons."; RL Repbase Reports 8(7), 753-753 (2008). XX DR [1] (Consensus) XX CC EnSpm-5_DR is a young family of autonomous En/Spm DNA CC transposons. The consensus sequence was derived based on multiple CC alignment of several copies of EnSpm-5_DR that are less then 2% CC divergent from each other. EnSpm-5_DR transposons are CC characterized by 2-bp target-site duplications and imperfect CC 12-bp terminal inverted repeats (1 mismatch). See also commentary CC on EnSpm-2_DR. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 4629..7058 FT /product="EnSpm-5_DRp" FT /note="En/Spm transposase." FT /translation="MMFRCYICKSLFATTVKLIRHLKLVHSLYPGKKFELV FT CAQDGCFLQFKSYAGFKKHLTRFHFRGVASSSGNISIQNQHAPNPDEALVS FT DTEQMTDNDAQVVECSSLSRDSSREMCASIIAKLQANGVANNLILSVIESM FT EDFVNDFHTNLRESILNVVPVGSTTRSSVEEVFNNFHNSFSDFNTDTKWRK FT YFSKKWGVVEPTEIHLGVRYDSRRNTTSGMYEQTTVNDTFIYIPLLKTLEF FT IFRNEEVCKLIQEQRQSDMYREFCDGQYYKRHPLYSICNNALQIQIYYDDF FT ETANPLGSKQGIHKLGCIYFVLRNLPSHVNSSLMNIHLISLFHSEDGKKYG FT MDKILGPLIDDVKVLEQKGMKVSFSEEPIFGTIAQITGDNLGLNGILGYVE FT SFSAKHYCRLCLTDKVLAQEVFSEDDPRVILRNRNQNEKHYKYLADNPNEN FT SCYGIKRNSILNTLTYFNVSENFVLDIMHDILEGVAQYEVKLLFEYMSCGL FT IPCDSIPQRLYAFNYGFLDRNNRPTKVNLQHPGNSIGLNASQTLCLIRNIP FT LIFGDVVPEGDKHWHLMLLLLQIVDIVFSHCISEGMIIFLKHLIVEHHRLF FT KDLYPSRNLIPKHHLMIHYPECIRQIGPLIHVWTMRYEAKHRFFKKNLKNF FT KNLTKSLAKKHQLAIAYYWESCTSRGIESGPVSNELLSDLEKCDFISEQLN FT IDLSSEVTITPWVKCHGTEYRKNLVVCLDLVEDVPVFGKIVQILIKDGVYF FT LVSCMESEFVEHLHAFRVFAQEHSLVLKKTEQLFYYKPFDLQMTYGNDSSF FT YIVLDCYL" XX SQ Sequence 7671 BP; 2404 A; 1280 C; 1457 G; 2530 T; 0 other; cacagcaaat ttaccggtgt tgatttactg gtgttacaac aaagtgttaa aataaagtgt 60 tgcgtcaacg ctatacagat tgagttaaac accaaaagcg tcagtttcga ttaagagttg 120 gtgtcatttt atatcgttgc aaaggcaact ccttaagcgt taaccaggcc acacctccat 180 gaacgtgcag acgtcctttt gattcaacca gaccacatgc agacgccatt tcaaatgctg 240 caagtggaag aaagttggta agtctaatat ataaatctta ttatacagat tcattgtgta 300 aattaaataa tatgtttttg tatcttagtt taatcatttt ttttcataac gatcagtata 360 tcaagcgtta cacagccaaa taattcagac gtgattttta ttaagttagc ctatatcagc 420 aattttatga atcatgtaaa gccaccataa cgcttatatt aatattatta tattataatt 480 ctcgcacacg gcgattgctt tatttatttt atttaccgta tgcacatgtc tggacataag 540 cacagtgtag ttaggctcgc gcgctaagac cgccgtccat tttagcaggt aacgttattc 600 aaggtgaact atcgcgtgtg gatttcttgt agtgttcaac aagtaaatgt gcacatttca 660 aagacagcgg ctgaactact ataacttgtt ttaatccaag gtacatgctt gggttcctgc 720 tgaatggtgg ccgactgacc gtggatcggt gagagttttc gctgttaaaa agtaccgtgg 780 taagttttta atgctgcaac gaaagtcagt gattgctcac ttcggtgata gaacgcacgt 840 tgtggaatat aaacacacag ttcaagactt cagatgaatg aaacgactaa tctacattat 900 cgacaaaaac tgttgtcaga tattttaatg taagggcata atcttacgcg aatgtaagag 960 attgcaataa caaggtttag ccgtcaaggc gctatagctt aagacattac agctgccagg 1020 aaacaataga agaagacagc gcgcgtaacg gtagtcagaa tgagtccaaa acagacgata 1080 atacattcat aaaactccct ccaatatatc attgtcgtgt aagtgtgctg caaaacaacg 1140 aacgttaact ttaagttgtc tttatttata gaaaaattat ttattcttac aatttatagt 1200 ccaattatat tacataaacc aaatataatg tgtttatttc tctccacacc aagagcttca 1260 ccagaggtca agacaatgct tgtcaaagca caatatcgaa accaaaagaa gttcataaaa 1320 atcccagagg cttgctttga tttattttta gcagagggtg agtggtatta aaaataatat 1380 atatttatta aaaaaaagtt tctctatcat acgtatcatt aaacgtgaat aaagcattta 1440 acaataatat ttattttccc tcctttgcag tgaaagagaa gttttcatta ccacaggatc 1500 ttgtaattac agttacagat gaaactggta cagaggtgga cgaagatgtg tttcctgatc 1560 taatgagcac cagtggactg cttcttgtta taaatgcttt aaatgacagt ggtgagtgat 1620 aacagtggag ctattgaatg atgtaagatg tgtctaagtg aatgctgcca attttcagta 1680 ctttaccata tttttaatgt cggcagtgtt gatgtgggta gaaatacagt cccataatgt 1740 aaactagtat tgagtgttgt tttttttttt ttttttacaa gtacttagac ctgtcttttt 1800 attttgatcc agtattaaaa aacctgttgc ctgttaaccc cagttttaac attaacagga 1860 ccctccacac ctcaatcttc agccagcttt gatacagata cactctcttt aacttcaaaa 1920 agcagtgagg acagtgactg gtttagtcca aaacgcttta gaaaggatga ggatgaggct 1980 tcacaaaact cacaggccag agatgtgagt gcaagtacac aatttggggg gtttacttgc 2040 agtatcataa ccataagctt tacataatca gttccattaa tttatgtatt atgtgtatag 2100 tactgacctt ttccagatca ggattgaatt taaatggcat tttcaatatt cctgcatttc 2160 tacctttata ataggatgtt aatgctaatt tctttttata gttgatcaaa caaattctag 2220 agacaagacc aggtggagca aatgtgctga gggagtatga agatacgggg acaattactg 2280 atgagacaag aaaagttatg gtgaacattc tagtagccca catgatggag acagaagggt 2340 aattatgaaa aacaatgttg ttgttttttt atggttatag cagtatacaa tagtcaacat 2400 ttgaaatgaa tcaaaatctt ttatcaaagt attaaatcta ttgtactaca cccattcttg 2460 tcaaaggaca attttgaaaa acattttttg atacacatca aatgttgact actatataga 2520 ttataaattg tatagtgtga agttgagtgc tagaaatgta atagtgctaa tcttttaaac 2580 atttaattga acaaatttta ataacatttg ttaattctca tcaatgttaa gtaacattga 2640 gtactcctgt tcttcaggag agttccacac cgccttacca aacagaagta tggtttggga 2700 attatcacct tgtttccctc actcagagat ccccaaggga ggactggata tgtaagttct 2760 tgtctgtcgt cagaagatca ttatatatat atatatatat atatatatat atatatatat 2820 atatatatat atatatatat taatggagtg ttaaatgttg aatctgaatt tgtgtcattt 2880 tgaatttatt aatttttgaa tttattaatg tcaggaacat ttttatgatg gccagaaaaa 2940 ctctggattc ttgtcgtggc gactcaaaac cgtacaaagg gggacaaaac cttctggaaa 3000 taaagatgat cctaaaacag aggaaaaggg gggtcctttg cttgaccgac agctgtgtca 3060 ccctgaagat cagctggaca gtgatcaaag tctagaggcc atctctctga tgaatcatac 3120 aagcgaccgt gaggtcatca tgcagaagat gagggcaaca tttaaataca gacaacgtct 3180 tgtgcgtgat ccagagaaat ccaccactat tctctctgtt tttcctcgat ttctggatac 3240 gaaaggattg gtatgaattg tctttttaaa tatattttat ttataaatat aaaaaaattt 3300 aattgaaatg ttagtttcac agatgtggtg aaatagtgac catcacattt agcttgtaat 3360 attttatagt aatggaatgt tttcttctga tttctcagat tctccaggac tttacactcc 3420 tgttcgggtc agaaacggct tccagactct tggaaaggtg gccaacagtt ttcaagtcaa 3480 agattatcaa gctagcagaa accttgacat ccactccact gctgaaaaga ttgctgtcat 3540 ctgccaaaca gaacaaagaa agcagcgagg cagaggattt cccaggtaca gatgtaactg 3600 atgtaatggt gcagtgaaaa tcttttgaat gttattttac tgtctatgtt taacttttta 3660 cttgtctttt tcagagtggg acagtgacat atcatccttt cttctacttc tgcatttttt 3720 gtcgccccaa gtttctggaa gaaagaaaat ccagaagatg agcatctcac aggcaattga 3780 ccatctggtg gtttttcaaa aggtaggtaa ctggtgatga tacaagagct gtagacatta 3840 taccttcaat taatgcatta acttctccag ttgataatac actatgagat aagtgtaata 3900 tgaagtgctt gatttatcaa attttctaaa cctaaactgg aattaagctt gttggcgctt 3960 gtatcatttt ctgtgtcttt tgatcttagt gttcaattga gctaaacaga ctcaaactat 4020 gactgtaaga tgcacatatt gccagtaaag ccggttctat aatcagttct aaatctgaaa 4080 catcagggtt atatgttctg tttattaagt aacctctagg gatttcattt ctttgtttta 4140 tttggactta acctgtgaga gcctcagaga tttacaactg atcacagaat tgcttccctg 4200 acaaaatgac acatttaagt cacagctttt gcaatattgc aaaatcaatt taatgttgac 4260 tttgtagtca taatgttaac tgataatgct taatttgcct ttttttttcc tcaatcaggc 4320 atgccggagt attcaggaac acctcgagac agaagacaat cgtcaacctt acattcttgc 4380 cataggaagc agcaaggaag ccatcagtca gttcttcatt gtgcttgaca aaaagctcat 4440 cccatgtcaa gaatcctctt cactagcggc aatcgatgaa ctgttcaagg tccattttgt 4500 gttcagtatc agttatgatc ctccacttaa aggtttgttc acatttctcc agaccacagt 4560 gtacagaatt gatgttggca gcacaagtga aaccccaagg gtgaaggagc tcagagctaa 4620 attactgaat gatgtttaga tgctacatct gtaaatcact gtttgctaca actgtaaaat 4680 taattcggca ccttaaactt gttcatagtt tgtatcctgg caagaagttt gagttggttt 4740 gtgctcagga tggttgcttc ctacaattta aaagttatgc agggtttaaa aaacatttaa 4800 ccagatttca ttttaggggt gtggcatcaa gcagtggtaa catatctatt caaaatcagc 4860 atgctccaaa tcctgacgaa gcacttgtgt ctgacacaga gcaaatgaca gataatgatg 4920 ctcaagtggt tgaatgttct tctctttcaa gggatagttc aagggaaatg tgtgcatcca 4980 taattgccaa attgcaggct aatggtgttg caaacaattt aattttgtct gttattgaaa 5040 gcatggagga ttttgttaat gatttccata ctaacttaag ggagagcatt ttaaatgttg 5100 tgcctgttgg tagcacaact agaagttcag ttgaagaggt ttttaataat tttcataatt 5160 cattctcaga ttttaataca gacactaaat ggagaaaata tttcagcaag aaatggggag 5220 ttgttgaacc cactgaaatt cacttgggag tcaggtacga ttcaagaaga aatacgacct 5280 ctggaatgta tgaacaaaca acagttaatg acacctttat ttacatccct ctgctcaaaa 5340 ctctggagtt catttttaga aatgaggaag tatgtaaact gattcaagaa caaaggcaaa 5400 gtgacatgta tagagaattt tgtgatgggc agtattacaa aaggcatcca ttgtactcca 5460 tctgtaataa tgcacttcag atccaaattt actatgatga ttttgaaaca gctaatccat 5520 taggatctaa acaaggaatc cataaacttg gctgtatata ttttgtgctt cgtaacctac 5580 catcacatgt aaattcatcc ttgatgaaca tacatctaat ttctttattt cattctgaag 5640 atggaaaaaa gtatggcatg gacaaaatac ttggcccatt gattgatgat gttaaagttc 5700 ttgagcagaa gggaatgaag gtgtcttttt ctgaggagcc catttttggt acaattgctc 5760 aaattactgg cgataacctg ggtctaaacg gtattcttgg ttatgttgaa tctttttctg 5820 caaaacatta ctgcagattg tgccttactg acaaggtgtt agcacaggaa gtatttagcg 5880 aggatgatcc aagggtgatt ttgcgcaacc gaaaccaaaa tgagaagcat tacaagtatc 5940 ttgctgacaa ccctaatgaa aactcctgtt atggtattaa aagaaattct atactcaaca 6000 ccttgactta ctttaatgta tctgagaact ttgtgctgga cattatgcat gatattctgg 6060 agggagtggc acagtatgaa gtaaaactgc tttttgaata tatgagttgc ggtctaatac 6120 cttgtgattc cattcctcaa agactgtatg ccttcaacta tggttttttg gaccgaaata 6180 accgcccaac caaggtcaac cttcagcacc caggaaacag tattggactt aatgcaagtc 6240 agacattgtg tctcatcagg aacatccctc tcatatttgg agatgtagtt ccagagggag 6300 ataagcactg gcatttgatg ttgctgctct tgcaaattgt tgacattgtg ttttctcatt 6360 gcatttctga aggaatgatc atatttttga agcatctgat agtggagcac catcggttgt 6420 tcaaagactt gtatccttct agaaatctga ttccgaagca tcaccttatg attcactacc 6480 ctgaatgcat caggcaaatt ggcccattga ttcatgtatg gacgatgagg tacgaagcaa 6540 aacacaggtt ttttaagaaa aatttaaaaa acttcaaaaa cctcaccaaa tcgcttgcaa 6600 aaaaacacca gctggcaata gcctattact gggaatcctg tacatcgagg ggtattgaat 6660 ctggtcctgt ttcaaatgag ttgttgtctg atcttgaaaa gtgtgatttt atttcagagc 6720 aactcaacat tgatctctct agtgaagtaa ctatcactcc ttgggtaaag tgtcatggta 6780 cggagtatcg aaaaaatctt gttgtttgct tagatttggt tgaggatgta ccagtctttg 6840 gaaagattgt tcaaattttg atcaaggatg gagtttattt tttggtctca tgtatggaat 6900 cagagtttgt tgaacatctc catgctttca gggtttttgc acaagagcac agccttgttc 6960 tcaaaaaaac agaacaattg ttttactaca agccatttga tttgcaaatg acatatggca 7020 atgacagttc attttatatt gttttggact gttacttgta aagtaatttt attggtggct 7080 tggtgttcca tgttggaata aaattttgaa agagtaatgt agtaagatgt gttttcttca 7140 cagtgaattt attattttta aataaaaaaa tttagaagcc ttaatatagt tatccataaa 7200 aaagtaacat ttcttaaata gtactgtcaa catttaagta atgtaatgtt tgattcctat 7260 aagcttgaat agattgtgat ttatttatta tttatttttt tgaacagtgt tcactcctga 7320 acagtgtagg attttaactc tgtcagagtt tgcaacaaag ttcctcctag ctctgtagca 7380 gtgttgttgt tatctgcgtt ggtgttaata cttcactctt atagtgttaa tattttgaca 7440 ctgtgttggt gttaaaagag gcaacactgt agcagtgttg ttgttattat ctgcattggt 7500 gttaatactt cactcttata gtgttaataa tttgacactg ttttggtgtt aaagaggcaa 7560 cacttttgaa agtgttgaat taacactttg cgggtggttc ccatatatac acttttaaag 7620 tgttgaaatt aactctattg gtgttaattt ccggatttca aatttactgt g 7671 // ID Gypsy135-I_DR repbase; DNA; ZEB; 4290 BP. XX AC chr14; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy135-I_DR; KW Gypsy135-LTR_DR; Gypsy135_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4290 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1531-1531 (2008). XX DR Genome; chr14; Positions 9618575 9614286. XX CC Positions [3183-3662] - Integrase core CC 'GTATG' target site duplication CC LTRs are 99% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 19..939 FT /product="Gypsy135-I_DR_1p" FT /translation="MDPARESPICSAVEHQGAMLGRHAEELSTARHAVEDL FT AAQVADLSSQIHTLRHNPGFPQPTRLSSEPRVNNPSTYAGEPTQCRAFLTQ FT CEVVFTLQPYTYAEDRARVAYVISLLSGRAREWGTSLWEAESEICTHFDAF FT KEEMLKVFDRSVHGREASRLLSTLRQGHRPITDYAIEFRTLSTTCGWNEAA FT LSARFLDGLNALLKDELYAHDLPADLDSLIELSIRMDKRLAMRRRARSPHT FT DVRGVSPSQDSKPFERHASSSVPMEVGGVHLSPSERQRRIHLGLCLYCGGS FT GHRFSVCPVKANARQ" FT CDS join(954..2183,2187..4274) FT /product="Gypsy135-I_DR_2p" FT /translation="MASASTQSPNAPTRTLLPITLCWEGSSVSCSALLDSG FT AEESFIDESFAAKHKVPLQPLRDPPTVYALDGRVLSRVFHSTSPVSLHVSG FT NHRETLTFFVICSPATPIVLGHSWLVKHNPQIDWIKGSIVSWNLSCYANCL FT VSAVSPVSSPSVLQEEPVEISGVPEVYHDLRAVFSKSRAEALPPHRPYDCS FT IDLLPGTTPPRGRLFALSAPEREALDKYLSESLAAGTIVPSSSPAGAGFFF FT VKKKDGSLRPCIDYRGLNEITIKNKYPLPLMSTAFDILQGARIFTKLDLRN FT AYHLVRIKAGDEWKSAFNTPFGHFEYRVLPFGLVNAPAVFQALINDVLRDM FT LNIFVFVYLDDILIFSPDLPTHIQHVRRVLQRLLENRLFVKSEKCDFHTRS FT VPFLGYIISDKGVSMDRLDYYRLAIIDWPIPESRVALQHFLGFSNFYRRFI FT RNFSQIAAPLTALTSAKTRFEWSDSAQQAFDRLKRMFASAPILITPDTERQ FT FIVEVDASDVGVGAVLSQRSAEDNKVHPCAFFSHRLTPAERNYDVGNRELL FT AVRLALGEWRHWLEGASIPFVVWTDHRNLEYIRSVKRLNARQARWALFFNR FT FNFTISYRPGTKNIKPDALSRQFTVLEDKPLPEPIIPIDMVVGAVSWGIEN FT SVKRALTRTQAPRNCPSGKLFVTRPLRAAVLRWGHTSKLTAHPGIRRTLAV FT IRQRFWWPAMIQDVRRFVKSCHVCAQNKSSNSATTGLLRPLPIPSRPWSHI FT ALDFITGLPPSKGNTVILTVVDRFSKAAHFIPLPKLPSARETAQVIINNVF FT RIHGLPVDVLSDRGSQFTSQFWREFCRQLGASVSLSSGYHPQTNGQTERAN FT QDLGRILRCLAARNPSSWCEQLPWAEYAHNSLPSSSTGLSPFSCCLGYQPP FT LFPTQERETAVPSVQAFIQRCRRTWESTVSALRRSGERIRRSANRHRTRAP FT RYVCGQKVWLSTRNLPLKAPSRKLVTRFIGPFTIDKIINPAAIRLKLPSYL FT QRIHPVFHVSCIKPVFHSSYSSPPSPSPPIVVDSPPIYKVRRLLDVRRRGR FT GQQFLVDWEGYGPEERSWIPSRDILDRSLIEDFFRSRQSFSSGAPGGAS" XX SQ Sequence 4290 BP; 882 A; 1219 C; 991 G; 1198 T; 0 other; ggattcacta gccataccat ggacccagcg agggagtctc ctatctgctc tgcggtcgaa 60 catcaaggag cgatgttggg cagacacgcc gaagaactca gcactgcaag acacgctgta 120 gaggacctgg ccgcgcaagt agcggaccta tccagtcaaa ttcacacgtt acgtcataac 180 ccgggattcc cacaaccgac ccgcctttcc tcggagccga gggtcaataa tccatccacg 240 tacgcgggtg agccaacaca gtgccgtgct tttctcaccc aatgtgaggt tgtgtttaca 300 ctgcagccct acacgtatgc tgaggatcga gcccgtgtcg catacgtcat ctctctcctc 360 tcgggcagag cgagggaatg gggtacctct ttgtgggagg ccgagtcgga gatttgtaca 420 cactttgacg cctttaagga ggagatgctg aaggtcttcg atcgttcggt tcatggccgg 480 gaagcgtcac gtcttttatc cactctccgc cagggtcacc ggcccataac cgattatgcg 540 attgagtttc gcaccctctc cactacttgc gggtggaatg aagccgccct ttctgctcgc 600 tttctggatg ggttgaacgc cctgttaaag gatgaactct acgctcacga tcttcccgcc 660 gatctggact ctctgattga gctctccatt cggatggaca agcgcctggc gatgcgtaga 720 cgcgctcgct ctcctcacac tgacgtcaga ggggtttccc catcgcagga ctccaaacct 780 ttcgagaggc acgcatcctc ctcggtgccc atggaggtgg gaggggttca cctgtctccc 840 tccgagagac aacgacgtat tcatctgggt ttgtgcctct actgcggagg ttctgggcat 900 cgtttttccg tgtgcccagt aaaagccaac gctcgccagt gagtcgggac ctcatggcga 960 gcgcttccac acagtcccct aacgccccga ctaggacctt gttacccatc acactttgct 1020 gggaggggtc ctcagtttcc tgctccgcgc tgctggattc cggcgcggaa gaaagtttca 1080 ttgatgagtc ttttgccgct aaacataagg tgccactcca gccacttagg gatccgccca 1140 ctgtgtatgc cctagatggt cgtgtattgt ctagagtgtt tcactccacc tctccggtga 1200 gtttacatgt ttctggcaat catcgagaga ctttaacttt ttttgttatt tgttcccccg 1260 ctacacctat agttttgggc cactcttggc tagttaaaca taatcctcaa atagactgga 1320 ttaaggggtc tattgtatca tggaatttgt cctgttatgc aaattgtctt gtatctgctg 1380 tgtctcctgt ctcctctccc tctgtgttac aggaagagcc tgtagagata tcaggtgttc 1440 ctgaagttta ccacgatctg cgagcggtct tcagtaagtc ccgagccgag gcccttcctc 1500 ctcacagacc atatgactgc agtattgacc tcctcccagg taccactcca cctcgggggc 1560 gtctcttcgc cttgtccgca cccgaaagag aggcgctcga taaatattta tccgagtctc 1620 tagccgcggg tactatagtt ccgtcatcct ccccagccgg tgctggtttc ttttttgtca 1680 aaaagaaaga tgggtcttta cgcccatgca ttgactaccg cggacttaat gagatcacta 1740 ttaagaataa gtacccttta ccgcttatgt ctactgcctt cgatatcctt caaggggctc 1800 gcatttttac caaattagac ttgcgcaacg cttatcacct tgtgcgaatt aaggccgggg 1860 atgagtggaa atcggcgttt aatacaccct ttggtcactt tgaataccgg gttcttccct 1920 tcgggctggt aaacgctcct gctgtatttc aggcactcat aaatgacgta cttcgcgata 1980 tgctcaatat ttttgtcttc gtctatttag atgacatcct catcttttcg ccagatttac 2040 ctactcacat ccaacatgta cgccgtgtac tccaacgtct attagagaac cgtcttttcg 2100 tcaagtcaga aaagtgtgat tttcatactc gctctgtccc atttctcgga tatataatct 2160 cggataaggg cgttagcatg gactaacgat tggattatta tcgattggca attatcgatt 2220 ggcccattcc tgagtctcgc gtcgctctcc agcatttctt gggtttttcc aatttttatc 2280 gccgctttat tcgtaacttt agccagatcg ctgctcctct cacggcgctt acatcggcca 2340 agactcgttt cgaatggtct gattccgcgc aacaggcgtt cgaccgtctc aaaaggatgt 2400 tcgcatccgc ccctattctc atcactcccg ataccgagag acaattcata gtggaggttg 2460 acgcctccga tgtaggtgtt ggcgccgtac tttcccaacg ctccgcggaa gataacaagg 2520 ttcacccgtg cgctttcttt tcgcatcgtc tcactccggc tgagcgcaac tatgatgtcg 2580 gtaatcgcga acttctcgcc gtccgtctcg cgctcggtga atggcgtcac tggcttgagg 2640 gtgcctctat cccatttgtt gtatggacgg atcaccgcaa cctcgaatat attcgatctg 2700 tgaaaaggct taacgcacgt caagctcgtt gggcactatt tttcaatcgg ttcaatttca 2760 ccatttctta tcgcccgggc accaagaata tcaaaccgga tgcactctcc cgtcaattta 2820 cggttttaga ggataaacct ctccccgaac ccatcatacc tattgatatg gtagtaggag 2880 ccgtatcctg gggtatcgag aactctgtta aaagagcgct cactcgcacc caagctcctc 2940 gtaactgtcc gtcgggtaag ctctttgtca ccagaccctt acgcgctgct gtcctccgtt 3000 ggggacacac ttctaaatta acggctcatc caggcattag gaggacgtta gccgtcatcc 3060 gtcaacgttt ctggtggccc gctatgattc aagacgtccg acgttttgtc aaatcttgcc 3120 acgtttgcgc gcagaataaa tccagtaatt ccgctaccac cggattattg cgaccactcc 3180 ctatcccgtc tcgcccctgg tcgcatattg cgcttgactt cataacgggt cttcccccgt 3240 caaagggcaa tactgttatt ctcactgtcg ttgatcgctt ttctaaagcg gcccacttta 3300 tcccgttgcc caagctccct tccgcccgtg aaacagcgca agttattatt aacaacgtgt 3360 tcagaattca cgggttacct gttgatgtcc tatccgaccg cggttcgcaa ttcacttcac 3420 aattctggag agaattctgc agacagcttg gggcctccgt tagtctatct tcagggtatc 3480 atccccagac taatggccag accgaaagag ctaatcaaga tctaggtcga atacttcgtt 3540 gtcttgccgc tcgtaacccc tcttcgtggt gtgaacaatt gccgtgggct gagtacgcgc 3600 ataattcgct accgtcctca tctaccgggt tatcaccatt ttcttgttgt ctcggttatc 3660 aaccgcctct ttttcctact caggaaagag agacagctgt tccttcggtt caagcgttta 3720 tacaacgctg taggcgcact tgggagagta cggtgtccgc gctccgccgc tctggggaac 3780 gtattcggcg ctctgccaat cgtcaccgca ctcgtgcccc tcgttacgtc tgtgggcaaa 3840 aggtttggct ttccactcgt aatcttcccc ttaaagcccc gtctcgtaaa ttggtaactc 3900 gctttattgg tccgttcact attgataaga tcattaatcc agcagctatc cgtcttaaac 3960 taccttccta tttgcaacgc atccacccgg tgtttcatgt gtcctgtatc aaacccgttt 4020 ttcactcttc ctatagttcc cctccctccc cctctccccc cattgtcgtc gatagtcctc 4080 ctatctacaa ggtacggagg ttgcttgatg taaggcgccg gggtcgtggt cagcagtttc 4140 tggtcgattg ggagggatat ggtcctgagg agagaagctg gataccgtct cgggacatcc 4200 tggaccgctc gctcatcgag gatttctttc gctctcgcca gtctttctcc tctggtgcgc 4260 caggaggcgc ttcttgagag gggggaatac 4290 // ID DNA-8-26_DR repbase; DNA; ZEB; 2174 BP. XX AC . XX DT 11-DEC-2008 (Rel. 13.12, Created) DT 11-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE A nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; TSD 8-bp; DNA-8-26_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-2174 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(12), 2149-2149 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 2174 BP; 679 A; 366 C; 384 G; 742 T; 3 other; cagagccggc ccaaggcata agcgaactaa gcggctgctt agggccccgt ggccaccagg 60 gggcccccaa gagtgcttga aattattgtg tgactttcgt tttagtttgt tgagttgtgg 120 attagtggtg ggacaaaata tatgcccatt tatagtgtta tttctgaccg cgatacatac 180 tcaggcgcca agatatttac ctaaatttac aaaagatctg atttaattaa tcagttttcm 240 acacygactg cagcagataa ccgctttagc tacagtacac ccagccgcca ccaggtggcg 300 cttcccgtaa tagcggtcgg cggatcatta gacatacacc agcgtcatgt aaaccactga 360 cagacagatg agccattgca atggtgaaaa tctcaggcat acagacactt tcttctgtcc 420 aagctgggcc aatgtatttg tgtgtttcag acatccagct cccccgaatt tgccactagt 480 ttctttctaa cagacgcgct ttcacttcgc tatatcgctc gcgcgtgccc gcggccagtg 540 caacttctta tgtgctcaaa ctttttaact ccgccaaaat atataaagat atcaatagga 600 catgaacatw taattagaag ggccttgcgt ttatttgact cttacagacc ttgtcaaagt 660 cttgtagatg gtattttggc aaaaaaatat taggggaatt aaaaaactta aatagatgag 720 taaataacta atgttctcca ctgctgaaaa catacgtgct cagaggagtc tgtgctcacc 780 tctcagagca ttctgatcag ttgaaaaaat agtgggaata catatttttt ttattaaaaa 840 taaaccaggg ctaagcataa aatgtttcca tgtttacata aaattagttg gtcaagttgt 900 aataagtgtt ttaagtttac agaaaattag aaagcaataa attactattt tggcagtttt 960 tttcttaaat gcatattcca ctaaattgag attctgctgc tgtgatagag atgtgctctc 1020 ttcagtcata aactgctctc tgcacatcag catttttaag cttgtttgac agatgtgcac 1080 ttgttttaca ttgttgcaat tctataaatg cgtgtttata aaagaaattc atcatacaat 1140 ttttgatgca ttttccccct tcatttacca aaattgtgat atattgtggc tggttttctt 1200 gtgatctatc aatatatcgc agaatcattg atatcgtgat tgaagcagca gtgcccggaa 1260 cagcagcaaa atgctttttc aatgaacttg gtgattaaaa aaaataagct tctgatttat 1320 acaattgaca gttggtaaat acactgcaaa aatgattttc tttcttatag tttttgtgtg 1380 atttttagtc caaatatcta aaaattctta aaccaataaa cattttctag aaagattttt 1440 agtcaaaatg atgtgaattt ttcattaaaa aaagctcaat aatctgccaa ttagggtaaa 1500 taaaaaatat tgttttaggt tttaaataag attattttac ttatctcatt ggcagattaa 1560 tttgttttac aaaaaaaaat tcacttaatt tagaattatt tctgaaaact gaacaatttt 1620 tgtacatatt tactaatttg attgatatgc gagtatgtct agaaaatgct tcctgattta 1680 gtttgatttt aatttaaata gatgttttaa gttatttgaa ctacaaaata caatatgaat 1740 tttttttcag tattaacatt tatatccaat ttaaaaaaat gtatctttat tgtcactatg 1800 tttgactgtt tggcagcgtc ttgtgcctga ataataataa aaataagtat gttattggct 1860 acttccagct gcttttgttt ctgccagccc atattaatgt tttgtgtcta attatttaga 1920 tttgtggcaa gtagtcatat aaaaagttgg atcaatagga agttgtttct gcagaataat 1980 atttagaatt gtcttaaaac atatatttta tgatgtaagc aacacagcta cagcgaataa 2040 aatcgataat gtgtgtttta aatttatagt gtgcgaatta tcaaggattg acgataattt 2100 gtcatattgt ttgcaccaag ggggccccaa ataaaatcct gcttagggcc ccctgaaggc 2160 ttgggccggc cctg 2174 // ID DNA-1-4_DR repbase; DNA; ZEB; 1825 BP. XX AC . XX DT 17-OCT-2008 (Rel. 13.1, Created) DT 17-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE nonautonomous DNA transposon from zebrafish - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; TSD 1 bp; DNA-1-4_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1825 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1213-1213 (2008). XX DR [1] (Consensus) XX CC The members of this family are ~97% identical to the consensus. CC It is characterized by 1-bp TSD and 39-bp TIR. The elements CC target to CAWTG site, and duplicate the central W. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1825 BP; 542 A; 394 C; 347 G; 540 T; 2 other; tagccgcatt tccactatcg ggccagtgcg agccagggct ttaatcgggc cgggccgggc 60 caatagcccg ggaggttgag aaatgaggcy gaaatcatgt cgcgtttcca ctgtcgggct 120 aatagctcac agcgcgtcac gcaaacaccg cccacagaac gtccccgaat caaacgtcac 180 acaacccgcc cacttcagcg ggaatgaggc agataaagca caccatagaa tcatcacgaa 240 acgaaaccat taaaatgaag acaaccgaaa caaacaaatg acacgttact gtacagcagt 300 acatgaaatg tctatacgca cagacctgtg tatttccatt cattacattt gtttactcgc 360 cgaccgactg ttggcattgt gcatcgactg tgctctcgcc actaacgcag ggacccatca 420 cagagagcgc acacatccat aatataaagc cacagaaatt ctcctttata actcgatatg 480 gcatgtattt tataacatcc tcgttttgat agaagtcgtg ttaagtgttt cagcacaaga 540 gcgagtagcc taataaaata tagccatatt tgttgcgctc ggcagctatt cactaaagct 600 cttcatttaa agtttcattt ataaatttaa ccacgtcata taaaaacata aacagttgtt 660 tgaggatata gaacacattt tgtgtttgga ctttccccca tatgcgttta aagcagcgag 720 cgctgcgcag gactgagtga cagaaaaaaa aggtataggc tagattctgc tttcactcac 780 ccaaataatg ctctgtttgc gcgatttgat taatatcatc gtatccaaat actttataaa 840 tatttactta tataaatatt tactttatat aaatatttca aaccttctcc ggtgttwatt 900 gtttttagaa aatatcgaaa atcttttttt ttttcagaca ggtctttgta aaatgaaagt 960 tataggctat atgtggcttt aaaacggttt gatttatgta ttgtatatag gctacctaca 1020 ttgttatagc tagcttttgt ttattttata ttggtttatt tatttaatgt gattttatat 1080 atatccgata tttgtaattc tttaaattat atttcaatat agcttaggct attttatcaa 1140 gatatgacac tatattgttt aaagtctacg aaagtgggca cgtattttgt taagttttat 1200 gtctttctgt tgtatttata ttgtgtatta tctccaaaat aaataaagaa aaaagccgct 1260 cgcggcataa cagagctgtg aaggagcgct tttgcccggt ctcacacaca catacaggca 1320 tctaaacgcg cacaaacaaa cattttaaac ttgacaaaaa ctctcgaaaa cctttactgt 1380 ctgctgtgtt tttaatatgg tgtaaagatt attatgcgtt attgaaacat aaaggaggac 1440 gcagcaaaga acgtcactat aaattaaaac tactttatta ttttatgcag cagccttaca 1500 tttaaaaggg aaacataagg gtgatcatac gcaaaaagac cccagactat aaggctagat 1560 gttttctttc ccttatttct ttatttgttt ggcgcatgta ctcatgtgcg atcagaaaac 1620 tgcccgcctc tcctttcact ccgccccgaa gccccagctg gccctccttg gcccaaggta 1680 ttcggcgggc cgaaaaaggc cggccgctgg ccccaagaaa gccccgcttt ggcccgatta 1740 ggccccggaa gtgacagtgg aaaccccact ggccttggct cgccctggct cgctcgcttt 1800 aggcgcgata gtggaaacgc ggcta 1825 // ID hAT-N81_DR repbase; DNA; ZEB; 828 BP. XX AC . XX DT 09-APR-2009 (Rel. 14.04, Created) DT 09-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Nonautonomous hAT DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW hAT-N81_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-828 RA Bao W. and Jurka J.; RT "hAT DNA transposons from zebrafish."; RL Repbase Reports 9(4), 861-861 (2009). XX DR [1] (Consensus) XX SQ Sequence 828 BP; 257 A; 177 C; 159 G; 235 T; 0 other; cagacctggg cattttacgg cccccgggcc acatgcggcc ctggacttct ttgatcggcc 60 cgcgtgaagt atttatttaa tttaaattca accgcaaaca gggccttgca atgcgcaggt 120 ctgatattgc gagtcaacaa aaggggactt tcacatatag cgtcttttgc gcgctcagtt 180 tcgttatttc aaatggagac gcgcgggttc agcgcgcaaa taggaaggga tgcacatgcg 240 gtatgtcacg ctcacgtttt ccaggcgccg cgagttgaaa aaaatctaaa catttcagag 300 tgccgcgagc gaaccacgag tcatgtgaca ggaaccaacc aggcagcttc acactttgta 360 tggaattaca cagtaataaa ggtagactaa ttacctcaga ttaacgattt aacaaatcca 420 gatacagatc ctctctcagt gatgatcatt acatcacata tccacctcac attcaacctg 480 acttcagtgc acttgttcag cccaacacag gctagaattc tctcactgaa cataaaaaaa 540 taatgaatga atgaatgaat gaataaataa ataaaattaa ctaaggtgaa gcaatcaaac 600 aaacaaccct gtggcattta tatgaagttg atattgtgat gctggtcttt aaaatacttt 660 ttgtgtaaaa ttgattcttt tttaaacttt agagttttta tgaatgttta ttgaatagtt 720 tcacagaatg tttttttttt taaaacaaat taactgtatt ggtctggccc tctaaaacca 780 ccccagagtc taatgtggcc ccttggaaaa attaattgcc cacccctg 828 // ID hAT-N60_DR repbase; DNA; ZEB; 1505 BP. XX AC . XX DT 27-OCT-2008 (Rel. 13.1, Created) DT 27-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Putative nonautonomous hAT DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW 8 bp TSD; hAT-N60_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1505 RA Bao W. and Jurka J.; RT "hAT DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1329-1329 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1505 BP; 468 A; 262 C; 254 G; 519 T; 2 other; cagtgttggg gaaagttact tttgaaagta gtgcattaca atattgagtt actcccccaa 60 aaaagtaact aattgcgtta cttagttact ttttatggaa agtaatgcgt tatattactt 120 ttgagttact tttgcgttac ttttccctta cctggctgag gtttgatctc tttcagaact 180 tgcaggtgtt ttttcacctt ttttatagag aagctctgca tttaacaacc acctatataa 240 cctacacctt cattttcctt taaaaaatgt aggataaaat tatattttga gaactttctg 300 agcccagagc tatacatgct gtatatacag attaatgaaa gcatgtaaag taatgtgttt 360 gctacttttg tatgttttgt gttattagtt aagtaaattc actgtccatt gagataaaga 420 ttgcaataaa gcctaaagag tactaacatg aaataattta tattaaggca aaaacagatt 480 ttaaaccttt aaaatgttta ataacaaaat gaacagtaag acgataaaaa acatccccaa 540 atcaaaataa atctttgagc tttatacatt gaataaattg cttcacgtgg ccttaacaaa 600 tgtgtaaatc gtttcccaaa agagaaaaaa atcttactat attacaaaac gtagtcgctt 660 ctacagtaac gttagacagt ggctgcatct cctcaacaat aaaagcaata accatttttt 720 ttgagtttag tcgaattaat tgttttacct ggagaacgtg aactgatgtc aatcatttgt 780 tgtttagctg gaggtggggt gcgggtgtag tgctttgcat ccagtttttt tttttttttt 840 ttgatgtcgt gtttcttgca gttgtcaaga gttttactta cacataatct acacttaaca 900 acaatccact tcttttcttc gatgagttca aaataacaag aatccattgc aaaaatgtaa 960 gtctccggcc tgccattgtt tctgtttctg actgtctgta gtgatgacgg gcgattcgcg 1020 aatgaatcgt ttgttttaaa cwgaatcttc taagtgaacc ggtcgaacca gttcattaaa 1080 ttgaactgag ttgtcgtgaa acggtttgcg tcttcagtaa gtaacttact aacataccgg 1140 ataccccctc tgactcgaaa taaaccaata tcctgagtta ttcagttact agaacagtac 1200 actgactcat ctgctgtgaa aaagagacct gatgatgatg atgagtgcga gccgactgtc 1260 tgttccctcg cagcacgctc tctcattgag tgtgtgattc aacctgacta aggtaatttt 1320 tattccgcaa tatattttta aagccaatta agcaaggtaa aaagtaactc gcgttacatt 1380 tttttwaaag taactcaaat aatattactt atttttaaaa gtaatgcgtt actttacttg 1440 ttactttgga aaagtaatat tattacgtaa cgcgcgttac ttataacgcg ttacccccaa 1500 cactg 1505 // ID BEL15-LTR_DR repbase; DNA; ZEB; 349 BP. XX AC chr14; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL15-I_DR; KW BEL15-LTR_DR; BEL15_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-349 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1380-1380 (2008). XX DR Genome; chr14; Positions 49300966 49300618. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 349 BP; 104 A; 75 C; 72 G; 98 T; 0 other; tgtataaaat cccttttgaa ctctcatttt gataactgtg tgacgtcaca tgtgggtgga 60 gttacggacg tgcccagaca tgttcggttc tcagaatatt ccagtgctac catgagacag 120 atggtagttt gttgatgtgt gtgcgttcat gagaataaac cacacacaca cacaaatgca 180 cccaaatacg caaatctaaa tgcactccta aatgtgtaaa ctgtttgaag atgggattga 240 ataaatcaat gtgcagacca tactgtcgtc aatgttgcct tttcacctct gaccgggttt 300 aaggaatgga tgttgaccac ccagtgtaca cacgcacata catctaaca 349 // ID DNA-1-10_DR repbase; DNA; ZEB; 835 BP. XX AC . XX DT 03-DEC-2008 (Rel. 13.12, Created) DT 03-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE nonautonomous DNA transposon from zebrafish - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; TSD 1 bp; DNA-1-10_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-835 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(12), 2131-2131 (2008). XX DR [1] (Consensus) XX CC Copies are ~89% identical to the consensus. TIR is ~30-bp long. CC The target of this element is CWG, and TSD is 1-bp long (the CC central W). CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 835 BP; 280 A; 157 C; 154 G; 244 T; 0 other; gggtagggtt gcaccagctg atcgtaagtt ctttcttaaa ctggaacgta aagtccacac 60 taggggctta gtaactacta gctagtttgt aactaaattt gttctcactt cggttgcacc 120 acatgttctt aaggcaaacc gtagttagta ggtcgtaagc tctccgtaaa gtcatgcgta 180 gtcgcattga atgacgtaat atccaataaa aagcatttaa atcgttaaac tgctttaaaa 240 ttctgcaact aatgcatcta tatataactg tcctatgaaa atgaaatgat aaattacatt 300 tttatagcac attaaattac agtcagtcac taataacaga tgacgcacac acctgcatcg 360 cgagccgtga atgcacacga agttatcaaa acattaaact tgaatttttt ttatatccta 420 cacatgaaag agaaaagagc aggaagaaaa actcagtcat gaagaaattc tcgttttaat 480 tgatggatac aattaacaca aaacactcct tgaaagaaaa ttaaactctt ctttcacaaa 540 ccgaaaaaaa aatgcgagat ttgggaggaa aataaggctg tgagtagagg gagatcattc 600 ttcattatca tcgtgagctc ctcaatgtaa actaatctcg ctgtcatctc ttttctaccg 660 ttatacatgg ggggaggctg ccgtaaatat ttagttggaa cgtagcgtta cgtttaaact 720 aagttcagtg gtgcaacaca aaaatattta gtaatgtgta agttgtaact tagtgtccct 780 ttaagtcaca actaggctac gttttaactt acgcactgct ggtgcaaccg gcccc 835 // ID Gypsy100-I_DR repbase; DNA; ZEB; 6684 BP. XX AC chr19; XX DT 02-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy100-I_DR; KW Gypsy100-LTR_DR; Gypsy100_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6684 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1509-1509 (2008). XX DR Genome; chr19; Positions 11708610 11715293. XX CC Positions [3095-3598] - Reverse transcriptase CC Positions [5045-5518] - Integrase core CC 'ATGT' target site duplication CC LTRs are 100% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 129..1958 FT /product="Gypsy100-I_DR_2p" FT /translation="MEIVKHENVNIKRAVIVSGLTHSDPDQEVETCLVKYG FT SIKRNLIIDDPNSEYHQCAIVEFSCDSAMQHLEGLLPMTIKSKLDTNVAFH FT VRSLESVYMPATHSNVTEAYFENLQALAQDSGRTLREVLHSELQKISELMF FT PESESSRPEPELEQFVYDSSTREPKNPANDSTPAQSSCSARSPDASERETI FT SFNIPTSALNPPAIQRVVMEHVVKTSDPVPSSHVAFRLKIFSGKIPRPSHE FT PDFDTWRASVDFLLNDKSFSDLYKTGKILDSLLPPASDIVRHVSPNAPPSQ FT CLELLDSVYGSVEDGDELFAKFIGVLQNQGEKPSTYLHRLHVLLSTAIRRG FT GVAEVERNRCLLKQFCRGCWDNSLIAELQLEGKKDTPPPFAELVVLIRTAE FT DKQSLKEERMRKHLGFNKHAPVPLKLRPATHFQSVYCSDIPDMPNDESPSQ FT LNEKQKAVKTKNKVEKSEVESLKREIAKLQTQIATMKTEPVHKVKSSPNPS FT EISELRQQIAELQAHLIPRFQREHQERSLPSNTIPLKHRPKLTETVMKNDP FT KPTWQLNNRPRPGYCFHCGEDGHLAVNCENDPNPRKVEEKRYELRERQAKW FT DHHNASNPKHLN" FT CDS 1961..6637 FT /product="Gypsy100-I_DR_1p" FT /translation="MSVSVTGQTETGRREKCPKKHGLHSHQSPCYELPAGL FT VGTRCTASIKIEGREVNCLLDTGSQVTTIPKSYYEKHLYKHPIKSLENILE FT VEGANGQSVPYLGYVEVNLKFPKMFLGVEADVPTLALIVPDLTNMPQILIG FT TNSLDVLYVNCTQGNNITPKSSCYGFQAVIKVLERRKRQASLVTVGYVKLK FT GHQQEVVPAGCTVVLDGLVQVNSPPAEKWVSLEPPTTTPLPAGLLVASSLH FT SLPAMKRFAQLPVVVRNDTQSDLLISPKSVIAEVHAVQQVMERPSSKDNFE FT DQKIKENALNIPIDFGDSPLSPEWKKRITSLLNSMTDVFALHDLDYGHTDK FT VKHHIKLSDETPFKHRPRPIHPQDVDAVRRHLQELLEAGIIRESESPFSSP FT IVVVRKKDNSVRLCIDFRKLNSQTIKDAYALPNLEETFSVLTGSRWFSVLD FT LKSGYYQIEMEEADKQKTAFVCPLGFWEFNRMPQGITNAPSTFQRMMERCM FT GDLNRKEVLVFIDDLIVFSKSLEEHEARLVQVLERLRDFGLKLSPEKCRFC FT QTSVKYLGHIVSESGVKTDPQKVEALKTWPRPENLKELRSFLGFAGYYRRF FT VQDFSKIVKPLNDLTAGYPPLQKGHKKKTNGTTQYFNPKESFGERWSQECQ FT QAFDTVIAKLTSAPILGFADPKLPYVLHTDTSTIGLGAALYQEQDGQMRII FT AFASRGLTRSESKYPAHKLEFLALKWAVTTKFSDYLYGANFTVITDSNPLT FT YLLTSAKLDATSYRWLSSLSTFTFKIQYRAGSKNQDADGLSRRPQAEIPDD FT LETKKERERIRQFTYHHLTGPSSEVPSIIASAETVDAICERHQIKQAYHDP FT DVLHPSLTLVESLTVDVSALPQEFTQEDAYHFSGLPKLSEEDLRERQRADP FT EIRIVIQQLESNEKPCLKTIPSELILWFKEWNRLELKNGILFRRRKCDRDT FT TYQLALPADFRDVVLKELHDEMGHLGIERTLELVRTRFFWPKMSSAVEQKI FT KTCERCVRRKTLPVKAAPLVNIQTSRPLELLCMDFLTIEPDRSGTKDVLVM FT TDHYTKYAIAVPTSNQKAQTVAKCLWDHFLVHYGFPEKLHSDQGTDFESRT FT IKELCRIAGISKIRTTPYHPRGNPVERFNRTLLQMLGTLNNKQKSRWKDYV FT RPLVHAYNCTKSDVTGFSPYELMFGRQPRLPIDLAFGLPVGGQNESHSKYV FT QGLKNRLEESYRVARENSAKIADRNKKRYDKHVVSSTLEIGDRVLVKNVRL FT KGKHKLADKWEPDVYVITKKAGDLPVYTVKPEQKDGPLRTLHRDLLLPCGF FT LSASTTDEHVEREVNRKPRTRARCAVEPEESESTSEDSESDHYPVDHYSPQ FT DFLEIETRILSDSKLPPAQSQTAAQLPGDETMLDDRHAVSAESVDTDTVTL FT PRMANSKVQEDNPLTPDQPVESSLENLPEFGVDCGKEIAHVEEEPNVHGES FT PKKSRGENEESTNLTQRESVIIIESQEDQNETDEIGNLRRPRRYCGPPNRL FT NYPQLGNPLSLVVQSLFSSLSTAITSSLGDSNSSRNFSPSETI" XX SQ Sequence 6684 BP; 2136 A; 1407 C; 1464 G; 1677 T; 0 other; taaaacgtgg cgagccagcc aggagcaaaa tatacagctt agtgaaatag ttataaatca 60 gataaacaca ataaggataa aacaattacg taaaaattag tcccccttcc ctgtgcagat 120 attcaaccat ggaaatagtg aaacacgaga atgttaacat caagagagca gtcattgtga 180 gtggcttaac ccattctgat ccagaccaag aagtggaaac ctgcttagtc aaatatggat 240 ctataaagcg caacctaatt attgatgatc ccaactctga gtatcaccaa tgtgccattg 300 tggagttctc ttgtgattct gctatgcagc acttggaggg tttattgcct atgaccatca 360 aaagtaagtt agacactaat gtggcctttc atgtgcgcag cttggagagt gtgtacatgc 420 ctgcaacaca tagcaatgtt actgaggcat actttgagaa tctgcaagcc cttgctcaag 480 acagtggaag gactctgcgg gaagtacttc acagtgagct gcaaaaaatt tcagaattga 540 tgtttccaga aagtgaatcc tcaagacctg agcctgagtt ggaacagttt gtgtatgaca 600 gcagtacaag agaacctaaa aaccctgcca atgacagcac accagcacag tcatcatgtt 660 ctgctagaag tcctgatgcc agtgagagag agaccatttc ctttaacata cctaccagtg 720 ctttgaatcc acctgcaata caacgagtag tgatggagca tgtagtaaag accagtgatc 780 cagtgccatc ttctcatgtg gcttttcgtc taaaaatatt ctccggaaaa attcctcgcc 840 ccagccatga accagatttt gacacttggc gtgcaagtgt ggatttccta ttaaatgaca 900 agtcattttc tgatttatat aaaactggga agatccttga cagtttgcta cctccagctt 960 cagatattgt cagacatgtc agtccaaatg ccccaccgtc gcaatgcttg gagctgttgg 1020 attctgttta tggctccgta gaggatggag atgaactttt tgcaaagttc attggtgttc 1080 ttcaaaatca aggtgagaaa ccttccacct accttcacag actacatgtg ttgttaagta 1140 ctgccattag aagaggtggt gtggctgagg tcgaacggaa ccgttgtctc ctcaagcaat 1200 tttgcagagg ttgctgggac aacagcttga ttgctgagtt gcagctggaa ggaaagaaag 1260 acacacctcc tccttttgct gagcttgtgg ttctcattcg cactgcggag gacaaacagt 1320 cgctgaaaga agaaagaatg aggaaacacc tagggtttaa caaacatgcg ccagttccac 1380 ttaagttacg accagctact cactttcaat ctgtgtattg cagcgacata cctgatatgc 1440 ccaatgatga atcgccaagt cagttaaatg aaaaacagaa agctgtcaag actaaaaaca 1500 aagttgaaaa atctgaagtt gaaagcctga agagagaaat tgctaaactt caaacacaga 1560 ttgctacgat gaagactgag ccagttcaca aagtaaagag cagccctaat ccaagtgaaa 1620 tttctgaatt gagacagcag atagctgaac ttcaagctca tctcattcca agatttcaga 1680 gagagcatca agagaggtct cttccgtcta acacaattcc cttgaaacac cgacccaagc 1740 tgactgagac tgtgatgaaa aatgatccaa aacccacctg gcaacttaac aacagacccc 1800 gtcctgggta ttgctttcat tgtggagagg acggacatct tgcggttaac tgtgagaacg 1860 atccaaaccc tcgtaaggta gaagaaaagc gttatgaatt aagagaaaga caagctaaat 1920 gggaccatca taatgcctca aatcctaaac atttaaactg atgtccgtct ctgttacagg 1980 gcaaacagag actggaagaa gagaaaaatg cccaaagaaa catggacttc attctcatca 2040 gtctccatgt tatgaattgc ctgctggatt agtaggaaca aggtgtactg ccagcatcaa 2100 aattgaagga agagaagtaa actgccttct agatacaggt tcgcaggtaa caaccatccc 2160 caagtcctac tatgagaaac acttgtacaa gcacccaata aaatctttgg aaaacatatt 2220 agaagttgag ggagctaatg gacaatctgt accctacctg ggctatgtcg aagttaattt 2280 gaaatttccc aaaatgtttc taggtgtgga agcagacgtt cctactttag ctctcatagt 2340 tccagacttg accaatatgc cacagattct aattggaacc aattcacttg atgtgttata 2400 tgtcaactgc actcaaggaa acaacattac tcccaagtca tcttgctatg ggtttcaagc 2460 agtaataaaa gtgctggaga ggaggaaacg gcaagcttcc cttgtgactg tggggtatgt 2520 aaagctgaag gggcatcaac aagaggtggt accagcagga tgcactgtag tcctggatgg 2580 attggttcaa gttaacagtc ctccagctga gaagtgggtg tcgctggagc cgccaactac 2640 gactccatta cctgctggtc tattagtagc aagctctttg cacagtcttc ctgcaatgaa 2700 acgctttgca cagttaccag tggtagtaag gaacgatact cagtctgatc tgttaatttc 2760 cccaaaatct gtgattgctg aagtccatgc agttcaacag gtgatggaaa gaccatcatc 2820 taaagataat ttcgaagatc agaagattaa agaaaatgct ttaaacattc caattgactt 2880 tggagattct cctctttcac cagagtggaa aaaaaggata acatcccttt taaactccat 2940 gactgatgta tttgcattac acgatctgga ctatggtcac actgacaaag tgaagcatca 3000 cattaagctt agcgatgaaa ccccattcaa gcaccgacct cgaccaattc atccacaaga 3060 tgtagatgct gtcagaagac atcttcagga gcttcttgaa gcaggaatta tcagagaatc 3120 tgaatctcct ttctcatctc ctatagttgt ggtgcgtaaa aaggacaact ctgtgagatt 3180 gtgtattgat ttccgaaagt taaattccca aacaatcaaa gatgcctatg cccttcctaa 3240 tttggaagaa acattctcag tattgaccgg atcgaggtgg ttttcagtcc ttgacctcaa 3300 gtcaggctac taccaaattg aaatggaaga ggctgacaag cagaaaactg cctttgtatg 3360 tccgttaggc ttctgggaat ttaaccggat gccacaggga atcacgaacg caccaagtac 3420 gtttcagcgg atgatggaac gctgtatggg agacctcaat cggaaagaag tgcttgtatt 3480 cattgacgat ttgatagtct tctcaaaaag tctggaagaa catgaagctc ggttggtgca 3540 agtgttagag aggttaagag attttggact aaagctctca cctgagaaat gcaggttttg 3600 ccaaacatct gtcaaatacc tgggtcacat tgtatctgaa agtggagtta aaaccgaccc 3660 acaaaaggtt gaagccctga aaacctggcc gaggccggaa aatcttaagg agctcagatc 3720 tttcctagga tttgcgggtt attaccgcag gtttgtccaa gatttttcaa aaattgtaaa 3780 gcctcttaat gacctcaccg ctgggtaccc tcctctacaa aagggtcaca agaaaaaaac 3840 gaatggaaca actcagtatt tcaaccctaa ggagtcgttt ggagaacgat ggagtcaaga 3900 atgtcaacag gcatttgata ctgtaattgc aaaactcacc tctgctccaa ttttgggatt 3960 tgctgaccca aaactccctt acgtgttgca cacagacacc agtaccatcg ggttgggtgc 4020 agcactttac caggaacaag atggacagat gaggatcata gcatttgcaa gtaggggact 4080 caccaggagt gaatcaaagt atcctgccca caaattagag tttttagcat tgaaatgggc 4140 tgttacgacc aaattcagtg actacttgta tggagctaac tttacagtaa taactgatag 4200 taatccgcta acctacctct tgacatcagc aaaactagat gcaaccagct atagatggct 4260 gtcaagtttg tctacattca cctttaagat acagtacaga gcaggaagta agaaccagga 4320 tgcagatggt ctctctaggc ggccacaagc ggagattcca gatgacctag agacaaagaa 4380 agagagggaa agaattagac agtttactta ccatcatttg acaggaccat catctgaagt 4440 accatcgata attgcatcag cagaaactgt ggatgccatc tgtgaacgac atcaaattaa 4500 acaggcctat catgatcccg atgttttaca cccctcatta actttagtag agtccctaac 4560 tgtcgatgtg agtgctttgc ctcaagagtt cacacaggaa gatgcttatc acttttctgg 4620 tcttcctaag ttgtctgaag aagacctgcg tgaaaggcag agagctgatc cagaaattag 4680 gattgttatc caacaactgg agtcaaacga aaagccctgt ctcaaaacta ttccatctga 4740 attaattctg tggttcaaag aatggaaccg tctggagtta aaaaatggca tactattcag 4800 aaggagaaag tgcgatagag acacaacata ccaattggca ttgccagctg actttcgtga 4860 tgtggttttg aaagaactcc acgatgaaat gggacactta gggattgagc gaactcttga 4920 acttgttcga acaaggttct tctggcctaa gatgtcatca gctgtagagc agaagataaa 4980 gacctgtgaa cgctgtgttc gccgcaagac tttgccagta aaggctgctc cgctggtgaa 5040 tatacaaact agcagacctt tagagctcct ttgtatggat ttcctaacaa tagaaccaga 5100 ccgaagtggc accaaagatg tactggtcat gactgaccat tacaccaagt atgctattgc 5160 tgtgcctaca agcaatcaga aagcccagac agttgcaaag tgtctgtggg atcattttct 5220 tgttcattat ggattcccag aaaaacttca tagtgatcaa ggtaccgact tcgagtctcg 5280 cacaattaaa gaactttgca gaattgcagg aatatccaag attagaacca ctccctacca 5340 cccaagaggc aatccggtag agcgctttaa ccgaacgctt ttgcagatgt tgggaacctt 5400 aaacaacaaa caaaaatcaa gatggaagga ttatgtcaga ccgctagtgc atgcttacaa 5460 ctgcacaaaa agtgatgtga caggattctc tccctatgaa ttaatgtttg gcagacaacc 5520 gcgactccca attgatcttg catttggttt acctgtgggt ggacaaaatg aatcacattc 5580 caagtatgtt caaggcttaa agaatcggct ggaagaaagc tacagagttg caagggagaa 5640 ttctgcaaag atagcagacc gcaacaagaa gaggtatgac aagcacgttg tcagttccac 5700 tcttgaaatt ggtgatagag tacttgtaaa aaatgttcgc ctaaagggaa aacacaagct 5760 tgcggataag tgggagcctg atgtttatgt gataacaaag aaagccggtg acttacctgt 5820 ttacactgtt aagcctgaac agaaggatgg acctctgaga accctgcaca gggatctgct 5880 tctgccttgt ggattcctgt cagcaagcac aacagatgag catgttgaaa gagaagtgaa 5940 tcgaaaacct agaactagag ccagatgtgc tgtggaacct gaagagtctg agagcacgag 6000 tgaggattca gagtcagatc attatcctgt agaccattac agccctcaag atttcttaga 6060 gattgaaacc agaattctgt ctgactcaaa acttccacca gctcaaagcc aaacagcagc 6120 tcaactacct ggtgatgaaa ccatgttgga cgacagacat gctgtctctg cagaatctgt 6180 tgacacagat actgtaacct tacctagaat ggcaaattcc aaagtacagg aggataatcc 6240 tctcacacca gatcaacctg tagaaagcag cttagaaaac ctacctgagt ttggagttga 6300 ttgtggaaag gaaattgctc acgttgaaga ggaaccgaat gttcatggag aatcgcctaa 6360 aaaatcaaga ggtgagaatg aagaaagtac caatttgact caaagagaaa gtgtcattat 6420 tatagagtct caggaggacc agaatgaaac agatgaaatt ggaaatctca gacgtccgag 6480 aagatactgt ggacctccaa acagacttaa ctacccgcaa ctgggaaatc ctctcagctt 6540 agttgttcag tccttgtttt ccagtttgag tacagcaatt acaagctctt taggagattc 6600 aaattcctca agaaatttct ctcccagtga aaccatctga attccaggat gcagagggac 6660 ctgcatacat tcaagagggg agaa 6684 // ID Gypsy-171-LTR_DR repbase; DNA; ZEB; 824 BP. XX AC . XX DT 13-APR-2009 (Rel. 14.04, Created) DT 13-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy-171-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-824 RA Bao W. and Jurka J.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 9(4), 866-866 (2009). XX DR [1] (Consensus) XX SQ Sequence 824 BP; 186 A; 138 C; 185 G; 315 T; 0 other; tgttacgtta tagaacgtat taattatttg tttgctaatt ttatatatat aaatattata 60 tatatacaga tttgtgtttt gtttatgcat ttggagctga tgttgtgttg tgcgctgtgg 120 agcatgcgca gctttctctc tctctctccc cctccctcta gagctgcgct tgattgcggt 180 ccagccgtgt tgcgtgtggg agcgtatata agctggagga gaggaacaat ggggagatct 240 ttctgctcgg tgcaggtcgc aaatctccgt cgtcgtcgct gctgttgttc cgagcgtctc 300 ctatctcctg ccccaaattg ttggttgact tggtgtgaat agatttctgt tactgacaat 360 ttaattattt tcttaaacgg ctctctgtta aatgtgaatc tattggcttt tctgattaaa 420 tacctattta tatatattag ttttggagaa acagaagcgg taggttggga atcgccattt 480 actttaaaac gttttctcct ggtttgtggt agcaagggag aaagggaagc ttgctgtaat 540 tttatttggg gtacttttat ttgagcaggt aagttagaat tatttccccg caagttagct 600 gtctttttgt ttattatttt ggcctggatt cccttccgaa gagatgcttc tatctttgtt 660 ttctaagtaa ataaatattg ttactacttg atcagcgttg tgattattgt gagttgggac 720 aggagaaggg gaaaacccta aattgagtat ttgagtttct ctttatttat tttttttaca 780 cttgttattg ttcccaccaa ccctagactg gtcgggacgt aaca 824 // ID BEL-37-LTR_DR repbase; DNA; ZEB; 443 BP. XX AC . XX DT 03-DEC-2008 (Rel. 13.12, Created) DT 03-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat - DE consensus. XX KW BEL; LTR Retrotransposon; Transposable Element; Nonautonomous; KW BEL-37-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-443 RA Bao W. and Jurka J.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(12), 2169-2169 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 443 BP; 100 A; 62 C; 98 G; 181 T; 2 other; tgtagctgac gctgatcggt cagcttgtat ttttgtttat tgtttttgta tttcattatt 60 tgaatgtgta ttggtttatg tttgttacaa atttctttgt ttccggttgg cttgagtggg 120 aggtcacatg attctccttg attgatttaa cctgcgagag tttgtatggt cacacagcat 180 agtgagatct caatgcaggc acctgttaaa tgctcctgct tattctacag ctggttatca 240 ggccaacacg gtaaactaaa cattgtttta tatattattt gtttattttg atgtcctttt 300 gattgtgtga aacatttggt tgtgtgaaac atttggtttg tttatttatc ttttagtttt 360 cacggtgttc aataaatgaa acgcatgrtg gacatcagta tttgggaagc gtggactgtt 420 ttaataaccg gcgggtagyt aca 443 // ID Gypsy-26-I_DR repbase; DNA; ZEB; 4616 BP. XX AC . XX DT 11-FEB-2005 (Rel. 10.01, Created) DT 19-MAY-2005 (Rel. 10.06, Last updated, Version 2) XX DE An internal portion of the Gypsy-26_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy superfamily; Gypsy-26-I_DR; Gypsy-26-LTR_DR; Gypsy-26_DR; KW endogenous retrovirus; gag; integrase; reverse transcriptase. XX NM Gypsy-26-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4616 RA Kapitonov V.V. and Jurka J.; RT "Gypsy-26_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 5(1), 25-25 (2005). XX DR [1] (Consensus) XX CC Gypsy-26-I_DR is an internal portion of the Gypsy-26_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its CC long terminal repeat is deposited in Repbase as CC Gypsy-26-LTR_DR. Gypsy-26_DR is characterized by 4-bp target CC site duplications. The internal portion encodes one CC polyprotein: the 1503-aa Gypsy-26_DR1p (pos. 64-4572) composed CC of the gag, protease, reverse transcriptase, and integrase CC domains. The consensus sequence was built from 4 copies less CC than 1% diverged from the consensus sequence. XX FH Key Location/Qualifiers FT CDS 180..4589 FT /product="Gypsy-26_DR1p" FT /translation="MAALAKETEGDDSLLPDYSSEPFTTHREPIQDLIHSL FT ASLYLENEPEIGEQEEGDFDDSLLPPPPPTIEDDDSLLTMIMATRMHKVES FT RVEAFENSSDVCFKDIQHQLNEQSNRIAVIESQVQHMLKQYQDHPIDVQQL FT ENSLSTMLKKECTQVKDTLETKVQELGQAIMDCLKRRDGQLKSLIQPSGGA FT TSTPHFSHTILDHGSCRPVHFKTPIKLEFPKFGSLDGEDPITYLERCDEYL FT AVQPLNDSEIISMLPSVLTHTAKDWWVAEKKRVRTWTQFKSVFLQSFLADD FT HDVEVERRIRERKQGVDESIRTFAYQYRALCLRLKPSMTEREILQAALRNC FT NPRIASILRGTVTTVDELVRVGTLIEKDINEERSFWRQRHQEANAKSTEGN FT KFFKGRQSNPHIAVCSDSSERSPVTLTLPLTIKGHQYQAILDTGSTYSLIQ FT ESCWKRLKSNHEVLQSSRGQSFSLANGCVQSALGKIAWQATIHGHDYPIRA FT YVMKDCDLAFPVLLGLEFLKMSGITVDFRNSSYSLPEEYGVIHSFTSSSPS FT PIVSLHLALPLIPTPSTDLTIIKELVDRADVSKAHRRQLEGLMLDWPTVCT FT ETLGQTNLIHHQIHTIDEIPVRKKAYPVPVNKQKFIDEEIARMLDKGIIRP FT SVSPWASPVVLVPKKDGSTRFCVDYRALNSKTPLDGFPMPQIQDILESLYG FT ATIFSTLDLKSGYWQVKMDEDSIKKTAFVTKNAQYEFLRLPFGLRNAAATF FT QRLMNNVLRDYMGEFCFVYLDDIVVYSKTIQDHFQHLKLLFAKLQDSGLTL FT NLKKCSMLQRTITYLGHVVSEEGVRTEDTKIKAVQDFPVPKNLKEVQRFLG FT LASWYHRFISHFSERAAPLYALKGKNAIWNWTVECQSAFDDLKYALQRAPV FT LMPPDFTKVFRVQTDASDIGLGAVLTQDFDGAEHVIAYASRLLHGAEKSYS FT TAEKECLAVVWAVEKWRQYLEGRNFEVLTDHSALTWVFNNPKPSSRLTRWA FT LRLQCFSFLVKYRKGSCNVVPDALSRGIPGQEVVGHIAICQANKTDPNLPV FT SWDEIGKAQKLDSSLQALWEAAKQATTDSSRIAYCVQNDYLFRRVPNKDQG FT CVYQLVIPASLREQFLHFAHSNPLSGHLGRMKTLKRLLDSVYWPEIRKDVW FT SFCTQCKTCQIYKPRISKLSGLLQSTPVVEPGYMLGVDLMGPFPKSNRSNE FT FLLVFVDYCSKWVELFALRSAKTHLITNILTKEMFTRWGTPAYLVSDRGPQ FT FTAQLLNETCKRWGVVQKLSTAYHPQTNLTERINRTLKTMLSSYVHDNHRD FT WDKWIPEFRYAINSAWQESTGFTPAEVALGRKLKGPLDRLIQRPPNPDHLA FT YNTLERQKAFLEQVIAKTSQAQERQGKYYNQRRKPKSFEEGDLVWILTHPL FT SRAADSFMAKLAPKWQGPGKIVKKVNNVNYRVVMLDKPSQCDTYHVEKLKE FT FYGTV" XX SQ Sequence 4616 BP; 1402 A; 951 C; 986 G; 1277 T; 0 other; tatttggcgc ccgaacaaaa gacaaaattt gattgaattc ttgagagata cattctgctt 60 tctttgagtc atgtttttgt taatgtgtct atgaggccat aatttcaaaa gtcctaacat 120 tacactctat ttcagcattc taaagttgac attatttgaa gagctttagt agcttcaaaa 180 tggctgcctt ggctaaagag acagagggtg atgattcttt gttacctgat tattcttcag 240 agcctttcac tacacatcga gagcctattc aagaccttat tcattctctc gctagcttgt 300 acttggaaaa tgaacctgaa attggtgaac aagaagaagg ggattttgat gattctttgt 360 tacctcctcc acctcctaca attgaagatg atgatagctt gttaacaatg attatggcca 420 cacgaatgca taaagttgaa agtagagttg aagcatttga aaattcctct gatgtttgtt 480 tcaaagacat tcagcatcag ctgaatgaac aaagtaatag gattgctgtc attgagagtc 540 aggtccagca catgctaaaa cagtaccaag atcaccccat agatgtacaa cagctcgaga 600 attcgttatc taccatgtta aagaaggagt gtacccaagt gaaagatact ttagaaacaa 660 aggttcaaga gttggggcaa gcaattatgg attgcctgaa acggagagat ggacaattga 720 aatcattaat ccagccttct gggggtgcaa cgtctactcc acacttcagc cacaccatat 780 tagaccatgg atcttgtcgt cctgtacatt ttaagactcc tatcaagcta gagtttccaa 840 aatttgggag tttagatggg gaagacccta ttacatacct ggaacgatgt gatgaatatc 900 tggctgtaca acctttaaat gactctgaga tcatatccat gcttccttct gtattgacac 960 acacagctaa agactggtgg gtagccgaga agaagagggt aagaacatgg acacaattca 1020 agtctgtttt tcttcaatct ttcttagcag atgatcatga tgttgaagtg gaaagaagga 1080 tcagagaaag gaaacaaggg gttgatgaaa gtattcgaac atttgcctac cagtacagag 1140 cattatgtct gagactgaaa ccttccatga cagagcgtga gattctccaa gcagcactac 1200 ggaactgtaa cccaagaata gcaagcattt taagaggtac tgtaactact gtagatgagt 1260 tggtacgtgt aggaacactt atagaaaaag atatcaatga agaaagatca ttttggagac 1320 agaggcacca agaagctaat gcaaagtcca ctgagggtaa taaatttttt aagggccgac 1380 agtcgaaccc tcacatagct gtatgttctg atagcagtga aagatctcct gtcacattaa 1440 cattgccctt aaccattaaa ggtcatcaat atcaagcaat tctggatact gggagcactt 1500 actctttaat tcaagagtcc tgctggaaac gactaaagtc aaatcatgaa gttttgcaat 1560 cgagtagagg acagtccttt tctcttgcaa atggatgtgt acaatcagcc ttggggaaga 1620 tagcttggca ggctaccatc cacggacatg actatcctat cagagcatat gtaatgaaag 1680 attgtgactt ggcttttcct gttttgttgg gcctggaatt tttgaagatg tctggaatca 1740 cagtcgattt tagaaattct tcctattctt tacctgaaga atatggggta atccactcct 1800 tcacctcttc ctccccttca ccaatagtaa gtctgcacct tgcactacct ctaataccaa 1860 caccttctac tgatctaacc attataaagg agttagtgga tcgagctgat gtctctaaag 1920 cccatagacg tcaactagaa ggactgatgc ttgattggcc cactgtatgc actgaaactt 1980 taggtcaaac caacttgatt catcatcaaa tccatacaat tgatgaaatt cctgtgcgaa 2040 agaaggccta tcctgttcca gtcaacaaac agaagtttat agatgaggaa atagcaagaa 2100 tgcttgacaa aggcattata agaccttctg tatctccatg ggcatcacca gttgtacttg 2160 tgcctaaaaa agatggcagt acccgctttt gtgttgatta tagagctttg aactccaaga 2220 ctcctcttga tgggtttcca atgcctcaaa ttcaggatat ccttgagtcc ttgtatggag 2280 caaccatatt tagcacatta gacctcaaat ctggctactg gcaggtaaaa atggatgaag 2340 acagcatcaa aaagactgct tttgtcacca aaaatgccca atatgagttt cttcgtcttc 2400 cttttggcct gcgaaatgct gctgcaacct ttcagaggct catgaacaat gttctgagag 2460 actacatggg agagttttgc tttgtctatc ttgacgacat tgtggtttac tcaaaaacca 2520 tccaagatca ctttcaacat ctcaagctac tctttgcaaa attgcaagac tctggtttaa 2580 cactcaatct caagaaatgt tctatgttgc agaggaccat tacttaccta ggacatgttg 2640 tttctgagga aggagtacgg actgaagaca ctaaaatcaa agcagttcag gattttcctg 2700 tcccaaaaaa tctcaaagag gtacagagat ttttaggtct tgctagttgg taccatcggt 2760 tcatttctca cttctcagag cgagctgctc cattgtatgc actgaaaggt aagaatgcaa 2820 tctggaactg gacagttgaa tgtcaaagtg cctttgatga tctcaaatat gcactacaac 2880 gagcaccagt attaatgccc ccggatttca ccaaagtctt tagagtgcag actgacgcca 2940 gtgacatagg actaggagct gtattgacac aggattttga tggtgcagaa cacgtcattg 3000 cctatgcttc acgtctttta catggagcag aaaaatcata ctccactgca gagaaggaat 3060 gtcttgcggt cgtgtgggct gtagagaagt ggaggcagta tttggaagga cgaaactttg 3120 aagtactgac ggatcattct gcgctgactt gggttttcaa taaccctaaa ccatcttcac 3180 gcttaaccag atgggcatta cgactacaat gtttcagttt cctggtcaag taccgtaagg 3240 ggtcctgcaa tgtggtacct gatgccctat ccagaggtat accagggcaa gaggttgtag 3300 gtcatattgc catctgccag gctaacaaga ctgatcctaa tttgccagtc agctgggatg 3360 aaattgggaa agctcagaag cttgattctt ctttgcaagc tctatgggaa gcagctaagc 3420 aagccaccac agattccagc cgcattgctt attgtgtaca gaatgactac ctctttcgca 3480 gggtgcctaa caaagatcaa gggtgtgttt accaactagt catccctgca tcactgagag 3540 aacagttttt acacttcgcc cattcaaatc cactgagtgg acacttagga aggatgaaaa 3600 ccttgaagag attacttgac agtgtctatt ggcctgaaat ccgtaaggat gtttggagct 3660 tttgcaccca gtgtaagact tgtcaaatat ataaaccgag aatctcgaaa ctgtctggat 3720 tgttgcagtc aacacccgta gtagagcctg gctatatgct gggagtggat ctaatgggtc 3780 ctttcccaaa aagtaatcga tcgaatgaat tcttattagt ctttgtggat tattgcagca 3840 agtgggtgga actttttgct ttaagatctg caaaaactca cctcataact aacatcttga 3900 caaaagagat gttcactcga tggggaactc cagcatatct cgtctctgac cgcggccctc 3960 aatttacagc ccaattgctc aatgagacct gcaagcgatg gggagtagtt cagaagctca 4020 gtactgccta tcatccgcag accaacttaa ctgagaggat aaaccgaacc ctaaaaacaa 4080 tgttgtcctc ttatgtccat gacaatcacc gtgactggga taaatggatc cctgagttca 4140 ggtatgccat taactcagca tggcaggaaa gtacgggttt cactccagct gaagttgctt 4200 tgggacgtaa gttgaagggt cctctagaca ggctgatcca gagaccacct aatccagacc 4260 atctagcata taacaccctt gaaaggcaga aagctttcct tgagcaagtg attgcaaaga 4320 ccagccaagc acaagagaga caaggaaaat actacaatca acgtagaaaa ccaaagtcct 4380 ttgaggaagg agatttggtt tggatcctca cacaccctct gtctcgagct gcagattcct 4440 ttatggccaa attagctccg aaatggcaag gtcctggcaa aattgtaaaa aaggtaaata 4500 atgtcaacta cagagtggta atgcttgata aacccagtca atgtgatact taccatgtgg 4560 aaaagttgaa ggaattttat ggaactgtat aactcttttc tttaggggaa ggggtg 4616 // ID Nimb-2_DR repbase; DNA; ZEB; 5281 BP. XX AC . XX DT 23-MAR-2010 (Rel. 15.03, Created) DT 23-MAR-2010 (Rel. 15.03, Last updated, Version 2) XX DE A family of Nimb non-LTR retrotransposons - consensus. XX KW Nimb; Non-LTR Retrotransposon; Transposable Element; Nimb-2_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5281 RA Kojima K. and Jurka J.; RT "Non-LTR retrotransposons from vertebrates."; RL Repbase Reports 10(3), 487-487 (2010). XX DR [1] (Consensus) XX CC ~9-bp TSDs. Only 25 sequences in the zebrafish genome. >97% CC identical to consensus. The 3' termini are composed by (GAA)n CC microsatellite. XX FH Key Location/Qualifiers FT CDS 6..1244 FT /product="Nimb-2_DR_1p" FT /translation="MARRIDSKAVGNNWNVSGQENADGSDESGNEVGDMEV FT VCGNYEKWDMVKNNKRKKKRRNKSDESDSDRCSALEETVTVEYKVFAKPVQ FT EGDTFGGMNPIQLTKTLHKEIGIIKSAKILRNGSILIFCKDEKQQAKAIKV FT NKINGKKVECSKTNGKRYVKGVVTGIPVNVSADEVKASITNANVVEAKRLR FT TNRNGSACDSLSVMVTFDEETLPKKILIGYMCYDVRLYIPPPLRCYKCQRF FT GHIAAFCKGKQRCGKCSGEHEYGKCEEGAKLKCCNCGGDHSSAYRGCEVSK FT RQAEVQRVKAVQGISYAEALKKVPEIMTVSKQNENRNKNTETCQKCEQLKA FT ETLIVSKNDFVIFMAEIINCSAQTKSRNERIKIIIKSAEKYLDVKDLLWET FT VRDILNEDTQSSQPGGGTS" FT CDS 1248..5006 FT /product="Nimb-2_DR_2p" FT /note="includes endonuclease, reverse FT transcriptase, RNase H, and CCHC zinc-finger FT motif." FT /translation="MLNVLQWNARSLIANGQEFKKYIDNIIEKPNIICIQE FT TWLKPQLDFNIKGYNIVRNDRNHSRGGGIATFIKSGMKFRIEQINTKYESI FT LIKVWTDRGCIDIINYYNPCDKLNQNILEEVMGVRQDSVLWCGDFNSHNSL FT WGSNSNDANGILLEEFIDEKYLVCLNNGEGTRYNCFKNTESVLDLTFISSS FT LAAVSTWKVLKHNTIGSDHYPVVTKIGLKIMYEKEDRIPRWKLEKVNWKEF FT QELCDKRVMTIQIINQRDVNILNNKIVNEIIQAAEEIIPKSKGVGCTKNVP FT WWNNDCKAAIKARNKAFRHLKKHHSLEAMIMYKKAQTILRKTIKTQKRIFW FT REYCNSLGREVQLSEVWGMIRRMAGVRRNYELPVLQYGDIVAISNLEKAEL FT LVQHFRNVHSSDNLSEEARKCRNTTLTKHPDLLKKVKTTENPLDLQFNMFE FT LKRAIISARQTAPGKDEICYKMLSHMSEASLEIVLNLFNQIWDMGQLPIAW FT KQSIVVPILKPGKNPSDPSSYRPISLTSHLCKIMERMITERITYFLESKNL FT FSPYQSGFRRGRNTMDSVLCLESDIRKAQTNKEVVIAIFFDIEKAYDMLWK FT EGLLIKLKSLGVTGKTYNWVMDFLFGRKIQVRVGKEYSHEYTVENGTPQGS FT VCSPLLFNIMINDIFSQIEQSIGKSLYADDGALWIRGRNVSFVQKKIQNAI FT FEVEKWANKWGFKLSVAKTQVICFSRRHKTISLALKLYGQPLEQVKTVRFL FT GVWFDEKLTWKDHLNKITEKSKKVINVLRCLSGQEWGASRTSLQNIYWALM FT RSVFDYGCIAYMSAAESNLKKLDVLQAQALRICSGSFRTSPVSSMQVEMGE FT MPLSIRRMKLMMAYWVNIQGQIESHPTKRTLLECWEHEETNFISFGWIAEA FT KARIIGLDQLQYCKAVPIPYIPPWFFPLPKVDFNIQQELKDNNNILPTKYI FT VQNYLEKNYKESIFLFTDGSKDPQTGHTGAAVYIPVNQYHIKKRITNNISV FT YTTELIAILIALQWIEENDIYNVVIASDSFSSLESIRSGRSSYRMDILNNI FT LSKTYYIKMKGKSVYFIWVPAHVGVEGNEKADFLAKQALRISKVNLEVPLS FT KAEAKIMIRTYAQSIWQVHWDNIDTGRHLYNIQKQVGTGRKENRNRREGSI FT ITRMRIGHTGLNHTLHKIGKHPTGQCIHCNQQETIEHILFHCSXYNKERND FT LIQSVKKSNLQHFTLAGLLGNKSSEVYNDIIKFIKETQLEERI" XX SQ Sequence 5281 BP; 2054 A; 755 C; 1078 G; 1393 T; 1 other; ggaggatggc aaggcggata gactccaagg cggttggcaa caactggaat gtgagtggac 60 aggagaatgc agacggaagt gacgaatcag gaaatgaagt aggagatatg gaagtagttt 120 gcgggaacta tgaaaaatgg gacatggtta aaaacaacaa acggaaaaag aaacgaagaa 180 ataaatctga tgaaagtgat tcggatagat gctcagccct agaggaaacg gtgacagtgg 240 aatacaaggt atttgctaaa cctgtgcaag aaggggacac atttggaggt atgaatccga 300 tacaactaac caaaacactg cataaggaaa taggtataat aaagagtgca aaaatcttga 360 gaaacggatc gatactgatt ttctgtaaag atgaaaaaca gcaagctaaa gcaattaagg 420 tgaataaaat caatggaaaa aaagtagaat gttccaagac aaacgggaaa agatatgtta 480 aaggcgttgt gacaggaatt cctgtcaatg tttcggcaga tgaagtgaaa gcaagcataa 540 caaatgcaaa tgtggtcgaa gctaaacgct taagaacaaa cagaaatgga agtgcttgtg 600 atagtctttc tgttatggtc acatttgatg aggagacact cccgaaaaag atattaatcg 660 ggtatatgtg ctatgatgta agactgtaca ttcccccgcc ccttcggtgt tataaatgtc 720 agagatttgg acatatagca gctttttgca agggaaagca aaggtgtggt aaatgtagtg 780 gggaacacga atatggaaaa tgtgaagagg gtgcaaagtt gaaatgttgc aactgtggag 840 gggatcatag ctcagcgtat cgagggtgtg aggttagcaa aaggcaggca gaagtacaaa 900 gagtaaaagc tgttcaagga atcagctatg cggaagcatt aaaaaaggtt ccagaaatta 960 tgactgtgtc taaacaaaat gaaaacagga acaaaaatac agagacatgt caaaaatgtg 1020 aacaactgaa agcggaaact ctgatagtga gcaaaaatga ttttgtaata ttcatggcag 1080 agataataaa ttgttccgct caaacgaaaa gtaggaatga aagaatcaaa ataataatca 1140 aatcagcaga aaaatacctg gatgtgaagg atctgctctg ggaaacggtt agggacatct 1200 taaatgaaga tacacaatca tcccagcctg ggggaggaac gtcttaaatg ttaaacgttc 1260 tacaatggaa tgcaagaagc ttaattgcga atggccaaga atttaaaaaa tatattgata 1320 atattataga gaaacctaat ataatatgca tacaagaaac atggttaaaa cctcaattag 1380 attttaatat taagggatat aatatagtta gaaatgatag gaatcatagc agaggaggag 1440 gaattgcaac atttataaaa agcggaatga agtttaggat agaacaaata aatacaaagt 1500 atgaatcaat tttaatcaag gtatggacgg atagaggatg tatagacatt attaattatt 1560 ataatccctg tgataaatta aatcaaaaca tattagaaga agtaatgggt gtacgacaag 1620 acagtgtttt atggtgtggg gattttaatt cacataattc attatggggg agtaatagta 1680 atgatgcaaa tgggattctt cttgaagaat ttatagatga aaaatattta gtttgtctaa 1740 ataatggtga aggcacacgg tataattgtt ttaaaaatac agaaagcgta ctagacttaa 1800 catttattag tagttcatta gcagcggtta gcacatggaa agtactcaag cacaatacaa 1860 taggtagtga ccattaccca gtagtaacta aaattggatt aaagataatg tatgaaaaag 1920 aagatagaat tccaagatgg aaattggaga aagtaaattg gaaagaattt caagaattat 1980 gtgataagag agttatgaca atacaaataa taaaccagag agatgtaaac attttaaata 2040 ataaaattgt taatgaaata attcaagcag ccgaagaaat aatacctaaa agtaaaggag 2100 ttggttgtac taaaaatgta ccttggtgga ataatgattg taaagcagct attaaagcaa 2160 gaaataaagc atttagacat cttaaaaaac accattcact ggaagctatg atcatgtata 2220 aaaaggctca gacaatatta agaaaaacta ttaaaacaca aaagcgtata ttctggagag 2280 aatattgcaa tagtcttgga cgagaggtac aattatcaga ggtgtggggt atgattagaa 2340 gaatggcagg tgttagacga aattatgaat taccagtgtt acagtatggc gatatagtag 2400 cgatcagcaa cctagaaaag gcggaactat tagtccaaca ttttagaaat gtacatagtt 2460 cagataatct ctctgaagag gctaggaaat gcagaaatac tacattaact aaacatcctg 2520 atcttttaaa gaaagtaaaa acaacagaaa atcctttaga tttacagttt aatatgtttg 2580 aattaaaacg agcaatcatt agcgcaagac aaactgcccc aggtaaggat gaaatatgct 2640 ataaaatgtt gtcacacatg tcagaggcat cactagaaat agtactaaat ttatttaatc 2700 aaatttggga tatgggtcaa ttacctatag catggaaaca atctatagta gtacctatac 2760 ttaaaccagg gaaaaatcca tcagatcctt ctagctatag acccatttca ctgacatccc 2820 acttatgcaa aataatggaa agaatgatta cagaaagaat tacatatttt ttagaaagca 2880 aaaacctttt ttctccatat cagagtgggt tccgcagagg cagaaatact atggactcag 2940 tattatgttt ggaatcagac atcaggaaag cacagactaa caaagaagtt gtgatagcta 3000 tcttttttga tatagaaaag gcatacgata tgctttggaa agagggattg ctgattaaat 3060 taaaatcttt aggagtcact ggcaaaacat ataattgggt catggatttc ttatttggaa 3120 gaaaaataca agttagggta ggaaaagaat attcacatga atacacagtg gaaaatggaa 3180 ctcctcaagg tagtgtttgc agcccattat tgttcaatat aatgataaac gacattttct 3240 ctcagattga acaaagtata ggaaaatcat tgtatgcaga tgatggggca ctatggatta 3300 gaggtcggaa tgtgtcattt gtacaaaaga aaatacagaa tgcaatattt gaggtagaaa 3360 aatgggcaaa taaatgggga ttcaaactat ctgttgcaaa aactcaggtc atctgtttct 3420 caagacggca taaaaccata tcactcgctt taaaactata tggacaacca ctagaacaag 3480 taaaaactgt aaggtttctt ggggtttggt tcgatgaaaa gctgacatgg aaggatcatc 3540 tgaataaaat cacagaaaaa agtaaaaagg ttattaatgt acttcgttgc ctgtcagggc 3600 aagagtgggg agcgagtaga acatcactac aaaatatata ctgggccctc atgagatctg 3660 tttttgatta tggatgtata gcttacatgt cggcagcaga gtcaaacctc aaaaaattag 3720 atgtattaca agcacaggcc cttagaatct gtagtggatc atttagaacc tcaccagttt 3780 catctatgca ggttgaaatg ggagaaatgc ccttaagcat taggagaatg aaattaatga 3840 tggcatattg ggttaacatt cagggacaaa ttgaatcaca tccaacaaaa agaacattac 3900 tagaatgttg ggagcatgag gaaactaatt tcataagttt cggatggata gctgaggcaa 3960 aagccagaat aattggatta gatcaactgc aatactgtaa ggcagtccca atcccatata 4020 ttccgccatg gttttttcca ttacctaaag tagatttcaa tatacaacag gagttaaaag 4080 ataacaataa tattcttcca acaaaatata tagtgcagaa ttatttagaa aaaaattata 4140 aagaatcaat atttttattc acagatggat ctaaagaccc acaaacaggc catacaggcg 4200 cagcagtgta tatcccagtt aatcaatacc atattaagaa aagaataact aataatatat 4260 cagtatatac aacagagtta attgcaatac taattgccct gcagtggatt gaggagaatg 4320 acatatataa tgtagtaatt gcatcagata gcttttcatc actggaaagt ataagatctg 4380 gaaggtcttc atacagaatg gatattctta ataatatact cagtaagaca tattatatta 4440 aaatgaaggg caaatcagta tatttcattt gggttcctgc tcacgtaggt gtggaaggta 4500 atgagaaggc agactttctg gccaaacaag ctctcagaat tagtaaagta aatctagaag 4560 tcccattgag caaagcagaa gctaaaatca tgataagaac atatgcacaa tcaatatggc 4620 aagtacactg ggataacatt gatacaggta gacatttata taatatacag aaacaagttg 4680 gaactgggag gaaggagaac agaaatcgta gggaaggaag tattataact cggatgagaa 4740 taggtcacac tggactcaat catacattac ataaaatagg gaaacaccca actgggcaat 4800 gtatacattg taatcaacaa gaaacaatag aacatatttt atttcactgt agtaawtata 4860 ataaagaacg aaacgatctt attcagtcag ttaaaaagag caatcttcag cattttacat 4920 tggcaggttt attaggaaac aaatctagtg aggtatataa tgatataata aaatttatta 4980 aagaaaccca gttagaggaa agaatataga gataatatca gaagaattgt tagtccaccc 5040 catcccccat ttttattaca ttttattagt attataattt tttttttttt tttttttttt 5100 tctcttccct ctctccccct cccctcacaa tttcctcccc ctttctctcc cttcatgtaa 5160 tttatacact tcagaaagta ttcaactagt tccatactcc attccagtcg gtggcggtaa 5220 tgcacctata agtctggttg ccaaccgcca gtaaaaccca aagaagaaga agaagaagaa 5280 g 5281 // ID GYPSY5LTR_DR repbase; DNA; ZEB; 458 BP. XX AC AL672044; XX DT 05-JUN-2002 (Rel. 7.05, Created) DT 05-JUN-2002 (Rel. 7.05, Last updated, Version 1) XX DE GYPSY5LTR_DR is a long terminal repeat from GYPSY5_DR. XX KW Gypsy; LTR Retrotransposon; Transposable Element; GYPSY5I_DR; KW GYPSY5LTR_DR; endogenous retrovirus. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-458 RA Kapitonov V.V. and Jurka J.; RT "The GYPSY5_DR LTR retrotransposon from zebrafish."; RL Repbase Reports 2(5), 19-19 (2002). XX DR Genbank; AL672044; Positions 24108 23651. XX CC GYPSY5LTR_DR is a long terminal repeat from the GYPSY5_DR LTR CC retrotransposon. XX SQ Sequence 458 BP; 92 A; 131 C; 93 G; 142 T; 0 other; tgttgtaaat gagggcgatg ccaaccctct tgtgcaccac cagagggaac catcgccaga 60 attctgattc gactcacgga ctcaaaatcc cataagccct gctacctggc actgattacg 120 gtccaggtgc aactcatcag ctctcgtgta tatataccgc actcacgctc cggttcgttg 180 cgaagtcttg atttgcctgg ctgtcatttc tgagcgttcc atactccctg cttcggactg 240 atctgtgttt ctgaccctgt gcttgttcta cgattacgaa agacatctgc ctgcccctga 300 tctccagcct gttattctga ccagtaagat atccgcctgc ctttgaactt ttgcctgtcc 360 cacgttctgt ctcctggatt gcccctttgt gtttgtttgt atgtgtgctc ttaataaagc 420 ttgcaaatgg attcaatgcc ttctgactca tcacaaca 458 // ID BEL2-LTR_DR repbase; DNA; ZEB; 890 BP. XX AC chr16; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL2-I_DR; KW BEL2-LTR_DR; BEL2_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-890 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1356-1356 (2008). XX DR Genome; chr16; Positions 35783496 35782607. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 890 BP; 229 A; 201 C; 186 G; 274 T; 0 other; tgtgcaaaag gcgattaaag aaagtgaaag catgttgttg cggatggtgc ccggaggact 60 tttattatga aagtatagat cgccgctggc cggaaaagaa agtgctttca ctctagcgat 120 cgctagatca tcgcagccat tgctcacgtg cagtcctcca tctgttaaga cccacagacg 180 gacactgcag ctccctttcg tgacagaaga ttgtttatgg tgttgtgtgc gtcctataag 240 agtgagtagc agtccggatg gcatgtaaat agaatgagag aacattccaa acgtgatgct 300 tgtgttaaat cattacagtg cttagcatag caggctaaag ctctcatgtc tagttatgag 360 taaagacatt taatgtttac tgttatataa tatgtttatg gttcactttg ttcttgttta 420 tggggagcag cctatattgt gaatgtgact tattgatgta agagtttaat ttagttttat 480 actttgtgtg tttcagagaa accacacccc ctatacaccc tacaaacatc cctaagtgct 540 cagatcaagg atctagtgtt tcagttcata caagcaagcg cacgccaagt tatcttctca 600 ttggcctaaa tgttgatgta ctaagcccta cggacttcag tggatgcttt aaagctgctc 660 tgcccatact atcaatgttt gctgctggat tagagctcac tccccatcct gttaattctt 720 actgattgat agctgtacag gctggctccc cccccccctt tgtggacaat ttgcagtttg 780 caataaaccc tgtttaactt cacctcatct ctctgtcgtg ttctaaccga cacccctggt 840 tcgctacgct ccctttaact agccctaacg gcaaggtagt cttcactaca 890 // ID TZF28B repbase; DNA; ZEB; 1478 BP. XX AC U51227; XX DT 08-JAN-2009 (Rel. 14.02, Created) DT 12-JAN-2009 (Rel. 14.02, Last updated, Version 2) XX DE Mariner/Tc1 type DNA transposon. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Nonautonomous; KW TZF28B. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1478 RA Jurka J.; RT "Mariner-type DNA transposons from zebrafish."; RL Repbase Reports 9(2), 559-559 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1478 BP; 466 A; 289 C; 329 G; 387 T; 7 other; tattcatagc gcttcacttt tccacatttt ttatgttaca gmcttattcc aaaatggatt 60 aaatttaaaa attctacaca caatacccca taatgacaat gtgaaaaatt ttaattgttg 120 caaatttatt aaaaataaaa aatgaaaaat cacatgtaca taagtattca cagcctttgc 180 cgtgaagctc taaattgagc tcaggtacat tctgtttcca ctgatcattc ttgagatgtt 240 tcagcagctt aattggagtt cacctgtggt aaattcagtt gattggacat gatttgaaaa 300 ggcatacacc tgtctatata aggtcccagg gttgcatgtc aaagcacaaa ccaagcatga 360 agacaaagga attgtctgta gacctctgag acaggattgt cttgaggcac aartctgggg 420 aagggtacag aaaaatttct gcwgcwctga aagttccaat gagcacagtg gcctccatca 480 tccgtaagtg gaagatgttt ggaaccacca ggactcttcc tagagctgcc ggccatccaa 540 gctgagcgat cgggggagaa gggccttagt cagggaggtg atcaagaacc caatggtcac 600 tctgtctgag ctccagcgtt cttctgtgga gagaggagaa ccttccagaa ggacaaccat 660 ctgtgcagca atccaccaat caggcctgta tggtagagtg gccagacgga agccactccc 720 catggaattt gtcaaaaggc atctgaagaa ctctcagacc ataagaaaca aaattctctg 780 gtctgatgag actaaaattg aactttggag taaatccagg tttggagaaa accaggcacc 840 gctcatcacc tggccaatac catccctaca gtgaagcatg gtggtggcag catcatgctg 900 tggggatgtt tttcagcggc aggaactgga agactagtca ggatwgaggg aaagatgaat 960 gcagcaatgt acagagacat ccttgatgaa aacctgcttc agagtgctct ggacctcaga 1020 ctggggtgat ggttcatctt tcaacaggac aatgacccwa agcacacagc caaaataaca 1080 aaggagtggc tttacaacaa ctcwgtgaat gtccttgagt ggcccagcca gagcccagac 1140 ctgaacccga ttgaacatct ctggagagat ctgaaaatgg ctgtacaccg aagacaggtg 1200 tgccaagctt gtggcatcat attcaaaaag acttgaggct gtaattgctg ccaaaggtgc 1260 atcaacaaag tattgagcaa aggctgtgaa tacttatgta catgtgattt ttcaggtttt 1320 tttattttta ataaatttgc aacaatttca aaatcttttt ttcacattgt cattatgggg 1380 tattgtgtgt agaattttga ggaaataaat taatttaatc cattttggaa taaggctgta 1440 acataaaaaa atgtggaaaa agtgaagcgc tatgaata 1478 // ID BEL5-I_DR repbase; DNA; ZEB; 6716 BP. XX AC chr1; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL5-I_DR; KW BEL5-LTR_DR; BEL5_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6716 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1361-1361 (2008). XX DR Genome; chr1; Positions 53863030 53869745. XX CC Positions [5602-6165] - Integrase core CC LTRs are 97% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 421..6516 FT /product="BEL5-I_DR_1p" FT /translation="MSQTGGTPCSVEEMGKKRESKLTQKALLNKIERLQCE FT RKRAVDKIKGLIPKMKEYMKQKGNILQIKQCLMTLNTMCESAVNAHDELMP FT LLPEDEFLKQKEWFLSIMNYSSVFQNQTEQWISETEQTQGVSQNATEMNEN FT VISAKSNEVENEDEINPRDSVSNISCNKSSLSKHSSTSSARLKAQAELAAL FT AMKKKLLEERHALEEEEEKLRKRKERLQLDAEIAGEQAKLEVLKTHSTTTS FT VILTSNVSDGMNSYVAKQSKQPFNVDAGEFIPSDSNKERYLGAKPKVKLTP FT VLDNITPSNIIDPPFSQNHLNSFDMQDSQGPHDFNTDIFMDSETKNCVLGL FT MKKQNEVTTLLMQQQGLSALPKREIPIFDGDPLKYHSFIRAFENGIEKNIN FT DNCDRLYFLEQYTRGHAKALVGSCHYMNPDRGYLKAKTLLKEQFGDEQKVA FT SAYMDKALLWSPIKTEDVKALQDFSFFLRGCCNAMEDVQYLHDLDMPSNML FT CIIKKLPYKLRDRWRSQVCELQESNNQRVKFKDIADFIEKQVKILTDPVFG FT NIQDTTSVSGNKGMNKLKSQYRSGNKGISFATSVGAVESKCQSEKKRKEII FT TTDKKVCMCCGGGHTLDMCVQLGKMVHEKKIGFLRENGICFGCLCTGGHIS FT KNCRKRISCSKCNLKHPTVLHREPAHVIEQTERNTVVPVDNMLVSSGLTGA FT GDEDCKLPIVPVQVKSNKGSKIINTYAFLDQGSTAVFCTENLMHKLHLTGR FT KGRILLRTMGQEKVVSSNIISGLEVAALEGDSFLELPKVYTQESMPVHRGN FT IPTNRDIKRWSYLKHIHLPQIDSEIELLIGTNVPKALEPLQVVCSVNDGPY FT AIRTMLGWTVNGPLMGDSGERVNWKQPLVMVNRVSAVNLDEIWHQQFKTDF FT PENRLDDYAGMSREDQKFLELVSNSVKHVNGHYQIALPLRSIDVSMPKNKN FT IVEQRLHHLKRRFQKDSLFHAEYNNFMNDLLLKDYAEKVPVEELDRSDGKV FT WYIPHHGVFHPTKGKMRVVFDCAASYQGTSLNAQLLQGPDLTSSLIGVMTR FT FRKEPVVIMADVESMFYQVRVSSDYSDLLRFLWWPNGDVEQQPVEYRMKVH FT LFGAASSPSCANYALRRCAEDYGCHYSENAVDKLLNSFYVDDCLVSVVTEK FT EAVSLYQELVSLCARGGFSLTKWITNRPGVLEAIPESHRARGMEGLNMELD FT SLPVERVLGLEWCIKSDCFKFKIVLKDRPLTRRGILSTISSIYDPLGMLSP FT VILTAKKVLRDLCRREIGWDDTVPESVSKDWMKWLQQLHLLDVFKVDRCVK FT PSCFGDVVTAQLHHFSDASQDGYGTVSYLLLKNQHSVMHSAFIMGKARVAP FT LKSVTIPRMELIAATMASRMDILWRKEMHMDLLDSVFWTDSTSVLKYIRNE FT TSRFKVFVSNRVSQILKVSSPEQWRYVDTASNPADMASRGVKVDVFIQNTT FT WVSGPHFLLHPESEWPINKEDFNYLSPEDPEVKNVAATNVTQLKEDPVTYL FT IKYFSSWTRLKKSVAWFLRIKDWLMSCLKETRHFHQTVVQSDNNAQQSSSV FT AEEIDYFKRTSSKLTVEDLDRAELAIIKFCQGQRFPEELASLEKGQHIKIS FT SHLHKLCPQLQDGILRVGGRLCRLAMPVEVKHPIILTKDLHISELLLRHIH FT QEVGHGGRNHMLSKLREKYWITGVSVAIRKVLSKCIVCRRLNALPVYQQMA FT DLPHERIVPDEPPFTRVGVDYFGPFEVKSRRSMVKRYGVIFTCLAIRAVHL FT EVAPSLDTDSFINALRRFISRRGQVREIRSDNGTNFVGAERELKAAIKQWN FT QGQINDLLLQKGIRWSFNPPAGSHHGGSWERLIRSVRKVLNSMFKVQNLDE FT EGLHTVLCEIEAIINSRPITKASMDPNDLEALTPNHLLQLKTSPSLPSGLF FT QPTDMYALRRWKQVQYMSDLFWKRWVKEYLPQLQERQRWIGVKRNLVVGDL FT VIIMDSTAPRNSWPTGRVIQTFPDRKGFVRQVRIKTKTSCLERPITKVCLL FT QEAEFD" XX SQ Sequence 6716 BP; 2097 A; 1210 C; 1603 G; 1806 T; 0 other; atattttgtc aacaaaaaac ggcatcttat caggaaaatc atcgctcgca aaatcagcgc 60 acagacgaat caggaagacg ccaagtaagt gatgcgattg caaaccaaat gggccacgtc 120 atgtgttcgt ttacatagga cagataagtt tatctgtgga tatatgtgtg gatgagaaat 180 ctggttggca ttggtgcatg ttaaagttgt ttggaatgca aatcagatgc gcgaggtcgc 240 gcgacttact ttggtgtctg taggaagtct gcactttgcg gcttgtttgt gtgatgccgt 300 gatgttgttt ggaaggaaaa tgaactgctg ttgaattgaa tcgaatcgtt tgagcgtgtt 360 caaacgttgc attgtgtttg gatgttgtga ttcactgaac actgaatcaa aagagtcact 420 atgagtcaaa ctggtggtac cccgtgctcg gttgaagaga tgggaaagaa aagggaaagc 480 aaactcactc aaaaggcctt gctgaataaa attgagagat tgcaatgtga acgtaaacgt 540 gctgttgata aaatcaaagg gttgatacca aaaatgaaag aatatatgaa acaaaaggga 600 aacatccttc aaataaagca atgtttaatg actttaaaca caatgtgtga gagtgctgtt 660 aatgcacatg atgaacttat gccattactt cctgaggacg aatttcttaa acaaaaggaa 720 tggtttttaa gcataatgaa ttacagcagt gtgtttcaaa accaaactga gcaatggatc 780 agtgaaactg agcaaaccca gggtgtttca cagaatgcaa cagaaatgaa cgaaaatgtg 840 atttctgcaa aatcaaatga agttgaaaat gaggatgaaa ttaatccacg tgatagtgtt 900 tcaaacataa gctgtaacaa atcatcactg tcaaaacatt cttcaacctc atctgcacgt 960 ttaaaagccc aagctgagtt ggctgcttta gccatgaaga aaaaattatt agaagaaaga 1020 catgcactgg aagaagagga ggaaaagttg cgtaaaagga aagaaagatt gcaactggat 1080 gctgaaattg caggagaaca ggcaaaactt gaggttctca aaacacacag tacaacaaca 1140 agcgtgatac tgacatcaaa tgtttcagat gggatgaact cttatgtggc aaaacagtct 1200 aaacaaccat tcaatgttga tgcaggtgaa tttattccat cagattcaaa taaggagaga 1260 tatttaggag ctaaacccaa agtcaagctt actccagtgt tggacaatat aactccatca 1320 aacatcattg acccaccatt ttcgcaaaac cacctgaatt ctttcgacat gcaagattca 1380 caaggaccac atgatttcaa tacagatatt ttcatggatt cggaaaccaa aaattgtgtt 1440 cttggtctca tgaagaaaca gaatgaggta acaacactgt tgatgcagca acaggggctt 1500 tcagctttac caaaaagaga aatacccata tttgatggtg acccactgaa ataccactct 1560 tttattagag cttttgagaa tggtattgag aaaaatataa atgacaactg tgaccgtttg 1620 tatttcttgg agcaatatac aaggggccat gctaaagcac ttgtggggag ctgtcattat 1680 atgaatccag atcgaggata tttgaaagca aaaactttgt tgaaggaaca gtttggggac 1740 gaacagaaag tggcctctgc ttatatggac aaagcccttt tatggtctcc aattaaaacc 1800 gaggatgtga aggctctgca ggatttcagt ttctttctca gaggctgttg taatgcaatg 1860 gaggatgtgc agtatcttca tgacttagac atgccttcta atatgttgtg tattattaaa 1920 aaacttccat ataaactcag agatcgatgg aggagccagg tttgtgagct gcaggaaagc 1980 aataaccaga gagttaagtt caaagacatt gctgacttca ttgagaagca agtgaaaata 2040 ctcacagatc cggtgtttgg taacatacaa gacactacat cagtaagtgg aaataaagga 2100 atgaacaaac ttaagtcaca atatcgatca ggaaacaaag gaatcagctt tgcgaccagt 2160 gtgggcgctg tggaaagtaa atgtcagtct gaaaagaaaa ggaaggaaat tataacaaca 2220 gataagaaag tgtgtatgtg ctgtggagga ggacacaccc tagatatgtg tgtacagctg 2280 gggaagatgg tacacgagaa gaagataggc ttcttgaggg aaaatggtat atgttttggc 2340 tgtttgtgta caggaggaca catcagtaaa aattgccgta aaaggatttc ttgttcaaag 2400 tgtaacctta agcatcctac agtacttcat agggaacctg cacatgttat tgagcagact 2460 gagaggaaca cagtggtgcc tgttgacaac atgctggtgt caagtggtct tacaggggct 2520 ggagatgagg actgcaagct acccatagta ccggttcagg taaaatctaa taaaggaagc 2580 aaaattatta atacctatgc atttttggac caagggagca ctgcagtgtt ttgtacagag 2640 aacttgatgc acaagcttca ccttacagga aggaaggggc gtattcttct acgaaccatg 2700 gggcaggaga aggttgtgag tagcaacatt atatctggtt tggaggttgc agctctggaa 2760 ggagatagtt ttcttgagct gccaaaagtt tatacacaag agtccatgcc agtgcaccga 2820 ggaaatattc caactaatag ggatattaaa agatggtctt acttaaaaca cattcattta 2880 cctcagatcg attccgaaat agaactcctg ataggaacca atgtccctaa agcactggaa 2940 ccattacaag tggtatgcag tgtaaatgat ggaccatatg ccattagaac catgttgggc 3000 tggactgtta atggaccact gatgggagac agtggagaaa gggttaactg gaaacagccc 3060 ctggtaatgg ttaatagagt gtcagctgta aatttggatg agatttggca ccaacagttt 3120 aaaactgact tccctgaaaa ccgtctcgat gattatgctg gtatgtcaag agaagatcaa 3180 aaattcctag aactagtgag taattcagta aaacatgtga atgggcacta ccaaattgca 3240 ttaccattaa ggagcattga cgtcagcatg ccaaaaaaca agaacattgt cgaacagcgt 3300 ttgcatcatc tgaaaaggag gtttcagaaa gattcattat ttcatgctga atataacaac 3360 tttatgaatg atctgcttct taaagattat gctgagaagg tgcctgtaga ggaactagac 3420 cgcagtgatg gaaaggtatg gtatatacca caccatggag tgtttcatcc caccaaggga 3480 aagatgagag ttgtgtttga ctgtgcagcg agttatcaag gaacatcgtt gaatgctcag 3540 ctcttacagg gcccagatct tacgagttct ttgataggag tcatgactcg ttttagaaag 3600 gaaccagtag taatcatggc tgatgttgaa tccatgtttt accaggtacg ggtctcttcg 3660 gattactcag acctgcttag gtttctttgg tggcctaatg gagatgtgga acaacaacct 3720 gttgaatacc gtatgaaggt gcatttgttt ggggcagcat catcacctag ttgtgccaat 3780 tatgccctca gaaggtgtgc agaggattat ggatgtcact acagtgagaa tgcagtagac 3840 aaattattga actcttttta tgtagatgac tgccttgtgt cagttgttac agagaaggaa 3900 gcagtgtcac tttaccagga gctggtttct ctgtgtgcca ggggtgggtt ttccctcaca 3960 aaatggataa ccaacaggcc tggtgtattg gaagctatcc ctgaaagtca cagagcaaga 4020 ggcatggaag ggttaaacat ggagttggat tcattacctg tggagagagt gctgggtttg 4080 gaatggtgca taaaatcaga ctgttttaag ttcaagattg tactaaagga tagaccactc 4140 accagaagag gaattctctc tactatcagt tcaatttatg accctctggg aatgctaagc 4200 cctgtcattt taacagcaaa gaaagttctg agagatttgt gcaggagaga aattggctgg 4260 gatgacactg tgccggagtc cgtgtcaaag gattggatga aatggctgca gcagctgcat 4320 ttgttggatg tgtttaaggt ggaccgatgt gtgaaaccat catgctttgg agatgttgtt 4380 acagcccagt tacaccattt cagtgatgcc agtcaggatg ggtatggaac agtttcttac 4440 ctgttgctga agaaccagca ttcagtaatg cacagtgctt tcatcatggg gaaggcaaga 4500 gtggctccgt taaaatccgt aaccatccct agaatggagc ttattgctgc taccatggcc 4560 agtcgcatgg acattctttg gaggaaagag atgcatatgg atcttttgga ttctgtgttt 4620 tggacggaca gtacatctgt gctcaaatac atccgaaatg aaacctcacg attcaaagtt 4680 ttcgtttcta atcgggtctc tcaaatcctg aaagtttcca gtcctgagca gtggaggtat 4740 gtggacactg caagcaatcc agctgatatg gcatctagag gtgttaaggt ggatgtgttt 4800 attcaaaaca ctacatgggt gtcagggcct cactttcttt tacatccgga gagtgagtgg 4860 cctataaaca aggaggactt caactacctt tcacctgaag accccgaggt caaaaatgtt 4920 gctgcaacaa acgttacaca actcaaagaa gatccagtca cttacctgat caagtatttt 4980 tcttcctgga cacgccttaa gaaatctgta gcatggttct tgaggatcaa ggactggctc 5040 atgtcttgtt taaaggaaac aaggcacttt catcagaccg ttgtacagtc tgacaacaat 5100 gcacagcaga gctcctccgt ggcagaggag attgattatt ttaaaaggac aagcagtaaa 5160 ctcacagtgg aggacttgga ccgggctgag cttgccatca ttaaattctg tcagggacag 5220 agatttcctg aagagctagc cagtctggag aaagggcaac atattaaaat atccagtcac 5280 ctccataaac tctgtccaca gctgcaagat ggaatcctga gagttggtgg tcgtctctgt 5340 aggttggcta tgcctgtgga agttaaacac cctataatat taacaaaaga tcttcacatt 5400 tcagaactct tgctgagaca tatccatcag gaagtggggc atggtggacg gaaccatatg 5460 ctgtccaaat tgagggagaa gtactggata actggtgtca gtgtagccat aaggaaagta 5520 ctgtccaaat gcatcgtctg tcgccgtcta aatgctctgc cagtttacca gcaaatggca 5580 gacttgccac acgaaagaat agtcccagat gaaccaccat ttactcgagt gggagtggac 5640 tattttggac cttttgaggt taagagccga aggagcatgg taaagcggta tggggttata 5700 tttacctgtc ttgccatcag agctgtacac cttgaggttg ctccatcact ggatacggac 5760 tcattcatta atgcactcag gcgtttcatt tccagaaggg gacaggtgcg agagatccgt 5820 tcagataacg gaactaattt tgtaggtgca gaacgtgagt tgaaagctgc cattaaacag 5880 tggaaccagg ggcagatcaa tgatcttttg ctgcagaagg gaatcagatg gagttttaac 5940 ccaccggctg gctctcatca tggaggatca tgggagaggt taataagatc agtgcgtaaa 6000 gtccttaatt caatgtttaa ggtgcaaaac ctggatgaag aaggtcttca cacagtgctc 6060 tgtgagattg aagccatcat caacagccgt ccaatcacca aggcttccat ggatccaaat 6120 gacctggaag cactaactcc aaaccacttg ttgcagttaa agacttcacc atcattaccc 6180 tcagggttgt tccaaccgac agacatgtat gcacttagga ggtggaaaca ggtacaatac 6240 atgtccgatc tattctggaa gagatgggtt aaggaatact tacctcagct acaggagcgt 6300 cagagatgga ttggagttaa gaggaacctc gttgtgggag atcttgtgat cattatggac 6360 agcacagctc ctagaaattc ttggccaaca ggtcgagtga tccaaacctt cccagaccga 6420 aaaggatttg tccgtcaagt acggattaaa acaaaaacca gctgtttgga aagacctatt 6480 accaaggtct gcttgctgca ggaggctgag tttgattaaa tatgaactac agttaaattc 6540 ttttcttttt tttttgataa ttccttttgt ggattatttt ttgtttttat ttgtggattt 6600 tgccattagg acacaaagaa gaaaagtatg gacattttac tgttggactt tacagttgaa 6660 cttttgtgtg tttgatttat gtctcctatc tgtaattttg gataattatt atgtgg 6716 // ID DIRS-10_DR repbase; DNA; ZEB; 5763 BP. XX AC . XX DT 17-JAN-2009 (Rel. 14.01, Created) DT 09-MAR-2009 (Rel. 14.01, Last updated, Version 2) XX DE DIRS-like LTR retrotransposon family - a consensus. XX KW DIRS; LTR Retrotransposon; Transposable Element; KW reverse transcriptase RNase H; DIRS-10_DR. XX NM DIRS-10_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5763 RA Bao W. and Jurka J.; RT "Families of DIRS-like retrotransposons in zebrafish."; RL Repbase Reports 9(1), 16-16 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 3929..4693 FT /product="DIRS-10_DR_1p" FT /translation="CVDGGTVGRHNLIIRFLRGARRMVPPRPPLMPSWDLA FT VVLTSLREEPFEPLDSVSLRFLSLKTALLVALASVKRVGDLEAFSVSDSCL FT EFXPAYSHVVLRPRPGYVPKVPTTPFKDQVVSLQALPAEEADPALSLLCPV FT RALRTYVDRTQSFRSSDQLFVCYGGRQKGSAVSKRRLAHWRVDAISLAYLS FT QGEPCPPGVRAHSTRSIASSWALARGASLTDICRAAGWATPNTFARFYNLR FT MEPVSSRVLGNPW*" XX SQ Sequence 5763 BP; 1074 A; 1711 C; 1597 G; 1365 T; 16 other; ttccccttcg gatggggaac tccaatgcta tgtggaaaac cttccacaat atggggtttt 60 cgtcagaaac caatcatctg aaagagtata aaatcgggcc aatgaaatgc caaatgaatt 120 ggcagcgtca gcatgcacag ctggcgtcaa tgacaatcag tcaagtatat aagacgcggc 180 tagtgcaatg ctcgacatcc ttttcgcttt cagagccttt cactagcttc tgagagagtc 240 tttgagggtt ctccwacctg tgtctacaga gagagatcga gaagcagctt ctcccggtcc 300 agagcgcgta tacgcagtgg cagacggtcg agctgggttt ttctcccttg cctggcgttc 360 tttgggtccg gtcctccaag agcggtttgt atatagggaa aaagttttcc taaaagagca 420 acacggtttt gcagcgcgtc tttttcaaga cgtcgctccg accgtgcgtt tctggatgcg 480 gtggtttcct atccccggat gatgggcacg agcactgcgt ttcatgtctg ggggtccagc 540 atgttaatgc ggtgctcgcg ggcagtgcat gccgtctctt tgatgtcatg tctgctgcgc 600 agttaagatc gcggctcgct cttgcaaaag agcgacccac cccagttgtc ccccgcactg 660 cggttggcac tcgggcagat ctgaggatta cagtgggagt aaatccgccg ccttcgggct 720 gcagacctct cgctcctcca cgcgctccat ccaagcttca ggtgagaata tgcgctccct 780 cttcggatgg tcgctctctc acttgatgac accgaggatc agatgtccat cgctgcatcg 840 gaggatgggc tgtcattgtc tgatgaggat gcgaacccgc tcgctccctc cctccggggt 900 ggagagcacg gcgttggcat ctaaaaagca ggcatgatgg ccgtgctttc ccgggctgct 960 tcggccgtgg gcctggtgat ggtttatccc ccagccccgc gaccggaccg actagatggg 1020 tgttatgtgg aggattcaaa gccttccgtc cccttcttcc cggaagtgca cagtaagctc 1080 acgcagtttt gagggcactt tttttctgcc cgatctgcat gcgcttccat cctcaccatc 1140 cttggagggt gggctgtcaa tgtctgatga ggattcgaac ccgctcctcc ctccgggttg 1200 atgagcgcgg tgtcgaatct agaagcagac tttgtggccg tgctttccca ggctgcttcg 1260 gccatggctc tggagagtgg ctctggagat ggtttatccc ccagccccgc ggccggactg 1320 actagagggg tgtaaaaacc tttcttcctg gatgcgcaca gtaggcttac gcagtgtgct 1380 gcgtgcgcct ccaccctcac cttgcatgct atggccacct accagcgcta ccaagcgcag 1440 gcgctggccc agctgcggcg aggatggttc cgacccagga ctgggcatga gctccgcacc 1500 ctgggaagga cgatggccac tttagtggcc aggaacgcca cctcaggcta aatctggtga 1560 tgtgtgtgat gttgacaaag ttcgctttct taactcaccc atatcccggg ctggcctgtt 1620 cggcgacacc gtcagtgaac tcgcccagga gttcacgccg gtgatggagc agtcggaggc 1680 gatgggttat aatctatcgg cgggatcgta agaccgctcc ctcccaccga gccatccaca 1740 tccactgctt ttcgccgagg gcgctcgcct gtagcttttt gctccgccgc cccgcctgtg 1800 cctccggcca agcggctgcg ccgagcatct cgcaggcaac cagcgccccc ctgccccagg 1860 gtgccgctaa gttcggtaaa cagaccgcga agcatccctg agacgggcca tccggagagg 1920 agggaatttr ctctttcccc gctggagggt ggggctctay atttaaangc ngwaaawaaa 1980 aaaaaaaaac gccatcaaat cttcaaagag cttttttctc tttcctcgga tgtgacagcc 2040 tgaacactgc cagtttggga cgctatgctt tccagctcgc aggatcggtg catttcgcca 2100 atggctcaca gagcgcgaga gaacggtctc ctttctctcc ctctcgcagc ccctcctccg 2160 gagtttgggt gcgagaccag agcgagtctc tcgcctctcg ctctcccgcg ggaccccagc 2220 gctccccggg tgagcccacc cactccacgc tgcctcaccg ctggcatgtc agcgatcgtg 2280 cgggcgggtt cacttgcgag ggctctgcct gcctggttag cgcgggccag cccatcgcaa 2340 tggctcatcc gtacgatcag actcggctat gcgatacagt tcgygaaacg gctttccaag 2400 ttcacgggcg tgtatttcac caggctcagt cctgcgtccg cccctgtctt gagggacacg 2460 tatatgcatt tctccatact tcctcgcggg cgtctgcatg ctcagttctc tcgacatttg 2520 gctgatttta gcccactctc ggggacaatt gactatgcac agagacaagg tgctccggca 2580 cctccacctg ttggggtttt agatcaactc gagaaaagag ctggctcgcc cccgtgcaga 2640 gcccctcctt tctcgggttg gagctggact cgatcaccat aaaggcgtgc ctctcacgag 2700 agcgcaccga gccagtgctg gactgtctga gagagtttga cagaaaaatg tggtccccct 2760 gaaatctttt cagaggctcc tggggcatat ggcatccgtg gccgcgacct ccccgctcgg 2820 attgctccat atgagaccac cacggcattg gcttcacgat cgggtcccca gacgcgcatg 2880 gcaggcgggc acataccgag tgactccact ccgctgtgtc gcctcgcccc caccccctgg 2940 agggacccct ctttcctacg ggccagagtg cccctgggtt aggcgtccag gcatgttgtc 3000 atttygacag atgcttccag tacgggttgg ggggccgtgt gttgcgggca tgctgctgcg 3060 gacctgcgga agggaaccca gctgcattgg catatcaact gcctagagct gttgacagtg 3120 tttctcactc tgcgccgctt tttaccggcg ctgagggggc aacacgtgct ggtcaggacg 3180 gacagcacgg cgacgggagc gtatatcaac cgtatggggg atgtgcgctc ccgccgcatg 3240 tctcagctcg ctcgccgtct gctcctccgg agtcacacgt ggctgaagtc gatgcgtgct 3300 gttcacatgc cgggcgagcc caaccgtgcg gccaactggc tctcacggca gctccttgcc 3360 ccgggagaat ggcgactcca ccccgagtct gttcagctgt catgggcact gccccacagc 3420 tggcctcggg gcacgcgcaa acttgcgttt tccccagtga gcctgctcgc gcagttactg 3480 tgcaaaccca gggaggacga ggagcaggtc ttgttagttg cgcctctctg gcccaaccgg 3540 acttggattt ttgaactctc cctcctcgcg acggcccccc cctggrgggt ccctttgaga 3600 gagcacctac tctctcaggg acagggcacc atcgggcacc ctcgcccaga tctgtggaac 3660 ctccacgtgt ggtccataga cgcgaggaag acttaggtaa cctaccgatg gcggtggtta 3720 ataccgtcac tcaggctaga gcaccctcta cgaggcatgc ctatgccctg aagtggagtc 3780 tattcactga gtggtgcgct tctcgctgag aagacccccg atcttgccag atcagtgttg 3840 tgctttcttt ccttcaggac aggctggagc gaaggctgtc gccctccaca ctgaaggtct 3900 acgtggccgc tatttccgct catcatgatg cgtagatggc ggcaccgtgg ggaggcataa 3960 cctcatcatc cggttcctca gaggtgcgag gcgtatggtt ccaccccgcc cccctctcat 4020 gccctcttgg gacctcgcgg tagtgctaac gagcctacgt gaggagccct tcgagccact 4080 cgattcagta tccttgagat ttctgtcctt gaagacagct ctgctggtcg cgttggcatc 4140 ggttaagagg gtcggggacc tggaggcatt ttcggtcagt gactcgtgcc tagaattcrg 4200 gccggcctac tctcacgttg tcctgagacc ccggcccggc tatgtgccca aggttcctac 4260 cacgccgttt aaggatcagg tagtgagcct gcaagcgctg cccgcggagg aggcagaccc 4320 agccctttca ttattgtgtc ctgttcgcgc tttgcgaacc tatgtggacc gcactcagag 4380 ctttagatcc tctgaccagc tctttgtctg ttacggtggt cggcagaagg gaagtgccgt 4440 atctaaacgg aggttggccc actggagagt ggatgccatc tccctcgctt atctgagcca 4500 gggtgagccg tgtcccccag gggtgagagc gcactccaca cggagcatcg catcctcttg 4560 ggcgttagca cgcggcgcct ctctgacaga catttgcaga gctgcgggct gggcaacacc 4620 taatacattt gctaggtttt acaatctgcg aatggagccg gtttcctcaa gggtattagg 4680 taacccttgg tgattgagga aacaattygg ttggggtgtt gaaacacgct tgctgcgcca 4740 ttctccctaa cacggaggta cgtgcacttt ttcagctttg tcagttcagt tccccgttcg 4800 gtgaacccta cagagttcct ccgaggcccc cagcatctga ctcagcggag gagtcagacg 4860 ttggcccgtt acgttgttgg catgcccgct ggtcagccmg cccgcattgt tctgggtata 4920 ggtgcctgct atgygtgatc ccctgcgggc gatcccataa gctcactcaa ccacggttta 4980 gtcccccctt gtgttagggc gggctcgtgt cttccctccc cgctaaccat cctctttatg 5040 tacccctccc ccatttccgg ggctgggcca caggctgtca ccaggtctcc cctccttggg 5100 tagcaggtga actccgcagc gtcctcccta tcgggactga acgctttccc aacgtactgt 5160 cgtattaaaa cccttttact gggttatttt cgactccccg aaaaatatag ctaaacctga 5220 acaggtaagt aaggtragta agggccaggg gacacgttgg aagactgcat ctcgcggcgt 5280 tgtaggtgcg ctcgctctac tgcrtggcgc acccttcgcc agggacgcgg taaggtgctt 5340 tcgttgtggc gttttccata gatttcccca tattgtggaa ggttttccac atagcattgg 5400 agttccccat ccgaagggga acgctacggt tactaaagta accctcgttc cccgaggggg 5460 ggaacggaaa tgctatgtac cttcgccaca acgattgtcc cttagctgtt gagcgtgaaa 5520 gtctcctcag ctcaaaagga tgtcgagcat tgcactagcc gcgtcttata tacttgactg 5580 attgtcattg acgccagctg tgcatgctga cgctgccaat tcatttggca tttcattggc 5640 ccgattttat actctttcag atgattggtt tctgacgaaa accccatatt gtggaaggtt 5700 ttccacatag catttccgtt ccccccctcg gggaacgagg gttactttag taaccgtagc 5760 gtt 5763 // ID Copia-6-I_DR repbase; DNA; ZEB; 2273 BP. XX AC . XX DT 29-OCT-2008 (Rel. 13.1, Created) DT 29-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Nonautonomous; KW Copia-6-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-2273 RA Bao W. and Jurka J.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1583-1583 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 2273 BP; 321 A; 948 C; 603 G; 394 T; 7 other; gaacgacgcg atcacaatat ggatccagca gataatataa tagtgcgtat aaaacaaggt 60 aaccgctcaa ttgagcgtta tataaaggag tttttggagc tagccccgaa gagctctcac 120 agtgacattt tgctkatgat gttttttcgg ggaggacttt tggaaccagt ccgttcccwc 180 atgccaccgt atgaggakga ctggaacctt tatcggttca tygaggyggc cttattgatc 240 tctggctccc ctctctctgt tgccctgaag gaggagaacc ctcaggaggg aggcctgagt 300 ggagggcaac agagagggat attttcgggg ccagctgatt tatctaaggg aaggaggaga 360 gaggctggac tggagttgcg gacttatccg cgtcgggctg tccggagacc acccttttcg 420 gagcctctgg agagcctgct agaactgtgt ggaaccccag tgtctgctcc agtcccacaa 480 tgcccgccag tgtcggctcc agctccagag caagatccct ctcctccctc tcctgatgac 540 aacgctctcc ctctttctca aaccaccaga cggaggaggc gtaggaacag gactgctcat 600 gccccagagc gcccgccagt gtcggctcca gccccagagc gcccgccagt gtcggctcca 660 gccccagagc gcccgccagt gtcggctcca gccccagagc gcccgccagt gtcggctcca 720 gccccagagc gcccgccagt gtcggctcca gccccagagc gcccgccagt gtcggctcca 780 gccccagagc gcccgccagt gtcggctcca gccccagagc gcccgccagt gtcggctcca 840 gccccagagc gcccgccagt gtcggctcca gccccagagc gcccgccagt gtcggctcca 900 gccccagagc gcccgccagt gtcggctcca gccccagagc gcccgccagt gtcggctcca 960 gccccagagc gcccgccagt gtcggctcca gccccagagc gcccgccagt gtcggctcca 1020 gccccagagc gcccgccagt gtcggctcca gccccagagc gcccgccagt gtcggctcca 1080 gccccagagc gcccgccagt gtcggctcca gccccagagc gcccgccagt gtcggctcca 1140 gccccagagc gcccgccagt gtcggctcca gccccagagc gcagtccaat gccggttcca 1200 gtctggctcc tggccctgcc ggcgccaccc aggctcctcg ccctgccggc gccacccagg 1260 ctcctcgccc tgccggcgcc acccaggctc ctcgccctgc cggcgccacc cagacgcctt 1320 gccctgccgg ccccagcccg gctcctygcc ctgccggcgc cacccaggct ccttgccctg 1380 ccggccccag cccggctcct tgccctgccg gcaccaccca gacgtcttgc cctgccggcc 1440 ccagcccggc tcctcgccct gccggtccca gcccggctcc tggcattacc aacgccatcc 1500 agacgtcttg ccctgccggc acctcccaga cgtcttgccc tgccggcccc agtccggctc 1560 catgccctgc cggctccagt ccggctccct gccctgccgg ctccagtcca gctgcctcca 1620 gtcccgccgg ctccagtcca gctgcctcca gtcccgccgg ctccagtcca gctgcctcca 1680 gtcccgccgg ctccagtcca gctgcctcca gtcccgccgg ctccagtcca gctgcctcca 1740 gtcccgccgg ctccagtcca gctgcctcca gtcccgccgg ctccagtcca gctgcctcca 1800 gtcccgccgg ctccagtcca gctgcctcca gtcccgccgg ctccagtcca gctgcctcca 1860 gtcccgccgg ctccagtcca gctgcctcca gtcccgccgg ctcctgtcca gctgcctcca 1920 gtcccgccgg ctcctgtcca gctgcctcca gtcccgccgg ctcctgtcca gctgcctcca 1980 gttccgccgg ctccagtcca gttgcctctc cagccggctc tcctaaaacc ccctgccttg 2040 tgtctcctgc gggatcatcc atgggtcatc cctcccgccc ctccctggtg gtccttccca 2100 ctgtttggga cggattcccc ggctgcaccc tggcttcctg ctggggcgcc tgacccacgg 2160 acctaccctg atctggccct cccatccctc cccttgttcc tgctcygctt cgcccacccc 2220 cccccgggac tattctggga gcgtctggaa gccgctcttt aaaggggggg taa 2273 // ID hAT-N49_DR repbase; DNA; ZEB; 1111 BP. XX AC . XX DT 21-OCT-2008 (Rel. 13.1, Created) DT 21-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Putative nonautonomous hAT DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW 8 bp TSD; hAT-N49_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1111 RA Bao W. and Jurka J.; RT "hAT DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1318-1318 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1111 BP; 346 A; 195 C; 210 G; 360 T; 0 other; cagagttggg tgtaacgcgt tccacagtaa cgcgttactg tattctaatt acatttttgg 60 ggaacgcagt aatgtaacga attacatttt aaatttgtgt aatttgatta cagttactaa 120 agtcaatgta attgcgttac ttacgttaca aatatagttt ttagaaaaaa aaatgcttct 180 tttaaatcac gttttccgct gcaacgtcag ttaataagca tgcacttttt aaattgctgg 240 agaacgcgag ctgaaatggt tgctgtcgag agtggttgct tcttcaagtg gaaatacaga 300 cattactttg atttcattga gcgtaaaaac aaaaatattg ctgtgaaatg cagtctatgt 360 ccagtctcta aagaactttc gactgctttt aaactcgacc agcaacttgt tcaagcactt 420 gaatagaaag cattctatga caatactcgt caccaaggag gacaccggcg cccaaaaaca 480 cggcggccca actcagccta aacaggctaa attgaatttt tgtgcaggat cgggagtgaa 540 ggtatcttct gaatgtgaag agaagcgtag ctgcttatgg ggccgttcac atatcgcgtc 600 ttttgcgcgc gcaagttcgt tatttccaat ggaggcgcgc ggcttgcgta caaatattca 660 gagtgcatat tgcgtgcgct ctcattttcc aggcacacca tttgaaaaac acaagtcatg 720 caacaagaac tgaccagtca gcttctaatt acctcagaca aacatatcca gttttcttct 780 aaatgtcttc ttttgtgtaa aaaaaaaaaa caggtttgga acaagtgttt gggcgaacac 840 tttaagtctt taagtctttt gaatatgtca agctgctgtg atcaacccat tgtaatgttt 900 ttcagtgaat attaaattac tgaaagagct gcagtttatt ttgtattttt ataggtatat 960 tttatgtttt aaaagtaact tcaaagtaaa gtaattagta atctgattac tttttacatg 1020 aagtaattag taatgtaatc agattacaat tttccagtag taatcagtaa tttgtaatct 1080 attacttttt aaagtaactt acccaacact g 1111 // ID BEL35-I_DR repbase; DNA; ZEB; 7317 BP. XX AC chr16; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL35-I_DR; KW BEL35-LTR_DR; BEL35_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-7317 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1413-1413 (2008). XX DR Genome; chr16; Positions 11731903 11739219. XX CC Positions [5636-6214] - Integrase core CC LTRs are 97% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 815..6574 FT /product="BEL35-I_DR_1p" FT /translation="MESHSNEEGKTQVFEIRSEASMSKSSRRSSRSSASVA FT ATRARAKAEAARAKASYAEKEAAMMKKKAEVEADLFVLQSQKDATAAAAEA FT AIYEAAADIEEGNIDELLSDTRSIQSQRTREYVQKHATEQHEQQLIDTLIQ FT QPSSRSAETLHQEAMVTSRSENPSYRVKELEIKQEKINNRKDKVHEYRSTF FT QTSPTPSVKAPMFSLPYTSDLANYMVRKEMGSSGLTKFDDCPENYWAWKSS FT FQDVTKDLDLTAREELDLLAKWLGPESSIQAKRIRSVHAHYPIAGVRMLWQ FT RLEECYGSPEVIENALLSKLERFPRIANRDSHRLRELGDILLELESAKSSG FT HLQGLAILDTARGVNPILGKLPFNLQERWITYGAKVKQHLQVSFPPFGFFV FT QFVCEQAKIRNDPSFALPMHVHPMQEKCPQFNKIPVSVRKTEVLATVSNSK FT NKLCSEQKMLEPSVQCPIHNKPHPLSKCCGFRAKSLDERKTYLKGLSICFR FT CCGSTKHIANTCGLAVKCRECNSDRHLSAMHPGPAPWIEQTLVTEQEQGEE FT LATDMLSDVTSECTEICSGAPRPKSCSKICLVDVYPFDCPDKAERMYVVLD FT DQSNRSLAKSAFFKLFGINTNPYPYTLRTCGGTAKTMGRRADNFIVRSLDG FT NTEVSLPTLLECNMLPEDRDEIPTPEVARYFSHLKTVADKLPPFDENAPIL FT LLLGRDILSVHKVREQYNGPGDMPYAQRLDLGWVIVGEMCLGGVHKSNSAN FT VYKTNILQNGRASFFPSCSKRIQVKEQFGTPVHQYQSCLPCHPVISSCTDH FT LGIGVFQRTPSDNKLSLSVEDEIFLDVMNKEMHLDESNHWVAPLPFRLPHK FT PLPNNRDQALKRLNILQRTLQKKPEMGEHFVEFMQNMFINKHAELAPPLLP FT DEEYWYLPIFGVYHPLKPGKIRVVFDSSAQCKGISLNDTLLRGPDLNNKLL FT GVLMRFRKEQVAVTADVEQMFYCFKVRKDHQNFLRFLWFKDNDLSKEITEY FT RMSVHVFGNSPSPAVAIYGLRRAADWGEEEYGEEAKQFIHRNFYVDDGLIS FT VSSAAEAISLLKNTKNMLSESNIRLHKISSNSHQVMEAFPASDRANDLKDL FT DLTIDPLPLQRSLGVSWSLETDCLTFQVSPVNRPFTRRGILSTVNSLYDPL FT GFVASVTIQGKALVRELSSDQVDWDTPLPTDKETQWKVWTNSLADLRKVQI FT PRPYIPISMSQTSEREVIIFSDASTLAIAAVAYLRVASIEGQCHVGFIMAK FT SKLAQYPAQTVPRLELCAAVLAVELAELITEELDMELTTVKFYTDSKIVLG FT YIYNTSRRFYVYVANRVARIRSSTKPEQWHHISSELNPADLGTRFVPADIL FT PHTNWFCGPEFLCKPDLGAKEDSFDLVEPGKDQEIRPQVTTLVTNVTDQSL FT GSHRFKRFSEWTSLVRSIANLIHIVQSFSDMKCTLDCKGWHLCQKSHQVDW FT TQAKTFIIQCVQQEHYSEELKKLRKREEISQQSSLKKLAPFVDGDGLIRVG FT GRLQLAELSEQEKHPLVIPACQHVSTLLVRHYHERVAHQGRHLTAGAIRSA FT GLWIVKGTRLINSVIHQCVTCRKLRGRIEEQRMSDLPMDRVTVDPPSTYVG FT LDVFGPWSIMSRRTRGGSAESKRWAVIFSCLSTRAVHLEVIESMSTSSFIN FT ALRRFLAVRGPVKHLRSDRGTNFIGACRELNIKTDDPELKGYLQEQGCTWS FT FNVPHSSHMGGAWERMINTARCILDGLLLKSSCTRLTHEVLTTLMSEVMAI FT MNARPLIPISSDPDTFEVLSPSMLLTQKASTVPASLGDFDIKDLYKKEWRK FT VQCLADAFWTSWKRDYLATLQVRRKWTEEKQNLREGDVVLLKDSNVKRNDW FT PIGLIVKTMASSDDRVRKVEVKIVKEGVAKVYLRPISEVVLLLPKKP" XX SQ Sequence 7317 BP; 2228 A; 1443 C; 1611 G; 2035 T; 0 other; ctaaagttgt ttagccactg tttccttggt tactatatgt gtggtcttct aaagccattt 60 gtgggtttgg cgctccacct accggacatt aaaggcactg cactatatgt acaattaaga 120 ttcactttcg ttttccttca gtgcatacag tcaaggacaa ctcacgcacg cggttctttc 180 ggtgcctgat taaaacgtgg aaaccttgga aatatatgtc cgtaagtcca caatttgatt 240 aaacagcaat tgtatgttca aaggttgcat gttaaagagt cattactggt ttattagatt 300 tattattcct gtatttgctc tccaatcaca tgttcataat tagtcagaca agttcataat 360 gctgagtggc gttacaaata tgttaatttg ctgacatttg atacaaaatg tgttttattt 420 aatagatttc cttatgtgtg tacaagattt gtgtaaaaaa gaactatgta cattagttgt 480 agaataatgt ctagctgatt gaactgaaca tgttgggaat atcattgtca tgtgaataag 540 attgtattct tatttttcct tttccttttg ttgtagtttt acaaacagaa tagaagatat 600 tgtgaggaac aatacccagt aaagtcactc aagttatcca actgctttct ggtgttcttt 660 tctggagtat gctatctgtt aaaccctttg ctcaggaaaa cgcaattggt tgggacagaa 720 tagtaaaagg acattgatac agcgggtgga aacaacgcaa tcacacaccc attgtgacct 780 ttgcagtgta cagcttatta gaagctcact aattatggag tcacacagta atgaagaggg 840 aaagactcaa gtctttgaaa taagatccga agcttctatg tcaaagtcat ctcggagatc 900 aagtaggtca tcagccagtg tggcagcgac ccgagcgaga gctaaagctg aagcagcaag 960 agctaaagca tcatacgcag aaaaagaggc agctatgatg aagaaaaaag cagaggttga 1020 agcagatcta tttgtactac aatcgcagaa agacgcaact gcagcagccg ctgaagcagc 1080 aatttatgag gcagcagctg acattgaaga gggtaatata gatgagctgc tcagtgatac 1140 acgctcaatt caatctcagc gtactaggga gtacgttcag aagcatgcta ctgaacagca 1200 tgagcagcag ttaattgata ccctgataca acagccatcc tctaggagtg ctgaaacact 1260 tcatcaagag gcaatggtga cttctcgatc agaaaatcca tcatataggg tgaaggagtt 1320 ggaaatcaag caagagaaaa ttaataaccg gaaagataaa gttcacgaat atcgctctac 1380 ctttcaaaca tctcctactc ccagcgttaa agctcccatg tttagtttgc cctacacatc 1440 ggatttagct aattacatgg tgaggaaaga gatggggagt tctggtctca caaagtttga 1500 cgattgtcca gaaaactact gggcttggaa atcttccttt caagatgtta caaaggacct 1560 tgatttaaca gccagagaag agcttgacct acttgcaaaa tggctcgggc cagagtcatc 1620 tatacaagcc aaacgcatca gatctgtgca tgcacactat cctatagcag gcgttcgcat 1680 gctttggcaa cgtttggaag agtgctatgg aagccctgaa gtaattgaga atgcgttgtt 1740 gagtaaactg gaaaggttcc ctaggattgc taacagagac agtcacagac tcagagagct 1800 aggagatata cttttggaat tagaatcagc aaagtcaagt ggacatttac aggggttagc 1860 aattcttgac acagcaaggg gagtcaatcc catccttgga aaacttccct ttaacttgca 1920 agagagatgg atcacttatg gagcgaaggt caaacagcat ctacaagtgt ctttcccccc 1980 atttggcttc tttgtccaat ttgtttgtga acaagctaag ataagaaatg accctagttt 2040 tgctttacca atgcatgttc atcctatgca agagaaatgt ccacaattta acaaaatacc 2100 tgtgtcagtg aggaaaactg aagtgttagc cacagtttca aattccaaaa acaagctatg 2160 ttctgaacaa aagatgctgg aacctagtgt gcaatgtcca attcacaaca agccgcatcc 2220 attgtcaaaa tgctgcggat ttagagcaaa gtctctagat gagagaaaga catacctcaa 2280 aggtctgtct atctgtttca gatgttgtgg ttcaactaaa cacatagcaa acacatgtgg 2340 gttagctgtt aaatgtagag aatgtaacag cgatcgtcat ctctccgcaa tgcacccagg 2400 gcctgctccc tggatagagc aaactctagt taccgagcaa gagcaaggcg aggagttggc 2460 tacagacatg ttatcagatg ttacctctga gtgcacagaa atctgcagtg gagctcccag 2520 accaaaatca tgttcgaaaa tctgcttggt ggatgtttat ccattcgatt gtcctgataa 2580 agcagagaga atgtatgtgg tactcgatga ccagagtaat cggtccttag cgaagtctgc 2640 ctttttcaag ttgtttggaa tcaacacaaa tccttaccct tatactctac gcacatgtgg 2700 tggtacagca aagacaatgg gtagaagagc agacaacttc attgtcaggt cacttgatgg 2760 aaatactgag gtgtcacttc caacgctctt ggaatgtaat atgttgcctg aggatcgaga 2820 tgaaattcca acacctgaag tagcgcggta cttctctcat ctcaagacag tggccgacaa 2880 gctacctccc tttgatgaaa atgctccaat tcttctcctt cttggaaggg acattttaag 2940 tgtgcataaa gtaagggagc agtacaatgg acctggtgac atgccgtatg ctcagcgtct 3000 tgacttaggc tgggttattg taggagaaat gtgtctggga ggagtgcata agtcaaattc 3060 tgcaaatgtg tacaagacaa acattctgca aaatggacgt gcctcttttt tcccatcctg 3120 ctccaaaaga attcaggtta aagaacaatt tggtactcct gttcatcaat accagtcctg 3180 tctaccttgt catccagtta taagctcttg tacagaccac cttggtatcg gagttttcca 3240 aaggacaccc agtgataata agctctcatt gtctgttgaa gatgaaatct ttcttgatgt 3300 catgaacaag gaaatgcact tggatgaatc aaaccactgg gtagctccct tgccattccg 3360 tttaccgcac aagccattac caaataaccg agaccaagca ttaaagcgtc ttaatattct 3420 gcaacgtaca ctacaaaaaa aacctgagat gggtgaacat tttgtggagt tcatgcagaa 3480 catgttcatc aacaagcatg cagaacttgc tcccccactc ctgcctgatg aagaatactg 3540 gtatttaccg atatttggtg tgtaccaccc tttgaaacct gggaagatta gggtggtgtt 3600 cgattccagc gctcagtgta aaggcatttc tttaaatgac acacttcttc gtggcccaga 3660 cctgaacaac aagttacttg gggtcttgat gcgtttccgg aaggaacaag tcgcagtaac 3720 agctgatgta gaacaaatgt tctactgctt taaggttagg aaagatcacc aaaactttct 3780 gagattcttg tggtttaaag acaatgatct ctctaaggaa atcacagagt acagaatgtc 3840 agtgcatgtt tttgggaaca gcccatctcc agctgtggca atttatggat tgagacgggc 3900 agcagattgg ggagaagaag agtatggtga agaggccaaa cagttcatcc atagaaattt 3960 ttatgtggac gatggtctga tctctgtctc tagtgcagct gaggccatca gtcttctgaa 4020 aaacaccaaa aacatgttgt ctgagtctaa cattaggctt cacaagattt cgtcaaatag 4080 ccatcaagtt atggaagcat ttcctgcatc tgatagagct aatgacctaa aggatctgga 4140 tctaaccatc gatcccctac ctcttcaacg aagcttgggg gtgagctgga gtttagagac 4200 cgattgttta acatttcaag tatcaccagt taatagacca ttcactcgac gaggcatact 4260 ttcaacagtt aacagtctct atgatccttt ggggtttgta gcttctgtca ctatacaggg 4320 caaagcctta gtaagagagc tgtcatcaga ccaagttgac tgggacacac cactcccaac 4380 agataaggaa acccaatgga aggtttggac aaattcatta gcagacctta ggaaagttca 4440 gattcccaga ccctacatac ccatctctat gtcccaaact tctgagagag aggtgatcat 4500 tttttctgat gcgtctacac tggccattgc tgcggttgcc tatctcagag ttgcatctat 4560 agagggtcag tgtcatgttg ggttcatcat ggctaagtca aaattggcac aatatcctgc 4620 tcagacagtg ccacgtttag aactgtgtgc ggcagtccta gctgttgaac tggcagaact 4680 aattactgaa gagctggata tggaattgac aacagtaaaa ttttacactg atagtaagat 4740 tgtcttggga tacatctaca atacctctcg aagattctat gtgtatgttg caaaccgagt 4800 ggcacgaatt agaagctcca caaaacctga acagtggcat cacattagtt cagagctgaa 4860 cccagcagat cttggaacta ggtttgttcc agcagatatc cttcctcaca ccaattggtt 4920 ctgcggtcca gaattcctgt gtaagcctga cctcggagct aaggaggatt catttgacct 4980 ggtagaacca gggaaggatc aggaaatccg tcctcaagtg acaacattgg tcactaatgt 5040 aacagatcaa tctcttggat cacacaggtt caaacgcttt tcagaatgga catcacttgt 5100 aagaagtata gcaaatctca ttcatattgt gcaatccttt tctgatatga aatgtacact 5160 agattgcaaa ggctggcact tgtgccagaa aagtcatcag gttgactgga cacaagcgaa 5220 aacttttata atacagtgtg tgcagcaaga acattacagt gaggaactca aaaagttgag 5280 aaagagagaa gaaatctcac aacaaagttc tctcaaaaag ctagctccct ttgtagatgg 5340 tgatgggcta attagagttg gaggccgttt gcaattggca gaactgtcag aacaagaaaa 5400 gcacccatta gtaatcccag catgccaaca tgtctctaca ttattggtca gacactacca 5460 tgagcgggtg gctcaccaag gtcgtcattt aactgctgga gcaatacgat cagctggtct 5520 ctggattgta aagggaacaa gactaattaa cagtgtgatt catcagtgtg ttacatgcag 5580 aaagttgaga ggaagaatag aagagcaaag aatgtcagac ctacccatgg acagagtgac 5640 agtagatcca ccttccacgt acgtgggtct agatgttttt gggccatgga gcataatgtc 5700 acgccgtaca agaggtggca gtgcagaaag taaaagatgg gctgtcattt tctcctgtct 5760 gagcacaagg gcagtccatc tggaggtcat tgagtcaatg tcaacttcca gttttattaa 5820 tgccctcaga cggttcttgg cagttcgtgg accagttaaa catttgagat cagacagagg 5880 aactaatttc attggtgcat gtcgagaact aaacatcaag actgatgatc cagagcttaa 5940 gggatatttg caagagcaag gttgtacatg gtcatttaac gttccccact cttcccatat 6000 gggaggagct tgggaaagga tgatcaacac agcacgatgc attcttgatg gcctgctgtt 6060 gaaaagcagc tgcacacgtc ttactcatga agttttgact acactcatgt ctgaagttat 6120 ggcaataatg aatgctagac cgttaattcc tatttcttct gaccctgaca catttgaagt 6180 tctatctcct tcgatgctcc tcacacaaaa ggcaagtact gtaccagctt ctcttgggga 6240 ctttgatata aaggaccttt acaagaaaga atggagaaag gttcaatgtc tagcagatgc 6300 tttctggaca tcatggaaaa gggattatct tgcaacttta caggtccgaa gaaagtggac 6360 agaggaaaaa caaaacctca gagaagggga tgttgtcttg ctcaaggact ctaatgtgaa 6420 acgaaatgat tggcctattg ggcttattgt gaaaaccatg gccagcagtg atgatagagt 6480 gagaaaggta gaagtaaaga tagttaaaga aggtgtggcg aaggtatatc taaggcctat 6540 ctcagaagta gtgcttctcc tgccgaagaa gccataaagg gataaatagt agtataatag 6600 gttgttatac caggcgagga gtgtgtggtc ttctaaagcc atttgtgggt ttggcgctcc 6660 acctaccgga cattaaaggc actgcactat atgtacaatt aagattcact ttcgttttcc 6720 ttcagtgcat acagtcaagg acaactcacg cacgcggttc tttcggtgcc tgattaaaac 6780 gtggaaacct tggaaatata tgtccgtaag tccacaattt gattaaacag caattgtatg 6840 ttcaaaggtt gcatgttaaa gagtcattac tggtttatta gatttattat tcctgtattt 6900 gctctccaat cacatgttca taattagtca gacaagttca taatgctgag tggcgttaca 6960 aatatgttaa tttgctgaca tttgatacaa aatgtgtttt atttaataga tttccttatg 7020 tgtgtacaag atttgtgtaa aaaagaacta tgtacattag tagtagaata atgtctagct 7080 gattgaactg aacatgttgg gaatatcatt gtcatgtgaa taagattgta ttcttatttt 7140 tccttttcct tttgttgtag ttttacaaac agaatagaag atattgtgag gaacaatacc 7200 cagtaaagtc actcaagtta tccaactgct ttctggtgtt cttttctgga gtatgctatc 7260 tgttaaaccc tttgctcagg aaaacgcaat tggttgggac agaatactat aagagct 7317 // ID L1-1_DR repbase; DNA; ZEB; 5811 BP. XX AC AL645691; XX DT 02-MAY-2002 (Rel. 7.04, Created) DT 18-MAY-2005 (Rel. 10.06, Last updated, Version 2) XX DE L1-1_DR is a non-LTR retrotransposon from the L1 clad. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1 clad; KW L1-1_DR; endonuclease; reverse transcriptase. XX NM L1-1_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5811 RA Kapitonov V.V. and Jurka J.; RT "L1-1_DR, a non-LTR L1-like retrotransposon from zebrafish."; RL Repbase Reports 2(4), 18-18 (2002). XX DR Genbank; AL645691; Positions 114238 120048. XX CC This element is characterized by 12-bp target site duplications. CC It can be an active element. It encodes two proteins: CC 294-aa L1-1_DR1p (positions 172-1053) and L1-1_DR2p (positions CC 1665-5489). These proteins are most close to corresponding CC proteins CC encoded by other L1-like elements. L1-1_DR1p is a putative CC RNA/DNA CC binding protein, and L1-1_DR2p is composed of the AP endonuclease CC (aa positions 1-200) and reverse transcriptase domains. XX FH Key Location/Qualifiers FT CDS 1665..5489 FT /product="L1-1_DR2p" FT /translation="MVKPHNVNASGICQVNLISWNVKSLNHPVKRGKVLSH FT LKQLNTDIAFLQETHLKTFDHFRLRGGWVGQLFHSTFHSKSRGTAILISKT FT VSFEASKIEADPAGRYIMVVGRLNNTPVVMVNVYAPNWDDSAFFTGLFSRI FT PNIDTHHLILGGDINCVLSPSLDRSSLKPMIPSRTTQVINQLLKTYGMIDV FT WRFQNPGCRGYSFYSPVHKTYSRIDYFFLDSELLPLVSECKYNAIVISDHA FT PLLITLDMPITSNNYRPWRFNTLLLSDVEFVKFISSEIREYLVHNQTPGIS FT SSLIWESLKAYLRGQIISYSARLKKKQHERLKKIENDIFKLDEILAHSSTP FT DMFRQRLALQSEFNLLCTKQTENLLIKSRHKMYEHGEKIGKILAHQLRQQN FT AAHSIMSVNDNTGTKLTNPLEINHRFREYYSQLYTSESCKDESLFDSFFKK FT ISLPTIDQEFALDMENPFSKDEFIRAVSSMQNGKSPGPDGFPSEFFKKFSG FT ELAPILLSLYEESSVTGSLPETMNQAIISLIYKKDKNPSECSSYRPISLLN FT VDSKIFAKILAHRLEIVLPTIVSGDQTGFIKNRYSFYNIRRLLNILHHPTP FT SDVPEVLLSLDAEKAFDRVEWDYLFYTLKKFGFGTKFISWIKILYSSPMAA FT IRTNCHISPFFSLERGTRQGCPLSPLLFALVIEPLSIAIRNDINIKGIQRD FT NFEHKISLYADDTLLYISEPLTTLPQIMTLLTAFGKISGYKINMQKSELMP FT INNAGRKIIFTSLPFKITKDKFKYLGIWITNKYKHLYKVNFPPLIDSIKKD FT LERWNPLPLSLGGRINTIKMNILPRCLYLFQCIPVFLTKSFFLLLDKLISS FT FIWNGKNARIRKNILQRHRDHGGLSLPNIQQYYWAANIRAMLHWSNPSYDS FT GPNWLSLENTSNFSTSLHALLCSNFPTPEPLSKYSLNPVVKHSLKIWAQFR FT RSFALKGLSAYAPIARNHMFTPSTIDKTFDIWSMKGLKILKDMFIDGQFAS FT FQQVKVKFQIPNSHFFRYLQLRSFVSSSMSHYPSLPPPSLLDSIMELSPYS FT KGLIGKIYSIINSHNLEPLVKLKRKWEVELEIELSEDMWQSVLDNIHSSSI FT CLKHRVIQFKVVHRLHWSKVKLAKFKPNIDPNCSIEPATLSHMFWACSKLK FT KFWHLIFKFLSDALNTYVEPEAIISIFGITPQSLCFNKSKINVIAFATLLA FT RRLILLKWKEKLPPTFKQWLMELLHHLTLEKIRYTFGGCTDMFFLTWQPVL FT DHVKKMDPSVILEE" FT CDS 172..1053 FT /product="L1-1_DR1p" FT /translation="MAGKLRKYKYTGKTAANQEDANTPSMMSEPAEHTDIL FT EIKAELISSIKTEITSLFQKELKTALSNEFEMVKAELQAVKSEIASNASAV FT RSDLEAIKTTVSDMERGLSSCSDDVTELQNTVRKLEKNVVTLQEKCLDMEG FT RMRRSNIRILNVAEDPGACTPASVSKLLKDTLKMDKDILIDRSHRTLQAKR FT ADGKPRAIVAKLHYYQDCVEILRRVRETGPLHHNGATIFIFPDYPPSVARA FT RSAFNEVRKLLRGKDGVRYGILHPARLRITHNGTEKQFQDAAEALTYVKNN FT IL" XX SQ Sequence 5811 BP; 1884 A; 1132 C; 979 G; 1816 T; 0 other; gtgcccggcg ctactgagca ggcgatggag taaggtaaaa gttgggtgag ctctgcaaaa 60 aacgtaatat aacttataaa tttaaggcgc tttaatcgaa aatatgagat gctacatcta 120 tctaagactt tactacttca cataaagcag ctgggttctc tataaatcca aatggcgggc 180 aaactacgta aatataagta cacggggaaa accgctgcta atcaagagga tgctaataca 240 cctagcatga tgtccgagcc agccgaacac actgacattt tggagattaa agctgagctg 300 atctcatcga tcaaaacgga aataacctcg ctctttcaaa aagaactgaa aactgcactg 360 tcaaacgaat ttgaaatggt caaagctgaa ttacaagcag ttaaatccga gatagcaagc 420 aacgcttcgg ctgttcgctc agatcttgag gcaattaaaa cgacagtgtc agacatggaa 480 cgaggcttat ccagctgctc agacgatgtt acagaactgc aaaacaccgt gcgcaaactg 540 gagaaaaatg tagtaaccct acaagagaaa tgcttggaca tggaagggcg aatgaggagg 600 tcaaacatta gaatactgaa tgtagccgaa gatcctggcg catgcactcc agcctcagtg 660 tcgaagctgc tcaaagacac cctcaaaatg gataaagata tactgatcga tagatctcat 720 cgcactctcc aggcaaaacg tgcagatggt aaacccaggg ccattgttgc aaaattgcat 780 tactaccagg actgtgtgga aattctacgc cgggttcgtg aaaccggacc ccttcatcat 840 aacggcgcaa ctatattcat cttccccgac tacccgccga gcgtggcccg tgcaagatcg 900 gcttttaacg aagtccgaaa actactacga ggaaaagatg gtgttcgtta tggcattctt 960 cacccggcca ggctccgaat cacacacaac gggacggaaa agcagttcca agatgcagca 1020 gaagccctga cttatgtgaa aaataatatt ctttaagacg gctcgaccgt ctctgattga 1080 gtagtaccaa gccacttcag tgactttgtt ttttttcata cactcacact cccatctaaa 1140 tgaccatgtt tgagtgagta tatgaacatt acatctatgc agagactgac tgaattccta 1200 tttcatggat gaggagtgaa aaacaatatt actgatacaa aatcacttac tattttcagt 1260 taaacttaaa agcaggaaat ataattggga cagtacaatt actgagttta ttattatcat 1320 tactatgctg ttatgtaact gtcatccttg ttttaatagg tgattgttaa taatatgcgc 1380 caatttaatt tattattttt taatttcatt tttgttttct ttctttaata atgtgcagag 1440 caacttgtga ggttaaaact ccccaagtaa gcactttatg tgtggatatt gttgcgaggg 1500 gttaaagttg caccatgttc tatttggtgt ttgggaatgg gtaaatgtcg cacttcattt 1560 tatacttcta cttcttgttt tgttttctac aatcttattg gaagggtctt ttctgttata 1620 tttaaacgaa gcgtttgtat gtaagcttac attttttaag acgtatggtt aaaccacaca 1680 atgttaacgc atcaggcatt tgccaggtga atcttataag ttggaatgtt aaatctttga 1740 atcatccagt gaaacgtgga aaggttctct cacacttaaa acagttaaat acagatatcg 1800 ctttcctaca agaaacccac ctgaaaactt ttgatcactt tagactaaga ggaggatggg 1860 tgggacaact ctttcactcg acttttcact ccaaatctag aggaacagca attctcatta 1920 gtaaaacggt ttcatttgag gcatcaaaaa tcgaagctga tccagcaggc cgttatataa 1980 tggtagtggg tagactaaat aatactccgg tagttatggt aaatgtatat gcacccaatt 2040 gggatgacag tgcattcttt acgggtctct tctcacgaat acctaatata gatactcatc 2100 atcttatatt aggaggagat attaattgcg tactatcacc ctcactggat cgcagctctc 2160 tcaaaccaat gataccaagt cgtacaactc aagtgattaa ccaacttctt aaaacctatg 2220 gaatgattga tgtttggaga ttccaaaatc ctgggtgtag aggttattca ttttattcac 2280 cagttcataa gacatattca cgtatagatt atttttttct ggacagtgaa ctacttcctc 2340 tagttagtga atgcaaatat aatgcaatag tgatatcaga tcatgcgcca ttattaatca 2400 ctctagatat gccaattaca tcaaacaact atcggccatg gcgatttaat acactacttc 2460 tctctgatgt ggagtttgtt aaatttatat catcagaaat tagagaatat ttagtgcaca 2520 atcagactcc aggaatatct tctagtctta tttgggaatc tcttaaagcc tatcttcgag 2580 gccaaattat atcatatagt gccagattaa agaaaaaaca acatgagcgg cttaaaaaaa 2640 ttgaaaatga tatttttaaa cttgatgaaa ttttggcaca ctcatctaca cctgacatgt 2700 ttagacagcg tttagctctt cagtctgaat ttaatttatt atgtacaaaa caaacagaaa 2760 atcttttaat taagtccagg cataagatgt atgaacatgg tgaaaagata gggaagatct 2820 tagctcacca acttcgacaa caaaatgcag cacattccat tatgtcagtt aatgataaca 2880 ctggcactaa attgacgaat cccttagaga tcaaccatcg gtttagagaa tactattcac 2940 aattatatac ttcggagtct tgtaaagatg agtcattatt tgattctttt tttaagaaaa 3000 ttagtctacc cactattgat caagagttcg ctctagacat ggagaatcca ttttcaaaag 3060 acgaatttat tagagcagtg tcatctatgc aaaacggaaa atcaccaggc ccagacggtt 3120 ttccaagtga attctttaaa aagttctctg gcgaacttgc ccctattcta ctttccctat 3180 atgaagaatc ctcagtcacg ggctccttgc cagagactat gaatcaagca attatttctc 3240 taatctataa aaaagataaa aatccatcag aatgcagctc ttatcgacca atttcattgc 3300 tgaatgttga cagtaagata ttcgccaaaa tattagcgca tcggctggaa atagtgctac 3360 ctacaatagt ttctggtgac cagacaggct ttattaaaaa ccgatattca ttctataata 3420 tacgcagact tctaaatatc ctccaccatc ccactccatc tgatgttccg gaagtccttc 3480 tctcacttga tgctgagaag gcttttgatc gggtggagtg ggactacctc ttttacactc 3540 ttaaaaaatt tggatttggc acaaagttca tttcatggat taaaatctta tactcatcac 3600 ctatggcagc aatacgtaca aattgtcaca tttctccttt cttttcgtta gaaaggggaa 3660 ccagacaagg ctgccctctg tcccccttat tatttgcatt ggtaattgaa cctctgtcca 3720 ttgcgatacg aaatgatatc aatatcaagg gtatacagag ggacaacttt gaacataaaa 3780 tttctctcta tgcagatgac accctcctat atatatctga accactaaca actctaccac 3840 aaattatgac attactgact gcctttggga aaatatcagg ttataaaata aatatgcaaa 3900 aaagtgagct tatgcccatt aataatgctg gtagaaagat tatttttacc tcactaccat 3960 ttaaaataac taaagacaaa ttcaaatatt taggtatatg gatcactaat aaatacaaac 4020 atttgtacaa agttaatttc cctccactga tagattccat aaaaaaagac cttgaacgtt 4080 ggaatccgtt accattgtca ctgggaggta gaataaacac tataaaaatg aatatattac 4140 ccagatgttt atatcttttt cagtgcatac ccgtattctt aacaaaatca tttttcttac 4200 ttttagataa attaatatca tcttttatat ggaatggaaa aaatgcacgt atccgtaaaa 4260 atattttaca acgacaccga gaccatggag gattgtcatt acccaacatt cagcagtatt 4320 actgggcagc taatattcga gcaatgctac actggtcaaa tccatcatat gacagtggcc 4380 ctaattggtt atctttagaa aacacatcaa atttttcaac ctctctccat gctctgctat 4440 gctcaaattt tccgacacct gaacctttat ctaaatactc tttaaaccca gttgtcaaac 4500 actcactcaa aatatgggca caatttagaa gaagttttgc acttaaagga ctatcagcct 4560 atgcccccat agcaagaaat catatgttca ccccctctac tatagacaaa acttttgaca 4620 tctggtctat gaaaggtctt aagatattaa aagatatgtt tattgatggg caatttgctt 4680 cattccaaca agtaaaagtt aagtttcaaa ttccaaattc ccacttcttt agatacctcc 4740 agctgcgaag ttttgtgtcc tcctcaatga gtcactatcc ctcactgcct cccccctccc 4800 tgcttgactc tattatggag ctaagcccat actcaaaagg acttattggc aaaatatatt 4860 ctataattaa ttcccacaat ctggaacccc tagtaaaatt aaaaagaaaa tgggaggtgg 4920 agctagagat agaactatca gaagatatgt ggcaatccgt tttagacaat atccactcat 4980 cttcaatttg tttaaaacat agagttatac aatttaaagt agtacataga ttacattggt 5040 ccaaagtgaa actagccaaa tttaaaccaa atatagaccc taactgtagc attgagccag 5100 ctactttatc tcatatgttt tgggcttgtt caaaattaaa gaaattctgg cacctaatat 5160 tcaaattcct ctcggacgca ttaaatacct atgtagaacc tgaggccata atttcaattt 5220 ttggaatcac accacagtcc ttatgtttta acaaaagcaa gataaatgtg attgcctttg 5280 ctacgctttt agctagaaga ttaatattgc tgaaatggaa ggaaaaactt cctccaacct 5340 ttaagcaatg gcttatggaa cttctacacc acctgacctt agaaaaaata cgatacactt 5400 ttggaggctg tactgatatg ttttttctca cctggcaacc tgttttagat cacgtaaaaa 5460 agatggaccc ctcagtcatt ttagaagagt agaacctttc cttgtttgtt tgtttgtttg 5520 tttgtttgtt tttttttttt tttttctctt ttctcttcaa ttttttgtat tttctatttt 5580 cttttttctc cccttattaa tgagagtaac atatgtttta cttattataa ctatttattt 5640 gtattttttc ttaatattat ttatttattt tttttttttg aacctaaatg tatgtgcggc 5700 aggttttgtt tgtttgttgt gtaaaaaaaa aaaaaaattg aaaagctatt ttgtaatgta 5760 tgtttcatat ctaatatgtt caataaaaat acttttggaa aaaaaaaaaa a 5811 // ID Gypsy127-LTR_DR repbase; DNA; ZEB; 715 BP. XX AC chr18; XX DT 03-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Gypsy retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy127-I_DR; KW Gypsy127-LTR_DR; Gypsy127_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-715 RA Jurka J. and Kohany O.; RT "Gypsy retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1287-1287 (2008). XX DR Genome; chr18; Positions 8831665 8832379. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 715 BP; 147 A; 134 C; 160 G; 274 T; 0 other; tgttacgcaa gtgcgtattg ggttgtttct gtgctgctgt gtgggtgtgt ctctcttttt 60 ccctctctcg tcctgtcccg ttcctgtttc caggttaccc cgccctaatt ggacgtgctg 120 cgtgcgtgcg ctgtggttgg tgctgggaga agtcagcatc ttttgaaaag gaaccgacgg 180 agtcagatcg ctggggggag agacggagag ttttctaaga gtgtggcgtt cgtgtgaaac 240 atatcatagt gctactggtt gtttataaat tgtgaacatt aaggtgttct ctcccttttt 300 cctctcccct ccaatttttg tgattgtcaa aataccgttt gataaagctg tgagctgagc 360 gccgtagggg ctgagcgcca tttactttga cactgatttt ctcctgtttt ttcgtttagt 420 taaggagtaa gggtagtttt tgtaatttat tttcttcatt tatttttgat tcagtaggta 480 agttatcttc agggtggagt tagtaaatgt tttgtttgtt attttggcct tgctcagccc 540 tgaagctttt tgtttgcgct catttatttg tttattgtaa atacaaacct tttccacctt 600 ctcacaataa aagaacctta aaggtcattt aaaaatcaaa cctgctaact gtttatttat 660 tttctctata atttttatgt tcgacttcag ccctagacat tttgaggtcg taaca 715 // ID DNA-TA-1_DR repbase; DNA; ZEB; 812 BP. XX AC . XX DT 03-OCT-2008 (Rel. 13.1, Created) DT 03-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE non-autonomous DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; TIR; DNA; KW DNA-TA-1_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-812 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1245-1245 (2008). XX DR [1] (Consensus) XX CC The consensus was built from copies which are 80-90% identical to CC the consensus sequences. TSD 2-bp (mostly TA dinucleotide). TIR CC is 11 bp long. The 5' 180 bp is 88% identical to the CC corresponding region of DNA11TA1_DR. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 812 BP; 260 A; 111 C; 144 G; 297 T; 0 other; cactcaaaaa aatgattact gctgcttgtt caaactacct gtttaaaatg agctgaaaca 60 acacaattct tgtaatttct ttggccaatt tatttgtttt atgttcagtc cacttaaatt 120 tgtaaaccat taagttaact taagcgtttt gtgttgggac aacatgaagg aattgtgttg 180 gtccaactac ctggttaggg gatttgtagt tcccagcatg ctttgcgtgg gattgaatta 240 agagagggaa atgttgacaa taaaagtaat cttaggtgtt taaagttaat agaaagactt 300 gttagagttt aatcttcact ttagtttgaa gatttagaag attttcatgt aatgctttga 360 gttttgagat taccaccttg cagaagcacc catgctttgg gtgttggcag cttaattagc 420 aaaaatgcag tttgttgctt tgctcagtga gcaataactg tcctaaaatt agttctgaga 480 tcagtctcaa tcaactgtat atgtaactca tgaaatatgc ttaaaaatgg ctcttttagt 540 tcatttagga taacttcaaa taaaactaag agactttgaa atgtacgaca cataatagac 600 atacatgata ttatcagacc tttgagttta agttaaataa aaatacaaat gtatttaaac 660 ttttatttga taaaatcatg ttggaccaac tcaattgcat ttaattgtgt aaatagtttt 720 tttttttttt tagttggtgc taaacaaatt tattggatgg aaatcctgcc ctcaattaaa 780 ttgagttcat ccaatgagtt attttttgag tg 812 // ID Gypsy-116-I_DR repbase; DNA; ZEB; 4847 BP. XX AC . XX DT 24-NOV-2008 (Rel. 13.11, Created) DT 24-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-116-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4847 RA Bao W. and Jurka J.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(11), 1787-1787 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 204..4796 FT /product="Gypsy-116-I_DR_1p" FT /translation="MELDIDEFISNPSVQQFNKFRKRDLLQIADVLNISVV FT STASKQVIKSAVFIELVKQGXLCEESEQKTEMEKSAVEAAVSADSVTVESD FT LVLQIQLKELELQIKRQEHENHVLQIRKLEMTQKEASSLIDPPAVSSALTQ FT PRDERTSAVLFDPSKHIKLVPPFRESEVDAYFIAFERIAAKLKWPKDMWAL FT MLQCSLVGKAQEVCSSLPIEESLNYDSVKAAVLRVYELVPEAYRQKFRNYT FT KSVKQTFVSFARDKKVLLEKWCAASKTTTFEQLQELILLEDFKSCLPDNLV FT VYLNEQKVNSISAAAVLADEYTLTHKTAFSSASSQSYGAPVLVNTEKTFTH FT SKFGARSIEPRKEQGKNIENRRVCFYCLDPNHLIADCKLWKKKNTVKSKSV FT ANVVLETVSNSHDQSCSDASVFSPFVFKGTVSVNPDSATKTINILRDTGAA FT QSFILENVLPFSSETYSGTDVLVRGIELGCVKVPLHTVCLTSDLVSGPVQL FT GVRAKLPVEGVSLILGNDLAGGKVFTHPVVAARPVCEVELGTKFPSIFSAC FT AVTRAQARNSGSDIDLSDTFMSFPDNPTEFKLSVQPSVFKTDDVTGHEALS FT MGREQLGKAQRSDPSLVRCVEAAEGKVESDTDDGVQYFWDRGVLMRKWLSQ FT KAKEVGFSPDYQIVLPSEYRTAVLKLAHDHILSGHLGINKTFTRVAKYFYW FT PGLRSAVSDFCRACHVCQIAGKPNQNIPPAPLQPIPVMGEPFERLILDCVG FT PLPKSKSGYQYILTLMCAATRYPEAIPLRNLKAKTIVKELIKFCSTFGLPR FT VIQTDQGTNFKSKLFEQVLQGISVSHVVSSAYHPQSQGALERFHQTLKSMM FT RAHCVESNSDWAEDLPLLMFAIRETVQESLGYSPAELVFAHGLRGPLKLLS FT EQLVADNKQMVPVSEYVNSIRERLRNACEKARTNLAEAQAEMKVRYDRKSV FT SRSFQPGDSVLVLLATPGSALQSKFSGPYMVTKKLSDTNYLVCTPDRRKKS FT RMVHVNMLKAYVKSDAHLPAVKPSLVASSVADEIEQDFSVPCGRLTNSVIL FT NDLDSHLSYLSSDQKSDIVELLENHRSLFSDIPSQTTVLTHDIDVGDARPI FT KQHPYRLNPKKRDLMKAEVEYLRQNHFASPSLSAWSSPCLLVPKSDLSVRF FT CTDYRKVNAVTKPDSFPLPRMEDCIDRVGPARFVTKLDLLKGYWQIPLTSR FT ASEISAFVTPDDFMQYSVMAFGMRNAPSTFQRLMRIVLSGVEKCEAYLDDI FT VIYSSSWEEHMVSLREVFARLVNASLTLNLAKCEFAKATVVYLGKKVGQGH FT VCPVDAKIASIVEFPVPGNKRELRRFLGMSGYYRGFCRNFASVVSPLTDLL FT STERVFVWSDACDQAFRAAKDLLCNAPILSAPDFERPFKLEVDASATGAGA FT VLIQESEMGIDHPVCYFSKKFTACQRRYSTIEKEALALLLALQHFEIYLGG FT SSFPIRVYTDHNPLVFLDRMRNSNQRLMRWSLEVQEFNLDIRHKKGSQNVV FT ADALSRVHGAEVEV*" XX SQ Sequence 4847 BP; 1294 A; 892 C; 1118 G; 1537 T; 6 other; attgggggct cgtctatttg gtttttgttt gtataaatga caattgaggc cctgtttgct 60 ttttgagcat atmcaattgt tgtttatcta gtctttttgg atgtttstta ttggcactct 120 caaatatgct gtatttgaag attcagttaa ccctgtttat ttsatgtttg aaaaggaatt 180 tcagatgtat stcttgattc attatggagt tagatattga tgaattcatt tcaaatccga 240 gtgtgcagca gtttaataag ttccgtaaaa gagatctatt gcagattgca gatgttttga 300 atatttctgt agtaagtact gcttctaaac aagtaattaa atctgctgta tttattgaat 360 tggttaaaca agggttkttg tgtgaagaga gtgaacaaaa aactgagatg gagaagagcg 420 ctgttgaggc ggctgtgtcg gctgattccg ttacagttga atctgatctt gtattgcaaa 480 tacaacttaa agaattagag ttacaaatta aacgacaaga acacgaaaat catgttttgc 540 aaattcgtaa acttgagatg actcaaaagg aagctagttc acttattgat ccacctgctg 600 tatcttctgc attgactcag cctagagatg aacgtacatc agctgttctg tttgatccaa 660 gtaagcatat caagcttgtc cctcctttta gagagtcaga ggttgacgca tattttattg 720 cgtttgagcg aatcgctgca aagctgaagt ggcctaagga tatgtgggcg ttaatgctgc 780 aatgcagttt agtaggtaag gcgcaggagg tttgttcttc tttgcctatt gaagaatctc 840 taaattatga ctctgtgaaa gctgcagtgc tgcgggttta tgaacttgtc cccgaagcgt 900 atagacaaaa gtttcgcaat tacactaaat ctgtcaagca gacgtttgtt agttttgctc 960 gcgacaagaa agttttgctg gagaagtggt gcgcagcatc taaaacgact acttttgagc 1020 agttacaaga attaattctg ctagaagatt ttaagagctg tttgcctgac aatttagttg 1080 tttatcttaa tgaacaaaaa gtgaattcta tttcagctgc agcagtgttg gcagacgaat 1140 acacgctcac tcataagact gctttttcct ctgcttctag ccaaagttac ggtgcgccag 1200 tgcttgtgaa cactgaaaag actttcacgc actcaaaatt tggtgcgcgc tccatcgagc 1260 cgcgtaaaga gcagggtaag aatatcgaaa atagaagggt ttgtttttat tgtttagacc 1320 cgaatcattt gattgctgac tgtaaattat ggaagaagaa aaacactgta aaatctaaaa 1380 gtgtagctaa tgtggttctt gagacggtgt ctaacagtca tgatcaaagt tgttcagatg 1440 cttctgtttt ctctccattt gtgttcaaag gcactgtatc agttaaccct gattctgcaa 1500 cgaaaaccat taacatcttg cgagacacag gtgctgctca atcttttatt cttgaaaatg 1560 tgctaccttt ttcaagtgaa acctactctg ggactgatgt gctggttcgt ggcatagagc 1620 tgggctgtgt aaaagttcct cttcatactg tgtgtttgac atctgatttg gtttcaggtc 1680 cagttcagtt gggagtgcgt gcaaaactgc cagtagaagg ggtcagttta attttgggga 1740 atgacctggc cggtggtaag gtttttactc atccagtggt ggccgcacgt cctgtttgtg 1800 aggtagaact tggtactaaa tttccttcaa tattctctgc gtgcgcagtt actcgtgcac 1860 aggcgcgaaa cagtggtagt gatattgatt tatctgacac atttatgtct tttccagaca 1920 atccaacgga gtttaaatta tctgttcaac ccagtgtgtt taaaacggat gatgttactg 1980 gtcacgaggc tttgtctatg ggaagagaac aacttggtaa agctcaaagg tcggatcctt 2040 ctcttgtgcg atgtgttgaa gctgccgagg gcaaagtaga gagcgacaca gatgacggtg 2100 tgcaatactt ctgggacaga ggcgtactaa tgcgtaagtg gctgtcacaa aaagctaaag 2160 aagtgggttt tagtccagat taccaaatag tgttaccttc tgagtatcga acagcagttc 2220 ttaaacttgc tcatgatcac attctatctg gtcatttggg cattaataaa acgtttacac 2280 gagtggctaa gtatttttat tggcctggtt tacgctctgc tgtgtcagat ttttgtcgtg 2340 cgtgtcacgt ttgtcagatc gctgggaaac ccaatcagaa cattcctcca gctcctttgc 2400 agcctattcc agtgatggga gagccttttg agcgcctcat tttagactgc gtcggccctt 2460 taccaaaatc aaaaagtggt tatcagtata ttttgacgtt aatgtgcgct gcaacgcgct 2520 accccgaggc aataccgtta cgaaatctta aagcgaaaac tatcgttaaa gagctgataa 2580 agttttgctc cacttttggt ttacctcgcg ttattcagac agaccaggga acaaatttta 2640 aatctaaatt gttcgaacaa gtattacaag gaatttcagt aagtcatgta gtgtcaagtg 2700 cataccaccc acagtcgcag ggcgctctcg agcgatttca tcaaacgctc aaatcaatga 2760 tgcgtgctca ttgtgttgag tccaactcag actgggctga agatcttcct ttactgatgt 2820 ttgctatcag agagacygtt caagaatcat tgggttatag ccctgctgag cttgtttttg 2880 ctcatggttt acgcggaccc ttaaaattat taagcgaaca attagttgct gacaataaac 2940 aaatggtacc tgtatctgaa tatgtgaatt caattaggga acgtcttcgt aatgcttgtg 3000 aaaaagcaag gactaattta gctgaggctc aggctgaaat gaaagtgcgt tatgatcgca 3060 aaagtgtgtc tcgttcattc cagccgggtg attcagtttt ggtgttgctg gctacgccag 3120 gctcggcttt acagtcaaag ttttcaggtc catacatggt gaccaagaag ttgtctgaca 3180 ctaactattt agtttgcact cctgaccggc gaaaaaaatc tcgtatggtc catgtcaata 3240 tgcttaaggc ttatgtgaaa agtgatgctc atctacctgc tgtcaaacca agtttagttg 3300 cgtcctctgt agctgatgag attgaacagg atttctctgt accctgcggt cgtttgacaa 3360 attctgttat attaaatgac ttggattcgc atttaagtta tttatcttca gatcaaaaaa 3420 gcgatattgt tgagttactt gaaaatcatc gctctctctt tagtgacatt ccttcccaaa 3480 ccactgtgct tacacatgat atagatgttg gtgatgcacg accgataaaa caacatcctt 3540 atcgtcttaa tccgaaaaaa cgtgacttga tgaaagctga ggttgagtat ttgcggcaaa 3600 atcattttgc gagtcccagt ttgagtgcat ggagttcacc ttgtctatta gtccccaagt 3660 ctgacttgtc ggttcgtttc tgcactgact atcggaaagt caatgcagtt acgaagcccg 3720 attcattccc attgccacgt atggaagatt gcatagatcg agtcggtcct gccaggtttg 3780 tgacaaaatt agatcttctt aaagggtatt ggcaaattcc acttacgtca cgagcctcgg 3840 aaatttcagc ttttgtgacc cctgacgatt ttatgcagta ctccgtaatg gctttcggga 3900 tgcgcaacgc accgtcaacc tttcagcgat taatgcgcat cgtattaagt ggtgttgaga 3960 aatgcgaagc ttatctcgac gatattgtga tctattcttc tagctgggag gagcatatgg 4020 tttctttacg tgaagtgttt gcacgtctgg ttaatgcctc tctaacttta aatctagcta 4080 aatgtgaatt cgccaaggcc accgtagtgt acctaggaaa aaaggtaggt cagggacatg 4140 tgtgtccagt tgatgcaaaa attgcgtcta ttgtggaatt cccagtgcca gggaataaaa 4200 gagagttgcg tagatttttg ggtatgtcag gatactacag aggattctgc cgtaactttg 4260 ctagtgttgt gtctccacta actgatcttc tgagcaccga aagagttttt gtgtggtctg 4320 atgcatgtga tcaggctttt cgagcagcaa aagatttgct gtgtaacgct cctattcttt 4380 cagcaccgga ctttgagcgt ccctttaaat tggaagtgga tgcttctgct actggggctg 4440 gagctgttct aattcaggaa agtgagatgg gtattgatca tccagtgtgc tatttctcaa 4500 aaaaatttac agcatgtcaa cgcagatata gtacgatcga gaaggaggct ttagcactac 4560 ttttagcgtt gcaacatttt gagatttatt tgggtggaag ttcgttccca attcgtgtgt 4620 acactgatca taatcctctt gtttttttgg ataggatgcg caattcaaat cagcgattaa 4680 tgcgatggtc tttggaagtc caggaattta atttggatat acggcataag aaaggttcgc 4740 aaaatgtggt tgcggatgcg ctgtccagag ttcatggtgc tgaagttgag gtttaagtta 4800 gttgggtaga tacaaactgc ttgtttgcat ctttttggtg gggaggt 4847 // ID ERV1-N4-LTR_DR repbase; DNA; ZEB; 821 BP. XX AC . XX DT 11-NOV-2008 (Rel. 13.11, Created) DT 11-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE nonautonomous endogenous retrovirus from zebrafish, LTR - DE consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR; ERV1-N4-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-821 RA Bao W. and Jurka J.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(11), 1782-1782 (2008). XX DR [1] (Consensus) XX CC TSD is 4-bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 821 BP; 214 A; 160 C; 154 G; 292 T; 1 other; tgtaagctga gaaaatatta tatacttttc cttctatgct aatgaaggac tactgagaca 60 ttccaagcat gggatgggtc ctcctggtga ggcagagttc ctcacagctc cgatacacat 120 ctctgccatt tgcttacagt aaccatgtgc tcttcaccat acacaattca tctgcttgtg 180 ttcttcyttc agataacttt gtcatgtttg gtgtcatttg acagtttatg acttcagtgt 240 tgtagtgatg aaatcaacag agacattgat taattgtatt tcaatctaca atatatttac 300 tattgttggg tcatggcctg aggagagtat gttaatgtac ttctccccca cttcatctct 360 tcaactttgc tctcatgttc agctggtttt gggggttggt tttcactcat gtttgacaat 420 gctttgctca gggtataata gggaaaggta actgttgatc tgggagaatg gcttttactc 480 tgcatctccc gcacaccggt gcgtagcctc atatgctaac aaaggcagtt tcactgtctt 540 tgttgctcag cagtgcatat gttacagcta atttctttat acagtatgtt agtttgtttc 600 attctaatgt ctgtatattt aaacatgtaa cttaataaac aactttgcct gctttaagac 660 gtagagattc tttctcttta tcaatgatga taacaacttt aatggagtca gattaattac 720 aatggagtca gataaataat ggatttaaac acgccatcct tcaactgctt tctgtttccg 780 attattggtt cagaatagca gcgtaagctt caagccttac a 821 // ID HATN10_DR repbase; DNA; ZEB; 455 BP. XX AC . XX DT 11-JUL-2002 (Rel. 7.06, Created) DT 11-JUL-2002 (Rel. 7.06, Last updated, Version 1) XX DE HATN10_DR is a nonautonomous DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW Dr000119; Dr000184; Dr000228; HATN10_DR; KW nonautonomous DNA transposon; hAT superfamily. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RA Bao Z.; RT "Dr000119, Dr000184 and Dr000228, unclassified and RT uncharacterized repeats."; RL http://www.genetics.wustl.edu/fish_lab/repeats/. XX RN [2] RP 1-455 RA Kapitonov V.V. and Jurka J.; RT "HATN10_DR, an ancient nonautonomous hAT-like DNA transposon from RT zebrafish."; RL Repbase Reports 2(6), 9-9 (2002). XX DR [2] (Consensus) XX CC HATN10_DR is a family of nonautonomous DNA transposons that CC belongs CC to the hAT superfamily [2]. CC Several thousand copies of HATN10_DR are expected to populate the CC zebrafish genome. The element was active long time ago, its CC copies are ~12% divergent from the consensus sequence. CC This element is characterized by 8-bp target site duplications. CC HATN10_DR has imperfect 16-bp terminal inverted repeats (two CC mismatches). XX SQ Sequence 455 BP; 151 A; 70 C; 76 G; 156 T; 2 other; taggcatggg acgataaccg ttttcaaggt ataccgcggt ttggaaaagt caaggtttta 60 aaaccgccaa aattttctgc tataccgttc ctaaggtatg tgtaagattt tttatttact 120 tttttttwgt tttttaggac aacagtatct ccagcagaaa agatatccaa agatgccgtt 180 ttaaattgta aagaaatctg tgtttttgaa actaatgaag acagcagaag tcaatgattc 240 atttgaatta tttagcctga catgtttact gytccaaaat attaaatgtt tctcaaaata 300 aaatatattg tgttcaaagg ggaaaaaagt ttttgttttt ttacccagac atttaaaaag 360 aatatatttt agagcagtaa tcacaatacc gtgaaaccgt gatattttta tccaaggtta 420 tcataccgtc agaatcttat accggcccat gccta 455 // ID hAT-N29_DR repbase; DNA; ZEB; 1580 BP. XX AC . XX DT 14-OCT-2008 (Rel. 13.1, Created) DT 14-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Putative nonautonomous hAT DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW 8 bp TSD; hAT-N29_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1580 RA Bao W. and Jurka J.; RT "hAT DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1299-1299 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1580 BP; 446 A; 276 C; 352 G; 506 T; 0 other; caggggcgga ctggccatct ggcagaccgg gcactttccc ggtgggccga cgtacttttt 60 gggccgggct gatcatcgtc ttatgatctg atcggcccat aaaaggctta gcagcctatc 120 gttttttttc tgccttatta attgacgctg ctgtgatctg tgactctctg aaagtaagat 180 gctttttttc ccggacagac agaccgggcc ggcccatgtg acactctgca gcccattggc 240 tcttttctgt attgacattg ggctggccca atcacaaact cctattttgg actccgtgtg 300 agcgtgcgtc gtctgtgtgt atttgtcaaa atagtcgaga gtcagagaga agaaacgcaa 360 gggaggcgca gagaaatcgc gggaaataca ccggcgtttc atttagcggt tctgaaaagt 420 tctccgttca ttctagtaca cggctaaaag cgcaaatcac gtgaccacac acatgcccac 480 tctctgccaa gagctgcagc cagtagttca gcgggtgagc ataaacccca ggtgagagta 540 tagtgcatgt cagacagttc atctgctgga tgatctcatc tcactatagt tgttaaacgt 600 gatattgaat tcatcttgtt gtctctatct aacaattctt atgtcttttg aatcacttgt 660 tctcagttaa ctgttttggc tgagtgcaaa gataagctaa tatattaatt tatgtaacca 720 catacactga ttctgcacat tgactgattg cttaccttta tcgtttaaat ttaatgtacg 780 ttatactgtt ataatggcca ttattggtta ttaaaactga tattctgcaa aagagacagg 840 gtaaagttca ttcttttcaa tggaaaaaat gaaaggagac agttatattt aagccctgtt 900 ttgttataaa tatgcagagt gaagatgatg ctcatataaa tatgcagcta cacgaggctg 960 tttggtgtct gttaatcata atatcaaaat gaaacaaatt tgacttatat tatgttttca 1020 ttgtttagat agtgaaacag caagcaggat gaggtgaaag aggttaaaaa gtaacactgt 1080 tccctttgaa gatttaccca tgcattagca ggcagttttt gttatgttta ttattgaata 1140 taggctgagt gaatagtgta ttgaataagc taaattggct gtagtgtatg agtgtgtgtg 1200 aatgagtgtg tatgggtgat ttccaatatt gggttgtggc tggaaaggca tctgctgcat 1260 aaaacatgct ggaatagttg gcagttcatt ccactgcggc aacccctgat aaataaggga 1320 cttatgcgaa gaaaaataaa tgaatgaata gtgtattttt tagaactcaa caaatatctt 1380 tatggacagt atacagaaat taaattgaga ttaaagtttg tatttattta tttatttcat 1440 atatatggct tttgtgtttt atagaatatg agttgataaa aatattggac gtgcatagat 1500 agatagcatt ttgattgaat ttagtgggcc gctctggccc aaaatgccag ggccgatttt 1560 tttgccccag tccagccctg 1580 // ID DIRS-4_DR repbase; DNA; ZEB; 6796 BP. XX AC . XX DT 24-OCT-2008 (Rel. 13.1, Created) DT 04-JUN-2010 (Rel. 15.07, Last updated, Version 3) XX DE DIRS-like LTR retrotransposon family - a consensus. XX KW DIRS; LTR Retrotransposon; Transposable Element; KW reverse transcriptase RNase H; phage integrase; DIRS-4_DR. XX NM DIRS-4_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6796 RA Bao W. and Jurka J.; RT "Families of DIRS-like retrotransposons in zebrafish."; RL Repbase Reports 8(10), 1271-1271 (2008). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 523..3816 FT /product="DIRS-4_DR_1p" FT /note="reverse transcriptase and ribonuclease H." FT /translation="MEGTNTPALPETPAQTQQQTQINTEAPIRGRRPIRST FT VTRTHRRTQSPSPNRNLPSPASSYASARSSSITSNKMTVSELRQSLTNAGI FT SIPTRCNKSELLKLYEAIPSPTPPPQDSRPTRSRHTPYPQPSATQHSRNPP FT GPPKKATKKTNKKQPQATGQXAPSTNQHTVNPPDNYATPGLPTPLLWPPAP FT QSSENSSPTLQAIPPTLNPPQFSLSSNLPHSSTQLIPNLSSNLPHSSTQLI FT PTQSFPTSSNALPLPANFSSTNPPFFPSTSLQAPTSITNPPQQNAFCTNTS FT SARAPFTLATATPLPIPHNAPVLEPPQISNTVRNLILSGADIDLSTLLSPI FT APPSADRQVDCGEFTITLKSPVSSQPRTLTIAEFHVAFSRYTDTICSVFPH FT RRRELNDYMAIISELALSYGGTHFYTYHKLFSAKCAIRVAQWNQCSYWGAL FT DTDLHNRVFLGCRNLSCAVCRSNLHPTTSCPFIIPSTEKELQTPRSTSYVP FT RPSTSAIPALLPPPSSQNPPSSLACNNFNAARCFRHPCKYLHICSYCGGAH FT ARVVCQVWKANKKHRSYLSTPVNISNLYHELCMHPDPNFSEFLISGLSNGF FT HPGVSTLPSYNLACPNLQSANAEPEVVEQLIKKEIDNKFMIGPFLAPPFST FT YRVSPIGVATRKFSGKKRLIIDLSSPHNSAYSSVNSIISPDEFSLNYHDID FT QAISLIKLVGRDAWLAKVDITSAFKIMPLHPEFWHLFGINWKSQFYFAVRL FT TFGCRSSPKIFDMLSEALCWILANNYGIPHVVHLLDDFLIISPPNSPPAKH FT LEITKAVFAKLGIPLAEEKTAGPSTFIEFLGINLDSNKFQASLPKEKVDRI FT ISLSSIFLEKQECSKRELLSILGHLNFAMRIIPQGRPFVTHLLQLAASVQS FT LEENISLSDPCRNELSLWISFLKCWNGCSFFYSDLISSPVDIHLYTDAAPS FT IGFGGYYQGRWFASDWPPQMLEVPSHQYSSALFELYPIVVATLLWGDEWSA FT SSILIHCDNEAVVHCINRGRSHSPALMPLLRRLIWTSAKKQFILTAVHVPG FT FHNQIADSLSRLHFQKFRELAPEAEQHPTPIPPYSEMIFQ*" FT CDS 3825..4916 FT /product="DIRS-4_DR_2p" FT /note="tyrosine recombinase." FT /translation="MHDLHQASISLIMQAVAPRTLQAYLTAWKTFKHFHSL FT YNTTFPNFSLLTITSFITYLHSHKHIQANSIKSYLSGIQFFHKLMYGSSSE FT SITNSQTSLLIKGIQKTRPPLPDTRLPITHNILAKCISTLRKGYFSFHTDH FT TLDAMFILAFFGFLRCSEFTVTSKFDPSIHPTIADLTLIDEETIAFLIKQS FT KTDQSRKGHYIYIFNIPSPTSPFQTLLAYTHYRKTLSASPLDPLFIDDTHH FT PVTRFWFQKHLKYVLTNSGFPSESYSSHSFRIGAATTAAHKGLSQQHIQTL FT GRWSSDAFKTYIRLSHSHLREAQRTLTSRCSYPSGQRHEPSTRKEHNPAIP FT ASRGQRNYQVSQGRGHDPAI*" XX SQ Sequence 6796 BP; 1875 A; 2179 C; 1015 G; 1725 T; 2 other; aagtgaagtt tttaaactaa tttcgagagg agcacgtgat ataattgacc gcagctggcc 60 gccwatctac actcattagt tagccaatca gatctattcc aaattactat aaatagccta 120 gctagatatt actcccttac cttcgttttc cgaagaacaa ggacaaaccc tgctcctaac 180 aaactaccga aaccctcgac aacaaacaac aacaacaaca acgacaacag ctacaacaac 240 agctacaaca acagctacaa caacagctac aacaacagct acaacaacag ctacaacaac 300 agctacagct acaacaacag ctacaacaac aacaacaaca acaacaacag caacaacaac 360 agcaacaaca gcctacatca acaataacag ctaacgcttc aacaacaaca gcttcaacaa 420 caacagctac tacaacaaca gctacaacaa aacagcgaca acaacagaaa gaactcacaa 480 cctaaacaga acaatccaac atcaaagcca tcaacaagca acatggaagg aaccaacaca 540 ccagctctcc cggaaacacc agcccaaaca caacaacaaa cccaaataaa caccgaggct 600 cctatcagag gccgaagacc catccgctcc acagtcacaa gaacccatcg ccgcacgcaa 660 tctccatctc caaaccgcaa cttaccgtct cccgcttcat cctacgcctc tgcaagatct 720 tcatccatca catccaacaa aatgactgtt tctgaactcc gccagtcact cacaaacgcc 780 ggaatttcca tccccacccg ctgcaataaa tccgaacttc tgaaactgta cgaagccatc 840 ccgtcaccaa ctccgcctcc ccaagacagt agaccaactc gctcccgcca caccccctat 900 ccacaaccct ccgctactca gcactcaaga aacccccctg gaccacccaa gaaagcaacc 960 aagaaaacta ataaaaagca acctcaagct acaggacaga magcaccttc taccaaccaa 1020 cacacagtga atccaccgga caattatgcc actccaggac ttcccacccc cctcctttgg 1080 cctccagccc cacaatccag cgaaaactcc agtccgactc ttcaagcaat tcccccgact 1140 ctcaaccctc ctcagttctc tctttcttct aatctccctc attcttcaac tcaacttatt 1200 cccaatcttt cttctaatct ccctcattct tcaacccaac ttattcccac tcaatccttt 1260 cctacaagct ccaacgctct ccctctgcct gctaattttt catctactaa tccccccttt 1320 tttccctcta catccctcca agcacccact tccattacta accctcccca acaaaatgct 1380 ttctgtacta acacatcttc cgcacgagcc cccttcaccc tagccacagc cacacccctt 1440 cccattccgc ataacgctcc agtcctggaa ccacctcaga tctccaacac agtcaggaac 1500 ctcatcctat caggtgcaga catagacctc tctacactcc tttcacctat tgcacctccc 1560 tcggcagatc gacaggtgga ttgcggcgaa ttcaccatta cacttaaatc accagtcagc 1620 tctcaacctc gcacactcac aatagccgaa ttccacgtag ctttctcacg ttatacagac 1680 accatctgct ctgtctttcc ccataggagg cgcgagctga acgactatat ggctatcatt 1740 tcggagctcg cactctccta tgggggaacg catttctata catatcacaa attattttca 1800 gcaaaatgcg ctattcgcgt tgctcaatgg aatcagtgtt cttattgggg ggctttggac 1860 actgatctcc ataacagagt ttttctagga tgccgcaatc tttcctgcgc ggtctgccgc 1920 tcaaaccttc acccaaccac ttcctgtccc ttcataatcc cctccactga gaaagaacta 1980 caaaccccaa gatccactag ttacgtaccc cgcccttcta cctctgctat ccctgctctt 2040 ctcccccctc cctcctctca aaaccctcca tcatctctag cttgcaataa ctttaacgca 2100 gccagatgtt tccgccaccc ttgcaaatac ttacacattt gcagttactg cggtggcgct 2160 catgctcgag tggtctgcca agtgtggaaa gcaaataaaa aacatagatc ctatttgtcg 2220 actcctgtca atatttctaa tctttaccat gaattatgca tgcaccctga tcctaacttt 2280 tctgaatttc tcatttcagg tctgtctaat ggattccacc ccggtgtttc gactcttcct 2340 tcctataacc tcgcatgtcc taaccttcaa tctgctaacg ctgaaccaga agtggtggag 2400 caattaataa agaaagagat cgataataaa tttatgatcg gtccctttct tgcccccccg 2460 tttagcacct atcgagtcag cccaattgga gtagcgacca gaaaattttc gggcaaaaaa 2520 cggctaatta tcgacctgtc ttctccccat aattccgcct attcaagtgt caacagcata 2580 atttcacctg acgaattctc tctgaattac cacgatatag accaagccat ttctttaatt 2640 aaactcgtcg gacgcgacgc ctggctcgcg aaagtagaca tcacgtcagc tttcaaaatt 2700 atgccattgc atcccgagtt ctggcatctc tttggcatta attggaaatc ccaattctac 2760 tttgcagtcc gtttaacctt cggctgcaga agtagcccca aaatcttcga catgctttca 2820 gaagcattat gctggatcct cgctaacaat tacggcattc cgcacgtagt ccacctacta 2880 gatgatttcc tcataatttc ccctccaaat tccccacctg ctaaacacct agagattacc 2940 aaagcagtgt ttgccaaact cggcatccct ctagctgaag aaaaaaccgc cggccccagc 3000 accttcatag aattcttagg catcaatttg gactctaaca aatttcaagc atctttaccc 3060 aaagagaaag tcgatcgcat catttctcta tcttccatat ttttggagaa acaagaatgt 3120 tctaaacgcg aactgctgtc aatattagga catttaaatt tcgccatgcg catcatacct 3180 caaggacgcc cgttcgtcac tcacctcctt caactcgcag catcagttca gagtctagaa 3240 gaaaatatat ccttatccga tccatgccga aacgaactca gcctctggat ttccttcctt 3300 aagtgctgga acggctgttc tttcttttat agtgatttaa tttcatcccc cgtagacatc 3360 catctttata cagacgctgc accctccata ggatttggcg gttactacca aggccgctgg 3420 ttcgcatccg attggccccc ccaaatgtta gaggttccat cacaccaata ttcatctgca 3480 ttattcgaac tataccccat agtcgtcgcg accctattat ggggagatga atggtctgct 3540 tccagcattc tcattcactg tgacaatgaa gccgtcgttc actgcattaa tagagggcgc 3600 tctcactccc ccgctctaat gccgcttctc cgtcgcctta tttggacttc agccaaaaaa 3660 cagtttattt taactgctgt acatgttcct ggttttcata atcaaattgc tgactctctc 3720 tctcgtcttc attttcagaa attcagagaa ttagcgccgg aggcggagca gcacccgacg 3780 cccatccctc cttattcaga gatgatattc caataaatca tcccatgcac gatctgcacc 3840 aagcatccat atctctcatt atgcaagcgg tggctccaag aaccttacaa gcttatctca 3900 ctgcatggaa aacattcaaa catttccatt cactatacaa cactacattc cccaatttct 3960 ccctacttac aatcacatca tttatcactt accttcattc tcacaaacat atccaggcaa 4020 actcgattaa gagctattta agtggcattc agttttttca caaactcatg tacggctcca 4080 gttctgaatc tatcactaac tcacaaacta gccttcttat taaaggcatt cagaagaccc 4140 gcccccccct cccagacaca aggctaccca tcacacacaa catactagct aaatgcattt 4200 ccacactcag gaaaggctat ttttcatttc atacagatca taccctagat gcaatgttta 4260 ttcttgcctt ttttggattt ctaagatgtt ctgaatttac agttacatct aaattcgatc 4320 cctctatcca ccctactata gcagatctga ccttgattga tgaggaaaca attgctttcc 4380 tcattaagca aagtaaaaca gatcaatcca gaaagggaca ttacatctac atattcaaca 4440 ttccctcccc cacaagccca ttccaaactc ttctagctta cacacactac aggaaaacac 4500 taagtgcaag tcccctagac ccccttttca tagacgacac acaccaccca gtgacacgct 4560 tttggttcca aaaacacctt aaatatgtcc taaccaactc aggcttccca tcagaatcat 4620 actccagtca ctcattcaga attggagccg ccactacagc agcacacaaa gggttatcac 4680 aacaacacat acaaacacta ggaaggtggt cttctgacgc cttcaaaacc tacatccgac 4740 tcagccacag tcatctcagg gaagcccaga ggaccctcac cagccgttgc agttatccca 4800 gcggccaaag gcacgagcct agtacaagga aagaacacaa cccagctata ccagcttccc 4860 gagggcagag gaactaccaa gtctctcaag ggcgcgggca tgacccagcc atctaatttc 4920 ttttcttcct tccagctgat ttgcactcag ccttctcccc cttttcactc acccaactac 4980 agtaagagtt ctttccctgc ccaagccccc ccgtcacccc cgccccccct ggccgctgcc 5040 acagaagttt tcactgccca ccaacttttc taacttctgt aggaccccgc cccccccatg 5100 gctctggccc ccgcaggggc gttaccccga gcttcgattc ccgcaggaat catctcacgg 5160 cccggcctta gctttttata ttatatgtca tttcaatgac atatagtata gcactattta 5220 ttttctcttt gtttgatttt atttgtataa attcacatct atatgcaccc acgcatataa 5280 atatgaattt atatatagtg ctgtcaccct cacgctctgt tcccgcagga acacccccag 5340 agcacctata gcgcccgtca ccctccagaa agagtcttca ccttcccctc atctttccag 5400 actcctactg gagccagcca aatagctcac ccagccccga gctgtgacac tagccatgtc 5460 accgaccctc cccggcccta gcttttattc atttatttct gtttatttct cacatttatc 5520 ttttattttt taaatttatt tatatatatg tatatatata tgcacccacg catataaata 5580 tatatttata tatagtgctg tcaccctcac gctctaactc ccgcggagtt aatcccgagc 5640 accggacccc cgcaggggtc atcgcccaac tgccatttca ccctccagct ggggcttcac 5700 cgaccattcc ttttcccgac tccagctgga gatggcacat acagctctct ctcccgcagg 5760 agagccaaga gcttcgactc tctcaagagt cagtaaaaaa cgcccccccc caaggcccta 5820 gattacccca ttatatattt atatatctat atgtttcata taaatatata attatatata 5880 gtgctgccac ctcccagctc aatctccgca aggagtgttc ctcgagcaaa ttactccttt 5940 ggagtccccg cccccccctg ccccccttca cccccctctc cagccggagt ccttcactcc 6000 ccttcccttg taatgactcc agcaggattc ccgcccaccc catggctctg acccccgcag 6060 gggtctcccc gagtctctac tccagcagga gtattcacag cccaagccaa ctctgctcgg 6120 gttcccgcag gaacctgtgt caccctttgc tccaaggagc cctctttaca tttcctttca 6180 aataactata tccagcagcc ggatatagca tttcaagcct tttggggagt ttcttcgaat 6240 acacggctgc tgtcccgagc ttcatgcatt tggggagctc tcgagaacca cctgatctcg 6300 tactcccctt acatgctcta tggacctggc gggagccctg ggctcaacta tctccgagct 6360 cagggttctc tcccgggaca gcatgccaaa cctgcttaca gtcgtcaagc aatatctaag 6420 tgtgaactct tgaagtgaag tttttaaact aatttcgaga ggagcacgtg atataattga 6480 ccgcagctgg ccgccaatct acactcatta gttagccaat cagatctatt ccaaattact 6540 ataaatagcc tagctagata ttactccctt accttcgttt tccgaagaaa ccccccatcc 6600 accccctatc tcctcctttc ctccctttaa aaaggggagc tctcgagaac cacctgatct 6660 cgtactcccc ttacatgctc tatggacctg gcgggagccc tgggctcaac tatctccgag 6720 ctcagggttc tctcccggga cagcatgcca aacctgctta cagtcgtcaa gcaatatcta 6780 agtgtgaact cttgaa 6796 // ID DNA-8-20_DR repbase; DNA; ZEB; 1311 BP. XX AC . XX DT 01-DEC-2008 (Rel. 13.12, Created) DT 01-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE A nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; KW nonautonomous DNA transposon; TSD 8-bp; DNA-8-20_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1311 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(12), 2143-2143 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1311 BP; 362 A; 292 C; 248 G; 409 T; 0 other; tagggatgca ccgaaatgaa attcttggcc gaagccgaag ccgaataata atgaattgct 60 tggccgaagg ccgaataccg aatgcggtgt tttgcatttt tttccattta gtttgccaat 120 tttttcacca ttacaataat taaatagtaa aaattagctt tttactattt tgtgttgctt 180 ttcagagaaa taaatcaaat aacaaaaata caatttcaaa aatatttatt taacactgaa 240 cattttttta acattccagc agataatata caaacaaagc acaatataac ttaaaataaa 300 taaattagta aaataaaaaa ataattttta tttggccatc tttgaagccc cccttcttga 360 atagcctatg ttaggcctat aactgactgc tgaaagaatg taactgtatg tctacaacaa 420 caaatgtgca ttgaatagtg caaaaaatgt ggatgacaac aaatgaataa ctgtcctaga 480 cataagccta cttagcctac ttcttcagga aaagtggcag gttcttcttt ataaaaagta 540 gcttctctgc tgtctcgcat gaaagtcggt tcctcttctc atcgatgaca tgagatgcag 600 cactaaacag tctctcgctg tcggctgtcg gtgcgcggct catctgtagt acgagcccgt 660 ttactctctg cgctgttttc atctcctgcg ctgtgcatca cctgaccgtc cgtctccaat 720 acctagcggc tttcccaagt ccaactcggc ctggatcatt tctcgtgtgc gcagcttttt 780 ccccatatcc aagtaatgat attttatatc gcggatcaag cacagtcgcg atgaagtaca 840 ggtgttctga gtccgcctga ctaaatcgtg cgctgacagc ttctaagagc gcacttttag 900 ctgttttcac tccgtggtct gtttcagcct ctttgtttaa aagacgcttt agtgctgcaa 960 gtaaatgtat gacgtctgcc acagatgagc ttatctctct tgttatctgc tcaaaggggg 1020 cgagaagaga gagaacgttc ctctggacaa ttcagcagaa cagtgtctgc aaacggccgt 1080 ttttgcctct ttctcttaca ctttaaagcg cttccacact gctgacatgt ttgctgcata 1140 aagcgcgcgc ctctcactct acgcgggtta ctttttgatc gcggcattaa aaacgtcatt 1200 gttcggcaaa atttattcgg cctttttact tattcggccg aacaccgaaa gggctttttt 1260 gactattttc ggccgaataa tttcggttgc cgaatattcg gtgcatccct a 1311 // ID DNA-2-8_DR repbase; DNA; ZEB; 2496 BP. XX AC . XX DT 24-OCT-2008 (Rel. 13.1, Created) DT 24-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE nonautonomous DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; 2-bp TSD; KW DNA-2-8_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-2496 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1221-1221 (2008). XX DR [1] (Consensus) XX CC TIR is 40 bp long. TSD is 2 bp long, mostly TA. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 2496 BP; 842 A; 433 C; 388 G; 833 T; 0 other; cactgttaaa aattgtcccg ttaaaaaaca gtaaaatact ggcagctgca gttgccagca 60 atgtactgtt attttacagt atgttgctgt aaaaatacat ggactacatt gatttacaca 120 atactcaagt gaactcattt tgctcactga aagcaactgc ctcaacttgg tcaactgctg 180 aatgagttta tgaagtaatg tgctatatat gcttagaagc atacttataa tgataactta 240 ttttagttaa ttagaaaagt tcggtttaac tctaatgtct tatttttcac aacagcacac 300 atacatgtaa atctggaatc accagagaga ttctgactgc atcagcaact aaacagccgc 360 tatctttaaa tagagaaaca tgcagaataa catcagcagt tcattgttca tctttaactc 420 atacactggc tctactctcc tccacaacgg tgacagacag aagaaattag cttcaggttt 480 tacagtaaaa tactgttttt tccttgattt aacagtaatc tactggcagc tgtggttgcc 540 agcaatatac tgttttttta cagtctactt ccgttgtgta aattaacagt atattactgt 600 taaaatacat ttttacactg tgtttttacc tctcacacct ttaacccttt aaaccccaga 660 gcatatactt cagttttaat tgaagtgtcc acactactga gtgttcaaga aacatggagc 720 aaagctgaag taaattcatt aattaactga ctaattaaat gataattgag gattaacaat 780 gaacaaatga agaaatactg aagggaaaaa acaagaacag aacatacaaa actttagtca 840 cagctttata atgaaataac ctgaagaaca acaaatgatt aaatcattta aatgatcagc 900 ggatgattaa acaactctac aaacatcatc accagctaca cttattactt aatacatgta 960 tttttgtata acatctacta aagttcttct tgagaaaaag ttgaaagttt tacgtcacca 1020 tcatggagaa cagagtttgc tttagttggg ctcttagccc tgtcattttt aacatttata 1080 tatcttggct gctgcaattg ttaagaactt tgtttaaaaa gctcctttgt tgtggcaagc 1140 cagccaaaaa cctaaataag atctggtgat gttcatgttg ctattaaatg gcagagtctg 1200 ttctcattta gtataataaa atttactgct cctattcaat gttaaatcta ttgttttaca 1260 ttatatgtat atatatgtat ggcaatgatt tctggaagtt tttaagaata tcttggacaa 1320 taacactgct ctatggtgta aatcgcactc aaagagacat caataatcag catatgaacc 1380 tcaataatgg tgacaacaaa tttaacaaaa tttaacaaaa ttaacagttt aactcacaaa 1440 caggcaactg aaagtctaaa cataaacaac agcagaatca aacacaaata aagaaaactg 1500 ttagaccatt atacaacatt tatttatacc gcaatgcatg ctgggatatt tgtacagtgc 1560 cactcccagc atgcattgca gcatgaaaca tttgagatgt taccattgtt gagatacaag 1620 gtcagattca tgaggtctga gtgtgtattt gtcataactg aactgttttt gtatgttatt 1680 tcttaactgt taagtaatta cggaggtatc ttttctgttt ccacttctca ttaattaaat 1740 taatacctgc aactaagcat aaaatctatt gcatgaggcc cttcttccag atttatacca 1800 aaacatttat cagaactaat ttcagataaa tagatttctc tatttattga cttcccaact 1860 ttattgctat agtacaaacg ttttgtaaac tctgtattac agcagttgga aagtcttctt 1920 aaatgagttc aagggtgaag agcccaacta aaccagcccc tcactccatt acggtgacac 1980 aaaaaacacc ttaatttctg ttggatctca atgagaatag tatggaagtc aaatcagtca 2040 gtaataagtg tgtgtctgct gttgtttgtg gagttgttta atgatcctct gctgagactg 2100 aagctgtttt attttatttg attttattta atgttgtaac tcaagtcact gtttgtgttt 2160 cttgtttatt ttcttcagta attgtaatta ttaacagcag gtgttcatca gtgtctgaat 2220 tcactcatta gtgttcatta aaactgtcag gtgaactata ttagttaact agtgtatgaa 2280 atagtgaaca aggacacaaa gtgtgatttc aaacacagac ttcaaaacat cgaatctgaa 2340 ctctgttagc tcacagtttt ctgcagttgg ttactttctc gaaaacaaaa atgttaccca 2400 gaaagcactg tttttaacag aatattactg ttttagtgaa aaaacattat tttaccgtag 2460 aatctgaccg ttttttaaca gcaatttttt acagtg 2496 // ID LTR-8_DR repbase; DNA; ZEB; 612 BP. XX AC . XX DT 11-NOV-2008 (Rel. 13.11, Created) DT 11-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE Putative long terminal repeat (consensus). XX KW LTR Retrotransposon; Transposable Element; Nonautonomous; KW solo-LTR; LTR-8_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-612 RA Bao W. and Jurka J.; RT "Long terminal repeats from zebrafish."; RL Repbase Reports 8(11), 1771-1771 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 612 BP; 207 A; 119 C; 91 G; 195 T; 0 other; tgttggaata tttaatcaat tccatgcgtg ttcaataaaa aaagggtcag gagagggttt 60 tcttcaatgt caaaaatatt agtaatttat tggatacaaa tgaataattc gattcacaga 120 attctggtat cctcaatgca atgcaagaat tacattcagg aggtgcgcaa caattttcat 180 ttcctgactc caacttttat aggcagctga tattaacagg tggagtttag tgaaattcgt 240 cacttgtcaa tcattctcca gtcctccatt ggccgcaccc atacatacgt cctctttttc 300 tacgaattct ctgcacaaca ttaaaagcat aagactttct caactgcaca tattctgtgt 360 tcagtttcaa cccatctctg ctcacaggaa gtttagctgc agagctcaag ttaattttag 420 acaatcaggc ctttgcattt ctatcagctt catctacttc tgcaaaaatg cttcattagt 480 atgcaaggca tatcacactc aagagaagaa gtttagttaa tatatcacct ttaagcaata 540 caaatacaat tgtttaatga aaagaaaaag agacaaaaat aataaaaata tatatattcc 600 tttttcacaa ca 612 // ID hAT-N21_DR repbase; DNA; ZEB; 1846 BP. XX AC . XX DT 03-OCT-2008 (Rel. 13.09, Created) DT 03-OCT-2008 (Rel. 13.09, Last updated, Version 1) XX DE hAT-N21_DR is a nonautonomous hAT DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Interspersed repeat; KW hAT superfamily; nonautonomous; hAT-N21_DR; hAT-2-DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1846 RA Bao W. and Jurka J.; RT "hAT DNA transposons from zebrafish."; RL Direct Submission to Repbase Update (03-OCT-2008). XX DR [1] (Consensus) XX CC consensus built from dozens of 2-3% divergent copies. TSD=8 bp, CC showing NNNTANNN specific pattern. The TIR=16 bp. The left 1-76 CC bp is 76% identical to that of hAT-2_DR autonomous hAT CC transposon, therefore, classified as hAT superfamily. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1846 BP; 644 A; 294 C; 321 G; 583 T; 4 other; cagtggttcc caacctgggg tccgcgcccc cctaaggggg gcgccagagt tcacaagggg 60 ggcgcgggag aggataagtg ttgaggtaga aaaaggcata aaaatgctcc actattatat 120 agggctgtac aattaatcga aaatccgatt tagatttcgr tttcaaacga ttataaaaaa 180 gcattaatcg agataaatga ttatatgcat catataccgc cccctttcca gttgtacacg 240 tttgttgctc ttaaaagcgc gaaagaccgc gtgcttatgt gtgtgtgtgc agcacgcaca 300 cgcagtcaga agcatgcggc cggtcagcat tcgctctcat tcacacactg agacgagcag 360 aggagcgcag acatctaaag tcatcttaac gtgagcgctt taatggtcaa ataggtgtca 420 aaacgcggtg tttagtgatt atttatatta accctcattt gtgtaataaa caaacgagtt 480 gaggatcaaa agacgtgaaa gagaaaccat taaagagaca gcgcgaaata ttcctgctgc 540 cgcctgtttt tatcattatt aatcaaacat aacgagaaaa tgcctcgctg ctcttgactg 600 aaggactgct gtagctaaag tgtttttctt acagtgaaga tgcttaaagc acagtttgtt 660 ttatattttt attctattgt atttatttct ctttcctttg cagggtggaa aataactgta 720 tgtatacact tgaagtcaga attattagcc ctcctgaata ttagcaaccc ttttcctgcc 780 caatttctgt ttmatggaaa gaagtwtttt tttaacttat ttctgaacat aatagttttg 840 atatctcatt tctaattact gatttctttt tatctttgct atgatgacag cacataatat 900 tttactagat attttttcaa aatacaagca ttcagattaa agtgcagttc aaaggcttaa 960 ttagggtaat aaggcaagtc attgtataac agtagtttmt tctgcagaca atcaaaaata 1020 tatattgctt aagggggcta ataatattga ccttaaaatt ggtttcaaaa tatttaaacc 1080 tgtttttatt ctagctaaaa taaaacaaat aagcctagaa gaaaaaaaat attataggaa 1140 atactgttaa aattccttgc tctgttaaac ataatttggg aaatatttat taaaaaaaaa 1200 aaaattctca ggagcgctaa taattttgac tttaactgat aatcgttttg aataatcgtg 1260 attacaatta tgaccaaaat aatcgtgatt atgatttttc ccaaaatcga gcagccctac 1320 tattatatat gtatttttaa gtaccaaaaa aaaaaaaatc taaacacatg cttaaaattc 1380 caacataata gtaaaaataa ttaaaataaa taactttaaa taattattta aattttgctc 1440 actcgcgcaa tgtgcttgtc caacgtgtcg taaaggcaaa atcaaaacaa aaatggcaga 1500 gaaacgtaaa tgcagtgatt cttcagaggc gaagaaaaag attagacagt atgacaaagc 1560 ctacctaaat tttggtttta ttgaaggcca agacaagtta aaactgatta attaatattt 1620 tctatacata ttactatata ttaatattac acaaacacac acatacagta tatatatata 1680 tatatatata tatatatata tatatatata tatatatata tatatatata tatacacaca 1740 cagtacatat gtgtgtgcgt gcgtgtgtat ttatatggtg ggggggctcc aatgtttgta 1800 tatgttcatg ggggggggcc ccaaagaaaa aggttgggaa ccactg 1846 // ID BRSATI repbase; DNA; ZEB; 197 BP. XX AC . XX DT 22-DEC-1995 (Rel. 1.11, Created) DT 09-JUN-2000 (Rel. 5.05, Last updated, Version 2) XX DE Zebrafish satellite type I DNA. XX KW SAT; Satellite; Simple Repeat; BRSATI; RFAL-1; Repetitive DNA; KW satellite type I. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-190 RA Ekker M., Fritz A. and Westerfield M.; RT "Identification of two families of satellite-like repetitive DNA RT sequences from the zebrafish (Brachydanio rerio)."; RL Genomics 13, 1169-1173 (1992). XX RN [2] RA He L., Zhu Z., Faras J.A., Guise S.K., Hackett B.P. RA and Kapuscinski R.A.; RT "Characterization of AluI repeats of zebrafish (Brachydanio RT rerio)."; RL Mol Mar Biol Biotechnol 1(2), 125-135 (1992). XX SQ Sequence 197 BP; 57 A; 38 C; 33 G; 65 T; 4 other; ctcattttca acgtccaatt cagaatgtna taaaaccaga tccagccant aaaatgcatc 60 attctttttt gttttagaca acatttcatg cactgttaaa catgttaaag caagttgcaa 120 gtgaaaatct atgtctctga ctgagtttgc attactgntg atttngacct ctctgctggc 180 tgactggctg agataag 197 // ID Gypsy38-LTR_DR repbase; DNA; ZEB; 416 BP. XX AC . XX DT 21-SEP-2007 (Rel. 12.09, Created) DT 02-OCT-2007 (Rel. 12.09, Last updated, Version 1) XX DE A long terminal repeat of the Gypsy38_DR LTR retrotransposon; a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy superfamily; LTR; KW endogenous retrovirus; Gypsy38_DR; Gypsy38-I_DR; Gypsy38-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-416 RA Dib M.R. and Naveira H.F.; RT "Gypsy38_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 7(9), 807-807 (2007). XX DR [1] (Consensus) XX CC Gypsy38-LTR_DR is a long terminal repeat of the Gypsy38_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its CC internal portion is deposited in Repbase as Gypsy38-I_DR. XX SQ Sequence 416 BP; 81 A; 69 C; 78 G; 188 T; 0 other; tgttatgggt tttgtgattg gttcgttctg tttcctgggt tttgctttct gtttcttaac 60 aggtgactcg ttatgatgca gcaggtgcgg ctaatcttca ggctttataa aaggagcttc 120 caaactgtgt tggagagact gctttggagg accgtttttt cgtttgactt cctttgcctt 180 acccaccgct tatattaatt ttgtttattt gaactttgct cctgtataat ttgtcgtatt 240 gaatattgtt atgttagaca attaatttgt ttactttgat accccttttt gcacattatt 300 ttattttatc atttttgtct ttttgtaacc tttaaataaa tatctttttg cattattata 360 ttcaacttgg tttgggttcc cttcattttg ttgcgtgctt gagccggcac gtaaca 416 // ID Gypsy-20-I_DR repbase; DNA; ZEB; 6048 BP. XX AC chr12; XX DT 11-FEB-2005 (Rel. 10.01, Created) DT 11-FEB-2005 (Rel. 10.01, Last updated, Version 1) XX DE An internal portion of the Gypsy-20_DR LTR retrotransposon - a DE fossilized sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy superfamily; Gypsy-20-I_DR; Gypsy-20-LTR_DR; Gypsy-20_DR; KW endogenous retrovirus; gag; integrase; protease; KW reverse transcriptase. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6048 RA Kapitonov V.V. and Jurka J.; RT "Gypsy-20_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 5(1), 13-13 (2005). XX DR Zebrafish.; chr12; Positions 34568715 34574762. XX CC Gypsy-20-I_DR is an internal portion of the Gypsy-20_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its CC long terminal repeat is deposited in Repbase as CC Gypsy-20-LTR_DR. The genome contains one one copy of CC Gypsy-20-I_DR that is flanked by 3% diverged LTRs. The CC internal portion contains two ORFs encoding Gypsy-like gag and CC pol proteins. The second ORF contains several stop codons. The CC first ORF encodes the 551-aa Gypsy-20_DR1p gag (pos. 170-1822). XX FH Key Location/Qualifiers FT CDS 170..1822 FT /product="Gypsy-20_DR1p" FT /translation="MDFCNRFDVDGQNSLYVTGVEGSCSMEVISGFFEVNG FT EIEKVVRVPDEPGQTVGRVFIQYSSESSILKIEPDSLGDLPSPNDTAVTWH FT VRTVRDISQEKLGEELAQRYLDELNTIPGSGRAAFLLALQKELQSVQLHSG FT YLQSPEAGPHTTVHSPHDLGKINVEIDHAESIAAQPAVTNFSPSYVSHSPV FT HIDESMLNPPQIQKVVVEHVIRNESAHSPMRQSRIRTFSGRTPKPNGEVDY FT ETWRTQVDLLLSDPSSPDPQKVRIILESLLAPAADIVKHLGLNSSPSSYLN FT QIESAFGEVEDGEELFATFLNLNQNAGEKPSAYLQRLHTLLTRAISRGGAN FT ATDSRKHLLRQFCRGCWDQTMIIGLQLEHLKDHPPSFSELLLSVRTEEDKR FT AAKLDRMKKHLGSTRAAAHAHSVYGMPVNVEAQSEPFKKCQQDETQKLKGE FT IATLKKQVAYLLKKGEVKECEIKSEKNSREDTVTGDCLVVSSNVPSEFTTM FT PRPWFCFKCGEDGHIANRCSKEPNPELVRKKNLQLRQRREKYRELHETSPL FT SLNL" XX SQ Sequence 6048 BP; 1776 A; 1242 C; 1380 G; 1650 T; 0 other; taaaagtggc gagccagccc aggagagctt tggttatttg ttacatgtag taatagctgc 60 tctagcgaac tgctctaatt ttcttaaata ggttttcatt ttattgtttt tggagagaga 120 ttttaaattt tccaattgat ttgttaaaac ttctattctc cagaaagaaa tggatttttg 180 caatcgtttt gatgtagatg gacaaaacag tttatatgtt acaggggtag aaggatcctg 240 tagcatggaa gtcatctcag gttttttcga ggtaaatggg gaaattgaaa aagtagtcag 300 agtccctgat gaaccaggcc agacagtagg tagagttttc attcagtatt catctgagag 360 ttcaatttta aagatagaac ctgatagttt aggagacctg ccgagcccaa acgatactgc 420 tgtaacatgg cacgtgagaa cagtccgtga tataagccaa gagaaattag gtgaggaact 480 ggcacagaga tatttagatg aactgaacac tattccagga agtggcagag cagcattcct 540 gcttgcttta cagaaagagc ttcagagtgt ccagcttcat tcaggctact tacaatctcc 600 agaggctggt ccacatacta ctgttcacag tccccatgat ttaggaaaga tcaatgtcga 660 aatcgaccat gctgagtcaa ttgcagctca acctgcggtt acaaactttt ctccctcata 720 tgtatcacat agccctgtac atattgatga aagtatgctt aaccctcctc agattcaaaa 780 ggtggttgtg gagcatgtca tccgtaatga gtctgcacac tcacccatga ggcagagccg 840 gattcgcaca ttctctggca gaacccccaa gccaaatgga gaagttgatt atgagacatg 900 gcgtactcaa gttgatcttc tcctttctga cccatcttca cctgaccctc agaaagtgag 960 gataatctta gagagtctcc ttgccccagc cgctgacatt gtcaagcact taggtttaaa 1020 ttcttcacca agttcatacc tcaaccaaat tgagtctgct tttggtgaag tagaagatgg 1080 agaagagctc tttgcgactt ttcttaactt aaatcaaaat gctggcgaaa aaccatcagc 1140 atacctgcaa aggttgcata cactacttac tcgcgctata tctagagggg gagctaatgc 1200 aactgactcg cgtaaacatc tgcttcgcca gttttgtaga gggtgctggg accagaccat 1260 gattatagga cttcagctag aacacctcaa agaccatccg ccatctttct ctgaactttt 1320 gctctctgtt agaacagaag aagataaaag agcagcaaag ttagacagaa tgaagaaaca 1380 ccttggtagc actagagctg ctgcacatgc tcactctgtg tatggcatgc cagtcaatgt 1440 agaagctcag tctgagcctt tcaagaagtg ccaacaagac gaaactcaga agcttaaagg 1500 agaaatagcc actcttaaga aacaagttgc ttatctttta aagaaaggtg aagtaaaaga 1560 gtgtgaaatt aagagtgaga aaaactccag ggaggacacc gttaccggtg actgtcttgt 1620 tgttagttcc aatgtgcctt ctgaatttac aactatgccg agaccctggt tttgttttaa 1680 atgtggagag gatgggcaca ttgcaaaccg ttgtagtaag gaacccaacc ctgaacttgt 1740 tcgcaaaaag aatttacagt tgaggcaaag acgagaaaag tatagagagt tgcacgaaac 1800 atctccgctt tctttaaacc tgtagcagct cctgctgtgg gacacccagg agctgacagc 1860 cattttactc gtcccacact aatggtccat gaagctgtca atcatgctaa tgaaaggcaa 1920 ggaaattatg ggtttagaga tgagttcttg ccacatggtc ttgttggtca tcgttgcatt 1980 gcttctgttt ttgttgaagg tattgaatgt gagtctcttt tagacacagg gttgcaagtc 2040 actactgttt cagaaacatt ttacttgacc catttctctt accttccaat tcaaccaatt 2100 catgctttgt ttgaggttga aggtgcggga gggcagcaca ttccttacct gggctacatc 2160 caaattgtca taacttttcc ttgtactgtc ttgggtgtcg aggtagaact tacttctctt 2220 gtacttgttg ttccagattg tcacttcaac agcagagtcc cactgctagt cggaacaaat 2280 gtccttgacc gcctgtatca gcagggtata gataaaaaag ggtccaagtt ctttcaaaga 2340 gccagtggtg actatgcttt gcttttccaa tatgttgccc aaaattatga gaatggaaat 2400 aagtcatttc aagttagagc atcaggaaaa aaacacatca ctattcctgc caaccaaatg 2460 atctgtatca ctggtcaagt aaagactaga aagaacattt caaagcctct atttgttcta 2520 gagcaacctg agacgttccc cttgccaggt gggctattag ttgaaaatgc tctagtgaat 2580 gtcccattcc atgcttgttc caagatccca gtcactctaa aaaacatgac tgaacatgat 2640 attactcttc atcccaatcg ggtgattgca gacatggcag ctgctcatgt tcttccattg 2700 aaacctgaag tatcatcttg ctccaatcag gtgcccagtg gcaagtcagt gttcaatctt 2760 gatcactcgc caattccaaa agagttgaag gaacgtataa tagacaaact gaattcctta 2820 aaagacgtat ttgcaattgg tgagctgtca tatggtcata ccacggctgt gaagcaccat 2880 atcagactac aggatgagac accttttaag gaacgtccca gaccaattca tccgtgtgac 2940 agagaagccg ttaggcaaca tctatgggaa ctcctggaag ctgaaataat cagagagtct 3000 gaaagtccct ttgcgtctcc aattgttctg gtccgcaaaa aaaaatggac aaattaggct 3060 atgtgttgat tacaggaagc tcaatctgcg cacaataaaa gatgcatatg cccttcctaa 3120 tatagaagag actttttcag cactcagtgg ttctaaatgg ttttctgtca tggaccttaa 3180 atctggatat taccaggttg aagtagccga agaggataag cacaagacag cttttgtttg 3240 tcctcttggc ttctttgaat ttaaccggtt gcctcaaggt gtaacaaatg cacctagcac 3300 cttccagagg cttatggaga agtgtgtagg agacttgcat ttgagagaag tactggtgtt 3360 tctggacgat cttattgtat tttccaaaac acttgaggaa catgaagccc gattgatgaa 3420 agtgttaaat cgtttaaaag aatttgggtt gaagctgtct ccagacaaat gccagttctt 3480 taagtcctct gtgaagtatc tggggcatgt tgtggatgct gatggagttc acacagatcc 3540 agaaaagatt tcagctttaa aagattggcc tcgtccttca acacggcgag agctaaaatg 3600 ttttctaggc ttctctgggt attatcggcg ttttgtcgag ggttactcta aaatagccaa 3660 gccactaaat tgcctaactg caggctattg tcccccaaga aaaagtggaa agatctataa 3720 aagagagaaa ccgaacactc ctgtcaatcc caatgcttct attgctgagg agtggacagt 3780 tgagtgtgaa aatgcattca ggactttgat agacgggcta acctctgcgc cgatccttgc 3840 ctttgcagac ccacaacttc cctatatctt gcacacagat gcttgaagag aggggttggg 3900 tgcggctctt tatcaggagc aggagggcaa gcttcgtgta gtagcctatg ccagtagggg 3960 actttccaaa agtgaaagaa attatcctac acataaatta gagtatttag ccttaaaatg 4020 ggctgtctgt gagaagtttg gagattatct ttatggcacc gaatttacag tactcactga 4080 caataatcct ctgacgtatg tgttaaccac tgcgaaactg gatgctgcag gacatcgctg 4140 gcttgcacaa ttatccacat accgattcaa cataaagtat cgagctgggt ctgtcaataa 4200 ggacgctgac ggattgtcca ggagaccaca gaatccgcct gaggaagatg atgcatttct 4260 tgaagaggag aggtctattg aagctttaaa aaggcgacta tgtgagaaaa cagaggatat 4320 ctccaatgaa acactctctg cactgtctga ccgccatgca gttgtgttgt ccaatgaaac 4380 ttatgaggca ccagggtttg ttcttgcaga gtccttggct ttaaatgcct cgtccattcc 4440 tgatagtttt gccacctcag gctatgagac cattcctgga atgacctgtg aggactggta 4500 ccgtgctcaa agagaggatt catctttgaa aagaataatt tcttttattg aactaggtca 4560 aaaacctaac ttcagacaga ctagactgga accccttgaa gttaagttgc ttctgagaga 4620 gtggaaaagg cttgagctac gagatggagt gttgtatcgg aagtgggcag ctaatggggc 4680 tgtgcttcaa cagatagttc tgcccctaca gtgcagggaa agggccatga agggtgttca 4740 tgatgaaata gggcatcttg gatatgagcg tgcacttgat ctagctcgtt ctagattttt 4800 ttggccaaag atggcaagag atattgagaa caggtgtcat acttgtgaaa ggtgtttcag 4860 aagaaaggca aaccctcaaa aagcagctcc aatggaatcc atacggacaa catttcctct 4920 tgaattagtg tgtatagatt atttatctct cgagccagac aatcatgaca tcaggaatat 4980 attggtcatt actgaccact taactaagtt tgctgtggcg ataccaacaa gggaccagaa 5040 agctaagacc attgccaaga cactctggaa aaacttcatt gtttattatg ggtttccaag 5100 tcgactactt agcgatcaag gaagggtgag tgaaaaaaac aaagcgaggt ttgatctaaa 5160 agtcagagcg gctgaactga tgcctggtga tagggtcttg gtgagaaatt taagtgtgag 5220 gggcaagcac aaactcgccg atagatggga aagggttgta cacacagtag tcaggagagt 5280 ttcaaatggc cctgtttaca cagtgaagcc agaaaagggt aatggtcccc atcgtaccct 5340 tcacagggac ctcttactgc cctgtgggtt cttgccagca agctcagaag atcagatgcc 5400 tagaacagct tcagctgtga aagggaaaca taagatgaga acaagggcta atactgaagt 5460 atgcgaaggt gacgaatgtg aggggcagtc tagtgatgaa gaggaaagtt actttcaaat 5520 acccatccct gaaatactca caagaagtcc cttcattgaa caggtgaatg atgggtcacg 5580 tagctctcat aatgtttctg cgaaattatc tgatgagttg aaccctcatg caattgagtt 5640 tgctccacgt actacagttg ttcccagacc agaagctatt catcccgtaa ctgtaccttc 5700 tagagaagct tctaaaattt cctctaggaa gtacaccagt atttcctgtg ccagaatcta 5760 gtgagattga acatgtccct gattatatag tcattgagat cccagaggag atagtaccag 5820 ttgctgaggt agaatctcca gaattgcccc atggcccagc tgatgctgaa tccgtacccc 5880 tcagacgttc aagtcgagag agacggccac cgaaggtttt agagtatgag gaattaggga 5940 agcccatatt acttgctctc acatcattct ttgattccct aggaaccatt tttgcggcca 6000 gcctagttaa aacgcatgca gggactcatg cggtttagag ggggagag 6048 // ID hAT-N69_DR repbase; DNA; ZEB; 1273 BP. XX AC . XX DT 30-OCT-2008 (Rel. 13.1, Created) DT 30-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Putative nonautonomous hAT DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW 8 bp TSD; hAT-N69_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1273 RA Bao W. and Jurka J.; RT "hAT DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1338-1338 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1273 BP; 435 A; 223 C; 213 G; 400 T; 2 other; taggatgtaa cgattcacct gactcacgat tcgatacgat tcacgatact aatctcacga 60 tacgatttag tcacgatttt ttaacaaaat tatttgaaac aaatttaagg tgaagagccc 120 tttatttttt tcttaaatgc tgcacattaa catattaaca taactaaatt gctattttaa 180 aacaaattgc taataaaaaa taaataaatc aaaaaataaa aakaactatt taatataaac 240 aaactaaaac tttcacagct aaattaagaa tttaaaacaa attctacatt aaaaatgaat 300 aatacaaact aaagaagact cattaatata aacaaactaa aactgtgact gtgctgggat 360 tttacatttt taaaaagaaa aaaagaaata tcagcatatc tatgttttct ggcataagct 420 gaggtctttg cacatttaca atgtcccctg ctgatgaaaa aactctttca ctggggactg 480 agatggctgg tgtggagagg aaagcctttc ctaaaccaga gagcagtgtg tacagaggtg 540 gtcaataggc ccttctgatc caggggactg gccactggca taaaatactg attataaaat 600 tactaattat atagtaggat agagacagga gttgaacatg attatgaagg tgaataatta 660 atattacttg tataattaat tagtataagt atcagtgtga tacacttaag aagagataat 720 cttgaacatt tttaaatatt caataataat aagcaaatta ttaaaatatg cataaactaa 780 tttgaaaagg agagggtaaa aataatctaa tacctgctgt tctgtaacag aacaactttc 840 tctttcagtc tgaaaaacat cttcacactt tcgctatgaa aacactgatg caccgtgtga 900 tctccgtagc actcgtcgga gcaagcggag ctgcaaatca tccggtccgc gtggtttggc 960 ttattaatag ttttttgcgt atagacgtct gtgacagtag tttttcactg tgctgacggt 1020 ttatgtgcat catattcgtt gttctgttag tgtaagtaaa cgtctttttg cagtatttgc 1080 acacagtttg tgttttgtct ggcgcacttc accgctgtca ttttactctg ccgccccacc 1140 tcttgctcgc cttcgaacag aacgcmctct gttcaaaaaa tgtatcgcga ttcgttcaac 1200 atttctaccg atttgaatcg tcacatattt gaatcgattt tcaaccggct cacggtgaat 1260 cgttacatcc cta 1273 // ID Gypsy46-I_DR repbase; DNA; ZEB; 5358 BP. XX AC . XX DT 25-SEP-2007 (Rel. 12.09, Created) DT 25-SEP-2007 (Rel. 12.09, Last updated, Version 1) XX DE An internal portion of the Gypsy46_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy superfamily; Gypsy46-I_DR; KW Gypsy46-LTR_DR; Gypsy46_DR; endogenous retrovirus; gag; integrase; KW reverse transcriptase. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5358 RA Dib M.R. and Naveira H.F.; RT "Gypsy46_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 7(9), 823-823 (2007). XX DR [1] (Consensus) XX CC Gypsy46-I_DR is an internal portion of the Gypsy46_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its long CC terminal repeat is deposited in Repbase as Gypsy46-LTR_DR. CC Gypsy46_DR is characterized by 4-bp target site duplications. The CC internal portion encodes one polyprotein the 1517-aa polyprotein CC Gypsy46_DR1p (pos. 701-5251) composed of the gag, protease, CC reverse transcriptase, and integrase domains. Some insertions CC fairly recent, according to the high identity between their CC flanking LTRs. Consensus obtained after the alignment of at least CC five independent insertions bearing at least 85% homology over at CC least 1000bp. XX FH Key Location/Qualifiers FT CDS 701..5251 FT /product="Gypsy46-I_DR_1p" FT /note="ORF." FT /translation="MASVVDTFRESPSEELLLSCTKEQLLQIAESFEIEIA FT PRYKTLKETLRNVLKGYLVELGVLEMPAETASLVEEVADPSVSEHAIRLRE FT LALKEKEIELENAKLELRKREIDHQYELKRMELEMSGKSVVSNEFDVGRNR FT SMVPPFCEKDVEKYFCHFERVAASLKWPENVWSLLLQCVLTGKAQEAYASL FT SIEETADYKLVKAAILRAYELVPEAYRQRFRHYAKLSNQSYVEFAREKEML FT FDRWCTSQKAESKEQVRQLILLEEFKNCVPSALSTYLNEQKADTLHKAATL FT ADDFVLTHKITFREKPREKVFAEAPVSPPRKSLRIRSTVPTDERACFYCKN FT SSHLIASCPVLRKKGKAKHVGLISVDSSSNFNCTSESDFNPTKDGYAPFLH FT YGTVSISSMSKPVPVRILRDTGASLSIILEGVLPLSEETATGSAVLVRGFE FT MGVADVPLHKISLQSELVSGDVIVGVRASLPIAGVTFILGNDLAGGSVWSD FT INVSPEVVPVPLAGTDDCALKYPDVFAACVLTRSMIKSAETSDSVDLCDTF FT MVNSDTTDLFSVPLQEKADLFPVSAEKDAIDCADLSLCPEKLIDAQKSDST FT LTSLFELVRPEQENTCAPQMYFLRNGILMRKWVSPKSSEDWNVVNQIVIPV FT DYRDMILDCAHNGSAGHLGILKTYNRILRNFYWPGLKRDVARFCKTCHVCQ FT VVGKPNQTIKPAPLYPIPMVSEPFEHILIDCVGPLPRSKSGFKYLLTIMCA FT TTRFPEAIPLRSVTTRAITKALVKFFTLFGLPKILQSDQGTNFTSRTFSQV FT LKRLGIKHNVSTAYHPESQGALERFHQTLKSMLRTYCFELEGDWEEGLPWL FT LFASREVIQESLGFSPAELVFGYNLRGPLAVLKERWLSDSKSTTVPEYFSQ FT FRTRLYRVRELAKQNLEKSQEKMKTWFDKKARERSFSPGDKVLVLLPVSGG FT SLQARYSGPYEIKRKLSDRDYVIHTPDRRKSLRVCHVNMIKPYYERKSDTE FT KVKNVIQSPVQESLSVGVFTSEPVDEKDVEPSRCIIEGRLKNSEMLENLTS FT KLPHLSQTEKAEVIGLVRSFPMLFSDIPGLTSVIEHDIDVGFTQPIKQHPY FT RVNPLKRSLLQKEVEYMLENHIAESSTSPWSSPCLLVEKSDGTFRFCTDYR FT RVNAVTKSDCYPLPRIDDCVDRVGSATYVSKLDLLKGYWQVGLSERAREIS FT AFVTPDAFLNYRVMAFGMKNAPSTFQRLVNTVLAGVPNCEAYLDDVVLYSS FT TWQEHMSLLRQVFNRLAEAKLTVNLAKCEFGKATIDYLGKVIGNGEVRPVA FT AKVTAICDFPSPKDRKQLRRFLGMVGYYRSFCKNFATVVSPLTDLLSPKIQ FT FEWSDACQCAFENVKSLLISAPVLAAPDYLKPFSLAVDASDVGAGAVLQQR FT GAEVEHPVCYFSKKFTSTQRKYSTIEKEALAMVLAIQHFEVYLSNYHPIVV FT YCDHNPLTFLNTMRNSNQRLMRWSLFLQPYDLDIKHVRGCDNVVADALSRV FT E" XX SQ Sequence 5358 BP; 1437 A; 940 C; 1274 G; 1707 T; 0 other; ttaattggga gctcgtccaa aaagtttaga ttatgttcgt cagctgttca gtgtgggagg 60 agagatgtgt ggccaccccg ttcattttgt gattcgtctg gatatgcttt cgtttgagtg 120 atctcttgca tcccatatgt gagtatttgc cctatttaat ttagccgccg ggtaggttgt 180 ggacctgcca ctttgttatt ttgcctttct tatttttgtg gttattcgga gtggaggtat 240 gcttcagagc acttgctgct gatctatcaa ttgggtcagt atgcagtgtt catgatttca 300 gcatttaaag tcgcctcgag cccctaacgc cactgaagac tagggaaaaa tattgtgtta 360 aatggggtaa atacattagt tttttttagg ggattgcatt tggttacgat tttgtgaaat 420 ttaaattgat aacctttcaa tttcgtgtaa tttgtggact tggtattagg ttacagcctg 480 gctgtgtgtg tgcgcgcgcg gctgtgtgtg gatgcggctg tatgtgtgtg attaagttgg 540 aaaaattaaa tttcgttgat atttttgctt aatcaggtga gttgatgctc tttcgtttag 600 attatttgat ggtttaaata tttgtaaacg tttggtttaa aatatttatc gttaaaacca 660 ttatttaatt gttttgtgag cataactgtg tacgtcggag atggcgtctg tagtggacac 720 atttcgtgag tctccttctg aagaactgct tttatcgtgc acaaaagagc agttattgca 780 aattgcagag tcttttgaaa ttgaaattgc tccaagatat aagactttga aggaaactct 840 acgaaatgtg ttaaaaggtt atctggtgga attaggcgtt ttagaaatgc ctgcagagac 900 tgctagtctt gtggaggaag tggctgatcc ttcagtaagt gaacacgcta ttcgattgag 960 agaattagca ttgaaagaaa aagaaattga gttagaaaat gctaaattag agttgcgaaa 1020 gcgtgaaatc gatcatcaat atgaactaaa gcgtatggaa ttagagatga gcggaaaaag 1080 tgtagtctct aatgaatttg atgttggtcg aaatcggagt atggttcctc ctttttgtga 1140 aaaggatgtt gagaaatatt tttgtcattt tgagagagtc gctgcttctt taaagtggcc 1200 ggaaaatgtt tggtcgctgc tattgcagtg cgtgttaacc ggtaaagcac aagaagcata 1260 cgcttcgttg tctattgaag aaacggctga ctataagctg gtgaaagctg ctatattgag 1320 agcatacgag ttagttccgg aggcatatcg acagcgtttt cggcattatg caaagctttc 1380 aaatcagtct tatgtcgagt ttgcacgaga gaaagagatg ttgtttgatc ggtggtgcac 1440 gtctcaaaaa gctgagagta aagaacaagt gagacaactt attcttttgg aggagtttaa 1500 aaattgtgta ccatctgctc tttctacgta ccttaatgaa cagaaagctg atacactcca 1560 caaagctgcc actttggcag atgattttgt tttaacgcac aaaatcactt ttagagagaa 1620 accgcgagag aaagtttttg ctgaagcacc tgtttcacca cctcgtaaat cactccgtat 1680 taggtcaact gttccgactg atgaaagggc ttgtttttat tgcaaaaatt ctagtcattt 1740 aattgcaagc tgtccagtgt tgaggaagaa agggaaagcg aaacatgttg ggttaatttc 1800 tgttgactcg agttcaaatt ttaattgtac gtccgagtcc gattttaatc ccactaaaga 1860 tggttatgcg cctttcttgc actatggaac tgtgtccatt tccagcatga gtaagcctgt 1920 gcctgtacgc attttacggg atactggagc aagtctttcc ataattttgg aaggagtttt 1980 gccattatca gaggaaactg ccactggtag cgcggtatta gtgcgcggtt ttgagatggg 2040 agttgctgat gttccgttgc ataaaatcag tctacagtct gaattagtga gcggtgacgt 2100 gatcgtcggt gttcgtgctt ctcttcctat tgctggtgta acattcattc taggcaatga 2160 tttagcggga ggtagtgtat ggagcgatat taacgtgtca cctgaagttg ttcctgtacc 2220 actcgctggt actgatgact gtgcgcttaa gtatccggat gtatttgctg cgtgcgtgct 2280 tactcgatct atgattaaat ccgctgagac ttctgatagc gttgatttat gtgacacatt 2340 catggtaaat tctgacacaa ctgatttgtt ctctgtccct ttacaggaaa aagcagatct 2400 gtttcctgtg agtgctgaga aggatgctat agactgtgct gatttgtctt tgtgtcccga 2460 gaagttaatt gacgctcaaa aatctgactc cactcttacc tcgcttttcg aattagttcg 2520 accagagcag gagaatacgt gtgcgcctca gatgtacttt cttagaaacg gcattctgat 2580 gagaaagtgg gtatctccaa aatcttctga agactggaat gtagttaacc aaatcgtaat 2640 tccagtagat taccgagaca tgattttaga ttgtgcacac aatggttcag ctggtcactt 2700 gggtatttta aaaacataca atcggatttt gcgaaacttt tattggcctg gattaaagcg 2760 tgatgttgca cgtttttgta agacttgtca cgtttgtcaa gttgttggaa aacccaacca 2820 aactatcaaa cctgcaccac tttatccaat tccgatggtt tctgagcctt ttgaacacat 2880 tttaatagac tgtgtaggac cgcttccacg ttcaaaatct ggctttaagt atcttttaac 2940 gattatgtgc gcgactacac gtttcccaga ggcaattcct ttgcgttctg taaccactag 3000 agcgatcacc aaagcgttgg tcaagttttt cacgctcttt ggtttgccaa aaattctcca 3060 gagtgatcag ggaacgaatt ttacatctcg aacgttttct caagtactta aacgacttgg 3120 tataaaacat aacgtttcca ctgcgtatca tccagaaagt caaggtgctt tagagaggtt 3180 tcatcaaact ttaaagtcaa tgttgcgtac gtactgtttt gagcttgagg gagattggga 3240 ggaaggactt ccatggttgc ttttcgcttc acgagaggtg attcaagaat cactgggctt 3300 tagccctgca gagctcgtgt ttggctacaa tttgcgtggc cctttggccg tattgaaaga 3360 gaggtggctc tctgattcaa aaagcaccac tgtaccggag tatttttcac agtttcgtac 3420 acgtctatat agagtccgtg aattggcaaa gcaaaacctt gaaaaatcac aagaaaagat 3480 gaaaacttgg ttcgacaaga aagcgagaga acgaagtttt agccctggtg acaaagtttt 3540 agtgttattg ccagtatctg gtgggtcgtt gcaagctcgt tatagtggcc cttacgagat 3600 caaaaggaaa ctgtcagatc gagactatgt tatccataca cctgatcgtc gaaaaagtct 3660 gagagtttgt catgtaaaca tgattaaacc ctattatgag agaaagtcgg atactgagaa 3720 agttaaaaac gtcatacagt cacctgttca agagtctcta tctgtgggtg tgtttacatc 3780 tgaaccagtc gatgaaaaag atgtagaacc atctagatgt atcattgagg gtaggttgaa 3840 aaactctgag atgctcgaaa atttaactag caaactacca cacttgagtc aaactgagaa 3900 agctgaagtc attgggttag tgagaagttt cccaatgttg ttcagtgaca ttcctggttt 3960 aacttcagtg attgagcatg atattgatgt tggttttacc cagccgatta aacaacatcc 4020 atatcgtgtt aatccactta aaaggtcgct tctgcagaag gaagttgagt acatgttgga 4080 aaatcacatt gctgagtcta gtactagtcc atggagttca ccgtgtctgc tggtagagaa 4140 atctgatggc acatttagat tttgtacaga ttatcgtcgt gtaaatgccg taactaaaag 4200 tgactgctat cctttgccaa gaatagatga ttgtgtggat cgtgtaggtt ctgctacata 4260 tgtttcgaaa ctcgatctat taaagggata ttggcaagtg gggttaagtg aacgtgctag 4320 agaaatatca gcatttgtca ccccggatgc ttttcttaat tatagggtga tggcattcgg 4380 gatgaaaaac gctccaagca cttttcaacg gctcgttaac accgtgttgg ccggtgttcc 4440 aaattgcgaa gcgtacctgg atgatgtagt gttgtatagt tcgacatggc aagagcacat 4500 gagtttgctt cgacaggttt ttaatcgatt ggcagaagct aaacttaccg ttaacttggc 4560 caaatgtgaa tttggtaaag ccactattga ttacctggga aaagttattg gtaatggtga 4620 agtgcgccca gttgctgcta aggtgactgc aatttgtgat tttccctctc cgaaggaccg 4680 taagcaattg cgtcgttttc ttggtatggt aggttactac cgttcatttt gtaagaactt 4740 tgctacggtt gtttctccgc taaccgatct gttgagtcca aagatccaat ttgagtggtc 4800 cgatgcatgc caatgtgcat ttgagaatgt aaagtctttg ttgatttcag ctccagtgtt 4860 ggctgcacca gattatttga agccattcag tcttgcggtg gatgcaagcg atgttggtgc 4920 gggagctgtc cttcagcaaa gaggtgctga agtagaacac ccagtatgtt acttttcaaa 4980 gaagtttact tctacacaac ggaagtattc taccattgaa aaggaggctt tagcaatggt 5040 tctagccatc cagcattttg aagtttattt gtccaattat cacccgattg ttgtatattg 5100 tgaccacaat cctcttacat tcctaaacac catgcgtaac tctaatcagc gtttgatgcg 5160 ctggagttta tttttgcagc cctacgacct tgacattaag catgtccgag gttgtgacaa 5220 cgtggtggct gatgcactgt cccgggtgga ataagtgggt aaggtgcatg aaatggtgtt 5280 cgggagaagg tgtttgaaaa tgtttttttt tgacttagtt tgtgttttgg tgatgaaaac 5340 acttttgggg gatgaagg 5358 // ID DIRS-5_DR repbase; DNA; ZEB; 6895 BP. XX AC . XX DT 24-OCT-2008 (Rel. 13.1, Created) DT 09-MAR-2009 (Rel. 14.04, Last updated, Version 2) XX DE DIRS-like LTR retrotransposon family - a consensus. XX KW DIRS; LTR Retrotransposon; Transposable Element; KW endogenous retrovirus; reverse transcriptase RNase H; KW phage integrase; DIRS-5_DR. XX NM DIRS-5_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6895 RA Bao W. and Jurka J.; RT "Families of DIRS-like retrotransposons in zebrafish."; RL Repbase Reports 8(10), 1272-1272 (2008). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS join(320..2326,2271..3593) FT /product="DIRS-5_DR_1p" FT /translation="MERDDTPAPLGTPAQGQQAPTGNQTNTEAPIRGRRPI FT RSTVSRTHRRTQSPSPITPNRNLPSPSSSYASARSSSLISNKMTVAELRQT FT ITNAGISIPNRCNKAELLKLYETIPSPTPPPQDSRPTRSRHTPYPQPTSAQ FT QATNHPGPPKKATRKTNKKLPQATGQSAPVTITFQNTDNPQENHATPGLPT FT PLLWPPAPLSSGNSIPALPDISPSLNPPHSILPSNLPHSSTQFFPTQTFPT FT IHNAVPLPTNFPSSTTSFFPSTSLHQAPTVITNPPQQSTLRTNISSARPPF FT TLSTATPLPIPQNAPVLEPPPISNAIRNLILSGADIDLSTLLSPIAPPSAE FT RQVDCGEFTITLKPPVSSQTRTLSIAEFHVAFARYTETICSVFPHRRRELN FT DYMAIISELALSYGGTHFYTYHKLFSAKCAIRVTQWNQCTYWGALDTDLHN FT RVFLGCRNLSCAVCRSNLHPTTSCPFVIPSADKELQTPKSTSYVPRPSTST FT IPSLLPPPSSQNPPSQICQSFNIARCFRHPCKFLHTCSYCGGAHARVVCQV FT LKANKKHRNYLSTPVDISNLYSELCLHPDPNFSEFLISGLSNGFHPGVSTL FT PSYNLACPNLQSANAEPDVVDHLIKKEIDNKFMIGPFLAPPFSTYRISPIG FT VATRKFSEEKTTNNRPVFSSQPENFLRKKRLIIDLSSPHNSAFSSINSLIS FT PDEFSLNYHDIDQAISLIKLVGRNAWLAKVDITSAFKIMPLHPDFWHLFGI FT NWKSQFYFAVRLTFGCRSSPKIFDMLSEAICWILANNYGIPHVVHLLDDFL FT IISPPHTPPAQHLATTKAVFARLGIPLAEEKTAGPSTRLEFLGINLDSQKF FT EASLPKEKIDRIISLSSIFLEKHECSKRELLSILGHLNFAMRIIPQGRPFI FT THLLQLAASVHSLEDNITLSDPCRNELSLWISFLKCWNGCSFLYSDLIASP FT VDIQLYTDAAPSVGFGGFYQGRWFASDWPSQMLETPLPQYSSALFELYPIV FT AAALLWGDEWSASSILIHCDNEAVVHCINRGRSHSPALMPLLRRLIWTAAK FT KQFIITAIHVPGFHNQIADALSRFLFQRFRQLAPEAEQHPTPIPPYSEMIF FT Q*" FT CDS join(3602..4663,4694..5233) FT /product="DIRS-5_DR_2p" FT /translation="MHELHQTSISLILQAVAPRTLQAYLTAWKTFKHFHFT FT YNTIFPDFSLLTISSFITYLHFHKNMQANSIKSYLSGIQFFHKLMYGSISE FT SIANSQVSLLIKGIQKARPPTPDARLPITHNILSKCISTLRKGYTSFHTDR FT TLDAMFILAFFGFLRCSEFTVTSKFDPSIHPTIADLTLIDEETISFLIKQS FT KTDQSRKGHCIYIFNIPSSTSPFQTLLAYIHYRKSLSNSPLAPLFIDDTHN FT PVTRFWFQKHLKATLHHSGFPSESYSSHSFRIGAATTAAHKGLTQQHIQTL FT GRWSSDAFKSYIRLSHSHLKEAQRTLTSRNANPSGQGHRHNPGTSQDPAMP FT APQGRKSYPIMRFPFFQLELHSASLPRHSPYYSRRSSCPSPLLPLRPPRPL FT PQKSPLPPPHSRTSVGPRPLALNPYNVTQSFDSGRNHISLAPALAFIYYIS FT FLIMLYNIVLFYLFSLFDLFYINVYSYAPTHININLYIVPPPSCSVPAGTL FT PEHLFRPSPSSRSLHLPIISPDSYWSWPQISSPNPEL*" XX SQ Sequence 6895 BP; 1832 A; 2196 C; 1072 G; 1794 T; 1 other; aatgaagttt cataaactaa tttcgagagg agcacgtgat atgattgact gcagccggcc 60 actcatctaa gctcattagc tagccaatcg gaacgatcca aacccactat aaataaccta 120 gctaaaatgt acacccctat cttcgttttc cgaagaagca cagaaggacg gacacagctc 180 ctcttcaaat cctcaaatct tcatctacca taccatcgcc tgcgacatta acttcaacct 240 actacaaatc aacaacgaca gcaacaacaa aactcaaaaa attcaacgga cacgccatca 300 tcaactaaac aacaacatca tggaaagaga cgatacaccc gctcccctag gaacaccagc 360 acaaggacaa caagccccaa ccggaaacca aacaaacacc gaagctccca taagaggcag 420 aagacccatc cgctctacag tttcaaggac ccatcgccgt acacagtctc catcccccat 480 aaccccaaac cgcaacttac cttctccatc ttcatcatac gcatctgcaa gatcctcatc 540 cctcatctca aataaaatga cagtcgccga actccgtcag accattacaa acgccggtat 600 atccatcccc aaccgctgca acaaagctga actgttgaaa ctctacgaaa ccatcccttc 660 accaactcct ccccctcagg acagcagacc aactcgatcc cggcacaccc cctatcctca 720 accgacttct gcacagcaag caactaacca ccccggacca cccaagaaag caaccaggaa 780 aacaaataaa aagctacctc aagctactgg acagtctgca cccgttacca ttacctttca 840 aaacacagac aatccacagg aaaatcacgc cactccagga cttcccactc cccttctctg 900 gcctccagct ccactttcca gcggaaactc cattcctgct cttccagaca tctctccctc 960 tctcaaccct cctcattcta ttctcccttc taaccttccc cattcttcaa ctcagttttt 1020 tcccactcaa acttttccta cgatccataa cgccgttcct cttcctacta atttcccctc 1080 ttctactacc tcctttttcc cctctacatc cctccaccaa gcacccactg tcattactaa 1140 ccctccccaa cagtctactc ttcgtactaa catctcttcc gcacggcccc ccttcactct 1200 aagcaccgcc acaccccttc ccattccgca aaatgctcca gtcctggaac cacccccgat 1260 ctccaatgcc atcagaaacc tcatcttatc aggtgccgac atagaccttt caacactcct 1320 ttcacccata gcacctccct cggcagagcg acaggtggat tgcggcgaat tcactattac 1380 ccttaaacca ccagtcagtt cacaaactcg cacactctcc attgccgaat ttcacgtagc 1440 cttcgcacga tacacagaaa ccatctgctc agtttttccc cataggaggc gcgagctgaa 1500 tgactatatg gccatcatct cagagctcgc gctctcctat gggggaacac atttctacac 1560 atatcataaa ttattctcag ctaaatgcgc aattcgcgtc actcagtgga atcagtgtac 1620 ttattggggg gctttggaca ctgatctcca caacagagta ttcttaggat gtcgcaatct 1680 atcctgcgcg gtctgccgct ctaaccttca cccgaccact tcctgtccct tcgtaattcc 1740 ctccgccgat aaagaactac aaaccccaaa atccaccagc tacgtacctc gcccttctac 1800 ttccactatc ccctctctac ttcctcctcc ctcctctcaa aaccctcctt ctcaaatctg 1860 tcaaagcttt aatatcgcta gatgctttcg ccacccgtgc aaattcctgc acacttgtag 1920 ctactgcggc ggcgcacacg ctcgtgtcgt ctgccaagta ctaaaagcaa ataaaaaaca 1980 tagaaattac ttgtcgactc ctgttgatat ttctaatctg tattctgaat tatgcttgca 2040 ccctgatcct aatttttctg aatttctcat ttcaggtctg tctaatggat tccaccctgg 2100 tgtttcgacc cttccttcct ataacctcgc atgtcctaat ctccaatccg ctaacgccga 2160 accagatgtg gtggatcatc taatcaagaa agagatcgat aataaattta tgatcggtcc 2220 ctttcttgcc cccccgttta gcacctatcg gattagtcca atcggcgtag caaccagaaa 2280 attttctgag gaaaaaacga ctaataatcg acctgtcttc tcctcataat tctgcctttt 2340 caagcattaa tagtttaatt tcacccgatg aattctcatt gaactaccat gacatagacc 2400 aagcaatttc tctaattaaa ctcgtcggcc gtaacgcttg gctcgctaaa gttgacatta 2460 cgtcagcttt taaaattatg ccgttacacc ctgatttctg gcacctcttt ggcatcaatt 2520 ggaaatccca attttatttc gcagtccgtc ttacgttcgg ctgcagaagc agccccaaaa 2580 ttttcgacat gctttcagaa gctatatgtt ggatcctcgc taataattac ggaatcccgc 2640 acgtagtcca cctccttgat gatttcctca tcatctctcc cccccatacc ccacctgctc 2700 aacacctagc gactactaaa gcagttttcg ctaggctggg tatccccctt gcagaagaaa 2760 aaaccgctgg acccagcact cgcttagaat ttctaggcat taatttggac tcccaaaaat 2820 ttgaagcttc gctgcccaaa gagaaaattg atcgaatcat ttctctatct tccatatttt 2880 tggagaaaca tgaatgttct aaacgcgaac tgctatcaat attaggacat cttaatttcg 2940 ccatgcgtat cattcctcag ggacgcccgt ttatcactca cctcctacaa ctcgcagctt 3000 ccgtccacag cttagaagat aacataacgt tatccgaccc ctgccgcaat gaactcagcc 3060 tgtggatttc cttccttaag tgctggaacg gctgctcatt cctgtatagc gatctaattg 3120 catcccccgt agacatccag ctatacacgg acgcagctcc ctcggtagga ttcggtggtt 3180 tctaccaagg ccgctggttc gcctctgatt ggccctctca aatgctggaa actcctctac 3240 ctcaatattc gtctgcttta ttcgaattat accccatagt agccgctgcc ttattatggg 3300 gagacgaatg gtctgcctct agcattctca ttcactgtga caacgaagcc gttgtgcact 3360 gcattaacag agggcgctct cactctcccg ctttaatgcc gcttctccgt cgccttattt 3420 ggaccgcagc caaaaaacaa tttatcataa ctgctataca tgtgcccggt tttcataacc 3480 aaattgctga cgctctttct cgctttcttt tccagagatt cagacaacta gcgccggagg 3540 cagagcagca cccgactccc atccctcctt attcagagat gatattccaa taaatcatcc 3600 aatgcatgag ctgcaccaaa catccatatc cctcattctg caggctgtgg ctccaaggac 3660 cttacaagca tatctcactg catggaaaac attcaaacac tttcatttca catacaacac 3720 catattccca gatttctccc tgcttacaat aagctcattt attacatacc ttcattttca 3780 taaaaacatg caggcaaact ccattaagag ctatttaagt ggtattcagt tttttcacaa 3840 actcatgtac ggctccattt ctgaatccat tgccaactct caagtcagcc ttcttattaa 3900 aggcatacag aaagcacgcc cccccacccc agatgccaga ttgcccatca cacataacat 3960 actctccaaa tgcatttcca cgctcaggaa aggctacaca tcttttcata cagaccgcac 4020 actagatgca atgtttattc ttgccttttt cggatttctc agatgttctg aatttacagt 4080 aacatcaaaa tttgatcctt ctatycaccc cactatagct gatctgaccc tgattgatga 4140 ggagacaatt tctttcctca tcaaacaaag caaaacagat caatcaagaa aaggacattg 4200 catctacata tttaacattc cctcctccac aagccccttc caaacactcc tagcttatat 4260 acactatagg aaatcactaa gcaacagtcc cttagccccc ctgttcatag acgacacaca 4320 caacccagtg acacgctttt ggttccaaaa acacctcaaa gctaccctac atcattcagg 4380 cttcccatca gaatcatact ccagccattc attcagaatc ggagccgcca ccacagccgc 4440 acacaaaggg ttaacgcaac aacacataca aacacttgga agatggtctt ccgacgcctt 4500 taaatcttac attaggctga gccacagcca tcttaaggaa gcccagagga ccctcactag 4560 cagaaatgcc aatcccagcg gccaagggca caggcataat ccagggacaa gtcaagaccc 4620 agccatgcca gctccccagg ggcggaagag ctacccaatt atgtaaagga gcaggcacga 4680 cccagcctcc taacggtttc ccttcttcca gcttgagttg cactcagctt ctctacctag 4740 acactcacct tactacagca gacgttcctc ctgcccaagc ccccttctgc cactccgccc 4800 ccccaggccg ctgccacaga agtctccact gcccccaccc cattctagga cttctgtagg 4860 accccgcccc ttggctctga acccgtacaa cgttacacag agctttgatt ccggcaggaa 4920 tcatattagt cttgccccag ccctagcttt tatatattat atatcatttc ttataatgct 4980 atataatata gtcttatttt atttattttc tctgttcgat ttgttttata taaatgtata 5040 ttcatatgca cccacgcata taaatataaa tttatatata gtgccgccac cctcatgctc 5100 agttcccgct ggaacactcc cagagcacct attccgcccg tcaccctcca gtagaagtct 5160 ccatctcccc atcatctctc ccgactccta ctggagttgg ccacaaatta gctcacccaa 5220 ccctgagctg tgacacgagc acgagtcact gaccctcccc ggccctagac ctttattttt 5280 ctttatttct tatttatttt tcacatttat cttttatttt ttaaatttat ttatatatct 5340 atatatctat atgcacccac gcatataaat atatacttat atatagtgct gtcaccttca 5400 agctctaaca ctcgcaagag ttgctcacga gcattcgacc cccgcagggg tcatcgccca 5460 aacgccactc acccttcagc tggggcctcc acccgccctc catcgttccc gtccctagct 5520 ggagatggca cttgcagctc tatctcccgc tggagagccc aaaagagcta gactcccgct 5580 ggagtcaagt aaaatcgccc cccccaaggc cttagttttt ccccattata tatatatata 5640 tatctatata tcacatatac atatatattt atatatagtg ctgtcaccct caagctctaa 5700 ctctagcaag agttgctccc gagcattcga cccccgcagg ggtcatcgcc caaccgccac 5760 tcacccttca gctggggcct ccacccgccc tccatcgttc ccgtccctag ctggagatgg 5820 cacttgcagc tctatctccc gctggagagc ccaaaagagc tagactcccg ctggagtcaa 5880 gtaaaatcgc ccccccaagg ccttagtttt ccccattata tatatatata tatatctata 5940 tatcgcatat acatatatat ttatatatag tgctgtcacc tcacagctct atctccgcaa 6000 ggagtgttcc tcgagcaatt actccttagg agcccctgac ccccccgcag ccctagtaac 6060 cccctcatcc agctggagtc ctcacttttc actcctcatc attttgactc caactggatc 6120 cccgtccatc ccacagctct gacccccgct ggggtttccc ttagtttcta atccagctgg 6180 agtatttata gcccacgcca actcggcacg ggctcccgca agagcccgta tcccccttgg 6240 ctcccatcgg agccccttca ctttcaacca ctatatccag cagccggata tagcatttca 6300 agcctttcgg ggagtttctt cgaatacacg gctgctgtcc cgagtctcat gcatttgggg 6360 agctctcgag aacacctgac ctcgtactcc cctcacatgc tttatggacc tggcgggaac 6420 cctgggctca actatctccg agctcagggt tctctcccgg gacagcatgc caaacctgct 6480 aacttgctaa caagttgtca aacagtatct aagtgtgaac tcttgaaatg aagtttcata 6540 aactaatttc gagaggagca cgtgatatga ttgactgcag ccggccactc atctaagctc 6600 attagctagc caatcggaac gatccaaacc cactataaat aacctagcta aaatgtacac 6660 ccctatcttc gttttccgaa gaaacccccc atccacccct tctcctcctt tcctcttttg 6720 ccgaggggag ctctcgagaa cacctgacct cgtactcccc tcacatgctt tatggacctg 6780 gcgggaaccc tgggctcaac tatctccgag ctcagggttc tctcccggga cagcatgcca 6840 aacctgctaa cttgctaaca agttgtcaaa cagtatctaa gtgtgaactc ttgaa 6895 // ID BEL27-LTR_DR repbase; DNA; ZEB; 509 BP. XX AC chr20; XX DT 03-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL27-I_DR; KW BEL27-LTR_DR; BEL27_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-509 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1400-1400 (2008). XX DR Genome; chr20; Positions 22704506 22705014. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 509 BP; 136 A; 90 C; 121 G; 162 T; 0 other; tgtaaataac cccaagtcat ttaagttagg tttcagtgtt aattatttat gctgctttta 60 attacttctc tatttcttta aaagttgttg tgtttttgtt atttatttga actccgcttt 120 gcactaagtt atttagccta tagagaccct tattttgacg cgcggcttgc cggaagtggt 180 caggtgcgcg agagcgcgca gatcacggaa gcaaggagat cggagaagcg cgcctagcac 240 gagggagaca gcgatcgctt gcacagagaa atgcacggag ctaatgttac atttaaggac 300 tgatttagga gagatttaat gcatatttga gtttgttttt tgttctgttc atcttcattg 360 tttgaacacc tggttggaag ggagaacaag ctcttacggt gttgtacagt ctttaaggac 420 ggcaaatgca acaataaagg cacaaaggag gacagtgcaa catcagtgca tgagtttaat 480 ccttttattt ctccaaccgg gctgttaca 509 // ID DNA-4-4_DR repbase; DNA; ZEB; 680 BP. XX AC . XX DT 12-NOV-2008 (Rel. 13.11, Created) DT 12-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE nonautonomous DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA-4-4_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-680 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(11), 1714-1714 (2008). XX DR [1] (Consensus) XX CC TSD is 4-bp long. Copies are ~90% identical to the consensus. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 680 BP; 238 A; 103 C; 103 G; 235 T; 1 other; gggacattct accagaaacg tacattttca cttataaaaa atgtgacagg gcctgacctt 60 taagcttgtc gtcctcaaaa aataaaaaaa caaaaacaaa acaagcataa aatcatacaa 120 ttcaatgata gaacgcatcc ttgatcactg tcagcttctc ctccatcaaa tgatgtcact 180 tcctttgagt catagcctta aaggtgtatt gcacrcattt tatgttaaat ttaatgcaaa 240 tgtgacagga ctgacagaaa catggggaca attgtaaatt tatttgtatt atatatttgt 300 agtatttatt tgtagaattt atttataatt tcatgttttt aatcaattga tgtgaatgat 360 aacaacttaa acttgctatt tctgaagttt tttttttcta aataacagtt tttagcataa 420 caaaactatt aaagctgtag gttcaattgg gctgaggaca aggacaatga cagggtttgt 480 acatttgtaa agtgttttta ataaagaaca aaaaatctga gaaccaaatg aatgacatat 540 tcctttacag aacaactcac agaaatcaac catcagtcca ttttagaaat ttttcatttt 600 tgggatgaaa tactttttat tcactgtatt tatgttttta tttcagggac agataatgta 660 cgtttctggt agaatgtccc 680 // ID Gypsy105-I_Dr repbase; DNA; ZEB; 4282 BP. XX AC . XX DT 28-APR-2009 (Rel. 14.05, Created) DT 28-APR-2009 (Rel. 14.05, Last updated, Version 1) XX DE An internal portion of the Gypsy-105_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy superfamily; endogenous retrovirus; KW gag; reverse transcriptase; integrase; Gypsy-105_DR; KW Gypsy-105-LTR_DR; Gypsy-105-I_DR; Gypsy105-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4282 RA Dib M.R. and Naveira H.F.; RT "Gypsy105_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 9(5), 946-946 (2009). XX DR [1] (Consensus) XX CC Gypsy105-I_DR is an internal portion of the Gypsy105_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its long CC terminal repeat is deposited in Repbase as Gypsy105-LTR_DR. CC Gypsy105_DR is characterized by 4-bp target site duplications. CC The internal portion encodes two proteins: the 306-aa gag CC Gypsy105_DR1p (pos. 18-935) and 1058-aa polyprotein (pos. CC 842-4246, conceptual translation) composed of the protease, CC reverse transcriptase, and integrase domains. CC Some insertions fairly recent, according to the hight identity CC between their flanking LTRs. CC Consensus obtained after the alignment of at least three CC independent insertions bearing at least 85% Homology over at CC least 1000bp. XX FH Key Location/Qualifiers FT CDS 18..935 FT /product="Gypsy105-I_Dr_1p" FT /note="Gag-protein." FT /translation="MDTAGQDSVRSALTQQGALLGQHESRLTSTSREVEYL FT TNQVADLTARLHELQFETLQGAQASHQAPEPRCNNPPTYDGDPTSCRAFLS FT QCSVVFTLQPRTYALEETKVAFVLTLLTGRARDWGTSVWEARAPCVSSFSS FT LRHEMIRLFDRSARGQEAADQLARLRQEGQSVTQYAIKFKTLAASCDWNEG FT ACHAMFRAGLDEDIQDELATHDLPQDFDTLTNLALRIEGRLRRRDHRRFLR FT SSTRREDPGSLSTMVPPSDHEPMQVGRLHLTSHEKQQRLAQGLCLYCGKPG FT HFAAKCPVKAKAHQ" FT CDS 842..4246 FT /product="Gypsy105-I_Dr_2p" FT /note="Polyprotein." FT /translation="ETAAPGSGSVPILWQAGAFCSEVPSKSQSSSVRRGIL FT VGAIPFSSPSSRTILPVVVQYDGISGSCSALIDSGAEGNFMDRSVAAQWGI FT PTIPLAESVSAHSLDGTLIATVSHTTPSVSLIVSGNHREVITLYLLDSPNA FT PIILGHPWLVKHGPHVDWSGDSVLSWSQSCLKSCLGPAPFPVSVSSVLQVE FT SVDLTGVPAEYHDLHLVFSKSRATSLPPHRPYDCAIELLPGTSPPKGRLYS FT LSRPEREAMDKYINESLKAGLIRPSSSPAGAGFFFVKKKDGSLRPCIDYRG FT LNDITIKNRYPLPLMSSAFELLQGAQVFTKLDLRNAYHLVRIREGDEWKSA FT FNTPTGHFEYRVLPFGLTNAPAVFQALVNDVLRDMVNQFVFVYLDDILIFS FT PSLQAHTQHVRQVLQRLLENQLFVKAEKCVFHTQSVSFLGFLISAGEISAD FT PAKVRAVAEWPTPDSRKALQRFLGFANFYRRFIRNFGQIAAPLTALTSSKV FT LFRWGDKAQEAFDKLKSRFISAPVLSIPDPKQQFIVEVDASEVGVGAVLSQ FT RSLQDGKVHPCAFFSHRLTPTERNYDIGNRELLAVRLALGEWRHWLEGAEQ FT PFVVWTDHRNLEYINSAKRLNARQARWSLFFSRFNFTLSYRPGSKNVKPDS FT LSRLFEAPERVVSKDTILPRGVVVASLSWEIESRVLKALRECQPPQRVPAG FT KLFVPSALRSQVIQWGHSSKLVCHPGVRRSLAAIRQRFWWPSMARDVRQFV FT FACSVCAQNKSSNSPPTGLLRPLPIPSRPWSHLALDFVTGLPVSKGNTVIL FT TVVDRFSKAAHFIPLPKLPSAKETAQAVVDHVFRIHGLPVNVVSDRGPQFV FT SRFWKEFCRQIGASTSLSSGFHPQTNGQSERANQDLERTLRCLASQNPSSW FT CQQLSWVEYAHNTLPVSSTGMSPFQCSVGYQPPLFPAQEPDAAVPSALAFV FT QRCRRTWSRAKEALAQASRRTKAAADRHRTPAPRYVSGQRVWLSTKDLPLR FT VASRKLAPRFIGPYRITKVLSPVAVRLKLPPTLGRVHPVFHVSRVKPVLYS FT PLVPTVPTPSPPPPRLVDGAPAYTVRKLLDVRRRGRGFQYLVDWEGYGPEE FT RSWVPARDILDRGLVEEFRRRRGEPPP" XX SQ Sequence 4282 BP; 847 A; 1245 C; 1106 G; 1084 T; 0 other; gtacgatctg accaacgatg gacacagcag gccaagattc tgtcaggtcg gctttaactc 60 agcaaggagc tctattgggt cagcacgagt caagactcac cagcacctcc agagaagtgg 120 agtatctcac caatcaggtg gctgatctca cggcccgtct acacgagctt cagttcgaga 180 ccctgcaggg ggcccaggcg tcccatcagg cccctgaacc acgctgcaac aatcccccta 240 cctatgatgg ggatcctact tcatgccggg cattcctgtc ccagtgctcg gtggtcttca 300 cgctccagcc acgcacctat gccttggagg agactaaagt ggccttcgta cttacccttc 360 tcacgggcag agctcgcgac tggggaacgt ctgtctggga ggccagagca ccatgcgttt 420 cctcttttag tagtctccgt catgaaatga tcagactctt tgaccgatct gccaggggcc 480 aggaggcggc agatcaatta gctcgtcttc gtcaggaagg tcagtctgtc actcagtacg 540 ctataaagtt caagacgttg gcggcctcat gtgattggaa cgagggagcc tgtcatgcca 600 tgttccgggc tggactggat gaggacatcc aggatgaact ggcgacccac gacctacctc 660 aagattttga caccctcaca aacctggcgc ttcgcataga gggccgtctt cgccgtcgag 720 accatcgaag gtttctgcgc tcttcaacca gaagagagga ccccgggtcc ctttccacca 780 tggtacctcc atcggatcat gaacccatgc aagtgggacg tctccatctc acgtctcatg 840 agaaacagca gcgcctggct cagggtctgt gcctatactg tggcaagccg gggcattttg 900 cagcgaagtg cccagtaaaa gccaaagctc atcagtgaga agagggatcc tggtgggcgc 960 tattcctttc tcttccccct catctcgcac catcctccct gttgtggtcc agtatgatgg 1020 catttctggt tcctgttctg ctctcattga ctctggggca gaggggaatt tcatggaccg 1080 ctctgtggcc gcccagtggg gaatcccgac catccctctc gctgaatctg tttctgccca 1140 ttcccttgat ggtactctta ttgccaccgt ctctcacact acccctagtg taagtcttat 1200 tgtatccggc aatcaccgtg aggtcatcac gctgtatctt ttagactcac ccaatgcacc 1260 tataatttta gggcatcctt ggttggtaaa gcatggtcct cacgtggatt ggtctggaga 1320 ttctgtcttg tcttggagtc aatcatgtct taagtcttgt cttggtcctg ccccgtttcc 1380 tgtttctgtg tcttctgtgt tacaggtgga gtcggtggat ctgaccgggg ttccggcgga 1440 gtaccacgac ctgcacctgg tcttcagcaa gtcccgggcc acctccttgc ctccgcatcg 1500 accctacgac tgtgccattg agctcctccc aggcacttct ccgcctaagg gacgtttgta 1560 ttctctgtcc cgtcctgaaa gagaggccat ggacaagtat attaatgaat cccttaaagc 1620 cggcctcatc cgcccctcct catccccagc gggagctggg ttcttcttcg tcaagaagaa 1680 ggacggctcc ctccgtccct gtattgacta tcgagggctg aatgacatca ccattaagaa 1740 caggtacccc ttacctctta tgtcctcagc ctttgaactc ttacagggag cccaagtttt 1800 caccaagttg gacctccgta atgcttatca tctggttcgc attcgggagg gggatgagtg 1860 gaagtcagcc ttcaacacac ccacgggaca ctttgaatat cgggtccttc cttttggtct 1920 gaccaacgcc ccggctgtct tccaggccct ggtcaatgac gtgttgagag acatggtaaa 1980 ccagtttgtc ttcgtgtacc tggatgatat cctaatattc tctccctctc tgcaggcaca 2040 cactcagcat gttcgccagg tgctgcaacg actcttagaa aaccaactgt ttgttaaggc 2100 ggagaagtgc gtcttccaca cccagtcagt ttcgttcctg gggtttttga tttcggcggg 2160 tgagatcagt gccgatcctg caaaggtaag ggctgttgcc gagtggccaa ctccagactc 2220 tcgcaaggca ctgcagcggt tcctgggatt cgccaacttt tatcggcgtt tcatcaggaa 2280 ctttggccag attgctgcac ccctaacagc actgacctcc tccaaggtac tgtttaggtg 2340 gggggataag gctcaggagg cctttgataa actcaagtcc cgttttatct ctgctcctgt 2400 cttgtctatt ccagatccta aacagcagtt tatagtcgag gttgatgctt ctgaggtcgg 2460 agtaggcgca gtcctatcac aacgatctct tcaggatggg aaagtacacc cctgtgcttt 2520 cttctcccac cgtttgaccc ccacggaacg aaactatgac ataggcaatc gggagttgtt 2580 ggcagtcagg ctcgctttgg gtgagtggcg ccattggctg gagggcgcag agcagccttt 2640 tgtggtctgg acagaccaca gaaaccttga atacataaac tctgccaaaa ggctcaatgc 2700 aagacaagcc cgctggtctc tcttcttcag ccggttcaat ttcacactct cgtacaggcc 2760 tgggtccaag aacgttaaac ccgactctct ctctcgcctg tttgaggccc cagagagggt 2820 ggtttcaaag gacaccatcc tgcccagggg tgtggtggtg gcctccctct cttgggaaat 2880 tgagagtcgg gtcctgaagg ccttacgaga gtgccaacct cctcagagag ttccagcggg 2940 caaattattt gtcccttctg ctctgcgttc acaggtcatc cagtggggtc actcatccaa 3000 gttagtctgt catccaggag ttcggagatc gctggcagcc atccggcagc gattctggtg 3060 gccatccatg gccagggatg tcaggcagtt tgtgtttgcc tgctcagttt gcgcccagaa 3120 caagagttct aactcgcctc ccactggtct gctccgcccc cttcccatcc cttcccgccc 3180 ctggtcacac cttgcccttg attttgttac tgggttacca gtatcgaagg ggaacactgt 3240 catccttaca gtggtggatc gattttccaa ggcggcccac ttcattcctc ttcccaaatt 3300 gccctcagcc aaggagactg ctcaggcggt ggtggaccac gttttccgga ttcatggcct 3360 tccggtcaac gttgtttctg ataggggtcc ccagtttgtc tcccggttct ggaaggaatt 3420 ctgtcgacag atcggggcct ctacgagtct gtcatcggga tttcatcctc agaccaatgg 3480 gcagtctgag cgggcaaacc aggatctgga acgcactctc cgctgcctgg catcccagaa 3540 tccgagctcc tggtgtcagc aactgtcctg ggtagagtat gctcataata ctctaccagt 3600 ctcctccaca ggtatgtcac catttcagtg ctctgttggt tatcaaccac ctcttttccc 3660 cgcacaggaa cccgatgctg cggtcccgtc tgccttggct tttgtccaaa ggtgtcgacg 3720 tacctggagt agagccaagg aagccttggc ccaggctagt agacggacca aagcagcggc 3780 tgaccgtcat cggactcccg ctccccgcta tgttagtggt caaagagtat ggctgtctac 3840 caaggacctg cctctcaggg tagcctcacg taaattggct cccaggttca ttggcccata 3900 ccggatcacc aaggtgttga gtcctgtggc agttcggctc aagctccctc ctacgcttgg 3960 tcgggtacac ccagtctttc atgtttctag ggttaaacct gttttgtact ccccccttgt 4020 tcccactgtc ccgacccctt ccccccctcc cccccgtcta gtggatggtg ctccagcata 4080 tactgtcaga aagttactgg acgtcagacg caggggtcgt ggtttccaat acctagtgga 4140 ctgggagggc tatggcccgg aggagaggag ctgggtcccg gcccgggata tcttggaccg 4200 ggggttggtc gaggaatttc gtcggagacg aggtgagccc cctccctagg gcgcctggtg 4260 gcgtccgtgg ggaggggggt ac 4282 // ID DNA-4-3_DR repbase; DNA; ZEB; 1421 BP. XX AC . XX DT 30-OCT-2008 (Rel. 13.1, Created) DT 30-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE nonautonomous DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA-4-3_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1421 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1232-1232 (2008). XX DR [1] (Consensus) XX CC TIR is 22-bp long; TSD is AATT specific. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1421 BP; 459 A; 274 C; 294 G; 392 T; 2 other; gacccttcgc taagccccgc cctccttagt tactgttgct acgcctgtca agctttcgtg 60 cctggcacgt ctattacaat atgtatgcgc cggtgtcaga cattgccagg gatataatta 120 gtcatttttg gcgaacaggt gagatatctg agaaagccag acaaaaaagg actgcagtat 180 gcattacagg ggtatattca cgacataata ggcaaccgat tagaaaataa tttaatcaaa 240 attgaagcta ggttagccta gccgcctcat acgctgacca ttcaacagct tagttcatgc 300 acagcaggag aggtgaaatt taaaaataat ctaataataa caaattaaac cgtgatttct 360 cttttttata gccaaaaagc ctgataaatt wattgttgtg caatgaaata tagcgttact 420 aaccgacgag gaaacacgca tggacagtgc ttctacataa ttcattcata gaaagaacag 480 ccagaaaggg gaaattgatg gatttgtgcc gaatattagc atcattgacc cataacaatg 540 tacagtacct ataactgtca gtatgtttgt gtgcttttta tacagtttct attctaattt 600 gcagttaagt ggaaaaaata aataaattga aatgcaaatc aaagtataaa tagcagaagt 660 gaacaaatag attttcccta ccagcaaata ttaatccgac accaaatata aaagcagaca 720 ctaacccgct ataacatcgt caccaatgac taaatctcca aaagactgac gaaaacatcg 780 gtgaattgta gctctgacat ggacatgagg gttataaatg actgaaaaac agctgcgggt 840 gaagtatatt gatcgcaagt gctcagattg tgatcacatc tcggttttgt tctgttgata 900 tgtaatttca aatattcgta gcttgaaaca gatggaaata tgattgctat tagtatgact 960 actatggcga gggcttaaac ggagcagtgc tcataatatc gaggagaaaa aaaaaaaaga 1020 aaaagacagc tgggaaacat aacctgcttt gtatttttgt aggaaacaca gtttagatct 1080 ktttagggta agaggtgtgt gagttattgc cgtctatcac tgaatgtgtc aggaatatcg 1140 aaaacaaaac caacttaact ccgctctttt agcgcccctc acagagcttg atccacgctg 1200 gcgtttctcc ccttgggttt taggttttga aaagggaaaa aaaattccac caccccgtcc 1260 aaacttttag gataccgggt atcagatttg cacaatccat atgcacaacg ctttggcttg 1320 tttccctcgc tcgaaagctt gacagacctc ctgcgcgtag ctacggttgc taagccacga 1380 ttggtgtgtg gcggtttttg ggtgtggctt agcgaagggt c 1421 // ID Gypsy161-LTR_DR repbase; DNA; ZEB; 380 BP. XX AC chr7; XX DT 13-OCT-2008 (Rel. 13.1, Created) DT 13-OCT-2008 (Rel. 13.1, Last updated, Version -1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy161_DR; KW Gypsy161-I_DR; Gypsy161-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-380 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1603-1603 (2008). XX DR Genome; chr7; Positions 36132701 36132322. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 380 BP; 88 A; 125 C; 72 G; 95 T; 0 other; tgtagcgagc gattacatgc cacgtcgccc tggtcgctaa tttcacccag ctggaccatc 60 aagccaggat cgctcacagc tggacggcat caagcggaga gacatataag cccaacctac 120 gccaggagaa gatgagcttc attctttcta tgactccctg tgctaatgct tgtgtctctg 180 tctctccgta gcagactcca gctcgtgacg atccttcacc cgactactca ccgtccttca 240 cctacttccc ggagcaccag agcacctcac ctcaaagaag agcacctttc acacaaccta 300 cactttgtaa ataaagcacc cttttgactt gtctgtaaac tcatcctgtt gtgccgctgt 360 tttccctctg cctcgctaca 380 // ID Gypsy139-I_DR repbase; DNA; ZEB; 4231 BP. XX AC chr15; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy139-I_DR; KW Gypsy139-LTR_DR; Gypsy139_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4231 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1539-1539 (2008). XX DR Genome; chr15; Positions 31060996 31065226. XX CC Positions [2937-3359] - Reverse transcriptase CC Positions [1833-2309] - Integrase core CC 'GTAAC' target site duplication CC LTRs are 99% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 157..1818 FT /product="Gypsy139-I_DR_1p" FT /translation="MEDIMRRLSEITTRQQLFTEQLSVRQQRIEERLFQMA FT EHLPLPEARASAHRHLTKLGDLDDIDAYLHTFEVIAKREEWPESEWAQILA FT PFLTGESQRAYFSLDSPQNEDYTVLKGEILARVGLSPVRAAQQFATWAYEE FT GAPVRAQAAQLSKLARLWLLGGSPTATQVAEKVVIERLLRALPRRLRGPVS FT MRNPASLAALVEAVELAEATLTRDIIERAAVPSRRVNSPWRQVESASRPVS FT RPAVPSPADEPMPTEPVNSTARAWLAGCIVHRTLPSGAPSRRVKLEGKTIT FT AVLDTGSSVTLVQPGLIKPRVGSKATIPITCVHGDTRYVPAQRVTIAAGNG FT AWPLEVGIVADLPVPLLLGRDWPGFEELLSLPAATPFQTRRRPRARTQRVR FT QPALLATESDRGGECSNQSSNVFMDLFQQVSRGGSFGQAQREDDTLRNCWS FT QVRVVEGQERLPAPHPLPHFIIQNGLLYCVAERREERRTLLVVPKSKTSTI FT LELAHTHPMAGHLGAANTIQRIRDRFHWPGLNGEVKRYCQACPTCQKTAPQ FT RPPPPVP" FT CDS 1767..4229 FT /product="Gypsy139-I_DR_2p" FT /translation="MPDVPENCSPTSPPPSPLIPLPIIEVPFDRIGLDLIG FT PLPKSSRGHEHILVILDYATRYPEAIPLRKATSNAIAKELFLLCSRVGIPS FT EILTDQGTPFMSRLMADLCHLLKVKQLRTSVYHPQTDGLVERFNQTLKRML FT RRVVAEDGRDWDLMIPYVLFGIREVPQASTGFTPFELLFGRQPRGLLDVAR FT QAWEQEPAPQRSLIEHVQDMRQRIERVMPLVRQHLTEAQHAQRRLYDRPAQ FT AREFQPGDQVLVLVPTATSKFLASWKGPYVVVEKVGPVNYRVRQPGRRREE FT QSYHINLLKRWVSPSSQLVTFAESPHPTVHLGEQLSAIQKTELETLVSQFK FT DVFSEHPGRTTVIQHEIKTTPGVIVQQRPYRVSEARRLAIEEEVQKMLKLG FT VIEPSRSPWSSPIVMVPKPDGTLRFCNDFRKLNEASSFDGYPMPRVDELLD FT RLGSARFISTLDLTKGYWQVPLAPGAKEKTAFTTPSGHWHYRVLPFGLHGA FT PATFQRMMDILLRPHQSYAAAYLDDVVVHLMCWEEHLTPLRRVLLELRRAG FT LTANPKKCHLGLAEAKYLGYHIGRGLIQPQQAKVEALQKTPRPTNKSQVRA FT FLGLAGYYRCFIPNFSSVASPLTDLTKKGQPERVTWNPVAEHAFQTLKKAL FT ASSPILHAPDFNCPFILQTDASDSGLGAVLSQLHEGEEHPVVYVSRKLTTA FT ESKYAAVEKEALAIKWAVLELKYYLLGRSFTLVTDHAPLQWMASAKNTNAR FT VTRWFLALQDFHFKVQHRAGAAHGNADGLSRMWSGWTGLAKHSPSHTPLTA FT PFFHNRMTTQTRKRQRGGG" XX SQ Sequence 4231 BP; 985 A; 1245 C; 1125 G; 876 T; 0 other; actggtggag aatgcgggca gattgagtgg ataaaacccc acatctggta acattgaatt 60 atttattttt tatttatttt tttttttagg cgagcggctg ctctctctct ctattccctc 120 aaaacggtgt ggaggaatac acccctctgt gacaaaatgg aggatatcat gcgtcgtctc 180 tctgagataa ccacacgaca acaactgttc accgaacagc tctccgtcag acaacagaga 240 atcgaagagc gcctgttcca gatggctgaa cacctcccac tcccagaggc acgcgcctcc 300 gctcaccgcc atttgacaaa gctcggtgac ctggatgata ttgatgcata tctgcatacg 360 tttgaggtca tcgcaaagag agaagagtgg cccgaatccg agtgggctca aatcctggcc 420 cccttcttaa ccggagagtc acagcgggcg tacttctcgt tggattcccc ccaaaatgaa 480 gattatactg tactaaaagg agagatcctg gcacgtgtgg ggttatcgcc ggtgagagca 540 gcgcaacagt tcgctacatg ggcgtatgag gagggtgccc cagtacgggc gcaggcagct 600 caactcagca agcttgcccg cttatggctc ctcgggggat ccccaaccgc cacccaggtt 660 gccgagaaag tcgtcattga gagattgctg cgtgctcttc ctagaagact ccgggggcct 720 gtaagcatga ggaaccctgc aagtttggca gctttagtgg aggcggtgga gttggcagag 780 gccacactga cccgggacat tatagagaga gcagccgttc cctcccggag ggtgaattca 840 ccatggcggc aggtggagag tgcctcacgg cccgtcagca gaccggctgt cccgagtccg 900 gcggatgagc ccatgcccac tgaaccagtg aactccactg cacgtgcctg gttggctggc 960 tgcatcgttc accggaccct accatcaggt gctccgtctc gcagagtgaa gcttgagggg 1020 aaaactatca cggctgtatt ggatactggg agctccgtga cgctggtaca gccaggtttg 1080 atcaagcctc gagtgggatc taaagccaca atcccaataa cctgtgtgca tggtgacaca 1140 cgctacgtcc ccgcacagcg ggtaacgatt gcagccggca acggcgcatg gcccctggaa 1200 gtgggcattg tagctgatct ccccgtcccc cttttgcttg gccgggattg gccagggttt 1260 gaggagttat tgtctttgcc agcagcaacc cccttccaga cacgtcgccg cccccgggct 1320 cggactcaac gggtccgcca accagctctc ctggccacag agagcgaccg agggggtgag 1380 tgttcgaatc agtcttctaa tgtctttatg gatttgttcc aacaggtctc taggggaggt 1440 tccttcggac aagcccagcg tgaagatgac acacttagaa actgctggtc ccaagttcga 1500 gtagtggagg gccaggaaag gctccctgct cctcaccccc tcccacactt tatcatccag 1560 aatggtctgc tgtactgtgt tgctgagcga cgggaagaga ggaggacatt gctggtggta 1620 ccaaaaagca aaacgtccac catcctggag ctcgcccaca ctcacccaat ggcgggtcac 1680 ttaggggcag ccaacacgat tcagcgcatt cgggaccgat tccactggcc ggggctaaat 1740 ggtgaggtca agagatactg ccaggcatgc ccgacgtgcc agaaaactgc tccccaacgt 1800 cccccccccc cagtcccctg atcccgttac ccatcattga ggtacccttc gaccgcattg 1860 gtctggacct gatagggccc ttgccaaaat cctcccgagg ccacgaacac atcctggtca 1920 tcctagacta tgccacccga taccctgagg ccatacccct acgcaaggcc acctccaacg 1980 ccattgccaa agaactcttc ctgctgtgta gccgggtggg catcccctca gagatcctga 2040 ctgaccaggg tactccattt atgtctcggc tgatggcaga cctctgccac ctcctgaagg 2100 tgaaacagct gcgaacatcc gtttaccacc ctcagacgga tgggctcgtt gagcggttta 2160 accaaaccct aaaacggatg cttcgcaggg ttgtggcaga agatggtcgt gactgggacc 2220 tcatgatccc atacgtactg ttcggcattc gtgaggtgcc ccaagcctct actgggttca 2280 ctccttttga gctactgttc ggccgacaac cccgcggcct cctggatgtg gcacgacagg 2340 cctgggaaca ggagcccgca ccccagcggt ccctcatcga acacgtacaa gacatgaggc 2400 agcgaattga acgggtaatg cccctggtac gacagcatct cactgaagcc cagcacgccc 2460 aacgacgtct ctatgacagg cctgcccaag ctcgagagtt ccaacccgga gaccaggtcc 2520 tggtgctagt gcccacagca acttccaagt ttctggcttc ctggaaaggg ccctatgtag 2580 ttgtcgagaa ggtcgggcca gtcaactatc gtgtccgtca gccgggacga cgtcgagagg 2640 agcagagtta ccatattaac ttactgaaac ggtgggtttc cccttctagc cagctggtaa 2700 cctttgcaga aagcccacat cctacggtac atctaggcga acagctctca gccattcaga 2760 agacggagct tgaaaccctg gtcagtcagt ttaaggatgt gttcagtgag caccctgggc 2820 gcaccactgt cattcagcac gagatcaaaa caaccccagg agtcattgtg caacagcgac 2880 cctatcgggt ttcagaagct cgccggctgg caatagagga agaggtacag aagatgctga 2940 agttgggggt catcgagcca tctcggagcc cgtggtccag ccccattgtc atggtcccaa 3000 aaccggatgg caccctccga ttttgtaatg acttccggaa gctaaatgaa gcatccagct 3060 tcgacgggta cccaatgcct cgggtggacg aactcctgga cagactcggt agtgccaggt 3120 tcatatccac cctcgacttg accaaaggtt actggcaagt tccgctagcc ccaggggcaa 3180 aagagaaaac cgcctttacc actcctagcg gccattggca ttaccgggtc cttccctttg 3240 ggctgcatgg ggcaccggca acgttccaac gaatgatgga catattgctc cggccccacc 3300 aaagttatgc cgcggcctac cttgacgacg tagtagtcca cttgatgtgt tgggaggaac 3360 atcttacccc tctgcggagg gtgctcctag agttacgacg ggctgggctc acagctaacc 3420 ccaagaaatg tcatctcggg cttgcagagg caaagtacct gggatatcac attggtcggg 3480 ggttgatcca accgcagcaa gccaaagtag aggctctaca gaagactccc cggcctacca 3540 ataagtccca ggtacgtgcc tttttggggt tagcgggtta ttaccgatgt ttcatcccca 3600 acttctcttc cgtagccagc ccattgacag acctgacaaa gaagggacag ccagaaagag 3660 tcacctggaa ccctgtggct gaacatgcat tccagaccct taagaaggcc ctggcctcct 3720 cacccattct tcatgcccct gacttcaact gccccttcat cctgcagact gatgcttccg 3780 actctggact gggtgcggtg ttgtcacaac tacacgaggg agaggagcac ccagtcgtct 3840 acgtgagtcg gaagctcacc acagcggaat ccaagtacgc agccgttgag aaagaggccc 3900 tggcaataaa gtgggcagtc ctggagctga aatactacct cctgggaagg tccttcaccc 3960 tggtgactga ccatgcgccc cttcaatgga tggcctcggc aaagaataca aatgcccgcg 4020 tcaccaggtg gttcctagcg ctccaagact tccactttaa agtccaacat cgagccgggg 4080 ccgcccatgg gaatgcagac ggcctctcaa ggatgtggtc tggatggaca ggtctggcaa 4140 aacattctcc ctcccacaca cccctaaccg ctcccttctt tcacaacagg atgacaactc 4200 agaccaggaa gagacaaagg ggggggggga a 4231 // ID Gypsy144-LTR_DR repbase; DNA; ZEB; 1938 BP. XX AC chr6; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy144-I_DR; KW Gypsy144-LTR_DR; Gypsy144_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1938 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1550-1550 (2008). XX DR Genome; chr6; Positions 43200726 43198789. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1938 BP; 618 A; 240 C; 376 G; 704 T; 0 other; tgtaacccag gttaccaaca gaaacaggat tagaattaat taattaaata tgctgaactt 60 atagtttttc tctttctgtt ttgcaaatta cctttatagc caattagaat acattaaaat 120 atttttttac ctgtaaaaac gatctaaaag agtttagaat atcatgtgac gtctttccgg 180 ttccgatcac tgacgcgggt agatttcgcg tgcactcgca tcggcggagc agacgagagc 240 agacgcagat ttagattctc tctctacatt aagttgtatt gaactggaaa ataagagatt 300 ttttgattta atgttgaatt atatgtgtct aaagacaata tagaaggaag tgatgctaga 360 agaaaccgtt tatgtgagtt aaatatagtt taattagtta atgatcgtga cggagctgat 420 tacaagcagc ctggttaatg tgtaactagc tccattagga gatcgtctga aaggaaacgc 480 gattaaagtt aaatgctgaa aaagagaata agcaaaagga ttgagtgaag aatttcttcg 540 cattaaaaag attaaaagtt catctggttt tggtgagtta acttttgttt catttattag 600 ttaataaatg taaatgtatt agaaatagtg ctcattgtaa gtcaaatgta agtgaatatg 660 tattataagt tcatttaatt cttaatgata ttataagaaa atgtgaatat ggttttatat 720 gtgaaatgta attcctattt tgatgctgta atagatacca catgtgtgtt atgaaattta 780 tattaatgat gattttgtga ccttgttttc aaggtcaggt tttttttgtt gttgtttgtt 840 tgttttattt ttcctccttt acctggattg gtgccatcgt tgtgctcctg tgtgatctgt 900 tgatttgatg ctggattggg attggagctt attggattct ggattcgttc taactcctgt 960 gttcgctaaa ggggttggga ctggcgagcc atacgaaccg aacttcaaat tgtcattgtg 1020 tgaaactgat acttgttgcc gatcaggacg gtaggaaact gaactacaag cacacacagt 1080 cgttcaatga tttcttttgg gaaaaggatt cactcacaga ggagaaactg agtgacagac 1140 atttcgttgt tgctccttga gatattttat attcactaaa aaggtgaata tacacatact 1200 gactgttttt tgcaggaaaa gaaaagactt ttaagaaact gaatttattg aaagaaaaga 1260 gggaactgtg aagtgattac aagtgaactg ttttataggt tcgttatttc atttattgtt 1320 gatataccaa gtggtaatag gtttgggttt gtttattagt ttttttttta aacgcacaga 1380 ttagagaagt agaatacata cacaaaattg agaaattaat tcagaaaata acttctttaa 1440 tattttattt aatcaagatt gaattcttta cttacaaggg aaatttgttt tactgataaa 1500 gagtgaagtt tattaatttt tttctgagaa attgaaaaga gttgaaagtg aaacctgtga 1560 aaacatagga ttattttagt tatattacat caacagtatt gagtaaaggt ggttatactt 1620 acctaaaata tacttacctg ttgagtaagt tctggatttg aaacctgaaa tcaaaagaga 1680 aacatttgtg aaacattatt ttattatttg atattgaata tttttgtacc ttgtgacaat 1740 tttatttttg tttcttatga tttatttgtt gataaattga gatttaacac ttgaatacaa 1800 gacaatacaa tcagttaaaa accttattcg ttgtgtgagt ggttcattat tgttgtgaca 1860 ccctggttct ctgcagacga atctaggctt ctgatttcac tggtccatat tcaaagcctt 1920 tctcaggaaa gtgttaca 1938 // ID IS4EU-1_DR repbase; DNA; ZEB; 5004 BP. XX AC . XX DT 29-APR-2007 (Rel. 12.04, Created) DT 01-MAY-2007 (Rel. 12.04, Last updated, Version 1) XX DE A family of autonomous IS4EU transposons - a consensus. XX KW ISL2EU; DNA transposon; Transposable Element; IS4EU; KW Interspersed repeat; IS4EU-1_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5004 RA Kapitonov V.V. and Jurka J.; RT "IS4EU, a novel superfamily of eukaryotic DNA transposons."; RL Repbase Reports 7(4), 147-147 (2007). XX DR [1] (Consensus) XX CC DNA transposons from the IS4EU superfamily are characterized by CC the TA target site duplications. These transposons are wide CC spread in metazoans, including fish, frogs, lancelet, sea CC urchins, sea squirts, insects and cnidarians. Autonomous IS4EU CC transposons encode two proteins: the IS4EU-TR transposase, which CC is similar to the IS4-like bacterial transposases, and the CC ISEU-EX DNA exonuclease. Based on the conservation of both CC proteins in highly divergent transposons, is is clear that they CC are necessary for transpositions. CC IS4EU-1_DR is a consensus sequence of a very young family of CC autonomous IS4EU transposons that were active in the zebrafish CC genome in a last few million years. The IS4EU-1_DR transposon is CC characterized by 11-bp imperfect terminal inverted repeats, TA CC target site duplications, and it encodes two proteins: (i) the CC 464-aa transposase, IS4EU-1_DR1p, composed of the THAP CC DNA-binding domain and catalytic "DDE" domain, which is conserved CC in all IS4EU transposases, and (ii) the 505-aa IS4EU-1_DR2p CC exonuclease. Questions and comments send to Vladimir Kapitonov. XX FH Key Location/Qualifiers FT CDS join(108..441,1034..2091) FT /product="IS4EU-1_DR1p" FT /note="IS4EU-TR transposase." FT /translation="MSSIACSVRGCHNNWIKRRNQLQQQCYEHRVTRSECC FT GAPYDLHPPPKDDEHLRQWLKALNLKQPPKRPYVCSYHFVDGKPTDRHPYP FT EKWLGYEAPVKKPRRVLERLYDSDPSVAASNAVDSCEDEPVCMPLNCDAET FT QWEDLCVSEHSYTKSQLNLKPPTRDMQTQCNESQPLYITLLRKNDLCQLYT FT GLTLDAFHSVAEHLTNAYSNSFQLHPWDQLLMTLMKLRLNLLQGDLAERFA FT VSQSIVSKVISCWIDIMEENMRDYVPWLPKETIQATMPQCFREQFPNTTCI FT IDCSETPLQKPHNLDSRGESYSHYYGQNTIKYLVSIAPCGLIMFISPAYGG FT RCSDKFITANSGFLEYLRPGDEVMADRGFTISDLLYEKKVKLVIPAFTKKG FT MQLSEEDTTNTRRIANVRVHVERVICRLKTFKIISQTVPINLTPKIDKILR FT ICAALCNLRSDIISDVEDE" FT CDS join(4892..4449,3780..2710) FT /product="IS4EU-1_DR2p" FT /note="IS4EU-EX lambda-like exonuclease." FT /translation="MAFNVSSGNVLSWEYGETSLQRSDGLILPHPSSPLMW FT VDDMKKWPEVTYGDIFNYFVLSQGVDGASMKNYKSTEAYQYLHSGKVGRVL FT LHSEGDFVFLKANVHPSQSSSPCHSAWVLLSKQGSVETTGCSCIAGLGRSC FT SHAASILWKVENAVSSGLTGIACTDEQRLWNMGTQRNLCLKRLMDIKFSHH FT QATDVLRMPEGMPKNAPLPPTPLFSTQKDLREGLKHLRLPASSLLHKCVSA FT VQGEESEQPASSPHANHDGTKNCQRCMLFYDSLVAIDPSKCAALERVTLEQ FT SSSHLWHDSRKLRITASTAKKVPIRGNPQTFIREHLYPRFHGNAATNHGLE FT SEASAIQWLESSGFTVSHRGTVVCGSEPWLSASPDGVLNLHELLEIKSPLL FT KCDESLEDLFRSQRYDVRMVDGIPQLQPNGPRGFYLQVQLGMFCTGLRSCK FT LLVWVPSQQVLLQVPYNEQFCSKTVARLKTFYFKYMLPQVTDEFQAGRLLL FT STRYLQLCK" XX SQ Sequence 5004 BP; 1557 A; 1047 C; 910 G; 1490 T; 0 other; ttgaccttat tttccacgac aacaagggtg gcgccattac gctcattttg tcatctcact 60 tccggtttcg aaggatctaa cgttagtcgt aagcttctac agcaagaatg tcaagcattg 120 cgtgttcagt tcggggctgc cataacaatt ggataaagag gagaaaccaa ttacaacaac 180 agtgttatga acacagagtg acccggtcgg agtgttgtgg tgcaccttac gatttacatc 240 caccacctaa agatgatgag cacctgaggc aatggctgaa ggcgctcaat ttaaaacagc 300 cacccaagcg cccttacgtg tgttcttacc atttcgttga cggaaagcca actgatagac 360 acccttaccc cgagaaatgg ctcggctacg aagctccggt aaaaaagcca cgccgggtgc 420 tggaaaggtt gtacgattca ggtaagccaa cttgtgtagt gattgctcgc ctaacgtcat 480 atttctgaaa tgcattctat aaacacgttt aagctcagca atgatgagtt aaaagtaatg 540 ttacaagaag tcttgtatga ttacaattgg ccacatgatg ttatatttca tatttttgaa 600 tgcatttcat agtgaaggta atacaggaat catagacgtt agcaggccag ctaacttacc 660 aatctagcac agacatttgg gctgtgacta cacatctatt aagctttcag ctcgaatata 720 ccaacatcta aaaatcatta aacatatatg tttaaaaaaa actacttatt ataaacattt 780 aaaagaataa ctttaaataa atttaaggaa tgaaattaaa atacattttg tagtgttcct 840 ttaattttac tttacatatt cctaaagtcc attttctcaa actacatttc ttattcagat 900 aataataccg acctgtatta ctattgaatt gactaataca cacaatgttc tattggtgca 960 ataatgtatt aattgtcatt atgtatatta tgtactaatg taaaaatgat tatgattaca 1020 tttatttcaa cagatccatc agtggctgcc agtaatgcag tggacagctg tgaagatgaa 1080 cctgtgtgca tgcccctaaa ttgtgatgca gaaacacaat gggaggatct atgcgtctct 1140 gagcacagct acacaaagtc acagctcaac ctgaagccac cgacaaggga tatgcaaacc 1200 cagtgcaatg agtcacagcc gctgtatata acgctactaa gaaaaaatga cctttgtcag 1260 ctctacacag ggttgacatt agatgcattt cattcagttg cagagcacct tacaaatgca 1320 tacagcaaca gcttccagct tcatccatgg gatcagctcc taatgactct gatgaagttg 1380 cgtcttaatt tactgcaggg cgatcttgct gaaaggtttg ctgtttctca atccatcgta 1440 agcaaggtga tttcatgttg gattgacatc atggaggaga acatgaggga ttacgtacca 1500 tggcttccca aggaaacgat ccaagcaaca atgcctcagt gcttcagaga acagtttcca 1560 aatacaacct gcattattga ctgttctgag acaccacttc aaaagccaca caaccttgac 1620 tcaagaggcg agtcctacag tcactactat gggcagaaca caataaaata cctggtctcc 1680 attgcaccat gtggactgat tatgttcatc tcacctgcat atggaggaag gtgcagtgat 1740 aaattcataa ctgcaaactc gggtttcctt gaataccttc gtccaggaga tgaagtaatg 1800 gcagatagag gcttcacgat tagtgatctc ctctatgaaa aaaaagtaaa actggttatt 1860 ccagcattca ccaagaaagg catgcaactc tcggaggaag acaccaccaa cactaggcgc 1920 attgccaatg tacgtgtcca tgttgagcgg gtaatctgta gattgaaaac cttcaaaatc 1980 atctctcaaa cagttccaat caaccttacc cctaaaattg acaaaatttt gagaatttgt 2040 gcagccctgt gtaatttacg tagcgacata atttctgatg tagaggatga ataaatgttt 2100 atttgtaggc gcattgtatt tgtcattaaa ttttgttgtt ttgtctttaa cagttgtgta 2160 aataaaaaaa ttaagcaggt attttaataa acacttacaa tgatgagatg ttacatatta 2220 acatgttagc tgtaacaaat actattgaat tccatggtta cagatttcct gcattcttta 2280 aagtatcttc atttgtgttc aacagaacaa agtaagtgta acaggtttga aacaaataaa 2340 ggcagagtaa attgacaatt ttaattttga actgtaattt ttaaactgtc cctttaagtt 2400 tgactttaaa aaataagaca tcaagtttgc aaaatatttt ttattacata gtaaaattaa 2460 atggttaaat cataccttta caagaaggtc atcatgtgta tctataatcc tggtaaaaaa 2520 caaacttaca acttaaacat ttaaactgat tttacgcaat ggtttacata acacatttta 2580 tgataataat acacaaatta tcacaatctt gcttatcaaa actgatataa aacaactaat 2640 ataaaatggt tcattttaaa tgattgtctt caccttctac accccagctt atttaaaatg 2700 caatccttat ttgcaaagct gcaagtatct tgtggaaagc aacagacggc ctgcttgaaa 2760 ctcatctgtc acctgaggga gcatatactt aaaataaaag gtttttagtc tagctacagt 2820 cttgctacag aactgctcat tgtatggaac ttgtagtaga acctgctgtg atggaaccca 2880 gacgagtagc ttacaggacc tcaaacctgt acaaaacatg ccaagctgca cttgcaaata 2940 gaagccccgt ggaccatttg gctgtagctg aggtatgcca tccaccatcc tcacatcata 3000 cctctgactt ctaaagaggt cctccaaaga ctcatcacat ttaaggagag gggatttgat 3060 ctccagcagt tcatgcaagt tcaaaactcc atcagggctt gctgacagcc aaggctcact 3120 tccacacaca actgtccctc tgtgactgac agtaaatcct gaactctcta gccactgtat 3180 tgcagatgct tcactttcta agccatggtt agttgctgca ttcccatgga acctggggta 3240 gaggtgctct ctaataaatg tctgagggtt acctctgatg ggtaccttct ttgctgtgct 3300 tgcagtaatt ctaagcttac gagagtcatg ccacaaatga gaagaactct gctcaagtgt 3360 tacacgctcc agtgctgcac acttgctagg atcaatggcc actaagctgt cataaaacaa 3420 catacacctc tgacagtttt tggtgccatc atggttagca tgtggtgagg acgcaggctg 3480 ttctgattcc tctccctgca cagcactcac acatttgtgc agaaggcttg acgcaggcag 3540 ccgaagatgc ttgagtccct ctctgaggtc tttctgggta ctaaataagg gagtgggagg 3600 aaggggggca tttttgggca ttccctctgg catccgcagc acatcagtgg cttgatggtg 3660 ggaaaacttt atgtccatta accgcttcag gcagaggttc ctctgggtcc ccatattcca 3720 gaggcgctgc tcatctgtgc aggcaatgcc agtgagccca ctggacacag cattttcaac 3780 ctgacaccaa acataattac tgacaaattt gtattatttt gtattaagat atgtcaacat 3840 gttctattat cttatttcaa tataatttat tacaaattac aatttatgtt tacctacctt 3900 gttttttttc aggtgatagc atgattagac tttttacact taatatatgt gtaccaaatc 3960 ttacctttag tatattcaca gtgtgcataa ttactatgca agtacatttt cttatatttt 4020 tatatcttat gtctttcaaa tgtatgtatt tatacatgca catctatcta tctatctttc 4080 tatctctctt tatttatata tatatatata cccatacatc tagcaatctg ggcttttttt 4140 tttaaatctg ccttaataat tttgcacact taaatatagg actttttcac ttccaagttc 4200 gagcctcctg tgcaattata caacacataa tttataaaca ttcacttacc ttacagacct 4260 tacctcataa ctcacaaatt aagagatttg accagatact tagtctgcag atggtaacca 4320 agtcagcagc tgtatgttat gacgcaaata cgttaacata ctattataca ttaattgctt 4380 acttatgtca acacaaacgt gacattacca tacataataa ataataacaa tgagtattat 4440 ttacttacct tccataatat agaggctgcg tggctacagg acctacctaa accagcgatg 4500 cacgaacatc ctgtagtctc cactgatccc tgcttactaa gtagcaccca cgctgagtgg 4560 catggactgc tcgactggct ggggtgcaca tttgctttta aaaagacgaa gtcaccctcg 4620 ctgtgcaata gcacacgacc cactttccca ctatgcaaat attggtaggc ctcggtactt 4680 ttataattct tcatagacgc tccatctact ccttgtgaca ggacaaagta attgaaaata 4740 tccccgtaag ttacctctgg ccactttttc atatcgtcta cccacattaa cggtgaagat 4800 ggatggggca atatgagacc atccgatcgt tgtagtgagg tttcaccgta ctcccagctc 4860 aaaacattcc ctgaactaac attaaatgcc atttttctaa aatctgaaac aataatctga 4920 tgaactgtgt cagcgtaccg gaagttggat gacaaaatgc ggaagagcga agaataaaaa 4980 gtgggcgggg cggaataagg tgaa 5004 // ID TDR20 repbase; DNA; ZEB; 245 BP. XX AC . XX DT 01-APR-2002 (Rel. 7.03, Created) DT 01-APR-2002 (Rel. 7.03, Last updated, Version 1) XX DE Zebrafish non-autonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; TDR20. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-245 RA Jurka J. and Drazkiewicz A.; RT "TDR20: Zebrafish non-autonomous DNA transposon."; RL Repbase Reports 2(3), 13-13 (2002). XX DR [1] (Consensus) XX SQ Sequence 245 BP; 77 A; 35 C; 48 G; 85 T; 0 other; tttggtcaca ctttattttg atggtccgtt tgttgaattt aagttacatt gcatctacat 60 gccaactaat tctcattaga ttataagtag actgttaggt tggggttagg gttagtgtaa 120 gttgacatgt acttgcaaag tttcttatag tcagttaaat gtctgttgaa ggagcagtat 180 caacagatat taagcagaca gtctactaat actcaaatgg accatcaaaa taaagtgtta 240 ccaaa 245 // ID DNA-AT-1_DR repbase; DNA; ZEB; 392 BP. XX AC . XX DT 15-OCT-2008 (Rel. 13.1, Created) DT 15-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE non-autonomous DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; TSD AT; KW DNA-AT_DR; DNA-AT-1_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-392 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1243-1243 (2008). XX DR [1] (Consensus) XX CC The element is characterized by the AT TSD (98% of all elements), CC and a ~76 bp TIR. The classification is unknown. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 392 BP; 126 A; 71 C; 78 G; 117 T; 0 other; ccgcttgtta agtgtgacgt cacgcgaagc ggcttccggg tccaagcgct ctattcaact 60 gaatggggag actcatgaaa tggtaataat aaacgtttac aaagcgattt aatgctttcg 120 aaaatcacga tcgcagtata tatgtccatg cctaatatcc gatggccaga aagtgattaa 180 ttttttttat aaattgttaa atttttggta tttgttatgc agcaagccca gagattgttg 240 tgtacactat gattttatat aaaattaact ttaatgtgtg ataggaataa aacgtgatca 300 taaacgaatg atttctccac tcaaatgaat ggcggcttgg acccggaaac agtattacat 360 acgtcacaaa cacgtcacca cttaacaagc gg 392 // ID hAT-N80_DR repbase; DNA; ZEB; 812 BP. XX AC . XX DT 27-FEB-2009 (Rel. 14.02, Created) DT 27-FEB-2009 (Rel. 14.02, Last updated, Version 1) XX DE Nonautonomous hAT DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW hAT-N80_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-812 RA Bao W. and Jurka J.; RT "hAT DNA transposons from zebrafish."; RL Repbase Reports 9(2), 546-546 (2009). XX DR [1] (Consensus) XX SQ Sequence 812 BP; 263 A; 142 C; 146 G; 261 T; 0 other; caagggtgta gaattagcat ggacggaggg gacgtgtccc caccaatatc cacagattat 60 taaaatgtcc ccaccaataa tttaatcgcc tccaaaataa aataatgctt tgtggaaatt 120 ttccagtcag gcttactgtg caagactcag gtgagacatg gacataagca gctacatttc 180 aaagaaaagt gtcagtcact tgaagtcaag ttcgctacaa gtccaccata atagtaacca 240 tggatgtgca atttaataaa aaatttgttt tcgattttgg cctccacgac gattatgaaa 300 aaacaacaat cgagataaaa tggattattg tgtcatatgc ctcccccttt ccagaagtct 360 gcattcattc cttgtcaaag taaaatcatg taaatttact tttgcaccat aagacatgct 420 tgatttaatg ttgtttttat tactaattgt atttgattta tttatttaag tgtttttttt 480 aaagtgcgaa atagaacatg tgctgttctg tttgcacgct cagagaagga aaagaacatg 540 gacatgtaaa gtcatctttt agcttaaagt gcataaatgg caatattata cttgcaaaaa 600 tgtgttcaga tgcggtggta agtgattatg cacaaataag ccctcagtaa acggaagtct 660 ttattaaact ttgtaaacct ttttgtcaga cattccttta actttacatt ttaaatgtga 720 agtcactaca tttaaatttt acaatttttc gggggtgggg gttggtgtca aattaggtcc 780 ccaccaatat caaaagcaaa tctacgccct tg 812 // ID TE-X-2_DR repbase; DNA; ZEB; 428 BP. XX AC . XX DT 03-NOV-2008 (Rel. 13.11, Created) DT 03-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE nonautonomous transposable element from zebrafish - a consensus. XX KW Transposable Element; Nonautonomous; TE-X-2_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-428 RA Bao W. and Jurka J.; RT "Transposable elements from zebrafish."; RL Repbase Reports 8(11), 1810-1810 (2008). XX DR [1] (Consensus) XX CC TSD is unknown. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 428 BP; 124 A; 61 C; 147 G; 96 T; 0 other; ttcaccagca tggaccacgc cacactttga gctagacttt attgtaggga tgtagggatg 60 aaggggatgg taagagaaca gtgggatgaa ggggaaggga agtaagagga taaacagtga 120 acaggtaggt aggttgatcg ggcgtcctac gatgatgccc agagactcag agtgggggta 180 ccgtctctgt aatattttgg tagagtgatt ggacccccct gattcaacct aggagtgaag 240 agagggttat aggattgtga atgggaagca aggaaataga gggagggagg gtgggttcga 300 gagaacgaga agaacagtgc tagagggctg gtctgcctta aataagtttt agggagtagg 360 tgattggtta aggagacaaa gtgaggcgtg gttccgctgc cgaacctttg ttaacaagtt 420 aactccat 428 // ID Kolobok-N7_DR repbase; DNA; ZEB; 1176 BP. XX AC . XX DT 28-FEB-2009 (Rel. 14.02, Created) DT 02-MAR-2009 (Rel. 14.02, Last updated, Version 1) XX DE putative nonautonomous Kolobok transposons from zebrafish - a DE consensus. XX KW Kolobok; DNA transposon; Transposable Element; Nonautonomous; KW TTAA TSD; Kolobok-N7_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1176 RA Bao W. and Jurka J.; RT "Kolobok-type transposons from zebrafish."; RL Repbase Reports 9(2), 550-550 (2009). XX DR [1] (Consensus) XX SQ Sequence 1176 BP; 338 A; 209 C; 251 G; 377 T; 1 other; aggtgcagta ggtgattgtc ttcagaaaca ttttttgttg tgctggttga aagtctcttc 60 acattccaat agtaatgatt aaagtaaatg atctaaatgt atttatatgt atttttatat 120 tctgggtaag gcataagact aaaaaatgtt catccaatta aaaaattgtc aggccgacaa 180 ttcccataat tctgataagt agcccaaact gtctgtcaac aaatgtagat ttgtacatct 240 gcgcacctgt tcacgcagat ctgccgtttg cgcgtgcacg cgcgccgcgt tcacgtgcac 300 acgagagagt gacagcggta aaaacaaatg ctgaatcaaa actttttaaa atcctgaatc 360 aatattggag ttacttttgc acgctggagg aaggatgaca ccatggctga agtatttctt 420 ttagacaggt aatgttatgt tttaaaacta ttttagtcac gcaaagctga tgtagattgt 480 gttgttttat gaatgggtta tatgcacaga agtgttgttc agccactgaa atcttccggc 540 gaaagattga tgtgactatt ttcagtttca taagggacct tttacagcat cgataatgta 600 gatttttatt tagtttaaca acaaaacatc agtaaatagc gctattccgc ctagcctgct 660 ttactgaggt gaagttggtt ttatattcac attggttcgt tcatttttga acaatgacaa 720 cctgtgatgc gctccctata ttgtttacca tgctactctg aatattcggt ctgaaatagc 780 atgtaagtat ggcttagtaa taagtgtaat tcctgcaccc cgccggccaa ccactaagca 840 tacagtagtc gctttaggac aaagcgtgtg agagagtgag accagcgatc cttgcatgca 900 tgaaatattg cacattatga caagttttgc ttgctacaca actgaaaacg tgctagatat 960 aatacagttt tttcrttcag aattgtagta ttataaagta ctataaagtt ttctaactgg 1020 cgaatgacat gatctagtgt agtgggtggt ctctgtcatc tttaatgtgc gcgttcgtga 1080 tttcaggagg cgtggctttg gacggcaggg gagggactgt gtttcaaaga tattatgcta 1140 accggttagc atttaggcag atcacctact gcacct 1176 // ID hAT-4_DR repbase; DNA; ZEB; 3176 BP. XX AC . XX DT 04-MAY-2006 (Rel. 11.05, Created) DT 04-MAY-2006 (Rel. 11.05, Last updated, Version 1) XX DE hAT-4_DR is a autonomous DNA transposon - a consensus sequence. XX KW hAT; DNA transposon; Transposable Element; Interspersed repeat; KW autonomous; transposase; DNA8-9_DR; hAT-4_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-3176 RA Kapitonov V.V. and Jurka J.; RT "hAT-4_DR, a family of autonomous hAT DNA transposons from RT zebrafish."; RL Repbase Reports 6(5), 243-243 (2006). XX DR [1] (Consensus) XX CC hAT-4_DR is a young family of autonomous DNA transposons that CC belongs to the hAT superfamily. The genome harbors several copies CC of hAT-4_DR that are over 95% identical to each other. DNA8-9_DR CC is a nonautonomous transposon derived from hAT-4_DR. The CC consensus sequence encodes a 642-aa transposase (putatively by CC three exons). XX FH Key Location/Qualifiers FT CDS join(355..1827,1917..2028,2169..2509) FT /product="hAT-4_DRp" FT /translation="MSDAETIYSPLGFKSKVWQHYGFYKKDGRLDKTDAIC FT KMCRASVKYTGSTTNLISHLKRRHGVVVDSVEASSSVPASSASCSDSPVAN FT SSKSGEKSIESFFHAPLANSSARSTAITDAIALFICKDIQPYSVTENEGFK FT HLLHVLEPRYKIPNRKLFSDKQIPALYDKVRREIEESLHKAQRVAVTVDGW FT TSCATASYVTVTAHYIDDEWVLKNHVLQTRVFNEAHTGNNLAVLLQDVCRE FT WKIEEKSPALVTDNARNMILAGAGAKMDPHVRCIAHTLNLASQKAFKVDSV FT SELVVKVRKLVSFFHKSPKATDLLREIQTQLHLPNHKLIHDVSTRWNSTLD FT MLERFWEQQPAVLNTLLSRKIKRREVMASLTEDDMTLIPEIIKLMSPLKMA FT TTLLSEEKNPTISIISPIQAKLQRQFQPDEGDLEVISHMKDRFRQDFDGRY FT TYLEGTLNCASALDPRFKDLAFLDDNDAKDMVFMKITTEVVQMNGEEEVNV FT TIPNEDQKAGTLEEDQTAETESHKSPSKEDKTDDAPPKKKTAMDQMFGDFL FT SARPPAKTIREKAKDEILKYRERDTLGLDGDVLQWWRLQVDLPLLSALAKR FT YLSIPATSVPAERVFSTAGDIVTAQRSLLHPDHVDQLIFLKKNL" XX SQ Sequence 3176 BP; 989 A; 626 C; 677 G; 884 T; 0 other; tagggatggg aagataaacc gatatgtatc gatacgcggt catgcgcgtg cacgatgcga 60 gtgcatcggt agagcagcag aggatgaatg aaatattaaa agcaaatcga gatgcatcgg 120 tttttgcgag atgcatcgat ttttccaaga tacaacttat ctttttaata tagaatgtat 180 tttttattta ttaacaagtg ttgtcaaact gtttttagcg cataatttaa tcgacctaca 240 taaagtatgc cttagcaacg gatttgcgga tgtgtacact acactacaac tcaggtcagg 300 tgtgcggatt aagctagtgg tgcgccctca gaaagcgcga gaagcaaaat aaacatgtcg 360 gacgccgaga caatttattc cccactgggt tttaaatcta aagtatggca acattatgga 420 ttttacaaga aagatggacg acttgacaag acagatgcaa tttgcaaaat gtgccgcgca 480 tctgtaaaat acacgggcag tacgacgaat ctcatttctc acttgaagcg gcgccacggt 540 gttgttgtgg attctgtgga agcatcttcc agtgttcctg catcctctgc ttcttgctct 600 gattcacctg tagctaacag ctctaaaagt ggtgagaaaa gcattgagag ttttttccat 660 gccccgcttg ctaacagttc cgcgcgctct acggcaatca cggatgctat tgcattgttt 720 atatgcaaag atattcagcc ttatagtgtc actgagaacg aaggttttaa acacctcctc 780 catgtcttgg agcctcgtta taagatacca aaccgaaagc tattctccga taaacagatt 840 cctgctctgt acgacaaagt aagaagagag atagaggagt ccctgcacaa agcccaaaga 900 gtcgcggtta cagtggacgg ctggacatcg tgcgctacag catcatatgt aactgtcaca 960 gcacactaca tagatgacga gtgggttctc aaaaaccacg ttctgcagac cagagtattt 1020 aacgaggctc atacagggaa taatttggcc gttttacttc aggatgtgtg ccgcgagtgg 1080 aagatcgaag aaaagagtcc cgcgttagtc acagataacg ccagaaacat gatcctggct 1140 ggtgctggtg ccaagatgga cccacacgtg agatgcattg cccacacatt gaacttggca 1200 tctcaaaaag cctttaaggt ggacagtgtc tctgagctgg tggtgaaagt taggaagctt 1260 gtttcctttt tccacaaaag ccccaaggca actgaccttc tgcgagaaat acagacacag 1320 ttacaccttc ccaaccacaa gctcatccat gacgtttcca ccagatggaa cagtaccctc 1380 gacatgttgg agcgtttctg ggagcagcag cctgctgtac tgaacaccct gctgtccagg 1440 aagataaaga ggcgtgaggt tatggcgagt ttgacagaag atgacatgac actgatccca 1500 gagatcatca aattaatgtc acctctgaaa atggcaacca cactccttag tgaggaaaaa 1560 aatcccacaa tctccattat ctctccaatc caagccaaac tgcagagaca attccaacca 1620 gatgagggtg atctagaggt tatttcccac atgaaagaca gattcaggca ggactttgat 1680 ggccgttata catatcttga aggcactctc aactgtgcct cagcacttga cccacgtttt 1740 aaagatctgg cttttttgga tgacaatgat gccaaggata tggtattcat gaagataaca 1800 accgaggtgg tacagatgaa tggagaggta agaacattta aagtaagatc attattgcag 1860 ttaagaacta ataatatatg catatgcatt aaattccatg tttatgtatt tttaaggaag 1920 aggtcaatgt cacaataccg aatgaagacc agaaagcagg cacattggaa gaggaccaga 1980 cagctgaaac agaaagccac aaaagccctt ccaaagaaga caagacaggt tggtttaaat 2040 ggagaaaatg caacaaatgt acagaagtat gtagtgaaga tgcaatacag taaagtaatt 2100 ttcagattgt aaattaaagg aatattttta gcagaagtga taagcctctc ctctgccctc 2160 ttctgtagat gatgcccctc cgaagaaaaa gactgccatg gaccagatgt ttggagattt 2220 cctgagtgca agaccaccag caaaaaccat cagggagaaa gctaaggatg aaatattaaa 2280 atatagggag agggacactt taggtttgga tggggatgtg ttgcagtggt ggagactgca 2340 agtagatctt ccactactat cagctttagc aaaaagatat ctgtccatcc cagcaacaag 2400 cgtaccagct gaaagagttt tcagcacagc tggagacata gttacagctc agcgtagctt 2460 gcttcaccct gatcatgttg atcagttaat atttctgaag aaaaaccttt aactgaaagt 2520 gactgaccaa tgtatactat tcaaaaatgc cacaaggtgt cctcctgttt actttatgga 2580 atttaattta attttaattt aattttattt tattagcctg ttgtttacag tgacagggat 2640 gtttcaaagc agcataatac tgctagttca cggtgttacg ttttgaatat tcagtggcaa 2700 aatatatgtt tgtaaaactt attttcatag ttgaaaagtt aaatcacaat tcttgttgtg 2760 caatagtaaa cattctttat ttattatttt attatatttt tttaatatat attgcactta 2820 tacattctgt ttatcttgga aatacttaaa taagatgtgc catatgttct aaaatggccc 2880 tctactgtaa accgacactc gggctctgag agaaaagcca gtttgtataa agtcttggtt 2940 ttacttggtt aataaataat caaactcatt caatggcacc gcatcttctt tcacatcatc 3000 acataaactt gatcttatta gttagtgctg aattatcgca ttgtatcgtg atatgggtgt 3060 gaatcgtatc gtgttgcatc gcaatatgtc acaaatgtat cgttaatata tcggatcgta 3120 tactttgtat cgagatgcgt atcggatcgg cattatagct tagatgccca acccta 3176 // ID ERV1-I_DR repbase; DNA; ZEB; 5976 BP. XX AC AL713862; XX DT 05-JUN-2002 (Rel. 7.05, Created) DT 05-JUN-2002 (Rel. 7.05, Last updated, Version 1) XX DE ERV1-I_DR is an internal portion of the ERV1_DR endogenous DE retrovirus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; ERV1-I_DR; KW LTR retrotransposon; RNase H; class I ERV; endonuclease; env; gag; KW protease. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5976 RA Kapitonov V.V. and Jurka J.; RT "The ERV1_DR endogenous retrovirus from zebrafish."; RL Repbase Reports 2(5), 11-11 (2002). XX DR Genbank; AL713862; Positions 66855 60880. XX CC ERV1-I_DR is an internal portion of the ERV1-LTR_DR endogenous CC retrovirus. It is flanked by 100% identical ERV1-LTR_DR. CC ERV1_DR belongs to the class I of endogenous retroviruses (4-bp CC TSD). XX SQ Sequence 5976 BP; 1868 A; 1190 C; 1220 G; 1698 T; 0 other; gttttggtgc cgctgacccg gatagagact gaaatcatcc acttctcaac aatttgcaac 60 aaaagactga aatccacctg aaattcaact tgaaccttca agatcaagga aacttcactt 120 cagacggaat cttcagctca actgcaattt gaagtcactc agcaatacaa agatcagacc 180 acttcagact gaaccttttc agttcaacca cgatttggtg agtttcactc tataaaagaa 240 ccgcaaaacc agcacatcta gttaccctct gcgccgtcag agaagagtta acaaacagct 300 gtgggtttag ttagctaagt tatcctctaa attttatttt agagaggaat aagtaatcct 360 ctgcattgcc agagaagttt agcatttttc tgctaaaatc tgtatcctct gttttaggca 420 gggaagttta gcctttcgtg ctaaaatctg taacctctgt tttaggcagg gaagtttagc 480 ctttcgtgct aaaatctgta acctctgttt tgacgcaggg aagattaacc atacgtgtta 540 atattttgtt ttgtctgtgt ttagcctttc gtgctaaaat ctgttatcct ctgtaaagca 600 gggaagatta agctaaaatc tgtatcctct gttttaggca gggaagttta gcctttcgtg 660 ctaaaatctg tatcctctgt tttaggcagg gaagtttagc ctttcgtgct aaaaatctgt 720 aacctctgtt ttgacgcagg gaagattaac catacgtgtt aatattttgt tttgtctgtg 780 tttagccttt cgtgctaaaa tctgttatcc tctgtaaagc agggaagatt aaccttacgt 840 gttaatattt tttgttttgt tgtaagtctg ctatcctctg taaagcaggg aagattaacc 900 ttacgtgtta atattttttg ttttgttgta agtctgctat cctctgtaaa gcagggaaga 960 ttaaccttac gtgttaatat tttttgtttt gttgtaagtc tgctatcctc tgtaaagcag 1020 agaagattaa ccttacgtgt taatatttgt tttgtttttg ttttagtgtg aagtactatc 1080 agtttttttt tccctctgca ttgccaggga aggttgactt tgttaacata cagttttttg 1140 tatcagaaaa gtttagtgtt ttgtttttgt agtaattgag aaactcacaa aaatgtttaa 1200 gaagaacgct agattgatcc ccacaaatga aaaaggtggt cttgcaactc cgttgtggtc 1260 agataaggaa tttaaatcat tgttaggagt acagatggac ctaatagtca ctgagaaagt 1320 tagaagagag ctaactcaga aatatgagat ccatccagac aagacttggt ctgtagatga 1380 atgtaaaaaa gtactaggtg ccagtatccg taaaaacaat gtaaaaggca ttatctgctg 1440 tcacagagaa tttgctttat ccttagccac acaacaagct aagcgcaact cagacctaga 1500 ggaacagaac aaacagctcc gtgctagagt aacatctctg acaaagaaac tgaaccgcaa 1560 caaagcttca gagaaagctg atgtagaaga aatcagtgag tctgataatg tttatccaga 1620 cttacaaact ctctttgaaa cagatgtacc tgtagagtca gtgaatgttt gtggggcgaa 1680 aaggaaatcg cagagaattg aggaaattga aagtgtttct cccaatgctt ctgttgtcca 1740 agttcaatca gttaccaaag ccttagggcc taaagacatt gaaagactgt ctcagggttt 1800 accttcagcc cgcataaatt tttctgagtt taggagagca ctaatcagca aaatgcgtct 1860 ttatgacatg tcgttgacag aagttacaca gctattgtct caaattctaa cagaatctga 1920 attcaacagt tttgaatctg ctgtttctga tctgcaacat gcagtaagga tgatttgaga 1980 gagggtgttc tgaaaactct taaaaatatt atggggccca aaatagattg gtctagagtg 2040 actagctgtg ctcaaaagaa agatgagtct gtgaatgaat tcactgagag gttttgtcaa 2100 acagccatag cttacagtgg aattgttgac aactctgaaa gtgttttaca tgaaaaagga 2160 ccactagttc gtatttggtc tgatggcctt gtaacagagt acaggaaagc attacctttt 2220 ctagatctta catggtttga caataccatc agaagtaatc taggcaagtt agctgcatgg 2280 gaaaaagatg ctgatgtcaa agcaacagtg agaatagcag caactacatt tagttcaaaa 2340 aacctagaca ataggtggcc taaaagagaa ggcaaatgca actactgtgg gaaattaaga 2400 cactgggaga aagagtgtag gaaaaagatg caggatagta aaagacatgc tatgcataat 2460 tctgctccct ctcagcctgc ctacaaccct gaagtagttc agccattgtc tacagcagct 2520 ttaggacagc tttctcaggc tcttctaaaa gcacaacaag aacaggaaaa aaactaattg 2580 ttggggctgt gagtagctac ctttcccctg taatccatca caatgacaaa agactgtttg 2640 tgaaaggtag catacaagat aaagagattg attttttgac cctcacagcc ccgaatgttg 2700 ttatacagcg tgcaccaatc acaaacccat cctcatgtat gatgtctgca ttgactgaag 2760 ttgtgttaga ggatgaggga gaaatgactc atgattgtgt aacgcttact tatacaccta 2820 caagtgaagt agcagaaaca cccatagaaa atgcagaatg ggaattgttc gttgatggtt 2880 cagcacaaat cattgatggt aacagaagag caggttatgc ggtaacttcc accactgaag 2940 taatagctac aggccgtctt ccagatcact tttcagctca agctgcagaa ctagtagctt 3000 taactagagc atgcacgcta gcgtcaggat cagttacaaa tatctacact gattccagat 3060 atgcttttgg ggtaattcat gattttggtg tcatttggca aagcagacag tttttaactt 3120 ctgccggatc ccccattaag catgctggat tagtaaaaga cctgatgttt gctatgaaac 3180 ttcctaagaa attggcggtg attaaagtga aagcacatct cacagccaat acaatggaag 3240 ctaaaggtaa tgctcatgct gatgtagctg ctaaacaggc ttgtcattat gctaatgtac 3300 aagtatgttc aggtagtaca gcacagaaag caattctacc tcctgaatca atcattgatc 3360 tgtacaaaga tgttccttta tatgaagcat ggacttggtt agacaaagga gccacagtgg 3420 attcatctgg ctgctggact aaagagggaa aatatgtcgc tcccgaatca ctgttgccat 3480 acttggctca acaaatacac aatttgggtc acagtggtcc agcaaccatg aatcacaggt 3540 tctcaaatca atggtggaat ccaaaattca gaaacatagc cactgagaca gtgaaaagat 3600 gtgttacatg ccagaaaaac aatgacatgc cagcagcaac tacagcagca acacataccc 3660 cagctccacc ggggccattt cgtcatctgc aggtcgatta catatcatta cctccctgta 3720 aaggaaaaac tgatgttttg atagtaattg ataagttctc gagatgggtt gaagcttatc 3780 caacaggacg tgctacagct gcacatactg ctaaatgtct aatcactgat ttcattccca 3840 gatgggggtt accagattcc attgactcag atcagggtac acacttcaca ggacaggtag 3900 tcaaggaagt gtctaaaatg ctgaagatta agtggaatct tcactgcccc tacaggccac 3960 aagcatcagg acaggttgaa cgagctaata ggacaattaa aaccaggcta agcaaaatgc 4020 atcaggaagg agtttcatgg gttgaagcac ttccagcggt actgtgtagt atgagagcat 4080 cacctaacag atcagtagga ctgagccctc atgagattat tactggacgc ccaatgcaga 4140 tgccaggtgt aattgatctt agaaatgctg atgtgcacat cgtctcagat gccctgatag 4200 cttactgtga aaaccttacc aagtctgtac agaatgccag agagagagtt gtgtcgtgtt 4260 ggcagaatcc accggaaggt ggacacacaa tcatcccagg tcagtgggtc atgataaaat 4320 cattcagaaa caagccatta gagcctaagt ggcttggacc acatcaagtg atgttgatta 4380 cagcagctgc agtgttgtgt cagggaagaa aaacttggac acatgtctca cattgcaaag 4440 ttgttccacc acctacaggg ataggatagg acacacagaa cagtggggta cccagggaag 4500 aactgatcct ttagggattg ggggccaccc ctgggcgaag acatcccaat agaatatccc 4560 ccatttatta gtccatccta accttactgt tctttaggtg tcatgggaat taagaaatag 4620 ggaaagaagg aacaactata gcagaattag aatgaacaga ggatacaaaa ctagtgtttt 4680 atttttactt ttttttattt ttaccttttt tctttttttt cttttattta tttattttta 4740 ttttttcctt tttttgcaac aaatactgtt tcgttgcttt tgtttccttc ttattctgtg 4800 cagatacaac ttgttgtcct gagacccata gacaccaacc ttctcaccta caagtccagt 4860 cacaaccttc aagaggccac tgctaaacca accatcacag aagaaaacac aacgcagaga 4920 accacacaca ggaacttcag tcatccatca atatgatcaa gattgccgta ctagtgatca 4980 tcattaacat cgcacaagcc atcgactcac gaaacaacat attcttagag ctgatgaata 5040 tgtcaagaaa tgccttgttt gctggaaaga acatctgcat gccacaccca ccttcagtag 5100 gagcaggaat cccctgggtg gcacaacccg tgtccaactg tgacatgtgt acgttctacc 5160 agctatcaga gaaaatacac acgccacacc aagctctcta catctgcata actccgctcc 5220 caccacaaag gaagaagatg gctgcattcc aggactcact tgtgcacaaa catggtggtc 5280 aagaactctt ggagcaataa ttccttctta tggtgtgatg caagcccttg atcaagtcag 5340 aagcttatca aattctgttc agaaactggc caatgataca gcttttgccc tgggtaatat 5400 cacaaacact cttgtttcac acaagataat gatcttacag aacagagtgg ccttggatta 5460 cattctagca aagcaagggg gagcttgcac catcataggc ccagagtgtt gcaccggatt 5520 aatggatcca accaagaact tgaacaaaat ccaacaagac atacttgatc tttcagtaaa 5580 attgcaccgt atgactgaag acaattcttc atggtttgga aacctgttag gaaaaccctg 5640 gctgtggatc aaggaaatag ctattcttct attgcttttc ctactgttgt actatctatg 5700 catccacagt atcaaatgtt ttactcaaca tttgacccac gcccatgacg aagaaatgac 5760 agttccaacc agaacagaat attatctata agaaatccat ggattttaac ggcatgtttg 5820 ttcctgtgtc aacatgtcat ttgtaaatat agacagtaaa aactgagcaa gtagtatgcg 5880 cctgtaacca taatacgtta caagaatgaa ctttttttta gtgtataaat ttatttattg 5940 tgagagtttt ttttttataa ctctcaaagg gggaac 5976 // ID EnSpm-4N1_DR repbase; DNA; ZEB; 2901 BP. XX AC . XX DT 28-OCT-2008 (Rel. 13.1, Created) DT 28-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE EnSpm-4N1_DR is an nonautonomous DNA transposon - a consensus. XX KW EnSpm; DNA transposon; Transposable Element; Nonautonomous; KW Autonomous DNA transposon; EnSpm-4_DR; EnSpm-4N1_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-2901 RA Bao W. and Jurka J.; RT "EnSpm DNA transposons in zebrafish."; RL Repbase Reports 8(10), 1259-1259 (2008). XX DR [1] (Consensus) XX CC TSD is 2-bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 2901 BP; 951 A; 452 C; 479 G; 1017 T; 2 other; cacagcaaat tcattggtgt taaatttcaa gtgttgtggt aattcagagt aaagtgttaa 60 aattctagag ttagataaag gtaaaataac cctctcggcg ttagaagctg atttgcctcc 120 ttgtcacaca gagtaacatg tatctacctt tgtagaaaac ccttgtattt gagaaactaa 180 tcagagtgtc tgaaatctcg ccacaccctc attcacctgg cccttaaatt agtcaattag 240 tttagtccac tagacagagg gaatgaccac aaataagtga attcagacac ctgctgttga 300 acacctgctg ttaacaagca caatcactga aggaataaga aattgaagga agaaataaaa 360 ctacagacac agcctcacac caaaccaact gaattaaaac agaggattaa acaactccat 420 taaataatag cataagcagc ttcactgatt acagtttgac ttcatttctg taagaatatt 480 tgacaatgaa cagaggttta tattttttta ttaaaaatat ttcatttgac ctcaccatca 540 tagagatcag aggctgctta tttgagctct tgaagcttta ctcttatatc ctaatgtttc 600 tctgactgtt atagctgtgt ttctaaaact catcagctat agcaagaaag gtcaagagaa 660 actatactgt ttatgcttaa ttgttataga tataatttca ggtgtttaat aagttgattc 720 ataaagaata tgcaaatata gttgtattaa aactgcatgg gacaatacta ctggcagtct 780 gctgctcttc atagaagatt caacaaacaa ttaggatgca attaataatt agagtgtcat 840 gcactaaaca ttcaagctcc agcttgctca ttatgacaca caggcatcaa gaatcaggat 900 atgaacctca accgtggttg ctaaaaaaaa aaaaaagctg catagttcat gccgcaatgc 960 atgctgggtg ccagcatagt acaaagctcc cagcatgcat tgcagcatga ataaattatg 1020 cagctttatg ttcttacaat ctctttcttt tcctttattt tgtgttgttt ttgggaggtg 1080 atatttcagt agtgtttttt ttttttattg atttttattt ttttgtttgt caccatcatt 1140 gagattcaaa gactaattgt tgatgtctgg gtgtttttaa gttctgccat agatcaaaca 1200 ttctcattgt aaatattgtt tttaaacctg ttataaggtc attaattatt ggagcttcag 1260 tggtttcatt tatgtatatt atttatttct cacacataac taatatgata ctgaattaga 1320 aaacaagcag gaaaggatgg ctatgatatc ataaataatc tcttcagact gacactttaa 1380 ttttcttttg gctttccatg ctatatgtat aaatagtgta attaacacta actatagcag 1440 ccaaaataaa agacatttat agtaagaagt tgaagagcct gactaaagca gactctgtcc 1500 tccatcatgg tgacttcaaa tgaacaacag aaatttcatt aagagctttt gttcacatga 1560 cagaaataaa gtcaaaggtc agtaataggt gaagctccat gataagttgt ttaatgtgat 1620 gagttttttt tttaaagatg ttgaaaaata acatttttta ttgtgtaatt tgttttattt 1680 ttattgatta ttattttatt tagagttttt ttttttctca gttgatttca tctttggttt 1740 tggcttgtgt ttttctgtcc ttcagtgatt ctgcttgtta acagttgttc atcaayaatt 1800 ctcaatcctc ctttaattag acacttaatt gtctcattaa cttcatcact gtctgagtgt 1860 tcagccctct gtagtgagga cactagttag gattttgctc tggaatttaa ggcttacgtg 1920 tgttcgaatg aaaaatacag actatgggat ttgtttttaa cagaaatatt ctggaaactg 1980 aatttacgaa tgtaaaatgt tgaaaacagc agattactgg caaccactgc tgccagtatt 2040 ttgacataca tttaacagat attttacaca ataagagttt gctaattaac actctgggtg 2100 ttaaaaattt taacactttc aaaagtgtta ttttaacact tatagtggtt cccatataaa 2160 cactgaggga gtgttaattt taactctaag gtagttaaat ctaccaaatc taaaatttgc 2220 agtgtgagat cgataaaaaa tacagactgt tgaatgtgct tttaacagaa atattctgta 2280 aactgaattt atgaatgtta aaaacagcag attactggca accacagctg ctggtatttt 2340 tccgtaaaat ctaaaaatga aaaaaaaaaa aaagaaaaac agcaaaacgg tttttgtgtc 2400 accattgtgg aggataatag cgtctctgtt caagtccaag atgaactgag agtatttgat 2460 gttatgctgc atcttctttt gtttaaaggt aactgtgtag ttactaatgc agtcaaaatc 2520 tgtttgctaa ttgcaatcac acatgaaaac attattgcat ctaaagactt tacatttttt 2580 tgcactaagc tatgattttg taaactgaac attgtattag ttcatgaaat atggttattt 2640 tttgtaaatt tatacagtta tataaaactt ccatgttttt cacagaaaga ttctggcaac 2700 cacagctgct ggtatttttt cggaaattta acagatattt tacacaataa gagtttgcta 2760 attaacactc tcttggtgtt gacaatgtta acactttcaa aagtgttatt tttaacactt 2820 atagtggttc ccatataaac tctgagggag tgttaatttt aactcyaagg tagttaaatc 2880 tactatctaa aatttgctgt g 2901 // ID HAT1N_DR repbase; DNA; ZEB; 965 BP. XX AC . XX DT 05-JUN-2002 (Rel. 7.05, Created) DT 05-JUN-2002 (Rel. 7.05, Last updated, Version 1) XX DE HAT1N_DR is a nonautonomous DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW HAT1N_DR; HAT1_DR; nonautonomous DNA transposon; hAT superfamily. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-965 RA Kapitonov V.V. and Jurka J.; RT "HAT1N_DR, a family of nonautonomous hAT-like DNA transposons RT from zebrafish."; RL Repbase Reports 2(5), 21-21 (2002). XX DR [1] (Consensus) XX CC ~1000 copies of HAT1N_DR are expected to populate the CC zebrafish genome. HATN1_DR copies are ~90% identical with CC the consensus sequence. CC This element is characterized by imperfect 9-bp terminal inverted CC repeats and the 8-bp target site duplications. CC The HAT1N_DR family is a nonautonomous derivate of some CC autonomous CC element similar to HAT1_DR. CC The HAT1_DR consensus sequence and HAT1_DR share the ~85% CC identical CC ~240-bp and ~140-bp 5' and 3' termini, respectively. XX SQ Sequence 965 BP; 292 A; 191 C; 185 G; 281 T; 16 other; tagggatggc tgacgtgaaa ctgatgtttc gacacagtgt cgagatcccg aagcgcaagt 60 gtttcgaaac actgcaccga agcatgatcc gaagcaccca agtcacgtga ctaaagtgtt 120 tcgaaacacc tggtcacgtg actaaagcga ttcaaagcat tgatcagctt agaagcgttt 180 cgagacctgg cgaatctgca tttgactgcc aaggtcagca aaaatctctg tctcatatgg 240 cytagwggac tgtgcgtgtg cncattgatt ctatgcttcg catgatgttt tgggttcgaa 300 tcccgacttg tacaaatact ctgaaatcat ggttttatgt attttaatca tgtttctgtt 360 ttgtgtagcc taaataggat acagctgcag ttacgccatc aatttagcat catttttgta 420 acactgtaaa acaataataa atatttttac agtatggttg ggtttagggt ttgggtagac 480 attaataaaa tacaataaat gggaaattta atgaataata taaataattc ttgataactt 540 caggccaccg tttttgatct agcaacaacc ctgttttaag acmaaaacaa gacatattta 600 ttcacaaakt gtttattcgg atrtcagacc aatgttgaaa caaaacgata caagarcaaa 660 gcattaccaa ctggtattaa agcatttcaa aatarctgcg acagcctggm ttcaaaccca 720 ctayccccgc atggaagtct gacaccttaa ccactgtact anatmacttg aagrwtcaag 780 ymtacaaagt ggggtttaat acctcaattt gctcaactgt tctttgctga agctgtttca 840 gtgcaataca taaaaaatga ccactagatg tcactgtaga gtggggtttc gaaacgtttc 900 gaagcttcga cacatttgct tcgactgttt cagtgtttca cgaagcctcg ctttgcccac 960 cacta 965 // ID Copia3-LTR_DR repbase; DNA; ZEB; 362 BP. XX AC chr16; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 01-OCT-2008 (Rel. 13.1, Last updated, Version -1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia3_DR; KW Copia3-I_DR; Copia3-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-362 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1589-1589 (2008). XX DR Genome; chr16; Positions 8435677 8436038. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 362 BP; 97 A; 54 C; 73 G; 138 T; 0 other; tgttgagttt tgtgttctca ttaatcctat aggtttatgt tttgttatgt attttgttta 60 tttatttctt ggattatact acttactgct gtatgccaaa aatgcgcaag agttcattgt 120 gcgcaagtgt aaacaaacag gacttttatt ttgactccgg ttgtgaatgt tgaacttact 180 tccgagtact tttatgtgat ttccttcctc ttcgcggcac tgatgttttt ttttgccggt 240 gcgttgtagt gaaagcgagc aaataaataa tattgagaag ttgaatctgg attgttcttt 300 catggatgga tgacaacaac atattgcaca tagagatgac agatagaaat gaaaacccta 360 ca 362 // ID Gypsy48-I_DR repbase; DNA; ZEB; 4410 BP. XX AC . XX DT 25-SEP-2007 (Rel. 12.09, Created) DT 02-OCT-2007 (Rel. 12.09, Last updated, Version 1) XX DE An internal portion of the Gypsy48_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy superfamily; Gypsy48-I_DR; KW Gypsy48-LTR_DR; Gypsy48_DR; endogenous retrovirus; gag; integrase; KW reverse transcriptase. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4410 RA Dib M.R. and Naveira H.F.; RT "Gypsy48_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 7(9), 827-827 (2007). XX DR [1] (Consensus) XX CC Gypsy48-I_DR is an internal portion of the Gypsy48_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its long CC terminal repeat is deposited in Repbase as Gypsy48-LTR_DR. CC Gypsy48_DR is characterized by 4-bp target site duplications. The CC internal portion encodes one polyprotein the 1437-aa polyprotein CC Gypsy48_DR1p (pos. 99-4402) composed of the gag, protease, CC reverse transcriptase, and integrase domains. Some insertions CC fairly recent, according to the high identity between their CC flanking LTRs. Consensus obtained after the alignment of at least CC three independent insertions bearing at least 85% homology over CC at least 1000bp. XX FH Key Location/Qualifiers FT CDS 92..4402 FT /product="Gypsy48-I_DR_1p" FT /note="ORF." FT /translation="MEEVAWDTVDEDEEVTSEKGAKSSPTRDPPVATTIEE FT DVVAMMRTFLTGQQKREEGLIYELRGLRETLRQQSTHETSQSRRWDLPTPV FT VTHTRRTEAAHHQTPVPPQPMRPETQMPTLQEGEDVENYLRRYERLAKSWG FT WPEVEWACRLVPLLSGKALEAYLAMDEDRADVYEDLREALLEKFDISTESY FT RQRFRQTVTPAGETPTETYNRLKGLYRRWVQPERSTKAEIGEVIILEQFLR FT VLPYEVRTWVKEHEPMDGLGAARLATQYLNAHRGQTHTRATANRGAKHTDK FT GDEGTVSFVPNKKLVCFYCQQEGHKASNCPVRKSKLTSLCYVPREGYEKDT FT VIHAKQHIDVVVNGQTLNALLDTGSNMSLIKQCYVPATRIDFTHTQNILCV FT HGDQKNYPTTEITVLIKDQPYLMTVGVVEGLPVDMVLGWDLPIVSELLKKT FT DGCTDVTSGVSCPVLTRAQAKVGLEPLPNLDDSLIQGEGKTKKSRRKRCME FT KQLGSPIQKEKTEGLHVTDWDIPTNIAQLQADDVTLRPLFKKVCEGANTIE FT ILGAEKYVVENDVLYVLSDGVKRLVVPECCRPLILHLAHTIPWSGHLAHQK FT TYYRISTCFFWPSMYTDVHTYCSTCAVCQKTSSVRRSARAPLHPLPVVTTP FT FKRIAMDIVGPLEKSSTGNQYILVICDYATRYPEAFPLRSITTAKVITALI FT QLFSRVGFPDEILTDQGTNFTSSLMKLLHEQLGIQSIRTSPYHPQTDGLVE FT RFNQTLKRMLRKFVNDTGKDWDKWLPFLLYAYREVPQASTGFSPFELLYGW FT RVQGPLELVRKSWEAPNTAGEGETNIVRFVLQMRDRLDTYREQAQENLEKA FT QKAQKLWYDQWSRQREYQPGQKVLLLLPSSTNKLLAKWQGPYDILRKTGPV FT TYEVLHPDRGKKSQIYHANLLKEWREREPMMKKAMLTCKVDAPEEEEEDKL FT DEETPRRFQSVVSVDHLEEKQRVELMDVLHSYPELLRTQPGRTNLVEHSIY FT LTDPTPIRQRPYRVPESLITPLRAEIKMMMDMGVIESSTSAWSSPIVLVPK FT KDGTLRLCLDFRKLNAVSKFDAYPMPRIDELVERIGRAKYITTLDLCKGYW FT QVPLEKTSREYTAFRTPVGLYQFTTMPFGLHGAPATFQRLMDLILQDCEDC FT SAAYLDDVVIYSNTWKDHLQHLHKILHKIQKAALTLNVTKCEWAKHDTKYL FT GFQLGNGEVRPQLDKVEAIRNCPRPRTKKEVRSFLGLVGWYRRFIPQFSEK FT AVPLTNLTTKMVKNPVPWTEDCEHAFTYLKECLCTFPVLRSPDFAKRFLVQ FT VDASARGIGAVLLQEDRGQKYPVLYLSRKLLPRETRYATIEKEGLAIKWAL FT ESLRYYLLGREFDLETDHRALSWIHSMKDRNARVTRWYLSLQPYKFQIRYK FT AGKENIIADYLSRLSHTFNPGEG" XX SQ Sequence 4410 BP; 1373 A; 990 C; 1085 G; 962 T; 0 other; agtggtgtca gaagtgagat ggcaagtcga aaatcagtgc attcaggaaa acagacaagg 60 aagggtctga gatcccagaa aaaaaacagc gatggaggag gttgcatggg acacggtgga 120 tgaggacgaa gaggtcacca gtgagaaggg tgctaaatcc agccctactc gagacccacc 180 ggttgcaacc accatagagg aagacgtagt agcgatgatg aggacctttt taacaggcca 240 gcaaaaaagg gaagaaggtc tcatctacga gctacggggg ctacgagaaa cactgcgcca 300 acaatccaca catgagacct cacagagccg gcgatgggac ctacccacac cagtagtcac 360 ccacacccgg agaacggaag ctgctcacca ccagacacca gtgccaccac agccaatgcg 420 accagaaact caaatgccaa cgttgcaaga gggggaagat gttgaaaact accttcgtcg 480 atacgaacgt ctggcaaaat cctggggttg gccggaggta gagtgggcgt gtcgactggt 540 tcctcttctc tccggaaaag cactagaagc atacctcgct atggatgaag atcgggctga 600 tgtctatgag gatcttcgtg aggcactgct ggaaaaattt gacatctcaa cagaatcata 660 ccgacaacga ttccggcaga cggtgacacc tgctggagaa acccccacag agacctataa 720 tcgacttaag ggcctctacc gacgctgggt tcaaccggag cggagcacga aggcagagat 780 aggagaagtc atcatcctgg aacagttcct aagagtgctg ccatatgaag tacgaacctg 840 ggtgaaagag catgagccca tggatggact aggtgctgcc aggctggcca cacaatacct 900 gaatgcacac agaggtcaga cgcatacacg ggctactgca aaccgagggg ccaaacacac 960 tgacaaagga gatgaaggta ctgtttcttt tgttcctaac aaaaaacttg tgtgttttta 1020 ctgccaacaa gaagggcata aagcatcaaa ctgtcccgta cgaaagtcca aactaacaag 1080 tctttgttat gtcccaagag agggttatga gaaagatacg gttatacatg caaaacagca 1140 tatagatgtt gtggtcaatg gacagacatt gaatgcactc cttgacactg gcagcaatat 1200 gtctttgata aaacaatgtt atgtgccagc aactcgcatt gactttacac acacacaaaa 1260 cattctatgt gttcatgggg accagaaaaa ttaccctacc actgagataa ctgttctaat 1320 taaggaccaa ccatatttga tgactgttgg agtggtagaa gggttacccg tagacatggt 1380 attaggatgg gatctaccaa ttgtgagtga actgttaaaa aagactgatg ggtgtacaga 1440 tgttacttct ggtgtgtcat gtcctgtact gacacgtgca caagctaaag tgggtttaga 1500 acccctacca aacttggatg acagcttaat tcaaggggag ggaaaaacta agaagtcacg 1560 gcggaaacgc tgcatggaaa aacagttggg ttcacccata caaaaagaga aaactgaggg 1620 gttgcacgtt actgattggg acattccaac caacatagcc cagttacaag ctgatgatgt 1680 taccttaaga cctttgttta aaaaagtatg tgagggggca aatacaattg agattcttgg 1740 ggcagaaaag tatgtagttg aaaatgatgt actgtatgtc ttgtctgatg gtgtcaaacg 1800 tcttgttgtt cctgaatgtt gcagaccact gattttgcac ctagcacaca ccatcccatg 1860 gtcaggacac ttagcacatc agaaaacata ctatcgcatc agtacatgtt tcttttggcc 1920 atccatgtat actgacgtac acacatactg ctctacatgt gctgtctgcc agaaaaccag 1980 ttctgttcgc aggtcagcac gagctccttt gcatccattg ccagtggtca ccactccctt 2040 caaacggata gcgatggata ttgttgggcc tctggaaaaa agcagcacag gcaaccagta 2100 cattttagtg atctgtgatt atgccacccg ctacccagaa gccttccctc tacgtagcat 2160 caccacagca aaggtaataa cagcactaat acaactgttt tccagggttg gattcccaga 2220 tgagatctta actgatcaag gtacaaactt tacctcaagc cttatgaagt tactacatga 2280 gcagttgggc attcagtcga ttcggacaag tccataccac ccacaaactg acggtttagt 2340 tgaacggttt aatcaaaccc tcaagaggat gctgagaaag tttgtgaacg acaccgggaa 2400 agactgggac aaatggctgc cttttctgct ttacgcctac cgtgaggtgc cacaggcatc 2460 tacagggttc tcaccctttg agctcctcta tggatggcga gtccaaggac ccctggagct 2520 ggtgaggaaa agctgggaag caccgaatac tgcaggagaa ggagagacca acattgtcag 2580 atttgtatta cagatgaggg accgactgga cacataccgc gaacaagcac aagaaaacct 2640 agagaaagcc cagaaagctc agaaattgtg gtatgatcag tggtccagac agcgcgaata 2700 tcaacctgga cagaaggtac tattgctcct accatcatcc acaaataaac tgctggcgaa 2760 atggcaggga ccttatgaca tcctcagaaa gactggacct gtaacctacg aagtactaca 2820 ccctgacaga gggaaaaaga gtcaaatcta ccacgccaac ctcttaaaag aatggagaga 2880 acgtgagcct atgatgaaga aggctatgct tacttgcaag gtggatgcac cagaagaaga 2940 ggaagaggac aagttagatg aggagactcc gagaaggttc cagtcagtgg tgagtgtaga 3000 ccatctagag gaaaaacaga gagtcgagtt gatggatgta ttacacagtt acccggaact 3060 tctaaggaca cagccaggaa ggactaacct ggtggagcac agtatctatc tgactgaccc 3120 aacaccaatc aggcaacgac cttatcgagt accagagagc ttgataactc cattgagggc 3180 agaaatcaag atgatgatgg acatgggggt gattgaatcc tctacaagtg cctggagtag 3240 tccaatagtc cttgtgccaa aaaaggatgg gacattaagg ctatgcctgg atttcagaaa 3300 gctaaatgca gtatcaaaat tcgatgccta ccccatgcca cgtatcgatg agctcgttga 3360 aaggataggg agagcaaaat atattacaac gctggaccta tgcaaaggct actggcaggt 3420 gccactagag aagacctccc gagaatacac cgcttttcgg acaccagtcg gcctctatca 3480 gtttaccacc atgccctttg ggttacatgg ggccccagca acctttcaac gccttatgga 3540 tctaatactg caagattgtg aggactgcag tgcagcttac ctggatgatg tggtaatcta 3600 cagtaacacc tggaaagacc atctccagca tttgcacaag attttacaca agatccagaa 3660 agcagcactg accctgaacg tcacaaaatg tgagtgggcc aaacatgata cgaaatacct 3720 gggatttcag cttgggaatg gtgaggtgcg tccacaactg gataaggtgg aggccatccg 3780 caattgtcca cgaccgagaa ccaagaaaga ggtacgatca tttcttggac tagttggatg 3840 gtaccgtcgt ttcattcccc aattttctga gaaagctgta cccttaacaa atctgacaac 3900 caaaatggta aagaatccag taccatggac ggaggactgt gagcacgcat tcacctactt 3960 aaaagaatgt ctctgtacct ttcctgtgct gaggagccct gactttgcaa agagatttct 4020 ggttcaagta gatgcatcag ccagaggaat aggagcggtg ctgttacaag aagacagagg 4080 tcagaagtat cctgtgctgt atctaagtcg gaagctgttg ccaagggaaa cccgatatgc 4140 aacaatagaa aaagagggat tggcaatcaa atgggccttg gaaagcctcc ggtattattt 4200 actaggcaga gaatttgact tagaaactga ccatagggca ctgtcatgga tccattcaat 4260 gaaagatcga aatgcccgag tgactcggtg gtacctgtcc ttacagccat acaaattcca 4320 aatacggtac aaggcaggga aagagaacat catagccgat tacctgtctc gattgtcgca 4380 caccttcaat cctggagaag ggtgaggtaa 4410 // ID Copia-7-I_DR repbase; DNA; ZEB; 3222 BP. XX AC . XX DT 17-NOV-2008 (Rel. 13.11, Created) DT 17-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Nonautonomous; KW Copia-7-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-3222 RA Bao W. and Jurka J.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(11), 1778-1778 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 3222 BP; 573 A; 1304 C; 759 G; 581 T; 5 other; gaagactcgg ccacacaatg gatccagcgg gtatcaagat catgtgtctg cgtcaggaag 60 cccggtccat agaagaatat gtggaggayt tcatccatst ggcccattta acatctttgg 120 atgagttatg cctcatgatc ttttttcgtg gaggactatc tgagccactg tattccacaa 180 tgccattgca cgaaccccay gggacattag aacattatat tgatttggca ctacaaatga 240 gtgggtcccc cttcactgta ggagaggtgg aggacacccc taaatatttt ttgggggggg 300 gcaaaggaca gcaagacccg ctggcaacgg ggcttgctgc tcattccacc accacacacc 360 cagctcagaa aatggccgca tctagtcctc cagctgacaa gatggctgct ccagttcaca 420 agatggctgc ctctagtcct ccagctcaca agatggccgt ctccagtcct ccagcttaca 480 agatggctgc tccagctcac aagatggccg actccagtcc tccagctcac aagatggccg 540 actccagtcc tccagctcac aagatggccg actccagtcc tccagctcac aagatggccg 600 actccagtcc tccagctcac aagatggccg actccagtcc tccagctcac aagatggccg 660 actccagtcc tccagctcac aagatggccg actccagtcc tccagctcac aagatggccg 720 actccagtcc tccagctcac aagatggccg actccagtcc tccagctcac aagatggccg 780 actccagtcc tccagctcac aagatggccg actccagtcc tccagctcac aagatggccg 840 actccagtcc tccagctcac aatgtggtca caaccagccc tccaactcac aatgtggtaa 900 gcgccaatct gccaccagct cacaagatgg tttcctgttc cttgtccgtt cctgccatag 960 ttcctgttct agttcctgaa ataccatcac ctgagccacc agaatggcca ccaccacctg 1020 agttacaaga gctgatgcca ccaccagagc tgttactgcc agagctgcca ccgccacctc 1080 cagagctgcc accgccacct ccagagctgc cagacccgcc agagctgcca gaccctccag 1140 agctgccaga ccctccagag ctgccagacc ctccagagct gccagaccct ccagagctgc 1200 cagaccctcc agagctgcca gaccctccag agctgccaga ccctccagag ctgccagacc 1260 ctccagagct gccagaccct ccagagctgc cagaccctcc agagctgcca gaccctccag 1320 agctgccaga ccctccagag ctgccagacc ctccagagct gccagaccct ccagagctgc 1380 cagaccctcc agagctgcca gaccctccag agctgccaga ccctccagag ctgccagacc 1440 ctccagagct gccagaccct ccagagctgc cagaccctcc agtgctgcca gaccctccag 1500 agctgccaaa gctgccagat cctccagagc tgccagtgcc gctgccgcca gagctgccag 1560 agccaccgcc acctgaacct ccagaatggc cgccgcctcc tgagcttcct gaatggccgc 1620 cgcctcctga gctcgctgaa tggccgccgc ckcctgagct cgctgaatgg ccgccgccgc 1680 ctgagctcgc tgaatggccg ccgccgcctg agcttcctga atggacgccg ccgcctgagc 1740 ttcctgaatg gccgccgccg cctgagcttc ctgaatggcc gccgccgcct gagcttcctg 1800 aatggccgcc gcctcctgag cttcctgaat ggccgccgcc tcctgagctt cctgaatggc 1860 cgccgcctcc tgagcttcct gaatggccgc cgcctcctga gcttcctgaa tggccgccgc 1920 ctcctgagct tcctgaatgg ccgccgcctc ctgagcttcc tgaatggccg ccgcctcctg 1980 agcttcctga atggccgccg cctcctgagc ttcctgaatg gccgccgcct cctgagcttc 2040 ctgaatggcc gccgcctcct gagcttcctg aatggccgcc gcctcctgag cttcctgaat 2100 ggccgccgcc tcctgagctt cctgaatggc cgccgcctcc tgagctycct gaatggccgc 2160 cgcctcctga gctccctgaa tggccgccgc ctcctgagct tcctgaatgg ccgccgcctc 2220 ctgagctccc tgaatggccg ctgcctcctg agcttcctga atggccgccg cctcctgagc 2280 ttcctgaatg gccgccgcct cctgagcttc ctgaatggcc gccgcctcct gagcttcctg 2340 aatggccgcc gcctcctgag cttcctgaat ggccgccgcc tcctgagctt cctgaatggc 2400 cgccgcctcc tgagcttcct gaatggccgc cgcctcctga gcttcctgaa tggccgccgc 2460 ctcctgagct tcctgaatgg ccgccgcctc ctgagcttcc tgaacggccg ccgcctcctg 2520 agctccctga acggccgccg cctcctgaac tccctgaacg gccgccgccg cctgaactcc 2580 ctgaacggcc gccgccgcct gaactccctg aacggccgcc gccgcctgaa ctccctgaac 2640 ggccgccgcc gcctgaactc cctgaacggc cgccgccgcc tgaactccct gaacggccgc 2700 cgccgcctga actacctgaa cggccaccgc ctcctaagct tcctcaatgg tcgcagcctc 2760 atgatccagg gccactgcag ctccacgctc caggccctcc gcagctccac gctccaggcc 2820 ctccgcagct ccacgctcca ggccctccgc agctccacgc tccaggccct ccgcagctcc 2880 acgctccagg ccctccgcag ctccacgctc caggccctcc gcagctccac gctccaggcc 2940 ctccgcagct ccacgctcca ggccctccgc agctccacgc tccaggccct ccgcagctcc 3000 acgctccagg ccctccgcag ctccacgctc caggccctcc gcagctccac gctccaggcc 3060 ctccgcagct ccacgctcca ggccctccgc agctccatgc tccagtcctg cccccgctgc 3120 atggtcctgg ccctccatcc ctccccctgt tccgcctccg ctccacctcc cgcctgaact 3180 gtatttggag cgtctggaag ccgctcttta gagggggggc ta 3222 // ID EnSpm-N7_DR repbase; DNA; ZEB; 2239 BP. XX AC . XX DT 20-FEB-2009 (Rel. 14.02, Created) DT 20-FEB-2009 (Rel. 14.02, Last updated, Version 1) XX DE EnSpm-N7_DR is an non-autonomous DNA transposon - a consensus. XX KW EnSpm; DNA transposon; Transposable Element; Nonautonomous; KW EnSpm-N7_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-2239 RA Bao W. and Jurka J.; RT "EnSpm DNA transposons in zebrafish."; RL Repbase Reports 9(2), 538-538 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 2239 BP; 712 A; 434 C; 437 G; 647 T; 9 other; cccagcaggc acacaacrtc ataagatgtt aatattaggt tagatttagg tcgtgatgtc 60 aggtgaccaa aattcaatgt ctagccagcg tctaaggaca acgttatttt gatgtccaat 120 aacgacgtca aatgacgttg atatttggtt gattttaggt tgtgttggaa agtgaccaaa 180 atccaacgtc gagccaacat cataaatcaa tgtcataata ctgacattta tttgtcaggt 240 atggcaacca aaatccaacg tctgatagat gtcatagtgg taacgtccac acaacgtcaa 300 gctgtaacat cattagacgt tgatatttgg ttgattttag gttgtgttgg aaagtgacca 360 aaatccaacg tcgaacagay atcttagtgg taacgtccac acaacgtcaa ratgtaacat 420 aattagacgt tgatatttgg ttgattttag gttgtgttgg aaagtgacca aaatccaacg 480 tctgatagac gtcatagtgg taacgtccac acaacgtcaa gctgtaacat cattagacgt 540 tgatatttgg ttgattttag gttgtgttgg aaaaagtgac caaaatccaa cgtctagccg 600 atatcatagt ggtaacgtca acatcatatt gacatcaaay actgatattt agttgatttt 660 aagttatggy aaccaaaatc caacgtctga tagacgtcat agtggtaacg tccacacaac 720 gtcaagctgt aacatcatta gacgttgata tttggttgat tttaggttgt gttggaaaaa 780 gtgaccaaaa tccaacgtct agccaacatc ttaaaccaac gtcatattga cgtcataata 840 ctgacattta ttcgtcaggt atggcaacca aaatccaacg tctgatagat gtcatagtgg 900 taacgtccac acaacgtcaa gctgtaacat cattagacgt tgatatttgg ttgattttag 960 gttgtgttgg aaagtgacca aaatccaacg tcgagtcaac atcttaaacc aacgtcatat 1020 tgacgtcaaa tactgacatt tatttgtcag gtatggcaac caaaatccaa cgtctgatag 1080 acgtcatagt ggtaacgtcc acacaacgtc aagctgtaac atcattagac gttgatattt 1140 ggttgatttt aggttgtgtt ggaaaaagtg accaaaatcc aacgtctagc caacatctta 1200 aaccaacgtc atattgacgt caaatactga catttattsg tcagttatgt tgcaaccaaa 1260 atccaacgtc tgatagacgt catagtggta acgtccacac aacgtcaagc tgtaacatca 1320 ttagacgttg atatttggtt gattttaggt tgtgttggaa aagtgaccaa aatccaacgt 1380 ctagccaaca tcttaaacca acgtcawatt gacgtcataa tactgacatt tatttgtcag 1440 gtatggcaac caaaatccaa cgtctgatag acgtcatagt ggtaacgtcc acacaacgtc 1500 aagctgtaac atcattagac gttgatattt ggttgatttt aggttgtgtt ggaaagtgac 1560 caaaatccaa cgtcgagcca acatcttaaa ccaacgtcat attgatgtca aatactgaca 1620 tttattcgtc aggtatggca accaaaatcc aacgtctgat agatgtcata gtggtaacgt 1680 ccacacaacg tcaagctgta acatcattag acgttgatat ttggttgatt ttaggttgtg 1740 ttggaaagtg accaaaatcc aacgtctagc caacatctta aaccaacgtc atattgacgt 1800 caaatactga catttattca tcagttatgg caaccaaaat ccaacgtctg atagacgtca 1860 tagtggtaac gtccacacaa cgtcaagctg taacatcatt agacgttgat atttggttga 1920 ttttaggttg tgttggaaag tgaccaaaat ccaacgtcta gccaacatct taaaccaacg 1980 tcatattgat gtcaaatact gacatttatt catcaggtat ggcaaccaaa atccaacgtc 2040 tgatagaygt catagtggta acgtccacac aacgtcaagc tgtaacatca ttagacgttg 2100 atatttggtt gattttaggt tggacattgg cctgacgttg ggtcaacccg attttcattt 2160 ccaaacaaaa tgcaacgtcc ccaygacgtt ggggtacaac gtcaatctga cgtcatgttg 2220 acgtcctgtg cctgctggg 2239 // ID piggyBac-N1_DR repbase; DNA; ZEB; 1005 BP. XX AC . XX DT 17-OCT-2008 (Rel. 13.1, Created) DT 17-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE piggyBac-N1_DR is a putative piggyBac nonautonomous DNA DE transposon - a consensus. XX KW piggyBac; DNA transposon; Transposable Element; Nonautonomous; KW piggyBac-N1_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1005 RA Bao W. and Jurka J.; RT "piggyBac-type DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1616-1616 (2008). XX DR [1] (Consensus) XX CC The element is characterized by 15-bp TIR and TTAA TSD. It has CC the 5'-CC terminus common in most piggyBac elements. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1005 BP; 332 A; 180 C; 188 G; 305 T; 0 other; ccacttaaac tctgctgcta tttggggatt tccgcctgga ttttgcctac ccaaatttaa 60 aagcttccca aatccacatg cagaggtgta aatgcaaaaa tttggtatca ttttaaagaa 120 aaccctttga attttcataa aacactattg aaagtgttta aaataactgt atatgttgtc 180 tgtgttataa taaacaccta aaaaaagagg cgctttttgt attttttttt tataaactca 240 aatttgaaag tgcgcctttt aggttctgtg tggtctagcg tgctgtaatt aattttggtg 300 gttcctgcac atgtctgtaa tcataggaaa aaagaaaaat gtctccacca taatctatgc 360 aaaagttatt gtattccaac tgatgagagg tgctgtacaa gccacaggga ccgatcattg 420 tttacatatt tcactattct tttgtttgat caaacataat tcactgtgtt tggaccacat 480 cagacatata aaaggattac ttatgcacat cacctcaaaa acagaggaga aatggccctg 540 aagggcacag cataaggtaa gagatgacag ctgtctgtgc tatctggggc tgcttattga 600 tcatgaatgt attgtttaca tttctgcgcc atggaaacac cacaattcat tcaacaactg 660 tttggcatat gaatataatt cagtaaaaag aatgctagaa ctgtatgagc accggcaaga 720 gctttcattt gagctataac ttgtacatgt gtcatataaa aaaatatgaa aatgaaccaa 780 tgtaaaatac ctagctcggg tatccaaaat actgtgtatt taagtgtaaa taacttttat 840 acagtagaat aaaaaccaaa gtgatgcata tgtgcataaa atatagattc tacactttca 900 aacgacacca cttacggggg tctggtgcaa cgctagccct ttaaatctga aagcgaaagt 960 cgatgacgtc acggacccgg taccgggtcc gcagagttta agtgg 1005 // ID Gypsy61-LTR_DR repbase; DNA; ZEB; 277 BP. XX AC chr15; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy61-I_DR; KW Gypsy61-LTR_DR; Gypsy61_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-277 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1438-1438 (2008). XX DR Genome; chr15; Positions 7134044 7133768. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 277 BP; 65 A; 52 C; 78 G; 82 T; 0 other; tgtagggaat atgttataac tcttcctttc ctccgggggg agctgtgtgt tgtgttaggg 60 agatgggctg cagtttggcc tcagttgtgt ttccctgtgg gaaagatgtt gcagtgtcat 120 ttctgctgtg caagtgacaa aatgaagaga ggagtccatt aaagggaaag ctgctctgtt 180 tggctcaacc agggtcttgg tgtccttctt attcatcctt ttacacatac acaacgcaga 240 gacattttgg ggaagcacac caacgaagag ggttaca 277 // ID hAT-N34_DR repbase; DNA; ZEB; 949 BP. XX AC . XX DT 15-OCT-2008 (Rel. 13.1, Created) DT 15-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Putative nonautonomous hAT DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW 8 bp TSD; hAT-N34_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-949 RA Bao W. and Jurka J.; RT "hAT DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1303-1303 (2008). XX DR [1] (Consensus) XX CC TSD is 8-bp long. The TIR is not perfect. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 949 BP; 299 A; 185 C; 214 G; 251 T; 0 other; tagagctgtg aacctacact agtctcacgg ttcggttcgg ttacgattat catgccatcg 60 attcggttca attcgatatt tcggtgcatc acggtgcatt gacggtgctt tccatacaca 120 gttttatatt ttcttcacag cagcatttct tgtattaaaa tgtatgaata tatttatatt 180 tatatactat ttgtaataca attttgtcat ttaatacaaa cagtcagata tataaactgt 240 aactttaaac aaaacgagca tttagcaaat aatataaaca aatataaata tccagctcaa 300 tttctgatcc ttgtctagtt ctcttacacc cctttgattg gtcacaccct caaaaaaaaa 360 acggttgcga ttggctcttg cgcgctgcgt tcttcacaga tgagtgatac tgataagcgg 420 caggcggcag cggcgatctc acagctgatg tatgatagac acggtggaga aaactctgca 480 gacacgcgct cgtttctgta tgcaaaagta acgctgtaaa acagccgaga agagaagaaa 540 agccacgcca gcaggtcaaa tgggccactg tgtgtgtgtg agaaagagag agagagacag 600 agagagagag acagagcgag agcgcgcgag agagagagag agagagagag agagagagag 660 aaatggagta ggctactttc attctctcga tctcagtgaa ataggggcta ttgttgtata 720 tgtcagtgaa agactgatcc agcaaacaca cacagtgaaa ttgtgcacag tttatgagtt 780 ttaagtagtt aaagcgctgt aaatactcgc gatcgcctcc ctcgcgacag agcgcatatg 840 aggtaaatga cgtcagtaat aaccggttat gattattact gaaccgatac cgaattgtcc 900 gcgtctgcat cgcagtgcac cgaaaaaacg attatttttg acaccccta 949 // ID DNA-TA-5_DR repbase; DNA; ZEB; 958 BP. XX AC . XX DT 11-DEC-2008 (Rel. 13.12, Created) DT 11-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE nonautonomous DNA transposon from zebrafish - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; TSD TA; KW DNA-TA-5_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-958 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(12), 2157-2157 (2008). XX DR [1] (Consensus) XX CC Copies are ~86% identical to the consensus. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 958 BP; 251 A; 206 C; 219 G; 278 T; 4 other; cagtggtccc tcgctataac gcggttcacc tttcgcggcc tcgcagtttc gcagwttttt 60 tttgtgcaat ttgacatgct tcatttattt tttttacagc gcattgtgtt ctgcgtcctg 120 attggctgta gaccattgtc aatcaatctc ctctgtgtct cctgtacagt acagaatgcg 180 ttcagcttgc caaattaaca taaatcttcg atcgctagca gtgtgagtty caacaaggat 240 gcaaaaaggg ttgtaactgt ccgcagcaag accatcgtaa agggggtcgc acaccagatg 300 cgtsgcgcca cgcagcgcca tgcattttag aattgtaaac ataggtttat atcagggtac 360 gcacaccggc gccgcaagtc agtggctgtc tgcggtgccc agctaygact caggacactg 420 ttcatatttc tgccgcgcca cagagcgcca tctgaataat ttcattttaa ataacatgcg 480 aatgtgcgca tccggtgtgt gttacttcca actgtcatgt gcgcggcgca tccggtgtgc 540 gaccccctta aggatggagt ctgctttagc tttgtggatc agtgactgca ggaaaaagaa 600 cattacgctg gatgccaaca ttatccgcac aagcgcaaaa gctaaaaagc tttatgaacc 660 ttttgctgac agcgatgaat gtttgcgcct gacaacaggt tttgatcttt ggtttcattc 720 tataatactg gacttatttt tctacgaaag tttgaacttt gagagtgttt aaacaagaga 780 gaaaagtgtg aaaatgttaa tgcctgtctg agaaaagtgt ataaagtgtg tagtgagggg 840 ttttacagcc ttaaaacatc tataataatt gtaaaaaata aagctgacta cttcgcggat 900 ttcgcctatt gcaggttatt tttagaacgt aactcccgcg aataacgagg gaccactg 958 // ID DNA2-1_DR repbase; DNA; ZEB; 5547 BP. XX AC . XX DT 01-DEC-2004 (Rel. 9.11, Created) DT 01-DEC-2004 (Rel. 9.11, Last updated, Version 1) XX DE DNA2-1_DR is a nonautonomous DNA transposon. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW DNA2-1_DR; HAT1N_DR; HE1_DR1; nonautonomous DNA transposon. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5547 RA Kapitonov V.V. and Jurka J.; RT "DNA2-1_DR, a nonautonomous DNA transposon from zebrafish."; RL Repbase Reports 4(10), 250-250 (2004). XX DR [1] (Consensus) XX CC DNA2-1_DR is a nonautonomous DNA transposon. Its is CC characterized by 11-bp terminal inverted repeats and 2-bp CC target site duplications. It is expected to be a member of the CC Mariner superfamily (however, this transposon has terminal CC inverted repeats that are shorter of TIRs in Mariners). CC This is a composite transposon, it contains copies of CC HE1_DR1 (pos. 4138-5096) and HAT1N_DR (pos. 4138-5096). XX SQ Sequence 5547 BP; 1915 A; 942 C; 903 G; 1787 T; 0 other; cactgtaaaa aaaaaaagtt gagtaaactt aaaataatta ggcaacctat cacacatagc 60 tttttgagtt aactcaacaa acagggtttg aagaccagtg aactcaaaat attttgttga 120 attctttagt tctctgaaca aaaagtatca tgttcaccca acaagatatt tttgtttgct 180 gaacaaaaaa tatcatgttc aaccaacagg atatttttgt ttgctgaaca aaaaatatca 240 tgttcattca acaagacatt tttgtttgct gaacaaaaaa tatcatgttc aaccaacagg 300 atatttttgt ttgctgaaca aaaaatatca tgttcaacca acaggatatt tttgtttgct 360 gaacaaaaaa tatcatgttc aaccaacagg atatttttgt ttgctgaaca aaaaatgtca 420 tgttcaccca acaagatatt tttgtttgct gaacaaatat tatcatgttt actcaacaag 480 atcttttgtt tgctgaacaa aaattttcac gtttattcaa caagatcttt tgtttactga 540 acaaaaatga ttatgtttac ccaacaagat ttttttagtt tactgaacaa aataaaacaa 600 actgattcat tatgcattta ctattaattt ttaggattca aattaaatac acaagaatag 660 tcaattcact tttttttatt tttaactgtg ttcaaatcac aatccaaaac atttgcatac 720 aaaatcagca tttgacatta gccgtgttca gcattgggct agtttgtaca caactagtac 780 tgaatgtttc cagatttgaa aagattgtac actgcctcaa cataaaataa ctggcaacct 840 tactaaaacc aagctattag tgtgtctaaa cactcaaacc aggtgaatat acaatcatct 900 aatcagagta aaccctaaaa gccactccat tcatacagtc tttaaagtgt actgcacttt 960 tgtcaacaaa gtacataaag tccaaaaaca gacctcaaaa cattgaaaca gtggcacatt 1020 aggaagtggt cacgggtcag ttgccgttac tgtgtggagt ttgaatgttc tccctgcgtt 1080 cacgtgggtt tcctttgggt tctccggttt cccccacagt ccaaagacgt gtggcacagg 1140 tgaattgggt aggctaaaat tgtccgtagt gtatgtgtga atgagtgtgt atggatgttt 1200 tccagagatg ggttgcagcc ggaagggcat acgctgcgta aaacaagttc attgcgctgt 1260 ggcgacccca gattaataaa gggactaagc cgaaaagaaa atgaatgaat gaatgaactg 1320 aactctcgca gatcatgtgc caccaacctc cataaaacat gctgaatcac ttaaaaagta 1380 tatttgagtt cttttgggtg ttccatattc agtctataca agaggctgaa caagtaagcg 1440 aaggctgcag gcagatgtgt gatatcctga agtactatgg cctcctccaa gattttagcg 1500 aggttcatca catattgcag ggatgcagaa gtgccattat catcttccac cacagtgaag 1560 attccaatcc tttaatctct ggtgctgtgg tcttcagggt ccgtgtcctt ataggagatt 1620 aagtaatggg aaaattaatt gatgtgtaat attaattact acttattaat gttattacta 1680 ttatgcaaag aaaataaaaa catgtaatac tttcgtacat acaaacagta acactttgaa 1740 tagtctataa tatataataa ttgaacaata tattgacaac tttttaatat ttagcaattt 1800 tttaaacttt tattttacat aaaattgcaa agatttactt ttatttgata cagtttcaat 1860 tgtgaaagtt gtttattgtc ctgtttagaa tataactgag cctttaattg tcattcataa 1920 gggatttata aagcataaat agtcctcact ttagattagg tcacaactgg atgaagttaa 1980 gctcataaag catatactaa catacaaagt aactattatt acatacctta atattaggat 2040 atttaaatgt taatttgaat agtttattaa tcgtgtactt aagcattctg aatgatctta 2100 aaaacaccaa ctattcttaa ataactggtt tgtaaataat gcaatactta aattagtaat 2160 acaaatgaat cattaaagtt aaaaaaaata caatcattga gcacttcatg tagagttatg 2220 cttaacagaa aataaattaa ctattcgcta atactaaata aattatttat agcgtgcagc 2280 tattaaaagt gttaccaagc agactaataa atgtagaata gggtatgaaa atgggtaact 2340 taccagacat atcttgagaa agcaagcaga atttttacga agaaaaattg gcaatccctc 2400 aagagctgct ctttctcgat cggtgacaat atcagtgacc taaaggaaag tcatttacaa 2460 agggtgtttt acaatcttat tgttttgtaa gtaaaaatgt acttagtaca tttaaatgag 2520 catagtattt tattcatgct gttgaaaaaa cacatggctt tatttttaaa aaagaatttt 2580 gtggtttcca tacaatggaa tttgataatg gttgtgttgg caaatttaga aaaatgttta 2640 gtttcatgtg acaattactt tgagttaaaa actacagcta ttattttcta ttttcctctt 2700 tgaatgtatt catttgaaat ggtcactcaa aacaagtgaa acatctgtta tttatcttac 2760 ttccatctca caatcaaaac acaacttgac acaattaaaa ttaatgcaaa agggctggac 2820 aattaaattg ttgcaattac gttttattag actgtattac ctgagcatga aattcttgac 2880 aggacaaaat atgttgcaca tgttaaaaac atcagacggg tgtataagaa aaaaaaaatc 2940 ttaatgagta cctgctcatc catcttgttt aaagttgtat ccatctctgg acccattgaa 3000 tcccttcttt tctgtagagt ttcaggagag gtgctgtata tttatcaagt cactcgatta 3060 actgtctcag gagatctaca ttacagatct gagaaaactc tgtgcagatc tacaatatac 3120 aaaaaaaaca tacatattaa tttttacaca taatatatat atatatatat atatatatat 3180 atatatatat atatatatat atatatatat atatatatat gtaaatgtaa atatttaaat 3240 gttcaaatgc tgaatacaga tgtattttga attacctata ataataatgt tgtttgttat 3300 ttttagcatt atcattacta ccttgaatga aaccaaagca attgcaatat agtgacttac 3360 ctcttcctta aagtccaggt tatctctctc taatttcttc ctggctaaat ctcaacaact 3420 tcttttgggc aaagagaaaa ggtttcttgc atctgctggt gaatgaccgt caaatttcac 3480 tgatgataac aagtctcacc tcctcaagga aatcttgcat tgaattatgt cattgtcaat 3540 atttcagtcc ttttgagcaa aaagaaagaa aaatatatat ttagcccttt atatttttta 3600 tgtaaataaa agctagctga ctgttaaagc aagttggtct gaatatgata gacatttagt 3660 cgagctaaca aatcaagcaa tgaattacat gaaaatcaca cgattttcac acaaattaca 3720 cagttaaaca cgtcaccgtt tgttccactg gctctggttc gaagggaaaa aaggttatga 3780 tgtgcgcaat gctgacagca atctagataa acaggacata atgttatatt tgacttggat 3840 acctgatctg gtatgttata tgatactcat tgtattaatc gtgagttatt aaagcaaatg 3900 actcgattga aatcgctatt tgtttcaaat acattcactc taaactttta ttgatgtaaa 3960 tagccaaaac caagtgcacg aaacaagtaa ctcacggcta aattataaca aatagttccg 4020 tattaggata cacgcgcaca tcaaagccaa catatcaaat aatataacgt tagatgacaa 4080 caaaaaaaga catacatacc tatgatttag cagaaacgtc gttggcctgc tcttaactag 4140 ggatggctga cgtgaaactg atgttttgac acagtgtcga gatcccgaag cgcaagtgtt 4200 tcgaatcact gcaccgaagc atgatccgaa gcacccaagt cacgtgacta aagtgtttcg 4260 aaacacctag tcacgtgact aaagcgattc aaagcatcga tcagcttaga aacgtttcga 4320 gacctggtga atcctaattt gactgccaag gtctccaaat tatctctgtc tcaaatggct 4380 tagagcactg tgggtttgct tattgattct atgcttagca tgatgttttg ggttcgaatc 4440 ccgacttcga aatcatggtt ttatgtattt taatcatgtt tctgttttgt gtagcctaaa 4500 taggatacag ctacagttac gccatcaatt tagcatcatt tttgtaacac tgtaaaacaa 4560 taataaatat ttttacagta tggttgggtt tagggtttgg gtagacatta ataaaataca 4620 ataaatggga aatttaatga ataatataaa taattcttga taacttcagg ccaccgtttt 4680 tgatctagca acaaccctgt tttaagacaa aaacaagaca tatttattca caaattgttt 4740 attcggatat cagaccaatg ttgaaacaaa acgatacaag agcaaagcat taccaactgg 4800 tattaaagca ttttaaaaca actgctactg actggcttcg aaacctacta ctcctgcatg 4860 gatgtctgac accttaacca ctgtactatg ttacttgaag gattccactt tcacaaagtg 4920 gggtttaatc cctcaatttg ttcaatcgtt ctttgctgaa gctgtttcag tgcaatatat 4980 ataaaaaatg accactaaat gtcactgtag agtgaggttt cgaaacgttt cgaagcttcg 5040 acacatttgc ttcgactgtt tcagtgtttc acgaagcctc gctttgccca ccactactct 5100 taaccgctct atcacggtct ctcacatcat taatacactg gtaaaataga tcgggcagtc 5160 agatgggtag cacatcgcac aaaaacatgt atacaattat atacaatcga ttatgaaaac 5220 gattgacacg tagctccgca gaaagaaaga tggttcgttt tgtcgcgcgc acgtgacagg 5280 taccccctgc agtgagctat ggaagtaaat tgcgcatgcg ccagaagctt ataaggcgca 5340 tgcgccaaca agaagggcac gcatcaacgc aaaactaaga attttgtatt agttcaacga 5400 aatattttaa gtttttcaca ttgaagcacc atatttgttg aacaaactaa ctattatttg 5460 tcagcatttc aaatgaaaat atttaaattc agttaactta ttaatctttg ttcagaggac 5520 tttttgaaag ttggcgtttt tacagtg 5547 // ID BEL3-I_DR repbase; DNA; ZEB; 6021 BP. XX AC chr1; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL3-I_DR; KW BEL3-LTR_DR; BEL3_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6021 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1357-1357 (2008). XX DR Genome; chr1; Positions 36993981 36987961. XX CC Positions [5042-5602] - Integrase core CC 'CAGTT' target site duplication CC LTRs are 99% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 39..3245 FT /product="BEL3-I_DR_2p" FT /translation="MDSATVVVSDPAARPKRKGKLPSHLGDYEVGYTLPEG FT PPSITASRCHSQHKSQSRKTSHSKASLHSKSSSFSTTSRAQATLALTSSQA FT AIVEEKIKQRQYDNLLEQMEEDKLAEMEYHRLRTQAKEAQRVQEEALEAKE FT ALANQLERQRKLKKAEADLEVAKLVNSLLSQDLSVIDAEPQASPDTTAPPP FT KSLCASPAPSSVLQQSSCSAPPRMQIIQTVPEAQSTLKDPYMAHSTPYFPP FT GALQATKLTQAPAHTAQMANQHPPNPISVPRPTFNPMHTTVTHNAMPHTDS FT DIIPPTAPHQRVTVASNMPICPQPYANSPVSQTGYLPQPGTELLMAAAYGI FT PRPTLPVFESGTESDFALLKLALDSLLSPHTHISEQYKYQVLMSHLKFASA FT QQLAKAYMHHPQPYTAALQALQAKYGQPRQLVQSELGAIMSTPPLKIGDAN FT SFDSFALSVQSLVGMLRTLEGQNGFELMCGSHVDRLLGKMPPAYRDSFVEY FT CFSHGILQSGTDRTYTLPDLAAWLQTKSQAKRISSRAVAMFQSDITKTSGR FT SKSSSYHWERPTTSFLTSENAVKPPAATPAKVSPKPKPYCPYCDNPDHYLN FT SCDKFKKLTTDQIVSWITRDKRCWKCGRNHSVNTCNLKRPCKICKEIHLTI FT LHDSIKDTSRAVLMVSLPSTQIYLDRPNRSQKVMLKVARVLLHSQTQTIEA FT YAVLDDGSERTIVLPQVVRQLGLSGVPEVLSLQTIQRNHTDLNGSKVTFEV FT SPFAKPSERYVIRNAFSAPGLCLAEHNYPVAALQKSYHHLKGLPLQPVDRV FT KPLLLIGSDLPHLLTPTQPVRRGPDGGPVAIHTKLGWVLQGPMSPIQPPKG FT LQQCLHIMTAPSREELSQHVERLWQIDTLPYNEKLITRSQQDNQCHNLLQT FT TTVRVDVNGIQRYATPLLRRTPITLLHTGIEAVLPSLRNLERKLAKNPEQA FT QVYCSEIQKLESAGYIAKITPQEANHSTESWFIPHHLVHHNGKDRIVFNCS FT FQYNGQSLNEQLLPGPTLGPSLLGVILRFRQHAVAVSGDIKGMFHQVRLLL FT GTGLCCDSSGGI" FT CDS 3245..5980 FT /product="BEL3-I_DR_1p" FT /translation="MNREAVPDIYEWQVLPFGTTCSPCCAIYALQCHSQEN FT RESRPDLADIVETSFYVDNCLHSTPNQDEAKAIIDGLRESLLKGGFEIRQW FT ACNVPSVLKHLPSEAISASSECWLSQSSSYLQELTLGLQWNCIDDTLGYKT FT RIAEPIQPTMRNLYKTLASQFDPLGFIIPFTTRAKTLIQDLWKQNLSWDDP FT IEPLYLRDKWSTWVAELSHITTVQFARPYAPAMSDTPSTIRELHVFCDASE FT RAYGSVAYLQIRNEEKVHVSFVLARSRVAPRKCLSMPRLELCAAHSGAQMS FT KVIQAELTIPIHQVTHWTDSTTVLHWLKSESCRYKVFVGSRVAEIQTLTDV FT SKWRYVDSNRNPADHITRGLTLEEIAGPHQWRYGPSFLTQPQEVWPVTPST FT KSEPDMNEVKRASFIAAIRVTPSPVPADLSQFRTWKELLQATVTSCDGAAD FT AETTNDASKYIQAEKLLLAHAQLDSFPEEFRALKAGQPIPTDSRLSSLAPE FT FDKDAELIRVGGRLRRAETLSFDTIHPIVLDSHHQVTKLLIQDIDQQLHHP FT GAERVLAELRRRYWVLRGREAVRKHQHFCQNCQFWRAKPQNPQMADLPPCR FT LRLFKPPFYSTGIDCFGPFHIKIGRRREKRWGILFKCLTTRCLHLDLLEHL FT DTDAFLLSLRRFISRRGKPAEILCDNGTNFVGGDHELRDSFKQMAPKLQTQ FT LAQQQIQFRFNPPNAPHFGGTWEREVKSVKAALQVILREQCVPESVLQTLL FT VEVEGILNSKPLGYVSSDIADPDPVTPNMLLMGRHDASLPQVLYDSNEMLG FT KRRWRHSQVLADQFWAAFVHRYLPELQGRQKWRTDGKGLVVGQVVLLIDNQ FT LPRALWPVGTVTETYAGADGRIRTAKVQVKDKSYLRPVVKLIQIPEMKDKD FT TETKGGLS" XX SQ Sequence 6021 BP; 1717 A; 1562 C; 1324 G; 1418 T; 0 other; agttggtcct tcgagccgga gctggcaaag caaacaatat ggacagtgca actgtagttg 60 tatcagaccc cgcagccaga ccgaagagaa agggtaaact gccatcacat ttgggagatt 120 atgaagttgg ctatacactt cctgagggcc ctccttcaat cactgcttca cgctgtcaca 180 gccaacataa aagccagtca aggaaaacgt cccatagtaa agccagcctt cattctaaat 240 ccagtagttt ttccactact tctcgtgcac aagctacttt agcactcaca agcagtcagg 300 cagccatcgt ggaggaaaaa attaagcagc gtcagtatga taaccttctt gagcagatgg 360 aggaggacaa gctggccgaa atggaatatc acagactgcg aacacaagct aaggaggctc 420 agcgcgtaca ggaagaggct ctggaggcaa aggaagcatt agctaatcaa ttggaaagac 480 aacggaagct gaaaaaggcc gaagctgact tagaagtcgc taaacttgta aattctttac 540 tcagccaaga cctaagtgtg atcgatgctg aacctcaagc atccccagac accacagcac 600 cgccaccaaa gtccctttgt gctagcccag ctccatcttc tgtcctacag cagtcttcat 660 gctccgcacc gcccagaatg cagatcatac aaactgttcc agaggcacag agcacattaa 720 aggatcctta tatggcccat tcaacaccct actttccccc aggggcgcta caggcaacta 780 agttaacaca agctccagct cacactgccc aaatggccaa tcagcatcca ccaaacccaa 840 tttcagtgcc acgacccact tttaacccta tgcatacaac agttactcat aatgctatgc 900 cccacacaga cagtgatatt ataccaccaa cagctcctca tcagagggtg acagttgcat 960 ccaacatgcc tatttgccca cagccctatg caaattcacc tgtttcccaa actggctact 1020 tgcctcaacc cggaacagag cttttgatgg ccgcagccta tggcatccct cgcccaacac 1080 tacctgtgtt tgaaagtggt acagaaagtg actttgccct cttgaaattg gccttagaca 1140 gcctactaag tccccatact cacattagcg agcaatacaa ataccaagta ttaatgagtc 1200 atctgaaatt tgctagtgca caacagctag ctaaagctta tatgcaccac ccacaacctt 1260 acaccgcagc cttacaagct ttacaggcga agtatggcca gcccagacag ctagttcaat 1320 ctgaattagg ggctattatg agtacacctc cacttaaaat aggtgacgca aactcatttg 1380 actcattcgc cctgtctgtc caatccctag tcggtatgct gaggactcta gagggccaga 1440 atggatttga gctcatgtgt ggttcccatg ttgaccgcct tttaggaaaa atgccccctg 1500 cctacagaga tagctttgtg gagtactgct ttagtcatgg catccttcaa tctggtacag 1560 acagaacgta tacactacct gacctagcag cctggttaca gacaaaatcc caagccaagc 1620 ggatctccag tagagcggtt gctatgttcc agtcagatat tacaaagaca tctggcagga 1680 gtaaaagttc ttcatatcat tgggaacgtc caacgacttc gtttctgact tcagaaaacg 1740 ctgttaagcc accagcagca acaccggcaa aggtcagtcc caagccaaag ccgtattgtc 1800 cgtactgtga caaccccgac cactatttga actcatgtga caaattcaag aagctgacca 1860 ccgatcaaat cgtcagttgg atcacaagag acaaacgctg ttggaagtgt gggaggaatc 1920 actcagttaa tacttgtaac ctcaagcggc catgtaaaat atgcaaagaa atccatctta 1980 ccatactgca tgactcaatt aaagacactt ccagagcagt tctaatggtg agtttgcctt 2040 cgacacaaat ctatctggat agacctaacc gctcacagaa ggtgatgctc aaggtcgcac 2100 gagttctcct gcacagtcaa acccaaacta ttgaggctta tgccgtgctc gacgatggct 2160 cagaaaggac tattgtcctc cctcaagtag ttcgacagct tgggcttagt ggagttccag 2220 aagtgctttc tctgcagaca atccagcgaa accatactga ccttaatggg tccaaagtca 2280 cctttgaggt ctctccattt gccaagccat cagagaggta tgttatacgc aatgcctttt 2340 ctgcccctgg tctatgtctt gctgagcata actaccccgt agcagcacta cagaagtcat 2400 accaccatct gaaaggtctg cctttacagc cagtagacag agtcaagccg ttactcctca 2460 tagggtctga cctgccacac ctcctgacgc caacacagcc agtacgaagg ggaccagatg 2520 gtggccccgt tgctatacat accaagcttg gttgggtact acagggacca atgagtccaa 2580 ttcagccccc caaaggcctg caacaatgcc tacatattat gacagcccca tcacgtgaag 2640 agctgtctca gcatgtggag cgcttgtggc aaattgacac attgccatac aatgaaaagc 2700 tgataacaag atcacagcaa gacaaccaat gccataacct cttacagact actactgtca 2760 gagtagacgt aaatgggata cagcgttatg ctactccact gctaagacgt acccccatca 2820 ctttgctcca tacaggcatt gaagctgtat tgcccagcct cagaaacctt gaaagaaaat 2880 tagccaaaaa tcctgagcag gcacaggttt actgttctga gatccaaaag cttgagtcag 2940 caggttacat cgcaaagata acaccacaag aggcaaacca ctccacagag tcatggttta 3000 tcccacacca cctggtgcat cataatggaa aagatcgtat cgtctttaac tgctccttcc 3060 agtataatgg ccaaagtcta aatgaacaac ttcttcctgg gccaacgcta ggaccatctc 3120 tgttgggggt gattttgagg ttccgacagc atgccgttgc agtgagtggg gatatcaaag 3180 gcatgtttca ccaagtgcgt ctgttactgg ggaccggcct gtgctgcgat tcatctggag 3240 gaatatgaat agggaggcag ttccggacat ttacgagtgg caggttcttc catttggcac 3300 aacatgcagc ccgtgctgtg ccatttatgc ccttcagtgc cattcccagg aaaacagaga 3360 aagtaggcca gacctagctg atattgtgga gacctcattt tatgtcgaca actgccttca 3420 cagtaccccc aatcaggatg aagccaaagc tataattgat ggcctacgtg agtcactgtt 3480 aaagggaggc tttgaaatcc gacagtgggc gtgcaacgta ccttctgtcc tgaagcatct 3540 tccatcagag gctatatcgg ctagtagcga gtgctggttg tcccagagta gctcttacct 3600 gcaggaactc acacttggct tgcaatggaa ctgtattgat gacacccttg gttataaaac 3660 ccgcatagcc gaacccatcc agccaacaat gagaaacctg tacaaaaccc tggctagtca 3720 gtttgacccc ctgggcttta ttataccctt taccactagg gctaaaacct tgattcagga 3780 tctctggaaa caaaacctaa gctgggatga ccctatcgaa cccttatact tgcgtgataa 3840 atggtccaca tgggttgcag aactgtccca tatcactaca gtacaattcg ccagaccata 3900 cgctcctgca atgtcagata ctccctccac catcagagag ttacatgtat tctgtgacgc 3960 ctcagaaagg gcatatggct cggtggccta tcttcaaatc cggaatgagg agaaggtcca 4020 tgtatccttt gtcctagctc gctctagagt ggctcctcgc aaatgtttat cgatgccccg 4080 cctggagttg tgtgctgcac actctggcgc tcagatgtca aaggtcatac aagctgagct 4140 tactatcccc atacaccaag ttacacattg gaccgactca actactgtgc tgcattggct 4200 aaagtcagaa tcttgccggt acaaagtgtt cgtagggtcg agagtagctg aaattcagac 4260 actgacagat gtgtccaagt ggaggtatgt tgactccaat agaaaccctg ccgaccacat 4320 taccaggggg cttactttag aggaaatagc aggccctcac cagtggagat atggaccatc 4380 attcttgact caaccacagg aagtatggcc agtgacaccc agtacaaagt ctgaacctga 4440 tatgaatgaa gtcaaacgag catcgttcat tgcagccatt agagttaccc caagcccagt 4500 accagcagac ctcagtcagt tcagaacttg gaaggaactt cttcaagcga ctgtcacatc 4560 ttgtgatggg gcggctgacg ctgaaactac aaacgatgcc tcaaagtata tacaggcgga 4620 aaagcttctc ttagcgcatg ctcagcttga ttcattccca gaggaattca gagccctgaa 4680 agctggacag cccattccga cagatagccg tttgagttct cttgctccag aattcgacaa 4740 agatgctgag ttgataagag tgggtggacg tcttcgtcga gctgaaaccc tgtcatttga 4800 tactattcac ccgattgttc tagactctca ccatcaagtc acaaagttac tgatccaaga 4860 catagatcaa caactccatc accctggtgc agaacgagta ctagcagagc taaggcgtcg 4920 atattgggtg cttagaggtc gcgaagctgt gcggaaacac cagcatttct gtcagaactg 4980 ccaattctgg cgtgccaagc cacagaaccc acaaatggcc gacctaccac catgccgact 5040 acgtctgttt aaaccaccat tctactctac gggcatagac tgcttcggac cctttcacat 5100 aaagataggc cggcgcaggg agaaaagatg ggggatctta ttcaagtgtt taaccaccag 5160 atgtttgcat cttgacctac ttgaacatct agatactgat gccttccttc tgtctctacg 5220 acgtttcata tcccgtagag ggaagcctgc tgaaatactc tgtgataatg ggactaactt 5280 tgtaggaggg gaccacgagt taagagactc atttaagcaa atggccccaa agcttcagac 5340 acagctagct caacagcaga ttcagtttcg atttaaccca ccaaacgcac ctcactttgg 5400 tggaacctgg gagcgagaag tgaaatcggt taaggctgct ttgcaagtca ttcttcggga 5460 acaatgtgta ccagagtctg tcctacaaac cttactagtg gaggtggagg gcatactaaa 5520 ttccaaaccc ttgggctacg tctcatctga cattgccgac ccagatcctg taaccccaaa 5580 tatgttacta atgggacgtc atgatgcctc cttgccccaa gtcctttatg actctaacga 5640 gatgctgggt aaacgcaggt ggcgacacag tcaagtactg gctgaccaat tctgggctgc 5700 atttgtccat cgctacttac cagaactaca gggcaggcaa aaatggagaa cagatggtaa 5760 agggctggta gtcggccaag ttgtactact tatagataat caactcccgc gtgcgctgtg 5820 gcctgtggga actgttacag agacctatgc aggggcggat gggaggattc gaactgccaa 5880 agtccaagtc aaggataagt cgtatctgag gcctgttgta aaactcattc aaattccgga 5940 aatgaaggat aaagacacgg aaaccaaagg tggcctttct taaggtttac aattatcttt 6000 atcgataatt gggggcggct a 6021 // ID CR1-4_DR repbase; DNA; ZEB; 1902 BP. XX AC . XX DT 05-JUN-2002 (Rel. 7.05, Created) DT 20-MAY-2005 (Rel. 10.06, Last updated, Version 2) XX DE CR1-4_DR is a CR1-like non-LTR retrotransposon - a consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; CR1 clad; KW CR1-4_DR; ORF2; reverse transcriptase. XX NM CR1-4_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1902 RA Kapitonov V.V. and Jurka J.; RT "CR1-4_DR, a family of CR1-like non-LTR retrotransposons in RT zebrafish."; RL Repbase Reports 2(5), 6-6 (2002). XX DR [1] (Consensus) XX CC CR1-4_DR is a family of CR1-like non-LTR retrotransposons and it CC was active in zebrafish recently. CR1-4_DR copies are ~5% CC divergent CC from the consensus sequence. There are ~100 copies CC of CR1-4_DR present in the zebrafish genome. CC The 5' portion is incomplete; the consensus was built from CC five copies. CC The consensus encodes CR1-4_DRp, a 566-aa portion of the reverse CC transcriptase. CC There is only a ~60% identity between CR1-4_DR and other CR1-like CC elements from zebrafish. XX FH Key Location/Qualifiers FT CDS 2..1696 FT /product="CR1-4_DRp" FT /note="portion of reverse transcriptase" FT /translation="LCNSFLTFFETKIKNIHHQLHSNSSAPDFSQHVSEII FT THFSFDTFTLPSTAEIVGHIRKSKTTNCLLDPLPTCLVKTCLPSLSTLITN FT IIHKSLDSGSVPSLFKTAVITPVLKNLELIHPIWPTTDQFPNLPFVSKILE FT KCVASQIHNYLSINNLFELFQSGFRPNHSTETALVRITNDLLLAADSGLLS FT ILILLDLSAAFDMVLHEVLLNRLASLGISGTLLLWFKSYLEDRTQYVQIKD FT FKSRSQIVTTGVPQGSVLGPLLFIIYLLPLGHIFRKYNIQFHCYADNTQLY FT LSTKPSCSFPPSALSRCLAEIKIWLSANFLKLNSDKTEALLIGTKSVLDKA FT DNFTIDIDNSTIFPSVQVKSLGVILDSTLSFEGHINNITRTAYFHVRNITR FT LRPSLTTNNTAIFIHALVTSRLDYCNALLSGLPSKLLRQKLQLVQNHAARV FT ISRTPSHEHVTPLLYQLHWLPVKYRIDFKILLLTFKALHNLAPQYLTELLH FT IYTPSRTLRSANNFTLVPPRTRLSTMGDRSFSSMAPRLWNSLPLDLRSSDS FT LHTFKSRLKTHLFKQAFL" XX SQ Sequence 1902 BP; 509 A; 461 C; 277 G; 655 T; 0 other; gttgtgtaac agcttcctga ctttttttga aacaaaaatt aagaacattc atcatcaatt 60 acattcaaat agttcagccc ctgacttcag ccaacatgtt tctgaaatca ttacacattt 120 ctcttttgat acttttacct taccatctac cgctgaaata gttggtcata tacggaaatc 180 caaaaccacc aactgcctgc ttgatcctct tcctacatgc ttagttaaga cctgtcttcc 240 atcattgtcc acactgatta ctaacattat tcacaaatca ctggattctg gatctgtccc 300 atctttattt aaaactgctg taatcacccc agtactaaaa aacctggagc tgattcatcc 360 aatttggcca actacagacc aatttccaaa tttgccattt gtctcaaaaa tacttgaaaa 420 atgtgttgcg tctcaaatcc ataactatct ctctattaac aatttgtttg agctcttcca 480 gtctggtttt cgtcccaacc acagcactga gactgctctt gtcaggatta ctaatgatct 540 actactggca gcagactctg gtttactgtc aattcttatt ctcctggact tgagtgcagc 600 ttttgacatg gttttgcacg aggttctttt gaataggctt gcctcactag ggatctctgg 660 cacccttctt ttatggttta agtcatatct cgaagatcgt actcagtatg ttcaaattaa 720 agattttaag tcaagatcgc agattgtcac tactggtgtc ccacagggtt ctgtactggg 780 tccactcctg tttatcatat atctactgcc tcttggtcac attttcagaa aatacaacat 840 acaatttcac tgttatgctg acaacactca actctacctg tccaccaagc cctcctgttc 900 ttttcctcct tctgctttaa gcagatgttt agctgaaata aaaatctggc tttcagctaa 960 ctttttaaaa ttaaacagcg acaaaactga agcccttctc atcggcacta aaagtgtttt 1020 ggataaagct gataatttca caatagacat tgataacagc acaatttttc cctctgtgca 1080 ggtaaagagt ttgggtgtca tcttggatag cacactctca tttgaaggtc acattaataa 1140 tattacacgt actgcatatt tccacgtgcg taatatcact cgtctccgcc cttctctcac 1200 aactaataac acagccattt ttatccacgc attagttact tcacgtcttg actactgtaa 1260 tgcacttctt tctggacttc cttccaaact tctccgtcaa aaactccaac tggttcagaa 1320 ccatgcagct cgtgtcatct ctaggacccc atctcacgag cacgtcacac cactcctcta 1380 tcagcttcac tggcttccag taaagtatcg tattgatttt aaaatattac ttctaacttt 1440 caaggcactt cataatctcg ctcctcaata tctcaccgaa cttctccata tttacacccc 1500 ttctcgtacc cttagatcag ccaacaactt caccctggta ccacctcgca ctcgattgag 1560 cacaatggga gacagatcct ttagctctat ggctcctcgg ctatggaact cgcttcccct 1620 agatctaagg agcagtgata gtcttcacac ttttaaatcc cgtctaaaga cccatctttt 1680 taagcaggct tttctttaac aatttttttt gtcatgtttt ttattatgtt cttttatatt 1740 cgctattgcg ttttaacctg tttggtcaat gattgttttt agtaatgttt aatttgtttt 1800 agcatgtctg ttgatgctta ttgtaaggcg accttgggtg tcttgaaagg cgccatttac 1860 aaataaaatg aattattatt attattatta ttattattat ta 1902 // ID L1-8_DR repbase; DNA; ZEB; 5248 BP. XX AC . XX DT 04-AUG-2002 (Rel. 7.07, Created) DT 04-AUG-2002 (Rel. 7.07, Last updated, Version 1) XX DE L1-8_DR is a non-LTR retrotransposon from the L1 clade - a DE consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1 clade; KW L1-8_DR; LDR1; endonuclease; reverse transcriptase. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1248-5059 RA Jekosch K.; RT "LDR1: LINE-like element from Dano rerio."; RL Repbase Reports 2(2), 15-15 (2002). XX RN [2] RP 1-5248 RA Kapitonov V.V. and Jurka J.; RT "L1-8_DR, a family of non-LTR L1-like retrotransposons from RT zebrafish."; RL Repbase Reports 2(7), 28-28 (2002). XX DR [2] (Consensus) XX CC L1-8_DR is a family of L1-like non-LTR retrotransposon. There CC are ~500 copies of L1-8_DR in the zebrafish genome. They are CC ~2% divergent from the consensus sequence. The consensus CC sequence is incomplete (a ~300-bp 5' end is not present). CC The consensus sequence encodes two proteins: CC a 290-aa L1-8_DR1p (positions 1-820) and a 1271-aa L1-8_DR2p CC (positions CC 1248-5060). These proteins are most close to corresponding CC proteins CC encoded by other L1-like elements. L1-8_DR1p is a putative CC RNA/DNA CC binding protein, and L1-8_DR2p is composed of the AP endonuclease CC and reverse transcriptase domains. XX FH Key Location/Qualifiers FT CDS 1248..5060 FT /product="L1-8_DR2p" FT /translation="MHGTAKYLKIISWNINGSHNPVKRRKWLGYLKSKDVD FT IALIQETHMMGTEAEKLKRDWVGQVFHNSYNSKRNGVAILVHKRVNFVMIK FT QKKDEEGRFIWLEAMVDDQKVNICNIYAPNKEDSVFFHTVNKMIGGQAGNQ FT LIIAGDFNQVQDAYLDRTTYHKNMPRDRLAIQLMMEDLGLVDIWRLVNPRE FT REYTFFSHNHKSYSRIDYFLVSGDLVESVVDCKIGVIALSDHAPVEMTLDT FT NSRTIKQNRWRFNISLLQDLNFSTKLGADLKEFFEINVGTTERLGTVWEAS FT KAYVRGNVIQYASLVKRLNKEKVKDIEARIKVLDRQLSLKFTDSILKQVCD FT LKYQLNDIYNRKAEYAMFRMRTAFYESGEKADKLLARQLKQKDASFLISAI FT KNEKDEVVTANMDINKVFEKFYSKLYEAESSPDCTKYRDFFSKITLPTLSS FT DQLEMLDAPIEESEIGAAIMSMKAAKSPGLDGFPAEYYKKYIDIVAPILEG FT VYKETLLLEQMPPTFNDALITLILKKDKDLYDPGSYRPVSLENVDCKILSK FT VLALRLEGILSNIVHSDQVGFIKGRSSSDNLRSLLHLIWQSRNENVPVAAF FT SLDAMKAFDRVEWGYLTYTLQMFGFGPTFLKWVKVLYSAPRAAVLTNGIIS FT PFFKLKRGTRQGDPLSPLLFTLFLEPLAVAIRNDIRVNGVHLGEREYKCFL FT YADDILLLLSNPSTSIPAVMDTIEHFSQISGYKVNWVKSEVMPVSVGCSLA FT DVSAFSFKWIPTGMKYLGIRLSRDLYDVVQMNITRMLQNVSTNFDKWKVLN FT LSLWGKVNAIKMMVSSRINYISIMIPLEFPLSIFKQYNQLVKDFLWEGKKP FT RISMKNMFTTRIKGGLALPNIELYNTAFEMIKICKHWSGDNVEGISWIEIE FT KTLTFPFSVIDALSQKSLYSIMNGEVNPILEHSQQVWKKIHKIFNLSHYKQ FT GYSSLWNNPAIKIGKKTVYWKDWCKKKGLKTVDDLYGQGALYSFQELKDKF FT NLVDKGDFWKYIQLHSSIRTVGYKPGAEENVLLGFLNMPKSMQTTSFVYKI FT AANNMYGKSDHLKIIWEKDLEVEFEEGEWEAVVSGRGGTVRDVRSKLIHYK FT IINRYYWTPVRLHRIGLKENNHCWKCGQSVGTFLHLTWSCHLVAPFWTRVI FT QNLEKWLGQPLPYSPRVCLLGDTSTLQNGISKTQAGLVVAGYIIAVRLVLR FT NWKNSDTPSFKDWIELMTSTASYMNVCWQDFRIPPIPLIRNGVAFCNIWRA FT HKKKIEKLSI" FT CDS 1..870 FT /product="L1-8_DR1p" FT /translation="MTSKFTSKARAAKQGAMASDGPAEIREKDARLGHVTT FT PPGFAELQAALKAAMDEMARFSANLSTLQADVTSVKTTQFQMKADAAAMAQ FT RLDEAEARIGVLEDENERLQQMAEKSVKECAELRDSVSDMVNRERRMNLRL FT IGLKEKSENGNLRECVRLILSEALDVDISETELQRVHRSLVPMPDENKPPR FT PVIMRFHSFLERERVMAAVRLKVREGGSIKWRSSKISLFPDMTRDVAEKRR FT RFTAVRKRLHELDIRFTLAYPAVIRFTWKGQRVSFDDHSKALELLSEKD" XX SQ Sequence 5248 BP; 1671 A; 788 C; 1233 G; 1556 T; 0 other; atgacatcca agtttacatc gaaggcgcga gctgcaaagc agggcgcaat ggcgagtgac 60 gggcctgccg agattcgaga aaaagacgct cgattgggac acgttactac accaccaggg 120 ttcgcggagt tacaggctgc gttaaaagca gcaatggatg aaatggcacg atttagtgca 180 aatttgtcaa ctttgcaggc agatgtaacg agtgtgaaga ctacgcagtt tcagatgaag 240 gcggacgcgg ccgccatggc gcaacggctt gacgaggcag aggctcgaat tggagtgctt 300 gaagacgaga acgagcgcct gcaacaaatg gccgaaaaaa gtgttaaaga gtgtgcagag 360 cttcgggatt cagtttcgga tatggtaaac agggagcggc gcatgaatct acgtctgata 420 ggacttaagg aaaagtccga gaatggtaat ctgcgggaat gtgtaaggct gattctctcc 480 gaggctctgg atgtggatat ctccgaaacg gagttacagc gtgtccaccg atctttggtt 540 ccaatgccgg atgaaaacaa gccccctcgc cccgttatta tgcgctttca cagttttctg 600 gaaagagaga gggttatggc agcagtgaga ttgaaagtgc gggaaggtgg cagtatcaag 660 tggaggagtt ctaaaatatc gctgtttcct gacatgacga gagatgttgc ggaaaaaaga 720 cggcggttca cagcggtcag aaagcggctt cacgagctgg atatccgttt tactctggcg 780 tatcctgcag taatacgctt cacatggaaa ggacagcgag tgagctttga tgatcatagc 840 aaggcgctgg agttattgtc tgaaaaggac tgaggaacgg aagttgtttt ctgtggacaa 900 aagttatgtt atttgatgct aagataaaaa caaacagact aggtctgtaa ccaatgtaag 960 gacaaaggga tctgcagcaa gtttgatggg gtgttaacat gcgatttaca cgaacgttct 1020 ggggcggggt cgacaggtgg ttccatcatc agcgcttcac tgctcgtgta tggggagggg 1080 ccttctttta gtgataggat gtaacttggg agggctcaag gggtttaagt tcttgaagtc 1140 aatgtttaca gagttctttt atgtgtttta tttttatttt tttacttttg tatgttactg 1200 ttgtatgttt tttacagaca gcaagtgttt atatatagta gggagacatg catggaacgg 1260 ctaaatattt aaaaataata tcctggaata taaatggttc tcataatcca gttaaaagaa 1320 gaaaatggtt aggttacctt aaatctaaag acgtggacat tgctttaatt caagaaacgc 1380 atatgatggg tacagaggct gaaaaactta aacgtgactg ggtgggacag gtgtttcata 1440 attcatataa tagtaagagg aatggggtag caatattggt gcataagagg gtaaattttg 1500 tcatgattaa acaaaagaag gatgaggagg ggagatttat atggttggaa gccatggttg 1560 atgatcaaaa agttaatatt tgcaatattt atgctccaaa taaggaggat agtgtatttt 1620 ttcatacggt taataagatg attgggggac aagcaggtaa tcagttaata attgcaggtg 1680 attttaatca ggtgcaagat gcctatctgg atagaacaac ctaccataaa aatatgccca 1740 gagacagatt agctatacaa ttaatgatgg aagatttggg gttggtggac atatggaggc 1800 ttgtcaatcc tagagagaga gagtatacct tcttttcaca taatcataaa tcttactcaa 1860 gaatagatta ctttttggtt tctggtgatt tagtcgagtc agtagtagac tgtaagatag 1920 gcgtgattgc tttgtcagat cacgcaccag tggagatgac attagacacc aattctagga 1980 caataaaaca aaacagatgg agattcaata tatccttgct tcaggactta aattttagta 2040 caaaattggg agctgactta aaggaatttt tcgagattaa tgttggtacc acagagaggc 2100 tgggaacagt atgggaagca tcaaaagctt atgtcagggg taacgtaata caatatgcta 2160 gcttggtaaa aagactcaac aaggaaaaag ttaaggacat tgaggccaga attaaagttt 2220 tagacaggca attatcacta aagtttacag atagtatttt aaaacaagtc tgcgatctga 2280 aatatcagct taatgatatt tacaatagga aagcagagta tgcaatgttt aggatgcgca 2340 cagctttcta tgaaagtgga gaaaaggctg ataaattgtt agcaaggcaa ttgaaacaaa 2400 aggatgctag tttcttaatt tcagctatca aaaatgaaaa agatgaagtg gtgactgcaa 2460 acatggacat taataaggtc tttgaaaagt tttatagtaa attatatgaa gcagagtcat 2520 ccccagactg tactaaatat agagattttt tctccaaaat tacacttccc accttgtcct 2580 cagaccagct tgagatgtta gatgcgccaa tagaagaatc tgaaattggg gctgcaataa 2640 tgtcaatgaa ggccgctaag tcaccaggtt tagatggctt tcctgccgaa tactataaga 2700 agtatattga cattgttgca ccaatattgg aaggggtgta taaagaaacc ttgttactgg 2760 agcaaatgcc cccaacattt aatgatgcgc taattacgtt aattcttaaa aaggataagg 2820 atctttatga tcctgggagt tatagaccag ttagtttaga aaatgttgat tgtaaaattt 2880 tatctaaagt attagcattg aggttggagg gcattttatc caatatcgta catagcgacc 2940 aggtaggttt tataaaaggg agatcttctt ctgataatct tcggtcacta cttcatctca 3000 tctggcaaag ccgcaatgag aatgttccag ttgctgcttt ttcattagat gcgatgaagg 3060 catttgatag agtagaatgg ggctatttaa cttatacgtt acaaatgttt ggctttgggc 3120 caacttttct taagtgggtc aaggtgctat attccgcccc acgtgcagct gttcttacaa 3180 atggcattat ttctcctttc tttaaattaa agcgaggcac cagacaaggg gaccccctgt 3240 ctcctttgct ttttactttg tttctggagc ccttagcagt tgcaatcagg aatgacataa 3300 gagtgaatgg tgtccattta ggagagaggg aatataagtg ttttttatat gctgatgata 3360 ttcttctctt gctttcaaat ccaagtacat ctatacctgc tgtgatggat actattgaac 3420 atttttctca aatatcaggt tacaaagtta attgggtgaa atctgaggta atgccagtgt 3480 ctgtgggatg ttcgttggcg gatgtgagcg ctttctcctt taaatggata ccaactggga 3540 tgaagtattt aggtattagg ctctcaaggg acttgtatga tgttgttcag atgaatataa 3600 cccgtatgct tcaaaatgtt agtacaaact ttgataaatg gaaagtgtta aatttgtctt 3660 tgtgggggaa ggttaatgca attaaaatga tggtatcatc aaggattaat tatatctcta 3720 taatgatccc tttggagttt cctttatcca tctttaagca atacaatcaa ctagttaagg 3780 actttctgtg ggaagggaag aagcccagaa taagtatgaa aaatatgttt accactagaa 3840 taaagggggg tttggcattg ccaaatatag agctttataa tactgcattt gagatgatta 3900 aaatatgtaa acactggtca ggtgataatg tagagggtat aagctggatc gaaatagaga 3960 aaacgctaac tttcccattc agtgttattg atgctttatc tcagaaatct ttatattcta 4020 ttatgaatgg ggaagttaac cctatactgg aacactcaca gcaggtctgg aaaaaaatac 4080 ataagatatt taatttgtcc cattataaac aaggctattc ttcattatgg aataaccctg 4140 ctattaagat aggaaagaaa acagtctatt ggaaggattg gtgtaaaaaa aaaggcctga 4200 aaacagttga tgatttatat ggacaaggtg cactgtattc atttcaggag ttgaaagaca 4260 aattcaatct agtagataaa ggggatttct ggaaatatat acagctgcat agtagtataa 4320 gaactgtggg atataagcca ggagcagaag aaaatgtttt attagggttt ctaaatatgc 4380 caaagtcgat gcaaaccaca tcttttgttt ataagattgc tgcaaacaac atgtatggaa 4440 aaagtgatca tttaaaaatt atctgggaga aagacctgga ggtggaattt gaagagggtg 4500 aatgggaggc agtagtttct ggtcgtgggg gtactgtgag agatgttagg agtaaactca 4560 tacattacaa gataattaat cgatattact ggacaccagt aagactgcat aggataggat 4620 taaaggaaaa taatcactgt tggaaatgtg gtcagtctgt gggcactttt ttacatttaa 4680 cgtggagctg tcatttggtg gctccattct ggacaagagt tattcaaaac ctagagaaat 4740 ggctgggaca acctttacct tattccccaa gagtctgtct tcttggtgat acatcgactt 4800 tacagaacgg aatatctaaa acacaggctg gactagtcgt cgcaggatat attattgctg 4860 tgagactggt gctgcgaaat tggaagaact cagacactcc ctcttttaaa gattggattg 4920 agctgatgac ttctactgca tcatatatga acgtatgttg gcaagacttc aggattccac 4980 ccataccttt aatcagaaat ggggtagctt tttgcaatat ttggagagca cataaaaaga 5040 aaattgaaaa gttgtcaatt tgactgtctg ttatattgtt ttgtactatt ttatatttgt 5100 tataattttt gttttgccat tatttttatt ataatttgtc tacattattt cttgaatgtt 5160 gattttgtat tgtaaatttt atgttgtgtg ggtataaaag ttcaaaataa aataaaaaaa 5220 gttaaattac aaaaaagaaa gaaaaaaa 5248 // ID LOOPERN3_DR repbase; DNA; ZEB; 1011 BP. XX AC . XX DT 05-JUN-2002 (Rel. 7.05, Created) DT 02-MAR-2009 (Rel. 14.04, Last updated, Version 3) XX DE LOOPERN3_DR is a nonautonomous DNA transposon - a consensus. XX KW Kolobok; DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; LOOPERN3_DR. XX NM LOOPERN3_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1011 RA Kapitonov V.V. and Jurka J.; RT "LOOPERN3_DR, a nonautonomous DNA transposon from zebrafish."; RL Repbase Reports 2(5), 27-27 (2002). XX DR [1] (Consensus) XX CC About 1000 copies of LOOPERN3_DR are expected to populate the CC zebrafish genome. LOOPERN3_DR copies are ~10% divergent from the CC consensus sequence. This element is characterized by 10-bp CC terminal inverted repeats and putative TTAA targets site CC duplications (less likely, TTTAA). It is more likely belong to CC the Kolobok superfamily based on the features of the termini, CC although it is expected to be a member of the piggyBac/Looper CC superfamily. XX SQ Sequence 1011 BP; 290 A; 190 C; 218 G; 286 T; 27 other; agggcaccta tggtraaaaa tctacttttc aagctgtttg gacagacmtg tgtgtaggta 60 tagtgtatag accgtcatat tggggtgata taaacacacc cagtcctttt tttttcaatt 120 taactacata aaaacggtsg accaattgga gcggttttca gatcgaccgc aactttacgt 180 aggagtgcgg tccccccgcc caccgaattg attgacagct gcgcgtaaca tgttccggta 240 gtcatgtgta tatgtcaaca agaccagacg tgcgcaaagc aaccgggaat aaaaggtctg 300 ttcagttcgc taggatcatc aatcatcatc aaatgtgaty aagagtaagt ttcacatgtt 360 taaaatgttt taaaacagtg catgtgtgta atkaattaca gcgatttact tcagctttac 420 ttcatcagca cagccgcgtg tcagaacaat tataaaagaa gacgcttcaa tcccggtttg 480 tggacgttaa atcaggttta ttttgtacat taacataaca gatatccaca cagyastkga 540 grttagccta tcctgacaca tttgcgtgca aaaacagtgc taagctaagc gcgctctgtc 600 tgtctgcctc tgtgtgtgtg tgtgtctctg tgtgtgcgtg tgtgtgtgtg tgtgtgngtk 660 aactttgtaa cgatattgtg tgtgactcat caatgcaact kcacaatact sattrgtaaa 720 gttcttactg tagtatctca caaacgctac gtgagacctt cttcctttaa gtctgtctgt 780 tgtctgacgc agcygaggga ggaggcatgt agaaatagta ggcgggrarg actcgyctta 840 aaggcgcagt acgacaaaac maccccctgs tgraaaamwg tataaaacag satctwgtaa 900 aaggtataat gaaaaatctg atgggtrttt tgakctgaaa ctttatatac acattctaga 960 gacgcaaaag acttatatta aatctgaaaa aaggggtaac ctaggtgccc t 1011 // ID SUSHIIDR1 repbase; DNA; ZEB; 5030 BP. XX AC AL591175; XX DT 04-MAR-2002 (Rel. 7.02, Created) DT 04-MAR-2002 (Rel. 7.02, Last updated, Version 1) XX DE Sushi-ichi Danio rerio 1. XX KW Gypsy; LTR Retrotransposon; Transposable Element; LTR; SUSHIIDR1; KW Sushi-ichi; retrotransposon. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5030 RA Jekosch K.; RT "SUSHIIDR1: putative LTR-retrotransposon (internal region)."; RL Repbase Reports 2(2), 16-16 (2002). XX DR [1] (Consensus) XX CC Putative novel GYPSY-like retrotransposon similar to Fugu CC rubripes CC retrotransposon Sushi-ichi, with perfect 156 bp LTRs CC (SUSHIIDR1_LTR) CC and a reading frame in pos. 224-4476 with 3 frame shifts. XX SQ Sequence 5030 BP; 938 A; 1438 C; 1223 G; 1431 T; 0 other; ttactggtgc ttgagactgc cgtctgccct ctagaggcca tccttggttt tcttattttt 60 ttctttattt tttgcctgga tttttcatcc ctctgtgggt tccccagctc cacctgctgg 120 aggttttttg ttattgccca gtcagcaaga ttaataaaat tgtttttttt gacctgcact 180 tgaatcctct cattctcttt cctgacagaa cgataaggtc agaatggatt cagcaggcac 240 ggatcaagtg cgaactgcac ttgcccagca gggtgcactc ctgggtcaac atgccaccca 300 gctcaccacc acctccagag agttggagat gctcactgcc caagtggtcg agctcaacgc 360 acgaattgaa tcactccagc atgaagctgc aggacccagg caggttgctc cacctatcgc 420 cacccaccac cacgatccag agccccatgc caacaatcca cctccgtatg atggagatcc 480 cagttcctgt cgggcctttt tatcgcagtg tggccttgtt tttgccctgc aaccccgccg 540 ttatgccacc gaggcctcca gagtggcata tgtcctgacc ctgctcactg ggagagcacg 600 tgagtggggc actgcagtct gggatgctag ggcacccttc tgccgttttt ttgatgattt 660 taaggacgaa atgattaagc tctttgaccg gtccgcacag ggagatgagg cggctgccag 720 gttggcacga ctgacccagg agggccgttc tgtcaccgac tattctatcg tgtttaatac 780 tttagcggcc acctgtgact ggaatgaagg ggctctccga tccagatttt ttgagggtct 840 gaatgaggag atccaggatg agatcgcctc tcatgagctt cctcgccacc tggagactct 900 tattgactta gctctacgtg ttgaggggcg ccttcgccgc cgccgccagt ggcggtctgt 960 ccattcttcc tggagggctg atgacatctt cccccaacaa gcctcctcta ctcctccctc 1020 ttccgatcct gagcccatgc agatgggacg tatgcacctc acgcctcagg agaagaggga 1080 ccgcctagcc cgtggcgtgt gtctctactg tggcaaacca gggcactttg ccatcaagtg 1140 tcctttaaaa gccagggccc atcagtaaac cggggaatcc tggtgggcgc ctccccctgt 1200 ctgaattccc ctctgtctcg caccctattg cctgtcacca tacaacatga aggatccctc 1260 aactcctgct ctgccttgct cgactccggg gccgaaggca atttcctgga ctgtctcact 1320 gccaagcagt ggggtatccc cgccatcccc ctcccttctc ctatttctgt ctggtcgtta 1380 gcgggtcagc tcatctctac cattacccac atcacccctc gtgtaagtct tgttgtttcc 1440 ggcaatcacc gtgaggacat tgaactgtac cttcttgatt caccaggttc ccctattgtc 1500 ttggggcatc cctggctggt acggcacaat cctcacgttg attggtccgg aaattctgtg 1560 ttggcctgga gtcagtcttg tcttgcttct tgtttgggtt ctgccttgtc tcctggttct 1620 ttgtgttctg tgttgcagga ggagactgtt gatgtaacaa aggtcccggt tgagtaccat 1680 ggtcttcgcc aggtctttag cagagctcgg gccggatctc ttcctcctca tcgaccatat 1740 gactgtgcca ttgacctcct cccaggcact tttccgccca agggtcgctt atattccctt 1800 tctggtccag aaagagaggc tatggacagg tatattcagg aatcacttag tactggtctc 1860 atccgtccct catcatcacc tgcgggtgca gggttcttct ttgttaagaa gaaggatggc 1920 tccctgcgtc catgtatcga ctacagaggg ctgaatgaca tcacaattaa aaacaggtac 1980 cccttacctc ttatgtcttc tgcctttgag ttgttgcagg gagcccaggt ctttaccaag 2040 ttagacctcc gtaacgctta ccacttggtg cgcatccgtg agggagatga gtggaagaca 2100 gcctttaata cccctacggg acacttcgaa tatcttgtct taccttttgg tctgaccaac 2160 gccccagctg tcttccaggc cctggtcaac gacgtgctga gagacatggt aaacaagttt 2220 gtctttgtgt acctggatga tattctcatt ttttcatcct ctttacaggc acacactcat 2280 catgtacgtc aggtcctcca acgactatta gaaaaccaac tctttgttaa ggcggagaag 2340 tgcgagttcc acacgaagtc ggttacattc ctggggtatg tcatctcggc tgagggaata 2400 aaacctgata ttgctaaggt aagggccgtt gccaagtggc cagtccccaa cacccgcaag 2460 gctctgcagc gtttcctggg tttcgccaat ttctataggc gcttcatcag gaactttagt 2520 cagttgctgc acccttaacg gctctcacct ccactaaagt tatgttcagg tggaatacac 2580 aggctcagga ggcctttgat gttcttaagt cccgtttcat ctctgctcct gtcctttcta 2640 ttccagatcc tgaacaacaa tttattgtcg aggttgatgc ttctgatgtt ggggtaggcg 2700 cggtcctatc tcagcggtcc cccaaggatg ggaaggtgca tccttgtgcc tttttctctc 2760 atcgcctgag cccagcagaa cgtaattacg acataggtaa taaagagctg ttggcagtca 2820 agctggcttt gggtgagtgg cgtcactggc tggagggagc agtgcatccc ttcttggtct 2880 ggacggacca taaaaacctt gaatatgtcc gttcggccaa gaggttaagc gcacgccagg 2940 cccgctgggc actcttcttt ggtcgattca acttcagtct ctcgtaccgg ccagggtcca 3000 agaatactaa acctgatgcc ctctctcggc tgttcgaggt ccccagtggg gagtcctcaa 3060 ttggacccat cctccctgag ggagtggtgg ttgggtctct ctcctggggc atcgagcagc 3120 aggtaattaa ggccggtcga gggattgagg tgccagaagg gtgtccggat ggtcggctgt 3180 ttgtgccggt tgcgctgcgt tccgaggtcc tccggtgggg gcatgcatcc aggctggttt 3240 gtcatccagg aatttggagg tcactggctg ccatccgcca gcgattttgg tggccctctc 3300 tgaggcagga tgtcaggcag tttgtgttgg cttgctcggt ttgtgcccag aataagacct 3360 ctaaccgagc acctgttggt ctgctcaagc ccttgcccat tccctctcgt ccatggtcac 3420 acctggccct tgactttgtt tctggccttc ccccctcaag aggaaacact gtcatcatga 3480 ctgtagttga ccgtttttct aaagcagctc attttattcc tttgcccaaa cttccctcag 3540 ccaaggaaac agctcaagct atggttgacc atgtcttccg gattcatggc cttccggttg 3600 atatagtctc tgatagggga ccacagtttg tttcccggtt tgagagagtt ctgtcgacag 3660 atcggagcct ctacgagtct gtcttcaggt tttcaccccc agaccaacgg gcaatgtgag 3720 cgagccaacc aggatctcga aagaactttc cgctgcctga cctcccagaa tccttgttct 3780 tggagccaac agctctcatg ggttgagtac gcccataatt ccctgccggt tgcgtctact 3840 ggtatgtccc catttctctg ttccatgggt tatcaacctc ctttgtttcc ctcacaggaa 3900 cctgacgctg cagtcccgtc tgccttggcc tttgtccaac ggtgccgtcg cacctggatg 3960 agggctaagg aggtcttggc tcggacttct agacggacta aagcagcagc cgaccgtcat 4020 cggaccctcc cacctgaata tgtgtgtggt cagaaggtat ggttatccac caaggacctg 4080 cctcttaggg tagcctcacg taagttggct cccaggttca ttggcccata cctcatcacc 4140 aaggtcttga gtccggtggc agttcggctc aagttacctc ccactcttgg tcgggtacac 4200 cctgtttttc atgtatccag ggtcaaacct gtgtttagat ccattcttaa ccctgttgtc 4260 tcttcccctg cccctccccc ccctcgccta gtggatggct ctcctgcata cagtgtcagg 4320 agactgctgg atgtcagacg ccggggccgg ggttttcagt atctagtgga ctgggagggt 4380 tatggtcctg aggagaggag ttggattccg gctcggcatg tcctggatcg ggcattgatt 4440 atggactttc accggcgacg gggtaagccc ctcccctgag gcgcctggtg gcgctcttag 4500 gggtgggggt actgtcatgg tctgtcacct gcatgttgtc gtcacattca tttgttttgt 4560 tttctcagcg cacatggcgt tattttgaca gatcgccatg cgctctgctg tctcacccca 4620 cccacttgtc attctacgca ttagatcatt agttccttca cacctgtctg tagttttcct 4680 tcattagtct ccctatttat tcctcttgtt ctcactttcc tgtaccttat cgttcgcgtg 4740 tcttacctta cgatcctgtt tcatgatctc gtccagtctc gtctcgtctc gttactgttt 4800 aggctccagt gccagtcttt aactgttgtt ttggattttg gattttactt tttctttctc 4860 cccagtcaga ccgttcctgc tgtagatcag ctgtgtttct cctcacctct cggcacttcc 4920 tgctccagtt ccgcattaac gcccaccagc gatctcctct ccagccgact ttctctcctc 4980 ctgctccctc tcgtcctggt tgccagcatc gcagaataaa ccccttgctg 5030 // ID Gypsy161-I_DR repbase; DNA; ZEB; 4215 BP. XX AC chr7; XX DT 13-OCT-2008 (Rel. 13.1, Created) DT 13-OCT-2008 (Rel. 13.1, Last updated, Version -1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy161_DR; KW Gypsy161-LTR_DR; Gypsy161-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4215 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1602-1602 (2008). XX DR Genome; chr7; Positions 36136916 36132702. XX CC Positions [2930-3352] - Reverse transcriptase CC Positions [1826-2302] - Integrase core CC 'GTAT' target site duplication CC LTRs are 97% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 158..3730 FT /product="Gypsy161-I_DR_1p" FT /translation="MSIEEVVRHLAEISRRQQVITEQLTARQDRMEQQLRQ FT AAGSSQFSEVSAHKFITKLSDLDDIDAYLHTFEVITERERWPKESWARMLA FT PFLSGEAQRAYFALETPKNDDYKALKKEILARMGLSNISAAQQFSQWSYDD FT KQPVWTQAANLSRLGRLWLLGGDPTAVQVAEKVVIEKMMRALPRRLRTLTS FT MRNPDSLAALVEAVELAEAHVARETGERAALPPRRVNAPWRPVEGTARPGS FT RPAVPSPVDEPMPTEPTSHSTPAWTAGCVVHRNVPPEAPTRKVCLDGKTQT FT ATLDTGSAITLVHPRTLKCHQEGKSRVPITCVHGDTRHVPSQRVTIATKSG FT SWRIEVGVVPDLPVPLLLGRDWPGFDDLLTHHQARSARAKKNSKGRAQRDR FT QPALMATESDRGGESSSANLYYDLFQQITAGGDFGRAQREDETLKHCWPQV FT RIIDGNERFSSPHPLPHFIVQNGLLYCVAERRGETKTLLVVSRTKREMVLE FT LAHTHPMAGHLGAANTIKRIRDRFHWPGLDGEVKRYCQACDICQRTSPQRP FT PPSPLIPLPIIEVPFNRIGMDLIGPLPKSARGHEHILVILDYATRYPEAIP FT LRKATSSAIAKELFLLCSRVGIPAEILTDQGTPFMSWLMADLCRLLKVKQI FT KTSVYHPQTDGLVERFNKTLKQMLRRVVAEDGRDWDLMIPYVLFGIREVPQ FT GSTGFTPFELLFGRQPRGLLDVARQAWEQEPAPQRSVIEHVRDMRGRIEKV FT MPIVKQHLTEAQRAQQRLYNRPAQPREFHPGDKVMILIPTTTSKFLASWKG FT PYTVVERVGPVNYRVRQPGRRREEQLYHINLMKKWVAAPGHLVAFSEETSP FT VVHIGEQLSPNQKAELQALVGQFRDVFSEKPGRTTIIQHDIITPPGTIVRQ FT RPYRVPEARRLAINEEIQKMRKLGTIEPSRSPWSSPIVMVLKPDGTLRFCN FT DFRKLNEISKFDGYPMPRVDELLDRLGGARFISTIDLTKGYWQLPLSESAK FT EKTAFSTPGGHWQYRVLPFGLHGAPATFQRMMDILLRPHQPYAAAYLDDLI FT VHSESWEEHLSRLRRVLLDLRRAGLTANPKKCHLGLAEAKYLGFHIGRGLI FT QPQQNKIRALQETPQPTTKTQVHAFLGLAGYYRCFIPNFSSIASPLTDLTR FT KGQPERIQWNRSRWRVPSPEDGTDILTGTART" XX SQ Sequence 4215 BP; 1133 A; 1185 C; 1091 G; 806 T; 0 other; actggtggag aatgcgcgct ttttcagagg gaaaaaaatt cagcacttac cagtaagaag 60 aagtaaccac tgaattttct ctgtttttct tctgtttaca gacaggcatc cgctctcact 120 cacgctctct tactcggctg tcatcccctc ctgagtcatg tcgatcgaag aggtagtacg 180 ccatctagcg gaaatctcga gaagacaaca agtgataact gaacaactca ccgccagaca 240 agaccgtatg gaacaacagc tccgccaagc ggccggttcc agccagttct ctgaggtgag 300 cgctcataaa ttcatcacaa aactcagcga cctggacgat attgatgctt acctgcacac 360 cttcgaggta atcacggaaa gagagagatg gccgaaggaa agttgggcga ggatgttggc 420 accattcctg tctggagaag ctcaaagagc ttatttcgca ttagagacac cgaaaaacga 480 cgactataaa gcattaaaga aggagatact cgccagaatg ggactctcca acataagcgc 540 agcacaacaa ttttctcagt ggtcatatga tgacaaacag ccggtatgga cccaagcagc 600 caacttatcc agattgggaa gattatggtt gctaggagga gatccaaccg cagtccaggt 660 cgctgagaag gtggtcattg aaaagatgat gagggcgcta cccagacggc tccgcacact 720 taccagcatg aggaaccccg actcgctggc tgctttggtg gaggcggtgg agctggcaga 780 agctcacgtg gcccgggaga ctggggagag agcggctctg ccaccccgga gggtaaatgc 840 gccatggcga ccggtggagg gcacagcacg accaggcagc agaccggcgg tccccagccc 900 agtcgacgag ccgatgccca ctgaaccaac atcacactcg accccggcct ggacggcagg 960 gtgcgtggta catcgcaacg tccctcccga agctcccacc cgaaaagtat gcttagacgg 1020 gaaaacgcag acggccacat tggatacagg aagtgcaata accctggttc atccaagaac 1080 actaaaatgt catcaagaag gaaaaagccg agtcccgatt acatgcgtac acggggatac 1140 ccgccacgta ccctcccaaa gagtcaccat agcaacgaag tcgggaagct ggcgcatcga 1200 agtaggagtg gttccagatc ttccagtacc ccttcttctg ggcagagact ggccggggtt 1260 cgatgatctc ctcacccacc atcaggctcg atcggctcgt gcaaagaaga acagcaaggg 1320 acgggctcaa cgggaccgcc aaccagcgct gatggccacc gagagcgaca gagggggtga 1380 gtcatcatcg gctaacctgt actatgatct tttccaacag attaccgcag gtggcgattt 1440 cggaagagca cagcgtgaag acgaaacgtt gaagcactgc tggccacaag tccggatcat 1500 agacggtaac gaacggtttt ccagccctca ccccctccca cattttattg tacaaaatgg 1560 tctgctgtac tgtgtcgcag agaggcgggg ggaaacgaag acgctattgg tcgtctcgag 1620 gaccaaaagg gagatggttc tggaactggc acatactcac ccaatggcag gacatctagg 1680 agcagccaac acgatcaaaa ggatcagaga tcgtttccac tggcccgggt tggatggaga 1740 agtcaagagg tattgtcagg catgcgacat ctgccaaaga acgtctcccc aacgaccacc 1800 ccccagccct ctgataccat tacccatcat tgaggtgccc ttcaaccgca ttggcatgga 1860 cttgataggg cctttgccga agtcggcccg gggacatgaa cacatccttg tcatcctcga 1920 ctacgccacc agatatccag aagcgatccc tctgagaaaa gccacgtcat cggcaatcgc 1980 taaggagctg tttttattat gcagccgagt aggaatacca gcagagatac tgaccgacca 2040 gggcacccca ttcatgtcct ggttgatggc agacctctgc cgcctcctaa aggtaaagca 2100 aataaaaact tctgtttatc atccgcagac tgatggcttg gtggagcgct ttaacaagac 2160 gctgaagcag atgctccgca gggtggtggc agaggatggg cgcgactggg acctcatgat 2220 cccgtacgtg cttttcggga tcagagaagt tccccaagga tctacaggtt tcacaccctt 2280 tgaattgctg ttcggccgcc aaccacgagg gctattggac gtggctcgtc aagcttggga 2340 acaagagcca gccccacaac ggtcagtgat tgagcacgtg cgggacatga gaggacgaat 2400 agaaaaagtc atgcccatcg tcaaacaaca tctgaccgaa gcccagcgcg cccaacagag 2460 attatataac cggcccgccc aacccagaga gttccaccca ggggacaagg taatgatcct 2520 gatacctacc accacctcga agtttttggc atcctggaag gggccatata cagtggtaga 2580 aagggtaggg ccggtaaact atcgagtccg tcagccggga cgaagaagag aagaacaact 2640 ttaccatatc aacttgatga agaaatgggt tgcagctcca ggtcatctag ttgccttctc 2700 agaagaaact tctcccgttg tccatatagg tgagcaactc tcaccgaatc agaaggcgga 2760 gctgcaagcc ttggtcggtc agttcaggga tgtgttctca gagaaaccgg gccgaaccac 2820 catcatccag cacgacatca tcaccccacc tggcaccatc gtccggcaga ggccttatcg 2880 agtaccagaa gctcgccggc tggctatcaa cgaggagatc caaaagatga ggaaattggg 2940 aaccatcgaa ccatctcgta gcccgtggtc cagccccata gtgatggtcc tcaaacccga 3000 cggcaccctc cgtttctgca atgacttcag gaagctgaat gaaatttcca aattcgacgg 3060 gtaccccatg cctcgggtgg acgagctgct ggatcggctg ggtggagccc gattcatatc 3120 cacgatcgac ctaaccaaag gctactggca attaccacta agtgaaagcg ccaaggagaa 3180 aaccgccttc tccactcccg gtgggcactg gcaataccgg gttcttccct tcgggctcca 3240 cggggctcca gcaacattcc agcgaatgat ggatatcctg ctgagacccc accagccgta 3300 tgcagcagca tacctcgacg accttatcgt ccactccgag tcatgggaag aacatctatc 3360 ccggttacgg agggtgctcc ttgatcttcg tagggctggg ctcacagcta atcccaagaa 3420 atgccacctg ggtctagcag aagccaaata cctcggtttc cacatcggta gaggtctcat 3480 acagccacaa cagaacaaga tcagagcact acaagaaact ccacaaccca ccacaaagac 3540 ccaggtacat gcatttctgg ggttagcggg ctactataga tgtttcattc ccaatttctc 3600 atccatagcc agccctttga cagacctgac cagaaagggg cagccagaga ggatacaatg 3660 gaacagaagc agatggcgcg ttccaagccc tgaagacggc actgacatcc tcaccggtac 3720 tgcacgcacc tgactttggc tgccccttca ttctacagac agacgcttcc gactcgggcc 3780 tgggcgcggt cctctcccag gtccacggcg atgaagaaca tcccatcatg tacgtgagtc 3840 ggaagtttac ccctgcagag acccggtacc caacggtaga aaaagaggcc ctggcgatca 3900 agtgggcaat cctggagctc aggtattatc tccttggcag aaaattcacc ctggtgaccg 3960 accacgctcc acttcaatgg atgtcaacag cgaaaaataa caacgctcga gtcaccagat 4020 ggttcctgtc gctccaggat tataacttca cagttcaaca tcgagccggg gcctcccacg 4080 gaaacgccga cgggctctca aggctatggt caggatgggc aggtctgtca aaacattcta 4140 ccccccctct caatactcta ccttttctcc gcaggacacc caggaccagg acgacgctaa 4200 gggggggggg gggga 4215 // ID SAT-4_DR repbase; DNA; ZEB; 120 BP. XX AC . XX DT 21-AUG-2010 (Rel. 15.11, Created) DT 21-AUG-2010 (Rel. 15.11, Last updated, Version 2) XX DE Satellite-like DNA - consensus. XX KW Satellite; Simple Repeat; SAT-4_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-120 RA Jurka J.; RT "Satellite-like DNA from zebrafish."; RL Repbase Reports 10(11), 2026-2026 (2010). XX DR [1] (Consensus) XX SQ Sequence 120 BP; 28 A; 24 C; 32 G; 36 T; 0 other; ggcgctctag gctagttttt aacagcagat ggcgctctag gctagttttt aacagcagat 60 ggcgctctag gctagttttt aacagcagat ggcgctctag gctagttttt aacagcagat 120 // ID Gypsy82-I_Dr repbase; DNA; ZEB; 5577 BP. XX AC . XX DT 20-APR-2009 (Rel. 14.05, Created) DT 20-APR-2009 (Rel. 14.05, Last updated, Version 1) XX DE An internal portion of the Gypsy82_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; endogenous retrovirus; Gypsy superfamily; KW gag; integrase; reverse transcriptase; Gypsy82-I_DR; Gypsy82_DR; KW Gypsy82-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5577 RA Dib M.R. and Naveira H.F.; RT "Gypsy82_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 9(5), 958-958 (2009). XX DR [1] (Consensus) XX CC Gypsy82-I_DR is an internal portion of the Gypsy82_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its long CC terminal repeat is deposited in Repbase as Gypsy82-LTR_DR. CC Gypsy82_DR is characterized by 4-bp target site duplications. The CC internal portion encodes two proteins: the 415-aa gag CC Gypsy82_DR1p (pos.121-1365) and 1400-aa polyprotein (pos. CC 1335-5534, conceptual translation)composed of the protease, CC reverse transcriptase, and integrase domains. Some insertions CC fairly recent, CC according to the hight identity between their flanking LTRs. CC Consensus obtained after the alignment of at least five CC independent insertions bearing at least 85% homology over at CC least 1000bp. XX FH Key Location/Qualifiers FT CDS 121..1365 FT /product="Gypsy82-I_Dr_1p" FT /note="Gag-protein." FT /translation="MLLCECGCFTAAKQHRAVILKEAFQDSPLFRLLVSIC FT FVFCVFFFCLFFFFTLMMESELDELRRQVQSLQSENEQLQHQANSAARPSV FT SASSVVDTNAHASGVSDASRPIAERIILIPRERRCSTFSGRGDEDVFEWIE FT EMKSNLRARNLPAREEALFILDHLGGSARSEIKFRPRVEREDPEKVFSVLR FT ELYGCAYSYISLQEQFFSRKQEEGESLQDFSHALFALMEKVVQCAPGGVPN FT SAILLRDQFVEHVVDNSLCRELKRYVRLHPQSTILDVRKEAIRWVDEGFRP FT DTRERSHSVPSLATQYRVQGHGPPSSSRGNSTEIAELKEMLKAQQEQLNRL FT TQGLQQLQSQHAGNPSRRVNPIICRRCNQPGHIARNCIVDFRRSTEQPRVV FT SQQSNAVGASRPSENFNPLNC" FT CDS 1335..5534 FT /product="Gypsy82-I_Dr_2p" FT /note="Polyprotein." FT /translation="AVGKLQPPELLSHTSGGVATGSQPSELKGKQALCKLV FT GNCPSVTITMGGVEVPCLLDTGSMVTTISEGFFFQHFEPWGKEKLQRCGWL FT QLRAANGLEIPYLGYLELDFVVLGKNIQNKGVLVVKDLPNSLPGGFPGLLG FT MNVIQECFRELFSQHGSSLFDLPTVREVPAWYSALQYCRETETDNCEITER FT LARVGGREAVRIPADSLRFVPVTFSLPIGPSPLLLFEPLGTENSLPAGLLA FT SVSLVPIVRGVAFVPIVNVGEQDSLLQPHVVVGTLCKVQTVPQLDVMTPPE FT EQAHLQATTALQMVRGDTVEEAIHTLDLPQLSLDQQAQVKSLLSKYRSVFS FT AHEGDLGCTELITHQIPLLDSEPIRQRYRRLPPSDYEAVKEHIHQLLQNQV FT IRESSSPYASPIVIVRKKDGQIRLCVDYRQLNGKTRKDAFPLPRIEESLDA FT LSGAQWFSTLDLASGYNQVPVAEGDKCKTAFCTPFGLFEFNRMPFGLCNAP FT STFQRLMERMFGDQRFQTLLLYLDDIIIFSSTIAQHLERLEMVFVRLQREG FT LKAKLEKCCFFQQKVSYLGHVVSKEGVSTDPAKISVVAEWKRPTNVTELRS FT FLGFTSYYRRFVEGFARLAAPLHRLVADLVGTKKKRGSGKSIESCWTEDCE FT LGFQNLKQRLVTAPVLTYADFSLPFVLEIDASHSGLGAVLSQEKDGKLRPV FT AYASRGLRPTERNMQNYSSMKLELLALKWALTEKFRDYLLGQKCVVYTDNN FT PLSYLRTAKLGALEQRWASQLAAFDFDIRYRPGRVNGNADALSRQYTALPS FT SQNDCGTTLPAVLVELLHKEPVVLAQNTISVFPTTSTVDLQREQERDSVIG FT PVLNFFSSKVYPTKEQRQAMSNRCRELLRQWSRLFKRDGLLYRHLKLPGGG FT EEIDQLLLPEVLQAEVFQQLHGSHGHQGRERTYELIRNRCYWPGMEADVRK FT RCQECSQCAVSKLNQPLARAPMGHLLASRPNQILAVDFTTLERASDGREHV FT LVITDVFSKYTQAVPTRDQKAITVANILIHEWFYRFGVPAQIHSDQGRNFE FT GAVVSQLCQLYGVQKTRTVPYHPQGNGQCERFNRTLHDLLRTLSVEQKRSW FT TCHIAQVCFAYNTTPHQTTGESPYFLMFGQVPRLPVDFLLTAVDAPVSGQV FT KDWVDEHRETLKETYSRVRARLNRAAELRRKRHGQVGKESALQEGQEVYLL FT DQGVRGRHKIQNHWSSTVYKVVQSPIGKGGVYTIVEADNPSRVKQVHRSHL FT RLVPNTGGSRHVVEPEMGVSEDNWDKVLDSSVAVDEVDDLDVVLIGVNPFA FT SGVVEVEPPLEQERLVSAETPLPAINPPRETTEGDRRRTTRATAGKHSNPH FT HLPRAAVVVQHVHLNREGVDVSLQPSQAQNSPFRPWL" XX SQ Sequence 5577 BP; 1395 A; 1099 C; 1501 G; 1582 T; 0 other; tttggcgtag tcggcagggt tcctttacaa gcaggggtgt gtactcaatc cttgggcact 60 ttgtgtgttt gttgagaaac ctttttttat tatttcgttt cattttgttg tttattgtga 120 atgcttttgt gtgagtgtgg ttgttttaca gcagcaaaac agcaccgggc agtgatttta 180 aaggaagcct tccaagactc accgctgttc aggttattag tgagtatttg ttttgttttc 240 tgtgtttttt ttttttgttt gttttttttc tttaccttaa tgatggagag tgagttggat 300 gagctgcgca ggcaggttca gtccctgcaa tcggagaatg aacagttaca gcatcaagct 360 aactctgcag ccaggccttc ggtgagtgct agttctgttg ttgataccaa tgcccatgcc 420 tctggtgtgt cagatgcttc tagacccata gcagagcgta ttattcttat tcctagagag 480 cgacgatgtt ccacgtttag tggaagggga gatgaggatg tgtttgagtg gatagaggag 540 atgaagagta atttacgggc cagaaattta ccagcaaggg aagaagccct ttttatcctt 600 gatcatctgg gtgggtctgc tagaagtgaa attaaattta gaccaagggt tgagagagag 660 gatccagaaa aggtgttttc agtgttaaga gagttatatg gatgtgcata ctcttatatc 720 tccctccagg agcagttttt ttctcgtaaa caggaagaag gcgagtcctt gcaagatttt 780 tcacatgccc tgttcgcttt gatggagaag gttgtacaat gtgctccggg tggtgtccct 840 aattctgcca tcttacttag ggatcaattt gtagaacacg ttgttgacaa tagtctatgt 900 cgtgagctta agcgttatgt tcggctccat ccacagtcca ctattctgga tgtaaggaag 960 gaggcaatta ggtgggttga tgagggattt cgtcctgata cgcgtgaacg aagtcattct 1020 gtcccctccc ttgctaccca gtatagggtt caaggacatg gtccaccatc aagcagtcga 1080 ggtaattcaa cagagatagc tgaattaaag gagatgttaa aggcccaaca agagcaactt 1140 aatcggctca ctcaaggcct gcagcagctg cagtcccagc atgcaggtaa tccatctagg 1200 cgggtgaacc ctattatctg cagacgctgt aaccaaccag ggcatattgc ccggaattgt 1260 atagtggatt ttcgtagatc cactgaacag ccgagggtgg tgtcacagca gagtaatgca 1320 gtgggtgcta gtaggccgtc ggaaaacttc aaccccctga attgttgagc cacacttcag 1380 gaggggtagc cactggctca cagccctctg agttgaaagg gaagcaagca ctttgtaagt 1440 tagtaggtaa ttgtccttct gtcactatta ctatgggtgg ggtagaggtt ccatgtttac 1500 tggacactgg gtccatggtg acaactattt cagagggatt ttttttccaa cattttgagc 1560 cttggggcaa agagaagtta caacgctgtg gatggctcca gctgcgtgct gcaaatggtt 1620 tagaaatacc ttatttgggt tacctggagt tagattttgt tgttcttggg aagaacatcc 1680 agaataaggg ggtacttgtg gtgaaagacc tgcccaattc cttaccaggt gggtttcctg 1740 ggctgttggg catgaatgtc attcaggagt gttttaggga gttgtttagt cagcatggtt 1800 ccagtctttt tgatctccct actgtgcggg aggttcctgc atggtattct gctttgcagt 1860 attgtaggga gactgaaact gacaattgtg agattacaga aagattggct agggttgggg 1920 gtcgagaagc agtgcgaatc cctgctgatt cgttgcgttt tgtgcctgtc acattttcct 1980 tacccatagg cccttcacct ttactgcttt ttgagccact agggacggag aattcacttc 2040 cggctggtct actggcctct gtatctttag taccaattgt tcggggtgtg gcttttgtac 2100 ccatagttaa tgtgggggaa caggattccc tgttgcagcc gcatgtagta gtagggacat 2160 tgtgtaaagt tcagactgtt ccacagctag atgttatgac tccaccagag gaacaggccc 2220 atcttcaagc aacaactgca ttgcaaatgg tgagagggga cacagtggag gaggccatcc 2280 atactttgga tctgcctcag ctgtctttgg atcagcaggc tcaggttaaa tctctgctga 2340 gtaaatatag atctgttttt tcagctcatg agggtgattt gggttgtact gagctcataa 2400 ctcatcagat tccgttgctt gatagtgagc caatacggca gagatatcga cgcttgcccc 2460 cttctgatta tgaggcagtt aaggagcata ttcaccagtt gttgcagaat caggtgatta 2520 gagagagctc tagcccgtat gcctctccaa ttgtgattgt gcggaaaaaa gatgggcaga 2580 ttcgtctttg tgttgactat aggcagttga atggtaagac gcggaaggat gctttccctc 2640 tgccacgaat tgaagaatca ctggatgccc tgagtggagc ccaatggttt tccacattgg 2700 atcttgccag tggctataat caagtaccag tggcagaggg agataagtgt aagaccgctt 2760 tttgcacgcc ctttgggcta tttgaattta atcgaatgcc ctttgggttg tgcaatgccc 2820 ctagtacctt tcaacgcctt atggagcgca tgtttgggga ccaaaggttc cagactttgt 2880 tactctattt agacgacatc ataatttttt cgtccactat tgcacaacac ttggagaggc 2940 tggaaatggt ttttgttcga ctacagcgag agggtcttaa ggccaaacta gagaaatgtt 3000 gctttttcca acaaaaagta agttacttgg ggcatgtagt gtctaaggag ggggtatcaa 3060 ctgacccagc caagattagt gtggtggcag agtggaagcg tccaactaat gtgactgagt 3120 tgaggtcctt tttaggcttt acaagttatt acagacgctt tgtagaggga tttgcacggt 3180 tagcagcccc gttacatcgc cttgttgcag atttggtggg gacaaagaag aagaggggtt 3240 caggaaagtc aattgaaagt tgttggaccg aggattgtga actgggcttt caaaatttga 3300 agcagcgact ggtgactgct cctgtactga cttatgcaga tttctctctc ccgtttgtgt 3360 tggaaattga tgcgagtcat tctgggttag gggctgtgct gagtcaggag aaggatggga 3420 agctgaggcc agttgcttat gctagccggg ggcttcgtcc cacagaaaga aatatgcaaa 3480 actatagctc catgaaattg gagctattgg cattaaaatg ggcgctaacc gaaaagttta 3540 gggactacct cttaggacag aagtgtgtag tgtacacaga taacaatccc ttgagttatc 3600 ttaggacagc caagttgggt gctttagagc agcgctgggc ttctcagctt gcagcttttg 3660 attttgatat ccgatatcgt ccaggccgtg ttaacggtaa tgcagatgcc ttgtcccgtc 3720 agtatacagc gctaccatct tcacagaacg attgtggcac cactttgcct gcagtgttgg 3780 tagaacttct gcataaggaa ccggtagtat tggcacaaaa caccatctct gttttcccta 3840 ccacttcaac tgtagatttg caacgggagc aggagcggga ttcagttatt ggcccagtgc 3900 tcaacttttt ctcttctaaa gtgtatccaa ctaaagagca acgtcaggct atgtccaata 3960 gatgtcggga actgctacgt cagtggagca ggttatttaa gagagatggg ttgctatatc 4020 gtcatctaaa actgcctggt gggggggagg aaatagatca gttgctgctc cctgaggttt 4080 tgcaggcaga ggtttttcag caattgcatg gctctcatgg tcatcaaggt agagagcgta 4140 cctatgagct gatacgcaac agatgttatt ggcctgggat ggaggctgat gttcgaaaga 4200 ggtgccagga atgtagtcag tgtgcagtct caaagttaaa ccagccacta gcccgtgcac 4260 ctatgggcca tttgctggca tcaagaccaa atcaaatctt ggcagttgat tttactacat 4320 tggagcgggc tagtgatggg cgagaacatg tcctggtgat tacagatgtg ttttcaaaat 4380 acactcaagc tgtgcccaca cgggatcaga aagcaatcac agtagctaac atcttgatcc 4440 acgagtggtt ttataggttt ggggtacctg cgcagattca ctcagatcaa ggtcgaaatt 4500 ttgagggtgc agtggtctct cagctttgtc agttgtatgg ggttcagaag actcgcactg 4560 ttccatatca ccctcaggga aatgggcagt gcgagcgttt taatcgaacc cttcatgact 4620 tattacgtac attgtctgtg gaacaaaaga ggtcatggac ttgccatatc gcacaggtat 4680 gttttgccta taatacaacg cctcaccaga ctactggtga gtcgccgtat ttcttgatgt 4740 ttgggcaggt tcctagactt ccagttgatt tcttgctcac agctgtggat gcacctgtca 4800 gtgggcaggt aaaagattgg gtggatgagc atagagagac cctaaaagaa acgtatagca 4860 gagtgagggc tcggctgaat agagccgcag aacttcgtag gaaacgccat ggtcaggtag 4920 gaaaggagtc tgcattgcaa gaaggccaag aagtttattt gcttgaccaa ggtgttcgtg 4980 gtcgtcataa aattcaaaat cattggtcca gcactgttta caaagtggtc caatccccga 5040 taggtaaggg tggtgtatac accatagtag aggctgataa tccctctcgg gtgaagcagg 5100 tgcataggtc ccatttaagg ttagtaccta atacaggagg ctcgcggcac gtggttgaac 5160 cagagatggg agtgtcagaa gataattggg ataaggtttt agactcctca gtggcagtgg 5220 atgaggtaga tgatttggac gtagtcctta taggggttaa cccttttgcc tcaggtgtag 5280 tcgaggtaga gccaccattg gagcaggaaa ggttggtgag tgcagagact cctctccctg 5340 ctattaatcc cccaagggaa acaacagagg gcgataggcg acgaactact agggccactg 5400 ccggaaaaca ttcaaacccg catcatttgc caagggctgc agtagtcgtt caacatgtac 5460 atctaaatag agagggggtg gatgtctcat tgcaacctag tcaggctcag aatagtcctt 5520 tccgtccatg gttgtaagta gtatcatcgg gacgatgata gaaaagccag gggtaaa 5577 // ID DNA-8-16_DR repbase; DNA; ZEB; 1891 BP. XX AC . XX DT 01-DEC-2008 (Rel. 13.12, Created) DT 01-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE A nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; TSD 8-bp; DNA-8-16_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1891 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(12), 2139-2139 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1891 BP; 554 A; 344 C; 410 G; 582 T; 1 other; tagagatgcg cggttggcgg ttatagccgc ggactccgcg gataaaccgc ggatcgggcg 60 gatgacgtta cgaaaaaata atattttaat taaattcggg cgggtggcgg gcggttgtac 120 tgtttttata ggcatagctg gcgcaccaac gaaaaagcct aggactgact tcacagaatg 180 ggaggaggaa tcggatacac taattctgga tgaagtgtac agaagtattc ctctacaaac 240 ttccgtatag acggatcagc agaggaaaat ctactttcct tttgggagaa ataaggccag 300 tcattcccac gactacagca cctttccaag agaattctat gcatttcagc aacaagtgct 360 gcgagcgagc gctccttcag tgcagcaggg cgcattatag tggctatgca ggcgctcccg 420 tctgaatcct gacactgtag atgctatttt attcctgcat tgtgccaaga aaaaaatcaa 480 actagaccta aggtcaggca tattaaacca attagcctat gttatttagg ctacacatat 540 gttcaatata aacttttgag ttcggtcagc tgataatttt acgttcatat tggtttagcc 600 tattctattc tagacatgtt ggcctcgctt ttacgttccg aggctaaggt tattttgcca 660 gaatttcttt gtcgcaactt taatggcgta gtatttcgtt atgcttcaag tttgagggga 720 atggggatga tcctaagtcc atgggttatt ggggcacttg tcatacatgc actttagcct 780 ggtcagactt tatgttcgct caaagaggga tggcatatct atttttctaa tttaatttct 840 aattgtgggg gtgactgagc ctacaaggct taggcacgat atgacgtttt tattaaaaaa 900 tttttcaact gtttgccaaa gcatgcgact atagcctatg cctacattaa gatatttatg 960 ttaatatttt ttttactgcc tgttgtttct tttggcatat aaaataggta ttgtctgata 1020 gagatatcaa taaaggttca tctgtgtcgc agaactggtg ttgtttccga tttctacaag 1080 ctcaataaca tccacagcaa aaaataaaaa ataaataaat aaaaaaatag tcgaaaaact 1140 gtgcccatta attagatgtg caaggcaagc aggtacgttt ttggggttgc tatggacaac 1200 tacaaatgta aacattatta ggctataatg taaatgactt attattctat ctatttacta 1260 aaaaataaag tgaaataggc atttagtagt aggctaaata gcagcatgtt gaaatgatgt 1320 gtcaatgcgt tttctattag gctacaataa aaaaaagttg tatagtacag tgcgtttaat 1380 ttaggctatt tatttatttt tgtccctgtg cgccgattgg agacggagtt cactgctgat 1440 attctgagag ctcgggtgct tctcatttct tatttgtgca tcctcgtttc ctttccttgc 1500 tttctttcct cgtgtttcca ccccaccggg ataacgktac gagggaagga cgcaaggaaa 1560 agactcaagg accgaggaat cgaatcaagt gaaaagacac gtcctcgtat ctttagcgtc 1620 acttcaaagc gtcgtcaatt aatgacttgc acgtttggat taactgcatg tctacattaa 1680 agtcattctc tattctataa tgatcaataa agaaacattc atttaataat aaaaaggaat 1740 aagccattca aataaagagc ccaatcagca gatggcgggc gggtgcggtt ttaaaaattg 1800 gtcaaaaatg ttactgcgga tggatggcgg acggatgatg aattttgtca tgcggttgcg 1860 gatgaaataa tagcccatcc gcgcatctct a 1891 // ID Gypsy-29-LTR_DR repbase; DNA; ZEB; 302 BP. XX AC . XX DT 29-NOV-2006 (Rel. 11.11, Created) DT 13-DEC-2006 (Rel. 11.11, Last updated, Version 1) XX DE A long terminal repeat of the Gypsy-29_DR LTR retrotransposon; a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy superfamily; Gypsy-29-I_DR; KW Gypsy-29-LTR_DR; Gypsy-29_DR; LTR; endogenous retrovirus; KW reverse transcriptase. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-302 RA Dib M.R. and Naveira H.F.; RT "Gypsy-29_DR, a family of LTR retrotransposons from zebrafish."; RL Direct Submission to Repbase Update (29-NOV-2006). XX DR [1] (Consensus) XX CC Gypsy-29-LTR_DR is a long terminal repeat of the Gypsy-29_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its CC internal portion is deposited in Repbase as Gypsy-29-I_DR. XX SQ Sequence 302 BP; 78 A; 42 C; 88 G; 94 T; 0 other; tgtagcatgc tagaaaaggt ttaaatacag ttcctggttg ttgtagtttg gcgtgtattt 60 atgttgggta atgtagggct ttaaggggct ggattacggg aaaaaggagt ctgcgcactt 120 gagagggagg gggcgggttt tcgagagatt gacaggtatc tggggttttt ctttggttca 180 caagagagag agttgtttta tgtgaagagc taaaaggatt actctgtctg caacgtgcaa 240 aagttcaata aaagacaagt tttgactgct aaacgcttcc tgaccttctt cagtctgcta 300 ca 302 // ID Gypsy-166-I_DR repbase; DNA; ZEB; 4298 BP. XX AC . XX DT 17-NOV-2008 (Rel. 13.12, Created) DT 17-NOV-2008 (Rel. 13.12, Last updated, Version 1) XX DE An internal portion of the Gypsy-166_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW endogenous retrovirus; Interspersed repeat; reverse transcriptase; KW gag; Gypsy superfamily; integrase; Gypsy-166_DR; Gypsy-166-LTR_DR; KW Gypsy-166-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4298 RA Dib M.R. and Naveira H.F.; RT "Gypsy-166_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 8(12), 2164-2164 (2008). XX DR [1] (Consensus) XX CC Gypsy-166-I_DR is an internal portion of the Gypsy-166_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its long CC terminal repeat is deposited in Repbase as Gypsy-166-LTR_DR. CC Gypsy-166_DR is characterized by 4-bp target site duplications. CC The internal portion encodes two proteins: the 311-aa gag CC Gypsy166_DR1p (pos. 18-950) and 1132-aa polyprotein (pos. CC 878-4273, conceptual translation) composed of the protease, CC reverse transcriptase, and integrase domains. XX FH Key Location/Qualifiers FT CDS 18..950 FT /product="Gypsy-166-I_Dr_1p" FT /note="Gag-protein." FT /translation="MDSAGADQVRNAVTQQGILLGQQATQLTATSQEVEFL FT TARVAELNDLVQDLQLNLRAGRQAAFPLHPDHEPHANCPPQYDGDPNSCRS FT FLSQCSLVFALQPRRYATEESRVAFVITLLKGRARDWATAVWDARAAFCAT FT FDDFRNEMTKLFDRSAHGDEAASLLAQLSQNGRSVTDYSVQFRTLAAVCAW FT NEAALRARFRDGLDDEIQDEIATHDLPHDFDALVDLALRVEGRLRRRQHRR FT ATRSSWRAEEGFSNLLHTPPPIPDQEPMQLGHLRLTTQEKRHRLARGLCLY FT CGKPGHQAVRCPLKAPAHQ" FT CDS 878..4273 FT /product="Gypsy-166-I_Dr_2p" FT /note="Polyprotein." FT /translation="GLVPVLWETGSSGCPVSVKSPRSPVVRGILVGTIPCS FT DTPTYRTSLPASLQFGDVVHSCSALIDSGAEGNFLDSTTAESWGIPAIPLS FT SPITVRSLNGLTITSITHSTPSVSLIISGNHREDISLYLFDSPSAPVVLGH FT PWLTKHNPHVDWSTSSVLSWSPSCLALCLGPAPCSGSVSHVLQVEAVDLSG FT VPSEYHDLRQVFSKSRATSLPPHRPYDCAIELLPGSSPPRGRLYSLSRPER FT EAMDRYIQESLKADLIRPSSSPAGAGFFFVKKKDGSLRPCIDYRGLNDITV FT KNRYPLPLMSSAFELLQGAKVFTKLDLRNAYHLVRIREGDEWKTAFNTPTG FT HFEYRVLPFGLTNAPAVFQALVNDVLRDMVNRFVFVYLDDILIFSPSKQVH FT TQHVRQVLQRLLENQLYVKAEKCVFHTKSVSFLGHIVSTEGIKADPAKVRA FT VAEWPIPNSRKALQRFLGFANFYRRFIRNFSSVAAPLTALTSPKVPFIWNS FT RAQEAFDVIKSRFITAPVLSLPDPERQFIVEVDASEVGVGAVLSQRSLRDG FT KVHPCAFFSHRLSPAERNYDIGNRELLAVRLALGEWRHWLEGAAHPFLVWT FT DHKNLEYIRSAKRLNSRQARWALFFGRFTFSLSYRPGSKNIKPDALSRLFD FT VPGENATPVAILPRGVVVGSLSWDVERRVREAGQGGETPEGCPGGRLFVPI FT RLRPEVLQWGHESRVACHPGIRRSLATIQQRFWWPSMAQDTKQFVLACAVC FT AQNKTSNQPPIGLLQPLPIPSRPWSHIALDFVSGLPPSRGNTVVLTVVDRF FT SKAAHFIPLPKLPSAKETAQVVIDQVFRIHGLPMDVVSDRGPQFVSRFWKE FT FCRQIGASTSLSSGFHPQTNGQTERANQDLERTLRCLVSQNPGTWSQQLSW FT VEYAHNSLPVASTGMSPFHCSVGYQPPLFPAQEPDAAVPSAFAFVRRCRRT FT WKRAKEALAQASRRTKAAADRHRTPAPPFVCGQKVWLSTKDLPLRVPSRKL FT APRFIGPYQITKVLSPVTVQLKLPCTLGRVHPVFHVSRVKPVFRSQLNPIG FT HSPAPPPPRLVDGVPVYTVRRLLDVRRRGRGFQYLVDWEGYGAEERSWIPA FT RDILDQSLIGEFRRRRGEPPLEAP" XX SQ Sequence 4298 BP; 831 A; 1221 C; 1131 G; 1115 T; 0 other; gaacggtctg accagttatg gattcagcag gtgcagatca agtgagaaat gcagtgaccc 60 agcagggcat tctcctagga caacaggcta ctcaattaac tgccacttca caagaggttg 120 agtttctgac ggctcgtgtt gcggaactca atgatctagt ccaggatcta cagctaaacc 180 tcagagccgg taggcaggca gccttccctc tccaccctga tcatgagccg cacgctaatt 240 gtccaccaca atatgacggg gaccccaact cctgccgatc tttcttgtct cagtgttcac 300 tggtcttcgc tctccagcca cgccgttacg ccactgagga gtccagagtg gcatttgtga 360 ttacacttct taagggcagg gcccgagact gggccaccgc tgtgtgggat gcacgtgctg 420 ccttttgtgc cacttttgat gactttcgta atgagatgac caagctgttc gatcggtcag 480 ctcacgggga tgaggcagct tctcttttgg cccagctaag ccagaacggg cgttcagtga 540 ctgactactc tgttcaattc aggaccctag cagcggtctg cgcttggaac gaagcagctc 600 tccgagcgag gttccgggac ggtctggatg atgagattca ggacgaaatc gccactcatg 660 acctccctca tgacttcgac gccctggttg accttgctct ccgtgttgag ggccgcctcc 720 gccgccgcca acatcggcga gcgacccgtt cctcctggag ggctgaggag ggcttttcaa 780 acttgctcca cacccctcct cccattccgg atcaggagcc gatgcagttg gggcatctgc 840 ggctaaccac acaggagaaa cgacatcggt tggctagggg cttgtgcctg tactgtggga 900 aaccgggtca tcaggctgtc cggtgtccgt taaaagcccc cgctcaccag tagtacgggg 960 gattctggtg ggcactatcc cttgctctga cacccccaca taccgtacct cactgccagc 1020 ctctctacaa ttcggagatg tcgtccattc gtgctctgcc ctgatcgact ccggagccga 1080 gggcaatttt ttagacagca ccaccgctga gtcctggggc attccggcta tccctctttc 1140 ctctcctatc actgtccgct ccctcaatgg tcttaccatc acctccatca cccattccac 1200 tccttctgta agtttaatta tttctggcaa tcaccgtgag gatattagtt tatatttatt 1260 tgactccccg agtgctcctg tagtattggg gcacccgtgg ctaaccaaac acaatcctca 1320 tgttgattgg tcaactagtt ctgtgttatc ttggagtccg tcttgtcttg ctttgtgtct 1380 tggtcctgcc ccgtgttctg gttctgtttc tcatgtgttg caggtcgaag ctgtagatct 1440 ctccggagtc ccgtcggaat atcacgatct gcgtcaggtc ttcagcaagt cccgtgctac 1500 ttccttacct ccccaccgac catatgattg tgccatcgag ctcctcccag gctcttctcc 1560 gcccaggggt cggttgtatt ccctttcacg tccggaaaga gaggccatgg acaggtacat 1620 tcaagagtct ttaaaagccg acctcatccg tccctcttcc tccccggctg gggctgggtt 1680 cttctttgtt aaaaagaagg atggctctct gcgtccttgc attgactaca gaggtcttaa 1740 tgacatcact gttaagaaca ggtaccctct gcctttgatg tcttctgcct ttgagttatt 1800 gcagggagcc aaggtcttta ctaagctaga cctccgcaac gcctatcact tggtccggat 1860 acgtgagggg gatgagtgga agaccgcatt taacacaccc acgggacact ttgaataccg 1920 ggtcctacca ttcgggctca ctaatgcccc ggctgtcttc caggccctgg tcaatgacgt 1980 gctgagagac atggtgaata gatttgtctt tgtgtacctt gatgatatcc tcatcttttc 2040 cccttccaag caggtacaca ctcagcatgt tcgacaggtg ctgcagcggc tgctggagaa 2100 ccagctttat gtcaaggcgg agaagtgcgt gttccacact aagtctgttt cgttcctggg 2160 gcatattgtc tcgacggagg gaatcaaggc tgatcctgct aaggtaaggg ccgtggccga 2220 gtggccaatc cccaactcta gaaaggctct gcagcggttc ttggggttcg ccaacttcta 2280 taggcgattc atccggaact tcagctcggt cgctgcaccc ttaacagctc tcacctctcc 2340 taaggtaccg ttcatatgga acagtcgagc tcaggaggcc tttgatgtta taaagtcccg 2400 tttcatcact gctcctgttc tttctcttcc agatcctgaa cggcagttta ttgttgaggt 2460 ggatgcttct gaagtggggg taggcgcggt cctttctcag cggtccctta gggatgggaa 2520 ggtgcatcct tgtgccttct tctcccaccg cctgagccca gccgaacgaa actatgacat 2580 aggtaacaga gagttgctgg ctgtcaggct ggccctgggg gagtggcgcc actggttgga 2640 aggagcagca cacccattct tggtctggac ggatcataaa aatttagaat atatccgttc 2700 agccaaaaga ttgaattccc gtcaggctcg ctgggcactc ttctttggtc gatttacctt 2760 ttccctctcg taccggccgg gatccaagaa catcaaacct gatgctctct cccggctatt 2820 cgacgtccct ggagagaatg ctaccccagt agccatcctc ccgagaggag tggtggtggg 2880 gtctctctct tgggacgtcg agcggcgggt gagagaggcc ggtcaaggag gggaaacacc 2940 ggaagggtgt ccggggggtc gcctgtttgt gccaattcgg ctaagaccgg aagtcctcca 3000 gtggggtcat gaatccagag tagcctgtca cccaggaatc cggaggtcgc tggctaccat 3060 tcaacagcga ttttggtggc cgtctatggc ccaagatacc aagcagtttg tgttggcttg 3120 tgcagtatgt gcccagaata agacttctaa tcagccgccc attggtctgt tacaacccct 3180 ccccattccc tctcgcccct ggtcacacat tgcccttgat tttgtctctg gcttaccccc 3240 atcgagaggt aacaccgttg tgttgacggt ggtggaccgc ttctctaagg ctgcccattt 3300 tatcccctta cccaaacttc catccgccaa agagactgct caggttgtga ttgaccaggt 3360 ctttcggata catggccttc ctatggatgt ggtctctgac agagggcctc aatttgtctc 3420 tcggttttgg aaagagttct gcagacagat cggggcctct acgagtctgt cttcaggttt 3480 tcacccccag accaatgggc agactgagcg agccaaccag gatcttgaaa gaactctccg 3540 ctgcttggtg tcacagaatc cgggtacctg gagtcagcaa ctatcttggg tcgaatacgc 3600 ccacaattcc ttgccagttg cttcaacagg tatgtcccct ttccattgct ctgttggtta 3660 tcaaccccct ttattccctg ctcaggagcc cgatgccgcg gtcccgtctg cctttgcctt 3720 cgttcgtcgg tgtcgccgca cctggaagag ggctaaggag gccttggccc aggcttccag 3780 acggaccaaa gcggcggctg accgtcaccg gacccctgct ccgccttttg tttgtggtca 3840 aaaggtttgg ctatctacta aggacctacc tctccgggtg ccttctcgca agctggcacc 3900 caggttcata gggccatatc agatcactaa ggtcttgagt ccggtgacag tacagctcaa 3960 gcttccgtgc actcttggtc gggttcaccc tgtttttcat gtatccaggg tcaagcctgt 4020 gttccggtcc caacttaatc ctattggtca ttccccggcc cctcctcctc ctcgtctagt 4080 agatggcgtt ccggtctata cggtgaggag gcttcttgat gtgagacgcc ggggccgggg 4140 ctttcagtat ctggtggact gggagggcta tggtgcggag gagaggtctt ggattccggc 4200 tcgggatatt ctggaccagt cgctgatcgg ggaatttcgt cggcgacgag gtgagccccc 4260 tctagaggcg ccgtgaggcg cttctggggg gggggggg 4298 // ID DNA8-1_DR repbase; DNA; ZEB; 2164 BP. XX AC . XX DT 01-DEC-2004 (Rel. 9.11, Created) DT 01-DEC-2004 (Rel. 9.11, Last updated, Version 1) XX DE DNA8-1_DR is a non-autonomous DNA transposon. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-1_DR; KW nonautonomous DNA transposon. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-2164 RA Kapitonov V.V. and Jurka J.; RT "DNA8-1_DR, a nonautonomous DNA transposon from zebrafish."; RL Repbase Reports 4(10), 251-251 (2004). XX DR [1] (Consensus) XX CC DNA8-1_DR is a nonautonomous DNA transposon. Its is CC characterized by 8-bp terminal inverted repeats and 8-bp CC target site duplications. It is expected to be a member of the CC hAT or P superfamilies. XX SQ Sequence 2164 BP; 701 A; 410 C; 371 G; 682 T; 0 other; cagtgttgtg caagttactt ccaaactgta atacattaca gattacttat tactgttatt 60 taaatgtaat cccttacctt acaatattac tgtctcagaa ttgtaatatg ttacattact 120 cttatattac tttttagtta ctttcaccaa aataactaca gaattagaac ttaacattat 180 aaaattctat aatgttctat aatgagcatt ttacatccag tagaaagtga tttgatgtag 240 cagaatgact gtgacctctc caggctacta acaatatagt atataattgg caatgtgaag 300 gctcaagaca atgcaagaaa ggatgacagc cagaattaca aatgatctgt taaaagtatg 360 gtagaggtaa agtgattatc tggcaatatc tgtgaatggc ttggctatat tcatattaga 420 cacaacaggc cgatatgtaa actccaatgc catgacaaaa tgcagatttt gtttcttttc 480 ttattgttgc cattattgta tttactttac attcaagtgg tttacaacac aaaaccgtcc 540 tgcacaaagt gagcatgatg aacagctttc tcagtataat gtttgtgcac cgatttcctg 600 tttgtggaca aaactgcttg cgagctctca catacgctgc tcgcgcacaa attctcacga 660 tctctcaaat atgcactgct cacgcacaag ttttcttgcg tgctctcagt acactgctct 720 ctcagtgaga ttatatgcag actcacaatt tgtgtcttgt gttccctctt cctttcaagc 780 tttttgatat gaattatatt agtacactat ttattaacaa taaccaaaat actaaaatag 840 acttataatt tttaatctaa taaaccataa cattgttggc tgtatgttat atgatgagat 900 atttcaaata aaaaagcatt tagacaaaaa taaaaactaa cgcaagcaga aatgtaacta 960 atagaaacac ttgataaggt attatagcct actgtactat tcactcttca aataaatctc 1020 attatcaatc acattttttt atggcaacta aaataaagtg aactagtgtc tgttgtcttc 1080 cacattgttg atgagattgt ggaggacatt ttccctgcca tcatcttcat tttgagaaga 1140 agctgttcca cagggcccct gaacacacat attgtcatta agtactttaa ttactctgtt 1200 gtggtaatac tatagttgct atggtaacac aacaagtgta atataaacaa atgatccagt 1260 gctgtagttt ttttttttta caatatgggg tagattgtac tacaattcac cacactttac 1320 tgtagtaaaa ctacacttta tatttaattt tatgagttca ctattcttaa tacgacagta 1380 tgcttaagca ttcattgtaa atactagcct aaaacatagg cagtatacct tacactcaaa 1440 aacactagta tttattatag aatttatctt aacctgtaag taaatagtgt agcaatatat 1500 aaaccatata aacattcttg ggataacaat taaataggag gatctctgga gagcaacaga 1560 acgaaaggaa gaggaaacac aaggcacaga ttgtgagtct gcatcatctg acggagcgag 1620 agcaggtcgt tagcacgtga gcagcccagc cagtgttgag tgaacgcgag gagttttgtc 1680 cacagaggaa accggcacgt tcactcgctc gtattacatg aatgagctta cgacatttgt 1740 cagtgaaata acgccatatg cacttacgtc cgtctcgcga atgcactgtc ttatgccatt 1800 ttcatctatt tgccgctttt ccgctgcgcg tactgcgaac cgggctgagc caatagcccc 1860 ggaccttgag ctcagaagct gaaatcattc cgcgttcaaa gtaaaaaggt cccatagcca 1920 gagggagatt aatgacagca cgcgcatata ttctctagcc acgaacagga aatagaactg 1980 cggtattttt tatttgttaa tgtcttgcgg gcgaaaagtg tgttgtaacg cacgcgttac 2040 tgaacatgta ccgagtaaaa tattactgaa aatgtattag taatgcctta cactactgcg 2100 ttacaggaaa atgtaataca ttactgtaat acattacttt tgtaacgcat tactcccaac 2160 actg 2164 // ID Gypsy-16-I_DR repbase; DNA; ZEB; 6775 BP. XX AC . XX DT 11-FEB-2005 (Rel. 10.01, Created) DT 19-MAY-2005 (Rel. 10.06, Last updated, Version 2) XX DE An internal portion of the Gypsy-16_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy superfamily; Gypsy-16-I_DR; Gypsy-16-LTR_DR; Gypsy-16_DR; KW endogenous retrovirus; gag; integrase; protease; KW reverse transcriptase. XX NM Gypsy-16-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6775 RA Kapitonov V.V. and Jurka J.; RT "Gypsy-16_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 5(1), 5-5 (2005). XX DR [1] (Consensus) XX CC Gypsy-16-I_DR is an internal portion of the Gypsy-16_DR CC LTR retrotransposon that belongs to the Gypsy superfamily. CC Its long terminal repeat is deposited in Repbase CC as Gypsy-16-LTR_DR. The consensus sequence was reconstructed CC based on multiple alignment of nine proviral copies (they are CC less than 1% divergent from the consensus sequence). CC Gypsy-16_DR retrotransposons are characterized by 4-bp CC target site duplications. The internal portion contains two CC ORFs encoding the 645-aa Gypsy-16_DR1p gag (pos. 109-2043) CC and 1601-aa Gypsy-15_DR2p pol proteins (pos. 1971-2043) CC composed of the protease, reverse transcriptase, and integrase CC domains. The second protein, including the protease domain, CC does not start from Met. Presumably, the gag-pol fusion protein CC is formed originally due to a ribosomal frame shift. This family CC is likely still active in the genome. Each of all nine proviral CC copies is flanked by identical LTRs. XX FH Key Location/Qualifiers FT CDS 109..2043 FT /product="Gypsy-16_DR1p" FT /translation="MDIIEKENVDISKAVIVGGMTLTETDSDLESWLLRYG FT SINRHLLIDDPDCEFHRHAIIEFTHNSAMKTLMPLLPLTVVSMSNPSTTFM FT VRALSCVYPHIASDSATNGYLEELQNIASFSGKSIEEVLQTELLKIKFGPS FT HAESLPVLDKKLEFPNAARSQILDRSTVSSPNRLLSPVISQSMITEQTAFP FT SSRISPFHEVESTNSKNLSKESLNHRSTKPTVTVSSHPALTMDIIDPPSVQ FT KVVVEHIVRTNDTAPMHHTSFRLRSFSGKIPRPVNEPDFDTWRASVDLLLT FT DPSISDLNRARKIIDSLLPPAADIVKHVSPNSLPAVYLELLESVYGSVEDG FT DELLARFMNSFQNNGEKPSTYLHRLQVLLSTAIRRGGIFEEERNRYLLKQF FT CRGCWDSSLIADLQLERRKATPPSFAELVVLIRTEEDKNASKEERMRKHLG FT LNKHYPAPSKFRLSAHQISAHQSETQDDQTDTSLAKQVCELQAQVVALQKP FT SSQKEKKKNAKPDEVSELRNVVTELQAQITAMQTTATPKIKSDVEATEIAD FT LKRQIADLKVQLTAPDMYRNRTRNLLPEPRATDCYRASKLPESRPRPGYCF FT RCAEDGHLASSCSNAPDPTKVAEKKRKLRERQAQWDTQQVAIMNPLN" FT CDS 1971..6773 FT /product="Gypsy-16_DR2p" FT /translation="EKTQVKGATSPVGYPTSSNHESFKLRTVSVEGHTETK FT RNNNCPEKRKQLFNQNACDAPPLRNLPRGLVGVKCTAQITVGNKRVSCLLD FT TGSQVTTVPWSFYQENLSNCPLKSLDNLLEVEGANGQTVPYLGYVELTLKF FT PREFLGTETEVPTLALVVPDLMNTPQVLIGTNSLDALYSNYVQQSASFPQS FT NFHGYRAVQKVLEARYKQASADVVGCIKFKGHVPEVVPAGCTVVLDGHVLV FT NCPHVGKCVALESPTSPALPGGLLVASCLHSLPSKRHQQLPVVLRNETQTD FT ITIYPRTIIAEMRAVQEVIKSGQVNSSTVNKELSACSNLKFDFENSPLTPE FT WKKRITDQLNSMPEVFALHDLDYGHTNKVTHRIKLNDETPFKHRPRPIHPQ FT DIDAVRKHLQDLLAAGIIRESESPFASPIVVVRKKDNSVRLCIDFRKLNSQ FT TIKDAYALPNLEEVFSALTGSKWFSVLDLKSGYYQIEMEEADKSKTAFVCP FT LGFWEFNRMPQGITNAPSTFQRLMERCMGDLNRKEVLVFIDDLIIFSESLE FT EHESRLMHVLKRLKEYGLKLSPEKCKFFQTSVRYLGHIVSENGVETDPVKI FT EALKTWPRPRNLKELRSFLGFSGYYRRFIQDYSKIIKPLNDLTVGYPPLQK FT RHLQENKNKQYLDPKKEFGDRWNQPCQQAFDMIIEKLTSAPVLGFADPKLP FT YVLHTDASTTGLGAALYQKQEGQMRVIAFASRGLTRSESRYPAHKLEFLAL FT KWAVTSKFSDYLYGTEFVVVTDSNPLTYILTSAKLDATSYRWLSSLSTYNF FT KLQYRAGSQNCDADGLSRRPHGELLDDPASQKERERIKQFTLHHLDEFGVE FT DSLILPEAIKAICDRHQIGNSSHKCKFSNPSIALVESLALHADVLPNEFEQ FT ENEHGLPVIPYLSNEELKRQQRMDPDLKFIIDCLQRNEKPSSSKDQSLAVT FT LWIREWSRLELRDGLLYRKKQDQESTHYQLALPVALRGTVLKSLHNDMGHM FT GMERTLDLVRTRFFWPKMSSSVEEKIKTCERCVRRKAFPEKAAEMMNIKTT FT RPLELVCMDFLSLEPDQSNTKDILVITDHFTKYAVAVPTRNQKAQTVARCL FT WENFLVHYGFPERLHSDQGRDFESSLIKELCLVAGIHKVRTTPYHPRGNPV FT ERFNRTLLQMLGTLENKKKSCWKEFVKPLVHAYNCTRNDVTGYTPYELMFG FT RQPRLPVDLAFGLPVDRSTKSHSQYVKDLKEGLRESYEIAIKNSAKVAQRN FT KRRFDKHVVVSTLDVGDKVLVRNLRLRGKNKLADKWEPDVYVVIRKAGDLP FT VYVVQPDGKTGPVRTLHRDLLRPCGYLSENEIEEMSPPNVQRKPRTRSSSA FT LEYAPKEHQMSDQSESEDDSLYIRNAGRQLESITTTVLPSSQSPVLVRNLP FT GIEPIEPLPVVVNPEKETLPDSRLEEDLTENQRDDVNENFLPVLNPADIDP FT KEIEPERSGNSVEVQIHRRALELDPVDVPHSNDQNPHVRNVSSNQPIVDED FT LDTSGPRRSKRQCRPPNKLEYHKLGNPLTLVIQSLLQGLSSAFTTSLEEPI FT LTRDQPFVVPDPFPIAVTTQPRTCPRTCLNSGGE" XX SQ Sequence 6775 BP; 2156 A; 1428 C; 1416 G; 1775 T; 0 other; taccaaaaag tggcgagagc cagccaggag agagattgca acaacagtgt ctaattacag 60 tattcgagtt cactataaat attacaaatc ggagagaact ttaacgtcat ggatatcata 120 gaaaaagaga atgtagatat ctcaaaagca gtaattgtgg gtggaatgac actgactgag 180 acagactcag atttagagtc atggctttta agatatggta gtattaaccg acatcttcta 240 attgatgacc ctgactgtga gtttcatcga catgctatca tagagtttac acataactcc 300 gcgatgaaaa cattgatgcc tcttttgcct ttaactgtag ttagtatgtc aaacccaagt 360 accactttca tggtacgtgc tttaagctgc gtttaccccc atattgctag tgatagtgct 420 actaatggat atctggagga attgcaaaac attgctagtt ttagtgggaa atccattgag 480 gaagtactcc aaacagagtt actgaagatt aaatttggtc cttctcatgc tgagtcacta 540 cctgttttgg ataaaaagct tgaatttcca aatgcagcac gttctcaaat acttgatcgt 600 agcacagtca gttcaccaaa tagactgctg tccccagtca tatcacaaag tatgattact 660 gaacaaacag cttttccttc atctagaatt tcaccatttc atgaagttga atcaacaaat 720 tcaaaaaacc tgtccaagga aagccttaac catagaagta ctaaacccac agtaacagtg 780 tcatcccatc cagcacttac catggatata attgatcctc ctagcgtgca aaaggtagta 840 gttgagcaca ttgtccgcac aaatgacaca gctccaatgc accatacctc ttttcgcctc 900 cgatctttct ctggaaaaat tcctagacct gttaatgagc cagattttga cacttggcgt 960 gccagtgttg atctcctact gacagatcct tctatatctg acttaaatcg agccagaaaa 1020 atcatagaca gtctgcttcc ccctgctgca gatattgtta aacatgtctc ccctaacagt 1080 ttacctgcag tatatctgga attgctggag tctgtatatg gctctgtaga agacggagat 1140 gagttattag ccagatttat gaatagcttc caaaacaatg gtgagaagcc ttcaacttac 1200 ctgcacagat tacaagttct cttaagcaca gctattcgac gaggtgggat atttgaagaa 1260 gagagaaacc gatatcttct aaagcagttt tgtcgcggct gttgggacag ttccctcatt 1320 gctgaccttc aattagaaag gagaaaagcc actcctcctt catttgcaga attagtagtt 1380 ctcatccgta cagaagaaga taaaaatgcc tctaaagaag aaagaatgag aaaacattta 1440 gggctaaata aacactatcc tgccccctcg aaattcagac tgtcagctca ccagatatct 1500 gcccaccaaa gtgaaacgca ggatgatcaa actgacacat ctctcgcaaa gcaagtgtgt 1560 gaacttcaag ctcaagttgt tgcactgcaa aagccttcaa gccagaaaga aaagaaaaaa 1620 aatgcaaaac cagatgaagt gagtgagctg agaaatgttg tcactgagtt acaggcacag 1680 attacagcca tgcaaactac agccactcca aaaattaaaa gtgatgtaga agcaactgaa 1740 attgctgact taaagagaca gattgctgat ttaaaggttc aactgactgc ccctgatatg 1800 tatagaaacc gcaccagaaa cttgctgcct gaacctagag caacagattg ttacagagct 1860 agtaaactac ctgaaagtag acctcgtccg gggtattgtt ttagatgtgc ggaagatggt 1920 catcttgcca gcagctgtag taatgctcct gaccctacta aagttgctga gaaaaaacgc 1980 aagttaaggg agcgacaagc ccagtgggat acccaacaag tagcaatcat gaatccttta 2040 aactgaggac ggtctctgta gaggggcata cagagactaa aagaaataat aattgccctg 2100 agaaacgtaa acaattgttc aaccaaaatg cgtgtgacgc accccctttg agaaatttac 2160 caagaggatt agtgggagtg aagtgtactg cccaaataac tgttggtaat aaaagagttt 2220 cctgccttct ggacacaggg tcccaagtaa ctactgttcc ctggtcattt tatcaagaga 2280 atttatcaaa ttgtccactt aaatcattgg ataacttgct ggaagtggaa ggggcaaatg 2340 gtcaaacagt gccttatctt ggatatgtgg aattgactct taagtttccc agagagttcc 2400 ttggaacaga gacagaagtg cccactttag ccctggtagt cccagatttg atgaacactc 2460 cccaagttct aattggcaca aattcattag atgctcttta cagcaactat gtccaacaat 2520 ctgcttcctt tcctcaatct aacttccatg gttaccgtgc agtgcaaaaa gttttagaag 2580 caagatacaa acaagcaagt gctgatgtag tgggctgtat caaattcaag ggacatgttc 2640 cagaggtagt acctgcagga tgtacagtgg ttcttgatgg acatgttcta gttaattgtc 2700 ctcatgtagg gaaatgtgta gctctagagt caccaacttc acccgcttta cctggtggtt 2760 tgctagttgc cagctgtttg cattccttac ccagcaaaag gcatcaacag ttaccagttg 2820 tgttacggaa tgaaactcag accgacatta ccatctatcc cagaactata attgctgaaa 2880 tgcgggcagt ccaagaagta ataaagagtg ggcaagtaaa ttccagcact gtcaataaag 2940 aactttctgc ttgttccaat ctcaaatttg actttgaaaa ttccccattg acacctgaat 3000 ggaagaaacg aataacggat caattaaatt ccatgcctga agtcttcgcc ttgcatgact 3060 tagattatgg acatacaaac aaagtcactc accgaataaa gcttaatgat gagactcctt 3120 tcaaacacag acctcgaccc atacatcctc aggacattga tgcagtacga aaacatttgc 3180 aagacttgtt agcagctgga attatccgag agtcagaatc cccctttgcc tcccccatag 3240 tagttgtaag aaagaaagac aattctgtac gtctttgcat tgacttcaga aagctgaact 3300 cacaaaccat taaagatgcc tatgccctgc caaatctgga agaggtcttt tcagcactaa 3360 ctggttcaaa atggttctct gtccttgact taaaatcagg atattatcag attgagatgg 3420 aggaagctga caaaagtaaa actgcctttg tgtgtccctt ggggttctgg gagttcaata 3480 ggatgcccca aggcattacc aatgccccaa gtacgtttca aaggctgatg gaaagatgca 3540 tgggtgactt gaatagaaaa gaggtgttgg tcttcatcga tgatctgatc attttctctg 3600 aaagtttaga agagcatgaa tcaaggctga tgcacgtttt gaaaaggctc aaagaatatg 3660 gactgaagct atcgcctgaa aagtgcaagt ttttccagac ttctgttcga taccttggtc 3720 atattgtatc agaaaatgga gtggagactg atccagtgaa aatcgaggcc ctaaaaacct 3780 ggccaagacc aagaaatctc aaagaattaa gatcttttct gggattttct ggatactata 3840 ggaggttcat tcaggattat tccaagataa tcaaacccct taatgacctc acagtagggt 3900 atccacctct tcaaaaacgt cacctacaag agaacaagaa taagcaatat ctggacccca 3960 aaaaggaatt cggagacaga tggaatcagc cctgtcaaca ggcctttgac atgattattg 4020 agaaactcac ctctgcacct gttctgggat ttgcagaccc aaagcttcct tatgttctgc 4080 atactgacgc cagtaccact gggcttgggg cagccttata ccagaaacaa gagggacaaa 4140 tgcgggtcat tgcttttgca agcagagggt tgacaagaag tgaaagccgg tatccagctc 4200 acaagctaga atttctagct cttaaatggg cagtcacatc taaattcagt gactatttgt 4260 atggaacaga atttgtggtc gtaactgata gcaacccttt aacttacatt ctgacatctg 4320 caaagcttga tgctaccagt tatcgctggt tgtcaagtct gtccacttac aatttcaagc 4380 tccagtatag ggctggaagt caaaactgtg atgcagatgg cctttcaaga cgaccacatg 4440 gtgagctttt agatgaccct gcatctcaga aagagagaga gagaattaaa cagtttaccc 4500 ttcatcattt agatgaattt ggagttgaag attctcttat cctcccagag gccataaaag 4560 ccatctgtga tcgacatcag attggaaatt cctcacataa atgcaaattt tccaaccctt 4620 ccattgccct tgttgagtcc ttagccctgc atgcagatgt attaccaaat gagtttgaac 4680 aagaaaatga gcatggtctt ccagtcattc cttacctgtc caacgaggag ttaaagagac 4740 agcagagaat ggatcctgat ctcaaattca ttatcgattg tttacagcgg aatgaaaaac 4800 cttctagttc aaaagaccag tcgcttgctg ttactctgtg gataagggaa tggagcagat 4860 tagaattaag ggatggattg ctttatagga agaagcaaga tcaggaaagc actcactatc 4920 aattagcctt acctgtagct ttacgtggaa cagtgttgaa gagtctccat aatgatatgg 4980 gacacatggg catggagagg acacttgacc ttgtcagaac cagattcttt tggccaaaaa 5040 tgtcatcatc tgtggaagag aaaattaaga catgtgagag atgtgtacgt agaaaagcgt 5100 ttcctgaaaa agcagctgaa atgatgaaca tcaagactac cagaccattg gagttggtct 5160 gtatggactt cttgtcttta gagccagatc agagtaacac caaagatata ttagttatta 5220 cagatcactt caccaaatat gcggtggctg tgcctaccag aaaccagaag gcgcagactg 5280 tggctagatg tttatgggaa aactttctag tacattatgg atttccggaa agactgcaca 5340 gtgatcaagg acgagatttt gagtcaagcc tcatcaaaga gctatgtctc gtcgcaggta 5400 tacacaaagt gagaactact ccttaccacc caagaggaaa tccagtggag agattcaata 5460 ggacccttct ccaaatgttg ggtactcttg aaaacaaaaa gaagtcatgc tggaaggagt 5520 ttgtcaagcc tttggtgcat gcctacaatt gcactcgaaa tgatgtaaca ggatacactc 5580 cctatgaact tatgtttggt agacagccca ggctgccagt cgacttagcc tttgggttac 5640 ctgtggatcg ttctaccaaa tcccactctc agtatgtaaa agatctgaaa gaaggtttaa 5700 gagagagcta tgaaattgcc atcaaaaact ctgcaaaagt agcccaacgt aacaagcgca 5760 gatttgacaa acatgtggtt gtttctactc ttgacgtggg agataaagtc cttgtgcgaa 5820 atttgaggct aagaggcaag aacaaactgg cagacaaatg ggaaccagat gtctatgtag 5880 ttatccgtaa agctggagat ctcccagtat atgtagtcca gcctgacgga aagaccggtc 5940 cagttcgaac tttacacaga gacttacttc gaccttgtgg atatttgtct gaaaatgaaa 6000 ttgaagaaat gagtcctcca aatgttcaac gtaagcccag aactaggtct agctctgctc 6060 tagaatatgc tcccaaagaa catcaaatga gtgatcagtc agaatctgag gatgactctc 6120 tatatattag aaatgcagga cgccagttgg aatccattac aacgactgtg ttaccttcat 6180 cacaaagtcc agtgcttgta aggaacttac ctggcataga gcccattgaa ccactccctg 6240 tcgtggtgaa ccctgaaaaa gaaaccttac ctgattccag attagaagaa gacttaactg 6300 aaaatcaaag agatgatgtc aatgagaatt tcttacctgt gctgaaccct gctgatattg 6360 accccaaaga aattgaacct gaaagaagtg gaaatagtgt tgaagtacag atccatagaa 6420 gggcgcttga attagatcct gttgatgtac cccattctaa tgatcaaaat cctcatgtca 6480 gaaatgtttc aagtaaccag ccaatagtgg atgaagacct agataccagt ggccctagac 6540 gctcaaaaag gcaatgtaga cctcctaata agcttgagta tcataaacta ggaaatccct 6600 tgacactagt cattcagtcc ttactacaag gtctgagttc tgcctttacc acatcgttag 6660 aagaacccat actcactaga gatcagccct ttgttgtgcc agaccctttt ccaattgcag 6720 tgacaaccca accccgtaca tgcccgagga cgtgcctgaa ttcagggggg gaatg 6775 // ID Copia4-I_DR repbase; DNA; ZEB; 4207 BP. XX AC chr15; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 01-OCT-2008 (Rel. 13.1, Last updated, Version -1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia4_DR; KW Copia4-LTR_DR; Copia4-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4207 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1590-1590 (2008). XX DR Genome; chr15; Positions 7905277 7901071. XX CC 'TTTAT' target site duplication CC LTRs are 100% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 78..4106 FT /product="Copia4-I_DR_1p" FT /translation="MVSRAGGYLAPQFLFDGSEEKFDLWETRFLGHLHTLK FT LKETILHQPTDAEAEQIAEDRRKNADCYAELIRLIDDKSLSLVRHDAADDG FT RKALKILKEHYSGKSKPRIINMYTALTQLHMSDNESVTDYLIRAENNITAL FT REAGELMSDGLTIAMVLRGLPDSFKPLAVHITQNEDDVTFTDFKRRLRVYE FT ETEKMKVATTDNVMKTCTRQGQGFNKPHTRDRKDEDINIICYKCGIKGHRA FT RKCYKRVWCNHCKNSTHAEFLCKKKTDRDGVRKFKDECDSDQDHLFKVKNL FT EYENMSPNINEKGIMVDAGATSHIVNDIRKFKDFDSSFRPQTHSVELADGT FT RCNGIAQQRGTAVIHFLDESGHQHRAQLRDALYMPSYPHDIFSVARATNGG FT ATVIFKRGNSRMITKDGNKFNIHETGNLYYLPTVEENADQCKVCHDLQTWH FT EILGHCNYEDVRKLQDVVRGMNIKGDTAKKNWLCEICTKGKFTQTRSREPD FT RRAEKPLELIHTDLTGPMPTQSREGHKYAQSFIDDYSGTILVYFLKSKSDT FT VQATEKFLADIAPYGDVKCIRSDNGTEFTCRDFQTLLMKNKIRHETSAPYS FT PHQNGTAERGWRTLSDMTRCLLIESKLPEELWNYAMQTSAYVRNRCYCSRT FT KKTPYELFTGRKPDVSKLQKFGSTCFAYKQEKGKLDSRCEEGVFIGYDKNS FT PAYLVYYPNTRKIQKQRLVKFKIRLMREKETQTSETSTEFEIISPKISSEG FT KDEGDLQDKPAQEIQSDTSETLSEQTEHTQPDKTTETAMRKNPTRVRRKPA FT HLQDFETDNQEDKLQTCIDSCYRVICDVPQNYQEAITSTNSMHWRNAMDKE FT MKSLKENQTFTLTQLPQGKQPVGGRWVYTLKRDTDGSEKYKARFVAKGYNQ FT KPGTDYDETFSPTADMTSVRVVMQKSAQENLILHQMDVETAYLHAPIDHEV FT YVEQPEGYEQESETGEKIVCKLEKSLYGLKQSGRNWNAVLHECLTENGFMQ FT NPADNCVYTREKPNEKVILIVWVDDLIIAASSESILESVKGMLTERFKMKD FT MGRLKHFLGIDFEQTEGLVKMSQEKYVNKILERFEMQNCKSRETPCELKLD FT FSEDAEKMKDKRKFREAVGSLIYLSTCTRPDLSFVVSKLSQNFAEPTEQHW FT NTVKHVFRYLKGTTKHELSFRKNDTEKLGLKVYSDADWASDTQDRRSTSGY FT CVSLSEESSLISWKTRKQSTVALSTCEAEYMSLASAIQECIYLKQLLRNMD FT RYQYAPTKVYEDNQGTIALARNPVCRQRCKHIDIKYHFIRETVNDGKVILE FT YCPTEEMIADIMTKPATKQKLKRFYRYIFDT" XX SQ Sequence 4207 BP; 1558 A; 750 C; 939 G; 960 T; 0 other; ggttatgggc ccaggcagcg agacaaacga tcgtttttga gaaagtaagg ttaacgcgac 60 gtattacgcg tgcaaagatg gtaagccgag cgggaggata cctagcgccg cagtttctat 120 tcgatggatc agaagaaaaa ttcgatctat gggagacaag attcctggga cacttacaca 180 ctctaaagtt aaaagaaact atattacatc agcctacgga tgctgaagcc gaacaaatag 240 ccgaggacag acggaagaac gctgattgct atgccgaact gatcagactg atcgatgata 300 aaagcctatc actggtaaga cacgatgctg ccgatgacgg aagaaaagcc cttaagatac 360 tgaaagagca ttattcaggc aaaagtaaac cacgcataat taacatgtac acagcattga 420 ctcaacttca catgagtgac aacgagagcg tcacagacta tttaatacga gcagaaaaca 480 acattacagc tctaagggag gctggagaat taatgagtga tggactgacg atagcgatgg 540 ttctccgagg gctaccggat tcattcaagc cactagccgt ccacataaca caaaatgaag 600 acgatgtcac atttacagat tttaaaagga gactccgagt ttatgaggag acagagaaaa 660 tgaaagttgc aactacagac aatgtaatga agacctgcac gagacaaggg caaggcttca 720 acaaaccaca cacaagagac agaaaagatg aagatataaa tataatatgc tacaaatgtg 780 gaataaaagg gcacagggcc cggaaatgct acaaaagagt gtggtgtaat cactgtaaaa 840 atagcacaca tgctgaattc ttgtgcaaga aaaagactga ccgagatggt gtccgaaaat 900 ttaaagatga atgtgacagt gaccaggatc atctatttaa agttaaaaac ttagaatatg 960 agaatatgtc acccaacata aatgagaaag gcattatggt agatgcagga gcgacatcac 1020 acattgtgaa tgacataaga aagtttaaag attttgacag ctctttccga ccacagactc 1080 actcagtaga gctagctgac ggaacaaggt gcaacgggat agcacagcag aggggaacgg 1140 ctgtaattca ctttttagat gaaagcggac accaacacag agcacaacta agggatgcac 1200 tatatatgcc ttcctatcca catgatattt tctctgtagc aagagcaaca aatggcggag 1260 caacagtcat ctttaaaaga ggaaacagcc gtatgatcac caaagatggt aacaaattta 1320 atattcatga gactggaaat ttgtactatt tgccgactgt agaagaaaat gctgaccagt 1380 gcaaagtgtg ccatgatcta caaacttggc atgaaatttt gggtcattgc aactacgagg 1440 atgttcggaa attacaagat gtagtaagag gcatgaatat aaaaggagat acagctaaga 1500 aaaactggct atgtgaaata tgcacaaaag gaaaattcac acagacaagg agcagagagc 1560 cggacagaag ggcagagaaa cctttagagt tgattcacac agatttgact ggtcctatgc 1620 cgacacagag cagagaagga cacaaatatg cacaatcttt catagatgac tactctggga 1680 ctatattagt ctactttctg aaatctaaat cagatacagt acaagccaca gaaaagtttt 1740 tggcagacat cgcaccttat ggagatgtca agtgcattcg ttctgataat ggcactgaat 1800 ttacgtgtcg tgattttcag acacttctga tgaagaataa gatcagacat gaaacgtccg 1860 caccgtattc acctcaccag aacggtacag cagagagagg ttggagaaca ctctctgaca 1920 tgaccagatg tttattaatt gaaagcaaac tgccagagga gttatggaac tatgcgatgc 1980 agacatcagc ttacgtgagg aacaggtgct actgtagtcg tacaaagaag acgccatatg 2040 agttattcac aggcagaaag ccagatgttt ccaagctgca gaagtttgga tcaacatgtt 2100 ttgcttacaa acaagaaaaa ggcaaacttg actcgagatg tgaagaggga gttttcattg 2160 ggtatgacaa aaacagtcca gcgtatctag tgtactaccc caacacaaga aagatccaaa 2220 aacaaaggtt agtgaaattc aaaatcagac taatgagaga aaaagaaaca caaacatctg 2280 agacaagtac agaatttgag attataagtc ctaagatcag cagtgaggga aaagatgaag 2340 gagatcttca ggataaacca gctcaggaga ttcagagcga cacttcagag actttatctg 2400 aacagactga acatacacag ccagataaaa ctactgagac tgcaatgaga aagaacccta 2460 cacgagtgcg gagaaaacca gctcatttac aggactttga gactgacaat caagaggaca 2520 agttgcaaac atgcatcgac tcctgttata gagtaatatg tgacgtacca caaaactatc 2580 aggaggctat aacatccaca aactcaatgc attggaggaa tgcaatggac aaagagatga 2640 aatctcttaa agagaaccag acttttacac tcactcagtt gccacaagga aaacaaccag 2700 ttggaggcag atgggtttac acactgaaaa gagacactga tggatcagaa aagtacaagg 2760 caagattcgt tgcaaaagga tacaatcaga aaccagggac tgattatgat gagacattct 2820 ctcctacagc cgatatgaca agtgtgcgag tggtaatgca aaaatcagcg caagaaaacc 2880 tgatcttgca ccagatggac gttgaaactg cttatttgca tgcaccaatc gaccatgaag 2940 tttatgtaga acagccagaa ggttatgaac aagaatctga aacaggtgaa aagattgtat 3000 gtaagctaga aaaatctctg tatggtctga aacaatctgg tagaaactgg aatgctgtat 3060 tacatgaatg tttgactgag aatgggttta tgcaaaatcc agctgataac tgtgtatata 3120 cacgagaaaa gcccaatgag aaagtaatcc taattgtatg ggtggatgac ctaataatag 3180 cagctagcag tgaaagcatt ctagagagtg tgaaagggat gctcactgag aggttcaaaa 3240 tgaaagacat gggaagattg aaacatttcc tgggtataga ttttgaacaa actgaaggtc 3300 tagtcaaaat gtcacaggag aaatatgtta acaaaattct ggaacgtttt gagatgcaaa 3360 actgcaaatc tagggaaacc ccatgcgaac taaaacttga tttttcagaa gatgctgaaa 3420 agatgaaaga caaaagaaaa tttagagagg cagtgggaag tttaatttac ttatccacat 3480 gcacacgacc tgatttgagt tttgttgtca gtaaactttc ccaaaatttt gctgaaccaa 3540 cagaacagca ctggaacaca gtaaaacatg tgttcagata cctaaaaggt acaacaaaac 3600 atgagctatc tttcaggaaa aatgacacag agaaactagg tctgaaagta tacagtgatg 3660 ctgattgggc atcagacaca caagacagac gtagtacatc aggatattgt gtcagtctta 3720 gtgaggaaag ttctttgatt tcctggaaaa caagaaaaca atcaacggtt gcattgtcaa 3780 cttgtgaagc agagtacatg tccttagcat cagctataca ggaatgtatc tacctgaaac 3840 agttactcag aaatatggac aggtatcaat atgcaccaac caaagtgtat gaggacaacc 3900 agggtaccat agcacttgca agaaacccag tctgtagaca aagatgcaaa cacattgata 3960 tcaagtatca tttcataagg gaaactgtaa atgatggtaa agtaatcttg gaatattgtc 4020 ccactgagga aatgattgct gatatcatga caaagccagc cactaagcag aaactaaaga 4080 ggttttatag gtatattttt gatacttgaa gagtaaattt ttgtttacat tattgatgta 4140 aaaggaaaag ccttatttga tttgttatct attttgagtg atacaataag ctacgagcga 4200 gtggggg 4207 // ID Gypsy-34-I_DR repbase; DNA; ZEB; 4439 BP. XX AC . XX DT 01-JUN-2007 (Rel. 12.06, Created) DT 05-JUN-2007 (Rel. 12.06, Last updated, Version 1) XX DE An internal portion of the Gypsy-34_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy superfamily; Gypsy-34-LTR_DR; KW Gypsy-34_DR; endogenous retrovirus; gag; integrase; KW reverse transcriptase; Gypsy-34-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4439 RA Dib M.R. and Naveira H.F.; RT "Gypsy-34_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 7(6), 189-189 (2007). XX DR [1] (Consensus) XX CC Gypsy-34-I_DR is an internal portion of the Gypsy-34_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its long CC terminal repeat is deposited in Repbase as Gypsy-34-LTR_DR. CC Gypsy-34_DR is characterized by 4-bp target site duplications. CC The internal portion encodes one polyprotein the 1473-aa CC polyprotein Gypsy-34_DR1p (pos. 19-4437) composed of the gag, CC protease, reverse transcriptase, and integrase domains. Some CC insertions fairly recent, according to the high identity between CC their flanking LTRs. Consensus obtained after the alignment of at CC least five independent insertions bearing at least 85% homology CC over at least 1000bp. XX FH Key Location/Qualifiers FT CDS 19..4437 FT /product="Gypsy-34-I_Dr_1p" FT /translation="MASRRKVTSGKQARKGLRSQKPVLEEVAWDTMDEEET FT ASAEGAEAMSPRVKPESKGMESDVVALMRTFLTAQSKREEGLIYELRGLRE FT SLQRSVIPETSPSQSLRLDLPTPAASRIRGPSVTQNPTLSTQRAADSPPPP FT PMRPESRMPAFQEGEDVENYLRRFERLAKTWGWPEEEWANRLVPLLTGKAL FT EAYLAMDEDRAEVYKDLREALMAKFDLSTETYRQRFRQTSIPAGETPTETY FT NRLKGLYKRWIRPEQCTKEEVGEAIILEQLLRVLPLEVRTWVREHEPTDGL FT NAAKLAVQYLNAHRGGQMYARSTPTRGAKHTDTSGKGTVSVFPTKKLICYY FT CQQEGHKASNCPVRKPKLTSLCYVPREGDVKDKVTYTNRHIDVTVNGQNIN FT ALIDTGSTFTLIKQCYIPRNTIDFTHTQNIQCVHGDKKDYPTTEVTLVIND FT QPYLMSVGVVDALPVGMVLGWDLPVLNELLSNAVESTNVYDDISCPVLTRS FT QTRAGLEPLPAMDDSLIQGEGGKPKKSRRQRRLEKQLGSPIVEVSTEGLKV FT TEWDIPKNIAQLQAEDVTLKPLFSKVCEGEKQIETLGAEKYVIEKDVLYVL FT SDGVKRLVVPTCCRSLILHLAHTIPWSGHLAHQKTYQRIGTRFYWPSMYTD FT VHTYCSTCIVCQKTSSVHRSARVPLHPLPVIETPFKRIAMDIVGPLERSST FT GNQYILVICDYATRYPEAFPLRSITTAKVITALTQLFSRVGFPQEIITDQG FT TNFTSRLMQLLHKQLGIHSIRTSPYHPQTDGLVERFNQTLKRMLRKFVSDN FT GKDWDKWLPFLLYAYREVPQASTGFSPFELLYGWQVQGPLDLVKRSWEASS FT TTAEGETNIVKFVLQMRDRLEKYREQAQENLEEAQKNQKLWYDKWSRRREY FT QPGQKVLLLLPSSTNKLLAKWQGPYEITKKKGPVTYEVFHPDKGKKNQTYH FT VNLLKEWREPGPELKKGMLACKVETPEAEDDDKVDFETPEHLQSAVSLEHL FT QERQRVELTEVLNTYPELQRGRPGRTNLVEHKICLTEATPIRQRPYRVPES FT LIKPLKEELKMMLEMDIIEPSTSAWSSPIVIVPKKDGTLRVCLDFRKLNAV FT SKFDAYPMPRIDELVERIGRAKYITTLDLCKGYWQVPLEKTSREYTAFRTP FT VGLYHFKTMPFGLHGAPATFQRLMNQVLRNCEEYSAAYLDDVVIYSTTWTD FT HVHHLHKILQKIQKAGLTLNVAKCEWARQETRYLGFQLGNGEIRPQVDKVE FT ALRNCPRPRTKKEVRSFLGLVGWYRRFIPQFSEKAAPLTNLTAKAIKNPVP FT WTEECEKAFITLKERMCTVPVLRSPDFTKKFLVQVDASARGIGAVLMQEDE FT GQEHPILYLSRKLLPRETRYATIEKEGLAIKWALDSLRYYLMGREFTLETD FT HRALTWINTMKDQNARVTRWYLSLQPFKFNVRYKAGKSNVTADYLSRLPDV FT SNLGEVGG" XX SQ Sequence 4439 BP; 1364 A; 972 C; 1142 G; 961 T; 0 other; agtggtgtca gaagtgggat ggccagtcgt agaaaagtga cttctggaaa acaggccagg 60 aagggtctgc ggtcacaaaa acctgtgctg gaagaggtag catgggacac tatggacgaa 120 gaggagacag ccagtgcaga gggtgccgaa gccatgtccc ctagagttaa acctgaatca 180 aaaggtatgg agagcgatgt agtggcattg atgaggacat ttctgacagc tcagtcaaaa 240 agggaagaag gcctcatcta tgagctacgg gggttgcgag agtcactgca acgctcagtt 300 ataccagaaa catcaccaag ccaaagccta cgcctagatc tacccacacc tgcagcttca 360 cgaattcggg gaccgtcagt cacgcagaac ccgacactgt cgacacagag ggcggcagac 420 tcaccaccac caccaccaat gcgaccagag tcacggatgc cagcatttca agagggagag 480 gacgtggaga attacttgcg tcgttttgag cgtctggcaa agacctgggg ttggccggaa 540 gaggaatggg cgaatcgact agtacccctt ctcaccggaa aagcgctgga ggcatacctc 600 gcgatggacg aagatcgggc ggaagtctat aaggatcttc gggaggcact gatggcgaaa 660 ttcgatttat caacagagac ctaccgacaa cgatttcggc agacatcaat tcctgctgga 720 gaaacaccga cggagaccta caatcgcctc aaggggctct acaaacgatg gatacgtccc 780 gagcagtgca caaaggagga ggtaggagag gctatcatcc tggagcagct gctacgagta 840 ctgccattgg aggtacggac ctgggtgaga gagcatgagc cgaccgatgg gctgaatgca 900 gcaaagttgg cagtccagta cttgaatgca caccgaggag gtcagatgta tgcccgttct 960 acaccaacac gaggggccaa gcacactgat acatctggaa aaggtactgt ctctgttttt 1020 cctactaaaa agcttatttg ctattactgt caacaagagg ggcacaaggc gtcgaactgc 1080 ccagtacgca aacccaaact caccagtctg tgttatgttc caagagaggg ggacgtaaaa 1140 gacaaggtga cgtacactaa tcgacacata gacgtcaccg tgaatggaca aaacataaat 1200 gcactcattg acactggcag cacatttaca ctgataaaac aatgctacat tccaagaaac 1260 accattgact ttacacacac acagaacatc caatgcgtac atggggacaa aaaagactat 1320 cctaccactg aagtgactct tgttatcaat gatcaaccat acttgatgtc agtgggggta 1380 gtggatgctt tacctgtagg tatggttttg ggatgggacc tacctgtgtt gaacgaactg 1440 ttaagcaatg ctgttgaaag taccaatgtg tatgatgata tatcatgtcc tgtgctaaca 1500 cgctcacaaa ccagagcggg tttagagccc ttacctgcta tggatgacag tttgatccaa 1560 ggggaggggg gtaaaccaaa aaagtcacgg cgacagcgcc gtctagagaa gcaattgggt 1620 tcacctattg tggaagtgtc aactgaaggg ttgaaggtga ctgaatggga tatccctaaa 1680 aatattgcac agttacaagc tgaagatgta acattgaaac ctttgttttc taaagtgtgt 1740 gagggggaaa aacaaattga gacgctaggg gcagagaaat atgtgataga aaaggatgta 1800 ctatatgtgt tgtctgatgg tgtcaaacgt cttgttgtgc ctacatgttg cagatcactc 1860 atattacatc tagcacacac cattccttgg tctggacatt tagcacacca gaaaacctac 1920 caacgcatcg gtacacgatt ctattggcct tccatgtata ctgacgtaca cacatattgc 1980 tccacatgta ttgtttgcca gaaaaccagc tctgttcaca ggtcagcgcg agttcctttg 2040 catccattac ctgtcatcga aactccattc aaacggatcg ccatggacat tgttgggcca 2100 ctggaaagaa gcagtacagg caaccagtat atattagtca tctgtgatta cgctacacga 2160 taccctgaag cattccctct acgcagcatc acaactgcta aggtaatcac tgcattaaca 2220 caactgtttt ccagagttgg attcccgcaa gagatcatca ctgatcaggg cactaatttc 2280 acatcaagac tgatgcagtt actgcacaag cagctgggca tccattccat ccgcacaagt 2340 ccctaccatc cacaaaccga tggattagta gagagattta atcaaaccct caaaaggatg 2400 ttgaggaaat ttgtaagtga caacggcaaa gactgggaca agtggcttcc gttcctgcta 2460 tatgcatacc gagaggtccc acaggcatcc acaggctttt cgccttttga actcctgtac 2520 ggatggcagg tgcaaggtcc tctcgatctg gtgaagagga gttgggaagc atcaagcact 2580 acagccgaag gagaaactaa cattgtaaag tttgtattac agatgagaga cagacttgag 2640 aagtaccgag agcaagctca ggagaatctg gaggaagctc agaagaacca gaagctgtgg 2700 tacgacaaat ggtctcgacg gagagaatac caaccaggac aaaaagtgct gttactatta 2760 ccatcttcca caaacaagct attggcgaag tggcagggtc cctatgagat cactaagaag 2820 aagggtcctg ttacctatga ggtgtttcat ccagataagg ggaagaaaaa tcaaacatac 2880 cacgtcaatc ttttgaagga atggcgggag ccaggacctg agttgaagaa aggtatgttg 2940 gcctgtaagg tggagacacc tgaagcagaa gatgacgaca aggtggactt tgaaacccca 3000 gaacacttgc agtcagcggt aagcctggaa catctgcaag agaggcaaag agtggaactg 3060 accgaagtgt taaacaccta cccagaatta cagagaggaa gacctggaag aacaaacttg 3120 gtggagcaca aaatctgtct aacggaagca acacctattc gtcaaaggcc atatagagtg 3180 cctgaaagcc tgattaaacc cttgaaggaa gagctgaaga tgatgttgga aatggacatc 3240 atagaaccat cgactagtgc ttggagtagt ccaatagtca ttgtacctaa gaaggatggt 3300 acgctgagag tatgcctgga cttcaggaag ttaaatgcgg tgtccaagtt tgacgcttac 3360 ccgatgccac gcatcgatga actggtcgag aggattggga gagccaaata cataactact 3420 ttggacctct gtaaaggata ctggcaggta cccttggaga agacgagcag ggagtatact 3480 gcatttagga caccagtggg gctataccat tttaagacca tgccctttgg tcttcatgga 3540 gctcccgcaa cgttccaaag actcatgaat caggtattaa gaaactgtga ggagtacagt 3600 gccgcatacc tggatgacgt agtcatttat agtactacct ggaccgatca tgtacaccat 3660 ctgcacaaga ttctgcagaa gatacagaaa gctggtttga cactgaacgt cgccaagtgc 3720 gagtgggcca gacaagaaac gaggtatctt gggttccaac tggggaatgg agagatacga 3780 ccacaagtgg acaaagtgga ggcactccgc aactgtccta gacctcgtac caagaaggag 3840 gtacgatcct ttctcggact ggtgggatgg tatcggagat tcattcctca attctctgaa 3900 aaagcagcac ccttgacgaa tttgacggct aaagccataa agaatccggt tccgtggacc 3960 gaagaatgtg agaaggcttt cattaccttg aaagaaagaa tgtgtactgt acctgtttta 4020 cgaagcccag actttacaaa gaagttcttg gtacaggtgg atgcctcagc gagaggaatt 4080 ggagcagtgc ttatgcagga ggatgaaggt caggaacacc ccatcctata cctgagtaga 4140 aagctgttgc cacgagagac caggtatgcc accattgaaa aagaaggatt ggctataaag 4200 tgggcactgg acagtttgag atactacctg atgggtagag agtttaccct agaaacggat 4260 catagagcct taacctggat caatacaatg aaagaccaga atgccagagt gaccagatgg 4320 tacctgtcac tgcagccttt caagtttaat gtaagataca aggcgggaaa gagcaatgtc 4380 acagcagatt acctgtcacg attgcctgac gtcagcaatc taggagaggt gggaggtaa 4439 // ID Gypsy60-LTR_DR repbase; DNA; ZEB; 723 BP. XX AC chr15; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy60-I_DR; KW Gypsy60-LTR_DR; Gypsy60_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-723 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1436-1436 (2008). XX DR Genome; chr15; Positions 21141391 21140669. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 723 BP; 205 A; 102 C; 160 G; 256 T; 0 other; tgtaacgaag ttataatttt cattgtgaat cctcctgaat tatgcagcag ttatgtgttt 60 tatagagtaa gggcgccctc atgtggtttg tttgggtacc gcagacatac aagttaggca 120 ggaagtgacc tagcgcagca gatgctgtga gacagccatg ctgtgggtag gtgtttttca 180 tgctccgtca tagtttatta atattagctt aagaaagtag cgatttcaaa catttagttt 240 ataatttggt tatattatca ttaagcatta agtttgactt gtgacatgta gtttttatgg 300 ctttttataa atgtctactg gattatgctc tttagcaagg ccatgcggtt gatttgattt 360 tctatttcat ttagatgttg tatgtctgca gcaagcaatc cactaataag ggctgtactg 420 tttattattt caggtatgtc aatgtttatt tgttaattgt gaatgttacg ttttatgtta 480 aatgttcata tgattatcag ttaatatatg gatattaatg atgtaatagg cagcagatgc 540 tgtgagacag ccatgctgtg gatgttgtat gtctgcagca agcaatccac taataagggc 600 tttactgttt attatttcag tatatcagta aaaaggagta atcaaatact gtgtccgggc 660 cttcattaaa gagcattgca cacaacgcag aggaagaaga gttgaaggac cgaagacgtt 720 aca 723 // ID hAT-N36_DR repbase; DNA; ZEB; 1740 BP. XX AC . XX DT 16-OCT-2008 (Rel. 13.1, Created) DT 16-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Putative nonautonomous hAT DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW 8 bp TSD; hAT-N36_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1740 RA Bao W. and Jurka J.; RT "hAT DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1306-1306 (2008). XX DR [1] (Consensus) XX CC TSD is 8-bp long. Most elements are inserted into TA satellite. CC The consensus contains an insertion of HE1_DR1 SINE element CC (masked out). CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1740 BP; 567 A; 310 C; 352 G; 510 T; 1 other; taagggtgga gccgaacccg aatacggtat tcggaaaggc acgaatagcg tgtttttacg 60 aatacttatt tcgaacaaat acttgaaaaa ttatttgtat tcgggagcaa gaaaaacact 120 atatcaaaaa gcagcgtttc ctcatgagac cacagtgcat gcccgcgtga gtgagtgagt 180 gagtgagaga gagagagaga gagagagaga gagagagaga gagagagaga gagagcgcgc 240 agagagagag agagagagtg agtcagagcg agagagacgc tcagtcggag ggcggggcgg 300 aggtaaaagg tgtctggcac aagcttctcc acagatacag acagagaagg ctcggctgag 360 cgaaaaaaat acttcactgc atcactattt ttgttaaata gatttttgta ccttaattgg 420 gttccagatg cagtttataa acttgtagcg gagtttgatc gggattttcc attacgctgc 480 atttttgaag tttgcaatcg ggtgctctct gtttacgcaa cgttagctct gttagcgcgg 540 taatttagac aatagactgt tgacagagta acgttaacgt tcgtttataa aggttaaaac 600 gatgcttgtg cagttgtgtc gaatagtatg cacttatggg agttatatgg tacgtctaca 660 ttactgtctt ttgcagacag caagatatat gctataggct agttaacctt agcatagctt 720 cccgtcactt tttattaact tgaaactcct caaatacatt tgggcacccg aatagaaaaa 780 gagtgagact gctgctgagc catttcttgg tcctccacca gtcaaaaaaa aaaaaaaaaa 840 aacatgttct ttgtggaaga attttacagt cagtggagct gcagataaaa caaactgata 900 ctgtataatg tgtgcaaaag tgaaatcagc agaggcagag atattaaaca tctgtcaaca 960 tcttctatgc ataataacct tctatgtaat gtattatcac atgcactaaa agcatcctct 1020 cctgatactg tctgtgaacc ccccacaagc atcaatcccc tcaatcagca cagctatgtt 1080 ctgctaaatc ctccacaagc accaaaccat caacacagcc acattctgcc ccagcaagtc 1140 agtttcaaac ataaaaaart aaaaaaaaaa aaaaactttg tgtctgtttt gtggcaggca 1200 gatgacaata gcagggctgt atcttttagt attatataga atacttttgt tctgccagat 1260 ctcccaggca gggttttatt tagatttatt tagttaattt tagtttttgg aattctgtcc 1320 attggaaaga agttctttca gaagaagaac agttttttga agtattattt gtttttattc 1380 tgtctattca gtgtccttca acaagaacgg tggtggtggg gttcataata ttcacaatgg 1440 tattttatta gttgttggtt taaccatgga tgagataatg gcttctatgt tattttcttt 1500 tcaatgacat ttcttgtcac atgttcaaaa acaacaacca atattgcatg tctataattt 1560 atataatggg tataattaaa caatactttc actataacgt aaattagaag tgtaatagaa 1620 aaaatatata tttttgcgcc attttgaatt ttactcgaat acaaatacaa atacaaatac 1680 ttttcccccc tcaacaaata caaatacaaa tacaaatacc ggctgctccg cacatcccta 1740 // ID Gypsy-27-I_DR repbase; DNA; ZEB; 5177 BP. XX AC . XX DT 11-FEB-2005 (Rel. 10.01, Created) DT 19-MAY-2005 (Rel. 10.06, Last updated, Version 2) XX DE An internal portion of the Gypsy-27_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy superfamily; Gypsy-27-I_DR; Gypsy-27-LTR_DR; Gypsy-27_DR; KW endogenous retrovirus; gag; integrase; reverse transcriptase. XX NM Gypsy-27-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5177 RA Kapitonov V.V. and Jurka J.; RT "Gypsy-27_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 5(1), 27-27 (2005). XX DR [1] (Consensus) XX CC Gypsy-27-I_DR is an internal portion of the Gypsy-27_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its CC long terminal repeat is deposited in Repbase as CC Gypsy-27-LTR_DR. Gypsy-27_DR is characterized by 4-bp target CC site duplications. The internal portion encodes one CC polyprotein: the 1632-aa Gypsy-27_DR1p (pos. 71-4966) composed CC of the gag, protease, reverse transcriptase, and integrase CC domains. The consensus sequence was built from 5 copies less CC than 1% diverged from the consensus sequence. XX FH Key Location/Qualifiers FT CDS 71..4966 FT /product="Gypsy-27_DR1p" FT /translation="MEAENFLINFPDAAEGSCEGPSHNSPVTPQIDRSLLD FT EDTGMEQPFIGAPILSSESRTDDLAVLVQAIDSLKSVFVDTVDRQKKWLQD FT STELCTHEVPKKTLTVVEKAHSEFLDVLSDRMKEFTKHTIEAIQWNSKQTH FT NEFKAMFDEANSSMFSSFKRALEESRVGEKTLQEEVCSLRKELLAMQTSLD FT SFMQQRISLDTKVDEKVLVSGSQVCSNTSSHVDSSESISQPALSNLTCGRI FT PPIKMVFPTFGGVNDESDPVIYLERCNDFLALRPMSNSEILATMRSVLHAS FT ARDWWETVRFKINSWESFQKAFLAVFLPEDYQDVLEEKVRNRLQGTNESVR FT DFAFSFQALFKRWKKEATDTEVLKALLKAMNPCYASQLRGRAQTVEELVRL FT GTQLERDYDHQIKYNQLQLPKSSFLDQHLSKKSKGVEVKSNLPPEQSLTSL FT LCWRCHENHSPGSCSKFKAVLSGKQTGGQFQGTKEFQPQNQQRGMLSAYDT FT QRFVKKDVSSGRKLDKRDGIKILRQLVIPMMVRNCSGKALVDTGATYTLLN FT VDLWNKIKEPNERLDVWHEGPLYLANGDATVPLGQKVLDFYLQDLHFQVPT FT VVLPAQNLVYSMVLGLDFIALTGLQLNIKDQLYSFSDDPQGRVFPFQPPIT FT PGDSWKCCKPPGNYNPSASLYSAIPPVQFVCTANECVDKSDVGESGSVLME FT LLQSKIKESNLSERESHVLFTLLYDHPEVCTPNLGRTHLVQHKIAVSTDVV FT VAQKPYRLPIHKKEIVKEQIDDMLNQDIIQPSHSPWASPIVLVPKKDGGQR FT FCVDYRKLNAVSESDAFPLPTVNEILESLSGSGIFSTLDLNSGYWQVSMHP FT DSMAKTAFVSPFGLYEFKVLPFGLKNAPATFQRLMNRVLADYLGQCCLVYL FT DDIVVYSANFHQHVLDLQKVLRCLQRAGLTLKLPKCHFCLTEIKFLGHIVT FT TDGVKADPAKTEAIQNFPVPTNLKELQRFLGMSGWYHRYVQNFSDIAEPLN FT ALKKKGVRFQWTAECQVAFDCLKRHLSSPPVLGHPNHAHTFVVYTDASSTG FT LGAVLAQRPSTFGASEEVLAYASRTLTSAEKNYSTTERECLAVVWAVERWR FT HYLEGKSFIVVTDHASLLWVFNTTKTNSRLIRWALRLQEFEFILEYRKGKL FT NSAPDALSRIDVPDSCPMVASYVPKQSTESMVSLFPLCDEDIWIAQQQDVE FT IQRIYQSLAEGKQSDEGSGSEFVILEDKVYRKVSNPTKGTHFQIYVPQTLR FT EILLEAYHSNPLSGHFGRYKTQKRLMQVAFWPNMWRDVSDFVKNCTSCQQN FT KPECRKPAGKLQQTEVKEPWEMLGVDLMGPLPRSTLGNTQLLVVVDYYSHW FT VEMFPLRKATAGVIAQTLRKEVLTRWGVPKFLLSDRGPQFTSEILKDLCSR FT WGVVQKLTTAYHPQTNFTERVNQVIKVMISSYVFGEHNRWDHYLPELRYAI FT NSAVQESTGYSPAELLLHRNLRGPFELVLEPHQTGLRVLKDLQEVVKRNVR FT RAKEKQKRLYDARRRDVHFTRNDRVWMRAHPLSKASQAFAAKFAARWIGPY FT RIVEKLGPVNYRIVREDNGEDLRTVHVCNLKPAFPSAGELDRRERERVLKI FT FVEESEDEEFLGFE" XX SQ Sequence 5177 BP; 1481 A; 944 C; 1190 G; 1562 T; 0 other; gatttggcgc ccaacgtggg gccctgagat atgctaaata tttagtggtt tttgtaattt 60 ttgcttaata atggaagctg aaaactttct tattaatttt cctgatgcag cagaaggtag 120 ttgtgagggc ccatctcata acagtcctgt tactccacag attgaccgat cattattaga 180 tgaggacact gggatggaac agccatttat tggggctccg atcctctcct ctgaaagtcg 240 gacagacgat ttagcagttt tagtccaggc aattgattct cttaaaagtg tatttgtaga 300 tacagtagac cggcagaaaa aatggcttca ggacagtact gaactttgta ctcatgaagt 360 tcctaagaaa acattaactg tggttgaaaa agcacattct gaatttttag acgttctttc 420 tgaccgtatg aaagagttta ctaaacatac aattgaagct atacagtgga actcgaaaca 480 gacacacaat gaatttaagg caatgtttga tgaggctaac tcttcaatgt tttcttcctt 540 taagagagct ttggaagaat ctagggtagg agaaaaaaca ttacaggaag aagtatgcag 600 cttgaggaag gagctacttg ccatgcaaac ttcgcttgat tcctttatgc agcagaggat 660 ctcacttgac actaaagtag atgaaaaggt tctggtttct gggtctcagg tatgttcaaa 720 tacatcttca catgtagact ctagtgagtc cattagtcaa cctgctcttt caaatttgac 780 ttgtgggcgt attccgccta ttaaaatggt atttcctaca tttggaggag taaatgatga 840 gtctgatcct gttatttatt tggagagatg taatgacttc ttagctctaa gaccaatgtc 900 aaacagtgag atacttgcca ccatgcgtag tgttctgcat gcctcagccc gagattggtg 960 ggagacagta agatttaaaa ttaattcttg ggaaagtttt cagaaggctt ttttagcggt 1020 tttcctccca gaggattatc aggatgtgct cgaagaaaaa gtgcgtaatc gactgcaagg 1080 gacaaatgaa agtgttcgag actttgcttt ttcatttcaa gctttattta aacgctggaa 1140 aaaagaagct actgatactg aagttttgaa agctctttta aaagcgatga atccttgtta 1200 tgctagtcaa cttcgtggcc gtgcacagac tgtagaggaa ctggtgagat taggaacaca 1260 attagagaga gattatgatc atcaaataaa gtacaatcag ttacagttac ctaagagttc 1320 ttttcttgac cagcatttgt ctaaaaaatc aaaaggggta gaagtaaaat caaacttacc 1380 tccagaacaa agtttgacta gtttgctttg ttggcgatgt catgaaaatc attccccagg 1440 ttcttgttcc aagtttaagg ctgtcctgag tggaaagcaa actggtggac aatttcaagg 1500 aactaaagag ttccaacctc aaaatcagca acgaggtatg ctgtctgctt atgatactca 1560 gcgctttgta aaaaaagatg tgtcatctgg taggaagctg gataagagag atggtattaa 1620 aattcttcgt cagttggtta tccccatgat ggttcggaac tgtagtggta aagcacttgt 1680 tgatacagga gctacttata cattgctcaa cgtcgattta tggaataaga taaaagaacc 1740 caatgagcga ctagatgtat ggcatgaggg accactttat ctagctaatg gcgatgctac 1800 tgttcctctt ggccaaaaag tactggactt ctatttgcag gatttgcatt tccaagttcc 1860 aactgtagtt ctgccagctc agaaccttgt gtattcaatg gttttgggtt tagattttat 1920 cgcactgact gggttgcaac tgaacattaa agatcaactg tacagttttt cagatgatcc 1980 acaaggtcgg gttttccctt tccaaccgcc tattacacca ggggactctt ggaagtgttg 2040 taaaccacct ggtaactata atccatctgc ttctttgtat tctgctatac ctcctgttca 2100 gtttgtatgt acagcaaacg aatgtgtgga taagagtgat gtgggagagt ctggttctgt 2160 attgatggag cttttgcaaa gtaaaatcaa ggagagcaat ctttcagaac gtgagtctca 2220 tgttctgttt actcttttgt atgaccatcc tgaagtttgt actcctaatt tgggtagaac 2280 ccatcttgtc cagcataaga ttgcggtttc aacagatgtt gttgtagctc aaaaaccata 2340 cagactaccc atacacaaaa aggagattgt aaaggaacaa attgatgata tgctcaatca 2400 agatatcatc cagccatctc attcaccctg ggcatctcct attgtgttag ttcctaaaaa 2460 agatggaggg cagagatttt gtgttgatta tcgtaaactg aatgcagtca gtgaaagcga 2520 cgcatttcct cttcccacag tgaatgagat cttggagtcc ctttctgggt cagggatatt 2580 tagtactctg gatttaaata gtggatattg gcaggtgtcc atgcatccag atagtatggc 2640 caagaccgct ttcgtctcac cttttggcct atatgagttt aaggtgttac cttttggcct 2700 gaaaaatgca ccggccacat ttcaaaggct tatgaataga gtcttggctg actatctggg 2760 gcagtgttgt ttagtgtatc tagatgacat agttgtctat tcggctaact ttcaccaaca 2820 tgtcctagac ctccagaaag ttttgagatg tttgcagaga gcaggactaa ctctcaaact 2880 tccaaaatgt catttctgtt taacagagat caagtttctt ggccatattg tgactactga 2940 tggtgtaaag gcagaccctg ctaaaacaga agctattcag aattttccag tcccaacaaa 3000 tttaaaggaa ctccaacggt ttctgggaat gagtgggtgg taccataggt acgttcaaaa 3060 tttttcagat attgccgaac cacttaatgc tctaaagaag aaaggagttc gttttcagtg 3120 gacagctgag tgccaggtag cgtttgactg cctcaaaagg catctttcct caccacctgt 3180 acttggacac ccaaatcatg cccatacatt tgtagtttat actgatgcca gttcaaccgg 3240 cttgggtgct gttcttgctc aacgaccctc cacttttggt gcatctgagg aggtgttggc 3300 gtatgctagc cgtactctga catcagcaga gaaaaactat tctacaactg agcgggagtg 3360 tttagcagtg gtttgggctg tagaacggtg gcggcattat ttggagggaa aatccttcat 3420 tgtagtcact gatcatgcct ctcttctttg ggtgttcaac actacaaaaa caaattcccg 3480 actgattcgc tgggctttga gactacagga gtttgagttt attcttgagt atcgtaaggg 3540 gaagcttaat agcgccccag acgctttgtc tcggattgac gttcctgatt cttgtccaat 3600 ggtagcttcc tatgttccta agcaaagtac agagagtatg gtgtcactat ttcctctttg 3660 tgatgaggat atctggattg cacagcaaca agatgtagaa atccaaagga tatatcagag 3720 cttagcagaa ggaaagcaga gtgatgaggg atctggctct gagtttgtca ttttagagga 3780 taaggtctat aggaaggtgt ctaatcctac aaagggaact cattttcaga tttatgttcc 3840 acagactctc agagaaattt tgttggaggc atatcactcc aatccattga gtggtcattt 3900 tgggcgttat aaaactcaaa aaaggttaat gcaagttgcc ttttggccta atatgtggag 3960 agatgtatct gactttgtga agaactgtac tagttgtcag cagaataaac cagagtgtcg 4020 taagcctgct ggaaaacttc agcaaactga agtgaaggag ccttgggaaa tgcttggtgt 4080 ggaccttatg gggccactgc ctcgtagtac cctgggtaac actcaattgt tagttgttgt 4140 ggactattac agtcactggg ttgagatgtt tcctcttcgc aaagctactg ctggagtaat 4200 tgcccagaca cttaggaagg aagtactgac tcgatggggt gttccaaagt tcctactatc 4260 tgacagggga cctcaattta catctgagat tttgaaggat ctgtgtagca gatggggagt 4320 ggtacaaaaa ttaacaacag catatcatcc ccaaactaat tttacggaac gtgtgaatca 4380 ggtaattaaa gtcatgatct cctcttatgt gtttggtgag cataatcggt gggatcatta 4440 cttaccagag ttgagatatg ccatcaactc tgctgttcag gagagtactg gatactcccc 4500 agcagagtta ttattgcaca gaaatctgag aggacctttt gaacttgtgt tagagcctca 4560 ccagactggt cttagggttc ttaaagactt gcaggaagtg gtaaaaagaa atgtgcgtcg 4620 ggctaaggaa aaacagaagc gtttgtatga tgcgagacga agagatgtgc atttcacaag 4680 aaatgataga gtttggatga gagcacatcc tctttctaag gcttctcaag catttgcagc 4740 taaattcgca gctagatgga ttgggcctta tcgaattgtg gaaaagcttg gtccggtgaa 4800 ctaccggatt gttcgggagg ataatgggga agatttacgt acagttcatg tatgcaattt 4860 aaaacctgca tttccctccg caggggaatt agatcgtaga gagagagaaa gggtcctgaa 4920 aatttttgtc gaagagtcag aagatgagga atttctcgga tttgaatagt cctgagatcc 4980 agtaatcact taactgattt ccatatttat agatcagatt atttggtgag tagagttatt 5040 ggtatggaaa gctgttgagt gtctttttat tgctgcataa tttttaggtc ttaaaaaacc 5100 cctaaagatt tcttgtaagt aacttataga tcacaacctc tttaagtttg actgatcttt 5160 ttccatgggg gggagag 5177 // ID Gypsy148-LTR_DR repbase; DNA; ZEB; 708 BP. XX AC chr5; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy148-I_DR; KW Gypsy148-LTR_DR; LTR71_DR; LTR_Retrotransposon. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-708 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1558-1558 (2008). XX DR Genome; chr5; Positions 32565351 32566058. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 708 BP; 139 A; 135 C; 149 G; 285 T; 0 other; tgttacgact gaagtttatt tatttttttt ttattattgc ggattgtaca cctttttctt 60 ttgtcccgcc tccttgctgt tttctgatgt cttataggct cccacgccgg cgtgattccc 120 tgctctgatt ggctgctggt ctacgaggcc tgtataaaag ggagcttccg gcaggaatcg 180 gagagctcat ttctgctgcg ttcggtgttt ggagtggagc tcctgggttt ccctcacccc 240 ttcggccgac gaccaacaat tatcactttt gtattgatga ccatcctatg ctttaatatt 300 tattgagagt ttgagcgtgt aggggtgacc tgcctattta tttgactact tttaactttg 360 tttatattag ggagtcaggt aagattcttg tatgtttctt taaatagtaa taggtaattt 420 agtggattta ttttatagat tcttttgttt gttattttgg cctgtggtca ccctgacgaa 480 atgctcgttt atacaaaccc tatttttcta taataaatat ctttatatat actctcaatt 540 gagtgtgtga gtttggttgt cgaccgaggg gaatattttg ctttaagttt tgtttagttt 600 gggaaatcca ccaccccaac acatttgttt gttcttcgtt tattatgccc ttttcgtttc 660 ccctagactt ggggcgtaat aattgggggc tcgtccgttt tttcctca 708 // ID Gypsy116-LTR_DR repbase; DNA; ZEB; 370 BP. XX AC chr20; XX DT 03-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy116-I_DR; KW Gypsy116-LTR_DR; Gypsy116_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-370 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1520-1520 (2008). XX DR Genome; chr20; Positions 51684865 51684496. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 370 BP; 113 A; 56 C; 85 G; 116 T; 0 other; tgtaacgggg gaaaaatgaa actgtgaatt aagtatgaac ggtgtcaaaa tgcttgtttt 60 tgttaaaaat atctgtctaa ttaaatcaaa atcctatgtt cactgtttaa ctgtgtaata 120 catcatgggg aaaaacagac acctacaggt gagaaagggg gtataatggg tggagtcagc 180 cggccatttt gtgagttcac ttgttcaaga gctgttgaga gtgatttttt ctgatgacca 240 gcccttgtac acttattgtc tgtgtggagc cctgtttttc ttttggagaa cactataaag 300 tgaacttgtt caggtaaact gcttccgcgt gatcattata aaaaacatat acgaactaag 360 aggggttaca 370 // ID Gypsy-28-I_DR repbase; DNA; ZEB; 4299 BP. XX AC . XX DT 31-MAY-2005 (Rel. 10.05, Created) DT 31-MAY-2005 (Rel. 10.05, Last updated, Version 1) XX DE An internal portion of the Gypsy-28_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy superfamily; endogenous retrovirus; gag; integrase; KW reverse transcriptase; Gypsy-28_DR; Gypsy-28-LTR_DR; KW Gypsy-28-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4299 RA Kapitonov V.V. and Jurka J.; RT "Gypsy-28_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 5(5), 118-118 (2005). XX DR [1] (Consensus) XX CC Gypsy-28-I_DR is an internal portion of the Gypsy-28_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its CC long terminal repeat is deposited in Repbase as CC Gypsy-28-LTR_DR. Gypsy-28_DR is characterized by variable 5-6-bp CC target CC site duplications. The internal portion encodes one CC polyprotein: the 1364-aa Gypsy-28_DR1p (pos. 18-4109) composed CC of the gag, protease, reverse transcriptase, and integrase CC domains. XX FH Key Location/Qualifiers FT CDS 18..4109 FT /product="Gypsy-28-I_DR1p" FT /translation="MDPASAAEIQEVLSRSVARMDHQDQQMKMTSQAIQAL FT VNQVSELTTQVQRLSSGLSAGSSLPAPSVDPPASPLVNETPRTTEPRLPPP FT QPYAGEPHLCRAFIAKCSLYISLQPSSFPTEESKVAFLINLLTGKAALWGT FT TAWEKKLPCCLSFKSFSEELKTVFDQAASGREASRRLAELRQEDHSVADYS FT IDFRTLAAECGWNSEAQWDMFFHGLADHIKDEIYALELPKTLDGLISLAIR FT VDNRLQRRGVHKEFRPQACFPEATTPDPEPEPMQVGRFRLSSEERRRRRTG FT GLCMYCGVGGHYAAVCPVKNKPSTGKSFLVGEILTGKDTSGSTFLPIKLGV FT NSNFHSCDALIDSGAEGSFIDSDLVNKLKIPCFPLSSPIAVHALSGLPLPT FT ITHITAPIRLITSGNHTETISFFLTKTVKNPVILGHPWLVLHKPHINWGHN FT TVFSWSESCHKSCLLSACSTVSCSVFQEEQVDLSNVPREYHDLKRVFSKSR FT AASLPPHRPYDCAIDLLPGTSPPKGKLYSLSVPEREAMEKYISDSLAAKII FT RPSSSPAGAGFFFVKKKDGSLRPCIDYRGLNSITVKNTYPLPLMSSAFERL FT QGANFFTKLDLRNAYHLVRIRPGDEWKTAFNTPRGHFEYCVLPFGLSNAPA FT VFQALVNDVLRDMIDQFIYVYLDDILIFSHSLQEHIQHVRRVLQRLLENGL FT YVKAEKCVFHAQSVQFLGHIVSVEGMRMDPEKIQAVVDWPTPDSRKALQRF FT LGFANFYRRFIRNFSQLATPLTSLTSSKTPFRWSSAAEAAFSKLKGCFVSA FT PILIAPDPSRQFVVEVDASEVGVGAILSQRSASDGKVHPCAYFSHRLSPAE FT RNYDIGNRELLAVKLALEEWRHWLEGSGVPFIVWTDHKNLEYIRSAKRLNS FT RQARWALFFGRFNFTISYRPGSKNIKPDALSRLFDPSDRKSSSDSVLPQNI FT VVANISWEIESRVRAALDGVTPPIGCPPSCLFVPEGLRSDVVRWGHSSKVA FT CHPGVSRTLFVIKQRFWWPAMARDVRDFVLACSVCAVSKSSNRPPAGLLQP FT LSVPSRPWSHISLDFVTGLPSSNGNTVVLTVVDRFSKAAHFISLPKLPSAR FT ETAVAVIDHVFRIHGLPTDVVSDRGPQFVSKFWREFCRLLGATVSLSSGFH FT PQSNGQTERANQDLERTLRCLVSQNPSSWSQQLSWVEYAHNSLPVSATGLS FT PFQCSLGYQPPVFPSLESEVAVPSVHAFVQRCRRTWNRARQTLLQVGKRTK FT AKADRHRSKPPVYVVGQKVWLSTKNIPLRTVCNKLAPKFIGPFIVTKIINP FT VAIRLKLPPAYRRIHPVFHVSKVKPVFHTAINPHTPVPPPAASRRWGDCLF FT G" XX SQ Sequence 4299 BP; 950 A; 1100 C; 1073 G; 1176 T; 0 other; agaacgaact gaccaacatg gacccagcaa gtgccgcaga aattcaggag gtgctttcac 60 gcagtgtcgc caggatggat catcaagacc aacagatgaa aatgaccagc caagccatcc 120 aagcactggt caatcaagtt tcggagctga ctacacaagt ccaacgactc agctcgggtc 180 tcagcgccgg atcctcctta ccagctccct ccgtcgaccc accagccagc ccgttggtca 240 acgaaactcc ccgaaccacc gaaccacgac tacctccacc acaaccttat gcaggtgaac 300 cccatttatg tagagccttt attgccaaat gttcacttta catatctctt caaccatcat 360 cttttcccac cgaagagagc aaggtggcgt ttttaattaa tctattgact ggcaaagcgg 420 cattatgggg aaccacggcg tgggagaaga aacttccctg ctgtctgtct tttaagtctt 480 tttcagagga gctcaagacg gtgtttgatc aggctgcttc ggggagggag gcatccagac 540 gtctcgcgga gcttcgccag gaggatcaca gtgtggcgga ttactccatt gatttccgca 600 ccctggcggc tgagtgtggc tggaacagtg aagcccagtg ggacatgttc tttcacggac 660 tcgcggacca cataaaagac gaaatttatg cattggagct gcccaagacg ttggacggtc 720 tcatcagtct ggctataagg gtggacaacc gtctgcagcg cagaggcgtt cacaaggagt 780 tccgtcctca ggcctgcttt ccagaggcta caacccccga tccggaacct gagcccatgc 840 aggtgggaag atttcgcctc tcgtcggagg agagacgtcg tcgtcgcaca gggggactgt 900 gcatgtactg tggggttggt ggtcattatg cagccgtatg tccagtcaag aacaaaccat 960 ccactggtaa gagttttttg gtgggcgaaa tattgactgg gaaggacact tcaggttcca 1020 cttttctacc cattaaattg ggagttaact cgaactttca ttcctgtgat gcgctcatcg 1080 attcaggagc tgaagggagt tttatcgatt ctgatctagt taataagtta aaaattcctt 1140 gttttcctct ttcttcaccc atagctgttc atgcacttag tggactacct cttcctacta 1200 tcacgcacat cacggcccct attagactta ttacttcagg aaatcatact gaaactattt 1260 ccttttttct cactaagact gttaagaatc ctgtcattct gggacatccg tggttggtgt 1320 tacacaagcc acatattaac tggggccaca acacagtttt ttcttggagc gagagctgtc 1380 ataagtcttg tttattgtct gcgtgttcca ctgtgtcttg ttctgtgttt caggaagagc 1440 aggtggatct gtcaaacgtg ccccgtgagt accatgacct gaagagagtg ttcagtaagt 1500 cccgggctgc ttctctacct cctcaccgtc cctatgactg tgctatagac ttattgccag 1560 gtacgtctcc gcctaaaggc aagttatatt cgctgtctgt tccagagagg gaggccatgg 1620 agaaatatat ttctgattct ctagcagcca agatcatccg cccgtcttct tcaccggcgg 1680 gggcgggatt tttttttgtg aaaaagaagg atggttccct tcgtccgtgc atagactatc 1740 gagggctgaa cagcatcacg gttaagaata cttatccttt gccgctgatg tcttcagcct 1800 tcgagcgtct gcagggggcg aactttttca caaaattaga tcttcgcaat gcttatcatt 1860 tggttcgcat aagacccggg gatgagtgga aaacagcgtt taacacccct agagggcatt 1920 ttgagtattg cgttctgccc ttcggccttt ccaacgctcc ggctgttttt caggcactcg 1980 tcaatgacgt gttgagagat atgatagatc agtttattta tgtctacctg gatgacattc 2040 tgattttttc ccactctctc caggagcaca ttcaacatgt cagacgagtg ttgcagaggc 2100 tgttagagaa tgggctttat gtcaaggcgg agaaatgcgt ttttcatgca cagtcggttc 2160 agttcttggg acatatcgtg tcagtcgagg ggatgcgcat ggatccagag aagattcagg 2220 ctgtggtaga ttggccaacc ccggattccc gtaaggccct gcagaggttt ctgggttttg 2280 ccaatttcta ccggcgtttt atccgcaatt tcagccagct cgccacacct ctgacttcct 2340 tgacctcctc caaaacgccg ttcaggtggt ccagtgcagc cgaggctgcc ttctccaaac 2400 taaaaggctg tttcgtttca gccccgatcc tcattgcccc tgacccttcc aggcagttcg 2460 tggtggaggt cgatgcgtcg gaggttgggg tcggagccat cctgtcccag cgttccgcct 2520 cggatggcaa ggtccatcct tgcgcgtatt tttctcatcg attatctcct gcagaaagaa 2580 attacgacat tggtaatcga gagttgttgg ccgtcaagct tgctttggag gagtggcgtc 2640 actggttaga aggctcgggg gtgcctttta tcgtctggac cgatcataaa aatcttgaat 2700 atattagatc cgctaaaaga ttgaactcca ggcaggctcg gtgggcatta tttttcggac 2760 ggtttaattt taccatctcg tacagaccag gttccaaaaa catcaaacct gatgcgttat 2820 cacgtctttt cgatccttca gatcgcaagt catcttctga ttctgtactt cctcagaaca 2880 tcgtcgtagc taacatatct tgggagatcg agtcgagggt ccgtgcagcc ctggatgggg 2940 taacgccccc gatcggatgt ccaccgagtt gtttgtttgt gccagaggga ttacggtccg 3000 acgtcgtccg gtggggtcat tcctccaagg tagcttgtca cccaggggtg agtcgcacat 3060 tatttgttat caaacaacga ttttggtggc cagctatggc tcgcgatgtg cgtgattttg 3120 ttttggcttg ctctgtctgt gcggtttcca agtcttccaa tcgcccacct gctggactcc 3180 tccaaccgct gtcagtgcct tcgagaccct ggtcacacat ttcgctagat tttgttacgg 3240 gtcttccatc ctctaacggt aacacggtag ttttgaccgt ggtggaccgg ttctcgaagg 3300 ctgctcattt catctctctg cccaaattac catcagccag agagacagcg gttgctgtca 3360 ttgatcatgt ctttcgcatt catggcctcc cgacggatgt ggtctctgac agggggcctc 3420 agtttgtctc taaattttgg agagaattct gccgtttatt gggggccact gttagtcttt 3480 cttctggttt tcatccccag agtaacggcc agaccgagag agccaatcag gatcttgagc 3540 gcacattgcg atgtttggtc tcgcagaatc catcctcttg gagccaacaa ctttcgtggg 3600 tggagtacgc acataattcg ttaccagtgt ctgccacggg cctttctccg tttcagtgta 3660 gtctaggtta ccagccacca gtttttccta gtctggaatc tgaagttgcg gttccctctg 3720 ttcacgcctt tgtccagagg tgccgccgca cttggaatag ggccagacag accctcctcc 3780 aagtggggaa gcgcaccaag gctaaagccg atcgccaccg gtctaagcct cccgtttacg 3840 ttgtcggtca aaaagtgtgg ctttctacga aaaacattcc gctccgcacc gtctgcaata 3900 agttagctcc taaatttatt ggccctttta ttgtcactaa gatcattaat cctgtggcaa 3960 tccgcctcaa attacctcca gcgtacagga gaattcatcc cgtgttccat gtatccaaag 4020 taaagcccgt ttttcatacg gcaattaatc cgcacacacc agttcccccc cccgccgcgt 4080 ctcgtagatg gggagactgc ttattcggtt aagcgcattc tggactctag acggagggga 4140 cgaggatttc agtacttggt ggactgggaa ggttacggtc cggaggagag aagttgggtt 4200 ccggccgggg acatattgga ccactctctt atcgatgatt acaatcgcca ggtaggctcg 4260 tctggggacg ccaggaggcg tccgtagggg agagggtaa 4299 // ID Tc1N1_DR repbase; DNA; ZEB; 479 BP. XX AC . XX DT 11-JUL-2002 (Rel. 7.06, Created) DT 11-JUL-2002 (Rel. 7.06, Last updated, Version 1) XX DE Tc1N1_DR is a nonautonomous DNA transposon - a consensus. XX KW Harbinger; DNA transposon; Transposable Element; Nonautonomous; KW Dr000102; Dr000304; nonautonomous DNA transposon; TA target site; KW TIR; Tc1 superfamily; Tc1N1_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 11-468 RA Bao Z.; RT "Dr000304, an unclassified and uncharacterized repeat."; RL http://www.genetics.wustl.edu/fish_lab/repeats/. XX RN [2] RP 1-479 RA Kapitonov V.V. and Jurka J.; RT "Tc1N1_DR, an ancient family of nonautonomous Tc1-like DNA RT transposons from zebrafish."; RL Repbase Reports 2(6), 24-24 (2002). XX DR [2] (Consensus) XX CC Tc1N1_DR is an ancient family of nonautonomous DNA transposons CC that belong to the Tc1 superfamily. CC Tc1N1_DR copies are flanked by the TA target site duplications CC and it has ~180-bp imperfect terminal inverted repeats. CC There are several thousand copies of Tc1N1_DR harbored CC by the zebrafish genome, they are ~13% divergent from CC the consensus sequence. XX SQ Sequence 479 BP; 166 A; 70 C; 62 G; 181 T; 0 other; cactgtaaaa aatatccgta aattagcagt tttccgtatt ttgtgattca tgttttcaat 60 tttaatttat gcttttgaat tgcattatgg gaccttgatc tttcttccaa caacttttaa 120 ccttaaaaag tttgaaaaag tgacttttat taacattttt aatagtttaa agtgatatat 180 tgtctgggtt ggtgttgtat attacggtac aaaaaccttg taataaactg ccagtacatt 240 ttctgttatt ttacagactt atttctctat atattatttc ttatacatta tattatttca 300 aactactaaa atgtcaataa aagtcacttt gttaaactgt agagttgaat tttcaacatc 360 aaaagtcgac agagcagaga tcaacatccc ataatgcaat tcacaaccgt aaataaatgg 420 aaaacacttg tgaatcacaa aatacggaaa ctgttaatta acagatattt tttacagtg 479 // ID Gypsy137-I_DR repbase; DNA; ZEB; 6687 BP. XX AC chr3; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy137-I_DR; KW Gypsy137-LTR_DR; Gypsy137_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6687 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1535-1535 (2008). XX DR Genome; chr3; Positions 1046833 1040147. XX CC Positions [2928-3467] - Reverse transcriptase CC Positions [4938-5414] - Integrase core CC 'AATA' target site duplication CC LTRs are 100% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 295..1695 FT /product="Gypsy137-I_DR_1p" FT /translation="MEHAKLTKSHYVGTPVGIGRGRGLLATPIPFPDISPS FT PTMCGVNNNVGLKADVIHCQDSNVDPVYVNYTSSDSAQPINTSTPVSANDM FT MSQMGNIVQLVGQQLADSILTHLNLHSQTEGASRHTQTDHGSKYTPEQKSM FT LNASQIQVVRQREIRDPPIFRGDKTDTVTIEEWVELMKNFIRKGVLPIDEQ FT GEEILIHLRGKAKDVVKVGMLSSSLDIKSNPDAVYTLLRKHFSCQQYSPIP FT LQDFYTTLPEPQEDPFDYWLRLNRAADITAECLKQQGKVLDNQLIEVTRMF FT IRNCPNSELALTFRSKTIDKWTAHEVHEILNEYHLEKNLRASGKAHGLFQC FT EKKFSVNEMHVTPSSCTVKVEQDSLQSKPSENMALEKVIDMLERVLMNNSS FT NAQATKQNWKRSNMPRIPGLNDSPCLVCKDPSHSAFTHCRDNRLCFLCFSS FT GHARNKCPKERRNSTSVNQQAN" FT CDS 1770..6335 FT /product="Gypsy137-I_DR_2p" FT /translation="MDYESLCQMISETSPHKTVIFQGVQKVPRADSLFHTS FT VSVEKDLILKSLIDSGSMACTISESTEKKLLESNPTITSQSAQDIVIVGCG FT GHHITPTAVYDLKATVYGCPMLIPAMVVPGQTEEMILGTNVIKELLMHLRE FT TNGYWRLMSKSNDNESDECSMFLSLLSNTERWRGESIPNKVGTVKLLRSVV FT LEPQTEHLVWGKLPQSTVLSVGSTVVVEPTMFRSRPRNILVGRIITPLWGD FT GSIPLKVINPTNRRIVLKRNAKIADVSPCIAVQDLPLPRQIHSNLQCTEKP FT LPVRSDDEMKQVLSDLNLGDLDLESCEVSARWKDNLLSIIEKYESVFSRDK FT MDCGEAIGFVHRINLTDEKPFRFPYRRVPPSQYAKLRTALNEMEEKGIIRK FT SHSDYASPLVLVWKKNGDLRICTDFRWLNARTVRDAYPLPHQSDVLAALGG FT NSFFSTMDLTSGYYNVPLEEGHKKYTAFSSPFGLHEYNRLPQGLSNSPATF FT MRMMMSVFGDENFSSVLCYLDDLMVFAPSESVALQRLEMVLSRLSHHNLKL FT APKKCCFLRRSVKFLGHIISEEGIKTDPGKVEAINMIQASDLMEPDGKTPC FT QKKIRSFLGMVLYYQHFIEGCSAKAKPLFKLTSGTVKQTPVMKGRRPKKKV FT TYLKLSPADWTAECEEALQTLKVDLTKNVTLAHPDFDQPFILAVDASFDGV FT GAVLSQVLPGEDIARPIAFASRTLSRSQMNYPAHRLEFFALKWAITEKFSH FT WLRGRHFTAWTDNNPLTYILTKPRLDACEQRWVAKLASYSFDLKYVPGTKN FT VVADALSREPFVESCIGHRLITEPYLSLLKDVSGVVDNSVQDAFKCVNNHQ FT SVKKVGENSCDSTSDESLHSGSFGSQEVSAVLAAHSTGGLSEVMGAVPPLF FT QPQQDSPPLLHSTIISEQGQDTTLHRILYYIERRRKLSKKEQAKETKSVRQ FT LLKHHDKLAICNGVLYKVKRDPKLNKKLYLVVVPASLKAQVLHGIHDAAGH FT QGRSRSLSLARQRFFWIGMKKDIDDYVKTCHRCVVGKTSEPNACAPLESIR FT TSEPLELVCIDFWSAELRDGKCVDVLIVTDHFTKMAHAFPCQNQSAKQVAK FT RLWNDFFLVYGFPKRIHSDQGANFESKLIKELLTMAGVDKSHTTPYHPMGN FT GIAERFNRTLGSMIRTLPPKTKSKWPQMLQQLTFCYNCTEHETTGFAPFFL FT MFGRVPRLPIDVLFQNVLLNEDVVDYKDFVSTLRSDLREAARIAQMHTLKE FT TDRHGRLYNRKVKGCPLAVGDRVLVANRGEKGRRKVADKWESSPYEVITVY FT PNINVYRIREVNSDKVRVVHRNLLLPVNFLPVDEPQEQDEQEDDRNDEIVA FT CGEESDIDGRTANWILSNPEDFVEDVDQSMDANLTNVSDCSLDMLDSASLD FT AGNLNENDVEEINTVERSDATVTGDVDLPEEGGQNLSNLTEREQFTKATEE FT LVVCTQPKCLDSAVAQTRVGRIIKPPKRLICEMNEQHMDSSDSTVSSFVYL FT VKSFFT" XX SQ Sequence 6687 BP; 2035 A; 1290 C; 1486 G; 1876 T; 0 other; ttggtgccgt gactcaggac ttcaatcaag gtggtcgaag atcatccaaa catcgaaccg 60 gtgggccagt cttcatcctt cactgcacaa gcccagcagc taagctttta ttagttaatc 120 caggacttta tatcaaggac tttaaacaac aaacggaaaa aaaaagcact gatagatgga 180 ctatctgaca agtttacacc aaaaagtggt ttggtattat tgaaacttta tagcacatca 240 attttatgat ttttttactt tctttttcac actgaatgca ctaattaatt gcttatggag 300 cacgcaaagt tgactaaatc ccattatgtg ggcactcctg ttggtattgg tcgaggaaga 360 ggtttattag caacaccaat tcccttcccg gacattagcc ctagccctac tatgtgtggt 420 gtaaacaata atgttgggtt aaaagcggat gtaattcatt gtcaggattc taatgtggac 480 cctgtttatg tgaattatac tagttctgac agtgcccagc caattaacac ctcaacccca 540 gtctctgcca atgacatgat gagtcagatg ggtaacatag tacagctagt aggtcaacag 600 ttagcggata gcatacttac acacttgaat ttgcacagtc aaacagaagg ggcatcaagg 660 catactcaaa ctgatcatgg tagtaaatat acacctgagc aaaagtccat gttaaatgct 720 tcacagatcc aagtagtacg gcaaagggaa attagagacc ctccgatttt cagaggcgac 780 aaaacagaca ctgttacaat cgaggagtgg gtggagctca tgaaaaactt tataaggaag 840 ggagttttgc ctattgatga acagggggag gaaattttga tacatttgag gggcaaagcc 900 aaagatgttg ttaaagtggg aatgctgtca agcagtttgg atatcaaatc aaatcctgat 960 gcagtttaca ctctattacg aaagcatttc agttgccaac agtattctcc tattccatta 1020 caagatttct acactacact cccagaacct caggaggatc cctttgatta ctggctgcgt 1080 ttaaaccgtg ctgcagatat cacagctgag tgtctaaaac aacaaggaaa ggtgctagac 1140 aatcaattaa tcgaagtcac acgcatgttt attcggaatt gtccaaattc agagcttgca 1200 ctgacatttc gctcaaaaac catcgacaaa tggacagctc atgaagtgca tgagattctg 1260 aatgagtatc atttggaaaa aaacctcaga gcttctggaa aagcacatgg gctttttcaa 1320 tgtgagaaga aattttctgt gaatgagatg catgttactc ctagctcttg cacagtaaaa 1380 gttgaacagg attcacttca gtctaaacca tctgaaaaca tggctctaga aaaggtgatt 1440 gatatgctgg aaagagtact gatgaataat agtagcaatg cacaggctac taagcagaat 1500 tggaaaagaa gcaacatgcc cagaattccg ggtttgaatg attcaccatg cctagtgtgc 1560 aaagacccct cacattctgc ttttactcac tgtagagata acaggctgtg ctttctgtgc 1620 ttctcatctg gccatgccag aaacaagtgt ccaaaggaaa ggagaaattc tacctcagtt 1680 aatcagcagg caaactaaat gatctacgtg tgagggagga tcacgtagat cttaagagtg 1740 aatctcccat aactgatttt gatgatgtaa tggattatga gtctttatgc caaatgatca 1800 gtgaaacatc tccacataaa actgttatat ttcagggggt gcagaaagtt ccaagagctg 1860 atagtttgtt ccacacgtct gtttcagttg aaaaagatct tatccttaaa tctttaatcg 1920 atagcggatc aatggcgtgt actattagcg agtccactga aaaaaagcta ctggaaagca 1980 acccaacaat taccagtcag tcagcacaag acattgttat agttggttgt ggtggtcacc 2040 atattacccc taccgcagtt tatgatctca aagctacagt gtacggctgc cccatgttaa 2100 ttcctgctat ggttgtgcct ggacaaactg aagagatgat tttgggcaca aatgtcataa 2160 aggaacttct gatgcatttg agagaaacta atggctactg gaggctcatg tctaagtcaa 2220 atgacaatga gagtgatgaa tgctctatgt ttctatctct tctctctaat acggaaagat 2280 ggagaggaga atccatacca aacaaagttg gtactgtaaa gttactcagg agtgttgttt 2340 tagagccaca gactgaacat ctggtttggg gtaagctgcc gcagtctact gtcttgtcag 2400 ttggcagtac tgttgttgta gagccgacaa tgttcagatc aagaccgaga aacatactgg 2460 ttgggaggat cataacacca ctatggggag atggttctat ccccttaaaa gtcatcaacc 2520 ccactaatcg cagaattgtg ttaaagagaa atgcaaaaat agctgatgtg tcaccttgta 2580 ttgccgttca agatttaccg ctgccaagac agattcactc aaatttgcag tgtactgaaa 2640 aacccttacc ggttaggtca gatgatgaga tgaagcaggt cttgagtgac ttgaatcttg 2700 gggacctgga tttggagtca tgtgaagtgt cagctcgctg gaaagacaat ttgttgagca 2760 ttattgagaa gtatgagtct gttttctcta gagacaaaat ggattgtgga gaggcaatag 2820 gatttgtcca tcgaattaac ctcactgatg aaaagccttt tcgttttcca tacaggcgtg 2880 taccaccaag ccagtatgcc aaactgagaa cagcattaaa cgaaatggag gaaaaaggta 2940 taattcgcaa gtcgcatagc gactatgcgt ccccactcgt actcgtttgg aaaaaaaacg 3000 gcgatctcag gatctgcact gactttaggt ggttaaatgc gaggactgtt agagatgcct 3060 acccgttacc tcatcagtct gatgtgttgg ctgctttggg tggaaactca ttcttctcaa 3120 caatggatct cacctcgggg tattataatg ttccactgga agaaggacac aaaaagtaca 3180 ctgctttctc ttccccattt ggacttcacg agtataatag gcttcctcag ggtctctcta 3240 atagccctgc cacttttatg agaatgatga tgtcagtctt tggagatgag aatttcagca 3300 gtgttctctg ttaccttgat gatttgatgg tgtttgcccc ttccgagtct gtggcccttc 3360 aacgtcttga aatggttctc tcacggttgt cccatcacaa tctgaaactg gctccaaaga 3420 aatgttgttt tctgagacgc tctgttaaat tccttggaca catcattagt gaggaaggaa 3480 ttaagactga cccagggaaa gtcgaggcaa taaacatgat tcaagcttct gatttaatgg 3540 aacctgatgg aaaaacaccc tgtcaaaaaa aaatcagatc ctttctcgga atggtgttat 3600 attatcaaca tttcattgaa ggatgttcgg caaaagctaa gcctctattc aagttgacat 3660 caggcacagt caaacagact cctgtaatga aaggacgcag accaaaaaag aaagttactt 3720 acctcaaact ttccccagcc gactggacag cagaatgtga agaggcttta cagactctga 3780 aagtagactt gacaaaaaat gtgacactgg ctcacccaga ttttgaccaa ccttttatct 3840 tggctgtcga tgcatctttt gatggtgttg gagctgtgtt gtcgcaagtt ttgccaggtg 3900 aggatattgc cagacccata gccttcgcta gccgaacttt atctcgctcg caaatgaact 3960 atccagccca cagattggag ttctttgctt taaagtgggc aatcaccgag aaatttagcc 4020 actggttgag aggaagacac tttacggcat ggactgacaa taatcctcta acatatattt 4080 taactaaacc acgacttgat gcatgtgagc aaaggtgggt ggcaaaactt gcttcttaca 4140 gctttgatct aaagtatgtc ccaggaacaa aaaatgttgt tgcggatgct ctgagccgtg 4200 agccatttgt tgagtcttgc ataggacacc gcctcattac tgaaccatac ttatctctct 4260 tgaaggatgt cagtggtgtt gttgataatt cagtccaaga tgctttcaag tgcgtaaata 4320 accatcaatc agtaaagaaa gttggtgaaa attcatgtga ctctaccagt gatgaatcct 4380 tgcattcagg atcttttgga agtcaggaag tttctgctgt actagcagca cattcgactg 4440 gtggtttgag tgaagtgatg ggggcagttc ctcctctctt tcaaccacaa caggatagtc 4500 ctcctttgct acacagtacc atcattagtg agcaggggca ggataccact ttgcatcgga 4560 ttctctatta tattgagcga cgtcgaaaac tatctaagaa agaacaagca aaggagacta 4620 aaagtgttag acaactgcta aaacatcatg acaaactggc tatttgtaat ggagtgcttt 4680 ataaggtgaa aagagatcca aaactgaaca agaaattgta ccttgttgtt gtacctgctt 4740 cattgaaggc tcaagttctt catggaatcc atgatgcagc aggacatcaa gggcgtagta 4800 gatctctatc actggcaaga caacggttct tttggatcgg catgaaaaaa gatattgatg 4860 attatgtgaa gacctgtcat cgatgtgtag ttggtaaaac ctcagaaccc aatgcttgtg 4920 ctccactgga aagtatccga acatctgaac ccttggaact agtctgcatt gacttctggt 4980 cggcagagct cagagatggg aaatgcgtgg atgtgctaat agtgactgat cattttacaa 5040 aaatggctca tgcattccca tgtcagaatc agtcggccaa gcaagttgca aagaggctct 5100 ggaatgactt ctttctggtt tatggattcc ccaaaagaat tcactctgac caaggggcta 5160 atttcgagag taaactaatc aaggaactat tgaccatggc aggtgtagac aagtcacaca 5220 ctacaccata ccaccccatg ggtaatggca ttgcagaaag gttcaataga acacttggaa 5280 gtatgatcag gacgttgcct cctaaaacaa agtcaaaatg gccacaaatg ctacagcaac 5340 ttaccttctg ttataactgc acagaacatg agacaacggg ttttgcgcca ttctttctca 5400 tgtttggcag agttcctcgt ctcccaattg atgttttatt tcagaatgtg cttcttaatg 5460 aggatgttgt ggattacaag gactttgtgt ccacactgag aagtgacttg cgagaagcag 5520 ctcgtatagc tcagatgcac actctgaaag aaacggacag acatggcaga ctttacaaca 5580 ggaaggtgaa gggttgccca ttagctgttg gggatagagt gttggttgca aacagaggtg 5640 aaaaaggaag aagaaaagtt gctgacaaat gggagtcatc tccatatgaa gtcataacgg 5700 tgtatcctaa tatcaatgtt tacaggattc gggaagtcaa ctctgacaaa gtgagggtag 5760 ttcatagaaa tctgctgtta cctgtgaact ttcttccagt tgatgaaccc caagaacagg 5820 atgaacaaga ggatgacagg aatgatgaaa ttgttgcctg tggagaggaa tctgatattg 5880 atggcagaac ggcaaactgg atattgagta atccagagga ttttgttgag gacgttgatc 5940 aatcaatgga tgcaaactta accaatgtgt cggactgttc tctagatatg ttggactctg 6000 caagccttga tgctgggaat ttgaatgaaa atgatgttga agagataaat actgttgaaa 6060 ggagtgatgc aactgtgaca ggtgatgttg acttacctga agaaggaggt caaaatctct 6120 ctaatttgac tgaaagagag cagttcacaa aggccactga ggaacttgtg gtgtgcactc 6180 aacctaagtg cttggactca gcagtagctc aaaccagagt tggtagaatc attaagcctc 6240 ctaaaagact aatttgtgaa atgaatgaac aacacatgga cagttctgac tctactgtga 6300 gttcttttgt ttatttagtt aaaagctttt tcacatagaa ttgatacaat ttttttttga 6360 aaggaacaaa ggagaaacag gttgaaacag tttagggatc tagcaaggag ttagtacatg 6420 attaaatatt tgaggtgtaa caggcatctc tttagtctgc tataagattg gggttaggcg 6480 cctcccttat cttattgtac tcctcatagg agaatatatt gattgacaga atcatactcc 6540 atatgttttt ttccccttct tcctttattg attgaataca tgaagtagat gtcccatgta 6600 ttgaattttt tttcttttct tttcttcaat gttagtttgt ttggttaaca tacatgtccc 6660 cattaagcag gatttaaggg gggtgta 6687 // ID Mariner-N6_DR repbase; DNA; ZEB; 1652 BP. XX AC . XX DT 18-NOV-2008 (Rel. 13.11, Created) DT 18-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE Mariner DNA transposon - consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Nonautonomous; KW mariner; Mariner-N6_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1652 RA Bao W. and Jurka J.; RT "Mariner transposons from zebrafish."; RL Repbase Reports 8(11), 1795-1795 (2008). XX DR [1] (Consensus) XX CC Copies are ~90 % identical to the consensus. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1652 BP; 546 A; 310 C; 358 G; 437 T; 1 other; cagttttatg caaaaatttg ggcacccctc tgtggctgca taataatgtg ctctttcttt 60 ataagagaag atcacagtga aatgccattc ggtttctaga gaaagctaga taatgtcatg 120 tttttcagac agaaatacgt agtgtagcag tattgagatg tgtgaaatta aattgaaact 180 gaaaaatagg ctatgcaaaa gtttgggcac cctctctttt gtgtggattt caaaacctgt 240 agtcacttaa tgctgattga ttacaacaca aaattgataa tttatgttat ttgagtgamt 300 catgactttg tgcagcacaa ccttaaccaa atcctaacac aggtgcaacc aatcatgaaa 360 aaggatattt aagatagctg attgctagtt gtgcttctcc ttattgtgaa ccagaaagta 420 gcaacatggg aacctcaaaa gaacttccta atgacctaaa aactaggata attcaccaac 480 atgaattagg agaaggatgc aaaaagctat cacaaaggtt taaagtctct atctccacag 540 tcagaaatat agtaagaaaa tggaaggcca caggaacagt tcttgtgaag gaaagatgtg 600 ctagaccaag aaaaatatct gaaaggcaaa ggcgaagaat ggttagaatg gtcacagaca 660 aaccacagac cacctccaaa gagctccagg aacatcttgc tgctgataat gtcattgtac 720 atcgttccac agtccagcgc actcttcaca aagaacagct caatggaagg tgatgcagaa 780 gaagcctttt ctgcattctg ccaacaagaa gagtcgtttg aggtatgcaa aggctcatct 840 ggacaagcct gaatcatttt ggaaaaatat actgtggaca gatgccacaa ccataggcgc 900 tttgcatggc ggaaaaagaa cacagcattc caggagaaga acctgctccc tactgtaaaa 960 tatggtggag ggtccatcat gctgtgggga tgtgtggcaa gcacaggtac tggaaacctt 1020 gtcagagtta aaggttgcat ggattccacc cagtatcagc agattctgaa gaacaatgtt 1080 caggaatcag tcaagaggct gaagttacgg cagggttgga tgtttcagca ggacaatgac 1140 ccaaagcaca gttccagatc taccaaggaa ttcatgctca gacatgatac aatgttctag 1200 aatggccatc ccagtcccca gacttgaaca tcatcgaaaa tctatggatt gatttgaaaa 1260 gggccgttca tgcttggcac ccatcaaacc tgactgaact ggagaaattt tgcaaggaag 1320 aatggcccaa aatgccttca gcaagaattc agggacttat ccttgactat aagaagcatc 1380 tacaggctgt tatttcagca aaaggtggct gtacaaaaga ttgatgtcat tattctgttg 1440 gggtgcccaa atttttgcac ctgtctgttt ttgttatgac ttaaattgca ttgcatctgt 1500 tgattaaata aatgttatgt cagaactgaa atgttgcttt ttccctaggg tataaaatat 1560 atccaaatga aattgctgat ttgaaaggcc agcaggctat ggcttgcaat tatgaaaatt 1620 atcaggggtg gccaaacttt tgcataagac tg 1652 // ID DNA25TWA2_DR repbase; DNA; ZEB; 802 BP. XX AC . XX DT 05-JUN-2002 (Rel. 7.05, Created) DT 05-JUN-2002 (Rel. 7.05, Last updated, Version 1) XX DE DNA25TWA2_DR is a nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA25TWA2_DR; KW nonautonomous DNA transposon. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-802 RA Kapitonov V.V.; RT "DNA25TWA2_DR, a nonautonomous DNA transposon from zebrafish."; RL Repbase Reports 2(5), 9-9 (2002). XX DR [1] (Consensus) XX CC ~2000 copies of DNA25TWA2_DR are expected to populate the CC complete CC zebrafish genome. They are ~3% divergent from the CC consensus sequence. CC This element is characterized by 25-bp terminal inverted repeats CC and the TWA targets site duplications. CC Its classification is not certain yet, although it is CC expected to be a member of the Harbinger/IS5 or Mariner/Tc1 CC superfamilies. XX SQ Sequence 802 BP; 213 A; 198 C; 165 G; 226 T; 0 other; tgccgagttc agactgcatg attttcaaag tagtcgtgtc acagatgttt tcacactgca 60 tgactatctg ggctagcgtt tcgtcgctgc tttgtttaca ctgcaagatg gatcggcgac 120 agggacattc acattgcatg actttactat aggaagaatc gccgacaact tcgtccaaac 180 tacgtctcac agccaaaaac acgtagtata tcttttgtta ttaactacat aatgagaaag 240 aagcctttaa tggggtagaa catgtacatg tttgctcacc tgggtttaaa gggaattagc 300 catttctcct caacgttgat aataaactaa tttctttctg tatgaaacgt caaacagaca 360 cggttgctcc tgagtcctgt caaacctcca ctagtttttc ctccatttcg tgggtccaaa 420 taaaccgaaa aagagcgctt ttaacttctc ccccagcctc ccgctggcct gcagcaggta 480 tacacacacg cacacacaag tgaatgctgc tctctcattg gctgtaggcg atcgctgatg 540 ttattttcag tcaaaactca attcacacgg catgatttga atcgccgaca gctccagata 600 tttagcatgc caaatatctc acaggcatcg gcgactcatc ggcgattctc tcagatcgcg 660 tctttgatag ttcatactgt gtgattgtca ctcacgtgca cgagcagcga tttgcctgtg 720 atttcaggca tttgtcggcg atttctcaaa acctgtcggc gagccaaaat cggggctaaa 780 atcacgcagt ctgaactagg ca 802 // ID Gypsy122-LTR_DR repbase; DNA; ZEB; 833 BP. XX AC chr16; XX DT 03-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Gypsy retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy122-I_DR; KW Gypsy122-LTR_DR; Gypsy122_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-833 RA Jurka J. and Kohany O.; RT "Gypsy retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1281-1281 (2008). XX DR Genome; chr16; Positions 51314858 51314026. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 833 BP; 203 A; 140 C; 165 G; 325 T; 0 other; tgtgacggcc tgaaaggttg gccgcacagt tctgtgtaac agcgtgcgta tttatacaca 60 cacacacaca accacacaca caaccacaca tacacacacg caaccatgca aacatattca 120 cagttatgca gacatgacat ttcattttat agtttctttc tccatatcta ttatacaaat 180 cacaacacac attagtttct ttcttgtgct ttatttatag aaaccataca tttttgttac 240 tactaaatta atttggagtg cacacaaatt agtttcgttg tatgattaat gtttgtttct 300 tttgagttac cgagtggtgc gctggagaca gaaagtggtg gccgggtggc ttccggtgcg 360 gcaaagattg gagtccgggc gggaacgccg gatttatgca aaatggttcc gcccgggcaa 420 accaactgct tgaataatgc agggggagaa gttattttag atatcctctt tgattaaata 480 tttatgtatt gattttgtta atttgtttat attggttaat gtttgatttt attatttttg 540 tggtaatgtt gttattgcat ttattttgtt tttgtgatgt atgtagtaat gtgaattttt 600 gtctttatcc ctttgtaatt taaatgagct catccttttc tttataggcc tggctgttct 660 tcattgtagg aggtcatttt gtgttggttg gttatgttgg cctcatgttc ctttactttc 720 ttttttgtta aattatatta atttgaggac ttatgtccta tttgttgaga ataaatctca 780 agtggaggat ttttggcttg tgttctgttt cctttgaccc tcggtccctt aca 833 // ID hAT-N24_DR repbase; DNA; ZEB; 369 BP. XX AC . XX DT 03-OCT-2008 (Rel. 13.1, Created) DT 04-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Putative nonautonomous hAT DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW hAT-N24_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-369 RA Jurka J.; RT "hAT DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1294-1294 (2008). XX DR [1] (Consensus) XX CC 8bp TSD. 99% identity to consensus. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 369 BP; 125 A; 44 C; 76 G; 124 T; 0 other; ggtcccactt tatattaagt gtccttaact actatgtact tacatcaaaa aataaataca 60 atgtacttac tgtgtttata atgtatttga gaacacttgt ggtgcttttg agttgggata 120 gaggttgggt tatggacagg tttggtggca tgggtaggtt taagggtggg ttaaggtgta 180 agggatggtc aacagtgtat ttacaaatgt aattacaaaa gttaattaca gatgtaatta 240 catacatgta tttaatcaag cataagtaca cagtaaatac atgtatttac acaataagta 300 cattgtaaca aactattaat tcctgtgtaa gtacatatta gttaaggcca cttaatataa 360 agtgggacc 369 // ID Gypsy-170-I_DR repbase; DNA; ZEB; 4313 BP. XX AC . XX DT 16-MAR-2009 (Rel. 14.03, Created) DT 16-MAR-2009 (Rel. 14.03, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-170-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4313 RA Bao W. and Jurka J.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 9(3), 700-700 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 19..966 FT /product="Gypsy-170-I_DR_1p" FT /translation="MESAGSDQMRNALAQQGALLGQHATQLNTTSREVEML FT TAQVVELNARIEELQHNVTVPRQASSSPTTPHHDPEPHANNPPPYDGDPNS FT CRAFLSQCALVFALQPRRYASEASRVAYVLTLLTGRAREWGTAVWDARAPF FT CHFFEDFREEMIKLFDRSAQGDEAAARLARLSQEGHSVTDFAIQFRTLAAS FT CDWNEGALRSRFFEGLNEEIQDELSSHELPHNLETLINLALRVEGRLRRRR FT HWRSIHSSWRVNDLPLQQSSSSTPPSPEPEPMQMGRMRLTPQEKRDRLARG FT LCLYCGKPGHFAVKCPLKAKAHQ*" FT CDS 897..4280 FT /product="Gypsy-170-I_DR_2p" FT /translation="MPLLRKTRAFCRQVSFKSQGPSVNRGILVGAFPCLDS FT PRSRTLLPITIQYEGSTNSASALLDSGAEGNFLDRAIAKQWGIPAVPLPSP FT ISVWSLAGQLLSTITHITPCVSLVVSGNHREAIELYLFDSPGSPVVLGHPW FT LAQHNPHVDWSGDSVLAWSQSCHVSCLGSALSPGSLCSVLQEETADLTKVP FT VEYHGLRQVFSRSRAGSLPPHRPYDCAIDLLPGTSPPKGRLYSLSGPEREA FT MDRYIQESLNAGLIRPSSSPAGAGFFFVKKRDGSLRPCIDYRGLNDITIKN FT RYPLPLMSSAFELLQGAQVFTKLDLRNAYHLVRIREGDEWKTAFNTPTGHF FT EYLVLPFGLTNAPAVFQALVNDVLRDMVNKFVFVYLDDILIFSSSLQEHTQ FT HVRQVLQRLLENQLFVKAEKCEFHARSVAFLGYVISAEGIRADPDKVRAVA FT KWPVPNTRKALQRFLGFANFYRRFIRNFSQIAAPLTALTSTKVLFKWNTQA FT QEAFGALKSRFTSAPVLSIPDPEQQFIVEVDASEVGVGAVLSQRSSKDGKV FT HPCAFFSHRLSPAERNYDIGNRELLAVRLALGEWRHWLEGAAHPFLVWTDH FT KNLEYVRSAKRLSARQARWALFFGRFNFVLSYRPGSKNIKPDALSRLFETP FT SGETPVESILPEGVVVGSLSWGIEQRVIKAGQGIEVPEGCPEGRLFVPVAL FT RSEVLRWGHESRLVCHPGIWRSLAAIRQRFWWPSLGSDVRQFVLACSTCAQ FT NKTSNKAPVGLLRPLPIPSRPWSHLALDFVSGLPPSRGNTVIMTVVDRFSK FT AAHFIPLPKLPSAKETAQAMVDHVFRIHGLPVDVVSDRGPQFVSRFWKEFC FT RQIGASTSLSSGFHPQTNGQCERANQDLERALRCLTSQNPSSWSQQLSWVE FT YAHNSLPVTSTGMSPFLCSMGYQPPLFPSQEPDAAVPSALAFVRRCRRTWM FT RAEEVLARTSRRIKAAADRHRTSPPQYVCGQKVWLSTKDLPLRVASRKLAP FT RFIGPYCITKVLSPVAVKLKLPPTLGRVHPVFHVSRVKPVFRSNLNPIISS FT PAPPPPRLVDGSPAYTVRRLLDVRRRGRGFQYLVDWEGYGPEERSWIPARH FT VLDRALVVDLHRRLGKPLP*" XX SQ Sequence 4313 BP; 827 A; 1257 C; 1123 G; 1106 T; 0 other; gaacgacccg gtcagaagat ggaatcagca ggttctgacc agatgcgcaa tgcactcgcc 60 caacaaggtg cacttttggg ccagcacgcc acccagctca acaccacctc cagagaggtc 120 gagatgctca ctgcccaagt ggtcgagctc aatgcccgaa ttgaggagct ccagcacaac 180 gtcacagtac ccaggcaagc ttcttcttcc cccaccacac cccatcatga cccagagccc 240 catgcaaaca accctccacc ctacgatggg gaccccaact cctgccgggc ctttttatca 300 cagtgtgcct tagtttttgc cctacagccc cgccgatatg cctctgaggc atccagggtg 360 gcatacgtct tgaccctgct cactgggaga gcacgtgagt ggggcaccgc tgtttgggat 420 gccagggcac ccttttgcca ctttttcgag gacttcaggg aggaaatgat caaacttttt 480 gaccggtctg cacagggtga tgaggcggct gcccgcttgg cacgactatc ccaggagggc 540 cactccgtca cagactttgc aatccagttc agaactctag ctgcctcctg tgattggaac 600 gagggggccc tacgttccag attttttgag ggcctgaacg aagagatcca agatgagctt 660 tcttctcatg agctccccca taaccttgag acccttatta acctggcact ccgtgtggag 720 gggcgcctcc gccgccgccg ccactggcgg tccattcact cctcctggag ggttaatgac 780 cttcccctcc aacaatctag ctcttctact cctccctcac ctgaacctga gcccatgcaa 840 atgggtcgta tgcgccttac acctcaagaa aagagggacc gtcttgcccg tggtctatgc 900 ctctactgcg gaaaaccagg gcattttgcc gtcaagtgtc ctttaaaagc caaggcccat 960 cagtgaaccg ggggatcctg gtgggcgcct ttccctgtct agactccccc cgctctcgca 1020 ccctgttgcc catcaccatc cagtatgaag gatccactaa ttcggcctct gccttgcttg 1080 attcaggggc cgagggtaac tttctagacc gtgccattgc caaacagtgg ggcattcctg 1140 ccgttcctct cccttccccc atttctgtct ggtcattagc tggtcagctt ctctccacca 1200 tcacccacat cactccctgt gtaagtcttg tcgtttccgg caatcaccgt gaggccattg 1260 agctgtacct ttttgactcg ccgggttccc ctgtagtcct ggggcatccc tggctggcac 1320 agcacaatcc tcacgttgat tggtctggag attctgtgtt ggcctggagt cagtcttgtc 1380 atgtatcttg tttgggttct gccttgtctc ctggttcttt gtgttctgtg ttgcaggagg 1440 agacagccga tttaaccaag gtcccggttg agtaccatgg tcttcgtcag gtcttcagca 1500 ggtctcgggc tgggtctctc cctcctcatc gaccatatga ctgtgccatt gacctcctcc 1560 caggcacttc tccgcccaag ggtcgcttgt attccctttc tggtccagag agagaggcta 1620 tggataggta tattcaggag tcacttaatg ccggtctcat ccgtccctca tcatctcctg 1680 caggtgctgg gttcttcttt gttaagaaga gggatggctc cctgcgtccc tgtattgatt 1740 acagagggct aaacgacatt acgataaaga acaggtaccc cttacctctc atgtcgtctg 1800 cttttgagct gttgcaggga gcccaggtct tcaccaagtt agaccttcgt aacgcttatc 1860 acttggtacg cattcgtgag ggagatgagt ggaagacagc ctttaatacc cctacgggac 1920 actttgaata tctggtccta ccctttggtc tgacaaatgc accagctgtc ttccaggccc 1980 tggttaacga cgtgctgaga gacatggtaa acaagtttgt ctttgtgtac ctggatgata 2040 ttctcatttt ctcatcctct ttacaggaac acactcagca tgtgcgtcag gtcctccaac 2100 ggctactaga gaatcagctc tttgttaagg cggagaagtg cgagttccac gctaggtcgg 2160 ttgcgttcct ggggtatgtc atctcggctg agggaatccg agctgatcct gataaggtaa 2220 gggccgttgc caaatggcca gtccccaaca cccgcaaggc cctgcagcgg ttcctgggtt 2280 ttgccaattt ctataggcgt ttcatcagga actttagcca gattgctgca cccttaacgg 2340 ctctcacctc caccaaggta ttgttcaagt ggaatacaca agctcaggag gcctttggtg 2400 cccttaagtc ccgtttcacc tctgctcctg ttctttctat tccagaccct gaacaacaat 2460 ttattgttga ggtagatgcc tctgaggttg gggtaggcgc ggtcctgtct cagcgctcat 2520 ccaaggatgg gaaggtgcat ccatgtgcct ttttctctca tcgcctgagc ccagcagaac 2580 gtaattacga catcggtaac agggagctgt tggcagtcag gctggccttg ggtgagtggc 2640 gtcactggct ggagggtgcg gcgcacccct tcttggtctg gacggaccat aagaaccttg 2700 agtatgtccg ttcggccaag aggttgagcg cacgccaggc tcgctgggca ctcttctttg 2760 gtcggttcaa cttcgtcctc tcgtaccggc cagggtcaaa gaatatcaaa cctgatgccc 2820 tctcacgact gttcgagacc cccagtgggg agaccccggt ggagtccatc ctccctgagg 2880 gagtggttgt tgggtccctc tcctggggca ttgaacagcg ggtaataaag gccggtcaag 2940 ggattgaagt accagagggg tgcccggagg gtcggttgtt tgtgccggtt gcgctgcgtt 3000 ccgaggttct ccggtggggc catgagtcca ggttggtttg ccacccagga atttggagat 3060 cactggctgc catccgccag cgattttggt ggccctctct agggtcggac gtcaggcagt 3120 ttgtgttggc ttgctcgaca tgtgcccaga acaagacctc taacaaggct cctgttggtc 3180 tgctccggcc cttgcccatc ccttctcgtc cttggtcaca cctggccctg gacttcgttt 3240 ctggcctccc tccctcaaga ggaaacactg tcataatgac tgtagtggac cgtttttcta 3300 aagcagccca ttttatcccc ttgcccaagc tcccctcagc caaggaaaca gcccaagcta 3360 tggttgatca tgtcttccgg attcatggtc ttccggttga tgtggtttct gataggggac 3420 cacagtttgt ttcccggttt tggaaggaat tctgtcggca gatcggggcc tctacgagtc 3480 tgtcctcagg ttttcatcct cagaccaacg ggcaatgtga acgagccaac caggatctcg 3540 aaagagccct ccgctgcctg acctcccaga atcctagttc ctggagccaa cagctctcgt 3600 gggtggagta tgcccacaat tcactgccgg taacgtctac aggtatgtcc ccattcctct 3660 gctctatggg ttatcaaccg cctttgtttc cttcacagga gcccgatgct gcagttccgt 3720 ctgccttggc ctttgtccga cggtgccgcc gcacctggat gagagctgag gaggtcttgg 3780 cccggacctc tagacgaatc aaagcagcag ctgaccgtca tcggacttcc ccacctcaat 3840 atgtttgtgg tcagaaggta tggttatcaa ccaaggactt gcctctcagg gtggcctcac 3900 gcaagttggc acccaggttc attggtccat actgcatcac caaggtcttg agtccggtgg 3960 cggttaagct caagttacct cccacacttg gtcgggtaca ccctgttttt catgtatcca 4020 gggttaaacc tgtgttcaga tccaatctta accccattat ctcctcccct gcccctcctc 4080 ccccccgcct agtggatggc tctcctgcat acactgtcag gaggttgttg gatgtcagac 4140 gccggggccg gggtttccag tatttggtgg actgggaggg ttatggtcct gaggagagga 4200 gttggattcc agctcggcat gttctggatc gggcgttggt cgtggacctt caccggcgac 4260 tgggtaagcc cctcccctga gacgccaggt ggcgttcgta gggggggggg tac 4313 // ID Gypsy6-LTR_DR repbase; DNA; ZEB; 1524 BP. XX AC . XX DT 17-NOV-2004 (Rel. 9.1, Created) DT 17-NOV-2004 (Rel. 9.1, Last updated, Version 1) XX DE Gypsy6-LTR_DR is a long terminal repeat of the Gypsy-6_DR LTR DE retrotransposon. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy superfamily; Gypsy6-I_DR; Gypsy6-LTR_DR; Gypsy6_DR; KW endogenous retrovirus. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1524 RA Kapitonov V.V.; RT "Gypsy6_DR LTR retrotransposons from zebrafish."; RL Repbase Reports 4(10), 262-262 (2004). XX DR [1] (Consensus) XX CC Gypsy6-LTR_DR is a long terminal repeat of the Gypsy6_DR CC retrotransposon whose internal portion is deposited in Repbase CC as Gypsy6-I_DR. XX SQ Sequence 1524 BP; 285 A; 294 C; 367 G; 570 T; 8 other; tgtgatggtg tagtgcagct gattgatatg agtaaccaaa atgtgatggt gctgtgcagc 60 tgattggcat tgacatcggc caatcggcac gtagcaccgc ctttttaaac ctgattggtt 120 ggtgcttggg atacgtcacg tccgactccg acgcactggt ttgggtgtct gtagycgcgg 180 ctacaggtag aggctctgcg ctcgttctga taattttttg taagttgttt gaatcgaccc 240 acgttaagct ctcttttggc attgttctag tgcatcgtgt ggcggtggct tttcccttgg 300 ctgtggtggc tggatgactg ttcagttgca ggtatgtgat ggcttttaac tggctgagta 360 atggggttta gccgcattta acctgcgtgt ttcatgccgt acagctatta attccggcac 420 ctcgctgtat tgtatcgacc aacgagtgat tagaacgtgt ctgagtgtgc gcccattggt 480 aagtgaagca cgtatgatwt ggtgtgatca tttaaaggac catgttaatg ggcttgtttg 540 cgcttttcct ccttaggatc ctcccttttc ccctttctcc cctttctctt ttacgtcgga 600 tcttcctctg gtatcatttg cttactagct cccgtgttca tcgctattgc cgtggttacc 660 ggagaycgtc tkcaactgct gctaatgcta acgccggcgt attggctgcc gcttggacac 720 caccaatgca gctgytagcg gggactgcct gcaataattc ttagcgtgct gctttgtgct 780 ccccttcatt ctttcatttg tttatttgtk ttggataatt gatttgttaa cattagcart 840 gttctgcttt ttgttgtgtc tttaatttat gcaaagcatt gatttgtttt cgttattaag 900 attattcttt tgtgtgttta ttagtcttat ttcgagtaac agttaatgtt tgggtttagt 960 gtaggtcaga tggtgtaact acagctgtag ctrtctttca attagatcat gtaaagtagg 1020 tgttaacctg attgacccac ttatttgtgt tttgttggtt atttattttt atttatttca 1080 ggaactgagg tttcccccgg tggatgacgg gtcactttcc actggtggtg gtgtttctct 1140 atcgtgtgct gtggacactg agtgcgtgat agcgtgattt ttgagtgcac tgtggtgtgt 1200 gcgtgtgctg gggtaaagtc tgtccactcg tacccagtca gtgggaacct ccaatcctgg 1260 tttggttttg tttatttatt gtttcttttg tgtttaataa tgaagtggga gtttttgatg 1320 tgtttttaag aaccgtacaa tttgacagtt ttattccaaa taaatatttt tgtacttttt 1380 cattaactca cgtctctgtg cctttctggg agaaacgaac ctgtgtcctt ttctaatagt 1440 tgtttccctg ggtaactccc ggggtggcgt agtcggttat ttttatttaa agagatttaa 1500 accctactct ataccctcgt caca 1524 // ID hAT-N52_DR repbase; DNA; ZEB; 821 BP. XX AC . XX DT 21-OCT-2008 (Rel. 13.1, Created) DT 21-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Putative nonautonomous hAT DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW 8 bp TSD; hAT-N52_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-821 RA Bao W. and Jurka J.; RT "hAT DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1321-1321 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 821 BP; 283 A; 140 C; 145 G; 248 T; 5 other; tagggatgct ccgatcagga tttttgcagc cgataccgag taccgattcc ttgtcatggt 60 gatcggccga tacagagtac cgattctgat gctttaagct ttataatgca taaagcacat 120 tttccctcta agtatgatac ttaagtggag atctctcctt acctaagaga ataaatgaag 180 gatgttgcat ataatccctt aaacacgtct cttataacca tttaggtgtg gacatgtcat 240 ggtcagaagc agctttacag aaaataatag ataaaacata taaaaaaatg gtaagaaaaa 300 tgactaacaa agcaatttgg aaacattgca aatgatggaa gaagaattca acatgtctca 360 atgaagaaaa agtttggagc gcatatttga tgcataagaa gttaaaatgc acacctataa 420 atccattact tctttactaa aaggaaatag gcctataaaa ctactgttta ttgcaataag 480 tatattacaa taagttatat tattaaatat tcaagttaaa aaataaaatt tttaagtttt 540 ttttaattta tctaatattt ccagccaggt tttagtgaac ttgcaatatt gtmaattaaa 600 cacagwagaa ctttcagttc ttatatgtaa aaaggcctaa atacatgcag taggtcmttt 660 atatatarcy caggtgctca cgtgcatctt cctctgcttg taaactcata aacacttgtg 720 atcggttcat gagatcggcc agatcggcga gtaccgatcg agtcatgaaa tgtgattatc 780 ggccgatacc gatctctggc cgatcgatcg gagcatccct a 821 // ID Gypsy148-I_DR repbase; DNA; ZEB; 4828 BP. XX AC chr5; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy148-I_DR; KW Gypsy148-LTR_DR; LTR71_DR; LTR_Retrotransposon. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4828 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1557-1557 (2008). XX DR Genome; chr5; Positions 32566059 32570886. XX CC Positions [2393-2869] - Integrase core CC LTRs are 96% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS join(1862..3256,3260..4795) FT /product="Gypsy148-I_DR_1p" FT /translation="MIDLSDTFIGSAQDVTLETQGADSLSMSCGAETLSDD FT LPLSADKVLISVEQKNDSSLSRCRTTALTAEAIKNKPIGYFWDDDVLKRKW FT TPTVSDGLGWDVVFQLVVPKCLRRQVLSVAHDNVAGHLGITKTYHRILRYF FT FWPGMKADVAKYCRSCHTCQIVGKPNQKIPPAPLKPIPVVDEPFSRVILDC FT VGPLPRTKSGHVYLLTLMCTTTRYPEAIPLRSLKAKVILKALITFFSTFGF FT PKVIQTDQGTNFMSRVFKQVLSQMNIKHVTSSAYHPESQGALERFHQTLKS FT MLSKYCAESSKDWDEGLSLLLFAIRESVQESLGFSPAQLIFGYDIRGPLKL FT LHESWVSEPKQECHLLDYVCTFRERLHNACKLAQISLGNAQAKMKERYDKK FT AVSRSFSAGDKVLVLLPVQGAALEAKFSGPYVIDKKLCDTNYVVRTPDRRR FT KCRVCHINMLKHYVSREDDSSKSSVIAPVAVVITSDSGSDEDGLSMQKSLI FT SCGKLNNSQALSVLKSQLLHLSDSQQSEMLDLIHCFPSLFLDVPSRTSVLK FT HVIDVGECRPIKQSAYRVNPAKRKVMGDEVKYLVTNGLAVPSSSAWSSPCI FT LVPKSDGTQRFCTDYRKVNAVTKPDSYPLPLMEDCIEKVGSARFVTKLDLL FT KGYWQITLTERAAEISAFVTPDDFLNYSVMAFGLRNAPATFQRLMNIVLSG FT VSNCKAYLDDIVIYSDTWDEHLSTLRTVFCRLRDASLTLNLSKCEFGKATV FT TYLGKQVGQGEIRPVALKVQAIVDFAVPRTKRDLRRFLGMAGYYRSFCRNF FT SDVVLPLTNLLRISNEFEWTDECQSAFDSVKSLLSNAPVLMAPNYSHVFKL FT DVDASATGAGAVLLQDDSNGVEHLIGYFSKKFLKHQLNYSTIEKETLALLL FT ALQHFEVFIGSSSLPVLVFTDHNPLVFLSRMKNANQRIMRWSLFIQDFNMD FT IRYKKGKDNVLADALSRSFIVE" XX SQ Sequence 4828 BP; 1250 A; 943 C; 1139 G; 1496 T; 0 other; tccaaattta gtttaatttg atctgtttat tcgtttaaat ttcgcgtccg gagagaaatc 60 ttgtgacatt gctggtgagt accatttata tattctctat ttttggtgac tggaaatttg 120 atcttgtgct gtggggaaag gttgagatta tggagtttag catagaagcg tttaaacaag 180 ctccttcgtg ggaacttttg gagtcatgcc gtaaggttga tttattttta atggctgatt 240 tctatgaaat ctccgtggta aaaactgcta agaaaaaaga agtgtgagat ttaatttata 300 attcgttggt gcagcaaggt gttttacagc ctcggcctgc tggagcagcg caaaaagacg 360 ccggtgctga tagccacgcg acagatgatt ctatgttgca ttcgccaagt ggaacaaatt 420 tagatgattt aaaattggct attcagttga aacaattaga tctcgaaatt aagtgtcaag 480 aacatactac tcagcttcta cgttttaggc aatgcgagtt agaaacgcaa gctgaccgtc 540 gttcaccagt cttgcctcct aaattgtctc ctatacctac tgtttcacca tcagcttcac 600 ttgctgttaa ttctaatcca accccggtta atttgttgtc tgtgtcctct aataattatg 660 ctgactttga catcagtcgt catattacgc tcgtaccccc atttcaggat aacgaagtgg 720 attcatattt cggcgctttc gagcgaatag ctgcggcgtt gcgctggcag aaagagactt 780 ggtctctttt cattcagtgc aaattaacgg gaaaagccca agaggttagt tccgctttat 840 ctgttaacga cagcttggat tatgataaac tgaaaagtgc ggtgctgaaa gcatatgaat 900 tagtccccga ggcctatagg caaaaatttc gttcgcattc gaagggttcc agtcaaactt 960 ttgtggagta tgctcgggag aagagcatgc tattcgataa gtggtgtcac tccagtaaag 1020 tcactgattt tgagcaactt cgtgaactaa ttttgattga ggattttaaa aatccgttgc 1080 cggacaaaat tgttgttttt ctgaatgaga aaaaaaattt cgactcttgc tgaagccgcc 1140 gtttgcgcgg atgagtttgt tttgacgcac aagagcacgt ttgtttcacg tcgtgagtca 1200 aatttttcgc cagtcttcga aaaaaatgga aaagcttcaa aatcgctaaa gcaggtaaaa 1260 actgcggaat ctcgcgaatg cttctactgc cacgatttag gtcatttaat ttctatgtgc 1320 cctagtttga gaaagaaaaa tcaggtaaag gcagctaaga gtgttggctt cgttagtagg 1380 gaaagtaagc ccgattcgga tattgatgct ttgtatgatc catttgtgtg tcagggaacg 1440 atttctctga gtggattggc acaagattca gttcctgtgg ccattcttcg agatacaggg 1500 tctgtacagt cctttgtatt atttgattcg ttacattttt tcggacgaaa gttattgtgg 1560 atctgatgtc cttattcagg ggattgagtt aggggttctg aaagtgccac tgcataaggt 1620 atacattcag tctgatttgg taacaggatt tgtgaaactg gctgttcgtc accagttgcc 1680 cgtgaagggt gttgctgtta tcattggtaa tgatttagcc ggaggaaaag ttcttccata 1740 tccagaggtt attgagaatc ctctttgtga gacggttaat tctaatgatt tagtttctga 1800 atttccgacc gtcttttctg cttgtgtaat cacgcgggcg caagctcgta aattcggcga 1860 tatgattgat ctatctgaca catttattgg cagcgcgcaa gacgtgacac ttgagacgca 1920 gggtgctgat tcgctgtcca tgtcctgtgg tgctgaaacg ctatcggatg acctcccgct 1980 gtccgcggat aaggtgctga tttctgttga acagaaaaat gactcctctt tgtcacgttg 2040 ccgaactacc gctttaactg cggaggcgat taaaaataag ccgattggtt atttttggga 2100 cgacgatgtg ttaaaaagaa agtggactcc aacagtgtct gatggtttag gctgggatgt 2160 tgtttttcag ttggtagtac ctaaatgttt acgcagacag gttctgagtg tggcacatga 2220 taatgtcgct ggacatttgg gcattaccaa aacatatcat cgtattttgc gatatttctt 2280 ctggcctgga atgaaagcag acgttgcaaa atattgtcgt tcatgtcaca catgccagat 2340 tgtcggaaaa cctaatcaga aaattccgcc tgcacctctt aaacctattc cagtcgtcga 2400 tgagcccttc tcacgagtga ttttagattg tgttggccca ctaccacgta caaaatctgg 2460 gcacgtttat ttgctcaccc tcatgtgcac cactactcgc tacccagagg ctataccgtt 2520 gcgctcatta aaagcgaaag ttattttaaa agcactgatc acattctttt ccaccttcgg 2580 ttttccaaaa gtgatccaga ctgaccaggg cacaaacttt atgtcccgag tgttcaaaca 2640 agtgctttct caaatgaaca ttaaacacgt tacatcgagt gcctaccatc cggaatctca 2700 gggggcatta gaaaggttcc atcaaacttt gaagtccatg ttgagtaaat actgcgctga 2760 gtcgagcaaa gactgggatg aggggttgtc cttattgtta tttgcgatcc gtgagtctgt 2820 ccaggagtca ttgggcttta gcccggccca attaattttc gggtacgata ttcgagggcc 2880 cttaaaatta ttacatgagt cgtgggtttc agagccaaaa caagagtgtc atcttcttga 2940 ttatgtttgt acttttcgcg agcgtttgca caatgcgtgt aaactagctc agataagttt 3000 ggggaatgct caggcaaaaa tgaaagagcg ctatgataaa aaggctgtgt ctcgttcgtt 3060 ttctgcagga gataaagtct tggttttgtt gccagttcaa ggtgctgcat tagaggcaaa 3120 gttttctgga ccgtatgtca ttgataaaaa gctgtgtgac acgaattatg ttgtgcgcac 3180 acccgacaga cgtagaaaat gccgtgtgtg tcacattaac atgttgaagc attacgtgag 3240 tcgtgaagat gattcgtagt caaaaagctc tgtaatcgct cctgtggctg tcgtgatcac 3300 ttctgacagt ggctcagatg aagacggtct aagtatgcag aagtccttaa tttcgtgtgg 3360 caaattgaat aattcccagg ctctgagtgt tttaaagtca cagttgttgc atttgtctga 3420 tagccagcaa agcgaaatgc ttgatttaat tcattgtttc ccttcactat ttttagatgt 3480 cccctcacga acttcagtat tgaagcacgt tatagacgtg ggcgaatgcc ggcctattaa 3540 acagagtgct taccgcgtga atcctgctaa acgaaaggta atgggagatg aggttaaata 3600 tctcgtcaca aatggtctcg cagtcccgag ctcaagcgcg tggagctcac cgtgtatcct 3660 cgttccaaag tctgatggca cgcaacggtt ctgcacagac taccgaaagg taaatgccgt 3720 gacgaaaccc gattcgtatc cccttccatt aatggaagat tgtattgaga aagtaggctc 3780 tgcacgtttt gtcacaaaat tagatctgtt gaagggttac tggcaaataa ccttgacgga 3840 gcgtgctgct gaaatttctg cttttgtgac gccagatgac tttctgaatt actcggtgat 3900 ggctttcggg ctacggaacg ctcccgccac ctttcagcgc ttaatgaata tagtgctcag 3960 tggagtgtct aattgtaaag cgtacctcga cgacattgtc atttattctg acacatggga 4020 cgaacatctg tcaacactga gaactgtgtt ttgtcgtttg cgagatgcct ctttgacact 4080 taatttaagt aagtgtgagt ttggtaaggc gactgttacc tatttgggaa agcaggtggg 4140 acagggggaa attcgacctg tcgctttgaa agtgcaagcc attgtagact ttgccgttcc 4200 ccgaacgaaa cgtgaccttc ggagattttt aggaatggcc ggttattatc ggagtttttg 4260 caggaacttt tcagatgttg tgctgccttt gacgaatttg cttcgcattt caaatgagtt 4320 tgagtggacg gatgaatgtc agtccgcgtt cgattctgtc aagtctcttt tgagtaatgc 4380 tccagtgtta atggctccaa attattcaca tgtgttcaag ttggatgtag atgcgagcgc 4440 tacgggcgca ggtgctgtcc tacttcagga tgactcgaat ggtgtggaac atcttatcgg 4500 ctatttctca aagaaattcc tgaagcacca gctcaattat agcacaattg aaaaggagac 4560 gttagccttg cttttagctt tgcaacactt tgaagtgttt attggctcta gttctctgcc 4620 cgtgcttgtt tttaccgatc acaatccact tgtcttcttg tctcggatga agaacgcaaa 4680 tcagcgcatt atgcgctgga gtctatttat tcaagatttt aatatggata ttcggtacaa 4740 gaagggaaaa gataatgttc tcgccgatgc tttatctcga tcatttattg tcgagtaatc 4800 tactacgtag attctttgtg tgtgggag 4828 // ID DNA-TTAA0-3_DR repbase; DNA; ZEB; 935 BP. XX AC . XX DT 29-OCT-2008 (Rel. 13.1, Created) DT 29-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE non-autonomous DNA transposon - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; TSD 0-bp; KW target TTAA; DNA-TTAA0-3_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-935 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1249-1249 (2008). XX DR [1] (Consensus) XX CC Elements are Inserted specifically between 5'-TT and AA-3', CC producing no TSD. TIR is 23-bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 935 BP; 269 A; 172 C; 162 G; 331 T; 1 other; acagtttttc tagtttgctt tgacctggtt aaagaaacta ggttcctctt cattttcatt 60 atcactatcc cagcagagca gaggaccggt cttgcaaatg tgtaaaccct ttctcattgg 120 gttgacacat tttccccacc atttttccaa acagttaaca cagatgtcat tcaagcagtc 180 caaactccat tgttttagtt ttggtaacca aacagaatcc atctattcct aactattaaa 240 aactaccaac atcagtatga aatcactgaa gcaccggttt gacactcctt cacacaaatc 300 ttcaattagc catcattctg caaaccttat aaaaacgatg gaaggtctgt gttgttctgt 360 gccagcattg atggaggagg tcaatatggc tatgtcctat actcccaata gatttgactg 420 ttttgactgt atattagttt acatgtgttt tctctaatat ttacagtatt ttattacttt 480 tgtctgtttt ttttatatac agtatttcag ttttcagcca cagtttgaaa aatgtcatga 540 attcaatata tattcaatca aagcacatct tattcttgat ccaattcttt gcaaaaaggc 600 gaatttgaca gcccaaatag agacaacaaa tgccattggc tataagctac aatggcaagc 660 aatggtggtg cattttgagt tctgtatttt gtattttgtt gtccattgtg taatgattaa 720 ttgaaagagc tcatatactt gtttgcaagt tgtgtwgttt gaaggtaaaa ctagcttttg 780 ttgcatatgt taaatgtttt gacacgatat gtgccttttg caagcaaaat gtgtcatttt 840 gaccatgagt gccgctgttt aggcctgtgt gttaactgtt ttgaaaatgt ggagtttcag 900 ttgacaactg catccaagca atcgagaaaa actgt 935 // ID Gypsy103-I_Dr repbase; DNA; ZEB; 4286 BP. XX AC . XX DT 21-APR-2009 (Rel. 14.05, Created) DT 21-APR-2009 (Rel. 14.05, Last updated, Version 1) XX DE An internal portion of the Gypsy-103_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy superfamily; endogenous retrovirus; KW gag; reverse transcriptase; integrase; Gypsy-103_DR; KW Gypsy-103-I_DR; Gypsy-103-LTR_DR; Gypsy103-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4286 RA Dib M.R. and Naveira H.F.; RT "Gypsy103_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 9(5), 942-942 (2009). XX DR [1] (Consensus) XX CC Gypsy103-I_DR is an internal portion of the Gypsy103_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its long CC terminal repeat is deposited in Repbase as Gypsy103-LTR_DR. CC Gypsy103_DR is characterized by 4-bp target site duplications. CC The internal portion encodes one polyprotein the 1422-aa CC polyprotein Gypsy103_DR1p (pos. 19-4284) composed of the gag, CC protease, reverse transcriptase, and integrase domains. Some CC insertions fairly recent, according to the hight identity between CC their flanking LTRs. Consensus obtained after the alignment of at CC least five independent insertions bearing at least 85% Homology CC over at least 1000bp. XX FH Key Location/Qualifiers FT CDS 19..4284 FT /product="Gypsy103-I_Dr_1p" FT /note="Polyprotein." FT /translation="MDAAGSDTVRTAVTQQGALLGQHEARLTNTTREVEFL FT ANQVAELTALVQDLQHEAAQGGPLRHHDPEPRCNNPPPYNGDPNSCRAFLS FT QCAVVFTLQPRTYASEESKVAFVLTLLTGKARDWGTSVWETRAPCCASFED FT FRQEMVRLFDRSVRGQEAADQLARLRQAGQSVTEYAIAFKTLAASCDWNEG FT ACRSMFRAGLQDDIQDELATQDLPQDLDDLVNMALRIEGRLHRRRQRLTVR FT PPWRVEDSRPVLAAEASGSAPMDPEPMQVGRLRLTPLQRQQRFVQGLCLYC FT GKPGHFAVACPLKRQASPVKRGILVSTTPFLHSSSRTLLPVSVHFSDSSNS FT CSALVDSGAEGNFMDSSLAALWGIPAIPLPDPIPARSLNGSLITTVSHSTP FT PINLTVSGNHHEVTTLLLLDSPSASIVLGHPWLVKHGPHVDWSGNAILSWS FT QYCLSSCLGSAPFPVSVSSVLQAEAADLTGVPVEYHGLRQVFNKSRATSLP FT PHRPYDCAIELLPGTSPPKGHLFSLSGPEREAMDRYINESLKTGLIRPSSS FT PAGAGFFFVKKKDGSLRPCIDYRGLNDITVKNRYPLPLMSSAFELLQGAKV FT FTKLDLRNAYHLVRIREGDEWKTAFNTPTGHFEYRVLPFGLTNAPAVFQAL FT VNDVLRDMVNRFVFVYLDDILIFSPCLQIHIQHVRQVLQRLLENQLYVKAE FT KCVFHAQSIPFLGFIISAGEIQADPCKIRAVAEWPTPDSRKALQRFLGFAN FT FYRRFIRNFGQIAAPLTALTSPKVWFKWNSDAQEAFDELKSRFVSAPVLSI FT PDPEQQFIVEVDASDVGVGAVLSQRSCLDGKVHPCAFFSHRLNPSERNYDV FT GNRELLAVRLALGEWRHWLEGAAQPFLVWTDHKNLEYIRSARRLTPRQARW FT ALFFDRFKFTLSFRPGTKNVKPDALSRLFEVPGKEKSVDAILPKEMVVASI FT SWDIERRVEKAIQKISVPGRVPAGRLFVPTRLRSEVIQWGHSSRLACHPGV FT RRSLALIHQRFWWPSMAKDVRQFVAACSVCAQNKTSNAPPVGLLRPLPIPS FT RPWSHVALDFVTGLPESKGNTVILTVVDRFSKSVHFIPLPKLPSAKETAQV FT VIDHVFRIHGLPVNVVSDRGPQFVSRFWKEFCRQIGASTSLSSGFHPQTNG FT QSERANQDLERTLRCLASHNPSSWCQQLSWVEYAHNSLPSSATGLSPFECS FT IGFQPPLFPSQEPEAAVPSALAFVRRCRRTWRKAREALVRVGRRTKAAADQ FT HRTPAPHYICGQRVWLSTKDLPLRVPSRKLAPRFIGPYQITKVLSPVVVRL FT KLPPKLGRVHPIFHVSRVKPVMYSPLVPSAPSPPPPQLVDGLPAYTVRRLL FT DVRPRGRGFQYLVDWEGYGPEERSWVPARDILDQALVEDFHRRRGKPLPAA FT PGGARRGG" XX SQ Sequence 4286 BP; 877 A; 1218 C; 1091 G; 1100 T; 0 other; gtacgatctg accagagaat ggacgcagcg ggttcagaca cggtcagaac tgcagtcacc 60 cagcaaggtg cactcttagg ccagcatgag gctagattaa ccaacaccac cagggaggtg 120 gaattcctgg ccaatcaagt ggcagagcta actgcactcg tccaggacct acagcatgag 180 gcagcccagg ggggtcctct ccgtcatcat gatccggagc cccgttgcaa caaccctcca 240 ccctacaacg gagaccctaa ttcctgtaga gccttcttgt ctcagtgtgc agtggttttc 300 accttacaac cacgcactta tgcatctgag gaatctaaag tggcgtttgt attgaccctc 360 ctcaccggta aagctcgcga ctggggaacg tcagtgtggg agactagagc cccttgctgt 420 gcttcgtttg aagactttcg ccaagaaatg gtgagattat ttgaccgatc agtccggggc 480 caggaagcag cggaccagct ggcacgccta cgccaggcag gacaatccgt tactgaatat 540 gccatcgcct tcaagactct agcggcctcc tgtgactgga acgagggcgc gtgcaggtct 600 atgttccgcg ccgggttaca ggatgacatt caggacgagc tggccacaca ggatcttccc 660 caggaccttg atgacctagt taacatggcc ctgcggattg agggacgtct ccatcgtcgt 720 cgtcagcgtc taacggttcg ccccccatgg agagtggagg attcccgtcc agtcttggct 780 gcagaagcat ctggttctgc tccaatggat ccggagccca tgcaagtagg gcgtcttcgt 840 cttaccccac tacagagaca gcaacgcttc gtccagggac tctgccttta ctgtgggaag 900 cctggacatt ttgctgtagc ctgcccactc aaacgtcaag cctcaccagt gaagaggggg 960 atactggtga gcactacccc tttcctacac tcttcctccc gcactcttct tcctgtttct 1020 gtacacttca gtgattcctc taattcttgt tcagccctgg ttgactcggg ggctgaagga 1080 aattttatgg actcctccct agctgctttg tggggaattc ctgctatccc tcttcctgat 1140 cctatccctg ctcgctccct caacggctcc cttataacca ctgtttccca ttctacccct 1200 cctattaacc tgactgtttc aggcaatcat cacgaggtga ccactctgtt gcttctggat 1260 tccccgagtg cctccatagt gttggggcac ccgtggttag taaagcatgg tcctcatgtt 1320 gattggtctg gtaatgccat tttatcttgg agccagtatt gtttgtcttc ttgtctgggc 1380 agtgcccctt ttcctgtttc tgtgtcctct gtgttacagg cagaggctgc tgatctgact 1440 ggggtcccgg tggagtacca cggcctgcgc caggtgttta acaagtcccg ggccacttcc 1500 ttgcctcctc acagaccata cgattgtgcc atcgagctcc tcccaggcac ttctccgcct 1560 aagggacact tgttttccct ttctggtcca gagagagagg ctatggacag gtacattaat 1620 gaatccctta aaaccggtct catccgaccc tcctcgtctc ctgcaggtgc tggattcttc 1680 tttgttaaga agaaggatgg ctccctgcgt ccatgcattg attatcgagg gctcaacgac 1740 attaccgtta aaaacaggta ccccctacct ctcatgtcgt ctgcttttga gctgttgcag 1800 ggagccaagg ttttcaccaa acttgacctc aggaatgcct accatcttgt ccgcatacgt 1860 gagggggatg agtggaagac agcctttaac acccctacgg gacactttga ataccgggtc 1920 ttaccgttcg gtctgaccaa cgccccggct gtctttcagg ccctggttaa tgatgtgttg 1980 agagacatgg ttaatcgttt tgtctttgtg tatctcgatg atattttgat cttttccccg 2040 tgcctgcaga tacatattca acacgtacgc caggtgctac aacggttgtt ggagaaccaa 2100 ctttatgtca aggcggagaa gtgcgttttc catgcccagt caattccgtt tctggggttc 2160 atcatctcag cgggagagat ccaggcagat ccctgtaaga taagggctgt tgccgagtgg 2220 ccaacccctg actctcgcaa agctctgcag cggttcctgg gctttgcaaa cttttaccgg 2280 cgattcatca ggaactttgg ccagattgct gcacctttaa cagcactcac ctctcccaag 2340 gtatggttca agtggaatag tgatgcacag gaggcctttg atgaattgaa gtcccgtttt 2400 gtctctgctc ctgttttgtc cattccagat cctgaacaac aatttattgt tgaggtggat 2460 gcatctgatg tcggggtagg cgcagttcta tctcagaggt cttgcctgga cgggaaagta 2520 catccgtgtg ccttcttttc ccaccgccta aacccttcgg aacggaatta tgacgtaggg 2580 aaccgagaat tattggcagt caggctggct ctgggggagt ggcgtcactg gctagaaggg 2640 gcagcccaac ctttcctggt ctggacggat cacaagaatc tcgaatatat ccgttcggcc 2700 aggaggctga cccctcgaca ggctcgctgg gccctcttct ttgaccgctt taaatttacc 2760 ctatcattca gacctggaac caagaatgtc aagcccgatg ccctctcccg tctctttgag 2820 gtcccaggaa aggagaaatc ggttgatgcc attttaccaa aagaaatggt ggtggcgtct 2880 atctcctggg acattgagag acgggtggag aaggccatac aaaagatctc agttccaggg 2940 agggtcccag cgggccgtct gtttgttcca accagacttc gctcagaggt catccagtgg 3000 ggacactcgt ccaggctagc ctgtcaccca ggagtgcgga gatcgctggc tttgattcac 3060 cagcgattct ggtggccatc catggccaag gacgtcaggc agtttgtggc ggcttgttct 3120 gtttgtgccc agaacaagac ttctaatgca cctcccgttg gtctgcttcg ccccctaccc 3180 atcccgtccc gcccatggtc acacgtggcc cttgattttg tcactggtct cccagaatct 3240 aaaggaaaca ccgtcattct cacggtggtg gaccgtttct ctaaatcagt ccatttcatt 3300 cccctcccca aacttccttc agccaaggag actgcgcaag tggtgattga tcacgtattc 3360 cggattcatg gtcttccggt taatgtggtt tctgacaggg gtccccagtt tgtctcccgc 3420 ttttggaagg agttctgtcg acagatcggg gcttctacga gtctgtcatc aggattccat 3480 cctcagacca acgggcagtc tgaacgagcc aaccaggatt tggaacgaac tctccgctgc 3540 ctggcgtccc acaatcctag ctcctggtgc caacaactgt catgggtcga gtacgctcac 3600 aactctcttc cgtcgtcagc tacaggtttg tccccatttg agtgttcaat tggattccaa 3660 ccccctttgt ttccctcaca ggaacctgaa gctgccgttc cgtctgccct agctttcgtt 3720 cggcggtgtc gacgcacctg gaggaaagct agggaggcct tggtccgagt tggcagacga 3780 accaaagcag cggccgacca acaccggact cctgcccctc attacatttg tggtcaaaga 3840 gtatggctct ccaccaagga cctgcctctc agggtgcctt cacgcaagct ggcccccagg 3900 ttcattggtc catatcaaat cactaaagtc ttgagtccgg tggtggtaag gctcaagctt 3960 ccccccaagc ttggtcgggt acaccctatt tttcatgtat ctagggttaa acctgtgatg 4020 tattcccccc ttgtcccttc tgccccttcc ccccctcccc cccaactagt ggacggctta 4080 ccagcctaca cagtacgaag gttgcttgat gttagaccca ggggcagggg tttccagtat 4140 ctagtggact gggagggcta tggtccagag gagaggagtt gggtcccagc ccgggacata 4200 cttgatcagg cgctggtcga ggatttccat cggcgacgag gtaagcctct ccctgcagcg 4260 cctggtggcg cccgtagggg gggggg 4286 // ID GYPSYDR1_LTR repbase; DNA; ZEB; 387 BP. XX AC AL591405; XX DT 04-MAR-2002 (Rel. 7.02, Created) DT 13-OCT-2008 (Rel. 13.11, Last updated, Version 2) XX DE LTR of putative novel retrotransposon GYPSYDR1. XX KW Gypsy; LTR Retrotransposon; Transposable Element; LTR; KW retrotransposon; GYPSYDR1_LTR. XX NM GYPSYDR1_LTR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-387 RA Jekosch K.; RT "Gypsy-like element from D. rerio (LTR)."; RL Repbase Reports 2(2), 11-11 (2002). XX RN [2] RP 1-387 RA Jurka J.; RT "Expanded LTR consensus sequence."; RL Direct Submission to Repbase Update (13-OCT-2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 387 BP; 93 A; 128 C; 73 G; 92 T; 1 other; tgtagcgagt catgtaatac cacgtcgccc tgggcacaaa tccacaccag ctggatctct 60 catcaacctc ggatcgctca cagctggacc tcatcagcca gggagagata taagccagcc 120 ccacacagga aggaagtgag cttcatttga catgactccc tgcgctaacg cttgtctctc 180 tctgtctctc ccacagccga atccagctcg tgaccgatcc aacactctac ytctcaccag 240 ccttcaccat ctccccggag cacttgagca cgcaccttta agaagagcac tcattcaccc 300 caatatcact tgtaaataaa ggcaccctcc ggggcattgt ttgtaacatt atctacggtg 360 tttgtgtttt ccctctgcct cgctaca 387 // ID Gypsy67-LTR_DR repbase; DNA; ZEB; 175 BP. XX AC chr14; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy67-I_DR; KW Gypsy67-LTR_DR; Gypsy67_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-175 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1450-1450 (2008). XX DR Genome; chr14; Positions 28463331 28463157. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 175 BP; 39 A; 21 C; 46 G; 69 T; 0 other; tgtggtgtat tgggataatg tgtgtgtgta atgatctggg tgttggtctc gctattgatt 60 ggttcttgac aactatttaa gccatgtgat gtattgatgt gatgacgttg tttatatcta 120 agtaaagcac gcttgatggc atctctgtgt gtgtttgcct cgtaaatata taaca 175 // ID Gypsy43-I_DR repbase; DNA; ZEB; 4725 BP. XX AC . XX DT 21-SEP-2007 (Rel. 12.09, Created) DT 21-SEP-2007 (Rel. 12.09, Last updated, Version 1) XX DE An internal portion of the Gypsy43_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy superfamily; Gypsy43_DR; KW Gypsy43-LTR_DR; Gypsy43-I_DR; endogenous retrovirus; gag; KW integrase; reverse transcriptase. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4725 RA Dib M.R. and Naveira H.F.; RT "Gypsy43_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 7(9), 816-816 (2007). XX DR [1] (Consensus) XX CC Gypsy43-I_DR is an internal portion of the Gypsy43_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its long CC terminal repeat is deposited in Repbase as Gypsy43-LTR_DR. CC Gypsy43_DR is characterized by 4-bp target site duplications. The CC internal portion encodes one polyprotein the 1507-aa polyprotein CC Gypsy43_DR1p (pos. 141-4661) composed of the gag, protease, CC reverse transcriptase, and integrase domains. Some insertions CC fairly recent, according to the high identity between their CC flanking LTRs. Consensus obtained after the alignment of at least CC five independent insertions bearing at least 85% Homology over at CC least 1000bp. XX FH Key Location/Qualifiers FT CDS 141..4661 FT /product="Gypsy43-I_DR_1p" FT /note="ORF." FT /translation="MEAEVKAFIDSPSLEQLDRFRKKDLLQVAEIYKISVS FT SGAAKAEIKAELLSELTSQGVFSEELEAAASPAADYDVNVALKLKELEFKI FT KQQEYETHLLRIRELELRQEAREVQTAESSRFDPSRYMKLVPQFRESEVDA FT YFIAFERVAGKLGWPKEMWAIMLQSSFVGKAQEVCSALSMQDSLKYDVVKA FT AVLKAYELVPEAYRQKFRGHIKKTQQTFLEFARDKKTMLEKWCTASEVTTF FT EGLQELILLEDFKNGLPENMVLYLNEQKVTTLSKAATLADEYVLTHRAVIP FT AARPHRNFKVDSEKDFTNTCPVKPETARTRAPAGGVNSRTEKRLCFYCLDP FT NHFIAECKAWKLKNASGRTKSVAHAVIEPRMMKAANTGLHNDVFKPFLLEG FT WVSTSPGETEKAVSILRDTGAAHSFIREGILHLSERTYTGSDVLIRGIELG FT CVRIPVHSVFLKSALVTGAVKVGVCAQFPVEGVDFILGNDLAGGSVFPCPI FT VSDFPDNNIAEVTDTSKFFPACAITRSQSRKFEDTVDLSDSFLLRSDPTEF FT VLSVDDKVVNDNEIKLDNQIIKIGRKQLSEAQKSDLSLSYCVDSAVQTESE FT LDGVRIGYFWDGDILMRKWTSASVVDPDTPPTYQIVLPACYRSAVLKLAHD FT HVMAGHFGVNKTFSRITKYFFWPSLRSSVGNYCRSCHACQVAGKPNQVVRP FT APLHPIPVMGEPFERLILDCVGPLPKSKQGYQYILTLMCAATRFPEAVPLR FT NIKTQSIVKELIKFCSLFGLPRVIQTDRGTNFMSTQFKRALEAISVSHQVS FT SAYHPQSQGALERFHQTLKTMLRTHCVDADKDWAESLPFLMFAVRETVQDS FT LGYSPAELVYGHVFRSPLKLLNEQLLASESSTVPVTEYVDSLRKRLSRSCE FT LAKIHLTNSQASMKEHYDKRSVTRSFEPGDLVLALLPVPGSALQTKFEGPY FT VIKRKINDTNYVIKTPDRRKKIKMCHINMLKLYHSRNPSEVKGKSAVAVAV FT PADSKESDMDVEICDSVCVGSLQNSVILKDLNSRLSYLGDGQKDSVINLIH FT SFPEIFTDVPSRTNVLSHDIDVGSASPIKQHPYRVNPFKRKIMKTEVEYML FT AHGLAEPSQSAWSSPCLLVPKSDASFRFCTDFRKVNAVTKPDCFPLPRMDD FT CIDKIGPAKFVTKLDLLKGYWQISLSARAAEISAFVTPDFFGQYSVMAFGM FT RNAPSTFQRLMNIVLRDIRNCDSYLDDIVVYSSTWEEHIETLREVFSRLKA FT ASLTVNLAKCEFAKAYVTYLGKRVGQGQVSPVNAKVEAIMNFPKPSNKREL FT RRFLGIAGYYRGFCRNFASLVAPLTDLLRPSNVFRWSPECELSFNGVKVLL FT SNAPVLTAPNFERPFQLEVDASAVGAGAVLLQSAETGIVKPVCYFSKKFNR FT NQCNYSTIEKEALALLLALKHFEVYLGGSSFPIKVYTDHNPLVFLNRMFNA FT NQRLMRWALLLQEFNLEIVYKRGTENVIADALSRAPGVESSV" XX SQ Sequence 4725 BP; 1247 A; 974 C; 1144 G; 1360 T; 0 other; taattggggg ctcgtcctaa acttcgtgtg aaagctcgtc tgccttttgt ttgtctgcgc 60 atcaaattgg gtgagtaaaa cttctctctt tacatcgtgt cgtgaattgt gttaaaactt 120 tgggtgtggg gggaggaaat atggaagctg aagtgaaagc ttttattgat tctccgtcgt 180 tagagcaatt agatcgcttc cgaaaaaaag atttgctgca agtggcagaa atctataaga 240 tatccgtttc atctggtgca gcgaaagctg aaattaaagc cgaacttctc tctgaactga 300 catcacaagg cgtcttttca gaggagctcg aagccgctgc ttcccctgca gcagattatg 360 acgtgaatgt ggcactgaag ttaaaagaac tcgagtttaa aattaaacaa caagaatatg 420 agactcactt attacgtatt cgcgaattag aactacgcca ggaagcgaga gaagtacaaa 480 ctgctgagag tagtcgtttc gatcccagtc gatatatgaa actcgttcct caatttagag 540 agtcggaagt ggacgcttat tttattgctt tcgagagagt ggcaggcaaa ttgggctggc 600 caaaagaaat gtgggcgatt atgctgcaga gcagttttgt gggcaaggcg caggaggtgt 660 gctccgcgtt gtccatgcaa gattcgctca aatatgacgt ggttaaagct gctgtattga 720 aggcttacga attagtgcca gaggcatata ggcagaagtt caggggccat attaagaaaa 780 cgcaacaaac ctttttagag tttgctcgtg ataaaaagac catgttagaa aaatggtgca 840 cggcatctga ggttaccact tttgaaggtc tccaggaatt aattttgttg gaggacttca 900 agaatggttt gcctgagaat atggttttgt atttgaacga acaaaaagtc actaccctgt 960 ctaaagccgc tacactggct gatgagtacg tactcactca ccgcgctgtc attcctgcag 1020 cccgcccgca tcgaaatttt aaagtggata gtgagaaaga tttcacaaat acgtgtcccg 1080 tgaagccaga gactgcccgc acgcgagcac ctgcgggtgg agtgaatagt cgtacagaaa 1140 aacgactttg tttttattgc cttgatccga atcattttat tgcggagtgc aaggcatgga 1200 aattaaagaa tgcgtccggt agaactaaaa gtgtcgcaca tgcagtaatt gaacctcgaa 1260 tgatgaaggc tgcgaacact gggctgcaca atgatgtctt caagcctttt ctgctagagg 1320 gctgggtttc cacgtctcca ggggagaccg agaaagccgt atccatcttg cgggacactg 1380 gtgccgccca ttccttcatt cgggagggca ttctgcattt gtcggagagg acttatactg 1440 gttcagatgt tttaattcgc ggaattgaac tgggttgtgt gagaattccg gttcattctg 1500 tgtttttaaa atctgcactt gtcactggag cagttaaagt cggggtgtgc gcacaatttc 1560 ccgttgaagg ggtagatttt attctgggga acgatttggc cggaggtagt gtgtttccct 1620 gtccaattgt gtccgatttc cccgataata atattgctga ggtaaccgac acgtctaaat 1680 ttttcccagc ctgtgcgatt acacgctcgc agtcacggaa atttgaagac actgttgatt 1740 tgtctgattc gtttttgctg cgctcggatc ctaccgagtt cgttctgtca gtggatgaca 1800 aagtggtgaa tgataatgaa ataaaattgg acaatcaaat tattaaaatt ggtcgtaaac 1860 aattgagtga agcgcaaaaa tccgacctgt cattgtctta ttgtgttgac tctgctgtcc 1920 aaaccgagtc ggagttagac ggagttagaa ttggatactt ttgggatggt gatattctga 1980 tgcgtaaatg gacatcagcg tctgtggttg atccagacac accgcccacc tatcaaattg 2040 tgttgccagc ctgttaccgg tcggcagtgt tgaaattagc gcatgaccac gtcatggcag 2100 gacactttgg tgttaataaa acttttagtc gcatcactaa gtatttcttt tggcctagtc 2160 tgagatcaag tgtcgggaat tactgtcgtt catgtcatgc gtgtcaggtc gccgggaagc 2220 ctaatcaagt cgtgcggccg gctcccttgc atcctatacc agtgatgggt gagccctttg 2280 agcgtttgat tttggattgt gttggtcccc ttccaaaatc taaacagggt tatcagtaca 2340 ttttaactct aatgtgcgcg gccactagat ttcccgaagc tgtaccttta cgcaacatta 2400 aaactcagtc tatcgtgaag gagctgatta aattctgttc cctctttggt ctgcctcgcg 2460 taattcaaac cgatcgtggc acgaatttta tgtcaactca gtttaagcgt gcgctcgagg 2520 caatttcagt gtcacatcaa gtctccagcg cgtaccatcc gcaatcacag ggagcgttag 2580 agcgcttcca ccaaacgctt aaaacaatgt tgcgtactca ctgtgtcgat gcagacaaag 2640 attgggcaga gagcctcccg tttttaatgt ttgctgtgag agaaaccgtg caagattcgc 2700 tcggatatag tccggcagag ctcgtttatg gccatgtttt tcgcagccct ttgaagcttt 2760 taaatgagca gttactagcg agtgagtctt ctacagtacc agtaacggag tacgtggact 2820 cgttacgtaa acgtttgtcg cgctcatgtg agcttgccaa aattcacctt actaattctc 2880 aagcgtctat gaaggagcat tatgataaac ggagtgttac ccgttcattt gagccaggtg 2940 atttagtgct cgccttactg ccggtgccag gctccgcgct ccagactaaa tttgagggcc 3000 cgtatgtgat aaaaagaaaa atcaatgaca caaactatgt gataaaaact cctgatcgtc 3060 gtaagaaaat caaaatgtgc catatcaaca tgctgaaact ctatcactca agaaacccta 3120 gcgaagtgaa aggtaaatct gcagtggcag tggcagtgcc agctgattcg aaagagagtg 3180 atatggatgt ggaaatttgt gactcagtgt gtgtggggag tttacaaaac tccgtaatat 3240 taaaggactt gaattcccgt ctgtcttacc taggtgatgg tcaaaaagac agtgtgataa 3300 acttgataca cagtttccca gagattttca cagatgttcc gtcccgcact aacgttttga 3360 gtcacgatat tgatgttggc agcgcatctc caatcaagca acacccatat agagtcaatc 3420 cttttaaacg caaaattatg aaaacagaag tggagtacat gctcgctcat ggtttagcag 3480 agccatctca gagtgcatgg agttcgccct gccttttagt tccgaaatct gatgcatctt 3540 ttcgtttttg cactgatttt agaaaggtaa atgcagttac caaacctgac tgttttccgt 3600 tgccgaggat ggatgactgt atagacaaaa ttggacctgc aaagtttgtt acaaagcttg 3660 acttgcttaa agggtattgg caaatctcac ttagtgcacg cgctgctgaa atttctgcct 3720 ttgtcacccc agactttttt ggtcagtact ctgtgatggc gttcgggatg aggaacgccc 3780 cttccacttt ccaaagactt atgaatattg tcttacgaga catcagaaac tgtgactcgt 3840 atttggatga tattgtggtg tactcttcca cgtgggagga gcacattgaa accttacgtg 3900 aggtattttc aagattgaaa gctgcgtcat tgactgtaaa cctcgctaaa tgcgaatttg 3960 ctaaagctta tgtcacctac ttgggtaaac gtgtggggca gggtcaagtt tcccctgtaa 4020 atgctaaagt agaggctata atgaattttc ccaagccttc taacaagcga gagctgcgcc 4080 gttttctcgg catagccggg tattacagag ggttttgccg caactttgcc agcctggtgg 4140 ctcccctgac tgatcttctt cggccctcca acgtatttcg ctggtctcct gagtgtgaac 4200 tttcttttaa tggtgtaaaa gttctgctga gcaacgcacc tgttctcact gctccgaact 4260 ttgagcgccc atttcagctg gaggtagacg caagcgctgt gggagcggga gctgtccttt 4320 tacaaagtgc tgaaactggc attgtaaaac ccgtgtgcta cttctcaaag aaatttaatc 4380 gaaaccagtg taactatagc actattgaaa aagaagccct tgctttgttg ctagctttga 4440 aacacttcga agtttatttg ggtggttcaa gtttcccgat caaagtttac accgaccaca 4500 accctttggt gtttctaaat cgcatgttta atgcgaatca acggttgatg cgttgggcac 4560 ttttactgca agaatttaat cttgaaattg tgtacaagcg cgggactgag aatgtgatcg 4620 cggacgcgct gtccagagct cctggtgttg aaagctctgt gtaaagtaaa aaaaaaaaaa 4680 ttaggtgtaa tttttttttt tttttttact atgggtgggg aggag 4725 // ID Gypsy4-LTR_DR repbase; DNA; ZEB; 1266 BP. XX AC . XX DT 17-APR-2009 (Rel. 14.05, Created) DT 17-APR-2009 (Rel. 14.05, Last updated, Version 1) XX DE A long terminal repeat of the Gypsy4_DR LTR retrotransposon; a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy superfamily; endogenous retrovirus; KW LTR; Gypsy4_DR; Gypsy4-I_DR; Gypsy4-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1266 RA Dib M.R. and Naveira H.F.; RT "Gypsy4_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 9(5), 955-955 (2009). XX DR [1] (Consensus) XX CC Gypsy4-LTR_DR is a long terminal repeat of the Gypsy4_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its CC internal portion is deposited in Repbase as Gypsy4-I_DR. XX SQ Sequence 1266 BP; 373 A; 180 C; 259 G; 454 T; 0 other; tgtggtgcac ccaaatttgt gcaaccacat ttggtgatat aaatatatat atatattcta 60 taaatagata atactgttta tttcatttaa attaagacta gcatctttta ttaattcagc 120 aaattgttat tttgtcattt agtgcaatgt cctgcaaaaa ggaaaaaaaa aagagtaata 180 atgaatgaga cagcataggg ggaccggaac ctttaatgcg attatttgtt cccgggtgcg 240 tatactgacg gaacctccgg tttctggtta gacgtaatct ggagaaagaa gcgcatggtc 300 ctgtcttgct ttcgtctcag tgttttcatt cattttacag gtcagtactg tgttaaaact 360 gtcaaaggat aaagttttag acgttgtgta aagaatatat ttgttggaaa ctgtacagtg 420 aagatttggg taacttttgg atgagttaaa gatgaaacac tttaggtaat ggtgtaaggt 480 cgggccgcga gtaatctcgc gatgactcgc gctattatcg cgagagttcg ttggcgcgcg 540 gtcattcagt ccgtctgtag tgtaagttaa atttaaacac tttatctttt gcttgaaagc 600 aacacagatc gctccagttg atgcttaaag tctcccaata tgtgtttaca tgttcagtga 660 gctgttttaa agcagagtga attcataata gtgtgttaag gaagcatatg agagttttat 720 atataaatat atgctatttt gtttacagtt ttgataatat tattactact aatagcttta 780 tttcttcagt gttatttcat aaggtctgtt aaagtcactg tacactttca aatgcaatgt 840 cccagtttaa gtaagctaat ttcttgcaca tacagtgtat ttttgagtgt taaataatac 900 ttttgaatga tttacatgtg tgatagtact atatctcagg gaaattaatg cagtattgca 960 ctgttgtttg taactagtag gatatctact ctgatttgag tgaatgtgca gcagaaagat 1020 tttatatatc tgattggtga agctaattca tgtaaatgca ctttatttgt gtttaatatg 1080 cagtgaatat gttagagaga gaattcattt attgtaaaag atgtatttga gtttaaatga 1140 tttaatagac gtaatctgga gaaagaagcg catggtcctg tcttgctttc gtctcagtgt 1200 tttcattcat tttacagatc acaattcttt ttgttattgc ggccaccgcg cctggggccc 1260 gtaaca 1266 // ID BHIKHARI-3-I_DR repbase; DNA; ZEB; 3361 BP. XX AC . XX DT 24-NOV-2008 (Rel. 13.11, Created) DT 24-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE Internal portion of retrotransposon bhikhari from Danio rerio - a DE consensus. XX KW LTR Retrotransposon; Transposable Element; Nonautonomous; KW internal portion; LTR; TSD 4-bp; BHIKHARI-3-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-3361 RA Bao W. and Jurka J.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(11), 1792-1792 (2008). XX DR [1] (Consensus) XX CC TSD is 4-bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 614..2320 FT /product="BHIKHARI-3-I_DR_1p" FT /translation="MADITVEAVSQAVKAAPFLKLLFSLAQNLPVQPNQDI FT SLPQDQNKVLYLKFEELKNELADLRRDNLKLKAEICRLSSETKSLHAITES FT QEEQLEGVKTDMAALQTELVQTQTDFSAQTEELNSLRSRETRTKAEVRSAL FT RDISELREKQEMDKEDWIREMDWIVDQISEFEGARHFSPKHPESQTEHPRS FT FDSAGREPLLCSFENRKRYQGSDNVRARSGLSPSADPFSNYEVASNSHNTV FT LSQPKSPSMPPPSDHLSLLKELACDIEVFNPETPGNSIELYLKDVHFALSY FT LPDITMSDKLFILRTTTTQAVHSFIDRQSPAISNDFNELCKALAAEFTLHE FT NPYTSRLSAFKIKQGRHESPREYYERLREMYFAGKDYPDAEEDPLFKHLFV FT ANLYPTVRKQLALFNIEKTWASELRQLAIRAWDSGKVNADRKNKEYAYISE FT PGQDRKNKKFPAHLDVVSEPDQNRIPSKYKFFPQSKSFDKNAKKPSAGEKR FT QHKDRRESRQKRKRDDPGNNWNMGNGIRQSRGFESRHDDDLLSKMHNMLQE FT HLKSLNSEEHSSTDRDQVTSA*" XX SQ Sequence 3361 BP; 1058 A; 659 C; 678 G; 946 T; 20 other; cactacatat ttggtgcccc gtgtgaggcc tggcttagat tgaaattgag ccttctgttt 60 tatcaacaca atgtttatgc actaataaaa ctgcaattga ggtaacttta gcaggaaact 120 atttcagtca tcaaatttag acagtaacca gttgcmaggt tccctytaag cctgacttga 180 ttgttgtccc tgctcaaata cttggtacta tttatgaata gggtgtttat gaagtgataa 240 aagtttgtga tttgccatta agtcctgaac caactttaty atggcagaga ttttctgatt 300 tcaaaatcag taaatygtag tcttgtaatt ctgaattata actgaattgc tacttgcatg 360 tataacaaag gtttggttca ccttttgagt ctatttaaat tcagagtktc tttgtgaaac 420 cggtcaggca acgttaacac ctctgaccat tcagtgataa gtgctaaatt aatcaagccc 480 cattcagtgt gtgacaccca ttgtcttagt kgtagactag ccggcactta tgtgttaaac 540 attccacagt ttaccttggt tggggggtgc atttatttta tatagaaaac acaagtttat 600 atattaaata aatatggctg atataacagt tgaagcagtg agtcaagctg taaaagctgc 660 tccatttttg aagcttcttt ttagcctagc acaaaattta cctgtacaac caaatcagga 720 tatttccctc ccccaagatc agaacaaagt tttgtatttg aagtttgagg agctcaaaaa 780 cgaactggct gatttgcgta gggataattt gaagctaaaa gctgagatat gtagattgtc 840 ttcagagaca aaaagcctac atgctataac tgagagtcag gaagaacagc ttgaaggtgt 900 gaaaactgac atggccgctc tgcaaacaga gcttgtccag acccaaactg acttttcagc 960 acaaacagag gagttaaatt ctctgaggtc tagagaaact aggaccaaag ctgaagtaag 1020 atcagcattg cgtgacattt ctgagctgag agagaagcaa gaaatggaca aggaagattg 1080 gataagagaa atggattgga ttgtggacca gattagtgaa tttgagggag ccagacattt 1140 ttcccccaag caccctgaat cccaaaccga gcacccccgc tcctttgatt cagcaggtag 1200 ggaacccctg ttgtgtagct ttgaaaacag aaagcgttat cagggtagtg ataatgtcag 1260 agcccgtagt ggtctaagcc caagtgctga cccatttagt aactatgaag ttgcatccaa 1320 ctctcataat actgtcctat ctcaacctaa gtctccaagt atgccacctc ccagcgatca 1380 tttgtcatta cttaaggaac tggcatgtga cattgaagtg tttaatcctg aaacaccagg 1440 aaacagcatt gagctgtacc ttaaagatgt ccattttgcc ttgtcatacc tccctgacat 1500 tactatgtca gataaacttt tcattttgag gacgacaaca acacaagctg tccatagttt 1560 tattgacagg cagagtccag caatttccaa tgatttcaat gaattgtgca aagcccttgc 1620 agcagaattt actctacatg aaaaccccta tacctctcgt ctgtcagctt ttaaaatcaa 1680 gcaaggccgc catgagtctc ctagagagta ctatgagcgt ttaagagaga tgtattttgc 1740 tggtaaagat tatcctgatg cggaggaaga ccctttgttc aagcacctgt ttgttgctaa 1800 cctttacccc acagttagaa aacagctcgc acttttcaac attgaaaaga cttgggcaag 1860 tgagttgagg caacttgcaa tacgagcttg ggatagtggg aaggtaaatg cagatagaaa 1920 gaacaaagaa tatgcttaca tttctgaacc tggacaggac agaaagaaca aaaagtttcc 1980 cgcacatttg gatgtagtgt cagaacctga ccaaaacagg attcctagca aatacaaatt 2040 cttcccccaa tcaaagtcct ttgacaagaa tgccaaaaag ccctctgctg gagagaaaag 2100 gcagcataag gacagaaggg agagtagaca gaaaagaaag agagatgatc caggaaacaa 2160 ctggaatatg ggtaatggaa ttaggcagtc cagaggtttt gaaagtaggc atgatgatga 2220 tttattatca aaaatgcaca acatgcttca agagcacctc aaaagcctta attctgagga 2280 gcatagctcc acagacagag accaggttac ctctgcttga ccaggtcagg tcaccttcct 2340 ccccacctaa ttcacacctc agagagtgaa tgtcagccat agggagggrg gggggggggg 2400 ggtagcttaa aacgaacagg gccttcaaag tcagcagact tttaattgcc cctgtcatca 2460 cattccttta attcaggaaa ggagtcataa ttctttgtct cagttgaaat attccaatca 2520 gtctctaatt gtacatgcaa actgatattt tttcttcatg cctttgacaa tcctctagat 2580 ggtcaaacca acagacaaga ccactcataa actagttact tggcacctta tgtaaaatat 2640 tgcaaatcac agccaatgtt gtaaaaattg ttgctagttt cagccttgca atctttcctt 2700 ataggacatc cttccaaaga gaagaatagt ttgattgaac ctgtagtacm gctgcagaag 2760 caggcagaac cttyayattt taaacagacc ttaccatacc attagcaggg taattaagat 2820 aatggatgtc atatagaatg tgacttgacc cattagttag tctgtaaaag aatgcaaatt 2880 ttacttaatg ccaaagtgag cttygctcct gataaaatgg caacacatka catcmtgtca 2940 tatttggtac caagtgaact tgtttgggga atcggtcagt cgggcaagga aaggaactat 3000 gcataatatg caagtgcack cagtgtacat ccttgacagt scaacaccaa attatgtcat 3060 tatcaaattt ggtttcttgc taaagatgct ttcgtttcag acaaacattt ttcagtatty 3120 acttytaaaa ctyattgtca tgtaactgtt aakcccyttg cataaaattt ccatttccac 3180 ataaactctt caaatagttc ttaaattatt tggaaattca gtcatattac tgctatgcca 3240 atggtcaaaa gggaaagtta agtggtatgt tttgttcctg ttttacttta ttccattgtc 3300 cagaccaacg acttcagtcc tcgattcgaa caaggagggg ggatgtagtg atcagcggta 3360 a 3361 // ID hAT-N63_DR repbase; DNA; ZEB; 1710 BP. XX AC . XX DT 29-OCT-2008 (Rel. 13.1, Created) DT 29-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Putative nonautonomous hAT DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW 8 bp TSD; hAT-N63_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1710 RA Bao W. and Jurka J.; RT "hAT DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1332-1332 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1710 BP; 457 A; 296 C; 307 G; 650 T; 0 other; tagcgctgtc aaaattaacg cgttaacgca tgcgattaat ttgaaataat taacgcgtaa 60 aaaaaattta cgcaattaac gcagtttttt tttttttttt cctgttgtgg ccaggggcag 120 acgccgggca gtttcccgct gtttcgacgt actttttggg ccgggctgat catcttctta 180 tgatctgatc ggcccataaa acactcggca gactgtcatg tttttctgcc tcagtgattg 240 acggtgctgt gtgagctgtc acgctctgaa agtaagatgt ttcttttccg gacagagcgg 300 ccgtgtgaca taatctgcag cccattggtt cttttcttta ttgacattgg gctgacccaa 360 tcacaaactt ccatgttggg ctctgcatgt tgtaacgtag gtgtgtattt gtcaaaatag 420 ccgagagtca cgcctgtttc tcacgcgcgg gtccgcgccg cggcggagag caggcttttg 480 cgtctgcact ctgcagcggg gcacagatca atatattcct gtctattcta tctagttacc 540 tatctgtcca tataaatata ctttttttat acttttaatc tgcaccacgg cccgcggcaa 600 cttcctccta taatgctgtt tccttaacct gcaagtcttt attttacaaa ttataaagaa 660 tgtctgcttc tcaagctcta tgaggggtgt cattcatgta ttttaagatg cactcggtca 720 gaagacaagc gcatgagata tcatgacatt gtgtttttga tcgtcagcat gatgtaggct 780 tatagtgaca ataatattta tacaatatgg ttattataat gatatttata catgaccaaa 840 ctagtgtgca acttaagcaa cttttttttt acatttagtt ttgtagttct gcccaaacaa 900 atatttgttg cattttttaa ttgtttaagt tcatttgatt gacaatgtaa aagcttttag 960 ctacgtttga tgtttcattg ttgagttaaa catttttttt agggtctctg acacattgct 1020 tttattattt atctttattt tattaatcaa cattaactgt gtgcgtcagc aggcagtttt 1080 tgttatgttc attatttatt aggctccctc aaaacacaca ggcacccaca tttccagtgt 1140 ggattataat tttattttaa taataaataa aaaaaaatgt agttttttaa tttgtcaact 1200 ctcgttattt cctatatatg gcttttgtgc ttttatagaa tgggagttgg taaacctatt 1260 caagggcatg cacagatttt caatacttta ttattattat tataattttc catatctgca 1320 acgcctgtat tttggcattt ttttgcagtc cacttagaat ccaacatgga aatcaatgtt 1380 ttctttattg gcattgattg ttttgaaatt caaatggcat taacatgcct gtgtttttat 1440 ttctgtaata aatatggctg tcaagccagg atcttgatgg tttattgtgg gtatgttgtt 1500 tacatgaaga aatctgtgtt acaagttaaa caaaaattct aataatgaat tatattttga 1560 atttaaatag tatttgtctt gcgtttacat taattttaca ctcgaatagc caaaattaca 1620 agtttcagtc ttttaaatgc gattaatcgc gattaatttt tttaaaagtg cgattaatta 1680 gttaattttt ttaatcgatt gacagcacta 1710 // ID BHIKHARI-4-LTR_DR repbase; DNA; ZEB; 1709 BP. XX AC . XX DT 24-NOV-2008 (Rel. 13.11, Created) DT 24-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE LTR of retrotransposon bhikhari from Danio rerio - a consensus. XX KW LTR Retrotransposon; Transposable Element; Nonautonomous; LTR; KW BHIKHARI-4-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1709 RA Bao W. and Jurka J.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(11), 1776-1776 (2008). XX DR [1] (Consensus) XX CC TSD is 4-bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1709 BP; 493 A; 386 C; 290 G; 539 T; 1 other; tgtaacgaat atggccgacc agacacataa atattttgca ttttattcca caaccgagcc 60 aactttgccc agagaagact cgaaacctga ttccaaggta tttacgaccc aaaggaatgt 120 gccatgcggc attaaacaca ggaatgactt acaaaggact ctttaaactg cttgtttaac 180 actgatgttt tcatataaac tttactctct ttatgtttaa tcatatgttt tgcaaagttt 240 ccctttctct tgtattaata tttttggttt taatacccct gcatataatg tttaactgta 300 cctaagctaa acaacctctt atactgttca agtttggact tttcgaaaag cttactcgct 360 gtacaaattg atcgttgaca ttttatttta atatgagtgc aacatattaa cgttttaaga 420 gactttaaca tttctgtccc gatttgcata aaatttgcct atgcaaattc ttctgcctcg 480 gacaaagagc cgtaaattta atattccagg aagttagctt ctaattggtc aactttccat 540 ggagggtgga tccgaacccg ccctataaaa gtttaggacc aagctaaact acgctctctc 600 tgttcaaacg ctcttgggat ctctcttcga actctctctg ttcaaactct ctctctgttc 660 aacctctctt gggacccctc tctgctctga aacacgctgc ctggaagctc cccctctctc 720 tcccaactcc aagcaacccc cctctctgct ctggaaacac gttgttttga aaccttcttc 780 aaactcactg caacttcctc acaaacctaa gtattgaaac cttcaagaaa gcaggcggaa 840 acccagacgg aaagggttaa tcgcaccgcg ctcggacatt tcactgtcca gaatgccttt 900 tgcggcttca acttgaaaca ccgactcccc cactacaacc ctcgtgagaa acaaaggatc 960 agccaaccaa ccacgtgctc aacaacggga gctcggacat cacgaacaga ccgcaagtat 1020 cacctttctc ccaacatcta aactggtgtg aactaaattc aaccctaaag agacaataga 1080 agggttatat tgaatttgat tacgtttggt ttgcttggtt tgttcttaca aactcagagg 1140 ttttcccatc attacatctt ctcaacctct tacctcacgt ttgttactct tccctatgct 1200 tgtatgaatg tgtgtttgtg ttaggtgtta gtttctgtta gattagtcaa taaagtttaa 1260 ttttgtataa agaaaggtgc ctgtgtatat taatgaatct aatgtcttga actttgatct 1320 tgctaccctg ctcaaatttt atgttgatag catcgctgaa cgattcgttt gttcggatca 1380 aaagagtcga ttcatttgaa gccccgttca aatgaatctt cctcccgcta agctagtccg 1440 ctatataatg ttccagaagg tacgcgagca ttgaaatcta aatcataact cgcaacatta 1500 taacggcggg aaaggcgcgg gaagaataac cttgagctaa tttactacag taataatggt 1560 gtagatatgc gagcacttta atctacagta taatttgtca cagattatta atgtggatat 1620 gcgtgcaagt taatctayat tattattatg atgtgctacc tgtttatttg tggagaataa 1680 acaaagcatg aaatatgtaa tttattaca 1709 // ID TDR24 repbase; DNA; ZEB; 444 BP. XX AC . XX DT 01-APR-2002 (Rel. 7.03, Created) DT 01-APR-2002 (Rel. 7.03, Last updated, Version 1) XX DE Zebrafish non-autonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; TDR24. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-444 RA Jurka J. and Drazkiewicz A.; RT "TDR24: a non-autonomous DNA transposon from Danio rerio."; RL Repbase Reports 2(3), 17-17 (2002). XX DR [1] (Consensus) XX CC TA-target site duplication. CC Several hundred copies in the genome. XX SQ Sequence 444 BP; 168 A; 64 C; 75 G; 127 T; 10 other; taatggaccc ttttcacatt tacaggtttc tcartagcag aagttntcat agttgggtaa 60 acttngagcg cagtgaatgg gagagtacaa caaatatttt ttttacaatc ttatttgctg 120 aaataataaa agaaaaactc cacgatggtg ttatcaagac tttyagaaaa aggaaaaaaa 180 ttgtgtgaga ctgataacgg cagccagaag agagaanaac gtcaaattta ctgtcctgcc 240 caatagacnc tgcattagaa acacctcgca caagcggtaa ttagtttttt gtaatttgat 300 gcaaagatga tataatayat acataaatac atataaatgt acatacattt twnaaaaaaa 360 atctaaatat taaaaacttt caaggattta agatgctgat gaccgttaaa crtgtcataa 420 cagtcttgtg aaaagggtcc atta 444 // ID Gypsy38-I_DR repbase; DNA; ZEB; 5385 BP. XX AC . XX DT 21-SEP-2007 (Rel. 12.09, Created) DT 28-OCT-2009 (Rel. 14.11, Last updated, Version 2) XX DE An internal portion of the Gypsy38_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; LTR; KW endogenous retrovirus; Interspersed repeat; reverse transcriptase; KW gag; Gypsy superfamily; integrase; Gypsy38_DR; Gypsy38-LTR_DR; KW Gypsy38-I_DR. XX NM Gypsy38-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5385 RA Dib M.R. and Naveira H.F.; RT "Gypsy38_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 7(9), 806-806 (2007). XX DR [1] (Consensus) XX CC Gypsy38-I_DR is an internal portion of the Gypsy38_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its long CC terminal repeat is deposited in Repbase as Gypsy38-LTR_DR. CC Gypsy38_DR is characterized by 4-bp target site duplications. The CC internal portion encodes one polyprotein the 1525-aa polyprotein CC Gypsy38_DR1p (pos. 594-5168) composed of the gag, protease, CC reverse transcriptase, and integrase domains. Some insertions CC fairly recent, according to the high identity between their CC flanking LTRs. Consensus obtained after the alignment of at least CC five independent insertions bearing at least 85% homology over at CC least 1000bp. XX FH Key Location/Qualifiers FT CDS 594..5168 FT /product="Gypsy38-I_DR_1p" FT /note="ORF." FT /translation="MSSELDIFFESPNEESFDKLTKEQLIELANKYDVHLT FT TKDKKLKESIAVVVKAGLVRLGLFEAEKSVVGEEREIFTPSSKYETSLSFE FT QQKELMLLELEKAKINGELEMRRLEVEALRFRLIGEGKLGESANSAIVSAK FT KGLDVTNNIKLLPKFSETDVDTFFRLFERLSTSMAWSEQEQTMMLQCVLVG FT KAQKAYAALSAVDSGSYSKIKEAVLKAYELVPEAYRQRFRNIRKSNAQTYS FT EFVSELKLQLDRWCSASEVKTYENLYELVLLEQFKATLPEHVVLFLTERKV FT KSAEDAAVISEYVLTHKVERMGSGVKNKLRSDVFGESAALRPSAFEMSKTE FT KADKALKPYGDNNDRCAYCHMKGHWKKECLVLKGKLSRVKSEQKAVLTVSS FT VQTDENANDDPVVLQKPVMMVRAANESYSPFITDGYVSLKENAEKTPVKIL FT RDTGASESFILESALPFSQETSTGSNVLIQGIGLNTMSVPLHRLFLQSELV FT SGPVVLGIRPSLPVEGVSVILGNNLAGDRVWPDVPPPPVVTTTPVLGETDI FT STDFPETFVSCAFTRAMRKQGIDEQELLEREIPVTSNKSVPGFIAVPSITR FT HDWEVAQCEDVSLKPLFDVALSQDAAESSSSCYFVHNNVLLRKWTPNKEED FT LGGAVIQVVVPVSLRDAVLAAAHGGMSGHVGVNKTYQHLLQYFYWPRVKSD FT IRRYIKICPTCQKTGKPNQTLKPAPLYPIPVLEPPFQHLIVDCVGPLPPSK FT SGSKYLLTVMCQSTRYPAAYPLSSITTRSVVKALSQFISIFGIPKIVQSDQ FT GTNFTSKMFSEVLQQLGIRHNKSSAYHPESQGALERFHQTLKSLLKAYCTE FT LKGDWELGLPWLLLAAREFVQESLGFSPNQLVFAHSVRGTLSVMTDSVVPN FT EPPQSLLKYVLGFRRRLLLAGELAKEKLEKAQKKMKGWFDRKSAVRKFSSG FT DQVLALLPLPESPFCAQFSGPYTVLRSVSDQNYVLSTPERRKSSQLCHVNL FT LKPYYSRDGIETSKSPVMLANTTLVSESENDVKIPDDAILHPRLNNSESLK FT RLDELLKHLPEKHCAELTSLLLAFSNLFSDTPTRTDVIEHDIDIEGSKPVR FT QRFYRVSLDKQKKLEAEVNYMLQNNIAKPSFSDWASPCLLVGKPDGSQRFC FT TDYRKVNAITKPDSFPLPRIEDCVDQVGSASYVSKFDLLKGYWQVPLTPRA FT QEISSFITPFGLFSYSVMSFGLRNAPATFQRLMNRVTSGLEGCAVYLDDIV FT IYSDTWDQHLTRIRDLFTRLTAANLTVNLAKCEFARATVTYLGKVVGRGEV FT RPVRAKVLAIDNFPPPETKRELMRFLGMVGFYRSFCSNFSSVVAPLTDLLK FT SKVKFDWTKKCEDAFENVKRMLTSSPVLAAPRLADPFKLQVDASHIGAGAV FT LLQADENGIDRPISYFSRKFNSYQLNYSIIEKEALALIWALQHFEVYLTSG FT ITPIVIYTDHNPLTFLHSLQNPNQRLIRWSLFLQPFALDIRHIKGVDNVLA FT DTLSRAPYG" XX SQ Sequence 5385 BP; 1521 A; 920 C; 1251 G; 1693 T; 0 other; aatgggggct cgtcctcaag taacttggga accttgatta aaaacacctg ttagatttcc 60 atgtggaagc tccaggagtt ttgttagtgt gcggtggcgt ttgttttgtt ttgttttgca 120 gtcccttgct ttctcttgca ttgctttgag aaatttgaaa atagcatcgg aagtgggtaa 180 gtttattttg atatctatgc tatcacgaat gtgtttctat ttagcccgat gatatttggg 240 agtgtagttt gaggtgatag tgtatttgat cacaaccgtt tgatgtttct aacacagggc 300 tgatatcccc atgttaggca tagcaacgct ttgcgttggt ttttgttgtt ttagtttgtt 360 atttttgtgc taacgccgag ttaagagcac tgccgctttt gtcggtctag tggtttgtcc 420 gtagatttag acatctgccg gtatttgagt gcttaaagat acttgactct tggtgaaagc 480 atgaagatta gggtgagtta gaatagggaa aaataacaaa agcatagggg gataatcagg 540 ttgtgtttga aatttataga aattcgtttg aaaaggaaaa aataaaggga aaaatgtcct 600 ccgaattaga catatttttt gagtcaccaa atgaagaatc atttgacaaa ctaacaaaag 660 agcagttgat agagttggcg aataaatatg acgtacactt gacaacaaaa gataagaaat 720 tgaaggaatc tattgctgta gttgtaaagg caggtttagt tcgtttaggg ttatttgaag 780 ctgaaaagtc tgttgttgga gaagaacgtg aaatatttac accgagttca aaatatgaaa 840 catcattgtc atttgaacaa caaaaagaat taatgctttt ggagttagag aaagcaaaaa 900 ttaacggtga actcgaaatg cgcagattag aggttgaagc actgcgcttt cgactaattg 960 gtgagggaaa gttaggtgaa agtgcgaaca gtgcaattgt gtctgctaaa aaaggtttgg 1020 atgtgacaaa taatattaag ttgttaccga aatttagtga aacagacgtt gacacatttt 1080 tcagattgtt tgaaagatta agcacatcga tggcctggtc agagcaggaa cagactatga 1140 tgcttcagtg cgtacttgtg ggaaaagcac agaaagcata tgctgcttta tcagcggtag 1200 atagtgggag ttatagtaag atcaaagaag ctgtgctaaa agcttacgag ttggttcctg 1260 aggcttacag gcagagattt agaaatattc gaaaatctaa tgcacaaaca tattctgaat 1320 ttgtgtctga actaaaactt cagttagatc gttggtgctc tgcgtcggaa gttaaaacgt 1380 atgagaattt gtatgaacta gtcctgttgg aacaatttaa agctacgctt cctgaacatg 1440 ttgttctttt tttgacagaa cgcaaagtga agtctgcaga agatgctgct gttatctctg 1500 agtatgtact cacgcataaa gttgagagaa tggggagtgg cgttaaaaat aaattaagat 1560 ctgatgtttt tggcgaaagt gcagctttac gaccttctgc ttttgaaatg agtaaaactg 1620 aaaaggcaga caaggcttta aagccttatg gtgataacaa tgaccgctgt gcgtattgtc 1680 atatgaaggg tcattggaaa aaagagtgtc ttgtactaaa agggaaattg tcacgtgtaa 1740 aatctgagca aaaagctgtc ttgactgtat cttcagttca gactgatgaa aatgctaatg 1800 atgatcctgt tgtgttgcag aagccagtga tgatggttag agcagctaat gaaagttatt 1860 ctcctttcat tacagatgga tatgtgtcac ttaaggaaaa tgctgagaaa acacctgtaa 1920 aaattcttcg tgatacgggt gcatctgaat catttatttt agaatctgct ctaccttttt 1980 cacaggaaac ttcaaccggg agtaatgtgt taatacaagg aattggttta aatacaatgt 2040 ctgtcccttt gcataggttg tttcttcagt ctgaattagt gagtgggcca gtggtgttgg 2100 gaattcgtcc ttctttacca gtagagggag tgtcagttat tttggggaat aatctggcag 2160 gtgatcgggt atggcctgat gttccaccac ctccagtggt gacaaccact cctgttttgg 2220 gtgagactga tatttctaca gattttccag agacatttgt gtcatgtgca tttacacgtg 2280 ctatgcgaaa acaagggatt gatgaacaag aattgttgga gagggaaatt cctgtgactt 2340 ctaataaatc tgttccaggc tttattgcag taccctcaat tactcgtcat gactgggaag 2400 tggctcagtg tgaagatgtt tctctgaaac ccttgtttga tgtggctttg tcccaggatg 2460 cagcagaaag ttctagttca tgctattttg ttcacaacaa cgtgctgcta cgaaaatgga 2520 ctccgaacaa agaggaggat ttgggtggag ctgtgataca ggttgtagtt ccggtttctt 2580 tgcgtgatgc cgtattagca gctgctcatg gaggtatgtc tggtcatgtg ggggtgaaca 2640 aaacttacca acacttgctg cagtatttct attggcctcg tgtaaaatct gacataagac 2700 gatacattaa aatatgccca acatgtcaga agactgggaa acccaatcag actttgaaac 2760 ccgctccttt gtatcctata cctgttttgg aaccaccttt tcaacatctg atagtggact 2820 gtgtagggcc attaccacct tctaagtcag gaagtaaata tttactaaca gtgatgtgtc 2880 agagcactcg atatcctgca gcatatcctt taagttctat tacaactaga tctgtagtga 2940 aagcgttgtc acagtttatt tcaatctttg gaattccgaa gattgttcag agtgaccaag 3000 ggacaaattt cacatcaaaa atgttttctg aggtgttaca gcaattagga atacgtcata 3060 acaagtcaag cgcttatcat cctgagagcc aaggtgctct ggaacgcttt caccagacat 3120 tgaaatcttt attgaaagcc tattgcacag agttgaaagg agattgggag ctgggtttgc 3180 catggttgct attagcagcc agggagtttg ttcaagaaag tctgggtttt agtccaaatc 3240 agctcgtttt tgcacactct gtacgaggaa ccttgtctgt aatgaccgat agtgttgtgc 3300 caaatgaacc acctcaaagt ttgctcaaat atgttttagg ttttcgaaga cgtttgctgc 3360 tggctgggga actagccaag gaaaagctgg aaaaagccca gaagaaaatg aagggttggt 3420 ttgataggaa gtcggctgtt cgtaagttta gttcaggaga tcaggttttg gctttgcttc 3480 ccctgcctga gtcgcctttt tgtgcacaat tttcaggacc ctacacagtg ttgcgatcag 3540 tgtctgatca gaattatgtg ttgtctactc ctgagcgtag gaagtcatct cagttgtgtc 3600 atgtgaatct attaaagcct tattatagta gagatggaat tgagactagt aaatcaccag 3660 tgatgttggc taacactact ttagtgagcg aatctgaaaa tgatgttaaa attccggacg 3720 atgctatttt acatccccgg cttaacaatt cagagtcttt gaaacgctta gatgagttgc 3780 tgaaacatct gcctgagaaa cattgtgctg aactgacttc actgttactt gcattttcta 3840 atctattttc agatacgcct acccgcactg atgtaattga acatgacatc gacatagagg 3900 gatcaaaacc agttagacag cgattttacc gtgtgtcttt ggacaagcag aagaagctgg 3960 aagctgaagt aaattatatg ctgcagaata acatagcgaa accttccttt tcagattggg 4020 cgtcaccctg tttgcttgtt ggtaagcctg atggctcgca acgattttgc acagattata 4080 ggaaggtaaa tgcgataaca aaaccagatt catttccttt acctaggatt gaagattgtg 4140 tggatcaggt gggttctgcc tcctatgtga gcaaatttga tttgctcaag ggatattggc 4200 aggtaccact gactccacgt gcccaagaaa tatcttcatt tattacacct tttggccttt 4260 tttcttattc tgttatgagt tttggtttgc gaaacgctcc agctacgttt cagcgattaa 4320 tgaacagggt aacatcagga ttggaagggt gcgctgttta tctagatgac attgtcatct 4380 atagtgacac atgggatcag catctaactc gcattcgtga cttattcacc cgtctgaccg 4440 ctgcaaatct cacagtgaat ctagcaaagt gtgagtttgc tagagccacc gtgacctatt 4500 tgggaaaggt agtgggcagg ggggaagtcc gacctgttcg agcaaaggtt ttggccattg 4560 ataattttcc acctcccgaa acaaagagag aattgatgcg atttttaggt atggtagggt 4620 tttaccgaag cttttgttcc aatttttctt ctgtggtggc tcccctcaca gatttgctta 4680 agtcaaaagt aaaatttgat tggacaaaaa agtgtgaaga tgcgtttgaa aacgtaaaga 4740 gaatgctaac ttcatctcct gttttggcag ctccacgact ggctgatcca tttaaacttc 4800 aggtggatgc aagccacatc ggggctggtg cagttttgtt acaggctgat gaaaatggta 4860 tcgatcgacc tattagctat ttttcacgaa agtttaactc gtaccaacta aattattcaa 4920 ttatcgagaa ggaagcttta gcattgattt gggcacttca acattttgag gtttacctga 4980 cttctggtat caccccaatt gtgatatata ccgatcataa tccccttacc ttcttgcatt 5040 ctctacaaaa tccaaatcag cgtctgattc gatggtctct ctttttacaa ccatttgctc 5100 tggatattcg tcacataaag ggcgtagaca atgtgttggc tgacaccttg tctagagccc 5160 cttatggcta gtggtttgaa tgtgtttctt ttttttagtt ctaggatgtg tggctttccc 5220 ttaaattgct gcctccagat attgggagat tgctggtgat tgctcaacag ttttctaaga 5280 aaaaggaaag tttcattttc attttgtttt tgttttaatt tatttgggag agtttataaa 5340 gttaaggaca gagattatta taatctcttt cttttgaggg ggagg 5385 // ID Gypsy62-I_DR repbase; DNA; ZEB; 4293 BP. XX AC chr14; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy62-I_DR; KW Gypsy62-LTR_DR; Gypsy62_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4293 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1439-1439 (2008). XX DR Genome; chr14; Positions 6791700 6787408. XX CC Positions [1667-2122] - Reverse transcriptase CC Positions [3140-3619] - Integrase core CC 'GGTAG' target site duplication CC LTRs are 99% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 68..4294 FT /product="Gypsy62-I_DR_1p" FT /translation="MAHQQQLARLNSLTEELVKALQGLNITPSPSPENPAP FT TVPVAPMSTIPAINPPLPFPDKFDGNPSRCKGFLMQCTLFVTQQPNLYATD FT NSKIACVCSLLTGRALDWATAVWHNQTSTFLSFNEFVKKFREVFDHPEGGR FT DAGEELLALQQGNLTAAEYALQFRTLAAQAGWTEKPLRVLYRKGLKTELQT FT ELACRDEGKTLDQLIEISIRLDNLIRSRTPGTSIRSFTTPSYSSEPEPMQI FT GRTQLSPEERERRLRNNLCLYCGQSGHLKSSYQARPPSKPLPVSSANILSV FT NNQCLEITTTISFNQQRMSVPAMIDSGAAGNFMDHNFARINNIPLTSCCSP FT LTVTALDGRPLGKGKVQYTTQPIILQAGAIHSETLYFFIVDSPNHPVILGL FT PWLQKHDPVISWKNREIIQWNEVCFQECLTPALPLQLNAVNITYPESYSRI FT PSVYQDLFEAFNEQKATQLPPHREYDCAIDLLPGSTPPRGRIFPLSQPESE FT AMKQYIKEELSKGFIRPSTSPASAGFFFVEKKDGGLRPCIDYRNLNAITCK FT FRYPLPLVPAALEQLRTAQYFTKLDLRSAYNLIRIRPGDEWKTGFSTCTGH FT YEYLVMPFGLVNSPSVFQSFVNDIFRDMLHKWVIVYIDDILIYSDSLEDHI FT THVRAVLKRLIDNKLYAEVEKCEFHQTSISLLGYVISQEGVAMDEQKVNAV FT LKWPKPSTVKELQRFLGFANFYRRFIRNFSTIAAPLTSLTKRSGKQLKWNT FT TAELAFIHLKDRFTTAPILSHPNPDKPFIVEVDASNTGIGAILSQRQDASS FT VLHPCAYFSRKLNPAERNYDVGNRELLAIKAALEEWRHWLEGAKHEFIVMT FT DHKNLEYLRTAKRLNPRQARWALFFSRFQFSVTYRPGSKNTKADALSRQFE FT DTPSEPNNEPILNNTLILAPVQWDISTEIRQAQTQEQTPPNCPPDKTFVPR FT SIRDKVLKLVHTSPCSGHPGIRGTLQLIQNQYWWPSIQTDITKFVHKCTTC FT NMNKSSHQRPAGLLQPLAIPNRPWSHIAVDFITNLPPSQGNTVILTIVDRF FT SKACRLIPLPKLPSAMETAEVLCNYVFRFYGLPEDIVSDRGSQFTSRVWSA FT FFKNLNINISLTSGYHPQSNGQTERLNQEVSRFLRTYCHNNQTDWSRYLLW FT AEYAQNSLRKTSTGLTPFQCVLGFQPPLFPWSGEITEIPAVNDWMQRSEAT FT WNLAHHHLQRAVRRFKEQADRNRRRNPTYAIGDWVWLSTRDLRLKLPSKKL FT SPRYIGPFKIIKQITPVSYRLLLPNHYRISPTFHVSLLKPAASTVEAEREV FT AVSEQTQEPLIIDGEEAYQIHQILKSRRRGGVLQYLVDWEGYGPEEQSWVD FT RKDILDPTLLLEFHQHHPDQPAPRPRGRPRHHQFPRIRSCSQGGGS" XX SQ Sequence 4293 BP; 1235 A; 1120 C; 880 G; 1058 T; 0 other; gaagacttcg ccacccaccg atccagcagc tattgtacaa gaattatccg cgcagggtca 60 ggttctgatg gcccatcaac aacaactagc acgcctcaat tctctcacgg aagaactggt 120 aaaggccctc cagggactca atattacacc atccccatct cctgaaaatc cagcgcccac 180 cgttcctgta gctcccatga gtactattcc ggccattaat cctccactgc catttcctga 240 taaatttgat ggcaatccat caagatgcaa gggatttctt atgcaatgta cgttgtttgt 300 aacacaacaa cccaatttat acgccacaga taatagcaaa attgcatgcg tttgttctct 360 gctgactggt agagctttgg attgggccac ggccgtctgg cataatcaaa catcaacttt 420 cttgtctttt aatgaatttg taaagaaatt tcgggaggta tttgaccacc cggagggagg 480 aagggacgct ggagaagagt tattggcgtt acaacagggg aatctcacgg ctgccgaata 540 cgccctacaa tttcgcactt tggctgctca agcggggtgg acggagaaac cgctaagggt 600 gctttaccgt aaagggttaa aaacagaact tcaaacggaa ctcgcatgcc gtgatgaggg 660 aaagacattg gatcagctca ttgaaatttc cattcgcttg gacaatctca tccgctcccg 720 cacgcctggc acatctatcc gctccttcac gaccccatca tattcaagcg aacccgaacc 780 catgcaaatt ggccgtactc agttatcccc ggaggaaaga gaaaggagac ttcgcaataa 840 tctctgtcta tattgcggcc aatctggtca tttgaagagc agctaccaag ccagacctcc 900 cagcaaacca ctcccggtga gttctgctaa tatcttatct gtcaataatc agtgtttgga 960 aataacaact actatatctt tcaaccagca acgtatgagt gttcccgcta tgattgactc 1020 cggagctgcc ggaaatttta tggatcataa ttttgctcgc attaataaca tcccactaac 1080 ctcctgttgt tctccactca cagtcacggc gttagatgga cgtcctttag gcaaaggaaa 1140 ggtccagtac accacacaac ccatcatcct acaggctggg gcgatacatt cagaaacttt 1200 atacttcttc atagttgact ctcccaacca ccctgtcatt ttaggattac cttggctaca 1260 gaaacacgac cccgttatct cctggaagaa cagagaaatt attcagtgga atgaggtatg 1320 ctttcaagaa tgtctaactc ctgcattacc gctacaactt aatgctgtca atataactta 1380 tccagaatcg tactcaagaa ttcccagtgt atatcaagat ctatttgaag cattcaacga 1440 acaaaaggct acgcaattac cacctcaccg tgaatatgac tgtgctatag atttattacc 1500 cggaagcact cctcctcgcg gaaggatctt tcccctttct caaccagaat ccgaagccat 1560 gaagcagtac attaaggaag agctgtccaa gggctttatc cggccatcca catcccccgc 1620 ctcagccggc ttcttctttg tggaaaagaa ggatggagga ttacgccctt gtatcgacta 1680 tcggaacctc aacgcaatta catgtaaatt ccgctatcct cttccccttg taccagcggc 1740 tttggagcag ttaaggacag ctcaatactt cactaaactt gatctccgga gtgcttacaa 1800 tttaatccgg atccgtcccg gagatgagtg gaaaacgggg ttctccacat gcaccgggca 1860 ctatgaatat ctcgttatgc ccttcggcct agtaaacagt ccttctgttt ttcaatcatt 1920 tgttaatgat attttccggg acatgttaca caaatgggtc atagtttaca tagatgacat 1980 tctgatttat tcggattccc tcgaggatca catcacccat gtcagagctg tgttaaaacg 2040 cctgattgat aacaagctct atgctgaagt tgaaaaatgt gaatttcatc agacctccat 2100 ctctttactg ggttacgtca tcagtcagga gggcgtggcc atggacgaac agaaggtaaa 2160 cgcggtatta aaatggccta aaccaagcac tgtaaaggaa ctacagcgat ttttggggtt 2220 tgccaacttc tatcgccgat ttataaggaa tttcagtacc attgcagccc ccctcacatc 2280 cttgaccaag agatcaggca aacaactgaa gtggaacacg acggcggaat tagcgtttat 2340 acacttaaag gatcgcttca ctacggctcc catcctgtca cacccaaacc cagataaacc 2400 ctttattgtg gaagttgatg catccaatac gggcatagga gctatactgt cacaacgtca 2460 ggacgcaagc agcgtactcc acccctgtgc ttacttctct cgcaaactca acccagcaga 2520 gcgcaactat gatgtgggga accgagaatt gctagccata aaagcagccc tagaggagtg 2580 gagacattgg ctggagggag cgaagcacga gtttattgtc atgacagatc acaagaactt 2640 agaatacctc cgcacagcaa aaagacttaa ccccaggcag gcaaggtggg ctttattttt 2700 ctcccgtttc cagttttcag taacctatcg ccccggttca aagaacacta aagcggatgc 2760 cctgtcgcga caatttgaag acacaccatc tgaacccaac aacgaaccta ttctgaataa 2820 cacattgatt ctagcacccg ttcagtggga catctccacc gagatcagac aagctcaaac 2880 acaggaacag acaccaccta actgccctcc tgacaaaaca tttgttccgc gaagcatcag 2940 agacaaagta ttaaaattag tccacacaag tccttgttca ggtcatccag gtataagggg 3000 cacattacaa ttaatccaaa atcaatattg gtggccgtct atacagacag acattaccaa 3060 gtttgtccac aaatgcacaa cttgtaatat gaacaaatca tcccatcaac gtccagcagg 3120 attactacaa ccattagcca tcccaaatag accctggtca catatcgcag tggatttcat 3180 caccaatctc cctccatcac aaggtaacac agttatatta acgattgttg acagattctc 3240 gaaagcatgc agattaattc cccttcctaa acttccatct gccatggaaa ccgccgaagt 3300 actgtgcaac tatgtgtttc gtttctatgg tttaccagaa gacatagtct ctgacagagg 3360 atcgcaattc acatcaagag tctggtcagc attttttaaa aatctgaaca tcaacattag 3420 cctgacgtca ggatatcacc cacaatcgaa cggacagacg gaacgactta atcaagaggt 3480 atctcgattt ctaagaacct attgccacaa caatcagact gactggagtc gttatttatt 3540 atgggcagaa tacgcacaga attcgctgcg caaaacttcc actgggctaa ccccgtttca 3600 gtgtgtgctc ggatttcaac ctcccctatt tccctggtcg ggggagataa ctgaaattcc 3660 agcagtaaac gactggatgc aaagaagtga agctacgtgg aatctggccc atcatcattt 3720 acaaagggct gtcagacggt tcaaggaaca ggcggaccgc aatcgccgtc gcaacccaac 3780 ttatgccata ggagattggg tatggttatc cacccgggat ctccgactga aacttccaag 3840 caagaaactc agccccaggt acatcggtcc atttaaaata attaaacaaa tcactcctgt 3900 atcctatcgt cttttattac ctaaccatta tcgcatttct ccaacctttc atgtctcact 3960 gcttaagccc gctgctagca cggtcgaggc ggagagggag gtggcagtca gtgaacagac 4020 ccaggaaccc ctcatcatcg atggcgagga ggcatatcaa atccatcaga tcctgaaatc 4080 caggcgtcgg ggtggtgtac tgcaatatct agtggactgg gaggggtacg gaccagagga 4140 gcaatcttgg gttgacagga aagacattct ggaccctacg ctactgctcg aatttcatca 4200 acatcaccca gatcaaccgg cccctcgtcc tcgtggtaga ccccgacacc atcagtttcc 4260 tcgcatcagg agctgctcgc aggggggagg ttc 4293 // ID ERV1-N1-I_DR repbase; DNA; ZEB; 5517 BP. XX AC . XX DT 22-OCT-2008 (Rel. 13.1, Created) DT 22-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE nonautonomous endogenous retrovirus from zebrafish, internal DE portion - consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW internal portion; ERV1-N1-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5517 RA Bao W. and Jurka J.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1596-1596 (2008). XX DR [1] (Consensus) XX CC The TSD flanking the LTR is 4-bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 5517 BP; 1726 A; 828 C; 1213 G; 1749 T; 1 other; gttttggtgg aggacgcgtg gcacttgttt actggtaaca ccattgatca atcaacaaaa 60 tgaaggtagg aagatttaaa aattataaga ctagggattg tctgtgaagt agtgggtagt 120 aacggaacgt ttttggttct catttagacg taaatgaatg ttgttggttc tcattcagac 180 gtgaatgaaa ggcgccggtg gattcacaga attttgtggt agattgctgt ggaatccctt 240 agcgagctgg tgcatttttg tggtacacct atagcgatac tgataagtgt aacagtagtt 300 gagaggtgta ccttctcctt cacccggagt tagagtactc atttgatggg tgtttaggag 360 gcttttggga cattctcggg cataaaacct ggactggttt tctatttagt agtggactaa 420 attaacccga gattgtaatt gagttccacc actgctatct aactgcatca cagacaaaag 480 gcatgccata agaatagtgt aggtcagatt attagggaaa gaaataattg aataaaaata 540 acaaataaca ttataatagg gatccattag atagtagtgg ccacaaaagt cagacaagat 600 acctagaaat atcagggtga ggttgattga tcaatgatgc accaggagca atcccgagaa 660 aattatgacc ctaactggat tggaccggtc tggacagttt gcacatcttg ggatggtttg 720 cagattcatc aagcagaagc caagcgctgt tggtaagagt gccctctcaa tttcttcttg 780 cttttggtgt tgaactcaga tataaaaaga aatagatgag agcagaatgc agtaaatggc 840 tcagctaagc tctcaggttt aaaatgattt tagaactgat tttaagaaaa aaaaaaaaaa 900 aaaagtattc ctttggcatt taattgtcat tgaaaagcta aaatctaaat attgaggggc 960 tagtagcaaa aacgtaagca cagaaactga cagtaacatt atgggaataa agaatagctt 1020 atgttttgtc ttagaagata agattctgtg tgactgcact gctatcttgg tttagccctc 1080 aacatgactg gttttggctg taaaagtaac agagacatgt ctgaataaaa atatgccatg 1140 taagtctgtg caaacaagca gcttcatggt tggatttata tatttttaat aatatatatg 1200 taacataaat gggggtttca aataatactt tgatggatgc agttaacccc tgaaagatct 1260 aaaactattg ttgactgtaa cagcattact ttgaaaatgc atgtcaactc tgttttatat 1320 gttttaggac gggcttgagt aaaaagatat tacaatgaaa ttatgttatt gaagttaatg 1380 ttatgtttaa tgtatgttta aatttagtaa acatgcattt tcgttctaga ttgctcaaaa 1440 ctctctttct ctctcattag atgtcagcta agagctgcta tctgcctgtt ccataacaat 1500 tgataagcac agtgctcagt tggaaatcac aaaacacaga tgtaaaggaa atgctagaat 1560 gaatacagaa attctacatt tcaatttgta tacactaata gaggtattag taattaaaaa 1620 tcaaagttaa caaataagag attgcaaaat gcttttgttt gttttcccag tgatgggttg 1680 cggctggaaa gggcaccagc tgcataaaaa cgtgctggat aggttggcag ttcattccgc 1740 tgtggcgacc ccggattaat aaagggacta agtgaacaga gaaatgaatg aatgaaaatg 1800 cttttgttaa agcatcttga aaaacactaa tatacccgct gagctataga gataattaat 1860 ttgaacagtt gttattactg tttggaaatt tcttgtcttg tcttcagtgg tttattactg 1920 tttgtaactt ttgtcttatc tttagtctcc accatcacat acagggcctg atggctatac 1980 tgaggtagca gaccctgaca ctgaatggat cagggtgata tctaaactcc ctgaataatc 2040 tcagcttgat attcagttga tttaagtgtt tgattaacct gtctcaatgt ttgaattcta 2100 gtaggtagtc tcaatggctt tcatggttct tgactgcact ttcccaagtt tgcagaaaat 2160 tctgaatcaa ggaggaagcc tgcacaggag cttgcagtgt gagtggcagt tgaccggtgg 2220 gcacgaaggg gggttcccta agacgggtaa gattttccct tggccaaagc ggaaaacaac 2280 ctaaggcatg ggacccctcg taaaccccgg ggcatctacc cgaaaagggg gatgtttaat 2340 gtgaaagatg gtgctggact ggggtgatgc tctatcggcc taaagcatcc ttagagggga 2400 actgtttgaa gttacagtag tctgcctcag tggccccgtc cagacctgat cacgagaatc 2460 ctgcctttat tgaggacatg tctgtccttg tgacaagcat caacggacta tgccagggta 2520 tgataccctg aaggagatga ttggtaaaaa gataatggca atgggctttc aagaaccatt 2580 ggagacagta aagtttctta aaattttacc aatatcaatg tggttggctg ctttaacagc 2640 actggatatt ttatatgtct gaattttaag aatgctgaat tcctgaatgt gaagctgaga 2700 tgatttttaa ctgatgtgtc tgatttttca tagtatttta ctgatgcagg tccttattca 2760 tggcaaagga cattttcgag atggtgttgg ctaaagataa agactggctg tgagatggtt 2820 tggataacac agctgaatgg aattaagcag ttaaggtaaa ataattacaa gaaattctct 2880 tagtctttaa atagttccaa ctgagttttg agagataaaa acttttgaca tttggaacgt 2940 tggtgaatga ttaaagattc tgaattgtaa taattttaat gttatagtct aagactattg 3000 aaaatttatg tgcatgaaac agagagacag agactgtttg agtgaagcca tgagttgtgt 3060 tttgtttctg ttatcgactt gcttagcaac gatattaaga aaacaaaatg caccagctga 3120 ttgtctcaga aaagacattg attatgaaca gaaactttga aacttgataa aagatcatgt 3180 aaatccagtt gaaactggac agagaatttt aagaataaat ttgtgcgagt tcaacaccta 3240 aagcaaactt agttatgagc tgtcacaata ttcatatcta tggtacaata atgcttattg 3300 tcctgaggga atgattcgtc cctgttgtat gaatattgtc actaacctct ggaatgagtg 3360 ttatattatt acagattgta taagcgaaat gccagacatt gcagaagtca ataatgctga 3420 gtgaatctga tgaaaagatg atgcagacca tgagatgagc tttaaatgga tgggttatcc 3480 agctctgttt gcaatcgtta ataatgactt cttttcctga ttcagattac tctctggaat 3540 aataataatt ttaaaacaag ttttattagc ttatagctgt gaaattgaag actgccatcc 3600 atatcaggtt acaaacccat gtctaaattg actgagaagt tcaaaggtta attacagtta 3660 attctttgct atctgaatat taatgttaaa tattttagac ttgatggcat ttaaacagtg 3720 acattggcta agtcactaac cacaatagtc cagaaagcat cctcactcta gcataggaaa 3780 aataagcaga gaattgagtt gaagtcattt gtgttaaagg caaatgctgc agtaaatcac 3840 tctgtctatg tgtgtgtgtc ctcaatggta gaggaacata attgtttgga tgttgtgtag 3900 aacagagaga taaagttgaa tgaaagtgtt tgcatctgat ggttgaattc agtaagaagt 3960 atttatactg ctgcagcgaa aattaacatt ttagtaaaca gaaatgcagc gtgaacttta 4020 acagcatttt taagcagtca agtcaactca agtcaatatt attattgtga atactgacaa 4080 gttttacaga aacttagact gtctttaaat gaaaatttag tagtccatat ccaagcaaga 4140 gtcttgatta tatgtggtaa aagagttgtt aactcagtct gttcttgttc aattggtaat 4200 gctttcaaac aggttttact gtcaaatgac aggtatgaaa gattttcaaa taatcaaaag 4260 tttgagcaaa tagctaacct gtgagcattt ctaattgacc agacagtaag aatcaaatga 4320 attttataca aaatcatact gtttgcttct gattatgttg aatggtttag tcagagtaaa 4380 gtaaatggta caaatgacaa gttgagaaga taattataag gtttagtctt atttctgtta 4440 aaagatgctt gagtttgaat tactcaggtt tgatctttgt tgaatttgac taagattttg 4500 ttaaacccct aaaacgtggg aacaaaagaa atggattcac aatccctctt attgcatgtt 4560 aatgacttgg attaatagtt tgtaaaaacc attacatctt tcttggtctt taggtaacag 4620 gtaatatcta tatctagtct ctttgaggtg actgagggtc tggctcaatt catttagccc 4680 ggcttccaga gagtggattt gatattagtt tgttaactgt gaaggtgaat ttggctcatt 4740 atggccagga caagggtgct tgtacacttc ctatcaatga tacagaggaa tgtgttgcat 4800 cactacaatc acatcattat ttgaataaaa akatatttta agtggacttg actgacatgg 4860 agttcagcaa ggtgataatt aaacatggta atggtgctga attggactat tgtgctgttg 4920 gcatgtttaa tgtgtgttaa tgtaaaaatg cttcattatt ctaatttgaa gttacagaac 4980 acgctgatga aggttttaaa actggtgaag ataaagatgt ttttgcaggt catgaatttg 5040 aatcctgtgt ggcttaggtc gagtggatga ccttttgtgt ctgatcagag gatctctgag 5100 cactagaatt agcctgtatg tttggattta gagttggagc aagttgtaga tgcatcagag 5160 gaagtatcga tggaaattat gagttgttta cttctggagg acttttaggg cttgcgcatt 5220 tgtgttcaca ttctggactt tggtccgctg ttgttgtttt ttgtatgttt tcacttctgc 5280 ttgcaggacc cttaatccca ccgatgatga gaagcgtgaa ctccttttgt gggtggagag 5340 tgaaagagcc cttagcttgg ggactggaat ttttaggttc cagcttgtcc aagaaggggt 5400 gaagaaaact ctgcgaactt attaagggag catgagtcat tactttcact ctccttagtg 5460 acctcactgg ttgaccggaa gcggagtttg gactccattc gggtcaaaag gggggac 5517 // ID DNA-TA-8_DR repbase; DNA; ZEB; 1308 BP. XX AC . XX DT 08-APR-2009 (Rel. 14.04, Created) DT 08-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE non-autonomous DNA transposon from zebrafish - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; TSD TA; KW DNA-TA-8_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1308 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 9(4), 859-859 (2009). XX DR [1] (Consensus) XX SQ Sequence 1308 BP; 436 A; 201 C; 203 G; 466 T; 2 other; gacccttttc acaatgacgt cacttcactt ccccctatcg gcaacgcagt gtatttttag 60 ttccagttta cctttgaatg aatgggagac ttctctggaa aatttacaaa tgaaaatatt 120 aacattgcac gatctgattt aagactatta atagtattgg gcatgtgtcc aaatgtttct 180 ctttcaatac tgaactgctt tctgagtttg aaaatgattg tttaagttat taagaatagg 240 aataagctat tgtttatcaa ggaaagtatg tttgcaacct tttcacttat taattaagtt 300 tatttcctca ataatggatt tatttttact ttgatgctgc tatgaatata tyaatttata 360 ttatctgtta tatgtaaatg ttccagcatg agtacaattc tttattaaat cttctcatat 420 tacctggtaa gtatgtctat aaatgcaatc agtttcatca tttattattt actgatctat 480 taatgttttg ttcagtttct ccatttgaat gaattaagca ttaattaaaa cagtgtccta 540 ccgccaccta ctgggtggat ttaatttgat atctgtacta ccttttaatt aataatgtta 600 caaataaatt gacataatct aataatactg aattaaaatg ccgtcttctt ggtgttattt 660 ttgcaattgt acatttttat aatgattttg tttgattatt trttagtaag agtcacatac 720 aacattgtta ttctatttca atttaatttg aatattaaat agacataaat gttttatgtt 780 catgaataaa atacctatat agaaaaaaat ggcagagaac tgagcagtag aatgtgaata 840 atatgaatgt gtttgtgaat cagctgtcca ctgtaaacaa ccacagtaaa acacccacat 900 ttttatttat ttattcttta ggacaaaaaa aaaaataaaa ataaatttga cataaaaaat 960 gttcagcacc cgtgtaaaca tctgaaatct ttatttgaca aatgctgaca agtacatacc 1020 aacttcatct gcagtaaaga cagcaacaca aagcaataga ctcagtttaa agtcccattt 1080 tcttggttca agcagaaatc tggatgatct gtaatttttt tttagatgtc agatgtctca 1140 ccttgaatgc agacaacaga aaacagccgt agtcacagtc taaccatatc tgtagtaaat 1200 atacagctga ctgggtgagt tctctatggg gagttctgtg taggcggaag taacgataca 1260 ctgcgtggag tgaaaccgga caccatgtga catcatgtga aaagggtc 1308 // ID BEL-38-LTR_DR repbase; DNA; ZEB; 314 BP. XX AC . XX DT 09-APR-2009 (Rel. 14.04, Created) DT 09-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat - DE consensus. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL-38-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-314 RA Bao W. and Jurka J.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 9(4), 863-863 (2009). XX DR [1] (Consensus) XX SQ Sequence 314 BP; 114 A; 40 C; 59 G; 101 T; 0 other; tataagagcc ataagttcat tgttttattt atgttaaatt gataataatg taaaaagcaa 60 acttgaaaat attttactaa aagtaattta atttttaata tagagtatat gctgatgtca 120 cttccggtaa ggaagaagtt gtgtcatgaa gatggaagca agacagtttt ttgaccgtgt 180 attatagttc agttaagttg ttattaaaca tgttcgaaat gacatcactg gactgtctta 240 tgttacccgc gttcaaatcc tgctaattca aagagagaaa caagacaaga agagacggat 300 cgatggcaca taca 314 // ID DNA-2-29_DR repbase; DNA; ZEB; 132 BP. XX AC . XX DT 09-JAN-2009 (Rel. 14.01, Created) DT 12-JAN-2009 (Rel. 14.01, Last updated, Version 1) XX DE Nonautonomous DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA-2-29_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-132 RA Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 9(1), 7-7 (2009). XX DR [1] (Consensus) XX CC 2bp TSD. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 132 BP; 43 A; 27 C; 19 G; 43 T; 0 other; acagtagtca acatttgaag tggatcaaaa cctttcatca aagttgtcct aaaactattg 60 aacaacaccc attcttgtct taggacgatt ttgaaaaacc gtttttgatc cacttcaaat 120 gttgactact gt 132 // ID Gypsy63-LTR_DR repbase; DNA; ZEB; 1214 BP. XX AC chr14; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy63-I_DR; KW Gypsy63-LTR_DR; Gypsy63_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1214 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1442-1442 (2008). XX DR Genome; chr14; Positions 13338612 13337399. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1214 BP; 374 A; 163 C; 264 G; 413 T; 0 other; tgtagagccc tactaatata ggatctgttg atattaagca gtttttgttt ttcttttatt 60 tttttcaaca aactatttta agaactttct gctaaataac aaaggtgttt aaattctgat 120 atcatagtta tttatgaaaa atattatcta ataaataaat aaatatgttt tatgtgtcga 180 gtttgatgac ctctagaggt tagggggcgg agcttggaag acgggaagga agtgaaacca 240 agagcagaga gtcagatgcg ttggatttag agtttttgtt ttaatagttt gccttgaaat 300 agttacacaa aagaaaattc atgtgttagt tctgagttta caaagactgt taacccacat 360 ccttagtgca tttgaggatc gctggacaat ctgtgtgggt aattgttcag agagagagag 420 agagtgaacg tcagacctgc ggactgaaag atcgattgga gagagactga ggctttgcta 480 atcgggcctg tttgtttcgt gatcggagga tttacaactg cggtgatgca attcgggatc 540 gcggatggac gtcagagacg tttggagtgt gtctggttct gtgaatgggt actgtgattg 600 tgaccttcat tatttctacg gattctgtga gtactgaaaa gtgttttttt ccaacgtgag 660 ctccaacatt gcgcgccaac gtaagactta aatcccagct gggtaacatt ttattttggg 720 ttttatttga tgctataatg ttgctcaaat gtgtgtttgc tttgtatgtc tgattaagtg 780 aagctgtgtt gtcactgaaa gtgaattgag tcctacagct ttaaagtgat ttaaaacaca 840 atgaaagaaa attaacacaa tagcctcaca atatttaata aatgtaaaaa gttattatat 900 gccttgcatt tgtgattttc ttttacagaa agggtgaaaa ttaatggtta aataaaccag 960 tgtgtaaaat aacttgtaaa tgagagagtt aaattatatt atgcaataga aaaagaaaac 1020 aaatagtcac ttctactctg aataacacta catatatatc tgaggcccgg tattattaag 1080 tgtttgttat taccatttga tgaacagaca ttgataattt taaagttata gggatgtgcc 1140 tctcaccttg aattttattt gaatttccct agacatatta ctttttctga gctagttaag 1200 gggagagggc taca 1214 // ID Polinton-1_DR repbase; DNA; ZEB; 18485 BP. XX AC . XX DT 14-MAR-2006 (Rel. 11.02, Created) DT 14-MAR-2006 (Rel. 11.02, Last updated, Version 1) XX DE A family of autonomous Polinton DNA transposons - a consensus DE sequence. XX KW Polinton; DNA transposon; Transposable Element; KW Polinton DNA transposon; Maverick; Tlr; Polinton-1_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-18485 RA Kapitonov V.V. and Jurka J.; RT "Self-synthesizing DNA transposons in eukaryotes."; RL Proc. Natl. Acad. Sci. USA 103(12), 4540-4545 (2006). XX DR [1] (Consensus) XX CC This transposon is characterized by ~350-bp terminal inverted CC repeats and 6-bp target site duplications (some copies have extra CC 5'-AG and TC-3' at their termini). The consensus sequence was CC built based on multiple alignment of several copies that are >90% CC identical to each other. It encodes a family B DNA polymerase CC (POLB-1_DR), retroviral integrase (INT-1_DR), ATPase (ATP-1_DR), CC cysteine protease (PRO-1_DR) and additional four unclassified CC proteins (PX-1_DR, PW-1_DR, PY-1_DR, and PZ-1_DR), conserved in CC Polintons from different species. XX FH Key Location/Qualifiers FT CDS 2875..3954 FT /product="INT-1_DRp" FT /translation="MSDIMKKVYYDPSSPGALGGKKRLKDAVFKETGVRLS FT DKQVSGWLAGEDAYTLHKTAPIKYKRNRVVVYGMDTQFQADLVDMTAYSTE FT NDGNKFMLTCIDVFSKYAWSRVLKNKSGVEVTKAFESILEEGRVPRKLQTD FT QGKEFFNKHFQDLMKKYDINHFATATDLKASVVERFNRTLKSRMWRFLTAT FT NSRRYIDVLQDIMTGYNNSYHKTIKMRPIDVCKENEATVFQNSYGNIKIQK FT PVFKFKVGDVVRISKVRGPFAKGYEQNYTEEFFTVSECIPRQPPVYKLKDY FT DGDVIEGVFYEQEIQKIFVNKNKSFKVEKILGEKKQGRSTLVLVKWLGWPS FT KFNSYIDKKTLVDLQKP" FT CDS 4005..4844 FT /product="PX-1_DRp" FT /translation="MDNNGFYVTLPCNASLTVYPENRISSYKTNLSKTIHL FT KGDWEVALAEIEYPRSWYNITDDDGIFNLNMYPDHDNEEKKKKLEANVLSD FT LHGFYVSRRYQIPGGYYKDIPAVIKMINDAIKPSGILFYDELKNKINLIAK FT PNIFIGFYEKLGAILGVRPEEGLGQTAYHEKADFTRTVSTYAPHQADIKGG FT FYTLYVYTDIIDYQSVGDSYVPLLRCVHITGENNDIVSVRYDKPHYASVNK FT SIITDINIEIKDDQNREVRFAYGKVIVKLHFRPVKQTSL" FT CDS 4850..5302 FT /product="PW-1_DRp" FT /translation="MPYYNTHHGEISDYVNYYQNQAGGGLPGYAGGGVMYG FT SGLGGLFRGLFRMAVPLLKKGFNIAKPHLKTAAKNIASEVISNVISRNRDE FT RQQDGSGLMVMARKRMSKPPGVRRRGRAIKKKRTVKKTSLTRRKRKKTVRR FT NTFAKRNINTIF" FT CDS 5307..6611 FT /product="PY-1_DRp" FT /translation="MALLHRMSGECIKSELDLFTVPLTQTAIEKNTYVEVP FT PLSAISDSSPLEFFIAGSGEEYIDLNNTLLHLRLKITKPNGGEIADPAKVA FT LINYPAATIFSQVDVSLGDRLISQSSSTHPYRCIIESLINYDKDTLESTFS FT AGLFFKDTAGHMDVKDPLGNNQGLLKRSTYTSRSKIVDLMGPIHSDIFFQE FT KLMLNGVDIKIRMIRGKDEFCLMRSDDVAYKLKIVSASLFVKKVSVAPNVR FT LAHAQALLSTNAKYPIDRVCLKNFSIPEGARVSNQENLFLGTLPKSIVLGM FT TDNDAFTGSYDKNPFAFKHYDLEFLAIYVDGQQIPAKPLQPNFTDGSVVRE FT FYQLVTATGRHLKNHALSISRSEFARGYSLYAFNLTPDEDCGQHVSLIKSG FT NIRLEARFRQPLPNTINLIIYSVFDSIIEVSNRRQILVDYY" FT CDS 6621..7154 FT /product="PRO-1_DRp" FT /translation="MNTIQLNAIMDKITCNTHFLGTLASDQLPERPLTNLP FT SMVIINTHPSELPGEHWLAVYLSEDGVGCFFDSFGNKPDSDGFPIAIKNFL FT MRNSTVTRYSTKRVQDFMSNVCGQHCVFFLYHMSRGRDYNYVMNLYSENYV FT KNDKMVELFVKKLKANVCSDNMYLCNHCVQTCNSSMMYK" FT CDS 13216..9068 FT /product="POLB-1_DRp" FT /translation="MDDENRTDDTYNIQPDQTLSVRLQRGVEMDADRDEDI FT FASLQHMIDMYPNSRDITFNNIPISQITNLDDLEMYIEQELNLNLPEDVES FT FLNSEQSGGAVTQETSASATVNSQLTDDGFDDDDDDDDDSHSTSHATHGRT FT DIESTHQSNVDDEQPSTSNVAEPLRHRPTRQIDIETLLRDGGNVNEYHVLP FT RPRFNSVSLRRTLNMSEIRSQDLASYHIRLHDTMDDIVSFARQIGGDGSVI FT NLCLRTPTLKSDVNAVLTPGNNYDVNLFTDQIAKILQSDDKLSANETVEIE FT AEVVMNRLGGGGVRRKLTDLAFDQVIKRKKTSLFIPTNISNKLCFSICIAH FT FLDPQLPECELENRASIIHNKVGLAIQDKVGFHDIAKFENMLDIKIVVFYR FT TNTGVLQTYVNNNEPHDKTVYLYLQDEHYYMILNLKSFIGASYVCEFCYKG FT YTSVRNHQCKHVCNVCFDGECYKHPKKIIHCTDCLRYCKSSYCYDAHKKPV FT LEGEKVPCDVIKYCKKCNRRYDKKKSKHICAPNRCDACREELVPDGEHECF FT IKTVPLKDPQNKYIFYDFETRYENARHVANFVCAITFCGERFVAGGSDCVK FT KMIEHFRKPKYEGYCFIAHNASGFDSFLILEYFCKAGLQMDIIMKGCKLIF FT MFDVSFKQRYIDSISFIPMALSKMPAALNLTTTEKGYFPHHFNRLENENYV FT GPYPDKKYYGYENLSEKDQAKFDAWYATTSGEVFDFKEQLCQYGVNDVVLL FT REACMTYRESFIECTQIDPFSYTTLPSCCMGIFKTHYLKDHTIALTHDNAY FT IRQNKTFSSVSIEWLEYLKKTRNVDIHHALNHGEMQIGKYFLDGYYEQGDS FT RYGLDFLGCLFHAHQCRYEPHKLHPMSGVPFGVLRRQVDEKIEILQNAYGL FT KVEIIWECEWSKMKQTDPSVIEFMSTYSAPERLKPRDALFGGRTNAYKLYH FT KVGEGETISYLDFTSLYPFIMSTKTYPIGHPEIIFNDFQPIENYYGLIKAT FT VYPPRKLLHPVLPYRCGGKLMFPLCRTCAHAENQTSRCNHTDDERALSGCW FT VSVELLKAIEKGYVVVKVDEVWHFPERSDKLFSEYVKTFLRLKQQASGYPS FT NVTTDSEKETYIRKYYEREGIQLDPSQITHNPAQRAINKLILNAVWGRFAM FT QSQHSVTQLVRDPEEFTTIVFGKTDALKYFTFISDDVALVQFHPTEDSHRI FT IRDINVFIGAYTTSWARLELYKLMDKLGDRLLYSDTDSVIFVSKAGDWMPP FT LGDYLGDLTDEIGDGDYITEFCSSGPKSYGYRTASGKVCMKAKGITLNAKN FT SQTIRLDTLIGLVDGYVTSGDDSRYILAQADNIVRNKKHLTLHNKSVVKKF FT KVVYNKRRLLPDYTTLPYGF" FT CDS 8031..7264 FT /product="PZ-1_DRp" FT /translation="MEYAEKMYLVPQNQLDKIQSTSARENIQQVVENDLDT FT AIRNILLRTDLDQREKVKLYSNILTRYLTIVKLGDRESSVLTLSLPTPEDD FT HKVNSDETHAEDKSKDDVTSEVLKNVPTKRVKNSKYILDKMSKAKDLSSWS FT ESGEFVFKGKTIPGSHMLDLINNVTAPQQVRDDRRPKGWTEFLQACAELNI FT PFSTVPNQQVRSKIISLKNNPIDDDQEILSTKKTTRKKRKNQQRTQLLDED FT MFVSPTRDKSRWMHF" FT CDS 9048..8383 FT /product="ATP-1_DRp" FT /translation="MREMVDFDFRLQHPFSCVISGPSNSGKSFFVKMLLEN FT AVNMISKKIDNVVFLYDCWQPLYDELLKLYDIKFIEGIPQSLNDDHLLPRD FT KTNLLIIDDMMKEASGNSEIEKVFTQYVHHRNLSAILIVQNLFVQGKSSRT FT ISLNTNYLILFKNPRDANQVAVLGRQMYPGNTKYFMECYQDAVKSPFGYLM FT IDYKAKTPEQFRLRTGLFSDRQVVYLQKKRRS" XX SQ Sequence 18485 BP; 6054 A; 3380 C; 3240 G; 5811 T; 0 other; agagagaatt atgggggggc acctgtcaaa aagtttgatg gattgtagcc ccgcccctaa 60 ggcgggactt ccggtctaag ccccgcccct tttatataaa ctttatagtt ccggtaatta 120 gattcttatc ggtttaatat aaaataataa ttccggtaat tagattctta tcggtttaat 180 ataaaataat agttccggta attagattct tatcggttta atataaaata ataggctatt 240 tccaggtaat gaaaatattg gtaattaaat tctaatcaaa cagaaataaa attaatatta 300 taataataac ggtaattaaa ttgatggtat taaaatcatt atttaattat tatattataa 360 tataattaat aaattatact aaatgcatca cttacccata tgatgttgtc tccatggcgg 420 tatgaacata ctgactgtat gtgataagga aaacgttaga caacacccat caactgcttg 480 tgaaagatag atttgtatca gctgtagaat atgtatatct gattgtatat gttgggtgtc 540 tcccactaaa tactatattt tgttaggtga taaggaaaaa atgttgaccg ttagacaacg 600 ccctggcaga tacatcaggc cgatagaatg ataagttgtt atttgaacta atgttaaata 660 ttgataagaa agtattgacg caacatataa ctgatagagt acgggtgtgt cactacagat 720 ggtatcaaca tacatgtgta tatgtatata tagagctaga aatagtttga gtgttatgtc 780 aaagtcatgg ctgcctttca agtaaacact agcacaccgg gtcagatttg gaccgacgat 840 ctcaataccg acttaaacgg taatcctatc gctatggagg aaaattggga agaccatatg 900 ctgagaaggc tgacgatggc tcctagaaaa aggatcacca acctgaagct aacaaatttt 960 cttcacgaga ataaacgtta catgaaaaaa gagtttgcta tgcctagcga acgcatttgt 1020 cattacctat ttgacaaaaa acagcgggtt gttaccttgt acacaacgga atctggagag 1080 ccgcacagtc tggatgaccc ctgtctgatc ttaccagtta aagactggtt cttattctac 1140 aatcatgtct ggcctgatat ccagaattct aaacaggagc cgttgtttat cgccgagtgg 1200 gattctgaaa aaccgaacaa tcgatttcga gtggttggaa acaactgtga aaagaatcct 1260 aaagactgta ttttcatatt cttgtgcgat gacaagacaa agcttcgtga gcgttctggc 1320 ataatacatg ctgatgagca taacgagcga tatgaaatgt tgctccccat cctctggcaa 1380 gacgtcaaca agctgtctga aatttttgcc accataaacc aattgttcaa atggtgtgac 1440 atgaatgatc gatacaagga tattaaagtt cagctgtttc acccccagag gtctgtgcga 1500 cgagggaggg aacactccca actgtctccg cccatctact acgaacataa aaacccataa 1560 tcaacaactg ttcaccacag tactacaaca tgtctcaaca actctcttca gaatttggta 1620 tcttcaagcc tgaggatatc aaagcaattc ctcctgatga aaacgctttg ccatgctttc 1680 cgaaattcgg aacttctgca ttagatcgcg acttaccatg ctttgagcta ttcaaatctt 1740 ctgaaccgga caacgatgaa aaaaactggc tggataaatt ttgcaccgaa ctaggatacc 1800 gtaacctgac atacctgacc aacaagtaca aaaaagatga cgatacaaag tttgacgtca 1860 cagatggatg tttctgcagt tccggtttca agatcataaa agcgctgatc gctgtgctca 1920 tgcaacacat catcaacgct cagctgaaaa aggaatgtga gggttgtgct atcgaccacc 1980 ccagccagct gcaacactcc tgtctgtttg aaccctcagc atactacttc gactctcact 2040 ttgcaacact cgtcaataaa ctgatcaaac ctgagtttca cactgctgtg tcacaggcta 2100 tgctgcgtgt tggcttaaag tcacagcctc agacgatcca cggaacagcc ctgggtattc 2160 tacacgaatt gaggagcgaa ccattcatcg ttgcaagact ggaagacatc agaaccaaga 2220 ttctggacga ctcctgtgag aaaatcgtca gtgacgctgt cgatttctgg caccgcagtg 2280 gctagacctc aaaccagctg aaaacaccac aatgggaaac acatgtaaac ctagtaagat 2340 acttgaaaac tttttattag agtactataa aaatcagtta aatggtcttt tatgtaaagt 2400 cattaatgat tccacgaatg atgatcgttt atcacaatcc aaacgtgtga agaattttaa 2460 aaaagaaatg aatgttatta gaagtatgaa taactcatgg gacaatatga ttgatgtttg 2520 tgttaatggt accgaaccat gttgtattat gattaaaaga gttttagatg ctatgtctga 2580 agtatctgat cagtttagag tcggtgatgt tctctgtatg tctacctatg tagcagatgt 2640 atgcatttcg ttgattgcaa aaaatgatga tgatgatata tctgaaattg ttgaatcttt 2700 ggtcacttac attttagata aaaacatgtc atcgtgtgtc caatttcttt cttaccttga 2760 ccgcatgtaa ccattttctg atttgatgtc atgtatgttt cctcgcttta aataaaaaaa 2820 gattcgtaca aacaatgttt tgcattattc tttgacaacc atcgaaacag gaccatgtct 2880 gatataatga agaaagttta ttacgaccca tcaagccccg gtgctttagg tggtaaaaaa 2940 cgtttaaaag atgctgtgtt taaggaaact ggagtccgtc taagcgataa acaggtatcg 3000 ggatggttgg ccggggaaga cgcttacact ttacataaaa cagcacccat aaaatacaaa 3060 cgaaatagag ttgtggtata tggtatggat acccaatttc aagccgatct cgtcgatatg 3120 acggcctatt ctacagaaaa cgacggtaat aaatttatgc taacttgtat cgatgtattc 3180 agcaagtatg catggagtcg ggtgttgaag aataagagtg gagtcgaagt tactaaagcg 3240 ttcgagtcta tactcgaaga gggtcgtgtc ccacggaaat tacagactga tcagggaaag 3300 gaatttttca acaaacattt tcaagatctt atgaaaaaat atgatattaa tcattttgct 3360 acggctactg atttaaaggc aagcgttgtg gagcgattta atagaacatt aaaaagtaga 3420 atgtggcgat ttctaacggc cactaattca cggcgttaca tcgatgtctt acaagacatt 3480 atgacaggat acaacaacag ttatcacaag actattaaga tgagacccat tgacgtctgt 3540 aaagaaaatg aagctacagt gtttcaaaac tcttacggta acataaagat tcagaaacca 3600 gtgttcaaat ttaaagtcgg tgacgtggtt aggatttcta aagtgagagg cccctttgcc 3660 aaaggttacg aacaaaacta taccgaagaa ttttttaccg tttctgaatg tattccacgc 3720 caaccacccg tctacaagct caaagattat gacggagatg ttatcgaagg agttttttat 3780 gagcaagaga tacagaaaat atttgtaaat aaaaacaaat cgttcaaagt ggaaaagatt 3840 ttaggcgaga aaaagcaagg gaggtctaca cttgttttag tgaaatggtt aggatggcca 3900 tcgaaattta atagctatat cgacaaaaaa accctggtag acttgcaaaa accttaaatt 3960 tcacatttct tttcatggtt atttattatt atacaaatca agctatggac aacaacgggt 4020 tctacgttac gctcccgtgt aacgcttctt taaccgttta tccagaaaat cgtatatcta 4080 gttataagac taatttgtct aaaactatac acctgaaagg ggactgggaa gtagcgctcg 4140 ctgaaattga atatcctaga agttggtata atattacaga cgatgatggt atcttcaacc 4200 taaacatgta ccctgaccat gataatgaag agaaaaaaaa aaaactagaa gcaaatgtac 4260 tatcagatct acatggtttc tatgtttcca ggcgttatca aatacccgga ggatactata 4320 aggatatacc ggctgtgatc aaaatgataa acgatgcaat caaaccatca gggatattgt 4380 tttacgatga attaaaaaat aaaattaatc tgattgcaaa acctaacatt tttataggtt 4440 tctatgagaa actaggagct atattaggag tgagacctga agagggacta ggtcaaacgg 4500 cctaccatga aaaagctgat tttaccagga ccgtcagcac atatgcacct catcaagccg 4560 acatcaaagg tggattctac acattatatg tctacaccga tatcatcgac taccaatcag 4620 tcggtgattc atatgtacca cttttaagat gcgtacatat aaccggtgaa aacaacgata 4680 tcgtcagtgt cagatacgac aaaccgcatt acgcatcggt caacaaatcg atcatcacgg 4740 acataaacat agaaataaaa gacgatcaaa accgagaggt gagatttgcg tacggtaaag 4800 ttattgtgaa gctacatttt agaccggtga aacagacgtc tctttaaaaa tgccttatta 4860 caatacacat cacggtgaaa tcagcgatta tgtcaactac taccaaaatc aagctggggg 4920 tggcctcccg ggctatgcag gcggtggggt catgtacgga tcaggcctgg gagggttatt 4980 tagaggtctg tttagaatgg ctgtaccctt gttaaaaaaa gggttcaaca tcgccaagcc 5040 tcacttgaaa acggctgcaa aaaacatagc atcggaggta atttcaaacg tgatttcaag 5100 aaatcgtgac gagagacaac aagacggttc aggtcttatg gtcatggccc gtaagagaat 5160 gtctaaacca ccaggggtca ggaggcgtgg tcgtgctata aagaaaaagc gcactgtcaa 5220 gaaaacttca ctcacacgca ggaagcgtaa gaagactgta agacgaaata ccttcgctaa 5280 gagaaacata aacaccattt tctaacatgg ctcttctcca tcgtatgtcc ggagagtgta 5340 ttaagtcaga gcttgactta tttacagtac ctttaacaca gacggctata gaaaaaaata 5400 cgtatgttga ggtacctcct ctatcagcca tctcggattc gtcgcctttg gagtttttta 5460 tagcaggctc cggtgaggaa tacatcgatc taaataacac attgttgcat ctgagactta 5520 aaatcacaaa acctaatggt ggtgaaatag cagatcctgc aaaagtcgct ttaatcaatt 5580 accccgccgc caccattttc tcacaagtcg acgtgagtct cggagatcga ctaatctctc 5640 aaagctcaag cacccacccg taccgatgta tcatcgagag tctcatcaac tacgacaaag 5700 acacgctgga atcaacattc agcgccggtc tcttcttcaa agacacggct ggtcatatgg 5760 atgtcaaaga ccccttaggt aataatcagg gtctcctgaa aaggtcaaca tacacaagca 5820 ggagtaagat tgttgatctg atggggccta ttcacagcga tattttcttt caagaaaaat 5880 taatgctcaa tggcgtcgat ataaaaattc gtatgatcag aggtaaagac gagttctgtc 5940 tgatgaggag tgatgacgta gcatataaat taaaaattgt gtctgcatcg ctatttgtca 6000 agaaagtatc agtcgcaccg aacgttcgac tggcccatgc tcaggctttg ctatccacca 6060 acgcaaagta cccgattgac agggtatgtc tgaaaaattt ctccatacca gaaggagcac 6120 gagtttcaaa tcaggaaaat ctatttttgg ggactttacc gaaatcaatc gtcttgggga 6180 tgaccgataa cgatgcattt accggttctt atgataaaaa cccatttgcg ttcaaacatt 6240 acgatttgga gttcctagcc atctacgtcg acggacaaca aattcctgct aaacctctac 6300 agccaaattt tacagacggc tcagtggtac gagaatttta tcaactcgta acagctaccg 6360 gcagacatct taaaaatcat gcactgtcta tcagcagatc tgaatttgcc aggggttatt 6420 ctctttatgc attcaacctt acaccggacg aagactgcgg tcaacacgtt tcgctcataa 6480 agtctggaaa tatcagactc gaagcccgtt tcagacagcc attgccaaat accataaacc 6540 tcatcatcta ctcagtgttt gacagtatca tcgaggtgtc taaccgaaga cagattctgg 6600 ttgattacta ttaatcggaa atgaacacca ttcagctaaa cgctatcatg gacaaaatta 6660 catgtaacac ccattttctc ggaactctgg catctgatca actacctgaa agaccgttaa 6720 caaacttacc atcgatggtg attattaaca cacacccttc agaactacct ggggaacact 6780 ggctagccgt ttatttatcg gaagatggtg tcggatgttt ctttgacagt ttcggtaaca 6840 aaccggacag tgatggattt ccaatagcta ttaaaaactt tttaatgaga aatagtacag 6900 tgacaagata ttctacaaaa cgtgtccagg actttatgtc aaatgtgtgt ggtcagcact 6960 gtgttttttt cttgtatcat atgtctaggg gtcgtgacta caactatgtg atgaatctgt 7020 atagtgaaaa ttatgttaaa aatgacaaaa tggtagaact ttttgtgaaa aaactaaaag 7080 ctaatgtttg cagtgataac atgtatttgt gtaaccattg tgttcagacc tgtaattctt 7140 ccatgatgta taaatgaatt taatgagtaa taaaaaaacc ttacaatcat catctgtttt 7200 gttgaatttt atttcaagat agtgatacac aaaacataaa aaacatgaaa catgttgaca 7260 ttaaaaatgc atccatcgac tcttatcacg agtaggtgaa acaaacatgt cctcatctag 7320 tagttgtgtc ctttgctgat tttttctctt tttacgagta gtttttttag tagacagaat 7380 ctcctgatca tcgtcaatag ggttgttttt taaactaata atcttactcc gaacttgttg 7440 gtttggtaca gtcgaaaaag gtatgtttaa ttccgcgcat gcttgaagaa actccgtcca 7500 gcctttaggt cttcggtcat ctcgtacttg ttgcggagct gtaacgttgt ttattaaatc 7560 gagcatgtgt gaaccaggta tggttttacc cttaaacaca aactcaccag actcggacca 7620 ggagctgaga tctttagctt tagacatttt gtctaatata tacttgctgt ttttcacacg 7680 ttttgtcgga acgttcttta gaacttcaga cgtcacatca tcttttgact tgtcttcagc 7740 atgtgtttcg tcagagttaa ctttatgatc gtcttcaggg gttggtagcg atagcgttaa 7800 aacactactc tcacgatcgc ctagtttaac aattgttaaa tatctggtta agatgtttga 7860 gtaaagttta actttttcac gttgatccaa atcagtcctg agtaaaatat ttcttatagc 7920 cgtatcaaga tcgttctcta caacctgttg aatattttca cgagcgcttg ttgattgtat 7980 tttgtccagc tgattctgtg gtacgagata cattttctct gcatactcca tccttcagcc 8040 ctgtctagaa ccgatcaaac ttgaaataaa tggtatagcg acgcttagga gcggtaaaag 8100 aaacccgcca gtctgattaa agaccttttt cttagccgca atgttgattt ttttattagc 8160 gaggaatttt atcgcagccc tcttcttttt caattttctg tattgttggt tgttgatcgg 8220 tataactccg tgtaggatgt tgagagcgag ttcgcagaga gctaaaatga gatcgtcaga 8280 tgctgactgg agaataagtc gtctttgttt agcagtagat ttatgtaaca tttttaatac 8340 aggcaaatgt ttacatagtc tcgcggacat ggttcgttat cacgatcttc tttttttctg 8400 gagatagacc acctgacgat ctgaaaacaa tcctgttctg agtcgaaatt gttcaggtgt 8460 tttagcttta tagtcgatca tgagataccc gaaaggactc ttgacggcat cttgataaca 8520 ttccataaaa tattttgtgt taccaggata catttgacgc cccaacaccg ctacttgatt 8580 agcatcgcgg gggtttttaa acaatattaa ataatttgtg ttcaaactga tggttctgct 8640 ggatttacct tgaacaaaca aattttgtac gataagaatt gcagagagat tacgatgatg 8700 aacgtactgc gtgaagactt tttcaatttc ggaattaccg cttgcttctt tcatcatatc 8760 gtcaataatc aacaagtttg ttttatcacg gggtagtaga tgatcatcat tcagactctg 8820 aggtatacct tcaatgaatt taatgtcata caatttaagc aattcgtcat acaaaggttg 8880 ccaacagtca tacaggaaaa caacattatc aattttttta gaaatcatat ttacagcatt 8940 ttccaatagc atttttacaa aaaaggactt ccctgaattg gatggtccac ttatgacaca 9000 tgaaaacgga tgttgaagtc tgaaatcaaa atcaaccatt tcacgcattc ctctatacac 9060 tgaattaaaa gccataaggc agagttgtgt agtcgggtaa aagtcttcgc ttgttataca 9120 ccaccttgaa ctttttaacg actgatttgt tgtggagtgt gaggtgtttt ttattcctga 9180 caatgttgtc agcttgagct aaaatgtatc tagagtcatc gcctgatgta acatacccat 9240 caaccagacc aatcaatgtg tctaatctta tcgtttggga atttttagca ttcagagtta 9300 tacccttagc tttcatgcag actttaccgg aagcggtacg gtaaccatac gactttggtc 9360 cgctggaaca aaactcggta atataatcac catcgcctat ttcatctgtc aagtccccca 9420 aataatcacc cagaggtggc atccagtcac cggccttgga tacaaaaata acactatctg 9480 tgtcagagta taataaacga tcgccaagtt tgtccatcaa tttgtacaat tctaacctcg 9540 cccacgaagt tgtgtacgct cctatgaata cgttaatgtc acgaataatg cgatgactat 9600 cttcggtagg gtgaaattgg actaacgcta catcatctga gatgaaagta aaatatttca 9660 atgcatcagt tttaccgaaa acaatcgtgg tgaattcttc agggtctcgt acaagttgcg 9720 taacagaatg ctgagactgc atcgcgaaac gtccccaaac tgcatttaaa attagcttgt 9780 ttatagccct ttgggccggg ttatgagtga tctgtgaagg gtcaagttga atcccctctc 9840 tctcatagta tttgcgaatg tatgtctctt tttcagaatc agtggtaacg tttgaagggt 9900 atccggatgc ttgttgtttt aaacgtaaaa acgttttaac gtattcgcta aacaacttgt 9960 ctgacctttc aggaaaatgc caaacttcgt cgactttgac cacgacatac cctttctcaa 10020 tcgcttttaa aagttcaaca ctgacccaac acccactaag tgccctctca tcatccgtat 10080 gattacatct tgatgtttgg ttttcagcgt gtgcgcatgt tctgcataac gggaacatga 10140 gtttaccgcc gcatctgtaa gggaggaccg gatgtagtaa ttttcggggc ggatacacag 10200 ttgccttaat aagaccgtaa taattttcga tcggctgaaa gtcgttgaaa attatttcgg 10260 gatgacctat tggatacgtc tttgtactca taataaaggg atagagcgac gtaaaatcca 10320 aataagaaat ggtctccccc tcccccactt tgtgatacaa cttataagca ttggtacgac 10380 ctccgaaaag cgcatctctg ggtttcagtc tttcaggcgc cgaataagtg ctcataaatt 10440 ctatcactga agggtctgtt tgtttcattt tagaccattc gcattcccag ataatctcta 10500 ctttcaagcc ataagcgttt tgtagaattt caattttttc gtcaacttga cgtctgagaa 10560 caccaaaggg tacacccgac atggggtgga gtttgtgagg ctcataacgg cattggtggg 10620 cgtgaaatag acaccccaaa aaatcaaggc cataccttga atcgccctgt tcgtagtagc 10680 cgtctaaaaa gtatttacca atttgcattt caccatgatt cagagcatga tgaatgtcaa 10740 cgtttctagt cttttttaaa tactccaacc attcaatcga gacactcgag aatgtcttat 10800 tttgccgaat gtaggcatta tcatgggtta gagcaatagt gtggtctttc agatagtgtg 10860 tcttgaaaat tcccatacaa caactaggaa gagttgtgta actaaatgga tcgatttgtg 10920 tacactctat gaacgattct ctgtatgtca tgcatgcttc acggagcaag acaacatcat 10980 ttacaccata ttgacacaac tgttccttga agtcgaaaac ctcccctgaa gtagtagcgt 11040 accacgcatc gaactttgcc tgatcttttt ccgataggtt ttcgtaacca taatactttt 11100 tgtcaggata aggtcctacg tagttctcat tttctaatct attgaaatga tgcggaaagt 11160 agcctttttc tgttgtggtg agatttaatg ctgccggcat cttagacaga gccatcggta 11220 tgaaggatat gctatctata tagcgttgct taaatgaaac atcgaacatg aagattaatt 11280 tacaaccttt catgatgatg tccatttgaa gacctgcttt acaaaaatat tcaagaatta 11340 gaaaggagtc aaatccggat gcgttatgag ctatgaaaca atagccttcg tatttgggtt 11400 ttctgaaatg ttcgatcatt tttttcacac agtcagaccc cccggctaca aatctttcac 11460 cacaaaacgt tattgcgcag acaaaatttg ctacgtgtct cgcattttca taacgagtct 11520 cgaaatcgta aaagatgtat ttattctgag ggtctttgag tggtactgtt ttaatgaaac 11580 actcatgttc tccatcaggg acaagttctt cacgacatgc atcacaacgg tttggtgcac 11640 atatgtgttt agactttttc ttatcgtacc gtctattaca tttcttgcag tattttataa 11700 cgtcgcaagg tactttttct ccttcaagca caggtttctt atgtgcatcg tagcagtaag 11760 aagatttgca gtagcgcaaa caatcggtac aatgaatgat ttttttagga tgcttgtaac 11820 attcgccgtc gaagcagacg ttacagacgt gtttacattg atgatttcgc actgatgtgt 11880 agcccttata gcagaattca catacataac tagcacctat gaatgatttc aaattgagaa 11940 tcatgtagta atgttcgtct tgtaaataaa ggtacaccgt tttgtcatga ggttcgttgt 12000 tatttacgta tgtttgtaac acaccggtgt ttgtcctata aaaaacaaca attttgatgt 12060 ctagcatgtt ttcaaatttt gctatgtcgt gaaaaccgac cttgtcttgg atagcaagac 12120 ccactttatt gtgtatgatt gatgcacggt tttctaattc acactctggt aattgagggt 12180 ctagaaaatg agctatacag atagagaaac ataatttgtt tgaaatattt gtaggtataa 12240 acaagctcgt cttttttctt ttgattacct gatcaaacgc caaatccgta agtttacgac 12300 gaacgccacc tcctcctagt ctgttcatta caacctctgc ttcaatctca acagtttcat 12360 tagctgataa cttatcatcg ctctgcaaaa tttttgcaat ttgatctgta aaaagattaa 12420 cgtcgtaatt attacccggt gttaaaactg catttacatc cgatttcagg gtgggtgttc 12480 gaaggcataa attgatgacg ctaccatcac cgccaatctg tctagcgaat gacacaatgt 12540 catccatggt atcgtgtaaa cgtatgtggt acgatgctaa atcctgagat cttatttcac 12600 tcatattcaa tgtcctacgc aacgatacac tattgaaacg aggccttggt aaaacatgat 12660 attcgtttac attaccgcca tctctcaaca atgtttcaat gtctatctgt cttgttggcc 12720 tatgccttaa aggctccgca acatttgagg ttgatggttg ttcatcatcg acatttgact 12780 ggtgtgttga ttcaatatca gtcctaccgt gtgttgcatg gctagttgaa tgactgtcgt 12840 cgtcatcatc atcatcatca tcaaaaccat catctgtcag ttggctgtta acagtagcag 12900 aagcactagt ttcctgagtg actgcaccac cgctctgttc actgtttaaa aaagattcaa 12960 catcttcagg caaatttaaa ttcaattcct gttctatata catttccaaa tcatccagat 13020 ttgtaatttg tgaaatagga atattgttaa atgttatgtc tctagaattt ggatacatat 13080 ctatcatatg ctgaagagat gcaaatatgt cttcatctct gtctgcatcc atttcgacac 13140 ctctttgcag tctaacagat agagtttggt ctggttgaat attgtatgta tcatctgtac 13200 gattttcatc atccattgta atttttgatt tattttatta ttattttaca aatatttttg 13260 ttaatgttta ttttatttca gacttttatg tttaaattag ttttattggt tattttagaa 13320 ttattttgtt ttgttattag atttatttta ttgtttaact tttttcttct gtattttatt 13380 tattgtttta tttacattta tcgtttattt gattgcttaa gttttcattt tatttatatt 13440 tgttgtttat tttaggatta atatttgctg tgtttataca attcaatctt tatttataca 13500 tatttaacca tgttattaga aatggatgtg tgtcaataat aggaatacaa ttatgaagta 13560 atgtatatga ttagctggaa taaaacacaa tgaaatattt tattacatca attttaaaag 13620 ttcaaacact aagttcacaa taaaaacaaa ctccaaggaa tacaaattta atataaatca 13680 acagcaaaag ttaaaacact aaacagctga gtcaatttaa agttcatcaa aatgttaaac 13740 tgaataaaaa tacatcggta ttacatcaaa tttaaactca tacactaaac aaatgagtaa 13800 gttctcaata aaacaatcac aattgaataa acatttaata tagatcaaca gttaaacatt 13860 taagacaatt tggagtacat caaaatgtta agcagaataa aacttgaaaa tggttttgta 13920 ttacatcaaa agttaaaaca ctaccagtta aatcaattta aagtacatca aaatgtaaag 13980 ttgaatcaaa atacatttgt attacatcaa ttttaaaagc tcaaacacta aacaaatgac 14040 tacgttcgta ataaaaagaa aggcaaagga gtacaaattg aatatagaac aacatctaaa 14100 gttaaaacac taaacaacta aagtcgatta aaattagaaa cgtaagacaa gtatagcatt 14160 tatcagagta taaaatatgg acttttaagt agtacaatga tgaatgatcg tacagatgat 14220 gagaatattt attttaggaa ttattcatcc atcaatgaat gcctaataaa aatgagtagt 14280 ttcgttatta accgatgccc ccgttcactt ttacccatca attgttgaag ctccccaatt 14340 ttttcatcca tacgtcgctg ttgctcagac aagaaatcct tgacattttc tatcacacgc 14400 tgctgctggc caatagaatc cgctagagca ttaattttta cctcatatct gtccagatat 14460 tcacgaatca gacaataagc aggcgactcg taaaagtctt tgatattgct gtgtttgctg 14520 acgctaccgg cctcctgatt ctgctggggg cgctgtggtc tttcgacgtc ttccgcgtct 14580 tcttcttctt cttcttcttc ttcttcttct tctactggtg cttgaaaggg ataaatcggc 14640 tgatgtgcgc ctccctgagt tgattcatga tgaccgctgt tggaagcacc ttgttcgcac 14700 catggatctg tattaaaaag aaatcaaaat attgtttagt attgaacgat aaataatact 14760 taacaataaa ataatattca tttgttttca ttcaacagta attatgttta ttatattctc 14820 attatattgt cataatcaat taaataaatg gaacactaac cttcagataa attaaaatta 14880 atagcagaag gatttggttc tgatggactg gcataaggac ttagtgggct agcaagaggt 14940 gtccatggat ctgtgtgtgt gaaaatatga aaataataac gttacactgt gatttaatga 15000 ttgaaacaaa caaacaaaaa tagcttatta cctgattctg cgtcttcatc atcatcctca 15060 tcggtatcgt attcttcttg cacaggtgtt tcagaaattt tttgcctttt atttacatat 15120 ggtgtgtaaa caattcttgg tctcttaggg gcagaagtcc ccggcgctgc atcaaacttt 15180 tcttctcgca atgattcttg ttgccccgca atcttttctt cttcttcttc caccgtcaag 15240 tcgataactt caattttcgc taaaatagga gaatgacaaa aataaataaa tacaagttat 15300 tactcattcc tacagtaaac aatagtgatc ccacatccgt cagtgctgtg gcctatgcat 15360 agcgcattcg accggtaacc gagccttgtc gcaacaggcg gttcgaatcc actaaacaac 15420 aaaaaattta atttatttta aataaaaagc ataccttttt gcgtttgatt atcaaacatg 15480 tgagtcgaag aggctataag aaaaagagaa acgtaagtaa caataatgat aaaataaaaa 15540 actattgtct aaatatgttc aaaacaataa atataattat ggaataatac ttacaaatac 15600 gattgatggc atcaaatgcc tcactttcgg aaggatttat ttctgtaaga cataagctga 15660 tagaatgaaa atatatcaaa cactggttaa tgaggtagta tacaaaccgc taggcagatg 15720 aaaatatatc aatttctact cacgcaattc catatcttct tcaaaagtga cagagttgac 15780 cctcttcaga agttccactc gttatgacag tgtaactgcg gattcaagtt tactgtgaat 15840 aaggtctgta tagggggcct ttttatgtca aatgtcgaga ggcgttgtcg tatgtttaaa 15900 agctggttcg gaacgataag ggggcaccca aaaaattaaa taacctgagc acgtcaatat 15960 tatactttat tatatgaaac tatatacaat ctaatagata gtgataaaac gtaacgacac 16020 aaaatattca tgtaatgtct ttaatcttgg tgaaaaacac aaatgaactt ggtgtaatgc 16080 tttagtgtag atatttgata taattataaa aaaaaaaact ctttcaatat tttgactgtt 16140 taaaatttat accatacatt atttcatact caggataact atcgttttga tagactataa 16200 taaaacatat attttattta gatatgtgtg tgtgtgtgaa tgaaaattta aaaattataa 16260 atattgttgt attacatttg tattcatatt ctatataaat ggatgaatga taaaaaataa 16320 taattcgttg atgtgataga atatttacat ctcatatttt aacattttcg tttataatat 16380 ataattaaca tagtctctgt tattaccaat attaaaaata cttataatac ataacattaa 16440 attagtatac aattgtttgt taaaagttaa tacaataaaa cgttgtattc acaacttttc 16500 tttcaaagtt accgcgttaa gcaacaaata tattattttc gacaataaaa tataataaag 16560 aatgactagt acatttttca tagtgtgacc aatagatggt gctagatagc cttgtatgaa 16620 ctctgtatgt tttcgtaata tgttacacac cctgtttcgt ggaaaaatac tagtcattta 16680 tcataaacat caatatgcga cgagtcaaca gtcgaacgcg cttttttctt atcgagcaaa 16740 ccatacatgg gttgtgactt gggacatttc atattagcat caaaattaca agactcttta 16800 taaaagttta atattcctgt aatataagac gttattgaaa caaaatatga atcttttcat 16860 gtataaaatt taacgcgtta aaactataca ttacacttta ggaatatatc tattactgtt 16920 cagtacgatt taaatgatat taattaatgt ttaaacatat aaagattgat agtggttatt 16980 aagtgaactg tcaaatatga ttgaaaatta ctcttgacct aaaattgcat agtgcattga 17040 tatgtaatac aactaaaatc gaatatcata ttttagtttt tcaaaaacaa aacaaaaata 17100 gttgtgtata taaatatata tatgtatata cacacacaca cacacacaca cacacacaca 17160 cacacacaca cacacacaca ttatttgtgt aaatatgtta tataaatgtt aaacaaaaac 17220 aaattttaat cttattaccg tttgatcttt atgttatatt gtattcaata ataatagtgt 17280 tatacctgtt aaacacaaaa ccgtatgtta atcatcttat tacagattga ttgttatatt 17340 atattgtttg tattatataa taatcaataa tattaatgtt atacatatgt taaacataaa 17400 aacacatgtt aattatctta ctttctaatg aacatcatgg tgaaacgcta cactataaaa 17460 taataataat attaatacta atattaatac tattaataat aaattagctg atgtatattt 17520 tatttattgt gtgctttaag ccttaatgaa aaatgatcat gaggcccccc tctcactctt 17580 caccctgtag gcttattatc acacacacac acacacacac acacacacac acacacacag 17640 acacacacac acacacacac agatacacac acgcacacat gcacacaggc acgcacgcac 17700 acacacacac tcacacacac acacacacac acacacacat atatgcatac agttgactgt 17760 atgttaatag tttattaatt ataatataat ataatataat aatataataa tataatgata 17820 taatataata taataatgta atactatagt ataattgaat aatcatgata tataatataa 17880 tataattaaa tttaatataa atattaatat tataataata taatataata atatattata 17940 atataagtta ataacataat gatttattat aacttaatat gatataatat aatacaatat 18000 aaaatattat ttaatataat attaatttta tacagtttaa tataatataa tataatttga 18060 tgtaatataa taaatataat ttacaacact aattaaatat aaaataatga ttttaataac 18120 agtaatttaa ttaccgttat tattataata ttaattttat ttcagtttga ttagaattta 18180 attaccaata ttttcattac ctggaaataa cctattaatt taattaaaac gataaaaatc 18240 taattaccgg atctattatt ttatattaaa ccgataagaa tctaattacc ggaattatta 18300 ttttatatta aaccgataag aatctaatta ccggaattat tattttatat taaaccgata 18360 agaatctaat taccggaact ataaagttta tataaaaggg gcggggctag accggaagtc 18420 ccgccttagg ggcggggcta caatccatca aactttttga caggtgcccc cccataattc 18480 tctct 18485 // ID Gypsy97-LTR-B_DR repbase; DNA; ZEB; 403 BP. XX AC . XX DT 28-FEB-2009 (Rel. 14.02, Created) DT 11-MAR-2009 (Rel. 14.02, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Nonautonomous; KW Gypsy97-LTR_DR; Gypsy97-LTR-B_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-403 RA Bao W. and Jurka J.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 9(2), 557-557 (2009). XX DR [1] (Consensus) XX SQ Sequence 403 BP; 62 A; 88 C; 92 G; 161 T; 0 other; tgtcacggtt gcaggtttgt gcgctcttcc ggagtgtctg tttgatcatg tgggtttgtt 60 ttgtttttgt tgcccacgtg tatttgtttt ggtcacgtgt tatgacggca ctcagctagt 120 ttgattagtt cgccagctga ggctcattat cgtgcctata tatgggcgac gttttctgtg 180 tttccttgtc agttcgttac gtgtgcttat gtgtgctatt tgtctgtgct caggaccgtg 240 aggagttacc actggaccct gttttcctgt ctgatccctg ccctggtttt ctcctagccc 300 tagcaccgca ctcattcctt gcatttattt tgactgtgtc aaataaatat tattattttt 360 cactcgcact tggatccatc tgaacatttc tttgattgtg aca 403 // ID BEL-37-I_DR repbase; DNA; ZEB; 4022 BP. XX AC . XX DT 03-DEC-2008 (Rel. 13.12, Created) DT 03-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion - consensus. XX KW BEL; LTR Retrotransposon; Transposable Element; Nonautonomous; KW BEL-37-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4022 RA Bao W. and Jurka J.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(12), 2168-2168 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS join(823..2109,1992..2441,2378..2803,2943..3584) FT /product="BEL-37-I_DR_1p" FT /translation="MSVQDESAEQNECLLQEQVVLMDDVYEIEGTVRNRYT FT SCTTCTEKSEARFKELNSGSSXVNEVRKSSRDKXLTPKMQELKEQELIQKE FT KKFKTAYESWKVXVRDVRFKLKHECXESDLYDMMDGVEKKESDMKXLYDNI FT XSQVVPSQEXRRKIDACSAVTADLLKIMRVRLVEDGMEFDAEAEKARLRML FT LDCEYARSIYGSTVSRAAALSNHSKPLSESLSISVKRAEAAALLAAKKAEI FT EMEADINEQRQRLKKLENRRDVEVMEARLRVYTEEESRMNSQQCGSACADV FT SNPILXVSNTPDLDKQDPESKVSVTQTLQESVALSRLPVPEPFVFFGDPLK FT FIEWSIXFKSLIERRCPNSADRLFYLQRYIDGEAKSVLEGTFYRKDXQAYQ FT QAWEKLNARYPSWYNVPLERNLVNGLRLVHPNTLNXSLSASMGKVKCQISF FT VVQRAFREKLSKWPKIGASEYIKLREFSDFLQSCSDAMPHVKSLNVLNDCE FT ENQKILLKLPNWVTHRWNRYVTEQLDLDKDYPSFSTFVSFISKEARIACNP FT VSSLYALKTSDGKSIKEIKHPRISSLATTTTMWEVNKGDKASQNKLVGNHY FT DNVDAACDLADSSNSKASNLSSTLVLGASSVECLFCGEGHLIHKCQDLTEK FT TAMEKRGFIMENKLCFACLRKGHFSKDCRNRAICAVCKRHHPTPLHEDRQI FT TDKPTSQRFHKLKKVHPLYVVRLQVTTDPVKLKLSTMLGKDSIVESQRVCG FT LQVRGFFSDSSFCLPPAYRGTLFPLNAHVPTCETAKRWRHLEKIADEMPAQ FT MDCGVGLLIGYDCSRALAPRQVISGGDEEPYAIKTDLGWSIVGDTPHCTKV FT NVTGLCHRISIQELHPVTPAAVIRSSEMDFADTQLGEKIISQEDVQFLHII FT TEGVHQNDGGYNKSLALIRLKHLKWKCQNL*" XX SQ Sequence 4022 BP; 1238 A; 634 C; 948 G; 1182 T; 20 other; ttgtttatat attatttgtt tattttgatg tccttttatt gtgtttgtgt gaaacatttg 60 gtttgtttat ttgtttatct tttagttttc acggtgttca ataaagaaaa tccatatgga 120 catcagtatt tggaagcgtg gagtgtttta ttaaccggcg ggtagttaca gtgaaaacgt 180 ggcccaattc caccttacca aggtgtttat cagcggtgca cagcaacggt gagcagctcg 240 tatctctgga cggacttttg tgcgcggctg ggctgcctct gcggtgacgg aggacgctat 300 tttcggcgcg tgattcctcc gtgaaagtct gggactgtgg aagctattta gcagcggcgg 360 tctgtgagga actggtttcg tgacgtcata caagctgctg ccttggtgtt ggcagcagcg 420 agattcagag cgcagtttgg cgtctttctc acgtggcgga tcctttcatt catcggcgtc 480 atagcaggtc agactgtggt atttgaatgc cgtctttggc gaaaagttga tgttttaaag 540 agttggatcc aagttgttta aaggtaaagt tgcttataac cagagagaga gagaaaaaat 600 atgtgctgat ggtaatggtt tgaaattgct gtgactgtat caatgtttaa gggttaacat 660 agatattaac tttttgtaga aaaaaaggtt ttaaatgcaa gtaaatattt cacagtatta 720 agggtattgt tttgaagtga gtttgagtgt ctaataatct catggttgtg ttgtgtaaag 780 tgtgtgcaca tagaaaataa ttgacaaaaa caaacaaaaa aaatgtctgt tcaagatgag 840 tcagctgarc aaaatgaatg tttattacaa gagcaagtag ttttaatgga tgatgtatat 900 gaaattgagg gcactgtgag aaatcgctat acatcatgta ccacatgcac tgaaaaatct 960 gaggcgagat ttaaagaact taattctggt tcatcarcgg taaatgaagt tagaaaatct 1020 tcccgtgaca agygtttgac accaaagatg caggagttga argagcaaga attgatccag 1080 aaggaaaaga agtttaaaac tgcwtatgag agttggaagg tcyatgtaag agatgttcgy 1140 tttaaactga agcatgagtg tyctgaaagt gatttgtatg atatgatgga tggggtggag 1200 aagaaagagt ctgatatgaa agkgttgtat gataacatty gatctcaagt agtaccaagt 1260 caagaaayta ggaggaaaat tgatgcatgt tcagckgtya ctgcagattt gctgaagata 1320 atgagggtac gcctggtgga agatggcatg gagtttgatg ctgaagctga gaaggcaaga 1380 ctgcgcatgc tgctggactg tgagtatgcg aggtccattt atggatctac agtgtctaga 1440 gctgcagctc ttagtaatca ttcaaaaccc ctttcagaat ctttaagcat ttctgttaag 1500 agagcagagg ctgctgcttt gttggccgca aagaaagctg aaattgaaat ggaggccgat 1560 atcaatgagc aaaggcagcg attaaagaaa cttgaaaatc gcagagatgt tgaggtcatg 1620 gaagcaagac ttcgagtgta cactgaagag gaatcaagaa tgaacagcca gcaatgtggt 1680 tcagcctgcg cagatgtgag taaccctatt ctccytgttt ctaacacacc agatttggat 1740 aaacaagacc cagaaagtaa agtatctgtc acacagacat tacaagagtc agtggcctta 1800 tcccgtcttc ctgttccaga accttttgta tttttyggtg acccacttaa gtttatagaa 1860 tggagcataa ygtttaagtc wcttatagaa agacggtgtc caaattcagc agacagrctc 1920 ttctatttgc aaaggtatat tgatggtgaa gccaaatcag ttctkgaagg caccttttat 1980 agaaaagatg awcaagctta tcagcaagca tgggaaaagt taaatgccag atatccttcg 2040 tggtacaacg tgcctttaga gagaaactta gtaaatggcc taagattggt gcatccgaat 2100 acattaaatt aagagagttc agtgactttt tgcaatcatg cagtgatgcc atgccacacg 2160 tcaagagtct taatgtgttg aacgactgtg aggagaacca gaagatattg cttaaacttc 2220 ccaattgggt gacacatagg tggaaccgtt atgtcacaga acagttggac ttggataaag 2280 attacccaag ctttagcacc tttgtttcgt ttatctcaaa ggaagcacgc atagcatgca 2340 atcctgtgtc atctttgtat gcattaaaga catctgatgg gaagtcaata aaggagataa 2400 agcatcccag aataagctcg ttggcaacca ctacgacaat gtagatgcag catgtgattt 2460 ggctgattct agcaattcta aggcgtccaa tttaagcagt acattagtgc ttggtgcaag 2520 ttcagtcgaa tgtttgtttt gtggagaggg acacttaatc cataaatgtc aagatctaac 2580 agagaagact gcaatggaaa agagagggtt tattatggag aataagttgt gctttgcatg 2640 cttaagaaaa ggacatttct ccaaggactg tagaaacagg gccatatgtg ctgtatgcaa 2700 gaggcaccac cccactccac tgcatgaaga tcgtcagata acagataagc caacttctca 2760 gaggtttcac aagctgaaga aagtacatcc tctttatgtt gtatgaatgg tggagaaaga 2820 ggtttaacat ccatgattgt tcctgtatgg ttgtcttcat ctaattctga gtcggaaatc 2880 cttgtatatg ctttattaga tacacaaagc agtcacactt tcattgatca agaggtatgt 2940 gacgattgca ggtaacaaca gatccagtaa aattgaagct gtccactatg ttgggaaagg 3000 attctattgt agagagccag agagtttgtg gactccaagt tagagggttc ttttcagata 3060 gttcattctg tttgcctcct gcttatagag ggactttatt ccctttgaac gcacatgttc 3120 ctacctgtga aacggctaaa agatggagac atctagaaaa gattgctgat gaaatgcctg 3180 cacaaatgga ttgtggagta ggtcttctta ttggttatga ttgctctaga gcgttggcac 3240 ctcgacaggt catcagtgga ggtgatgaag agccttatgc catcaagact gacctgggct 3300 ggagcattgt aggggatact ccacactgta caaaggtgaa tgtgactggc ttatgccatc 3360 gtatatcaat ccaggagctt catcctgtga cacctgctgc tgtcatcaga tcatcggaaa 3420 tggattttgc agatactcaa ttgggagaga agattatatc acaggaagat gttcagtttc 3480 tgcacattat aacagaaggt gttcatcaga atgatggtgg ctataataaa agtcttgcat 3540 taatacgatt aaagcacctt aaatggaaat gtcagaacct ttaggagtga agtgttccac 3600 ttgtacaaca ggatgccttt atggaagtgt tgcatgtact caacctggaa tcaggaaaga 3660 tgccttgtaa tcatacatat cagtttgacc agagtccaat tcaaaagtta atagtactgg 3720 tgagcagagt ttaaaaattg aaatgctcta tagtttcatt aaattacttg ggtatgaagc 3780 cctcattttg taatataacc tttatactgt ggataaattg agtattaaag taattatcaa 3840 ttttaactag aatgtgtatt taaaaatata aataaataat aataaaaatt gagtacttta 3900 ataaaatgta gatttttaat tagaacatat aaattatgaa ttgagttggt atggtgttaa 3960 ggtttgaaaa ttgccattga ctgtattatt aatttctgaa gtcaaaggta atttggtggg 4020 ag 4022 // ID Gypsy93-I_DR repbase; DNA; ZEB; 4216 BP. XX AC chr18; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy93-I_DR; KW Gypsy93-LTR_DR; Gypsy93_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4216 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1497-1497 (2008). XX DR Genome; chr18; Positions 27133301 27137516. XX CC Positions [3059-3565] - Integrase core CC 'AGTAG' target site duplication CC LTRs are 99% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 32..3979 FT /product="Gypsy93-I_DR_1p" FT /translation="MDSAGVPMPIMNWEANNLPESWRKFQQHAELMFSGPL FT KNKGEDEKCSYLLLWVGEKGRDIHNTWSLSADEAKILKSYYDRFTAYVEPK FT ANVIFARYKFHEKMQGEREPFEQFVTDLRLLVKDCNYANSNEMVRDRIVFG FT IYSPKVREKLLNVGSELTLDRAIDVARSHELAQAQLKTISSSMSASREHVH FT TVFRNASKGAVAKQGNTKYTKKYSDGQQSRNAGSIQSKGCGFCGNKRHPDR FT SLCPARGKVCKICGKLNHFAKVCYSNTEKSIHAIRKENSDLAEECDELFID FT SVTQHKYSSEMEQAFVNIKIGPKQQEIRFKLDTGSSANVIPLYEYVKLDVQ FT HNLQSTSRPLYGYGGERLVVKGKCDLKCKCKDIDMVMAFYVVDTNAPPVIG FT LKGCIDFGLIKMVLSVTETKTEVPVKEEFADVFKGIGLFPGECTIHLQPHA FT TPVVHPPRRIPLALRSRLKEELESMEKQGVIIKVTEPTEWVNSMVVAEKPR FT TGQLRVCLDPRDLNKAIKRPHYSLPTLEDITSRLAGAQYFSVLDARSGYWA FT IKLTDESSRLTTFNTVFGRYRFLRLPFGLVSAQDEFQRKIDETYEGLHGVM FT AIVDDILIHGKTKTEHDRNLRAMLLRSRERGVKLNPEKCIICATEVSYFGH FT LLTAEGIKPDPAKVAAVRDMEPPKDKGELETVLGMINYLSKFAPGLSDVNA FT PLRQLLKESSEFVWDTQQDNAFKKIKELLTCEPGPVLAYFDPLKELSLQVD FT ASKYGLGAVMLQDGRPIAYASKSLSESEINYAQIEKELFAILFGCKRFHQY FT IYGRQVIVESDHKPLESIMRKPMASVPPRLQRMILQLQKYNFTIVHKPGKK FT VVDDHDTGLSEGMDIQVHTVFSSLPVSDSRLEEIKDHTGKDLQFETLKQVI FT QRGWPEEKRKCPQSISEFWNHRDELSYMNGIIFKGEKIVVPTRLRSDMLSR FT IHASHLGIEKSKQRARDIMFWPGMGKEIENMISTCTICLTYRPSNTKEPMI FT SHRIPDRPWQVVATDLFTWNNEDYLVTVDYYSRYFELDKLQTTTSASVISK FT LKAAFARHGIPEAVISDNGPQYSSSEFETFARTWQFKHTTTSPYYPQSNGL FT AEKSVHTAKMLLEKAKAGKGDPFLSMLEYRNSPVDGFKSPAQLLMSRRLRS FT TLPTTNQQLVPKLVSIKDVKTKRVQKQQHQKNYYDRSSRPLSQLKEGQTVR FT VQEQGYWKPAVVIKTANTERSYHVRTVDGREYRRNRRHLLDTKESQNNSLE FT ESELHDGAEPCSTPTDSSLHSDSTHTPKHTDELPEATIYRTRSGRQIKPRV FT ILDL" XX SQ Sequence 4216 BP; 1341 A; 904 C; 948 G; 1023 T; 0 other; tggtgtcaga ataaacggat gacaaacgaa gatggattca gcaggagtcc cgatgccgat 60 tatgaactgg gaggctaata acctgcccga gtcatggcgc aagtttcaac agcacgcaga 120 gctaatgttc agcggacccc taaaaaataa gggtgaggat gaaaaatgca gttatcttct 180 cctgtgggtc ggagaaaagg gtcgcgacat ccataataca tggtcactgt ccgctgatga 240 ggcgaagatc ctcaagtcat attatgaccg ctttacagca tatgttgaac caaaagcaaa 300 tgtgatattt gcacgttata aattccacga aaaaatgcag ggagagcgtg agccgtttga 360 acagtttgtg acagacttgc gtctacttgt gaaggattgc aattatgcaa atagcaacga 420 aatggtccgc gatcgcatcg tcttcgggat atattcaccg aaagtgagag agaagctttt 480 gaatgtgggc tctgaattaa cccttgatag agcaatagac gttgctagat ctcatgagct 540 ggcccaagca cagttaaaaa caatctcaag cagtatgagc gcttcacgtg agcatgtaca 600 cacagtattc cgtaatgcat caaaaggcgc agttgccaag caaggaaata ccaaatatac 660 aaagaaatac agtgatggac agcaatctag aaatgcaggt tcaatacaat ccaaaggatg 720 tggcttctgt ggcaacaaac gtcacccgga tcgctccctg tgtcctgcaa gaggaaaagt 780 ttgcaaaata tgtggcaagt taaaccattt tgcaaaagta tgttactcta ataccgagaa 840 aagcatacat gctataagaa aagaaaattc tgacttagct gaggagtgtg atgagctgtt 900 tattgactca gtgacacagc ataaatacag ttcagaaatg gaacaagcct ttgtgaatat 960 taaaattgga ccaaaacaac aagagattag gttcaagctg gacacaggat cctcagccaa 1020 tgtcatacca ctgtatgagt atgtaaaact ggatgtgcag cacaatctgc agtcaacatc 1080 tcgccctctg tatggttatg gaggtgaacg cctggtggta aaaggcaagt gtgacctgaa 1140 atgcaagtgc aaagacattg atatggtcat ggcattttat gttgttgaca caaatgcacc 1200 ccctgtcata gggttaaaag gatgcataga ctttggcctc attaaaatgg tactatcagt 1260 tacagaaaca aagacagagg tgccagttaa ggaggagttt gcagatgtct tcaaggggat 1320 tgggctattt cctggtgagt gtaccataca cttacagcca catgcaaccc cagtagtcca 1380 tccaccgaga cgcattccac ttgcattacg cagtcgtctt aaagaggagc tggagagtat 1440 ggaaaagcaa ggtgtaatta ttaaagtcac agagccaacc gaatgggtta attccatggt 1500 tgtggctgaa aaaccacgca caggtcagct aagggtatgc ctggacccga gagacctcaa 1560 taaggccatt aaacgacctc actactcctt acctactctc gaggatataa catccaggtt 1620 agccggtgcc caatatttta gcgtcctgga tgccagatct ggctattggg ccataaaact 1680 cactgatgag tcgtcaagac taaccacttt caacaccgtc tttggtcgtt atcgcttttt 1740 gcgtcttcca tttggactgg tctcagccca ggatgagttc cagcgaaaaa ttgacgaaac 1800 ctacgaaggt ttgcatggag ttatggccat tgtcgatgac attctcattc acggaaaaac 1860 caaaactgag catgacagaa acctccgtgc tatgctgctt cgttccagag agagaggtgt 1920 caagctcaat ccagaaaaat gcatcatttg tgccacagag gtaagctact ttggacatct 1980 cctcacagca gaaggcataa agccagaccc agcaaaagtg gctgctgtaa gagacatgga 2040 accacccaaa gacaaaggag aacttgaaac agtattaggt atgattaact acctctccaa 2100 gtttgcgcct ggtctttctg atgtcaatgc accactgcgt cagctcctta aggagtcaag 2160 tgagtttgtt tgggataccc aacaagataa tgctttcaag aaaataaaag agctactcac 2220 gtgtgaacca ggtccagtct tagcatattt tgatcctctc aaagagctca gtctgcaggt 2280 tgatgcctca aaatatgggc ttggagctgt gatgttgcag gatggacgac caattgcata 2340 tgcatcaaag tctctatctg aaagcgaaat caattatgct caaattgaaa aagagctttt 2400 tgcgatactg tttggttgca agcgcttcca ccagtatatc tatggccgac aagtcattgt 2460 tgagagtgac cacaaaccgc ttgagtcaat aatgcgaaag cccatggcat ctgtaccccc 2520 aaggctacag aggatgatac ttcagcttca aaagtacaat ttcacaatcg tacacaaacc 2580 aggcaagaaa gttgtggatg accatgatac gggcctcagt gaaggtatgg acattcaggt 2640 acacactgta ttcagcagtc ttcctgttag cgattcaagg cttgaggaaa taaaggatca 2700 caccggaaaa gacctgcagt tcgagacgct aaagcaggtg attcaaagag gctggcctga 2760 ggagaaaaga aagtgtcctc agagcatttc agagttctgg aatcatcgtg acgagctttc 2820 gtacatgaat ggtataatct tcaaaggaga gaaaattgtc gttcccacac gccttcgctc 2880 tgacatgctg tcacgtatac atgccagtca ccttggaatt gaaaaaagta agcaaagggc 2940 tcgagacatt atgttctggc ctggaatggg aaaagaaata gaaaatatga tcagcacatg 3000 cacaatctgc ctcacatacc ggccttccaa caccaaagaa cccatgatta gtcacagaat 3060 tccagatagg ccatggcaag tagtagccac tgatttgttc acctggaaca atgaagacta 3120 tcttgtgaca gtggactatt acagcagata ttttgagtta gacaagctcc agaccacaac 3180 ttcagcttct gtgataagca aactcaaggc tgcttttgca cgacatggca tcccagaggc 3240 agtaatatca gataatggcc cccagtacag ctcaagtgaa tttgaaacat ttgcaagaac 3300 atggcaattc aaacatacta ccacaagccc atattaccca caaagtaatg ggctagcaga 3360 gaagtccgtg catactgcta agatgctatt ggaaaaggca aaggctggca aaggcgatcc 3420 tttcctcagc atgctggagt atcgcaacag ccctgttgat ggtttcaagt ctcctgctca 3480 actattgatg agtcgccgct tgcggtcaac cttgcccact accaaccagc agctcgtgcc 3540 aaagctcgta agcattaaag acgtcaaaac aaagagagtg caaaaacagc agcatcagaa 3600 aaactattat gacaggtcat ctagacccct gtcccagctc aaggaaggac agaccgttag 3660 agtacaggag caaggatact ggaagccagc tgtggtcatc aaaacagcga acactgaaag 3720 atcatatcat gtgcgtactg ttgatggacg cgaataccga cggaacagac ggcatctcct 3780 ggacacaaaa gagtcacaaa acaacagcct ggaagaatct gaattgcatg atggtgctga 3840 accatgttcc actccaactg acagttcctt acacagtgac tctacacata cacctaaaca 3900 cactgatgaa cttcccgaag ctaccattta cagaaccaga tctggaagac aaattaagcc 3960 cagagttatc ctggatcttt aattgtacaa gtatgtaggt gaaccattga gaaactcagt 4020 gaaatgttca gtgcatttaa aaggtaatgc aaattttatt aacatgtgtg acagttttag 4080 tttattcaaa cttgtgatgc aagtgtcgtt ttattgtatt agttcagtaa catatgggtg 4140 ttcatgcctt ttgggaaact taatttttca ttatctgtca tacattattt gttgtctcac 4200 tttaaaagaa agggga 4216 // ID Gypsy65-I_DR repbase; DNA; ZEB; 5803 BP. XX AC chr14; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy65-I_DR; KW Gypsy65-LTR_DR; Gypsy65_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5803 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1445-1445 (2008). XX DR Genome; chr14; Positions 18743089 18748891. XX CC Positions [2525-3064] - Reverse transcriptase CC Positions [4379-4855] - Integrase core CC 'TTTA' target site duplication CC LTRs are 100% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 276..1412 FT /product="Gypsy65-I_DR_2p" FT /translation="MDAERDMDNADAMAEDIEGAGAAAPGVDAEGGNPISE FT LQSQILELGRRHDQVMSALSNMTNVHTRSYVYIPREKQIVPFSGDLGKDCQ FT NVDEFIEELERVIRVRGLNSEDQVDFILSHLRGSALDEVKLCMGGEDKSAE FT DMFSYLRAAFREKLSTSQLLHTFYARKQLDSEDFRDFSHALSLMLNAALQQ FT APQCVPNVQRALRDQFIEGVRDLALRRELRKVVRERPDATLFEVREEALLW FT CSEERPRSTNVARNRNLVSVKESAENAQMTATVTNDLTLALQDVIKVITQQ FT GKAIGELTNAVRDLTIQRSDSALEKSNKPKFKPRYTRGGQPICLRCESVGH FT IARHCTGQRSQENPVPTTSESLIAGNGAPPLRRAEQ" FT CDS 1610..5755 FT /product="Gypsy65-I_DR_1p" FT /translation="MHCTSKWLKITAANGLPLPYQGYIELDIQVMGLTLPG FT CGFLVIRDQKEGEIDSAPPGILGMNIAQRCKQLILAEFDNALKGNLDAEWR FT AALNKVQEAAAVEANSLVRVASTHKLYIPAASVATVQARTPKILPESKDLC FT LFEPGQSSLPGGLVLIPTLVSPAKRVFPIQVVNLSPEGAWLPPKVKLGVLG FT HGGHVESSSCEVRFHQISVDHEEVMVNQRSMKGVSGDLQCPLERVQIGGTQ FT AQQAELKSLLRRYLDVFAVTEEDLGYTELVKHEIPLNDEFPVSQSYRRIPP FT NQFQEVKDHIAGLLKKRIIQESSSSFASPVVLVRKTDGSLRLCVDYRKLNC FT KTRRDAFPLPRIDESLDALGRAQVFSSIDLASGYHQVAVHEKDRHKTAFIT FT PFGLYEYQRMPFGLCNAPATFQRLMQTIMSDLVFQMVLVYLDDLLVYSSTF FT EEHLVRLETVLQRLREAGLKIKVDKCHFLQSEVRFLGQVVSAQGVSTDPEK FT VSAVNQWPVPNTLKELRSFLGFCSYYRRFIEGFSRVAGPLHDIVNTCVHQK FT SVAQANKLFKDSWTMECQQAFEKLKEKLTCAPTLGYADFTIPFVLETDASN FT LGLGAVLYQQQRGHKTVIAYASRRLRGAERNDRNYSSMKLELLALKWAVVE FT KFRSYLLGSKFTVLTDNNPLCHLNTAKLGAVEQRWAAQLAIFDFEVQYRPG FT RSNTAADALSRRPGMEGMEVVTEDSEYDGCVAVCSELRVGTLVEASIAEQV FT AGSREEGTGQLQEAMEDEHFLENTPTLPGYTKDELCQFQKADPVISLFRHF FT WNQKRKPTKNERVSLSRSVLALLKQWDRIQEKDGLLYRIIDDVFNGECQQL FT LLPACLVERVLQSVHDNMGHQGVERTLGLLRQRCYWVGMYNAVDQWVKKCQ FT RCVLSKMPQPKIKAPWTPFLASHPLEVVAMDFTTLEPASDGRENVLVVTDV FT FTKFSQAFPTRDQKADTTAKIILKEWFLKYGVPQRLHSDQGRNFESAVVAE FT LCKLYGVRKSHTTPHHPQGNAQCERFNRTLHDLLRSLPPDKKRRWPEYLPE FT LVYAYNVTPHSSTGFSPHYMLFGTQPHLPVDALLGQEPMSEKEPAWLSVHK FT ERLQEAHTRAREYAEKKAADRVAKRHNELYCPEVAVGQHVYLRHRPLGRNK FT IQDAWAPTAYKVIEVHGTTYTVEPVGGGPARRVHRSDIRPCPGPVPMPRRK FT IKSVEEVPTSIKGMPSLDAECVVVENTRLHESGSANPVREESVQSQLDLDP FT DSGAMDEMEEAQERSKDRVINEVSAIEELSLPNPERTGSPIETVSARPVPV FT PRKPKTESTHVDAPCPSPRRTQRSTAGVHSNPNKLPKSVCNAVSLSPDILS FT KVLAGILLYTSEKLQEVDD" XX SQ Sequence 5803 BP; 1626 A; 1206 C; 1564 G; 1407 T; 0 other; tctaattttg gtgtcagacg tgggattctg tttgtttggg gctaaaaata aaaataaata 60 actaaaaaaa aaaaatcaaa gagtgcaaac aacggtagtg gtgacacacg actgagtgct 120 ggaaggacct gcagtgccat tctgacggaa gcagctgcag gataaagggg ccagcgcacc 180 aggactgctg tctgcaagga ctttgagaag gacagtgctc gtcgatctga ctgttgtggg 240 ctggtcgaga cattttgaaa ctctgataca aggtaatgga tgctgagagg gacatggaca 300 atgctgacgc aatggcagag gacattgaag gtgcaggtgc tgcggcacca ggagtggacg 360 ctgaaggagg taatcccatt tcagagcttc agtcccagat cctggagttg ggacgtaggc 420 atgatcaagt catgtcagct ctatctaaca tgaccaatgt ccatactaga tcctatgtgt 480 atatcccaag ggaaaaacag attgtacctt ttagtgggga tctaggtaaa gactgccaga 540 atgttgatga attcattgag gagctggaaa gggtaatcag ggttagaggt ttaaactccg 600 aggatcaggt agattttatc ctctcgcact taaggggttc agctctcgat gaagtcaagt 660 tgtgcatggg gggtgaggat aagtcagctg aagacatgtt ctcatatttg agggcagctt 720 ttagggaaaa gctcagcacc tctcagttac tgcacacatt ttatgcacgc aaacaattag 780 acagtgagga ttttcgtgat ttttcacacg cactgtcatt gatgttaaat gctgctctgc 840 aacaagcacc acaatgtgtt ccaaatgtgc aacgggcact aagggaccag tttattgagg 900 gggtgaggga tttagcactc cgcagggagc tccgtaaagt ggttagagag cgacctgatg 960 ctacattatt tgaagtccgt gaggaagccc ttttgtggtg ctcagaagag aggcctcgtt 1020 caacaaatgt agcccgaaac agaaatctag tgagtgtgaa agaaagtgct gaaaatgcac 1080 agatgacagc cactgtaaca aatgacttaa cactagccct acaagacgta attaaagtaa 1140 ttacacaaca agggaaagca atcggggagc ttaccaatgc tgtgcgtgat ttgacaattc 1200 agaggtctga ctcagctctt gagaaatcga ataagcccaa gtttaagccg agatacaccc 1260 ggggtggcca acccatttgt ttgagatgtg agagtgttgg gcacatagcc aggcattgca 1320 ctgggcagcg ctcccaggaa aaccctgtac ctactacgtc tgagtcactg attgcgggaa 1380 acggtgcacc tccattgcga agagccgagc agtgagaggt ggcacagatg gctcaaagac 1440 tggtaccctc acaaaagagc agtttctgga gcgcgcagtg ggcaagtgcc cagaagtgga 1500 aattcatgct gggggagttg caatccggtg cctgttagac actggcagta atgtgagcac 1560 tttaacagag agctttttta aagaacatct tcacggggaa gacaaagata tgcactgcac 1620 aagcaagtgg ttaaagatta ctgccgctaa cgggttgcct cttccctacc aggggtacat 1680 tgaacttgat atccaggtta tgggtttgac tttacctggg tgtgggttct tagtgatacg 1740 agaccagaag gaaggggaga ttgactcagc gcccccaggc atacttggta tgaatatagc 1800 acagcgttgt aaacaattaa ttttagctga gtttgacaat gctttgaagg ggaatttgga 1860 tgctgaatgg agagctgccc tcaacaaagt acaagaagct gccgcagtag aagcaaactc 1920 tttagtccgg gtagctagca cacataaatt gtatatacct gcagcttctg tggctacagt 1980 tcaggccaga acacctaaaa tactacctga gagtaaagac ttatgtttgt ttgagccagg 2040 ccagtcttca ttgccaggtg gtttggtttt gatccctacc ttggtatcac ctgcaaagcg 2100 ggtcttcccg atacaggtag taaatctatc ccctgaaggc gcatggttgc cacccaaagt 2160 gaagctaggt gtactaggcc atggtggtca tgtcgagagt agctcctgtg aagttagatt 2220 ccaccaaatt tctgttgatc atgaggaggt gatggttaat cagaggtcaa tgaagggagt 2280 tagtggagat ctgcaatgtc ccttagaaag agtacagatt ggtggtaccc aggcacaaca 2340 ggcagagttg aagtcacttc tcagaaggta tttagatgta tttgcagtca ctgaggaaga 2400 cctaggttac actgaactag tgaagcatga gatcccctta aatgacgaat tccctgtctc 2460 acagtcgtac cggcgaatac ctccaaatca gtttcaggaa gtaaaggacc atatagctgg 2520 attgctcaaa aagcgtatca tccaagaaag ctccagttcg tttgcatctc cagtggtgtt 2580 agttcgaaag actgacggga gtcttcgttt atgcgttgac taccggaagt taaattgtaa 2640 aactagacgg gatgcgttcc cattgccacg aatagatgaa agtttggatg ctttgggtag 2700 ggcacaggtg ttctcgtcca ttgaccttgc gagcggctat caccaggttg cagttcatga 2760 gaaagatcgc cacaaaacag ctttcattac accatttggc ctgtatgagt accagcgtat 2820 gccgtttggc ctatgtaatg cgccagcgac gttccagcgt ctcatgcaga ccataatgag 2880 tgacctggtc tttcaaatgg tacttgtata tttggacgat ttgcttgtgt actcgagtac 2940 atttgaggaa cacttagtgc gacttgagac tgtgttacag aggttgaggg aggcaggact 3000 taagattaag gtggacaaat gtcacttttt gcaatctgag gtgaggttct tagggcaagt 3060 tgtgtctgct caaggagtgt ctacagatcc ggaaaaagtg agtgcagtga accagtggcc 3120 agtccctaac actctaaaag agctgcgatc ctttttagga ttttgcagct actatcgacg 3180 ttttattgag ggtttctcca gagttgcagg tccgcttcat gatattgtca acacatgcgt 3240 tcaccagaaa agtgttgctc aggccaacaa gttgttcaag gactcctgga ccatggagtg 3300 ccagcaagcc tttgagaagc tcaaagagaa gcttacctgt gctcctacat tgggatacgc 3360 agactttacc ataccttttg tgttagagac agacgccagt aaccttggat taggtgctgt 3420 gctataccaa cagcaaaggg gacataagac tgtgatagcc tatgcaagcc ggaggttgag 3480 gggtgcagag cggaatgaca ggaactacag tagtatgaag ctggagctcc tggcactcaa 3540 atgggctgta gtggaaaagt ttaggagtta cctgcttggt tccaagttta ctgtcttaac 3600 ggataacaat cccctctgcc acctcaacac agctaagcta ggtgctgttg aacagaggtg 3660 ggccgcacag ttggcaatat tcgattttga agtgcagtac cgccctgggc ggagcaacac 3720 tgcagcggat gccctatctc gccgaccagg aatggaaggt atggaggtag tgactgagga 3780 ttcagagtac gatggctgtg tggctgtttg cagtgagctt cgggtaggga ctttagtgga 3840 ggccagcatt gcagaacaag tggcaggatc tagagaggaa ggaacagggc agctacagga 3900 agctatggag gatgaacact ttctggagaa cactccaaca ttgcctgggt acaccaagga 3960 tgagctttgc cagttccaaa aggcagatcc cgttattagc ttgtttagac atttctggaa 4020 ccaaaaaagg aaacccacaa aaaatgagag agtgagcttg tcccgatcag ttctggcatt 4080 gttaaaacag tgggacagaa tacaagagaa agatggactg ctgtacagga tcattgatga 4140 tgtattcaat ggggaatgtc aacagctgct gcttccagct tgtctggttg aacgggtcct 4200 acaaagtgtc catgacaaca tggggcatca gggggtagaa aggactttgg gtctgctcag 4260 acagcgctgt tactgggttg gaatgtataa tgctgtggat cagtgggtga aaaagtgcca 4320 gcggtgtgtg ctatcaaaaa tgccgcagcc caaaataaag gcaccctgga ccccattctt 4380 agcttctcat ccattggagg tggttgcgat ggatttcacc acactggagc ctgcatcaga 4440 tgggcgagag aatgtcttgg tggtgaccga tgtattcact aagttcagcc aggcattccc 4500 tactcgtgat cagaaggcag atacaacagc caagatcata cttaaagaat ggttcctcaa 4560 gtatggtgtt ccccagcgtc tccattcaga ccaagggagg aactttgaga gtgctgttgt 4620 tgcagagctc tgtaagctat atggggtaag gaagtctcac actacaccac atcacccaca 4680 gggaaatgca cagtgtgaaa ggttcaaccg aaccttgcat gatcttttac gctccctacc 4740 gcctgacaaa aaacgaaggt ggcccgagta cctacctgag ctagtgtacg cctataatgt 4800 cactcctcat tcctccaccg ggttctcccc tcattatatg ctctttggta cacaaccgca 4860 tttaccagtg gatgctttgt tgggtcagga acccatgagt gagaaggagc cagcctggtt 4920 gtcagtccat aaagagcgac tccaggaggc tcacacacgt gcacgtgagt acgctgagaa 4980 gaaggcagca gatcgagttg ctaagcgcca taatgagtta tactgtccag aagttgcagt 5040 tggtcaacat gtgtacttac ggcatcgtcc cctgggacgg aacaagatac aggatgcttg 5100 ggcacccaca gcctataagg tgatcgaggt gcatggaaca acctacactg tggagccagt 5160 gggaggtggc ccggcaagac gagtacatag gtctgacatt agaccctgtc cgggaccagt 5220 cccaatgcca cggagaaaaa ttaaatcagt ggaagaagta cctacctcca taaaaggtat 5280 gccttcattg gatgcagagt gtgtggttgt ggagaacacc cgattgcatg aaagtggctc 5340 agcaaatcca gtacgggagg agtcagtgca gagtcagctg gacttggatc cagatagtgg 5400 tgctatggat gagatggagg aagcacaaga gagaagtaag gatcgagtga ttaatgaagt 5460 aagcgcaatt gaagagcttt ctctgcctaa ccctgaaaga actggttcac cgatagaaac 5520 ggtgagtgct agaccagtgc cagttcccag aaagcctaag acagagagta ctcatgttga 5580 tgcaccttgt ccatcaccac gcagaactca gcggtcaact gctggtgttc atagtaatcc 5640 aaataaattg ccaaagtcag tttgcaatgc tgtgagcctc agccctgata ttctgtccaa 5700 ggttttagca ggtatactcc tgtacacatc tgagaaactt caagaagtag atgattaagg 5760 gggtttgaag tcatcgagga cgttgacagt tggcagggga gag 5803 // ID Gypsy133-LTR_DR repbase; DNA; ZEB; 1548 BP. XX AC chr23; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy133-I_DR; KW Gypsy133-LTR_DR; Gypsy133_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1548 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1528-1528 (2008). XX DR Genome; chr23; Positions 3087331 3085784. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1548 BP; 274 A; 300 C; 370 G; 604 T; 0 other; tgtagcagat gttttgcatg aagccttggt gtacgcccat tactaactgc gggttgttaa 60 gaggtgtata aaagcctgtc ggagaatagt ggaagttgcg tcatcgtcca gctgactccc 120 tgtggtgggt gtgtcactgc cgtgcgtttg gtatgtagtg gctgtttaag ccttattttt 180 gctttcgttt taaattgtgt tgggtttgat tcagtttttc ttgctctgga aggctggtgt 240 ctgttcggtt gctgttgcac catagagtgg ctacggtggg gcgttgtgag ctttgagaag 300 tcgtgtttcc tagagctgaa ctctccctcg tgattctgtt gacacttatg cacgttcccg 360 tgactgctgt cttcccaatc ggtaagtagg gcgctaccgc tcgtacattc tgtgcatatc 420 aaactactag gagtagaaac tgatttctgt tatattttct agttagttat tagtacggac 480 gcctggcttt gccggaggga tgactggcat ctcgtgtagc gattgcactg tggattgagc 540 gtcttcacct gcccttgaca cctgctggat gctgctgttt tgtttctgtt aattcagttt 600 tgattctgtt ttgtttcttt tgtttctgtt aattgttttt tttttttttt gctttttttt 660 gtcgtttatt tattttaggt gcttttgtat cagtgttttt ttgttgtttt gcatatgttt 720 taatgtttgt ttttttctac atttacaaaa atgggaacac ctaaaacaag tgttgtctaa 780 ctgactgctt ggctttgctt ttatacctgc attctggtgc gttgtcctgt tccaatattg 840 tgaccttgtg ctctaacttt tactacatac tatttgggat tctttttagc gggggccttt 900 cacctccgtg cccctgggcg gctggtagct gctcaagtag tctgtttttg atacccctgt 960 gagcccattt aatattgtgt ttccttaaat gttttatttc cttcctaatt ttggttttct 1020 tctctcccta ggagctgggg gtgtctagta cgggcagtca ggctgctgtg tggcgtggct 1080 cttttgatga attagtcttt tgctcacttg tgtgcattga agttcactgt gtggtgtttt 1140 tggaatctaa gagtgtgcac gtgtgaattt gtgggtcatt ctcgggccca gtgcctagcc 1200 tgcccaactt tagtaagccc cagctcctgc ggttggatga ggatttgttt ttttttttgt 1260 attttgtaac cgccttctag tgccttttta aagcacatgt gcctgtcatt ttaatatttg 1320 tagtccaatg agaagtgcat tgtaataaaa ggaactctat ttttctacat ccacgtctca 1380 ttcccttatt gtcagagata cgaacctgtg ttttgtgtcc caggtaatgt actgggtggc 1440 gtagtggagc tctaattgca ataggttgtg ttaattcggc cgcagtgtgt cagttattgt 1500 attggtctgg ctaccaacaa aaagccttat cccccccttc gcgccaca 1548 // ID I-2_DR repbase; DNA; ZEB; 5532 BP. XX AC . XX DT 03-OCT-2008 (Rel. 13.1, Created) DT 01-FEB-2010 (Rel. 15.03, Last updated, Version 2) XX DE Non-LTR retrotransposon from zebrafish - consensus. XX KW I; Non-LTR Retrotransposon; Transposable Element; Nonautonomous; KW I-2_DR. XX NM I-2_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5532 RA Jurka J.; RT "I-type retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1342-1342 (2008). XX RN [2] RP 1-5532 RA Kojima K. and Jurka J.; RT "Consensus update."; RL Direct Submission to RR (01-FEB-2010). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 460..1668 FT /product="I-2_DR_1p" FT /translation="MANKQNPKRKLSDQDLDCNFGNGISLLPKSENWQRFI FT LLESLQSDMPLSKLSPFAIQKGISGIAGTVKDCKKLRSGQILVECSKKVQA FT ENLLRANMLAGVAMKTFHHPTLNHSKGVIRTRELEDMEETEITTELMTQDV FT IHVKRITIRREDRLIKTGTYILTFSRPLPPEKIQIGYLSVNVDIFTPNPLR FT CFSCQKFGHGSISCKNRPTCVNCGEEKHGDQCKKTPKCSNCAGEHPSSSKD FT CPTWIKEKEIQKIKCTKKVSYLEARKLVENSSFFKLEKTFAAVMKPKLQSV FT GVQTDLTWINEKFEEIKSQTPVNTKCTDHQSQTVLHQTQVNHKEHSSTGQH FT SSKNKNDKKPNPQKEKTSQEYKEMKTSSSKDIEMSDDKKPRSRSTSPKTKT FT KGHSYVLPT" FT CDS 1638..5348 FT /product="I-2_DR_2p" FT /note="includes endonuclease, reverse FT transcriptase, and RNase H domains." FT /translation="NQRALVCLTNLTVESHIIQWNCRGIRANFSELQRLAC FT IYNPLAFCLQETHLTPESNISLKHFTCLNAYGPNLQRPCGGTSILLRHDVI FT HSNVDINTNLQVVAVRITLQNTITLCSVYIPPEATVSHQDLENLVEQIPPP FT FILMGDFNAHNPLWGGKQLDAKGKKMEKLINENDLCLLNDGSYTYLHPGHG FT SFSAIDLTICDATLATDFSWYVCDDLCGSDHFPLIITKTAADTQQRPQKWK FT LEKANWYSFQALCYERLDGKQQDKENPIKWFTEKIIEIADESVPKTSTKQS FT KRRNPWFDDQCKELIKARKKAVRCFQKHPTSENLIRIKICRAQARRYIREA FT KRQSWNKFVSSLTINTPSKKIWDAIRKMKGREGPQLKHIDNHGTLLTSKHD FT IANILAETFAKNSSTENYQPNFQKIRTNQETIKLNFISQNTEIYNQPFSME FT ELLNSLKCCHDTAVGPDKVHYQFLKHLPQLSLCLLLDNFNEIWKSGNIPPS FT WQEATIIPIPKPGKEHKDPNNYRPIALTSCVCKTMERMVNNRLVWKLESDH FT QISDFQCGFRRGRCTLDHLINLESYIRNGFIKKEHVVAVFFDLEKAYDTTW FT KYGILKDLYKMGFRGNLPIFISNFLSNRTFRVQVGTTMSDPQIQQQGVPQG FT SILSVTLFMVKINSVTDVIGRNMMCSLYVDDICICYRGKNMNIIERQLQLC FT INKVSNWSTENGFKFSKTKTVCMHFCLLRSLHHDPELFLDEEPIKIVKETK FT FLGLHFDCKMTFIPHIKALKNKCLKNLDLIKVLAKTKWGADSTVLMRLYRL FT LVRSRLDYGSIIYSSARKSYLQMLDPVHHQGLRLALGAFRTSPAQSLYTEA FT NEPPLHIRRLELSLQYALKIKTNQHNPAFKPIFQPQYLDLYEXRPSYIQPF FT GVRIRTHIKNLNVNLDSIHQTLICPVPPWRLNKLHVNLDLSKHKKADTFPH FT TYQQAVAGIRHFYPTHNPIYTDGSKKDGHVSAAIVMGQSHHGIRIPDQSSI FT FTAEAKALFLALEHIENAEGHNFIIFSDSKSCLQTLNSFKLEHPIIIDIFI FT KVNELQRKFYNIVFCWIPGHTGLLGNEQADKAAKTALTGKLMECKIPPSDL FT KPLIKGYILNKWQSEWDQCPENKLFEIQPEIGKKSNLFFKSRHDEIVYRRC FT RIGHTRLTHEHLLKGEEPPKCLYCNRNQTVKHILVECPIFNVIRKQYLSGE FT TLKEIFLNVNPSKVVEFLSKGNLKKLF" XX SQ Sequence 5532 BP; 1951 A; 1084 C; 986 G; 1510 T; 1 other; ccccgtgggg acccggggta gaataggtcc ccagcacccc cttgctgatc gtaagaggcg 60 acaaatgggg caacttgttt cagccgtgag ttgtgacccg tgtcagtgga aaaagagatc 120 ctggtatttg aggaatgtaa ctgttgcgac tattgcagat ccaacacagt actttggctc 180 ctatttcact tagacgggag gttggaaggg cccgatccaa tcaatcggct agtcaagaaa 240 tgccctggat tatttaactt ttttaaacat ctagcaaatt tccagaaggc tttggctgag 300 ggtcaattga gtaatgcata tttttaaata tgcttgcttg agtatatcat tcttatatcc 360 ttgatgtata cccggtggag atccgcggct ctgtgcatcc ccttgttggg ctccgaggtg 420 ggtgaggagt aagagtgatg aatctgaaat catacaaaaa tggcaaacaa acaaaaccca 480 aaaagaaaat tatcggatca ggacttggat tgtaactttg gaaatggcat cagtctacta 540 cctaaaagtg aaaactggca gagatttata cttctagaat ctctacaatc tgacatgcca 600 ctctctaaac tgtctccttt tgctattcaa aaaggcatct ctgggattgc aggaacagtg 660 aaagattgta aaaaactaag atctggacaa attcttgttg aatgctccaa aaaggttcaa 720 gcagaaaacc tactacgtgc aaatatgctt gctggtgttg ctatgaaaac ttttcaccat 780 ccaactttaa atcatagcaa aggagtaatc cgtaccagag agctagaaga catggaagag 840 actgaaatta ctacagagtt aatgactcaa gatgtaatac atgttaaaag aatcacaatc 900 aggagagaag atcgtctcat aaaaacagga acctatattt taacattcag tagacctcta 960 cctccagaaa agattcaaat tggatacttg agtgttaatg ttgacatttt tacacctaat 1020 ccactcaggt gtttcagctg ccaaaagttt ggacatggat cgatttcttg caaaaacagg 1080 cctacttgtg taaactgtgg agaggaaaaa catggtgatc aatgtaaaaa aacaccaaag 1140 tgcagcaact gtgcaggaga acatccaagc tcatccaaag actgtcctac ctggattaaa 1200 gaaaaagaaa tccagaaaat caagtgcact aagaaggtga gctatttgga agctcgaaag 1260 cttgtagaga actcctcatt ctttaaattg gaaaaaactt ttgctgctgt gatgaaacca 1320 aaactgcagt ctgtcggcgt tcagacagat ttgacgtgga taaatgaaaa atttgaagaa 1380 ataaaatctc aaactccagt aaatactaag tgtactgatc atcaaagtca gactgtactt 1440 catcaaactc aagtaaatca caaagaacat tcatctacag gacaacactc atccaaaaac 1500 aaaaatgaca aaaaaccaaa tcctcaaaaa gaaaaaacaa gccaagagta taaggagatg 1560 aagaccagca gctcaaaaga catcgaaatg tcagacgaca aaaaacctcg aagtcggagc 1620 acttctccta aaactaaaac caaagggcac tcgtatgtct taccaacttg acagtggaaa 1680 gtcatattat tcagtggaac tgccgtggca taagagccaa tttttctgaa ttacaacgtc 1740 ttgcttgcat ttataatccc ctggctttct gtctccaaga gacccatctt actccagaga 1800 gcaatatatc tctgaaacat tttacatgcc taaatgcata tggcccaaac cttcaacgtc 1860 cttgtggtgg aacatcaata ttacttagac atgatgttat tcatagtaat gttgacataa 1920 atacaaacct acaagtagtg gcagtccgta taactttgca aaatactatt actctatgtt 1980 ctgtttatat tcccccagaa gctactgtct cacatcaaga tttggaaaat ctggtggaac 2040 aaatcccccc tccatttatt ctcatgggag attttaatgc tcacaatcct ctgtggggag 2100 gaaaacaact tgatgcaaag ggaaagaaga tggaaaaact aattaatgaa aatgacctgt 2160 gtcttttaaa tgatggttca tacacatact tacacccagg acatggatcc ttctctgcca 2220 ttgacttaac aatatgtgat gcaactttag ccactgactt ctcatggtat gtctgcgatg 2280 atctctgtgg aagtgatcac tttcctttga ttataaccaa aacagccgca gatacacaac 2340 agagacccca aaaatggaaa cttgaaaagg caaattggta ctcatttcaa gccctttgtt 2400 atgagagact tgatggcaag caacaggata aagaaaatcc aatcaaatgg ttcacagaaa 2460 aaattattga aatagcagat gaatctgtac caaaaacatc gacaaaacag agcaagagaa 2520 ggaatccttg gtttgatgac caatgtaaag aactcattaa agcacggaaa aaagcagtaa 2580 gatgttttca aaagcatcca acttctgaaa accttatcag gataaaaatt tgcagagcac 2640 aggcacggag atatataaga gaagcaaaga gacagagctg gaataaattt gtgtcaagcc 2700 tgacaataaa tacaccatca aagaaaatat gggatgcaat tcggaaaatg aaaggaagag 2760 agggaccaca actaaaacac atcgataatc acggcactct gctgactagt aaacatgaca 2820 ttgctaatat acttgcagaa acctttgcaa agaattcctc tacagaaaac tatcagccaa 2880 atttccaaaa aatcagaact aaccaagaaa caattaaatt aaatttcatt tcccaaaaca 2940 cagagatcta caatcaacct ttttctatgg aggaactttt aaactctcta aagtgttgcc 3000 atgacacagc agtcggacca gacaaagtac attatcagtt tcttaaacat cttccccaat 3060 tgagtctttg cctcctcctg gataatttca atgaaatttg gaaatcagga aatatcccac 3120 catcatggca agaagccacc ataattccta taccaaaacc tgggaaggaa cataaagacc 3180 ccaacaatta cagacctata gcattgacga gctgcgtatg taaaactatg gagaggatgg 3240 taaataaccg acttgtctgg aagcttgagt ccgatcatca gataagtgat ttccaatgtg 3300 gtttcagaag aggaaggtgt acattagatc atctcatcaa tctggagtct tacatacgaa 3360 atggattcat caaaaaggag catgttgtag ctgtattttt cgaccttgaa aaagcatatg 3420 acactacatg gaaatatggc attcttaaag atctgtataa aatgggattc aggggaaatt 3480 tacccatctt catttcaaac tttttatcaa atagaacttt ccgagttcaa gtaggaacca 3540 ctatgtcaga ccctcaaata caacagcaag gagttcctca aggcagcatc ctttcagtga 3600 ctctttttat ggttaaaatt aatagtgtca cagatgtcat tggaagaaac atgatgtgca 3660 gtctttatgt ggatgacatt tgtatatgct acagagggaa aaacatgaac ataattgaaa 3720 gacagctaca gttatgcata aacaaagtga gtaattggtc tacagaaaat ggtttcaaat 3780 tttccaaaac caaaactgta tgtatgcatt tctgtctttt acgatctcta catcatgatc 3840 cagaactctt tttggatgag gaacccatca agattgttaa agaaacgaaa ttccttggac 3900 tccatttcga ttgtaaaatg accttcatcc cacatataaa agctctaaag aacaaatgct 3960 tgaagaactt ggatctaatt aaagttcttg ctaaaacaaa atggggggca gattctactg 4020 ttttaatgag gttgtataga cttttggtgc gctcacgact ggactatggg agtataattt 4080 acagctccgc tagaaaatct tatttacaaa tgctggatcc tgtacaccac caaggactta 4140 gactagctct tggagctttc agaacttcac ctgctcaaag cctatacact gaagcaaatg 4200 aaccacctct tcatattaga cgcttggaat tatctctcca gtatgcatta aaaataaaga 4260 caaatcaaca caacccagct ttcaaaccaa tctttcaacc tcaatatttg gatttgtatg 4320 aggmaagacc cagctatatt caaccattcg gtgtgcggat cagaacacac ataaaaaacc 4380 taaatgtaaa tcttgattcc atccaccaga ctctcatttg cccggtccca ccatggaggt 4440 taaataagtt gcatgtaaat ctggatctaa gcaagcataa gaaggctgat acattcccac 4500 atacatatca acaagcagtt gctggcataa gacattttta tcctacccat aatccaatat 4560 acacagatgg atcaaaaaag gacggtcatg tttctgcagc aatcgtaatg ggtcaatcac 4620 accacggcat tcgcattcct gatcaaagct caatctttac agctgaagca aaggcactct 4680 ttttagcgct ggaacacatt gaaaatgctg aaggacacaa ttttattatt ttttctgatt 4740 ctaaatcttg tctacagaca ttaaattctt ttaaactgga gcatcccatt attattgaca 4800 tttttatcaa agtgaatgaa ttacagagaa agttttataa tatagtcttc tgctggatcc 4860 caggacatac tggactactt gggaatgaac aagctgacaa agccgccaaa acagccctaa 4920 caggaaagct gatggagtgc aaaatcccac cttcggatct taagccacta ataaaaggct 4980 atattctcaa caaatggcaa tcagaatggg atcagtgccc agaaaataaa ctttttgaaa 5040 ttcaacctga aataggaaaa aaatctaatt tattttttaa gtcaagacac gatgagattg 5100 tgtatagaag atgccgcatt ggacacacga gactcacaca tgaacattta ctcaaaggag 5160 aagaaccacc aaaatgcctt tattgcaaca gaaaccaaac agttaaacac attcttgtgg 5220 aatgccccat ttttaatgtt atcagaaaac aatatttatc tggagagact ttaaaggaaa 5280 tatttttaaa cgtgaatcca tctaaagttg tagaattttt atcaaaagga aatcttaaaa 5340 aactttttta gatttttatc ttatatatta tcattattat ttttatttat atattttacc 5400 ttgtacatta tttattgctg ttgatgactt ttaattgtta gaatattttt acttgatcta 5460 ttttcttttt gccatgacat agccatagaa gctgaaatgg caataaaatt aaatctctct 5520 ctctctctct ct 5532 // ID hAT-N74_DR repbase; DNA; ZEB; 1494 BP. XX AC . XX DT 12-NOV-2008 (Rel. 13.11, Created) DT 12-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE Putative nonautonomous hAT DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW 8 bp TSD; hAT-N74_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1494 RA Bao W. and Jurka J.; RT "hAT DNA transposons from zebrafish."; RL Repbase Reports 8(11), 1741-1741 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1494 BP; 457 A; 284 C; 256 G; 496 T; 1 other; tagggctggg cggtatatcg agttctgggg atatatcgat atgattcccc aacgcgatgc 60 ggtattatcc aatatcgttc atatcgatat ggtttgaggc cacacgtgcg cattctgcgc 120 gaaacagacc actccaaggt agcacgcgag ctccacttgc acaaatgtgc gcatgcgcaa 180 atgaatgatt cactccatga gtcatacaaa agatttgttc aaaatgaacg aatcgttcaa 240 gaacgaccca tcactgacgc gacatggagg aacatgcagt gccggtgttg tatcgaaatc 300 tgactccccg gacaaatctg actccccctc gcgtgcacgc acgtcacaca gtgcctgcag 360 ctgtcacagc ggtttatcag aaacctttta tcgatcattt cttccgcaat tgacagcaag 420 aacgaacata gaagccttgt caaattgaca agcagcacct gaatgtgttc aaaagaatta 480 aaaacgccaa aatgggacga atttataccg catttcgaaa gttgaaacca cctcatattc 540 actgattata agttcagtat ttattacacg acggtctctt ctttacattt acgttataac 600 acatttagct ctgcgttctg ctctgttata atgactgttt acatttatat gcttaaccag 660 catttcatca cagtactctg aaacaatttc gttacttctc atatctatag tctattcgtt 720 ttcatctcct tatttaatat ttagtaaaac aagtttagtc aaaacaaaac aaaagaaaaa 780 gtgtaggatt ttttacatgg agtgaaagac ttctctttag atatcttcga tatgtgcact 840 gtttaataat tgtgcacaat acatacatgt aaatgtgtaa ctaaaaaaca tacayataca 900 gctgatttaa ttgttgttta taaatcacac aactgtggta tacaggatta cttataacaa 960 cactgtaatt gtgacataaa atactgtttc atatttacat tcaatgtggt tcttttctat 1020 cctacaatac tttttgttgt gttactattt aagtgtgttc ctttttgctc tgacaatgta 1080 gctctgtttt ctatggttct gtttactttg ttatttgcac aacagcacag tagtgctgtg 1140 tatagtgtta agcaggaaaa aaaaactgta ctgtatttat ttatcaaaga ttttttttat 1200 aatttgaaat gttgtgcaac ttatagtttt aagcaggaga aaaacctgta ctgtatttta 1260 tttatcaaag attttgtgca gctgttattt tttgtgcagc tgtttatttg taaacaggga 1320 gggaaacccc tgtgtttgaa ttatattttg ctcaaaaaaa tgtttaataa agttttaata 1380 aaggctttaa ctctcaagta accatttatg ggaaaatacc ggatatatat cgtataccgt 1440 cattcctcct aaaaataccg ggatatgatt ttttgcccat atcgcccagc ccta 1494 // ID DIRS-N1_DR repbase; DNA; ZEB; 6632 BP. XX AC . XX DT 31-OCT-2008 (Rel. 13.1, Created) DT 09-MAR-2009 (Rel. 14.04, Last updated, Version 2) XX DE DIRS-like LTR retrotransposon family - a consensus. XX KW DIRS; LTR Retrotransposon; Transposable Element; Nonautonomous; KW reverse transcriptase RNase H; phage integrase; DIRS-N1_DR. XX NM DIRS-N1_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6632 RA Bao W. and Jurka J.; RT "Families of DIRS-like retrotransposons in zebrafish."; RL Repbase Reports 8(10), 1273-1273 (2008). XX DR [1] (Consensus) XX CC Members are 98% identical to the consensus. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 6632 BP; 1838 A; 2035 C; 956 G; 1802 T; 1 other; aagtgaagtt tattcataaa ctaatttcga gaggatcacg tgcttatgat ttatcacgtc 60 cagtctcgca tttgctaatc actaatcttc caatcatatg agccctaagg caccataaat 120 agtctaagtt ttcagttcac tttatcttcg tttggaagaa gcagcttcac cgtagctagc 180 tccgttgaag aaccaactct aaaccagcat ggacaactaa agcagcaaca agctacaatc 240 ttcaggatct tcatttggaa gaacgctctg ctctgctcat caactacaaa agctacaagt 300 gctacaagct acaaaagctt caagctacaa tctacaatat acaatctaaa agcttcaatc 360 tacaagctac aatatacaag ctacagcaac aacaacaaca actactattg ctaccatttc 420 aaacaactac aactccaaca atttcatcaa cagcttaaac aacaacacca aaaccttcca 480 cttcaaagag cctccacaac gcatctgctg tgtcttcaac cttatttccc agcaagatat 540 aagccaaaac tccaaagcta ttgcaataaa acggaaccct caccttctcc tagcagtaca 600 catgctatcc atcgtttaac agattgtttg ctggaatgtg ttcagcagag cagcagctcc 660 agccacaaac aatgttgaat ttgatcaaca aaatggccgc caggcttttt gcaccttttg 720 catggcctga ctgaaactcc ttaagccaat agctgtaaag ataagcgtct ccatccaatg 780 agctaaaaga ctggagatgg tcccgccctc tctcttgact ctgttgcaaa gaccttatga 840 atgaataggc cagacacgct tgtaaattta gttaaaaaaa aaaaaaaaaa aaaaaaaaaa 900 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaataaaa ttccatttga catttaactt 960 ctgaaaagat tattacccaa ccagtaaata ttaacaagct attgaaaaac attatgccat 1020 caagcacaga agtccaattg taatacgcca gcgactcatg tgttgattta caagcgtcat 1080 ataagattat atagcaagtc agtcggatgt gaaaacggca aagaaacaaa cttcccactt 1140 gactgttata aatacaaatt taaacttcac tatcacttca catgctcagt cttcaagggg 1200 taccagtttt ccaggaccac cagactagcc atatcccatg tcctcagctc tccattcatc 1260 tgtcatcccg ttaggagacg aacacccacc cctaccatcc ttgcacatcg tatcaatgaa 1320 gtcatcggac catgcattat caaaagcgct tccatttaca gccgcccatg cccttattaa 1380 gatgcctcgc atggatataa actaaaacaa ttttcatgtc tgccatactt gtattatttt 1440 gcaaagattt aatttctgat tctctttgtt ttttctattt tagagatcat gactgctggc 1500 tccagaagct gaccctcatc caacctatta cactccacat cctcttcaca aaggatacat 1560 tccatttttc actttatcca atctgtccta acctgcagtc tgcaacctcc gaacctgaaa 1620 ccgtcgatta tcttattaaa aagaaatcga caacaacttc atgattggaa ctccaaggct 1680 cctcctttta atatctcgcg cattagcccc attggtgtcg ctactcaaaa atgttctggc 1740 aaaaacgcct tgtagtcgac ctttcgtccc aacacaattg ctcctttcct agcattaaca 1800 gcactatccc attggacgaa tatttactta actaccacga cattgatcaa gagatctctc 1860 tcataaagat agccagtcgt aacacctggc tagccaaaga agacatttcc cctaccttta 1920 aaatcatgcc aatccaccca gacttctggc accttttggc atttattggc gaaatcaatt 1980 ctatttttcg gatgcaaaag cagcccaaaa aatatttgac ataatgtcag aagctttatg 2040 ctggattctg tctaatattt acgaaatacc atacctcatt caccttctag acgattttct 2100 tatttccccc ccgtcatctc ctccagccaa acacctagcg atcacccagc aggttttcgc 2160 tgatctcgga aggacccagt acttcaatcg aatttctggg tattaatcta gactcgcata 2220 aattccaagc atcccttcct aaagagaaaa tcaatcgaat aatttctcta tcccaaattt 2280 tccttgaaaa acagatgtgc acacaacgag aactcctacc aattctcggc catctaaatt 2340 tcgctatgcg cattattcct ttcatttccc acctccatca attatccacc acagttcatg 2400 gtttagaaga aacaattttt ctctccaaac ccagtcgcga tgaactttgc ttatggatct 2460 ctttccttaa gcaatggaac ggctgttcct ttttctacag cgacttgatt gcatccccta 2520 ttgacatcaa cacatacaca gacgctgccc ctccaatagg ttacagtggc taatataaag 2580 gacactggtt tgtctcaaca tggccaaccc aattgtaatt ccattccaaa gaccaatgtt 2640 tttcagccct cttcaaattc taccctatca tcgcagcagc catcctgtgg ggggacgaat 2700 ggtctacatc tagcattctc tttcactgcg ataacgaagc tacagtgcat tgcattaaca 2760 aagggcgctc ccactcccaa tcgcttatgc catttttaag acaccttatc tggatatctg 2820 ctaaaaaaca atttatcatg attgctgaac atgtacctgg ttgcaaaaac caaatcgctg 2880 actctctctc tctctcattt ctctttgcag atattctggc aactagcccc ggaagcagac 2940 cctcacccaa cgccggtccc tccttattca gtaacgatat ttccataaac cacccacttc 3000 ataatcttca tcaaacttct ctatctctta tcctacaagc aatagctccc agaaccctca 3060 atgcatacct cacagcatgg aattcgttca aacaattcca taccatgcta gacaacccat 3120 tacactcaaa atccttacct catgcatcta caccctctgt aaaggttaca tttcctccca 3180 tacagcccgc accttagatg ctatgttcaa tctagcattt ttttgggggg tttcttaaat 3240 gttctgaatt aacagttaca tccaaattta accctctact ccaccccacc atctcagatc 3300 tagctttgca agacagggaa actatctctt tccttatcaa acaaagcaaa acagattaat 3360 tccagagagg acactctatc ctcattttca atattccttc acctacacgc ccattccaaa 3420 ccctcttagc ctagaaaatc tcaagaggct aacccactgg ccccgctttt tactgatgac 3480 gctaaccgtc cagtatctcg attctggttt caaaaacacc ttaaagaaat ctttcgccta 3540 tcaggttttt tcccagagcc cttttccagc cactcattca ggattggcgc agccactaca 3600 acgggctctc acaccatcag atccagaccc ttggtcgctg gtcttctgaa gttttcaaat 3660 cttatatacg tctcagtaaa taccacctca agaaagccca acaggcttta accaaccccc 3720 aagcacctcc tcacacggct ccaactcaaa aaggtgtcta acagagcctc gactctcgca 3780 agagtcatcc aaaccaaacc ccgcagaagc ccaattgtcc acccatggca cccttacatg 3840 tttgccaagg actccctcaa gggccattgc agttagcatc ttctgcaatc accaatagct 3900 atttacagat gcagacgttt gtatccacat gttctttgcg catttactgc gtttgtacat 3960 atgtattata tttgtgcttg ttcccttctt ccaacttttc caaatctttt atccttataa 4020 ccctactttt cagcctctcg ttcctttctt ctctacactc cactcccaac taatctattt 4080 tacactctac ctctaatgat cttgtttctc ctgtctagac tcttcagaca cccgaagggg 4140 cccccttgag ctccaactct cgcaggattc ggccatagcg gccacagtcc tttcttcttc 4200 acgatgttac ctattctttt tccgcctaga ctctttctcc aagctccaaa ccccgcaggg 4260 gtccgctcaa gctaaaactt cttcactcta cttcaactac ctctcttctc cttatgactc 4320 accctctagc tctgaccccc acaggggtcg ctccgagccc caatctctcg caagagttat 4380 ccaaatcctc tttcccactc actgatatct atctcctcct atccgcaatc taccctcaag 4440 ctctgacctc cacaaaggtc gctcagagct ccaactctca caagagttac tctcttccac 4500 tcactgctac ctatctcccc ctatctacac tctaccccca agctctgacc tccacagagg 4560 tcgcttcaag ctccaactct cgcaagagtt actctcttcc actcattgct cctcccagct 4620 acactctacc ctcaagctca aacctccgca gaggtcactc ctatctacac cctaccctca 4680 agctcaaacc tccgcagagg tcactcctat ctacacccta ccttcaagct caaacctccg 4740 cagaggtcgc tcctagctac actctaccct caagctcaaa cctccgcaga ggtcactcct 4800 atctacactc taccctcaag ctcaaacctc cgcagaggtc actcctatct acaccctacc 4860 ctcaagctca aacctccgca gaggtcactc ctatctacac cctaccttca agctcaaacc 4920 tccgcagagg tcgctcctat ctacacccta ccctcaagct caaacctccg cagaggtcac 4980 tcctatctac accctacctt caagctcaaa cctccgcaga ggtcgctcct atctacactc 5040 taccctcaag ctctaacctc cgcagaggtc actcctatct acaccctacc ctcaagctca 5100 aacctccgca gaggtcactc ctatctacac cctaccttca agctcaaacc tccgcagagg 5160 tcgctcctat ctacwcccta ccctcaagct ctaacctccg cagaggtcac tcctatctac 5220 accctaccct caagctcaaa cctccgcaga ggtcactcct atctacaccc taccttcaag 5280 ctcaaacctc cgcagaggtc gctcctatct acaccctacc ttcaagctca aacctccgca 5340 gaggtcgctc ctatctacac cctaccttca agctcaaacc tccgcagagg tcgctcctat 5400 ctactcccta ccctcaagct ctaacctccg cagaggtcac tcctatctac accctaccct 5460 caagctctaa cctccgcaga ggccgctcta aactcaaact ctcacaaaag ctccatcttt 5520 cgtaagagct actctcctcc actctctgct ccataccttc actctactct ggcccccgca 5580 ggggtcactt agagctccaa ctcccacaag attcactcaa agttctcttt tccactgctt 5640 caaaccacct attctcccct tcttcttaac cccgaaaacc atgacccccg caggggttcc 5700 ttcaaactct aactccagca agagttatta gaatcctctt ttcaccctta ttaatcctat 5760 ctaattaatg cccatgcatt taacctcttt ccttctatta tatccagcag ccggatatag 5820 ctctaaattt cctgcctttt ggggggtttt ttcttcgaat acgcggctgc tgtcccgagc 5880 gattaatttc tgcattttgg ggagttctga gatccaccga gctcaggctc ccttcttgct 5940 ctgccaacgg gaggaagccc cgggctcgag gagcccttga gctcggggct ctctcccggg 6000 acagcatgcc aaataagctt tgtaaatcat cagctaagtg tgaactcttg aagtgaagtt 6060 tattcataaa ctaatttcga gaggatcacg tgcttatgat ttatcacgtc cagtctcgca 6120 tttgctaatc actaatcttc caatcatatg agccctaagg caccataaat agtctaagtt 6180 ttcagttcac tttatcttcg tttggaagaa aacccccctc ctcccctatt ctcctccttt 6240 acctgtaatt gggcggcacg gcggtccagt ggttagcact gtgaaccaca cagcaagaat 6300 actgccggtc ctagttcgat aggaccggtg agtgtttctg tggggagttt gtatgtcctt 6360 cccgtgtccg cgtgggtttt ccccgggctc tccggtttcc tcccaccatc caaagacatt 6420 caacatactt aacaatcaag ctggtctaat tccttacgtt cccttagcta cagcggcagg 6480 ggagttctga gatccaccga gctcaggctc ccttcttgct ctgccaacgg gaggaagccc 6540 cgggctcgag gagcccttga gctcggggct ctctcccggg acagcatgcc aaataagctt 6600 tgtaaatcat cagctaagtg tgaactcttg aa 6632 // ID Gypsy58-LTR_DR repbase; DNA; ZEB; 144 BP. XX AC chr16; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy58-I_DR; KW Gypsy58-LTR_DR; Gypsy58_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-144 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1432-1432 (2008). XX DR Genome; chr16; Positions 11949951 11949808. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 144 BP; 48 A; 25 C; 26 G; 45 T; 0 other; tgtcatgtat tttgtatgag tgtgctaccc gtacctgagt gacgtcatca tgttgagttc 60 agttaataaa agaatactgc atggaacagc acggctactc tggtgtcatc ttatacatca 120 tttaaaaata taaaaaacat aaca 144 // ID HE1_DR1 repbase; DNA; ZEB; 402 BP. XX AC . XX DT 01-APR-2002 (Rel. 7.03, Created) DT 01-APR-2002 (Rel. 7.03, Last updated, Version 1) XX DE Zebrafish HE1 SINE element - a consensus. XX KW SINE; Non-LTR Retrotransposon; Transposable Element; DANA; KW HE1_DR1; HE1_MM; HE1_SINE. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RA Ogiwara I., Miya M., Ohshima K. and Okada N.; RT "Retropositional parasitism of SINEs on LINEs: identification of RT SINEs and LINEs in elasmobranchs."; RL Mol. Biol. Evol 16(9), 1238-1250 (1999). XX RN [2] RP 1-402 RA Jurka J. and Drazkiewicz A.; RT "HE1_DR1: HE1 SINE element from zebrafish."; RL Repbase Reports 2(3), 4-4 (2002). XX DR [2] (Consensus) XX CC Similar to Mustelus manazo SINE elements: HE1_SINE and HE1_MM CC reported by [1]. However, this is the first report of HE1 SINE CC elements in zebrafish. HE1 SINEs are similar to MER6 DNA CC transposons from the human genome. XX SQ Sequence 402 BP; 89 A; 76 C; 119 G; 114 T; 4 other; gggcgacacg gtggctcagt ggttagcact gtcgcctcac agcaagaagg tngctggttc 60 gagtcccggc tgggtcagtt ggcatttctg tgtggagttt gcatgttctc cccgtgttyg 120 cgtgggtttc ctccgggtgc tccggtttcc cccacagtcc aaagacatgc gctataggtg 180 aattggataa actaaattgg ccgtagtgta tgtgtgtgtg tgtgaatgtg agtgtgtatg 240 ggtgtttccc agtactgggt tgcagctgga agggcatccg ctgtgtaaaa catatgctgg 300 atwagttggc ggttcattcc gctgtggcga cccctgataa ataaaaggga ctaagccaaa 360 ggaaaatgaa tggaatgaat naatataatt tttttcaatt aa 402 // ID TE-U_1_DR repbase; DNA; ZEB; 286 BP. XX AC . XX DT 14-OCT-2008 (Rel. 13.1, Created) DT 14-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE nonautonomous transposable element from Zebrafish. XX KW Transposable Element; Nonautonomous; microsatellite; CA; GT; KW TE-U_1_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-286 RA Bao W. and Jurka J.; RT "Transposable elements from zebrafish."; RL Repbase Reports 8(10), 1620-1620 (2008). XX DR [1] (Consensus) XX CC This transposon was inserted specifically into AC (or GT ) CC microsatellite. The size of TSD is unknown. The TIRs are CC imperfect, around 20-bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 286 BP; 78 A; 69 C; 43 G; 96 T; 0 other; tgtttgtttt tgtgaaaagt ggggacatta cataggtttc cattcatttt atactgtcca 60 aaccgtatat tatattgccc tcaccccacc ccacccctaa acccaaccat cacaggagac 120 tgtgtgcagc tttactctct gattaaactc attctgtagg atttataagc attttgagaa 180 atgaggacgt caccaatgtc ctcatatttc acctcctttt tgtaatacct gtgtcatacc 240 catgtcatta tacagatttg tgtcctgata tgtcacaaaa acacgt 286 // ID Gypsy95-I_DR repbase; DNA; ZEB; 4303 BP. XX AC chr19; XX DT 02-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy95-I_DR; KW Gypsy95-LTR_DR; Gypsy95_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4303 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1501-1501 (2008). XX DR Genome; chr19; Positions 14949914 14945612. XX CC Positions [3205-3666] - Integrase core CC 'TCAGG' target site duplication CC LTRs are 99% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 105..1028 FT /product="Gypsy95-I_DR_2p" FT /translation="MQSMMNQTSIQPFDTETDPSSLGTWWNKWIQRFENFI FT IAMNITGDARKKALLLHLAGERVHDIYDTFAADEDEYTETKTKLNNYFSPK FT KNTQFLVYQFRKAVQQHGEKLDTYQTRLRMLAKDCEFADVDCEIKAQLIQS FT CTSSRLRRKALREPDLTLEALLDHGRTLEISEQQAAGIEQNPVAAVSAVYH FT NYKQQSKFKATNKQISNIQCRNCGGQYPHQGVCPARGKECRACGKLNHYAK FT VCRSNTTKHPQQRKYNDRLQYKRNSNPQTSKQVAATCVSNTQDDSSSSEDA FT YVFVFKQQHMPTTATD" FT CDS 1636..4098 FT /product="Gypsy95-I_DR_1p" FT /translation="MRQANIAIQRERHITPTIDDVIQELNGAKVFSKLDLR FT AGYHQLELHPDSRYITTFTTHLGLRRYKRLNFGISSAAEVFQNAIAQTLQG FT IKGVKNLSDDIIFYGASQRAHDEALQAVFQRLQDMGLTLNREKCEFNKNRL FT EFYGFIFSDSGISADPKKVAAILQASEPKDQGEIRSLLGMANYCSRFIQDF FT STICAPLRDLTKKDTPWQWGEMHKQALQQLKDALARATTVSYFAPLKDTEL FT VVDASPVGLGAILSQTQNGQKNVIAYASRALSDTEKRYSQTEREALAIVWS FT CEHFHLYIYGHPFTLVTDHKPLELIWNNPRSKPPARIERWGLRLQPYQFRI FT EYRKGIDNPADFMSRHPVPLDVECNTRASKVAEEYVNFMALQSTPKAMMID FT EIKTETKKDAVLQQVCSCIRNNSWHINTRDSLFADTLKQYRNVSSELTVSP FT NDDLILRGSRIVIPSTLEQRVLDLAHESHQGVAKTKSLLREKIWFPNIDRK FT VEVITTNCIACQANTPITHVEPLQMSEMPEAPWHNLSADFYGPLPSGEYLL FT VVIDDYTRYPVVKILHSTSATTVIPAIDDVFSMFGIPKVLKTDNGPSFNSN FT MFAQFAVYSGFHHRKITPFWPQANATAERFMRTLGKAIRIAHTRGLPWKQQ FT LNVFLREYRSTPHCTTGTSPAELLFQRKMRTKIPIATSSIISVDDVVRKRD FT KSAKDKMKALADTRKHATHINIKPGDTVLCRQQKQNKLTTPYSTKPLTVTR FT VKGTMVTAERQGHSITRNCSFFKRLNQEMPELLFEPGHGDETEENVTHTVP FT PKYPFRHNRRPPAYLSDYV" XX SQ Sequence 4303 BP; 1464 A; 958 C; 835 G; 1046 T; 0 other; atggtgacga ggatactaac gcgatctcaa cccatgattt gaagttttga agaaatgctt 60 cccgtacgtg agtgagtaaa cactatcaca cagaagctgt agccatgcag tcgatgatga 120 atcaaacatc catccagcca tttgacacag aaacagaccc cagttcgtta ggaacttggt 180 ggaataagtg gatacagcgt ttcgaaaact tcattattgc aatgaatatt accggcgacg 240 ctcggaagaa agcgctactg ctgcaccttg ctggagaacg cgtgcatgac atatatgata 300 cgtttgccgc agatgaagat gaatacacag aaacaaaaac gaaactaaat aactacttct 360 cccctaaaaa gaatacacaa tttcttgttt atcagttcag aaaagctgtt cagcaacatg 420 gagaaaaact tgacacttat caaaccagac tgaggatgct tgcaaaagac tgtgagtttg 480 ccgatgttga ctgtgaaata aaagcacagt tgatacaaag ctgcacgtcg tctaggctgc 540 gtagaaaagc actcagagaa ccagatttga ctttggaggc tttattagac catgggagaa 600 cgctggaaat atctgagcaa caagcagctg gaattgaaca aaacccagtg gccgctgtca 660 gtgctgtata tcacaattat aagcaacaaa gtaaattcaa agcaaccaac aaacagattt 720 ctaacataca atgcagaaac tgcggaggac agtatccaca tcaaggtgtt tgtcctgcta 780 gaggaaaaga atgcagagca tgtggaaaac taaatcacta tgcaaaagtg tgtcgatcta 840 acactacgaa acaccctcaa caacgaaaat ataatgatcg tctacagtac aaacgtaaca 900 gcaatccaca aacaagcaaa caagttgctg caacatgtgt cagtaataca caagatgact 960 catcaagtag tgaagatgca tatgtgtttg tttttaaaca gcaacacatg ccaacaacag 1020 ccacagactg aaataaaaat aaacggcacc agggttagag ctttgatcga ttctggagct 1080 gctgtaaata ttattagcaa agcagaattt gacacgcttg caccaaaacc acaaatcaga 1140 ccagcaacaa tcaagatttt cgcatatggt tctgatactg ctctacccat cattggagtg 1200 ttccagtgcg acgtcaaagc atcacatagt gctacagaag ctaagtttta tgtacttcag 1260 aatgatggac acactcttct aagctatgga acagcacaag agcttggatt aataaacatc 1320 agcagctccg ttaaccatac aagcaccatt cgtacagtag ctgatgaact aattgaaaat 1380 catccagatc tgttcaaagg aatagggaaa ctgaaaaact ttcaagtgaa gttgcacatt 1440 aatccagatg ttcaacctac atgtcagcca cacagacgtg tgccattcca cattcgacag 1500 aaagtggaag ctgaactaca acaactgaca aacaatgaca tcatagaaat agttacaggt 1560 ccaacgccat gggtttcccc tattgtgact ccacccaaac caaaagatcc tgattaagtc 1620 cgaatttgtg ttgacatgcg ccaagctaat atagccatac aacgtgagcg tcacataacg 1680 cccaccattg atgatgtcat acaagagcta aacggtgcaa aagtgttctc aaaacttgat 1740 cttagagctg gttaccatca gctagaactt catccagaca gccgatacat cactacattc 1800 accactcatc taggactaag gagatacaaa aggcttaatt ttggcatatc atctgctgca 1860 gaagtgttcc aaaatgccat agcacaaacg ctacaaggca taaaaggtgt aaaaaacctc 1920 agcgatgata tcatttttta tggtgcctca cagagagcac atgatgaagc gctccaagct 1980 gtttttcaga gactacaaga catgggtctt acactaaatc gtgagaaatg tgagtttaac 2040 aaaaacagac ttgaatttta tggattcatt ttctctgaca gtggcatctc agcagacccc 2100 aaaaaagttg cagccattct acaagccagt gaacccaagg atcaaggtga aattcggagc 2160 ctactaggga tggcaaacta ctgttcacga tttatacagg acttttcgac aatctgtgca 2220 cctcttcgtg atctaactaa aaaagacaca ccttggcagt ggggtgaaat gcataagcaa 2280 gctctacaac agctcaaaga tgctttagcc cgtgccacca ctgtatcata ctttgccccc 2340 ctaaaagaca ctgaacttgt tgtagatgcc agtccagttg ggcttggtgc cattttatca 2400 caaacacaaa atggtcaaaa aaatgtcatt gcatatgcca gtcgcgcact cagcgacact 2460 gaaaaacgat actcgcaaac tgaacgcgag gctctagcca ttgtatggag ctgcgagcat 2520 tttcatttgt acatttacgg tcatcctttt actttagtga ctgaccacaa gcctttagaa 2580 ctgatatgga acaacccaag atcaaagcca ccagccagaa ttgaacgatg gggcctcaga 2640 ctccagccat accaattcag aatagaatac agaaaaggaa ttgacaatcc tgctgatttc 2700 atgtccagac accctgtgcc acttgatgta gaatgtaaca ctagggcatc caaagtggca 2760 gaggagtatg tcaatttcat ggctctgcag tcaacaccga aagccatgat gatagatgag 2820 ataaaaactg agacaaagaa ggatgccgtg ctccagcagg tttgctcctg cataagaaac 2880 aactcatggc acataaatac acgagactca ctctttgctg acacactaaa acagtacaga 2940 aatgtcagtt ctgaactcac tgtgtctccg aacgatgatc tcattctcag gggctctaga 3000 atagtcatac catctactct agagcaaaga gtcttagatc ttgctcatga gagccatcaa 3060 ggggttgcta aaaccaaaag tcttctgaga gaaaagatat ggtttcccaa cattgatcgg 3120 aaagttgaag tcataactac aaattgcatt gcttgccaag caaatacccc catcacgcat 3180 gtcgagccgc ttcaaatgtc agaaatgcca gaagcaccat ggcataatct gagtgcggat 3240 ttctatgggc cattaccatc tggcgaatat ctgctagtcg ttatagatga ctacactcgc 3300 tatcctgtag tcaaaattct ccattctaca tcagcaacta ctgtgatacc agctattgat 3360 gatgtttttt ctatgtttgg aattcccaaa gtgcttaaaa ctgacaatgg cccttccttc 3420 aacagcaaca tgttcgcaca atttgctgtc tacagtgggt ttcaccatag gaaaataaca 3480 cctttctggc cacaagctaa tgctacagca gagagattta tgcgtacttt agggaaagcc 3540 attcgaattg ctcacactcg aggtctccct tggaaacaac agttaaatgt tttccttcgt 3600 gagtacagat ccacacctca ttgcacaaca gggacttctc cggctgaact gctctttcaa 3660 aggaagatgc gcacaaaaat tcccatagct acttcaagta tcatcagtgt ggatgatgtt 3720 gtccgcaaaa gagacaaaag cgcaaaggat aagatgaaag ctcttgcaga cactcgaaaa 3780 catgccacac acattaacat caaacctggg gacactgtat tgtgccggca acagaaacag 3840 aacaaactaa cgacaccata cagtacaaag ccactcacag tcaccagagt caaaggcact 3900 atggtaactg cagaacgcca aggacactct atcacaagga actgctcttt ctttaaaaga 3960 ctcaaccagg agatgcctga gcttttgttt gagccaggac atggagatga aacagaagaa 4020 aatgtcacac atacagtccc tccaaaatat cctttcaggc acaaccgtag gcctcctgcc 4080 tacctttctg attatgttta ggtaaaaaaa ataaataaat tgcattaaag ttctacaatg 4140 cacatgaaac agactgttca gatttttcaa gtattcatgt cgcataggaa ttttaattgt 4200 tgcagagttt tctatgattg gcaacagtta atacttaaaa tagtttgttg catagttatt 4260 attgcatatt aaaatgtttg taaaaaaaaa aaaaaaggag tga 4303 // ID CR1-22_DR repbase; DNA; ZEB; 1721 BP. XX AC . XX DT 17-NOV-2008 (Rel. 13.11, Created) DT 17-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE CR1-like non-LTR retrotransposon - a consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; AP endonuclease; CR1 clad; CR1-22_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1721 RA Bao W. and Jurka J.; RT "CR1-like non-LTR retrotransposons from zebrafish."; RL Repbase Reports 8(11), 1696-1696 (2008). XX DR [1] (Consensus) XX CC The 5'- part is incomplete. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS join(2..391,410..1480) FT /product="CR1-22_DR_1p" FT /translation="DSVFLKTYKDILXGPITHLINLSITQGIYPDAWKLAI FT VTXIFKAGEQTNVQNYKPISILPTVAKVAEKWVAQQMIAYLNAGHTPLHPM FT QFGFXTNXSTESANCYLIENMKSKLDRGGVIGAVFFRFKKSIGTPFKAHSF FT NFSSEAIXWITSYLTNRKQCVRLGNSQSVYRISNCGVPQGSILGPILFSLY FT INDLPQVCPTINIQMYADDTVLYVHAKNKQLAANYLTEALVHVSDWLCESY FT LNLNTSKTVCMFFSQSLTVTHHPDVFVKGEKLRVVSDFKYLGLILDSQLTF FT QKHAKKVIKIVKFNLANFRHIRPYLTNEAAKLFMHSMIFSHITYCLTSWSH FT ASKTVLKSVESLYKQALKVFDQKSIRYHHCNIIQKHYLLSFDNSMKYLADA FT CMIFKILNGYAPPPLCEFVTHKNNNGQATRSVTRGDCAVQFRRTTFARSVF FT SVRASHFWNTLPIDLRGVTNYSTFKYKLKKWLKANQVCNHV*" XX SQ Sequence 1721 BP; 554 A; 280 C; 287 G; 588 T; 12 other; agattctgtg tttttaaaaa catacaagga tatactasmt ggscctatca cccatctaat 60 aaatctttca attacacaag gtatataccc agatgcttgg aaattagcca tagtgacacy 120 ratttttaaa gctggcgagc aaacaaatgt gcaaaactat aagccaatta gtatattgcc 180 tacagttgcc aaagtagcag aaaaatgggt agcacagcaa atgattgctt acttgaatgc 240 sggacatacc cctttacatc caatgcagtt tggatttara actaatyact caaccgaatc 300 tgctaattgt tatcttatag agaatatgaa atcyaaactt gacagaggtg gagtaatagg 360 agcagttttt tttagattta aaaagagcat ttgacacagt aaatcatgag gtactccttt 420 caaagctcac tcttttaact tttcatcaga ggcgattatn tggataactt catatctaac 480 aaataggaaa caatgtgtgc gccttggtaa tagtcaatca gtatacagaa tcagtaattg 540 tggagttcct caaggatcca ttttgggccc tatcctgttt agtctgtata taaatgacct 600 cccacaagtc tgtcctacta ttaatattca gatgtatgct gatgacacag ttctatatgt 660 gcacgcgaaa aacaagcaac tggcagcaaa ttatttaacc gaggcattag ttcatgtatc 720 tgattggttg tgtgaatcat atcttaattt gaatacaagt aaaacggttt gtatgttttt 780 ctcacaatca ttgacagtta cccatcatcc tgatgtattt gttaaaggag aaaagctcag 840 ggttgtctct gattttaaat atttaggtct aattttagat tctcaattaa cattccaaaa 900 gcatgccaag aaggttatta aaattgttaa atttaatctg gccaacttca ggcatatacg 960 accttacttg actaatgaag cagcaaaact ttttatgcac tccatgattt tttctcatat 1020 tacatattgt cttactagtt ggtcacacgc tagtaaaaca gttttaaaat cagttgaatc 1080 actttataaa caggcactta aagtatttga tcagaagtct attaggtatc atcattgtaa 1140 tataatacaa aaacattatt tattgagctt tgataatagt atgaaatatt tggctgatgc 1200 ttgtatgatt tttaaaattt taaatggata tgctccacct ccattatgtg agtttgtgac 1260 acacaagaac aataatggtc aagcaacacg atctgttacc aggggagact gtgctgtgca 1320 gtttagacgt acaacttttg cacgatcagt cttctcagtt agagccagtc atttttggaa 1380 cacacttccg attgacttaa ggggtgttac aaattattct acttttaaat acaagttaaa 1440 gaaatggcta aaagcaaatc aagtttgtaa tcatgtgtaa ttcttttgtc taacatttta 1500 agaaattctt attacatgtt tttatctctt tttgtcatat gtaactttac tgtttttgta 1560 tgtgtgatga tgtgtttttt ttattttagg ttgcctttta aaatctggca gggggcaaca 1620 gatgaanata attagccctt gtggctaact ctggcttatt tacagtttta ctgttkatta 1680 atgagcattg tccctgttaa ataaaataaa ttcaaattca a 1721 // ID Gypsy-117-I_DR repbase; DNA; ZEB; 4913 BP. XX AC . XX DT 25-NOV-2008 (Rel. 13.11, Created) DT 25-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion - a DE consensus. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-117-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4913 RA Bao W. and Jurka J.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(11), 1789-1789 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 141..4880 FT /product="Gypsy-117-I_DR_1p" FT /translation="MASKFELDKFTIAPTEEQLDKCRKDDLFLIADFFDIV FT VPRNALKKEIREALYMELVKQKILPDANVQGLGVANTALSEELEAAEAEFK FT PTDVASMDPINPPSLTDPQLAIRLKELEVELSRQQYQSQLLHVRAVELETK FT RDIRLKELELELKTGQNPCPPSSLTKGSTPVQSPVNASSPLPAVPSVTPVS FT PNTDFDISRQISLVPTFRESEVDTYFTVFERIAATLKWPRNIWPLLLQCKL FT VGKAQEVCSALTLEQSLEYDSVKAAVLRAYELVPEAYRQKFRKHVKNPSQT FT FVEFAREKTTLFEKWCAASKITTFEQLKELILVEEFKNCISEKIVVYLNEQ FT KVSSLSEAAIFADEFVLTHKVAFTSPRSSRAVIDRRGSSRTAVAVSKDDRI FT AESSTGSRECYYCHEKGHLIAGCPVLLRKNQRKAQDASKTVAFASTHTSEL FT VDSIDPSFYPFVFEGVVSFSELDQEQKPVHILRDTGAAQSFILADVLPFSS FT QSSCNSDVLVQGIELGVVRVPLHTVYLRSDIVTGLVKVAVRTQLPLKGISL FT ILGNDLAGNKVSCLPEVTEVPCVPEEDVLTREFPSVFRSCVVTRAQARKLD FT KEIDLSNSFMNSDLPDVENKTEDACLQNLSLESKFDFPFDKQRLIDAQRAD FT DTLKSCFAAVIEISDLPNHPAAYFLDGGVLMRKWSPENMKGDWSSVFQVVI FT PKDYREYVLSVAHDHELSGHLGIRKTYNNLLKHFFWPGMKSTVSHYCRSCH FT ACQVAGKPNQVISPAPLKPIPVMTEPFEKLVVDCVGPLPRTKSGHSYLLTL FT MCSATRFPEAIPLRSLKAPTIVKAIVKFCTTFGLPKFIQSDQGSNFMSRIF FT RKVMKELNIQHCVSSAYHPQSQGVLERFHQTLKTLIRTYCVQREKDWDEDV FT PLLLFAIRNTVQESLGFSPAELVFGHTLRGPLKLLQEQLLSTGQSVTPPKN FT VLDHVSSFRERLHLVWKLAQQSLATSQSRMKSRYDKKSVQRSFKVGDQVLV FT LQPLPGSALQAKFTGPYVIEEKLSETDYVVQTPERRRKTRICHINMLKLYV FT SRPDCKDSVSSPVVTPAVVTSIDVVSPEYSPCLDDLHLGSAYLSGAKLQNS FT ETLESLASKLSHLPSSSQSELVHLIDKYSSIFSDVPTVTSVLMHDIDVGDH FT RPVKQNAYRVNPVKREIMKKETQYLVENGLAEPSCSPWCSPCLLVPKPDGT FT YRFCTDYRKVNQLTKSDSYPLPRMEDCIDRIGSAKFVTKLDLLKGYWQVPL FT TERASEISAFATPDSFLQYKVLAFGLKNAPATFQRLMNQVLANVRNCEAYL FT DDVVCYSDTWGNHLQTLEEVFSRLQAANLTLNLAKCEFCHATVTYLGKEVG FT HGTVRPLGAKVQAIVEFPVPKTKRDLRRFLGMAGYYRNFCKNFSDVVKPLT FT DNLRKDVPFSWNSKCEFAFECLKNLLCNAPVLVAPDFTRPFKLEVDASGTG FT AGAVLLQEDEQGIDHPVCFYSKKFNRHQVNYSTIEKEALALLLALQHFEVY FT VGSSPKPIEVFTDHNPLVFLQQMCNKNQRLMRWALICQSFNLQIKYKKGTD FT NIIADSLSRSHL*" XX SQ Sequence 4913 BP; 1376 A; 966 C; 1058 G; 1513 T; 0 other; tgaaattggg ggctcgtcca ttggttatat gctctattag ctgaatttac ttggctcatt 60 tgtttttgca tttgtttgcc tggggttttt cttttgataa gggaggaaat atagtgattg 120 gcccaactac tttgattggc atggcatcaa aatttgaact tgataaattt acaattgcac 180 caacagagga gcaattagat aaatgtcgca aagacgattt gtttttgatt gcagattttt 240 ttgatatagt cgtacctcga aatgcactga aaaaggagat cagagaggca ttgtacatgg 300 agcttgttaa acaaaagatc ttacctgatg cgaatgtgca gggtttgggt gttgcaaata 360 cagcattatc tgaagagttg gaagctgctg aggcagagtt caaacctacg gatgttgcga 420 gcatggatcc cattaatccg ccttcgctaa cagatccaca gctggcaatc cggttgaaag 480 aactggaggt ggagttaagt cgacagcagt atcagagcca gcttttacat gttcgagcag 540 ttgagcttga gacaaaacga gacatccggc tgaaagagct ggaacttgaa ttaaagactg 600 gacaaaatcc atgtccacct tcttccctga caaagggaag cactcctgtg cagagtccag 660 taaatgcctc ctctcctctt cctgcagtgc catctgtgac accggtaagt cctaacactg 720 attttgatat tagcagacaa atttcattgg ttcctacatt tagagagagt gaagttgaca 780 catatttcac tgtttttgag cgtattgcag ccacgttaaa gtggccaaga aatatttggc 840 ctttattgtt acagtgcaaa ctggttggca aagcacaaga agtatgttct gcactgacac 900 ttgagcaaag tttggagtat gattcggtta aggctgccgt gttaagagct tacgaactcg 960 ttccagaagc ctatcgacaa aagtttagga agcatgtgaa aaaccccagt caaacatttg 1020 ttgagtttgc acgtgagaaa accacactgt ttgaaaagtg gtgtgctgct agcaaaatta 1080 ccacatttga gcagctgaaa gaactcattt tagtggaaga gtttaagaat tgcatttcag 1140 aaaagattgt agtttatttg aatgaacaaa aagtttcatc actatctgag gctgcgattt 1200 ttgctgatga gtttgtgctc acacacaagg ttgcattcac gtccccacgt tcatctcgtg 1260 ctgttataga tcgcaggggt agctctagaa cagcagtagc tgtgtcaaaa gatgatcgca 1320 ttgctgaatc ttcaacaggg tcccgcgagt gttattactg ccatgaaaag gggcatttaa 1380 ttgctggttg tcctgtgtta ctgcgcaaaa atcagagaaa agctcaggat gcatctaaaa 1440 ctgtggcctt tgcgagtact cacacttctg aattagtaga ctctattgat ccctcttttt 1500 atccatttgt gtttgaaggt gtagtgtcat tttctgaatt agatcaggag caaaaaccag 1560 ttcacatcct tcgggataca ggtgctgctc agtcttttat tctggcagat gtgcttccat 1620 tctcttcaca gtccagctgt aattcagatg ttttagttca gggaattgaa cttggtgtgg 1680 ttagggttcc tttgcacact gtttatttac gctctgacat tgtgactggc ttagttaaag 1740 tggctgtacg tactcagtta ccactgaagg gaatttccct tatccttggg aatgatttag 1800 cagggaacaa agtttcttgt cttcctgagg ttactgaggt accttgtgtg ccagaagagg 1860 atgtgttgac tcgagagttt cccagtgtat ttcgttcttg tgttgtgact cgtgcacaag 1920 ctcgtaaact tgataaggaa attgatcttt ctaattcctt catgaattca gatttaccag 1980 atgttgagaa caaaactgaa gatgcttgtt tacagaatct gagtctagaa tctaaatttg 2040 actttccttt tgacaaacag cgcttaatag atgcacaacg tgcagatgat acgttgaagt 2100 catgttttgc tgctgtgatt gagatatctg acttacccaa tcatccagct gcttattttt 2160 tagatggcgg ggtacttatg cgcaagtgga gtccagaaaa catgaagggt gactggagct 2220 ctgtatttca agtagtcatt ccaaaagatt acagagaata tgtgctaagt gtggctcatg 2280 accatgaatt gtctggtcat ttaggtataa ggaagaccta taacaatctt ttgaaacatt 2340 tcttttggcc tggaatgaag tctacggtgt ctcactattg tcgatcttgt cacgcttgtc 2400 aggttgctgg caaaccaaat caggttattt ccccagctcc cttaaagcca attccagtaa 2460 tgactgagcc atttgagaaa cttgtagttg attgtgttgg acctttacca aggactaaat 2520 ctggtcactc ctacttgtta actttgatgt gttctgctac taggtttcct gaagccatcc 2580 cactgcgttc cttaaaggct ccaaccattg tgaaggcaat tgtgaagttc tgtaccacat 2640 ttggcctgcc aaagtttatt cagtcggacc aagggtcaaa tttcatgtca agaatcttcc 2700 gtaaggtcat gaaagagcta aacattcaac attgtgtctc tagtgcgtac catccgcagt 2760 cgcagggtgt tcttgaaagg tttcatcaga ccctcaaaac ccttatcaga acctactgtg 2820 tgcaacgtga gaaagattgg gatgaagatg tcccactttt attgtttgct attcgtaata 2880 cagtacagga atcacttgga ttcagcccag ctgaattagt atttggacat actcttcgcg 2940 gaccactgaa actgttacaa gaacagttac tttctacagg tcagtcagtc actcctccta 3000 agaatgtatt ggaccatgtg agttcatttc gtgaacgttt acatttagtg tggaaactag 3060 cacaacagtc gttagcaacc tctcagtcca gaatgaaaag cagatatgat aagaaatctg 3120 tccaacgctc atttaaggtt ggtgatcaag ttctggtatt gcagccttta cccggctctg 3180 ctctacaagc aaaattcaca ggcccttatg tgattgagga aaaattaagt gaaactgatt 3240 atgtcgtaca aactcctgaa agaagaagaa agactcggat atgccacatc aatatgctta 3300 aattgtatgt ctctcgccct gattgtaaag actctgtcag ttctccggtg gtaactcctg 3360 ctgttgtaac atccatagat gtggtttctc cagagtattc tccttgccta gatgatcttc 3420 atttaggaag tgcttatctt tctggtgcta aattgcaaaa ttctgaaact ttagaatctc 3480 ttgcctcaaa actttcccat ttgccaagtt catctcaaag tgagcttgta cacttaattg 3540 acaagtattc tagtattttc tcagatgtcc caactgtgac tagtgtgttg atgcatgaca 3600 ttgatgttgg tgaccatcgt ccagtaaagc agaatgctta ccgtgtgaat ccagtcaagc 3660 gtgaaataat gaaaaaagaa acccagtatc tagttgagaa tggtttggca gagccaagct 3720 gtagtccctg gtgttctcca tgtttgttgg ttccaaaacc tgatggaact taccgctttt 3780 gcacagatta ccgaaaggtg aatcaattaa ccaaatccga ttcctaccct cttcccagaa 3840 tggaggattg cattgatcgc attggaagtg ctaaatttgt gaccaagtta gacttgctta 3900 aaggatactg gcaggtgccg ctcacggagc gtgcttctga aatctcagcc tttgccaccc 3960 cagactcctt tttacaatac aaagttctag cttttggcct taaaaacgct ccagctactt 4020 ttcaacgctt gatgaatcaa gtattggcta atgtaagaaa ttgtgaagca tatctagatg 4080 atgtggtatg ctactcagat acatggggta atcacctaca aacactggag gaagtgtttt 4140 caagacttca agcagctaat cttactctta atttagctaa atgtgagttc tgtcatgcca 4200 ctgtaacata cttgggaaaa gaagttggtc atggtactgt tcgccctcta ggggctaaag 4260 ttcaggcaat tgttgagttt cctgtgccca aaactaaaag agacctcagg cggttccttg 4320 gaatggctgg ttattacaga aatttctgta aaaacttttc tgatgttgta aaacctctaa 4380 ctgataacct tcgcaaagat gtacccttca gctggaactc taagtgtgaa tttgcctttg 4440 aatgtcttaa aaatttgttg tgtaatgctc ctgttttagt tgccccagac ttcacccgac 4500 cttttaaact tgaagtggat gccagtggta cgggggctgg tgctgtcctg ctacaagaag 4560 atgaacaagg aatcgaccat cctgtgtgtt tctattctaa aaaattcaat cgtcatcagg 4620 ttaattacag taccattgaa aaagaagcac tagctctttt gttagcattg caacattttg 4680 aagtatatgt cggatccagt ccaaaaccca ttgaagtatt tactgaccac aacccacttg 4740 tgtttcttca gcaaatgtgt aacaaaaatc aaagactgat gcggtgggct ctcatttgtc 4800 aaagttttaa tttacaaatt aagtacaaga aaggtactga caatataatt gcagattcat 4860 tgtctagatc acatttgtaa acttgtaacc atgtttacac ttatgtgtgg ggg 4913 // ID BEL17-LTR_DR repbase; DNA; ZEB; 444 BP. XX AC chr13; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL17-I_DR; KW BEL17-LTR_DR; BEL17_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-444 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1382-1382 (2008). XX DR Genome; chr13; Positions 52480739 52480296. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 444 BP; 128 A; 99 C; 94 G; 123 T; 0 other; tgtaaaaaag gctgcagctt ataagtgcaa ttctaacctc tgtcttttgt cctttatgat 60 gaggcacccc gcatacacca gaacacgtta cctgagccaa tcagcgtcca ggatctgtga 120 ctacgcaatc ttccccaccc aaacagattt ttagtttcta gtgtagcgag acacatagac 180 ggatgctaaa gtttgcgagg aatagatcac ggactgtttt catcattgtt tgtcagcggt 240 tgccacaagt gttgtgcgtg aatgcagaaa cacaacacaa gtgtcataag tcaagtctgt 300 tgagtggtga caataaatgg gttgttcaca caaacgttca tcttcgcctc atctctaatc 360 ggatagtctc tggtgggtca gcaatcagaa ctggcaatta aaaagtctat ggtgcagagc 420 tactaaaagc caaccatttt ttca 444 // ID DNA8-8_DR repbase; DNA; ZEB; 1207 BP. XX AC . XX DT 17-NOV-2004 (Rel. 9.1, Created) DT 17-NOV-2004 (Rel. 9.1, Last updated, Version 1) XX DE DNA8-8_DR is a non-autonomous DNA transposon. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-8_DR; KW nonautonomous DNA transposon. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1207 RA Kapitonov V.V. and Jurka J.; RT "DNA8-8_DR, a nonautonomous DNA transposon from zebrafish."; RL Repbase Reports 4(10), 258-258 (2004). XX DR [1] (Consensus) XX CC DNA8-8_DR is a nonautonomous DNA transposon. Its is CC characterized by 15-bp terminal inverted repeats and 8-bp CC target site duplications. It is expected to be a member of the CC hAT or P superfamilies. XX SQ Sequence 1207 BP; 336 A; 243 C; 244 G; 381 T; 3 other; cagggctatt caattagttt gtcatggggg ccggttcatg aaaagcatcc caaacgaagg 60 gccggagaga tatgacttgc tatatgagtg atgacacaac tgcatataag agcccatatg 120 ctgtattttt gctccttaag acacccacgt tggctttttc tacattaaaa tgttaatata 180 ttgtattttg aaactacata atatcagtgc attgtacaat agactttttw ttattttttt 240 tacaaacatt caaatgttgt atttcagagc acaacaaaag cagtgcattg ctgaagtagg 300 ctttttctac attcagacac attcatatgt tatgttttga actgcataac aattagggat 360 gcaacaatta tagattttgg ttgtacgatt atatagtctg aagaataatc atggtttcac 420 ggttatcacg tctaatgtaa attcaagctt tatattaggt aagtagctat ttagattaac 480 tgttgttgcc atctgcrttc atgcatacat aatcatttta ataataattt gtctaacata 540 tcttttgttt acattgcact gaaaggcacg cacaactctc accgtgagat gttttctact 600 gcgtgcgccc ggaaagacgc aagcgcgttg ctccgcttgt gcgcgcatat tgtcatatat 660 tcttacatta gaaataacaa acttgcaagc ggaaaagacg tgatatggga acggccgctg 720 ctatagttaa tccggtgtgc gtgcgtatta gcctcggtgt acatacttgg aactttaaac 780 tagcgcacag ttggcaactt ggcataactt tgtttaattg cagcagtttc attccgtgca 840 acagaaatgc ggcttagaga acctttacga ttaattaact gtgacgaatt aaagcgcagt 900 taatagtgaa atcggctaat cgttgcatcc ctattaacga tatcagtgca ttgtacaaaa 960 ggccttttct acaaacacat ccaaattgtt ttcggctgcc cgcgccactc gcttatggtg 1020 actcacgctc tgcgcagcct ttagctcrtg ctgtattgaa cagacgcacg tcacttcata 1080 actatagcgg ccgtttttcg actgaactaa atttattttt gatgtcttgg tcacactatt 1140 tggagggccg gaacaaattg cctcgggggc cgggttcggc ccgcgggccg ccaattgaat 1200 agccctg 1207 // ID BEL29-I_DR repbase; DNA; ZEB; 6439 BP. XX AC chr20; XX DT 03-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL29-I_DR; KW BEL29-LTR_DR; BEL29_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6439 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1401-1401 (2008). XX DR Genome; chr20; Positions 49698005 49691567. XX CC Positions [5401-5991] - Integrase core CC 'GTTGG' target site duplication CC LTRs are 99% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS join(226..2556,2560..6342) FT /product="BEL29-I_DR_1p" FT /translation="MDEKEESGCEMAADVDGEKVVSENVANDDGVANEDVS FT SKRVRKLTAKGVTLLFENLQKSRKQNLTQVNKIKQNVNDIISLKTTSETVL FT NVEKQMKKFVKYCEKAEDLHNAIVEMPIPQDEFDKQTKWFNAKMAENLCFK FT KEITEWLTDAGKSWSVDKACDASSAVSSDIGPDDSISNVSTNQSQRSKVSK FT GSASTSSMTSSARIKVMAEKAALLQHAAAMEKRHLLEEEEERLHIQSEKLK FT RQKEQLMLNAKIAATDAKLSVLKTSEVGSNKKAPSDDSKSKKSHSLKSDLQ FT VHLPGNSTQVAMHCTTTQQAASIQDGTAGVQSNPVSYSQLQDHSSQSHAHL FT QTDTGQSILYNIMQQQANITAQLVHRGDVTPLPPRIIPIFDGDPLQFAAFI FT QAFENGIERKTTNAQDCLFYLEQYTRGQPRELVRSCHHMSAEQGFCKAKNL FT LKEHFGNELKITAAYMERALNWPLIKPDDISGLQAYALFLRGYCNVMVNLQ FT YMDEINVSSNLKNVMMKLPYRLREKWRTKAYELQERNGCRLKFTDLVDFIE FT HQVKILTDPLFGDIKDTQAPISHKAKVKSVPFSGRGQNAATVAVVNTVEGF FT QSKVIPSQDSFSSCYACKKPHVLEKCPEIQKMTHRQKIDFLKTHGICFACL FT KFGHISKRCTNGQICSMCSQKHPTILHISAKDKPIEEKQVSNALVTLQTYA FT HIGAGEQDCSLSIVPVQVKSKLGDFTVTTYAFLDPGSTATFCTEGLMRKLK FT MIGTKKRILLRTMGHENVVNTSVLTGEISGLSDGKFMDLPEVLTQKTIPVS FT KDNVLDPEDLRKWRYLDDVKIPKLDAEVELLIGTNAPKLLEPWEIINSEGQ FT GPYAVKTLLGWVINGFVKEANDRRKIGFQSVTANRISVTKLEELLVAQYNH FT DFGEKSSEDDFKMSREDQRFMEIMEQSISIENGHYCIDLPFRKNDVIMPNN FT RCLAEQRLKGLKRKFSCNQEFQKEYTAFFADVIDKGYAEAVPQDQLLRDDG FT KVWYIPHHGVYHPKKKTIRVVFDCSAVFKGVSLNSQLLQGPNLTNTLLGVL FT TRFRQEHVTLMADIQAMFHQVKVSQKNVDFLRFLWWPNGDITKSFKEYRMK FT VHLFGAISSPTCSNFALRKLAENYKDCFPAKALNSILHNFYVDDCLLSVPT FT EREALQMVKDLTAVCSKGGFQLCKWISNSRKVLANIPEECLAKSTKDLNLD FT KDNLPVERALGLYWCIESDAFMFKIAVSERPHTRRGVLCIVSSIYDPLGFL FT APLILPSKLLLQELCKRNIGWDSEMPQTLSRQWSEWLQDLNKITMVKVDRC FT IKSKDFGMIRTAQLHHFSDASQCGYGTVSYLRLEDETKKVRLSFMLGKSRV FT APLKQITIPRLELAAAVLAVRIDLMLRKELQLDLDKSVFWTDSQTVLKYIA FT NDAKRFHTFVANRVSVIRDATDVEQWRHIGTRLNPADDASRGMSIENFLKS FT SRWIQGPDFLLEASEEWPLSSFDQTLMNDPEVKRDTVVNAAISNCPDSATH FT QLLNYFSDWTKLKIAVAWILKFKDVLLKLKQQRKCIQSSFNSGTNNSTVEQ FT NKVDEEMKRFRTGFFGQNLTPDDLSRAESAVIGFSQREAFKKEISSLQSGT FT SGVKRSSNLYKLDPVLIDGLLRVGGRLSRAAIPEETKRPVILPKEHNVSKL FT IMQHVHRQLGHAGRNHMLSTLRKRYWIINANSACRKVISECVVCRRFQGKV FT GEQKMADLPKERITSDLPPFSNTGVDYFGPVDVKRSRSVVKRYGVLFTCMA FT SRAVHLEVAYSLDTDSCINCIRRFMCRRGQVLQLRSDNGTNFIGAEKELRN FT ALKSLDHDKIQRTFLSEGIKWIFNTPAGSHHGGVWERLIRLVKKVLHSTLR FT QQCLDDEGFHTILCEAEAILNDRPIIKLSDDPNDLEALTPNHILLLRSKPC FT LPPGLFQKDDLYIRRRWRQVQYLSDLFWKRWIKEYLPLLQERQKWRKPHRN FT FNVGDIVVIMDPTAPRGSWSLGKITQTYPDKKGFVRSVQLKTKTGLLERPV FT SKICLLLEKTMD" XX SQ Sequence 6439 BP; 2082 A; 1136 C; 1444 G; 1777 T; 0 other; atagtaaaaa ctcagttccc gttggattga tgagcggctt gaatggaacg aagattaaag 60 gaaccagctt gctgtggttg agtttgcaac gctgacacgg acacacaata aggaagaacg 120 gtgggtgctg tttatgtatg tttaaaggct tcatgtaatg gaatgtgaag ttttggatca 180 aacggagaca tgctgtgctg atagtttgtg ctggttgtgt tgttcatgga tgaaaaggaa 240 gaatcaggtt gtgaaatggc tgctgatgtt gatggtgaaa aggttgtttc ggaaaacgtt 300 gcaaatgatg atggtgttgc taatgaagat gtgtcgtcta agagagtgag aaaattgact 360 gcaaaggggg tgactttgct ctttgaaaat ttgcaaaagt ctagaaaaca gaacttgact 420 caagtaaaca aaatcaaaca aaatgtgaat gatataattt ctcttaaaac cacaagtgaa 480 acagtactca atgtggaaaa gcaaatgaag aagtttgtca agtattgtga aaaggctgaa 540 gatttgcata atgcaattgt ggaaatgcca ataccccagg atgagtttga taagcaaaca 600 aaatggttca atgctaaaat ggcagaaaat ctgtgtttta agaaggaaat tactgaatgg 660 ttgactgatg ctggaaagtc atggagtgtg gataaagcct gcgatgcttc atctgctgta 720 agcagtgaca taggacctga tgacagtatt tcaaatgtgt ctacaaacca atcccaacgt 780 agtaaagtat caaaaggttc agcaagtact tcaagtatga cttcttctgc acgaataaaa 840 gtaatggcag aaaaggctgc attactacaa catgctgctg ctatggagaa gagacatctt 900 ttagaggagg aagaagaaag acttcacatt caaagtgaaa aactcaaaag acaaaaggaa 960 caactcatgc tgaatgcaaa aatagcagcc actgatgcaa agttgtctgt tttaaaaacc 1020 agtgaagttg gaagcaataa aaaagcacca tctgatgact caaagagtaa aaaatctcac 1080 tcattgaaat ctgatttaca agtacatcta cctggtaatt ctactcaagt tgcaatgcac 1140 tgtacaacca cacaacaagc tgcaagcatt caggatggca cagctggtgt tcaaagtaat 1200 ccagtctctt attctcaatt acaagatcat tcttcacaat ctcatgccca tctacaaact 1260 gatacaggac aaagtattct gtataacatc atgcagcaac aggcaaacat aacagctcag 1320 cttgtgcaca ggggggatgt gactccatta cctcctcgaa taattccaat ctttgatgga 1380 gatcccttac agtttgctgc cttcattcaa gcttttgaga atgggatcga aagaaagacc 1440 acaaatgcac aggattgctt gttttattta gaacagtaca ctagaggaca accgagggag 1500 ttggtcagaa gttgccatca tatgtcagca gaacaaggat tctgtaaagc caagaatttg 1560 ttgaaggagc actttgggaa tgaattaaaa ataactgcag cctatatgga aagggcattg 1620 aactggcctt taataaaacc tgatgacatt agtggtttgc aagcatatgc tttgtttctg 1680 agagggtact gcaatgtaat ggtaaacttg cagtatatgg atgaaattaa tgtgtcatcg 1740 aaccttaaaa atgttatgat gaaattgcct tatcgcctta gggagaaatg gagaaccaaa 1800 gcttatgaac tgcaagaacg gaatggttgt cggcttaagt ttacagattt ggtggacttc 1860 atcgaacatc aggtaaaaat tctcactgat cctctctttg gagatattaa agacacacaa 1920 gcacctatat cccacaaagc caaagtcaag tctgtacctt tcagtggcag ggggcaaaat 1980 gcagctacag ttgctgtagt caacactgtt gaagggtttc agtctaaagt catcccttca 2040 caggacagtt tttcatcctg ttatgcatgt aaaaaacctc acgttttgga gaagtgccct 2100 gagattcaaa aaatgactca cagacaaaag attgatttct tgaaaactca tggaatttgc 2160 tttgcctgtt taaaatttgg acacattagc aaacggtgta ccaatggcca gatttgtagc 2220 atgtgtagtc agaagcatcc tactatactg catatttcag ctaaggataa acctattgag 2280 gagaaacaag tgagtaatgc tctggttaca ttgcagacct atgctcatat tggggccgga 2340 gaacaggatt gttctttgtc cattgtgcct gtgcaggtga aatccaagct tggagacttt 2400 acagtaacaa catatgcttt cttggatcca gggagcactg ccactttttg taccgaaggg 2460 cttatgcgga agttaaagat gattggaaca aagaagcgga tattgcttcg aacaatgggg 2520 cacgaaaatg tggtaaatac atctgtgctc actggatagg agatctctgg attatctgac 2580 ggtaaattca tggatttgcc agaagtcttg actcagaaga caatacctgt aagtaaagat 2640 aatgtgcttg atccagaaga tttaaggaaa tggaggtatt tggatgatgt caaaattcca 2700 aaactagatg ctgaagttga attgctaatt ggcacaaatg ctccaaagct cttagaacca 2760 tgggagatca tcaatagtga aggtcaagga ccctatgccg ttaagaccct attggggtgg 2820 gtcataaacg gttttgtaaa ggaagcaaat gacagacgga agattggctt tcagtcagtc 2880 actgctaata ggatttctgt taccaagctg gaagagctat tggttgctca gtataatcat 2940 gattttgggg agaaatcttc agaggatgat ttcaaaatgt ctagagaaga tcaaaggttt 3000 atggagataa tggaacagtc aattagcatt gaaaatggtc actattgtat tgacctaccc 3060 ttcagaaaga atgatgtcat catgcctaat aataggtgcc ttgcagaaca acggttaaag 3120 ggcttaaaaa ggaagttcag ttgcaaccag gaatttcaaa aggagtacac cgcctttttt 3180 gctgatgtta tcgataaggg gtatgctgaa gcagtaccac aagatcagtt actacgagat 3240 gatggtaagg tgtggtatat cccacatcat ggggtctacc atcctaaaaa gaagaccatt 3300 agagttgtgt ttgattgcag tgcagttttt aaaggagtgt ccttgaattc acaattgcta 3360 caaggcccaa atctcaccaa cacattgttg ggtgttttga ccaggtttag acaagaacat 3420 gtgactttga tggcagacat tcaggcaatg tttcatcaag taaaggtgtc acagaagaat 3480 gtggattttc ttcgtttcct atggtggccc aacggtgata ttacgaaaag cttcaaagaa 3540 tacagaatga aagtccatct gtttggagct atttcatcac caacctgcag caattttgct 3600 ttgagaaagc ttgcagaaaa ttacaaggat tgctttccag ccaaagcgtt gaattctatc 3660 ctgcataact tttatgtaga tgattgctta ttatctgttc ccacagaaag agaggcactt 3720 caaatggtca aagacttaac tgcagtctgt tccaaaggag gattccagtt gtgcaaatgg 3780 ataagcaaca gccgtaaggt gttagcaaat atcccagaag agtgtctggc aaagtcaacg 3840 aaagatctca acctggataa agacaatcta cctgtagaaa gagcgctcgg gttgtattgg 3900 tgtatagaat ctgatgcatt catgttcaag attgctgtta gtgaaagacc acacacaaga 3960 cgaggcgttt tgtgtattgt gagctctatt tatgatccct tgggatttct tgcacctcta 4020 atactccctt ccaagttgct attacaggaa ttgtgcaaaa gaaacatcgg atgggatagc 4080 gagatgccac aaaccctttc acgccaatgg tctgagtggc tgcaagatct caacaagata 4140 accatggtga aagttgacag atgtatcaag tcaaaggatt ttggaatgat caggacagct 4200 cagttgcatc acttttcaga tgcaagtcaa tgtggttacg gcactgtttc ctacttgcgt 4260 ctggaagatg aaactaaaaa ggtgagactt tccttcatgc ttggaaaatc tagagttgct 4320 cctttgaaac aaataactat accaaggctt gaacttgcag ctgctgtttt agcagtcaga 4380 attgacctga tgcttaggaa ggaactccaa ttagacctag ataagtctgt gttttggaca 4440 gacagccaaa ctgtactgaa gtacattgcc aatgatgcca aaaggtttca tacatttgtt 4500 gcaaatagag tatcggtcat cagggatgca acagatgttg aacagtggag acatattggt 4560 acaagattga atcctgctga tgatgcctct agaggcatgt ctattgaaaa cttcttgaag 4620 tccagcagat ggatacaggg tccagatttt ctgttagaag cttcggaaga atggccactg 4680 tcctcttttg accaaacttt gatgaatgat cccgaggtaa aaagggatac agtggtaaat 4740 gctgctattt caaactgtcc tgacagtgct actcatcaac tcttgaacta tttttctgat 4800 tggacaaaac tgaaaatagc agtggcatgg attttaaagt ttaaagatgt tcttttgaag 4860 ctcaaacaac aaagaaaatg cattcagtcc tcattcaatt ctggaacgaa caattctaca 4920 gtagaacaga acaaagtgga tgaggaaatg aaaaggtttc gaactggatt ctttggtcaa 4980 aatcttacac cagatgatct ttctcgagca gaatcagcag ttattggatt ttctcaacgt 5040 gaagcattca aaaaagaaat ttcttccttg caaagtggaa catctggagt taaaaggagt 5100 agtaaccttt acaagcttga tccagtattg atcgatggtt tacttcgagt gggtggaaga 5160 cttagtagag cagcaatacc agaggagaca aaaagaccag ttattctacc gaaagaacat 5220 aatgtatcca aacttatcat gcaacatgtt catagacaac taggtcatgc agggagaaac 5280 cacatgttgt caactttacg aaagaggtac tggattatta atgcaaactc tgcttgtagg 5340 aaagtaatct ctgagtgtgt tgtgtgtcga cgatttcaag gcaaagttgg ggaacaaaaa 5400 atggctgact tgccaaagga aaggattaca tcggatcttc ctcccttttc caacactgga 5460 gtggattact ttggacctgt tgatgtgaag agaagtcgca gtgtcgttaa aagatatgga 5520 gttttgttta cgtgtatggc cagtcgagca gtacaccttg aagtagccta ttccctggat 5580 acagattctt gcataaattg cattcgtcgt tttatgtgtc gtcgtggcca agtattgcag 5640 ctaagatctg ataatggcac taattttatt ggagcagaga aagagcttag aaatgctttg 5700 aagagtttgg atcatgataa aattcagcgt acctttctat cagaaggaat aaaatggatc 5760 tttaatacac ctgcaggatc acatcatgga ggcgtatggg agaggctcat tcgtcttgta 5820 aagaaggtgt tgcactctac acttcgacaa caatgtttag atgatgaggg ctttcacact 5880 attctttgtg aagccgaagc tattcttaat gaccgtccca tcataaagct ttctgatgat 5940 ccaaatgact tggaagcact tacgccgaat cacatcttgc tgttgagatc aaagccatgt 6000 cttcctccag gacttttcca aaaagacgat ttgtatatca gacgtagatg gcgacaagtc 6060 caatacttgt ctgatttgtt ttggaaaagg tggataaagg aataccttcc actattacag 6120 gaaagacaaa aatggaggaa gccacataga aacttcaatg ttggtgacat agtggttatc 6180 atggatccaa ctgctccacg tggttcctgg tcactaggca agatcactca gacatatcct 6240 gataaaaagg gatttgtccg ctctgtacaa ctgaagacaa aaacaggatt gctggaaaga 6300 ccagtttcca aaatctgcct gttattagaa aaaacaatgg actgatccat aaagaagtag 6360 tacacataca ccttttttgg ctcctttgat aatttaagag attaattgta gttgcacacc 6420 aacaattatg gggccggcg 6439 // ID Gypsy66-LTR_DR repbase; DNA; ZEB; 313 BP. XX AC chr14; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy66-I_DR; KW Gypsy66-LTR_DR; Gypsy66_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-313 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1448-1448 (2008). XX DR Genome; chr14; Positions 14659027 14659339. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 313 BP; 94 A; 54 C; 63 G; 102 T; 0 other; tgtaaagggt caataaatat gtaatgtgtg catatattta aatgatgaaa acaggacccc 60 cctacaaaaa tttagctgat taaatggatt tattttgaaa acattaacta taatgtattg 120 ttgggctaat cccaaagtta ttttttgtat atcctgtaga gggcgctaca tgcataaaag 180 acgttgaaca gagcagctcg ggtcagagtt cattaaagac cgtcagagct cgtattgatt 240 tgtgtcttta cttctctgca gtttacacat ttcacttacc cgtttgggtc tcttcagcct 300 gaggcgggta aca 313 // ID ERV3_DR-I repbase; DNA; ZEB; 9338 BP. XX AC . XX DT 29-AUG-2008 (Rel. 13.08, Created) DT 29-AUG-2008 (Rel. 13.08, Last updated, Version -1) XX DE Internal portion of ERV1 Endogenous Retrovirus from Danio rerio. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR; KW ERV3_DR-I. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-9338 RA Smit A.F.; RT "ERV3_DR-I - ERV1 Endogenous Retrovirus from Danio rerio."; RL Direct Submission to Repbase Update (05-AUG-2008). XX DR [1] (Consensus) XX CC Not a consensus. Internal sequence at danRer4 CC chr6:39596928-39606262 with 99% identical LTRs (ERV3_DR; a few CC dozen copies in genome though. ORFs: gag 895-2664 (1 stop codon), CC pol 2812-6240, env?? 6278-8527. XX SQ Sequence 9338 BP; 2632 A; 1876 C; 2188 G; 2642 T; 0 other; gttttaattg gtggaggatg cgggcaaagc tacggggaca tcctaaagat caatcaacaa 60 aaacaaaaga aaggtaggaa gatttaaaat tataagacta ggtattgtct gtgaagtagt 120 gggtagtaac ggaacgttgt tggttctcat tcagacgtga atgaatgttg ttggttctca 180 tttagacgta aatgaaagtg tggatttctc attcagccgt gaatgaaagt gtggatttct 240 cattcagacg tgaatgaaag gtaccggcgg attcacagaa atttgtggta gatttgtgtg 300 gaatccctta gtgagctggt gcatttttgg cggtacacct ataagtgata agtgataaga 360 ataacagtag tggagaggtg tacctcctct tttacccaaa gttaaaatac tccgttgagg 420 ggtgtgttcc aagacagtct cggggataaa aattggactg ttcttctttt tattcagtaa 480 tggactaaaa taaattaacc cgagagtgta ataaagttcc accactgcta tctaactaca 540 tcacagacaa aaaggcatgc cataagaata gtgtaggtca gattattagg gaaagaacta 600 gggactaaat aaataaacaa attcttaagt aacaacataa agtggtatcc attagactgc 660 attaaccaag aaacaaggat acctagttat taaagggaaa ggttgattga ttttgtggat 720 ggaccgagag aacgccctac atatactgag ctaagaaaaa aaaaaaaatt taattatata 780 tatatatgta tgtatttgta catgtatatg tggatgtatt aagttaaagt atagctcctg 840 gtgtgttcag agatactttg cagttctaga ggtggaaggc agattatttg tttaatgata 900 cttaccgcaa ctaccatctc tattataccg cttctggcat ttctcggagg cttcagattg 960 ttttccagta atttctttaa attactaaca aaaagaatgg ctgacctttt ctctctgatg 1020 acccaatcca gcaaaggcat taaaatggat gagagctacc tctgtaaaca gatatgcctc 1080 gaagggttaa aatccaaaaa gtattctgag attgttaaac gaatggtcag cctgggctat 1140 gagtctaagt ggaaattgac tgagcaaaca gaatggttac agaacaccaa agaggcagac 1200 aaatataagt ttcttgcagt gtgtgcatgc tcatggatca gtcagcaggg aggaattttc 1260 gttgacaatg acggcgatat tcccgatatg agcagtgctt ggaaaaattt aatgagtaat 1320 atagacaagg tgtctgaaat ctcaggaatt gatgtgggta ctgatctgtc tggtcttgga 1380 aaggctctct ctgctattgc agagcaagct taagaatgtg gggaatgtac aaacaatttc 1440 aatgagacta aaccttctgc tcccgagtgt ttttcttcta accggcaggt ggcggtggtg 1500 tccagcccaa tccaaaaaca agagactcag tctgattcgc cttacgcaca ggtcagagag 1560 ctactaaacg tccttacgag tggagcaccg gccccgcctc cctacgcaaa ctcgggtgag 1620 tttccctctt tgcaagaacg tgtagacaga tacggtgtgg taaaaacctt tcttcaccct 1680 cattctaagc gcagtataac cttctccaaa aaaggagctt ctgctgcacc gatcaggatt 1740 gccactgatg tacacagaaa tacagaaatg ggggcgggtt atgaaattgg ggaatgggtg 1800 cggcttacac cgggtgagaa aagtcacatt cttgcaggga tggaacctct gaaattatct 1860 tacccaaaca aaacgacatg ggataaaatt gaggctgcgg cccgacaaca aaatttggga 1920 attgctgaca ttcaatcaat tgtagagagt ttggttccgg tatccaaatt accaaagatt 1980 caaacgatcc agtacgaccc gagagtacct gatgagtggg tggggtattg ccatgcctat 2040 aaagagtata aggaccaggt tcaggaagta ttggggaggg gagcttttcc ctggactagt 2100 gttacccaga tcaagcaaag accaggtgaa agtcctgttg aatatcttga tcgcttccgc 2160 acagcctttg agactcattg tgcagtaaat aactgtgaag cagaatttga caaacctttt 2220 gtgcttgaat ctgctgctaa tgggctgaca aaacagtata gagatttagc gctcactggc 2280 tgtgctgaat ttttaacttg ggatgatctg gctcgctgga gtacaatgtg ttggggcaga 2340 ctccaggagt cagagggcca tacgggcctt acccctccca ctgtgaatgt tggtgctgtt 2400 gcagctgctg acgacactaa aagattgtca ttagccgaaa ttgagtgtta tcaatgcgga 2460 aatctcggac atttgggtcg aacttgtcgc aatggcacta aaagttgtaa caaatgtgga 2520 aaaccgggtc atgcctcaaa attttgcaaa gcagacctta acccctctca gaggagaaag 2580 ggtaaaaatg aaaaacctca aaacaaacct atcctctctg agagtgccat tgccaagctg 2640 ttaaaattgg tgggtgagga ctaggggtca tcggcctcca gttcttgtga tgctgctgac 2700 cccaggccat ttgttcattc attgttagga ggccggcaat gtttagcttt ggttgacacg 2760 ggagcccaga ttagcatcac taatattcct ctcaaaacaa cttcagaata catgtacatt 2820 gatggactaa atagctctga actgttctat aaatctgtac ctgttcctct ccttcttgct 2880 gattctcaga atgtgttttg gacagaattt tgggttggga aaaacactat tgaaactgtt 2940 ataggtgttg acattctgaa taatctcgaa gctgatgttt tactttcaac agataaactc 3000 gtgctcggct gcaatgaagt aattcgctta tctcatgccg gccctcacca agagacagga 3060 tgcgctgtgg cttcagctgc tagtgagagt gagactagac ctgagtggaa atttattatt 3120 tccaggttct ctgatgtctg ggctaaagat aaatttgatt gtgggaatgc tcaaattcag 3180 ccacttagca tcccaggtcc cctgcatgaa gccaaacatc aatatccttt gaaaaatgac 3240 gctcgggagg gggctgaaca ggtagtgagg gaactacaga aacgggggat aattgtacct 3300 tgttcttctc ccactaactc ccccatgtgg ccagttaaaa aacctgatgg tagttggcgt 3360 ttgacaattg actatacagc tttaaatgca gtaactacta aattgcaccc tctggtggcc 3420 aacccagcca ctatccttaa tgaacttggc tctgagaatt gctatttcac tgctttggac 3480 atttctaatg gcttttggac ttgccctcta gctccagatg ttcaactgaa atttgctttt 3540 tcatgtgctg gacaacaatg gacatggagt cgacttccgc aaggattctg caattcaccg 3600 acactgtttc accaggtact ggcatctttt attgaccctc ttcgcaaaga aattgaaact 3660 gaaggttccg tcatcctcca atatgtggat gatattttga ttgccagccc taatcagaat 3720 aaacatttga ctgctgtaca gcttgtgttg aaagctctca gtgatggggg tttcaagatt 3780 aatttacaga aagctcagct ggcccaaact gaggtcacat acctgggaca gattgtcggg 3840 gttcacggac ggcgcatcac ccctgaacgt gtcaaagcca tcatcgagct gcccaagccc 3900 aacaccgtca ctggtctaag acaggtaatg ggattgctaa attactgtcg ccagtatgtg 3960 tctgaatata cagaactctc taagcccttg acagaagccc taaaaggagg aaagcctgga 4020 aaggagtgca tcgactggac tgaggagatt gaggaggcgt ttgtgagtat caaagagact 4080 ctttcttctt gccctgctct tagacatgtt gatgccagga aacccttcca cctgtggaca 4140 tgggtgggca ccaagtctta ctctgcagct ctggggcaac gtgtgccagg acggaagtca 4200 tatggcatga ttggatatta ctctacgccc atccccatct ccatggcagg ccaacatcct 4260 tgcttgctga tctgtgactg tgctgagtgg gctgtcaaaa caacagagga catcgtctgc 4320 tatcaaaagg tgatattgca caccaaacac cagatcctga aactgctcac aaacacccgt 4380 ttaggatcaa tttccaacca acgacgggct aaatgggaag ccacactctt gagcaagaat 4440 gttgacattc atactgatac tgacgctaca attaccccag cttccttgct gcctgatgaa 4500 gggaatgccc acaactgtgc acaagtgttg gaagaggaca ttcggagccc tttggacaca 4560 aagcctttgc ctgatgccat gaaattgttc gttgatggct ccagttttca tgaacagggt 4620 aaccgttata ctggctgggc agttgtgaat gaacattgtg aaactgtgga ttgtggctct 4680 gtttcaggtg ggggagctca ggtggcagaa ctgatagcgc tcacatgggc tctgcagctg 4740 gccaaaggaa aacgagtcaa tgtgtacaca gacagccgct acgcctatgg tgttgtacat 4800 gactttggac ccgtctgggg acgtaggggg tttctaacta ctgctggtct ccccatttct 4860 catcaaaaac aggtcatgga actgatggag gcatgcacat tgccggctgc aggagctgtc 4920 atcaaggtca ctggacactc aagtgaaatg tcactggagg ctaagggaaa tcatgctgca 4980 gatgctgctg ccagaggggc tgcaaaacaa accaacaaac ttgtgagtgt tatggcactt 5040 tctccccagg aggaggaggt gcacagcaga gacatttttg agctttacgc tgcagatgac 5100 cccgaagaac ttgaacagtg ggttaaatta ggggcacaca aagcttcaga tgggttatgg 5160 aagtttcatg ataaatttgt ctgtctatct tctgctcgta ctgaactaat gtctctgtat 5220 catggtttgg ctcatgcagg tcctgagaaa ttacatgcaa tgatctccaa aacttggtgg 5280 tggccgaagc agagagctgc ttgcgttgat ttttgtaaga ggtgtctagt atgtctcaag 5340 gtgaactctc cctctaagtt aaagattccc ttgggacatg ctcctcgtcc acagggaccc 5400 tggacacatc tccaaattga tttcattggt cctctgcccc ctagtggagg cttttcgtac 5460 attctcatga ttgtcgattt gttttctaga tgggtagaag cctttccact tagaacttgc 5520 acagcagatg caacagccaa aacattgttg actgaagttt tccccagatg gggcataccg 5580 ctacaaattg actctgacca gggtacacac ttcacaggga aggtgatgaa acatgttatg 5640 aagttaatgg gaataactca acatttttat atctcgttta gaccgcaatc tagtgggtca 5700 gtcgaacgta ccaaccgcac tttaaaagcc gcgttgagaa aaagacttct ggagtgggga 5760 aaagggtggc acgctgctgt gccaatgatt ttgtttagca tgagagcaag cccaaacaag 5820 accacacaga taagtccttt tgaaattatg actggtagat tcttgcgtct gccctgggat 5880 gcgccactgc aacatgaggg cccttctggt cctttgaaag atgctctgca aaattatctt 5940 aaggctttgg attgttgtct aaaagacaca cgagtaagtg taaacattag acaagctgag 6000 aaagataaca aagagcaact gactgtgcct gttcttcctg aaattggaga cagtgtaatg 6060 ctgcagaggg agatagtttc tcccttgggt ccaaagtttg atggcccata tcaagttctg 6120 ttaactacta atacatctgt tttgcttgac agaggaattc tgggtactgt atggaaacat 6180 tggtcccagg tgaagcctat cgagaaatgc cacatccaag aaacggcccc ccaaacctaa 6240 tcaatcatga ggtttgggtg ttaatagtta atgataaacg cgtaaatctt cactaaatct 6300 gagtttttct tttgtgaaat tgttacagat ggctggacca gatgatgtgc tgaaactgct 6360 tattctgaaa gaagagattc gcagagatcg acgtctcaag atgagacgag ccttcatccg 6420 attggtcatg ccaatttctt ttgtagtgct gattttaatc gccttatcat ttctattgtg 6480 gattcaggtt tctctctgga ctggacgctt tcacgcgttc tctactaatt gggattgcga 6540 agtggccgac tgccgtcctt cttgggttcg atacccttgc ctaaattcta ctgagggtct 6600 caatttatta cctactatgt atcaaaaagt taagaatcga tgttatcgtc ttgacggcaa 6660 cgagacagtg gcctattggc tcggcaactc cccaaattat ccagagaaca ccctcattat 6720 gcaaaaaggt cggtggtccc aggatggtga attggatgtg tttgaggaaa attatagttc 6780 tctcagttgg attccaaaga ctttagctca gatgaattac agtgattgcc caagtctggt 6840 taacaagaat cgtagatatg tggtacagca aaagtcactg ggatcaacca caaagcgaaa 6900 taagagagag cagcctaact ccaattacca ccacataatt cttgaaggta aagcagtcaa 6960 attacctcta tacccaagtg cccctgatcc gaggcatcta atgtcttact ttctgcctaa 7020 agcaaagaat caactcagct ccaaccgagt tgttaaatct gtggtcagaa gttatgcaga 7080 cagtaacatt tttgtcaggg atgatcctag ctccccttgg tatacggatg tggtgtcaga 7140 caggcgatgg aatgtgttaa aaaccaatca aggtaaaagt ggtgcgatgg gactaggtag 7200 accatatggt gcagtttgtg acaaatatgg taatatagac agaacagaca aggacaattt 7260 tgacttgaat ggaagtactg atgaattgcc taatgttgat gctctttctc ttgcatggcc 7320 caggtatact agatttgatg acccctggag cgagacgatt catgtagaga ggtgtatggg 7380 taaggaattt cagcagtggt acatcaaaac atttccaaga gacaaacctg aaatttgtaa 7440 gacagcaatt agcaaatttt taggtctgac tgatatgcag attcctctag atgtatgtaa 7500 ttcatcattt gtgaaaaagg attatgtgtg gtctcatctt actgatgttt tggtacaata 7560 tgatctggca gtaatgatgg ctaacaaatc tgtgattaac ttggaagatt tacatgacca 7620 tttagaggcc aaatgctgta gtcaggaatc tgtggctttc aatgtgacta tgtggatgca 7680 tttaaatatg tttcatttct atttggaagt tgttgcttac aggaaggcta tgctgcgagt 7740 cccagagaca gtggatgatc ttatctggaa tgagtttaag ccccatcagg tgtatatgtg 7800 gccccagata gcagatgtag caataggaga ggaatggttt gaccgagaga agcaattcag 7860 acagttcttg cgaccagtcc catcggttca ggagcttagg gatgtacatt atgccccggg 7920 atcaggtagt tgtgctttcc atcagttagt tcaccacatt ggccccgagt ggcacccagc 7980 aggaaattgt ccacacagta attgtgctga gattgaatac tgcgttgcaa acatgaccaa 8040 taccacggag tactacccat atgttaaggc tcatctggct agcggaaaat acattaatgc 8100 ttttctaaag cagattaaaa gtcctgatgc cttttctggt tatgaatatt tgaattctct 8160 ttggcttaga tcgactggat ccccttttct gtctgaacct gaggcaccct cagctttagc 8220 tgtgatgttt gcatttaccg ttggagcagt cttcactgat gtgactgaat cagtggaaga 8280 gtttgttgaa gaagcctggg gagaatttgt ggctgtatca ggaagcatag tcatgaaaat 8340 tttgaagtat gtcctgtttg gttttggagg gattataggg attttattca tcttcctaat 8400 agtctgggct ttggtccgct gttgttttgc gtatatcttc acttctgctt gcaggacact 8460 caaccccaca gacgatgaga aacatctact gaccgtgacg aaccgtcttc ggagccggat 8520 ggcttgacgg atgcaaagac gaaccccttt tgtgtgagta gaaggtggaa gagcccttag 8580 cttggggact ggaattttta ggttccagca tgtccaagaa agggtgaaga gactgtgtaa 8640 cctaacccag ggagcacggt cattacttcc actttcttta gcgacctcac tagttgaccg 8700 gaagcggatt cttcactttc ggccaaaaac gggggtgtga aggagtcgga gttctacctt 8760 ccaatccatc aacatttatt caacgtatat cctttctatt taatatcctg tttctcatct 8820 ctttgaatat atattttatt gattatctga atgagtattt aacttgaatt gatttgacta 8880 atgaataagt taaaactgct tactgcaaga atgttctgtg tccttgtttc tctctttgcc 8940 tgtctatttt aactttgttt tcaggagaca cacaaaccag gtcataggtc atggagacct 9000 ctctccctct ccctctatcg gttcctataa ctgtagggtc aatacagagg agaagaagac 9060 acctgtttca ttgtcctgat atattgccta taattttata gctgtttgat tgatctgttt 9120 ttatgtataa ttaatagtaa tgtaattcct tttcgtgtgg agatagactg tgtgtagcca 9180 agatattgca gcagactgtt gatggaaaca gtcttgaacc agcctgataa cacagctgaa 9240 cgtacaagaa acatccttga gtctttaaca cagatggtct tttgtgtcta ttgttttcct 9300 ggtttagctt tctacaggtc ggagatcagc tcgtaata 9338 // ID Gypsy44-LTR_DR repbase; DNA; ZEB; 699 BP. XX AC . XX DT 25-SEP-2007 (Rel. 12.09, Created) DT 25-SEP-2007 (Rel. 12.09, Last updated, Version 1) XX DE A long terminal repeat of the Gypsy44_DR LTR retrotransposon; a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy superfamily; LTR; Gypsy44-I_DR; KW Gypsy44-LTR_DR; Gypsy44_DR; endogenous retrovirus. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-699 RA Dib M.R. and Naveira H.F.; RT "Gypsy44_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 7(9), 818-818 (2007). XX DR [1] (Consensus) XX CC Gypsy44-LTR_DR is a long terminal repeat of the Gypsy44_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its CC internal portion is deposited in Repbase as Gypsy44-I_DR. XX SQ Sequence 699 BP; 141 A; 139 C; 140 G; 279 T; 0 other; tgttacattc cattgtgttt tgaatgtatt attgtgtttt ggtttttttt ctctctctct 60 ctctctctcc gttcctgcaa cctgtttctt agctaggtgt ggaggagagg aagttgattg 120 cttgcacctg ctactcgttg cccggcaagc ttaaaagcct gccagttgca agccacattg 180 aagcttgctt ccagagtgtg gtctctctcc tgcctggttg atctattgtg aaaaactatt 240 ggtgctgttg tgtttaactt gttgaagtta aaatctgttg ttaaagcctc tgaagccatt 300 tgagctctgt ttatgaaaac taactttgtg tttggaagcc gtagggaggt gggtgccatt 360 ttatttctga aacccttttt ctcttgttta tgtttagtta gggagttagt gtacatttct 420 gttgttgatt taattttact ttgtcagttt agacagtttt actccctaac tagctagagt 480 gtacttttgt ttgttgtttt ggcctttccc cctcctgaag tctatttttg cttcccagtt 540 ttgttttcct attgtacatt tgttaataaa ttatttaact tgaatttggt gtgttgtcgt 600 gaaggcaggg gactgaccca cactcacccc aaccaacatt tttttaaaac ttttcactgt 660 tactcccgct cctagacact taccatcagg gacgtaaca 699 // ID DIRS-14_DR repbase; DNA; ZEB; 6940 BP. XX AC . XX DT 07-JUN-2010 (Rel. 15.11, Created) DT 07-JUN-2010 (Rel. 15.11, Last updated, Version 2) XX DE A family of DIRS-type LTR retrotransposons - consensus. XX KW DIRS; LTR Retrotransposon; Transposable Element; DIRS-14_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6940 RA Kojima K.K. and Jurka J.; RT "DIRS-type LTR retrotransposons from zebrafish."; RL Repbase Reports 10(11), 1842-1842 (2010). XX DR [1] (Consensus) XX CC This consensus is generated from 6 sequences with >91% identity. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 222..1061 FT /product="DIRS-14_DR_1p" FT /translation="MENDNDSNIPESPALQEIPAAAQRQHAPLSSQVEPEA FT PTRGRRLTRSSSSKTHSLAVSPSPSTSRVHNASPASSYASAFSSLNPPKQM FT TNHELRYILSNAGIDAPRSLNKKELLELYHNATTEAGNPTHTSQAKKKTRN FT PKTRYAPYPPQELSPSIQPIQKQARRGHAPSRTPSSHHPQETQTSTHHLAS FT TLHRRIQTDHPRSPFWPPAPPSNFSSSPNPLIPPQSIPTASNPLEQINPLP FT LHPPLAPSSFPSGPPPPANFPSSSLPPSFPLCTPPPRSH" FT CDS 3390..5024 FT /product="DIRS-14_DR_3p" FT /note="tyrosine recombinase." FT /translation="LSLSFLFSEVLAVSSRGGPSSNPSASIFRPDIAIDHP FT LHKLHETSISLILQAVAPRTLQSYLTAWNSFKHFHSLFNIHFPDFSLLSIT FT SFISHLHSSKKIQASSIRSYLSGIQFFHKLIYGAPSAAILHSQTNLLIKGI FT QKTHPPPPDPRQPITLRILSKCIATLRKGYQSIHTAHTLDAMFNLAFFGFL FT RCSELTASDKFNPAIHPTISDLALLDKETLSFFIKQSKTDQSRKGHSIYIF FT DLPSPTSPFQTLLAFSHYRKRQTPNPLSPLFTDDSNQPVTRFWFQKHLKNI FT LRLSGFPPDPFSSHSFRIGAATTAAHKGLSQHQIQALGRWSSDAFNSYIRL FT SRFHIKAAQQTLISASSQRSHPTPPPTLTPPELDSHSPQESIAAPQASSTT FT FLFPATSASLSASTEPPLPFLLVSSASLPLSGESRRPPLLFCPSVECSAAS FT LASSLSAPLLCLPVTEVPRLLSTKFLRRPTFPFQRLTLRYPPLPSSTNLAS FT LYSFRIGPTHLLGPPQGYSPRAGAPKPPCRSPHRPPPFPDSCRNLAQQL" FT CDS 1429..3489 FT /product="DIRS-14_DR_2p" FT /note="reverse transcriptase and ribonuclease H." FT /translation="MAIIAELALSYGGTHFYTYHKLFSAKCAIRLTQWNQQ FT SYWGALDTDLHNRVFLGCRNLTCAVCRSSLHPTTSCPFTVSPTDSDRPAPK FT STSYVPRIPSSNIPSLLSPHANCQNFNIGRCFRQPCRYLHLCNYCGGAHAR FT TVCPISKANKKSKNYLSTPVNVSKLSFELRCHPDSNFCAFLVSGLTHGFHP FT GISALPTYNLICSNLQSAIAEPEIVDNLIEKEIKDNFMIGPFSAPPFNVFR FT ISPIGIATRKFSGKKRLIIDLSAPHNSYFPSINSTIPLEEFSLHYHDIDQA FT ISLIKTAGRNAWLAKIDITSAFKIMPIHPDFWHLFGIQWRSKFYFAVRLTF FT GCRSSPKIFDMLSEALCWILSNNYGIPYLIHLLDDFLIVSPPSAPPAKHLA FT ITQKVFADLGIPLAPEKTSGPATKIEFLGINLDSNKFQASLPKEKIDRITS FT LSAAFLEKQVCKKRELLSILGHLNFAMRIIPQGRPFITHLLQLANSVRSLE FT DPISLSNSSRDELKLWISFLQHWNGCSFFYSDLISSPVDIKLFTDAAPSVG FT FGGYYQGHWFAAPWPPQMLDTPQNLRSSALFELYPVVVAALLWGDEWSASS FT ILIHCDNEATVQCINKGRSHSTSLMPLLRRLIWTAAKKQFIITAKHVPGFK FT NQIADSLSRFCFQRFWRLAPEADPHPTPVPPYSDLILP" XX SQ Sequence 6940 BP; 1580 A; 2525 C; 1167 G; 1666 T; 2 other; agtgaagttt attcataaac taatttcgag aggatcacgt gcttatgatt gacacggctg 60 gccccgagtc aagctaatgt acgaaccacc aatcagacta ctcctaacca agtataaata 120 accaaacacc ttacctttag tcatcttcgt cttgaagaag cagcctcgcg ttaaaacgtc 180 gaatcgccat taacgttact cgtcgaagca gcaaattccg aatggaaaac gacaacgaca 240 gcaacatccc tgagagcccc gcgctccaag aaattccggc cgcggcccag cgccagcatg 300 ccccgctgag ctcccaagta gagcccgagg ctccaaccag aggccgccgc ttaacccgct 360 cttcctcctc aaaaacccac agcctcgcag tatctccttc cccatccacc tccagagtac 420 acaacgcttc accagcttct tcatacgcct cagccttctc ctccctcaac ccacccaaac 480 agatgaccaa ccatgaactc cggtacattc tgtcaaacgc cggcatagat gctccacgct 540 cacttaacaa aaaggagctg ctcgagttat accacaacgc caccaccgaa gcaggaaatc 600 ccacacatac gtcacaggca aagaagaaaa cccggaatcc caagacccga tacgccccgt 660 accctccgca agaactctct ccatcaatcc aaccaatcca gaagcaagct agaagaggcc 720 acgcccccag ccgcacacca tccagtcacc acccccagga aacccagacg tcaacccacc 780 atctagcatc cactctccac cgaagaatcc aaacagacca ccccagatct ccgttctggc 840 ctccagctcc cccttcgaac ttttcctcaa gccctaaccc acttatcccg ccccaaagca 900 tccccacggc ctcaaatcca ctagagcaaa tcaatcctct ccctcttcac cctcctctcg 960 ctcccagctc ctttcctagc ggcccgcccc ctccagctaa ctttccctct tccagcctcc 1020 ccccctcctt ccctctctgc acaccccccc cccgttctca ctagcaccgc tccaacccag 1080 caccccccac caccttctac agcacgtcct ccattcactc tccattccgc cacacccctc 1140 cctcctcccc ctaacgctcc agccctggaa cccccccccg tctcaaacac agcaaggaac 1200 caaatcttgt caggtgctga cgtagatctc ctctcacttc tttcacccat cacaccccct 1260 tcggcagagc gacaggtgga ttgcggcgaa ttttctataa ccctcaaacc acccgccaac 1320 acccaatcac gcaccttatc actcgccgaa ttcaccatcg ccttctcacg cttcaccgag 1380 ataatttgtt ccgtattccc ccataggagg cgtgagttaa acgattacat ggccattatc 1440 gcggagctcg cgctttccta tggggggact cacttttaca cctaccacaa gctcttctcc 1500 gccaaatgcg ccatccgact cacccagtgg aaccaacaat cctactgggg agcactcgac 1560 accgatcttc ataacagagt ctttttaggt tgccgcaatt taacctgcgc ggtctgccgt 1620 tccagtctgc acccaaccac ctcctgcccc ttcactgtct cccctaccga ttcagaccgt 1680 ccagcaccta aatccaccag ttacgtcccc cgtatcccta gctctaacat accctctctc 1740 ctttcccccc acgcaaactg ccaaaatttt aacattggca gatgcttcag gcagccatgc 1800 agatacctcc acctctgcaa ctactgcggg ggcgcacatg cacgcacagt ttgccctatc 1860 tccaaagcaa ataaaaaatc caaaaactac ttgtcgactc ctgtaaatgt ttcaaaattg 1920 tcctttgaat tacgctgtca tcctgattct aacttttgtg cttttctcgt ttcaggtcta 1980 acccacggat ttcaccctgg catctccgct ctcccaacct ataacctaat ttgctctaac 2040 ctacagtccg ctatcgcaga gcccgaaatc gtagataact taatcgaaaa agaaatcaaa 2100 gacaacttta tgatcgggcc cttttccgct cctccattta acgtctttcg catcagccca 2160 attggcattg ccactcgcaa attctcaggc aaaaaacgcc ttattatcga tctatcagct 2220 ccgcataatt catactttcc aagcattaac agcactattc cgctagaaga attctccctc 2280 cactaccatg acatagatca agcgatctcc ttaatcaaaa ccgccggccg taacgcatgg 2340 ctcgcaaaaa tagacattac ctcggcattt aaaatcatgc ccattcatcc cgatttttgg 2400 cacctttttg gcattcagtg gcgctccaaa ttctacttcg cagttcgtct aacgttcggc 2460 tgcagaagca gccccaaaat atttgacatg ctttccgaag ccttatgctg gatactgtca 2520 aataattatg gcatccctta cctcatccat ctcctagacg attttctcat cgtctcccct 2580 ccatccgcgc cgccggctaa acatctcgcg atcacccaaa aggtattcgc agatctcggc 2640 attcccctcg cccccgagaa aacctccggt cccgctacca aaatcgaatt cctaggcatc 2700 aatttagatt ctaataaatt ccaagcatct ctccccaaag agaaaatcga ccgtatcacc 2760 tcactctcgg ccgcctttct cgaaaaacaa gtttgcaaaa aacgcgagct cctctctatt 2820 ctcgggcatc taaatttcgc catgcgtatt attcctcaag gacgcccttt catcacgcac 2880 ttattgcagc tagccaactc agtccgtagc ctagaagacc caatttccct gtccaattcc 2940 agccgagacg aattgaaatt atggatatcc ttcctgcagc actggaacgg ctgttccttt 3000 ttctattccg atttaatctc atcgcccgtc gacattaaac tatttacaga cgccgcccct 3060 tctgtcggtt tcggggggta ctaccaaggg cactggtttg ccgccccctg gcccccccaa 3120 atgctagaca ccccccaaaa cctacgctca tccgccctct ttgaactata tcccgtcgtc 3180 gtcgccgccc tcctatgggg cgatgaatgg tccgctagca gcattctaat acactgcgat 3240 aacgaagcta ccgtgcaatg cattaataaa gggcgctcgc actccacctc gctcatgccc 3300 ttattaagac ggttaatttg gaccgcagcc aaaaagcaat ttatcataac tgctaaacat 3360 gtaccaggct ttaaaaatca aattgctgac tctctctctc gtttctgttt tcagaggttt 3420 tggcggttag ctccagaggc ggaccctcat ccaaccccag tgcctccata ttcagacctg 3480 atattgccat agaccaccct ctccacaaac tccacgaaac ctctatttct ctcatcctcc 3540 aagccgtggc tcccaggact ctgcaatctt atctaacagc ctggaactca tttaaacatt 3600 ttcactcact ttttaacatc cattttcccg atttctccct cctctccatc acctcattca 3660 tatcacacct tcactcatca aagaaaatcc aggctagctc cattagaagc tatttaagtg 3720 gcattcaatt ttttcataaa ttgatttatg gagcaccttc tgctgccatc cttcattccc 3780 aaacaaacct cctaatcaaa ggcatacaga aaacccaccc cccacccccc gatcccagac 3840 aacccataac tctccgcatt ctttctaaat gcatagccac cctccgcaaa ggctaccagt 3900 caattcacac agctcacacc ctcgatgcca tgtttaacct agcctttttt ggctttctca 3960 ggtgctctga gttaactgca tccgacaaat tcaacccagc catccacccc accatctcag 4020 atctggcctt gctggataaa gaaaccctct ccttctttat caagcaaagc aagacggatc 4080 agtccaggaa gggccactcc atatatattt tcgacctccc ctcccccaca agtccctttc 4140 aaacccttct agctttctcc cactaccgta agcgccaaac ccctaacccg ctttcccccc 4200 tcttcacaga tgattctaac cagccggtaa cccgattctg gttccaaaag catttaaaaa 4260 acatcctccg cctttctggt tttccccctg accccttctc cagccactcg tttaggatag 4320 gagcagccac tacagccgca cacaaaggcc tttcccaaca ccaaatccaa gccctaggcc 4380 ggtggtcttc agacgccttt aattcctata tccgcctcag ccgctttcac atcaaagcag 4440 cacaacaaac gcttatctcc gccagcagtc agcggagcca ccccaccccc ccccccaccc 4500 tcaccccccc cgagctcgac tcccacagcc cacaggagtc aattgcggcc ccgcaggcca 4560 gtagcactac ttttcttttt ccggcgacga gtgcctcgct ttctgccagc accgagcctc 4620 ccctaccctt ccttctggtg tcgagcgctt cgctccctct gagcggcgag tcacgccgac 4680 ccccactcct gttctgtccc agcgttgagt gctccgctgc ttctttagca tcgagcctct 4740 ccgccccact tttatgtctc ccggtgacgg aggttccacg cctgctaagc acgaagttcc 4800 tccgccgccc cacttttcca tttcagcgtt taaccctccg ctaccctccc cttccttcta 4860 gcacaaatct cgcttctcta tattccttcc gcataggccc tacccacctg cttggacccc 4920 cgcaggggta ttccccccga gcaggagccc caaaacctcc ctgcaggagt ccccaccgcc 4980 cacccccttt tcccgactcc tgcaggaact tggcccagca gctctgatcc cgagcttcga 5040 ctcccgcgga gtctgtcacc gccccccccc acccactaga taattttctg tcttccttct 5100 agcacagagc gctctgctgt ctcccagcat mgagtctctc tgcccacacc cttctttctg 5160 gtgtsgagcg ctccgcgccc tccgagcgtc ggagtctctc cacacccccc ccttctctgt 5220 cccagcgtcg agtgccccgc ttactcttag cgggagtctc tccgccccac tttatgcctg 5280 tctctcagtg ttgagtgctc cgctctctcc agcgccgagc ctctccacct ccctccgatt 5340 tttcttttcc ttttcagcac caagtctctc cgctacctcc gcttccttct agcacacctg 5400 gctcctctat gttccttccg cataggccct cactcaccag ctctgactcc cgctggagtc 5460 accccggagc acgattctgg ccacggcccc tccccgcagg agtcactatt gccccctccc 5520 gctcccgact ccagcgggga caaaaaaaaa aactggccca gcagctctga ccccgagctt 5580 cgactcccac ggagtctgtc accgcccacc cccgcccctt agaatttttc tttcttcctt 5640 ccagcatcga gtgctccgct accttctagc gccgagtctc tccacctttt cccttccttc 5700 cggtgttgag tgctccgctc ccccagagcg ctcgagtctc tccgcctcct ttctgtccca 5760 gcgtcgagta ctccgctcct tcttagcgtc gagtttctcc gccccacttt tttctctacc 5820 ctgcggcgtc gagggctccg cttcctccaa gcactgaagt ctctccgccc cctctccttt 5880 tcttattctt ttcagtgacg aataccgcca ctactccctc tagctctacc cctctatctt 5940 ccttctgctt aggccatcac tcccagctat gactccgcag agtctcccaa gcttccctgc 6000 ccagactccc acggagtcta tgcttaccca cgctcccgca ggagcccctc tctttctcct 6060 actcttccct aactgtatat ccagcagccg gatatagcat tggtatcagc ggctctgggg 6120 gggttcttca atacgcggct gctgtcccga gcataaagca ctttgggggg ttctcgagat 6180 ctacccgagc tcaaactccc ctctcgccct gcaacgggag ggagccccgg gctcgaggct 6240 ctcatgagct cggggctctc tcccgggaca gcatgccaaa ctagcttatt accaatcatc 6300 agctaagtgt gaactcttga agtgaagttt attcataaac taatttcgag aggatcacgt 6360 gcttatgatt gacacggctg gccccgagtc aagctaatgt acgaaccacc aatcagacta 6420 ctcctaacca agtataaata accaaacacc ttacctttag tcatcttcgt cttgaagaat 6480 ccccccttcc accccttcgc ctcctctttt gtctacaggg cagcacggcg gcccagtggc 6540 tagcactgca gcctcacagc aaaaatgcct ccggttcggg catctaccca aacggtcagc 6600 attttcgtgt ggagttcatg ttctccccgt gcttgcgtgg gttttccccg ggtcctccgg 6660 tttcctccca cactcccaaa aacatgacac ttaagtaaat tgactaatcc aaattagcac 6720 caaatttgat tcaattctgt cagcaacgca tcaccttagc aaccctcacg cagcaggagg 6780 gaggggggtt ctcgagatct acccgagctc aaactcccct ctcgccctgc aacgggaggg 6840 agccccgggc tcgaggctct catgagctcg gggctctctc ccgggacagc atgccaaact 6900 agcttattac caatcatcag ctaagtgtga actcttgaat 6940 // ID CR1-2_DR repbase; DNA; ZEB; 4238 BP. XX AC . XX DT 02-MAY-2002 (Rel. 7.04, Created) DT 02-MAY-2002 (Rel. 7.04, Last updated, Version 1) XX DE CR1-2_DR is a CR1-like non-LTR retrotransposon - a consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; KW AP endonuclease; CR1 clad; CR1-2_DR; CR1DR2; LINE_DR; ORF2; KW reverse transcriptase. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 3422-4062 RA Okada N., Hamada M., Ogiwara I. and Ohshima K.; RT "SINEs and LINEs share common 3' sequences: a review."; RL Gene 205(1-2), 229-243 (1997). XX RN [2] RP 1111-4008 RA Jekosch K.; RT "CR1DR2: CR1-like repeat from Danio rerio."; RL Repbase Reports 2(2), 8-8 (2002). XX RN [3] RP 1-4238 RA Kapitonov V.V.; RT "CR1-2_DR, an ORF1-less semiautonomous family of CR1-like non-LTR RT retrotransposons in zebrafish."; RL Repbase Reports 2(4), 7-7 (2002). XX DR [3] (Consensus) XX CC CR1-2_DR is a family of CR1-like non-LTR retrotransposons and it CC was active in zebrafish recently. CC The consensus sequence was reconstructed based on 10 copies CC that are ~2% divergent from it. CC Approximately several hundred copies of CR1-2_DR are present CC in the zebrafish genome. CC The consensus sequence encodes one protein only, CR1-2_DR1p CC (position 1111-4005). It is a first CR1-like family without ORF1. CC A 5'-terminal 1110-bp non-coding is present in CR1-2_DR instead CC of CC a CR1-like ORF1. CC The 965-aa CR1-2_DR1p protein is composed of AP endonuclease and CC reverse transcriptase domains. CC Endonuclease/reverse transcriptase proteins encoded by CC CR1-2_DR-like CC elements were probably involved in proliferation of SINE_DR1. CC CR1-2_DR and SINE_DR1 share nearly identical 60-bp 3' tails. XX FH Key Location/Qualifiers FT CDS 1111..4005 FT /product="CR1-2_DR1p" FT /translation="MCFLIPVVTNTRKTREVRCRRNPHNLRSIHVSTISQL FT SLSVGLWNCQSAVNKADFITSIATYSDYNLMALTETWLRPEDTATHATLSA FT NFSFSHTPRQTGRGGGTGLLISKEWKFTLIPSLPTISSFEFHAVTIIHPFY FT INVVVIYRPPGKLGHFLDELDVLLSSFSNFATPLLVLGDFNIYVDKPQAAD FT FQTLLASFDLKRAPTSATHKSGNQLDLIYTRHCFTDQTIVTPLQISDHFLL FT SLNIHITPEPPHTPTLVTFRRNLRSLSPNRLSTIVSDSLPPSRKLTALDSN FT SATNTLCSTLASCLDRLCPLASRPARASPPAPWLSDALREHRSKLRAAERI FT WRKTKNPAHLLTYQTLLSSFSAEVTSAKQTYYRLKINNATNPRLLFKTFSS FT LLYPPPPPASSTLTTDDFATFFCTKTAKISAQFAAPTTNTQDTTPTPHTLT FT SFSQLSESEVSKLVLSSHATTCPLDPIPSHLLQAISPAVIPTLTHIINTSL FT DSGLFPTTFKQARVTPLLKKPNLDHTLLENYRPVSLLPFMAKILEKVVFNQ FT VLDFLTQNNLMDNKQSGFKKGHSTETALLSVVEDLRLAKADSKSSVLILLD FT LSAAFDTVNHQILLSTLESLGVAGTVIQWFRSYLSDRSFRVSWRGEVSNLQ FT HLNTGVPQGSVLGPLLFSIYTSSLGPVIQRHGFSYHCYADDTQLYLSFHPD FT DPSVPARISACLLDISHWMKDHHLQLNLAKTEMLVVSANPTLHHNFSIQMD FT GATITASKMVKSLGVTIDDQLNFSDHISRTARSCRFALYNIRKIRPFLSEH FT AAQLLVQALVLSKLDYCNSLLAGLPANSIKPLQLLQNAAARVVFNEPKRAH FT VTPLLVRLHWLPVAARIKFKALMFAYKVTSGLAPSYLLSLLQIYVPSRNLR FT SVNERRLVVPSQRGKKSLSRTLTLNLPSWWNELPNCIRTAESLAIFKKRLK FT TQLFSLHFTS" XX SQ Sequence 4238 BP; 1084 A; 1128 C; 781 G; 1244 T; 1 other; gcaggagaag cactttagca gcatctagaa cagcagcctg taagtacatt taagatttgt 60 ttcagttgtt gtgtatggtt tgaggacttg tttccagctg tttgtgtaaa gttgtaggac 120 atttaaactt gctttcagtt gtgtataata ctagaagttg tttagccact gtttccttgg 180 ttactataag agcttgtgta gcgaacgcag acgcggttcg cgtcgtccgc ctcagtttcg 240 gcccttgttt tgactcggga ggcgtgtcca aacgccaggt aaccactata tagtgagcac 300 ggtagcatta gtcggcagga gaagcacttt agcagcatct agaacagcag cctgtaagta 360 catttaagat ttgtttcagt tgttgtgtat ggtttgagga cttgtttcca gctgtttgtg 420 taaagttgta ggacatttaa acttgctttc agttgtgtat aatactagaa gttgtttagc 480 cactgtttcc ttggttacta taagagcttg tgtagcgaac gcagacgcgg ttcgcgtcgt 540 ccgcctcagt ttcggccctt gttttgactc gggaggcgtg tccaaacgcc aggtaaccac 600 tatatagtga gcacggtagc attagtcggc aggagaagca ctttagcagc atctagaaca 660 gcagcctgta agtacattta agatttgttt cagttgttgt gtatggtttg aggacttgtt 720 tccagctgtt tgtgtaaagt tgtaggacat ttaaacttgc tttcagttgt gtataatact 780 agaagttgtt tagccactgt ttccttggtt actataagag cttgtgtagc gaacgcagac 840 gcggttcgcg tcgtccgcct cagtttcggc ccttgttttg actctcgagg cgtgtccagc 900 tgaattcaat cagctagtgc tttggggtta tataaacaac tagttcaccg cggcagcggt 960 cgcggcagcc tcgtgtgaag accgacgagg gtaaagacca tcgactctac ctgcgcgact 1020 ccaccgagca aagacaccga caaagcactt gagtacttta ctgtattgtt ttactttaca 1080 cttatttttt gttgtcagtg cacttttatt atgtgttttc taattcctgt tgttactaac 1140 actcgcaaaa cacgggaggt acgctgcagg cgtaatcctc acaaccttcg ttcaatacat 1200 gtatctacta tttcacaact ctctctctcc gtgggcctct ggaattgtca atcagctgtt 1260 aacaaggctg attttattac ctccatagct acatattctg actataatct catggctcta 1320 actgagacct ggttgaggcc ggaggacact gctacacatg ctactctttc tgctaatttc 1380 tctttttccc acactcctcg tcagacaggg agagggggtg ggactggact actaatttcc 1440 aaagaatgga aatttactct gataccgtcc ctgccaacaa tcagctcctt tgaattccat 1500 gcagtcacca ttatccaccc cttctacata aatgtggttg tcatctaccg cccaccaggt 1560 aaattaggtc acttcctaga tgaactggat gttcttctct catctttttc taattttgcc 1620 actcccttat tggtgctagg tgacttcaac atttacgttg acaaaccgca agctgcagac 1680 tttcagactt tgcttgcctc ttttgaccta aaaagagcac ctacttctgc tacccacaaa 1740 tcaggtaatc agctagacct tatttacaca cgacactgct tcactgatca aacaatagta 1800 actccactac aaatatctga tcatttcctt ctgtctctca acatccacat tactcctgag 1860 ccgccacaca ctccaacact ggttaccttt cgcagaaacc tacgatctct ctcacccaat 1920 agactatcca ccattgtttc agactctctt cctccatctc gcaaactcac tgcacttgat 1980 tcgaacagtg ccactaatac actctgctcc acactagcat catgtctaga ccgattatgt 2040 cctcttgcat ccaggccagc ccgtgccagt cctcctgcac cctggctctc ggatgctctc 2100 cgtgagcatc gctcaaaact tcgggctgca gagagaattt ggcggaaaac taaaaatcct 2160 gcacatctct taacatacca aactcttctg tcctctttct cagctgaggt tacttctgca 2220 aagcagacgt attaccgtct gaaaatcaac aatgccacta atcctcgcct actttttaaa 2280 acattttcct ccctcctcta tcctcctcct ccacccgcat cctccacact tactactgat 2340 gactttgcta cattcttctg caccaaaact gcaaaaatca gtgctcaatt tgctgcacct 2400 acaacaaaca cgcaagatac aacaccaaca ccacacacac tcacctcttt ttctcagctc 2460 tctgagtctg aggtgtccaa acttgtgcta tctagccatg caaccacctg tccactcgat 2520 cccattccct ctcatctctt gcaagccatc tctcctgcag tcataccaac actgactcac 2580 ataattaaca catctcttga ctctggttta ttccccacta catttaagca ggctagggta 2640 accccactgc taaagaaacc caacctggac catacgctac ttgaaaacta cagaccagta 2700 tccctgcttc cattcatggc caagattctg gagaaagtag tgttcaatca agtcctggac 2760 tttcttactc aaaacaatct catggacaac aagcaatccg gctttaagaa aggccactca 2820 actgagactg ccctgctctc ggtcgtggag gatctcagac tggctaaagc agactctaaa 2880 tcatcagtcc tcattttgct ggacttgtca gctgcttttg acactgtcaa ccaccagatc 2940 ctgctatcta cgcttgagtc actgggcgtt gcgggcactg ttatacaatg gttcagatct 3000 tacctctctg acaggtcatt cagggtgtct tggaggggag aggtgtccaa cctacagcat 3060 ctaaacactg gggtacctca aggctctgtt cttgggccac ttctcttctc catctacaca 3120 tcatctctag gaccagtcat ccagagacat ggattctcct accactgcta tgctgatgat 3180 acccagctat acctctcttt tcatcctgat gatccctcgg ttccagctcg catctcagcc 3240 tgcctgttgg atatttcaca ctggatgaaa gatcatcatc ttcagctgaa cctcgcaaaa 3300 acggaaatgc ttgtagtttc tgccaacccg actctacacc ataacttttc aatccagatg 3360 gatggggcaa ccattactgc atccaaaatg gtgaaaagcc ttggagtaac gattgatgac 3420 caactaaact tctctgacca catttctaga actgctcgat cgtgcagatt cgcactctat 3480 aacatcagaa agatccgacc cttcttatct gaacatgcag ctcaactcct tgttcaagct 3540 cttgttctct ccaaactgga ttactgcaac tctctactag ctgggcttcc agctaactct 3600 atcaagcctc ttcaactgct ccagaatgca gcagcacgag tggtcttcaa tgaacctaaa 3660 cgagcacatg tcactccgct gctagtccgt ttgcactggc tgccagttgc tgctcgcatc 3720 aaattcaaag ctctgatgtt tgcctacaaa gtgacttctg gccttgctcc ttcttatctg 3780 ctctcacttc tgcagatcta tgtgccctcc agaaacttgc gttctgtgaa tgaacgtcgc 3840 ctcgtggttc catcccaaag agggaagaaa tcactttcgc gaacgctcac gctcaatctg 3900 cccagttggt ggaatgaact ccctaactgc atcagaacag cagagtcact cgctattttc 3960 aagaaacgac taaaaactca actatttagt ctccacttca cttcctaatc tgcaattgcc 4020 tctctgaata tcacactaac tgtacccaaa aaaaaaaaaa aaaaaaaaaa aaataaaaaw 4080 actactaata cttcccttct tagactttac agacctgaaa cttgcctata gcacttattc 4140 attgttgctc ttagttgtgt aaattgcttc cttgtcctca tttgtaagtc gctttggata 4200 aaagcgtctg ctaaatgact aaatgtaaat gtaaatgt 4238 // ID Gypsy-23-LTR_DR repbase; DNA; ZEB; 1216 BP. XX AC . XX DT 11-FEB-2005 (Rel. 10.01, Created) DT 11-FEB-2005 (Rel. 10.01, Last updated, Version 1) XX DE A long terminal repeat of the Gypsy-23_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy superfamily; Gypsy-23-I_DR; Gypsy-23-LTR_DR; Gypsy-23_DR; KW endogenous retrovirus. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1216 RA Kapitonov V.V. and Jurka J.; RT "Gypsy-23_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 5(1), 20-20 (2005). XX DR [1] (Consensus) XX CC Gypsy-23-LTR_DR is a long terminal repeat of the Gypsy-23_DR CC LTR retrotransposon that belongs to the Gypsy superfamily. Its CC internal portion is deposited in Repbase as Gypsy-23-I_DR. XX SQ Sequence 1216 BP; 348 A; 206 C; 286 G; 358 T; 18 other; tgtggcaatt aggatgtaaa tccataccac ctggagaaaa aatgattaca ggttaaaaag 60 aaaagagcgg aagaggaagt gataagaacg gtatgaggag agcactaaaa acagtggtcg 120 aatgagttgg ctctggaagt gtcgagaagt gtaagcgcca aaaagtgcct aaaaaagcga 180 agttgaaaag tctaccggtg gtgatcggat ccagtgaaga gtgcagttaa ggcgttttgg 240 ggagacgttt tggctgagat ggtggatttg gccgtcgccg tcgcccgaga tcatgtgcgg 300 tgtccacaac aaagggagct gagtgacgtc gctctacata gacgcgggct gtaggtgatt 360 tttcctctag ttcagcggtt tcaggtactt tacacttcsa caacaaacaa agaaaacaaa 420 acgttagcat accggcatcg ttgatgggtt gactgctaar gcgatgggkt wcagctgttg 480 acattatcca trccactcct acgctttgaa tgaactgtta gtgcgcacat acattccttg 540 yatagaggcc gctggacacm cagacacacc cacacataca cccacacata cacaaacaca 600 cacacacaca ctatcacacg cacmcacata cattgaacga actgtttaac attcagattc 660 cagytgatgc ttccagtaac gttatgttgg ttaattgaga aaamactgga ctctgatgcg 720 actgaactrt gtttccgtgr rttrattttr tatgtggttt acctttgaat ggccagagag 780 ccaatgattg tgaggggtgt aatktccatg gggtttgttc ttgtatatgg atgaaatgct 840 castgctgtc tttcacttag taaatactga ttatggttgk gattgcagtg gataaattct 900 tctgtgagtg tggtataaat tcatcatgat gttttcataa agctgtttat tttcaattgt 960 gcttttgagc gatctattgg tagtgatatt tcagtgatag aaaaataaat atttctatgc 1020 atactcaatt ggtgaatttg tgtaaaattt ttgtgtattt ttgtgtgtta tgtttagtat 1080 agtttgtagt cggtggctta ttgctatctg ggttaaagtt aaatttgtaa tatattgacg 1140 gatttattct gtctggcacc cgaacaaaaa gttaaaaatt acctacttgg gggtttaaga 1200 aagggccacc gttaca 1216 // ID Gypsy70-LTR_DR repbase; DNA; ZEB; 640 BP. XX AC chr14; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy70-I_DR; KW Gypsy70-LTR_DR; Gypsy70_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-640 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1456-1456 (2008). XX DR Genome; chr14; Positions 3106815 3106176. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 640 BP; 119 A; 151 C; 110 G; 260 T; 0 other; tgtcagaacc cgggtttgaa ctttgccctc tttcttttct gacagcgttt tctatctttc 60 cttactgaca acattctaat ttgtctccaa attgctgtgt aactttactc agctgatcgt 120 cttttacaat tagtagcctc acatgttact gccagctgcc tttcatttct gctaattgct 180 gctcctatat ctgagggtgt ttcgcttctc cttttgtcag agtgttggat ttcttacagc 240 cttcttgtcg gtctgttgta ttatcattgt gttctctgtc cagttccctg gatccatccc 300 ggatttggtc tgccctactg gagagctgat cttttgcgtt gcctttgccc ttctcctgca 360 ttaccaggcc tggaccaaaa actaacttta ttttttgatt gaaatttagc ccactcctga 420 tttttctttt tcatgtggtg ctactaattt tgttggtatt tttcctgtac tgttgaacat 480 tgcctgtggt ccagttttca gtttattttt tactaaagtt caagatctgc tctgggatct 540 cagcaagcgc tgagtcactt tctttttgct tactttatta aaaaacccta ttgaattgaa 600 ctcctctgct tttgggtcct ttatacagac acacctgaca 640 // ID hAT-N72_DR repbase; DNA; ZEB; 890 BP. XX AC . XX DT 03-NOV-2008 (Rel. 13.11, Created) DT 03-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE Putative nonautonomous hAT DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW 8 bp TSD; hAT-N72_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-890 RA Bao W. and Jurka J.; RT "hAT DNA transposons from zebrafish."; RL Repbase Reports 8(11), 1739-1739 (2008). XX DR [1] (Consensus) XX CC TSD = 8 bp. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 890 BP; 242 A; 233 C; 160 G; 255 T; 0 other; tagagatgcg cggatcagct gaaatgacac ccgaatccgc ggcttcatac taatcatcca 60 cccgccaccc gcccgcacat atagttttat ctgatatatg tatccgcacc cgatccgcac 120 ccgatcgtca ctttaataat ttaatttctt tgttttaggc atgatgatag gtcaccgtgg 180 atattaacag cagattcagt gagctataat ctgcgcatct ctgatagtaa aataatacgt 240 taatcgtaaa cgtttttaaa tgaacattta tttataaaaa caaaagtttg tctttagaca 300 acattttctt cattttcaaa tagatacaga aagtaaaaat aagacatttt agcttttaca 360 gatacaggaa caagataaca aaaagggtgc cacttcagcc taaacggcag ctatgttatg 420 tttacttggc ctttttcttt gcactgtgca aaaacagaat agcatctact gtgcctggat 480 tcgggcgatt ccgcctcgcc tccaaaacgc ggccagctgc actgaatgag cgttcgctcg 540 caccgcttgt tgtaggaatg aacaagattc tcctcgcaag gtgctgcagt cgtggaaata 600 agcgatcttg tttctcccag aacgatagca gattttcctc tgatgactcc tctaactgaa 660 actttgaaca gtaggctact cttgcacttc atccattatt ttgggcgtct catcttccca 720 ttctgcaaag cccactctct gcttttttcg ctggtccact gccatgtccc gctacatcca 780 cccgcaaccg acccgcaatt aatcataatg tattttttta ttacccgacc cgcccgaccc 840 gcggattatc cgcagcgccc gcggatataa ccgccatccg cgcatcacta 890 // ID Gypsy-167-LTR_DR repbase; DNA; ZEB; 878 BP. XX AC . XX DT 12-MAR-2009 (Rel. 14.03, Created) DT 12-MAR-2009 (Rel. 14.03, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy-167-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-878 RA Bao W. and Jurka J.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 9(3), 695-695 (2009). XX DR [1] (Consensus) XX SQ Sequence 878 BP; 198 A; 165 C; 162 G; 353 T; 0 other; tgttacggct cagtatctgt gcctgtgttt gtttgtattt tttttctctc tccctccttc 60 tctacctatc attttcttta ggtccgccca gctcatctac ctattggcca agctagatgc 120 cagtcatcat taccatttgc gatgacatca tcattgtcca atcacaagca gacttcccca 180 ctttaaaagg aagccggtgc agtcgaaagc tagcttgctt tttgcttgct cactgttgtg 240 tgtttgctat gttggttgac tgtgtgtttt gctggatggc tggattatgt gacagtttct 300 tgtgctatgt gtgttataat catttgtgct gaaatcgatc tttaaaagtg ttctaattgt 360 gttttctctc tgattaccag atatacgaca tcaatgggat tcgaggaaca gggtagatgt 420 cacgtgacct acatgttgct acacgtagga gataccaatg tattatacac actagtttag 480 aagtgtgcgc cactaattgt aagttttgtt tacgtttgtg tttagaagtg tagtttagat 540 agggcgtgaa aacgctgaag acttttcttt ctctttcttt tattagatag tatagagatt 600 gtgggttcta gttagagttg tgttttgtta tattattttc ttttctttgg cgccactttt 660 gttcctcctc ccctttatta caaccattta tttattcttg ttttctttat tttgtatata 720 ctttgtttat catcagttat ccactttgta aaataaacac ttattttggc ataatttggt 780 tgtttgtctg ctccattgcc accaccattc ataaaagagt cataagtttt taaatgttgc 840 gttaacccta gcgtttccat cttaaaataa ccgtaaca 878 // ID BEL12-I_DR repbase; DNA; ZEB; 6580 BP. XX AC chr14; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL12-I_DR; KW BEL12-LTR_DR; BEL12_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6580 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1373-1373 (2008). XX DR Genome; chr14; Positions 49656857 49663436. XX CC 'GGCTC' target site duplication CC LTRs are 100% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 465..6347 FT /product="BEL12-I_DR_1p" FT /translation="MSKSDIGSKKHTKQADIETRSENTRRSTRAQVPTEKM FT LAYQQEEQQKKKKKVISLYDKWKIEARNAREQLKSDLSDTQLAILIETLEI FT AKNKVLSAYDEFRKQTLPDVEMRRKMDSCEAVTTDVTRIVNERIAGIDGDY FT SYEREKERLSALSYKYANSIFGSTASQVSQEISDHSGHCSVLSDLEAKRVD FT AAAEVAAKQAAYNSLMAETKQKEKIKELEDQHKKALEVEFKELEQIQAQKD FT LKVAQAKLDVYSEEVEKITSSLNDCSSASGSELSQGNERKEVNRQEQNETT FT NPSVKASTSEFSALVQALQDGISASKLPVPEPPIFTGDPIHFIEFKQSFMA FT LIDKKGISSADKMFYLKKYVSGPARKALDGTFFRTDDEVYKDAWDKLNHRY FT GHPFIIQKAFRERLANWPKIHPKDAMGLQAFSDFLNACRGALPHVKGLQIL FT NDYQENQKLMQKLPDWAISRWNRQVTQSITNNHEYPTFDEFAVFVSTEADV FT ACNPVTSFHALHNSGYTDEKVNMKEVKKNRVRVLATQSNTEGDQYVIKSSN FT KTQCVFCQKNNHQLDNCSKFLSRPLEERRQYIQEKKLCYGCLNFGHSAKVC FT RYRLTCDTCNKKHPTSLHNNNFVKMVRTSSSAVQTQNDPEGTANAVTLNVA FT GEEPGVCTSMIVPVWVSAKHNPSCEQLVYALLDSQSDTTFIDKSVSDSLKA FT ASFPVKLKLTTMLGRDTVLQSERVTGLQVRAYKSSDYIDLPATYTKDCIPA FT NRHHIPTNEVAEHWKHLSVIADKIPPLQDCDVGLLIGYNCSRSMIPREVIV FT GGDDEPYAVQTDLGWSIVGCSPRSDLFNVSRLCHRIAVRELPSITPVNIIR FT ALESDFKDADEETVRVSQDDILFLEKLKSGISKNIHGHYQMPLPFKVRPKL FT PNNKRCAMVRLDHLKRKMVRDESYMKNYNQFMSQIFQRGDAEEVHSSGKEG FT ETWYLPHHGVFHPKKADKLRVVFDCSARHEGSSLNDHLLQGPDLINSLHGI FT LIRFRQHPIALTCDVEKMYHQFHVDEPDRDFLRFLWWKNGDITQPVHEFRM FT KVHLFGAASSSGCANYGLKHLATENGELYPLGSQFLLKNFYVDDGVISVEN FT VHEAIKVADEARRLCALGGLRLHKFLSNNELVLKGIPQSERMCGVRGLDLA FT FDELPLERTLGLQWDRESDCFKFKVHLKEQPASRRGILSAVASVYDPLGLI FT APVLFNGKRILQEMCKNGSGWDDPLMDGLSLRWVKWKQDLQSLQNLSIPRT FT YVPLNFGKPISIELHHFSDASNDGYGQCSYVRVKNEEGNVHCALVMAKSRV FT APLKVVTVPRLELAAAVVSVEVSNVLKRELDYPVANETFWTDSKVLLGYIS FT NEARRFHTFVANRVQRICRSTKIEQWRYVPTERNPADYASRGLTVKQLLDS FT DWFTGPKFLWETEMFIPSEPAPALLLGDPEVKSLFTLNTECQNSFNLANRL FT TKFSSWTKAVGAVARLLRRANKDRSSSLSTMLERQKAELHIIKCLQESSFK FT DELKEIKGGRGVSCGALRPLDPFVDLDGVLRVGGRLTRSSYPDSVKHPIII FT PKTHQITQMIVAHYHNKVMHQGKGFTVNEIRSNGFWIPGINKVVASFIKGC FT VVCRKLRRPVEEQKMADLPPERVDPSPPFLYTGMDCFGPFSVKSRRNVQKR FT YGLLLTCLCSRAVHIELLNDLTTDSFINALRCFIAIRGTVRQLRSDQGTNF FT VGARNELKGALCEMDPERITTFLLENQCEFIMNTPHASHAGGVWERQIRTI FT RSVLNATFALYPGKLDDSCLRAFLYEAMAIVNSRPLTVDCLNDPQSLRPIT FT PNNLLTLKPVTALPPPGKFLEEDVYARKRWRHVQYLAEQFWSRWRKEYLAN FT IVTRQKWNMPKRNLQINDIVLVKEENLPRNEWKLGRIQETISSGDGLIRKA FT KVLLGDCKLDAKGKRTTKQSIIERPVQKLVLLKEGNE" XX SQ Sequence 6580 BP; 2123 A; 1141 C; 1467 G; 1849 T; 0 other; gtaagagctt gccatcatcg ggtcgctgct taacggagtg gacggcaatt aaaaggactt 60 ctggcatcgg cgcgctcgtt gggaacggtg tgattgctcg ctacggatca agagcggact 120 ttaaacagac ggagctacaa ggctacacac agctgcacgg actgatctaa cggacttcac 180 ggcatacaac ggctttaaac tagtttgagg atcgcagtga gtactaaaac aaagaaacta 240 actgaacttt taaagggaca atttctcgat tatggtgtgt gtgtgttttt aaagggcaat 300 tgacgttgtt gtgtgtgttt aagtcacttg ttaaatgaat tgtgatctat caaagtgttt 360 aactatgcat tttaatgcct ttttgcatat ggttaaaatg agtggttaat ttagtgcctt 420 attgacctaa aattattgtg gggtattaat ttatagatcc aaaaatgtca aaatcagata 480 taggttcaaa gaagcacact aagcaagctg atatagaaac taggagtgaa aatacacgac 540 gttcaacacg tgcacaggtc cctactgaaa aaatgttagc ttatcagcag gaagagcaac 600 aaaaaaagaa gaaaaaggtt atttctttgt atgataaatg gaaaattgaa gcccgtaatg 660 cacgtgagca acttaaatcg gatctatcag acactcaatt agctatttta attgaaactt 720 tggaaattgc taaaaataag gttctcagtg cttatgatga atttaggaaa caaacattac 780 ctgatgttga gatgagacga aaaatggatt catgtgaagc agtaacaact gatgtgacca 840 gaattgttaa tgaaagaata gcaggaatag acggtgatta cagttatgag agggaaaagg 900 aacgtcttag tgctttgagt tataagtatg ctaattcaat ttttggttca acagcttcac 960 aagttagtca ggaaatcagt gatcattcag gtcattgctc tgttttatca gacttggagg 1020 cgaaacgtgt agatgctgca gcagaagtag cagccaaaca agctgcatat aattctttaa 1080 tggcagaaac taagcaaaaa gagaaaataa aagagttaga agatcagcat aaaaaagccc 1140 ttgaagttga gtttaaagaa ttagagcaaa ttcaagcgca aaaggattta aaagtagctc 1200 aagctaaact ggatgtttac agtgaagaag ttgagaaaat aaccagctct ttaaatgatt 1260 gtagtagtgc gagtggatcg gagttgtctc aaggaaatga aaggaaagaa gttaatcgac 1320 aggaacaaaa tgagaccaca aatccatcag ttaaagcatc aacttctgaa ttctctgctc 1380 ttgttcaagc acttcaagat ggcatatctg caagcaaact tccagttcca gagcctccta 1440 tttttacagg tgatcctatt cactttattg agtttaagca atcctttatg gcacttattg 1500 ataagaaagg gatttcctct gctgataaga tgttttatct caagaaatat gtgagtgggc 1560 ctgcacggaa agccttggat ggcactttct tcagaactga tgatgaagtc tacaaagatg 1620 cttgggataa gttaaatcat cggtacggcc atccatttat tattcaaaag gcgtttagag 1680 aacgtttagc aaattggcct aaaatccacc caaaggatgc catgggactt caagctttct 1740 cagattttct gaatgcatgt cgtggtgcat tacctcatgt taagggtctt cagatcttaa 1800 acgattacca agagaatcaa aaacttatgc aaaagttacc tgattgggca attagtagat 1860 ggaatcgtca agttacacag tctatcacca ataatcatga atatcccact tttgatgaat 1920 ttgctgtgtt tgtgtcaact gaagcggatg tagcatgtaa tccagttact tccttccatg 1980 ctcttcacaa ttcaggatat acagatgaga aggtaaacat gaaagaagtc aagaaaaata 2040 gagtacgggt tttggctact caatcaaaca ctgaagggga tcaatatgtg attaagtctt 2100 ctaacaaaac tcaatgtgtg ttctgtcaga aaaataatca tcaacttgac aattgttcta 2160 aatttctcag tagaccatta gaagaacgac ggcaatacat ccaagaaaag aagctttgtt 2220 atggatgcct gaattttgga cacagtgcaa aagtatgtcg atatcgtcta acatgtgata 2280 cctgtaacaa gaagcatcca actagtttgc acaataacaa ctttgtgaaa atggtaagaa 2340 cttcatcttc tgcagttcaa acgcaaaacg atcctgaagg tacggctaat gcagtaacgt 2400 taaatgtagc tggagaagaa cctggtgtgt gtacttctat gattgtccct gtttgggtat 2460 ctgcgaaaca taatccaagt tgtgaacaat tggtatatgc gctgttggat tctcagagtg 2520 acacgacttt cattgataaa agtgtgagtg attcattaaa agctgcatca tttccagtga 2580 aactaaagct gacaacaatg cttggaaggg atacggtact tcaaagtgag agagtgactg 2640 gacttcaggt gagagcttat aagtctagtg actatatcga tttaccagca acttatacta 2700 aagattgtat tccagcaaat cgccaccata ttcccacgaa tgaggttgct gaacattgga 2760 agcatctttc agtcatcgca gacaaaatac ctccactaca ggattgtgat gttggcctgt 2820 taatcggcta caactgttca agatcaatga ttcccagaga agtgatagta ggaggagacg 2880 acgagcccta tgcggttcaa acggatcttg gatggagtat agtaggttgt tcaccacgct 2940 ctgacttatt taatgttagt agattgtgtc atcgcatagc agttagggag cttccttcta 3000 tcacacctgt caacataata cgggcccttg agagtgattt taaagatgca gacgaagaaa 3060 ctgtcagagt atcccaagat gatattctct ttcttgagaa attaaaatcc ggcataagta 3120 aaaacattca cggtcactac caaatgcccc ttccttttaa agtaaggccg aaactgccta 3180 ataacaaacg atgcgctatg gttagacttg atcaccttaa aaggaagatg gtaagagatg 3240 aatcatacat gaagaattac aatcagttca tgagtcaaat ctttcagaga ggagatgctg 3300 aagaagtgca cagtagtggt aaggaaggag aaacgtggta cctacctcac catggtgttt 3360 ttcatccgaa aaaggcagac aaactgcgtg tggtcttcga ctgttctgca aggcatgaag 3420 gttcatccct aaatgatcac ttgcttcaag gtcctgatct cataaatagt ttgcatggca 3480 ttctaattcg atttcgtcaa catcccattg cacttacttg tgacgttgaa aaaatgtacc 3540 atcagtttca cgtagatgaa cccgatagag attttctgcg atttctttgg tggaagaatg 3600 gagacatcac acaacctgtt catgaatttc gtatgaaagt tcatttattt ggggctgctt 3660 catcttcagg ttgtgcgaac tacggcctaa aacatcttgc gactgaaaat ggggagttgt 3720 atccactggg ctcacagttc ttgcttaaga acttttatgt cgacgatgga gtgatcagtg 3780 tggaaaacgt gcatgaagcc attaaggttg cagacgaagc gcgacgtctc tgtgcattag 3840 gtggtttacg attacataag ttcttgtcta ataatgaact tgtattgaag ggcatccctc 3900 aatcggagag gatgtgtggt gtaaggggtc tggatcttgc ctttgatgaa ttgccattag 3960 agagaacact tggattgcag tgggataggg aatctgactg cttcaagttt aaagtacacc 4020 tgaaagaaca acctgcctct cggcgtggca tattatctgc tgtcgcttct gtttatgacc 4080 ctttaggact cattgcacct gtccttttca atggaaagcg aatcttgcaa gagatgtgca 4140 agaatggcag tggttgggat gaccctctga tggatggact gagtttaaga tgggtaaaat 4200 ggaaacaaga tttacaaagc ctgcaaaatc taagcatacc acgcacatat gtaccattga 4260 atttcgggaa accaattagt attgaattgc atcatttctc cgatgctagt aatgatggat 4320 atggtcaatg ttcttatgtc agagtaaaga atgaagaagg aaatgttcat tgcgcacttg 4380 tgatggcaaa atcaagagtt gcacctctca aagttgtaac agtccctagg ttggaattag 4440 ctgctgctgt agtctcagtg gaagtaagta atgtactcaa aagggagctg gattatcccg 4500 ttgctaatga gacattttgg actgactcta aagtgttgct ggggtatata agtaatgaag 4560 cacgcagatt tcacacattt gttgcaaacc gggtgcaaag gatatgccgc agcacaaaaa 4620 tcgaacagtg gagatatgtt ccgacggaaa gaaatcccgc tgactacgca tctaggggtc 4680 tcactgtaaa gcaactactt gactctgatt ggtttaccgg cccaaaattt ttgtgggaaa 4740 ctgaaatgtt tattccatct gagcccgctc ctgctcttct attgggcgat ccagaagtaa 4800 aaagcctttt tactttgaat acagagtgtc aaaacagctt caatttggcc aaccgcttaa 4860 caaaattttc ttcatggact aaggcagttg gagctgtagc tcgtctctta aggagagcca 4920 acaaggatag atctagtagt cttagtacca tgctagaaag acaaaaggca gaactccata 4980 taattaaatg cttgcaagag agttctttta aggatgagct gaaggaaatc aagggtggga 5040 gaggtgtgtc atgtggtgca ttacgtcctc ttgacccttt cgtggatctt gatggtgtgc 5100 ttagggtggg aggaagactt actcgatcaa gttacccaga ctctgtgaag catcccataa 5160 ttattccaaa gacacatcaa attacacaaa tgatcgttgc tcattatcac aacaaagtta 5220 tgcatcaagg aaaaggattt actgtaaatg aaattagaag taatgggttc tggatcccag 5280 gaatcaacaa agttgtggca tccttcatta aaggttgtgt tgtctgcaga aaactcagaa 5340 gaccagttga ggaacaaaaa atggcagatc ttccaccaga aagagttgat ccatcccctc 5400 cgtttctgta cactgggatg gattgttttg gtccgttttc agttaagtct aggcgaaatg 5460 tacaaaaaag gtatggcctc ctgttaacgt gtctttgttc ccgtgctgtg cacattgaac 5520 tgcttaatga cttgacaact gattctttta taaatgctct acgttgtttt attgccatta 5580 gaggcacagt tcgacagcta agatctgatc aagggaccaa ttttgtgggg gcaagaaacg 5640 agctgaaagg tgcactctgt gaaatggatc cagaaagaat aactactttc cttttggaaa 5700 atcagtgcga attcatcatg aatacacctc atgcaagtca tgctggtggc gtatgggaac 5760 gtcaaataag aacaataaga tctgttctga atgcgacctt tgctctttat ccagggaaat 5820 tagatgactc ttgtctgaga gcgttccttt atgaagcaat ggcaattgta aatagcagac 5880 cactaactgt tgattgtttg aatgatccac aaagcttaag acccattact ccgaacaatc 5940 tgcttacatt gaagccagtg actgctctgc ctccaccagg taaatttttg gaggaagatg 6000 tgtatgcaag aaagaggtgg agacacgtgc agtatttagc cgaacagttt tggagccgtt 6060 ggcgcaagga atatttagca aacatagtga ctcgtcagaa atggaatatg cctaagagaa 6120 accttcaaat taatgacatt gtattggtga aggaagaaaa cttgcctaga aatgaatgga 6180 agctaggtag gatacaggaa accattagta gtggtgacgg gcttattagg aaggctaaag 6240 tgctcttagg tgattgtaaa cttgatgcga agggaaaacg tacaaccaag caatccataa 6300 ttgaaagacc agtgcaaaaa cttgtattgc taaaggaagg aaatgaatag ttatttaaaa 6360 ttgtgtaaat aagtggatca tgaaatcatt ttaaattttc tacatattta tgggtgtgta 6420 tttattatca tgaaaaaaaa agggtattta taaaaaaaaa aaaaaaattt taaaagggag 6480 aaaaaaaaaa aaaaaaaaaa cacgtgaaga gtttgttgct aggaaattat tcgtgacttg 6540 taatttgggt ttaatacttg cactcataat ttggtgggag 6580 // ID CR1-33_DR repbase; DNA; ZEB; 1448 BP. XX AC . XX DT 12-FEB-2009 (Rel. 14.02, Created) DT 12-FEB-2009 (Rel. 14.02, Last updated, Version 1) XX DE CR1-like non-LTR retrotransposon - a consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; AP endonuclease; CR1 clad; CR1-33_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1448 RA Bao W. and Jurka J.; RT "CR1-like non-LTR retrotransposons from zebrafish."; RL Repbase Reports 9(2), 519-519 (2009). XX DR [1] (Consensus) XX CC 5'- and 3'- ends are uncompleted. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS join(208..858,729..1286) FT /product="CR1-33_DR_1p" FT /translation="IIRFYCPPSPQWASLEPHSAGWSLTSPADHSGLPGEE FT RYPKLISWSLEYLRDLVLGPLLFSIYTTSLGTIIQSHGFSYHCYAYDTQLY FT LSFHPDDPTVAARISHCLEDISAWMKDHHLQLNLAKTELLVFPATPHIQHN FT FSIQLGSSTITPCNSVRNLGVILDDQLSFKEHIANTARSCRFALFNIRKIR FT PFLTDHAAQLLVQALVISRLDYCAMLFHSSIMQVCTFQHQKNQTLPNRSCC FT TTFSPSSCHFKAGLLRNALLAGLPSCTLKPLQMIQNAAACLVFNEPKRAHV FT TPLFISLHWLXVEACIKFKSLMLAYRTATGTAPSYLHLLLKVYIPSRSLRS FT VSEHRLVVPSQRGYKSLSKTFSFNVPCWWNDLPIPTQTADSLVSFKRQLKY FT KPISSXIT*" XX SQ Sequence 1448 BP; 385 A; 411 C; 244 G; 402 T; 6 other; tattgccaaa acactggaaa ggttgttttc aaccaaattt ycacatttta acacaataac 60 ctgctggatg caaaccaatc aggcttcaaa tgtggacatt ccactgagac tgcactactg 120 tcactgaagc cctgcggatt gccaaagcag gttctaaatc atctattctg cttttactgg 180 acctgtctgc tgcttttgat atggtgaatc atcagattct actgtccacc ctctcctcaa 240 tgggcatcac tggaacccca ctctgctggt tggagtctta cctcaccagc agatcattca 300 gggttgcctg gagaggagag gtatccaaag ctcatcagct ggtcactgga gtacctcagg 360 gatcttgttc ttggtcccct tctattttct atttacacca catcactggg aaccatcatc 420 caatcacatg gtttctctta tcattgttat gcctatgaca cacagctcta tctttcattt 480 catccagatg acccaacggt agctgcaagg atctcgcact gcctggagga tatctcagct 540 tggatgaaag accatcacct acagcttaat ctggcaaaaa ctgaacttct tgtcttccct 600 gcaactccac acattcaaca caatttctca atccaactgg gatcatctac tattacccca 660 tgtaattcag taaggaatct tggggtaatc ctggatgatc agctgtcctt caaagagcac 720 attgctaaca cagctcgatc atgcaggttt gcacttttca acatcagaaa aatcagaccc 780 ttcctaacag atcatgctgc acaactttta gtccaagctc ttgtcatttc aaggctggac 840 tactgcgcaa tgctcttcta gctggactcc catcttgcac ccttaaacct ctacagatga 900 tccagaatgc tgcagcttgt ctggtcttca atgagcccaa gagagcccat gtaacacctc 960 tctttatctc tctgcattgg ttacyggttg aggcttgtat caagttcaaa tcattgatgc 1020 ttgcttacag aacagccact ggcactgcac cctcttacct ccacctgctc ctgaaagtct 1080 acatcccctc caggagcctc cggtcagtaa gtgagcaccg ccttgtggta ccatcacaga 1140 gaggctataa gtcactttcc aaaacctttt cattcaatgt tccwtgctgg tggaatgatc 1200 ttcccattcc cacacagact gctgattcac tagtatcatt caaacgacaa ctaaaataca 1260 aacccatctc ttcyraaatc acctgaaccc ctgtgaacaa aaccaggacc tcctcttctc 1320 cccccacacc acacaacaac ctcaaaatta tgcaattctg agtgtgcctc acacaggtga 1380 gtgggcttga caaaccacct gtagaaaaca cattctttca tctctctgac attttaaccr 1440 acacttgc 1448 // ID Gypsy50-I_DR repbase; DNA; ZEB; 5692 BP. XX AC . XX DT 25-SEP-2007 (Rel. 12.09, Created) DT 02-OCT-2007 (Rel. 12.09, Last updated, Version 1) XX DE An internal portion of the Gypsy50_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy superfamily; Gypsy50-I_DR; KW Gypsy50-LTR_DR; Gypsy50_DR; endogenous retrovirus; gag; integrase; KW reverse transcriptase. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5692 RA Dib M.R. and Naveira H.F.; RT "Gypsy50_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 7(9), 831-831 (2007). XX DR [1] (Consensus) XX CC Gypsy50-I_DR is an internal portion of the Gypsy50_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its long CC terminal repeat is deposited in Repbase as Gypsy50-LTR_DR. CC Gypsy50_DR is characterized by 4-bp target site duplications. The CC internal portion encodes one polyprotein the 1599-aa polyprotein CC Gypsy50_DR1p (pos. 608-5404) composed of the gag, protease, CC reverse transcriptase, and integrase domains. Some insertions CC fairly recent, according to the high identity between their CC flanking LTRs. Consensus obtained after the alignment of at least CC five independent insertions bearing at least 85% homology over at CC least 1000bp. XX FH Key Location/Qualifiers FT CDS 608..5404 FT /product="Gypsy50-I_DR_1p" FT /note="ORF." FT /translation="MSSSSIIEDFFAFPSEILLERCTKEELLQIAERFDIE FT VTSNDKKLKETLMKAVKSALCERGVLEVRAPIIPDESVSSPVCSVDLNAKF FT SEMSLQEKQLCVDAEKFRAERDVCALKEKELENDFEVKKMQQQLEMRKMEL FT ELQKEREEREFQLRKLELELKLKSHQAPVISAAESNVFDVHKNIRLVPPFL FT EREVEKYFQHFERVAESLKWPKTFWTLLLQCVLIGRAQDVYSALTTEQSSD FT YEIVKSAILQAYELAPEAYRQKFRTHFKSEKSTFLEFAREKENLFDRWCSS FT VKIKSKEELRELILLEEFKCCVPNAVAMYLNEHKVTKLSEAALMADEFTLT FT HQNTAHVNDMSRLKLNSMSKSKKMLSRQHVVSVPAENVSLTRQKGEIICFY FT CKKQGHKISDCAILKKKDKSVKPVGLISKVNHNVSTKQIEKINISEMEDKT FT SVDFGPFITDGSVSLPDSDEFVPVRILRDTGAGQSFLMQGLLPLSERTATG FT SHVLVRGLEMGYMEVPLHHIHLNSKLVTGDVVVGVRANLPIPGVTFILGND FT LAGGNVWEKPDGEIPPVVFPSVEGLNKPPDCSDLFPACAVTRAMTKRGVLD FT ETGSVLSDTFISSTDSSLEPGTVNTTTVDEETPVHNITSVDAHIEKPVSLS FT ESPPCDANANTCLLSPLHSVTRAELIKAQHADEAVQEMFSLVGAGGENKVS FT TSYILRDGLLCRQQPMSSDSSLDPRIQIVVPSIFRHDVLRLSHQGPAGHMG FT VRKTCDRILREFYWPRLKKDVTKYIRSCHNCQMTGKPNQRIPLAPLQPIAV FT CTTPFEHLIIDCVGPLPRSKAGHAYLLTVMCQSTRYPTAYPLRSITTKSIL FT KALTNFMSIFGIPKTIQSDQGSNFMSRQFSKVLQQLKVKHNISSAYHPQSQ FT GGLERFHQTLKSLLRSYCVELDCDWEDGLPWMLLAIREVVQESTGFSPNEL FT VFGHTVRGPIAVLADEWNITDPPENVLDYVSGFRYRLYEARAIALRKLGKS FT QEKMQRLFDRKVKLRSFQVGDKVLALLPVLSSPFQARFSGPYTIVKCYTNN FT NYLLSTPDRRKKVQVCHVNLLKPYLSPVSSLSVHVVTSDLDSSVSGASVDM FT TTDLDEKSGPSRGVVEGRLNNSEFLVNLSHHLSHLCEPEKTSIIELVNSFP FT SIFSDVPTQTHVIQHDIDVGMASPVKQHPYRVNPVKRKLLEKEVEYLLAHR FT LAEPSFSSWSSPCILVTKSDQSYRFCTDYRKLNSLTKPDCFPLPRIDDCVD FT RVGSAQFVSKFDLLKGYWQVPLTSRAKELSAFITPDSFLQYTVMPFGVRNA FT PATFQRLVNRVLSGMQGCEAYLDDVVLYSSTWSEHLAQIRELFERLAKANL FT TVNLAKCEFGKATVTYLGKVVGRGCVRPITAKVEAISSFPTPTTRRELRRF FT LGMVGYYRGFCKNFADVVAPLTDLLSPKRPFHWSVQCQCAFDNAKSLLANA FT PVLTAPNFEKPFLLAVDASAFGAGAVLLQEDDKGVEHPVSYFSKKFNCHQQ FT VYSTVEKEALALVLAVQHFEVYLSSVCGPIVIYTDHNPLTFLDRMRGKNQR FT IMRWSLILQPFSLLIKHIRGKDNLIADALSRV" XX SQ Sequence 5692 BP; 1491 A; 1038 C; 1285 G; 1878 T; 0 other; taaaattggg cgctcgtccg gataatttaa ctacacataa ggtaatgcat tttgtttgtg 60 ttcaccaatg taaatgagta gacttcaggt agttagagtg ttctcagctg ggtttgtaat 120 ccttccatcg agacactatt tgttatttta gtttgttatt ttgccttttt tatttttgaa 180 ggttattctg gatggagatt attaagtctc tgtcgggggt acgcatcggg cttttggcta 240 tgccaattgt cctcgagctt agcgtttaaa agccgccccg agtccggtac acctttgaag 300 attaggtagg ttaagttagt tatttttcgt gtaggtagga actgaagtct ctcattttca 360 ttcttttcat aaggtgacac tttgtttttt tccgtatttg ttaattacga tgccctccga 420 ttaatagagt aactatagtt tatagtaatc tcttataacg gtgagttttt gtaaactgag 480 cgcgtctgct gttattgttc ctggcgaagt ggctgatggg atttgtagtt gttaggcgct 540 agtttcacgt gcgcttcagt gtgtgcgcgc tttgttgttg ttgttagagc gtttgctgtt 600 ggcagtcatg tcgtcttcat caattattga agatttcttt gcatttcctt cggagatact 660 actggagaga tgtacgaagg aggaactgtt gcagatcgcg gagcgttttg atattgaggt 720 aacgagtaat gacaagaaac tcaaggaaac gctgatgaaa gctgtgaaga gcgcactttg 780 tgaacgcggc gttttagagg taagagctcc tattattcct gatgagtctg tatcgtctcc 840 tgtttgtagt gtcgatctaa atgcaaagtt tagtgaaatg tcgttgcaag aaaaacagct 900 gtgtgttgat gcggaaaaat ttagagccga gcgtgatgtt tgtgctttga aagagaaaga 960 gttggagaat gattttgaag taaagaaaat gcaacaacag ttagaaatgc ggaaaatgga 1020 gttggagttg cagaaagagc gagaggaaag agagtttcaa ttgcgaaaac ttgagctgga 1080 attgaaatta aaatcacatc aagctcctgt aatttcagct gctgagtcaa atgtgtttga 1140 tgtacacaaa aacattaggc tagtgccgcc gtttttggag agggaagtag agaaatactt 1200 tcagcatttt gaacgtgttg cagagtcatt aaagtggcca aagacttttt ggacattatt 1260 attgcagtgc gtgttgatag ggagagcaca ggatgtgtat tcagcgttga ctacagagca 1320 aagtagcgat tatgagattg tgaagtccgc tattttacaa gcttatgaat tagcacctga 1380 ggcttatcga caaaaattca gaactcattt taaatctgag aaaagcactt ttcttgagtt 1440 tgctagagag aaagaaaatc tgtttgaccg ctggtgctca tcagtaaaaa ttaaatctaa 1500 ggaggagctt cgagagctca ttcttcttga agagtttaag tgctgtgtcc caaatgctgt 1560 ggccatgtat cttaatgagc ataaagtgac taaactgtct gaggctgcac ttatggcaga 1620 tgaatttact ctcactcacc agaatactgc tcatgtaaat gatatgagta gattaaagct 1680 gaattctatg agtaaaagta aaaagatgct ttcacgtcag catgttgtgt cagttcctgc 1740 tgaaaatgtt tcactcacac gtcaaaaagg tgaaattatt tgtttttatt gtaaaaaaca 1800 aggacataaa atttctgact gtgctatttt aaaaaagaaa gataaatctg ttaaacctgt 1860 tggtttgatt tcaaaagtga atcacaatgt tagcactaag cagattgaga aaataaatat 1920 cagtgaaatg gaggataaga ccagtgtgga ttttgggcca ttcattactg atggtagtgt 1980 ttctttgcct gactctgatg aatttgttcc tgttcgtatt cttcgtgata caggtgctgg 2040 acagtcattt ttgatgcaag gacttttgcc tttgtctgaa cgtactgcta ctgggtctca 2100 tgtgctggtc agagggcttg aaatgggcta catggaagtt ccattgcacc acattcacct 2160 taactccaaa cttgtaacag gtgatgtagt tgtaggagtt cgtgctaacc ttcctattcc 2220 tggtgtcacc tttatacttg gaaatgactt ggctggtggt aatgtgtggg agaaacctga 2280 tggtgaaatt ccacctgtag tgtttccttc tgttgaagga ttgaataagc ctcctgactg 2340 ttctgatttg ttcccagcat gtgcagtaac tcgagcgatg actaagagag gagtgttgga 2400 tgagacggga agtgttctta gtgatacttt tatttcatct acagactctt cactagagcc 2460 tggtactgtg aacaccacta cagtagatga agagactcct gtgcacaata ttacatctgt 2520 tgatgcacat attgaaaagc ctgtttcact ttctgaaagt ccaccttgtg atgctaatgc 2580 taacacttgt ttgttatctc cacttcatag tgtgactcga gctgagctga ttaaagccca 2640 gcatgcggat gaagctgttc aggagatgtt ttccttagtc ggtgctggtg gagaaaacaa 2700 agtctccacg tcttatattc ttcgggatgg gttgctttgc agacaacagc ctatgtcttc 2760 tgactcttct cttgatccta ggatccaaat tgtggttcct tcaatttttc gtcatgatgt 2820 cttaaggctg tcacaccaag gccctgctgg tcatatgggg gtaagaaaga catgtgatag 2880 gattttgagg gaattttatt ggcctagact taagaaagat gtgactaaat acatacggtc 2940 ttgtcacaac tgtcaaatga ccgggaagcc taatcaacgt attccattgg ctcctttaca 3000 acccattgct gtttgtacta ccccttttga acatctaatc atagattgtg tgggtccttt 3060 accccgatct aaggccggac atgcttattt gcttactgta atgtgccagt ctacacgcta 3120 tcctacagct tatccattga gatcgatcac aacaaaatct attttaaagg ctttgacaaa 3180 tttcatgtct atctttggta ttccaaaaac cattcagtct gatcaaggtt caaattttat 3240 gtcccgacaa ttttcaaaag ttctacaaca actaaaagtt aaacacaata tctcaagtgc 3300 ttatcatcct cagagtcagg gaggtttaga aaggttccac cagactctta agtcactttt 3360 gagatcctat tgtgttgaac ttgattgtga ctgggaggat ggacttcctt ggatgttgtt 3420 ggctattcgt gaagttgttc aagagagcac tggttttagc cccaatgagt tggtgttcgg 3480 acacacagtc cgaggcccga ttgctgtttt agccgatgaa tggaacatca ctgatccacc 3540 tgagaatgtt ctagattatg tcagtggttt tcgctatcga ctttatgaag ctcgagctat 3600 cgctttgcga aaactaggta agtctcagga aaagatgcag aggttgtttg atcgcaaagt 3660 taaacttcga agttttcagg ttggcgataa agttttagca ttgcttcctg ttttgagtag 3720 tccgtttcag gctaggttta gtggaccata caccatagta aagtgttaca cgaataacaa 3780 ctatctttta agtacacctg atcgacgaaa gaaagtacag gtttgccatg ttaacttgct 3840 gaagccatac ttgagccctg tttcatctct ttcagtacat gtcgtgacct ccgatcttga 3900 ctccagtgtt tcaggagctt ctgtggacat gactactgat ctggatgaaa agagtggtcc 3960 ttcacgagga gttgtggaag gacgcctgaa taattcggaa tttcttgtta atctctctca 4020 tcatctgtct catttgtgcg aaccagagaa gaccagtatc attgagttgg taaattcatt 4080 cccttctatc ttttctgatg ttcctactca aactcatgtc attcaacatg acattgatgt 4140 tgggatggct tcacctgtaa aacaacatcc atatcgggtg aatcctgtta aaaggaaact 4200 ccttgagaaa gaagtggaat atttgcttgc tcaccgatta gcagaaccca gttttagttc 4260 gtggagctcc ccctgcattc tagtgactaa atcagatcaa tcttatcggt tttgtactga 4320 ttataggaag ctcaattcac ttacaaaacc tgattgtttt cctttaccca gaatcgatga 4380 ttgtgtagat cgggtaggat ctgcacagtt tgtgagtaag tttgatcttt taaaaggata 4440 ttggcaagtg ccgttaacat ctcgtgctaa ggaattgtct gcttttatta caccagacag 4500 tttcctacag tacacagtaa tgccttttgg agttcgaaat gctccagcga cttttcaacg 4560 tctggtaaat cgagtgttat cagggatgca gggatgtgaa gcctacctgg atgatgttgt 4620 gctgtacagt tctacttggt cagaacatct agcacaaatt agagaactgt ttgaacgttt 4680 agctaaagct aacttgactg ttaatcttgc caagtgtgag tttggtaagg ctacagtgac 4740 ttatttgggt aaggtggttg gcagagggtg cgttcgtcct attactgcta aagttgaagc 4800 catctctagt tttcccactc ccactactcg tcgtgagtta aggcgttttc tcggaatggt 4860 gggttactat agaggttttt gtaagaattt tgcagatgtt gttgcccctc tgacagattt 4920 attgagccca aaaagacctt tccattggtc agtgcagtgt caatgtgctt ttgataatgc 4980 taaatctctt ttagctaatg ctccggtgtt gactgcacca aactttgaga aaccttttct 5040 tctagcggtg gacgctagtg cttttggtgc tggtgcagtt cttcttcagg aagatgacaa 5100 aggagttgag catccagtca gttatttttc aaaaaaattc aattgtcatc agcaggtcta 5160 ttccacagtg gaaaaagagg ctttagctct tgtgttggct gttcaacact ttgaggtcta 5220 tttgagttca gtttgtggtc ctattgtcat ctatacagac cataatcctt taactttctt 5280 ggatcgaatg cgtggaaaga accagcggat tatgagatgg agccttatat tgcaaccatt 5340 ctctctgctg attaaacata ttcgtggcaa agacaacctg attgctgacg ctctctcaag 5400 agtgtaaaat gtcccattag gttgggcata tcttttgttt gtctttctct ctctctctct 5460 ctctctctct ctctctctct ctctctcgat tctgtctctt tctctctcat tttctatctc 5520 tagtctttac tctgatttcc tagggcccag gagtcgggga gaaaactgga gtgagattta 5580 tggagatgag tactcctgtt attgtaaatt tcaatttagt tggtttatgt tctgtttgtt 5640 ccactagttg tttttgtcgg ttctttaggg aacctccttt tggagtggga gg 5692 // ID TC1DR3 repbase; DNA; ZEB; 1225 BP. XX AC . XX DT 01-APR-2002 (Rel. 7.03, Created) DT 01-APR-2002 (Rel. 7.03, Last updated, Version 1) XX DE Zebrafish non-autonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; TC1; TDR1; TC1DR3. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1225 RA Jurka J. and Drazkiewicz A.; RT "A non-autonomous DNA transposon related to Tc1 superfamily."; RL Repbase Reports 2(3), 12-12 (2002). XX DR [1] (Consensus) XX CC Related to TDR1, TZF28 and TC1_FR (all variants, see pufrep.ref). XX SQ Sequence 1225 BP; 345 A; 274 C; 305 G; 293 T; 8 other; cactcaccgg ccactttatt aggtacacct gtccaactgc tcgttaangc aaatttctaa 60 tcagccaatc acatggcagc aactcaatgc atttaggcat gtagacatgg tcaagacgat 120 ctgctgcagt tcaaaccgag catcagaatg gggaagaaag gtgatttaag tgactttgaa 180 crtggcatgg ttgttggtgc cagacgggct ggtctgagta tttcagaaac tgctgatcta 240 ctgggatttt cacgcacaac catctctagg gtttacagag aatggtccga aaaagagaaa 300 atatccagtg agcggcagtt ctgtgggcgc aaatgccttg ttgatgccag aggtcagagg 360 agaatggcca gactggttcg agctgataga aaggcaacag taactcaaat aaccactcgt 420 tacaaccgag gtatgcagaa gagcatctct gaacrcacaa cacgtccaac cttgaggcrg 480 atgggctaca gcagcagaag accacaccgg gtgccactcc tgtcagctaa gaacaggaaa 540 ctgaggctac aattcncaca ggctcaccaa aattggacaa tagaagattg gaaaaacgtt 600 gcctggtctg atgagtctcg atttctgctg cgacattcgg atggtagggt cagaatttgg 660 catcaacaac atgaaagcat ggatccatcc tgccttgtat caacggttca ggctggtggt 720 ggtggtgtaa tggtgtgggg gggatatttt cttggcacac tttgggccca ttagtaccaa 780 ttgagcatcg tgtcaangcg ccacagccta cctgagtatt gttgctgacc atgtccatcc 840 ctttatgacc acagtgtacc catcttctga tggctacttc cagcaggata acgcaccatg 900 tcataaagcg ngaatcatct cagactggtt tcttgaacat gacaatgagt tcactgtact 960 caaatggcct ccacagtcac cagatctcaa tccaatagag cacctttggg atgtggtgga 1020 atgggagatt cgcatcatgg atgtgcagcc gacaaatctg cagcaactgc gtgcatgcta 1080 tcatgtcaat atggaccaaa atctctgagg aatatttcca gtaccttgtt gaatctatgc 1140 cacgaaggat taaggcagtt ctgaaggcaa aagggggtcc aaccyggtac tagtaaggtg 1200 tacctaataa agtggccagt gagtg 1225 // ID Kolobok-1N2_DR repbase; DNA; ZEB; 690 BP. XX AC . XX DT 19-FEB-2009 (Rel. 14.02, Created) DT 19-FEB-2009 (Rel. 14.02, Last updated, Version 1) XX DE nonautonomous Kolobok transposons from zebrafish - a consensus. XX KW Kolobok; DNA transposon; Transposable Element; Nonautonomous; KW TTAA TSD; Kolobok-1N2_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-690 RA Bao W. and Jurka J.; RT "Kolobok-type transposons from zebrafish."; RL Repbase Reports 9(2), 549-549 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 690 BP; 204 A; 130 C; 140 G; 213 T; 3 other; aggggaccta ttatgccact tttttacaag atgtaaaata agtctctgat gtccctagag 60 tgtgtatgtg aagtttcagc tcaaaatacc acacaaataa tgttttataa ctctttgaaa 120 ctgccccttt taggctttga tccaaattgt gccgttttgg tgactgtcgc tttaaattca 180 aatgagattg tgctcttttc aaaagagggc ggagctacaa atgcctatgc gtcagcatag 240 yggcagattc aaaacaggac taatgttatc tctgtgaaaa actgaaaatg tcaaaagaag 300 tgtctcagaa gaaggcagtc tatggagact acagtgttca tttgtgcatc ctaaaactcc 360 acatttataa tgcctcttag tcgctgacat tatcttcagc aggtatggtg cgaaaactct 420 aatggcggac ggctgcttct cactcagggc tgtttatgct aatgagggag agatcgtcac 480 taatgggcgg ggctttcccc ctctgatgac acgtacaaag ggagaatgtc aatcaaagtg 540 tttctgcaga ctgtttttat caagtgtgat tataaaaaat araattaatt aatttttacc 600 attagaggct ggytatattc acacactgtt gccacacaac tgtgtttaaa ccccttataa 660 aagtgatttt tgcataatag gtccccttta 690 // ID TE-X-4_DR repbase; DNA; ZEB; 13972 BP. XX AC . XX DT 01-DEC-2008 (Rel. 13.12, Created) DT 21-JAN-2009 (Rel. 13.12, Last updated, Version 2) XX DE nonautonomous transposable element from zebrafish - a consensus. XX KW Transposable Element; Nonautonomous; TE-X-4_DR. XX NM TE-X-4_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-13972 RA Bao W. and Jurka J.; RT "transposable elements from zebrafish."; RL Repbase Reports 8(12), 2179-2179 (2008). XX DR [1] (Consensus) XX CC This element putatively inserted preferentially into CA CC minisatellite (showing); TSD length is unknown. Sequences CC corresponding to DNA-2-21_DR, DNA8-4B_DR and TDR18 is masked out. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 13972 BP; 4775 A; 2427 C; 2148 G; 4618 T; 4 other; cacacacaca cacacacaca catacttgtt tttgtgcatt gtgggggtca cctgtcacgg 60 tggttacgta atggggaccg ccctttctga tgatgttaga gacataaaaa aaatagtttg 120 ccacacaaaa acacaaacta tttcataaaa tcacttcaga ttttggatat tccgcaaatt 180 tttttttaga cctgacacag tggtcacttg tggggacatt tcaggttttg tgtagaagtg 240 gggtccgcca catacacaac cgatttttag acaaagtggg gaccaattct acacacactt 300 tactggtttt gagacaaagt aaggaccagt tccagataca cattaccggt attgagtcaa 360 agtagggacc aggtctacac acacattact ggtattgagt caaagtaagg accagatcta 420 cacacacatt actggtattg agtcaaagta aggaccagat ctacacacac attaccggta 480 ttgagtcaaa gtaaggacca ggtctacaca cacattaccg gtattgagtc aaagtaagga 540 ccaggtctac acacacatta ctggtattga gtcaaagtaa ggaccagatc tacacacaca 600 ttaccggtat tgagtcaaag taaggaccag gtctacacac acattactgg tattgagtca 660 aagtaaggac caggtctaca cacacattac cggtattgag tcaaagtaag gaccaggtct 720 acacacacat taccggtatt gagtcaaagt aaggaccagg tctacacaca cattacyggt 780 attgagtcaa agtaaggacc aggtctacac acacattact ggtattgagt caaagtaagg 840 accagatcta cacacacatc actggttttg agtcaaagta aggaccagat ctacacacac 900 attaccgttt tttagtcaaa gtaaggacca gttctacaca cacattaccg gtttttaggc 960 aacgtgggga tcagatcagc acccttacta atattaaatt ttaaacctca atcaagtaat 1020 ttgacagtaa actgctttag aattttcatt tgaagagcag gtcagtttat aaaaaatgta 1080 atttgaatac agaaggaacc tcaagatcca aacaggaaat ggtccagaat agatttaata 1140 aatgttatat ttattataaa taatttccac aaaaccttta aacacaaaaa aaacagtaaa 1200 cacaaaatcc atgaaactct cttgcatgtc taaaagcagt tctcattggg tataacaagc 1260 ttccacataa ctcttccatg tctgcataca catttagtta agtgcaatag caaacaatat 1320 taaacggtta gttcacccca aaatgaagat tctgttatta attcctctcc ttcatattgt 1380 ttcaatccct tgctgtctat ggaggatgag gagttcttgc acttcatcta agttatcttc 1440 ctttgagtct gaagatgaat gaaggtctca ggggattaga gtgagatgag ggcaagtaat 1500 taataatata atgtacattt ttgggtaaac tcttaaaacc caccagtatg ttctatttct 1560 gtttctgtcg tattttaact atttgaaaaa aaaaaaacat taaaaatatt taaaaatata 1620 tttataaaga actgaacccc acaaatgcac agagaccttc atcatataac caagccacca 1680 aaactccaaa tagacatcat ttgttaagca ttaactctac attaatagta agcagtttat 1740 aaacacagct agatgctcta ttgttgactt ataagcatgt gtaatgtgct caatttttgt 1800 gttttcatac tttgtttata attcattttt cattactaaa tgaagtattg cattatttac 1860 aaactgtcta agaataattg gtgtttttta agatcattaa gaatgagtaa ataaatgatt 1920 aaaaaactat ataaatgaac atttatacat ttactattca ggcatataat cagggctaaa 1980 aaaaaaaaaa aagaaaaaaa agaaaatcca atcggttcac tctaagcaga atttatattt 2040 tttcggttct gttcttgcgt tgcttaatag acaaaatatt ccgaaaacag ttttctagaa 2100 aaaaaaaaaa aaaaaacttt atttaattac aaagtattgc agggctctag agtgctctgt 2160 ttttttaatg gtgcaactaa aaaatctata cgaggggtaa gattaaatga attttcaata 2220 agtatcatta aagatatatt tataatatga acttgctttt ggttattgat tgacacgctg 2280 tttaaaggtg ctacacacag ccccactcct tggtatcgag cgcaggcccg gcgctacatt 2340 tagatgttgt ataatatgaa atagcaataa aaattatgaa aattgttatg aaatagtttt 2400 tcagttagcc aamctaataa aaatatttag tacactgcac agtgcaatta ttatttaaat 2460 ttattatatc cattattatt aatattagta aaattgacaa aacacagaac tcttttgaac 2520 gtttgctttc attttgacat gctaaagcgc acttttacag ttttgtaaag aaacatccca 2580 ttcatgcagc agcagtcaag ttaggacact taaaaatgcc ttttgtgttt aaagatgaac 2640 ggcctgcgaa gtcaagttta cattgttata aattaaatag atatagccta gtgtgtaagt 2700 gaatttgctg cagatccacc tctcacatgt gctgcaggta gggtaaatga taatcttttt 2760 aaaaaattta ctttaggcta tactgctatt tgctcatgca aaattaaaca ccagaaaata 2820 tatcgtaggc ctatattaag agctaaatta cagttagcat cataattagt tcaattatgt 2880 gctgtaggct attgattgac caacaccccc tgtatacagt ttacccactg caggtctaca 2940 gttctgatgt gtattttgaa ttacttgttt taaatgacag taagacatgc tatatttagt 3000 ttttattgta aaatatacat tataatccag tacaaagtaa aataatattt attaatggcc 3060 tattggcagc gtatagccgt taagcttata gtctaagcat gtcattttat tattattaac 3120 ataaattaca aactagcatc cttatgattt tatttcgttt ctgtatattt ttatcaaacg 3180 aaaaatgcaa tgagttatac tttggtttta attttattag aaatttaata gccgtttagg 3240 caacactact tctacatgtc attggctctg attgcgtttt ttacagataa cctatttaca 3300 aagattttta ttaaacaagt gcactaccac aagatatttt tgtaggtgta aaaatctaaa 3360 aagtatcatt atcatattct tgccttattc atctttaata taaatataaa atttaacaca 3420 ttcattgatt aaactaggga tgctaacgat tactcgattg atttattgtc ggtaaccctt 3480 taaaaccgat agaccttatc gatgaccgat taagcatggg catttaataa gttatgcttt 3540 gctctttgag atgcgtccac gatttcacac attacataat caagtgtgcg cagtttaggc 3600 tacgttaccc tattttaggg agtcgtgcac tctcttgact ttgcatattg gttatgcaca 3660 tattggtcat acctattatg ctcagatctc gtttatccgc tggtgctgcc attaatccgc 3720 tggtcaagta atcaaaaagt aggctacatg acgtttaatt atgggcaggt tattaagacc 3780 aaacattggt tgtgcaaact gtattacgtt ctctaaaatt ttaaagtgtt ttaagctaca 3840 ttatcttttt atagactggc ataggcctaa atacgcatca cggccagaca tttatagttt 3900 atataaaaac gagcgtcacc gtcgcatttg ttcacatgct ttaaattaat gctttttgtc 3960 tcccaaattc atataataaa gagaacgata aaataacatt attaaccgtt taactgtaaa 4020 catttctgtt cagaacgatt aaagttgatc tttttttttc ggttttcgtt tctgttcctg 4080 aaaaatgtca tttgattctg tttttcgttt ccgttccttg aaccggtttg gagccctgca 4140 tataatacgt ttttctgcat gttaaataat gctttattaa ctcaacttca tgcagttttg 4200 tgatctaatc taaagtgagg aatattgatg ctttataaat cccttataaa tgacaattaa 4260 aggctcagaa tcattcattt attcatttat tttcattttg gttttctctc tttattaatc 4320 cggggtcgca acagcggatt gaaccaccaa cttatccagc acgtttttat gcagcggatg 4380 cccttccagc tgcaacccat ccctgggaca catccataca catacacact cattcacgct 4440 catttactac ggaccatttt aggctaccca attcacctgt accgcatgta ggagcacctg 4500 gaggaaaccc acaccaacac agggagaaca tgcaaacttt acacagaaat gccaactgac 4560 ccagccaggg ctcgaaccag cgaccttctt gctgtgaggt gacagcacta cctactgtgc 4620 cactgcgttg ccaggctcag tatcaaataa acaataagta tttgcaatcg tacctaaaca 4680 aatacaatta ctgtaaattt taaacattgc tgaataacag gagtgtcgaa atacaacatc 4740 atttgataaa acaacaatat aataatttaa caatacatta cgtgtaaagc aattttatat 4800 ttagacaaga ttgcaatgat ttacttttca tttgatacag tttcatttga gcatcttgta 4860 ataagtagaa attaatagtc atttttttcc tgtatataat gtaactgagc cttatattgt 4920 catttataat ggatttataa agcatgaata gccctcgctt tagattagat cacaaaactg 4980 catgaagttg agttcataaa gcatttatta acatactaaa caacttatta tatgcctgaa 5040 taataagaat tatcaatgtt attttgaata gtttattaat catttacttg cacattctga 5100 attatctaaa aaaaaaaaca ccaactactt ttaaatatct ggtttgttaa tgatgtattt 5160 ttcattacta aattaagtat tgcattattt accaagtatg aaaacaatca ttaagcacat 5220 tataattgtg cttataagtc aataatattt gtaatttata tttataaata taatttataa 5280 actgcttact aatgtctatt aatgtagcgt taatgcttaa caaacgttca actaactatt 5340 tgctaatgtt tcataaataa ttcattgtgt gcagttatta tagtgttacc caacatttat 5400 atatcttatt aatcaggcat atagtaatgg ttactctgtc taataataaa tgctttatga 5460 actcaacttc atgcagtttt gtgagctaat gtaaagtgag gactattgat gctttataaa 5520 tcccttataa atgacaatta aaatctcagc tatattctaa acagaaaaaa agaaataaat 5580 taaaaggggg agtgatttat agctttagag tgatttatat atatatatat atatatatat 5640 atatggtgaa gcagtggcgc ggtaggtagt gctgtcgcct cacagcaaga aggtcgctgg 5700 gtcactggtt tgaacctcgg ctcagttggc atttctgtga gaagtttgca tgttctccct 5760 gcattcgtgt gggtttcctc cgggtgcaaa ggcacagtcc aaagacatgc tacaggtgaa 5820 ttgggaaggc taaattgtcc ctagtgtatg agtgtgacag gcccatatga caggctggcc 5880 ggaaggtgta aaaaagtcgt tatcatatgg acaagtctaa aactgaagga cttgttccaa 5940 aaagagccga ccccgcaatc atacgggact aaggccaaag aagatatata tatatatata 6000 tatatttttt ttttttttca ttaatttcaa gatacacaaa tgaaactatc taatgaaaaa 6060 taaatctttg caatcttatc aaaatataaa attactgtac agtgcaaaca tcacagaaac 6120 actgaaatga atattgtcat attgttaatt gtgttgtttt gaaatgatgt tgtattttta 6180 ttttgcaatg tttaaacttt acagaatttt tttaataaga ttgcaaagat ttattgttca 6240 ttagatactg agccttaaat tgtcatttat aagggattta tcaactataa acagtcctta 6300 ggtcacaaaa ctgcatgaag ttgtgttatt atagcattta ctaacacaca tggttacaat 6360 tactatatgc ctgaatataa catgtataaa tgttcatttg aatagtttat taatcattta 6420 ataactcatt ctgaattatc ttaaaaacca ccaactactc ttatatacaa atagtttgta 6480 aataatgcaa tacataatat agtaatgaaa aatgaatcag taacaaagca tgaaaataca 6540 attaagtcaa gtcaagaata cagcgtttga agctgtattt ataaactgct tcctaaagtc 6600 tattaatgta gagttagtgc taaactaatg atgtatagtg tgtagttact atgaaagtgt 6660 tattcagatt ttttatctaa agagccgaga ttcagttttt aatataaaaa tccaaagctg 6720 accaatctct ggtttttaga gccaatggtt cagctttcac cacagcatgt acttcctgtt 6780 tgctccagga tcttctcttt atgggaccta tgggtggtta aaatataaaa aagtggtttt 6840 agttgcataa atgttcagtc atttttacaa catgtgctgt tatgttagtt taatattgtg 6900 atagaaaatg ctagaaaaat atgtcttgcc tattattaaa aatacttaag atggagatgg 6960 attaaaaaat acttcctatt aacttcaaaa ctggttaatg gggttacact ttattccata 7020 gtccagttta gacattcttc taactatgcg taactctgca actacgttaa ttatctcacc 7080 gtaaagtatt atgtattaga gtatttgcat actgttagaa tgttcaataa gactaacgct 7140 catctgatta tcagtagagc agtagagtat tagcagactg ttagctcaag tttagctcaa 7200 aacactacac tgtaacacat atctgttcat taccagttcc tgtattttgt gatttacaag 7260 tgtttttttt tttttttttg tggtggtgaa ttgcattatg ggatgttgat ctctgctctg 7320 tgcacttttg atgttgaaaa ttcaactcta cagtttaaca aagtgacttt tactgacatt 7380 ttagtagttt gaaataatat aatgtataag aaataatata tagagataaa taagtcttta 7440 gaataacaga aaatgtaccg gcagtttatt acaaggtttc tgtagcataa tatacaacaa 7500 caacccaaac aaaaagtcac ttttttttac tttttaagat taaaagttgt tggaagaaag 7560 atcaagatcc cataatgcaa ttcaaaagca gaaacaaaca gatttcataa aaacatgaat 7620 cacaaattat gaaaaactgc taatttaatt attattactt atttttatta attttagagt 7680 gtagagtatt agctgacagt taaggttacc tcaataaaac caactctcat tggtctcagt 7740 aaatagtaca gcagactgtt aggttacgct caaaattagt acaatatgac taactcttat 7800 ataaatttag tagagcatta gcagtgttga ttctaacaga cagtctaatg gctgttagtt 7860 gacaagtagt tgcagttact tgtagttagt agtagtagta gtagtaaagt aaattatcaa 7920 aataaagtgt aaccagatat gcaattagta tatttttcca aagtatttag caatattgtt 7980 taactaacat gaaatgagac atcaagtgca actgtactga aatcatcatg gaagtaatgt 8040 gcttctgtcc tgttttaccc tacatgttat gtaaatgagc ttgtgcattt ggacttgtac 8100 aataacacaa gtcagtcaca aattaacttt aatgatcata cctctctttt ttgtgaatgc 8160 tgggtctgct tcactatctt cacagacact gtaccatggt gatgatactt gaggggactc 8220 tgaaagtaaa aatcaagtct gagagttgac ttctctgcat aaagcttacc aaaatattta 8280 tttgattttg tgttgttctt tttaatgggt aaaagcgtgt gatggtagct taaaaatagt 8340 tttcaaacat agcacataca tatcatgata aaaataaaca ttctgaaaaa agaaaaattc 8400 tgatcagaaa aataaataat atttttacct tttagattgg acttcacctc ctccaaaatc 8460 acgatccact gtaactttct ctgaatgtaa aaacccacac aaaaatacat catttaaaac 8520 aatatggtat atgtttgcaa atacaaatgt gtatataaca tccagaaaat ctggcactgt 8580 attacatgtt tgtgtgtaaa cccacgattt gatcaagatt ccacccttta aaatctgctg 8640 agcgtctgtt ttcaagtccc agaaaaactg agttgtggga acagagcagt tctcttctgc 8700 agtcatacag tcataacaca cacatacata tgaagagttt ggttgcaaaa tgcgataaac 8760 ggcatttttt gacatttgga ttttattatt ggaaatcact gttcagatgt accttaatct 8820 atgtgtgaat tgctgccatt aataacattt aagaatgtat acagttgaag tcagaattat 8880 tagcccccct gaattattag acagcttgtt tattatttcc ccaatttctg tttaacggag 8940 agaagatttt tttcaacaca tttctaaaca taatagtttt aataactatt tctaataatc 9000 atttctatat atatatatat atatatatat atatatatat atatatatat aaaagtatat 9060 acagttgttt tttcttctgg agaaagtctt atttgtttta tttcagctag aataaaagca 9120 gttttaattt tttatgaacc attttaaggt caatattatt atccctccaa tgtatataat 9180 taaaaatata tattttatat ataattttat aaatatttta taaaaaatta ttttatatat 9240 ttttatatat aatttataat ttttattttg aaaaatataa tttagacagt agtactcaag 9300 ggtccagggg tctgttcttc ggacctcgct taaataatct aagaatatat tacagatcct 9360 ggatctttta atcttgataa ctcatcttag gctaatttgg ttcttcaaaa aagtttgcga 9420 atcagattaa actgatctga gaggtgttgt gactgtgtgt tgtgacagat ctatcgatcc 9480 tcaaaatcat tatcagcaat acaacgattg gctgacggca cagcagcgtg atgacatctg 9540 attaatggac agttatcaaa attacatgaa atccgtagta aacggttcgt taaatctgat 9600 atgcaataac agagttgtga gtcacgtgct ggagcagagc agttctcttc ctctgtcata 9660 tggtcatcct tcaataaaag ctgaatataa aaaaaatata tattagttag acagcagtac 9720 tcaagggtcg cggtccccac tatgtcagac cttactgtgt gtgtattcag gactaagtgt 9780 gtttcacagc ttctctaaca tgcctctggt ccccacaatg tcagtgcata tgttcaccaa 9840 ttaggacaca cactctgttt catttctata ttttgattaa gttaaataat tacaattaaa 9900 tacaccacta ctacaacaac aactactatt actactacaa ctaataatat taataataat 9960 aataataata ataataattg ttattattat tattataaac acacctcatc agagttgtgg 10020 gtctcctact ggagcagagc agttctcttc ctctgtcata tggtcatcct tccatccttc 10080 aataaaagct gaataaaaaa aaaaaaaata tatatattat atatatatat atatatatat 10140 atatatatat atatatatat tagttagaca gcagtactca agggtcccgg tccccactat 10200 gtcagacctt actgtgtgtg tattcaggac taagtgtgtt tcacagcttc tctaacatgc 10260 ctctggtccc cacaatgtta cactagtgca tacgttcata aattaggaca cacactctgt 10320 ttcatttcta tattttgatt aagttaaata attacaatta aatgcactac tactacaaca 10380 acaactacta ctactactam taataataat aataataaca acaataataa acacacctca 10440 tcagagagtt gtgggtctcg tactggagca gagcagttct cttcctctgt catatggtca 10500 tccttcaata aaagctgaat ataaaaaaat atatattagt tagacagcag tactcaaggg 10560 tcccggtccc cactatgtca gaccgtactg tgtgtgtatt caggactaag tgtgtttcac 10620 agcttctcta acatgcctct ggtccccaca atgttacact agtgcatatg ttcaccaatt 10680 aggacacaca ctctgtttca tttctagatt ttgattaagt taaataatta caattaaatg 10740 cactactact acaacaacaa ctactactac tactactaat aataataata ataacaacaa 10800 cattaaacac acctcatcag agttgtgggt cacgtactgg agcagagcag ttctcttcct 10860 ctgtcatatg gtcatccttc aatataagct gaaaataaaa aaatatatat tagttagaca 10920 gcagtactca agggttccgg tccccactat gtcagaccgt actgtgtgtg tattcaggac 10980 taagtgtgtt tcacagcttc tctaacatgc ctctggtccc cacaatgtta cactagtgca 11040 tacgttcacc aattaggaca cacactctgt ttcatttcta gattttgatt aagttaaata 11100 attacaatta aatgcactac tactacaaca acaactacta ctactactac taataataat 11160 aataattatt attattatta aatttaaaca cacctcatca gagggttgtg ggtctcctac 11220 tggagcagag cagttctctt cttctcgtca tatggtcata cttcaataaa agctgaatat 11280 aaaaaacaaa aaataataat tagacagtag tactcagggc tccyggtccc cactatgtca 11340 gaccgtactg tgtgtgtatt gaggactaag tgtgtttcac agcttctcta acatgcctct 11400 ggtccccaca atgttacact attgcatatg ttcataaatt aggacacaca ctaagttaaa 11460 taattacaat taaatgcact actacagcaa caacatctac tactactact actactacta 11520 ataataataa tactaataaa catacctcat cagagagttg tcggagcagt tttcttctgc 11580 agtcatatgg gcatccttca atgaaagcta tttaaaaaca gctttgttat ataagtcatg 11640 tcatgactat aaagctttca tgacagtctt atgaaccccc ctttaaagta aagcattacc 11700 caattaatta aactttttta cccattaaga caagaagaag atattaagag attgatttca 11760 ttatctctta tggatgatgc aaactcttta tttttgaaca gatttaccct taatctatat 11820 taacattaaa aaactcttag ctttgctgaa tgtattaagg attaatgtat taagtgatca 11880 aaatgtcaaa cctttctcca aggccaatca catgcgccat agtcaaacgt tatttcctct 11940 ccaaagaaaa tatccctgat tgcaaaaagg cacaaatgag gttttctctg aacaacgatc 12000 ttattgacag tgcagttagg gtgcagatcg tcctcatgat cggcatcaat gctgcaaaac 12060 taagagaaaa gagcaagcaa cattttaatt atgccctctc atagcaataa aaagctaaac 12120 tatggcatga atacaaacac aatgaatata atatatacat gtacatttga atggggaaaa 12180 cacttaacca cctgttcttg ttttaccata taaagtcaaa cataaagaca gtatttttat 12240 taatatctca tctatgcttt gcttccacaa gctctatcat ttgtactcaa ccacaaaact 12300 tcctttaata aatgtggaca acgcaaactc tgacctaaaa gatttaatga aacgtgtagg 12360 gttcaaaaac tacaaaaacc tcacattagc atcctctctg tgtgtgaatt ataaatacag 12420 taaacctttt gaagtggatc agaacctttc ctcaatgttg ttctaatact attgaacacc 12480 tgttcttctt tcttaggaca gttttaaaga actttttttc cacttcaatt gtttacaggt 12540 cacactttac aatcaggttt tattagttaa tgtacttact aacatgacct agtaatgaac 12600 aatgcatgta cagcatttat taatcatagt ttaacattta ctaatgcatt attcaaatct 12660 aaattcatgc ttgtaccagt aataatgcat tgcaataatt aatgcattgt gcgtgttcaa 12720 ctagtaatga acaactttat tttcattaac taaagttaac aaacatgaac aaatactgta 12780 ataaagtatt atttattgtt catgttagta aatgcattaa ccagcattaa ctaatttaac 12840 cttattgtaa aaggctacca gttgacagca aataaataaa taaaacacac acagtgaaca 12900 tttatgtact tttatatatg ttttacacta agttggagtt tattataatt atttatctaa 12960 aataaaacgt ttctacacgt catagaacaa taaattaagt acatttctag gaccagatag 13020 ccccacattt taattataac ctctatatta ctacattaac gtataaatag tatataatag 13080 tgttatttca ggattaacgt taaatgttta aagcaattaa ccatttaaaa agtacgatac 13140 aagaactatg aaaacatacc ttagtaacct tctatatatt cggtttccag tgaatgggtc 13200 ttgtcagtgg agctggtgat gtgtttctcc gcatctttta atggttttat cctccgtcga 13260 tccatttagc cgcccggtga ttggtgactg tctgaccgcg gctcatgcac tccaccccgg 13320 cctctgactc acgtagcacc gggcagttag cacgttagca cattagcgcg agtaactttt 13380 actcgtgtaa acaagttaaa ttacataccg aactgtagtt tgtgtcttat cagtgttata 13440 ttaatacgta atttgaaaca acatcgatta aaacacccag gaaattcgtt aaaacattta 13500 attaactata cctgataact gtcaaaatat gcaacataag tatttgtcat ataacctatt 13560 ttattacatt atgatggtga aaaacattct aacttaccca aaaattcgaa taatttgatg 13620 atagatctta attaagttga ttaaagtcgg attcatttat gagattgcgc atgtgcagta 13680 ggcagagtgt gtaagtgtgt gtcagggtgc agtaccgcca tccgactaca ggaccgctat 13740 agagcagtgc gccgacacac acttacacac tcaattcagg taaatggtca aggtggggat 13800 cagtgaaaaa actgcactag gcatatcagc aacgccccct gagtctttta aaggcaaaac 13860 atagtgggga ccgggcgatc ggtccccact atgtttttgg tccccacttt gtgagtgtgc 13920 tatcatgctg aggtccccac catgtaataa aaacaaacac acacacacac ac 13972 // ID DNA-TA-6_DR repbase; DNA; ZEB; 972 BP. XX AC . XX DT 28-FEB-2009 (Rel. 14.02, Created) DT 02-MAR-2009 (Rel. 14.02, Last updated, Version 1) XX DE nonautonomous DNA transposon from zebrafish - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; TSD TA; KW DNA-TA-6_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-972 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 9(2), 535-535 (2009). XX DR [1] (Consensus) XX SQ Sequence 972 BP; 357 A; 112 C; 146 G; 355 T; 2 other; cactgaaaga aatgactttt trgtgtagtc aaatctaata aattgactct cataattttt 60 aaattgtaat attaagtgtc aaggataact agaattttct aagtacatat ttaatattta 120 cacatttact tcatgtaata aatgtaaatg agtggaaaga gttaaatatt aatattcatt 180 tacttgtaat atttaaatat tttctaatcg tcgcaaacct agtaatatta agtaagtttt 240 aataatggat atgtcgcaat ggatgtgtag ttcccagcat gctttgcaag ggattgaatt 300 aaaagaaaag tattttgaaa ataaaagcaa tcttaggtgt ttaaatttga gagaaagact 360 tgttagagtt tgatcatcat tagtttgtag attaagaaga gttttgagta tgtgctttta 420 gttttaaggt taccaccttg cagaagagca gcactcatgc tttgggtctt ggcattttta 480 ttatcagcat taatgtttga cgcttcactc agaaaatcat agctgtgcta acactagckc 540 tgacatcagt aagcatttat aaagtaaatt tgctcttgct aaataaattt gcttaaaata 600 aagtgaaata acacatttca agtcagtgga aatgtgtgtc ttaattaaac caagttgcac 660 caaatgtact aatttgtaac taataagatt tatttaaatt gataaaagca atgttccttc 720 tgctaaaaca agaaaatgac attacttggt tcatttaaat aaataaagac tcaaaagtaa 780 agataatcaa ttaactttta cttaaatgtt taacctaaca ctcctaaatt aaaaattaag 840 taaattttac atgatattga caaagttaat tgtacctgat ataagattgt aaaatttact 900 tgaaatttgt ttgtgcaaat tgttacgagg atttttttta aagtaaatct taagagttat 960 tttttttcag tg 972 // ID Gypsy151-LTR_DR repbase; DNA; ZEB; 884 BP. XX AC chr14; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy151-I_DR; KW Gypsy151-LTR_DR; Gypsy151_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-884 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1564-1564 (2008). XX DR Genome; chr14; Positions 14029165 14030048. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 884 BP; 277 A; 126 C; 186 G; 295 T; 0 other; tgtgacaccc caaatattgg tttgccacaa ttgaggatat taatgatatg catgtgtaaa 60 tgtataaagg gaatggtggt tccgccagag cgatagaggg cgctaagaga gcacagtgtg 120 tgattcagcc ttttctggac tgctcgcgat agcccagtaa catggagagc tgatcacacg 180 catatatttg tcttgtgttt attccaccat ttaacaggtt agttgactgt tctaagtgtc 240 aatctgtatt gatatggaaa gtatacatgt ttgtgcatat taagtctttg gaaaagattt 300 gtgtttgatt acatttacat gtttaatgag ggtttaaaat cgcgatgttt acccgcgatt 360 taaggggatt tatatctact caaatagttt aataaaggtg aatatgcttg tatatacatg 420 tatttcagtt ccaataatgt ttagtttgct ttaaaaagtg catgtgagtg atctatatgg 480 gcaaaacatt cagagaatgt caagtttaat gcatttttgg tacagataaa agccaattaa 540 gtttatttga ttcatgtgac actaaatagc gtgtgaatac acagatgtac tacacagcta 600 ttggctgatg aatatatttg tcaaagaaat gctgtttatt taaaagaagg aggctgaaac 660 agtcaatatg tataaacagt aaatgctata gcaatgtatt ttagatattg caagtgttgc 720 catttaaatt tgtttgatgt gaacatgttg agaaaaataa atatagccca gtaacatgga 780 gagctgatca cacgcatata tttgtcttgt gtttattcca ccatttaaca gattgctggg 840 atctttaaac tgtcagctga tacaggagtc ctggacccgc taca 884 // ID DNA-TA-4_DR repbase; DNA; ZEB; 9795 BP. XX AC . XX DT 08-DEC-2008 (Rel. 13.12, Created) DT 08-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE nonautonomous DNA transposon from zebrafish - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; TSD TA; KW DNA-TA-4_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-9795 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(12), 2156-2156 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 9795 BP; 3180 A; 1714 C; 1666 G; 3234 T; 1 other; cagtaggtga gagttcaaag tgactattac cgccgcgtct tataccgccg ccgtttgaaa 60 taactgccgc tctgttttta gtatatgaca gcagcttgtg aatgagccgc cagggggcgc 120 aaagggacgg gatgcgaacg gacagaaata gcctttacag cagctttaaa tatgctactg 180 tgcttgtaaa tgagattaaa caataagtca aagcattata attccttcat taatcattta 240 aaatttgtta acacacttta ttgtaaactt tttaagtgca aagaggacta gttttggaaa 300 atagaattga agaaataaaa gacaactaac atgaaagcac aaacattcag tacaaataag 360 tagtcttaaa taaataaata aagcagtctt ttagcctaaa tatagagaga tgttcagtgt 420 ttgctatttt gataggacta agacaagaca ttttcattta tgtttttatt tctgtttttg 480 ttttaatttt cgttgttgat tatattttac tatcttattt tatgtctcaa caattgtaca 540 taataataaa cgaataatga aaactaatta ataatgaaat taggcctaaa taaattattt 600 atttaaagat attgcggcga aacactgaga aacggtcagg ggcatggtct aaagagctta 660 agttgaatgc tgcttcatca aaagcaaata aataaataaa taaattgacc taccttaagg 720 agatcactga aagtataata aaacaaattt tgccgcagga cataacataa attaagtctc 780 gcaggtttat ctttaataat ttagtttcag ttcagttttt tttttttttt tcaaaacgtc 840 aatgttctcc tttaaagaag ctataactgt tgttttgttt tttaattaat ggctcagtat 900 gtttggtatt ttaatttcaa tttcagtcac cttgcatatg cgtgtgaaat ataatgccgc 960 ataaccgcga aaaaagaaga aaaagctgtc agttagacct aattcatcaa aatgaactgt 1020 aataaaatac agcatgtatg ttaatactgg cagaatgtga acaaactcaa ggtaaggcta 1080 agatatgtgc ttgctttttt aatgcattta aaaagatttg cggctactat ggtgtataat 1140 aatgtttttt tcttcctttt ttgtatggta aaggacaatg cacattgtta ttgatgtaaa 1200 cttagactaa aacttaccat tgataaatgt gacctagcct acttcaagca gataacttaa 1260 agtataataa aaataatttt gccacaggac ataaataaag tcccgcaagt ttagttttaa 1320 taatttggtt tcagttttgt tcagtttttt tttctcaaaa catctgtttt tctttaaagt 1380 aaaattgttg ttttttgttg ttgttgttgt tttacttgta atggctctct ctctctctct 1440 ctctctctct ctctctctct ctctctctct ctctatctat atatatatat atatatatat 1500 atatatatat atatatatat atatatatat atatatatgt aattatttaa taatatataa 1560 ttataaatat attcttaata aacttcccag ccacaaatat tgaagaacac aacttgtatt 1620 aaaactgggc ggggattaga aagaatacaa tgcttgttat ataatttaaa atgttattcc 1680 tcttgttcaa tttctgcatg cacatttttt cacccgctac tctgccagga acttcgccgc 1740 tggagagcac cttcaaaaat ctcaatctca tggctcattc ttcacgaata tagaattaaa 1800 ataatggcgc gataggttta ttgtgcatcc cgcgggtgca accaacaaga gttgtgcatt 1860 ttagtttaac tttttgagcg ttaaaagcag ttctgtgtta gtttttattt aaatatatca 1920 agccgaaact aaacagcgca ttctctccgc ctcctcgttc ataaccagcc sgacgcactg 1980 ctgaagcagg caatagagac actccgtaat tcataatctt agctgatgtt tcggagtcga 2040 aagaatatat taaagagaaa tgttctttta acagtcccga tatgtttaat cttttttttt 2100 cctgtcattt ctcttcagca aattatattt ttttaaagct gcttgattct tttaaaaccg 2160 aaagcagagc ccggcaattg gtgtttttcc ataagatacg ctttgtgaag tttttaatgt 2220 gtttttaaat agttttattc tattcaaagc agcacctata tgccaattta tcctaaaaac 2280 gcggcatagg agccatgcat gtcatgtgtc acataactgc attttcctac gcgtgcgcga 2340 tcaatttctg atccttttgc ataaaggttt ttaatcaaaa aacaggtcat ttaattactt 2400 tcgatgtgct aaaatatttg ttagacgaat gttattttaa aaaaagcata tgcacatatt 2460 acaattgtaa aatagtctaa aatgcatggt ctgatcagaa ataaaggaaa ttagttatat 2520 ttcattattt aatgcataaa ataggctagt ctttttataa attgagttga attgaattga 2580 attgatttga aacttttaat tgcattttta cgtcctgtaa ataataaaag ttgcatcaga 2640 ttgatgaaaa cgaatcttaa tacctgcaaa cagggccaca gttacttctt ttttcactga 2700 ctggcatgaa tggcaaacgc gtcagaaaaa aactgctgaa tctgcgcgaa tatttggtta 2760 acttataaaa caaatgtctg gtttagtgat gttagtaatg catagaaaaa tgcaaagcag 2820 aattctccta agctcattaa cccatgcact gatgacagcg agggaatcac tggcccacca 2880 ttttatttat taaaggtatt atttcactga cctgaacata accatttata atataattaa 2940 taacctatcg tctaaagcag gcaacactct attcagctat gttctgtgtt tttacgtaaa 3000 aatatattcg ttttggcgaa tgcttggagc gtaatataga tattttaaat aacacattaa 3060 gttcagttca aaagttgttg tgcatagaaa cattcatttt cataaggccc catatttaat 3120 gcggtttcgt ttttaaacgc ataggttttg ttaagccatc cgtcctattt ttggattttg 3180 tgtattcatg tttatgaaaa acacttaaag agtggagacc ccttccccct attcataacc 3240 aaaagcttgt ctgttaggtg ttaatgggca tcagtgaggc tgaaatcatg tcgcgttttc 3300 actttcgggc tagtagcttg cagcatgaca cgcaaaccct gcccccagaa cgtccccgga 3360 attagatgtt gcatattgta tctgtgatct taaggaaagt gttaagtaga ctatttaagc 3420 acagagcgag taataaaaca gcatatattg cgttgcgctc agcagctttt cactaataaa 3480 gctcttcatg taaatttcat ttttcaattt tagcgcgtca tataaaaaca tcaatgcttg 3540 tttggggaca cagaacacac ttatttttgt agtttcccct gatccgtttt taaagtggca 3600 ctgcacattt ggttaactga aaaaaggtcg cagttcatgc gttcactcac cccaaaaatg 3660 ctctgtttgc atgatttgat tattatcatc gtatcctaaa taaataataa ataaataaat 3720 aatattttaa accttctctg gtgtttactg tttttagaga atatataaaa tctttttttt 3780 ttaacaaagc ctttgtaaaa tggctttgtt taatatggct ttaaaggttt catttatgta 3840 ggctataaag gcctaccttg ttgcttttgc ttgttttata atggtttatt tatttcatgc 3900 gattttatta tatcagatat ttgtaattct ttaaattata tttcaatatg gcttaggcta 3960 ttttataaag atatgacact tgttttaagt ctacaaaagg gtacacgtaa agttttatgt 4020 ctttctgttt tattcatatt gtgtattatc tgcaaaataa ataacaaaaa gaaagaagct 4080 gctcgcagct taccagagct gtgaagggag cgctcatttg ctcaatctct cacacacaca 4140 tatgcaggca tataaacacg cacaaacaca ccgtttaaac ttgacaaaaa ctctcgaaaa 4200 cctttagtct gctgtgtctt taatatggtg taaagattat tatgcgttat tgaaacatcc 4260 agggggacgc agcaaagtca cttataaatt aaaattgtat tattttatgc aacagcctta 4320 catttaaaac ggaaacataa cggcgattat aatcaaaaag acctctcaac ccataaggct 4380 agatgttttc tttcccttat ttattaattt atttgttcat gtgtttggcg catgtaactc 4440 gtgtgccttc ggaaaactag tgtaatgacg gcaagccatc tttcccagac ttggggcaaa 4500 gtccgcctct cctttcactt cgccccccaa agccccagct ggccctcctg gcatcctctg 4560 cgtaaaacat atgctggata agttgacggt tcatttcgct gtggcaatta cagacttata 4620 aagggactaa gccgaaaaga aaatgaatga aagaatgaat taataataat aataataata 4680 ataataataa taataataat aataataaaa taaggtaatg atctttacac agatgccgcg 4740 tacgtggcat tttttatttt tagagatggt tcatttgaat tgcctctgcc acttccccca 4800 cctcaaacct gaaagttgca gagagaacta tagcctattt acaccttagc ctcaatagca 4860 acagttcata gcccatttac accaagacag cgcaagttta gcggaagaat gtcaaatcag 4920 aatcacgcag atgaagcaca ccagctacta ctatcatgaa actattaaaa tggcattggc 4980 ataactttat ttattttcag cagctttttt tcgctcaaca gaaatgcggc gttgggaagc 5040 atttaataaa taaaagcagg gttaatagtg acgtgattca aatgatggat ctgcgacaga 5100 gaaactatag gggttttggg aaacactcac tactatatcc tccccccccc ccaaactatg 5160 catattcagc gcaagttact gtacggcttt gtttgggaaa gcacaggtcc gctggtgccg 5220 cgcgcctctt tctctgggcc gaagcgagat gattttaaaa atattaaaga tatagattaa 5280 ttaattattc agcaaaattg acatttattt catatttcac tatgcatttc atctgatagg 5340 ctaatcttta tagcttaatt aaacacatat tgtcaaaacc cacacgtgta tgaaaatgaa 5400 aatggcattt gttattatta gactccgatt tgttatttgt tgcctaggct attttcacat 5460 ttgaggtgct ttattggcat gacaaataac tgtacattcg tattgccaaa gcagtgcagc 5520 gtcactgcgt acaaagacag tgcaaaaagg gcagtagtgc aaacaattat ggatataaca 5580 tataataaag aaaataaaat aaaaaaattg aataataaaa ataaaaatag aatcatgata 5640 aaggtaatga tagtagccta ttatcaaaag taagtccaca aataaagagt tcgagtattt 5700 gtgtgttgga gacagtggag cgggtattgt tagttgaacg cacaacgcgc tctgtgctgg 5760 actttacacc tgctttgatc tggtgcaggc ataaatgcgc atgcggtgga ggcgcactct 5820 aaactcacac tggagaaaac tggactagtg gatatcacaa agacacgcag tttaatttat 5880 ttcatgtttt gcttaactaa ttctttgaat aaaactattg agctataaag gggaaaaacg 5940 cagaagaaat ttcccgttac attaatttta gcacagctgc cattcatttt cacattagtt 6000 tttgagttag ggattaatat taaagtagcc tagtaaaaat gtctatacta taaataatat 6060 tataatttat tatatccatt attattatta ttattattat tattattatt attattatta 6120 ttttaattat tattattaag ctatacaatt gacaaaacgc agaacaaaat atttgctttc 6180 attttggcat agctaaagca cacgtttaca gttatgtaaa aaaaaaacat ccccttctac 6240 agcagcagtc aatttaggac acttaaaaat gtcttttgcg tttaaagatg atcgacctgc 6300 gaagttatgt ttacagtgtt gtaaaagaaa aaggtaggct atttagtgcg tgtggattta 6360 ccgcggatcc acctcttgcg ctgcaggctg aataatatag gtagacatgg ctttctgttt 6420 ggctacacta atgcctaaaa atgtaactaa gtaacacatt aattttagaa actattttta 6480 aatggcattt aacatttaaa tttatttttt tatttaatgt aacatttaaa tataatttaa 6540 tttaaaacat ttattttttc ctcgctgatt ttggttgagt aataaagcgt gatgactcag 6600 atatgatcta gtttagcttg catgtgtggg cgtattttga cgtctttacc taaaacttta 6660 cacatttata atgtttaaaa taaaacggaa agaaataagg agactaacat aatttaaacc 6720 acttgtttgg tgcttaaaca ttatctttct gcattaatgt aaacggtatc gttgccttca 6780 ttaataaaat cggtaatagc aaaactgcat attattatta ctataaaaag taggactata 6840 tcggtaatgc aaagtaatga attaataaat agcctaaata aaaggaacaa gaaaggacca 6900 aattcagtcg tcaagtgcaa tgggattgcg cccttgcgcc ctcagttatg atatcctgcc 6960 gccgggcctg ctatagcgaa actttattgt ttgagaagaa acaaaagcgg aaactgtgga 7020 tgaaaacctg gctgggaaga tggtgacaac atgaattttc tgttcttcaa cgagaaaaaa 7080 aaaatcgcag acaggacaca atgacgccgt ttattataca acgcaggtcg tgtttacgat 7140 cattaatatt aacaactaac attaatatca tatcaataac tcctcttgct ttagacttta 7200 ccttttagag cttgttcttg ttggcgcaat ataagtttat tgcatgtata gggtacgaat 7260 cagattatga agaatgatta agagcatgct tttccttcat tgcatccttt ccgtaaggtg 7320 ttttatttat ttttttattt aatttattta tttataattt ttttaaataa tgatgacaat 7380 atttactgtc taaacattcc tcggtatact accaacaaat tacgcccctg tacccaagta 7440 gcctacctaa gttaatttac atttcatgta aaaattaaga aataaaaaaa gaccaaagta 7500 ttatgctaaa tattcgtaca aacgaacatg tttgtttttt aactttcgtt gataaaacaa 7560 cacaatacag cctaggctag ttaaatatta aataacttta actgcttgaa acagtaaacg 7620 ttacatttta gaaacttctc aaatactaat caataataat ataaatagta acgaagaaca 7680 aaaatattag cctataggct tgtaacaaca taagtcagat gttggaaaaa aaaactgcgt 7740 ctctctgcct ttttttctat tcatatttta aattatttgc cagaaaaact aacatgttga 7800 ctttgttcgg tttaagcgga gacttatttg agattacaat gtttccagca gcactgagac 7860 tgtgtgcctg tagcccatac agatactttt tcgcaaccgt ggcaaagaga gtatcttgct 7920 ccgccgcctg catttctcgg aaaccagctc tacattgtgg atcggcgcgt tctcaactac 7980 caaatatgct gctccatttc tcatcaacgt ctactattta tcagtttctc ttttgtattt 8040 gcccattttt gaaaaggcct ctctaccata cctggctgac tagctgggca tttgcgttca 8100 gtagatgaca ccggagctgg ctccgcatgt catggctggg tgacattacc acagatggct 8160 atggaggttc gacgtgtttg tccatttaac acttattttt ctaaacaagc cctacaaata 8220 gcttcatcct aattattgat tcgccttact cgttcgccac gaatccaaaa tgtttccaga 8280 ttggcgatgt aatataagtt ttccagcaag gttcatggct gaacttgaac ttgctgtatc 8340 acgtctaagt taacgttatg ttcataaatg tataggctat ttggcaacaa aacagaaaat 8400 gtgtgttcta gagtgattgt cgtcacggtg accgtcacag cccttacata tgacattgca 8460 tacatttgca tacatgtttt aagctataaa atgaaagcct cagtttggtg cggagttatt 8520 attatacaaa aatcaagaaa atctttggat ttgtttgatt cgtagattgt aggctatttt 8580 aaaagctgat attatataag aactattaat aactgtaatt ttcaaaaaat aaaaaaataa 8640 ataaaaaagg ggtttgttgg ttgtagccca ccctaaatca tcaacagcct gcttttttta 8700 agtttatgaa gcggctccat acagcacaga ggtgatctgc atcagcgctg gttattgtca 8760 acactgtgtg aaactgtgac gagaaatatc cttctttcat tttgcttctt tggcaaaatg 8820 tgtctgtaga aacgtgtgtt tgcatgtgtt accatcccaa gagttaacaa atatttgttg 8880 tacatttagg ggccgatcac aaagcgcacg cctttacgtt gcaaaaacgc gcgacgtacc 8940 gcgctgcctt ttatggttga caagaaaaaa ataagcctct tttgtatgtc actagaaaac 9000 gactgaatta gctggtgtgc gcgagtcgag ttgctgtcga tgttataatt gtaattttaa 9060 taatattgtg atatttaaga tctataaata cagctctgga taacttgaaa cagcgaccac 9120 aagtctctcc tcagtcttta aaagtttatc gtactcacat tgacaaccca aaccccgctg 9180 tcacctcaac agaaagcctg cagtgctttc atttgactga acaatgaaac agaggtaaca 9240 cgctttttcc actcgagcac acagcgcgca acggcaaaac ttaaagcgcg ctgagcaagc 9300 ataagcagag cgtaaaacac tggcggcagt cagtaaacca ttaaacagat gccccgtatg 9360 cagccaaact ttaaaaatgt ataaagcata ttttaatcgt ttaactgata ccattcatcg 9420 gttacaaacg caccctttcg gttattgctt aatcagttat tttgagcatc cccaactaaa 9480 ccgcatgtgc aagctctatt ttttttttct ctcacggccg cacgccggag atccggcatg 9540 gtgccggaat actttaagcc ctggtaagat catgtttatc actataagaa aactaaatca 9600 aaattatatt ttaattgaac acatctgatc acataaggta cgcacagaaa agtctgttta 9660 gtgttgtgaa aaggcaaagc tgaatatctg tggttaaact gccacccagc ggtcaaaagc 9720 tgctggcgca tcaagtaccg ccgtcgccgc ggtatgaatg gcggcataag gaacacattg 9780 aagtagtaac atctg 9795 // ID Mariner-3_DR repbase; DNA; ZEB; 2647 BP. XX AC . XX DT 23-JUN-2010 (Rel. 15.11, Created) DT 23-JUN-2010 (Rel. 15.11, Last updated, Version 2) XX DE A family of Mariner/Tc1-type DNA transposons - consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; DNA-2-3_DR; KW Mariner-3_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-2647 RA Kojima K.K. and Jurka J.; RT "Distant groups of Mariner/Tc1-type DNA transposons from RT vertebrates."; RL Repbase Reports 10(11), 1845-1845 (2010). XX DR [1] (Consensus) XX CC >93% identical to consensus. 68-bp TIRs with one mismatch. This CC is specifically inserted into TTAA and appears to generate no CC TSDs. However the similarity to Mariner/Tc1-type transposase CC indicates that it makes TA TSDs and ends with CAG..CTG. This is CC highly similar to DNA-2-3_DR, and probably DNA-2-3_DR is a CC nonautonomous derivative of this transposon. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS join(486..728,978..1259,1390..2052) FT /product="Mariner-3_DR_1p" FT /note="Putative DDD/E transposase." FT /translation="EQYTATQRKKERKKERKKECLHEGEEDEYQDKGEEWD FT KVEGEEWDKDQGEEWGKVEGEEWGKVEGEELECVVALKEGPELGIQRLPPS FT GGRGKLLNEEQELAIVNMVIADNEIKLKEIQSRVVEDNFVFGNIAAISLTS FT ISRTLAKHRVRMKHLYKVPFERNSERIKELRHQYVQRVMELEANQVPHEII FT YVDEAGFNLAKRRRRGRNIIGKRATVTVPGQRGANITMCAAISNNGALLHK FT CQIGPYNTDRLLLFLEDLHERLVPEVERGQVGDHLPVYVITWDNVAFHHSR FT AVTAWFDANPRMMSHFLAPYSPFLNPIEEFFSAWRWKVFDHRPHDQMALLD FT AMDAACQDITTEHCQGWIRHAKRFFPRCLAREDIRCDVDENMWPNAEDRAE FT *" XX SQ Sequence 2647 BP; 838 A; 475 C; 574 G; 758 T; 2 other; cagtttttta caatggctat gacacttttt tcaatacatt taacaaattt gctaaactct 60 taacgcagtt agcacaacat ccgtcttcgt gggctataca attaacacat ttcttgttgc 120 tttaacacaa aatgcatcca gttaacacaa atttaaaatg cttaaacttc ttttacacac 180 aaacacaact aagaccaaaa caatgtaatt ttaaagtcaa atttacaaat gctttcacac 240 agtatgtcag aacagataac ttcatgttct aaacagaact tatataggct aaagtcaatt 300 gaaacacaaa tatattccct gtattttatt ccatcgccaa tgacaaacag cttattgcag 360 ttatgtaaat atataaatca cagctgattc ccatctagca accaaactca ggtgttaagg 420 tttttccaat tacatgatca tatgttataa aagaggacta tttgagctga tcttgtgtgg 480 aaaaggaaca atatacagca acacaaagaa agaaagaaag aaagaaagaa agaaagaaag 540 aatgcctgca cgagggagag gaagacgagt accaggacaa gggagaggag tgggacaagg 600 tagagggaga ggagtgggac aaggaccagg gagaggagtg gggcaaggta gagggagagg 660 agtggggcaa ggtagaagga gaggagttag aatgcgtggt ggcgctcaaa gaaggaccag 720 agctagggta actgatcaga taagagctac tattattgat catgtcataa accacgggct 780 atcatacaga gaggctggtg aacgagtaca gccaaatctc agccgggaca cagtggcatc 840 cattgtccgc attttcagag aaaccaacag gtaggatatt gcttctccta cagtaaagtg 900 taactaacca tttactgtat tagagtaact gtatgcctcc ggtctctaaa aagtatctgt 960 attttgtccc tctaaggatt caacgtttac ctccctcagg gggcagagga aagctcctga 1020 atgaagaaca ggaacttgct attgtcaaca tggtgattgc tgacaatgaa ataaaactga 1080 aggaaattca gtccagagtt gtagaggaca actttgtctt tgggaatatt gcagcaatta 1140 gcttaacatc aatttctcgg actctagcta aacacagagt cagaatgaaa catctataca 1200 aagttccttt tgaaaggaac agtgagcgca tcaaagaact ccgtcaccaa tatgtccagg 1260 taaggttatg caatacagtc attactctaa tatacacagt gtgttttgaa gtaaactact 1320 ctataaacct ggagttcatt agtacatagt gtaaatgctg tactgtacat ctatgtgtct 1380 aattactttc agagagtcat ggagttggag gccaatcaag tcccacatga aataatctat 1440 gttgatgagg ctggcttcaa cttggcgaaa aggcgtcgcc gtgggaggaa tataataggc 1500 aaaagggcca cagttaccgt gccgggccag agaggagcca acatcaccat gtgtgccgca 1560 atctccaaca atggtgcact cctgcataaa tgtcagattg gcccctacaa caccgaccgc 1620 cttctcctgt ttttagaaga cctgcatgaa agactggtgc cagaggtgga aaggggacag 1680 gtgggagacc acttgcctgt atatgttatc acgtgggaca atgtggcatt ccaccattct 1740 cgtgcagtca cagcctggtt tgacgctaat ccaaggatga tgtcccattt ccttgcccct 1800 tactcccctt tcctcaaccc tattgaggag tttttctcag cctggagatg gaaggttttt 1860 gaccatcgtc cacatgacca aatggccctc ctggatgcaa tggatgctgc atgccaagac 1920 atcacaactg aacactgcca ggggtggata aggcatgcca aaagattctt cccacggtgc 1980 cttgccagag aagatattag gtgtgatgtg gatgagaaca tgtggccaaa tgcagaagac 2040 cgggcagaat agaggattac tttttttatt ttattataat tccatttttt tctgktttat 2100 ttttgcaaat aaagtctgta ctgcagcaat tctgtttctg tatttttttt ttttttttac 2160 ataaactgta cattttataa agctgctctg agatggttgt tttttgtatt gaatttctgc 2220 agcaaaagaa acaaaatgca tgcatttact aaacccaaca acacaaaact gtaaagaggc 2280 ttcaaatata agggcagagc tgacacataa ataatgcagt ttctgtaatg tcagctgatg 2340 atagtgtttt tattgtcagt gtattctgat tgactaaatg tttctgttgg aagagaacat 2400 gtgttagtgt tttgaacaat tatgtcattt tgaaacatgt ttatagtgtt ttgttagaca 2460 tggtgtaktg tgttcgttta ttattgtatt ttgaaaagta gtttagaggt ttagtttata 2520 atgtgtgatt ttgagcatga aattaactgt tttgccaatt gtgtgtttta ggtgtgttgc 2580 tgcgttaaga gtttagcaaa cttgttaaat gtattgaaaa aagtgtcata gccattgtaa 2640 aaaactg 2647 // ID BEL33-LTR_DR repbase; DNA; ZEB; 580 BP. XX AC chr22; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL33-I_DR; KW BEL33-LTR_DR; BEL33_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-580 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1410-1410 (2008). XX DR Genome; chr22; Positions 22289441 22288862. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 580 BP; 154 A; 88 C; 111 G; 227 T; 0 other; tgttctgtct tatccagact ttttacattg agttagtgtt aatctggtat ttagcgacct 60 cttttggtta tacatgggta ttacatttta tgttatagtc cttatttcat gacatcataa 120 atgtgagcct atgcattcag aagccgcatg gaagagcaaa cggctgtagt ttcatttgct 180 cccttataca agttgtactg aaggcacatt cggtatgtag acattatttt atcataattt 240 gagttatatt tgtttatgtt tacgctataa gcatgttatt atgcatgcaa gatgtttatt 300 gtgttcttta ttttggtcaa ttgcaatatg tgactacata tatgattcag ctaagtgcgt 360 aaacagtgac ctattgtgca tagagtgcag attgtgcttg tttaatttac atattattga 420 tccagagtac ttaccattct gtattttgta tgttttgcag ttttacgtta ttccataatc 480 tacagtaaaa ggttcaaact tatgccttga gtctgttgtc attggaagac gttatctctc 540 tgtcgaatca agcaaggcgt ttgatataag gggcagaaca 580 // ID Gypsy37-LTR_DR repbase; DNA; ZEB; 372 BP. XX AC . XX DT 21-SEP-2007 (Rel. 12.09, Created) DT 21-SEP-2007 (Rel. 12.09, Last updated, Version 1) XX DE A long terminal repeat of the Gypsy37_DR LTR retrotransposon; a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy superfamily; LTR; Gypsy37-I_DR; KW Gypsy37-LTR_DR; Gypsy37_DR; endogenous retrovirus. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-372 RA Dib M.R. and Naveira H.F.; RT "Gypsy37_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 7(9), 805-805 (2007). XX DR [1] (Consensus) XX CC Gypsy37-LTR_DR is a long terminal repeat of the Gypsy37_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its CC internal portion is deposited in Repbase as Gypsy37-I_DR. XX SQ Sequence 372 BP; 80 A; 83 C; 73 G; 136 T; 0 other; tgttaggact gcgttgtttt gttttgtgtc tttgattttg attccagatt gggggcgtgc 60 cttcagcgca cctgatgctg gtcagcgccc ttatttaaac cccggcagag agtttgtcgg 120 ggccgctgtc cattcacttg gcagttggcc agagccgcgc tcctatttgg gcctttatgc 180 tttgtttcgc atccatacat tatcactgac aacatttaca gcatccatac atttgcattt 240 cacactgatt gaacatactg atactgatat tttgattata taatttaaat tgagctaaat 300 aaatattctt tttgattata cttctccttg tcgtctccct cattatgtta catccttgag 360 ccaggtgtaa ca 372 // ID DNA-X-8_DR repbase; DNA; ZEB; 912 BP. XX AC . XX DT 17-NOV-2008 (Rel. 13.11, Created) DT 17-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE nonautonomous DNA transposon - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; DNA-X-8_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-912 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(11), 1734-1734 (2008). XX DR [1] (Consensus) XX CC Copies are ~86% identical to the consensus. TSD is unknown. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 912 BP; 258 A; 188 C; 180 G; 284 T; 2 other; aggttattgc acactgaatc cgaaattttc gtctgtaatt ttcgcacgtt aaaaaataaa 60 ttcgacctca cgttgtgtca atcgtattga cacactgcct ccgaaacttt cgtccgtcat 120 aaaaaaattt gaaccaggtt catttttttt ctrcttttcg catctgaaaa agcrctttga 180 gaggtgtttt gacagttcag agccaccgta caagcgaaaa gctaaataca gaatgccgtc 240 atttgagcta cagataactt tatgacaaac agcagtgcat aaaataaaga cagaatgtgg 300 cggacagttg agaaccccta tatgctggat tcagtgactg gcgcacctct gccctgctgc 360 tgtttggtgt gtgtgtgtgt ccatacattt gacgtactgc ccatagacat tataatagaa 420 agcatggtcc gctttcggtc tgcatggtct gctttcggtc cattgcttcg acacgtcaac 480 gcggccacca tctattataa tgtctatagg tactgccagt ctctgttcag gatttgttta 540 cgtacgtgaa tgtgtgaagt atcacaagca atgtatcaaa attccactga tttcctgaaa 600 taaactttgc atttcgggat aatcctagcg cagctctttg ataatgagct gaactctcct 660 tcctctctat gcagtattgg atgaacacaa tatgtagcct attcctcttt ctttttctac 720 tttggagaag agagaggaga acaaagtatc actgcttgaa ctgactccga aatgttttgc 780 gttttttcat aatggattaa ttaatacgca tcggactcag tgtgcaaggt ttgtttacgt 840 gataaatttt tcgcatatga atacgaaaaa aaaacgcata cgaaaatttc ggacttcagt 900 gtgcaaagac ct 912 // ID BEL32-I_DR repbase; DNA; ZEB; 6873 BP. XX AC chr15; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL32-I_DR; KW BEL32-LTR_DR; BEL32_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6873 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1407-1407 (2008). XX DR Genome; chr15; Positions 43280521 43273649. XX CC Positions [5827-6378] - Integrase core CC 'TTCAT' target site duplication CC LTRs are 100% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 1098..3164 FT /product="BEL32-I_DR_2p" FT /translation="MHLEGHASDEIEEPWPDVGSVLDSTSSLSRPLSHQTK FT CSSTHSSIHSGKRNEAVAEAAACQEVLAVLEEQDREVSELQRLEAESKLRM FT AQFESEDLARKQAMQDKRRNLERLEEIKKLNAARARVRVYDQAEGDIEAAE FT LKQDALSVIDNKVPDFISSSFVHKSLPAPTHVLHASSPSFIPKSQPVVHQD FT QNHLPQRTPQGSLYSPLMAQAQNGSDLVSTLAEVMSANRLPISEPALFTGD FT PLKFKDWRLSFETLIDRKNIPKNEKLYYLRKYLGGAAKKTVEGFFLLGTEE FT AYDSAWQLLEKRFGDPFIIGKSFRDKLHGWPKISSKDGCELREFTDFLKSC FT EAAMPHLKTLEVLNDCNENQRILLKLPDWLVSRWNRKVMEFRQVYGDYPKF FT KEFVDFLSNESDLACDPISSVQALRSVEATKPKYSRNQPSQAKTFYTNTTQ FT STFPACVFCKRTGHGLAKCRRFIEKDVQDRVKFVRTEKLCFGCLQTGHHSR FT RCENKSTCEKCQKRHPTCLHDDKFKEYQRSVVKESGDSKEKADGTEIVAVA FT ITNTIRQEGPNTQTSTIIPVWVSSTKLPDQEILVYALLDTQSDTTFILDEV FT AQRLKVNMENASLQLSTMSSRSTTIRCHKVSGLQVRGHNSEKIIHLPAVFT FT REFIPVNRSHIPTSDTALKWPHLEQLANKIHQHWTVKLDF" FT CDS 3340..6771 FT /product="BEL32-I_DR_1p" FT /translation="MPTVKPSVELKQEVHFVCRNVVKEVLPTDIIKALKLD FT FTDHSADDNPVSQEDILFLSKVKEGVKHKEDGHLELPLPFKTENPSLPDNK FT QSAIHRLISLKRRLRTDKQYYEDYVHFMSDIISRGDAERVPSSELDNKSAW FT YIPHHGVYHPHKPGKIRVVFDCSARFQEASLNDHLLTGPDLTNTLVGVLCR FT FRKGPIAIMCDIERMFHQFHVTKEHQDYLRFLWWDNGNLDSKPSVYRMRVH FT LFGAASSPGCSNFGLKYLASQGHGSFSEEAIKFIQRSFYVDDGLTSVSTSA FT EAIHLVDEARALCKTGNLRLHKFVSNDMEVIATIPPEDCAQARDLDMALGE FT LHIERALGVQWCIEADEFQFRVIVKKNPMTRRGVLSTVASVYDPLGFVAPF FT LLVGKQILQVLCKDKVSWDEDLPEHILPQWESWLRDLPHLADLKIPRSYLS FT SNFDRPVSYELHSFSDASLSGYGACSYLRATSETGQISCSLVIGKARVTPT FT KQTTIPRLELSAAVTSVRNGNIVKRELEIEDLQEYYWTDSRVVLGYVNNDA FT KRFHTFVANRIQRIRSSTCPAQWRHVSSENNPADYASRGLSAAQLKESNWL FT KGPDFLWQKILLCEEEVVRDLEITDPELRKTNIHTIRTKEIDLMVTRFAKF FT SDWSSLVRAVARLKRFVREFKGIQPRRNGMTSLEERKAAENVIIKIVQDDA FT FAEDIRHIKLQKENYLNKHSNLHRLSPFLDKEGLLRVGGRLTRSSLDYDVK FT HPALLPKKSHISALLVKHHHERIYHQGRGMTMNEIRANGIWILGCGSVVSS FT HIYKCVKCRRYRGTTEIQKMADLPEERTETSPPFTCCGIDCFGPFIVKEGR FT KELKRYGLLFTCLCSRAVHIETLDDLTTDSFINALRVFVAIRGPVRQIRCD FT QGTNFMGARREFSDLLRGMDQERVRAIGCEFVVNVPSASHMGGVWERQIRT FT IRSILNAMLDGSACRLDSTTLRTFLYETMAIINSRPLSVEHLHDPTGPEPL FT TPNHILTMKSSVIVPPPGNFDKQDLYLRKRWKRVQFLANEFWQRWKREYLL FT NLQLRQKWQRTSRNSQVNDIVILQDESAQRNEWKLARVVETYSSADGIVRK FT LKLLLSDTTLVKGKPQSRSVYLERPVHKVVTLIEAS" XX SQ Sequence 6873 BP; 2229 A; 1290 C; 1513 G; 1841 T; 0 other; gtgagaactc gacgcagatc gacgcgggta tgaaagcaac gtcctcgcgg gtaacgtcga 60 gacaacgtcc agctgaagat aaggagggct caagttacag tgccgaagta acacgcagtt 120 acacacagtg atgattagac tgagttgaaa gtgcaacgtt tcagactgaa atcagaaagt 180 actctgacac agttaaggtg gtcagagaag tgattgaact tgaaagtgca gtaaggactc 240 aaatctaatt gtgttgctca gccttcagtt ttgggacaaa tttaatgtgc tgatatttat 300 ctggtattat ttaagagtgc atttaagtta ttgataagtt cattagttac tgaaattgaa 360 atcgactgat ttatttgttc aatattactg atgtaaaagt ggtttgtgaa aggattagtc 420 agtaaattta acttgagttt attttcagaa tattttaggt ttatactgca tctatatagt 480 caactaattg ctgattattt ctttaacaag ggaatttgtt acatgataat actttaattc 540 aatattgaca tacatacttt atattttatt tcttatgtag tttaacaggc gttaaggatt 600 ttatacagta aacatttaca ttttacattg agctacattt acacagtcta agttcactaa 660 ttttgtttac atacaccata catatacagt aatattaata tttatatcaa aatgtctgat 720 acacaaacag agacttgccg taaaggggaa atatcacctg cacaacctga agctacagag 780 ctgcgaagaa gtgaaagagc tagaactctt acagaaaagg gaaaagagct tcaagaagct 840 aagttaaagg gtttaaagcg acagtataga atagtttatg agaagtggag atattatgca 900 agattaagta aggaaatatt ctcggatgaa gcttctcaag agaattgaat gagcttattc 960 acaatattga aagcagctgt tctaatgtga aagccatata tgaagaacta cgtcaagttc 1020 aaattcctga acaagaccta cgacgcagga ttgatgtgtg catatcgctt tcaggattca 1080 ttgttaaaag agcacaaatg catcttgaag ggcatgcatc agatgaaatt gaagaaccct 1140 ggcctgatgt tggatcagtc ttagactcaa caagctcttt atcaagacca ctgtctcatc 1200 aaactaaatg cagttcaacc cattctagca tccactcagg caaaagaaat gaagctgtag 1260 ctgaagcagc tgcatgtcag gaagtcttgg cggtgttgga agagcaagac agggaagtat 1320 ctgaacttca aaggttagag gcagaaagta aactacgtat ggcacaattt gaatctgagg 1380 atctggccag gaaacaagca atgcaagata aacgcagaaa tcttgagcgc ctggaggaaa 1440 taaagaagtt gaatgctgca agagcacgag ttagggtcta tgatcaagcg gaaggagaca 1500 ttgaagcagc tgaattgaaa caagatgctc tatcagtcat tgataacaaa gttcctgatt 1560 tcataagctc ttcctttgtt cataagtctc tgccagcgcc aacgcatgtg ctacatgcat 1620 caagtccttc atttattcca aagtcccagc cagttgtaca ccaagatcaa aatcatctcc 1680 cacaacgcac acctcaaggt tcactatatt caccattaat ggcccaagca caaaatggtt 1740 ctgacctggt cagcacatta gcagaagtta tgagtgccaa tcgtcttcct atttcagagc 1800 ctgcattgtt tactggagat ccattaaaat tcaaggattg gcgtctttct tttgagacat 1860 tgattgacag gaaaaacatc ccaaaaaatg agaagctgta ttatttaaga aaatacttag 1920 gtggcgctgc aaagaaaact gtagaaggtt ttttcttact tggcactgaa gaagcatatg 1980 attctgcttg gcagctgtta gaaaaacgct ttggtgaccc atttataatt gggaagtcat 2040 tcagagataa gctgcatgga tggccaaaga taagctctaa agatggttgt gaactaagag 2100 aattcactga cttcctcaag agttgtgaag ctgcgatgcc tcatctgaag actttggagg 2160 ttctaaatga ctgtaacgaa aatcaaagga ttctactaaa attgccagat tggttagtgt 2220 ctagatggaa ccgtaaggtg atggaattca gacaggtata tggagattac ccaaaattca 2280 aggagtttgt agacttcttg tcaaatgagt cggatctcgc ctgcgatcct atttcttcgg 2340 tacaggcact taggagtgtg gaagctacaa aaccgaaata ctcacgaaac cagccttctc 2400 aagcaaaaac attttacaca aacacaactc agagtacttt cccagcatgt gtattttgta 2460 aaagaacagg acatggctta gcaaagtgca gaaggttcat agagaaggat gttcaggatc 2520 gtgtcaagtt tgttcgcaca gaaaaacttt gttttggttg tttgcaaact ggtcatcatt 2580 caagaagatg tgaaaataag agcacatgtg aaaagtgtca gaagagacat cctacatgtt 2640 tgcatgatga caagttcaaa gaatatcaaa ggtctgttgt gaaagaaagt ggcgattcaa 2700 aggaaaaggc tgatggaacg gagatagttg ccgttgcaat taccaataca attaggcagg 2760 aaggcccaaa cacacaaaca tccacgatca taccagtttg ggtatcatcc acaaaactgc 2820 ccgatcaaga aatccttgtc tatgcacttt tagacacgca gagtgacacc actttcatct 2880 tggatgaagt agctcaaagg cttaaagtga acatggagaa cgctagtttg cagttatcca 2940 cgatgtcctc aaggtcaacg accatacgtt gtcataaggt ttctggtctg caggtgagag 3000 ggcataattc tgagaaaatt atacatctgc cagcagtttt cacacgtgaa ttcattccag 3060 tgaacaggtc acatatacct acgtctgaca ctgctttaaa atggcctcat ctggagcagt 3120 tggcaaataa gattcaccag cactggactg tgaagttgga cttttgattg gctacaactg 3180 tcagcaagca cttctgccta aggaaatctt atttggagaa gaaaatcatc catatgcaca 3240 acgaaccgat ctcggctgga gcatagttgg atgctcatat ccagcagatg actatggtga 3300 tgcgattgga gcagtcatag aactttagtt tgcccagtga tgcctacagt gaagccatca 3360 gttgagctaa agcaagaggt gcattttgta tgtcgaaatg tagtcaagga agtacttccc 3420 acagacataa ttaaggcact caagttagac ttcacagatc atagtgctga tgacaatcca 3480 gtttctcaag aagatattct ctttttgtct aaagtaaagg aaggtgtgaa acacaaagag 3540 gatggacacc ttgaactccc acttcctttc aagacagaaa atcctagttt accagataat 3600 aaacaatctg caattcatag acttatctca ttaaaacgtc gactgaggac agacaaacag 3660 tattatgaag attacgtaca cttcatgagt gacatcataa gccgaggaga tgctgaaaga 3720 gttccaagtt ctgagcttga taacaaatca gcatggtaca ttccgcacca tggagtctac 3780 cacccccaca aaccagggaa gattcgtgtg gtgtttgatt gttcagcgcg cttccaagag 3840 gcctctctaa acgatcacct cttgactggt ccagacctca ccaacacctt agttggagtg 3900 ctgtgtcgat ttagaaaggg tccaatagcc attatgtgtg acattgaaag aatgttccat 3960 caatttcatg ttacaaagga acatcaagac tatttgagat ttctttggtg ggacaatggc 4020 aatctggatt ccaagccttc tgtgtacaga atgagagtac accttttcgg tgccgcttca 4080 tcccctggct gctcaaactt tggactcaag tacttagcat cgcaaggtca tggaagcttt 4140 agtgaagagg ctatcaagtt tatccagagg agtttttatg tcgacgatgg cttaacaagt 4200 gtaagtacat ctgcagaagc tatacatctt gtggatgaag caagagcttt gtgtaaaaca 4260 ggaaatcttc gtctgcacaa gtttgtttcc aatgatatgg aagtgatagc cacaatacca 4320 cctgaggact gtgctcaggc cagagatctg gatatggctt tgggagagct tcatatagaa 4380 cgtgcacttg gagttcaatg gtgcattgaa gcagatgaat tccaatttag agtgatagtc 4440 aagaaaaatc caatgactag aagaggtgtt ctttcaacag tagcctctgt ctacgatccg 4500 ttagggtttg tagcaccttt tttactagta ggcaagcaga ttcttcaagt attgtgcaaa 4560 gataaagtaa gctgggatga agatctgcct gagcatattc taccacaatg ggagtcatgg 4620 cttcgagatt tgccccattt ggctgactta aagattccta gaagctacct ttcatcaaat 4680 tttgacaggc ctgtttctta tgagctacac agcttttctg atgcaagtct tagtggatat 4740 ggtgcctgtt catatcttag agcaacaagt gaaacaggac aaatcagttg ctcacttgtt 4800 attgggaaag ccagagtaac ccccaccaag caaacaacta tcccgagact tgaactatca 4860 gcagctgtga cttcagtccg aaatggaaac attgtcaaaa gagagcttga gattgaagat 4920 ttgcaagaat attactggac tgactcaagg gtagttcttg gctatgtgaa caatgatgcc 4980 aaaagattcc acacattcgt agcaaatcgg attcaacgca taaggtccag cacatgtcca 5040 gcacaatggc gacatgtcag ttcagaaaac aaccctgctg attatgcctc aagagggctg 5100 agtgctgctc agctaaaaga gtccaattgg ttaaaaggac ctgactttct gtggcagaaa 5160 atccttctgt gtgaagagga agtggtgagg gatctggaaa ttactgaccc tgagctccgc 5220 aagaccaaca ttcataccat caggactaaa gagatcgatt taatggttac tcgatttgca 5280 aaattttcag actggtcaag tttggtgaga gcagtcgcca ggcttaaaag gtttgtcaga 5340 gaattcaagg gaattcaacc aagaaggaat ggaatgacta gtcttgaaga aagaaaggca 5400 gcagaaaatg tcatcattaa aattgtacaa gatgatgcct tcgccgaaga cattcgacat 5460 atcaaacttc agaaagaaaa ttatctaaac aaacacagca atttgcatcg attgagtcct 5520 ttcttggaca aggaaggtct tctcagggtg ggaggaaggc taacaagatc aagcctagat 5580 tatgatgtaa agcatcctgc gttacttcca aaaaagtctc atatatctgc cttactcgtc 5640 aaacatcacc atgagcgcat ataccaccaa ggaaggggca tgacaatgaa tgagatacgt 5700 gcgaatggaa tatggatctt aggatgtgga agtgtggtct cgtcacatat ctacaaatgt 5760 gtaaaatgta gaagatacag aggaactaca gagattcaaa aaatggcaga tttgccagaa 5820 gaaagaactg aaacatctcc cccctttaca tgctgtggaa tagattgttt tggccctttc 5880 atagtaaagg aaggaaggaa ggaactaaaa cggtatggat tgctgtttac ttgtttgtgc 5940 tctagagcag tgcacataga gactctagat gatttgacga cagattcctt catcaacgct 6000 cttagggtgt tcgtagctat taggggacct gtgcgacaaa taagatgcga tcagggaaca 6060 aatttcatgg gtgccagaag agagttttct gatctgctca gagggatgga tcaagaacgt 6120 gttcgagcta ttggttgtga atttgtagta aatgttccct cagcgagtca tatgggcgga 6180 gtgtgggagc gccagatacg gacaatacga agcatcttga atgccatgct cgatggatct 6240 gcttgcagac ttgactcaac gactcttaga acattcctct atgagacaat ggccataatc 6300 aacagcaggc cattaagtgt tgaacatctt cacgatccta ctggtccaga acccctcact 6360 cccaaccaca tactaactat gaagtcttca gtcattgtac caccaccagg aaactttgat 6420 aagcaagacc tctatctgcg taaaagatgg aaaagagtgc agttcctcgc aaatgaattc 6480 tggcagagat ggaaacgaga atatttgcta aatctccagc tacgtcagaa atggcagaga 6540 acatcaagaa attcacaagt aaatgacatt gtgatacttc aagatgagag tgctcagaga 6600 aatgagtgga agcttgctag agtagtggaa acatactcaa gtgctgatgg cattgtaaga 6660 aagttgaagt tactacttag tgacactaca ttggtcaaag ggaagccaca atctagatca 6720 gtgtatttag aaaggccagt tcataaggta gtcactttaa ttgaagccag ttaagtcagg 6780 cacatgacac aattacactc tgaaacttga tttaacttca ataattgttt atagaaaatc 6840 tcacatttca agtaaaggag attttggtgg gag 6873 // ID DNA21TA1_DR repbase; DNA; ZEB; 539 BP. XX AC . XX DT 02-MAY-2002 (Rel. 7.04, Created) DT 02-MAY-2002 (Rel. 7.04, Last updated, Version 1) XX DE DNA21TA1_DR is a nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA21TA1_DR; KW nonautonomous DNA transposon. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-539 RA Kapitonov V.V. and Jurka J.; RT "DNA21TA1_DR, a nonautonomous DNA transposon from zebrafish."; RL Repbase Reports 2(4), 9-9 (2002). XX DR [1] (Consensus) XX CC The consensus sequence was reconstructed based on 17 copies CC present CC in sequenced DNA that represents ~1% of the genome. Approximately CC 1500 copies of DNA21TA1_DR are expected to populate the complete CC zebrafish genome. The element was active relatively recently; its CC copies are 5% divergent from the consensus sequence. CC This element is characterized by 21-bp terminal inverted repeats CC and CC the TA targets site duplications. CC Its classification is not certain yet, although it CC is expected to be a member of the Mariner/Tc1 superfamily. XX SQ Sequence 539 BP; 159 A; 123 C; 103 G; 154 T; 0 other; gctgcgtccc aaatggcaca ctatacacta tgcactcatg cactatgtac ttatgcactt 60 acacactcaa caggatagta tatgtatgta gtgtcgtccc aaatggcaca ctaatgtttt 120 tttactaagc ggaaattcaa accgtttccc tgatgacgtt tgacggttgc caaatcagtg 180 aaataaacga ccgaattatc aaataatacc tgccgtgagt ataaccgcat tcaccatcgg 240 gaggcgctat aatcactctc gtaggagaat tttgctttca ccatccaaaa taaataaagt 300 tatccaacat gtgcgtccga tagctccgcc ccttccgcta cgtaagcaaa cctgcggtcg 360 ttgagtgcgt gaagtgtcca tcattacaca cttcatttta gcggctgaat gagtgcatca 420 tccgggtaat taaagtgcac ttattatttt tagagttttc agtgtgaaca cactacttac 480 actatttata ctacaaaatg gcgtagaata gtgcataagt atgcgatttg ggacgcagc 539 // ID DNA-2-12_DR repbase; DNA; ZEB; 4686 BP. XX AC . XX DT 11-NOV-2008 (Rel. 13.11, Created) DT 11-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE nonautonomous DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; 2-bp TSD; KW DNA-2-12_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4686 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(11), 1703-1703 (2008). XX DR [1] (Consensus) XX CC TSD is 2-bp long (mostly TA). CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 4686 BP; 1632 A; 784 C; 730 G; 1539 T; 1 other; cactgtaaaa aatttgacct taaattcaca gtaaattact ggctaataat tgcattactt 60 tcacagtaaa ttactgtatg taaattcaca gtaaattatt gtgaggtagt tcacagtaat 120 ttactgtggt tttgtcacat taaattattg tgaaattaca accatatatt gtaactttac 180 agtagataat aaagtctgtt actgtgattt cacagtatta tcttgtagaa ttaactatat 240 tttactgtga atttgcaaaa cgattactgt gatttagcag taggttgcag tttaaatacc 300 tgatttaact gtaaagttac aattatatcc cggattactg taatttaaca gtttggtgct 360 gtaaaatttc aaagcaattt taggtcacac aaaaatgaaa attctatctt gatttactca 420 ccctcaggtt gttccagctc tgttaaaaat gatttttttg ttaaacacag gaataaaaat 480 atcaaatatt cctctttgtg tccaaacaac ctgaggatgc gtaaatgacg acagaatctt 540 catttttggg tgatctgaga agactctaaa atcaaaataa aaacaaacac acatacattt 600 tacagattta tttaacagct taatgacatc tgtgtaaaca tttcagaaaa ctttcaaaag 660 gtgaaggtga agcctcaaaa aatactataa gaacatattc ataaaaaaat ctgacatcat 720 atgtagcata catataaaaa gcagtgcttc agagtgcgct ctctctgtct ctcatatata 780 tatatatata tatatatata tatatatata tatatatata tatatacata tatatatata 840 tatacattat atatatacat atatacatat atacatatat atatatatat atatatatat 900 atatatatat atacatatac atatatatat acacacacat gatatatata tatatataca 960 tacataagag agagagagac aaaacttctt gctaaattag gcagtcgtct attcaaggtc 1020 aatcattttc ctgataattt tacacacttg ctggttgatg tgtcttctct gcttctttga 1080 cttcgtctgt cttccatctc caaactggtg cccaaaaagc acctataagc aaaagggcag 1140 acaaacagtt agcttcatac aacactactg ggacactact gctgcactac agagcaaaat 1200 tactgttagc tcagtttgaa catgctgaca atactgtagc tcagtcaaac tgcaatcatt 1260 taaacaaaaa catctgaaaa acatttccat cacaaacaca caatcaagaa cagtccaaca 1320 acaaatagtg gttttaatga cctaggcaga gcattttaaa acagttgatt tgatttacac 1380 cttctgttta cagactataa tctaatgctg ggttcacact tattgcaaag ttaacacatg 1440 catcacagca cttgctcttg aatactagca ggatgtttct tgcatcaagc taatttttya 1500 tgatttaaat attaggaaaa ttttgcaaaa aaaaacttga ttgaagtgat acagcacttg 1560 tgttgtattc atttcaatca agttttttgc aaatattcca aatatttttc aacgtttttt 1620 gagtttcact taatttgtgc tgcccagtaa aaatttccca ctatttatgc atttgcaatg 1680 acttgtatgt aatccactca cctgaatagg taacttcaca tatggtataa gagccccatc 1740 cttttaaaga cagcttcccc tctggtctcc agcacagaca aacaatttgt gtctgcaaca 1800 cggcccttta taacaagggg acaaggggag tcatgacttg tgctcctata cattccttgt 1860 ttaacccccc caaaacaagc cctcagttac aatatttatt gtagacatta ccttaaattc 1920 cagagtgcaa aaggcatctt cagcatactt cagtttcaga ccagattcaa agtcagttgt 1980 gaaatccagg tgaggtccca tgaccactcg accctccaag gaataagact aaaaggggga 2040 aagaaggact aaattagagg tgtttgtgaa taattaaatt tgtttgtaga aacaaagttg 2100 acattccatg ctctgcctat taaaaaccaa gcaaacaaac acacatgccc agtaaaattg 2160 ctaataaaaa taaaaactat agtttaaaaa aatctttgtc taatatttac tactactgga 2220 ggtactggtg gaacttctgg aagaagaaaa agagtttgta caagaaactg ctcacaacaa 2280 atctgtgaaa aacctgacat ttaaaaatct aataaaaatg aggaaggcat caacattacc 2340 tttaataatc aagcgagggt tgattggtag catcagtgtt tcaacatcag tggctgtggc 2400 tgcaaaaaaa tatgaatata ataagagtag aataagactc aacctaacac acgcacacac 2460 tctatatata tatatatata tatatatata tatatatata tatatatata tatatatata 2520 tatatatagg tttagattgt gataatgaat ttaggttata tttgtgtgct cagaatttac 2580 actcacctca cacttcagct attatcaggt acacctgaac accaaaaaat aaaatctaca 2640 gttagtgaca atatacaatt agttctcatt agtaaatgtt agcactgcat taacaccagt 2700 gagaaatata caatatgtac tgtgttagtg ttagtttaaa agaatacaac tgaattgtat 2760 tataagcctc atttaataaa acagttaaga cttgtttcat aatgagttta gtttatttaa 2820 ctaacttaac gttaacgtta cagtgaacaa aattaacgta catgatggtc actaaccgta 2880 acgttaactt aagtgaaggt cagagcttac cttaacgtta tagtcatttc acctttactt 2940 cagtcttgta ctgaaacaga aaatgcagtt aacaagaagt taattaaaga aaattccctt 3000 tcttttttca ggaactaacg ttaacgttat gctaatacct cagaaaacac tacaatcacc 3060 atctggcgtc gttaatttct tgctttaaaa aaaggcgcgc ttaaaccctg tacgtgcgag 3120 tacaactaaa gtactcggta aattcctgtt aaatgacatg cactatacag aaatacacat 3180 acaacaatca tttaacggac aaaagtcata ttttaacact taccgaggat gaatccgttg 3240 ggtgaagaga cgacggcgtt gtctgtggtg atggcgcttg tttgcggtcg agtcgagtca 3300 gttctgttca atgaatcggc tcctttaagt gaacagtaaa gagtcgaatt cggctccttt 3360 aagtgaacag taaagagtcg aattcgcctg aaaacgattc cgttttgtat aatattgtaa 3420 gacgcaaaca tataattcat taatatttca tcatattgct tggttcgtgt actgcttgta 3480 cacgaattgc ttgttgatga cttcgagctc atgaacacgt ctgataaaat atctgaatct 3540 gattcaatgt cacacgagtc ctgaaccgaa gcagtgcaag agtaatatat tgattgaaat 3600 ataaagttaa ttattttctt cgccattcaa aatattacat atctctgact ttaacaataa 3660 gaataaagtt cgtttggagt gtattgagaa ttattatttt ttcctcccag gattcatttc 3720 gcaccaagct gcatcaagct gtaatggtgg atgagaaggc acataatatt ataaatattg 3780 cttgtaatat gtaatgaaat aaagttttaa cactttttca tgcgagaatg tagttcttca 3840 aaactcaaat ctgtggactg ttagtaaaga ttgtgtgtaa ttttaagtgt gttttgccgg 3900 tagcggagat ctatgacctc caggcggtaa cggcgctcat tactcatgta tccgccgaga 3960 ggcgcctgtc ttcaggctag acattgggac aattgccccg tcttcgatgg ctctatatgc 4020 gtccgaaaat gaagttttaa ctgtttttca tgctagaatg tagttgttta aaactctaat 4080 ctgtggttta tttatataga taaaaaaata atatatttat atatatatat atttaattct 4140 gagattagtg acctccaggc gatgacaggt gcccaatact catgaaaccg tctaaagcag 4200 acatttaccc tgacattgaa ataattgttg gctgtttaac agtctttggt ttcattaatt 4260 aattactttt tattccagtt tattatgttt tggctaagca cacagttatg ttcattaaat 4320 attatttcct attctatgtt aactgtataa aattaaataa aggtgcagta caatcaaaaa 4380 gatgttggta accaaatcgt tttgggggct ataacattct gttattatct gttaaaagtt 4440 catttgtatt aatttctacg ttgagtttaa aaaaggtttg aatttctata actaacgtgc 4500 aaaagatata gcccttttca cagtaatttg ctgtaatcat gctacctgcc gtaatttcac 4560 aataaaatta tgaatacaac atattactgt gaatttttca gttaaatact gtgaagggat 4620 actaatgtaa tttactgtga aaaattacag taacttgttg tgaaaacctg gaaatttttt 4680 acagtg 4686 // ID BEL11-LTR_DR repbase; DNA; ZEB; 532 BP. XX AC chr14; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL11-I_DR; KW BEL11-LTR_DR; BEL11_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-532 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1372-1372 (2008). XX DR Genome; chr14; Positions 41949506 41948975. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 532 BP; 139 A; 93 C; 93 G; 207 T; 0 other; tgtgctggtt ccagtgtaat acccttttga tccactaggt ggtagtgttc tgttaatttt 60 atcattatat taagaagtga cctttcacct aaacaccagc cattttgaaa cattcagtcg 120 atgctgacag ccacacggtc tctcagacct tcatctgtct attcattttg tatttcctaa 180 tagaagcttt gtctgtaagt accaaaatat tttataagga ttttgtgtgc attacatatt 240 aatattgggg acatatttca tgctaaactt ttcttagtaa gattgcttta gatcagcagt 300 tcatgcatcc acatcattat agtgatattg tatgtgcgtt tcattcgatt gtacagtaat 360 atttctcaaa ttcttatttt gtaatgtttg ttttctctct gtatttcagt tttactctgt 420 tccacataca tttgaagtgg tggtcacaat aaatccatat tcagttcagt atctcagtgg 480 aaggagttgt ctatctgtct agttgaagaa ggggaactct tttacgaggg ca 532 // ID Gypsy41-LTR_DR repbase; DNA; ZEB; 890 BP. XX AC . XX DT 21-SEP-2007 (Rel. 12.09, Created) DT 21-SEP-2007 (Rel. 12.09, Last updated, Version 1) XX DE A long terminal repeat of the Gypsy41_DR LTR retrotransposon; a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy superfamily; LTR; Gypsy41-I_DR; KW Gypsy41-LTR_DR; Gypsy41_DR; endogenous retrovirus. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-890 RA Dib M.R. and Naveira H.F.; RT "Gypsy41_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 7(9), 813-813 (2007). XX DR [1] (Consensus) XX CC Gypsy41-LTR_DR is a long terminal repeat of the Gypsy41_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its CC internal portion is deposited in Repbase as Gypsy41-I_DR. XX SQ Sequence 890 BP; 199 A; 156 C; 186 G; 349 T; 0 other; tgtaacgttc ccttgcagaa cgtattttgg ataagtgtgt gcgttcgttt gtatgtgggt 60 gtgtcgtttc atgcatgtgt tacaggtacg aacggcattg gaggatccgc atttcgtgct 120 gatgacgttg gccgctgacg ctgggtttgc gccaccaatc gctggttggc gtagcgttta 180 aaagccgagc ccaaactgca atggcgatct ctctctcttt ctctgtctct ttccttctcg 240 ccacttctcg ctcttattct ttctgtgggt ctctgtgagg acagtctctc ttgcgagggc 300 tagtctgaga gaatgtgttt tgcttgttta atgctcttgt ctaaagtttg cgatttaaca 360 atgtttgtga aattgtccag ttaactgaca tgtttgagtg atgcctcgga gaggtgtagg 420 agtgagtaag tcgcgcgata tacatataca acactcagat tgatattttg tattagttca 480 ttaggttcag gagaatgtgc tgttttctgt atgttttgtt tttagttaga gcagtttagt 540 taagacgttt ggtacgttga agatgtttct tttcatgctt ttcttttcat atttagttta 600 gtttggtgaa aacagttact gagttttgtt atatttattt ctttgattta agcacattca 660 actacttcct ctccccccag gtaattcatt atttaatcat tgtaaatatt tttctttcac 720 tgtataatat attttcactg gttttcatgg gatatatggg caataaataa ttgcagtgat 780 tatttgattc agtttatttc tttctccatc cgcttgtcac acacattaaa ctaatgtatg 840 ttattgtcat ctactcctag tttaatatta acatctaaaa ggacgtaaca 890 // ID EnSpm-4_DR repbase; DNA; ZEB; 9040 BP. XX AC . XX DT 31-JUL-2008 (Rel. 13.07, Created) DT 28-SEP-2008 (Rel. 13.07, Last updated, Version 2) XX DE EnSpm-4_DR is an autonomous DNA transposon - a consensus. XX KW EnSpm; DNA transposon; Transposable Element; KW Autonomous DNA transposon; EnSpm-4_DR. XX NM EnSpm-4_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-9040 RA Kapitonov V.V. and Jurka J.; RT "Zebrafish En/Spm DNA transposons."; RL Repbase Reports 8(7), 752-752 (2008). XX DR [1] (Consensus) XX CC EnSpm-4_DR is a young family of autonomous En/Spm DNA CC transposons. The consensus sequence was derived based on multiple CC alignment of several copies of EnSpm-4_DR that are less then 2% CC divergent from each other. EnSpm-4_DR transposons are CC characterized by 2-bp target-site duplications and imperfect CC 11-bp terminal inverted repeats (1 mismatch). See also commentary CC on EnSpm-2_DR. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 6136..8571 FT /product="EnSpm-4_DRp" FT /note="En/Spm transposase." FT /translation="MLKCYICRTLHDAPSSLIQHLKFFHGLYPGKKFVLVC FT AQEGCSRQFKSFKGFKMHLNTCHYTTDLDASSDVMQVPHQSDFGEQSSQHN FT SSVMDQDAINEPSTSLMSKDQAKDMCASIIAKLKGSGVANSVVLSVVESME FT EYVDEIHANLEEQVLSALPAENQIRSAVKDVFSNAFNPFSDLNTNSKMTKY FT FSEKWGVVEPIEIHLGVRFDSKRNKKSGVYEQVPVNDTFIYVPLLKTLEFI FT FKNPEVCSHINKPPATDCNLYQDFCDGKYYKHHTLYSMSQNALQIQVYYDD FT FETANPLGSKHGVHKLGCLYFTVRNLPPRLNSSLMNIHLISLFHSQDSKKY FT GIDKILCPFVEDVKVLEQHGMKVSFTEQPLYGTIAQVTGDNLGLNSILGYV FT ESFSGNYFCRMCLADKGLAQTMFSENDPRMVLRSRLTNEEHYNYLCENPRE FT TSCFGLKRNSIFNSLSYFSVSDNFVLDIMHDVLEGVAQYEIKLLFGYLNQN FT FISNENILQRVYAFNYGFMDKKNRPTRINLSSSGNSIGLNASQTLCLSRNL FT PLIFGDVVPEGDRHWHLLLLLIHIVNIIFSPSITDGMIVFLKHLIREHHQL FT FSELYPQNNLIPKHHFMIHYPECIRQIGPLVHVWSMRYEAKHKFFKSSLKN FT FKNITKSLAMKHQIAVAYHWESLFTKGIESGPVKSKKLTDVDNGHLIAEHF FT RIDMLSEVNITSWIKHEDVEFHTGLVVCTGVVEELPVFNKIVCIFLRNEAY FT FLVTEMETSFVEHLHAFEVTENMHNVSVVMPHDLRFFKPFDVQMAYGADSL FT FYVVPDCCIV" XX SQ Sequence 9040 BP; 2972 A; 1481 C; 1698 G; 2889 T; 0 other; cacagcaaat tcattagtgt taaattttga gtgttgaggt aattcagagt aaagtgttat 60 aattctagag ttagataaag ataaaataac cctctcagtg ttggaagctg atttgcctcc 120 ttgtcgcaca gagttacatg tatctaacgc taatatagta ttgactacta catccgggaa 180 tttatgcagc aatatccctg agggattttt ccatacgcca ttttcttctg ccacgaggag 240 caatgagaag cagcaactgg tgagtaaaag tacttaaaat gtttatacat agacatttga 300 cataaaaatt gttatatata cgttagtttt aaatatcata tgtaattttc catcatttat 360 ttagttctgt actatctatg ttacattaaa gaaactaaag ctgtcttaac cgctcctctt 420 ggttaaaata cgtgaaaata cggtttgtga ctgatttgaa ttgctttcct taaacattaa 480 cgcaatagtt aataaagttt tactgatgtt taacaagata acgttatatt tttattcccg 540 tgcgcgggcc cgcgcggttc ggaaaactaa cgttacctcc agctccctca cgccataccc 600 gggcactttt cacacacttt tgctggtaac agcaaaagaa aggtccacgc gtgtgaggga 660 gctgcttcgt ttatttatgt aacacagcca tacgtcgtgc tttttttgct tgcttgattc 720 aacatcaaaa gaaaaaacgc ccgttgtata gcactaacaa gtgccctttc tagataattt 780 aacacctgcc attcgccatt ttcttccatc gcgagcagga gtggtggtga ggttcacgtg 840 ctatttctgc gtgaggcgat tattctgact aaattcacga cagaaagttt tagttaatga 900 tcataattta tcaaatttag aacgaacgtt actcgtagca tgtctttgtg tgagtgtgtg 960 tgcgtgcttg cgtgtgtaat ttagcttcgt taactaacgt aaagttacat cttttttata 1020 ggaatcaaca gcttcttgag gctcaacgtt cagatatttc tccttcgggt aagtgggagt 1080 tacatgtgaa gttataaatt tagactcgag agttatgaat tctgccaatg ttgatagaaa 1140 atgccatttg cagataagct aattgctttc agaagtcatg ctagaaagtt tacgaagact 1200 agttgtatgt ttaactatct ataatttatt aaacgtgttg tcttaagttc ataggatatt 1260 ggggcaccta aaaggtacaa atataacagt taataaaaag caatttacag ttgaccagtt 1320 attttttatt actcttttgt caagcttgtc gaattgtcag tttagcgaag gtgacaagtt 1380 tatttttgct ttagtgcgtt tgttaaacag ggtagaacga tattgccgtc atcatgacaa 1440 gtgtgaaagg cagtatgtac agttagaagc ctttcaagcc ttaaaagaac gattgaatac 1500 acaacaactt cagtttcaga aggtactttt ttatggcaaa gcataagaaa gcataatata 1560 atctttctga atgttatttt gtaaattaaa ttaggtattt atcatttaga tatctaataa 1620 aaattgtact gtatgtccct tttttttcct ctgtagacgc atgttgttga tacaaaaatg 1680 ataattaagg tgcagtatga aaatcgcaag aaatacatca aattacaggc tgctgatttt 1740 gatgaattca tttctcaagg taaaattccc aataacaatt tgagtaagta ataacatgta 1800 ctaggttacg gaaccccgga agggacatag tggaggagaa aaaattggct ggttaaaaaa 1860 aaaaaaaaaa aaaatgggtg aaagaagaaa aaaaatgggt gggaggaaaa tatatattta 1920 tattataaat atatatttat atttatttaa tatatatata tatattaagt ttttgcatta 1980 tctcgcaaaa atgtttctcc acaaacactt cctgttcact tcactcacgt aaaccctccc 2040 gttttttgcc gaaattctcc catattttac cattctattc cactttcttt aatatgcaat 2100 atatatgcaa cctctgaacc agtgaatgca tgtataagcg ctgactgaca gacgcgctct 2160 gtacaataaa ctgattccag atcagcgtct gtacgtgcca tttaaaggga cacttctgtt 2220 atcaaacaag tatgtttaca tattattcag tgtatttgtc tgtttaaaca tacacccgct 2280 cctctgtaaa tggcgtgtac agaagctggt ctggcatttg ctgattcaga ggttgcatat 2340 aaaaggtgac gatcgacgca cgtattaatg tagagaaagt gggatagaat ggtaaaatac 2400 aggagaattt cagcaaatcg ggagggttta catgattgaa gtgcacagaa agtgtatgtg 2460 gagaaacagt ttgcgagata acgcaaaaca tttttgcgtg ggaacacaaa actttgtgag 2520 agaacgcaaa aacttaaaaa atatatattt tcctcctacc cattttttct ttcacccatt 2580 tttttttttt tcacccagcc aattttattt ctcctccact acgtcccttc cggggttccg 2640 tactaggtgg aggtaatttg gtaatttaaa aaaatgtaac atgtttctct ttatcatcat 2700 ttagtgagag aaaagttctc cattcctgct agcaacatta cagtggaaga tgactctggc 2760 acggaggtgg atgaaactgt gtttgcagaa ttgtctgcag tggcagggat ttgctttgtt 2820 gtgaaggacg gtttggatca cggtaaaaaa taaatagtag tggtcatttt agataattta 2880 attcactctg tattcatctt acgtccattt tagtggcata gttaaccaaa tgctaaaaaa 2940 aaaaactgcc agcaattcct cactagtgtg taaagacttt tactctgttt tgaagaaaag 3000 gtctcattag ttggcattgg ttaatatgta aattaaaata aatgttaata attaaaatat 3060 tctttattta gtctttatta taaaaagatt accagagtaa caagttacca agattacaaa 3120 tatggtaaca catgtatttg tcattattac tattggataa ttttactaat aaaattttca 3180 ataaattatt actgatattt gatatctgat atatccataa gtatactgca aataagggta 3240 attgcttttt atatgttttg ttttattcag acacatctcg gtcatcaact ccatctgcac 3300 cgctgtcata cagtgggagt tccctctctg ttttgagtag tggcagtgac agtgacttgt 3360 caagacaacc caaacgaatg aaaattgatg aagagccatt acagagtgct ttggccaaag 3420 atgtaagttt atcgtctgaa gtgtgcaatg ggtactaagt gtaaaacaaa aaactatata 3480 gggcagggcg ataattcggt atcgataatt atcacaatat gtattttttt cgataaaaca 3540 ataagtgttc gataatattt atgcagtatg cgtaaggctg cgcaggcatt ttgcagcctg 3600 catttccaga tgccacacgc agtacaagct tacagccata cagtattagt tgcacttgga 3660 ggggtaagaa aaaaataagg taaaaaaatg aagatgctaa taacattaca gacccaaaca 3720 agtgttggta agccagctca actgcccagc agagtttcat catttttgaa agcagtgcct 3780 tacgagaaaa aaaccctgga aaaaaaaaaa aaaaaaaaaa aaaaaaaata tatatatata 3840 tatataattt tacaaaatgt cttaaatcag ctgcactaag cggttggcat tctgaaaatc 3900 tttaattaat atatataaga tttgacattt atcgtgataa ttatcgatat agactgatat 3960 gaaataaatt atcgtgatat gatgattttc ccaacaagca ttactcttgt tggcttaata 4020 aaacaattaa cagtctaatt tacattttag cttataaagc agatcctcca aacaaaatca 4080 ggagggacaa aagtgttgga agaatatgac gaaactggga cattgtgtga cagtacaaga 4140 aggcaaatgg ttaacattct tgctgcctac atggttgaaa tggaagggta agtttgttgt 4200 attaatatta ggcagttaat tgctataaag ctttttttct cttaagtttt tttcgtgtac 4260 tttttcatat tgcgcaagta ttaaaatcaa tttttattta tttttgcagg agaatccctc 4320 agcggagtac taaagaaaaa tatgcattgg gaattgtcac cttgttccct gcattaaaag 4380 atcccttttc aacaaaagga tatgtaagtt atgtatttat tttaacatat tgatttaaaa 4440 tgtatttatc attaagaaaa ggtaatgaat ttgcttgaca cttttaagga ctgtcacaat 4500 aattagtgca tgtgcgtgtg ttgtatttgt taaattatgt tagtgtggat taactaatga 4560 aaaagacaac agtagcaaca ttaatttgaa acattttaat aaaatattat acataatatt 4620 tatactatga aaaagttatg actgtgggat tccttctatt tgcttataat aaagttaaac 4680 caacttatgt ctctgttttt gattaggaac atttttatga tggccaaagt ggctctggat 4740 ttcttgcgtg gaggttaaag acaattcaac gaaagactaa aattgagttc agagagttta 4800 aaatacagaa tgcaggagca ggtggtccaa cccagtaaag ggagctgcct tcggctgctg 4860 atcagttgga tgaagaacgt tgtaaagagt taatttcctt aatgaaccac accactgacc 4920 gagaaactgt cctgcaaaag atgaaggaga ccttctgcta tagacagcgc cttatctaca 4980 atcctgacga gtcgcacaac atcctcacag tgtttccaag gctgttggac acgaaagggc 5040 tggtaagtga tgaaatgatt tccgtgaact acaattcata tctgtaaagc atgagatggt 5100 cagtaatact tttttgttgt tctgttcctg cagatagatc aagattttag cctcctattt 5160 ggtgcagaaa cagctgccaa actgcttgag aagtggccta ccttctataa ggaaaaggtg 5220 aacagagaag cagagagact tactaccacc tcagtgctcc aaagcttgct gaattcagcc 5280 aggaatctgt acaatgatga gtcttccgag gatcatcgag gtatgtcaaa gaaaaattta 5340 atatctaaca gagagctcta atgtgaattt atgaaaaaca gctaataagg gaaaataata 5400 aattgttcta ttgtttctta cagagtggga cagtgatatg gcatcttttt tgctgctcct 5460 gcaccttcta ccacctcagc cttctaaaaa gaaaaaacag aagatcagtg cagctcaggc 5520 catggaacat ctagttgtgt ttcacaaggt ttgtgcagag ctacagtttg tgattaactg 5580 ggcaagataa gcttaattaa tttagtgcat aatctaagat tatcatgatg atctattttt 5640 tcctccagtc aaacaacagt tttgaagaac acttcgcaaa acaggaggga catcgccaac 5700 catacctcct tgcttcagga atgcacaaga gcgccatcag caattacttt attgcaatgg 5760 acaagatgat catcccatgc cagggaacca cctcgttggc agccattgat gaactgttta 5820 aagcacactt cgttttcagt gtaagctatg atgatgcact cagcaacatg tacacattcc 5880 tccagacaac agtctacggt gtagatgttg acaccactaa agaaagtcca aaggtgaagg 5940 agttacgagc aaagttcatg aacagaaact aaaagactat gttaaagtgc tacatttgca 6000 gaacattgca tgatgcaccc agttcattaa ttcagcacct taagtttttt catgggttat 6060 atcctggcaa aaagtttgtt cttgtttgtg cacaagaagg atgctcaagg cagtttaaaa 6120 gttttaaggg ttttaaaatg catttaaata cttgtcatta tactacagat cttgatgcaa 6180 gcagtgatgt tatgcaagta ccacatcagt cagactttgg tgaacagagc tcacaacaca 6240 actcctctgt aatggaccaa gatgcaatca acgaaccatc aacatctctt atgtcaaaag 6300 accaagcaaa agatatgtgc gcgtcaatta ttgcaaagtt aaagggcagt ggcgttgcga 6360 acagtgtagt gttatctgtt gttgaaagta tggaggagta tgttgatgaa attcatgcaa 6420 atcttgaaga acaagtgctc agtgctttac ctgctgaaaa tcaaataaga agtgcagtaa 6480 aagatgtctt tagcaatgct ttcaatccat ttagtgactt aaacacaaat tccaaaatga 6540 caaaatactt cagtgaaaaa tggggtgttg ttgagccaat tgagattcat ttaggagtga 6600 gatttgattc aaaaagaaac aaaaaatctg gagtatatga acaggttcca gtaaatgaca 6660 ctttcattta tgtacccctg ttaaaaacgc tagaatttat tttcaaaaat ccagaagtat 6720 gtagtcatat taataaacct cctgcaacag attgtaactt ataccaagac ttctgtgatg 6780 gaaaatacta caagcatcac acactgtatt ctatgtcaca aaatgctttg caaattcaag 6840 tttattatga tgactttgaa accgcaaacc ctcttgggtc aaaacatggg gttcacaagc 6900 ttggatgttt atattttaca gtccgaaatt taccaccacg tttaaattcg tctttgatga 6960 acattcacct catctctttg tttcattccc aagattccaa aaaatatggc attgacaaaa 7020 ttctttgtcc atttgttgaa gatgtaaaag tgctagaaca acatggaatg aaagtgtcat 7080 ttactgaaca acctctttat ggtacaattg ctcaagtaac aggggacaat ttaggtctga 7140 actcaatcct tggttatgtg gaatctttct ctggaaacta cttttgcaga atgtgtcttg 7200 ctgacaaagg attggctcaa acaatgttta gtgaaaatga tccacgtatg gttttgcgca 7260 gcagattgac aaatgaggag cattacaatt atctttgtga gaatccgagg gaaacgtcat 7320 gttttggctt gaaacggaac agtatattca attctttgtc atacttcagt gtttcagata 7380 attttgtttt agatatcatg cacgatgtct tagagggcgt ggcacaatat gagattaagt 7440 tgttgtttgg ttatttgaat cagaacttca tttctaatga aaacatactc cagcgtgtat 7500 atgcattcaa ttatggtttc atggacaaaa agaaccgtcc aacacgcata aacctgtcta 7560 gtagtggcaa cagtattgga cttaacgcta gtcaaacatt atgccttagt agaaacctcc 7620 cactaatctt cggtgatgtg gtcccagaag gtgacagaca ctggcattta cttctgcttt 7680 taatccacat agtaaacata atattttccc caagtattac agatggaatg attgtatttc 7740 taaaacatct tattcgagag catcaccagc tattcagtga attgtatccc caaaataatt 7800 tgataccaaa acaccatttc atgattcact accctgagtg tatacgccaa attggtcctt 7860 tagttcatgt ttggagtatg cgatatgaag caaaacacaa attttttaag tccagtttga 7920 aaaatttcaa gaacataact aagtcccttg cgatgaaaca ccagatagct gttgcatacc 7980 attgggagtc actctttaca aaagggattg aatctgggcc tgttaagtca aagaaactga 8040 ctgatgttga caatggtcat ttgattgcag aacattttcg gattgatatg ttaagtgaag 8100 taaatatcac tagttggatt aaacatgaag acgttgagtt tcacacaggt cttgttgttt 8160 gcacaggtgt tgttgaagaa ttgccagtgt tcaacaaaat tgtttgtata tttctgagga 8220 atgaagctta ttttttggta acggaaatgg agacctcatt tgtggaacat ttacatgcat 8280 ttgaagttac tgaaaacatg cataatgttt cagttgttat gcctcatgac ttaagattct 8340 ttaagccttt tgatgtacaa atggcttatg gtgcagactc tttgttttat gttgtaccag 8400 actgctgcat tgtgtagatt gcaagataat gttttaagtt ttgtttcagg agattttatg 8460 tacaaggtca ttgtacagag tgttttaagt gtaatgtaaa accatgttaa agaaaataaa 8520 atgtattcat tgcagcacat atttattgta tgagtggtga attgaaatga tacattgtaa 8580 ttatagaatt aacgacaggg cataatgtaa atacaaatgt aaaatatgtc aaatgaacac 8640 tggtgaagtg ttgaaaattt aacctaagag tgttaacaca aaacactgag tggtgttcat 8700 atgatcattt ttggtgttaa tatgttacac tataagagtt tgcaaactaa cactctcgga 8760 gtgttgacta tgttaacacc atcaaaataa cactggtaaa gtgttgaaaa atttacctaa 8820 gagtgttaaa acataacact gactggagtt cagataatcc tttgtggtgt taatatttta 8880 cactataaga gtttggtaat taacactttc tgagtgttga tgacgttaac tctttcaaaa 8940 gtgttatttc aacacttttc cagtggttcc catataaact ctgagaaagt gttaaattta 9000 actctaaggt agttaaatct acaatctaaa atttgcagtg 9040 // ID LOOPERN4B_DR repbase; DNA; ZEB; 511 BP. XX AC . XX DT 27-SEP-2008 (Rel. 7.06, Created) DT 19-FEB-2009 (Rel. 13.09, Last updated, Version 3) XX DE A a nonautonomous DNA transposon - a consensus. XX KW Kolobok; DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; LOOPERN4_DR; LOOPERN4B_DR. XX NM LOOPERN4B_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-511 RA Bao W. and Jurka J.; RT "Nonautonomous Kulobok DNA transposons from zebrafish."; RL Repbase Reports 8(9), 937-937 (2008). XX DR [1] (Consensus) XX CC This is a element member from Kulobok superfamily. The terminal CC ~10-bp is similar to that of Kolubok-1_DR. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 511 BP; 155 A; 93 C; 100 G; 163 T; 0 other; ttaaagggga cctattatgc ccctttttac aagatgtaaa ataagtctct gatgtcccta 60 gagtgtgtat gtgaagtttc agctcaaaat accacacaaa taatgtttta taactctttg 120 aaactgaccc ttttaggctt tgatcctaat tgtgccgttt tggtgactgt cgctttaaat 180 tcaaatgaga ttgtgctctt ttcaaaagag ggcggagcta caaatgcctg tgtgtcagca 240 tagtggcaga ttcaaaaaca agactaacgt cctatgctaa tgagggagag atggtcacta 300 gtgggcgggg ctttccccct ctgatgacac gtacaaaggg agaatgtcaa tcaaagtgtt 360 tctgcagact gtttttatca agtgtgatta taaaaaataa taattaaata catttttacc 420 attagaagct ggttatattc acacactgtt gccacacaac tgtgtttaaa ccccttataa 480 aagtgatttt tgcataatag gtccccttta a 511 // ID ERVN1-I_DR repbase; DNA; ZEB; 660 BP. XX AC . XX DT 07-JAN-2005 (Rel. 10, Created) DT 07-JAN-2005 (Rel. 10, Last updated, Version 1) XX DE An internal portion of the ERVN1_DR nonautonomous endogenous DE retrovirus - a consensus sequence. XX KW Endogenous Retrovirus; Transposable Element; Nonautonomous; KW ERVN1-I_DR; ERVN1-LTR_DR; ERVN1_DR; KW nonautonomous LTR retrotransposon. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-660 RA Kapitonov V.V. and Jurka J.; RT "ERVN1_DR, a miniature nonautonomous endogenous retrovirus."; RL Repbase Reports 4(12), 311-311 (2004). XX DR [1] (Consensus) XX CC ERVN1-I_DR is an internal portion of the ERVN1_DR miniature CC nonautonomous endogenous retrovirus. Long terminal repeat of CC ERVN1_DR is deposited in Repbase as ERVN1-LTR_DR. Some copies CC of ERVN1_DR are flanked by identical LTRs. Average identity of CC ERVN1-I_DR copies to their consensus sequence is above 96%. CC The internal contains PBS complementary to the Phe-tRNA. CC ERVN1_DR is characterized by 5-bp target site duplications. CC Since there are no autonomous LTR retrotransposons similar to CC ERVN1_DR, its exact classification is not possible. XX SQ Sequence 660 BP; 119 A; 203 C; 215 G; 122 T; 1 other; gtggtgccga aacccgggag aaggagggac gcgttgtcca gatgccctcg ccgctgagag 60 agggggcgag tgttcgggag ttaacccgag tgggggagaa acccgttcgt ctgcccagag 120 tgccggggcg gcctgaggct ggcgaggaaa ggatctgcct gctgccgtct cccagcatgg 180 ggaggagcag gggacgcggg gctcaccacc ggaagggaaa gtgctgaagg gagccatccc 240 acagaacccg ggggagtcgc cgcgatccgc catgatgcca gagcctgcat ccrtccaccg 300 aagcgggagc ggagcaggag accagaaatg ccgccagcca gccaggagaa ccgccgccgg 360 acgccagaag acggacgggt ggatagattc ctccacaggg ctcccagcac gtcgtcgcat 420 ctctcagctg gtggaggacc gagtggcagc gtgtctggag aaccgaaccc caattttttt 480 tttttttttc ctctctcccc tttctcggtc tactctgtct ttcctccttt tttttccatc 540 tccttttctc tcggcccgtc tcatataggt ctcgcgggtg ctgtggcgcc aaaccccccc 600 ccccccccgg agtatgcaca gccacggggt gcccccggcc tgtgatggga gatgggggta 660 // ID Gypsy78-I_DR repbase; DNA; ZEB; 5828 BP. XX AC chr12; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy78-I_DR; KW Gypsy78-LTR_DR; Gypsy78_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5828 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1471-1471 (2008). XX DR Genome; chr12; Positions 39484973 39490800. XX CC Positions [2583-3122] - Reverse transcriptase CC Positions [4425-4901] - Integrase core CC 'TAAT' target site duplication CC LTRs are 98% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 475..1473 FT /product="Gypsy78-I_DR_1p" FT /translation="MSTLATLNNTNTRSYVYIPRERQIIPYCGDSSKDSQT FT IDEFIEELERVIRVRGLSDEDPEDFILSHLRGSALDEVKLCMGGEDVSPQD FT LFTYLRDAFREKRTIPQLLHAFYAKIQLEGEDIREYSHSLSTMLNAAIRQS FT PSAVPDTQLALRDQFVEGIRDPMLRRKLRKLVRDRPRVTLFEVREEALLWC FT VEERPRGTNVAKSRYLRGRNAEEGVECTHGSAMVASSSSGDLNAVLQDVVK FT VVAQQGKAISELTDAVRGLTTQRATVGFDRPQRDRVTPRYTKDGQPICLKC FT EGVGHLARQCTAPRKSVGQVAIPAETHVTGNVNPPLCRAEQ" FT CDS 2328..4319 FT /product="Gypsy78-I_DR_2p" FT /translation="MTQTHGDLQGPLDKVRMGGTPEQQAELQALLNQYADV FT FAVGDEDLGYTDRVKHDIPVTDDVPVSQPYRRIPPNQFEEVKEHISGLLRK FT GVIRESSSSYASPVVLVRKADGSLRLCVDYRKLNLKTRRDAFPLPRIDESL FT DALCKAQVFSTIDLASGYHQVAVHEKDQHKTAFVTPFGLFEYSRMPFGLCN FT APATFQRLMQAVMGDLVFQIVLVYLDDLLVYSSTFKTHLVNLEVVLRRLRE FT SGLKIKVEKCHFLQTEVRFLGHQISAHGVGPDPDKVSAEEKWPVPKTAKEL FT QSFLGFCSYYRRFIAGFAKVAGPLHDVVNEFTKKRSGVQSDQSFQAAWTSE FT CQKAFERLKTELTCAPVLGYADFSLPFVLETDASSWGLGAVLYQYQGGKKK FT VIAYASRRLRGAERNDRNYSSLKLELLALKWAVVEKFRGYLLGSWFMVLTD FT NKPLCHLNTARLGAVEQRWVAQLAVFNFEVRYRPGRCNTAADALSRRAGLE FT EEEEPSEDAEFDHCVAICNGVHCGTTVEPDLGTAGPVLGSTLEEEVWPNSD FT FGNTPTLPGYTREQLCQFQDADPFLKLVKQFWRCQKKPSSQERRGLPRQVR FT SLLKQWDKFKEDRGLLYRVVDDAFLGECQQLVLPSFLIEPVLQSVHEQMGH FT QGTERTLSLLRPRCY" XX SQ Sequence 5828 BP; 1622 A; 1153 C; 1584 G; 1469 T; 0 other; tctatttggt accaggagtg gggttaattt cctcagcagg tttccatttg tgggtagtat 60 aattagtgta tataaaaata tagataaata aataataata aaatacataa atatgaagtc 120 taagttttgt tagttaagtg aggaaaataa acaacggtag tggtgaagaa gagtcacgct 180 tcagagactg tgtagagagc tgtgtgcgac atccggacga cagggaggac gaggagtcag 240 tgactggacc tgctggattg aggaggcttt agaggactgt gttttgtttc aaacttctag 300 agacttgtgg actgagaaat cgtgctatgg aggcagaagc acctaatgta agtgtgagtg 360 gagaggaagc ctgggaagaa ggcggagcca ctgatagaga ggaggacagt gggagaccct 420 cttttgcagc tcaggtcaca aattcaggag ctgggaaaaa gacatgatca ggtgatgtca 480 acattagcta ctctgaacaa tactaatact agatcttacg tctatatacc tagagaacga 540 caaattattc cttattgtgg ggattctagt aaagacagtc agaccattga cgaatttatt 600 gaagaactgg agcgtgtaat tagagttagg gggttgagtg acgaggatcc agaagatttt 660 atcttgtcac acctgagggg atcagcccta gatgaggtaa aactctgcat ggggggggag 720 gatgtaagcc ctcaggatct gtttacttac ttgagggatg catttcggga aaagcggact 780 attccgcagc tgttgcatgc cttctatgct aaaatacagt tagaggggga ggatattaga 840 gagtattctc attctttgtc gacaatgtta aatgcagcaa tcaggcagtc accaagtgct 900 gtgcctgata cacagctagc gctgagagat caatttgtgg aggggatcag ggaccccatg 960 ctccggcgta agttgcgtaa actagtgagg gaccggcctc gtgtaacatt gtttgaagtg 1020 agagaagaag ccctcttgtg gtgcgtggaa gaacggccac gtggaacaaa tgttgcaaaa 1080 agtcgttact tgcggggtcg taatgctgag gagggtgttg aatgcaccca tggttcggcc 1140 atggttgcat catcgtcttc tggtgattta aatgcagttt tgcaagatgt tgtgaaagtt 1200 gttgcccaac agggaaaggc aatcagtgag ttaactgacg cagtccgtgg tcttaccacc 1260 caaagagcca ctgttggctt tgacaggccc cagagagata gagttacgcc cagatacact 1320 aaagatggtc agcctatttg tttaaagtgt gagggagtag gacatttagc ccgccagtgc 1380 acagcaccac gcaagtcggt aggccaggta gcgattcctg cagaaacaca cgtgacagga 1440 aacgtcaacc ctccattgtg cagagccgag caatgagagg ggtagctcat ggctcagtca 1500 gtgatactct gaccaaagaa cgattcttag agcgggcact gggtccatgc cctgaagtgg 1560 aaattactat agggggaata cctgtcaggt gtcttttaga cacgggtagt aatgtcagta 1620 cactcactga aagttttttt aggaaacact tgcatgggga ggatagagac atgcacagca 1680 cctccaagtg gttacgcatt accgctgcta atatattgcc tttgccatat ttaggatatg 1740 tagagttgga tatccaggtg atgggtgtaa ccataccggg ctgtggattt ttgattgtcc 1800 gagaccatct ggaggaagaa tctgatcttt cacctccagg tattcttgga atgaatattg 1860 cacagcgatg cagacagcta atcttagttg aatttgataa tgcgctaaac ggagcattaa 1920 actcagattg gcggtatgca ttgagtaaag ttcaggaagc gacatctgaa gggtcgaaat 1980 ccacggttcg aatagctgga aagttaaaaa cgcatgtacc tgcaggctct gttacaacag 2040 tacaagttcg agtacataaa gagttgccta agactaatgg ggtggtcata tttgaactgg 2100 ggaagatgtc attgccgggg ggtttagttc tcttgcctac gatagtgcct gctaagcgag 2160 tgtttcctgt acaggtctta aatctgggtt cttaggacct gtggctgccg ccaaaggcga 2220 gggtgggggt tttatcccaa ggccgtcagg tggaaagtga tccctgcagt gttgcttttt 2280 aacgtatttc ccaagaccat gaggtagtaa cagttagctt gaggtcaatg acacagacac 2340 atggagactt acagggtcct ttagataagg tgcgaatggg gggaactcca gagcaacagg 2400 ctgaattgca agcattgttg aaccagtatg ccgatgtgtt tgcagtaggt gacgaagatc 2460 tcgggtatac tgatcgcgta aagcatgaca tcccggtaac tgatgatgtg cccgtctccc 2520 aaccctaccg acgtattcca cctaaccagt tcgaagaagt taaggagcac atttctggac 2580 tgctcaggaa aggagtcatt cgggaaagtt ccagctcgta tgcgtcacca gtagtcttag 2640 tgaggaaggc tgatggaagc ctacggctat gtgtagacta ccgtaagctg aatcttaaga 2700 cccggcgcga tgcgtttccc ttgccaagaa ttgatgaaag cctggatgca ctgtgcaaag 2760 ctcaagtgtt ttctaccatt gacttggcaa gtggctatca tcaggtggcc gttcatgaaa 2820 aggaccaaca taagaccgcc tttgtcacac catttggttt attcgagtat tcacggatgc 2880 cctttggcct ctgtaatgca ccagcaacgt tccagagact tatgcaggct gtaatgggtg 2940 atttggtgtt ccaaatagtt cttgtttatc tggatgactt gctggtctat tcaagcacgt 3000 tcaaaactca tctggttaat ttagaagtgg tactccgccg cctgagggag tctggattaa 3060 aaataaaagt ggaaaaatgc cattttctac aaacagaggt gaggtttctt ggacaccaga 3120 tatcagctca tggtgttggt cctgatcctg acaaggtcag tgcggaagag aagtggccag 3180 tgccaaagac agcaaaggaa ctgcagtctt ttttggggtt ctgtagttac tatcggcgat 3240 ttattgcagg ttttgctaag gtggccgggc ctctgcatga tgtcgtgaat gaattcacca 3300 aaaagagaag tggggttcag agtgatcagt cctttcaggc tgcttggacg tcagaatgcc 3360 aaaaagcctt tgaacgtctt aagacagagt tgacctgtgc tccggtgcta ggatatgccg 3420 acttttcact tccattcgta ttagagacag atgcaagtag ctggggtttg ggcgctgtct 3480 tataccagta tcagggtggg aagaagaagg taatagctta cgctagtagg aggctgagag 3540 gtgctgaacg caatgatcgt aattacagta gtctgaagtt ggagctttta gccttaaagt 3600 gggctgtagt ggagaaattt aggggctatc tgctgggttc ctggtttatg gtgcttacag 3660 acaacaagcc attatgccac ctcaacacgg ctaggttggg ggcggtggaa cagagatggg 3720 tagcccagtt ggctgtgttc aattttgaag ttcgctaccg accaggacgg tgtaatacag 3780 cagcggatgc tttgtccagg agagcagggt tggaggaaga ggaggagcca tctgaagatg 3840 ccgaatttga tcactgtgtg gccatctgca atggggtaca ttgtggtacc actgtagaac 3900 cagaccttgg cacagctgga ccagtcttgg gaagcactct ggaagaggaa gtttggccaa 3960 attctgactt tggaaacacc ccaacattac cggggtatac tagggaacag ctctgtcaat 4020 ttcaggatgc cgaccccttc ctaaaacttg ttaagcagtt ttggcgttgt caaaagaaac 4080 cttcctccca agagagacga gggctacccc ggcaagtgag atcattactt aagcaatggg 4140 ataaattcaa ggaagataga ggactgttgt atcgagtggt tgacgatgct tttctgggag 4200 agtgtcaaca actggttctt ccttctttct taattgaacc agtgctgcag agcgttcacg 4260 agcaaatggg gcaccaaggg actgaaagaa cgctgagttt acttcgacca aggtgctact 4320 gagcaggaat gtatgaagcc gtagagacat gggtgaagta ctgtcaacgg tgtgtgcttg 4380 ctagaatgcc acagcctaaa attcgaccat cttggacacc tttcttggct tctagaccat 4440 tggaagtggt ggctgttgat tttactacac tcgagccagc ttctgatggt cgggaaaacg 4500 ttctagtgat caccgatgtc tttactaaat tcagccaagc attcccaacc agggaccaaa 4560 aagctgatac cacagccaaa gttcttctaa aggagtggtt tcttaaatac ggggttccag 4620 agagactaca ctctgaccag gggagaaact ttgagagtgc agttataggt gaactttgca 4680 aattgtatgg ggttcggaaa tcaaggacca ccccatatca tccgcaagga aatccccagt 4740 gtgagcaatt taatcgcacc ttgcatgact tgttacgctc gttgccacct gagaagaaaa 4800 ggagatggcc tgagcatctg tcagagctgg tatatgttta caatgtaaca ccacactctt 4860 caacaggata ctccccattt tatatgctgt tcgggacaca accacacctt ccgattgatg 4920 tgttgttggg gcaggagcct gacgaaggca cacaatcaag ttggttgtca atacacagag 4980 aacgactcca agaagcccat ctgagagcaa aggagtacgc aaagaagaga gcggaggaac 5040 gagctgctag acatgagagc aatgtctatt gccctgaagt tgaagttgga caaaaggtgt 5100 acttaagaca cgccctatgg gaagacataa aatccaggat gcatgggcac cggatgttta 5160 tatcgtgaga gatgttcaag gaactacgca cgtagtagaa cctgagggag ggggagagca 5220 aaagaggata catcggtgtc accttcgacc ttgtccatca cagcttccag taccaaagcc 5280 acgtagcaag gtacctccaa tgaagaaaag ggtgggtcaa cgtgatgagg agattcagtt 5340 gccaccagat gttgagtgcg tagtggtgga agagacatgg ccagccttgg aagaaccgac 5400 agcaatgtta atgaaagaca ttgaggagcg tagttcacag gtggaagatt tagaggaatc 5460 aaggttgaat gaacttgaaa atgaggaaat ccttgtgtct gagaattgtg gtgtcgatat 5520 cctacctccg aatcccagtc caatgacaac tagtgaatat ccagttcctg ctcctaggac 5580 tcgttcactt ccttcagtgg agcagaaagt acctccacca gtgccgcgac gttctcaaag 5640 atcaactgcc ggtcaacata ataatcctta caatttgcct aggtcagctt gcaatgcagt 5700 atcgtttagc cctgacattc tgtctcaggt tttagcaggc atagtgatgt acacatctga 5760 aaaactgaaa ggtgctgtag atacttagta agaagtcccc gaggacgttg acttggggca 5820 ggggagag 5828 // ID DNA16TA1_DR repbase; DNA; ZEB; 533 BP. XX AC . XX DT 05-JUN-2002 (Rel. 7.05, Created) DT 05-JUN-2002 (Rel. 7.05, Last updated, Version 1) XX DE DNA16TA1_DR is a nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA16TA1_DR; KW nonautonomous DNA transposon. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-533 RA Kapitonov V.V. and Jurka J.; RT "DNA16TA1_DR, a nonautonomous DNA transposon from zebrafish."; RL Repbase Reports 2(5), 8-8 (2002). XX DR [1] (Consensus) XX CC This element is characterized by 16-bp terminal inverted repeats CC and CC the TA targets site duplications. CC Its classification is not certain yet, although it CC is expected to be a member of the Mariner/Tc1 superfamily. CC DNA16TA1_DR and DNA15TA1_DR share identical 9-bp termini. CC There are several hundred copies of DNA16TA1_DR in the zebrafish CC genome; they are ~91% identical with the consensus sequence. XX SQ Sequence 533 BP; 194 A; 84 C; 67 G; 186 T; 2 other; cactgtaaac cctaatgttg tctttactta aacaatcaag taaagttgac tgaacattac 60 ttaaaattta gcattttggt cctgtcactt aaaaatgagt taatttaact tatatacaat 120 gaaaatgcat aaacttaaga tttcaagtat agtaagctca aaatcacaat taatcctttg 180 aaaaaaaata ggttattttt acaaatgtag tcttgactgt aaaattctaa tatgtgcaac 240 cttttagtgc gcaaggtctt tttttaatca gaaaacaagg cttcatgtat aattattcac 300 cataatggga ataaatggct actttttttc ttttaaacat ttttttttat agtatataat 360 tacacacaca cacacacaca cactgaactk aacaatttaa gtataatcta ccaaaccggc 420 aagttaaata aacttaaata tgcaagtttt gggagactcc rttactcaat taaattgagg 480 caacgagttt gctcaattaa tttcgttcag tcaacttatt agggttttca gtg 533 // ID BEL36-LTR_DR repbase; DNA; ZEB; 594 BP. XX AC chr1; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL36-I_DR; KW BEL36-LTR_DR; BEL36_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-594 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1416-1416 (2008). XX DR Genome; chr1; Positions 43699872 43700465. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 594 BP; 173 A; 97 C; 100 G; 224 T; 0 other; tgtgttgccc tttaagaagt ttcactttaa aatattttct ttaaatattt gatatgcata 60 tttcctttgt taaaatgtga aattcataat ttatttaatt aatctacttc gtgtgtaaaa 120 aggaatacct ttattttgca ttatatcata cttcctgtgt agtcacttcc tgttagtgtt 180 cctggctcat tgtggaagcc atgtcagaca tgcgctctct cggacaatct aagccatccg 240 cttaaactca tcatagaggc aatgccgtaa gtttatgttt tgcacattag acatgttaaa 300 gactttatat tagtgattat tttgtaaaaa atacaacgtt taataacttt gtttgctagt 360 gtaattccta ctgcagggag aactgtagtc tgtttatgtt atgtatatgt gagtggaaat 420 acctgtcaaa actcaatttc tgagaattgt tttgttattt tgttgcagtt ttcaccataa 480 atgctaataa gaagagtgac agtaaatgat caaacttgct acgactctgt cttcattggc 540 tatggtttaa cttggtgttt agtcagtcac aacacactgc acgagaacac taca 594 // ID BEL33-I_DR repbase; DNA; ZEB; 6043 BP. XX AC chr22; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL33-I_DR; KW BEL33-LTR_DR; BEL33_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6043 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1409-1409 (2008). XX DR Genome; chr22; Positions 22295484 22289442. XX CC Positions [5046-5627] - Integrase core CC 'GATCA' target site duplication CC LTRs are 99% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS join(87..4721,4725..5984) FT /product="BEL33-I_DR_1p" FT /translation="MSKPSSKAGSQKRAASSSRSSRHSHGSSAVITAAAMA FT RAEAEAVKAEITFAKKESELKVQQAQLEASLETLRQEKKAAAIQAKAEALE FT TAAELESRGQSSLSELPIEDATERTQAYVSEQMESDVDAEAPVPDGPFYDK FT ESIPFSEMQQPQNTVSYLQQEKQCTSYATPPRSQPSNAENIQNPAISQSTS FT VRFTISGQRKAFTQSPHNVYHPEAQSPAVQGSEERSRHRGTPLSYSQTMSG FT TPLYAGPHQSNLYADSQISDVVRFIARRDLVSTGLTQFNDQPETFRAWRAS FT FINATEDLNISPSEEMDLLVRWLGRESAEHAKRIRAVHINYPLQGLSMIWK FT RLYELYGAPEMVESSLFRRLENFPKISNRDNCRLRELSDLLMEIQAAMQDG FT DLMGLTYLDTPRGVNPIIQKLPFSLQEKWLTVGSKYKEDYRVSFPPFKFLV FT DFVCQQAKMRNDPSFSLTAIHEESHRPNKQLNKHVFPRAVVTHKTQVSSGS FT PLAASELKNKINPAKQCFLHDKPHPLSRCRGLRMKPMEERKSLLKQQGICF FT KCCSSILHMAKNCDEDIKCSECGSSTHIAALHPGPAPWTTDPKQPVAENQK FT VSEDIPVETEVSSACTEICGNSQLPRSCSKVCLVNVFPKNHPEKAVKAYVI FT LDDQSNRSLARSEFFDLFSITERSIAYTLKTCAGTIETEGRIAQGFQVESL FT DGGLTLSLPALLECNEIPDNRSEIPTPEVISKHRHLKHLARYIPEPNHHAP FT ILFLLGRDIIRVHKAHRQVNGPPNAPFAQKLDLGWVIIGDMCLGKVHKPTF FT VQAYHTNILDNGQSTHFQPCPNNYNIKERSQSVSLTKPLIRENTQRLTEWD FT VFQKTKDDEKTAPSIEDKLFLQIMDREVYKDDENSWVAPLPFKEPRQLLPN FT NRPQAVERLKSLLRSFRRKPEIKRDFVAFMEKLFENDHAEIAPSVRPNEQC FT WYLPSFGVYHPKKPSQMRVVFDSSALYKGVSLNKVLLTGPDLNNSLLGVLM FT RFRKEPVAFMVDIQQMFHCFRVRPADRNYLRFLWFHENNPENELTEYSMKV FT HVFGNSPSPAVAIYCLRCAAQEGETDFGEDARQFVEEDFYMDDGLKSLPSP FT EMAISLLKRTQDMLAGSNLKLHKLASNSKKVMQAFSSEDYANSLKELDLGV FT NSLPMQRTLGLLWNLETDCFNFQVQSDNRPYTRRGLLSVVNSLYDPLGFVA FT PITVQGKLLLRELTANATDWDSPLPASGQELWDSWKNSLQELQHIQVPRSY FT GLVSPSGAKFTELSVFSDASEKIIAAVAYLKTVDSNGNCHAGFITGKARLA FT PLPELSIPRLELCAAVLSVNMMETIIAEIAIRFDEINLYTDSRVVLSYICN FT EKRRFHVYVSNRVQRIRSRTKPEQWHYVSSDHNPADIATRPITPTKLQDTI FT WFSGPAFLHQLKTETVPKAPFPLVDPDRDVEVRTHMTLSNTSFSRRFERFS FT VWNRLIRAIGKLIHIAHSYKPDQKGHTPACKGWHCCKLPPSADTIIQSTKV FT VIRAIQQETYTDELQCLKQNKPLPKTSTPKLDPFVDQSGLLRIGGRLIKAD FT LSSEEKRPLIIPGKSHVALLLIRHFHERTHHQGRHFTEGAIRSAGYWIIGG FT KKRISSLIHTCVVCRKLRRGCETQKMADLPKDRLSAEPPFTHVGLDVFGPW FT SVCARRTRGGFAESKRWAVLFTCLNIRAIHIEVIESLTTSSFINALRRFLA FT IRGPVKQIRSDRGTNFIGACKDLQIPSNVDEKAVERFLSDHNCTWTFNPPH FT SSHMGGTWERIIGLTRKILDSMLLQTSSSRLTHEVLSTFMAEVTVIINNRP FT LIPVSTDPADPFILTPATLLTQKSSTDLVPPENCEKPDLYKQWRMVQNLAS FT TFWDRWRKQYLATLQYRRKWQHQQTNIPKGSIVLLKDSQSKRNDWPLGIIT FT ETYPSQDGRVRKIQVKIIGKDGPKLFLRLINEVVLLLQENS" XX SQ Sequence 6043 BP; 1895 A; 1416 C; 1319 G; 1413 T; 0 other; gtaaaacgaa tatcttcagc aatcagcttc tagtttatga ctccacggcg gggagcaagt 60 cagctcatga ggtgattctc tcaacaatgt ccaagccaag ctccaaagct ggatcacaga 120 agcgagcagc aagctcatca cgctcgtcac gccactcaca tggctcatca gctgtaatca 180 ccgctgctgc aatggcacga gcagaggctg aagcggttaa agcagaaata acttttgcta 240 agaaagaaag tgaactaaaa gtacaacaag cccagttgga agcatcatta gagactttaa 300 gacaggaaaa gaaagctgct gccatacaag cgaaagcaga ggctttagaa acggcggcag 360 aattagaaag cagagggcaa tcaagtttgt cagagctgcc tatagaggat gcaacagaac 420 gcacacaggc ctatgtatca gaacagatgg agagtgatgt ggatgcagag gcgccagtcc 480 cggacggacc gttttatgat aaggaaagca tacctttttc agaaatgcaa caaccacaga 540 atactgtatc gtatttgcaa caggaaaagc agtgtacttc atatgccacg ccgcccagat 600 ctcagcctag taatgcagag aacatacaga accctgccat ttcacagtct accagcgtac 660 gcttcacaat ttctggtcaa cgaaaggcat tcacccagtc acctcacaat gtttatcatc 720 cagaagccca gtctccagct gttcaaggta gtgaggaaag gagcagacat agagggacgc 780 cactgagtta ttctcagaca atgtcaggta cccccttgta cgctggacct catcaaagta 840 atttgtatgc ggacagtcaa atctcagacg tggtcaggtt tattgcaaga cgtgatttgg 900 tgtctactgg ccttacgcag tttaacgacc aaccggaaac cttcagagca tggagagcat 960 ccttcataaa tgctacagaa gaccttaaca tttcaccaag tgaagaaatg gacttgctag 1020 taagatggct ggggcgcgag tcagcagagc atgcaaaacg tattagagct gtccacatta 1080 actatccatt acaagggctc agtatgattt ggaaaagact ttacgaatta tatggcgcac 1140 cagaaatggt tgagagctca ctcttcagga gactggagaa cttccccaaa atatcaaaca 1200 gagacaattg cagactacgt gagctaagtg atttgctcat ggagatccaa gcagctatgc 1260 aggatggcga tctgatgggt cttacctatc tagacacacc acgaggagta aacccaataa 1320 tacagaaatt gccattcagc ttacaagaga agtggctcac ggtggggtct aaatacaaag 1380 aggattacag ggtttccttt ccacccttta agttcctagt agactttgta tgtcagcaag 1440 caaaaatgcg caatgatcca agtttctccc ttacagccat acacgaagag tctcacagac 1500 caaacaaaca gttaaacaaa catgtctttc ccagagctgt tgtgacacac aagactcagg 1560 tttcttctgg gagtccatta gctgcgagtg aactcaaaaa taagataaac ccagcgaaac 1620 aatgcttttt gcatgataag cctcatcccc tgtccagatg tagaggatta aggatgaaac 1680 ctatggagga aaggaaatca ttgcttaaac agcaaggaat atgcttcaaa tgttgcagtt 1740 ccattttaca catggcaaag aactgtgacg aggacatcaa atgttcagaa tgtggcagca 1800 gtacccacat agcagcactt catccaggac cagcaccctg gacgactgat ccaaaacaac 1860 ccgtagctga aaatcagaaa gtgagtgaag acattcctgt ggaaacagag gtctcctcag 1920 cgtgtacgga aatatgtgga aacagtcaac ttcctcgttc ctgctcgaag gtctgtcttg 1980 tgaatgtctt tccgaaaaac catccagaaa aagcagttaa agcatatgtc atactagatg 2040 atcaaagcaa cagatcgctc gcaagatcag aattctttga tctgttcagc attacagaac 2100 gcagcatagc ttataccctc aaaacatgcg caggtacgat agaaactgaa ggcaggatag 2160 ctcaaggctt tcaagtggaa tctctagatg gaggcttgac cttatcctta cctgctcttc 2220 tcgagtgcaa cgaaatccca gacaaccgct cagaaattcc tactccagaa gtgattagta 2280 agcacaggca cttaaaacat cttgctaggt acattccaga acccaaccat catgccccca 2340 ttctatttct gttaggcaga gacattatca gagtccacaa agcccatagg caagtgaatg 2400 gtccaccaaa tgcccccttt gcacaaaagt tagatttagg gtgggtaatt attggagaca 2460 tgtgcctcgg gaaagttcat aaacctactt tcgtccaagc ttatcacacg aacattctag 2520 acaatggtca gtcgacacac tttcagccat gtcctaacaa ctacaacata aaagaaagat 2580 cccaaagtgt ctctctcacc aagcctctga tcagggagaa cactcaaaga ctgacagagt 2640 gggacgtatt tcagaaaaca aaagacgacg agaaaacagc accatctatc gaggacaagc 2700 tatttcttca gatcatggac agagaagtct acaaagacga tgaaaattca tgggttgccc 2760 cactcccatt caaagagcca agacaacttc tgccgaacaa ccgtccacaa gcagtggaac 2820 gccttaagtc acttctgcga tctttcagaa ggaaaccaga gatcaaaagg gactttgtag 2880 cctttatgga aaagctgttt gaaaatgatc atgcagagat agctccatca gtcagaccaa 2940 atgaacagtg ttggtatttg ccttccttcg gggtatacca tcctaaaaag ccatcccaaa 3000 tgcgagtagt cttcgactca agtgctctct ataaaggtgt atcactgaat aaggtcctct 3060 tgacaggtcc ggacctcaat aacagtctcc tcggagtcct catgcggttt cgtaaagagc 3120 ccgtcgcttt catggtggac atacaacaaa tgttccactg cttcagagtt cggcctgcag 3180 acaggaacta cctcaggttt ctttggttcc atgaaaataa cccagaaaac gagctcaccg 3240 aatacagtat gaaggtacat gtgttcggca acagtccatc tccggcagtg gctatctact 3300 gtcttcgctg tgcagctcaa gaaggagaaa ccgatttcgg tgaagatgcc agacagttcg 3360 tagaggaaga cttttatatg gatgacggac tgaagtcttt accctcacca gaaatggcca 3420 taagcttgct aaaaaggaca caagacatgc tggctggatc taacttgaag ttacacaaat 3480 tggcttcaaa cagtaagaaa gtcatgcaag cgttttcatc tgaggattat gccaactcac 3540 tgaaagaact ggatttaggt gttaattcac ttcctatgca gcgcactctc ggcttactgt 3600 ggaatttgga gacagactgt ttcaacttcc aagtgcaaag tgacaataga ccatacacta 3660 gacgtggact gttatcagtg gtaaacagtt tgtatgaccc acttggattt gtggccccta 3720 taacagttca gggcaagtta ttactgagag aattgactgc aaacgctact gactgggact 3780 cacccttacc tgcgagcggt caggagctgt gggattcatg gaaaaactct ctgcaagagt 3840 tacaacatat acaagtccca aggagctatg gactcgtatc accatcaggt gccaagttca 3900 cagagctttc tgtgttttca gatgcttccg agaagataat cgccgctgtc gcttacctaa 3960 aaacagtcga ttcgaatggt aattgccatg caggattcat cacaggcaaa gcaagattag 4020 cgccacttcc agagctctcc ataccacgcc tagaactatg tgcggctgtg ttgtctgtaa 4080 acatgatgga aacgatcatt gcagaaattg ctatcaggtt tgacgagata aacctctaca 4140 cagatagtcg agttgttctc agctatatct gcaacgagaa acgtagattt catgtctacg 4200 taagcaaccg tgtacaaagg attcggagta ggactaagcc agagcagtgg cattatgtga 4260 gctcagatca caaccctgcg gacattgcga cacgaccgat aactccaact aaattgcaag 4320 acacgatttg gttcagtgga cctgcatttt tgcatcaact gaaaacggag actgtaccta 4380 aagcaccctt tccacttgtt gatcctgatc gagatgtaga agttcgcacg catatgacac 4440 tgtcaaatac ctcattctcc aggcgttttg aacgtttttc tgtctggaac cgcttaatca 4500 gagctattgg caagttgatt cacattgcac actcctataa gccagatcaa aaaggtcaca 4560 ctccagcgtg caaaggttgg cactgttgca aactgccccc gagtgccgat acaataattc 4620 agagcacaaa agttgtcatt cgtgcaatac aacaggaaac atacacggat gagttgcagt 4680 gcctaaaaca gaacaaaccc cttcctaaaa caagtactcc ttgaaaacta gacccctttg 4740 ttgatcaaag cggactgttg agaattggtg gtcgcctgat caaagcggat ctaagctctg 4800 aagaaaagag gcctctcatc ataccaggca agagccatgt agccttactc ctgatacggc 4860 actttcatga gcgaacacat catcaaggcc gccactttac agagggtgca ataagatctg 4920 cgggctactg gataatagga ggcaaaaaac gcattagcag tcttatccat acctgtgttg 4980 tatgtcgtaa actcaggagg ggctgtgaaa ctcaaaagat ggcagatctg cccaaagacc 5040 ggctcagtgc agagcctccc ttcacacatg tgggactgga tgtgtttggc ccttggtcgg 5100 tatgtgctcg aaggacaaga ggtgggtttg cagaaagcaa aagatgggca gtgctgttca 5160 cgtgtctaaa catcagagct atccacatag aagtgatcga gtccttaact acttccagct 5220 tcattaatgc cttaaggagg tttttggcaa tccgaggtcc agtaaaacag attcgctcgg 5280 atagaggaac caatttcatt ggagcttgta aagacctgca aataccatct aacgtggatg 5340 aaaaggcagt ggaacgattt ctgtcagacc ataattgcac atggacattc aacccccctc 5400 attcatcgca catgggtggc acatgggaga ggataatcgg acttacccgc aaaatcttgg 5460 attccatgct tctccagacg tcatcctcca ggcttacaca tgaggtactc tctaccttta 5520 tggctgaagt cactgtcatt ataaacaaca gacctttgat accagtgtca actgatccag 5580 cggatccctt tatcttgaca cctgctaccc tacttaccca aaagtccagt accgacttag 5640 ttccacccga gaactgtgaa aagcctgacc tatacaaaca gtggcgtatg gtgcaaaatc 5700 ttgcaagtac cttttgggac agatggcgaa aacagtacct tgcaacacta cagtataggc 5760 gaaaatggca acatcaacaa acaaacatcc ccaaaggaag catagtcctg ctcaaagact 5820 ctcagtccaa acgaaatgac tggccgctgg gaattataac agaaacttac cccagtcaag 5880 atggacgagt cagaaaaatc caagtaaaaa tcattggaaa agatggaccc aagttgttcc 5940 tcagactaat aaatgaggtc gtactccttc ttcaagagaa ctcttaatag actgctcaag 6000 acttcaagaa aggtttggca ttcaaggaat accagacggg gag 6043 // ID DNA-2-2_DR repbase; DNA; ZEB; 407 BP. XX AC . XX DT 16-OCT-2008 (Rel. 13.1, Created) DT 16-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; 2-bp TSD; KW DNA-2-2_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-407 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1218-1218 (2008). XX DR [1] (Consensus) XX CC Copies are 98% identical to the consensus. The TSD is 2-bp long. CC The TIR is 186-bp. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 407 BP; 99 A; 109 C; 106 G; 93 T; 0 other; ccctgctgaa aaatccagct taaaccagcc taggctggtt ggctggtttt agctggtcga 60 ccaggctggt tttagagggg ttttggccac ttccaggctg gtttccagcc atttccagcc 120 tggtcttagc tggtcaggct gggagatgac cagctaaaac cagcttgacc agcctagcca 180 agctgggagt ccagccaaaa ccagctatat ccagcttaaa ccaggctggt caagctggtt 240 ttagctggat ttggctggtc attttccagc ctgaccagct aagaccaggc tggaaatggc 300 tggaaaccag cctggaaatg gccaaaaccc ctctaaaacc agcctggtca accagctaaa 360 accagccaac cagcctaggc tggtttaagc tggatttttc agcaggg 407 // ID Gypsy81-LTR_Dr repbase; DNA; ZEB; 769 BP. XX AC . XX DT 17-APR-2009 (Rel. 14.05, Created) DT 17-APR-2009 (Rel. 14.05, Last updated, Version 1) XX DE A long terminal repeat of the Gypsy-81_DR LTR retrotransposon; a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy superfamily; LTR; Gypsy-81-LTR_DR; KW Gypsy-81_DR; endogenous retrovirus; Gypsy-81-I_DR; KW Gypsy81-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-769 RA Dib M.R. and Naveira H.F.; RT "Gypsy81_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 9(5), 957-957 (2009). XX DR [1] (Consensus) XX CC Gypsy81-LTR_DR is a long terminal repeat of the Gypsy81_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its CC internal portion is deposited in Repbase as Gypsy81-I_DR. XX SQ Sequence 769 BP; 194 A; 160 C; 129 G; 286 T; 0 other; tgtaacagga ttattttgga tatattgatt tcactatatt ttcctcctgt ttacttactg 60 ttcatataaa attgttttgt gttcacaata taaatgtaaa aagagtctgt aaagtgacag 120 tggtttcacc tcttcccccc cccccccctt gggttaacgt caaggtggta tagtccaagc 180 agccatacct gttgcactat tgcttgagcg gtcttttcca ccttttcctt gaactaagga 240 ggtatgctct acctgttcca aattgaactt ggctctggcc ataattaaca cacacacaca 300 aatgcatatt taagtttaat tttaatcatt tgttatattt actgatataa aaacagtttg 360 ttatattgta gaatgttatt tttgttgaca gtgtacattt aatgtaatga gctgaagctg 420 ccgatctgtc ataataacat tctctgtgca ggtatgcatt tatgattttt attttagaca 480 atgtaatatg tggattgtca ctgttcacat gctgattttt gtaattgtga gaactagttt 540 tctttccctt ttcctgtctt cgcgtccttc cgcggcgcct ctcgctcgtt tcctccatct 600 ttcattcgcc tttctttctt ttccaccttt tccttgaact aaggagtgta catttaatgt 660 aatgagctga agctgccgat ctgtcataat aacattctct gtgcagactc aaaatggtgt 720 cctctcgtca ttcactacac aacgcagagc aaaagagagt acggttaca 769 // ID DNA2-3_DR repbase; DNA; ZEB; 1245 BP. XX AC . XX DT 26-SEP-2008 (Rel. 13.09, Created) DT 27-SEP-2008 (Rel. 13.09, Last updated, Version 4) XX DE DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW DNA2-3_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1245 RA Jurka J.; RT "hAT-type families from zebrafish."; RL Repbase Reports 8(9), 931-931 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1245 BP; 455 A; 163 C; 158 G; 469 T; 0 other; ggtaacactt tataataact acacactata aatcatttat taagcattag caaatagtga 60 attcattatc cgttaagcat taactctaca ttaataaacg ttagtaagca gtttataact 120 gcagctacaa atgctctatt cttgacttat aaccacattt aaaatgtgct taataattgt 180 attttcatac ttagtaaatg atttattttt cattactaaa ttaagtatcg cattatttac 240 aaaccagttg tatttaagag tagttgaggg tttttaggat cattcagaat gagttagtaa 300 atgattaata aactattgaa atcaacgttt atatgtctta ttattcaggc atatattaat 360 ggttactatg tatgttaata aatgctttat taactcaact tcacctagtt ttgtgaccta 420 atctaaagtg aggactattc atgctttata aatcctttat aaatgacaaa taaaggctca 480 gttaaattct aaacaggaaa aatgacatta ttcattccta ttcaattaaa gatacacaaa 540 tgaaactgta cttcaaacga aaaataaatc tttgcaacct gatctaaaat aaaatcactg 600 tagagtttaa acattacatt ttattacatt gtttaattat tatattgttg ttgttttatc 660 cagttttatg tcgtattttg acactcctgt tattcggcaa tgtttaaact ttacagtaat 720 tttacatttt agaaaagatt gcaaagatta tagttcattt gattctgagc ctttatttgt 780 catttataag ggatttataa agcatgaata gtcctcactt tagattaggt cacaaaactg 840 catgaagttg agttaataaa gcatttatta acatattagt taactattag tatatgcctg 900 aataataaga catataaacg ttgatttcaa tagtttatta atcatttact aactcattct 960 gaatgatcct aaaaaccctc aactactctt aaatacaact ggtttgtaaa taatgcgata 1020 cttaatttag taatgaaaaa taaatcattt actaagtatg aaaatacaat tattaagcac 1080 attttaaatg tggttataag tcaagaatag agcatttgta gctgcagtta taaactgctt 1140 actaacgttt attaatgtag agttaatgct taacagataa tgaattcact atttgctaat 1200 gcttaataaa tgatttatag tgtgtagtta ttataaagtg ttacc 1245 // ID DNA-1-9_DR repbase; DNA; ZEB; 2445 BP. XX AC . XX DT 01-DEC-2008 (Rel. 13.12, Created) DT 01-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE nonautonomous DNA transposon from zebrafish - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; TSD 1 bp; DNA-1-9_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-2445 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(12), 2130-2130 (2008). XX DR [1] (Consensus) XX CC TIR is 28-bp long. The target of this transposon is CWG; TSD is CC 1-bp long (the central W). CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 2445 BP; 807 A; 441 C; 437 G; 759 T; 1 other; ggggggtatt ccagaaagct gggttaactt accatctgct aaaacctaaa ctctcggttg 60 atttacccaa aacctgctta ccgcgagtat gttggttcca aaacaggtga caagagttag 120 ttaaatcaac ctcctgaaga taaccgcggg ttaatgcgcg tgcacggcac acacctgaag 180 gcattttcag tagatcgccg aattcacgag tcaccataga aagtcacggt gagaaaaaaa 240 gggcaacgca ctttacagag gcgaagctgg aggttttaaa cacagagatt acacatctgc 300 ttcaacgaag gaaagaaata tagaataaga aatagataag aaaataaaaa ataaataata 360 agaaaattta ttagttcatt aaattacaca tgccaccctc cattttattg taatgataaa 420 ttaaactact ctttaacatc ctttttgagt atgttgtgta accattcatg catttatttt 480 tcttgccatt aatttcttaa ggccacaata tgtgtattga ctaataaggc ttatagaaat 540 tgtaatcttt ctggagctag aactctgtcc aaagtcatta aacacagaaa cattatcata 600 aaaggtaata tttaattact ggagctcttt atttaacact ctagaatatt ctttctgaca 660 tgcagctaat agaaagaggc ctgtctaact gctgtgtcct ataataaagg cttttcagtg 720 gttgagtttt tgacttaaac cacacactct gtcactgttg taagtgattg ttgtcagata 780 gctttaggtt ctttttattt tgcatgcaga cgaagttcct tctgcctcaa ctaacaagca 840 gtgtaagaat tttctgcact tttccatatt tgaatattac tcttggatca caaaatcttt 900 tatctacata tttaaattct aaagtagctg tataaggtcc atcttcaaag ataaataaag 960 tcacatgagc atcaacttaa ggtgagaaat gtatatgaaa aatatgttac acgttgaaag 1020 gtgcgtgtat aaagcagtgt acaaaatcat tcattttatt tcaggaaatg aagaaatgaa 1080 cagaccaaac tatttgatta aacatagttt tttgactaat tatatatttt tttagcactc 1140 ttccatctgt tttgtctgct tgggtcactg agatttcctc tgggtcaggc tgcaggtagg 1200 gtgtcatcgt ataaggcaga aaagggtatc ttttgtcctc cagcaagtaa caatgtagtg 1260 cccagtaatg attctattgt ataatacaaa tataataaaa ataaacattg tgttaagcaa 1320 catttgcttt atagtttttt tttttttttt actatatttg tgttatacac ttgaatcatt 1380 acttcctgaa aatcagccat ttagcctcaa catatagtga tgaggtgggg tgtgtgatga 1440 gctggtaagt ggaagcagta atgagtttaa tagttgaaac accaaaagat taaggataga 1500 aaattaagtt gtacctgcac acttatactt tggacaaatt ttatattatc agtgtccttt 1560 agtgactgat ggagctttaa aatcggctgc aattaatgca cacaatagca tttagaaacc 1620 ctgaaaaaaa tattatatta aaaaatgtag gtgtgtgtgc gtgaatgtat atatatatag 1680 aacgacgtcc tacatcttaa attatagata tattttccta caaaagacaa agctgccact 1740 tcttataata ttatacagaa aaatgtgagg gtgcagaagg agagaaaaaa cacaagatct 1800 gtaagtgaga ttcgaactcg ctccgttgtg cgctccgttg ttctttatcg acgcactgtc 1860 cactacgcta ttgccgtgct ctgaatcatc tgtgtaatga tacgcagtga tttatgacca 1920 aggaactgta caaaagtgta aaaaacagtt cagtgccagg catactgtac aggtggtcct 1980 actgccacat tgtacagttg cctaaaatgc cctgacgcga atacaaagag ctgcactgaa 2040 tgttttttat ggtaagcgca gacccgcggt ttgtcacatt tgatttcaaa aaggtttgtt 2100 aaatacatta acattatgaa tttggttatg aaaaaaacta tgcacttata accctctcac 2160 aacgaaaaaa cttgtatctg tgtgtccaca tccataaata ttctcatcaa aagagcacgc 2220 aatgctcagt aaactctctg aaacagacaa actctgraac atagcaactt actctctgaa 2280 cataacctgc tccggagcag gttatcttca gagagtaagt tgctatggct acttaacata 2340 cccagaagtt acctccgatt ttggaaccaa agttgaggtt atccacctac ttactctcaa 2400 acttacccgg gtatgtcaca taacctgctt tctggaacag ccccc 2445 // ID HATN14_DR repbase; DNA; ZEB; 255 BP. XX AC . XX DT 11-JUL-2002 (Rel. 7.06, Created) DT 11-JUL-2002 (Rel. 7.06, Last updated, Version 1) XX DE HATN14_DR is a nonautonomous DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW Dr000220; HATN14_DR; nonautonomous DNA transposon; KW hAT superfamily. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 5-253 RA Bao Z.; RT "Dr000220, an unclassified and uncharacterized repeat."; RL http://www.genetics.wustl.edu/fish_lab/repeats/. XX RN [2] RP 1-255 RA Kapitonov V.V. and Jurka J.; RT "HATN14_DR, a nonautonomous hAT-like DNA transposon from RT zebrafish."; RL Repbase Reports 2(6), 13-13 (2002). XX DR [2] (Consensus) XX CC HATN14_DR is a family of nonautonomous DNA transposons that CC belongs CC to the hAT superfamily [2]. CC Several thousand copies of HATN14_DR are expected to populate the CC zebrafish genome. The element was active long time ago, its CC copies are ~15% divergent from the consensus sequence. CC This element is characterized by 8-bp target site duplications CC and CC 39-bp terminal inverted repeats. XX SQ Sequence 255 BP; 93 A; 29 C; 44 G; 86 T; 3 other; ggtaacactt tattttaagt tacaattcat gctattaaca aaccattaac taagactatt 60 agcttaataa actactaatt agctgtttat taatagttag taagatagwa gttgggttta 120 ggttttgggt aggattaggg atgcagaata agatcatact ttataactac taatraacag 180 ttaatatctt aataataggc aggtaataag ccagtagtta atagcgtgaa ttgtgacyta 240 aactaaagtg ttacc 255 // ID RTEX-2_DR repbase; DNA; ZEB; 3260 BP. XX AC . XX DT 23-FEB-2009 (Rel. 14.02, Created) DT 10-NOV-2010 (Rel. 15.12, Last updated, Version 2) XX DE RTE-like non-LTR retrotransposon - a consensus. XX KW RTEX; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; RTEX-2_DR. XX NM RTEX-2_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-3260 RA Bao W. and Jurka J.; RT "RTE-like non-LTR retrotransposons from zebrafish."; RL Repbase Reports 9(2), 564-564 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS join(2..1780,1716..1955,1861..2238,2186..2836) FT /product="RTEX-2_DR_1p" FT /translation="NGRVRGDSLGRYTYSSSLGSSTVDYMITDLDLFSFSA FT FTVKPLTPLSDHSQITLYIKRNKNTNIFSQSSKLXNIKNNYRWVQDSSEKY FT RNAIEDPQIGLLLDKFMENSYPDNGDGVNLAVENINYIFDYLATXSNIXTT FT KKTFKQKQENEKWXDFDCKTIRKNLRQLANQKHRQPENTDLRLQYXEELKK FT YKNTIRKKKEEYTQKQLKTIEESVDSNNFWDNWNLLNKKKREQLTIQNGDT FT WKNHFEKLFSKINMSTEQSLIYNKLTHTEQIFGKYQNPLDYPITEKELLDK FT IQVLPTKKASGPDGILNEXIKNINHKFQLAIMKLFNLVLSVGHFPDIWNKG FT LITPIFKSGDKSDPNNYRGICVSSNLGKLFCSIINTRLIGFLTEHNVLSKS FT QIGFLPNYRTSDHIFTLQTLIDKYVHQNKNKIFACFVDFQKAFDSIWHEGL FT LSKLLESGIGGKTYNIIKTMYSNNQCAIKIGNKRTEFFXQGRGVRQGCPLS FT PTLFNIYINQLAMLLEQADTPGLTLYDSSVKFLLFADDLVLLSPTEEGLQQ FT QLDILHSFCQTWALTVNPKKTKTLIFQKKTQMSGKETQLHPWYDKNYSRKK FT PRCQGKKHSFTLGMTKIEPAISYTYLGLKISANGKLNLAVNELKEKARRAF FT YAIKKINTNRNSNSNLAQNFPVSDKRKQEGPFTPLKKSTQIEIPIRIWLKI FT FQSVIEPIVLYGSEVWGPLLSHEFDKWDKNPVESLHAEFCRSILRVQRNTP FT SNACRAELGQYPLLMRIEKQSVKFWKHIKMSDPNSYHFKALKTPRNGHVTP FT TLIILKPSKHQEMDIEKSLLIQMVLKLQTQTNTTNMTNSRQHQDTDMLIHK FT IKPNQIINARKEEYLIYWNEAAGKQSKLQCYLDLNRDYTTATYLSAVKDSK FT LRTTMTKYRLSAHSLTVETGRYRQNWQPRESRICPHCAQAEVETEEHFLTH FT CTNYQHIRETFYTKLQSIYPQFTELDNKTQLQYLLGEKNECVLLATQYINA FT CHKKREQNQ*" XX SQ Sequence 3260 BP; 1230 A; 618 C; 572 G; 829 T; 11 other; caatggcagg gtgagaggag actctctggg cagatacacc tacagctcaa gtcttggtag 60 ctcaacagtt gactacatga tcacagattt agatctgttc tctttcagtg cattcactgt 120 taaaccccta acacctctat cagatcacag ccaaattaca ctttatataa aaaggaacaa 180 aaatactaat atattttcac aatcyagtaa attgtrtaac attaaaaata attatagatg 240 ggtacaagac agctctgaga aatacaggaa cgcaattgaa gacccccaaa taggtttact 300 tttggataaa tttatggaaa atagttatcc tgataatgga gatggtgtta atctagctgt 360 agaaaacata aattacatat ttgattattt ggcaacgatr tctaatatta amactactaa 420 gaaaacyttt aaacaaaaac aagagaatga aaaatggyat gactttgatt gtaaaacaat 480 aaggaaaaac ttaagacaac tagcaaatca aaaacacaga caaccagaga atacagattt 540 acgtcttcag tactrtgagg agcttaaaaa atataaaaac acaatcagaa aaaagaaaga 600 agagtacacc caaaaacagc tcaaaacaat tgaagaatct gttgactcaa acaatttctg 660 ggacaactgg aacctcctta ataaaaagaa acgtgaacag ctaacaatac aaaatggaga 720 cacctggaaa aaccactttg aaaaactgtt tagtaaaata aacatgagta cagaacaatc 780 gcttatatat aataaattaa cccatacaga gcaaatattt ggcaaatatc aaaatccgtt 840 agactaccca attacagaaa aagagctact agataaaata caagtcctac caacaaaaaa 900 ggcaagtggt ccagacggta tccttaatga aatkattaaa aacataaatc acaaattcca 960 attggctata atgaaactgt ttaatttggt tctgagtgtt ggtcatttcc ctgacatatg 1020 gaataaagga ttaataacgc ccatattcaa gagtggagac aaatcagacc ctaataatta 1080 cagaggcatc tgtgtgagca gtaatctggg gaagctgttc tgcagcatca tcaacaccag 1140 actcataggc ttccttacag agcacaatgt cctcagcaaa agtcagattg ggtttctgcc 1200 aaattacaga acatcagacc atatcttcac ccttcagact ctgattgaca aatacgtcca 1260 tcaaaacaaa aacaaaatat ttgcttgctt tgtagatttt cagaaagcat ttgattcaat 1320 ttggcatgaa ggtctgctgt ctaaacttct agaatcaggt attggcggta aaacatacaa 1380 cattataaaa acaatgtatt cgaacaatca atgcgcaatt aaaataggaa ataagcgaac 1440 agaattcttc astcaggggc ggggtgtgag acagggctgt ccactgtcac caaccctctt 1500 caatatttac atcaatcaat tggcaatgct cctagagcaa gcagatacac cgggtctcac 1560 actatacgac tccagtgtga agttcctgct gtttgcagat gatctggtgc tgctgtcgcc 1620 aacagaagag ggtctacagc agcagctgga tatcctgcac agcttctgtc agacctgggc 1680 cctgaccgtt aacccaaaga aaactaaaac cctaatattc cagaaaaaaa cccagatgtc 1740 agggaaagaa acacagcttc acccttggta tgacaaaaat tgaacctgcc ataagctaca 1800 cataccttgg gttgaaaata agtgctaatg gaaaactaaa tttggctgtg aatgaactga 1860 aagagaaagc aagaagggcc ttttacgcca ttaaaaaaat caacacaaat agaaattcca 1920 attcgaatct ggctcaaaat tttccagtca gtgattgaac ctatagttct atatgggagt 1980 gaagtgtggg gtcctctcct cagtcatgag tttgataagt gggataaaaa tccagttgaa 2040 agcctacatg cagagttctg taggagcatc ctcagggtac agaggaacac acccagcaat 2100 gcatgcaggg cagaattagg ccagtatcct ctactcatgc gcattgagaa acaatctgtt 2160 aaattttgga aacacattaa aatgagtgac cccaactctt atcattttaa agccctcaaa 2220 acaccaagaa atggacattg aaaaaagcct cctgattcag atggtcctga agctgcaaac 2280 acaaaccaac acaacgaata tgactaacag caggcagcat caggacacgg acatgctcat 2340 ccacaaaatt aagcccaatc aaataataaa tgcacgaaaa gaagagtatc tgatttactg 2400 gaatgaagca gcmggaaaac aaagtaaact tcaatgttat ttagacctaa atagagatta 2460 taccacagca acatatctga gtgcagtaaa ggacagtaaa ctaagaacaa caatgaccaa 2520 atacaggctg agtgctcaca gtctgactgt agagacgggc cgatacagac agaactggca 2580 acccagagag agccgcatct gcccacactg tgcccaggca gaggtggaga cggaggaaca 2640 cttcctcacc cactgcacaa actatcagca catcagagaa acattctaca ccaaactaca 2700 gagcatttac ccacaattca ctgagttaga caataaaaca cagcttcaat atttactagg 2760 ggaaaaaaat gagtgtgtcc ttctagcgac acaatacatt aatgcctgtc acaaaaagag 2820 agagcagaac caataaaaac atcagtgatg cgcctgcaaa cacacacaca cacacacaca 2880 cacatagatg attataaatg aatcattgta aatatactga ttattattgt ttttattact 2940 gttttatcct aaaatgttga ttgttaaatg tacagtttat tctgatctat ttttntattg 3000 atttgtatga ataatatgat atgttgaata ataaaaataa tatgatgtat tgttgatatt 3060 ttatttaatt attatatata aatatcatct ttttttacta ctactattta ctgtgtgtac 3120 tgttttttat tatatatgta tgttactttt tatgtaaact ttataactgc tttggcaata 3180 catttgtaac atttgtcatg ccaataaagc aattattgaa ttgaattgag agagagggag 3240 agagagagag agagagagag 3260 // ID ACROBAT1 repbase; DNA; ZEB; 1379 BP. XX AC . XX DT 05-JUN-2002 (Rel. 7.05, Created) DT 05-JUN-2002 (Rel. 7.05, Last updated, Version 1) XX DE ACROBAT1 is a nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; ACROBAT1; KW nonautonomous DNA transposon. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1379 RA Kapitonov V.V. and Jurka J.; RT "ACROBAT1, a nonautonomous DNA transposon from zebrafish."; RL Repbase Reports 2(5), 1-1 (2002). XX DR [1] (Consensus) XX CC Several hundred copies of ACROBAT1 are expected to populate the CC zebrafish genome. The element was active recently, its CC copies are ~5% divergent from the consensus sequence. The genome CC is populated by multiple subfamilies related to ACROBAT1. CC This element is characterized by 19-bp terminal inverted CC repeats. Its classification is uncertain and needs additional CC data. Usually, ACROBAT1 copies are flanked by CCT and TGG. CC It's possible that they have been inserted into the CCTGG CC target site and generated duplications of T. XX SQ Sequence 1379 BP; 475 A; 228 C; 216 G; 459 T; 1 other; gggcccggtt tttcaaaagg tttaatccgg ataaaattga tccggattta gtaatcctgt 60 ttttgcgatc cgtgatcacg taatccagct tactttttga gccagttttt caaagcaaca 120 tcggattgga tcaatctgat ccggatagga acttttcagg atcactaaat ctggattacc 180 agtgctcaaa cagcatagga catcacaatg tagaactata gatatgtaaa gagcaacaag 240 tagaatagcc agtgtggggt caatataact ttatttttat ttgaaatgaa aactaagaaa 300 atttaataaa taataaaata ataataataa taaataataa aaacaagaaa aaccccaccc 360 catcctcttc cagcagtccg ctcatctgag acctacaaca ctgtacattg aggatattta 420 taataagttt caaatataga tgtaaataaa aaaaaaaaga cttaatgtca aaaactccac 480 aataatttca gacttcctca tctgatgtgg agagagcagc ttgtctgagc gtagccttta 540 ggaggcgaat ctcaagtcat tggccttggt ttgctgcagt ttggtcagag tcagttgttt 600 ctctgccaga tgaagctgtg cttctgccct ttcagtctct ttttgcaatt gttgaaagtg 660 atttcaaaaa tcagcgattg ccattctttg caatttttta tttttgttgt tattctgtaa 720 tcattgcatg catcactaat aaatattcta aaaagaaaaa tattttccat tgtgatgaat 780 atatatatat atatatatat atatatatat atatatatat atatatatat atatatatat 840 atatatatat atatatatat atantagtga cagaataaaa ttgattgttt ttggtcaatt 900 ttgacagctg tttgggggat tattttatga ggccaaactc tttctacttt gctgtaggcc 960 tatactgaaa ggctaaatac gttaaagcct atactcacta tagcctgaca gatgaacaaa 1020 taaaattaaa accttatgaa taaatagtat atctcgtaag atactgcatg atccaaaaat 1080 agtattattt aataaatttt ttaagttttc attacatttt gggcatgaaa tccacgctaa 1140 atccaccctt ctgatgggat cagtttaatc caggtttttt ggatcaaagt gatccagatc 1200 ctacaaaaaa ggtctgaaaa acccaaacta aaggtttgat ccggatcaaa accaagattg 1260 gattacgtga tctaatccga ttacgtaatc cgtttttttc ttttgaaaaa cccattttca 1320 agatttgatc caatccctta tccaaaatcc tagtggatta cttttgaaaa accgggccc 1379 // ID DNA8-7_DR repbase; DNA; ZEB; 584 BP. XX AC . XX DT 17-NOV-2004 (Rel. 9.1, Created) DT 17-NOV-2004 (Rel. 9.1, Last updated, Version 1) XX DE DNA8-7_DR is a non-autonomous DNA transposon. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-7_DR; KW nonautonomous DNA transposon. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-584 RA Kapitonov V.V. and Jurka J.; RT "DNA8-7_DR, a nonautonomous DNA transposon from zebrafish."; RL Repbase Reports 4(10), 257-257 (2004). XX DR [1] (Consensus) XX CC DNA8-7_DR is a nonautonomous DNA transposon. Its is CC characterized by 16-bp terminal inverted repeats and 8-bp CC target site duplications. It is expected to be a member of the CC hAT or P superfamilies. XX SQ Sequence 584 BP; 194 A; 88 C; 89 G; 213 T; 0 other; taggcctgtc acaataatca atatatcgac ttatcgcaca acacatggac atgacctcaa 60 tcatttttgg tgatgcaata tatatcgccc atacataaaa accaattcta gcaacatttt 120 agctgattgt gcaacatctc tatctctatt ggaggtgtca gtgtcagtat ggttaaaaaa 180 cactatagta tttactataa attactatag tatattttca tgtgggtgtc tgacaactga 240 aaatagatct ggtagcagat atcgcagcct ctgttacttt ttaccttgca cttttttgtt 300 aacatttatt attatttatt taggatatgc gttttcacat tctgattcca atgcctaatt 360 attaaattgt taaaatgact ataattaaat gaaatattct taagaataaa atattatgtg 420 ttctttggaa gagtgtactt gcattgtgat gatattatat tgtcattctg gcattatata 480 atgagtggca taaaatggtc ttaaaatgac aataatatcg tttatcgcaa tatattttgg 540 tgcaatatat cgtacaacaa aaaatagata tcgtgacagg ccta 584 // ID HARBINGERN8_DR repbase; DNA; ZEB; 241 BP. XX AC . XX DT 08-OCT-2003 (Rel. 8.09, Created) DT 08-OCT-2003 (Rel. 8.09, Last updated, Version 1) XX DE HARBINGERN8_DR is a nonautonomous DNA transposon - a consensus. XX KW Harbinger; DNA transposon; Transposable Element; Nonautonomous; KW 25-bp TIRs; 3-bp TSD; Dr000419; Harbinger superfamily; KW HARBINGERN8_DR; nonautonomous DNA transposon. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-241 RA Bao Z.; RT "Dr000419, unclassified and uncharacterized repeat."; RL http://www.genetics.wustl.edu/fish_lab/repeats/. XX RN [2] RP 1-241 RA Kapitonov V.V. and Jurka J.; RT "HARBINGERN8_DR, an ancient family of Harbinger-like RT nonautonomous DNA transposons from zebrafish."; RL Repbase Reports 3(9), 181-181 (2003). XX DR [2] (Consensus) XX CC HARBINGERN8_DR copies are flanked by 3-bp target site CC duplications CC generated upon their integration in the genome. CC It has 25-bp terminal inverted repeats (3 mismatches, including CC 2-bp CC termini). CC The consensus sequence is 78% identical to the unclassified CC Dr000419 CC repeat [1]. The HARBINGERN8 family is several millions years old, CC its CC copies are ~94% identical to the consensus sequence. CC There are ~100 copies of HARBINGERN8_DR in the genome. CC HARBINGERN8_DR is classified as a putative member of the CC Harbinger CC superfamily based of the 3-bp target site duplications and the CC 5'-GGC terminus that are common for different Harbingers. XX SQ Sequence 241 BP; 81 A; 40 C; 49 G; 71 T; 0 other; ggcccggttt cacagacagg gcttagacta gccaggatta ggccatagtt caattaggac 60 atttaagtaa ttttttataa acatgcttag aaaaaaaaac attactggtg tgtatcttga 120 gacaaaacaa aggcactgat atatttaaaa atcaatcagt gcaattttat ttcagttgaa 180 acagctcaga cttacatttt agtctaggac taggcttaag ccttgtctgt gaaaccgggg 240 g 241 // ID Gypsy83-I_DR repbase; DNA; ZEB; 7028 BP. XX AC chr1; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy83-I_DR; KW Gypsy83-LTR_DR; Gypsy83_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-7028 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1477-1477 (2008). XX DR Genome; chr1; Positions 46852056 46859083. XX CC Positions [3078-3581] - Reverse transcriptase CC Positions [5076-5552] - Integrase core CC 'CGAA' target site duplication CC LTRs are 100% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 403..1836 FT /product="Gypsy83-I_DR_1p" FT /translation="MAEKFPSDFDATSPFVPRGRGALLTGVYNDVTLRNES FT FPGTEHVQTVRDNTIGVGRGHSLRNQTVGHFPGHGPFVSTPKADNTDQTLQ FT QITDMMGRLGAQIGDSIMARLAETGVVNSNCALHNSKDPDVDSVRQDVPQV FT TVHVKAEREPVMFKGDGSDKYSVRDWIDLTKSQLIKQKCPISDQADEIMSR FT LMGKARDIVKIGLRSDTSLNAARNPDIIYSILQRYFSDAASCLPLADFYST FT YPEPRENPVDYWIRLNKSADLAEEGLRRQGRFMDDISMEVTRMFVKHCPDP FT ELLCLFKCKPIDEWNARDIQRRLDDYQREKRTPLSNLNPRPRYTVASIHCT FT DPQKTTDYCQSNIQTDLSNVPPAVPSSLSEAQCIEPVIYTKQNVQKPDSEI FT FSRMMSMLEQVLEKVAQNSRVPSPQNVVYRQGSVQCDVCKANTHTTKTHCF FT RDRLCFSCFSPGHSQNVCPRRVSARRNQSGGNL" FT CDS 1788..6620 FT /product="Gypsy83-I_DR_2p" FT /translation="MSTQGVSSAQPEWGKLIDPYMVGGCAGLDDDSHENVA FT CAEAADAVSVDFGELTENVIVQNTNIISSSSSLFYTPVIVEGKLTLGGMLD FT SGSMACTMSEKAEAKLLDAGVLTKENRSNFDAVLIGCGGTRVKPKSAYDIE FT MEVYGCKIIVPTLVVQGQLDELIIGTNVIKHVLRESKRCPAYWNALSHPCL FT NDDQESERFLSMLAGMERWKGEEIPDKIGTVRCTSAVCLEPGREYLVWGKL FT NKSCVISPGSTVITEPTTSRSAPRHVLVARLVTPLWGDGWVPLKLINLSHQ FT QVLLRRNAKIADLFACVALEDLENLVPTCNDDPLFNCTQRVDSRTCLDRRE FT DLVKQLEHAGLGALDINSCEASEDCKNKLRELILQYENIFSRHHLDCGKAE FT GFVHRIHVTDTKPFRLPYRRIPPSQYQVLRKVLDEMEEREIIRKSTSEFAS FT PLVLVWKKNGDLRICTDFRWLNRRTFKDAYPLPHQADCLAALGGNCLFSTL FT DLTSGFYNMPLHDDDRKYSAFTTPVGLYEYNRLPQGLCNSPGSFMRMMTAI FT FGDQNFLSLLCYLDDLLVFAPNEKIALERLEMVFSRLSKHNLKLSPKKCCF FT LRRSVKFLGHIVDETGISTDPCKVDSIARMTYADLMEKDGVTPSQKRVRSF FT LGLINYYQHFIPKYSSIARPLFDLLSGQMKNSKGGWRRKRSLQSCKLRPEH FT WTEDHVRSFEGLKSSLINSVVLAHPKFDQPFILSTDASLDGLGAVLSQVQE FT GDTMARPIAFASKSLSRSQRNYPAHRLEFLALKWAVCDKFSHWLKGNKFTV FT CTDNNPLTHIMTKPKLDCCEQRWVAKLASYDFDIKYIPGSRNLVADALSRE FT PFVHVGHRLLHEAFDCLRNNMTEMSDVSVQNAFRQSTNSDDQFPRPDSVFH FT NGMTFSSDEVSAVFRSHNDWETGAQVRAISNLHCVPQVVDNDYGCLPAYSV FT SELRDKQRSDSILSRVVEYVERCRRPSRRERAHEPFQVVRYLKHWEKFKMV FT DGVLYRVSKDSFLKTRRFQFVVPESLKVEVMKGTHDHAGHQGQFRTLSLVR FT QRFFWLSLDRDVREYVRQCQRCVVSKTAEPSDRAPLVNIQSSRPLQIVCID FT FWSAENSQNKSVDVLVVTDHFTRLAQAFLCRNQSAKEVAKVLWDKFFCIYG FT FPERIHSDQGANFESNLISELLRVSGVRKSRTTPYHPMGNGAVERFNRTLG FT NMIRALPPGEKSDWPRRLQSLTFMYNCTVHETTGFAPFYLMFGRVPRLPID FT LMFQSVLSDSTTTDFSKYVSSLTKDLSEAMKIAQKHASKEQHRHAVLYNRK FT TKGSDISISDRVLLANRAERGKKKISDRWETAIYIVIEKFPDTNTYRIRNT FT TTGQEKVVHRNLLLLVNFLPLTDDSGSLSPSSDPGADVFSSAENSPLHDAD FT LCLDQNETVDGVPSGNCLDESINVSSDADIGTTAVGEFDSVIPIPADSVSI FT RDDLSIVEDTGTICSSVTGVGRTLEWVSQLPEDNSDRADLHSEMSCPASTT FT ASLDVVNTMSDVPLVRPEQPAGSVPLPSQPCNSSGSLRQVQDDFPRVFTRA FT TSRFGRVIRPVNRLLYTMSRQDVSHNSRGYVTSICKSIVQALRE" XX SQ Sequence 7028 BP; 1867 A; 1370 C; 1671 G; 2120 T; 0 other; ctggtgccgt gacctgtggc atcgcgagga tccgctcaac gcagatcgcc gcgtggttgc 60 tgaaatttga ggatcgctgt gtgattgctg atctttgaag actgttgcgt gattgatgaa 120 ctctgaagat cttcgcatgt ttgctgaacc ctgaagatcg tcacatgatt gctgaacttt 180 ggagaatgct gtgttttcat cactatttcc aaaactgact gttgactggt tggtcatttg 240 gttacatata tatatatatt tttcactttt tcattttttt cttttctttc aggccatcta 300 catccttgaa tttagggtaa aattgtactt tggactcaga acactatttt ttttctttat 360 atatatatat aattgtgtta ttattattcc ttacattcaa ttatggccga aaaatttcct 420 tctgattttg atgcaacaag cccatttgtt cccagaggga ggggtgcttt gcttactggg 480 gtatacaatg atgttacttt gcgtaatgag tcattccctg gtactgagca tgtacaaaca 540 gtgcgcgata acactatagg tgttggtcgt gggcacagct taagaaacca aacagtggga 600 cattttccgg ggcatggtcc ttttgtttca actccaaaag ctgataatac tgaccaaact 660 ttgcagcaaa taacagacat gatgggtcga cttggggctc aaataggcga ttcgatcatg 720 gcacggcttg ctgagactgg ggttgttaat tccaattgcg ccttacataa cagcaaagac 780 ccggatgtcg acagtgtcag acaagatgtg cctcaagtca ctgtgcacgt aaaagcagaa 840 agagaaccag taatgtttaa aggtgatggt tcggacaaat attctgtccg tgactggatt 900 gatttgacaa aatcacagtt aatcaaacaa aagtgtccaa tatctgacca agctgatgag 960 ataatgtcaa ggctgatggg caaagctagg gacattgtga aaattggatt gaggagtgat 1020 acttctctaa atgccgcacg caaccctgac attatctatt caattctgca gagatatttc 1080 agtgatgcag cttcttgtct gcccttagct gatttttatt cgacgtaccc agaaccaaga 1140 gaaaatccag tcgattactg gatcaggctt aacaaatcag cggatttggc tgaagaagga 1200 ttgcgtagac agggtagatt tatggatgac attagcatgg aagtgacgcg catgtttgtg 1260 aaacattgcc ccgaccctga attgttgtgt ctttttaaat gcaaaccaat tgatgagtgg 1320 aatgcgaggg acattcaacg aaggttggat gactatcaaa gagaaaaaag aactcctctg 1380 agcaatctta atcctcgtcc tcgatacact gtcgcttcta tacattgcac tgacccacaa 1440 aagacaactg attattgtca gtctaacatt caaacagacc tgtctaatgt tcctccagct 1500 gttccgagta gtttgtctga ggctcagtgt attgaaccag tcatctacac gaaacaaaac 1560 gtgcagaagc ctgatagtga gatcttcagt cggatgatga gtatgcttga acaagtgcta 1620 gaaaaggtgg cacaaaattc tagagtcccg agccctcaaa atgtagttta taggcaaggt 1680 agtgtacaat gtgatgtttg taaggctaac acgcacacca ccaaaactca ttgctttaga 1740 gataggcttt gcttctcttg tttttcacca ggtcactcac agaatgtatg tccacgcagg 1800 gtgtcagctc ggcgcaacca gagtggggga aacttataga cccgtatatg gtggggggat 1860 gtgcgggtct agatgatgac tcccatgaga atgttgcctg tgctgaagct gcagatgcgg 1920 tgagtgtaga ttttggtgag ttgacagaaa atgtaattgt acagaacaca aacataatta 1980 gcagtagttc tagtttgttt tacacaccag tcatcgtgga gggcaagttg actcttggcg 2040 gaatgcttga tagtgggtca atggcctgta cgatgagtga aaaagcagag gccaaactcc 2100 tcgatgctgg tgtgttgaca aaagagaata gatccaattt tgatgcagtc ctgatagggt 2160 gtggtggtac tcgagtgaag cctaaatctg cttatgatat tgagatggaa gtttatggct 2220 gcaagatcat tgtgcctacg ctagttgtcc aaggccaact agacgagcta ataattggga 2280 caaatgtgat taaacatgtg ctccgtgagt ctaaacgttg tcctgcatac tggaatgcct 2340 tgtctcatcc atgtctgaat gatgatcagg aatcagaaag gttcctttcc atgttggctg 2400 ggatggaacg ctggaagggt gaggaaatcc cagacaagat tggtacagtc agatgtacgt 2460 cagcagtgtg tttggaacct gggcgtgaat acttggtttg gggtaagttg aataagtctt 2520 gtgtcatttc acccggtagt actgtgataa ctgagcctac tacttctaga tcggccccta 2580 ggcacgtgtt agtggctagg cttgttacac cgttgtgggg tgatggttgg gtgccgctga 2640 agctgattaa tttgtctcat cagcaagttt tgttgagacg caacgcaaaa attgccgatc 2700 tgtttgcatg cgtggcactt gaggatttgg agaatcttgt acccacctgt aatgatgatc 2760 cactgtttaa ctgtactcaa agggttgact ccagaacttg cctagatcga cgggaggatt 2820 tggttaaaca gttggaacat gctggtctgg gtgcattaga tattaactca tgtgaggctt 2880 ctgaagactg taagaataag cttcgagaac ttattttaca gtacgagaat attttctctc 2940 ggcatcacct tgattgtggt aaggctgagg gatttgtaca cagaattcat gtcactgata 3000 ccaaaccatt tcgcctacct taccgaagaa tacctccgag ccagtatcag gttctccgca 3060 aagtcctaga cgagatggaa gaaagagaaa taattcgtaa atcgactagt gaatttgctt 3120 ctccactagt gctggtttgg aaaaagaacg gggacttgcg catttgcacc gatttccggt 3180 ggctaaatag aagaacgttt aaggatgcgt acccgcttcc ccatcaagct gattgcttgg 3240 cggcacttgg tggcaattgc ctgttcagca ccttagatct tacatctggg ttttacaata 3300 tgcccttgca tgatgatgac agaaaatact ccgcatttac tacaccggtg gggttgtatg 3360 aatataaccg ccttccacag gggttgtgta acagtccggg tagttttatg agaatgatga 3420 ctgcaatctt tggtgatcag aacttcttaa gcttactttg ctatctcgac gatctacttg 3480 tttttgcccc taatgagaag atcgctttgg agcgtctcga gatggtattt agtaggttga 3540 gtaaacacaa cttaaagctt tcaccaaaga aatgttgctt tctcagacga tcagtgaaat 3600 ttcttgggca tatagttgac gaaacaggca tctctactga tccatgtaaa gttgacagta 3660 ttgccagaat gacttatgct gatttgatgg agaaggatgg tgtgactcca agtcagaaac 3720 gcgtccggtc atttcttggc ttaatcaact actaccaaca ttttataccc aagtattcat 3780 ccattgctag acctctcttt gatctactct caggccagat gaagaactct aagggtggat 3840 ggaggcggaa aagatctctg cagagctgta agttacgtcc tgaacattgg acagaggatc 3900 atgttagatc ttttgagggg cttaagtcat ctttgatcaa cagcgtagtt ttggctcacc 3960 caaaattcga tcagcccttt attctatcta ctgatgcctc attggatggc ctcggagctg 4020 ttttgtctca agtgcaagag ggagatacaa tggctaggcc aattgcattc gccagtaaat 4080 cgctttctcg gtcacaaaga aattatcctg ctcacaggtt ggaatttttg gcgttaaaat 4140 gggcagtgtg cgacaagttt agccactggc taaaaggtaa caagttcacc gtctgcactg 4200 ataataatcc gctgacccat attatgacta agccaaaatt ggattgttgc gagcagcgat 4260 gggtagcaaa gttagccagc tatgatttcg atattaagta catccctggt tctcggaatc 4320 tagttgccga tgctttgagc cgcgaaccgt ttgtgcacgt gggacacagg ttgttgcatg 4380 aagcttttga ttgtctcagg aacaatatga ctgagatgtc tgatgtttct gttcaaaatg 4440 cgtttagaca gtcaacaaat agtgatgatc aatttcctcg gccagattct gttttccaca 4500 atggcatgac tttttcaagt gatgaggttt cagcagtgtt tcggtcccat aatgactggg 4560 agactggagc acaagtacgt gcaatatcaa atttacactg tgtgcctcag gtggttgata 4620 atgactatgg ttgtcttcct gcgtactcag tgagtgagct tcgtgataaa caacgcagcg 4680 atagcatttt atctcgcgtt gtggagtacg tagagcgatg ccgtaggcct tcgagaagag 4740 agagagcgca tgaacctttt caagttgtaa ggtatttgaa acattgggaa aaatttaaga 4800 tggtcgatgg agtgctttac agggtgtcga aggatagctt tttgaagaca aggcgtttcc 4860 agtttgtagt ccctgaatca cttaaagtcg aggtaatgaa aggaacacat gatcatgcag 4920 gccatcaagg tcagttcaga actctcagtc tagtgaggca acgttttttc tggttatctc 4980 tcgacagaga tgtgcgtgaa tatgtacgtc agtgccagcg ttgtgtcgtc agcaagaccg 5040 ccgaacccag tgatagagct cctctagtta acatacagtc aagcagacct ctacagattg 5100 tctgcattga cttttggtca gcagaaaact cacaaaacaa atctgttgat gttcttgttg 5160 tgaccgacca ttttactaga ctggctcagg ccttcttgtg tagaaatcag tcagctaagg 5220 aggtagccaa ggttttatgg gacaaattct tctgcatcta tggctttcct gagagaatac 5280 actccgatca aggagctaac tttgagagca atcttatcag tgaacttctc agagtgtcag 5340 gtgttaggaa atcccgtaca accccatacc acccaatggg gaatggtgct gtcgaacgat 5400 ttaacaggac gcttggtaat atgattcgtg cattgcctcc tggagagaag tctgattggc 5460 cgaggcggtt acagtctctc actttcatgt acaactgtac ggtgcatgag actaccgggt 5520 ttgcaccttt ctacctcatg tttgggaggg tgccccgtct gcctatagat ctaatgtttc 5580 aaagtgtctt gagtgactca acaactactg atttttccaa gtatgtgtct tcacttacga 5640 aggatctttc tgaggctatg aaaatagctc agaaacatgc ttctaaggag caacatcgtc 5700 atgctgtgct gtataatagg aagacaaaag ggtctgatat tagcatttct gatcgtgtac 5760 tgttggcgaa tcgtgctgag aggggaaaga aaaaaatttc agatcgttgg gagactgcaa 5820 tctacatcgt gattgaaaaa tttcctgata caaacacata cagaattcgt aatactacca 5880 ccggccagga gaaagtagtt caccgtaatt tgttattact agtcaacttt cttccactga 5940 cagatgattc tggatccttg tccccttcaa gtgatccagg agcagatgtt ttctcgtcag 6000 ctgagaattc ccctttacac gatgctgact tgtgcttaga tcagaatgag actgtagatg 6060 gtgtccctag tggtaactgt ttggatgagt ccatcaatgt gagttctgat gctgatatag 6120 gcactactgc tgttggtgag tttgatagtg ttatccctat tccagctgac tcagtgtcta 6180 tcagagatga tctgtccatt gttgaggaca ctggaacgat ctgttctagt gtaacaggtg 6240 ttggtagaac tctggaatgg gtctctcagt tacctgagga taactctgat cgggctgacc 6300 tacacagtga aatgtcttgt cctgctagca caacagccag tcttgatgtg gtgaatacaa 6360 tgagtgatgt tcctcttgtt cgtcctgagc aacctgctgg ttccgttcca ctacccagtc 6420 agccatgcaa ttcaagtggt agtctaagac aagtccagga tgactttccc cgtgtattta 6480 cccgagcgac atcacgtttt ggacgtgtca tcagaccagt aaatagacta ctctacacta 6540 tgtcaagaca agatgtaagc cataattcta ggggttatgt aacatccatt tgtaagtcaa 6600 ttgttcaagc tcttcgagag tagttgtttt tttgagacag ttaatggatg ttttgcgatt 6660 gagatttggg aactaaagaa gctctgcttc tatgtgattt ttttttcttc ctctctcttt 6720 actcttaagg ctaataatag tggtattatt tcatttattt attttctcaa atatttttat 6780 tggtgtacac actttactgt aactttgatg tattttgtcg cagagttgtg gggtgtatat 6840 tgggcatcct gcagacagtt gataagttga gactatggcg ccagtctctt ctttattcat 6900 ccttacggga tactttgata gttgttagtt tctctgtgtg ttggtcttgt attttgacag 6960 attattgtgt gtatcttatc gtttttttcc gcatgttcgc gattttggtg aaattcagct 7020 ggggtgga 7028 // ID Gypsy69-I_DR repbase; DNA; ZEB; 4245 BP. XX AC chr14; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy69-I_DR; KW Gypsy69-LTR_DR; Gypsy69_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4245 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1453-1453 (2008). XX DR Genome; chr14; Positions 6154431 6150187. XX CC Positions [1637-2092] - Reverse transcriptase CC Positions [3110-3589] - Integrase core CC 'TATGG' target site duplication CC LTRs are 100% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 17..4246 FT /product="Gypsy69-I_DR_1p" FT /translation="MDPAEVSQMQAAFAYQSDVLKGYQEQLQQLSNINDHL FT TQYIRSLPAPQSSPVRLALPDKFDGSPEQCRGFVRQCKIFFTSQPDTYALD FT TQKCAVLMSLLTGRALDWASAVWESDAAIQTSFNHFISQLQEVFAYPEGGQ FT DISEQLLNLKQGSRTAADYAIEFRTLAAQSGWNDVALKAAFKQGLNTKIQT FT ELACKAVDLTYSEFITLAIKIDNLLRSNSRSHKQRTTQLEQAQTGLDTPEP FT MQIAATRLSTEERDRRQRENLCFYCGEEDHRTASCPHKAKKFSRTKYSVST FT IEFPTFSKHSFLLNCELSYDTTSQCVSALVDSGSAVNIISQELTDKLKIPT FT SPCVPVINITTIDNGTIGSGIKAITQPVSLSIGLFHEETITFYVIPSCKYE FT VILGHPWLTIHDPTISWNQGELTHWSTHCQQRCFSKILSLPCLSTSIESPE FT VHSQVTLPQPYREFAEVFNKSKAAQLPPHRSWDCAIELLPNMSPPKSKIYP FT LSRPETQAMETYIEEALSSGYIRPSTSPAAAGFFFIEKKDGGLRPCIDCRG FT LNNITVKYRYPLPLVPPALEQLREARIYTKLDLRSAYNLIRIREGDEWKTA FT FLTTRGHYEYLVMPYGLANSPAVFQSFINEVFRDLLNKCVIAYIDDILIYS FT PNLEQHIKDVRTVLTRLQENQLYAKLEKCEFHMSKTSFLGYIISHHGVEMN FT DTKVQAVTGWLLPKTVKELQRFLGFANFYRRFIRNYSLISAPLTSLLKGKP FT SKLKWNPETVKSFEKLKTSFTTAPILKHPNPELPFVVEVDASDYGIGAVLS FT QRHGNPGKLHPCAYFSRKLTAAERNYDVGNKELLSMKAALEEWRHWLEGAV FT HPFQIITDHKNLEYIKSARRLNPRQARWSLFFTRFNFTVTYRPGTKNHKAD FT ALSRRYDQGQLDQTPVSILPPSVVIAQISWDIMEEIQRGQQDDPPPPECPP FT NRQYVPQTLRLRIMQWVHNSLSSGHPGISRTLNLVRNAFWWPKMNQDITTF FT VKSCAVCAQSKTPRELPSGLLQPLPIPHRPWSHLSIDFVTDLPNSNNYTTI FT LVIIDRFSKACRLIPLKGLPTAMETALELFQHVFRGYGIPEDIVSDRGPQF FT TSKVWKAFCKQLDINVSLTSGYHPESNGQVERLNQEIGRYLRTYCSREQDK FT WSNFLPWAEYAQNSLTHSSTGLTPFQCILGYQPPMFPWSGEPSMVPSVDDW FT VQRSEEVWNSAHVRLQRAIRTQRINADQRRRPNPNYQPGQRVWLSTRDLRL FT RLPSRKLSPRYVGPFKILKRINNVTYRLELPADYRISPSFHVSLLKPVHPT FT SDSGTSNPEPPPPLDIEGAPAYAVNEILDSRRRRGQLQYLVDWEGYGPEER FT SWVPSKDILDPSLTQNFHQRYPNRPAPRPRGRPRRCTPGGVPRGGES" XX SQ Sequence 4245 BP; 1266 A; 1121 C; 823 G; 1035 T; 0 other; gaatacttcg cctaacatgg atccagcaga agtttctcag atgcaagctg catttgctta 60 tcaaagtgat gtactaaagg gatatcagga acaactgcag caattatcga acatcaacga 120 tcatctcact caatacatcc gttcacttcc tgctcctcaa agctctccgg taagactggc 180 tttacctgac aaatttgatg gctcgcctga gcaatgtcgt gggtttgttc gccaatgtaa 240 gatattcttt actagccaac cagacactta tgcactagac acccagaagt gtgcagtgtt 300 aatgtcgttg ctaacgggac gggccctaga ttgggcttca gcggtttggg aaagcgatgc 360 tgcaatacaa acatccttta accatttcat atcccaactg caggaagtgt ttgcttatcc 420 tgaagggggg caggatattt ctgaacaact cctaaattta aagcaaggat cacgcacggc 480 agctgattat gctattgaat ttagaacact ggctgcccag agtggatgga atgatgttgc 540 actcaaagca gcattcaaac aaggattaaa tactaaaatt caaactgaac tagcctgcaa 600 agcagtagac ttaacataca gtgaattcat aaccctggcc atcaagatag ataacttatt 660 acgtagcaac tcacgcagtc ataaacaacg aactacccag ctggagcaag cccagacagg 720 cctggacact ccagaaccca tgcaaattgc tgccactcgt ctctctacgg aggaaagaga 780 tcgacgtcag cgagaaaacc tgtgcttcta ctgtggagag gaggatcatc gcactgcttc 840 atgccctcac aaagccaaga aattttcgag aaccaaatac agtgtgagta ccattgaatt 900 tcccactttt tctaagcaca gttttcttct gaattgtgag ttaagttatg atacaacctc 960 tcaatgtgtt tcagcattag tggattctgg gtcagcagtt aatatcataa gtcaggaact 1020 cacagacaaa ttaaagatcc caacatcccc ctgtgttcca gtcatcaata taaccacaat 1080 tgacaatggt accatcggct cagggattaa agccatcaca caacctgtat ccctgagcat 1140 tggacttttc catgaggaaa ccatcacatt ctatgtgatc ccctcttgta aatatgaagt 1200 tattttgggt cacccatggc ttaccatcca tgacccgacc atttcctgga atcaaggtga 1260 actcacacac tggtccactc attgtcaaca acgatgcttt tccaaaatat tgtctctacc 1320 atgtttgtcc accagcattg aaagcccaga ggtccattca caagtgacgt taccacagcc 1380 ctatagagaa ttcgcagagg tattcaacaa atctaaagca gcccaactac cacctcatcg 1440 atcttgggat tgcgctattg aactgctacc caatatgtcc ccaccaaaaa gcaaaattta 1500 tccactgtct cgtcctgaaa ctcaagccat ggaaacatac atcgaggagg ccctatcttc 1560 tggttacatt cgaccctcta cttcccccgc agcagcagga ttttttttca tagagaaaaa 1620 ggatggaggt ctccgtccat gtatcgactg ccgaggattg aataacatca ctgtcaaata 1680 tcgttaccct cttcccttgg tgcctcctgc ccttgaacaa ctacgtgaag cccgtatcta 1740 tactaaactt gatctcagaa gtgcttataa ccttatccgt attcgggagg gagacgaatg 1800 gaaaaccgca ttcctgacca ctagggggca ctatgaatac ctcgttatgc cttatggcct 1860 tgctaactcc ccagcagtct tccagtcatt cataaatgaa gttttcagag atttgctgaa 1920 caaatgtgtc attgcttata tagatgacat tttgatttat tcccctaacc tagaacaaca 1980 catcaaagac gttaggactg ttttgacacg attacaagaa aatcaactct atgctaagct 2040 ggagaagtgt gaattccaca tgtccaagac ctccttccta ggatacatta tcagccacca 2100 cggagtggaa atgaatgaca ccaaggttca ggcagtcact ggttggcttc tccccaaaac 2160 agtgaaagaa ctacaaagat ttctagggtt tgccaatttt tatcgccgat tcataagaaa 2220 ctacagccta atttcagccc ccctcacctc actgctaaag ggtaagccat ctaagttgaa 2280 atggaatcca gaaacagtca aatcgtttga gaaacttaag accagcttca ccaccgcacc 2340 cattctgaaa caccccaacc cagagctacc ctttgttgtt gaagtggatg catccgacta 2400 tggcattggg gcagtcttat cacaacgcca tggtaaccca ggtaaactcc acccctgtgc 2460 ttatttttct cgcaaattga cggcagctga aagaaactat gacgtaggta acaaggaact 2520 actttccatg aaggcagcac tggaagaatg gcgccactgg ctagaaggag ctgtccatcc 2580 cttccagatt atcacagatc acaaaaacct tgagtatatc aaaagtgccc gacgactaaa 2640 ccctagacag gctcgctggt ctttattttt cacccgtttc aattttacag taacctatcg 2700 cccaggcacc aagaaccata aggcagatgc cctttctcgc agatatgacc aagggcaatt 2760 ggatcaaacc ccagtgtcaa ttcttccacc atctgtagtc atagcccaaa tcagctggga 2820 tatcatggaa gaaatacaaa gaggccaaca agatgatcca ccacctcctg aatgcccccc 2880 aaatcgccaa tatgtcccac aaacactcag actacgaatt atgcagtggg tacacaactc 2940 tctaagttca ggacacccag gcatctctcg tacattgaac ttggtacgca atgcattctg 3000 gtggcctaag atgaaccaag atattactac gtttgtcaag tcctgtgcag tctgcgccca 3060 gtcaaaaaca ccacgggagc tgccatccgg attacttcaa ccactcccca tccctcatcg 3120 tccatggtca cacttatcga tagatttcgt caccgatcta cccaactcaa ataactacac 3180 aacgatactc gttatcatag accgcttttc taaagcctgc agactgatcc cacttaaggg 3240 actacccaca gccatggaaa ctgccctaga actattccaa cacgtattca gagggtatgg 3300 catcccagaa gacatagtat ctgacagagg cccccagttc acgtccaaag tatggaaagc 3360 attttgcaaa caactagaca tcaacgtcag cctcacttca ggttatcatc ctgaatctaa 3420 tggacaagtg gaaagattaa accaggaaat tggacgctat ctcagaacct actgcagcag 3480 agaacaagac aaatggagca actttctgcc atgggcggaa tatgctcaaa actctctcac 3540 ccattcatcc acgggactaa ctccgtttca atgcatccta ggctaccaac ctcccatgtt 3600 cccctggtct ggtgaacctt ctatggtacc atcagtggat gactgggtcc aacgcagtga 3660 agaagtatgg aacagcgctc atgtgagact tcaaagagcc attcgaaccc aaaggattaa 3720 tgcagaccaa agacggcgtc caaatcctaa ctatcaacct ggccagagag tctggttatc 3780 aacccgagac ctcagactac gactacctag caggaagcta tcaccaaggt atgtaggacc 3840 cttcaaaatt ctgaaacgaa ttaacaatgt cacataccgt ttagagcttc ctgctgatta 3900 ccgcatctcc ccttcttttc atgtttccct cctaaaaccg gtccacccaa cgtctgactc 3960 cggcacatca aaccctgaac cccctccacc tttggacatt gagggagcac ctgcatatgc 4020 agttaacgag attttggact ctcgccgcag aaggggtcag ttacaatact tagtggactg 4080 ggaaggctat ggaccagagg agaggtcttg ggttccgtcg aaggatatac tggacccatc 4140 tcttacccag aactttcacc aacggtaccc caatcgccca gcaccacgac ccagaggtag 4200 acctagaaga tgtacacccg gaggtgttcc tagaggaggg gaatc 4245 // ID DNA-X-13_DR repbase; DNA; ZEB; 802 BP. XX AC . XX DT 13-MAR-2009 (Rel. 14.03, Created) DT 13-MAR-2009 (Rel. 14.03, Last updated, Version 1) XX DE nonautonomous DNA transposon - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; DNA-X-13_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-802 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 9(3), 689-689 (2009). XX DR [1] (Consensus) XX CC This element is very old ( each copy with ~83% identity to the CC consensus), and the TSD is not detectable. XX SQ Sequence 802 BP; 260 A; 153 C; 154 G; 229 T; 6 other; caattgatag ttttcacgtg tcgtcacacc attagggggc aaaacattac tatctcctgc 60 tgaccgccaa gtcacaggct gagtaatttc tcttttttta gccaaaagcc ggataattgt 120 tgtgcaataa gatatagcat tactaaccga cgaggagaca crcctgagct gtgcttctac 180 agaatccctc akagaaagaa cagccagaaa ggagaaattt atggatttgt gccgaatggt 240 agcatcattg acccataaca atccaactgt cagtacattt gtatgctttt tatacagttt 300 ctattctaat ttgcaggtta agtgcaatta aagtataaat tgaaatgcaa agtcaaagta 360 taaatagcag aagtgaacra atagattttc cctaccagca aatattaatc taagcaaaag 420 aaactgacac caaatataaa agcagacact tacgtgctat aacgccatca ctaatgacta 480 aatcgctgag gaatatccaa aagatagatk taaatataac cgaattggag ctctgacagg 540 gacatgatgg ttttaaattg actaaaaaat ggctgtgggt gcagtatatg gatcgcaagt 600 gcccacaatt tacagtttgt gatcatatct ctgttttgtt ctgttgatat gtgatccaaa 660 tattcrtagc ttgaaacaga tggaaatatg accggtttgt gctgcatttc tatggagatc 720 ccagtgtagc tgccattacg gcccattgtt tttttgccca ccaggtctcy gcacaggtca 780 cgtgactgaa aactatcaat tg 802 // ID TDR13B repbase; DNA; ZEB; 763 BP. XX AC . XX DT 07-JAN-2009 (Rel. 14.02, Created) DT 07-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE Zebrafish non-autonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; TDR13; TDR13B. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-763 RA Jurka J.; RT "TDR13B: a non-autonomous DNA transposon from Danio rerio."; RL Repbase Reports 9(2), 565-565 (2009). XX DR [1] (Consensus) XX CC 8 bp TSD. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 763 BP; 185 A; 177 C; 238 G; 161 T; 2 other; caacccaagc tcattctgaa aacgtagtcc cgcggacgtt tctggagaca gcgaaatacg 60 tcccgggagg tacgtatggc tgcatttatt tttttcaagc gaacgctacg gggcggtgtg 120 acgctgttcc ctttcgcgct tgccggctga ccgcttacct ccgtgtggag ggctttcccg 180 ctgcaaccag tttgtccagt cagctcatcg tgtacgtcgg cagacttgag atgcagagag 240 gaggagttga ccacggcgat cgaccggttt cgagtccggg gaagagcggt tccagaaatc 300 aggtaagaca aaaacagaat cccaaaaatt aagtgaacga gtttcataac agggtgagaa 360 tgtggtgaaa tccgaaaacg cggtagaaaa aaaaaatcag ggcttttctt tttttggacg 420 gcttttgtaa actgttgctt gggtttaggg aaagcgagcg ggcgggcggg tcaatcggta 480 aaattggttg ggtttaggga aggaggaggg ygggtcagcc gattggccgg tcgcgcagtc 540 aatcatccgg tcagtcggac agcggcctct ggtgggttcg cgcgagaaca gcgcgggcgc 600 gaacggcact cgcgagaggc gtttgagatg cgaaaaagcg cacaacagcg gcctctcgcg 660 gattcgcaaa aacaaaaact gcagcaatac gtacctcccg ggacgtatty cgcggtctcc 720 agaaacgtcc gcgggactac gtttccagaa tgagcctggg ttg 763 // ID HARBINGER3_DR repbase; DNA; ZEB; 3599 BP. XX AC . XX DT 06-NOV-2003 (Rel. 8.1, Created) DT 19-MAY-2005 (Rel. 10.06, Last updated, Version 2) XX DE Autonomous Harbinger-like DNA transposon - a consensus. XX KW Harbinger; DNA transposon; Transposable Element; KW DNA-binding protein; HARBINGER3N_DR; HARBINGER3_DR; KW Harbinger superfamily; transposase. XX NM HARBINGER3_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-3599 RA Kapitonov V.V. and Jurka J.; RT "HARBINGER3_DR, an autonomous Harbinger-like DNA transposon from RT zebrafish."; RL Repbase Reports 3(10), 184-184 (2003). XX DR [1] (Consensus) XX CC HARBINGER3_DR is an autonomous Harbinger-like DNA transposon. CC The consensus sequence was built based on several copies that are CC 90% identical to each other. This transposon is characterized CC by the 3-bp target site duplications and 12-bp terminal CC inverted repeats. CC Protein machinery encoded by HARBINGER3_DR was involved in CC transpositions of HARBINGER3N_DR nonautonomous elements. CC HARBINGER3_DR encodes two proteins, 343-aa HARBINGER3_DR-1p (2 CC exons, CC positions 368-1037 and 1119-1480) and 221-aa HARBINGER3_DR-2p CC (2 exons, positions 3345-2940, 1762-1503). CC HARBINGER3_DR-1p is a Harbinger DNA transposase. CC HARBINGER3_DR-2p is a DNA-binding protein that contains the CC myb/trihelix motif. XX FH Key Location/Qualifiers FT CDS join(3345..2940,1762..1503) FT /product="HARBINGER3_DR-2p" FT /note="contains the myb/trihelix motif" FT /translation="MAKANKKRNFTECELEVLLSEVDRRKTVLFASLSSGI FT NNKRKKIEWESLADAVNAVGSERRTVSELKKKWSDVKVQVKRRTAAHRQSV FT DRTGGGTGDTALTPFEERVASIVGDTLLSGVVSVSVGDTDVLEEAHEDGAG FT TSTDTDFVPPEEPEPSVSGATPRVSSASASAEARPSGRVLTQAVLESQQDI FT VRAIGDINNHLKNISNALTDISQSLKELVKK*" FT CDS join(368..1037,1119..1480) FT /product="HARBINGER3_DR-1p" FT /note="Transposase" FT /translation="MAVLALLEDIVNGRIRRERVFRDHGDFLAHDDDWLIS FT RFRFPRAILLDLCAELGPLLERETARSHALPVPLQVLTTLGFLATGSFQRE FT LADRSGLSQSSLSRAMPAVWDGIIRMSSRYIRFPYHAVDQPNIKAQFAAIA FT GFPNVIGAIDCTHIAIKAPSEDEFAYVNRKHFHSINVQIICDAQMRLTNIV FT ARWPGSTHDSFILTNSMVGMRLQGGRVRDGWLLGDRGYPLKTWLLTPLNNP FT QTDQERRYNDAHSHTRSVVERAIGQLKCRWRCLDKSGGVLLYRPNKVCRIV FT LACGVLHNVAHRHGIPNGEPVAPPDDPDPGPVCIQPNQQAIQARQRVVAAI FT *" XX SQ Sequence 3599 BP; 1011 A; 749 C; 760 G; 1079 T; 0 other; gggcctcatg tatcaacgct gcgtacgcac aaaaactttg cgtacgccag gtttcacgct 60 cagaatcgcc cacgtttgga tttactaacg atgaaatgaa cgtgggaatg tgcgcagctc 120 cacgccagct tcttggctgg cgtacgcaca ttttttgtgc gtgtctgttt tatttccatt 180 ggcgactcct agaggcaatt atgtcaaatt gcacactaca aagtatacca ctaacacatg 240 tgaaaaacat tgctattaat gtataattga taaaatgggt taattgccac aatatttttg 300 taccaattat gtgatttaga acatataaaa gcatttgcaa atgcatacaa ccctttagtc 360 tatggcaatg gcagtgttgg ccttattaga ggacattgtc aatgggcgaa tccgaaggga 420 acgcgttttt agagatcacg gtgattttct ggcccacgac gatgactggc ttattagccg 480 tttcagattt ccaagagcta ttctcttgga tctctgtgct gagttgggtc cactgttgga 540 aagagagaca gcgaggagcc atgcattacc cgttccctta caggtgctga caacgcttgg 600 tttcctggca actggttctt tccaaaggga actggcagac cgctcggggt taagccagtc 660 gtctttgagc cgtgcaatgc cagctgtatg ggacgggatc atccgcatgt ctagcaggta 720 tataaggttt ccataccatg cagttgacca gccaaacatt aaagcgcaat ttgcagcgat 780 cgccggtttt cctaatgtaa tcggagcgat cgactgcacg cacattgcta taaaggcgcc 840 atctgaagac gaatttgcat acgtgaatcg gaaacatttc cattcaataa atgtgcaaat 900 aatatgtgat gctcaaatgc gcttaacaaa tattgtggca aggtggcctg ggtcaaccca 960 tgattcattt atccttacaa acagcatggt tgggatgagg ctccaaggtg gcagggtgcg 1020 tgatgggtgg cttcttggtg agtgatgtat ttaaagatat tattccagct aagtttattt 1080 tattttattt tgagcgtaat tatttaactg catatcagga gaccgtggtt atccattaaa 1140 gacgtggctg ttaacccccc tcaacaaccc acaaactgac caagagcgca ggtacaatga 1200 tgcccattct cacactcggt cagttgtaga gcgggcgatt gggcagctga aatgccggtg 1260 gcgctgcctt gataagagcg gaggggtgct gctataccgc cctaacaaag tgtgccgcat 1320 cgtgctggcc tgtggtgtgc tgcacaatgt tgcgcacaga catggcatac ctaatggtga 1380 gccagtggca ccgccagatg acccagaccc aggaccagtg tgtatacaac ccaaccaaca 1440 agccattcaa gcccgccaac gtgtggttgc ggcaatataa aaaaggtaat cagagaccaa 1500 gtttattttt tgaccaattc ttttaatgac tggcttatgt ctgttaaagc attacttata 1560 ttttttaggt gattattaat atcaccaatg gccctaacaa tatcctgttg tgattccaga 1620 acagcctgcg tcaggacgcg gcctgatgga cgggcctcgg cactggcact agcactggag 1680 acacgggggg tagcgccgga gacactgggt tctggttcct caggagggac gaagtccgtg 1740 tcagtagacg ttcctgcacc atctaagtgt aaataaaaac aagcacgtta actgatattt 1800 gagtccatta tttgggtaca cgcatattta taaatgaaat agatagggga gcgcggggcg 1860 caaagtaacg cagggttaaa tgtaacaaag agttttaagg tttttgctca gggttaacca 1920 tggcatgctt ccgaggtatt caccatagtg tctgcagatg tctacctggc aattattgaa 1980 aaagtttgcc acaatttgga taagaaacac caatttttgc cgcataaaag tattttttat 2040 tatagtctat ctatctatct atctatctat ctatctatct atctatctat ctatctatct 2100 atttttttta tctgtgaaag tatgcacgta aaatcttaca ttgttgtaat taccgtcttc 2160 taggctaaaa gattaaatta ttttgaaaga tgtaagaaac acacagtgac tgctagccag 2220 ttttgaggaa tacatgacac agcggggtta gttgtaacag gatgttacaa ttaagccaca 2280 cactgttgga caccaactaa actaacaaaa tattttttat tttaattttg agtgtaatta 2340 cactttttat aataaaatgt tttttattag aaagtatttt taaatagaaa aacatttatt 2400 gctaataatg caaataaatg caatgtataa taatggataa atggataata aggcaaatag 2460 attcaaatgt gtttatccaa ataaataaat agactgtgct atgtagaata aaaataaatg 2520 agccaattac tgaggaaata tggctacttt tttttaaagt aaactgaatt taaattaatc 2580 gtatgaaatc tgccttttta agcatgtgtt acaactaacc ccccatctgt tgcaatctgg 2640 cccacaggtg taaatggtaa cctttttact cctggcactt ttgccaatac tgcacagaaa 2700 cgatagttcc ggcgatcata atttcagtgc tcatttgtag gagaggcttg tgtgttgttg 2760 gtaaaaaaaa tggtttgtca aaccccatta ctttgttcat tatttgacaa aaaccaaaaa 2820 gtgttacttt gtgccccact ctcccctaca taaattaatt gtactgctac atgtattgga 2880 aacttaaatg caaaccggtt atttaacaaa gttaagacat aaaatgagaa gtaacttacc 2940 ttcgtgggct tcctctaata catccgtgtc tcccacagac acggatacta ctccagacag 3000 taaagtgtca cccacaattg aggcaactct ctcctcaaag ggtgttagtg cagtatcccc 3060 tgttcccccg cctgttcggt ccacactttg acggtgcgcc gctgttctcc tcttaacctg 3120 cacctttaca tcggaccatt tcttttttaa ttcactcaca gtgcgacgtt cagaccccac 3180 tgcgttaact gcgtcagcta aactctccca ctctattttt ttccttttgt tattaattcc 3240 ggaggacaaa cttgcaaata gcacagtttt tctccggtct acctccgata ggagcacctc 3300 caattcacat tctgtaaagt ttctcttttt gtttgcttta gccattgctt tttcgtttgg 3360 ttttgccaaa gtgaagtcat taccatattt ataaggggga ggaggcaggg aggggttttg 3420 cgctcgtgca cgtgcgctca atttcacgtt aattcggatg tacaaagaga atatgcgtgg 3480 gattcggcgt acgcagtgtt tcatacatct gaatttttta ctgcgtacgc acatttacag 3540 ctttgtgcgt acgcaatgtt ttagtatgaa ttccacgcaa gtcttcgtac atgaggccc 3599 // ID Gypsy160-LTR_DR repbase; DNA; ZEB; 222 BP. XX AC chr4; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy160-I_DR; KW Gypsy160-LTR_DR; Gypsy160_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-222 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1582-1582 (2008). XX DR Genome; chr4; Positions 35828624 35828845. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 222 BP; 59 A; 39 C; 57 G; 67 T; 0 other; tgtggtgtac tgttatatgc ccttataatg taattgcgta ttcactccac tagagggcca 60 ccttggttta acttcggggg tgctgtggag tcagtgagtg agacggaagt gcatgtgata 120 aaggcgggag ttactagtgt tctgttaata aatgtgtttc tgaattatac ctctccgtgt 180 aagctgtcta tataacacac agcacaagga ggaaacacta ca 222 // ID Gypsy96-LTR_DR repbase; DNA; ZEB; 816 BP. XX AC chr19; XX DT 02-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy96-I_DR; KW Gypsy96-LTR_DR; Gypsy96_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-816 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1504-1504 (2008). XX DR Genome; chr19; Positions 20989536 20988721. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 816 BP; 125 A; 247 C; 150 G; 294 T; 0 other; tgtcacatcc tcagtctgat catgctgtct gttccccatt tctgtttact tctggtgagc 60 gcacgtgaat gtttattttg atcgctcccc gtatctctcc tccctctcgt taccctattt 120 accatctatt aagattgatt ttcactcacc tgcaccctat cagtgattcc tctcgtgtcc 180 tatttattcc cctccttgtg tgtagtccag tgctagattg ttgttttgcg agcgcgccca 240 cgccgagccg ctcgcgtctg atctcgtctt gtctcctgtc gtggctctcc tgtcctttcc 300 aactctgtct tgttgtgctg ctccacgcca gaaccatctt cttgcacttt gcctctcctt 360 aacctcctct gttcccaggc cggactgctt caacctctcg gagcccatcg gcttcaactg 420 actgccggct tcctctccct cgctcacgct ccacttccac attcccctca cggcaacgta 480 ccagcacgcc ccctgccggg taacagtatc atagacgcta ttgtttgcat ttatttttgg 540 atcaccccgg ctgtcttttg ctttctcttt tttgaagatt gtgaactggg cgcccccttt 600 tggatttatt tggtgacagg acccggtctc tgttgtcaag gacttaacag cgccctcttc 660 tgggagtctt tttttatgca gtttttttgt ttttgttttt ctcctatttt ttctgcaaac 720 gcccagttgg atttgatttg caccctttta gggtaatttt ttgatcaaga ataaaaactc 780 ttaacctgca cttgtgtccg cctgtgttcc ctgaca 816 // ID Gypsy75-I_DR repbase; DNA; ZEB; 6487 BP. XX AC chr1; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy75-I_DR; KW Gypsy75-LTR_DR; Gypsy75_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6487 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1465-1465 (2008). XX DR Genome; chr1; Positions 26255568 26262054. XX CC Positions [2851-3390] - Reverse transcriptase CC Positions [4837-5313] - Integrase core CC 'TTAA' target site duplication CC LTRs are 100% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 278..1654 FT /product="Gypsy75-I_DR_2p" FT /translation="MAFSLNADDFDETMTGLGLGRGRFFGETPVTPIGRES FT KINGKPQVLCSTRIADQVASPTADTVPLAVPDLNNPDWQGFIAQIVQQVGQ FT TMLTSQKEASSKGEENADTQSQSLGATKTTFTDIPSLNLTGVRLIMQSDAK FT EPPVYRGDSTDKLSVREWEELMETYLRKRGTPLAEQHQEILSRLMGKAKDV FT VKVTLRCNSALKPIEDPKVIMNILKQHFGDVNYSCMPLADFYGTVPLAGEN FT PIEYWVRLNKAVDLTEEALKRLGRQMQNPCQEAAMMFVKYCPDPTLSAVFR FT FKAPEKWTAQEIQEQLDRYQTDQREQMSTKPKRHSVTRHVTACIQTPHKDA FT VSVVPSEEPGECDVSITRAAQSDDNCMRTLINLLDRALSQNAQTPSQPSQS FT KLCKVCRSRDHSTVTHCRREHLCFTCYQPGHGKRECTNERFRPSQAPPRGS FT AGLNWMAHVVEGDA" FT CDS 1564..6231 FT /product="Gypsy75-I_DR_1p" FT /translation="MHERTFQAKSSATQGLGRFKLDGSRCGGGCVSLSREP FT LENVDLLQFYETACAQAPSGTKVIAQNVQTLEVFDNLFYAPVTINNKHTVK FT GMLDTGSMACTLSDIAEKRMLSENVLSTATALQQDVILVGCGGKVTKPKCM FT YEVELKVYGESCQVPILVVPGQRDDLIIGTNVIKFLAHRMKGTSDYWRLVS FT NLSGEPTTGCEQFLDVLANTCRWKGSELPDKVGTVKLQQCVTLLAKQEHLV FT WGKLPKNVPMSPGSTVVVEPTTSRSASRDIMVGRVVTPLWGDRWVPVKVTN FT FSDKPVTLKRNRKLADVFPCVAVEDFELFQGSSQAGVLNDENQHLSHSSDS FT LRQRLETVGLSDIDIQSCSVSHDTREKLVELLERYNDVFSKHALDCGEVKG FT FVHRIRLMDERPFRLPYRRVPPAHYQKLRQVLSQMEEQGIIRKSVSEYASP FT LVLVWKKDGNLRICTDFRWLNARTIKDAHPLPHQSDCLAALGGNTLFSTID FT LTSGFYNIPMAEEDKKYTAFTTPVGLHEYNRMPQGLCNSPASFMRMMVSIF FT GDLNFSSLLCYLDDLLVFAPSEPEALERLEVVFSRLRQYNLKLSPKKCHLM FT RTSVRFLGHIIDGNGVAVDPAKVDVISKLSKFDLMEDDKCTPSVRRIKSVL FT GMILYYQHFIPGCSSLAKPLFALTAGQRRKRSIKGDPKAGTYRKLKPTDWT FT VECDHSLNSLKESLLQSVVLAHPDFTRPLILAIDASLDGLGAVLSQVPAGE FT ERARPIAFTSKTLSKSQRNYPVHKLEFLALKWSVCEKFSHWLKGQTFTVWT FT DNNPLTYVMSKAKLDACEQRWVSKLAAYTFELKHIPGTKNIVADALSRDPF FT VKTVSSRLMKESYRDLLVEAKEVNGDEVRDAFRQGAQNLQICLQSCLTLPS FT DSDTIKACLNLHDQWEETSQARAVQLIQSYQDFVPTCTDVLPSITADEMCD FT MQHADLAVSGVVPFLNLKRQPSRRERAKMNPAGLSLLRHWEKLKIKEGVLY FT RVIQDPHSKLKRYQLVLPSCLKEKALRGVHDLAGHQGQARTIHLARQRFFW FT PSMGKDICDYIKCCQRCILAKTPEPSARAPLESIKTSMPMELVCLDFWTAE FT DAKKHSVDVLVLTDHFTKLAHAFPCSNQSAKQVARKLWDRVFCVYGFPGRI FT HTDQGANFESALLAELLKLSGVSKSHTTAYHPMGNGEVERFNRTLGSMLRT FT LPLKEKHKWPEQIQTLTFAYNATVHETTGFAPFQLMFGRVPRLPVDVMFGR FT VLHDPVVVDHSTYLKSLMSHLHEAAEIAQRHSEREQRKQAKGYNKRVKGIC FT LNIGDRVLVANKGERGKRKLSDKWDPVVYTVIDRNLQTHTYKIKDENGRVK FT VVHRNMVLDISFLPVESSDEERIIPVSVSESDDDTCYNENEITMQDCDSVD FT SGGVIITLEDKEIRSPVVSKFETSQAQDIDAEKGLESDQLDMNGGESSNQL FT RDLENSNSETSLDADLNCDTEEPTPTQFQTDPQNYTETQAAVPDIQGVRTR FT AGRVVKKVNRLIESMVQKPFNIQGVGNQLKKRSESFLSLF" XX SQ Sequence 6487 BP; 1913 A; 1304 C; 1563 G; 1707 T; 0 other; ttggtgccgt gacccggatt tgaagaattg cttcaggatc cacgtcagtc gggtcaccga 60 gggacgctgc attaaggttc accataagtt aatgaaggac tgtgataaaa acaaatgaac 120 atttcttatt ttgagtgttg attgatacat attttcacta gagtgactat ttttgataac 180 cagtgtttat tgcatttata atttggtctt aaaggtacat atttttattt tattattatt 240 aattcctctc tatattctat atttttctaa gttgaacatg gcatttagtt taaatgcaga 300 tgatttcgat gagacaatga ctggtttagg tcttggtagg gggagatttt ttggcgagac 360 tccagttact ccaattggga gagaaagtaa gattaatggg aagccgcaag tgttatgctc 420 cacacgtata gctgaccagg tagctagccc cacagctgac accgtgcctt tagctgttcc 480 tgatttaaat aatccagatt ggcaggggtt tattgcacag attgttcaac aagttggaca 540 gacaatgtta acatcacaaa aagaagctag tagtaaggga gaagaaaatg cagacactca 600 gtcacaaagc ttaggtgcca ccaaaaccac atttactgac ataccttcac ttaatttgac 660 gggggtcagg ttgattatgc agtcggatgc taaggaacct ccagtctatc ggggagattc 720 aactgataag ctcagtgtgc gagagtggga agaactaatg gaaacatatc tgcgaaagag 780 aggtacacca ttagctgaac agcatcaaga gattttaagc agattgatgg gtaaagcgaa 840 ggatgtcgtc aaagtaacgc tccgctgtaa ttcagctttg aaaccaattg aggacccaaa 900 agttattatg aacatcctga aacaacactt tggcgatgtt aattattcct gtatgccctt 960 ggcagacttc tatggtacag ttccattagc aggagaaaac cctattgagt attgggtgcg 1020 tcttaacaaa gcagtagatc tcacagagga ggctttgaag agacttggac ggcagatgca 1080 aaatccgtgt caggaagcag ccatgatgtt cgtgaagtac tgccctgacc ccaccctctc 1140 tgcagtgttt cgctttaaag ctccagaaaa atggactgca caggagattc aagagcagct 1200 tgatcgatac cagacagatc aaagagagca aatgtcaacc aaaccaaaac gccattctgt 1260 tacaaggcat gtaactgctt gcattcagac ccctcacaag gatgcagtga gtgttgtacc 1320 ttctgaagag ccaggtgagt gtgatgtcag catcacacgt gcagcacaga gtgatgacaa 1380 ttgtatgagg actttgatta accttcttga tcgagcactt tctcaaaatg cccaaacgcc 1440 atctcaacca tctcagagta agctttgcaa agtgtgcaga tcccgtgatc actccactgt 1500 aacacactgt cggcgtgaac atctgtgctt tacttgttat cagcctggtc atggaaaaag 1560 agaatgcacg aacgaacgtt tcaggccaag tcaagcgcca cccaggggct cggcaggttt 1620 aaactggatg gctcacgttg tggaggggga tgcgtgagcc tcagtagaga acccctagag 1680 aatgttgatc ttctccagtt ttatgaaacc gcatgtgcac aggctccatc agggaccaaa 1740 gtgatagcgc aaaatgtgca aactcttgaa gtatttgaca acttgttcta tgcaccagtc 1800 accattaata ataagcatac tgtgaagggg atgctagata ctggctcaat ggcttgcacc 1860 ctaagtgaca tagcagaaaa gaggatgtta agtgaaaatg tcttgtctac agccacagct 1920 cttcagcagg atgtcatctt agtggggtgt gggggtaaag taacaaagcc aaagtgtatg 1980 tatgaggttg aactgaaagt ttatggagag agctgtcaag tacctattct tgttgtaccc 2040 ggccagcggg atgatttaat tattggcaca aatgtgatca aattcctcgc acatcgtatg 2100 aaaggcacga gtgactattg gcgacttgtt tctaatctct caggggaacc gacgacagga 2160 tgtgagcagt tcttggatgt tttggccaac acatgccgat ggaagggttc agagttgcca 2220 gacaaagtag gaacagtcaa gttacagcag tgtgtcactc tgttagcaaa gcaagaacat 2280 ttggtatggg gaaagctgcc caaaaatgtt cccatgtcac caggaagcac agttgtggtg 2340 gaaccgacaa cttcaagatc tgcatcacga gacatcatgg tggggagagt tgtaacgcca 2400 ttatggggtg atcgatgggt ccctgtaaaa gtcacaaatt tctctgacaa accagttact 2460 ctcaaaagga atcgtaaact ggcagatgta ttcccatgtg tcgccgtgga ggattttgaa 2520 cttttccaag ggtcaagtca agcaggggtg cttaacgatg agaatcaaca tctatctcat 2580 tcgtcagata gtttaaggca gcgtttggag actgttggcc tttctgatat tgacattcaa 2640 tcttgttctg tcagccatga cactagagag aagttagtag agttactgga gagatacaat 2700 gatgtgtttt ccaagcatgc tttggactgc ggagaagtga agggttttgt ccaccgcatt 2760 cgactaatgg atgagcgccc attcagactt ccttatcgaa gggtgccgcc tgcacactat 2820 caaaagctac ggcaggtttt gtctcaaatg gaagaacaag ggataataag gaagtctgtg 2880 agtgaatacg cctcgccttt agttctagtg tggaagaaag atggcaacct gagaatatgt 2940 acagatttca gatggttaaa tgcgagaaca atcaaggatg cacatccact tccacaccag 3000 tctgattgtc tggctgcttt aggcggaaac actctcttta gcaccataga cctgacctct 3060 ggattttata atatccccat ggctgaagaa gataagaaat atactgcgtt tacgacccct 3120 gttgggctgc atgagtataa taggatgccg caagggctat gcaacagtcc ggcatccttt 3180 atgcgaatga tggtaagcat cttcggagac ttaaacttca gcagtttgct gtgttatctc 3240 gacgatctgc tagtgttcgc tccaagtgag ccagaagcat tggaaaggct agaggtcgta 3300 ttcagccgtc tgagacagta caatcttaag ttaagcccta agaaatgcca tctaatgagg 3360 acttctgtga gattcttggg tcacattata gatggaaatg gagttgcagt cgatccagcg 3420 aaagttgatg ttatctccaa actctccaag tttgatctaa tggaagacga caagtgcacg 3480 ccatcggtgc gaaggattaa atcagttctc ggcatgatcc tttattatca acacttcata 3540 cctggctgtt catccttagc aaagccactc ttcgctctta cggctgggca aagaaggaag 3600 aggagcatca agggtgatcc taaagctgga acctacagaa agctcaagcc tactgattgg 3660 actgtggagt gtgatcattc attgaatagc ctcaaggaga gtctattgca aagtgtggtg 3720 ttggcacacc ctgactttac tcgacctcta atcttggcga tcgatgcctc cctagatgga 3780 ttgggggctg tattatctca agtgcctgca ggtgaagagc gtgctcgccc tatagcattc 3840 acaagtaaga ctctaagtaa atctcaacgg aactatccag tccacaagct tgagttccta 3900 gcactcaaat ggagcgtgtg tgagaaattt agccactggc taaagggtca aacatttact 3960 gtttggacag acaacaatcc cctcacatat gttatgtcta aagccaaact tgacgcatgt 4020 gagcaacgtt gggtgtccaa attggcagct tacacctttg agctgaagca tattccaggg 4080 acaaagaaca tagtggcgga tgcgttgagt agagaccctt ttgttaagac ggtgagcagt 4140 aggctaatga aggagagtta tcgagactta cttgtggaag ctaaagaggt taacggagat 4200 gaagtcagag atgcattccg tcagggagct cagaacttac aaatttgcct ccaatcctgt 4260 cttaccttac catctgacag cgatactatc aaagcttgct taaatttgca tgatcagtgg 4320 gaagagacat cccaagcaag ggctgtgcag ttaatccagt cttaccagga tttcgttcca 4380 acctgtactg atgttcttcc tagcattact gcagatgaaa tgtgtgacat gcaacatgct 4440 gatttggccg tatcaggggt ggtgcccttt ctgaatttaa aacgtcaacc gtcaaggcgg 4500 gaaagagcca aaatgaatcc tgcaggttta tcccttctca ggcattggga aaaacttaag 4560 atcaaggagg gagtattata cagagtcata caagatccac acagcaaact caagcggtat 4620 cagttggtat taccttcttg cctcaaggaa aaagctctcc ggggtgtgca tgacttggca 4680 ggtcatcagg gacaagcgag aactatccat ctagcccgtc aacgtttctt ttggccaagt 4740 atggggaagg atatttgtga ttatatcaag tgctgtcaga gatgtatctt agccaagact 4800 ccagaaccat ctgcccgtgc ccctctcgaa agcatcaaga cctctatgcc aatggaactg 4860 gtgtgcctag atttctggac cgcagaggat gcaaagaaac actctgtaga tgtgctcgta 4920 ttgactgacc atttcacaaa gttggcacat gccttccctt gttcaaatca gtcagcaaaa 4980 caggttgcaa ggaaactatg ggaccgtgtg ttttgtgtgt acggatttcc aggtcgcatc 5040 cacaccgacc aaggggccaa ttttgaaagt gctctcttag cagagctgtt gaaactctca 5100 ggtgtctcca aatcccatac caccgcatac caccctatgg gaaatggtga agtcgaaagg 5160 ttcaaccgta ccttgggaag catgctccgt actctgcccc taaaagagaa gcacaaatgg 5220 ccagagcaaa tacaaacact gacttttgcc tacaatgcta cagtccatga gacaacaggg 5280 tttgctccct ttcaactcat gttcggccgt gtgcctaggc tacccgtgga tgttatgttt 5340 ggccgagtcc tacatgatcc agtagtcgtc gatcatagta cttacctgaa atccttgatg 5400 tcacatcttc acgaagctgc agagattgca cagagacatt ctgaaagaga acagagaaaa 5460 caagcgaagg gttacaacaa gagagttaaa ggaatttgtc tgaatattgg agaccgtgtg 5520 ttggtggcaa ataaaggaga aagagggaag aggaaactat cagacaagtg ggatcctgtg 5580 gtgtacactg tgattgacag aaacctccaa acacacactt acaagatcaa ggatgagaat 5640 gggagagtaa aggtagtgca cagaaacatg gtgttggata tcagtttcct acctgtggag 5700 tcgtctgatg aggagaggat cattccagtc tcagtgagcg agtcagatga tgacacttgt 5760 tataatgaga atgagattac catgcaagac tgtgattcag tggattcagg aggtgtgatt 5820 ataactttgg aggataagga gattagaagc ccggtcgtct caaagtttga aacctctcaa 5880 gcacaggaca ttgatgctga gaaaggcctt gaatcagatc agcttgatat gaatggagga 5940 gagtccagta atcagttacg tgacttggag aatagtaaca gtgagacatc tctggatgct 6000 gacttgaatt gtgacacaga agaacctaca ccaacacagt ttcaaactga cccccaaaat 6060 tacactgaaa ctcaggctgc tgttcctgac attcagggtg ttagaacccg ggctggtaga 6120 gttgttaaga aggtgaatag gttaattgag tcaatggttc agaagccttt taacattcag 6180 ggtgtaggaa atcaattaaa gaaaagatct gagtcatttc tgtctttgtt ttaaagagct 6240 tcctttaaat ttatttgatg caaagaacta gttttgaatt cgaatggaca ataaaaaggg 6300 ttgggtcata atatggtgta caaggcatca tattattcca agaagatttt aggcttgatc 6360 accctgatgt tttcttgact ccttagtggg taactttcaa gttgaatgaa tgaattagtt 6420 tcattgagga cctataccaa gtgatttgaa ttctaggcag attttaatga aatcaaaaag 6480 gggtgta 6487 // ID hAT-5_DR repbase; DNA; ZEB; 2496 BP. XX AC . XX DT 08-OCT-2008 (Rel. 13.1, Created) DT 08-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE hAT-5_DR is an autonomous DNA transposon - a consensus sequence. XX KW hAT; DNA transposon; Transposable Element; transposase; KW hAT superfamily; hAT-5_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-2496 RA Bao W. and Jurka J.; RT "hAT-type DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1340-1340 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 473..2323 FT /product="hAT-5_DR_1p" FT /translation="MAESGKRTAKRKYEDERRTFLSEWEDLYFFVERNGKP FT FCLICQTSLSHFKASNLERHFTSLHSAVAREFPKGSELRKHKVKTLKGQAE FT KQTQLFRKFTKHSETVTLASYQLAWNIARAKKPYLEGEFVKKCLSDAVAIL FT CPENENLKRSVKDLQLSRHTVEQRISDIDNSVETHLLSDLQKCQYFSIALD FT ESCDVQDKPQLAIFVRFVSEDCTIREELLDIVPLKDRTRGIDLKETLMTVV FT EKANLQLSKLTAIVTDGAPAMLGSERGLVGLCKADDRFPAFWTFHCIIHQE FT HLVSKKLNLDHIMKPVLEIVNFVRTHALNHRQFKNLIDELDEDLPSDLLFH FT CAVRWLSRGHVLSRFFELLNPVKLFLAEKHKEYPELHDPQWISDLAFLVDV FT LHYLNGLNVDLQGKLKMLPDLVQSVFAFVNKLKLFKTHLQKRDYTHFPTLL FT KASGQEAEVVKGSTARYATLLENLEQSFEERFSNLRQKRQQITFLINPFTA FT ESGCLKAPLVEDEAASQLEMIELSEDDRLKSVLREGTVEFWKIVPVERYPN FT VKQAALKLLSMFGSTYVCESLFSTLKLVKSKHRSVLTDTHVKELLRVATTE FT YEPDLKKIVETKECQVSH*" XX SQ Sequence 2496 BP; 715 A; 531 C; 557 G; 693 T; 0 other; caggggtggc gaacccgcgg ctcgcgagcc gcttgcggct cttttactgc tcttgtgcgg 60 ctcaacctct cgcgcgattt aacatacatt aaccaattga ctaacgcccc attcacacgg 120 tgcttctgcg tctctttctg cttttgtgcg gctcaacctc ttgcgcgatt taacatataa 180 taaccaattg actaatgccc cattcacacg gggcttcagc gttgctcgct gcagaagctg 240 ggagtagctc aacttttcaa gcgcttacgg acgcgttagc caatcagatc gctgtatgca 300 aatacacctg ctagacagtg gcctattgct gactgaattt tattttctga cgcttccatg 360 acgatcgttt cagctctaac ttcagacacg ccttcagtca agggttgaca ctgaagcccc 420 gtttgattgg ggcgcaacac gagtgacgtg cagttcgtta atatcaatcg tcatggcaga 480 gtcaggaaag cgtacagcta agcgtaaata cgaagatgag cgcagaacat ttttatctga 540 atgggaggat ttatattttt ttgttgaaag aaatggaaag ccattctgct tgatatgcca 600 gacttcgttg tcacatttta aggcttcaaa tctcgagcgc catttcacct ctctccattc 660 tgccgtagcc cgggaatttc ccaaaggcag tgaacttcgc aagcacaagg taaagacact 720 gaaaggtcag gcagaaaagc aaactcagct tttccgaaaa tttactaagc attcagagac 780 ggtcacatta gcatcatatc agctagcatg gaacattgct cgtgcaaaaa agccatacct 840 tgaaggagaa tttgttaaaa agtgcctcag tgatgctgtt gctattcttt gtcccgaaaa 900 tgaaaatttg aaacgttcag taaaggacct tcaactgtca cgacacactg ttgaacagag 960 aatatcggac attgacaatt cagtagaaac acatttactg tcagatcttc agaagtgcca 1020 gtattttagc attgcgttgg atgagtcctg cgatgttcaa gacaagccac aattggcaat 1080 atttgtacgg tttgtgtcag aagactgcac aattagggaa gagctgcttg acatagtgcc 1140 gctgaaagac agaactcgtg gcatcgatct gaaagagaca ctgatgactg ttgttgagaa 1200 agcaaatctg cagttatcaa aactaacagc catagtcacg gacggagctc cagccatgct 1260 gggatctgag agaggtcttg tagggctgtg taaagctgac gaccgttttc cagcattttg 1320 gactttccac tgtattatcc atcaagagca cttggtgtct aaaaagctaa acttagacca 1380 cattatgaaa cctgtgctgg aaattgtcaa ttttgtacgc acacatgctc tcaatcacag 1440 acagttcaaa aatctcattg atgaacttga tgaagacctt ccatctgatc tgctttttca 1500 ctgtgccgtg cgatggcttt caagaggaca tgtgctttct cgtttctttg agcttttgaa 1560 tccagtgaag ctgttcctcg cagagaaaca taaggaatat cctgagctac atgatcccca 1620 gtggatttca gatttggctt ttttggttga cgtgctgcat tatctgaatg gactgaatgt 1680 ggatctgcaa gggaagttaa agatgctgcc tgacctggtg caaagtgttt ttgcatttgt 1740 caacaaactc aagctgttca agacacatct acaaaagaga gactacacac attttccaac 1800 cttgttgaag gcaagtggac aagaagcaga ggttgtaaaa gggagtactg cacggtatgc 1860 tacgctgctt gaaaatcttg agcaaagttt tgaggaaaga ttctccaacc tacgacaaaa 1920 aagacaacag ataacatttt tgatcaaccc attcactgct gaatcaggct gtttaaaagc 1980 ccctttagtg gaagatgaag cagcatctca gcttgaaatg attgaacttt ctgaggatga 2040 cagacttaaa agtgtcctga gggaaggaac cgtggagttt tggaagattg tgcccgttga 2100 gagatatccc aatgtcaaac aggctgcact caaactgttg tcaatgtttg ggtcaactta 2160 tgtctgtgaa tcactttttt cgacactgaa actagtgaaa tcaaagcatc gctctgttct 2220 tactgacaca catgtgaaag agttactacg agtagccaca actgaatacg agccagacct 2280 gaaaaagatt gtggaaacca aggaatgtca ggtgtcccac tgagtcaagt aaaatgaggt 2340 aggctaattt aaatccatga aaactattgc gttatataat taggtgtata agttattgtg 2400 acagatatac tgttgtttgc gcatctgttg tgtggctctt tgcagtgata aagttttttt 2460 tttggctcat catgccaaac aggttcgcca cccctg 2496 // ID piggyBac-N5_DR repbase; DNA; ZEB; 2136 BP. XX AC . XX DT 12-NOV-2008 (Rel. 13.11, Created) DT 12-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE piggyBac-N5_DR is a putaitve piggyBac nonautonomous DNA DE transposon - a consensus. XX KW piggyBac; DNA transposon; Transposable Element; Nonautonomous; KW TSD TTAA; piggyBac-N5_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-2136 RA Bao W. and Jurka J.; RT "piggyBac-type DNA transposons from zebrafish."; RL Repbase Reports 8(11), 1800-1800 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 2136 BP; 718 A; 380 C; 351 G; 683 T; 4 other; ccccctgggg tcgaagaccg cgtatacgcg ttctgacaag tttttacctc attgagctga 60 aatgaaccta aattacactt tcagttttga tcgtacagat aagatcaata catcaatcga 120 atctgttagg tgtctagttt taaatgtgta cactcacaac aaaaacaaat gtgtgtgctt 180 ttgcaaaaca aggataataa acagtgtagg cattctgtct tttctctctc cgtgaactgt 240 ttttagaaac gcgtcagtaa aatgcgctga aactccgcga aaacaaaaca cccagacatg 300 agacacatat ctttataaag cttgaagtgt ctacttttaa atgcagctag tgcatgttga 360 aaacaaatat tgtttgataa agtaatcagt atgaaaccaa cgctatgtct agtttctcag 420 tctgagctca ttagagttaa tgcgaccacg cccacgaggc tttgacatta caaatagcct 480 gcatgagaag cgcgcaacat gtagactgtg tcaggaaaaa cacgtcatta aaatgaactg 540 aaactcaayg caaatactcg acaaagaaac atgtgagatg tttctaaaga aagcttgata 600 tgtctacttt tgcattaacc aattcaagtc gaaaacaaat aatctccttt catgtaatcc 660 gtataaaagt aagggctggt actggtacgg attctcctgt cgcctcatta cagcgaacgt 720 gatcccgccc gcagtgaagg cgcagttcat atggaaatga actgaggtga gccagccagg 780 tgataatgta cacgctcccc gagacatggc atcaaacgac aagctttatt atagttactt 840 atttttgttt gaatgatggc acgctacacg ggtgaagaag cacagtacta aacagcgaca 900 aagaatttac cttctcctca gaggaagagc gcgactctga tgacgagcgc atgcattttg 960 aagaacgtct agacccagca aaggacagaa ttcctgtcga gtaagtcttt tattctaacc 1020 tttatgattt atatgaaata tgtccttaca ggccacttat agtttcattg ctatacaaag 1080 tgtatcctat aacgtatagc gtataatgat aaacataggc ctatataatg ataaaatgta 1140 atgttatatg aaataaacaa ttataataat ayatataaat atatatatat atatatatat 1200 atatatatat atatatatat atatatatat atatatatat atatatatat acacagcaac 1260 tattgaatta wtaaaacaaa ttatatgtat attatacatt attaatgtct gtttatatgm 1320 atattaatat aaaaataaat acttgtttac aatacaggtg ctgttctgtt taaaattcct 1380 atgtggtaac tgagtttatt aggttaacta gtattagtag taattttatg taatagctga 1440 ttatttttaa gtagatttgc tatgtgtaag tttatatagt ttacagagtt ttatacagtg 1500 tactgactct cacaagtgta tatcgagtta ttttgtgtga atagttaaat atattttaaa 1560 taaactgcaa acaaaaaata tatatataaa cttatttttt tctcagattc tttcttgaaa 1620 ctcttagtca catacctgtc tgatggctca ttatgcagct cattatgcag gtctttgtct 1680 tctctggtgt gactcactgc attagtaatg atagttcacg ccttctcgca taaaccattt 1740 tcacacagaa agtgacttag aaattttaat tcattgtatt gttttctgtg agcaagtgaa 1800 caagatgatt ctcacatcat tttgaagcaa aaacactaga ctacaagatc cagttctcaa 1860 aagtcttgtg cacaaatgtt ctctgtgtgt ttcacggcct tgtttcagtg acttcaaaat 1920 tttagttttt cactaaccac gcataaacgt ttctttctca aaaacacaaa cacatacaaa 1980 catgctgctt aggtattatt gtagcccaac ttgtgctgtt tacagtgtta tcacacttta 2040 gccattaata tgtttttaag atactgaaaa caacacaaat gtcagtgcat gtcaaaactt 2100 ctccagagcc ccaaaacacc ctcagacccc agaggg 2136 // ID Gypsy-33-I_DR repbase; DNA; ZEB; 4490 BP. XX AC . XX DT 01-JUN-2007 (Rel. 12.06, Created) DT 05-JUN-2007 (Rel. 12.06, Last updated, Version 1) XX DE An internal portion of the Gypsy-33_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy superfamily; Gypsy-33-LTR_DR; KW Gypsy-33_DR; endogenous retrovirus; gag; integrase; KW reverse transcriptase; Gypsy-33-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4490 RA Dib M.R. and Naveira H.F.; RT "Gypsy-33_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 7(6), 188-188 (2007). XX DR [1] (Consensus) XX CC Gypsy-33-I_DR is an internal portion of the Gypsy-33_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its long CC terminal repeat is deposited in Repbase as Gypsy-33-LTR_DR. CC Gypsy-33_DR is characterized by 4-bp target site duplications. CC The internal portion encodes one polyprotein the 1490-aa CC polyprotein Gypsy-33_DR1p (pos. 19-4488) composed of the gag, CC protease, reverse transcriptase, and integrase domains. Some CC insertions fairly recent, according to the high identity between CC their flanking LTRs. Consensus obtained after the alignment of at CC least five independent insertions bearing at least 85% homology CC over at least 1000bp. XX FH Key Location/Qualifiers FT CDS 19..4488 FT /product="Gypsy-33-I_DR_1p" FT /translation="MATRKTAPKGRKPKTGLRSQVKAASEEGADWNSVVTT FT EDEEGATSRPRVSSKSSSTPSKPVTSDATAGEGMYSLMRRFLDSQEEREDR FT YMQELRGLRDSILQSIRPAETSIDAESARMELPTPATKVSTKHRTAYIHDS FT PNVTEPRVPMQWTEPKMPSFQQGEDIENYLQRFERLARTWRWPKEEWSYRL FT VPLLTGQALEAYLAMDEEEAEVYADLKEALLEKFNISPETYRQRFRSSTVP FT VGESPTETYHRLKNLYKRWVQPEEHSKEEIGEVIILEQLLRVLPYDARTWV FT KEHEPKSGQAAARLAQQYLNAHRGGLRTQPHKGTVRHTLHNTGSERGRTEL FT SDNAQTQKPATKDLICFYCQQLGHKASVCPARKAKLTGFCYVPREGDCDFD FT SVGEGHNVYNVTVNGHELKALLDTGSSLSMMKSRFVHNVSYENTTSVQCVH FT GDIKQYPRAEVMVEVQEQMYLLNVAVVDNLPADMILGRDFPVLYELLQPTI FT KDSERSIATVNVSCPALTRAQARAGLQPLPDLDSSLLQGGTKGPRKSRRQR FT RLEKYLGTPASESSGEGLKVDGWKVPGNITQLQREDKTLKPLFVKAESEQT FT SILCNEVYVVENGVLYVRTNDVLRLVVPSCCRPLVLHLAHTVPWAGHLGQQ FT KTYARISSRFYWPTLYTDVQTHCKTCAVCQKTSAVSQRGRAPLQPLPVISA FT PFRRIAMDIVGPLEKSSAGHRYILVVSDYATRYPEAYPLRSITTPKIIHAL FT IQLFSRVGIPEEILTDQGTNFTSRLMGQLHKQMGITAIRTTPYHPQTDGLV FT ERFNQTLKNMLRKFVADTGRDWDKWLPFVLFAYREVPQASTGFSPFELLYG FT WQVQGPLDLLKKDWEGEPASKKEESGIVHYVLEMRDRLEKYREQAKENLLE FT KQQAQKRWYDQHARLRQFQEGQKVLLLLPTSTNKLLAKWQGPYTVIRKMGP FT VTYEIHHPDKGKTRQTYHVNLLKEWKEPPGKGPETALLVRKVEVEEDVEEE FT DMEDVKRQPSVVNLTHLEDSKREELQNLLNLFPALFCQRPGRTELTQHTIH FT LSDPTPTRQRPYRVPERLVEPLKEEIKLMKELGVIEPSTSEWSSPMVIVPK FT KDGSLRVCIDFRKLNAQSKFDAYPMPRIDDLLEKIGQAKYITTLDLCKGYW FT QVALNPESRPYTAFRTPFGLFHFTALPFGLHGAPATFQRLMDRVLQGCEDW FT AAAYLDDVVIHSNSWTEHLQHLQQTLKKIAEAGLTLNVSKCEWAKQEANYL FT GYHLGNGQLKPQIDKVEAICRSPQPKTKKEVRSFLGLVGWYRRFVPNFASI FT AAPLTNLLSKSVTNPVPWTNDCEKAFKTLKEKMCSSPVLQSPDFSKRFLVQ FT VDASATGIGAVLAQGNAEEQKPVVFLSRKLLPRETRYSAIEKEGLAIKWSL FT DSLRYYLLGREFDLETDHRALTWIQSMKDHNARVTRWYLALQPYHFKIRHR FT PGRLNVVADYLSRFPASTRLGEGEG" XX SQ Sequence 4490 BP; 1284 A; 1070 C; 1148 G; 988 T; 0 other; tctggtggca gcggtgggat ggctactcgc aagacagcgc caaaaggccg aaaaccgaag 60 acgggtttac ggtcccaagt gaaggcagca tcagaggaag gggctgattg gaattcggtg 120 gtgacaacgg aggatgagga gggagccaca agtcgtccca gagtctcctc aaagtccagc 180 tcaaccccat ccaagccagt tacttctgat gctacagctg gtgagggaat gtattccctg 240 atgcggagat ttctggactc tcaagaggag agggaggacc gatatatgca ggagctccga 300 ggtcttcgtg attccattct gcagtcaata cgtcctgcgg agacttccat tgatgcagag 360 agtgcacgaa tggagctacc gacacccgca acaaaggtat ccacaaaaca cagaacagcc 420 tatatccatg actctccaaa tgtcacagag ccaagagtgc cgatgcagtg gactgagcca 480 aagatgccaa gcttccagca gggggaagat attgagaatt atcttcagcg gttcgagcgt 540 ctggccagaa catggaggtg gcctaaggag gagtggagct accgtctggt tccgttgctg 600 acaggacaag cactggaggc atacctggca atggacgagg aagaggctga ggtctatgcg 660 gatctgaaag aagcactgct ggagaagttc aacatctctc cagagacata ccgccagcgc 720 ttcaggtcat ctacagttcc agtgggagaa tcgccgacag agacctacca ccgactaaag 780 aacctgtaca agcgatgggt ccaaccagag gagcactcca aagaggagat tggcgaggtc 840 atcatcctgg agcagctgct tcgggtgcta ccctacgacg cacgcacctg ggtaaaggag 900 cacgagccaa agtcaggaca ggcagcagca cggttggctc agcagtacct gaacgcacat 960 cgtggcgggc tccgtactca accacataaa ggtactgtac gtcacacatt acataacact 1020 ggttctgaga ggggtcgtac tgaactgtcg gacaatgcac aaacccaaaa acctgcaacc 1080 aaagacttga tttgctttta ctgccagcag ctgggtcata aagcatcggt atgtcctgct 1140 cgtaaagcta aacttacggg tttttgttat gtgccaagag aaggggactg tgattttgac 1200 tctgtggggg agggtcataa tgtatataat gttactgtga acggacatga actaaaagca 1260 ctcttagata ctggaagctc tttgtcaatg atgaaatcac gttttgtaca taatgtcagt 1320 tatgaaaaca ccacctctgt tcaatgtgta catggggaca taaagcagta tccccgagct 1380 gaggtcatgg ttgaggtaca agaacaaatg tacttgctta atgttgccgt agttgacaat 1440 ttacccgctg atatgattct aggaagagac tttccagtac tgtatgaact tttgcaaccc 1500 accatcaaag actctgaacg ttctattgct actgttaatg tgtcatgtcc agctcttaca 1560 agagcacaag ccagggcagg tctacagccc ttgccagact tggatagtag tctgttgcag 1620 ggcgggacca agggccctag gaagtcaaga cgccagcggc gtcttgaaaa gtacctggga 1680 actccagcct ctgagtcctc aggagagggt ctcaaagttg atggctggaa ggtcccaggg 1740 aatataacac aacttcaaag ggaagataaa accttgaaac ccttatttgt taaagccgag 1800 tctgaacaaa catcaatttt gtgcaatgag gtatatgtag tagagaatgg tgtgttatat 1860 gtgagaacaa atgatgtgtt gcgtttagtt gtgccatctt gttgtcgtcc tcttgttctt 1920 caccttgcac acactgttcc atgggctggt cacctaggcc aacagaaaac atatgcacgc 1980 attagctcac gattttactg gcccacactt tatactgatg tacaaacaca ctgcaaaaca 2040 tgtgccgtct gccaaaagac aagtgctgtg tcccagcggg gcagagcacc cctgcaacct 2100 ctccctgtta tctctgcacc ttttaggcgg attgccatgg acattgtggg gccattggaa 2160 aagagcagtg ctggacatcg ttacatcctg gtggtcagtg actacgccac aaggtatcct 2220 gaggcctacc cactccgttc tattacaaca cccaagatca tccatgcctt gattcagttg 2280 ttctccagag ttggcatacc tgaggagata cttaccgacc aaggaacaaa cttcacatca 2340 cgtctgatgg ggcagctaca caagcagatg ggcatcaccg ccataagaac aacgccatat 2400 cacccacaga cagatggact tgtcgagcgg ttcaatcaaa ctctcaagaa catgctgcgg 2460 aagttcgttg cagacacagg ccgagattgg gataagtggt taccgtttgt tctctttgct 2520 tacagagaag tgccccaggc atcgacgggc ttctcaccat ttgagctcct ttatggatgg 2580 caggtacaag gacccctaga cttgctgaag aaggactggg aaggagaacc tgcatcgaag 2640 aaggaggaga gcggcattgt gcactatgtg ctagagatgc gagaccgttt ggagaagtac 2700 agagaacaag ccaaagaaaa tctgctagag aaacagcaag cacagaagag atggtacgac 2760 caacatgcca gactgagaca attccaagag ggacaaaagg tgcttctact tttaccaact 2820 tcaactaaca agttactggc taaatggcag gggccataca ctgtgatccg caagatgggg 2880 ccggtaacgt atgaaattca tcatccagac aaggggaaga ccaggcagac ctaccatgtc 2940 aacctattaa aggagtggaa ggagccaccc ggtaagggac cagaaacagc cctcctggtg 3000 aggaaggtgg aggtagaaga ggatgtggaa gaagaggata tggaggatgt aaagagacag 3060 ccgtctgtgg tgaatctcac tcacctagaa gactccaaaa gggaagagct gcagaacctc 3120 ctaaatctgt ttcctgcttt gttctgtcag aggcctggac ggactgagct aacccaacac 3180 accatccacc tttctgatcc aacaccaaca cgtcaacgtc cctaccgggt gcctgagagg 3240 ctggtggaac ccttgaaaga ggagataaag ttgatgaagg agctaggagt gatcgaacca 3300 tcaacgagtg aatggagcag cccaatggtg attgttccca agaaagatgg atccttgcgt 3360 gtctgcatcg atttccgcaa gctcaatgcc cagtccaagt ttgatgcata cccgatgcca 3420 agaattgatg atctgctgga gaagattggt caagccaaat acattaccac gctagactta 3480 tgtaagggat actggcaggt agccctcaat ccggagtcca gaccatacac tgctttccga 3540 acaccatttg gattatttca cttcacagcc cttccctttg gccttcatgg ggcaccagcc 3600 accttccaaa ggctaatgga tagagtcctt cagggttgtg aagactgggc ggctgcatat 3660 ttggatgacg tggtaattca cagtaattct tggactgaac atctacaaca tcttcaacag 3720 acactcaaga agatagcaga agcaggactc accttaaatg tgtctaaatg tgagtgggca 3780 aagcaagagg ccaattacct cggataccac ctgggcaatg gacagcttaa acctcagatc 3840 gacaaggtag aggcgatttg caggagccca cagccaaaga ccaagaagga agtgagatcg 3900 ttcctaggac ttgtaggctg gtacaggaga ttcgtcccaa actttgcctc tatcgctgct 3960 cctttaacaa acttgttgtc taagtcagtc acaaatcctg ttccatggac taatgactgt 4020 gaaaaagctt tcaaaaccct gaaagagaag atgtgttcca gtccagtctt acagagtcct 4080 gacttctcaa aaaggttcct ggtacaagtg gatgcttctg caacaggaat aggagctgtc 4140 cttgcccaag gaaatgcaga agaacaaaaa cccgttgttt tcctcagccg caagctactg 4200 ccgagggaga caagatattc agcaattgaa aaagaaggtc ttgcaatcaa gtggtctctc 4260 gacagtttac gttattacct cctaggccgt gagtttgatc tggaaactga ccaccgggcc 4320 cttacctgga tccagtccat gaaagaccac aatgccagag tcacccgatg gtacctcgcc 4380 ctgcagccat accactttaa aatacgccac cgaccgggta gactcaatgt ggtagccgac 4440 tacctgtcca ggttcccggc cagtacgcgg ctgggagagg gggaaggtga 4490 // ID Gypsy-21-I_DR repbase; DNA; ZEB; 6426 BP. XX AC . XX DT 11-FEB-2005 (Rel. 10.01, Created) DT 19-MAY-2005 (Rel. 10.06, Last updated, Version 2) XX DE An internal portion of the Gypsy-21_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy superfamily; Gypsy-21-I_DR; Gypsy-21-LTR_DR; Gypsy-21_DR; KW endogenous retrovirus; gag; integrase; protease; KW reverse transcriptase. XX NM Gypsy-21-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6426 RA Kapitonov V.V. and Jurka J.; RT "Gypsy-21_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 5(1), 15-15 (2005). XX DR [1] (Consensus) XX CC Gypsy-21-I_DR is an internal portion of the Gypsy-21_DR CC LTR retrotransposon that belongs to the Gypsy superfamily. CC Its long terminal repeat is deposited in Repbase CC as Gypsy-21-LTR_DR. The consensus sequence was reconstructed CC based on multiple alignment of five proviral copies (they are CC less than 1% divergent from the consensus sequence). CC Gypsy-21_DR retrotransposons are characterized by 4-bp CC target site duplications. The internal portion contains two CC ORFs encoding the 562-aa Gypsy-21_DR1p gag (pos. 86-1771) CC and 1585-aa Gypsy-21_DR2p pol proteins (pos. 1672-6426) CC composed of the protease, reverse transcriptase, and integrase CC domains. The second protein, including the protease domain, CC does not start from Met. Presumably, the gag-pol fusion protein CC is formed originally due to a ribosomal frame shift. XX FH Key Location/Qualifiers FT CDS 1672..6426 FT /product="Gypsy-21_DR2p" FT /translation="CKSCLGSCQKKAFKRAAVVLGGTNGVIPSHSFKIESA FT PVVGQTGAEKCSRRPKNSKFCNHHASSRSKPFILPKGLVGAKCTAEVSIAG FT QKCSVLFDTGSQVTTVSQTYYEQNLSHLEIKPLEHLEVEAANGQFVPYLGY FT VEIDVVFPKEFLGAEITLTTLALVTADTSSNVQSPVLIGTNTLDLAYEAQF FT DSEVAQPSHVLPYGFKVVMNVLRHRFKLKTNSMIGLVRAQGTKPEVVPAGQ FT TLVLEGSINSRELSSDKWVLMEAPLQSSLPGGLLCASSLVTLSQRSFQKIP FT VILKNETEHDILVPAKSVIAELYSLQSVNVKEPATESNSQGKELPPYTLDF FT GDSPIPNEWKERISQKLCAMSDVFALHDLDFGKTDKVTHRIRLHDQTPFKQ FT RARPIHPQDFDAVRKHLQELLDAGVIRESESPFSSPIVVVRKKNGDVRLCV FT DYRKLNLQTIKDAYALPNLEETFSALTGSRWFSVLDLKSGYYQIEVEESDK FT HKTAFVCPLGFWEFNRMPQGVTNAPSTFQRLMERCMSDVHLKEVVVFLDDL FT IVFSDTLEEHERRLLRVLHRLREFGLKLSLEKCRFFQTSVKYLGHIVSSSG FT VETDPDKVAALKTWPVPKNLKELRSFLGFSGYYRRFIRDYAAIVKPLNDLT FT SGYPPHRKSSKVRERRDYHNPKEPFGSRWSANCQKAFEAIIEALTTAPVLG FT FADPKLPYVLHTDASTVGLGAALYQEQEGQLKVIAYASRGLSRSEARYPAH FT KLEFLALKWAVTEKFYDYLYGNQFTVVTDSNPLTYVLTSAKLDSTSYRWLS FT ALSAFSFKLQYRAGKQNIDADSLSRLPHGALKNDVASQKEQERIRQFALNH FT LFDDVQTMSPEVIQAICDKHIVYCKSVDCGVTPMTLVESLAIHVDAIPGSF FT DHDENNFGCPVVPTFGENDLKEKQRSDPIIREVIVQLETGETIPPLVRKEL FT PMLSLLMRELSKLELQNGILYRRRHDGDTITYQLVLPESLRSMVLTCLHDD FT MGHLGVDRTLDLVRSRFYWPKMLVDIERKIRTCPRCVCRKSLPDHAAPLVN FT IQVTRPLELVCMDFLSIEPDSRNTKDVLVITDFFTKYAVAVPTCNQKSRTV FT AKALWENFIVHYGIPEKLHSDQGADFESKTIKELCELMGIHKIRTTPYHPR FT GNPVERFNRTLLNMLGTLKKCDKVHWSSFVKPLVHAYNCTKHDSTGFSPYE FT LMFGRQPRLPIDLAFDVPLNREEYKTHSQFVHDLKCRLKKTYDLAMKSTAK FT VGERNKARFDKHVVESVLDIGDRVLVKNVHLRGKQKLADKWESLVYVVVKR FT AGDLPVYTVKPEGKEGPLRTLHRDLLLPCDLLQLPEEVLALPSTRKRPPTR FT KNPCNKVQDQVDFDSDDDDSSLELMHSDLLIPTNITFTEVYEAGTGPQTEV FT PSIPIVQDTVGHISTENLPVSADSPIECLPEIENFQIDNAPTLNVLPVDDP FT DVVESDENLPAEVPVNLGDQQIPVEETVKVSEHETETKKHENDGNVIRKSD FT RVRQKPRIFTYPELGNPLVSIVQSLFQSLSTAVTDSIIENNSFTKAADAVV FT TQPVSFMHRDVHRVNGGG" FT CDS 86..1771 FT /product="Gypsy-21_DR1p" FT /translation="MEIIEQENIKVPNSLIVSGTTDTESDIDLTEHLGKYG FT CINRIVRIDSPGSPHHKNVIVEYESGSAVKILEPQLPFIFENPHQADIQYE FT IKALSSIYVETVTTSATKGYMEKLKSIAKLSGRRFEDLLQEELSKCREPVA FT QADDDTTDVASDLRSGIPQVNPAQGSQEMPMGAAFVPLVGTNVPNVNPPEI FT QRVVVEHIVKSEEATSHMHAPLKLRFFSGRSPRPANEVDYEIWHNSVELML FT QDPAVSDLVRSRKIIDSLLPPASDIVRTLGLHATPRAYLDLLDSAFGTVED FT GDELFAKFLSTMQNAGEKPSQFLQRLQVALAQAIRRGGVPSSEADRHLVRQ FT FCRGCWDDALISELQLEQKKANPPSFSELLLLLRTAEDKISTKELRMRKHL FT GASKLRTSSYSVCASSDEVVSTQTIVSDLKKQIAELQNQVEGLTKAKKQAQ FT CAPEGRVCDLQKQVVGLKGQIVEAKPSKSAKTQSATRMNKTQSVQTNLSEV FT VSSVPLKQLSKPKPWYCFHCGEDSHIASTCDRDANPALVAAKRKLLKERQS FT SWEAQMGSSHRTPLK" XX SQ Sequence 6426 BP; 1825 A; 1292 C; 1508 G; 1801 T; 0 other; gtttggcgag ccagccagga gcagtaagta gagagtgact atagtaagaa atcagacata 60 tttaaataga ctttacaacg taaagatgga aatcattgaa caggagaaca ttaaggttcc 120 aaattctctt attgtaagcg gaactacaga cacagagagt gatattgatc tgaccgagca 180 tctaggtaaa tatgggtgca taaatagaat tgtccgcatt gacagtcctg ggtcccccca 240 tcacaaaaat gtgatagttg aatatgagag tggaagtgca gtgaaaatcc tagagcccca 300 gttaccattc atttttgaaa atccacatca ggctgatatt cagtatgaga tcaaagcttt 360 gtccagtatt tatgttgaaa ccgtcactac tagtgccacc aagggctata tggaaaagtt 420 gaaaagcatt gcaaagctta gtggtaggcg ttttgaggac ctcttacaag aagagctttc 480 taaatgcagg gaacccgtag cacaggctga tgacgatact acagatgttg cctctgattt 540 gcgctcaggt attcctcaag ttaaccctgc tcaggggagc caggaaatgc ctatgggtgc 600 tgcatttgtt ccattagttg ggacgaatgt tcctaatgta aatcccccag agatacagcg 660 tgttgtagtt gagcatattg tgaagagtga agaagctact tctcacatgc atgctcctct 720 caagctaaga ttcttttctg ggcggtcccc tcgtccagcg aatgaggtag attatgagat 780 ttggcacaac agtgtagaac tcatgctaca agacccagcc gtgtctgatt tagtcagatc 840 tagaaaaatt attgacagcc ttctacctcc agcatcagac attgtgagga cgctaggtct 900 gcatgcaacg cctagagctt atcttgatct tttagattca gctttcggaa cggtggagga 960 tggtgacgaa ctctttgcta agttcttgag cacaatgcag aatgcaggag agaagccatc 1020 acagttttta cagcgattgc aagtagctct tgcacaggct attagaagag gtggtgtgcc 1080 ctctagtgag gctgatcgac acttagtaag acaattctgt aggggttgtt gggacgatgc 1140 cctcatttca gaattacaac tagaacaaaa gaaggccaat cccccttcct tctcagaatt 1200 gttattactg ttgcgaacag cagaagacaa aattagcaca aaagagctcc gtatgagaaa 1260 acatcttggt gcatccaagc tgcgaacatc ctcttactct gtgtgcgctt catcggatga 1320 ggttgtatcg actcagacta ttgtttcaga tttgaaaaag caaattgcag agcttcagaa 1380 ccaagttgag ggtttaacga aagctaaaaa gcaagctcag tgtgctccag agggcagggt 1440 ttgtgacttg caaaagcaag tagtagggtt gaagggtcaa attgttgaag ctaagccaag 1500 caagtctgca aaaactcaat ctgctactag gatgaacaaa acccagtccg tgcaaacaaa 1560 cttgagtgaa gttgtaagca gtgtccctct taagcagctg agtaaaccta agccctggta 1620 ctgctttcac tgcggtgaag acagtcatat tgcttctaca tgtgatcgtg atgcaaatcc 1680 tgccttggta gctgccaaaa gaaagctttt aaaagagcgg cagtcgtcct gggaggcaca 1740 aatggggtca tcccatcgca ctcctttaaa atagaatcag ctccagttgt gggacaaacg 1800 ggggctgaaa aatgtagcag gcgtcccaag aatagtaaat tttgcaatca ccatgctagc 1860 agtagaagca aaccttttat tttgcctaag ggcttggtag gggcaaaatg cactgctgaa 1920 gtctctatag cgggtcaaaa gtgtagtgtt ctttttgaca caggttcgca ggtaaccact 1980 gtttctcaga cctattatga acagaatttg tctcatctag agataaaacc gcttgaacat 2040 cttgaggtgg aagctgcaaa tggacagttt gttccgtatc tgggctatgt tgagattgat 2100 gtagtgttcc caaaagaatt ccttggagca gagatcacac ttaccactct tgctttggtc 2160 actgcagata ctagcagtaa tgtccagtct cctgttctta ttggtacaaa cacccttgac 2220 ttagcctatg aagcccagtt tgattctgaa gtagcccagc cttcacatgt attaccgtat 2280 ggattcaaag tcgtcatgaa tgttctcaga caccgattca agctaaaaac caatagcatg 2340 attggacttg ttcgagccca gggtacaaag cctgaagttg tccctgcagg acaaactctt 2400 gtgcttgagg gttcaataaa ttccagagag ctttcctctg ataagtgggt tctgatggag 2460 gctcctcttc agtcttcctt gcctggaggt cttttgtgtg catcttctct tgtcactctt 2520 tcccagagat catttcagaa aatcccagtg atcctgaaaa atgaaactga gcatgacatt 2580 ttagttcctg caaagagtgt cattgcagaa ttgtattcac tgcagagtgt gaatgttaaa 2640 gaacctgcca cagagtccaa cagtcagggt aaagagttac caccgtacac tttggatttt 2700 ggtgactcac caatacctaa tgagtggaag gagagaattt cacaaaagtt gtgtgcgatg 2760 tcagatgttt ttgctcttca tgatttagat ttcggcaaga ctgataaagt gacgcatcgc 2820 ataaggctac atgaccaaac cccatttaaa cagagagctc gccccattca cccgcaagat 2880 tttgatgctg tgcggaaaca tctgcaggag ttattagatg ccggtgtcat ccgggagtcg 2940 gagtcccctt tttcttcacc gatcgtagtt gttcggaaga agaatgggga tgtccgtctc 3000 tgtgttgact accgtaaact taacctgcaa acgatcaaag atgcttacgc tttgccaaat 3060 ttggaggaaa ctttttccgc tcttactggt tctcgttggt tctctgtttt ggacctcaaa 3120 tctggttatt accagatcga ggttgaggag tccgataagc acaaaaccgc ttttgtctgt 3180 ccgttaggtt tctgggagtt caatcgaatg ccgcaagggg ttacgaatgc tcccagtacc 3240 tttcaaagac tgatggaacg gtgtatgagc gatgttcatt taaaagaggt cgttgttttt 3300 ctggatgact tgatagtgtt ttctgacacc ttagaggagc acgagcgtag gttgttgaga 3360 gtgttgcatc gcttgcggga gtttgggttg aagctttctc tggaaaaatg cagatttttt 3420 cagacttctg tgaaatatct tgggcacata gtgtctagca gtggtgtaga gaccgatcca 3480 gacaaggttg cggcactgaa aacttggcct gttccgaaaa acctgaagga actcagatca 3540 tttttaggtt tttcagggta ttaccgcaga tttattcgcg attatgctgc cattgtgaag 3600 cctttaaatg atttgacgtc agggtatccg ccacatagaa agagctctaa agttagggaa 3660 cggagagatt accacaatcc caaagagccc tttgggagcc gttggtctgc aaattgtcag 3720 aaagcgtttg aagccatcat cgaagcactt actactgctc cagtgcttgg ttttgccgat 3780 cctaaacttc cgtatgtttt acatacggac gccagcactg tggggttggg agcggctttg 3840 taccaggagc aagaagggca gttaaaagta attgcttatg ccagccgtgg actttctcga 3900 agtgaagccc gttatcccgc tcataaatta gagtttttgg cccttaaatg ggcggtaact 3960 gaaaaattct acgattatct gtacggtaac caatttactg tagttaccga cagtaatcca 4020 cttacttatg tattaacctc ggctaagtta gattctacga gctacaggtg gctttccgct 4080 ctttcagcat tttcctttaa attgcagtac agagctggaa aacagaacat agatgctgac 4140 agtttgtcca gacttcctca tggtgcttta aaaaatgatg ttgcatctca gaaggaacag 4200 gaacgaattc gtcaatttgc actgaaccac ctgtttgatg atgtccagac catgtcacct 4260 gaagtcattc aggccatatg tgataagcac attgtctatt gcaagtccgt ggattgtggt 4320 gttactccta tgactttagt tgagtccctt gcaatccatg ttgatgcaat cccaggcagt 4380 tttgatcatg atgagaataa ttttggttgt ccggtagtgc caacatttgg agagaatgat 4440 ttaaaagaaa aacaaagatc tgaccccatt atccgtgaag tcattgtcca gttagaaaca 4500 ggtgagacaa tccctccttt agtacggaaa gaacttccta tgctctcttt gcttatgagg 4560 gaactcagta agttggagtt gcaaaatgga atcctctatc ggaggaggca tgatggagat 4620 accatcactt atcagctagt tcttcctgag tctttgcgta gtatggtttt gacatgtcta 4680 catgatgaca tggggcattt gggtgtagat cgtacactgg accttgtgag atctaggttt 4740 tattggccta aaatgcttgt ggatattgaa agaaagatcc ggacctgccc taggtgtgtg 4800 tgtcgtaaat ctttaccaga tcatgctgca cccttggtta atattcaagt gactcgaccc 4860 cttgagctag tgtgtatgga ttttttgtcc attgaaccag attcccgaaa caccaaagat 4920 gtgctggtta tcacagattt ctttaccaag tatgcggttg ctgttccaac ttgtaatcag 4980 aagtcgcgta cagtggctaa agctctttgg gagaatttta tagttcatta tggaattcct 5040 gaaaaattac acagtgatca aggggccgac tttgagtcaa agactataaa agaactatgc 5100 gaattgatgg gaattcataa gattaggaca accccttacc accccagggg gaaccccgtc 5160 gaacgattca accgtactct tttaaacatg ctcggtacgt tgaaaaaatg tgacaaagta 5220 cattggagta gttttgtgaa acctcttgtt cacgcgtata actgcaccaa gcatgattcc 5280 actgggttta gcccctatga attaatgttt ggacggcagc cgcggttacc tatagatttg 5340 gcatttgatg ttccgctgaa cagagaggag tacaaaactc attcgcagtt tgtgcatgat 5400 ttgaagtgta ggttgaaaaa aacgtatgat ttggcaatga aaagtactgc taaagttgga 5460 gaaagaaaca aagctcgttt cgataagcat gttgttgaat ctgttttaga cattggggat 5520 agggtcctgg taaaaaacgt gcacttgagg ggaaaacaga agttggcaga caagtgggaa 5580 tcacttgtat atgttgttgt gaaaagagcg ggtgatctcc cagtgtacac tgtcaagcca 5640 gagggtaaag aaggccctct aagaacgctt cacagagacc ttttactgcc ttgtgatttg 5700 ttacagttac ctgaggaagt acttgcgttg cccagtactc ggaaacgtcc tccaactcgt 5760 aaaaacccct gtaacaaagt acaagaccag gtagattttg actctgatga tgatgacagc 5820 agcttagagc tgatgcacag tgatctttta ataccgacga acataacgtt tactgaggtt 5880 tatgaagctg gaaccgggcc acaaactgag gttccctcaa ttccaatagt tcaagacact 5940 gtcggtcaca tttcaactga aaatctacct gtttcagcag acagtccgat cgaatgttta 6000 cctgaaatag agaattttca aatagacaat gcacctacct tgaatgttct acctgttgat 6060 gatccagatg ttgtggagtc tgatgaaaac ttacctgcag aggttcctgt caatttaggt 6120 gaccagcaaa tacctgtaga ggaaactgta aaagttagtg agcatgaaac tgaaacgaag 6180 aaacatgaaa atgatggtaa tgttatcagg aagtctgata gagttaggca gaaaccacgg 6240 atttttactt atcctgaatt gggtaacccg ttggtctcta tagtgcagtc tctttttcag 6300 agtcttagca cggctgtgac tgactccatc attgaaaaca acagctttac aaaggcagct 6360 gatgcagtag tcacacagcc tgttagtttc atgcacagag acgtgcatag agttaacggg 6420 ggaggg 6426 // ID Gypsy132-LTR_DR repbase; DNA; ZEB; 966 BP. XX AC chr3; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy132-I_DR; KW Gypsy132-LTR_DR; Gypsy132_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-966 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1526-1526 (2008). XX DR Genome; chr3; Positions 44235468 44236433. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 966 BP; 256 A; 151 C; 191 G; 368 T; 0 other; tgtgacgggt gttaataaga gtattaattt aatgcatttc actaatagtc cctcctttta 60 attttcgtgc atttgttttg catcaatttg tttagacata tgttagaatt tagtttttac 120 tctatttgtg ttgtttaatg aaaatagttt tatatttgtt ttgcgcgcaa cgttacttcc 180 ggttgccgcg ctacgtcact tccggttgcc gcgctgggtt ctcagaccgc attgagcgcg 240 ctgaggagag gtgagctttt taatggtgct ccaaggaaaa tcgttctaaa agttgtcaga 300 actttaaata agctttaaat atgcgattgt attgtttgta agagtgttct atattaacta 360 ggaagtactt ttcttcatgt ggatggataa aattaatcac tttactgagc acagttgatt 420 gagtatgtgt gtataagtat gtatactgga cacagtgatt tcgaatgacc ttctaacact 480 ttgtaactgc tatttatttg ttaggtgggg aaatttacaa acgaaatcat atcaacttac 540 tgtgctgttt tctgattcat ccaggtattg ttttatgtac atttatgctt ctttgtgttt 600 acatttgttt cgcaattaca ttaatttcta atataaagtt agatagggga agattaatta 660 atatcacaat gtatggaaat gtaatcattt attattttta tttatttacc ttttgtactt 720 ttgtgatctg tttacccagt tttaattggg tatgatgtta ttgttatgtt tgctcagacc 780 gcattgagcg cgctgaggag aggtggggaa atttacaaac gaaatcatat caacttactg 840 tgctgttttc tgattcatcc agtttgctct gatctgacgc ttgagtgaca ataaacccac 900 catcagcagt accagtggcc tgtctcatta ttgtacacaa cgcagagagt gcgaacaccg 960 gttaca 966 // ID CR1-10_DR repbase; DNA; ZEB; 2284 BP. XX AC . XX DT 27-OCT-2008 (Rel. 13.1, Created) DT 27-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE CR1-10_DR is a CR1-like non-LTR retrotransposon - a consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; Nonautonomous; KW reverse transcriptase; AP endonuclease; CR1 clad; CR1-10_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-2284 RA Bao W. and Jurka J.; RT "CR1-like non-LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1205-1205 (2008). XX DR [1] (Consensus) XX CC The 5'-part is incomplete. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS join(261..623,627..1070,989..1960) FT /product="CR1-10_DR_1p" FT /translation="MGISQITPAKLQRPLIIFINSVSNMVPLCPSKISNMC FT SLNSSQPIFSISNISESDALQAIKSLKSSRTKXTFGIDSLMLKELSSTLLT FT PITKLLNXSISEAIFPCVWKSSAVIPIXKNGDHSLSNYRPISIIPTVSKVA FT EKLIANQLIKHLNTTTFSLHPMQFGFRXQHSTETANCFLIENIKHLLDKGG FT VVGAVFLDLKKAFDTVNHKILLSKLTKFNMSPESLXWFESYLSNRSQSVRI FT KDYHSSRSLCPRVYLKVPFWDLCYFLSTRLSLFPLSLXTGVPQGSILGPLL FT FSLYINDLPSVCPNVLIQMYADDTVIYTHGKNTTQVAEKLTQAMAHVSEWL FT NQSCLKLNVXKTVAMYFSKTNIVNNLTDIYVDGDKLQIVSDFKYLGVQIDS FT KLTFKSQIKKVCNRVKFNISNFRFIRSHLSLQAAKIFMHSMILSHITYCLT FT TWSQANKTTLKPIESLYKQXIKILDKKPLRHHHCSILQKHKLLSWENVIKY FT SNICLXYKIIHGLSSPPLSQFVNIRNTTHRATRGAARGDCIVPLRKSTFGQ FT GAFSVTAAQEWNSVPGSIRDITSFNIFKKHLKIWFIENQTCHHSD*" XX SQ Sequence 2284 BP; 800 A; 416 C; 337 G; 703 T; 28 other; ggattaactc tgacatacta aaactaatga aagagagaga ttttgcgtta aaaaagtaat 60 taagagtaaa tcggctagtg atagaaacat gtttacaaca ttaagaaata gagtagtaaa 120 aaacatacga aaagccaaag ctgattattt cctaacatta atagaaaaag caaaagggga 180 ttctaaaaca atctggaacc aactaaaaaa agttgttaga catgatacaa aaaacaaaaa 240 gatgttggaa gttatgatag atgggcatat cacaaataac gcctgcaaaa ttgcagaggc 300 ctttaataat ttttattaac tcagtttcaa atatggtccc attgtgtcca tcaaaaattt 360 ctaacatgtg ytctttaaat agttcacagc caattttcag cataagtaat atctcagaat 420 ctgatgcctt acaggcaata aaatctctca aatcctccag aactaaagaw acttttggta 480 tagactctct tatgttgaaa gaacttagtt ctacactatt aacyccaata acyaaactyc 540 tcaatckttc catttctgar gcaatatttc catgtgtatg gaagtcatct gctgtcattc 600 caatttwtaa aaacggggat cattagtcac tttccaacta yaggccaatc agtattattc 660 ccactgtgtc taaagttgca gagaaactca ttgccaatca attaattaaa catctaaaca 720 ccactacatt ttctcttcat ccaatgcaat tcggtttcag akctcaacac tccactgaga 780 ctgcaaattg ttttctcata gaaaatatta aacatctact ggataaaggt ggtgtagttg 840 gtgctgtgtt tcttgacctg aaaaaagctt ttgacactgt aaatcataaa atattattgt 900 ccaaattaac taaatttaat atgtcaccag agtcattgaa ktggtttgaa tcatatttat 960 caaatcgatc acaatcagts agaattaaag attatcactc ttcccgctct ctctgyccac 1020 gggtgtacct caaggttcca ttttgggacc tttgttattt tctctctaca taaatgacct 1080 tccttcagtc tgtccaaatg tcctgattca gatgtatgca gatgatacag ttatttatac 1140 acatggtaaa aacacgaccc aagtagcaga aaaacttaca caagctatgg ctcatgtctc 1200 agaatggtta aaccagtctt gtttaaaact gaatgtaarc aaaacwgttg caatgtactt 1260 ttctaaaaca aacatagtaa ataatctaac agacatctac gtggatgggg acaaattaca 1320 aattgtatct gattttaaat atcttggggt ccaaattgac tcyaaattaa catttaaatc 1380 acaaattaaa aaggtatgta accgtgttaa attyaatata tctaatttcc ggttcataag 1440 gtcacattta tcaytgcagg cagctaaaat attcatgcat tcaatgattt tatcacatat 1500 aacctactgt ctaacaacct ggtcacaagc aaacaaaaca acactaaaac ccatagagtc 1560 actttacaaa cagwcaatta aaatactgga taaaaagcct ctaagacatc atcattgtag 1620 cattttgcaa aaacataaay tgttaagctg ggaaaatgtc attaagtact caaacatttg 1680 yctgwtgtat aaaatcattc atggtctgtc atccccacct ctcagtcagt ttgtaaacat 1740 caggaacacc acacacagag caacacgagg ggcagcccga ggagactgta ttgtcccact 1800 ccggaaaagt acctttggtc agggtgcctt ttcagttaca gctgcacagg agtggaattc 1860 tgtccctggc agcatcagag atataacctc ctttaatata ttcaaaaaac atttaaaaat 1920 ctggtttata gagaatcaga cgtgtcatca ttccgattaa aatgaaccta gcatgtttcc 1980 aattgtgccy accatcttaa tgatctgttg tttaaatgtg tttcaactat ttttatctct 2040 attacaaaat gttggtgttt tagyttaatt tgtaaatatt ctcttttaac atctayatra 2100 ccatgtctat tttagtactt tttaactatg tttgatwgat tgtataattc ttgtattgtt 2160 ttttttaggt tgttcaggga caacagataa aaaatagcct tckggctaat tctggtgcat 2220 ttgcagaaat gctaattaat gtacactgtc cctgccaaat aaacaaataa ataaataaat 2280 aaat 2284 // ID SAT-3_DR repbase; DNA; ZEB; 297 BP. XX AC . XX DT 21-AUG-2010 (Rel. 15.11, Created) DT 21-AUG-2010 (Rel. 15.11, Last updated, Version 2) XX DE Satellite-like DNA - consensus. XX KW Satellite; Simple Repeat; SAT-3_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-297 RA Jurka J.; RT "Satellite-like DNA from zebrafish."; RL Repbase Reports 10(11), 2025-2025 (2010). XX DR [1] (Consensus) XX SQ Sequence 297 BP; 113 A; 13 C; 82 G; 89 T; 0 other; cagaataaaa gtgtaaagtt tggtgtgtgt gatcagaata tagacagaat aaaagtgtaa 60 agtttggtgt gtgtgatcag aatatagaca gaataaaagt gtaaagtttg gtgtgtgtga 120 tcagaatata gacagaataa aagtgtaaag tttggtgtgt gtgatcagaa tatagacaga 180 ataaaagtgt aaagtttggt gtgtgtgatc agaatataga cagaataaaa gtgtaaagtt 240 tggtgtgtgt gatcagaata tagacagaat aaaagtgtaa agtttggtgt gtgtgat 297 // ID Gypsy149-I_DR repbase; DNA; ZEB; 6528 BP. XX AC chr4; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy149-I_DR; KW Gypsy149-LTR_DR; Gypsy149_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6528 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1559-1559 (2008). XX DR Genome; chr4; Positions 22535537 22542064. XX CC Positions [4776-5249] - Integrase core CC LTRs are 99% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 201..6455 FT /product="Gypsy149-I_DR_1p" FT /translation="MADERKRLVWKIKKSLHKLNVHELGYIADSITSVAGL FT IPPTIAKDDEESCLNFIISYMQSDSLLKLEDEGFAQLLTLRDMINELSEKC FT VNNTASDPSVFNEHEFDAHTQASPVHRHTDSTVTITHSQLNEYQKLLSSYE FT DLGKKLSEYKITQTEQKTTPTTRQFAYPEYLPQQQRESIMCPRDFPYLPHK FT EFKIHGGQIGDSTSDIGYNNLCKQIDEGIKMNYSESEIIQAVLRIVKSGQF FT KEMLINKDDLTVPELKSFIQSHLSEKGSSELFQELMSTRQREHETPQQFLY FT RVIGLKQKVIFTSKQSKTHIEYEPRTVQNVFLRTIHQGFLPKYNDIRSELK FT PLLTDHTVTDEALLKQVTKISSEENERQRRLGQVIQRRHTQAHSAQIDSES FT NESESNSKMQKKNKTKTIQELSAQVEALKRTIDKLKQAKEPCHCSVQMGVG FT SHVIHRPMSVRRGGPYGCPTCIQENNSSCNHCFICGEKGHRAIGCGKRTRG FT TRNMYQSAELRSSTEERQPENEACEANANSAACSNMSRKTNSTQNRVAELI FT GSKCLLKCNINGYAVTALLDTGAQVSIIDQAWKRTYLPDQHLRPLSDIMGS FT KPLHVLAVNGDVLPFDGWVEATVNLPGNSDPQLAVQVPFLVGKMALERPLL FT GFNVIEQLVKGQTNGPNVLTTIATLLMGAMEIGDEQAEVIVNFIQTQKPSC FT DTETIVKVGCQDVVIPSGQLAHVKCQIPVDFDLLEPTILFESNVDNQQLER FT LDIGDGLLEVCGRQRPFINIPIGNHTSYDITLPRRMTIGSIQPIKKIVETD FT QITEPVHVRVDSAEIQGGTDSKNITVSDHWDPSVDLSHLTEKQQRIAKRML FT YEESGAFARNDEVVGCVPSLQMPINLIDNIPVQRAYCSVPKPLHKEVKEYI FT QDLLARGWIVKSKSPYSAPVVCVRKKDGTLRLCVDYRLLNKKTVPDRHPLP FT RIQDLTDTLGGYSWFSILDQGKAYHQGFIAEGSRHLTAFITPWGLYEWVRI FT PFGLTNAPAAFQRSMEEMLDSLRDECCIPYLDDILCYAKTFEEHVEGLRKV FT LQALQRHGVKLRPTKCEMFRKEVRYVGRLVSTEGVKIDPKDLDAVQALKAK FT KPTTVGEVRRILGFLSYYRAYVQDFSRIAKPIYDLLQVKRGTEPIKTKSKA FT DGKKNVQIPSRTPIEWKEEHQKILCKLVEALTTPPILAYPDFDLPFTLHTD FT ASEKGLGAVLYQRQVGKMRVIGYGSRTLTQAERNYRLHSGKLEFLALKWAI FT CEKFRDYLFYAPHFTVYTDNNPLTYILSTAKLNAVGYRWVGELSDFRFDIK FT YRPGKVNVDADILSRCPLSIDQYITECTEELSSEIVKTTWEGTKAAEQKDV FT AWVAALNQAQEEQLENDAGEVLGKLSYNELRKAQRLDSAIGEIIRMKESNA FT SLTNDIRQRASGPVKKLMHEWSRLHLDDGLLYRRTEQRRQFVLPAQYKQLV FT FQHLHNDMGHVGVERVVKLARDRFFWPFMKKDIEIYVTRKCPCIKQKRPAI FT PERAPMGSISTSSPFELVSIDYLHLERSRGGYEYILVLVDHFTRFAQAYAT FT KNKSGRTAAERIFNDFIPRFGFPTKLHHDQGREFENDLFKTLRRLSGMSHS FT RTSPYHPQGNPAERFNRTLLQMMRTLAEKEKEKWKEHLPQIVHAYNCTRHE FT ATGYSPFYLLYGRHPRLPIDLVFKVTNEEKSDDPRGYAKQWAKRMTEAYRI FT ALENSKNSSARGKLYYDKKSRGVILQPGDRVLVRNLNERGGPGKLRSYWEQ FT TVYVVKEQINDGPVYRVVAEIDGSKSRVLHRNLLHQVNDLPVEIPEEVSKQ FT RTKRNRTKQNDETRSPFESDSDENQDENYYWLRAQQTQENNQRLIINQPAD FT EPTCDPPQRQSERERKTHWRTQPIENKKPFCHPLRQQSCERPEKERLRKRT FT EIREQPIENERLTCNLPNENELTCERQMTWTETLTGQSEKEEEQVEEECSQ FT IYKDDTPKQDAFKIPTLSSEETDQTSQVRRSTRDRTPTSRLTYEMLGQPEI FT QSYPTVNTLEAYEMNQTPHWGLPSYQTTPYLTPCISTAYSFLTPQVTFPFW FT VAPMTIGY" XX SQ Sequence 6528 BP; 2272 A; 1268 C; 1441 G; 1547 T; 0 other; gattcatgga ggttccaccg agattaatat ccagagtgta aggctagtgc tgggatccag 60 acttcacggt gacgagacga cgacaggatt gcgggcgact tgaggcagct gctgacgtcc 120 aaagagaggc aaaagggacc aagctgagag gaaggcggcg catcactaga gggtcgtgag 180 aaagtctgaa gctggacaag atggcggacg agaggaaaag actcgtctgg aaaataaaga 240 agagcctaca caagttgaat gtgcatgaac tggggtatat tgctgacagc attacttcgg 300 ttgcaggttt gattccacct accatagcca aggatgatga ggaaagctgt ttgaatttca 360 tcatttctta tatgcagagt gacagtttac tgaaactgga agatgaagga tttgcacaat 420 tactaacatt aagagacatg attaatgagt tgagtgagaa gtgtgtgaac aatactgcga 480 gtgatccatc tgtttttaat gaacatgaat ttgatgcaca tacacaggca tctcctgtac 540 acagacacac agattctact gtcacaataa cacactcaca gttaaatgaa tatcagaaac 600 tgctcagtag ttatgaggat ttagggaaaa agctgtcaga gtacaagatt acacagacag 660 aacagaaaac gacaccaact acaagacagt ttgcttatcc tgaatactta ccacaacagc 720 agagagaaag cataatgtgt ccaagagact ttccatatct accacataaa gagttcaaga 780 tacatggtgg tcagataggt gatagtactt cagacatcgg ctataataac ctatgcaaac 840 aaattgatga aggcatcaag atgaattact cagagagtga aattattcag gctgttctgc 900 ggatagttaa atcgggacaa ttcaaggaaa tgctcatcaa taaagatgat ctaactgtac 960 cagagctaaa aagtttcata caatcacatt taagtgaaaa aggcagtagt gagctctttc 1020 aggaattaat gagtacaaga caacgagaac atgaaactcc acaacagttt ttatatcgtg 1080 taattggcct aaaacaaaaa gtaattttta catccaaaca gagcaaaaca cacattgaat 1140 atgaaccacg aacagttcag aatgtgttcc tgcgtaccat tcatcagggc tttcttccta 1200 agtacaatga catcagaagt gaactgaaac ctttgctcac agatcataca gtaactgatg 1260 aagcactgct gaaacaagta acaaaaatct caagcgagga aaatgaacgc caacgacgat 1320 taggtcaagt catacaacga agacatactc aagctcatag tgcacaaatt gactcagagt 1380 ccaacgagag tgaatcaaat tcaaaaatgc agaaaaaaaa caagacaaag accatacaag 1440 aactgagcgc tcaagtagaa gctttaaaaa gaacaattga caaattgaaa caagctaaag 1500 aaccatgtca ctgctctgta caaatgggag ttggctcaca tgttatacac agaccaatgt 1560 cagtgagaag aggagggcca tatgggtgcc caacgtgtat acaggagaat aacagctcat 1620 gcaatcattg ctttatctgt ggagaaaagg ggcacagggc tattggttgt ggtaaaagga 1680 caaggggtac caggaatatg taccagtctg ctgagcttag atcatcaact gaagaaaggc 1740 aaccagagaa tgaagcttgt gaagccaatg caaacagtgc agcctgctct aatatgtcta 1800 gaaagacaaa ctcaacacag aacagagtag cagaattaat tggaagcaaa tgtttgttaa 1860 agtgcaacat caatggttat gctgtaactg cccttcttga cacaggcgct caagtcagta 1920 ttattgatca agcctggaaa agaacttact tacccgatca gcacttgcga cctctgagcg 1980 acataatggg gtccaaacct ctccacgtct tggctgttaa tggagatgta ttgccatttg 2040 atgggtgggt tgaggctaca gtaaaccttc ctggtaacag tgacccacag ttagcagttc 2100 aagttccttt cctggtgggt aaaatggccc ttgagcgacc tctcctggga tttaatgtca 2160 ttgaacagtt ggtcaaagga cagacaaatg gtccaaatgt tcttacaaca attgctactc 2220 tgctaatggg agccatggag attggagatg aacaggctga agtaatcgta aattttatac 2280 aaacacaaaa gccatcttgt gacactgaga ccatagtcaa agttggatgt caggatgttg 2340 ttattccttc gggtcaactg gcacatgtaa aatgtcaaat acctgtggat tttgacttgt 2400 tagagcctac catcttgttt gagtctaatg tagacaacca acagcttgaa cgacttgaca 2460 ttggagatgg attactggaa gtctgtggga gacaaaggcc attcattaac attccaatag 2520 gaaaccatac cagttatgac attacattac cacgacgaat gacaatcggt agtatacagc 2580 ccattaaaaa aattgtggaa accgaccaaa taacagaacc agtccatgta agagttgaca 2640 gtgcagagat acaaggagga actgacagca aaaatattac agtatcagat cattgggacc 2700 catctgtaga tttaagtcac ttaaccgaga agcaacaaag gatagcaaag agaatgttat 2760 acgaagagtc tggggctttt gcacgcaatg acgaagttgt gggttgtgtc ccaagtttgc 2820 agatgcctat caatctgatt gacaacattc ctgtccagag agcatactgc tcagttccta 2880 agccgctaca taaagaagtt aaagagtaca tccaagatct cttggccaga ggttggattg 2940 tgaagtcaaa gtccccttac tctgcaccgg tagtttgcgt cagaaagaaa gatggaacac 3000 ttagactttg tgttgattat agattactta acaagaaaac tgttcctgac aggcatcctt 3060 tgcctcgcat tcaagacctc actgacacgt tgggagggta cagctggttc tcgattttag 3120 atcaaggaaa agcttatcac caaggcttta ttgcagaggg atcacgacat ttgactgctt 3180 tcattactcc ttggggcctt tatgagtggg tacgcattcc gtttggctta actaatgcac 3240 ctgctgcctt tcaacgaagt atggaagaga tgctggactc actgcgtgat gaatgttgta 3300 tcccatacct cgatgatatc ttatgctacg caaaaacttt tgaagaacat gtggaaggac 3360 tacggaaggt gttgcaagct ttacagagac atggagttaa gctaagacca accaagtgcg 3420 aaatgttcag aaaagaagta agatatgttg gacgactggt cagtactgaa ggagtcaaaa 3480 tagaccctaa agatttggac gctgtacaag ccttaaaagc aaagaaacct accacagtag 3540 gagaagtcag aaggatttta ggatttctca gctactacag agcttatgtt caggatttct 3600 ctcgcattgc caagccaata tatgatcttt tgcaagtaaa acgcgggaca gagcctatta 3660 aaacaaaatc gaaggcagat ggaaaaaaga atgttcaaat accatcacga acacctattg 3720 agtggaagga agaacatcaa aagatactgt gcaaactagt tgaagctttg actactccac 3780 ctatccttgc atatcctgat tttgacctgc ctttcacact tcatacagat gcttcagaaa 3840 agggacttgg agcggtactt tatcaacggc aagttggaaa aatgagagtc atcggatatg 3900 gatcaaggac attgacacag gccgaacgaa attacagact ccattcaggg aagcttgaat 3960 tccttgcact taagtgggca atctgtgaaa agtttcgaga ctacttattt tatgccccac 4020 acttcacagt ttatacagac aacaaccctc tgacctacat actgagtact gccaaactta 4080 acgctgtggg gtatagatgg gttggagaac tgtcagattt ccggtttgat attaagtaca 4140 gacctgggaa ggtgaatgtg gatgcagaca ttctttcccg atgcccacta agcatcgacc 4200 agtacatcac agagtgcaca gaagagttat ctagtgagat agtcaaaacc acatgggagg 4260 gcactaaggc tgctgaacaa aaggatgttg cttgggtcgc tgcacttaac caggctcaag 4320 aagaacaact tgagaatgat gctggtgaag tcttgggaaa gctaagttat aatgaactac 4380 ggaaggcaca gagactggac tcagctattg gtgagattat tagaatgaaa gagtccaatg 4440 caagtctcac taatgacata aggcaaagag ctagtggacc tgtaaaaaaa cttatgcacg 4500 aatggtcaag actacacctg gatgatggat tattgtacag gagaactgag caacgacgtc 4560 aatttgttct accagctcaa tacaagcagt tagtattcca gcacttacat aatgacatgg 4620 gtcacgtggg ggtagagaga gttgtaaaac tagcaagaga cagattcttt tggccattca 4680 tgaagaaaga cattgagata tatgttacac gaaagtgtcc ttgcatcaaa cagaaaaggc 4740 cagcaatacc cgaaagagct ccaatgggta gtataagtac tagctcccca tttgaattag 4800 tatcaataga ctatctccat ttggaacgta gcagaggtgg ttatgaatac atactagtcc 4860 tggtggatca tttcacaaga tttgctcagg catatgcaac taaaaataag tctggaagaa 4920 cagcagcaga aagaattttc aacgacttca ttccccggtt tggatttcca actaaactac 4980 atcacgatca gggtagagaa tttgaaaatg acctcttcaa aaccttgcga agactgtcag 5040 gaatgagcca ttcaagaact tctccctatc accctcaggg caatccggct gagaggttta 5100 acagaacact actccagatg atgagaacac tggcagagaa ggaaaaggaa aaatggaagg 5160 aacatttacc acagattgtg catgcgtaca attgcacacg acatgaagcg acaggatact 5220 caccatttta cttactctat ggacgacacc caagactacc cattgatctt gtgttcaaag 5280 tgactaacga agagaaaagt gatgatccaa gaggatatgc taaacagtgg gcaaagcgaa 5340 tgacagaagc ttacagaatc gctttagaga acagtaagaa ctcaagtgcg agaggaaaat 5400 tgtactatga taaaaagtcc agaggagtta tcttacagcc tggtgacaga gtactcgtga 5460 gaaacctaaa tgagagaggt ggacctggaa aattgagatc atattgggag caaactgtat 5520 acgttgtcaa agagcaaata aatgatggac cagtctaccg agtggttgca gagatagacg 5580 gctcaaaatc aagagttttg cacaggaatc tgcttcatca agttaatgac ttaccagtag 5640 aaattccaga ggaagtatct aaacagagaa ccaaaagaaa tcgaacaaaa caaaatgatg 5700 agacaagatc accctttgag tctgatagtg atgagaatca ggatgagaac tattattggt 5760 tgagggcaca acaaacacaa gagaataacc aaagactcat cattaaccaa ccagctgatg 5820 aacctacctg tgatcctcca caacgacaat ctgaaagaga aagaaaaaca cattggagaa 5880 cacaacctat agagaacaag aaacctttct gtcatccttt gcgacaacaa tcttgtgaac 5940 gacctgaaaa agaaagacta agaaaacgga cagagattcg agaacaacct atagaaaatg 6000 aaaggcttac ctgtaatctt cctaatgaaa atgaacttac ctgtgaaaga caaatgacat 6060 ggacagagac attgacagga caatctgaaa aagaagagga acaagttgaa gaagaatgta 6120 gtcagatata caaagatgat acacctaagc aagacgcttt taaaatacct accttgagtt 6180 cagaggaaac tgaccaaaca agccaagtcc gaagatccac acgtgacaga acacccacat 6240 ctagactaac ttatgaaatg ctaggtcagc cagaaattca gtcataccca acggtaaaca 6300 ctttagaggc ttatgagatg aatcagacac cacattgggg gttaccatcc taccaaacta 6360 caccttacct tacaccttgc attagtacag catattcctt tcttacgcca caagtcacat 6420 ttccattttg ggtagctcct atgacaatag ggtactaaaa gactaaggat aaaaggtgtt 6480 atgatgagag attatactgg caggagccat cttttctttg ttggggag 6528 // ID Gypsy35-LTR_DR repbase; DNA; ZEB; 900 BP. XX AC . XX DT 21-SEP-2007 (Rel. 12.09, Created) DT 02-OCT-2007 (Rel. 12.09, Last updated, Version 1) XX DE A long terminal repeat of the Gypsy35_DR LTR retrotransposon; a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy superfamily; LTR; KW endogenous retrovirus; Gypsy35_DR; Gypsy35-I_DR; Gypsy35-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-900 RA Dib M.R. and Naveira H.F.; RT "Gypsy35_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 7(9), 801-801 (2007). XX DR [1] (Consensus) XX CC Gypsy35-LTR_DR is a long terminal repeat of the Gypsy35_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its CC internal portion is deposited in Repbase as Gypsy35-I_DR. XX SQ Sequence 900 BP; 250 A; 163 C; 134 G; 353 T; 0 other; tgtcacagtg taaaactaaa agacactgtg actaacaaac acacatcaca ttgcactcat 60 acataaaata cacactttta aacatggacc tttgtgttat ttgtattgtg catgtgtgtg 120 tttgtgtaat gtgtatatgt tgtttttttt ttctgtcaag attagcattg tgtttatgta 180 aaaatttcac ttttatgtca aacaatagcc catatcaaca ctgaaataac cacagacaca 240 ttttctttca aacttcattt attaaccaaa catatagagt ttacatatac ttaccagttt 300 cctattctca gcaccaaacc tgtttaactt tcaaaacaaa gccatttata cagatttttc 360 ccgccttcac tttggagcag ccattttcca taacatcatc agggtcacaa ccccctggtc 420 agtaaataaa aatcacagtt ttgttgttta aatttgtatt ggttatttgt tgtgttgtat 480 tttgagtttt gtatgcatat tcatttactt gcattaattt atatttctgt tgaatgtctt 540 tttatgtatt tttttttgtc tacatggtaa cccactacca gtaaaatggt ttaacccatg 600 ggcggggcca atacaataaa atggccccaa acattcactt ggagtgggta gttgtaggag 660 aaggttgttt gtgtttgtat tgttaacaga ttttgttcag gtcttttact tgatttagct 720 gaacaagcct gtatatacat tgtaaatata gaatagttta tttttcttta ttgttatttt 780 gtcactttca cctgttttgc tggatttttg ttttcacttt gtgtaaataa atcactcttc 840 actttcacca gcggcttctg tgacactttt ttgcctccat ccacccggtc aatcgtaaca 900 // ID ERV1-N3-LTR_DR repbase; DNA; ZEB; 1409 BP. XX AC . XX DT 29-OCT-2008 (Rel. 13.1, Created) DT 29-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE nonautonomous endogenous retrovirus from zebrafish, LTR - DE consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR; ERV1-N3-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1409 RA Bao W. and Jurka J.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1601-1601 (2008). XX DR [1] (Consensus) XX CC TSD is 4-bp. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1409 BP; 462 A; 249 C; 261 G; 435 T; 2 other; tgtaagaatt gaaatattgc aaatatttct gacgtctttg acaattcaaa acatgagctg 60 aaaacagagc cactggtgac ttcacaaccc agcagggggg aatgcgggaa aaaccagttc 120 ctgctgcatt tgcatctgag tatgctgaaa ctcagcttca tccccccaac tcacacatat 180 ggtcatttca atgtttgaat gttctagaat taaccaaact gactttaact atcatgtttg 240 atatcatttg aaatgtctca gtgcgattgg tacaatggtc tcattctctc agaaatagac 300 ataatcagaa caataaatat ataacttcag gcatcttttg aaccactgtg aggggtgtga 360 ctttccacta agtgtgaatg ccttttgtta tacattcaca cccccttcct tgagggaatt 420 cttggattat ttaaggtaag gtctgagaaa gactcagggc gattctgcct aaccagatca 480 gcccataaca tggggcctag gctccatgtt atgtatattt cgaagtcagg aatgcatctt 540 tttcatcttg gatttatctt tgataatttg atgttttgta ttgatcgttt atgaattgac 600 tgattgaatt tggcataata catttacctg actaataaat tgttatgttt tgaattattc 660 aaacagagct cgtgttatta tcactgataa atgttgttgt ccatagcact acaagccact 720 gtacaggtta gtaacggact aaaacatgtt agacggagca gcgtggcacg ctatacaatg 780 ttgttagaga tccgactaac aaattggcat tawttcatgt atacttggac tgtaaaaggc 840 caactattta gaacaacaaa atattgttga aactacttta aatgtggata tttaggcctt 900 catctacaaw agtataatcc tgcgcgatta ttattaaaat actttgtctt gattaagtag 960 ataacagatc tatggggacc acataaaaat ctcctaaatt gagtaagtag tgttctgcct 1020 ttttaggaga gtggttaatc gcctctgttt aatgaccaag actgacaagc ttgtattaag 1080 agattgtata cccggccggt gcaagactcg agatgctatc ggaatctgag taaatgagaa 1140 taaggtataa taacaaatca aaagaatatt caatcataat ctaaaataat gtgaaaggaa 1200 acaaaataat cttctcaatg ttttataatt ggagtcagat aaaacgagga taaacgtatt 1260 aatattctaa accatctcat actaaaattg gagtcagata tgagttaagt ttaatataaa 1320 tcaacattgt gcactggaaa gaccgaacat gcagtgaacc ttcatccacc attggatacc 1380 gtcattggac caactggctg gaccctaca 1409 // ID BEL23-I_DR repbase; DNA; ZEB; 5940 BP. XX AC chr20; XX DT 03-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL23-I_DR; KW BEL23-LTR_DR; BEL23_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5940 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1393-1393 (2008). XX DR Genome; chr20; Positions 46542808 46536869. XX CC 'CAGAC' target site duplication CC LTRs are 100% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 41..5890 FT /product="BEL23-I_DR_1p" FT /translation="MEQSGEGFWAPHTRPKRQIRQPSYLRDYEVGAVRTKV FT REPIHNSPAHHTYREEHWSPQPHVRWSGAPTEHDDYTFHQDESLGAMSLRG FT PCTPMPYTEEVIEQIVSHRESMPRQRADESLQHSYTSLPDTRRVGHPEKSS FT DRGRTVDQQELITEHWPLPPPPLSDDFALLREEECDDRLPPPPWPSSQYPP FT ASIVGEELRVVTVIDRMMGELQLMRDNATSSITAHSRLHSTPHDSGRPMPS FT RRHEQYGAQSWLQQPESFPPIINSQRRTTHVEEPVSHFQPRPHQHRPNSYQ FT GRQFTPVLTTLSGGDYKGPTPKIPLFVNKDPMEFSRLKLALTNLLPADATE FT LFKYQVLVDHLRLEEACLIADSYINSPLPYSDTMAALNEKFGQPHHVALKR FT IAAVMDSPEVRRGDTAGFERFSLHVQSLVGMLQTLGPDGEVELRCGSHVAR FT LLTKLPIEMRANFRRQMFNRHGHTHTLLDLAEWLRYESWCQGYDNPTEVRF FT QRPPSYRVEKRHSKPAATVLHGSESYMPNQAIRPESQNSCVEKGKVKAKPY FT CPYCDCTEHFLNKCPKIQNLTSEEVREWIRAKKRCWRCGRSHQAAQCDLKR FT LCSVCQGKHLKVLHDVNTLSKHEHAKAESCLVSTSTEVLYLDRPTTTSRVL FT LKVVRVFLQNKDKILDTYAILDDGSERTMLLTEAAEKLGLQKKPEDIALRT FT IRQEVQTLKGSSVSFSISSPINPRKNFRIVKAFTSPRLDLADHSYPIMSLK FT RKYKHLANIPLHPFSRVKPLVLIGADYPHLLTPIEPVRLGPSGGPAAIHTR FT LGWTLQGPTPFAHWVSDTSLCLLTSVSPETTELMRNVEKLWRMDTLPCQSD FT KQAMRSRQDKEAVELLEAETRRVNVAGVLRYATPLLRKRDIPPLKSSKEAV FT LPSLRSTERRLSRDPQRADEYGAAIRKLVETGAVKRIDPSEASSTEESWYI FT PHHLVSHNGKNRLVFNCSYQHQGRNLNDTLLPGPTLGASLLGVLLRFRQHA FT VAVSGDIKSMFHQVQLLPRDRPLLRFIWRDMMRMEPPDIYEWQVLPFGTAC FT SPCCATFALQLHARTHSKEGDGVRHSVEQCFYVDNCLQSVDTVDEARKLVD FT QIREVLATGGFEIRQWACNLPDVIKHLPAEARSDSMERWLSYDEPDLLEPA FT LGLSWHWGTDTLRYRSRPLEYGALTLRNIYKVLARQYDPLGFILPYTTRAK FT LIVQRMWDKPRDWDDPHIPQDLQEAWKEWESELHLLPHISFPRPYASTPHS FT QMIVSRQIHVFSDASEKAYGAVSYLRTEDSQGQVSLAFLAARSRVAPRRQH FT SIPRLELCGALTAAQLAKTIARELTVVIDQTILWSDSTTVLTWLKSESCRY FT KVFLGTRVAEIQELTGGSSWRYIDSLQNPADDVTRGKTLAELAGQNRWSQG FT PPFLLKGPEEWPSSPGKQPDLDTTEFRKLTSCGMIARAERQETSQEQSFAS FT WKEMVEATAQELHGAADGPPNAADYRKAEMMIFQKIQSDCFPEELCRLRTG FT KEVLQSSRLLTLSPELDSDEGIIRVGGRLRRAEGLDPALKHPIVLDPAHTA FT TKLLIQDYDARLCHPGPERVFAEMRRNLWIIRGREAIRKVQQQCQECRRWK FT SKPLVPKLADLPLARLRLHKPAFYSNGVDCFGPFQIRIGRRSEKRWGIIYK FT CLTTRAVHLDLLHNMDTDSFLMSLRRFVARRGTPAELLSDQGTNFKGGEKE FT LRDSFNSMSLDLQQLLAKQKIDFRFNPPAAPHFGGTWEREIKSVKSALHTV FT MGVQSVSEEVLYTVLLEVEAILNTKPLGYTSSSVADLDAITPNVLLMGRLD FT GALPPVVYSKEEGLSKRRWRYCQVLADHFWSRFIKYYLPSLQCRQKWHGTP FT ADLTVGTVVLMMDPQFPRAFWPIGRVIKVHHSADGHVRSADIKVKDKIYTR FT PAVRLIVLPEIPESEEDEQCPPPTSD" XX SQ Sequence 5940 BP; 1641 A; 1462 C; 1465 G; 1372 T; 0 other; attttggtcc ttcgagccgg atagcagctg tgctgggagt atggaacaga gtggtgaagg 60 attctgggcc ccacatactc ggcccaaaag gcaaataaga cagccatcct acttgagaga 120 ctatgaagtc ggtgcagtgc gaacaaaggt gcgtgagccc atacataaca gcccagcaca 180 ccatacatat agagaggaac attggagccc acagcctcat gttaggtgga gtggtgcacc 240 aactgagcat gacgattaca cctttcacca ggatgagagc cttggagcta tgagcttaag 300 agggccatgc accccaatgc cctatactga agaggttatt gaacagattg tgtcacatag 360 agagagcatg ccaagacaaa gggctgatga gagtttacag catagttata catcacttcc 420 tgatactaga agggtaggac acccagagaa atcctcagac agggggcgca cagtagatca 480 acaggaactt attacagaac actggccatt gccgccacct ccattatctg atgatttcgc 540 tctgctaagg gaagaggaat gtgatgatag actgccacca ccaccatggc cttcctcaca 600 atatccacca gcatccatag taggggagga gttaagagta gtcactgtta tagataggat 660 gatgggtgag ttgcagctca tgcgggacaa tgcaacatct agcattacag cacattcacg 720 tttgcactcc acacctcatg actcaggtcg ccctatgcct agccgacgtc atgaacaata 780 cggagcacag tcctggttac agcagccaga atctttccca cccatcatta attcacaacg 840 tcgaactact catgttgagg aacctgtttc tcacttccaa cccaggcccc accaacaccg 900 tccaaactct taccaaggta gacaattcac gccagtatta accactttaa gtggaggaga 960 ttacaaaggg ccaacaccta aaatacctct ctttgttaat aaagacccga tggagttctc 1020 taggctgaaa ttagccttaa caaacttgtt gcccgctgat gctactgagc tctttaaata 1080 tcaggttctt gtggatcatt taagacttga ggaagcttgt cttattgccg actcgtatat 1140 aaactccccc ctgccttatt cagacactat ggcggcctta aatgaaaagt ttggtcaacc 1200 gcatcatgta gctttgaaac ggattgcagc agtaatggat tcacctgagg tccgaagggg 1260 agacacagct ggctttgagc gattctccct gcacgtacag tcacttgtag gcatgctgca 1320 gaccctgggc cccgatgggg aagtggagct gagatgtggc tcacatgttg ccaggctcct 1380 cactaagctt cccatagaga tgagagcaaa ctttcgacgt cagatgttta atcgccatgg 1440 gcatacacac acactactgg atctggcaga atggctgaga tatgagtcat ggtgccaggg 1500 ctatgacaac cccacagagg ttcgcttcca aaggccacct agctaccggg ttgaaaaacg 1560 ccacagtaag ccagcagcca ctgtgctaca tggatcagag agctatatgc caaatcaagc 1620 aatcagaccc gaaagtcaga actcatgtgt ggaaaaggga aaagtcaagg cgaaacccta 1680 ctgcccatat tgcgactgta cggagcactt tttaaataag tgtcccaaga ttcagaatct 1740 cactagtgag gaagtgagag agtggataag agctaagaaa agatgttgga gatgtgggag 1800 atcacatcag gctgctcagt gtgaccttaa gaggctctgc agtgtatgtc aaggcaagca 1860 tctgaaagtt ctgcatgatg tcaacacact ttccaaacat gagcatgcaa aggcagagag 1920 ctgtcttgtt agcactagta cagaggtgct ttatcttgac aggcctacaa caacctctcg 1980 agttctgcta aaagtcgtca gagttttctt gcaaaataaa gacaagatac tcgacacgta 2040 cgctattcta gatgatgggt cggagcgtac aatgctcctc acagaagctg ccgagaagct 2100 tggcctacag aagaagcctg aagacatcgc tcttcgtacc atacggcaag aggtccaaac 2160 acttaaggga tcctctgttt cattcagtat ctcatctccc atcaatccca ggaaaaattt 2220 ccggatagta aaagcattca cttctccacg ccttgatcta gctgaccact cataccccat 2280 catgagtctg aagaggaaat ataagcactt ggcaaacatc cccctccatc cattcagccg 2340 tgtcaagcca cttgttctaa tcggagctga ttacccccac ctacttactc caattgaacc 2400 agtaagactg ggcccttcag gaggaccggc agctattcac accagacttg gatggactct 2460 ccaaggaccc acaccctttg ctcattgggt gtcggacacc agtctgtgtc ttctcacaag 2520 cgtttcacct gagacaacag aactgatgcg gaatgtggag aagctctgga ggatggacac 2580 tctaccatgt cagagtgata aacaggcaat gcgatcaagg caggataagg aagcagttga 2640 gctcctagag gctgagacaa gacgagtcaa cgttgcaggc gttcttaggt atgccacacc 2700 cctacttcgg aaacgagaca tacctcctct caaatctagc aaagaagctg ttctcccaag 2760 ccttaggagt acagagagaa ggttgagccg ggaccctcaa cgagctgatg aatatggagc 2820 agctatccgc aagctggtag aaactggagc agtgaagagg attgacccaa gtgaagcctc 2880 cagcactgaa gaatcatggt atatacccca ccacttggtg agccacaacg gaaagaaccg 2940 tttggtcttc aattgttctt accaacatca aggtcggaat ctaaatgaca ctctccttcc 3000 tgggccaaca ctgggagctt cccttttggg tgtcttactg agattcaggc aacatgcagt 3060 ggcagtcagt ggagacatca agagcatgtt ccatcaggtt cagcttttgc caagggatcg 3120 gcccctcctc aggttcattt ggcgagacat gatgcggatg gaaccaccgg atatctatga 3180 atggcaggtc ttaccatttg ggactgcatg ctcaccctgc tgcgccacct ttgccctcca 3240 actgcatgcg aggactcaca gcaaggaagg agatggagtc aggcattcag ttgaacaatg 3300 cttttatgtg gataactgcc tccaaagtgt ggatactgta gatgaggcca ggaaactagt 3360 ggaccaaatc agggaagttc tcgcaactgg agggtttgag attcgacagt gggcctgtaa 3420 cttacctgac gtcatcaagc acctgcctgc ggaagctagg tctgacagta tggagcgatg 3480 gctctcctac gatgagcctg atctgctgga acctgccctt ggactcagct ggcattgggg 3540 gacagatact ttaagatacc gaagccgacc tctggagtat ggagctttga cactgaggaa 3600 tatctacaaa gtgttagcac gacaatacga ccccttggga tttatcttgc cctacacgac 3660 tcgcgcaaag ctgatagtgc agcgcatgtg ggacaaaccg agggactggg atgacccaca 3720 tattccccaa gatctgcagg aggcatggaa agagtgggag tcagagctac acttgttgcc 3780 tcacatttca tttcctcgac cgtatgcttc aactcctcat agtcagatga ttgtctccag 3840 gcagattcac gtatttagtg atgcctccga aaaggcctat ggagctgtct cttaccttcg 3900 aactgaggac agccagggac aggtgtcttt ggcattcctg gcggccaggt ccagggtggc 3960 acctcgtcgt cagcattcaa ttccccgact tgagctctgc ggtgctctga cagctgccca 4020 actagcaaag acaattgcac gagaattaac agtggtcata gatcagacaa tactgtggag 4080 tgactctaca acagtgctaa cttggttgaa gtcagagtca tgtcgctaca aagtcttttt 4140 gggcactcgg gtcgctgaaa tccaggagct tacaggaggg agttcctggc gttatattga 4200 ctcgttgcaa aacccagcgg atgatgttac ccgcggaaag accttagcag agttagcagg 4260 acagaaccgg tggagtcaag ggccaccctt cctccttaaa gggcctgaag aatggccatc 4320 atcaccagga aagcaacctg atttagacac tactgagttc aggaagctta cttcatgtgg 4380 aatgatagca agagcggaga gacaagaaac atcacaagag cagtcttttg catcttggaa 4440 ggaaatggtt gaggccacag cccaggagct gcatggggcg gcagatggac ccccaaatgc 4500 agcagactac agaaaggccg agatgatgat cttccaaaag attcagtcag attgcttccc 4560 agaggaattg tgtcgccttc gcactggaaa ggaagtgctc caaagcagtc gtttattgac 4620 cctgtcccca gagcttgatt cggatgaggg aattatacga gtcggaggga ggcttagacg 4680 ggccgaaggg ttggatcctg ccctgaagca tcctatagtc ttggatccag cacatactgc 4740 cacaaagctt cttattcagg actacgatgc tcgtctttgt catccaggcc ctgagcgagt 4800 gttcgcagag atgcgccgca atctttggat cattcgtgga agggaggcaa tccgaaaggt 4860 gcaacagcag tgtcaagaat gccgcaggtg gaaatccaag ccattagtgc caaagttagc 4920 agacttgcct ttagcccgcc tacgtctaca caagccagca ttctactcca atggggtcga 4980 ttgcttcgga cccttccaaa ttagaatcgg acggcgttca gaaaagagat ggggaatcat 5040 ctataaatgc ttgaccacac gcgcagttca cctggatctt ctgcacaata tggatacaga 5100 ctctttcctt atgagtctta gacgattcgt tgctcgcaga gggactccag ctgaacttct 5160 ttctgatcag ggtacgaact ttaaaggagg agaaaaagag cttcgtgact ccttcaacag 5220 catgtcattg gacctccaac agctcctagc taaacagaag attgactttc gctttaatcc 5280 ccctgcagct cctcactttg gaggtacttg ggagagggag atcaagtcag tcaagtctgc 5340 tttacatacc gtgatgggtg tccaatcagt ctcagaagag gtactgtaca ctgttttact 5400 cgaggttgaa gccattttga atacgaagcc actgggttac acttcatcca gtgttgctga 5460 cttggatgct ataaccccga atgttcttct gatggggcgg ctcgatggag cactacctcc 5520 tgtagtatac agtaaagaag agggcctaag caagcggaga tggcggtact gtcaagtttt 5580 ggcagatcac ttttggtccc gattcattaa gtattaccta ccctccttgc agtgccgaca 5640 gaagtggcat ggcaccccag ctgacctcac agtgggtaca gtagtcttga tgatggaccc 5700 ccagttcccc agagctttct ggccaattgg tagagtgata aaagtgcatc atagcgcaga 5760 tggtcacgtg agatcggcag acattaaggt caaagacaag atctacaccc gtccggcagt 5820 ccgactaatt gttctcccag aaatcccaga gtctgaggag gatgagcaat gccctccacc 5880 aacttcagat taacaattga gccttttaca tgagcaaatt gcacacaatt tgggggcggc 5940 // ID piggyBac-N4_DR repbase; DNA; ZEB; 1323 BP. XX AC . XX DT 03-NOV-2008 (Rel. 13.11, Created) DT 03-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE piggyBac-N4_DR is a putaitve piggyBac nonautonomous DNA DE transposon - a consensus. XX KW piggyBac; DNA transposon; Transposable Element; Nonautonomous; KW TSD TTAA; piggyBac-N4_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1323 RA Bao W. and Jurka J.; RT "piggyBac-type DNA transposons from zebrafish."; RL Repbase Reports 8(11), 1799-1799 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1323 BP; 448 A; 243 C; 173 G; 456 T; 3 other; ccctctaccg cacgcattat gataaatcta taaaaaaata tatttgactg tttttctttt 60 cttaaaatgg cttaacttga agtgctgtaa aaaaaaaaaa ttggaaaaat tgraaaaaat 120 ttaaaaaatt ttttaaggta ctttatgata tgttgccata tggcaacaac gtccaacagt 180 ggatctaact tagaaatttc aaatttaaaa ctttccttat aattaataat tttgttgttt 240 gaaataattt aattggtgtt gcagtattat aagctttaac acagaactta taaatgacac 300 cttatacata ctttacaamc aactcatatt ccaacagctt tcatttattc cattcttttt 360 ttgctgaata ttattgaaaa caatcataat attgtattat catgtataca acatacagta 420 aatataaata ttttctccag taaatattat aacaaataaa taacaagtct agtatatcca 480 gatgcatcag aataatgtag ctactagcta acaatgttta tatattatac tatatactat 540 actacgcctg tcttatctgc ctctgagcta acttacctga actgtctcca tcactgctct 600 tatcaaatgt ccagtctgca tcattctcat ggtcactaaa acccatctca tcctcacttt 660 catctgcagg aagagccagt tctgtccttt tcttctttca cttaccatag aacatggtgg 720 tgctcgctaa tacactgttc cctgtattca tatctattca aaagtagtat tgccattaaa 780 actagatttt atccataatg caaatgccat taacatatga ctaatcaaca ttatattact 840 agcattcatg ttgttattgt tacaacttaa aagttcacag ctgaccttgc tagctagcta 900 acccattgca atattgcatg ttccactatt gcacagtgtt gccatttggc aacacataca 960 tttgatcgat ttacaaaaaa ctaatttgat aaaaaawttt ttgagttgtg aatatgtcat 1020 cataacttac tggaggtgat gtttcagatt tttttgtgga tctgacataa tttgatcctt 1080 tgtttttatt gacgtttaca tttgcattta gcaactactc acaataaacg ccagtctgtc 1140 agaaatcgcg ctacagaaaa acactgcatg tcatgtgact taaccaaagc acatcattgg 1200 ctactctaag gtcttctctt tccaacaaca aaaaaaaaat aaaatgttgg tcttaaagaa 1260 acagtggcag ggaaatgaac ataagtatca tgttgccata tggttacacc gtgcagtaga 1320 ggg 1323 // ID ERV1-3-LTR_DR repbase; DNA; ZEB; 391 BP. XX AC . XX DT 02-DEC-2008 (Rel. 13.12, Created) DT 02-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat - DE consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; ERV1-3-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-391 RA Bao W. and Jurka J.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(12), 2172-2172 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 391 BP; 127 A; 74 C; 82 G; 108 T; 0 other; tgtaatggaa attcattttt agataaacat gtgtagtgga aacttacagc agaaacagcg 60 gagcacctga tgcacgtaca gaatgggcct tattaggatt tgaccttgca agcagtacaa 120 actagataga acagtgaaca cactttgctt cagtgtctca ttctgtagaa actgtcgtca 180 gaatgggcat tattagggtt tgaccttgta agcagtaaaa accagataaa agggtgaaca 240 cacttgactt cagtgtctca ctctgcagaa actgttgttg ctgtatgact gtgaccttct 300 ttgcaaagaa taaaacttcg aaaagacatt ccgagtaaga ctttgtttct tgaccacaga 360 gagcgcctct ttaaagaaaa ttgccactac a 391 // ID Gypsy127-I_DR repbase; DNA; ZEB; 4976 BP. XX AC chr18; XX DT 03-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Gypsy retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy127-I_DR; KW Gypsy127-LTR_DR; Gypsy127_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4976 RA Jurka J. and Kohany O.; RT "Gypsy retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1286-1286 (2008). XX DR Genome; chr18; Positions 8832380 8837355. XX CC 'CTAT' target site duplication CC LTRs are 99% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 1266..4934 FT /product="Gypsy127-I_DR_1p" FT /translation="MADEFVLTHRTVFSPVVKGADRAAVQGPEMQVKGWIN FT NTAKPGRGAGFSNRRIPQNRSGDGRVCFFCLDANHLIADCEAWKKKNVSVK FT PKSVGFIRAATEETDLNSDQRGFEPFMLSGWVSHDAESVPKPVTILRDTGS FT VQSLMLRSTLNLLSDSYTGSDVILRGVGSGCVTVPLHMIYLQTELVTGNVC FT VGVCSDLPVEGVQFILGNDLAGGKVFPVPIVVDTPVIDEISDVSIQFPNAF FT PTCVVTRAQARNEGKEMDLSDSFMQMSSENTSEDTCFDSKQITNLAEILGG FT TDVHLKVGRAQLIDAQRNDPSLRECISAADKGTVVNGKVQYIWEDDLLIRK FT WEDDDCVHQIVVPVGYRSQILALAHDHVFSGHLGVRKTYDRILRYFFWPGL FT KSDVSAYCRSCHSCQLAGKPNQVIPPAPLHPIPITSEPFERILIDCVGPLP FT KSKAGHQYILTIMCAATRYPEAVPMRNIKAKAVVRELIKFCSTFGLPKVIQ FT TDQGSNFTSTLFSQVLRELTVKHQLSSAYHPESQGVLERFHQTLKSMLRTF FT CVETGRDWDDGLPMMMFAVRETVQESLGFSPAELVFAHKISGPLKLLQEQL FT LSVNPHPMSVLDYVNKMRERLHRVCEIARKNLAVKQDKMKKRFDRHSTNRS FT FQVGDSVLVFLPVPGSALQARFSGPYPVERKLSDTNYVIRTTDRRKKTRVC FT HINMLKAYVGRSSDAKNNSVVPVALVSSVESELDGWVAREEMAVPPRLSNS FT EILKGLDEHLSYLDNSHRKALVQLLTHFSSLFSDTLTQTNVLFHDIEIVNS FT SPIKQHPYRVNPWKREEMKKEVEYLLQNGLAVPSSSPWSSPCLLVPKTDGT FT FRFCTDYRKLNSVTKPDSFPLPRIEDCIDRVGAATYVTKLDLLKGYWQVPL FT SARASDVSAFVTPDNFLQYTVMAFGMRNAPATFQRLMQRILSGVTNCEAYL FT DDVVIYSNSWPSHINTLREVFSRLAEASLTVNLAKCEFGKAVITYLGKQVG FT QGQVRPVNAKVEAIREFPVPKTKRELRRFLGMIGYYRAFCKNFSTIVSPLT FT DLLSTNREFKWNSECDHAFGAAKDLLCSAPVLSAPDFTRPFQLEIDASGVG FT CGAVLTQESKLGINHPICYFSRKFSRAQRQYSTIEKEALALLLALQHFEVY FT VGGSSDPVMVYTDHNPLVFGYPSYMYNSNQRLMRWALVVQEFNLEIRHKKG FT SENIVADALSRAYSV" XX SQ Sequence 4976 BP; 1352 A; 958 C; 1203 G; 1463 T; 0 other; taaattgggg gctcgtcttt tggtttgtga gaagaagagg aaatagttta tgtattttac 60 tgtttgcaga aactggaata tatcgactct gtgtcagctg tttcaaccaa ctgtcttgac 120 tgcatgtggg tggtaagtca aacttttgtc tatctgttct cttgtggtgt gggaaaaata 180 attcggattg gattatggag tttgacttga tttcttttac actggctccg aacgtggaac 240 aattaaatcg ctgccgcaaa gatgatttaa ttaaaattgc taattttttc aaaattcccg 300 tgtcacagaa tgagtctaaa aaagcgatta aaaagtatct acatgatgaa ttggttttgc 360 aacaaattct acctgacgaa acggttgagc gtggtgcaga agcgagcgct gcttcgctgg 420 tcacggccga tgagaagcct ttatctgcac cgctctcttc cgccgagcca cgtatgttac 480 cgtcgtcagc gcaagcgcag gctaatgctg cggtgaccgc gtcagatcca atgttggcaa 540 taaagttgaa ggaactggaa ttggaagtta aacgggcaga gactgagagt aaaaggctta 600 gggtaaagga gctagagctg gaaatagaga aagaacgact tcagagagat cgtggtctgt 660 ttcagcaggt cggggtttct ccgatacgtt cacctacctc tccgtcattt gctcagctac 720 ccagtggccc tgcactagcc tcatctagtc ctgctgctgc cgtaactgat cctgtgagtt 780 tgccagcatt tgatgtgagt aaacatatta atttggttcc accatttaga gaggcagaag 840 tagacagtta ttttactgcg tttgagcgtg tggcagctac attgcagtgg ccgaaggaga 900 tgtgggctct cctgctgcaa tgtaaattaa ttggcaaagc gcaggaagtg tgctccgcgt 960 agccagtgga acagagtctt gtttatgacg ttgttaaaca gactgtcctc cgtgcatatg 1020 agctagtgcc ggaggcatac agacaaaaat ttagagctca caagaaaaca gacagacaaa 1080 catttgtgga gtttgcaagg gaaaaacgaa ctttatttaa taaatggtgt gcttcaaata 1140 aaaccacaac gtttgagcaa ttgaaagagc aagttttatt agaggacttt aagagctccg 1200 tgtctgagaa tcttgttatt catttgaatg aacaaaaagt agtgtctctg tcctcagctg 1260 ctgtaatggc agatgagttt gtattaacac acagaacagt tttctctcct gtggtcaaag 1320 gtgctgatag ggctgctgtc cagggtcctg aaatgcaggt caaaggttgg atcaataata 1380 ctgctaaacc agggcgaggg gctgggttca gcaatagaag gattcctcaa aaccgctccg 1440 gggacggtag ggtgtgtttt ttctgtttgg atgctaatca tttgatagca gactgtgagg 1500 cctggaaaaa gaaaaatgtg tcagttaaac ctaaaagtgt ggggtttata cgtgcagcta 1560 cagaggagac agatttaaat tcagaccaga gggggtttga accttttatg ctgtcagggt 1620 gggtgtcaca tgatgctgaa agtgttccta aaccagtaac tatcttgagg gacacaggtt 1680 cagttcagtc tttgatgctt cgaagcactc taaatttgtt atctgattct tatactggtt 1740 ctgatgtcat actaagagga gtgggttcag gctgtgttac agttcccctg cacatgatat 1800 atttgcaaac tgaattagtg acgggaaatg tttgtgttgg agtgtgttct gatcttccag 1860 tagagggagt tcagtttatt ctgggcaatg atctggcagg tgggaaagtt ttccctgttc 1920 ctattgtggt tgatacccca gtcattgatg aaatatcaga tgtatctatt caattcccaa 1980 atgcatttcc tacatgtgta gtgacgcggg ctcaagctcg taatgaaggg aaggagatgg 2040 atttatctga tagtttcatg cagatgtcta gtgaaaatac atctgaagat acatgttttg 2100 acagtaagca aattacaaat ctggctgaaa tactgggtgg aactgacgtt cacttaaaag 2160 ttgggcgtgc gcagttaata gacgcgcaaa gaaatgatcc ttcactgcga gagtgcattt 2220 ctgcagcaga taagggcact gttgttaatg gcaaagttca atatatctgg gaggatgatt 2280 tgctaatacg taagtgggag gacgatgatt gtgttcatca aattgttgtg cctgtgggtt 2340 atcgttcgca aattctagcg cttgcacatg accatgtttt ttctggccat ttaggagtta 2400 gaaaaaccta cgaccgaata ttgcgatatt tcttttggcc aggacttaaa tcggatgtgt 2460 ctgcatattg tcgttcatgt cattcatgcc agttagcggg caaaccaaat caggttatac 2520 ctcctgcacc cctgcatcca attccaatca ctagtgaacc gtttgagcgt atattgattg 2580 attgtgttgg cccattgcca aaatctaaag ctggtcatca atatatatta acaataatgt 2640 gtgcagctac gaggtatccg gaagcagttc cgatgcgcaa tattaaagcc aaagcagtcg 2700 tcagagaact gattaaattt tgttccactt tcggtttgcc gaaggttatc caaacagatc 2760 aaggctctaa tttcacgtcg actttattct cgcaagtgtt gcgtgaatta actgttaaac 2820 atcagctatc gagtgcttac catcctgagt ctcaaggcgt actagaacgt ttccatcaaa 2880 cgttaaaatc tatgctgcgc acattttgcg tggaaactgg acgagactgg gatgatggtt 2940 taccgatgat gatgtttgca gtacgagaaa ccgttcagga gtctttgggt ttcagcccgg 3000 ctgagttagt atttgcacac aaaatcagtg gccccttaaa actgctccaa gaacagctac 3060 taagtgtcaa tcctcaccct atgtctgttt tagactatgt gaataaaatg cgtgaacgtt 3120 tacatcgtgt gtgtgaaata gcacgaaaaa atttggctgt gaaacaagac aaaatgaaga 3180 aacgttttga cagacatagt actaaccgga gttttcaggt gggcgactcc gtgctggtat 3240 ttctgcctgt ccctgggtct gctctacaag cacggttttc tggcccgtat ccagtggaac 3300 gaaaactgag tgacaccaat tacgttattc gtaccacaga tcgtaggaag aaaactcgtg 3360 tttgccacat taacatgctt aaagcatatg taggcagatc cagtgatgct aaaaataatt 3420 ctgttgtccc agttgcactt gtgtctagtg ttgagtctga attggatgga tgggttgctc 3480 gtgaggaaat ggcagtacca cccagacttt caaattcaga aatattaaaa gggttagatg 3540 agcatttatc ttaccttgat aattcccatc gaaaagctct tgtacagttg ttaacccatt 3600 tttcatctct cttctctgac accctcactc aaaccaacgt gttgtttcac gatattgaaa 3660 ttgtgaattc ttctcccatt aaacagcatc catatcgggt taatccatgg aagagagagg 3720 agatgaaaaa agaggtggag tacctgttgc agaatggcct tgcagtacca agttcaagcc 3780 cgtggagctc gccctgtctt ttggtgccaa agacagacgg cacattcagg ttttgcacag 3840 actatcgcaa actgaacagt gttacgaaac cagattcttt tcctcttcct aggatagaag 3900 attgcattga tcgtgtgggt gctgcaacct atgtcactaa actagattta ttgaagggct 3960 attggcaagt gccgcttagt gcacgcgcct ctgacgtctc tgcatttgtg acaccggaca 4020 acttcctgca atatacagtc atggctttcg ggatgcgaaa cgctccagcc accttccaac 4080 gtctcatgca gagaatattg tctggtgtga caaattgcga ggcttatttg gatgacgtag 4140 tcatttattc aaattcatgg cctagtcaca ttaacacctt gcgtgaagtg ttttcacggc 4200 ttgccgaagc ttctttaaca gtaaatctag ctaagtgcga atttggtaag gcagtgatta 4260 cttaccttgg caagcaggtg ggtcaggggc aagtgcgccc tgttaacgct aaagttgaag 4320 cgattaggga atttcctgtg cctaaaacga agcgcgagtt gcgacgtttt ttagggatga 4380 tcgggtacta tagggctttc tgcaagaatt tctccaccat agtatccccg ttgactgatt 4440 tgttgagcac aaatagagag tttaagtgga actctgagtg tgatcatgct tttggtgcag 4500 ctaaagatct cctgtgtagt gcgcctgtac tttcagcacc tgattttacg cgtccttttc 4560 aacttgaaat tgatgcaagt ggagtaggtt gtggagctgt tttgactcag gagagtaaac 4620 ttggcattaa ccatccgata tgctatttct ccaggaagtt ttccagagct cagagacagt 4680 acagcactat tgaaaaggag gcactggcat tacttctggc gttgcagcat ttcgaggtgt 4740 atgttggagg tagttctgat cctgtgatgg tgtataccga ccataatccc cttgtttttc 4800 tcacgcggat gtataactcc aatcagaggt taatgaggtg ggcacttgta gttcaagaat 4860 ttaatcttga gatacgtcat aaaaaaggat ccgaaaacat tgttgcagat gctctttcac 4920 gtgcttattc agtttagtga tatcgtcaaa attaatttga ctcttgtgga tggggg 4976 // ID Gypsy90-I_DR repbase; DNA; ZEB; 4227 BP. XX AC chr18; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy90-I_DR; KW Gypsy90-LTR_DR; Gypsy90_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4227 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1491-1491 (2008). XX DR Genome; chr18; Positions 17919624 17923850. XX CC Positions [2941-3363] - Reverse transcriptase CC Positions [1837-2313] - Integrase core CC 'CCTG' target site duplication CC LTRs are 100% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 145..4227 FT /product="Gypsy90-I_DR_1p" FT /translation="MDELLQRLTEVSIRQQQVAEHLAARQDQTEQELFALR FT LAAAQRVPLPDPRARATQLLPKLTPHDDIESYLQMFETTATNEGWNVDDWP FT QVLAPLLTGDAQRTYFSLPDSAATTYQDLKREILAHMGLSSTCAAQQFFEW FT TYKPHLPVRAQAAGLTRLARHWLLAEDPTPSQVAERVVIDKLLRALPRSHR FT RVVGMRNPSTTLELIEAIELADATLHREGGERAPPFPRRVFQERRAPEGPQ FT RPVSRSTVPSPREEPMPTEVSTPPSRTWLAGCIVHRDLPEGAPEADVKVNG FT KPYRALLDSGSAVSLVHARILAPRTGSKTFLPITCVHGDTRQVPARHVTIS FT AGPDTWPVEVGVVKDLPVPVLLGRDWPGLERLLAAASQPASPRGSRHRKKP FT NRRPRQRPVLLASDSGRDGESPSHNTNLFYDVFQQVTGGGSFAKEQQGDDR FT LKHCWTQVRVIEGKESQPGPHPLPYFAVQNGLLYWVAQRRGEEKLLLVVPR FT SKTEAILELAHSHPMAGHLGVANTVQRIRDRFHWPGLEADVKWFCQACPTC FT QRTSPRTPPPSPLIPLPIIEVPFERIGMDLVGPLPKSARGHEHILVIVDYA FT TRYPEAVPLRKATAKAIAQELFLLSSRVGIPAEILTDQGTPFMSRLMADLC FT RLLKVKQLRTTVYHPQTDGLVERFNQTLKQMLRRVAAEDKRDWDYMLPYIL FT FGIREVPQASTGFTPFELLFGRQPRGLLDVAKEACEQQPAAHRSVVEHVKQ FT MREKIDRVMPLVREHLITAQKAQQRHYNRAAQAREFQLGDRVMVLIPNAAC FT KFLATWQGPYTVLEKIGPVTYRVQQPGKRKADQLYHINLLKKWIGTRVQLA FT ALATSEPVVVDINPHLSAAQKGDLRHLIGQFPDVFSYLPGQTSLLQHDIRT FT PPGVIIRQRPYRVPEARRQAIEEEIKQMLKLGVIEPSSSPWSSPIVMVPKP FT DGTLRFCNDFRRLNEVSEFDSYPMPRVDELLDRLGRARYISTIDLTKGYWQ FT VPLSENAKPKTAFSTPSGHWQYRTLPFGLHGAPATFQRMMDIILRPHHAYA FT AAYLDDIVIHSETWEDHLDRLRRVLSELRRAGLTANPHKCHLALSEAKYLG FT FQVGRGLIQPQEKKVEAVRIAPRLVSKTQVRAFLGLAGYYRCFIPNFSSLA FT APLTDLTRKGQPEKVNWTPAAEEAFRSIKTALTSEPVLRAPDFGCPFLLQT FT DASDTGLGAVLSQVQEGEEHPILYISRKLTPAERKYAAVEKEALAIKWAVL FT ELRYYLLGHQFTLVTDHAPLQWMARAKDTNARVTRWFLALQDFHFNVRHRA FT GTANANADGLSRIWAAFAGLSGVTPHPPLNSPLLSHIIHRTSTPLRGGE" XX SQ Sequence 4227 BP; 1008 A; 1250 C; 1095 G; 874 T; 0 other; ctggtggaga atgcgggcag aagagtgaag aatcaccgac acccagactc ccactcatct 60 attaaaaaaa aaaaaaaatt atttttattt tttttttttc tctcttcctt ttgtgtgcag 120 gcgagcgctc actcccccta caggatggat gaacttctcc agcgactcac cgaggtcagc 180 atccgtcagc agcaagtggc cgagcatctg gcagctcgtc aggaccagac ggagcaagaa 240 ctttttgccc tgcgtctggc tgctgcacaa cgtgttccgc taccagaccc tcgtgcccga 300 gccactcaac tccttccaaa gctaacccct cacgatgata ttgaatctta tttacaaatg 360 tttgaaacca ctgctacgaa tgaagggtgg aatgtcgacg actggccgca ggtcctggct 420 cccctgctga ctggtgacgc tcagcgcacc tacttttctc tccccgatag tgcagctacc 480 acttaccagg acctgaaacg ggaaatcctg gcccatatgg gactgtcatc cacatgtgca 540 gctcaacagt tttttgagtg gacatataag ccccatctgc cggtccgtgc ccaagcagcg 600 ggtctcacac gacttgcgag acattggctc ttggcggaag acccaacgcc gagtcaggta 660 gcggaacggg tggtcatcga caaattgctc cgggctcttc cccgatccca tcgacgagta 720 gtcggaatga gaaacccctc aaccaccctg gagctcattg aggcgattga actggcggat 780 gcgaccctac atcgggaagg aggggagaga gcgccgcctt ttccccggag ggtgttccag 840 gagcgacgag cgccagaggg ccctcaacga ccggtgagca ggtcgacggt tccctctcca 900 cgggaggagc ccatgccaac agaggtttca acacccccct cacggacctg gctggcaggc 960 tgcattgtac acagagacct acccgaagga gccccagaag cggatgtcaa ggtcaacgga 1020 aaaccatacc gggccctact cgactctggt agtgcggtta gtctggtaca cgcacggatt 1080 ttggccccac ggactgggtc aaagaccttc ctgcctatca cctgtgtcca cggggacacc 1140 cgtcaggtcc ctgcccgtca tgtaaccatt tcagccggcc ctgatacctg gccagtcgag 1200 gtgggagtgg tgaaagatct gccagttcct gtgctccttg ggagagactg gccaggttta 1260 gagcgcctgt tagccgccgc ttcccaacct gccagcccca gagggagccg tcaccgtaaa 1320 aagcccaacc gaagacctcg tcagcgaccc gtccttctgg catcagacag cgggagagat 1380 ggtgagtccc cctctcataa tactaacctg ttttatgatg tctttcaaca ggtcacagga 1440 gggggctcat ttgctaagga acaacaggga gacgataggt tgaaacattg ctggacccaa 1500 gtccgagtca tagagggaaa agaaagtcag cctggtcccc accctctccc atactttgct 1560 gtccagaatg gcctgcttta ctgggtcgca cagcggaggg gggaggaaaa actactgctg 1620 gtcgtacccc gatccaagac tgaagcaata ctggaactgg cccactccca tcccatggca 1680 ggacaccttg gggtggccaa tacagtccag cggatacggg accgtttcca ctggccagga 1740 ctggaggcag atgtaaagtg gttctgccag gcctgcccca cctgccaacg aacctcacca 1800 aggacacctc ccccgagtcc tctaattcca ttaccaatta ttgaggtgcc cttcgagcgc 1860 attgggatgg acctggttgg gccgttgcct aagtctgccc ggggacatga gcacatcctg 1920 gtgattgttg actatgctac ccgttaccca gaggcggtgc cccttcggaa agccaccgcc 1980 aaagccatcg cccaggagct cttcttgctc tctagccgag tcggcatccc cgcagagatc 2040 ctgacggatc agggtacccc ctttatgtcc cggctaatgg ctgacctctg caggctgctg 2100 aaggtgaagc agttgaggac cactgtctat cacccccaga cagatggctt ggttgaacgc 2160 tttaaccaaa cactaaagca aatgctaaga cgggtcgccg cggaggacaa gcgtgactgg 2220 gactatatgc tgccctacat tctcttcggg atccgggaag ttccccaggc ctcaactggc 2280 ttcaccccct ttgagctcct cttcggacga caaccccgtg gcctcctgga tgtggcaaag 2340 gaggcctgtg agcagcagcc ggcggcccat cggtctgtgg tggaacacgt caagcagatg 2400 agagaaaaaa ttgaccgtgt catgccgtta gtccgggagc atctgattac agcacaaaag 2460 gcccaacaac gccactacaa cagagcagca caagcacggg aattccaatt aggagatcgg 2520 gtcatggtcc taattccaaa tgcagcatgt aagttcctgg ctacatggca gggcccctac 2580 acggtattgg aaaaaatcgg accagtgacc taccgggtgc aacaaccagg aaaacgaaag 2640 gcagatcagt tgtaccacat taaccttcta aagaaatgga ttggaactag ggtccaactc 2700 gctgccctcg ccacttctga acccgtggtt gttgacatca acccccatct ctcggctgct 2760 caaaaggggg atctgcggca cctgatcggt cagttcccag atgtgttctc ttatctcccc 2820 gggcaaacca gccttctcca acatgatata cgaacaccgc caggagtcat catacggcag 2880 cggccctatc gggtccctga ggctcgtcgg caggctatag aagaggagat aaaacaaatg 2940 ctaaaattag gggtgataga accctcatca agtccatggt ccagcccaat cgtgatggtc 3000 cccaaacctg atggcaccct ccggttctgc aatgactttc gtcggcttaa tgaagtctca 3060 gaatttgaca gctaccccat gcctcgtgtg gatgagctgt tggaccgcct gggaagggcc 3120 cggtacatct ctaccataga cctaaccaaa ggctactggc aagtaccact gtctgagaat 3180 gccaaaccta aaactgcctt ttccaccccc agtggtcatt ggcagtaccg gacccttccc 3240 tttggcctgc acggggcccc tgccacattt caacgtatga tggatatcat actccggccc 3300 caccacgcat atgccgcagc atacttagac gacattgtga tccactcgga gacatgggag 3360 gatcatctgg accgtctacg gagggtgctg tctgaactcc ggagggctgg actcacggcc 3420 aaccctcata agtgccatct tgctctctct gaagcgaagt acctgggttt ccaagtggga 3480 agaggactca tccaacccca agaaaaaaag gttgaagctg tccgcattgc tcccaggctg 3540 gtgtccaaaa cccaggtacg agcattctta gggttggcag ggtactatcg ctgttttatt 3600 cctaatttct cctctttagc cgcccctctg acagatctga ccaggaaggg gcaaccagag 3660 aaggtaaact ggacgccggc agccgaggaa gccttcagga gcataaagac agcactgacc 3720 tcagaaccag tacttcgagc tcctgatttt ggctgtccct tcctgctgca gacggatgcc 3780 tctgacacag gactaggagc tgttctgtca caggttcagg aaggggaaga gcatccgatc 3840 ctatacatca gcagaaagct gaccccagcc gagagaaaat acgccgccgt agaaaaagaa 3900 gcccttgcca tcaagtgggc agtcctggag ctgcggtact acctccttgg ccaccagttc 3960 accctggtta ccgatcatgc acctctgcag tggatggcca gggcaaagga cacaaatgcc 4020 agggtgacaa gatggttcct cgcactccag gacttccact tcaatgtgcg tcaccgggcc 4080 gggactgcca acgccaacgc tgatggactc tctcggatct gggcggcttt tgcaggtctg 4140 tcaggggtca ctccccaccc accccttaat tctcccctat tgtctcatat catccacagg 4200 accagtacgc cacttagggg gggggag 4227 // ID Tc1-1_DR repbase; DNA; ZEB; 1625 BP. XX AC . XX DT 07-DEC-2004 (Rel. 9.11, Created) DT 19-MAY-2005 (Rel. 10.06, Last updated, Version 2) XX DE Tc1-1_DR, an ancient Tc1 transposon reconstructed from its DE defective copies present in the zebrafish genome. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Prince; TXr; KW TC1; Tc1-1_DR; Tc1/mariner superfamily; transposase. XX NM Tc1-1_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1625 RA Kapitonov V.V. and Jurka J.; RT "Tc1-1_DR, an ancient family of Tc1 transposons identified in RT zebrafish genome."; RL Repbase Reports 4(11), 303-303 (2004). XX DR [1] (Consensus) XX CC Tc1-1_DR is an ancient family of Tc1 transposons that was CC active in the zebrafish genome more than 10 million years ago. CC The zebrafish genome harbors several thousand copies of CC Tc1-1_DR elements that are ~85% identical to their consensus CC sequences; all copies are severely damaged by mutations. The CC Tc1-1_DR consensus sequence encodes the 340-aa Tc1-1_DRp CC transposase (pos. 376-1395). Tc1-1_DR elements are CC characterized by the TA target site duplications and 210-bp CC terminal inverted repeats. The Tc1-1_DR consensus sequence is CC ~90% identical to Tc1 elements present in frogs (TXr from CC Xenopus laevis, Prince from Rana pipiens). Presumably, CC independent multiple events of horizontal transfer were CC involved in evolution of these transposons in vertebrates. XX FH Key Location/Qualifiers FT CDS 376..1395 FT /product="Tc1-1_DRp" FT /note="Transposase" FT /translation="MPRSKEIQKQMRKKIIEIYQSGKGYKAISKALGLQRT FT TVRAIIYKWQKHGTVENLPRSGRPTKITPRAQRQLIQEVTKDPTTTSKELQ FT ASLASVKVSVHDSTIRKRLGKNGLHGRVPRRKPLLSKKNIKARLSFARKHL FT DDPQDFWENTLWTDETKVELFGRCVSHYVWRKSNTAFQKKNIIPTVKCGGG FT SVMVWGCFAASGPGRLAVINGTMNSAVYQNILKENVRPSVSDLKLKRTWVL FT QQDNDPKHTSKSTSEWLKKNKMKTLEWPSQSPDLNPIEMLWHDLKKTVHAQ FT KPSNVAELQQFCKDEWAKIPPQRCNRLIASYRKCLIAVVAAKGGPTSY" XX SQ Sequence 1625 BP; 550 A; 326 C; 326 G; 423 T; 0 other; cagtggtgtg aaaaagtgtt tgccccttac tgatttttta tttttttgca tgtttgtcac 60 actttaatgt ttcagatcat caaacaaatt taaatattag tcaaagataa cacaagtaaa 120 cacatcatgc agtttttaaa tgaaggtttt tattattaag ggaaaacaaa atccaaaact 180 acatagccct gtgtgaaaaa gtgtttgccc cctgttaaaa cataacttaa ctctggttta 240 tcacacctga gttcaatttc tctagccaca cccaggcctg attactgcca cacctgttcg 300 caatcaagaa atcacttaaa taggacctgc ctgacaaagt gaagtagacc aaaagatcct 360 caaaagctag acatcatgcc gagatccaaa gaaattcaaa aacaaatgag aaagaaaata 420 attgagatct accagtctgg aaaaggttat aaagccattt ctaaagcttt gggactgcag 480 cgaaccacag tgagagccat tatctacaaa tggcaaaaac atggaacagt ggagaacctt 540 cccaggagtg gccggccgac caaaattacc ccaagagcgc agcgacaact catccaagag 600 gtcacaaaag accccacaac aacatccaaa gaactgcagg cctcacttgc ctcagttaag 660 gtgagtgttc atgactccac cataagaaag agactgggca aaaatggttt gcatggcaga 720 gttccaagac gaaaaccact gctgagcaaa aagaacataa aggctcgtct cagttttgcc 780 agaaaacatc ttgatgatcc ccaagacttt tgggaaaata ctctgtggac tgacgagaca 840 aaagttgaac tttttggaag gtgtgtgtcc cattatgtgt ggcgtaaaag taacaccgca 900 tttcagaaaa agaacatcat accaacagta aaatgtggtg gtggtagtgt gatggtctgg 960 ggctgttttg ctgcttcagg acctggaaga cttgctgtga taaatggaac catgaattct 1020 gctgtgtacc aaaatatcct gaaggagaat gtccggccat ctgttagtga cctcaagctg 1080 aagcgaactt gggttctgca gcaggacaat gatccaaagc acaccagcaa gtccacttct 1140 gaatggctga agaaaaacaa aatgaagact ttggagtggc ctagtcaaag tcctgacctg 1200 aatccaattg agatgctgtg gcatgacctt aaaaagacag ttcatgctca aaaaccctcc 1260 aatgtggctg aattacaaca attctgcaaa gatgagtggg ccaaaattcc tccacagcgc 1320 tgtaacagac tcattgcaag ttatcgaaaa tgcttgattg cagttgttgc tgctaagggt 1380 ggcccaacca gttattaggt ttaggggcaa acactttttc acacagggct atgtagtttt 1440 gtattttgtt ttcccttaat aataaaaacc ttattttgaa aactgcatga tgtgtttact 1500 tgtgttaata tctttgaata tatgactaat atttaaatta gtttgatgat ctgaaacatt 1560 aaagtgtgac aaacatgcaa aaataagata aatcagtaag ggacaaacac tttttcacac 1620 cactg 1625 // ID Gypsy-34-LTR_Dr repbase; DNA; ZEB; 960 BP. XX AC . XX DT 27-JUN-2007 (Rel. 12.06, Created) DT 28-JUN-2007 (Rel. 12.06, Last updated, Version 1) XX DE A long terminal repeat of the Gypsy-34_DR LTR retrotransposon; a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy superfamily; LTR; Gypsy-34-I_DR; KW Gypsy-34_DR; endogenous retrovirus; Gypsy-34-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-960 RA Dib M.R. and Naveira H.F.; RT "Gypsy-34_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 7(6), 183-183 (2007). XX DR [1] (Consensus) XX CC Gypsy-34-LTR_DR is a long terminal repeat of the Gypsy-34_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its CC internal portion is deposited in Repbase as Gypsy-34-I_DR. XX SQ Sequence 960 BP; 252 A; 175 C; 207 G; 326 T; 0 other; tgtgacaaag gagtttgtca ctggcctgcg gcagagctgt acacccacat attcacgcac 60 acacacataa cttaccttca ctccaccagc cgatcgatcg ggtgcacacg ggagattgcg 120 ctgtaataaa aacacataca cgcacaaaca cattatatac tccattaatc cttgcatatt 180 ctgacacgcc ataaatcaca ccgcatacat ttgtcttacc tgtgtgtccg tttgtgtttg 240 tgtttgttcg tctcgttgcg tttctgttgc gtctctcttc cgggttgcgc aatcgcggct 300 acgggaggca aaaggggcca aaaaggtcat gcgcagcatt ttatgcacag tgatgcgctg 360 cgcatgcgct gagtggttgg gcacccgggg ggagactatg tgttttatgt gggggatgaa 420 aagaatggga aaacaatgga aaaagcagta ttcatgacat gtgaaaatgc tgatttaata 480 attttggttt taagttatat agtaatacaa atgaagtgta tgaaagtgtg aaatacttac 540 ctgaaagatt tgtttatgtc catacttacc ttatacttac cttcaatata gtttaatcca 600 aactgggcgg agtcactcac tataaataga gtgacttgtg ctcagttcag ggtgggagtc 660 gttggagata gaaggtgtga gtagtggcga atagttggtt tttgataatt ctggagaaca 720 aatattttgt atataattat tttgttactt tattgtctct atttgtttgt tttgttgttt 780 gtttattttt tgtatatata tatattctat tttttttttc ttttattttt gggatttttg 840 gaatggaaac ttttgcactg taaataaacg caccttgcac catagtgcta cgtcggctga 900 gccatcattc attttttttt tcttgtttca cacacacccg aaaaacgcct ctgtgtgaca 960 // ID Gypsy142-I_DR repbase; DNA; ZEB; 4156 BP. XX AC chr15; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy142-I_DR; KW Gypsy142-LTR_DR; Gypsy142_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4156 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1545-1545 (2008). XX DR Genome; chr15; Positions 41201139 41205294. XX CC Positions [2991-3470] - Integrase core CC 'AACT' target site duplication CC LTRs are 100% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 36..3935 FT /product="Gypsy142-I_DR_1p" FT /translation="MMALIGSIGEFAPKAESWSAYIERLEQFFVANEISQE FT KQVATLLSVMGATTYGLLRNLVQPEKPKDKSYKEIVDTLKSHFEPKPLLIA FT ERFRFNHCNQRADETVTEYAAELKQCAVSCEFGATLDEALRDRFVSGIRNE FT ACQRRLLSESNLTFARAFEIALSMETAEKDTQQLRGHDSHSSVVHKVEVRP FT FSQREKKCYRCHGKNHSAQVCHFKDARCHNCGKIGHIKRACRGKMEVGKRK FT LYAEKNTKYVEAEEAELPMFSLRAEENSKKSFTETFDVNGKEIMMEIDTGA FT AVSIISEKEFKAVFPTEPLEVARVKLKTYTGELMPVLGQFEAKVSYDGQTE FT VLSLIVVKGEGPSLCGRNWLQKLSLNWRKIKHVSQVQQVTSIEEVLESCSE FT VFKEELGTLRGLKAKIRVNPEISPRFCKARPLPFSMKAQVDAELDRLEKAG FT IITPVKYSEWAAPIVPVVKKDKTLRLCGDYKLTVNQAVTTEIYPLPRIEEL FT MATLSGGTMFSRIDLASAYQQVLLEDESKELLTINTHRGLFVYNRLPFGVS FT SAPSIFQRIMENLMRDLDVIVYLDDLLVTGKTEQEHLQRLQAVLKRLQENG FT LRVKKSKCEFGKSQIEYLGFVLNSQGLHPSPDKVDAVKNAPVPTCVKELKA FT FLGLVNYYGRFLPNQSTMAAPLYRLLRDNVPWEWKQSEQEAFEKCKDLLTN FT EKVLIHYDPSLPLTLACDSSAYGIGAVLQHTMPTGEERPIAYASRTLSPAE FT KKYSQIEKEGLSLIFGIKKFHQYLWGRKFKMVTDHKPLLTLFGEHKSLPTM FT AAARIQRWAIILSAYDYHIEYCPSEKNSNADGLSRVPLPEMTDGGTAALSE FT HIHALIAEHFEQAPLKAEHVSRVTRRDKVLSKVLKYVMEGWPMNVDENLKA FT YHLRRLELSAERGCVLWGTRVIIPEKLRKIVLKELHVGHQGMVKMKALARK FT YVWWPKMDAELEQVCRTCEPCQMEQKAPRQVPLHPWEFPGQSWKRLHIDFA FT GPFLGHTFMIIVDAYSKWLEVFRMPNLTSQSTISRLRRLFAAYGLPEHIVT FT DNGTQFTSEEFKNFMQQNGILHSTSAPGHPATNGLAERYVQTFKGGIKKLA FT HVTMDLEDKISLFLMQYRTTPNCTTGQSPADLFLNRHVRTRLDFVHPDVTV FT AVRRKQYLQKFHHDKRSVERSFSEKDAVYLRNTTGKGNKWVPGVIVKQTGP FT VSYNVQGQDTDGTFRRHCDQLRPRVVVDDPVEQMDCSETGTETSVRPCASE FT EEEPVISETSEKDSVGNTSMAQGLRRSSRIRKTPKRYDP" XX SQ Sequence 4156 BP; 1266 A; 794 C; 1043 G; 1053 T; 0 other; gtaaaagttg gcgacgagga taaactaatg ctcggatgat ggcactcatt ggaagtatag 60 gtgaatttgc tccgaaagca gaatcttggt cggcctacat tgaacggctg gaacagttct 120 ttgtggcgaa tgagataagc caggagaagc aggttgcaac actcttgagc gtgatggggg 180 caacaaccta tggtttactg aggaacttgg tacaaccaga aaagccaaag gataagtcgt 240 acaaagaaat tgtcgacacc ttaaagagtc attttgaacc aaagccgtta ctgattgccg 300 aaagatttcg ttttaatcat tgcaatcaaa gagccgatga aacagttacg gaatatgcag 360 cggagctgaa acagtgtgct gtgagttgtg agttcggggc aacgctggat gaagcattgc 420 gtgatcgatt cgtgagtgga atccgcaacg aagcctgcca gcgacggctg ttgtcagaat 480 ctaatctgac gtttgcacga gcgttcgaaa tcgcattaag catggagaca gctgagaaag 540 acacgcagca gttgcgaggg catgactcgc actcgagtgt tgtacataag gtggaagtgc 600 gacctttcag tcaacgagag aaaaagtgct atagatgtca tgggaaaaat catagcgctc 660 aggtatgtca ttttaaggat gcaaggtgcc ataactgcgg aaaaattggg cacattaaga 720 gagcatgcag aggaaagatg gaagttggaa aaaggaaatt gtatgcagag aagaatacta 780 aatatgtgga ggcagaggaa gctgaattac ccatgttttc actacgggct gaggaaaata 840 gcaaaaagtc attcacagag acgtttgatg taaatggcaa ggaaattatg atggaaattg 900 atactggcgc ggccgtgagt atcatatctg agaaagaatt taaggcggta ttcccaactg 960 agccactaga ggtcgccaga gtcaagctaa aaacttatac aggtgagctt atgcctgtat 1020 taggacagtt tgaagcaaaa gtgagctatg atgggcaaac tgaagtgcta agtttgattg 1080 tggtgaaagg agaggggcct tcgttgtgtg gcaggaattg gctgcaaaaa ctctcattga 1140 attggaggaa aattaagcat gtcagtcaag tgcaacaagt cacgtctatt gaagaggtgc 1200 tggagtcatg ctctgaggtg tttaaagagg aattgggcac actcagaggg cttaaggcaa 1260 aaatacgagt taatcctgaa atttctccac gtttttgtaa agctcgtcca ctgccttttt 1320 caatgaaagc tcaagttgat gcagaattag accggttgga gaaggctggc ataatcacgc 1380 ctgttaaata cagtgaatgg gcagcaccca ttgtgccagt ggtaaaaaaa gacaagactc 1440 taagattgtg tggagactac aaattaactg tgaatcaagc agtcactaca gaaatttatc 1500 cactaccgcg gattgaggag ttgatggcta cactgagtgg aggtaccatg ttttccagga 1560 ttgacctggc ttccgcgtat cagcaggtcc ttcttgaaga tgagtccaaa gagttgctaa 1620 ccatcaacac gcaccgagga ttatttgtct ataatagact tcccttcggt gtcagttcag 1680 cgccctctat ttttcaaaga attatggaga atctcatgag agatcttgat gttattgttt 1740 accttgatga cctattggtc actggaaaaa cagagcagga acacctgcag agactacagg 1800 cggtgctgaa aagactccag gagaacggtt tgagggtaaa aaaatctaaa tgtgaatttg 1860 gaaaaagcca aattgagtat ttaggatttg tgctaaacag ccagggcctt catccatccc 1920 cagacaaagt ggatgcagtc aagaatgcac cagtacctac ctgcgtgaag gagctgaaag 1980 cttttttggg gttggtcaat tactatggcc gttttttgcc caaccagagt actatggctg 2040 ccccgctcta cagactgctg agagataacg tgccatggga atggaaacag tctgagcaag 2100 aagcttttga aaaatgtaaa gatttgctga caaacgaaaa ggttttaatt cactatgatc 2160 ctagtctacc acttacctta gcttgtgatt cttctgccta tgggatcggc gctgtgttgc 2220 aacacaccat gcccaccgga gaagaacgtc ccattgctta cgcctcgcgg acactgtcac 2280 ctgctgaaaa aaagtattcg cagattgaaa aggaggggtt gtcgctgatc tttggaataa 2340 aaaaatttca ccagtacctt tggggaagaa agttcaagat ggtaactgac cataaacctt 2400 tgttgacttt gtttggagag cacaaaagtc tacccaccat ggcagcagcc agaattcaaa 2460 ggtgggcaat cattctgtcc gcttacgatt accacattga gtactgtccc tcagagaaaa 2520 acagtaatgc agatggcctt tcacgtgttc ccttaccgga gatgactgat gggggcacag 2580 ctgccctatc agagcatatt catgcactaa tagcggagca cttcgaacaa gctcctttga 2640 aggcagaaca tgtgtctcgt gtaacacgca gagacaaggt gttatccaag gtgcttaaat 2700 atgtcatgga aggttggcca atgaatgtgg atgaaaatct aaaggcctat cacttgcgga 2760 ggttagaact gtcggcagag agaggatgtg tactttgggg caccagagtt ataattcctg 2820 aaaaactgag aaaaattgtg ctgaaagagt tgcatgtcgg ccatcaaggg atggtaaaga 2880 tgaaggcact tgcccgcaag tatgtgtggt ggcctaaaat ggatgcagaa ctggaacagg 2940 tttgtagaac ctgtgagcca tgtcagatgg aacaaaaagc accccgccag gttcctttac 3000 acccctggga atttcctggt cagagctgga agagattaca catagacttt gctggtcctt 3060 tcttaggaca cacatttatg attattgtgg atgcatattc aaagtggctg gaggtattta 3120 gaatgcctaa ccttacttca caatccacca tttcaaggct gagaaggctg tttgcagcat 3180 acggacttcc ggaacacata gtcacagaca atggaacaca attcacttcg gaagagttta 3240 aaaattttat gcagcaaaat ggcattcttc attccacaag tgccccaggt catcccgcta 3300 ccaatggcct tgcagaaagg tatgtgcaga catttaaagg gggaataaag aaacttgcac 3360 acgtgacaat ggacttggag gataagatct cactgttttt aatgcagtat cgcactactc 3420 cgaactgcac tacagggcaa tcaccagccg atcttttcct taacagacat gtgcgcactc 3480 ggctggactt tgttcaccca gatgtcactg ttgctgttcg caggaaacag tacctacaaa 3540 agttccatca tgacaaaagg tctgtggaga gatcattctc tgagaaggat gcggtttacc 3600 ttcgtaatac cacaggaaaa gggaacaaat gggttcctgg agtgatagta aagcaaaccg 3660 gtcctgtgtc ttataacgtt cagggacaag acacagatgg cactttcagg agacattgtg 3720 atcagttaag accacgggtt gtagtggatg atcctgttga acagatggac tgctctgaga 3780 ctgggactga aacttccgtt agaccctgtg catcagaaga ggaggaaccg gtcatttcag 3840 agacttctga aaaggacagt gtaggaaata caagcatggc acagggtcta cgtcgctcat 3900 ctagaattag aaagacacca aaacgctatg acccataatg cttaggaatt acaactgtgc 3960 aaatgcaacc aagtttaaaa tttgcaatgg tctcgtaata tactaggttt agtatttttt 4020 tttttctctc tgctgttgtg ttaatgcaag aagaatttaa cagctactgt tagttttgat 4080 agtaatgttg ttgcatgtgc agtttttttt tatgtgaagg taacagtaaa tgtactgtgt 4140 ttaaaaaggg gggtaa 4156 // ID Gypsy67-I_DR repbase; DNA; ZEB; 4036 BP. XX AC chr14; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy67-I_DR; KW Gypsy67-LTR_DR; Gypsy67_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4036 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1449-1449 (2008). XX DR Genome; chr14; Positions 28467367 28463332. XX CC 'TTATC' target site duplication CC LTRs are 98% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 29..3958 FT /product="Gypsy67-I_DR_1p" FT /translation="MAQIGRLDEYKPENESWSAYIERVELFMIANDVDDTK FT QVATLLSAVGASTYGLLRNLVQPLKPKDKTFREIVGILQAHFEPKPLIIAE FT RFRFQHCIQKPHETVSQYVAELKQCASKCDFGASLDEALRDRFVSGIQSEA FT CQRRLLSEDALTFARAFEIALSMETADRDTQQLRKAEDAAAVHKVKTTHHQ FT RYACYRCKGSNHKAHECYFKNSKCHQCGKFGHLKKACREKEKPYPNKSKHE FT RKSNVTAYMDAEEPELEMFSLLSDAEKQKFKVNFTVNQQDVLMEVDTGAAV FT SIISHQTYKQSFANVPLERAEVKLKTYTGESIKVLGQFRATVQYEHQINEL FT ALLVVEGNGPNLCGRNWLHKIKLNWKGIYYVNQMQQPTSIQEVLEKYNEVF FT RDELGTLKDIKATISVKSDVQPKFFKSRPLPFAMKEQVEKEIERLEKANII FT SPVTYSEWAAPVVPVVKKDKTIRLCGDYKVTVNQVANTETYPLPRIEEVLA FT TLSGGKVFSKIDLAQAYQQVLLDDDSKKYTTINTHKGLYVYNRLAFGISSA FT PSIFQRIMENLMKDLKVVVYLDDLLIVGKDEQEHLSIVSQVLQRLQDSGLR FT VKKSKCEWGQTQIEYLGHVMDGGGVYPTKDKVKAVQDAPAPLNVKELRAFL FT GLVNYYGRFVPNQSTVLALLYRLLKEQVIWYWGKKEQSAFEKCKELLTSDK FT VLVHYNPELPVKLACDASAYGIGAVLQHTMPNGEERPIAYASRTLSPAEKK FT YSQIEKEALSIIFGVKKFHQYLWGRSFKLMTDHRPLVTLFGEHKHFPMMAA FT ARIQRWAIILAAYDYHICYRKAEMHGNADGLSRVPLPEITDVGTEAMSAHI FT NTLLSNHIHEAPINASQIARATRRDLHMSKVAQYVMEGWPSEVAEELEVFQ FT TKRDELSMEQGCVLWGTRVIIPFKLRKSVLYELHAGHPGIVKMKALARKYV FT WWPKIDMDVEKCCKECEICQQEQRVPSQVPLHPWEFPGESWKRLHIDFAGP FT FLNNMFMIVVDAYSKWLEVFRMSHITSQATITRLKRLFSAYGLPEQIVTDN FT ATTFTSDEFQTFVRRNGILHTTGAPRHPATNGLAERYVQTFKVGMKKLKNE FT QMCIEDKISLFLLRYRTTPNCTTGQSPSDLFLRRHVRTRLDFLKPNIHEVV FT RKKQYLQKHLHDHKARERSFAVDESVFLRNTAGGDPKWLSGVVVQQTGPVS FT YKVCDSSSTTMYKRHGDQLRPRHSSSETDQLSSYSSLEKPSSEKELTDSSS FT QNSGLTESPAVECESGLPVSGAASPVGPASPVGPRRSKRTIKLPQRFRD" XX SQ Sequence 4036 BP; 1288 A; 692 C; 954 G; 1102 T; 0 other; aactggcgac gaggaaaatt gtgtagatat ggcccaaatc ggaagattag atgagtataa 60 accagaaaat gagtcctggt cagcatatat tgagcgagtg gagttattca tgattgctaa 120 tgatgtggat gatacaaagc aagtggcgac attactaagt gctgttggtg cttccactta 180 tgggctattg cgaaacttgg ttcaaccttt aaagccgaag gataagacat ttagagaaat 240 tgtcgggatc ctgcaagcac actttgaacc caaaccgcta ataatagctg agcgcttccg 300 atttcagcac tgcatacaaa agcctcatga aactgtgtcc cagtatgttg ccgaactcaa 360 acaatgtgcg tcaaaatgtg actttggtgc aagtttggat gaagcgttac gtgatcgttt 420 tgttagcgga atacaaagcg aagcatgcca acgaagattg ctttcagagg atgcgctcac 480 gtttgcacga gcatttgaaa ttgctctcag tatggaaaca gctgatcggg atacgcaaca 540 gctgaggaaa gcagaggatg cggcggctgt gcataaagtg aaaactacgc atcaccagag 600 gtatgcatgt tatcgatgta aaggtagcaa tcataaggca catgaatgct attttaagaa 660 ttctaaatgc catcagtgcg gaaaatttgg tcatttaaag aaagcgtgta gagaaaaaga 720 aaagccttac ccaaataaaa gtaaacatga gagaaagagc aacgtgacgg cttatatgga 780 tgccgaagag ccagaattgg aaatgttttc tctgttgagt gatgcggaaa aacagaagtt 840 taaagtgaat tttactgtga atcagcaaga tgtactgatg gaagttgata caggagctgc 900 tgtgagtata atttcccacc aaacatacaa acagtcattt gctaatgtac cactggaaag 960 agctgaagta aagttaaaaa cttatactgg tgaaagcatt aaagttctgg gacaattcag 1020 agccactgtt cagtatgaac accaaataaa tgagttggcc ttgcttgtgg ttgaaggtaa 1080 cggacctaat ttgtgtggta gaaattggtt gcataaaatt aagttgaatt ggaaaggaat 1140 ttactatgtg aatcaaatgc aacagcctac atcaattcaa gaagttcttg agaaatacaa 1200 tgaagtcttc agggatgaat tgggaacttt gaaggatatt aaagctacaa tttcagtaaa 1260 atcagatgtg cagccaaaat ttttcaagag tcgtccacta ccctttgcca tgaaagaaca 1320 ggtagaaaag gaaattgagc gattggagaa agcaaatatt atatctccag ttacatacag 1380 tgaatgggct gctccagttg tccctgtggt aaaaaaagat aaaacaattc ggctatgtgg 1440 agactataag gtcactgtaa atcaagttgc taacaccgaa acgtacccgt taccaagaat 1500 tgaagaagtt ctggccactt taagtggagg aaaagtattt tctaaaattg atctagctca 1560 ggcataccag caagttttgc ttgatgatga ctcaaagaaa tacacaacaa tcaacactca 1620 caagggactt tatgtctata atcgtctggc ttttggcatt agttcagctc cctctatttt 1680 tcaacgtatt atggaaaatt tgatgaagga cctgaaagtt gtggtatact tggatgattt 1740 gttgattgtg ggaaaagatg aacaagagca tctatcaatt gttagtcaag ttctccagcg 1800 gttacaagac tctggattaa gagtaaaaaa aagtaaatgt gagtggggcc agactcaaat 1860 tgaatatctg ggtcatgtga tggatggagg aggagtgtat ccaacgaagg acaaggtgaa 1920 ggctgtccag gatgctccag cacctttaaa tgtaaaggaa ttgcgagctt ttttaggctt 1980 ggttaattat tatggacgtt ttgtaccaaa tcagagcaca gtgttggctc tattgtatag 2040 acttttaaaa gagcaagtga tctggtattg ggggaagaaa gaacaaagtg catttgagaa 2100 gtgcaaagag ctgttgacta gtgataaagt gttagtacac tataatccag agttacctgt 2160 gaaactagca tgtgatgctt cagcttatgg aatcggagca gtgttgcagc atacaatgcc 2220 aaatggtgag gaacgcccta ttgcttatgc ttcacgcact ttgtcacccg cagaaaagaa 2280 atattctcag attgagaaag aagcattaag cattattttt ggtgtaaaaa aatttcatca 2340 atatctgtgg ggacgatcct ttaagctgat gactgatcac agacctcttg tgactttatt 2400 tggggaacat aaacatttcc caatgatggc tgctgcccgt atccagaggt gggcaataat 2460 tttagcagcc tatgattacc acatttgtta ccgcaaggca gagatgcatg gaaatgctga 2520 tggattgtca cgagtgccat tgcctgaaat tacagatgtg ggtacagaag ccatgtcagc 2580 acatattaat accctgctat ctaatcatat acatgaggcc cctataaatg catcacagat 2640 tgctagagca actagaagag atctacatat gtctaaagtt gcccagtatg taatggaggg 2700 ttggccaagt gaagtggcag aagagttgga ggtgtttcag actaaaagag acgagttatc 2760 gatggaacag ggttgtgtgc tatggggaac aagggtgatt atacctttca aacttcgaaa 2820 atcagtgttg tatgaacttc atgcaggaca ccccggcatt gttaaaatga aagccttggc 2880 tcgcaagtat gtgtggtggc cgaaaattga tatggatgtg gaaaaatgtt gcaaagagtg 2940 tgaaatttgt caacaagaac aacgagtgcc cagtcaagtt cctcttcatc catgggagtt 3000 tccgggtgag tcttggaaac gtctgcacat agattttgct ggccctttcc tgaacaatat 3060 gtttatgatt gtagtggatg cttattcaaa gtggttagaa gttttccgaa tgtcacacat 3120 tacatctcaa gctacaatta ccagattgaa gcgattgttt tcagcatatg gattaccaga 3180 acagattgtg actgacaatg caactacttt tacttcagat gaatttcaga catttgtaag 3240 aaggaatggt attttgcata ctactggggc acctagacat cccgccacca atgggttggc 3300 agagagatat gtccagacct ttaaggtggg catgaaaaaa ctgaaaaacg agcagatgtg 3360 cattgaagat aaaatatctt tatttttatt gcgctatcgc actactccaa attgtaccac 3420 tggccaatca ccatctgatc ttttcttgag gagacatgtc cgtaccaggt tggatttcct 3480 gaaaccaaat attcatgaag ttgttcgcaa gaagcaatat ttgcaaaaac acttacacga 3540 tcataaggca cgggaaaggt cctttgctgt tgatgagtct gttttcttgc gtaatacagc 3600 tggaggagat ccaaaatggt tgtctggagt ggtggtgcag cagactggac cagtatcata 3660 taaggtgtgt gattcatcat caaccactat gtataaacgg catggtgatc agttgaggcc 3720 acgtcattct tcatcagaaa ctgatcagtt gagttcgtac agctcattgg agaaaccaag 3780 ttctgaaaaa gagttgactg attcaagcag ccaaaattca gggttgacgg aatctccagc 3840 tgttgaatgt gaatctggac ttcctgtatc tggggcggca agcccagtgg gacctgcaag 3900 cccagtgggg ccacgtcgat ctaaacgaac tattaagtta ccacaacgtt tcagagacta 3960 agtggaaaca tagtaatgtt gaactgtatt ttgtaatgtt tgatttatgt gatgattatc 4020 ttaagaggga aggtga 4036 // ID Gypsy65-LTR_DR repbase; DNA; ZEB; 1129 BP. XX AC chr14; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy65-I_DR; KW Gypsy65-LTR_DR; Gypsy65_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1129 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1446-1446 (2008). XX DR Genome; chr14; Positions 18741960 18743088. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1129 BP; 317 A; 161 C; 247 G; 404 T; 0 other; tgtaaccggg gtaatattca tttgaaattt aattaatttc cccagttata caaccatatt 60 tagttttgtt tgtttttaag cgctgcccct ttaagagttc acgcacacac aaacgcggaa 120 aaaaaggtgc acgtcagata cagagagcgc ggtgtgaagc agttacagac agggggaaag 180 aaagttgata caactgttta agagagaaag tttgcagttt tgtgttaact tgggagcagg 240 tagcagatgc aggtggtgtg agtttgtata tctgctggct taatttgctg ctcagctgat 300 cattgttcct gcatgtgaga aatctcggtg aggaaggaga attcagagca cgctagtgga 360 gattgagcgt gagtgaactt aataaatgtt ttaagagttt gataatgttc caaaatgttt 420 gataacgctg tttattctct atataagctg ttgatacaat aactgttatt ccttatgctg 480 tgttagaggg gagaagagca gtaaacaaga tcacacaggt gctaactgct aatcgcacgt 540 gtgcttcaga gaatcagaac gagtgagttt gctcatttta ttttaatgga gtgtgattat 600 gtgtaagaca tgaaagtttt gttctcagtg taacatgtaa acttatgtgt gggtcctgaa 660 cacattacgt tatgtaaaaa atgccttttt atgtgaatta tcatatgtgt gatgtattgt 720 atgcagtgga atgtgctgat tgctcattta ttctgttttt gtttgcttta ttaggcatag 780 ataagccact gccgttgttt tgcacatatt aaatagttta ttaacgtggt tttgaactac 840 tttaagtcac atagggattt taaattggat gttttgaata ctgtcaaaga atttattctt 900 atagttatat atacatttat ttttcctgtt gctgaccttt tttattttgt cataaagcca 960 actttgttta ttgattggac ctgttttaaa gcttgctgct catttcccaa acaaagtaag 1020 ttaatgctgt gtctccttga ggctcagtcc agaataaaag aatccatttt tcttatagat 1080 gttttggctg tgttatttcc ctgataaaag tagggtgtta ttggccaca 1129 // ID DNA-N4_DR repbase; DNA; ZEB; 1042 BP. XX AC . XX DT 02-OCT-2008 (Rel. 13.1, Created) DT 13-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Non-autonomous DNA transposon - consensus. XX KW DNA transposon; Transposable Element; Mariner-N6_DR; DNA-N4_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1042 RA Jurka J.; RT "Mariner-type DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1611-1611 (2008). XX DR [1] (Consensus) XX CC 4-bp TSD (TATA). 96% identical to consensus. Possible Piggybac. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1042 BP; 298 A; 235 C; 207 G; 302 T; 0 other; gacctttttc ttggaacgca aaattaccca ttatgctttg cgtgtatgcg caaggagaat 60 gcgaagaact tccggtcggc agctgatgcg tgtaaacagt cacatttgga cagctttgaa 120 acgatagttt taatctattt tacctgcttt tatcctcttt gaactaatac tgttgtccat 180 cagcaccatc tcctggactg atcatttaaa gaaatgcgat ctaataggat ggaaaacagc 240 tgctgtgagg cgttttcccc tcgttgtcag acggcatctt ctgcagttta aatgaagcct 300 cgtcatcgct aaagtcaaca ttttctcttt atttcaaata tataaccagc ccaaacaaat 360 gtagttcacc aaaatgtttg acacacacac gtagcctgta ctgtgccact gacacactga 420 tttaataact gtgacaaagt gaaaatatag gctagtgtca tttcgaaatg acagaaaaac 480 acaaaccgtg gtgaaaacaa cacacgaaat aaaacacaaa cggctcgcga ttagaatgaa 540 cagcaaatca gccagcagag gatcaataac agacttttat tggtcatttt tgtaaattgc 600 acgactttca tacctgttaa gtttcatgcc agtactctgg tttgctctct ctctctctct 660 ctctctctct ctctctctct ctgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtta 720 aagagccgct gagctaacag ctgacaggcc gggaacgcac acacacactg tatttctgac 780 ggcagacacg tgaactagga gaacgttttt ctcgtatttc tgacgcgctt gaaccacatg 840 tgacagatta taacggcttt aacagacaca tttctaagtt gtgtgtcaca aaaacactct 900 gttatccatt aaaaacgtta ttgaaaccca ttttcggagc gcaaattacc agttgtacac 960 gctgtaaacg gaagttttta gcattctacc ggaagacgct ggtcgctatg gcgccctcca 1020 tgcagcagcc atgaaaaagg tc 1042 // ID Gypsy145-I_DR repbase; DNA; ZEB; 4215 BP. XX AC chr10; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy145-I_DR; KW Gypsy145-LTR_DR; Gypsy145_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4215 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1551-1551 (2008). XX DR Genome; chr10; Positions 8886400 8882186. XX CC Positions [1630-2085] - Reverse transcriptase CC Positions [3103-3582] - Integrase core CC LTRs are 98% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS join(673..1485,1489..4215) FT /product="Gypsy145-I_DR_1p" FT /translation="MRSRRSHLRENVSLYQSSETSEPEAMQIGHTRLTPAE FT RQRRLHNDLCLYCGENGHLKLSCPVKPQAPPTPTVSTYFQFPKKVCLPVQI FT SFTNHVINTSAFVDSGSAGNFIDLVFAKKHNIPLMPCVSVLTVAALDGRPL FT GNGRVQAITEEITLSTGALHTEKIHLFAIQTSDNPIILGLPWLQKHEPTIS FT WSNAQVIQWSDTCFKTCINVPKPQKIQAASVDVTSSDNHIIPPEYSDLLEA FT FSKTRASQLPPHRSCDCAIDLIPGTVPPKGVFPLSQPETEAMKSYIEEELS FT KGFIVPSTSPASAGFFFVKKKDGSLRPCIDYRGLNEVTVKFPYPLPLVPSA FT LEQLRTAKVFTKLDLRSAYNLIHIRSGDEWKTAFSTTTGHYAYRVMPFGLA FT NSPSVFQAFINDVFRDMLGRWVIVYMDDILVYSNSLESHVKHVRLVLQRLI FT AHQLYAKPEKCEFHQTSIAFLGYVIGADGVTMDDTKVQAVCKWPRPTTVKE FT MQRFLGFANFYRRFIRNFSTIAAPLTSMTKGTRRHLIWSQEAESAFSDLKQ FT RFSSAPILRHPDPDREFIVEVDASNTGIGAVLSQRHGNPAKLYPCAFFSRK FT LNSAEQNYDVGNRELLSMKAAFEEWRHWLEGAKVPFTVLTDHRNLEYLRSA FT KRLNHRQARWALFFTRFNFVVTYRPGSKNTKADALSRQFETELQPIPSEPI FT LHPTIVVALIQWDIHTEIREGLLNDPTPIECSPDKLYVPTNLRDRVFQLTH FT NLPSTGHPGINATLEIISNKYWWSSLRSDVIKYVKNCEVCNVSKPSKQLPA FT GLLQPLPIPQRPWSHIAIDFITDLPVSRGNTTILTVIDRISKGCRLIPLPK FT LPTALETAEALCHYVFRFYGIPDDIVSDRGPQFTSRLWSEFFKALNVNVSL FT TSGYHPESNGQTERLNQEINRFLRSYCHRNQDEWSNFLFWAEYAQNSLRKQ FT STGLTPFQCILGFQPPLFPWSGEPTNLPAVNDWLRQSETTWNEAHTHLQRA FT VRRQKEQADKHRRPHPNYQPGSWVWLSTRDLRLRLPSAKLSPRYVGPFKII FT RQISPVSYRLDLPAQYRISPTFHVSLLKAAGAPRGEDDLDEDRLQRPSPLI FT IDGEEAYQVQEILDSRRRGGTLQYLIDWMGYGPEERSWVNAKDILDPSLTI FT DFHRSHPNKPAPRSRGRPRTS" XX SQ Sequence 4215 BP; 1190 A; 1103 C; 786 G; 1136 T; 0 other; aaagacttcg ccgaacattg atccagcagc tgtttaccaa ctttcgtcgg agatatcttc 60 acaagctaac gtattagcaa cacatcaaca acagctggcc cggctcacct cgctcacaga 120 cgaattggtg aaagccatgc aaaacattca agtaactaat ataaatcctg ttgctccagt 180 ccctgcctca ctacccaaat caacaccagg taatcctttt atggcaagcc cgcgtctagc 240 ttacccagag aagtttgacg gcactccgga taaatgtaaa ggcttcctcc ttcaatgctc 300 gctgtttatc gcccagcagc cttcactata cccggaaacg gaaaacggtc agatcgcctt 360 catctgctca ctgctcacag gaaaagccct ataatgggct acagccgtct gggatataaa 420 tcgtccagta tttcccagtt ataagaggaa gcgggtgagc aaattttaaa attacgtcaa 480 ggaaaatcta cagctgccga tttttccatt aaatttcgca ctttagcagc tcaatctaaa 540 tggcctgatg cacctctcaa actgcatttt cgcagagcat taaatcctga gttacaaaca 600 gagcttgctt gcagagatga gagcaaaaca ctggaggagc ttattaatct caccatacgg 660 cttgacaatc tcatgcgctc acgtagatca caccttaggg aaaatgtatc tctgtaccag 720 tcttcagaaa catcagagcc tgaagctatg caaattggcc atacccgtct cactccagct 780 gaacgacaac gcagacttca taacgattta tgcctgtatt gtggagagaa tggtcatttg 840 aagcttagct gtcctgttaa accccaagct ccgcccactc caacggtaag cacttatttc 900 cagttcccta aaaaagtttg cttgcctgtg caaatatctt ttaccaacca tgtcattaat 960 acatcagcat ttgttgattc aggttcagct ggtaacttca ttgacctggt atttgctaaa 1020 aaacataaca ttcccctcat gccttgtgta tctgtgttga cagtggcggc gctagacggc 1080 agaccattag gtaatggtcg agttcaagcc ataactgaag aaataacctt atcaactgga 1140 gccctacaca ctgaaaaaat tcacctgttt gccattcaaa cctctgataa ccccattatt 1200 cttggactcc catggctcca gaaacatgaa ccaacaattt cctggtcaaa tgctcaagtc 1260 attcagtggt cagatacatg tttcaaaact tgcatcaatg tgcctaaacc acagaaaatt 1320 caagcagcat cagttgatgt aacttcttct gacaaccaca ttatccctcc tgaatactct 1380 gatctcctcg aagcattcag caaaaccaga gcctctcaac ttcctcctca tcggtcatgt 1440 gattgtgcca ttgatttaat tccaggaact gttccaccaa agggttgagt ttttcccctc 1500 tcccaacctg aaacagaagc tatgaagtct tatatcgaag aagaattatc aaaaggcttt 1560 atagtcccat caacctcccc agcttctgca ggcttctttt tcgtaaagaa gaaagatggt 1620 tcacttcgac cctgtatcga ttatagaggc ctgaacgaag taactgtgaa atttccatac 1680 ccacttcctc tagttccttc agctctggaa cagctccgta ctgcaaaagt atttactaaa 1740 ttggatcttc gcagtgcata taacctcatt catattcgta gcggcgacga atggaaaacc 1800 gccttttcca caaccactgg tcactatgcc tacagggtaa tgcccttcgg attagccaac 1860 agtccctctg tgttccaagc gttcatcaat gatgttttcc gtgacatgct tggaagatgg 1920 gtcatagttt atatggacga tatcctggtc tattctaatt ccctggagtc acatgtcaaa 1980 cacgtcaggt tagtgctcca acgtctaatt gcacatcagt tatatgctaa acctgagaaa 2040 tgtgagtttc accaaacatc cattgccttt cttggttatg taattggtgc tgatggagtc 2100 acaatggatg ataccaaagt tcaggcagtc tgcaaatggc ctagacctac tactgttaaa 2160 gaaatgcaga gatttctagg gtttgccaac ttctacagaa gatttatccg aaacttcagc 2220 actattgcag ctccgcttac ttccatgact aaggggactc gacgacacct tatctggtca 2280 caggaagcag aaagtgcatt ctccgacctt aaacaaagat tctcttcagc tcctatccta 2340 cgacacccag atcctgaccg tgagtttatt gtagaagttg atgcatccaa cacaggtatc 2400 ggggcagttc tctcacaacg tcatggtaat ccagctaaat tatacccctg tgcatttttc 2460 tctagaaaac tcaactctgc tgaacaaaat tacgatgtgg gcaaccgtga actactttca 2520 atgaaggcag cttttgaaga atggcgacac tggctagagg gggctaaagt tccctttact 2580 gtactcactg accaccgaaa tctagaatat ctgcgttcag ctaaacgatt aaatcaccgc 2640 caggctcgtt gggctctttt ctttacacgc tttaactttg ttgttactta ccgacctgga 2700 tcaaagaaca caaaagctga tgctttatcc cgccaattcg aaactgaatt acaacctatt 2760 ccctctgaac ccattcttca tccaaccatc gttgtggcac tcatacagtg ggacatccac 2820 actgagattc gggagggact acttaatgat cctactccta ttgagtgttc tccagacaaa 2880 ctctatgtac caaccaacct acgagaccgt gtctttcaac taacccataa ccttcccagc 2940 accggccatc ctggtatcaa cgccacactc gaaatcattt caaacaaata ctggtggtca 3000 tctctacgct ccgatgtaat caagtacgta aagaattgtg aagtctgcaa tgtttcaaaa 3060 ccctcaaaac aattaccagc gggtctgctg caaccactac caatcccaca gagaccttgg 3120 tcacacattg cgattgattt cattactgac ttgcctgtct ccaggggtaa cactactatt 3180 ctcacagtaa ttgatcgtat ttccaaggga tgtcgactca taccattgcc taaactaccc 3240 acagctcttg aaactgctga agcactctgt cactatgttt tcagatttta tggcatacct 3300 gatgacattg tatccgaccg tggtccccag ttcacttctc gtctgtggtc tgaatttttc 3360 aaagccctca atgtgaatgt aagcttaact tctggatacc atcctgaatc caacgggcaa 3420 actgagagac tgaaccagga aattaatcgc tttctccgat catactgtca tagaaatcag 3480 gatgaatgga gtaacttttt gttttgggct gaatatgcac aaaattctct tagaaaacaa 3540 tccacaggac tcacaccttt ccaatgcatc ctaggatttc aaccacctct gtttccctgg 3600 tctggagaac cgaccaatct acctgcagtc aatgattggc ttcgacaaag tgaaacaaca 3660 tggaatgaag cacatactca cctacaacgt gctgtcagaa gacaaaagga gcaggcggac 3720 aagcatcgtc gaccacatcc caattatcaa cccggatctt gggtttggct ttccaccaga 3780 gatcttcgtc tacgactacc cagtgccaag ctcagcccta ggtacgtagg tcctttcaaa 3840 attattcgac aaatttctcc agtttcttac cgcctagatt tacctgctca atatcgtatc 3900 tcacccactt tccatgtttc tctgctcaag gccgctggtg ctccgagagg ggaggatgac 3960 ctagacgagg atcggttaca gagaccttcc cccctcatca tcgatggaga agaggcatat 4020 caggttcagg agatcctgga ctcacggcgt cggggtggta ctcttcagta cctaattgac 4080 tggatgggtt atggacctga ggagagatct tgggtaaatg ctaaagacat tcttgaccct 4140 tctctcacca tcgacttcca cagatcccat cccaataaac cagctccacg ctcccgtggg 4200 agaccccgaa cctct 4215 // ID DNA-4-1_DR repbase; DNA; ZEB; 612 BP. XX AC . XX DT 24-OCT-2008 (Rel. 13.1, Created) DT 24-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE nonautonomous DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; TSD ATAT; KW DNA-4-1_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-612 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1230-1230 (2008). XX DR [1] (Consensus) XX CC TSD is 4-bp long. The TIR is imperfect. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 612 BP; 208 A; 107 C; 81 G; 216 T; 0 other; gagcaattcc atgaaaatgt caaccttgcc atgaaaagaa aaatgttttc accaaaatag 60 cgaaaccgtt tctacatttt ttgcgcaggc aagtgtttta cagtacttta aaaatactta 120 aaaatgtctc tgtcaatgtt ttcaaactat tatatttgat ttaccaaagt caaacaaatg 180 gcaatttcac atccgtcaca tccataacga aaaagtgtca catccataac aaagcttttc 240 cctcaaaaat gcaaaaatgt caaaataaaa taaaatcaat cagttttgtt ctataaagag 300 ccaactctta tccttttgtt gagcattatt tattttgggt tgtgcagttt aattcacaga 360 atttctacga aatatgttgt atactgtaaa ctcgcagact ttttttgtca catccataac 420 gcttgtttat tttcctcatt taaagtataa aaatgttcac agattgaatt tttttgatcc 480 cataactctg tggttagttg ttttggaaaa tataaacaga tgattcaata taatgttaac 540 gtatactttc tctgcgaatt tatgttttga atttttactg caaatgtcac atccataacg 600 ctggaatcag cc 612 // ID ACROBAT2 repbase; DNA; ZEB; 781 BP. XX AC . XX DT 09-JAN-2009 (Rel. 14.02, Created) DT 09-JAN-2009 (Rel. 14.02, Last updated, Version 1) XX DE A nonautonomous DNA transposon: consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; ACROBAT2. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-781 RA Jurka J.; RT "Non-autonomous DNA transposons from zebrafish."; RL Repbase Reports 9(2), 562-562 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 781 BP; 253 A; 137 C; 120 G; 271 T; 0 other; gggcccagtt tttcaaaaaa tttaatctgg atcaaattgg tccagatttg gaaatcccat 60 gttttgctat ccaggattaa ctgatctatc ttacttttat gacagttttt taaaggaaca 120 ttgggttgga tcactgtgat ccaaatacca aatttcagga ttaccaaatc ctgtttacca 180 gagccttaaa tggaaccaac agtgtagcct actggcttag caaagaacaa caggtaggca 240 aaataccagt ggccacaaac agccattgtt tgttttaatg gtaagaaaca gaaacactgc 300 aatcaacaaa cattttacat ttttttagtt tgttctaata caatactgct ttgatattgt 360 tttgttgttg ttgtttacca tatctcatta gatgtgacat gcttcatata ggcttcaaat 420 atgaaggaat ttatatatca tcagtaacca ttaagtttgt gatcattcac ttaaaaaaat 480 aaaacttttt ttattttatt tggttaaaaa aaaaaatcac aactgaatcc acccttctga 540 tgggatcagg ataatcctgt tttttttttt tttttttttt tggatcaaat agatcccaat 600 ctgagttcaa agttttgaat aacccaaagg gcaggtttga tccaaatcaa atgtaagatt 660 ggattacatg gtctgatctt aattcaaaat ccctctgtta cttttgaaaa acccatttcc 720 aagatttgat ccaatccgtg atccaaaaat cccactggat tacttttgaa aaactgggcc 780 c 781 // ID DNA-6-N4_DR repbase; DNA; ZEB; 3877 BP. XX AC . XX DT 12-NOV-2008 (Rel. 13.11, Created) DT 12-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE nonautonomous DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; TSD 6-bp; KW DNA-6-N4_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-3877 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(11), 1716-1716 (2008). XX DR [1] (Consensus) XX CC Copies are 97% identical to the consensus. TSD is 6-bp long, and CC TIR is ~1370-bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 3877 BP; 1341 A; 629 C; 638 G; 1267 T; 2 other; tatggaacaa aatcaatgcc taaagtcttg aattaaaaag cttgttgaat atcacaaact 60 gaaaaaaata atacaccgta ttaacaagtt cttgcatttt aatgacttga attccagggt 120 ttgtattttt aggtcctgaa atttaacata tctgtatatt taagctttga atatttcaac 180 gaaaaatatt tcaaacacgt tttcatactt aaaaattcaa taattcatat tcaattttca 240 gatttcactt acaaaatatt caataccctt taaaaatacg atgtataata ttcaaaaggt 300 aattttcagt tcattttatt tcagttcaaa tattcgcttc cataaattca gtggatcaaa 360 ttcgactgtt aaaattcaac attacatccg ggaactgcag gaaaagcaat agattgcaga 420 ttgaagatcg ccagctgctc tagctttcac cagcaggtgg agatggaata atgtaagatt 480 tttaacccag taaacaggcg agaaaaaggc tattaaaggc acaaaagtag catttaatat 540 taatatcaat gctttggaca ttataaatga taaaaagtat gagtaaaatg agtaaatgag 600 gttttagtca tctatatttg cccttatgta acggaagcca gctggtgatc gcgtaaatct 660 cagttctcgg atccagtgac agacgttgtt ctgcaatctt atcggagctt gtctcccatg 720 ccggctcgct ccctgctcgg accggtgcag ttacttgcac ataaaccgtc ttttgatgaa 780 tatgatgtta cacattgtct atctttattg actctttctt ccatagacgc gaagtattat 840 attgacagct aaagaatata aacgtaagag aaacaggara ggaaatatta taataataca 900 aaatagataa acattgacag ggctaagaac aattacgaac aaagatattc aaaagcagag 960 agattttatg agaatgtcac acaaaacagt ttacttggaa ctgttaaaca gtgttcaggt 1020 atgatttaag gttggttatg gatggggcgt ctaatttaaa ataatttaac ataacgtgcg 1080 aatgcattaa aggagcgtct tcagttatta tttaaacgca gctatgtcag gattgggata 1140 ggttatataa tttcattatt taaattatga ttgttaatat taatatttta tgacatgtat 1200 ttgggctatt gcgcttaaat aagtcatccg tgattctaaa ttaatatttc tatttatcct 1260 ttttagctat cactttgcta tcgtctttct cttctttttt tcctattttt tgtggttctg 1320 ccacggttaa acgtttgcaa aatgtcttta gtttgtccag aatattgctt ttccactaaa 1380 atagagacga agcagaagga aaatgaatga attgatgaat gaatgaaatg attgacgttt 1440 ttatgcatgt gcggtgtggc ttgcacacga caaagacgaa cgagtggccg cgagctccgg 1500 tggccggtca gagccagcgc cgcatatcaa actggttaag agatgctgga tggctgccaa 1560 tatatgatag gaattcttcc gaagtttcag atgatatctg ggcttgtgca taataatgta 1620 aaataaaagt gaaagtcatg acatttggcc aagcataaga atagcctaca agagtgtttc 1680 gattattatc atgacatgtc atgcttggga ttatatgaat aacacctcgc tgtttaccat 1740 ttccgaaagt ggagctgacg tcggcgcgaa attgcttgta taattgtaaa aagagtgatg 1800 aatattaatg gatttcttat atctgtcatt tctgataata taattgattt aactagtgtg 1860 agagtatcca acactcattt agcaaaaatg catccagaat cctaaaccaa taactttgtc 1920 aagcaaaaat tatgatttta taggctactg attttttttt aaaacaagaa aaaagaaact 1980 attagctacc ctagcaattg tttatttaaa tatgatcatc tcaaaatata aaataaaaaa 2040 ataatggttt gactttgtga aatcataaaa cagacttgca gaaaagagtt aatagaataa 2100 atgagcaaat acattctctc cagtagatgg cgattaaagc tcagtaataa tcccatgatg 2160 atgaacagag ctgctttgca ggaacccctc aaccacgcat gtcaaaccac gcattctaaa 2220 cattaatctt atgaaactag ctgaagtgac aactatatta aatttaaacc agtttaaata 2280 agtttatatg tttaataact gcaaaaaaga tcaacattgc taatcataaa atacaaacgg 2340 ttgctgaaaa gttataaata atggttttct catttaatac tgaaaacaaa ttaaagtcag 2400 ttataactgc ctattatcaa accacaggcg actcatttta ttacttataa ttggctgcga 2460 cattgccaca tttgcgttaa tacatttttc tcgcctatag gctacataac aataaaacaa 2520 tattctggac aaactaagac attttgcaaa cgtttaaccg tggcagaacc ccaataatta 2580 gaaaaaaaag aagagaaaga caggcagagt gataaaaagg ataaatagaa atattaattt 2640 agaatcacgg atgacttatt taagcgcaat agcccaaata catgtcataa aatattaata 2700 ttaacaatca taatttaaat aatgaaatta tataacctat cccaatcctg acatagctgc 2760 gtttaaataa taactgaaga cgctccttta atgcattcgc acgttatgtt aaattatttt 2820 aaattacacg ccccatccat aaccaacctt aaatcatacc tgaacactgt ttaacagttc 2880 caagtaaact gttttgtgtg acattctcat aaaatctctc tgcttttgaa tatctttgtt 2940 cgtaattgtt cttagccctg tcaatgttta tctattttgt attattataa tatttcctyt 3000 cctgtttctc ttacgtttat attctttagc tgtcaatata atacttcgcg tctatggaag 3060 aaagagtcaa taaagataga caatgtgtaa catcatattc atcaaaagac ggtttatgtg 3120 caagtaactg caccggtccg agcagggagc gagccggcat gggagacaag ctccgataag 3180 attgcagaac aacgtctgtc actggatccg agaactgaga tttacgcgat caccagctgg 3240 cttccgttac ataagggcaa atatagatga ctaaaacctc atttactcat tttactcata 3300 ctttttatca tttataatgt ccaaagcatt gatattaata ttaaatgcta cttttgtgcc 3360 tttaatagcc tttttctcgc ctgtttactg ggttaaaaat cttacattat tccatctcca 3420 cctgctggtg aaagctagag cagctggcga tcttcaatct gcaatctatt gcttttcctg 3480 cagttcccgg atgtaatgtt gaattttaac agtcgaattt gatccactga atttatggaa 3540 gcgaatattt gaactgaaat aaaatgaact gaaaattacc ttttgaatat tatacatcgt 3600 atttttaaag ggtattgaat attttgtaag tgaaatctga aaattgaata tgaattattg 3660 aatttttaag tatgaaaacg tgtttgaaat atttttcgtt gaaatattca aagcttaaat 3720 atacagatat gttaaatttc aggacctaaa aatacaaacc ctggaattca agtcattaaa 3780 atgcaagaac ttgttaatac ggtgtattat ttttttcagt ttgtgatatt caacaagctt 3840 tttaattcaa gactttaggc attgattttg ttccata 3877 // ID BEL35-LTR_DR repbase; DNA; ZEB; 124 BP. XX AC chr16; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL35-I_DR; KW BEL35-LTR_DR; BEL35_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-124 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1414-1414 (2008). XX DR Genome; chr16; Positions 11731779 11731902. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 124 BP; 25 A; 33 C; 35 G; 31 T; 0 other; tgtgtagcga acgcagacgc ggtttcgcgt cgtccgcctc agttttggcc cttgttttga 60 ctcgggaggc gtgtccaaac gccaggtaac cactatataa tgagcacgct agcattagtc 120 ggca 124 // ID BHIKHARI-5-I_DR repbase; DNA; ZEB; 9658 BP. XX AC . XX DT 26-NOV-2008 (Rel. 13.11, Created) DT 26-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE LTR retrotransposon from Danio rerio: internal portion - a DE consensus. XX KW LTR Retrotransposon; Transposable Element; Nonautonomous; KW internal portion; LTR; TSD 4-bp; BHIKHARI-5-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-9658 RA Bao W. and Jurka J.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(11), 1794-1794 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 9658 BP; 2703 A; 2715 C; 1935 G; 2302 T; 3 other; tatattggtg gagagaatgc gggcagtttt ccaaacattt tctgtgaaac aaactttcca 60 attcattgtg atttattaag cgcgctatag aagaaaaaaa aaggaccgcg agaagcattt 120 ccttttcttc aaaactggga ggacgaaaag cctcctccag ccattctgtt cactaaacac 180 agcagagagc tgcccgtgta gagtgtaaca atttaaactg cagttcatat atattttgaa 240 tcgattttgc tgagacgcac gttctgcaaa tcatttaaat atattagtgt gcctgtcgga 300 ggaaaagaca cctctctcag cgcttgtaag acagtttgga aaccccaaac ctgtccgttt 360 aaaaggcctt tatagctaga aaacctctag actattttct cccgtggttt caaactgtta 420 gattagctgc ctaacgttta acctctttgc ttcgccgccg tgcgaaatta atttcagaac 480 tgctggacgg ttctaaatta atgtatttcg ggagctgcaa gcttgcctgt taaccgcttt 540 ctgacgaaag gaattctcag aaccataagc gtgtttttaa atggtaccaa aatctgatat 600 ggatgtgaat tcccatattt cagttatgtt cgtccggacc aaatctgtgt acaaacgtgc 660 accgcctcga gagacgtctc gcattttatg tttgtttgtc taattgtaat ggtcgaatat 720 tattatcaat gttacggcaa catgatatat aatgaccgct tgtgcgaatc ttagtggttc 780 gcgctttgcc ttcgtgaagt cagccgcata ccaagttatg aatgagaact cctcccattc 840 ataaatcggc tcttcgagcg ctcattgact cctcgtccac gtaactcgtg cgcgcgcacg 900 catgcgtgaa tgacgtagtc accaccccaa agcataagat ccgcctttca ggaggcagag 960 ttggtgacgc ggtttacttc acagggcaaa accactcgct agcttctaac gactagcctt 1020 aaagtaaact aacccttcgc gtcttaatat taccaaactg ttagcacaag gctaaccgga 1080 tgcgaagcgt aactcgtaac aacttgctag cggctaacgc cagctaacgt taatgcttaa 1140 gcaaaccaac ccttcacatt ttgtaacatt aacaaacttt agcacaaggc taaccgaatg 1200 tgaagtgcaa ctcggaaaca tctccccctg tctccttctg ctctcctcac atcagttccg 1260 cccaccattg tgtctcaagt ggtctgaaat ggtcaaacag tttggagtta atttgtttta 1320 atttcgatta agcattttat tttccctgtt atcattttat ccaaacaata gtttagttac 1380 ctaagtatag attattggtc tgtttttatt tcttttggta ttatgaatta ttttgttact 1440 ttcaattttt tttattaatt ttctgccttc tgtctttttc attttaatac cgttgtccag 1500 tttattttgt tattacttga tttttaatca agttcccaat aatctattct aaacttaggt 1560 catgtttaat tttaattttg ttggttttgc tatttgggga atagccttta atctctctat 1620 tattatcatt aataggcata atttttcttc ccttttgatt ttttttttac tccatctttt 1680 cctcatgcct tatttattcc ctcacctctt aggttatagg gaaagtataa ctatattagg 1740 gcaagtatta gtctgaagct aaattcagag ctatcaatct atagatccca catctgatca 1800 aaacatgttg ggctaaccca ttaatgtttg tagatgcccc tgaatactaa ggcactcttt 1860 ctactttttc taagccgtcc tgtactgctt ttacctatag gggattatta tgattattct 1920 tttagattct tttatgccga tttaattttc ctttagcctc cctccatcac agggtgataa 1980 ccttctctat ccatcagata ctacagcata ctaagattcc tactccaact ggctcgccca 2040 gaatagtaga tccatagtac tgcccgtatc cggacccgag tcctctgtgc ttgtgtggtt 2100 aagcgtacag gttattcagg ttgcagatca aaagattggg ttcatctgtc tgacataact 2160 gacccctgct gaaggatgcg ttagatacct tggcgttttg tctctccggc ttgtgtggat 2220 cacagtggca tcagcaatcg tactccaaga agcaccgtcc gtgtcaactg gcatacgatc 2280 aatcttggtt aattaccctg tcatgattcc acacacgtgc ccgcctgagt gagattggcc 2340 acctcattct acaggcctag tgcggcaccg gaatcatcag gcaaaaccca attttgaatc 2400 atgccaggcg gtcagtgact ttcttgagtg ctacgtattg tgctaaaagc ctacgctgtg 2460 tgagttcaca accgagttac attataggca agtgatctga tagcgactgc agtaggagtt 2520 ggggaaagtc tagccagata gcacgctctt gcgaccttca tctgattgcc tttcactcac 2580 caccaagtgt aaatagcatg taaaacctag tctaaaccta gcccactaca ggactataaa 2640 acgaacattt tagacatgcc cataggtttt gctattctct ctctctctct ctctctctct 2700 ccctcgtttc tcatccttgt ctgtctttca gttttatttt tattcatatt tactgatatc 2760 catattagta ttttctttca tttttctttc cctgtacact cgttcatccg agcgaggtgc 2820 ctttttgtct aacttatcgt tagtattatt catttttgtt ttttttgggt tactgttatt 2880 tttatttact taattatttt tatttacttt gacctgctca gagaccacgc cattgagaac 2940 caccagggtg atttaaataa ttgattgata atttgcacag cctggcgcat gactaaaagc 3000 cttgaggcca gtacgtagat atcaactgtt atctagccac ccaaattcga catccgaatt 3060 gtccggcccc cagtggatcc ttagaaggac caacgtctag ccggcacata ccccctagct 3120 gacaaccctg tcagactaat aatatagcac aatgctaaca tccactcgat ttggccatgt 3180 gggtctctct ctctctccct tgtgatctct gtctcaccca acagaccagc atgtcaagtg 3240 cgcccagccc gctgctcact gggaccaact ggaggcctgg ttaagtgctt aacccacaac 3300 ctcctgccca acgacgccat ggagctgcag cggccaagca gagaacagct ggacgtctgc 3360 attaatccgc tgagggcgaa cgacctagcc cggagccatg acagccaagc ccagactcaa 3420 ctctccaggg ccctgacaca cctcaccctt gcaccggcca gacccatagg aaagggtctc 3480 aaccagctgg agcatcgagc ccaggacgcc tgcaagacga cgagggaaaa ggacgccgaa 3540 cttctggagc aaatggacgg actacacgcg ccccgatgga gctacaagcg gacaacgcac 3600 acaaggagcg acgagaggag aaagccaaag atgaactgag tgtaaagcta caagctgaaa 3660 cactattgta aagagcagaa ctaaagacag tgaagctcgg gtcaaagcct ctaaagtcaa 3720 cggctgaggg cccgagcaaa attccaacac ggaacccaac aaagagactt ttaagcatga 3780 acctaccaga gttcacccag aactgtcaca ttcctacaac ccatgtgact acctgaaagg 3840 ggaagtgcca cagtcagccc caaaagccaa gactggatag accagcagag gggggtgaat 3900 ccaactcata ccgccccgcc agccagcatt ccagcaacgc ccatcagcaa cacagggcca 3960 ggactccctc caaaggggtc ccccacgaca gacaccatct gtcagccaaa gactttgaac 4020 cagctggcca gggaaatgcc aactttcacc ccaaaccctg ctggaggcga cgacacgcac 4080 ccctaattct aagacattga cttttacact tttacacaga ggccaagtgc cactctctgt 4140 gacaaactgt atctgctaag aattacttcg agccgtgagg caagaagttt cctgaatcaa 4200 cagcgccgtg ccatcaaagc agattacgag ccgatgcaag agctcttact agagtgtttt 4260 acctgacccc gtgtcagagt aaggactgat tgtcgccatg ggtctcagaa cagccaagag 4320 acctcacaag catactgcag tcgactacac caaccctact tcgagacagg aaatgagcca 4380 cggacggagg aggagttcac cttccagatc ctctctctct caaaagcctg ctttctgtgg 4440 tgagtccgca cacgggcaat gccctgcccc cacgccctga ccacccaaca gcggcagagt 4500 ttagctcata aaatctgttc aaaacaaaga ctgctttgaa agactatcaa acacctacga 4560 gctaccctgt ctcagaatct catccggagc tgactttaaa aggcacgtta ctgtttccca 4620 gccgcaagac ttccaaacag tgagttaaag cctttctccg ccagcagagg acagcaccgc 4680 caggaaggcg catgtcccaa caccagacag gatgctcaga gacacactgg gacaagccac 4740 tcaccacaag cagcccaaaa ggagcctaac agcaaccaag gaatgaccaa cctgcagtag 4800 ctaggagctc aagcaaccag catccagcta gttacacgcc agagaaagcc aaaggtgaaa 4860 acctgacacc aaacaacggc acagtggcgt catgagcagc agaactgcta aaaagcttga 4920 aggggcccta caggaagtca agccagactc cttatgacta gaggggggtc caaaaccatt 4980 taccccccct gaacacctgc tttctgctcc atccccgcag agtactgcga gtccaacaca 5040 agctaaagac tgtgtatcta actgcagcat cacgcttccc ccaagaagca gccagtgcca 5100 ccatgcttgc aacctcaggc accagaaaac actggtctgg ttgtctgttt acctcctgag 5160 ccacagccca ccaaccagcc acacctgcgt acaaacaact gctcaggccc ctgcacttca 5220 actcttgggg aacctgacca aaatggcgtg gcaacaacgc gctacttggc tgtaaccgtg 5280 gagatgaaag aaagatgaaa aacgcgcgtc agcacgggag cagacctcac aataatgtca 5340 tctttcctgt ttgagtaact caaagcaaga gctcagagac taaaacgaac acttaaatgc 5400 caaccaggca ggctgatggt gcaaatcata cagccaaact gaggttcaac tcacatacat 5460 cgctcccatt caagtaacta ctgtccccat gactgcagtg caccccgtgt acccgtcgcc 5520 aatgggcaca tacctgctgc tgaatggtaa aaatccgctg gatatgtttg aacctctttc 5580 gaactttgaa cccctttaaa atctgggccc aagtgcgaga actcgctcac agccgtcagg 5640 tcaaccccaa cagactgtca ggtcaccgag gtcgcgggaa actgctggaa cccgcgagaa 5700 cccaaactca acaacataga gggggtcaag gtcgctactc tgcagccttc aaaccagcta 5760 gcaaccccaa cgcttgctgc cccaggcctc aaagggctta atgttgaata accggactgt 5820 gaatgacatt gtacttgcac tgtgagcaga caacccagca gccatcagcc tcgcacggtg 5880 cgacactccg cctgaacaca cccccagacc tgtgttcaac taacaagcat acacactact 5940 gtccaaatgc tgaactctca cacatgacaa ctgccaacag catttgcaca ttgaacttga 6000 atcggaacgg cggatggttg acccatcccg tcctaactct cacggccccg ccgcacgatg 6060 tttacatcag agctaatagc ctgatacatc ttcaggcata tatatagaca cctttaacga 6120 ccggtgatag gcacccacac ttctcacagc caacaacgat ggctgtcacc ctcaagaagc 6180 cccactcagg ccagaccatg ccagaggcct gctctgtcat catcaaacaa gactctgtgg 6240 tgccagccta tacaaacagg tgctctatat ggctctccat acgaaaggcc gaactctaaa 6300 ccatacattg ggtttttctt ccaaccaaca acagaagggg cataaaactg ggcctcaacc 6360 tggaagccat accctaaatt gaactgtcct catgaacagt atatgtcttg tgtaatagct 6420 gcatggcaac tgacataaaa atccccaaag actacctact gggacgattg gtcagctacg 6480 agttccatga ctttgaactc ttgggccaat tagtggcctc accctacatg caacgacacc 6540 agaggggagc ttagaaaaca cagactatac tgttccattc cagctcgcca cactcacgtc 6600 tgtcttacca gtaaacaaag aaccggtagg cagaagtgag ctcacagagg actaactcct 6660 tgccgtgtgc ccagccactc gtcaccccat cactaacatg gatgaagaca gggcactttc 6720 ctcaacagaa cagacatgct gaggccacgc aggatgaggc caaccgacaa acgtcagcac 6780 atcctgtata actaccacgg tgccctcact aaagactctt tagactgcaa gtcccactga 6840 ccttcacatg gaaatatcca actccatgct ggaaaaagga catcgggcta tgcaacaaaa 6900 cctgctcggc acccatctgg cccrtcctca agcccagggt caaaagggtg cctgaccata 6960 gaaagctaaa tcagcaaatg ccattgtcac aagggccaat gacaccactg gaacaggaag 7020 tgcccagaat tcaaggacca accatctcct caacgcttga tgtggtcccc gccttctgga 7080 caatacctgt acatccaaat gaccagcata agctagcttt caccttgaac aacagacact 7140 aaacgatcat gaggtgcccc gtcggctagg ccaactcacc agccgaaagc aacaccctcc 7200 tgaacaaagc atatcccgat gcttatgtta gaggcaacct catctacgtt gatgacatcc 7260 ttaatgaaag taccaccgtg tctggccact caaaggagat agaccatggc ctacaacagt 7320 tgactgtggc tagcgcaaag actaccgcct agcaaggtct ctgaactgag cggctcccta 7380 agggggtgca actactccag acacgtcatt aagaactgtt caaagacttc tcgacccgtg 7440 accaccctgt caaaaagggc tgtccttctg tttgacggaa ggtcaacagc tggccacgat 7500 tgaagtgaga caacgtctgg gcatatgcaa gtaagacact ccttgcgcca gaaggcaaat 7560 actaagactg tgtaacaaca tcgctctgca cggtgggtgc catttcaaca gctattctcc 7620 aacttctttg gagcacaaaa rccatcatag acacctgcca ccaacctgtc atgttactga 7680 acagtcagca catccgggat ggcataaaga ctaacgccag tacggccatg cggccgatgg 7740 ccccctaggg ccactgtatt gacacttgtt acaccttaag ccacaaatcc acgctgggta 7800 acagcatagc tgtttaccaa gactgtacaa ctgacagaca atttgcaacg ctgtagagga 7860 ggaaccccag ccacctctgc caacctgtca ttcaagtact cccaagtact gggatgagaa 7920 tgcgtgccaa ggtatgccca caacctgtgt cgatggatgc tcccacaacc accgcagtat 7980 gcagaaatgg cagccaccat tgtaaccctt caaagcacag cagcccatga catcagagtg 8040 cacttagctg cagactccaa cttgctagac tctgtttcac gttccacctc ccagggggga 8100 aacaaaaggc ttcagaactg cccaccacaa gcccgtgaaa cgtcaacact ttttcttctt 8160 cccaagtagg cgaccacaac actgacaacc acatcatggt cgtctacaga agaaagataa 8220 gaacagttca aactttcagg cactgacaag gacctgaatg actggaccga tgcccttacc 8280 aagaaggtgc atcacacggt ggaccatggt caccacccaa ccaaaacccc gagatctggc 8340 agctgtaacc ggcagccaac aaccctgcct agcctagctg ccacatccaa tctcaccagc 8400 ccgacaaatg tcgacaaacg acattgcgac cacgtaggcr tctgactcct cacgtcagac 8460 cagttttatt tttttacttt tactttgtct cttgttgttt tgtttgtttt ccaccactca 8520 ccccacctac cacccgatcc ttctgtctga cctcagcttc aacacagatg tcactcaaca 8580 cctcctacat cttaaaaaaa aacaggtgcg tgtgaatcct ttttgggttg tccctgaaga 8640 ccccacgaca ccacagttcg tgctgcctca gagccagagg ggggtaatgc taatgtactg 8700 accctgcaca cagcaaaaca tgcccagggc gctgattcac cgagcaccac agcaacacag 8760 agaaaccatc ctctcatgat cagacaccca gatggatcgg gcgggacact tccatgggct 8820 cacaaggagc aacaattttt ttttttgtgt gtgtttttct ctccttcgct ttctcttccc 8880 cttgaactca tatgccagta caccaaatgg gtagagtgtc tctcagcacc gacgacacag 8940 cccagaccat ggcatatctt ctgatgatgt aaacaacttg gacagaggca cccaccctgc 9000 actaccaaca tcacacagga agggcaaaga ctcctggttg tatacgctca gctgcatagc 9060 agccaaccct caaacctacg tatcacaacc agagtaacaa acactgggcc acaagccagg 9120 aaagggggta gagaccctgc aaccggtgtc acagtggaat gcccagccaa gcacagtcat 9180 cacagacacc acctactctc tcccctgttc cctctctctc tctctctctc tctctctctc 9240 tctcttcctt tcctacacag gatacagcca cgatcctgta cctcagtctc accagaagac 9300 cactgcaacc tcacaactat actcctctat cactcatctc tggaaaactg cggaacaact 9360 accactccgt gcctcccgac aacagggaag cacacattaa caagggcctc gccattcaat 9420 aagatgcaag caccagtgaa caagtgcacc gccatactcc cacgagcctt tcagcggaaa 9480 gctgggaaag gaaaggaacg ccaaggagct gcacccacgt tgatcgaaat gaaactccgt 9540 ccagctgact caggaacgga catcgagggg ccctcacttc ctgcccaacc aacaagttaa 9600 agttagatcc caaccaaccg ggttttctcc gaaccacgtt gtttgggaag agggggac 9658 // ID DNA-TTAA0-4_DR repbase; DNA; ZEB; 1707 BP. XX AC . XX DT 04-DEC-2008 (Rel. 13.12, Created) DT 04-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE non-autonomous DNA transposon - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; TSD 0-bp; KW target TTAA; DNA-TTAA0-4_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1707 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(12), 2158-2158 (2008). XX DR [1] (Consensus) XX CC This family of transposons insert specifically between 5'-TT and CC AA-3', and do not produce TSD. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1707 BP; 543 A; 337 C; 321 G; 505 T; 1 other; acagtttttt tggattgctt acacactaaa attaaaagta gtacactatt agcaaaacct 60 tacacacaag tatcaaaaca tcagcccata tttgcacaac tataagcaca ttgtcaactt 120 cacacttgtt gcaaaactct acacacagtg atttgcaaaa cactaaacac acttaacata 180 cattacacac taaaatctat catgaagtca cttccttgca ataccaaagc actgactgtc 240 aaattactgc accgtccaac caattggttc aacacagtcg tcaggtgtgc aaacacttgt 300 ttgattaatt gtagacacac caatcaggtg tttaagcact ataaaatgcc agtttcaatg 360 aaaaccatca ggtctcaata tcaacactgg cacacaccat aaaaaaaaaa aaaaaaatat 420 atatatacat attcaaatga agcaaattta tggagtgcct ttcgagcaca atttcgaaag 480 ggtgaaaaga ctgtggcatg attatgcaga ggtatgtatt cactttagca gtgtgatctt 540 gcatactgtc tcataatctt ttactgtact gtaatactag atctacactg aactacacaa 600 ttttgcctga cactgttctt caggtagttt tacgaatgga tggagaggag atccagcatg 660 agttcgttta cgtagatgat ggtgtgttca acctgacgag aacacggagt agaagaaaca 720 tcattggcca cagggctata gtcaatgtcc cagggcaacg tgggggaaat atacagtaac 780 actctgtgca gccattacaa agaatggggt cctccactgc catgcccata tgggccctta 840 caacacagca catatactta cattcttgga ccaattgcac aacataacag cagcaaatca 900 aatcgatcat acgcaataca ttgttgtctg ggacaatgtg tctttccacc gttctgctct 960 ggttcagaac tggttttagc aacatccaca ttttaccgtc ctatatcttc caccatactc 1020 tccattcctc aaccctatag aagagttttt ctcagcatag tggtggaarg tatatgatct 1080 tcgtctccaa gctgaggtac ccctcatcca agccatggag gaggcctgtg accagatgga 1140 ggtagcagca atacaagatt ctacattcaa gacacttctt tccaaggtgt cttgctaagg 1200 acaacattgc aatgttgata attctcatgc cagatccagc taggcgaaga gacaatgtct 1260 aacttacagt tcaatacgta aagtgacatg ttacagtact ttgaatcttc atgccaacat 1320 gttggacatg ttgaggaata aatgagtttc ttcaaggtgc aacattgttc ttgagtagtg 1380 tttggtgaat ttactttata tttgtacttt gttgatggta gcctactcta atcataggga 1440 agtagaagtg ctaaaagtgt tttaggttta tcacagcagt gtgtaactcg tgcaaacagt 1500 gtatagtaaa tgtgaaaatg tgtgtttctt atggtaaaaa agtgtggttt ttaaacataa 1560 gtctatagtt tttataagag tgtttaatta tgcaaaggat ctgtagtgtt ttgctaattg 1620 ggtgtgtggt tgtgctaatt gtgtgtactg ttttgaaaac acgggccctg ttttgaaaat 1680 cgtgcttaag caatccaaaa aaactgt 1707 // ID LTR5_DR repbase; DNA; ZEB; 1841 BP. XX AC . XX DT 03-NOV-2008 (Rel. 13.11, Created) DT 03-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE Putative long terminal repeat (consensus). XX KW LTR Retrotransposon; Transposable Element; Nonautonomous; KW solo-LTR; LTR5_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1841 RA Bao W. and Jurka J.; RT "Long terminal repeats from zebrafish."; RL Repbase Reports 8(11), 1773-1773 (2008). XX DR [1] (Consensus) XX CC TSD =4 bp. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1841 BP; 534 A; 411 C; 313 G; 579 T; 4 other; tgtaacaaat atggccgacc agacacataa ttattttgct ttttattcaa caaatgagcc 60 aacttttgcc cagggaagac tcgatggtca gttgtaacct gattccaagg tatttatgac 120 ccaaaggaat gtgccatgca gcactgaaca aagaacggtt caggacttct ttaagctcct 180 tgcttaagac tgatgtttta atgtaaactt tactctctct ttatgtgtaa tcatgtgttt 240 tgcaacgttt cccttgctcc tgtattaata ttttggtttt aatacttctg catacaatgt 300 ttaactctac cgaagctaaa caagctcata aactgttcat gtttagastt gttttaaagc 360 taacttgctg tactaattga ttgttgacat tttatgttaa tacgagtgca acatattaac 420 attttaagag acttgaacat ttctgcctca gtttgcatca aatttgccta tgcaaattcc 480 tctgcctcgg acaaagagcc ataaaaccat gattcaggga agttagtttc taattggtca 540 actttccatg gagggtggat ccgaacccgc cctatataag cttgggacca aagcaacgcc 600 tctctctctt gctcygcaac tctctgaaac caaccttttt taaccagctc tctgctctga 660 aactctctgg aaccawgctc tcttaaaccg caacctcttc aagctctgca acctcttcaa 720 actctctgca acttcttcag aaagaactct tctctcaact ctcttctctg aaaagtaacc 780 ctcttcaact caagtcatag caacgaccta agaacgagac ctaagcagag aaacctccaa 840 gaaagcaggc ggagacccag acggaaaggg ttaatcgcac agctcggaca tttcacatct 900 cagatacctt cttttcttta ttttccatcg agagtctagt taagtttagt gcgtcgcaaa 960 ccagtggtca gcacaactgg ccccttttgc cgcttcaaac ttgaaactac cgactccacc 1020 ccacttcaac cctcgcgagr accaagggaa caaccagcca gaaccacgtg ctcaaccaag 1080 ggagcttctt cacatcacac ggacagaccg caagtatcca cttttgtcta cttcctacat 1140 ctgaactggt gtaaactaaa tgcaacccta aagagacaat agaagggtta tattgaattt 1200 gattatgttt ggttcgcttg gtttgctctt acaaactccg aggttttccc atcattacat 1260 tctctcaacc tcttacttta ccttctttac tcttctctac acttgtatga atgtgtgttt 1320 gtgttaggtg ttagtttctg ttagattagt caataaagtt taattttgta taaagaaagg 1380 tgcctgtgta tatttatgaa tctaatgtct taaacttcga tcttgttacc ctgctcaaac 1440 ttaaatgggt ttttattttt gatagcaata gccataataa actcgactgt ccactcgaac 1500 gatcgctgga cgattcgttt gttcggatct aaagagtcga ttcatttgaa gccccgttca 1560 aatgaatctt cctcccgtta agctaatccg ctatatatat aatggtgcag aagatacgcg 1620 agcactgtaa tctaaatcat aactcgcaac attataacgg tggaaaaggc gcgggaagaa 1680 taaccttgag ctaatttatt acagtaataa tggtgtagat atgcgagcac tttaatctac 1740 attataattt gttacagatt attaatgtag atatgcgtgc aaatttaatc tacattatta 1800 ttaatttgct acatgtttat tggtggagaa taaacgaagc a 1841 // ID DNA-8-31_DR repbase; DNA; ZEB; 425 BP. XX AC . XX DT 19-FEB-2009 (Rel. 14.02, Created) DT 19-FEB-2009 (Rel. 14.02, Last updated, Version 1) XX DE A nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; TSD 8-bp; DNA-8-31_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-425 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 9(2), 531-531 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 425 BP; 158 A; 69 C; 50 G; 148 T; 0 other; taggcctgtc acaataatca atatatcgac ttatcgtgca atacttggag atgacctcaa 60 taatttttgc cattgcaata tatatattta caaattcaca aataatgtta aagccatttt 120 acaatgacat ttacattcta cctcaccggt gtcaaagttt ataattggac atttacctaa 180 taggacattt aatagtatat ataataggac atttaccttt ttaacatcaa attatagaat 240 ctaaataacc ttaagaatga taagtatttt aaagtgttta tggctatttg cattattatg 300 ctgttatata atcattatat aatcagtggc ataaaatgat ctcaaaatga caataatatc 360 gcttatcgca acaatttctg tgacaatata tcgcacaaca aaaattcctt atcatgacag 420 gccta 425 // ID DNA25TWA1_DR repbase; DNA; ZEB; 555 BP. XX AC . XX DT 02-MAY-2002 (Rel. 7.04, Created) DT 02-MAY-2002 (Rel. 7.04, Last updated, Version 1) XX DE DNA25TWA1_DR is a nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA25TWA1_DR; KW nonautonomous DNA transposon. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-555 RA Kapitonov V.V. and Jurka J.; RT "DNA25TWA1_DR, a nonautonomous DNA transposon from zebrafish."; RL Repbase Reports 2(4), 10-10 (2002). XX DR [1] (Consensus) XX CC The consensus sequence was reconstructed from 21 copies present CC in sequenced DNA representing 1% of the genome. Approximately CC 2000 copies of DNA25TWA1_DR are expected to populate the complete CC zebrafish genome. DNA25TWA1_DR copies are ~5% divergent from the CC consensus sequence. CC This element is characterized by 25-bp terminal inverted repeats CC and the TWA targets site duplications. CC Its classification is not certain yet, although it is CC expected to be a member of the Harbinger/IS5 or Mariner/Tc1 CC superfamilies. XX SQ Sequence 555 BP; 137 A; 142 C; 158 G; 118 T; 0 other; aggctgattt atacttctgc gtcaaacgcc ggcgtatgct acggcgctga cgcatagccc 60 ttcgccgtgg ccgtcggcgt cactgacgtg cacctctcaa aaaatgtaac tacacgtcgc 120 aacgacgcgt agcgcaagct ctgtgattgg tcggcttggt agcgctgacg agtctgggcg 180 ggaccgagag ccgcgcgaat ggcgcgagcc cgatggagcg attgtttaca agtgtggagt 240 cccgtgaagg agctccggat ggaaagtttt gttttgtgtt tacctcatag ttaaagttgt 300 tgcacgtccg ccggttcctg cctcaaaatg agcgagtttg agccacttgt acatcccgga 360 agtgttcagg aaaagcaaaa cagcagcgaa gaaactcgac acagaggaac atttacacct 420 cactgccaac tagcgtttcg gaagtgttaa tgcagaccaa cagagacagc gcgcagaagt 480 ataaatgcac agccacgcgc gttgcatgcg ccgtgggtta cgccggtcac ttgacgcaga 540 agtataaacc aggct 555 // ID Copia5-I_DR repbase; DNA; ZEB; 4425 BP. XX AC . XX DT 21-OCT-2008 (Rel. 13.1, Created) DT 21-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia5-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4425 RA Jurka J.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1592-1592 (2008). XX DR [1] (Consensus) XX CC 5 bp target site duplication. LTRs are 100% similar to each CC other. The closest elements to this family are from fungi and CC plants. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 831..4253 FT /product="Copia5-I_DR_1p" FT /translation="MSDAEIFMTESLGSAIIDTACTRTVCGEKWLEXYIDD FT LTQDQVNQLMQTETRSCRPFRFGDGNLVYSTKNVKLPAKIGLTKCLIDTEV FT VKVDIPLLLSKTSLKTAGTILDMEKDSAVMFKQSIPLEFTSSGHYCVDIRD FT KDNEKDQTEEKVLTVTESMSPEEKHKVLLKLHKQFGHASADRLQRLMHSSG FT NKDKECFAILQQIVDDCEICKKYKRTKPKPAVGLPMASEYNETVAVDLHEL FT EPSIWYLHIIDHFTRFSAGSIVKTKKSSEIVNSFIHTWISVHGAPQKLYSD FT NGGEFNNQEIRDMAENFNIEIRTTAGYSPWSNGLLERHNQTLTEIILKVKR FT ENGCDWHTALDWALMAKNSMLNVHGYSPYQLVFGQNPNLPSVLVDKLPALE FT GTTVSAKVGEHISALHASRKAFTEAESSERIRRALRKQLRPTDDMYVTGDK FT VFYKRADCPEWKGPGVVIGQDGAVVFIRHGGTLVRVHQSRLCKANTQNQES FT RDKQNEQANSQKVTKKGSLNSVADSSDSEVTDSDQENISDRELCTGEVAYP FT RITQAETDHVVCGDPVSFADVKLKTGQTITFNKQNSDGLHRAKVLGRAGKA FT SGKYKNWFNLQHIKHDGSDGQKESVDMSGVDNLHIESTDREADVLITKDIS FT FDAAKQAEIFNWYKNNVFEEVEDAGQKCVSTRWVCSLKETQNGIVPKARLV FT ARGFEELNIHELQKDSPTCASDSLRLLLAVICQNKWQVHSMDIKSAFLQGM FT QLSREIYVRPPPEVGKENVLWKLNKCVYGLADASLYWYNKVKEIMLSTGSK FT XSKVDPAVFYWLDEQCKVTGVLACHVDDFLWAGSQNFSTNVIPILKSALNV FT GREEHEHFCYVGMDFVTINGVVHVHQHXYIENLQPIHLQAARAVQRDTSLN FT ETEREQLRSKIGQILWVAKQTRPDIMFDVSSLASNIKHATVQSIHDTNKVI FT RKLKSEKVSLKFQHLGNSDALNFVVFSDASLGNLPDGGTQGGTLIGLMGEG FT GKFSPLCWQSKKIRRVVRSTLAGETLALSDGIDNAIFLTTLFSELTTGNAE FT LNTPPLICVTDNHSLFDALKSTKQVSEKRLRLEISSIKELMQSKKIKKVLW FT SDTKTQLADCLTKKGASALMLLKVLSEGLWCY*" XX SQ Sequence 4425 BP; 1473 A; 801 C; 1015 G; 1133 T; 3 other; tggtagcaga ggatggttga atgatggcag ccagttataa agttccgccg aagtttgacg 60 aaacaagacc gtacgagtgc tggaaaaatg aagttaatat ctggacgcgg gtcactgaac 120 tcgataaaaa gaaacaagcc ctcgctgttg cattggggct tgaaggcaga gcccgtgaga 180 cagcgatgga aatatctgcg gaggatttag acagtgatac aggtatgaca acgctgatgg 240 ctaagctgga tgatgtgttc ttaaaagagg agaaggatcg cgcgtatgaa gcgtattctt 300 attttgatgg aataactaag gacagttcgg tttccatggc ggactacata attgattttg 360 aacagcgata taatagaatg aaaaggtata atatgacgct tcctgacgct gtactagcgt 420 ttaagctatt ggatacggca tgtctcgacg acaaaaacag acagctagca ctgacagcct 480 gcacagagtt aacattttca tctatgaagt cggctctaaa acggatattc ggaggaaaag 540 cagcaggctc gtcaaacggg atacaagtga accaggatgt tgcgtttttc accgaacaaa 600 gaccacaaag aagggaaaaa cgtaatgaaa tgtggcaaag tggacattca aagcagccat 660 taccagggac taatccacta gataagtacg gtaagagatc cagatgtgct atttgccagt 720 ctacatttca ttgggcttaa gactgtccac acaagaaaac taaacatgtt aaaataactg 780 aaaatgcaaa cattgaggaa tgtaacataa cattatttac caaagcctca atgtcagatg 840 cagaaatatt catgactgaa tcactgggat cagcgattat tgacactgct tgcacccgta 900 ctgtgtgtgg ggagaaatgg ctggaaawtt atattgatga tctcactcaa gaccaagtaa 960 accagctgat gcaaacagaa actcgtagtt gcagaccatt tcggtttggt gatgggaact 1020 tggtatattc cacaaaaaat gtgaaattac cagctaaaat aggactgaca aaatgcctaa 1080 tcgacacaga agtcgtcaaa gttgacattc ccctcctgct gagtaaaacg tcactaaaaa 1140 cggcagggac cattttggac atggaaaagg acagtgcagt gatgttcaaa caatccattc 1200 ctcttgaatt tactagttct ggacattact gtgtcgatat cagagacaaa gacaatgaaa 1260 aagatcaaac tgaggagaaa gttctaacag tgacagaaag catgtctcct gaggagaaac 1320 acaaagtcct tttgaaactc cataaacagt ttggccacgc gtcggcagac aggctacaga 1380 ggctcatgca cagttcagga aataaggaca aagagtgttt tgctattcta caacaaatag 1440 tagatgactg tgaaatatgc aaaaagtaca aaaggacaaa accaaagcct gctgtgggat 1500 tgcccatggc ttctgaatat aacgagaccg tggcggtgga tttgcacgag ttagaaccaa 1560 gtatatggta ccttcacatc atcgaccact tcacacgttt cagcgccgga agtattgtta 1620 aaacaaagaa gtcttctgaa attgtcaatt ccttcattca cacttggata agtgttcatg 1680 gcgcccccca aaaactgtac agtgacaatg gtggagaatt caacaaccaa gaaatcagag 1740 acatggctga aaactttaac atcgagataa gaacaacagc tggatacagt ccttggagca 1800 acggactact agagagacat aaccaaacac tcaccgagat catcctgaag gtgaaacgag 1860 aaaacggatg tgactggcac actgcacttg actgggccct catggccaag aacagtatgt 1920 tgaatgttca tggctacagt ccataccaat tagtctttgg acagaatcct aaccttcctt 1980 ctgttttagt tgataaatta cctgctttag agggtaccac tgtaagcgct aaagtagggg 2040 aacacatttc tgctctacac gcttctagaa aagcattcac tgaagcagag tcttctgaac 2100 gtataagaag ggcactacgt aagcagctta ggcccacaga tgacatgtat gtgacaggtg 2160 acaaggtatt ttacaaaagg gcagattgtc cagagtggaa gggaccaggg gtggttattg 2220 gtcaagatgg agctgttgtt tttataagac atggtgggac tcttgtaaga gtacatcaat 2280 ccaggctctg taaggcaaat acacagaatc aggaatctcg agataagcaa aatgaacaag 2340 ctaacagtca gaaagtaacc aaaaagggta gcttaaacag tgtagcagat agctcagata 2400 gtgaagtcac agacagtgac caagaaaaca tcagtgacag ggaattgtgt acaggagaag 2460 tggcttatcc tagaataaca caagctgaaa cggatcatgt tgtgtgtggt gatcctgtct 2520 cttttgctga tgtaaagcta aagacaggac agaccataac atttaacaaa cagaatagtg 2580 atggtctaca tagagccaaa gttttaggca gagcaggcaa agcctctgga aaatataaaa 2640 actggtttaa tctgcaacat attaaacatg atgggagtga tgggcaaaag gagtcagtgg 2700 atatgtcagg tgtggacaat cttcacattg aatcaaccga tagagaggct gatgtgctaa 2760 ttacaaaaga tatttcattt gatgctgcta aacaggcgga aatatttaac tggtacaaaa 2820 acaatgtttt tgaggaagtt gaggatgcag gtcaaaagtg tgtatctacc agatgggtct 2880 gtagtcttaa agagactcag aacgggattg tgcccaaagc tagactcgta gccagaggtt 2940 ttgaagagct taacattcat gagttgcaga aagattctcc aacctgtgct tctgattctc 3000 ttaggctatt attagctgta atctgtcaaa acaagtggca agtccattct atggacatca 3060 aatctgcttt cttgcagggt atgcagctgt ccagagaaat ctatgtccgc ccccctcctg 3120 aagtaggaaa ggaaaatgtt ttgtggaaac taaacaagtg tgtttatggc cttgcagatg 3180 catcattgta ttggtacaac aaagtgaaag aaatcatgtt gagtacaggt agtaaawtgt 3240 ctaaagtaga tccagcagta ttctactggt tggatgaaca gtgtaaggtt actggagtac 3300 tcgcatgtca tgtagatgat tttctttggg caggctcaca gaacttctca actaatgtaa 3360 tacctatact aaagtctgca ttaaatgttg gacgtgagga gcatgaacat ttctgctatg 3420 taggaatgga ctttgttact attaatggtg tagttcatgt acatcaacat agktacattg 3480 aaaacctaca gcccattcac ctgcaagcag cacgtgcagt gcagagggac acctccctta 3540 atgagactga aagagagcag ctaaggtcaa aaataggaca aatactgtgg gttgcaaaac 3600 aaaccagacc agacattatg tttgatgtaa gcagtttagc atcaaatata aagcatgcta 3660 cagttcagtc tattcatgac acaaacaagg tcattagaaa gcttaaatct gaaaaggtga 3720 gtctcaagtt tcaacatttg ggaaacagtg atgctctgaa ctttgttgtt tttagtgatg 3780 cttcccttgg gaatctccct gatggtggca cacaaggggg tacactgatc ggtctcatgg 3840 gagaaggagg gaagttttct cctctctgtt ggcaatctaa gaaaattaga cgtgtagtga 3900 ggagtactct ggcgggggaa acgctggccc tgtcagatgg aatagataat gccatttttc 3960 tgactactct attttctgaa cttaccactg gaaatgctga actgaataca ccgccgttga 4020 tctgtgtaac agacaatcat tctttgtttg atgcccttaa gtcaacaaaa caggtctctg 4080 aaaaacgtct tcgactggaa ataagcagca taaaggagct tatgcagagt aaaaagatca 4140 agaaggtact ttggtcagac acaaaaactc aacttgcgga ctgtcttaca aaaaaagggg 4200 cgtcagccct tatgttattg aaagtactca gtgaaggatt atggtgctat tgaaagcacc 4260 aagttaagga ctgagaacac ttaaaaaaaa aaaaaaagaa gggtgaatgt gttcattcaa 4320 aactttcatt taaatataat actttgtttc ttttaatatg ggaactgtaa tattctatat 4380 tttttatgta ttttgttttt ttttttgttt aaagtatagg gagat 4425 // ID DNA8-3_DR repbase; DNA; ZEB; 1387 BP. XX AC . XX DT 17-NOV-2004 (Rel. 9.1, Created) DT 17-NOV-2004 (Rel. 9.1, Last updated, Version 1) XX DE DNA8-3_DR is a non-autonomous DNA transposon. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-3_DR; KW nonautonomous DNA transposon. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1387 RA Kapitonov V.V. and Jurka J.; RT "DNA8-3_DR, a nonautonomous DNA transposon from zebrafish."; RL Repbase Reports 4(10), 253-253 (2004). XX DR [1] (Consensus) XX CC DNA8-3_DR is a nonautonomous DNA transposon. Its is CC characterized by 12-bp terminal inverted repeats and 8-bp CC target site duplications. It is expected to be a member of the CC hAT or P superfamilies. Main portion of DNA8-3_DR is composed CC of a 83-bp minisatellite tandemly repeated ~15 times. XX SQ Sequence 1387 BP; 240 A; 369 C; 521 G; 253 T; 4 other; cagggccgga gcaagctgaa ctgccgctct aggcagagga cgatcacgcc gccctcaacc 60 ccaatgtgaa aacgaaagtg accggttatg aaaatgaagt gaggtgtcgc ggacctcttt 120 tctgccgccc tatgcgcgga tataactgag gacgcgcggg tgctgaggga gggagggagg 180 tgtcgcggac accgccctct ctattctgcc gccctatgcg cggatataac tgaggacgcg 240 cgggtgctga gggagggagg gaggtgtcgc ggacactgcg ctctctattc tgccgcccta 300 tgcgcggata taactgagga cgcgcgggtg ctgagggagg gagggaggtg tcgcggacac 360 cgcgctctct attctgccgc cctatgcgcg gatataactg aggacgcgcg ggtgctgagg 420 gagggaggga ggtgtcgcgg acaccgcgct ctctattctg ccgccctatg cgcggatata 480 actgaggacg cgcgggtgct gagggaggga aggaggtgtc gcggacaccg cgctctctat 540 tctgccgccc tatgcgcgga tataactgag gacgcgcggg tgctgaggga gggagggagg 600 tgtcgcggac accgcgctct ctattctgcc gccctatgcg cggatataac tgaggacgcg 660 cgggtgctga gggagggagg gaggtgtcgc ggacgccgcg ctctctattc tgccgcccta 720 tgcgcggata taactgagga cgcgcgggtg ctgagggagg gaggkacnga ggtgtcgcgg 780 acgccgccct ctctattctg ccgccctatg cgcggatata actgaggacg cgcgggtgct 840 gagggaggga gggaggtgtc gcggacaccg cgctctctat tctgccgccc tatgcgcgga 900 tataactgag gacgcgcggg tgctgaggga gggagggagg tgtcgcggac gccgccctct 960 ctattctgcc gccctatgcg cggatataac tgaggacgcg cgggtgctga gggagggaag 1020 gaggtgtcgc ggacgccgcc ctctctattc tgccgcccta tgcgcggata taactgagga 1080 cgcgcgggtg ctgagggagg grakgaggtg tcgcggacgc cgcgctctct attctgccgc 1140 cctatgcgcg gatataactg aggacgcgcg ggtgctgagg gagggaggga ggtgtcgcgg 1200 acgccgccct ctctattctg ccgccctatg cgcggatata actgaggacg cgcgggtgct 1260 gatggaggtg tcgctgacgc cgccctctct attctgtcgt ctatgcgcga atatatctga 1320 ggacgcgggt ggtgagggag gtgtcgcgga cataactgag gacgcctcta tggacgcgcc 1380 ggccctg 1387 // ID Gypsy11-I_DR repbase; DNA; ZEB; 6196 BP. XX AC . XX DT 07-JAN-2005 (Rel. 10, Created) DT 07-JAN-2005 (Rel. 10, Last updated, Version 1) XX DE An internal portions of the Gypsy11_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy superfamily; Gypsy11-I_DR; Gypsy11-LTR_DR; Gypsy11_DR; KW endogenous retrovirus; gag; integrase; reverse transcriptase. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6196 RA Kapitonov V.V. and Jurka J.; RT "Gypsy11_DR, an LTR retrotransposon from zebrafish."; RL Repbase Reports 4(12), 315-315 (2004). XX DR [1] (Consensus) XX CC Gypsy11-I_DR is an internal portion of the Gypsy11_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its CC long terminal repeat is deposited in Repbase as CC Gypsy11-LTR_DR. Gypsy11_DR is characterized by 4-bp target CC site duplications. The internal portion encodes one 2006-aa CC polyprotein composed of gag, protease, reverse transcriptase, CC and integrase domains (pos. 132-6149). PBS is complementary to CC Pro-tRNA. XX SQ Sequence 6196 BP; 1944 A; 1254 C; 1429 G; 1569 T; 0 other; taaagtgggg gctcgtccgg gatctccctc tgatacatca atgataagtt cataaagaaa 60 gaagagacaa gttaaattat ctgaatccaa gcacctactg tcttctattc aaaagttcat 120 aaactgtcgc tatggctttg agaaaggacc ccagtcttca agcagagtta acaaaatggt 180 gtgaagcagc cagtattgat tctactcatg ctttaatcct gttaaatgta cctgtttaca 240 cagaagtggc agagattgaa gatgtaatgg aaactgtcaa agctctggga agggttcgag 300 tgagggacac aagagaagga cctactgccc agtctatgat ggttctatgt gagtgccgac 360 aagccactga ctccacccgt attcctcctg aagtgacaag gggggaaaag agtgagccct 420 ggtcagttat cgtgattcaa ccaagtgaaa gtgcctctaa tacagatgct gaaggattct 480 ctgaaaagct gacaaagttc ctgatggagg aagggaaaac tttcagtgac gtaaaagctt 540 taattacacc ccacagtgct ggtgatagtt ctcctgaatc aattatacgt gcaatgggtg 600 aagtactgga aaagaccata aagccaacaa gtgatggaaa tgcataccga cgacttcgga 660 cattttcagc cattgttcca accccaattg gagaggaaaa catggataac tggatagatc 720 aagcaaggct catgataact gagtgtgact gtactgaaaa agaaaaacga cgcagaatca 780 tggaaagcct taaaggacca gcaatggaaa tagtgagagc tgttcgtttc tcaaatccag 840 aagcaggtgc tttagagtat gtggaagcat tagagagttc ctttggatct tcagagtctg 900 gtgaagatct gtattttaag tttcgactta tgcgtcaaaa tgcaggggag gcattatcag 960 agtttctgag gagaattgac aaagctctaa aaaaagtggt ggaaagaaat ggcctgtctt 1020 tcagaatggc cgataaagtg cgagtagaac aactcattcg aggggcagtg aattctgatc 1080 tcatgttgct acagttgagg cttcgagaac gtacagcatg cccaccttct ttcctgagtc 1140 tgctgaagga aatacgtgag gcagaagaga gtgaagcttc tcgccacaga atgtcagcta 1200 aagccaaatc catacagtat tgtgatgaaa ccacaagcac atctgtcatt caagaactca 1260 aaggagagat tcaagagctg cgtgcacact tatgtggagt catgcccaaa actgtttcta 1320 cttcttcttt ggtggttaaa tccagagaga aacctaacca gatggaaaaa acagaagatg 1380 tagaagttca agagttaaag aaacaagttc agtacttaca acaacagttg gcagttatga 1440 gtgtcagtcc ttctcagggc tcactagagg ttacagaaca acggccaaaa ccatcacagt 1500 ccacttcatc atcatttagc cgaaagtcaa ccagaacaaa agatgattat ttctgttaca 1560 aatgtggaca agacggtcat attgcgacaa aatgtcaagc cccacaaaat tcagaccaag 1620 taatccagaa gctaattcgt tctttgcggc aagctaagag ttttaagtcc gaggctaatg 1680 aaagcagaaa aacccagaac caagtttgtt tctctaaaaa gagtcagaca gacataaaca 1740 aagccagcag tattccaaaa ggcctagttg gaccagcatc aacagtagaa gtgacaatta 1800 atggacattt atgtcaagcc ctgttagaca gtggatcaca agtgacaata gtgtttgaga 1860 gttggtactc taagaattta aattatgtac cgattcaccc cttgactgga ttatccatat 1920 ggggcctcag ttcgtccagt tatccttata agggatatat agtagtggat gtcactttcc 1980 cgttctcagt cagtggagta aaggagccgc tgtccatctt ggctctcgta tgtcctgaac 2040 ctcaagggcc ctcccagtta ccagtcatta ttggaacaaa cgccagtttc ttccagcgcc 2100 ttgctactct cagtcatgat gtaaaatcat ccagcactgc acaagcttta agaattcata 2160 ctaaccttcc tgaaatccat ctcaatcagc agacatgtaa aaatacctta atcgaccagc 2220 ctgaggggaa agtacggtgg atgggacctg aaaagtgtgt agttccatct agaggagaag 2280 tactggctgt ctgtaaaata gagactgaaa aacccttgag aaaagaaatt tttgtggtag 2340 actcacctga agaggacaat cttccagctg gtacattcat aactccattt gtgcttccct 2400 tttccaccat tcaagataac agtgtgcaag tgttagtcca taatgaaacc tcaaaggaca 2460 tttcaattcc atcgggtaca ataatggcca atgtgtacct tactgacaca cttactgttg 2520 gttctgggga aaaaagctct caagtaattg acccaagcct gtttaaattt gaagagtcat 2580 ccattcctga gttatgggaa atgcgactcc gtcagaagtt ggcagtcaga agtgatgtct 2640 tttctaccaa tgagtgggat gtgggattag cccaaggagt tgaacataac attagactga 2700 cagacaccaa acctttcaga gagcgctcca gacgcatcgc tcctgcagac atcgaggatg 2760 tgcgacgcca tatcaaagaa cttttagagg ctggaataat tacagaatca agaagccagt 2820 atgcatcgcc cattgtaatt gcaaggaaaa agagtggagc agttaggatg tgcattgact 2880 acagaactct aaatgctcgc accatccccg atcagtacac tactcccagg attgatgatg 2940 ctctggactg tttggctggc agtaagtggt tttctgtgct agatcttcgc agtggatatt 3000 atcaaatagc tatggctgag gaggacaaag aaaagacagc atttatctgt cctttaggat 3060 ttttccagtt tgaacgtatg cctcaaggga ttactggagc tccagccaca tttcaacggc 3120 taatggaaaa agctgttggt gacatgcatc tccttcaagt aatagtctat ctggatgata 3180 tcattgtgtt tgggcgcact cttgaagaac atgaagagag actattaaaa gttttagatc 3240 gactcagaga gtgtggcctc aaagtttcaa ttgataaatg ccagttttgt cagtcccaag 3300 tacgttatgt ggggcacatt gtgtcagctg ctggagtatc accagaccca gctaaagttg 3360 aggcggtaac ccgctggaaa atgcccactg atctgaaatc actgagatcc tttcttggtt 3420 tctgtggatt ttaccgacga ttcataaaag actactcagc cattgtaagg ccattaacag 3480 aactcactaa agggtaccca ccaacatctg gccaaaacaa aaagaccgca gatgtcagaa 3540 aatatcacaa aacgtctgag ccttttggag agcgttggga tgataactgt acagcagcat 3600 ttcacaagat catttattgt ctcacacatg ccccagtact tgctttcgcc gaccctaaca 3660 aatcctatgt tcttcatgtg gatgctagcc ttaatggtct gggagctgtt ttgaatcagg 3720 agcatcctgg tggacttagg ccagtagcct atgcaagccg aaagctgagt gcatctgagc 3780 agcgttaccc cattcatcag cttgagttct tggcgcttaa atgggcagtt gtggacaagt 3840 tccatgatta tctttatggt gctcagtttg tggtgaaaac agacaacaat ccacttacct 3900 atgtgctgtc cagcgccaag ttgagtgcaa caggtcaccg atggttagct gctttagcca 3960 catacaactt tagcttacaa tacaagcctg ggagtcacaa cacggatgct gatgtattgt 4020 cacgctaccc ctttgaatca gctatctcaa cagaatggaa ggaaatttca aagtcaggag 4080 taaaagctat ttgccagctg gcaagaatta atgaaagtga tgagtcatcc tccaggctag 4140 tggatcatct atgtgtctct cctcacagta tcccagaagc atttgcttgt cccacttcat 4200 tgagtttcag tcctatggaa cagttgagcc atgaggagtt aagtaaggct caagatgaag 4260 atcctgtaat tggtatagtg aagcgagaat tggagtctgg acaaatactt actgatacta 4320 ggagttcaga tgattcaatt gctctgttac ggcggcaggg acccaagttg aaaattgaga 4380 ataaacttct gtacagagtc accaacagtt cctgcagaaa ggaaaatata cagcttgtct 4440 taccctcaag gtattggtcc catgtactgc gctccttgca tgatgactct ggtcatttgg 4500 gggtagagag aacaactgaa ctactgagag atcgattcta ttggccccga atgtctagct 4560 atgtggagca gtatgttaag aattgtgggc gatgtgtaac aagaaaaaca gtacctaaga 4620 aggctgcccc actgaaccac ctaaccagca atgggccatt tgacttagta tgtattgact 4680 ttttatctat tgagcctgac tccagaggtc taagcaatgt tctggttgta accgaccact 4740 tcactcgcta cgcacaagca tttgtaacca aagatcagaa agcactaact gtggccaaag 4800 tactgtgcga caaatttttt gttcactatg gcttacccac ccgaatccac tcagatcagg 4860 gaagagactt tgagagtggt ctcataaaag aaatgcttaa catgcttggt ataagaaaat 4920 ccagaacgtc accctatcat ccacaaggag atcctcagcc agaaagattc aaccgaacac 4980 tgctgtccat gctcggtact ctgaatccta cggagaaaca cagatggagt cagcacatca 5040 accgattagt gcacgcttac aactgtacta agaatgatgc aactggatat tccccgtact 5100 atcttctatt tggaagagag gctcgtctgc cagtggatgt gtgttttgga acttcacctg 5160 atggaaaagg agctggtagc catcgacaat atgtagaaag aatgaagtca gagttacaac 5220 aagcatatca gttggcaact gaaactgcac agaaaagtca aaagagaaac aaacgattgt 5280 atgacaagca cataaaacat cagacactgg ctgtaggaga tcgtgtgctg acaagaaatc 5340 tgggtatgac tggaaagaac aaacttggtg acaagtggaa ttctgttcct tacctgatag 5400 ttgaaaagct gaataactta cctgtatatc gtttaaaacc tgaatctgga atgggaagtg 5460 tgaggactct tcacagagac catctcttac ctgttgggga tgatgtgaga ttgtctgatt 5520 caggaaaaat tcaaaataca gctgctccaa cagagatcag gacagaatct gtggggagaa 5580 aatatggaag aggattaaag gaaataaatc agaacatgac aggaagtgga ttgagaaatg 5640 actcagacag tgaagaggat gacttgtggt attactaccc gaatagggtc gatgaaactc 5700 aaaagtatgt aacacaggag gaatcttcaa taaatgaggg agcagtttac aataaggatg 5760 aattggaaac acccgctgca gagatggatg acacatcagt cactgaccag ggccacgaag 5820 cagtattgga gcagactcct gaggagcttg aagatcctgc tctagaaggg ggagatcctg 5880 gaacagccaa tactactaga gaggtagtag agcatgttca tgtgcgacct aagagaaaca 5940 caagacctat agaccgattg agctatgatt gtctggggag acccactaac aagtctttaa 6000 ccttagttca cagaggaatg gttgtaaaag tacaaggtgg ggatagttca ccaaagtttt 6060 gtaatacagt gtggtgccac cctatggcac ggtgtactca ttgtgttcaa gtaagcaatt 6120 gtgctgagcc tgagatcact attgaagttt aatgaggtaa tctcatgagg acatgagatg 6180 tttagaaggg ggagca 6196 // ID DNA-2-4_DR repbase; DNA; ZEB; 1399 BP. XX AC . XX DT 24-MAR-2005 (Rel. 10.02, Created) DT 29-MAR-2005 (Rel. 10.02, Last updated, Version 1) XX DE This is a nonautonomous DNA transposon - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA-2-4_DR; KW Interspersed repeat. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1399 RA Kapitonov V.V. and Jurka J.; RT "DNA-2-4_DR, a family of nonautonomous DNA transposons from RT zebrafish."; RL Repbase Reports 5(2), 36-36 (2005). XX DR [1] (Consensus) XX SQ Sequence 1399 BP; 429 A; 265 C; 297 G; 408 T; 0 other; atgcctagtt cacactagag gattttaagc ctgatttgag cccgatttgg aagttaacga 60 gctcgccgac agatcgggct gtgatcggga agaaatctgc gggtgctcgg cgctcgccgc 120 tctttatgtg tgaactactg aacaacgcat caacgaggct cgctgacgcg tcgccgacac 180 ctcgcagacg gaaatccaac attcagcatg ctaaatattc cagagcagtc ggccgactca 240 accccacgtg tggtcagatg tagtgacgag ctgcagccaa tgagagagca tatcagcatg 300 agctgaatgg ctgtgtgttc aggagctcag cagaaacatc tgtgattggt cagaatgggc 360 atcggtgcac tcgcttcatt ctgcgttagc acagacatga gagtaggcta tattaacagt 420 ggaactagaa ttctgtaact attagaatta ccatactgct cattctgacc gttctcatat 480 aaaacgccat attgtcacaa catatttaca ttcaaagcta ttattgagat attaaaatta 540 agctttgaat gtctggcatc atatttaatg acaccattac cctaaaaata taatcttttt 600 ttggtaatac agcctataaa tatgtagtta agatactaaa acacttaaag gtcttggctt 660 tcattttcac tttcaaacag gagctatttc acagcacaga atatgctgtt ttgaaagata 720 tctgaagtaa attgatgttt ttgccatcag aaagttttgt ttatgttagc aggaagttgc 780 taggcaagaa aatgcacaca tatttaaagt cacagcgaaa agtatcagac atgcatgcag 840 tcattttgac caatatggta atttaaggca gaaatgctca gttctttact gttttgtaat 900 aaaaaaaata acaatgtcag aaagaaatac actgatagtt agaaaacggc agggtgttat 960 aaatatgtta gttttatttc aaagagttat aaaattacaa aaattaaaaa ctctctgtgt 1020 atctatccac tcgaaccttg cagatgagtg attaatccgc tgataaatca gctgatttga 1080 cgatgttttt aaatgctttc tccaaagctt gaaacgctgt cagtgatgta atcagcacac 1140 aagcagccaa tagtgttcag ctttttctga cgactcctcc ctcaaaattt cattggctgt 1200 ggtttggtag cttgaatgag ctgatgggga atgagttgtt gtcagatcat gtagtgtgtg 1260 acccccctct tgtggatcgt tcacggagtg ttgcgtagtg tgaacaccac agagattaaa 1320 agacacaaag tcaagtcatg tagtgtgaac ggcacagcga tctgctgatg tttaaaatcc 1380 tgtagtgtga actaggctt 1399 // ID Gypsy117-LTR_DR repbase; DNA; ZEB; 1028 BP. XX AC chr20; XX DT 03-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy117-I_DR; KW Gypsy117-LTR_DR; Gypsy117_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1028 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1522-1522 (2008). XX DR Genome; chr20; Positions 53918662 53917635. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1028 BP; 286 A; 155 C; 193 G; 394 T; 0 other; tgtagctgag ctaataagta taaaatatac caatgcaaat tattagtgaa cttcatcaaa 60 actttacttt taattacttt tgttttatat tttcttttct taatcataat gttttgtata 120 tagttttgat agagatgttt tttatgcgga accatgttgg caaatgttta tggagagcac 180 atattagtgg cgcacacagc aactcatgcg gtgtgtgggc aggcagcgcc atcttcagtg 240 ttgaattcgc cgctggctga gatggttgca tgttctctcg ctctgctaac gcgagcctgg 300 taaggatttt caagcgagtt cactgagtta ttttatgtga cagtgaatgt ataaacgtga 360 taaaactatg atgtatcgaa gatgaatgta aataatcaca ttttgagtgc ttaaagtgac 420 tttcctttct aattggcaga gtggaaaaag agagagacat ttcagcatat tattgtgctc 480 tctgtgtttt tacaggtatg tttattatgt tttcttaagt gatgtttgaa atctaattat 540 acaagttgtg taaaatgagg tctgtttgca aagcaatata attttgcgta tattcatgta 600 ttttatagtg ttaaatgttt atttgtgtat cttttctgat atttgacact atttttcatt 660 ctattgctca ctaatgtagt gaccagtgaa acattatatt ccttttacaa cttatctgat 720 tatactataa tgtttattct ttactactaa tttttgtggt ctatttattt gtatttattt 780 tcttttacct attgtttttg tcaatttgtt ggctattatt aatgcaacga gtgttgaatt 840 cgccgctggc tgagatggtt gcatgttctc tcgctctgct aacgcgagcc tgagtggaaa 900 aagagagaga catttcagca tattattgtg ctctctgtgt ttttacagaa taaaccaaac 960 ttgagacctc caacgccgct gttgtcttat tgggaaaaaa cacacacaac caaagtaaaa 1020 ctgctaca 1028 // ID ERV1-N6-I_DR repbase; DNA; ZEB; 4503 BP. XX AC . XX DT 01-DEC-2008 (Rel. 13.12, Created) DT 01-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE nonautonomous endogenous retrovirus from zebrafish, internal DE portion - consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW internal portion; ERV1-N6-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4503 RA Bao W. and Jurka J.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(12), 2173-2173 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 873..2522 FT /product="ERV1-N6-I_DR_1p" FT /translation="MEDMMVKYIAKHGTVGIFDGIEKVVDKPYEWAISKFE FT NFSKMPKNNKGKIANALQAVLASYKVTRKRVVELEAENERLSVEINASKYY FT QASEALWKEEKIRMQXEITRLKTHKDLLASSVETFYINLXEARATCNFFIX FT NGLTENNTSETLNANRVPKMTDIQEDDEEWXRDSQSVSSQCSECTVRFPMA FT PVHSNTTMHITEAGVESTTHNTVRGFNPNELEAVIANIGKFDPTEQDPLDF FT LKRLEEYAEVYKYTDGDACVLLRMCLPETLSGALGQKVKDKTANKSERKQA FT LLEVLGVMSVNWDKISETHMKRGEHPRVFSERLLEMFKTFSGNPDITYDDV FT SFKSALINKCDPHTHSAVTMLVTPLSDYNEIIGKMTQFYNNNANSQRQMKD FT ARYPVAAFASREYSSSSNGFHHRARKFTSVNSMGVVLCYACGKAGHIARXC FT QKSAKSQSIXCHACGKNGHIARHCLEKMRKDQEVVCFRCGIMGHKAIQCHV FT SVKRKTSYKDLHKKIARLENQLEELKGMKEGPIHTFSDTSLHENDCNSVSS FT SC*" XX SQ Sequence 4503 BP; 1603 A; 666 C; 945 G; 1271 T; 18 other; ctggcgagtc atccagcgag gacttccaaa acaacaaagg agagaaaggt aagaaaagca 60 ggttttctta ttaaatgttt gttgttagga atccccctcg ttttaaaaaa atctagaaaa 120 gaaaatttaa gacagaagag aatttgtatt gttcagtttt agcagaagga gttcttaagt 180 ctattctgta ctattctgcc aattttgttt aagattctaa ctaaactgaa ttaacccgac 240 gtagtgtggt agcagaattg tattaaattg ttgtttctaa agtaattgaa gcaaagggct 300 gaagtttagg tttctaaagt gaatgaaact gtgctgaact gtgttgattc tagtggcaga 360 gttgtattgg aaatgttgtt tctaaagtaa tttctaaaaa tatgtttcta gcataagtga 420 aactggttta taaagttttc ataattcttt ataaattcta aagtaactga attgatttga 480 tgcagtgtgg tcacagaaaa tatatttggt agggtgtctg gaactgttaa agatcatttt 540 cttgaggttt tgcctgatgt aatgtagcag tgttttgatt tttaaattac aacacaaagg 600 actgatgcag attggtttta tctgataaca tacagataaa cattgttttg agagtagaat 660 ttgaggcctg atggagagtg tctatgttca atagacactg aagtacagac attttaatgg 720 tcttgttttt tttttttttt tttttttgtt attattcatt tgcacaggag cagtgaataa 780 ttgtatcaag tattaaataa cttggtaatg ttcagtcagt aaagaactaa aatctattga 840 attactcaag tgaacttaat aagattgtga aaatggaaga tatgatggtt aaatacattg 900 ctaagcatgg cacagttgga atttttgatg gtatagagaa agtcgtagat aagccatatg 960 aatgggctat ctctaaattt gaaaattttt ccaaaatgcc aaagaacaat aaaggaaaaa 1020 ttgccaatgc acttcaagct gttcttgcat catacaaggt aactagaaag agagtagtgg 1080 agttagaagc tgaaaatgaa agactcagtg ttgagataaa tgcttcaaaa tattaccagg 1140 ctagtgaagc attatggaag gaagagaaga tcagaatgca gratgagata acacgtctga 1200 aaactcataa agacttgttg gcatcatctg tagaaacatt ctacattaat ttgraagaag 1260 caagggctac ttgtaacttc ttcataarga atggcctaac agagaataac acaagtgaga 1320 cattaaacgc aaatagagtt ccaaaaatga cagacattca ggaagatgat gaggaatggg 1380 mtagagattc tcagtcagtc agtagtcaat gctcagagtg tacagttaga tttcctatgg 1440 caccagttca ctctaatacc accatgcata ttactgaagc aggggttgaa tcaacgaccc 1500 ataacacggt tagagggttt aacccaaatg aactagaagc tgtgattgca aatattggaa 1560 aatttgatcc tactgaacaa gatcctttag atttcctaaa gaggttagag gaatatgcag 1620 aggtgtacaa atatacagat ggtgatgcat gtgtgttgct aagaatgtgc ctgcctgaaa 1680 ctttgtctgg agctttaggt cagaaagtaa aagacaaaac ggcaaacaag tcagagagga 1740 aacaggcact gcttgaagtc ttgggtgtga tgtctgttaa ctgggataaa atatctgaga 1800 cacacatgaa gagaggggaa cacccaagag tattttctga acgattgttg gagatgttta 1860 aaacatttag tggcaatcct gatattactt atgatgacgt cagtttcaag tctgccttga 1920 tcaacaagtg tgacccacac acccattctg ctgtgacaat gcttgtgaca ccactctctg 1980 attataatga aattattggt aaaatgacac agttttacaa caataatgca aattcacaga 2040 ggcaaatgaa agatgcgaga tatcctgttg ctgcctttgc aagcagagaa tattcaagct 2100 catccaatgg tttccatcac agagcaagga aatttacaag tgtgaactca atgggtgttg 2160 ttctttgtta tgcatgtgga aaagctggtc acattgcgag amattgccag aagagtgcaa 2220 aatctcaaag cattgyttgt catgcatgtg gaaaaaatgg tcacatagca agacattgtt 2280 tagagaaaat gaggaaagac caggaagttg tttgtttcag atgtggaatt atgggacaca 2340 aggcaataca gtgtcatgtt tcagtcaaaa gaaagacaag ttataaagat ttacataaga 2400 aaatagccag rctagaaaac cagctggaag agttraaagg aatgaargaa ggacccattc 2460 acaccttttc agacacatcc ctgcatgaaa atgactgtaa ctcagtctca agcagctgct 2520 gatgtgaatt ggagaaaatg tttcatgata gtacaatatt atagagatat ttttactatc 2580 aaaacggggg tgattctgta taatttctaa kaaattattt taaattatat tgattgttta 2640 aatgtaattt atgatacatg ttttaatgta attatttttt ttaaaataat attactgttg 2700 tgtgatgctt taatgtgaga agatacttst gtgggaaagg gggggcaacc tcttcttttt 2760 attctcgttc cagattgaaa ggtgtggtga aagatgatcc agtcatgggt gagacaggcc 2820 agtatgacca cgaggagatt atgcaaagaa caaggttctt caagagaaca gagagatcca 2880 gaaccccagg cccagtattg atcaraactt tyaagcctgg atgagagcca gaaaccaaaa 2940 aaaaaaaaaa aaaaacttga tcacagaaac caaacagaga tccaaagctc caggtccagc 3000 atcaattaga attgtcaaga cccagattac atctagaaac tcagagatgg tgaagatgat 3060 caagaaacca agagatgcya atgagattaa gaaccaagag atgttgggat gctgatgaag 3120 ttctagaaac ccagagacac agaaaagttg gaacagcaca aatggaatac cagtgactgc 3180 tttttcaagt gagacttgtt gtccagatgg aagagactga acagtggata agagaaagag 3240 aagatcaaac catttttaga aacttggaag ttaatgaaaa caaatggtaa aaatacggac 3300 ccacattgaa ggcctaaaag cctaaggcca cagactgaca tatgaagaaa cgaatacata 3360 catacatgca tacatacatg catacataca tacatacata catacataca tacatacatg 3420 catacataca tacatacata cagatagatt atatattgct gtacaacaaa tgcaaaagtg 3480 tggatgtcat cagtatgtcc acactgggga aaaaaaaaaa aaaaaaggtt gcatacagta 3540 atgtagagca tatattactg tccttgaatg cacacttgga taaaagatgt ccatgctgaa 3600 aaaagtggac acaatgtgtt taaaaaaaaa aaaaaaaaaa cacaggaaac ttttcacaaa 3660 caaatgtttg ttgtgywggg agtgtacatt tgttatctag tggatgcaaa tgtggtgtag 3720 ttaatatgaa tgtttaatga ttctaatgtt tgattttaat gtggattwta atgaggatgt 3780 taacactgtc gacacataaa aaaaaaaaaa aaaaaaggga aatgcttgca ctcacaaata 3840 cttgttgtgt agtaaatgtt catgtggtgt agttaatgca atgtttaatg ttyttatggt 3900 gtgattttta atgtggattt cacttggatt ttaatgagga tttgtacgct gataaaatag 3960 acacagtgag aaaaaaaaag aacaatgctt gcataaaatg gacaatacta tgcaatggtt 4020 gtacatatga aaaaaatgta catgctaaga aaatgaacga gaagaaacga tgcttgcata 4080 caagcatgca gacacacaag cacgcacgca cgcacacact cactcacaca cacacacaat 4140 tttctctatt gattgatgtt cttgctcctt tggcttttgg ctgtagtact ttttagactg 4200 atacattaga acattagatg tcatggttct catgtgctcc tgtcctaaac ccaagaaatt 4260 atttatattt tatagttcca aatttgaaga agagactatg acaacttgcc taattggatg 4320 gagcaataac cagtgtgcat gaggtgatga ccccaagacg ggtaagattc tttggtgccc 4380 agccaagaac aacctaagcc agggctttac cgccactggg cacctgccca tgaagggagg 4440 tgtaatgtgt gaaagagtgg ttgtgatgct tctttctggg ccaaagagca tcagagaggg 4500 gac 4503 // ID ERV1-2-LTR_DR repbase; DNA; ZEB; 884 BP. XX AC . XX DT 02-DEC-2008 (Rel. 13.12, Created) DT 02-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat - DE consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; ERV1-2-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-884 RA Bao W. and Jurka J.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(12), 2171-2171 (2008). XX DR [1] (Consensus) XX CC TSD is 4-bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 884 BP; 247 A; 160 C; 171 G; 304 T; 2 other; tgtgatggtt gatgggtcag atgacattaa tcatgtgaga cacaaaaaaa tcactgatgt 60 attctgatat tcctatgcgt aaaagcacaa ctcttaatga tgaagtgaaa cgtaagagaa 120 aggaaaggcg wctccggcag agggcgcgcc gagccaacaa atcagctgga cgaactctct 180 gtgaacccct tgctacttca gatgaccctc ttgctagtag tcgaactctg aatgaactcc 240 caacttgtac ttttgctgaa ctctctaccg agttcacgca cctgaatttg agtgcataac 300 tatatagcac attataaatc gaggcttata ttaaaatgcc taaatgttga tttatatgat 360 acaatgtact cttgatgact atatatctgt ctttttactg ttcaagtgat ttagcatgct 420 ccttgaatgt atgatttaag tagtctgtgc acctgtgttt agggttgatt tatgaatact 480 gtcactgaga cactgagtat aaaagctgac atgtactctt cagctttgcg cttcctgact 540 tctgttcatt gaggttggtt gccctttgtc cgccaacggg ggacaaagta aggatttctt 600 gttttctgtt tgtcttttgt ttgttctact ttttcactgg attttatttt attaataaaa 660 gtgtatttta ttamttttct tcactggagt ggacattgac ttcattttcc agaacccatc 720 taagttaccc tgtgtggtaa agtatgctct aatttattct tttgcaaaat acgttcttta 780 aagacgatcc tttaactaaa ctcttgctaa gaaatatagc taagggagga tcctgatctt 840 aactttgttt aattagggtg attaaataaa gggaaatcag caca 884 // ID Gypsy4-I_DR repbase; DNA; ZEB; 6550 BP. XX AC . XX DT 17-APR-2009 (Rel. 14.05, Created) DT 17-APR-2009 (Rel. 14.05, Last updated, Version 1) XX DE A long terminal repeat of the Gypsy-4_DR LTR retrotransposon; a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy superfamily; endogenous retrovirus; KW LTR; Gypsy4-LTR_DR; Gypsy4-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6550 RA Dib M.R. and Naveira H.F.; RT "Gypsy4_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 9(5), 954-954 (2009). XX DR [1] (Consensus) XX CC Gypsy4-I_DR is an internal portion of the Gypsy4_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its long CC terminal repeat is deposited in Repbase as CC Gypsy4-LTR_DR.Gypsy4_DR is characterized by 4-bp target site CC duplications.The internal portion encodes two proteins: the CC 536-aa gag Gypsy4_DR1p (pos. 261-1868) and 1546-aa polyprotein CC (pos. 1757-6394, conceptual translation) composed of the CC protease, reverse transcriptase, and integrase domains. Some CC insertions fairly recent, CC according to the hight identity between their flanking LTRs. CC Consensus obtained after the alignment of at least five CC independent insertions bearing at least 85% homology over at CC least 1000bp. XX FH Key Location/Qualifiers FT CDS 261..1868 FT /product="Gypsy4_Dr_1p" FT /note="Gag-protein." FT /translation="MATERRKLAWSIRRRLFALSNIQLFELAMNIATDNVD FT PVQFKQTDEESCMDYVTSYIQSDTLLQLEDEGLSQLLILSDEIDQIIKKSS FT STESSVSEVAEINTLATPIPHSSNMISDSPPSSDNALENIETQSIAELKRV FT YEELIEQMKQREATVPQPAATISQPSQPQVSSRTHFTQERPATITDLSYLP FT RREFKLFGGQIGDNSSDISYNSLTKQINEGLKEGFTEAEVVRGVLRIVKPG FT TFRDMLMLKDELSIFELQCFLRSHLGEKAVTEMFQELMCARQTEQESPQQF FT LYRMIGLKQKLIFQSKQANTDISYDPKTIQEVFLHTVYQGLGVKYADLRQR FT LRPLTLNNNVADEEILREVTRIISDENEHQRRLGQVTRQKHLQAQSAVLET FT KETIPAKGQNAAQTIQHLAAQVQTLTDMVASLMELKTANQHQFFPPPATQN FT PPYPNSSPFPQQQYSHTSQPPTNQHIAPSKKMFLCQKCTELNLQDCSHCFV FT CGESGHRAVGCLKRTRTQGNGNRSLSRDSQRPAQTLCPSL" FT CDS 1757..6394 FT /product="Gypsy4_Dr_2p" FT /note="polyprotein." FT /translation="VWTSSSWVSKADEDSGKREPVSVEGQPEASTDPLSQS FT VKPAPIQPAKQRTRHQKEPHIYSCQSMKTCQVPLVGKKSTLKCLISGYPVT FT VLFDSGSQVSIVDRQWAKKYITNHPVRPLRELMDDELEVYAVNGQTVPYDG FT WVELTVTLAGHEDPNLTVKAPFLVSQLPLPQPLVGANVLGVIIQRQQSDRD FT ANAVLYSLLRRAFEVDEEQVAAIVNFIQVPQQTDCTPATVRVGKDNVIIPA FT GKAVQVWCRVPPNFGITDPFVLYEPLEDNVALRHLSIGEGLLGVNMTQRPY FT VKIPISNHSKDDVLLPKRTVLGTIQHVAKLIETETLETTSSQSTPITISAT FT GVDKTTPPCDSPTDSWLPPVNLTHLSPDQQKSVEKVLVEECRAFSRDSGDI FT GCIPSLQMEIRLKDDTPVQRAYASIPKPLYREVKEYIQELLVKGWVVKSQS FT PYAAPVICVRKRDGTLRLCIDYRLLNNKTVPDKHPLPRIQDLTDSLGGYSW FT FSILDQGKAYHQGFIAEGSRYLTAFTTPWGLYEWVRIPFGLSNAPAAFQRS FT MEEMLDTLRDECCIPYLDDVLCFSKSFDEHVQVLQKVLKALQRHGVKLKPE FT KCELFRKEVRYVGRLVSAEGVKVDPKDIEAVQALKHRKPQTVGDVRQLLGF FT LSYYRTYVQDFSRIAKPLYDLLQVKTDISHTKPAKGKTKHPQQSSRAPIHW FT EREHQDTLERLIDMLTQPPVLAYPDFTCPFILHTDASQKGLGAVLYQNQDN FT KMRVIGYGSRTLTPAEQNYHLHSGKLEFLALKWAVCDKFKDYLFYAPHFTI FT FTDNNPLTYVLSTAKLNAVGHRWVGQLADFRFDIKYRPGKVNIDADVLSRC FT PLDINTFIKECSEELSQDAVNAVWEGNRRAKQGDVPWVAALSLASLDQPLK FT EPLQTISHDELVQKQMADPVIGKVLEMKVNDITLSEDNRAKADTNTKRLLR FT EWSRLYIENGLLYRKTTERKQLVLPSVYKQVALTHLHNNMGHVGVEKVLSL FT ARERFYWPFMKREIEEYVTRKCPCIKQKRPATHERAPMGSITSNAPLELVC FT IDFLHLESCKGGFEYILVLVDHFTRFAQAYPTKNKAAKTAADRLFNDFIPK FT FGYPSKLHHDQGREFENELFKSLRQLGGVGHSRTSPYHPQGNPVERLNRTL FT LQMLRTLGEKEKQNWKDHLPHVIHAYNCTKHEATGFSPHYLMYGRHPRLPV FT DLLFGLQTADDETPHGFADKWKGKMVEAYRIANTNSQQSSLKGKVHYDKRC FT KGVTLQPGDRVLVRNLNERGGPGKLRPYWEQTIYIVREQVGDNPVYKVSPE FT TGGRPIRTLHRNLLLQVNDLPVEIVQHPTAKPQRRDKRPKERSKTPDQSQS FT PENSDSEGETPLYWFPAPVSHPQIEHPQATKPTHIVPHHRPESHKNSQVEQ FT DTYMNAEKEQVISMDNVENADLPLSDDVQEEPQLHEEMEIQEYSPQPVSCL FT GPEQDLETPPRLRQSTRQRQPKRIFTYPSLGQPDYRTCPTVNTVNVQPTMY FT PLLHYHAFYPQPPPITSNPYLQMPYAVACF" XX SQ Sequence 6550 BP; 2060 A; 1590 C; 1422 G; 1478 T; 0 other; aactgtgggg gctcgtccgg gatctctcct cttcagtcgt cgaggggcag atccagtgat 60 acaaggaccg tgacaacgat tgggccacta tttcagctaa gccagccagt gggtgtccaa 120 ggagagacga cgatcaaaag tgaaaggtat cctcgcctca aaaagggaac cggtctgctg 180 taaagacaga atagtgcgca cttatcacta cagagctaca gtaaatggta aaaagaaata 240 actaaaatta tacaagcaga atggcaacag aacggagaaa attggcctgg agcatcagaa 300 ggcgcttgtt tgcactctct aacatacagc tcttcgagct agccatgaat attgctacag 360 acaatgtgga cccagtccag tttaaacaga ctgacgaaga aagctgcatg gactatgtca 420 cttcctacat tcagtctgac acactgctac agctagagga tgaggggtta agtcagctcc 480 taatattgag tgacgaaata gaccagatca ttaaaaagag ttccagcacc gagtcctcag 540 tcagtgaagt agctgaaata aatacactcg ccacgcccat tccccattcg tctaacatga 600 tttcagactc accaccatcg tctgataatg cattagaaaa catagaaaca cagtcaatag 660 cagaattgaa gagagtgtac gaggagctaa ttgagcagat gaagcagcgt gaagctacag 720 tgccccaacc cgctgcaaca atcagtcagc ccagccaacc acaagtaagc tctaggacac 780 acttcactca ggagagacca gccaccataa cagacctttc ctatctgcct cgtagagaat 840 ttaaattatt tggaggacag attggggaca acagctctga catcagctac aacagcctca 900 ctaaacagat taatgagggc ctaaaagaag gatttacaga agcggaagtt gttaggggcg 960 tactgagaat tgtcaaacct ggcaccttta gagatatgct tatgctaaaa gatgaactgt 1020 caatttttga acttcagtgc tttctgaggt ctcaccttgg agaaaaagca gtaactgaaa 1080 tgttccagga actaatgtgt gcgaggcaaa ccgaacaaga gtcaccccaa caatttctgt 1140 accgtatgat tggacttaag caaaaactca tctttcagtc caaacaggct aacacagaca 1200 tttcctatga cccaaagact atacaggaag tgttccttca cacagtctac caaggtttgg 1260 gcgttaagta tgcagacttg aggcaaagat taagacccct tactttaaac aacaatgttg 1320 ctgatgaaga aatactacga gaagtcacca gaataattag tgacgagaac gaacatcagc 1380 gtaggctagg tcaggtaact cgccaaaagc atttacaggc ccagagtgct gtacttgaaa 1440 caaaggaaac aatcccagcc aaaggacaga acgctgccca aacaattcaa cacctcgcag 1500 cccaggtaca aaccttaacc gatatggtag ccagcttaat ggagcttaaa acagcaaatc 1560 aacaccaatt cttcccgccc ccagccactc agaatccccc ctaccccaac tcaagtccat 1620 tcccccaaca acagtacagt catacctctc aacctcccac caaccaacac atagctcctt 1680 caaagaaaat gttcctctgt caaaagtgca ccgagctaaa cctccaggac tgtagccatt 1740 gctttgtgtg cggtgagtct ggacatcgag cagttgggtg tctaaagcgg acgaggactc 1800 agggaaacgg gaaccggtct ctgtcgaggg acagccagag gccagcacag accctctgtc 1860 ccagtctgta aagcctgctc caatacaacc tgcaaaacaa cgcaccagac atcagaagga 1920 accccacata tactcttgtc aaagcatgaa aacttgtcaa gtacccctag taggtaagaa 1980 gagtacctta aagtgtttga ttagtggcta ccctgtaact gtattgttcg actctggctc 2040 ccaagtaagc atagtagaca ggcagtgggc taagaagtac atcaccaacc atcctgtaag 2100 gcctctaaga gaattgatgg atgatgaact cgaggtttat gcggttaatg gccaaacagt 2160 cccatatgat ggatgggttg aactcacagt tacactcgct ggtcacgaag accccaatct 2220 cacagtaaaa gccccatttt tagtcagtca gttgccactt cctcaaccat tagtcggagc 2280 aaatgtgctt ggtgttatca tccaaagaca acaatcagat agggatgcta atgccgtcct 2340 ttacagcctt ctacgcagag cttttgaagt tgatgaggag caagtggcgg ccatagtcaa 2400 tttcatacag gtgccgcaac aaacagactg taccccagcc acagtgaggg ttggcaaaga 2460 caatgtcatt atcccagctg gaaaagcggt ccaagtatgg tgcagggtac ctccaaactt 2520 tggcatcact gacccctttg tcctgtatga accccttgaa gataatgttg ctttaagaca 2580 tctgagtata ggagagggac tattaggagt taacatgaca cagagacctt atgtcaagat 2640 ccccatatcc aatcactcaa aagatgatgt gctcctgcca aaacggacag tgctgggaac 2700 gatccaacat gttgccaaac taattgaaac agaaacacta gaaacaacct caagccaatc 2760 tacaccaatc acaatatctg ctacaggtgt agacaaaacc acccctcctt gtgattctcc 2820 aactgattct tggctgccgc cagttaacct tactcacctc agtccagacc agcaaaagtc 2880 agttgaaaag gtgttagttg aagagtgtcg agcgttttct cgtgacagtg gtgatatagg 2940 ttgtattccc tcactacaaa tggaaatcag acttaaggat gacacaccag tgcagagagc 3000 ctatgcatct atcccaaaac ctctttacag ggaagtcaag gagtacatcc aagaactgtt 3060 agttaaaggg tgggttgtaa aatctcagtc cccctatgca gcccctgtca tctgtgtgag 3120 gaagagggat ggaacgctac gattgtgcat cgactaccgc ctcctaaaca acaaaacagt 3180 cccagacaaa catccacttc cgcgaatcca ggacctcaca gactcacttg gaggatacag 3240 ctggttctca atcctggacc aaggcaaagc atatcatcaa gggttcatcg ccgaaggatc 3300 aaggtacctt actgcattta caaccccatg ggggctctat gagtgggtgc gaataccctt 3360 cggtttgtcc aatgcaccgg cggccttcca acggagtatg gaggaaatgt tagatacact 3420 gcgggatgag tgttgtattc cctaccttga tgatgtgctc tgtttttcaa agtcatttga 3480 tgaacatgtc caagtgttgc agaaagttct caaggcccta caacgccatg gagtcaaact 3540 aaaaccagaa aagtgtgagt tgttccgcaa agaggttcgg tatgttggtc gcctagtgtc 3600 tgctgaggga gtgaaagtgg accctaaaga tattgaggca gtgcaagcac tcaaacacag 3660 gaaaccccag acagtaggag atgtcagaca actcctggga ttcctaagct attatcgaac 3720 ctatgtgcaa gacttttcac gtattgcaaa acctctatat gatctgcttc aagtcaagac 3780 tgacatatcc cacaccaaac cagccaaagg caaaacaaag catccccaac agtcctctcg 3840 agctccaata cattgggaac gagagcacca agatactctt gagcgcttga ttgacatgtt 3900 gacccagcca ccagtgttag cctatcctga cttcacttgc ccctttatac ttcatacaga 3960 tgcatcgcag aaaggtctag gagcagttct gtatcagaat caggacaaca agatgagagt 4020 aataggttac ggctcacgca ctctaacacc tgccgaacaa aactaccact tgcacagcgg 4080 aaaactagag tttttagccc tcaagtgggc agtttgcgac aaattcaaag attatctgtt 4140 ctatgcaccc catttcacaa tcttcacaga taacaatccc ctaacctatg ttttaagcac 4200 agctaagcta aatgctgtcg gacaccgctg ggtcggacaa cttgccgact tccgctttga 4260 tataaagtat cgcccaggga aagtcaacat tgatgcagat gtgttgtctc gctgtccact 4320 tgacatcaat accttcataa aagagtgctc ggaggaactg tcacaagacg cagtcaatgc 4380 agtttgggaa gggaacagga gagcaaaaca aggagatgta ccatgggttg cggccctcag 4440 ccttgcttcc ttagatcagc ccctcaaaga gccactgcag actatcagtc atgatgaatt 4500 agtccaaaag caaatggcag atccggtgat tgggaaagtc ctagaaatga aggtaaatga 4560 tatcacacta tcagaagata accgagctaa agctgacact aacaccaaaa gacttttaag 4620 agaatggagt cgattataca tagagaatgg ccttctgtac agaaagacca cagaacgaaa 4680 acagttggtc ttgccctctg tttataaaca agtagctctc acacaccttc ataacaacat 4740 gggccacgta ggcgtagaga aagtcttgag tctggccagg gagcgtttct attggccttt 4800 catgaagcga gagattgagg aatatgtcac gaggaaatgc ccctgtatca aacaaaagag 4860 accggcaaca catgaaagag cacccatggg aagcataaca tccaacgccc cacttgaact 4920 agtttgtatt gactttctcc acttagagtc ttgcaaaggt ggctttgagt atattctagt 4980 tttggtggac catttcacca ggttcgccca ggcgtacccc actaaaaata aggcagctaa 5040 gacagcagca gaccgtcttt ttaatgattt cattccaaag ttcggatacc catctaaact 5100 ccaccatgat cagggccgag aatttgaaaa tgaacttttc aaatctctca gacaactcgg 5160 cggtgtgggc cactcaagga cctccccata ccatcctcaa ggcaaccccg tcgaacgact 5220 gaatcggact ctgttacaga tgcttcgtac cttaggagaa aaggagaaac aaaattggaa 5280 ggaccatttg ccccatgtca ttcacgctta taattgtact aagcatgagg ccacagggtt 5340 ttcaccacat tacctgatgt acggcagaca tccccgtctt cctgtagatc ttctttttgg 5400 cctccagaca gcagatgatg aaacaccaca cggatttgct gataagtgga aagggaagat 5460 ggtggaggca taccgtatag caaacacaaa cagtcaacag tcaagcctta agggtaaagt 5520 acactatgac aaaagatgca agggagtaac actacagcct ggagacagag tactggtgcg 5580 caatctaaac gaaagaggtg ggcctggcaa attaagacct tactgggaac agaccattta 5640 cattgtgaga gagcaagttg gagacaatcc agtgtacaaa gtaagtccgg agacaggagg 5700 acgtccaatc cgtacactac acaggaactt gttgttgcag gtaaatgact tgccagtaga 5760 aatagttcag catcccactg ccaagccaca gaggagagat aaaaggccca aagaaagatc 5820 aaaaactcct gaccaatcac aaagcccaga aaacagtgat tcggaaggtg agacacctct 5880 ctactggttt cccgctcctg ttagccatcc tcaaattgaa cacccacaag caactaagcc 5940 aacccacatt gtgccacatc acagaccaga gtcgcacaag aattcccaag tggagcagga 6000 cacatacatg aatgctgaaa aggaacaagt gataagcatg gataatgtgg aaaatgcaga 6060 cttacctctg tcagatgatg tccaggagga accccagctt catgaggaaa tggagattca 6120 agagtatagc cctcaaccag tatcatgtct tggaccggaa caggacctag agacaccacc 6180 ccgcttgaga cagtccacta ggcagagaca gccaaagagg atattcactt atccatcact 6240 aggccagcca gactatcgga cttgtcctac tgtgaacaca gtcaacgtcc agccgaccat 6300 gtatccttta ttgcattacc atgcctttta tccccaaccc cctcccatta catcgaatcc 6360 ttacttgcaa atgccatatg cagttgcttg cttttgaagt gaaagacact aagtaaaagt 6420 tcagagacac tgcataggag attgttttga gacttttgaa agttcataga tgttaattgt 6480 tgctatgtgt attgttatct gtcagttgtt taactgaagt gtcaggagcc acttttattg 6540 ctggggagcg 6550 // ID DNA-TTAA0-5_DR repbase; DNA; ZEB; 933 BP. XX AC . XX DT 08-JAN-2009 (Rel. 14.01, Created) DT 08-JAN-2009 (Rel. 14.01, Last updated, Version 1) XX DE non-autonomous DNA transposon - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; TSD 0-bp; KW target TTAA; DNA-TTAA0-5_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-933 RA Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 9(1), 12-12 (2009). XX DR [1] (Consensus) XX CC These transposons insert specifically between 5'-TT and AA-3', CC and do not produce TSD. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 933 BP; 280 A; 172 C; 141 G; 339 T; 1 other; acagtttttc tcagtcactt tggtccattt ctcagatcag aattgaaatt ctcaaaactc 60 acctgttcaa tcttcacatc attgtgtcac ttgtgcacat cagaaaagca gtttctcatt 120 cctttatgaa caagttgcaa atgcttttgt acatccatgc aaatgattat gtgcaattct 180 ctgatgtttc ctacattatc aattgcttat gtcatgttga tcaaaattgt attattttgg 240 tctctgttga atagtctcac cccccacaac atttaatcgt ttgttcatgg catacattta 300 aagtctttac atgcaaaatg gctgaacaag ttgtcataat attttgtcat aatgcatttt 360 ttttctatac atttccatta gactttttkt tctaaatctg tctacaattg gtaaatttct 420 cccaggtgaa tcttgacttt ttctaatgaa ggtgcatctc aaaggagatt gtcctatgtt 480 cacatttcta cttttgtttt gtttttttgt tctgtgcagt gctgtctttt cctttctgta 540 tcacaatgac atgatctgtc aacaaattac agtacaaaca gcaaaaataa ataaataaat 600 aaataaataa ataaaataaa aatattgccc atagtttaca tcagaacacc cctccagagt 660 acactgttat attgacaaca tgactaagta atttgactgt cttatccata cacaatgaca 720 caaggacttg tcattctgac ggcactgaca cgttcattga cacagaaatt tagttttgag 780 agatgaacta aggattttga gcaagagact ggcttttgca ggtaatccgt ggtgttttgc 840 tatttgtacg cgttgttttg agaaatgcac ttactgtttt gcaaatttca attctgatct 900 gagaaatgta ccaaagcgac tgagaaaaac tgt 933 // ID DNA-8-9_DR repbase; DNA; ZEB; 3719 BP. XX AC . XX DT 13-NOV-2008 (Rel. 13.11, Created) DT 13-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE DNA-8-9_DR is a nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; TSD 8-bp; DNA-8-9_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-3719 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(11), 1724-1724 (2008). XX DR [1] (Consensus) XX CC Copies are ~96% identical to the consensus. TIR is 14-bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 3719 BP; 1160 A; 646 C; 701 G; 1209 T; 3 other; cagtggtggg agtaacgaat tacaactact cacgttactg taattaagta catttttggk 60 gtaattgtac tttcatgagt agaattttaa gcctgtaatt ttacttgtac ttgagtacaa 120 attaagctga gtaatctact tcgttacttt taaaatcaca attcgttaca gagtactgta 180 aataaattaa tatttcagcc acgcactgac cagcatttcg gtagccaata aaaaaagctg 240 atcttaacgg accaatgaaa agcctggtac attatttgaa ccgaaaaaca agttcaggct 300 actgcagatt tacgtgaccg tcatcataca gtcacaaagt tatcaattga catatggaga 360 tttaagacag aagaaatgag gaagaaatag cagaggatga atgcccatgg ccacacccgg 420 aggagctgtt cacctacagg tacataggga agaaagggga cagmtgtgct gatgtgaata 480 gctatgttat catctaaaat gctaattaaa tgtatgccca aaactggaat atcgcataaa 540 agactgcaaa taaagcagcg tttctgtcca tcaagtcaaa gagaacaaaa ttgccacctt 600 ctgattaact ggcgacaaat atcaaaagta aaaacagaat tagcatcagt aggagcagct 660 gcatgaataa attatcttca tttaataagt gacttgcgca tcagaaggca atcctgcacg 720 cagtgaacgc gcgctggcgt ttgaaggcgc gagacacgga gtgcaggcac tcttgacaat 780 tctggaggtc attattaata taatataata ctgaaatggt tgaggcggtt tagaatgacc 840 aaaacaacac tttagatgtt ttacaatgtg cttagcctgc tggtttgtcc attcacacac 900 attttcaaaa tatatgcgca ttatagcgtt tccatcaacc gtttttatgt gcatatccaa 960 aaagcgcaaa aaataagtgg atgaaacctg ccctctgctg tgccatccgg cttacacagg 1020 gtttccgtgg ggtcttaaaa tgtctaaaat ataaacatct aaattttagg ccttaaaaag 1080 tcttaaattc gctgttctag gtctgaaata tgtttgcaca ggtcttattt ttccgatgta 1140 catgtaacgc tactcctaat gctcatttaa attatttttt ttgttgttgc ttggtttcgt 1200 ggtggtgtag ttctttattt cactagtcct attgtaattt gcagtattaa aactacaaat 1260 aagaaaagac cagcatgcaa tagctcagtg cgcgcggcga atgtgacgtc atcgccgtgt 1320 ttgcggaggt tcactttgga tgcgcgcgac atgatttcgg gtgaacagtt cgcgcggcgc 1380 cgcgcttgat ttgagttgaa tttgaatcac tttgaagaga ttttgtctgt aacagtacga 1440 ctgtacagac atcttaacta tccgaccatg cgtgatcagc atagagataa ccagatgacc 1500 aataattctt ggctagaaat tgcaacagcc atggaaagga agtgttttgt taaagtttgg 1560 aaaaacctgg gggataagtt tgttacaaca aaaaagaggc catggcaaaa gtggagaccc 1620 aggtggtttt aagattacag aatatgtttt tccaccattt ataggtttta cactgatgta 1680 tcattttgaa gttaaagcaa tgtaacaaaa attaagtaac aaattaagta tttgataact 1740 ggaaaataat atatgaaccc atcggcttac agtatactga wgcccaattt ctaggcttta 1800 ttggtgatga tggacaggat tgttggacca ttattgcctt ttcagctcat tagtagagga 1860 cagaaactag ccagacaata atgtgtcaat tgttgagtgg gcaaaaaata atatatatat 1920 ttttttgcta ttattcttgc cagaataaaa aaaataattg tagagtaacc catgttctgt 1980 tgtcattgat atttaacttt aataggtaga actgtccgag atataaacaa aagtgagtga 2040 tgcatcaaac tttgatttaa aaaatcttga atagttataa aactttataa tcattcatta 2100 attttctttt tggcttagtt cctttattaa tctggggtcg ccacagcgga atgaaccgca 2160 caatttatcc agcatatgtt ttacacagcg gatgcccttc cagctgcaac ccgttctcat 2220 ttacaacaac agacaattta gcttacctaa ttcacctaaa atgcatgtct ttggactgtg 2280 gggaaaaccg gagcatcttg agcaaaccca cgcaaacacg gggagaacat gcaaactcca 2340 cacagaaatg ccaactgacc cagccaaggc ttgaaccagc gacctttttg ccttgagtcg 2400 agagcgctac ccactgcccc accccgtcac ccaactttgt aatcataata aattataaaa 2460 ataatgaaaa ataattattt ttaaaatctt gaacaatatt aaagatatga cgtagttctg 2520 aaaacgtttt acttctctgt ttatccgtct gactttacag tgaggtttct tttgactggc 2580 ccctgatgtt ttaaagaata tcacaatgac gaacacgtct agtattaaag cctcatccat 2640 ttcgtgaggt gcgcgcacag tcagcggagg tgtgcaatgc ggagccaggc gaaagtataa 2700 atcagcctca aaatgtttgt ttctttgttt ttttgttcct gcacttcaat ggtgcatcta 2760 acctgtcttc agtactgttc tatttaaata attttatttt accacaaatt ttgtgttttt 2820 gcattttctc ttgcatgctt ttgatattgt gatataggtc ttaaatttaa tactttatgg 2880 tcttaaagag gtcttaaaaa gtcttaaatt tgacattatg atatctgcag aaaccctttt 2940 tttttttaca agacttcagt ctataatcta aaaaaaaaaa cattatggta agaagacata 3000 acttaatttg tgtgatattt aatttattaa taatatgttt aggcactagg cagtgttgtt 3060 ctgtggtttc tgacggttca gttcgcgaat taggtaatta acaaagtttc tccataattt 3120 gcaagtgttt tttttttatt tattaaatca tctattgatc gtacagataa aagaggcatc 3180 atttgaaatg ataaaaagtc gtgcatttgt gcatatttct gttatcttta tactcaaaat 3240 agcaaaacat tgtttttgta aaataatcat ggtgcgcagt gggtattgct atcgcctcaa 3300 agcaaaaagt ttgttggttc aagccctggc tgagtcagtt ggtgtttctg tgtggagttt 3360 gcatgttctc cccatgttgg cgtgggtttc ctctataggt gctctataat tataatctat 3420 aattaatgct gtttacacag ttttaaaagt gtttatagta tctttaaatg tgttatgtaa 3480 aatgtgtttt actgcctgga tcttgtctga ctccgcattt atggataatt ccaccaatac 3540 cgccaagcta aaaaagtaat ttgtaatttt gtaatttgag tagtttttac gaagagtaat 3600 ttgtactttt acttgagtac tttttaacac cagtactttt acttgtaatt gagtacaatt 3660 tcagcaatgt aacagtactt gtaattgagt acaatttttt tgtactctta ccaccactg 3719 // ID BEL5-LTR_DR repbase; DNA; ZEB; 503 BP. XX AC chr1; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL5-I_DR; KW BEL5-LTR_DR; BEL5_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-503 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1362-1362 (2008). XX DR Genome; chr1; Positions 53862527 53863029. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 503 BP; 153 A; 120 C; 102 G; 128 T; 0 other; tgtcctatcc ttattatagg agccatatta cttttcattt aattataatg tttatttgta 60 tactgattgg tttgggtttc acagtgacgt cttatgtttt ggtcacgtgg gcataattgg 120 cacctgctcg aacatataaa gcgcgtgcac aatcacacag ggtggaggag agaaggaaag 180 tggggtagcc gtattttttg ttgaccgcaa acagccacgc aaacagccac gcaaacagca 240 gcgcaaacag ccacgcaaac agcaacgcaa aactgtatca caacggcaca cgcaacaatc 300 atcacaccaa cgccataatc ggcaccatac aggtattcaa agacacaaat tgctgtattg 360 tttttacatg cttcaagtta cataaagtct ttccatataa catcttgggc tcagtgtctc 420 tgtctaaagc caccagcgcg tcacaaatat aaaggaccct cactgcctct cgagcgactt 480 atagagggca tcaggaagtc gca 503 // ID DNA-X-3_DR repbase; DNA; ZEB; 1474 BP. XX AC . XX DT 20-OCT-2008 (Rel. 13.1, Created) DT 20-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE This is a nonautonomous DNA transposon, - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; DNA-X-3_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1474 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1252-1252 (2008). XX DR [1] (Consensus) XX CC The consensus was built from copies which are ~90% identical to CC it. This transposon was inserted specifically into 4-bp ATAT CC target, but the TSD is not certain. The TIR of this element is CC 28-bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1474 BP; 479 A; 257 C; 256 G; 482 T; 0 other; gaagagttca gatgcaaaac cctctaaatc catcagacct cttttcttgt aaatgagcat 60 tttctatcag gctcctctga ttaggttcag aagtttcatt ttatatgtaa tgataaggtt 120 attagctagt aaataaaata cctttttttt tttaaatgtc actttagaca attctttggt 180 ttttaacaag gtaaattttc ttttgcgtca aaaccagcta aatccacttc tgcttttttt 240 ttgcaaaaac ctctaaatcc acctattggg acaagacagt gtaattagtt gaaaaatcac 300 tcactaaaga tgcaattaga aattatttta ccaaacataa aacacattac acaaaccacc 360 taaaattaaa aatatataaa tctgtcaagt aagtggcggt tcattccgct gtggtaaccc 420 ctgataaacc agggaaaaag cagaaggaaa atgaatgaat gaaatctgtc aagtaagtgc 480 attaatcaat aacaattaaa actgacatta attaaatctt aacaatctgt tgtcatttct 540 gatatttggg atttagattt aatgtaagta gagcaatgta aacattgcaa acattgtaaa 600 ctaagcttgg tagattcaaa taatgtagtg taaaaacatt gtgaaacatc aatatctgtg 660 cattgtccat taatataaaa agcttatcag acgtaatagg tattatgaag taatataaat 720 aatgtatagt tttatacatt atatttatct ttttaaaaaa atagcttaaa ttagcaaata 780 aaacacaagg taggcctact gggcaaaaag gggatgcctc tcagacaatt ttagaagctg 840 tcattcattc attctcacca tacttaaggt cttggtctct tgtttatcct catagcatcc 900 atgtgggata aaagaacggc atcttaactc tgtctttcgt gaaatggagt tgccgtttaa 960 tgtatagtaa atcggactca ttcaaagtag cgtcgctgcg caaaaaaacg ttatgaatgc 1020 atgttacact tccgactgta cattgtgttt tctttttctt ataatgcaca gagttttgag 1080 gtaagggacg ttttcatttg ccattcactg acagtcggac aggctcatcc agttcatcaa 1140 actccatctt ttaaaatcga aatcgaaagc ggaataaaac agtctcctca taaaagccgg 1200 cgtatcagcg cgctgctaca ttgaaaacat atgattcgat tcgcgccttc tgattggttc 1260 tcgggatata gcggcttttg ctgtcaaagg caagtggcgt ttagcggctt ttgccaacaa 1320 actgttattt ctgaatgcgc tgacgctggg atttagatga tttgaagcaa atctatttgt 1380 tgatggtgta ctacgtgcct aaagcattca ctcaatgtca ttatctcatt tttggcggaa 1440 gtggcgttta gcggcttttg catctgaact cttc 1474 // ID L1-12_DR repbase; DNA; ZEB; 5787 BP. XX AC . XX DT 21-NOV-2008 (Rel. 13.11, Created) DT 21-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE L1-12_DR is a non-LTR retrotransposon from the L1 clad. XX KW L1; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; endonuclease; L1 clad; L1-12_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5787 RA Bao W. and Jurka J.; RT "L1 non-LTR retrotransposons from zebrafish."; RL Repbase Reports 8(11), 1743-1743 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 113..1171 FT /product="L1-12_DR_1p" FT /translation="MMEGEKIKKGKGKQLGKADKGSNLEDTAVAEEQGERN FT GGQEENLKLATVDDSVAAIRADIKMMALEMKSELNNFRDDLKRELVDFRKE FT IYQTLNEXTTDLKTTTDRVSEAEARIAEAEEWSTDFREALSQSLQAQEKMQ FT MKLTDLEARSRRNNVRIFGISEGAENNNIYQFIDNLIKKELDLVDIELRIQ FT RCHRALGPRPPNEAQPRSVIVYFQEFKVKEMVLHTAWKKKEIFHNNSRIYF FT DHDYPAETLTKRKAYSQIRRILKEKGIRFQTPPPAKLRVFFDGGPITYGSA FT DEATEDMKKRGFQFGRETSSEVEVLSKARTTGWQRSTNTTNRQTRQDWIKE FT RLRNFRRPR*" FT CDS 1772..5557 FT /product="L1-12_DR_2p" FT /translation="MQFQEYRVITLNVNGLYNPIKRSKVIAKMKKEKLHVI FT FMQETHLNNLEHEKLKQKGFKNVFYSSYRRGKRRGVAILISNCVNFRLLAE FT FTDKEGRYVLVKGLLDKKEVTLINVYIPPEQDNSCIREVFQLIASEASGVL FT ICGGDWNAQMQPKLDSTNALKRLTPRARVTKKLLMELGLIDVWRELHPLDK FT QFTFYSASQISYSRIDYFFVFNSDRHRLIDCKIGTRDISDHSPVYLTIHLD FT NKKNNSLWRLNTSILNDSTYKEFIQKEIKDYIDNNNNGEVSPSVLWDASKS FT VIRGKLIALTSHKKKEKCKKMFDLQENLKTLEREHIMEKKPHILNQIKNIK FT KQINTLYEEEIEKKAKFTKQRFYENGPKALKLLAWRLRKQQAERTISEIRD FT PITNRIKHNQDDIHNIFETYYRNLYSEISTTNPTEINIFLESLDLPSIGTT FT QNKSLMAEITKEEIGKAISRLKVNKTPGSDGFPVEWYKVFKDQLIPMLHEC FT FNYMLKGGEPPRTWSEAIISIIPKEGKDKKECSSFRPISVLNTDYKLYASI FT LAKRLEDIIPELVDLDQTGFIRDRQTQDNIRRVLHVIDHVNKKKTRAVVIS FT LDAEKAFDSVRWEYLYLALTRFGFNEDFIRCIKNLYSSPNAKIRINGHLTK FT SFVLERGCRQGCPLSPTLFALFIEPLAQLIRDEPGIKGITVRGSEQKICLY FT ADDVLLFVGTPELSIPNLMSVLNKFGTYSGYKLNIQKTQILSFNYFPQKDM FT QNKFNFKWNSTSFKYLGIKITKDSTQLFDSNFGPINKSIKSDIDRWSQLPL FT EMHNRIETIKINIVPRLLYLFQSLPIEISSKQFREWDKWISRFLWRGRRPR FT VKYKTLQLSKEKGGRSLPCLSDYYKAAQLRPLVICCNPEYTAKWKDLETSQ FT IDIPLQSLLGSKILHKHYLNSLNQWSKVPLRIWFKECNSPLLEKQSRLIKW FT VAFDPDFKPXKIDGRFQAWYRLGITTFSLISSNGELDSFQKISDXYGLDKC FT DFFRYLQTRTYFNSEIRCSENHPPNLIDLVIDIYKNKDNRRLVSKLYSAIR FT STKEESTNYIRLKWERESNLVISEEDWLNICSVQATSSNSGMWRDFCWKNL FT TRYFITPKLTYMQTQDKDRGQCWRKCGEDLANHFHIFWSCPAIQPYWKDVK FT QIIRNIFGDDVDLSFTTIYLGNIAANLTVKDKYLLKILLATSKKTVTRKWL FT QLEPPTKSEWLDIISNVQNMERMTFSLKLQMDKYLQYWEKWIVLMPLHSVI FT *" XX SQ Sequence 5787 BP; 1999 A; 1004 C; 1165 G; 1611 T; 8 other; ggacttccgg cggttaactg aagcgcatgg ccgcttaact ttttgtctcc cgttcggtcg 60 gagattaatt ccccaaacaa tacaattaaa acttagttac tgatcgaatt taatgatgga 120 aggggagaag atcaaaaagg gcaaaggaaa acagcttggg aaagcggata aaggtagtaa 180 tttagaggat accgccgtgg ctgaggaaca aggagaacgt aatggcggac aagaggaaaa 240 tctaaaatta gcaacagtgg atgatagcgt cgcggccatc cgcgcagaca taaagatgat 300 ggccttggag atgaagtcag agctaaataa cttccgggac gatctgaaaa gagaattagt 360 ggactttcga aaggaaattt atcagacgct aaacgaawtc acaactgatc taaaaactac 420 aactgacaga gtgagtgagg ccgaggcccg catcgccgaa gccgaggagt ggtccacgga 480 cttcagggaa gctctgagcc agtcactaca agcccaggaa aaaatgcaga tgaagttaac 540 ggacctggag gcacgctcga gacgtaataa tgttcgcatt ttcgggatct cggaaggcgc 600 cgaaaataac aacatctacc aattcatcga taatcttatt aaaaaagaac tggacctcgt 660 tgatattgaa ctcaggattc aaagatgcca cagagcgctc ggaccaagac cgcctaacga 720 ggcacagcca agatcggtga ttgtgtactt tcaagaattc aaagttaagg aaatggttct 780 gcacactgcg tggaagaaaa aggaaatctt tcacaacaac agcaggatct attttgatca 840 tgactaccct gctgaaacct tgacgaaaag gaaggcatac tctcagatca ggaggatact 900 caaagagaag ggtataagat ttcaaacacc gccgccagcg aaattgcgag tgttctttga 960 cggtgggccg atcacctatg gaagcgcaga cgaagcgacc gaggacatga agaaaagggg 1020 cttccagttc ggccgcgaga catcttcaga agttgaggtt ttgagcaaag cgcggacgac 1080 cggctggcaa cgctctacga atacgacaaa ccggcagacc cgtcaggatt ggattaaaga 1140 gaggctccgg aattttcgcc gaccaagata gatttaaagg tttatggttt gttagctgtg 1200 agtgacaaat gactgtaaac tatacagttg gtgaaggaac cagccgtttg ttatggacat 1260 tttagtgtgt gtcaggcgac agctgcgaac acccgcaacg ggatggagta agactgaatt 1320 cttttttttc tttcccctaa cgttactact ctaagactgt tatctcatgt aggctgttgc 1380 tgttatccta cttacctcaa ttatttttct gttgatctgc aacgggtaaa atcgccctta 1440 ataaataacg tagttgaccg acaaacacgg gaacatatat atatatttat tatttgcttt 1500 tgacgttcgt tagtgcacgc agaataccct aaagggcact taaggctgca ctgaagaggg 1560 gctctggtgt ttatctagac tattccccta caataagaag taagaactgt ttgttacttc 1620 gaatggaggt ttctctatgt attctgttta gttttgttat ttccgactgt aaagtcaagg 1680 gttctttttt tattgtatat agttcatatg tagttcagac taggtcacga ataacatata 1740 ccttacagaa agtgctttat aacatgatca catgcagttt caggaatata gggtgattac 1800 ccttaatgtg aatggcttat acaatccaat caaaagaagc aaggttatcg caaagatgaa 1860 aaaggaaaag ttgcatgtta tatttatgca ggagacgcac cttaataatc tagaacacga 1920 aaaattaaaa caaaaagggt ttaagaatgt attttattca tcttacagga ggggaaaacg 1980 aagaggggta gcaatattaa tctctaactg tgtgaacttc cgcctgcttg ctgaatttac 2040 agacaaagag ggtcgatatg ttcttgtcaa gggcctctta gataaaaaag aagttacgtt 2100 aattaatgtg tatatacccc ctgagcaaga caactcatgc ataagggaag tttttcaatt 2160 aatagcttct gaagcatcag gagttttgat atgtggagga gactggaatg cccagatgca 2220 acctaagctt gattccacta atgcattaaa gaggctaacc cctagagcaa gggtaactaa 2280 aaaactgttg atggaactgg gattgataga tgtatggagg gagctccatc cactagataa 2340 acagttcaca ttttattccg caagccaaat ttcctattca agaattgatt acttctttgt 2400 ttttaattca gacagacacc ggctaataga ctgcaagata ggaaccagag atatctctga 2460 ccactcacct gtttacctaa ctatacattt agataacaaa aagaacaact cactgtggcg 2520 gcttaataca agtatactga atgatagtac atacaaagaa tttattcaaa aggaaataaa 2580 ggattatatc gataacaata acaatgggga agtatcaccg agtgtcttat gggatgcatc 2640 aaaatcagtg attaggggta aactcatagc tcttacttca cataagaaaa aggaaaaatg 2700 taaaaagatg tttgacttgc aggagaacct taagacctta gagagagaac atattatgga 2760 aaagaaacct catattttaa accaaataaa aaatattaag aagcaaataa atacattata 2820 tgaggaagaa atagagaaaa aagccaaatt cactaagcaa agattttatg aaaacggccc 2880 aaaggctttg aagttattgg catggaggct tagaaaacag caggcggagc ggactatatc 2940 ggaaattaga gaccctataa ctaatagaat taaacacaac caagatgata tccataacat 3000 ttttgaaacc tactatagaa atctatatag tgaaataagt acaactaatc ccactgaaat 3060 aaatatcttc ttagaatcat tagacttacc ctctataggc acaacacaga ataaatccct 3120 aatggcagag atcacaaaag aggaaatagg gaaagcaatt tcaagattaa aagtaaataa 3180 aacacctggc tcagatggct tcccagttga gtggtacaag gtcttcaaag accagttaat 3240 ccccatgctt cacgagtgtt ttaactacat gcttaaaggt ggtgaaccac caagaacatg 3300 gagtgaggca atcatttcga ttattcctaa agagggtaaa gataagaaag agtgtagttc 3360 ttttagacca atttctgtat taaacacgga ttataaatta tatgcatcaa tactagctaa 3420 aagattggaa gatataatac cagaattggt ggacctagac caaaccggat ttataaggga 3480 taggcaaaca caagataata ttaggcgggt gctacatgtt attgatcatg tcaacaaaaa 3540 gaaaaccaga gctgtagtta ttagcttaga tgcggaaaag gcatttgact cagttcgttg 3600 ggagtattta tatttagcac tgacaagatt tggctttaat gaagacttta ttagatgtat 3660 taaaaattta tattcatcac caaatgcaaa aattaggatt aatggtcatc ttaccaaaag 3720 ttttgttctg gaaagaggct gtcgtcaagg ttgccccctt agtccaaccc tttttgcttt 3780 atttattgag cctttggcac agttaataag agacgaaccg ggaattaaag gaataacagt 3840 tagaggttca gagcaaaaga tctgtctgta tgcagatgat gttcttttgt ttgttggaac 3900 accagaactt agcattccta atctgatgtc agttttaaat aaatttggca cctactctgg 3960 ttacaagcta aatatccaaa aaacacaaat tttaagcttt aattatttcc ctcagaaaga 4020 tatgcaaaat aaatttaatt tcaaatggaa ttcaacaagt tttaagtact tgggtattaa 4080 aataactaaa gattctacac aattatttga tagtaatttt ggccctataa ataagagtat 4140 aaaatctgac atagataggt ggtcccagtt gcccttagag atgcacaata gaatagagac 4200 aataaaaatt aatatagttc ctcgtcttct atacctcttc caatccctac caatcgaaat 4260 ttcatcaaaa cagtttagag aatgggacaa gtggatttca agatttttat ggaggggcag 4320 gagacctaga gttaaatata aaacattgca gttgtctaag gagaaagggg gaagatcatt 4380 gccttgcttg tctgactatt acaaagctgc ccagttacgg cccttagtaa tctgttgtaa 4440 cccagaatac actgcaaaat ggaaagacct ggaaacttca caaattgata tcccactaca 4500 atcattattg ggtagcaaga ttttacataa acactattta aatagcttaa atcagtggtc 4560 caaagttccc cttagaatct ggtttaaaga atgtaattct ccactactcg aaaaacagtc 4620 tagactgatt aaatgggtag catttgaccc tgactttaaa ccarctaaaa tcgatgggag 4680 gtttcaagct tggtataggc tcggcatcac tacatttagc ttgatttcct caaatgggga 4740 attagatagc ttccagaaaa tctcagataw gtatggtctg gacaaatgtg atttctttag 4800 gtayttgcaa accagaacct attttaatag ygagattaga tgctcagaga accatccccc 4860 caatttaatt gacttagtta tagacatcta caagaacaaa gataatagaa gacttgtttc 4920 aaaattgtat tctgccatcc gatcgaccaa ggaagaatca acaaattata tcagactaaa 4980 atgggagaga gaatccaacc tagttatctc agaggaagat tggttgaata tctgctcagt 5040 acaggccacc tcttccaatt ctggtatgtg gagagatttt tgctggaaga atcttaccag 5100 atactttata acccccaaat taacatacat gcaaacacag gacaaggaca gggggcagtg 5160 ctggaggaaa tgtggagaag acttggcaaa ccactttcac atattttgga gttgcccagc 5220 catccagcca tattggaagg aygtaaagca aataatacgt aatatttttg gtgatgatgt 5280 ggatctttct tttacaacaa tttatctggg aaatattgca gccaacctaa cggtaaaaga 5340 caagtactta ttaaaaatac tcctagcaac cagtaagaaa actgtgactc gaaagtggct 5400 acaattggaa ccccctacaa aatctgaatg gttggatatc atatctaatg ttcaaaatat 5460 ggagaggatg actttttctt taaaactaca aatggacaaa tacctgcaat attgggagaa 5520 atggattgta cttatgcctt tacacagtgt aatctgattt atatatgtat ttgttcatgt 5580 gacctgtaag gaaatatgtg acctgtcaac cgttcattta agttctggtg ttttattgta 5640 ttatttactt aattttattw tttatttttt ttttaaattt tttttaaatt ttttttttat 5700 ttttaytttt ttttttacct actttttgtg tgtttttttt ttctctcctt aaaagaaata 5760 aaaataaagt ataaaaaaaa aaaaaaa 5787 // ID Gypsy74-I_DR repbase; DNA; ZEB; 6516 BP. XX AC chr13; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy74-I_DR; KW Gypsy74-LTR_DR; Gypsy74_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6516 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1463-1463 (2008). XX DR Genome; chr13; Positions 21213538 21207023. XX CC Positions [2893-3432] - Reverse transcriptase CC Positions [4882-5358] - Integrase core CC 'TATG' target site duplication CC LTRs are 99% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 290..1642 FT /product="Gypsy74-I_DR_2p" FT /translation="MAGRGWSGDLITPHIGRGRGLMGVGEPAVGRPRVLVY FT DSETIPVRSSVDEVSKPTSSTPENASAENVTQQLRDLIGELGNQIGDSIVA FT RLTDQSPALPRSDVSFEKQNSQQMPSPMSLDLSKVNLIVKTDVKEPPVFRG FT DGSDKCTIMEWIEQMKVYLSKKDYDKTDRVEEILNHLSGRAKSIVKVKLKS FT SPAAAFSPDVVYEVLQRYFSESPGSCQPLADFYATQPKKSEHPVDYWVRLN FT EAAEQADSHLKRRGSKMENMSSEVAMMFIRNCPSPDLSSVFRCKPISKWSA FT EEVQEAIDEYERDCKSWKPALPVPKVVVNQAVFAEKTVGTEAAVGSESVGV FT SSALCIANTHAKANEVNDTGTLERVLKMLERVLERTALPVSEPNPRVSQWY FT RASPCEVCGDRSHSTRSHCMKERRCLACLEIGHQRKACHKVAGQTAPRAPD FT VNNQGN" FT CDS 1714..5526 FT /product="Gypsy74-I_DR_1p" FT /translation="MFIYSNHCQVESGGRIILFQNIAKVEKADSLFYTDVL FT INDAVTCRGLLDSGSMACTINEETECRLLDVCGILEPNDQHTNILLVGCGG FT VQVRPKCIYQLKMVVYGHSVSVPTLVVPGQKDQLIVGTNVNKYVLSEIKKS FT QSYWRVMSQSDSSGEPEIHRFLSMLSGHDRWEGGSIPSVVGTVKLTRAVTL FT LPQQEHLVWGKLPTSSTISVGSTVLIEPSSTQTRSKAVLIGRSLATLPGDR FT WLPVKVINTSDKLVTLRRNAKIADVFPILAVEDLDFQDGQDHAKTVSAQSQ FT SLISSTVVGGSSLQGIRNELQELGLSDLDIESCEVSPHWKEQLLQLVGRYQ FT DVFSKHKLDCGNAKEFVHKIHLSDARPFRLPYRRVPPAQYQKLRMVLSEME FT EQEIIRKSCSEWASPLFLVWKKSGDLRVCVDYRWLNARTVKDAHPLPHQAD FT CLAALGGNAVFSAMDLTSGFYNIAMAEEDKKFTAFTTPMGLYEFNRLPQGL FT CNSPASFMRLMMNIFGDQNFLSLLCYLDDLLVYAPDEGEAIKRLEMVFGRL FT RGHGLKLAPKKCHFLRRSVRFLGHVIDSSGVATDPDKVSAIGAVMESDLMM FT EDGVTPSQRKIKSFLGMVLYYQRFIQDCSAIAKPLFALTAAPRGKKSPGKG FT VAAFKKLSPDDWKEEHSRAFQQLKAALISSVILTHPDFSRPFILSTDASTD FT GLGAVLSQVAEGETKARPIAFASKSLTRAQAKYPAHRLEFLALKWSICDKF FT SHWLKGQEFVVWTDNNPLTYILTKPRLDACEQRWVSKLAPYQFSIKYIPGS FT KNVVADALSRQPFVQHRVSQRLLNEPYNVLLEEAEQLKEGTVQDVFRFGVD FT TQGTEGSLMLNACSLSSAEVSAVLDGHTDWEVGVQARPISWIAHGAHQLLP FT PGQCPIPVLSLDELRDRQQSDPCLSRVLFYVNRGRRVTRRERSGETRKVLR FT VLKQWDRLRMLDDVLYRVSKNNLTGKKQLQYVVPSSLIAQVMEGVHDEAGH FT QGQIRTMHLARQRFFWVGMEHDVRKYVKCCKRCVVSKTLDPDGQAPLESVK FT TSSPMELVCIDFWCAEGSRGQSLDVLVVTDNFSRMSHAFLCRDQSAKQVAK FT QLWDRYFCVYGFPDRIHSDQGANFESNLIRELLLVAGVKKSRTTAYHPMGN FT SSVERFNRTLGNMIRALPARSKERWSQLIPTLTFAYNCTAHESTGYAPFYL FT MFGRIPKLAVDVMFSNVERDCEVVDIDKYVKRLKDDLKLAITAAQKNVDVS FT QQRQTCTTRG" XX SQ Sequence 6516 BP; 1714 A; 1290 C; 1734 G; 1778 T; 0 other; taaatttggt gccgtgaccc ggatgtcatc tgatctgctg gagctgatcg ctgggattcg 60 caatgtgaat acacaacgct cgatattgga gtctgaagaa ctttctgtga cattctggta 120 gcagtttgga cactattaac agtttcacat cacgcattac acatacacag gactttactt 180 tgcttaattt tatttttatg tgttttactg ttgttatttt tttgcaaagt gtgaagctac 240 tgtgaattta attcattgta tataaactgc tttattaatt tttcacaaca tggctggtag 300 aggatggagt ggggatctga taacaccaca catcggcaga ggtcggggtc tgatgggggt 360 aggtgaacct gctgttggca gaccacgtgt gttggtttat gactctgaga ccattccagt 420 caggtccagt gtagatgagg ttagtaaacc aacttcctcc acccctgaaa atgctagtgc 480 tgagaatgtc acgcagcaac tgcgcgatct tattggagag ctcggaaacc agataggcga 540 ttcgattgtt gctcggttaa ctgatcaaag ccctgcatta cctcgttctg atgtgtcgtt 600 cgagaaacaa aattctcaac agatgccttc gcccatgagt cttgacttat ccaaagttaa 660 tctcatagta aagactgatg ttaaagaacc accagttttt agaggggatg gtagcgacaa 720 atgcactatc atggagtgga ttgagcaaat gaaagtgtat ttaagcaaaa aagactatga 780 caaaactgat cgcgtagagg agattttaaa tcatctttcc ggccgcgcaa aaagcatcgt 840 taaggtaaag ttgaaaagca gcccggctgc agcattcagt cctgatgttg tctatgaggt 900 gctacagcgt tattttagtg aaagccctgg ctcgtgccag ccgctcgccg atttctatgc 960 aacgcaaccc aagaaaagcg agcaccccgt ggactattgg gttagactga atgaggcagc 1020 tgagcaagct gattcacatt tgaaaagacg tggtagcaaa atggaaaaca tgagttcgga 1080 ggtggcgatg atgtttatta gaaactgccc tagtcctgat ctttccagcg tctttagatg 1140 caagcccata agcaaatggt cagctgagga ggtccaagag gccattgatg agtatgagag 1200 agattgtaag tcttggaagc cagccctgcc tgtacctaaa gttgttgtaa atcaagctgt 1260 atttgccgag aaaacagtgg gaaccgaggc agcagtagga agtgaaagtg tgggcgtttc 1320 ttcagcattg tgcatcgcaa acacccatgc taaagctaac gaagtcaatg atacaggtac 1380 actggagcgg gtgctgaaaa tgttggagag ggtactagag cgtacagcct tacctgtttc 1440 agaaccgaat cctcgtgtgt cgcagtggta tcgcgcttcc ccatgtgaag tctgtggcga 1500 tcgttcgcat tcgactcgct ctcattgtat gaaagagaga agatgcctcg cttgtcttga 1560 aattgggcat caacggaaag catgtcacaa ggttgcaggt caaactgcac cccgggcccc 1620 tgatgtaaat aatcagggaa actagcccac tcgcatcgag gaggggacga tgtgagtgag 1680 gctagtcagc ccctcacctg tgaagatgac ccaatgttca tctatagtaa tcactgtcag 1740 gttgagtcgg gtggccgtat cattttgttt cagaacatag caaaagtgga aaaagctgac 1800 agtttgttct atactgatgt tttgataaat gatgcagtca cctgccgagg ccttttagac 1860 agtggttcga tggcatgcac aattaatgaa gagaccgaat gcaggctgct ggatgtttgt 1920 gggatcctcg agcctaatga ccagcatacc aatattttac ttgtcggttg cggtggtgtt 1980 caggtgaggc ccaaatgcat ctaccagttg aagatggttg tatacggaca ttcagtgagt 2040 gtgcctacct tggttgtgcc aggacagaag gatcagttga ttgttggtac aaatgtcaac 2100 aagtatgtcc tcagtgaaat aaagaaatcc cagagttatt ggcgtgtgat gagtcagtca 2160 gattcatccg gagaaccaga gatccatcga tttttgagta tgttgtctgg tcatgatagg 2220 tgggaggggg gtagcattcc aagtgttgtt ggcaccgtta agctcactcg tgctgtcact 2280 ctcttacctc aacaggagca cctagtctgg ggtaaactgc cgaccagttc caccatttcc 2340 gtggggagta ctgttctgat cgaaccatct agcacgcaaa ctcgtagtaa agctgttctg 2400 attggacgat ccttagccac tttaccgggc gataggtggt taccggttaa ggtaatcaat 2460 acgtccgata aactggtcac actgaggcgg aatgctaaaa tagcagatgt atttcctatt 2520 ttagcagttg aggacttaga tttccaagac gggcaagatc atgcgaaaac tgtaagtgct 2580 cagagccagt ctctcatctc tagcacagta gttgggggta gttccctcca aggtatccgt 2640 aatgagttgc aggaactcgg tttgagcgat cttgacatag agtcctgtga agtaagtccg 2700 cactggaaag aacaacttct gcagttagtt gggagatacc aagatgtgtt ttctaaacat 2760 aaactagact gcggcaacgc caaagagttc gtccataaga tccatctgtc tgatgctcgc 2820 ccttttagac ttccgtatcg tcgtgttcct ccagcccagt atcagaaatt gagaatggtg 2880 ctttctgaaa tggaggagca agaaattatt cgtaagtcct gtagtgagtg ggcgtccccg 2940 ttgtttctgg tatggaaaaa gagtggtgat cttcgcgtct gcgtggatta tcgctggttg 3000 aatgccagaa cggtcaagga cgctcaccct ttacctcatc aggcggattg cctggcagcc 3060 ttgggtggga atgcagtgtt tagtgcgatg gatcttacct ccggctttta taatattgcg 3120 atggcagagg aagacaaaaa gtttacggcc ttcacaacac ctatgggtct gtatgagttc 3180 aaccggttac cacaagggtt gtgcaatagt cccgccagtt tcatgcgtct gatgatgaat 3240 atcttcggtg accaaaactt tttgtctttg ctctgttatc tagacgattt actggtgtac 3300 gccccagacg agggagaggc tattaagaga cttgagatgg tgttcggtag actcagggga 3360 catgggttga aattagcgcc gaaaaagtgt cattttctcc gaaggagtgt gcggttcttg 3420 gggcacgtca tcgacagctc tggggtagct acagatccag acaaagtgag tgctatcggg 3480 gcagtcatgg agtccgactt aatgatggag gatggggtaa ctccgtccca aaggaagatt 3540 aaatcgttcc ttgggatggt gttatactat cagagattta tccaggactg ttctgccata 3600 gcgaaaccac tttttgcttt gacggccgct ccgaggggta aaaaatctcc agggaagggt 3660 gttgctgcct ttaaaaagct gagtcctgat gattggaaag aggagcacag tagggccttt 3720 caacagttga aggctgctct gataagctct gtgatattaa cacatcctga tttcagccgc 3780 ccatttattc tgtctacaga tgcgtccact gacggcttgg gtgctgtttt atcacaagtt 3840 gcagagggtg agactaaggc ccgaccaatt gcttttgcaa gtaagtctct tacgcgtgct 3900 caagcgaaat accctgctca ccgcttagag ttcttggctt taaaatggtc catttgcgat 3960 aaattcagcc actggctgaa gggccaagag tttgttgtat ggacggacaa caatccatta 4020 acttacattc ttaccaagcc tagactcgat gcgtgtgagc agaggtgggt ttcaaaattg 4080 gctccctatc agttcagtat caagtacata ccgggcagta agaatgtagt ggccgatgcc 4140 ttaagtcgtc aaccctttgt tcaacacaga gttagccaga gactgttgaa tgagccttat 4200 aatgtgttgc ttgaagaagc ggaacagctc aaagagggga cagtgcagga tgtattcaga 4260 tttggtgtgg atactcaggg cactgaaggt tccctgatgc taaacgcttg ttcactgtct 4320 agtgctgaag tgtctgcagt gttagatggt cacactgact gggaagtggg tgtacaggcg 4380 agaccaatct cttggattgc ccatggggca catcagttgc tgcctcctgg gcaatgtcca 4440 ataccggttc tttcgctaga cgaactgcga gataggcaac agagtgatcc ttgtttgtcc 4500 agggttttgt tttatgttaa taggggtagg agagtcacac gacgtgagcg ttcaggtgag 4560 acgcgtaaag tgctgcgggt cttaaaacaa tgggacagac tgaggatgct tgatgatgtt 4620 ttgtatcgcg tgagcaagaa caatcttaca ggaaagaagc agttacagta tgttgttccg 4680 tcatcactga tcgcacaagt aatggagggt gtacacgatg aggcaggaca ccagggccag 4740 attaggacga tgcacttggc taggcagaga ttcttttggg ttgggatgga acacgacgtt 4800 cgcaaatacg ttaagtgttg caagcgctgt gttgtcagta agacattaga tccagatggc 4860 caagccccac tcgaaagtgt aaagacatca agtcctatgg agttggtctg cattgatttt 4920 tggtgtgcag aaggctccag aggtcaaagt ctggacgtgt tggtggtaac ggataatttt 4980 tcccgtatgt cgcatgcctt tctatgtcgc gatcaatcgg cgaagcaggt agcaaagcag 5040 ttgtgggatc gatatttttg tgtgtatggt ttccctgatc ggatccattc tgatcaggga 5100 gctaattttg aaagtaacct gatccgagag ctgttgctag ttgctggggt aaaaaaatct 5160 cgaactaccg cttaccatcc catgggaaac agcagtgttg aacggttcaa tagaaccttg 5220 ggaaacatga tccgtgccct tccggctagg tctaaagaaa ggtggtcaca gttaatcccg 5280 acattgactt ttgcgtataa ctgtactgcg catgaatcca ccgggtatgc cccgttctac 5340 ctaatgtttg gtaggatccc aaaactggcc gtggacgtca tgttcagcaa cgttgagaga 5400 gattgtgagg ttgtcgatat tgacaagtac gtgaaaagat tgaaggatga cttgaagtta 5460 gctataactg ctgctcagaa gaatgtggat gtcagtcagc aacgtcagac ttgtacaacc 5520 agaggatgaa gggctgtgtc attgagttgg gtgatcaagt tttacttgcc aacaagggtg 5580 aacgggggaa gagaaaatta gccgatagat gggagtctac gccttacagg gtagttgctc 5640 taaaccccca gtgtcacatc taccatatac gcaataccag aaagggctta gagaaaactg 5700 ttcaccgaaa cttactgtta caggcaaact ttcttccact ggaattggaa gattccgcat 5760 tatccggggg tagtgatggg tctgtcttag gaagcagaag cgtgtctgtg gatgatgttg 5820 ctggtcttga actggagttg tgctctgtgt ctgaacgggt tgcgagttgg gtggaaaaga 5880 ctgctgtttc tgagggttct ccacaagagt ttttaagtgg atctggtgat gctgtcttaa 5940 caagactttc cgagatgctg gagcaagcgc agactccatc agaggctgac tacatcgaag 6000 agatttgtgg tgcaagtagt gacaacttga gtcgtggttc ttgcagtcgg tgttctgaaa 6060 ttgtgctacg cgatgattgt actccatctg ctggagttgt accaccttcg gatcaggtga 6120 caattggagt tagaacgaga gtaggtagag tggtgaaacc tgtgaacaga ttaatagaga 6180 acatgactca gtccttaaaa acacttaaca cagttggtgg ggttgtaaag tctttgttga 6240 agtgaaagta cattgaatta cattatttac ttgagagtca gaacattcga gttgatgcat 6300 ctctgaatat cggttgtccc agattcaagt tatgaatggt gtataaggca ccgtttttgc 6360 aggtgactgc agcgggcctg atcaacctgc tgtcgtcatc ttcccttcta cgtggggaaa 6420 cattttgttg tactatgttc tgggtaaagt gggttatata ctgaagagtg taatatcgat 6480 ataatagtcc aaatttgaga agttcagggg gaagaa 6516 // ID Gypsy106-LTR_DR repbase; DNA; ZEB; 1379 BP. XX AC chr20; XX DT 03-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy106-I_DR; KW Gypsy106-LTR_DR; Gypsy106_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1379 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1516-1516 (2008). XX DR Genome; chr20; Positions 5971868 5973246. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1379 BP; 419 A; 182 C; 263 G; 515 T; 0 other; tgtgacggat agagggatat tattttatat aaggtgatta aaagaaaaag ttaaaccccc 60 ccctctcact ctttacctca tttttttgtt ttattcctgt tctagttctt atgagtaaat 120 gaaatgattc atgtagttca ccaccacacc tgtagagggc agtgtggcgc gttagtgggt 180 tacctaggag accggcagct cgaggtcata ttaaactgca ctcgagcaat gctataacgt 240 gtctgtctcc ttatcattca caggtatgga taaacactca tttaaagtgg tttgtgtgaa 300 atatttatca tctttactct taagaaatgt taagaaacta tgttagaagg tgttagttct 360 atgcattgat tcgtttttgg tttaattaga tcgtaaacat tatagccact cgtggttagc 420 aaaagcagcc gaagctaatg atgagttaat ggttaactca aatgatgtga acggagcatg 480 tagtgatgta tgcattcaga aatatgcatt agatcctttt ctaagcactt taatgttaag 540 taattgttgt aattgttaca tttgttatga atgtcagcca gattaggtta atatggggca 600 tatttttata ttgattctct tctaaaatgt ttgtgaaagc ataacgctgt atagtagtgt 660 atagtaacac aatagtaata agtgatagta caagtttaat gttttaatgc tggatttttg 720 gttttataac atcataattg gctttctcaa taacttaggg ttattacatg agtaaaagtg 780 acatttattt aatcctaaat tgtcggtatt tctcataaat atattaacaa gtgttaatat 840 aaatgctgat gtaatgtaaa ttggaaatgt tgtgaaggaa aatgttacat tgaattaatt 900 ataatgatta gcatcatttt tgcatttata ccagtagtta tagatagagt aagaaatgta 960 ttattttaat aattcttgta atattacttg agtatctcaa agtaagacca agactgtatt 1020 ctgaggagtg tattcaagtg tataaagtat ttttgttcaa tgtattaatg ccttttgttc 1080 acagagaaaa tctagttaca ttgcattgag tgtgtattga tattgtgtat tgatattgca 1140 ttgagtaata tgatgtattg atatccatga cacatttgag ttttatctta ttgatatacg 1200 tttggtttta ttctggaaca ctgtgtacaa tgtgttagct ttaaaagaag taattatctt 1260 tgatatatta aactgcactc gagcaatgct ataacgtgtc tgtctcctta tcattcacag 1320 ttttgtgcgt gatttggctt gtgccggtcc agattcccct aggtctggag ccggtaaca 1379 // ID Kolobok-N6_DR repbase; DNA; ZEB; 467 BP. XX AC . XX DT 17-OCT-2008 (Rel. 13.1, Created) DT 17-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE putative nonautonomous Kolobok transposons from zebrafish - a DE consensus. XX KW Kolobok; DNA transposon; Transposable Element; Nonautonomous; KW TTAA TSD; Kolobok-N6_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-467 RA Bao W. and Jurka J.; RT "Kolobok-type transposons from zebrafish."; RL Repbase Reports 8(10), 1350-1350 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 467 BP; 144 A; 80 C; 102 G; 141 T; 0 other; agtccgcatg aaccggaagc tgcgaccgtt tttttttgca tattgtgatg cagttcctag 60 agaaacggaa tattaaatga gaaaacagtg ggcgtggctt gtttttttct actgcgagct 120 gattggatgt agtaaagtag gcatttcatt cagaaagatg gggaaaaggg tttggggaga 180 gttattacaa cctaacagac tcctcctcct caccatttct gtttgttgtc aaaactgaca 240 gttggagggg cgtggttaag tatgttagcc acgcccaata cctcagacag acctaatctg 300 agaatttaac tgaaaacaaa caggaagtgc attttcagat ttcaattaaa gattacaagg 360 gcaaactatt ttttttctta atgacatgca cagatgaatt gttcaccaca aaactagcaa 420 tgtgagctaa caaaatcaat atggttagtt ttgatttcat gtgtact 467 // ID EnSpm-6N1_DR repbase; DNA; ZEB; 9236 BP. XX AC . XX DT 02-OCT-2008 (Rel. 13.1, Created) DT 02-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE EnSpm-6N1_DR is an non-autonomous DNA transposon - a consensus. XX KW EnSpm; DNA transposon; Transposable Element; Nonautonomous; KW Autonomous DNA transposon; EnSpm-6_DR; EnSpm-6N1_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-9236 RA Bao W. and Jurka J.; RT "EnSpm DNA transposons in zebrafish."; RL Repbase Reports 8(10), 1260-1260 (2008). XX DR [1] (Consensus) XX CC EnSpm-6N1_DR contains a 39-bp TIR, whose terminal ~13 bp is CC similar to EnSpm-6-DR. In addition, as EnSpm-6-DR, the TSD is CC also 2-bp, which is different to the 3-bp of other EnSpm. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 9236 BP; 3226 A; 1538 C; 1471 G; 2997 T; 4 other; cccagacagc acacatacgt tgggccgacg tcgtgccgat tttcaaaatt ccatcggtgc 60 gatgtcgctc agggagcgtt tgctaattgg caaaatgtcg ccgcgacgtc ggcaaatgat 120 cggaaacgct gtccggccga tgttaggacg acgcaaccgg aaataccggc cgctccgcgg 180 aggcgctgtt ctcaggcgtt ttgctccata cttatggacc actagctgca gcagctgctt 240 atttataata aacacgacac accactctca ataacagttg aagatttatt aatatttttc 300 acaaacatac aaatagatta acatactttt atcacacata accatgtttt accacgttta 360 aaacaaaacc aacaagttgt actataacca tggctgaact actgtgcaat aaagatttta 420 aatacaaagt ccattgcagc atttacatta agaccatagg aatatttaca agtacttcaa 480 aacatgtaca ctaagggtga ctatagttta ctatagtaaa aacacaaata acttgtgtaa 540 gcacatgtta taatgcttat aatttcttaa atagtgttaa agaaaatgca tgtttccaag 600 actataattt taaaatacaa gataatacca gtcatgtgtt cagtatccaa aagactttct 660 tcagggagtt acaataataa aagagtacac aaacacaaaa taaataccaa gcaaacaata 720 tttacagtag taaataaaat catacaaatt aaaataattg tctgataaaa tcttattaaa 780 gtctgtagtc aatttttttt tttttttttt ttataaatgg cattcaacaa tttgcaatct 840 cctctgaact aataacacat caataatata aaataacaat actttctcag agataaccag 900 ttttttttaa gcaaactcta atgcacgaca caaaacagat tttgtcatta tatatatata 960 tatatatata tatatatata tatatatata tatatatata tatatatata tatatatata 1020 tatataaaca tatatatata tatatatata tatatatata tatatatata tatatatata 1080 tatatatata tatatatata taatccctta atttctgttt aacggaaata ttttttaaca 1140 catttctaat cacaatagtt ttaataactc atttctaata actgagtaaa taatatttga 1200 ctagatattc ttcaagacac ttctatacag cttaaagtga catttaaagg cttaactagg 1260 ttaattagct taactaggca ggttagggta attaggcaag ttattgtata tcgatggttt 1320 gttctgtaga cagtcgaaaa aaattgctta aaggggctaa taatattgac cttaaaatgt 1380 tttttaaaaa attcaaaact gtttttattc tagccgaaat aaaagaaata agactttctc 1440 cagaagaaaa cataacattt tcttgctctg acctatatat atatatatat atatatatat 1500 atatatatat atatatatat atatatatat ataataataa taaaactggt tgtaaattgt 1560 gtacaatgtt ccaagaaaac agacaaaatt aaaagaaata acctgtcatt tagttttgtt 1620 ttatcttgtg gaatgttgct tacacataaa aggggatata taggtggaat aaaaagtgtg 1680 tcgccatcct ctaaattgtt cttgtatgtt gtcttcatct ttagtcttca caagtgaggc 1740 tggcccaaac cgggactgca gtagccaaat ctaatccttg tagtgggaac ccttaaataa 1800 agcaagaaac caaaaggaga aagattagca ttgcagcagt tcatattacc acaaatatat 1860 attatttgtt tcatgacata taactggagt atgtgttatg caagtgccta gctaaaatgt 1920 gtaatttaat ttaaacttaa agagtgtgtc tgagcactaa ttttcagaaa agccatttga 1980 aagttaaatt ccttgtgaaa tcataatgaa tgtttttttt ttttggtttt tagtatcatt 2040 gtgttagtat aaatgtgtat ctaaatatat gtctaatctt cagataccac attacatgca 2100 aattaaacta agtttcgtca actaaaatgt cacactgtac accacattta atatatcata 2160 gctagggcca ggaggaatct gcggatgttt tttgctattt ctgctgaaaa ttttggtgga 2220 agtctgcgga tttctgcgga attattttgg gagtatcata actaaaacct taatatatga 2280 aataaaaaaa taatatattt taaagtttta tttaatgttt aaaatgcaaa tccaattaga 2340 ttcactttat ttggtaaata aagcaagttt ttcatataac atatctagta agacagaaaa 2400 tattactgta caaacctcat tgtacataaa tcagatgaac attttcacat tagtcaataa 2460 tattactgaa attaatwaaa aaactgaata aatatagatt tacacacatt tactcaagta 2520 aataaacaga attaatgata ggctaaacat ctgcggagtt ctgcgaaaaa tccgggtggg 2580 cctaatcata gctttatttc tgctacaaaa aggtttttca gcaaacaact tctattatat 2640 gtaggctata gaggaggata tataaagaaa tgatatattc aactctctaa ttaaatgcac 2700 ctgatgcagc taatcaagtc ttttaggctt atttgaaaac ttaaatgacg tgttggacca 2760 cagttggaca acgctcagtg agaatacttt tttgcacaaa ttctttttgt tctaaagaaa 2820 catgtctgat tgctagcaat catcaaagag caaacttttt tttttttcaa ataaatacta 2880 tgctcttaaa aataaacttt tcatacatca agaatactaa aaatatgttt atttaaaatg 2940 ttttatattt ataaaaaaaa cagcttgagg agagaatcag tacattggaa tactattttt 3000 tcagaaggat catgtgacac tattaattat atacattgta ttaaattcac agtattactg 3060 cttgactgta ttaattaaat aaagctgaca ctatcagaag agaatttacc ataccaacct 3120 aaaagctatg aaacagtagt gtatgtttaa gatgttactt actagtccta ttgtactttg 3180 gagaggttgt ctgacaggag gatcccttca actcagcaga attctcagca gaccatcagc 3240 gtacaccaat ctaagaataa aataagcaat ttgttaaaaa ccaaaaatat atatatttta 3300 tatttccagg tttataggaa agaagaaagc taaagcagat ttaaaatgta gaaaaaaata 3360 aaacctattg acaatttaat ggcacaacat accccaaact aagcacgctc aaatggttaa 3420 tttacataaa tgacaactgc tgctcattat accaatatta actgttaagg attcattcat 3480 tcattttctt ttgggcttag tccctttatt aatctggggt caccacagcg gaatgaaagg 3540 ccaacttatc cagcctatgt tttacgcagc ggatgccctt ccagctgcaa ccaatacaca 3600 ctcattcaca cacacttata cactatggtc aattttagca tacccaattc acctatgtct 3660 ttggacttgg gaaaaccgga gcacccggag gaaacccacg ccaacacggg gagagcatgc 3720 aaacttcaca cagaaacgcc aactgaccca gccgaagctc gaagcagcaa ccttcttgct 3780 gtgaggcaaa cgtgctaccc tctgcgccac cgtgctgccc tgttaaggaa tcagattaat 3840 ataaatgaat tgacatgttt atttagtgtc tgacacatta aatattgcat ttttaaatat 3900 tgtttcattt taagatttat ttcagaatta actatagaac ctggatatat cgtcacattt 3960 tcatgcacag cagcatttcc aagaacatag aaaaacagaa tataggccaa ctgaaaaaaa 4020 aataaataaa aaaattctga ctgaatgctt taagtaactg gctaggaatg ggcaggtata 4080 agattctgag ggtataatct tggataaaaa tatcacagtt tcacagtatt gtgattactg 4140 ctctgaaatg tgtttttttt ttatctctgt gtaaaaaaca gctacttttt ccccattgaa 4200 cacaatatgt tttrttttaa gaaacattta aaatattttg taacagtaaa catgccaggc 4260 taaataataa tttaataaaa tagtactatc ttcattagac tcaaaaacac tgatttcttg 4320 tgaaggcata ataagctatt tgtttttcag atgtttgctg aatcgtgagc tgaccagtag 4380 cttttgatgt ggagggtctt ttgctgctgt agataatgtt ggactatttt tttaagtctt 4440 tgataacgcg ctaaaaacca tatacaatgc atccggaaag tattgatagc gctttacttt 4500 ttccacattt ttaatgtttc agtcttattt caaaatggat taaattcatt aatttcctca 4560 aaattctaca cacaatcccc ataatgtgtg ttgacccgat tttagttaat catttacaag 4620 tcaatattgc ctatatggat tgtaatttaa aaaaaaatta acatgtaaaa ttgcagtgtt 4680 aaatgcaatg ccatcaaaca ttttagacgc accagtgcaa aaagctagtc tagagcactt 4740 ggaatgctgg tgtagttcta gttaaattca gttcagttaa agctgtatat gcagcaggtc 4800 tcaagtatag gatttagtat ttaactaatc ttaacaacta aacatttacg aatctgtatt 4860 agccagctca tgtactacaa tgtaacagat actgtatatt gccaccttga tacttaagct 4920 ttctttaaaa tattttgtgt tcaaccgatt aattaattct ttcatttaat tttttcagtg 4980 caatagtata gcttacactg gtgtatttgt atggaccccc caaaatagcg gaggccactt 5040 tctggccacc caagtataaa cttctagcct agccactgtg aactacctgt ctgtcctgct 5100 ccagaactct ttgtcggctg cttgtatgtc tggatacagc tgcacaagtc ttttaacact 5160 gacaaataca gcctctgcac atcctcacaa ccaggagaac agaatgatca ggaggacctg 5220 aaaaatataa ccaactgaaa tcagttgaaa tgtattttgt cttgtgatga tgacacaaaa 5280 ataaattatt tttcacaaag gatgtgctgt tacctatgaa acggtctata atgaatgatc 5340 tttttgatcc tgtttccttg atgtcattca ctcagagatg gtttggacat ttgattgtat 5400 ctaaactgta atgaatttaa gttacaaaaa cattgatagc aggaccattg atcttgtaac 5460 ttatacaaat tgttaagtat tgaacaagtc tgcaaccaaa ctagtcaagt taataatact 5520 aatctcaact caactatgat tagttgtctt gtttctgctg cccagacggt ctctccacct 5580 gctgataaaa aaaggaaaaa gtatggtatt tttacaacaa tgaaactgca aacaacttaa 5640 ctataagggg gtaagcagtg agcatctttt attataagca atttactgaa tactttgtca 5700 ctggtattac ctaaaaacca aacgtcccaa gtttggctca tagctgtcca cacatctgcc 5760 cagtagcttg tggtaatact gaagacactg atcagctggt ccaactgagt ttggagttga 5820 actgcaatca ggcatccaat actgcaaaga aagagaaact cactttgggt taacaaataa 5880 tgtaatccca tccacacatt tcaagtaatt gtgcagttaa ttctcttaaa atgaatgaat 5940 gaattctctt attgaatttt ctgagagctt atgaatatat tttgtcttcc caaaatagca 6000 tgtactgaaa cacaacatta tataaatata atttaataag agcgtgacgt ggtggcacaa 6060 taggtagtgc tgtcgcctta cagcaaggat gctgactcga gcctcggctg gatcagctgg 6120 catttctgtg tggagtttgc aaattctccc tgcgtttgcg tgggttttct ctgggtgctc 6180 cggttttgcc tacaagtcca aagacatggg gtacaggtga attgggtagg ctaaattgtc 6240 tgtagtgtat gagtgagaat aagcgtatat ggttgtttcc cagtgatggc ttgcagctgg 6300 aagggcatcc actgcgtaaa acatatgctg gataagttga cagttcattc cgctgtggtt 6360 accccagatt aataaaggga ctaagcagaa aagaaaatga atgaatgaat gaatgaatga 6420 atgaatgaat ttaataagag ttaaactgat ctgttaacat gtctgtcagg agctgatcta 6480 ccttataatt aattttttgc ttaatttttt taacaggtaa atcaatacac acacatatat 6540 atatatatat atatatatat atatatatat atatatatat atatataaaa aatggttcca 6600 gtagtaacgc tccttaaagt taaactacat attcacattc ctcatctgtc tactgttaca 6660 ttatactaat cactgttaca tacagtggcc gagagtgcaa agagagtaat aaaacacaga 6720 caaagaagac agctttgtca ctttttctat gctatttgtg agattgtgta tgttgtcaaa 6780 ctgatgaaaa tgttttctca atttacttta acatttttgt gtttgtttat gtttttataa 6840 cttgcagccg ttgttaaatt taaaagtatt ttaaacatgg atggacatct aaatgcatat 6900 agggcacata tctcaaaaca ttatgaaata tttccagata aatcaatttt gaattaaaat 6960 gtatattaat ctccagactt gcatctccct gatcttttta aaagaacggc ccctcgtttg 7020 ctcatcactg tttgacaagt gtctggcagt cagtcaaaaa tatacttaca tataaagatg 7080 ttcatcagct acacggcact taggttaaac tttgacctcc ttttcagttg atgtggaaaa 7140 caaaagcaaa agaaagaaaa aacaaaacga ctgaatacaa gttgcatatc atgcctgtac 7200 aaatcaaaaw ttattcattt agggcttaaa ttgacttgtt tgaggatgag ttaaggtgta 7260 agggatgggt gaacagtgta aatatacatg taattcatag ctatgatgta aagctraatt 7320 ttcagcatca ttactcaagt cttcagtgtc acttgatctt tcagaaatct ttttttaaga 7380 tgttgattta gtgcttgagg ttgaactaat ataatggtga ttatcaaatg gtgctgattt 7440 gctgctcaag aaacatttcg gattgctatc aatgctaaaa acagatgtga tgacaaataa 7500 tttagtgtaa acttttttct aagatttaat aaatcaatta atcataaaaa aattaaacaa 7560 aatgaaaatc acaaatatta cttcattagc agcaaatcta tatatttatt aagttctgaa 7620 gaatcatgtg atgtgaaatt taataccact cacatgtttc aatgtcaaaa ctttcttaaa 7680 ataactgcaa taaagacaag taacagatgc ataatatata gacatgtaac atgtgcataa 7740 tgatgtgcaa tggaaattca tatgaattaa tcctctgctt aaagttacct gtctttcctg 7800 catctctgtg atcagctccc ttcagaaatt taggaatctt gagttatatt ttctgtagca 7860 ggaggttatt ctctcactcc tgaaaccaca gaacactttt ttacttaaca tttagtttaa 7920 gataccatga aatatcaaaa cacgattgaa cttttgattc acggtccttt acagtaacgt 7980 tacaatgcgt attttagact aacgttacat atggagtgca ttcaataaaa cattgacaaa 8040 acctctcatc tacacataaa gaataaataa aagccaaaac tttaacaaac aagtttaaca 8100 agagctaaca ttaagtaagt taacgtaata ttaccccgac gctaacgtta ccagtgctaa 8160 tatgctaacg gacttttacg aagacttacc actcttatat aacataatat tcaactgaat 8220 attcatcaat tgccaataat aaatgtgtga ggaacagttt tatgtagtat ttgctttgtt 8280 ttagggacta aacttacctg aaaacagtca aaacggcagc ttgagagaag tcgtgttgtg 8340 tctttcaggt ggtgtcgtgt cgtgttgtag cttcgtttcc catagcaaca tcctctccgc 8400 tccagtgttg gaagcaagac gcgcgcgcgc acattaaagt cttgcagtat tacatttata 8460 aacatgatca ctccatagac aatttaaata ataaccttgt attttatatc ttgaatttaa 8520 aatgtattta gcagttttat atacgctatt atctttaaga cgagctttat agcctactaa 8580 actgttacaa aacaaacact ttatttgtgc atcagattca ctgtcaaaaa taaataacca 8640 ataaatataa atgcaagtat atataaaaca aagatgtcac tctgtaaaaa ggtgtttatt 8700 ttaattaaac agccgaaatg cagaacatac ttgattttta cactgggaaa aatgtattag 8760 aaaaagccac aataaaaaaa agcaaacagc agttaacacc agaaaaaaac aacgttttca 8820 ttattaaagc ataacaataa aactttttat attttaatga attgaaatca tgtattgtag 8880 gcattttggt cagttttact gcattaatgt aggctattta aagctggacc gaccgacgca 8940 gttcacatga tgcatgtcgc gtttgtctaa tttattatgc acgcaaaaac ttacttgtaa 9000 accaaatatg ttgccggatt ctttcatggg catgagagat aaagtgctta tgaagtgctt 9060 cgtcggcctc tggtcgttaa ccgacgcaga gccgacgttt gttgagctgg aacaatagaa 9120 cacacgaccg aacgatcatg cgtatgtttg gccgaccata taccgatgtt agaattaaag 9180 gggccgacgt gttgaccttc ggccgacgtc ggcccaacgt atgtgtgctg tctggg 9236 // ID Gypsy94-LTR_DR repbase; DNA; ZEB; 520 BP. XX AC chr19; XX DT 02-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy94-I_DR; KW Gypsy94-LTR_DR; Gypsy95_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-520 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1500-1500 (2008). XX DR Genome; chr19; Positions 22462880 22462361. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 520 BP; 141 A; 73 C; 101 G; 205 T; 0 other; tgtaacaggt gttatattat ttttattaat ttcactcaaa attctccttg ttattcatca 60 attattttta tttaattgtg tattttcaaa gttgttcttt atatatctta tatttcagag 120 ttaatgaaat catagttgtg tttgttttca atgtgttggt gtattttgga aaattgcgcg 180 tggccccttt aagtgtcgtt ttccggttcc gcctgctctc actcagttcg cggcaaacgc 240 gattgaggag aggtgagttt tactgaagct ccggttaaaa tataacatat aagtgttgtt 300 tatcataata acacatgtaa tggtgttcta aagtgcatta cattgatgtt taaacatgta 360 tgagaggttg aatatatatg aggcatgtat tgtggataaa gtttataagc atgaatgtgc 420 aagctttgta gtgagaatta cccctttcac taataggttg gtgagctcct aaggaaggca 480 catatactca gctttgccat ttgtttgctt tattcataca 520 // ID hAT5-N1_DR repbase; DNA; ZEB; 1162 BP. XX AC . XX DT 05-MAR-2010 (Rel. 15.11, Created) DT 05-MAR-2010 (Rel. 15.11, Last updated, Version 2) XX DE hAT5-type DNA transposon - consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW hAT5-N1_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1162 RA Kapitonov V.V. and Jurka J.; RT "hAT5 - a novel group of autonomous hAT DNA transposons RT characterized by 5-bp target site duplications."; RL Repbase Reports 7(12), 1188-1188 (2007). XX RN [2] RP 1-1162 RA Kojima K.K. and Jurka J.; RT "hAT5-type DNA transposons from teleost fish."; RL Repbase Reports 10(11), 1903-1903 (2010). XX DR [2] (Consensus) XX CC >96% identical to consensus. 5-bp TSDs. Both termini are similar CC to DNA-5-3_DR, DNA-5-7_DR, and DNA-5-8_DR with >70% identity. CC This transposon encodes a CCHH zinc-finger protein similar to the CC N-terminal portion of hAT transposase. These characteristics show CC that this is a nonautonomous member of hAT5 group of hAT-type DNA CC transposon, represented by hAT5-1_NV from Nematostella vectensis CC [1]. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS join(312..494,523..873) FT /product="hAT5-N1_DR_1p" FT /note="includes a CCHH zinc-finger motif, similar FT to the N-terminal region of hAT transposase." FT /translation="MGKCLFDDNWLEDEKYRGWLKKTASNHEGRCALSKKT FT IKLGTMGRKALDSHMKAEKHKKYVVCSHRALRPQPSTRPSTSSNAFSSFAN FT IVTFKAEILWVFQTVSRHHSYTSNEDVHLVFQAMFPDSECASTFTCGRDKT FT VYLARFGVAPYLKKKLISRANKDAFIIMFDESMNAATKT" XX SQ Sequence 1162 BP; 331 A; 238 C; 256 G; 337 T; 0 other; cagggtcccc gcgggtcctt aaaaagtctt gaaatgtctt aaataaaatt ttcgattttt 60 aaggtctgaa aatgtcttga attctgtaat tttccataag gaggtcttaa atttcatgtg 120 ggatcttaaa tttcatgtcc aactcgtctt ttttatttat ttttttcaac cgctatttga 180 ccagcgcaac atttgggggc agcacttcgt gggtgcaacc tgcatcattc cataggtgac 240 atacgagtaa gtgattgatg aatgcgttgc gttgatactt cgccaccacg gcattttgcg 300 aaatcgcaag catgggcaaa tgtttgtttg atgacaattg gttggaggac gaaaagtatc 360 gggggtggct aaagaaaaca gccagtaatc atgaagggag gtgtgcttta tctaaaaaga 420 cgataaaact ggggacaatg ggccgcaaag cgctcgattc acacatgaaa gcagagaagc 480 acaaaaagta tgtgtagtcg caggcaagta gtttgcccat aagtatgttc tcatcgagca 540 cttcgacctc agccttccac aaggccgtcc acttcaagca atgccttcag cagctttgcg 600 aatatagtta catttaaggc cgaaatactg tgggtatttc aaaccgtaag ccgccaccac 660 tcgtacacct caaatgagga cgtccaccta gtttttcaag caatgttccc tgactcagag 720 tgtgcgagta cgttcacatg tggaagagac aaaacagtat atttagcacg atttggtgtt 780 gccccatacc taaagaagaa actaatttca cgagcaaaca aggacgcttt catcataatg 840 ttcgatgagt ccatgaacgc agcaacgaag acttagactt gcactttaga cgttggtcaa 900 ctgacgagac cggtaccctg ttgtcagggt tgtaaggtca atcagcactc ggcataaaag 960 tttttgcgtc cctttgacct gtcttgagcg tggctctggt gagttgtgcc acgctaatgc 1020 ttcaccacat cattctgtgt tgacacactg aacattttcc tgtttttttt gtttaggtaa 1080 agtcttaaat tttcatttta gaggacttaa aaaggtctta aaaggtatta aatttcctgt 1140 taaaaaatgt gcagataccc tg 1162 // ID Gypsy80-I_DR repbase; DNA; ZEB; 7430 BP. XX AC chr12; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy80-I_DR; KW Gypsy80-LTR_DR; Gypsy80_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-7430 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1475-1475 (2008). XX DR Genome; chr12; Positions 29215471 29208042. XX CC Positions [3512-4051] - Reverse transcriptase CC Positions [5553-6029] - Integrase core CC 'CATG' target site duplication CC LTRs are 100% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 861..2219 FT /product="Gypsy80-I_DR_2p" FT /translation="MEKFNTTGLGRGRGVFQTNSTFSMGRGFPTHPLTSTP FT RVRSSSCEANSLNGPDENEWRSPAVERSIPVDLLTTVVKEIGKSISENIVS FT CLELRTTGCGDGSRVGHTGTGASFDAQNLKLMLQSDVKEPEYYRGDGSDKC FT TIHEWVELMSVYLRKREYSQECQAEEVLGRLMGRARDVVKIALRNYLKTDL FT STGPGPIYDILKQHFSEAAYSAMPLADFYATLPQKGESPFDYWIRLNRAMD FT VAEDCLKAQGKKLDNSSQEVSVMFVRYCPDPQLQLVFKCKPQQHWTAAEIQ FT ERLDEFQREAKYTQNQLLFLAQKQEAVRPPLHYDHNEASSLESSIVKASIP FT QVQQSSVHNSVESLERLISRLERVLEKVPGEGAKSAEIPVRTRVKTRADGR FT GECKVCRDVNHTTSHHCRADGLCFICYAPNHTRFECPTAVDKGRVKSQSTA FT YPSGQQGN" FT CDS 2372..5452 FT /product="Gypsy80-I_DR_1p" FT /translation="MHEDLFYTDVVLENTLTVRGMIDCGSMACTLSPVALS FT KLQEAGIVLMMSGGVTKPSDVVLVGCGGLRTQPSSMVDIKMTVCGCTVLVP FT TLVVDGQIDDLIVGSNVIKYLISELKQTGNMVEELSFDASNNSDHNKLIQL FT LSNVERWKGSSVPDKVGTVRLKRSVTLEPLREHLVWAKLTELKNLSAGSAI FT IAEPCRARSRPRNVLIGRTVATLWADGWVPVKVINPSTVPITLRRNAKLAD FT IYPCLALEDFEDDLTVDKEVSYSVGQNVNYAKRHDNSSVNSLSDTNVTEED FT CEFGEKLTALGLADIDLGSCQLSTDWKAKLVDLIARYECIFSKHRLDCGKA FT EGYVHRIRLSDERPFRLPCRRLPPNQYDKLRQALNEMEDREIIRKSCSEYA FT SPLVIVWKRNGDLRLCTDFRCLNARTIKDAHPLPHQADALAALGGNVFFST FT MDLTSGYYNVEVHEEDRKYTAFTSPFGFYEYNRLPQGLCNSPATFMRMMLR FT IFGDQNFLSLLCYLDDVLVFAPNEELGLKRLELVFERLKTHNLKLAPKKCH FT FMRRSVKFLGHLVSEAGVASDPDKIKVITEMTESDLMDDRTGAPSQQKIRS FT FLGMVVYYQQFIEQCSVIARPLFQLTAGPKGHKSKKSKGKIAWKRTLTADD FT WTDDCRRAFGQLKQALIDCVLLAHPDFNRPFLVSVDASSGGLGAVLSQVPE FT NGGVTRPVCFASKSLSFAQSKYPAHRLEFFALKWAICDKFSHWLRGRPFTV FT WTDNNPLTYILTKPKLDACEHRWVARLAPYQFSIRYIPGSRNVVADALSRE FT PFVKQSALHRLTRVPYDSLLKEASAVQGERVQDVFRWSAHPFVPGTELGIG FT STPGMLVVNSQSSVSRLAGECSALEVSAVMEAKRFDSVTPHCDLLLPQLIQ FT QIMPTNQTGSMSLSHEELAEMQQTDSVLARVISFVEKGKKPSSRECKQEPI FT EVRRLLRSWKKFEMRAGVLYRVAREVVCKKKSFQYVVPKSLVGTVLKGVHD FT DAGHQGQQRTLYLVRQRFFLA" FT CDS 5658..6923 FT /product="Gypsy80-I_DR_3p" FT /translation="MAHAFPCPNQSAKSVAKQLWNNFFCIYGFPKRIHSDQ FT GANFESMLISELLSVAGVEKSHTTPYHPMGNGVVERFNRTLGNMIRALPIA FT AKRRWPSLLKSLTFAYNCTIHETTGYAPFQLMFGRTPRLPVDVVFGSVLRD FT SAVTDYDEYVKVLLSDLKHAVTIASETAGKQLKRHTDLYNRKLKGAPVDVG FT DRVLLANKGERGKRKLSDRWESSLYIVIGRNSETNTFKIENSSTGQVKTVH FT RNLIMPVKFLPLPNDLVEGPESVEECVGTVEARALELLPQPAVDQRTVVWV FT SGLESSEGCDEVTNTMEDRMTCQNMTENYDSLSDSVRLVGLGEPVSEDTDR FT VIDTNSNLFEYVQPLTDSASLSDVVTVASVQSNDLYDPGLGLLTSVRSKAG FT RIIKPVVRLIETMQLQRVDNCVKLPVWV" XX SQ Sequence 7430 BP; 1961 A; 1311 C; 2013 G; 2145 T; 0 other; ttggtgccgt gatcggtctt ctaagcgatt tcggtggccc agcgggatgc ggacaattga 60 gtggcggtgc tcggcggact tcggaggccc agcgggacgc ggacaattga gtggcggtgc 120 tcggcggact tcggaggccc agcgggacgc ggacaattga gtggcggtgc tcggcggctt 180 cggaggcccg gcgggacgcg gacattcgag cggcggtgct cggcggactt cggaggcccc 240 gcgggatgcg gacaatcgag tggcggtgct cggcggattt cggaggccca gcgggacgcg 300 gacaatcgag tggcggtgct cggcggattt cggaggccca gcgggacgcg gacaatcgag 360 tggcggtgct cggcggattt cggaggccca gcgggacgcg gacattcaag tggcggtgct 420 cggcggactt cggaggccca gcgggacgcg gacaatcgag tggcggtgct cggcggattt 480 cggaggccca gcgggacgcg gacaatcgag tggcggtgct cggcggattt cggaggccca 540 gcgggacgcg gacaatcgag tggcggtgct cggcggattt cggaggccct gcgggacgcg 600 gacatttgag tggcggtgct cggagtgaca tggacacttg agttttggtc cctgcagact 660 cgccgctaag cttagaagag aaaggtggtt gtttacattt tgcacgcatt tttataggct 720 aacattacga acactgaact gagttatgtg attaaatgag tgtagactga tttactatat 780 catttaaggt gattttgaca ttattttttt catatttcat ggtttggttt cattacttta 840 acaactgact tttacagggg atggaaaaat ttaatactac tggtttgggg cgagggagag 900 gggtgtttca aacaaattca acgttttcca tgggtagggg ttttcctaca caccctttaa 960 cttccacgcc gcgggtacgc tctagttctt gtgaagcaaa ttcacttaat ggacctgatg 1020 aaaatgaatg gaggtcccca gctgttgaga gaagtattcc tgttgatttg ttaactacag 1080 tggtaaaaga aataggtaaa tctataagtg agaatatagt ttcttgtctg gaattgagga 1140 caacaggttg tggggacgga tcaagagtgg gacacactgg cactggagcg tcatttgatg 1200 ctcaaaattt gaaactgatg ttgcaatctg atgtaaaaga gccagaatac tatcgtgggg 1260 acggctcaga taaatgcact atacatgaat gggtggaatt aatgtctgta tatttgcgca 1320 agcgtgagta tagtcaggaa tgtcaggctg aggaagtttt gggtaggtta atggggagag 1380 cacgtgatgt tgttaaaatt gctctccgca actacctgaa aactgatttg agtactggtc 1440 caggacctat ttatgacatc ttgaagcagc atttcagtga agcggcatac tctgctatgc 1500 cattagctga tttttacgcc acattgcctc aaaagggaga aagcccgttt gattattgga 1560 tcaggctaaa tagggcaatg gatgttgcag aagactgttt gaaagctcaa ggaaaaaaac 1620 ttgataattc gtctcaagag gtatcagtaa tgtttgttag atactgcccg gaccctcaat 1680 tacagcttgt gttcaaatgc aaacctcaac aacattggac agctgctgaa attcaagaaa 1740 ggttggatga atttcagaga gaagccaaat atactcaaaa tcaattgtta tttttggctc 1800 agaaacagga agctgtcaga ccacctttgc attatgatca taatgaagca agtagtctcg 1860 aatcgtcaat cgttaaagcc agcattccac aagttcagca gtcttctgtt cacaattcgg 1920 ttgaatcctt agaacgcctg atctcacgtc tagaacgagt attggaaaag gttcctggtg 1980 aaggggcaaa atctgcagag atacctgtgc gtacgagggt caaaactaga gcagatggga 2040 ggggggagtg taaagtatgt agggacgtta atcacacaac ctcccaccat tgtcgagcag 2100 atggactctg ttttatatgc tatgccccaa atcacacacg ttttgagtgc ccgactgctg 2160 ttgataaggg gcgtgtcaaa tcacagtcga cagcatatcc ttctggtcag cagggaaact 2220 aggtggcccg catacagtgg ggagggatgc agggcctgat aaaattcccc agtcaattga 2280 caatgtggat ctggaatcct tatattctgc tgtccgggag gatactgatc aaaacgtgac 2340 tgtaatactt caaaatacac acaaactatc aatgcatgag gacttgttct atacagatgt 2400 cgtcttggaa aacacactga ctgttcgggg tatgattgat tgtggctcaa tggcctgcac 2460 tttgagtccc gtcgctttgt ctaaattaca ggaagctggg atagttttaa tgatgtcagg 2520 tggggtgacc aaaccgtcag atgtagtact agttggctgt ggtggattga gaacgcaacc 2580 aagtagtatg gtcgatataa aaatgactgt ttgtggatgt acggttctgg ttccgacttt 2640 agttgtagac ggccaaattg atgatctcat tgtgggcagt aatgtgataa aatatctgat 2700 cagtgagttg aaacaaacag gaaatatggt tgaagagttg tcatttgatg cgagcaacaa 2760 ttcagaccat aacaaactaa ttcagttgct ttcaaacgtc gagagatgga agggatcgag 2820 tgttccggac aaggttggga cagtcagact gaagaggtca gtaacacttg agcctttgcg 2880 ggagcattta gtgtgggcta aacttactga attgaaaaac ctgtcagcgg gtagtgcgat 2940 tatagcggag ccttgcaggg cccgctcaag acctagaaat gttctgattg gccgcacagt 3000 ggcaacattg tgggcagatg gatgggtacc tgttaaggta ataaatccct ctactgttcc 3060 tattactttg agacgtaatg caaagttggc tgacatttat ccttgtttgg cattggagga 3120 ttttgaggat gatttgacgg ttgataagga agtgtcatac tcagtgggac aaaatgttaa 3180 ctatgcaaag cggcatgaca attcaagtgt caacagtttg tcagatacca atgtgacaga 3240 ggaagactgt gagtttggag aaaagttaac tgcattgggt cttgcagata ttgatctagg 3300 ctcctgtcag ttgtctactg attggaaggc aaaactcgtt gatctgattg cacggtatga 3360 gtgtatcttt tcgaagcaca gactcgattg cggtaaagca gagggttatg tgcatcgaat 3420 aagactgagt gatgagagac catttagact accatgtaga aggcttccgc ctaatcagta 3480 cgacaagctt aggcaagctt tgaatgagat ggaagacagg gagattatta ggaagtcgtg 3540 tagtgagtat gcctccccgc ttgttattgt ttggaagaga aacggagatt taagactctg 3600 taccgatttt aggtgcctca atgctcggac gattaaagat gcacatccgt tgccacatca 3660 ggctgatgct ttagctgcac tgggtgggaa tgtttttttc tccactatgg acttgacatc 3720 tgggtattac aatgttgaag tgcatgagga agacaggaag tacactgctt tcacatctcc 3780 ctttgggttt tatgagtata accgtcttcc tcagggactt tgtaatagcc cagcgacgtt 3840 tatgcgaatg atgctcagaa tttttggtga tcaaaatttc ttaagccttc tctgttattt 3900 ggatgatgtg cttgtttttg ctccgaacga agagttgggc ttaaagagat tagagttggt 3960 ttttgagcgt ctgaaaacac ataacttgaa attggcacca aagaaatgtc actttatgcg 4020 aagatctgta aagttccttg gtcatctagt tagtgaggca ggtgtagctt cagatccaga 4080 taaaataaaa gtgattacgg aaatgactga gtcggatctc atggatgata ggactggggc 4140 accatctcaa caaaagatac gttcttttct tgggatggta gtgtattatc agcagtttat 4200 agagcagtgt tcagtgattg cgaggcctct gtttcagttg actgctggtc cgaagggtca 4260 taaaagcaaa aagagtaagg ggaagattgc ttggaagaga acgcttactg cagatgattg 4320 gactgatgat tgtagacggg catttggtca gctcaagcag gccttgattg attgtgtttt 4380 gttggctcac ccagatttta acaggccatt tctggtatca gttgatgctt ccagtggtgg 4440 cctgggagct gtgctttcgc aggttccgga aaatggggga gtcacaaggc cagtttgttt 4500 cgccagtaag tcactttcct ttgctcagtc aaaataccca gcacatcggc tggaattttt 4560 tgcattaaag tgggctatat gtgacaaatt tagtcactgg ttgagaggtc ggccctttac 4620 ggtctggacc gataataacc cgcttacata tatactaaca aaaccgaaat tggatgcttg 4680 tgagcaccgg tgggtggcta ggttggctcc gtatcagttc agtattaggt atatacctgg 4740 ttcgaggaat gtcgtagctg atgctcttag tagggagccg tttgtgaagc agagtgcttt 4800 gcatcgcctt accagagtgc cttatgactc ccttttgaaa gaagcatcgg cagttcaggg 4860 agaacgggtg caggatgttt tcagatggtc tgcgcatcct tttgttcctg gtactgagct 4920 tggtattggg tcaactcctg gcatgctggt tgtgaactct caatcatcag tgagtaggtt 4980 ggcaggtgaa tgttcagcat tggaagtgtc tgctgtaatg gaagcgaaga gatttgactc 5040 agttacacca cattgtgacc ttttgctgcc tcagttgatt cagcaaatta tgcccaccaa 5100 tcagacagga tcgatgagtt tgtcccatga ggagttggca gagatgcagc agactgacag 5160 tgtattggcc agagtgattt cgtttgtgga gaagggtaag aagccatcta gccgtgaatg 5220 taaacaggaa cctatagaag tgaggaggct acttagatcc tggaaaaagt ttgagatgag 5280 agctggggtg ctttatagag tagcgaggga ggttgtatgt aagaaaaagt catttcagta 5340 tgtagttcca aaatcattgg tggggacagt cttaaaaggt gtgcacgatg atgctggcca 5400 tcaaggtcaa cagaggactt tgtatctggt gagacagcgt ttttttttgg cataaacttg 5460 agagggatgt cagagattat gtcaaatgct gcgagaggtg tgtctatggc aaatctccag 5520 aaccagaggc gagagctcct cttgaaaata ttgttacgtc tgagcctttg gagcttgtgt 5580 gcgttgattt ttggtcagct gaggacagta aaaacaaatc agtggacatt cttgtggtca 5640 ctgatcattt tacgaagatg gcgcatgctt ttccctgccc taatcagtca gccaaatctg 5700 ttgcaaaaca gctatggaac aacttttttt gtatatatgg attccccaaa cgaattcact 5760 ccgatcaggg agcaaacttt gaaagcatgt tgatttctga gttattgtct gttgcgggag 5820 tggaaaagtc gcacacaact ccatatcacc ctatgggtaa tggggttgtg gagcgattta 5880 atagaactct tgggaacatg attcgggctc ttcctattgc agcaaaacga cggtggccta 5940 gtttgttgaa atctttgaca tttgcatata attgtacaat acatgaaacc actggatatg 6000 caccttttca actgatgttc ggcagaactc cgagacttcc tgtggatgtg gtttttggat 6060 cagttttgcg agacagtgct gttacggatt atgacgagta tgttaaggtt ttgttaagtg 6120 acctcaaaca tgctgtgaca attgcgagtg agactgctgg taagcaactg aaaagacaca 6180 ccgatttgta taacagaaaa cttaagggtg ctccggtgga tgttggggac agggttctgt 6240 tagcgaataa aggtgagcga ggaaaacgta aattgtctga ccgttgggag agcagtctat 6300 acattgtgat tggaaggaac tcagaaacca atacttttaa gattgagaat tcttccacgg 6360 gtcaagtgaa aactgttcat aggaacctga tcatgcctgt aaaattcctg cctcttccaa 6420 atgatttggt agagggacca gaaagtgtgg aggagtgtgt tgggactgtg gaagctcgag 6480 cattggaact tttgcctcaa cctgcagttg atcagagaac tgtggtatgg gtttcagggc 6540 tcgagtcttc agaaggttgt gatgaggtaa ccaacacaat ggaggacaga atgacttgtc 6600 aaaacatgac cgagaattat gactctctca gtgattcagt gagattagtt gggttaggag 6660 agccggtttc ggaggacact gatcgagtca tagacacaaa ttcaaacttg tttgagtatg 6720 ttcagccgct aacagattct gccagtctaa gtgatgtagt tactgtggcc tcagttcaat 6780 ctaatgatct atatgatcca gggttagggt tgttgacgtc tgtgagatcg aaggctggga 6840 gaatcattaa accggtggtg cggcttatag aaactatgca attgcaacgt gtagataact 6900 gtgttaagtt gcctgtatgg gtttaatatg atgatatttt ttttttttgc tgttgttttt 6960 tttaccctat tttgggtata tttctaattg catttaggat ttgtgaatgt attggacaat 7020 acatttagac cacttgaaat ggtaatatgg gatggtggta tgctgactga gggtattgaa 7080 catgtaacag gcatgttttt ctatgggttt aaaagggctt agctaaccct tcttttctct 7140 aattcttcaa aagaatggta tatgtactga gatgctcatc agcattattt tatttttgag 7200 ataaacattc catggtttgc atttattttt ttgttttgtt tttatgtttt gttttgatga 7260 ttggagtaat tttgttataa atttgaatac atttctatgt attttatttt attttattat 7320 ttattttttt tgttaaagtt gttgaatctg actgttgtta gtaataagtg atctttgctg 7380 ctttgaattt gtaattgttt aatgagcaag tggagttcag cgggggagaa 7430 // ID Gypsy88-I_DR repbase; DNA; ZEB; 4328 BP. XX AC chr10; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy88-I_DR; KW Gypsy88-LTR_DR; Gypsy88_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4328 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1487-1487 (2008). XX DR Genome; chr10; Positions 24369384 24373711. XX CC Positions [1753-2208] - Reverse transcriptase CC Positions [3223-3702] - Integrase core CC 'GTAAC' target site duplication CC LTRs are 99% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 20..1009 FT /product="Gypsy88-I_DR_2p" FT /translation="MEPECPSSQELLATVAQHGSTVQRHENVLVQQEALMA FT KHSQVLSELMTTVRQLFDRLPTCTTPASITTPAPMSESRAPSPMMEPRLPP FT PRHFSGDPSACDGFLTQCSLTFELQPSSFPSDRAQIAYVITLLSGKALSWA FT TAVWKAKSPFCSSYMAFEQEFKRVFDHPLSDRQASKKLLTLRQGTGSVAEY FT AIQFRTVAAGSGWNDEAFMVCFQNGLSEIIQDDLATREPALDLESLIDQAI FT KLDNRLRERHLYRATVSTEGFTPTSSSPVLLHQDNPEPMQLGRTRLSPTER FT DRRMRERCCLYCGLSDHFRSTCPQLSGNAQSRTGREGL" FT CDS 1135..4287 FT /product="Gypsy88-I_DR_1p" FT /translation="MDITLAKDLNIPIDSLPEPLSVTALDGRPLAPGEVTH FT LTSHLSLSIYQHEEKLCFHLIQSPEFPVILGHPWLLQHNPHIDWVSGTLLD FT WSPTCQNTCLNLKPLNPIFESREPVDLSQVPAQYHHLKAVFSKKRATSLPP FT HRPYDCAIELLPDSCPPRGSIFSLSLPERTAMESYIEESLAAGIIRQSTSP FT AGAGFFFVGKKDGGLRPCIDYRGLNKITIRNRYPLPLMSTAFEILQEASIF FT TKLDLRNAYHLVRIKQGDEWKTAFNTPTGHYEYLVMPFGLTNAPAVFQALI FT NDVLRDMLNKFVFVYLDDILIFSSSLQEHIFHVRKVLQRLLNNHLYVKPEK FT CQFHVTQVKFLGFIIKPGQIQMDPQKIQAMVDWPSPSSVKEVQRFLGFANF FT YRKFILNFSTVAAPLSALTKENGAGFHWGPEAEEAFIKLKKRFTSAPILLI FT PNPDKPFMVEVDASDVGIGAVLSQRGEDNKLHPCAFLSHRLTPTERNYHVG FT DRELLAVKLALEEWRHWLEGAKHPFQVLTDHKNLEYVQQAKRLNPRQARWS FT LFFNRFHFTLTYRPGSKNLKPDALSRAYAKSIIEDCKKSIVPSSKIIAPIR FT WDLEGLVRKAQTKDPDPGGGPTDSLYVPKVVRSKVLQWGHSSQLTCHPGSV FT RTLEFLQRRFWWPTIKRDVTMYVKACPTCNQNKSSHCAPQGLLHPLSIPHR FT PWSHISMDFITGLPSSQGNTTILVIVDRFSKAARFIPLSKLPTAKETAELV FT INHVFRVFGIPLDVVSDRGPQFSSRFWQAFCQSIGATASLSSGFHPESNGQ FT TERLNQDLETTLRCMAANNPTAWSRFIIWAEYAHNTLRSSATGMSPFECQF FT GFPPPLFPEQEVEVAVPSALQFVRRCRQTWRKARLKLLKVSQQYKHQANRR FT RRPAPTLRPGQRVWLSTRNIPLRVDSRKLSQKFIGPFKIARKVNPVTYKLY FT LPKSLKINPTFHVSLLKPVLSSPFSVTEKPPPPRSFGGQPVYTVRRILDVR FT QVRRTRQYLVDWEGYGPEERSWVPAKDILDPKLIQDFHKSHKSSP" XX SQ Sequence 4328 BP; 1097 A; 1234 C; 877 G; 1120 T; 0 other; gaataactga gccaacaaca tggagccaga atgccccagt tctcaggagc tacttgccac 60 agtggctcaa catgggtcaa ccgtccaacg ccatgagaat gtcctagtac agcaagaggc 120 cctgatggcc aaacactctc aggtactttc tgaactaatg actactgttc gccaactttt 180 tgacagattg ccaacctgca caactccagc ctccattacc accccagcac ctatgtcaga 240 atctcgagca ccctctccca tgatggaacc tcgtctacct ccaccacgac acttttcagg 300 tgatcctagt gcgtgtgatg gtttcctaac ccaatgttct ctcacctttg aattgcagcc 360 ttcatctttt ccatcagatc gggcccaaat tgcctatgtg atcacactgt tgtccggtaa 420 ggccctttcc tgggccactg cagtatggaa ggctaaatca cccttttgct caagctacat 480 ggcgtttgaa caggagttta agcgagtctt tgaccatccc cttagtgacc ggcaagcctc 540 taaaaaactc ctcacccttc gacaagggac tggcagtgtg gcagaatatg ctattcaatt 600 tcgaacagtg gctgcaggga gtggctggaa tgatgaagct tttatggtct gctttcagaa 660 cgggttatct gagattattc aagatgattt ggctaccagg gaacctgccc tagatctcga 720 gtccctaata gaccaagcaa ttaaactgga taatcgactg agggagagac atctttatcg 780 cgccactgtt tccaccgaag ggttcactcc aacctcctct tccccagtac tgttacacca 840 agacaatccc gagccaatgc agttgggcag aactcgcctt tctcctacgg agagggaccg 900 ccgcatgaga gagcggtgtt gcctctattg tggtctgtct gatcattttc gctccacctg 960 tccccagctt tcgggaaacg ctcagtcccg tacaggcagg gaaggactgt aacgggagtt 1020 acctacagca cctcaccttc caaatgtggt ttgttcctcc ccatcaccct cacttgggca 1080 gaccaaagat cccaacttca agcctttatt gactctggtg ctgcaggaaa cttcatggac 1140 atcaccctgg caaaagacct caatattccc attgactctc ttccagaacc actgtcagtc 1200 acagcactgg atggcagacc attagctcca ggagaggtta ctcaccttac ttctcacttg 1260 tctctttcca tctaccaaca tgaagagaag ctctgtttcc acctcataca gtcacctgag 1320 tttccagtta tcctcggaca tccctggctt ttgcagcata acccccacat tgactgggta 1380 agtggaaccc ttctggactg gagtcctacc tgtcaaaaca cctgtttgaa tctgaaacct 1440 ctgaacccca ttttcgagtc tcgagaaccc gtggatctgt cccaagttcc agcccagtac 1500 caccacctca aagctgtctt tagcaagaag agagctacct ccttaccgcc ccaccgtcca 1560 tatgattgcg ccatcgagct acttccagac tcctgccccc caagaggcag tattttttcc 1620 ttgtcccttc ctgagcgtac tgccatggaa agttatatcg aggaatccct cgcagcagga 1680 atcatccgac aatccacctc tcctgcaggc gcagggtttt tctttgttgg aaaaaaggat 1740 ggtggtctgc gcccatgcat agattacagg gggcttaata aaatcaccat ccgcaaccgc 1800 tatcccttgc cactcatgtc cactgctttt gagatcctac aagaggcttc tattttcact 1860 aaacttgatc tgcgcaatgc ttaccacctt gtgcgcatca aacagggaga tgagtggaag 1920 accgctttca acacgcccac agggcactat gaatatctag tgatgccctt tggtcttacc 1980 aatgctcctg cagtgtttca agccctcatc aatgatgttc ttcgagacat gctgaataag 2040 tttgtatttg tctacctaga tgacatcctt attttctcaa gctccctcca agaacatata 2100 tttcacgtcc gcaaggtcct ccaacgtctt cttaacaacc acctatatgt taaacccgag 2160 aaatgccagt tccatgtaac ccaagtcaaa ttccttggat ttatcatcaa gcccggacaa 2220 atccagatgg atccgcaaaa gatccaagct atggtggact ggccttcccc ctcatcggtc 2280 aaagaggtgc aacgattcct tggttttgcc aatttttata ggaaattcat tctgaatttc 2340 agtactgtgg cagccccatt atccgccctc accaaggaga atggtgctgg gtttcattgg 2400 ggacccgaag ctgaagaggc ttttatcaag ctaaaaaagc gcttcacatc agcccctatc 2460 ctcctcattc cgaaccctga caaacctttc atggtggagg ttgatgcttc cgatgtgggc 2520 attggagcag tgttgtctca gagaggtgag gataacaaac ttcacccatg cgcattcctc 2580 tctcatcgtt taactcccac tgagagaaac taccacgtgg gtgataggga gttactggca 2640 gtgaagctag ctctcgagga atggagacac tggcttgagg gagccaaaca cccctttcaa 2700 gttctcactg accataaaaa tcttgaatat gttcagcagg cgaagcgact caacccccga 2760 caagccaggt ggtccttatt tttcaaccgt tttcatttta ctctcaccta ccgccctggt 2820 tcaaaaaacc taaaaccaga cgccttgtcc cgagcatatg caaaatccat tattgaagac 2880 tgcaagaaat ctattgtccc cagctctaag atcattgccc ccattaggtg ggatctggag 2940 ggtcttgtaa ggaaagccca aactaaagat cccgacccag gtggaggtcc tactgacagt 3000 ctgtatgtcc ccaaagttgt tcgctccaaa gtcctccaat ggggacattc atcacaactt 3060 acttgccacc cagggtctgt ccgcactctc gagtttctgc agagacggtt ctggtggcca 3120 accattaaaa gagatgtcac catgtacgtc aaagcatgcc ccacctgcaa ccagaacaaa 3180 agttcccatt gtgcacctca gggacttctg catccacttt ccatacccca caggccctgg 3240 tcacatatat ccatggactt tatcacagga cttccttcat cccagggaaa caccactata 3300 ttagtcatcg tggacagatt ttccaaagct gcaagattca ttcccctgtc caaactcccc 3360 actgctaaag agacagctga attagtcatc aaccatgtgt ttcgagtctt tggtattccc 3420 ctggatgttg tctctgatcg aggacctcaa ttctcctcca gattctggca agccttttgt 3480 caatccatag gggccactgc aagtctatct tcaggtttcc accctgagtc gaatggtcaa 3540 actgagagac ttaatcagga ccttgagacc acactgagat gtatggctgc taataaccct 3600 actgcatggt ctcgcttcat catatgggcc gaatatgcgc ataacaccct ccgttcttcg 3660 gcaactggta tgtccccttt cgagtgccaa tttggttttc ccccgccact atttcctgaa 3720 caggaagttg aagttgcagt tccctcagct ctccagttcg ttagacgctg tcgtcaaacc 3780 tggagaaagg cccgtcttaa gctcctcaag gtttctcaac aatacaaaca tcaggctaac 3840 cgcagacgca ggccagctcc aactcttcgt cctggccaaa gagtttggct gtcaaccagg 3900 aatattcccc ttcgagtgga ttcaagaaaa ctttcccaaa agttcatcgg cccttttaaa 3960 attgccagga aagtaaaccc agtcacttat aaactatact tacccaaatc cttaaaaata 4020 aaccctacct ttcatgtttc attacttaaa ccagttcttt cttctccttt ctctgtgaca 4080 gaaaaacctc cccctcctcg ttcttttgga ggccaaccag tctacacagt ccgccggatt 4140 ctggatgtac gccaagttcg tagaacccgg cagtatcttg tagactggga aggttacggc 4200 cctgaggagc gttcctgggt tcctgccaag gacattttgg accctaagtt aattcaagac 4260 tttcacaagt cccacaagtc aagtccctag gcaagaacgt caggagccgt tcctagaggg 4320 gggggccc 4328 // ID DNA-AT-2_DR repbase; DNA; ZEB; 1441 BP. XX AC . XX DT 28-OCT-2008 (Rel. 13.1, Created) DT 28-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE non-autonomous DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; TSD AT; KW DNA-AT_DR; DNA-AT-2_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1441 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1244-1244 (2008). XX DR [1] (Consensus) XX CC This family is characterized by AT specific TSD and ~100-bp TIR. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1441 BP; 409 A; 260 C; 290 G; 480 T; 2 other; agagtggagg aactatgacg tcaatttgta tgcaaaaacc cggaagcgag ttagcatttt 60 agcacttccg gttccctcgt cccaaagtca atgggttttt tgaatggggt ttcagtaaaa 120 tcgcttaaat aaggtccgtg gctaacacaa attcaagata ttttcacgtt ttgttctacg 180 acataaaaca catcagttac agcacactct tgaattttta attcggccat gcttctttaa 240 aaagacggtt gctatcaagt tgctaaatgg gactacaggc ggtgtcggag acattatacg 300 tcatcgagct gatctggtct ggagcgcagc tcgcttgtgt tgggcatttg gatttgtctg 360 ttatttaggt attttcatga atagtatttt atagtttttt gtgtttttct aattaagaat 420 tcagattgtg tgtagataat kccttcacat gtaaaggctg tcgagacaga ttcatttgtt 480 gatcatttat tttaattaaa ctgatattat gaagatactg ctcaccagtg cagcctgtct 540 ctctcctgct ctcagtaatg caaacgtgtg aataaatgtg taaaaataca gacatattaa 600 ctgtcatttt aacgtgatca agtgattttt cgtctttttt tattcatctg aacacatttt 660 actcagtcat aactgcaata tttacactcc tccataaaac gtatagcaga tgtgactgta 720 tgggctgctt ttcgctgtac gtcgtgacat aaaaggaata ttgaatgttg ctctgtgtcc 780 atcatgatgg aacttgcagc catttgatag acttgttcct cctcacgcct catttctgtc 840 atctgttaag gtaagcaggc tgtgtgcacg cgagcgatac acttatttaa atatgtctga 900 tgaaaatact atataggcag atttatacac gcagttttaa aacttttaga acaaccgcaa 960 agcaaaacag atgtatttcc cacgtaaaaa cgatccaaaa ggcacatagg tctatgtgtt 1020 tttgcgtatt tatttgttta taatatatag cgtggattat aatataataw acagagagat 1080 taactctttg tcatggacat tatttctaaa aataagcttg ataagttgtc tgtagcaggg 1140 tggggctgac gtcacagacg agcgccccga gggctctgta gtccgtttat agcctaatgt 1200 tagcttttta actcttgcgt ttgcatttaa gatccaaaag tgctccaagt tgtattttcg 1260 tgtgaggatt atccggctgg acaaaacgtg taagtgtaat gaactgtgtt tgaacacaga 1320 gcttattatt tgcaatcttc gaaaagccta tgggaaaatc ctatagggat tttcacgagg 1380 gaaccagttt tatgctagca gccgattagc ctacaaagtg acgtcatagt tcctccactc 1440 t 1441 // ID Gypsy8-I_DR repbase; DNA; ZEB; 6655 BP. XX AC . XX DT 07-JAN-2005 (Rel. 10, Created) DT 19-MAY-2005 (Rel. 10.06, Last updated, Version 2) XX DE An internal portions of the Gypsy8_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy superfamily; Gypsy8-I_DR; Gypsy8-LTR_DR; Gypsy8_DR; KW endogenous retrovirus; gag; integrase; reverse transcriptase. XX NM Gypsy8-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6655 RA Kapitonov V.V. and Jurka J.; RT "Gypsy8_DR, an LTR retrotransposon from zebrafish."; RL Repbase Reports 4(12), 321-321 (2004). XX DR [1] (Consensus) XX CC Gypsy8-I_DR is an internal portion of the Gypsy8_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its CC long terminal repeat is deposited in Repbase as Gypsy8-LTR_DR. CC Gypsy8_DR is characterized by 4-bp target site duplications. CC The internal portion encodes two proteins: the 622-aa gag-like CC Gypsy8_DR-1p (pos. 93-1958) and 1557-aa pol Gypsy8_DR-2p (pos. CC 1985-6655). The primer-binding site (PBS) is complementary to CC the Arg-tRNA. XX FH Key Location/Qualifiers FT CDS 1985..6655 FT /product="Gypsy8_DR-2p" FT /note="Pol-protein" FT /translation="MGTKRKQSPNNQQALNKHTDTRNSLKLPAGLVGMKCT FT ARVKIEEKEVNCLLDTGSQVTTVPMSFYNRYLSRHPMQPLNHLLEVEGANG FT QAVPYLGYVELTLKFPQEFLGSEAEVPTLALVVPDLTQTPQILIGTNTLDV FT LYAGHAQTVKPRVMSRYQGYQAVIRVLEARLQQAGVNVLGQVKVKGDMPEV FT IAAGSTVVLNGCVKVIGPLSETCVTIEPSVSSSLPGGLLVASSLHSLPAKR FT NVQIPIVLRNETQIDLTIPPKAVLAEVHAVQSVIEGRKPVNASVSKVTNPA FT QNRIIPDFGDSPLSNDWKEKITDLVNSMQDVFALHDLDYGHTNKVKHHIHL FT SDSTPFKQRARPIHPQDVDAVRRHLKELLDANIIRESESSFASPIVVVRKK FT NNDVRLCIDFRKLNSQTVKDAYALPNLEEAFSVLSGSKWFSVLDLKSGFYQ FT IEMEEADKSKTAFVCPLGFYEFNRMPQGVTNAPSTFQRLMERCMGDLNRKD FT VLVFIDDLIVFSKTLEEHKAKLLQVLTRLKEYGLKLSPKKCKFFQTSVQYL FT GHIVSQNGVETDPTKIEALKTWPRPKNLKELKSFLGFAGYYRRFVQDFSKI FT TRPLNDLTIGYPPLQKNQKRNPTKSTPYLDPKEQFGERWNQDCQWAFNTII FT EKLTSAPVLGYADPRLPYVLTTDASTVGLGAALYQEQEGRMRVIAFASRGL FT TKSEAKYPAHKLEFLALKWAVTAKFSDYLYGGEFTVITDSNPLTYILTSAK FT LDATSYRWLSSLSTFNFKIQYRAGKRNLDADGLSRRPHDEQIDDFASQKER FT ERIKQFTLHHLAETEQQVILPDAVKALCERHQVYQSCGDSNLPYSQLTVVE FT SLSQSVDAVPQEFQQEAGGLPVLPQMSEEELKECQRTDPVLEKVIRHLDSG FT KKPHGKVEPAEVALWLREWDRFEFKNGILFRKRQDPRGVLYQLALPKKLRG FT NVLKNLHNDMGHLGIERTLDLARARFYWPKMAITVEEKVKTCERCVKRKTP FT PERAAPLVNITTSRPLELVCIDYLSLEPDRSNTKDILVITDHFTKYAVAVP FT TRNQKAETVAKCLWDNFFIHYGFPERILSDQGPDFESRIIKELCGIAGIQK FT VRTTPYHPRGNPVERFNRTLLQMLGTLEDKQKTYWKDFVKPLVHAYNCTRN FT DTTGFSPYELMFGRQPRLPVDIAFGLPATGSSPSHSIYVRNLKDRLEESYR FT IATENASKLARRNKKRFDERVVTSFLEVGDRVLVRNVKLRGKHKLADKWEK FT EIYVVLKKAGDLPVYTVSPEGRDGPLRTLHRDLLLPCGFLQESMPEPVKPK FT PPRRPRTRANTSAREPDTMTESSDSEDDSMDHYSRRHLPKVESRILFNPRH FT VKPSRDRPIAELSSKTRVVKSKINDCPAIEIPQENLPYLPEDENCPMSEPE FT RENTPVMDSVRPDMLRNVPVVNDQELLEQRDELEILSEIDDEADQRNIHSG FT QAVIDQVERNTLRRSQRHREPPQRLQYSQLGNPLSLVIQSLLQGLSTAVTA FT SLEESDCPREASLLMQKMFPSAAVTQPKRCRGTCIDSRRGE" FT CDS 93..1958 FT /product="Gypsy8_DR-1p" FT /note="Gag-like protein" FT /translation="MDVIEREDVKIENAVIISGLTLTETDEVVESYLLRFG FT SIRQNFLIDDPQSVFHHNAVVEFSHNSAICNIEPQLPLTIVSPTDMSVIFR FT IHKLSTAYSTATSSDTAKQLKSQEDKNVESQQDIIKQNSMKMNGVTHLDDA FT SHREDCGLENLKSSTKVLTSVTNTPQRTGSFKPSALKIDNNNPAKVNMSTT FT SPHSLTEANSVSVRIPISSMHPPDIQRVIMEHVVKMSETVPQHSASFRLKA FT FSGRMPRPGNEPDFDTWRTSVDYLLNELSLSESHKMQKILDSLLPPASDVI FT KHVSPNAPASECLRLLESVYGSVEDGDELLAKFINTLQDPGEKSSAYLHRL FT YVLLCTTIRRGGIAESERDRYLLKQFCRGCWDNALIVELQLERIKADPLSF FT AELAVLLRTAEEKKTSKEERMRKHLGLGRPSPTPLKLRTITHQQSAHFNNS FT LEVDATNIPAFDNPKQKVSKQKSKTQCPETSEADALKKEIMALQSQITAIK FT TAADQEARERTEASELQQLKGQIAELKVQLATSGAQRKQFQKSPQQSNNPS FT DFGVSRERNERQKNTELRTNRPRPWYCFRCGDDGHLAIYCENAPNPLRVEE FT KRQKLREKQAEWDLRNGGATVPLN" XX SQ Sequence 6655 BP; 2188 A; 1425 C; 1423 G; 1619 T; 0 other; atcttggcga gccagccagg agcgagaaag cagcagcttt tgaggtgaat aattattgaa 60 atacgcatat atagtgaaaa tataacatta caatggatgt catagaacga gaagacgtta 120 aaatagaaaa tgcagtgatt attagtggtt taaccctaac tgaaacagat gaagtcgtag 180 aatcctacct tttgagattt ggttctatac ggcaaaactt tctgattgat gatccgcagt 240 ctgtatttca tcataatgca gttgttgagt tttcacacaa ctccgccatt tgcaacattg 300 agcctcaatt gcctttgact atcgtaagcc ctactgatat gagcgttata ttccgcatac 360 acaaattgag tactgcttac agcacagcca catctagcga cacagcaaag caactgaagt 420 cacaagagga taaaaatgta gaatcccaac aagatatcat taagcaaaat tcaatgaaaa 480 tgaatggagt gacacatttg gatgatgcat cacacagaga agattgtgga cttgaaaatt 540 taaaatccag cacaaaagtc ttgacaagtg taactaacac ccctcagcga actggaagtt 600 tcaagccatc tgctctaaaa attgataata acaacccagc caaggtgaac atgagtacta 660 catcgcccca tagtttaaca gaagccaatt ctgtatctgt gagaattcct atatcctcta 720 tgcacccccc agacattcag agagtaatta tggaacatgt tgtgaagatg agtgaaacag 780 tgccccaaca cagtgcttct ttccgcttaa aagccttttc tggacggatg cctcgtccag 840 gtaatgagcc tgacttcgac acatggcgga caagcgttga ctatttgttg aatgagttat 900 ccctttctga gtcacataaa atgcaaaaaa ttctagacag cctgttaccg cctgcttcag 960 atgtcattaa gcatgtgagc cccaatgctc cagcatcaga gtgtctgagg ttactagagt 1020 ctgtttacgg ttcagtggaa gatggagatg aattattagc aaagttcata aacactctgc 1080 aggatcctgg tgaaaaatca tctgcctatc ttcatagatt gtatgtgctc ttgtgcacta 1140 ccattaggcg tggagggatt gcggagagtg aacgagaccg ttatctcctg aaacagttct 1200 gccgtggctg ttgggacaat gccttgatag ttgaattgca gctagagaga ataaaagctg 1260 atccactctc ctttgctgag ttagcagtac tcttaagaac agctgaggag aagaaaactt 1320 caaaagaaga gagaatgaga aagcatcttg gtttgggcag accctcacca accccactca 1380 aattaagaac aataactcac caacagtctg ctcactttaa taactcactt gaagtggatg 1440 caactaatat tccagcattt gacaatccaa aacagaaagt ctctaaacag aaaagtaaaa 1500 cccagtgtcc tgaaacatct gaagctgatg ctttgaaaaa ggagattatg gctctccaaa 1560 gccaaatcac tgccattaaa acagcagctg accaggaagc aagggagaga actgaggcaa 1620 gtgagcttca gcaactaaaa ggacagatag cagagcttaa agtccaactt gccacttctg 1680 gagcacagag aaagcagttt cagaaatccc ctcaacagag taataatcct agtgattttg 1740 gtgtcagtcg agagagaaat gaaaggcaaa aaaacactga attaagaacc aatcgaccca 1800 gaccatggta ttgctttcgc tgtggagatg atggtcatct tgccatttac tgtgaaaatg 1860 caccaaaccc attaagagtt gaagaaaaga gacaaaaatt aagagagaaa caagctgagt 1920 gggatcttag aaatggagga gccacagtgc ctttaaacta aaatcagtct ctatcgcagg 1980 gcggatgggg actaagagaa aacaaagccc aaataatcaa caagcactta ataagcatac 2040 agataccagg aactccttaa agttaccagc cggactagtg ggaatgaagt gcactgctag 2100 agtcaaaatt gaagaaaagg aagtgaattg cttgctagac acagggtctc aagtcacaac 2160 agtccccatg tctttctaca accgctacct gtcacggcat cctatgcagc cgttgaatca 2220 tctgttagag gttgaagggg caaacggcca agctgttcct taccttggat atgttgaact 2280 gactctaaag tttccacaag agtttttagg atctgaggcc gaagttccaa cattggccct 2340 agttgtccca gacctgacac aaacacccca aatccttatt ggcactaaca ccctagatgt 2400 cttatatgct ggtcatgctc aaacagtcaa gcccagagtt atgtcacgtt atcaagggta 2460 tcaagctgtg ataagagttc tagaagcaag actgcagcag gctggtgtga atgtcctggg 2520 ccaagtgaaa gtaaaaggag acatgcctga agtgatagca gctggaagta ctgtagttct 2580 taatggatgc gtcaaagtta ttgggccact ctcagagact tgtgttacaa ttgaaccctc 2640 agtatcgtca tctttgcctg gtggattact tgtggcaagc agtttgcatt ctctacctgc 2700 aaaacgcaat gtccaaatac caatagtgct aagaaatgag acacaaattg atttaactat 2760 tcctccaaaa gcagtattgg ccgaagtaca tgctgtacaa agtgtgattg agggaagaaa 2820 accagtaaat gcttctgtga gcaaagttac aaatcctgca caaaatagaa tcatccctga 2880 ctttggtgac tctccgttat caaatgattg gaaggaaaag ataactgatc ttgtaaactc 2940 catgcaggat gtatttgcac ttcatgactt ggattatggc cacacaaaca aagtaaaaca 3000 ccatatccat ctcagtgata gcaccccatt caagcagcgt gctcggccta tccatcccca 3060 ggatgtcgat gctgtaagac ggcatcttaa agaactcctt gatgcaaaca tcatcagaga 3120 atctgaatcc tcttttgctt ctccaattgt agtagtaaga aagaaaaaca atgatgtacg 3180 cctctgcatt gacttcagaa agttaaactc gcaaactgta aaagatgcat atgccctgcc 3240 taatttggag gaagcctttt ctgttctatc tggctccaaa tggttttcag ttctcgactt 3300 aaaatcaggc ttttatcaaa ttgagatgga agaggctgat aaatcaaaga ctgcattcgt 3360 ctgtccttta ggattctatg aattcaacag aatgcctcaa ggcgtcacta atgcaccaag 3420 tacatttcag aggctgatgg agcgatgcat gggtgatctt aacagaaaag acgtcctagt 3480 tttcatagac gaccttattg tcttttccaa gacattagaa gagcacaaag ccaaactctt 3540 gcaagtcctg acacgactaa aagaatacgg attaaagctt tctcccaaga agtgcaagtt 3600 tttccaaaca tcagtccagt acttaggcca catagtctct cagaatggtg ttgaaacaga 3660 tccaaccaaa attgaagctc tcaaaacctg gccaagacct aaaaacctta aagaactgaa 3720 atcttttctt ggatttgcag gatactacag aaggttcgtt caagacttct caaaaatcac 3780 aagacccctt aacgacctta ctattggata tcccccactg cagaaaaatc agaaacgaaa 3840 ccccacaaaa agtacacctt acctggatcc taaagaacag tttggagagc gatggaacca 3900 ggattgtcag tgggcattta acacaatcat agagaagttg acctctgctc cagtcctagg 3960 atacgcagat cctagactcc cctatgtgtt gaccactgat gccagcactg ttggacttgg 4020 agcagctctt tatcaagagc aggaaggtcg aatgagggtg attgcctttg caagtagggg 4080 actaactaaa agtgaagcaa agtaccccgc tcacaaatta gagttccttg cactcaagtg 4140 ggcagtcaca gccaaattca gtgactatct gtatggagga gagttcactg tgattacaga 4200 cagcaaccca ctcacctaca tattaacatc tgcaaaactc gatgcaacca gttacagatg 4260 gctgtccagc ctgtcaacat tcaattttaa gatccagtat cgtgcaggca aaagaaatct 4320 agatgcagat ggactctcaa gacggcccca tgatgaacag attgatgatt tcgcctctca 4380 gaaagaacgt gaaagaatca aacaattcac tctccatcac ttagctgaga cagaacaaca 4440 agttatcttg cccgatgcag taaaagccct ctgtgaacga caccaagtat atcagagctg 4500 tggtgattct aacctaccgt attctcaact tactgtggta gaatcactgt cccaaagtgt 4560 tgatgcagta ccccaagaat tccagcagga ggcaggaggc cttccagtac ttccccaaat 4620 gtcagaggaa gagttaaaag aatgtcaaag aactgatcca gtgcttgaaa aagtaattag 4680 gcaccttgat tctggaaaga aacctcatgg gaaagtagag cctgcagaag tagctttgtg 4740 gctaagagag tgggaccgct ttgagttcaa gaatggaatc ttatttagaa agagacagga 4800 cccaagaggc gtattgtatc agttggcctt gcccaaaaag ctcagaggaa atgtattaaa 4860 aaatctgcac aatgacatgg ggcatctcgg aattgaaagg actttggacc tggctagagc 4920 ccgtttttac tggccaaaaa tggcaataac tgtggaagag aaagtaaaga cctgtgagcg 4980 atgtgtaaaa cgaaagaccc ctcctgagcg ggctgcaccc ctggtgaata tcacaaccag 5040 cagacccctt gaactagtct gcatagacta cttgtcacta gagcctgatc gaagcaacac 5100 taaagacatc cttgtaatta ctgatcattt caccaagtat gctgttgcag tgcccactag 5160 aaaccagaaa gctgaaactg tggccaaatg cttatgggac aactttttca tccattacgg 5220 gtttcctgaa agaattctga gtgatcaggg gcccgacttt gagtcaagaa taatcaaaga 5280 actgtgtggc atagcaggaa tacaaaaagt aagaactacg ccttaccatc ccagaggcaa 5340 ccctgtagaa cgtttcaaca gaacattgct ccaaatgtta gggaccctgg aggacaaaca 5400 aaagacctat tggaaagact ttgttaaacc attagtccat gcttacaatt gcactcgtaa 5460 tgacacaaca ggattctcac cttacgagct gatgtttggg agacagcctc ggctgccagt 5520 agacatagca tttggtttac ctgctactgg gtcatcccca tctcattcca tttatgtgag 5580 aaatctgaaa gatcgcttag aggaaagcta tagaatagct actgagaatg cctcaaaatt 5640 agcaagaagg aacaagaaac gatttgatga gcgagtagtc acctcattcc ttgaggttgg 5700 agatcgtgtt ctagtacgga atgtcaagct aagaggaaag cataaattgg ctgataagtg 5760 ggagaaagaa atctatgttg tgttaaagaa ggcgggagac ttacctgttt atactgttag 5820 tccagaaggc agggatggcc cacttcgcac actgcatcga gacttgctgc tgccctgtgg 5880 attcttacaa gaaagcatgc ctgagccagt aaaacctaaa ccaccccgca gacctagaac 5940 tcgagcaaac actagtgcaa gagaacctga tactatgact gagagttctg actctgagga 6000 tgattcaatg gatcactact cacgtagaca cttaccaaaa gtagagagca gaatcctctt 6060 caatccaaga catgttaagc cttcaagaga caggccgatt gcagaacttt ctagtaaaac 6120 aagagtcgtg aaaagtaaaa tcaatgactg tcctgcaata gagataccac aagagaactt 6180 accttactta cctgaagatg aaaactgtcc catgagtgaa cctgaaagag agaacacacc 6240 tgtaatggat tctgttagac cagatatgtt aagaaacgta cctgtagtaa atgatcaaga 6300 actattggaa cagagagatg aactggaaat attgagtgaa atagatgatg aagctgatca 6360 gagaaatatt cacagtggcc aggcagtgat cgaccaggtt gaaagaaata cattgagacg 6420 ttcgcaaaga caccgtgagc caccacaaag gctacagtac tctcaattag ggaatcctct 6480 ttctctggtt atacaatcct tactacaagg tctcagtaca gcagtcactg catctttaga 6540 ggagtctgat tgccccagag aagcctctct tttaatgcag aaaatgttcc cttctgctgc 6600 cgttacgcag cctaaaagat gcagagggac ctgcatagat tccaggcggg gagaa 6655 // ID DNA-5-5_DR repbase; DNA; ZEB; 1353 BP. XX AC . XX DT 21-OCT-2008 (Rel. 13.1, Created) DT 21-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE nonautonomous DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; TSD 5-bp; KW DNA-5-5_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1353 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1237-1237 (2008). XX DR [1] (Consensus) XX CC TSD is 5-bp long. TIR is 14-bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1353 BP; 399 A; 252 C; 248 G; 454 T; 0 other; cagggctcga aattgcgacc attttggtcg catatgcgcc cgaaaattaa tctatgcgac 60 ctcataatat atttgggagc attagtgcga ctgcagataa tggttgtagt gcgacctgtt 120 ttgatttttt gtaaaaacgt gctgaatcgc tcttccctgc cgctatattg gttcatatta 180 gctgtcattc actcaaggta ttccgctgtc agatgacagg gaaggagctt ttatgaccac 240 gggaaatgca aacggctgaa gagtaaaaac ttaaagcaca caggtttgca aaccccacct 300 aaagttgagg cgcagatgaa agcgatcatg acacgtcacg tggtgaataa tgatccgcca 360 gctgagatca atcgagtacg cgcttcttgg agaaggtgcc cgaatctcca tgcgttgcat 420 tcactgcgtg ttcagcgcaa atgtccgcta aaagtcaaat ctaatactgt acatatcatc 480 gccaaagaag ctcgccttta ctaagtttac actgaaactg cggctcataa caaagaccgg 540 tattgcgccg atggttagcg caagaatcct gctctgcctg ctcataaatt gggtcggctg 600 actcgcctgc ttttccacta acacagaaaa atgaagatca gctcagactg aacctttaaa 660 ttgacacaaa ctgaaaccaa aacttttaaa gagtgataga agtgagactt tacttacttt 720 cttttgtcta ttcttttttg aggtgtatat tatttttatt ttttatttat ttactgatga 780 ctgctttgca gctttcatca ttgaattgaa tgatttatta taatcttttg tttgttttgt 840 agcagaaata ttatttatta aattgacatg catataaaaa cagcagtaca aaataaatat 900 ttcttactgc aatgcttcat ttttgtttga tgcaaactat acaattattt ttcataaagt 960 aacagacttt ttatagcaga taacctacat tcatgcacat tcaaggcagt atcataatga 1020 gaaatggaat tcattgttac tattattgtc attttttatc atcattaata ttctatggcc 1080 taattattag acattcattt tggaattatt gcacacaaat atcaatgtct tcgcagcatt 1140 agttagatag ttgtttctgg tttttgtcag tcctattaat gttgttttaa aatacaataa 1200 atcccttaaa aaacaatttg cagcggtgct cattcatttt tggtgggtgc tcctaaattt 1260 tttctggtgc tcctaaaatt tttttggtgc tcctaaatat ttgaagttgg gagcaccggt 1320 gctaccaagt aaaaaagtta atttcgagcc ctg 1353 // ID hAT-N77_DR repbase; DNA; ZEB; 1112 BP. XX AC . XX DT 07-JAN-2009 (Rel. 13.11, Created) DT 15-JAN-2009 (Rel. 14.01, Last updated, Version 2) XX DE Nonautonomous hAT DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW hAT-N77_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1112 RA Jurka J.; RT "hAT DNA transposons from zebrafish."; RL Repbase Reports 9(1), 19-19 (2009). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1112 BP; 358 A; 202 C; 253 G; 299 T; 0 other; cagcggttcc caaactgggg tgcgtgcacc cctaggggtg cgcagtctga ccaccagggg 60 tgcgcgagag aatgtttgta atggcggaaa attttatctt ttattatttt tatgcattta 120 tcttgggtaa acttcacatg tcatattttt gagagaaaag aaattcactg agagagaaaa 180 aaaaatcaca ttacaaattg taatttacaa cctatgcata ctcactcatc tcgcggtttc 240 acgtctgcgt cacaatagcc ccgccctttc acattaggcg ctaagtgata gtacattcgt 300 aatttttttc cgcttactat attcatcaaa atggacacct ggttaaaaac tggcacttta 360 aagaagtcaa gtgacagtag ggatgaacca tctacatcgg cctcttctat ggagatgtgc 420 agcggccttg aatatatgga ctcggaggat aacgaacgag gtgaagaggc gcaggcggga 480 acagagaaat ccagcagaga tgtactgatg aagagaatta ggcctacagc tagagagtga 540 gaaaatgtca agtaagaaac gcaaatatag cgacagttac attggtttcg ggtttacatg 600 gattggagat gcagaaaatc caaatccgca atgtgtagtc tgtggtgagg ttttagctaa 660 tagcagtctt aagccctcat atatggtccg gcacttacaa acaagacaca gccacttaca 720 agacatgcct gtaagttttt ttttcaagga aaattggagg aactgcagaa gaagaaaaaa 780 ggaggcacaa aggagcagta gctttcatcc gaaaagttgc accatcggtt tcctgggtac 840 actgcagcat ccaccaagaa gctctagcga tggaaaaaca tgccagatga acttttgtct 900 gtacttaatg acgctgtgaa aattgtcaac ttcataaaag ctcgtccatt aaattcacat 960 attttattat ttttatttta atcacgtata cagaaaatat gtgtatttag ttattgtagt 1020 taaaaatgtg gagtttaatc ggagaaaggg gtgcttgaca ggtgtcaaat attagaaagg 1080 ggtgcacaca gcaaaaagtt tgggaaccac tg 1112 // ID BEL22-LTR_DR repbase; DNA; ZEB; 704 BP. XX AC chr19; XX DT 02-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL22-I_DR; KW BEL22-LTR_DR; BEL22_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-704 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1392-1392 (2008). XX DR Genome; chr19; Positions 33131145 33130442. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 704 BP; 198 A; 98 C; 143 G; 265 T; 0 other; tgtgttgtta tcacaatgtt tggtagactt attttctgtt ctggagtgaa tatgtatgtt 60 tttcattatt aggacgtggc ccctttaaga atcttaggtt tcagaacaga gtcataccat 120 gtgagcactg atgaggaagt gagcaagcat gttgcacaga gcagcacagt cagacaatcc 180 tgagttaatt ctgtgacgtt tgcattcatc ttgtgtttgt tttagcatcg tgggtatgta 240 ttgactgttt taatcaatgt ctactatttc agatgcttta ggtcattttt atatagttat 300 tattagaagt ctattatttc tgtctgcttc actatagtta agggacatag tgagtgtaat 360 ctctgtctag ttcattgaaa gttcatgcaa ttcagtatag ttaaaaagtg atttaggaat 420 tgtgtgtgtg tttaacatga ttatattaca tttaagagaa tgtgtgttta agagaagaca 480 cataaagcct tgaagacatt catttattta cttctttgcg aatgtataca tcttaacaaa 540 acatgcttaa tgttgattat gccttttgtg catttcagtt ttacctttct ttggaatatc 600 aataaacctg tggacaagga gttatcgtct tcagagtttt gatgtcaagg aagagtttat 660 ctggctgtaa agttatattc agcacatata gcgaggacag caca 704 // ID Gypsy164-I_DR repbase; DNA; ZEB; 3863 BP. XX AC chr7; XX DT 13-OCT-2008 (Rel. 13.1, Created) DT 13-OCT-2008 (Rel. 13.1, Last updated, Version -1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy164_DR; KW Gypsy164-LTR_DR; Gypsy164-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-3863 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1606-1606 (2008). XX DR Genome; chr7; Positions 63698995 63702857. XX CC 'TTAAG' target site duplication CC LTRs are 97% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS join(36..1694,1698..3848) FT /product="Gypsy164-I_DR_1p" FT /translation="MSTFAERSPNRALMALQQAERSVADYSIEFHTLAASS FT GWNAKAQWDHFLHSLAEYIKDEIFSLELPSTLDGFVDLAIRVDNRITLRAR FT HRRGGVEEFSPRNTGALDVVMPQQWGFSEEEPMQIGRARLSAMEHRFCISN FT RLCLYCREAGHQVASCPIRERRKPAKGSLTMSATNIRALSGGRASLQAWMN FT IKGSIKQVSALIDSGAEGDFMDLNLAKRLGVSMIVLDNPISAKTLCGSHQA FT NITHCTIPLTLTLSGNHVEQLQFLLIHSPTAPLVLGHTWLVKHNPQIDWTL FT SSVSSWSSYCLSHCLGSACSPLSSLSMLQEEVNLTGVPVVYHDLRAVFSKS FT RASSLPPHRPYDCAIDLLPGTSPPRGRLYSLSKPEREAMEKYIHDSLTAGI FT IRPSSSPAGPGVFFVEKKDGSLRPCIDYRGLNDITVKNRYPLPLMSSAFEL FT LQGATIFTKLDLRNAYHLVRIRQGDEWKTAFNTPTGHFENLIMPFGLSNSP FT AVFQALVNDMLRDMVNRFVFVYLDDIQIFSQNEREHVQHVRRVLQQLLENC FT LLDKVECDFHTQSVSSLGFVLSPEGVRMDPAKVKAVADWPTPDSRKAAQRF FT LGFANFYRRFIRNFSQVALPLTNLTSIHKQFEGSLQAQTAFSELKRHFISA FT PILSNPDSSRQFVVEVDASEVGVGAILSQRSSLDGRIHQCAFFSHRLTPSE FT RNYDIGNRELLAVRLALGEWRHWLEGSGVPFVVWTDHKNLEYIRSAKRLNS FT GQARWALFFGRFDFHISYRPGSKNGKADTLSHLGEGEARPTPPVAVLPPEH FT VVAGVTWGVESVVRKALRMLATPTKCPSNLLYVPESVQTKVLQWGHSSKLA FT CHPGADRTKGSIKQRFWWPSMARNTRRFVLACPICAVAKGSNRPPAGLLRP FT LPVPSRPWSHIAMDFVTGLPPSSGNSVVLTVVDRFSTAAHFIPLPKLPSAR FT ETAAIVLDHVFRTHGLPVDVVSDRGPQFVSKFWAEFCRQLGATASLSSGYH FT PQTNGQAERANQELERVLHCVASEEPLSWSSRLTMVEYAHNSLPVSSTGLS FT PFQCCLGYQPPLFSSQEPDAIVPSAQAFIQRCLQTWRLARKAIIRTGERNK FT ASADRHRSVPPLYVCGQKVWLSSKDIPLRPPSRKLGPKFIGPFTIVKVLSP FT VSVRLKLSSQFKNIHPVFHVSKIKPIFRSPLQPLTSAPPPPRLFEGALTYT FT GRRLLDVRRRGRGYQYLVDWKGYGPEERCWIPARGILDRSLIDQFHQTHGE FT SSGNARGRS" XX SQ Sequence 3863 BP; 847 A; 1038 C; 974 G; 1004 T; 0 other; ggaacagact gaccaaagca tggactcagc agaggatgtc gacattcgcc gagcgctccc 60 ccaacagggc gctaatggcg cttcaacaag cggagcgttc ggtggcggat tactccatcg 120 agttccatac cctggcggca agcagcggtt ggaatgccaa agcccaatgg gaccactttc 180 ttcacagcct ggcagagtat atcaaggacg agatcttctc gctcgagctg ccatccactc 240 tcgacggatt cgtcgatctc gccatccggg ttgacaaccg aatcactctc agagcgcgtc 300 atcgcagagg cggagtcgaa gagttttctc ccaggaacac cggggcgctt gatgtggtca 360 tgccacagca gtgggggttc tcggaggagg agccgatgca gatcggaaga gcccgcttgt 420 cggcgatgga gcatcgcttc tgcatttcca accgtctctg cctctactgc agggaggcgg 480 ggcatcaggt ggcctcttgc cctatcagag aacggcgtaa gccagctaaa ggcagtttga 540 cgatgagcgc cacgaatata cgtgcactgt ctgggggccg ggcgtctctc caagcctgga 600 tgaatattaa gggttccatc aaacaggtgt ctgctttaat cgattcagga gctgaagggg 660 actttatgga cttaaactta gccaagcggc tgggtgtttc tatgatcgtc ctggataacc 720 ccatttcagc caaaacgctt tgcggctctc accaggctaa tatcacccac tgcaccattc 780 ccctcactct tacactttca ggcaatcacg tggagcagtt acagtttctc ctcattcact 840 ctcccactgc tccccttgta ttgggtcaca catggctagt taagcacaat ccacagattg 900 attggactct tagctccgta tcctcttgga gttcctattg tttgtctcat tgtcttggct 960 ctgcttgttc tcccctttct tctctttcta tgttgcagga ggaagtgaac ctaactggcg 1020 tacctgtcgt ttaccatgat ctgagagctg tgttcagtaa gtcccgggct tcatctctcc 1080 ctccgcatcg cccgtacgac tgtgctattg atctgttacc tggcacttct ccacctaggg 1140 ggcgtttata ctctctgtct aaacctgaga gggaggccat ggagaagtac atccatgatt 1200 ctctaacagc aggtattata cgtccctctt cctctccagc tgggccgggg gttttttttg 1260 tggagaaaaa ggatggttcc ttgcgtccct gcatagatta tcgggggctg aatgacatca 1320 ctgtaaagaa tcgttatcct ttgccactga tgtcatcggc cttcgaactc ctacagggag 1380 caaccatctt tacaaaatta gacctccgca atgcctacca cctagttcgg attaggcagg 1440 gggatgaatg gaagaccgcc tttaacaccc ctactggtca ttttgaaaat ctcattatgc 1500 cctttgggct ctccaattca cctgcggttt tccaagcact cgtcaatgac atgctcaggg 1560 acatggtaaa tcgttttgtg ttcgtttacc ttgatgatat tcagattttt tcccagaatg 1620 aacgggaaca tgtccagcac gtcaggcggg tgcttcagca gctgctggag aattgtctct 1680 tggacaaggt ggagtagtgt gacttccata cacagtcggt ttcgtccctg gggtttgttc 1740 tgtcgccaga gggagtgcga atggatcctg ccaaagtgaa agcagttgca gattggccca 1800 ccccagacag tcgcaaggcg gcccagcgat ttctggggtt tgccaatttt tacagacggt 1860 tcattcgcaa ttttagtcag gttgctctac ctctgactaa tcttacctcc atccacaagc 1920 aatttgaggg gtcactccag gctcagacag ccttttctga attaaagaga catttcattt 1980 ctgcgcctat tttaagtaat ccagactcgt ctcgtcagtt tgtggtggag gtggatgcgt 2040 cagaagtagg tgtgggagct attctatccc aaaggtcctc gttggacggc agaatacacc 2100 aatgcgcttt cttttcccat cgattaacac cttcagaacg aaattacgac atcgggaaca 2160 gggagttact ggctgtcagg ttggcattgg gtgagtggcg tcactggcta gaaggatcag 2220 gggttccttt tgtggtctgg accgaccata aaaatctaga atacattaga tcagccaaac 2280 gtctcaattc tggtcaggct cgttgggcac tattctttgg ccggtttgat tttcatattt 2340 cttatcgccc ggggtctaag aatggaaaag ccgacaccct atcacatcta ggggagggcg 2400 aggccagacc cacccctcca gttgctgtcc tgccacctga acatgtggtt gcaggggtaa 2460 cttggggggt agagtctgtg gtccgtaagg ctcttcgtat gctcgccacg cccaccaaat 2520 gtcccagcaa tctgctatac gtgcctgagt cggtccagac caaggtttta cagtggggtc 2580 actcgtcgaa gctagcctgc cacccaggag cagatcgcac aaaggggtca attaagcaac 2640 ggttctggtg gccatcgatg gcgcggaaca ctcgtcggtt tgtattggct tgtccgatct 2700 gtgcagtagc taaaggctcg aatcgacccc cagcagggct tctccggcca ctgcccgtcc 2760 cttcgcgacc ctggtcacac atagcgatgg actttgtcac tgggttacca ccatccagtg 2820 gcaactcagt agtcctcact gtggtagaca gattctcgac ggcggcacac tttatacccc 2880 tccccaaatt accttcagcg agggagacag cagccattgt cttggaccac gtcttccgta 2940 ctcatggcct cccggttgac gtggtttcag acaggggtcc ccaatttgtg tcaaaatttt 3000 gggcagaatt ctgccgacag ctgggggcga ccgctagcct ttcttcagga taccatccac 3060 agactaacgg ccaggccgag cgagctaatc aggaactgga gagggtcctg cattgtgttg 3120 cgtccgagga accgttgtct tggtcttcca gattaaccat ggtcgaatat gcgcataact 3180 ccctcccagt ctcgtctacg ggtttgtctc ccttccagtg ctgcttaggc taccaacccc 3240 ctctattctc ttctcaagaa ccagatgcca ttgttccctc tgcccaggca ttcattcaga 3300 gatgtctcca aacatggagg cttgccagga aagccattat tcggactggt gaaagaaata 3360 aggcatctgc ggatcgccac cgttccgtac cccctcttta cgtctgtggt cagaaagttt 3420 ggttgtcctc taaggacatt cctctcagac ctccctcacg taaattggga cctaaattca 3480 ttggaccttt caccatcgtc aaggtcctca gtccagtgtc agttcgactc aaattaagct 3540 ctcagtttaa aaacattcac ccggttttcc atgtgtctaa aattaaaccc atttttcggt 3600 cccccctaca gcctttaacc tctgctcctc cacctcccag actatttgag ggtgcactta 3660 catacacagg gcgccggctg ctggacgtaa ggaggagggg cagagggtat caatatttgg 3720 tagactggaa ggggtatggc ccggaggaga ggtgttggat cccagctcgg ggcattctgg 3780 accgctctct cattgatcag tttcatcaga cccatggtga gtcttcaggg aacgccaggg 3840 ggcgttcttg agggagaggg tac 3863 // ID ERV1-N5-LTR_DR repbase; DNA; ZEB; 695 BP. XX AC . XX DT 11-NOV-2008 (Rel. 13.11, Created) DT 11-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE nonautonomous endogenous retrovirus from zebrafish, LTR - DE consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR; ERV1-N5-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-695 RA Bao W. and Jurka J.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(11), 1784-1784 (2008). XX DR [1] (Consensus) XX CC The TSD is 4-bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 695 BP; 230 A; 110 C; 145 G; 210 T; 0 other; tgaagaagat ttctattctc tctgtgtgaa gtattttcta agtgtaaagc agttaatttt 60 ctcttcacaa aatgtatgta gtgtttgcag caaacatgcc agcctgtgtt aaaaatacag 120 tgatatgtct gtgtaagatg agtatgcctg cacaggagca acactaggag aaggctgagt 180 gactgattgt gtagaataac tgaatatgta aagcactgat tatatatgtt aaatacttct 240 atgaagctat atgtgtcaga agttagaatg agtaacatat tatcaagtta ttatcctatc 300 ataacaatgt atgtaaacac ttagttcctt tgcataagtt gcatctatac tttgctgaca 360 tagacctgta acatctctgt tgacatgagc tagtgggctg agagccaaga atagcaggac 420 cccttagact agaaaaccta ttcaaaaatg gtcaagagtt aaaaagagac acttaggggt 480 gtgtcaaaaa atcctgcata aaaattgtaa caaacacact aaaggtttag aaggagcagg 540 agagacttcg gaaggtgtcc aggagagact tcagaatgct ctggggtctg ctctgctctt 600 tctcggcttt attatggaga ataaagtcta ttttctgata ttcaaaacct ccggtctgat 660 aatttgtaat tgatgagacg tcaaaattta cgaca 695 // ID Gypsy45-LTR_DR repbase; DNA; ZEB; 898 BP. XX AC . XX DT 25-SEP-2007 (Rel. 12.09, Created) DT 25-SEP-2007 (Rel. 12.09, Last updated, Version 1) XX DE A long terminal repeat of the Gypsy45_DR LTR retrotransposon; a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy superfamily; LTR; Gypsy45-I_DR; KW Gypsy45-LTR_DR; Gypsy45_DR; endogenous retrovirus. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-898 RA Dib M.R. and Naveira H.F.; RT "Gypsy45_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 7(9), 820-820 (2007). XX DR [1] (Consensus) XX CC Gypsy45-LTR_DR is a long terminal repeat of the Gypsy45_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its CC internal portion is deposited in Repbase as Gypsy45-I_DR. XX SQ Sequence 898 BP; 201 A; 131 C; 185 G; 381 T; 0 other; tgttacatcc cttggatgta tttatttatt tattatttat ttgtgtgttt actgtccatt 60 gggagtggtt atgttgtgtg tttcttttac tttcacggtg tctctctgcc tgctcactag 120 gtagagctga ttgggcccag gtggtgctca tcaatggagc gtgctgcagt aatgcgctgc 180 tacaaaagag cagtgcgtct caaaccacgg agaggttcac ttctcccctg ctccagactc 240 gtgattgttg tgggtgttga tctggttgtt tgtaaaaagg agttaatggg ctagtgagtt 300 caattttagt ttctgtgctt tgagaaaaac ttgcttattt ttgcattgtg taattttaca 360 ggcagtgtgt ctgttttctg agttgtgatt taaatacata tattcttttg tgtgtggtac 420 gtaaaaggta attgtattgt ttctttgtct gactttttga tttaaaaatt gaagctgtag 480 ggagaggttg ccatttttat ttttgtaatt gtttgactgt gtttattgat agtaagggag 540 ttgaggaaag cctgtgttgt ttcatttctt ttcttttacc tgcaattagg ttatttagta 600 ctcctcctga gctagacctg ttttgtttgt tgttttggtt aatccttctc ctgagattta 660 tgcttcattt attgtttatt gatttaagaa ccaaataata aaataattta agtttactct 720 tgtggttgtt tcacatttct gaggcaggat tgaagtgttc cctcaaaatt tgattattta 780 atttatacac aattcattaa tttggtcttc tttttttttt ttctcaaatt ctaaatttta 840 attttctttg ttattttcct tccccaaccc tagaccaaaa aggggaaggg acgtaaca 898 // ID BEL8-LTR_DR repbase; DNA; ZEB; 564 BP. XX AC chr18; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL8-I_DR; KW BEL8-LTR_DR; BEL8_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-564 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1366-1366 (2008). XX DR Genome; chr18; Positions 48738603 48739166. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 564 BP; 139 A; 78 C; 133 G; 214 T; 0 other; tgtagcagat gtcatgtgtg tttaagatct tatgccagtt cacctaggaa ctaattaagg 60 cggagctaat ttcaggggga actataaaaa tgatgcacca ctgtgagtag agagaccaat 120 gcattcactt ccaagcattc aacaagctaa ggaatgctta tgcttttctt gagtttctcc 180 cttccgtaac cagccaatga ggtatgtttg ttttgtatga tttgttggtt ttgtttgatt 240 tgaagtgtgg tgttgttgtt gctgtttgtt tggtgtgtgc atttaactgt ttgtttatga 300 gcggtgctat ttataaagtt aaattgtatt agttaagatt atgtgatcca gtgttatgtg 360 tgaccataat gtggtgattt ggattaattt gatatggaaa tgcactgttg taattcattg 420 ttgtatgttt tttagttctt acgtctggta cacccctgat gtcttgtggt ttgctctatg 480 ttgagaactg aagacttgct atgaaaataa accagtcagt agaaatcata gactcttgct 540 cgtttgttgc tggagggagc taca 564 // ID CR1-19_DR repbase; DNA; ZEB; 1666 BP. XX AC . XX DT 11-NOV-2008 (Rel. 13.11, Created) DT 11-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE CR1-19_DR is a CR1-like non-LTR retrotransposon - a consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; AP endonuclease; CR1 clad; CR1-19_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1666 RA Bao W. and Jurka J.; RT "CR1-like non-LTR retrotransposons from zebrafish."; RL Repbase Reports 8(11), 1693-1693 (2008). XX DR [1] (Consensus) XX CC The 5'- part is incomplete. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS join(168..974,932..1210,1107..1511) FT /product="CR1-19_DR_1p" FT /translation="ADMSXKSAGPDGLSGFLLKVCAEELSPVWCSLFQKSI FT NLKVVPAIWKRSLIIPVPKIASPVDNNDFRPIALTSVVMKCFXKYMVSVLR FT ENISTKLDPLKFAYSKGVGTCDAINSLLHFVLNHLEGPDAYARVLFFDFSS FT AFNTVQPHVLIRKLKKLGVNSTIIEWYHSFLTNRTQAVKVNNSLSQVKYSS FT SGVPQGCVSSSLLFTLYTNECSSRHSQNLIFKYSDDTVILSLLNKQDPVSV FT YFSEIDSFVEWCDENFLKVNTKKKKKRKKLFKSKYKKKKKKEIIFDPKCVV FT EHLPVFVHNQEITQVQSYKYLGIYIDRSLTWDTHVNWICEKLHQRLYFLRR FT LWFYGVDKNNHVLILSSCITGSVKSFTKDCIFYEGYGFMVLIRIIMYLFYQ FT AVFESVIRYGLSTWYGNLSVNLKSKIFNIIRTATRLIGSETFTSIQTLYEQ FT CTLNEAKRILITHLMYSSHFTIFYLLEEDTELSAGNGTDLRTLLFQCQSDY FT *" XX SQ Sequence 1666 BP; 563 A; 276 C; 286 G; 532 T; 9 other; gctagaaagt tagtatccac aactgatgat aaacaaaaag caggaccaag gtttgacact 60 cacagtataa attcagayar tggaaaagaa ataaactcaw tttatattaa tgaatcaasa 120 gaaagcaatt tgctgattga ctgtgatcaa gttcarctat tatttaagca gatatgtccm 180 aaaaatcagc tggtcctgat ggtctctcgg gcttcttact aaaagtatgc gcagaagaac 240 tgtcaccagt gtggtgttct ttatttcaaa aatctattaa tttgaaggta gtcccagcaa 300 tctggaaaag atctcttata atccctgtgc ctaaaattgc cagtcctgta gataacaatg 360 attttagacc aattgctctc acgtcagttg taatgaaatg cttcraaaag tatatggtgt 420 cagtgttaag agagaatatt agtactaaac ttgatccact taagtttgca tatagcaaag 480 gagtaggtac atgtgatgcc atcaatagtc tgttgcactt tgttttaaat catttagaag 540 gccctgatgc atatgcacgt gtacttttct ttgattttag ctctgcgttt aacacagttc 600 aaccacatgt tttaattagg aaactaaaaa agttaggagt aaactctaca attattgaat 660 ggtatcactc ctttttaaca aatcgaactc aggcagtcaa agtaaataat agtttgtccc 720 aagtcaaata tagtagttct ggtgtycctc agggttgtgt gagctcatca cttttattta 780 ctctatatac taatgagtgc tcaagcaggc actcacagaa tcttattttt aaatactcag 840 acgatacggt aattttaagc ttgctgaaca aacaagatcc tgtttctgtt tatttttctg 900 aaattgattc atttgttgag tggtgtgatg aaaacttttt aaaagtaaat acaaaaaaaa 960 aaaaaaaaag gaaataatct ttgatcccaa gtgtgttgtc gagcatttac cagtgtttgt 1020 acataatcaa gaaatcactc aggtacagtc atacaagtat ttgggaattt acatagacag 1080 gtctctaaca tgggacactc acgtaaactg gatctgtgaa aagcttcacc aaagactgta 1140 ttttctacga aggttatggt tttatggtgt tgataagaat aatcatgtac ttattttatc 1200 aagctgtatt tgaaagtgtt atcaggtatg gtctatctac atggtatggc aatctttctg 1260 taaatctgaa atcaaaaatt tttaatatta tcagaactgc tacaagacta ataggatctg 1320 aaacttttac tagtatacaa acactctacg agcaatgcac tctgaatgaa gctaaaagaa 1380 ttttaataac ccatctcatg tactcttccc attttacgat cttttacctt ctggaagaag 1440 atacagaact ttccgctgga aacggaacag atttaagaac tcttttattc cagtgtcaat 1500 cagactattg aacactgaac aatagtgtgt gtaattcttt tgcatgttga cttattcttt 1560 taaattgtta ttgtgtcaga acctgctgaa gtgctgkgtg agtctacaga aaatttcccc 1620 ttgagggaca ataaagtata caaacaaaca aacaaacaaa caaaca 1666 // ID CATCH1LTR_DR repbase; DNA; ZEB; 752 BP. XX AC AL591492; XX DT 02-MAY-2002 (Rel. 7.04, Created) DT 02-MAY-2002 (Rel. 7.04, Last updated, Version 1) XX DE Long terminal repeat of the CATCH1_DR endogenous retrovirus. XX KW BEL; LTR Retrotransposon; Transposable Element; KW BELL/CATCH superfamily; CATCH1I_DR; CATCH1LTR_DR; CATCH1_DR; KW endogenous retrovirus. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-752 RA Kapitonov V.V. and Jurka J.; RT "CATCH1, an endogenous zebrafish retrovirus from the BEL/CATCH RT superfamily."; RL Repbase Reports 2(4), 5-5 (2002). XX DR Genbank; AL591492; Positions 55360 54609. XX CC CATCH1LTR_DR is a long terminal repeat of the CATCH1_DR CC endogenous CC retrovirus. See comments for CATCH1I_DR. XX SQ Sequence 752 BP; 203 A; 149 C; 135 G; 265 T; 0 other; tgtgagaaag ccctttaacg acccctagag gaaggtttgg tgatagggag ttcagtgaca 60 aaaggcagtc atgacggctg gagagacaca ggctgccgat cggctggctg gactgtgcaa 120 ccgatttgta catgttttac accagtgtgt aattgattat gggtgagtcc cttttgtttg 180 ctatcctttg atccagataa atgttctgat tactgttatt gtcttgtaaa tcacactgta 240 tgtctcgtga ttcccacgtg cgccataacc gaaagtgaaa ccatatgtta tcaaacccaa 300 cgatctcatt tcggactttg aactggatat tcacattgat tcatagtttg atataaccta 360 ttttatgtat ctatttaata tatatcacct gattgagtca ataattcgta aaatgttcac 420 cgttgggtag atttagcctg tacaataata ctaaatggtc attccaattg cttacaacag 480 tatgtttaat tcatcattgt ctttataatt cacaattact cgatgtatgt ttatttgtat 540 cttaattaat aagtttgttg tataatttac taacattagc aattgttcct ctttgtattt 600 ttctgtctta tagaccacat gtgtgtaatg acattccagc ttgattaaac ttacaaaact 660 gggaatcctt ctcctgttct cttaccggaa ctaccctttt catcttgtgt gcacgacacc 720 tgaaccccct gagcgtccca tacgctcgta ta 752 // ID DIRS-8B_DR repbase; DNA; ZEB; 7313 BP. XX AC . XX DT 07-JUN-2010 (Rel. 15.11, Created) DT 07-JUN-2010 (Rel. 15.11, Last updated, Version 2) XX DE A family of DIRS-type LTR retrotransposons - consensus. XX KW DIRS; LTR Retrotransposon; Transposable Element; DIRS-8B_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-7313 RA Kojima K.K. and Jurka J.; RT "DIRS-type LTR retrotransposons from zebrafish."; RL Repbase Reports 10(11), 1838-1838 (2010). XX DR [1] (Consensus) XX CC This consensus is generated from 5 sequences with >94% identity. CC The sequence 4994-7307 is >90% identical to DIRS-8_DR. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 330..3689 FT /product="DIRS-8B_DR_1p" FT /note="reverse transcriptase and ribonuclease H." FT /translation="STATSTITLTTAHXSTIPSKDGNQINNNDNSPSNSTI FT NPVPLTFTEEPAAPELPTVPQTEPQNATRSRRSTRSSTRLRTSSTPPSTSR FT QQATSPASSYATASSFRQPTRITISELRKHLADLGIHAPRTLNKPELLKLY FT SNATSDSHPLLSPSKRPITREPSARHTPYPQPTRTQPGKRKSKTKHVSSSA FT PTGPHPQESQSPPATSHRQDLNQGLPVSQLLPPSFSWPPAPPSSSPATPPI FT PPISSNPFQPNVSAPPAIQINQPFSLPPQPSTSYSIPHNQPASSLPTYPIP FT PSFSLPLAHTSTDPTQTLPAPVSSARPSYTLFTATPLPMPPNALALDPTPI FT STATRNQILSGADIDLITLLSPISPPAAERQVDCGEFTFTLKQSSNTQSRI FT LSLAEFTMAFSRYTEVICSAFPHRRRELNDYMAIIAELALSYGGTHFYTYH FT KLFSAKCAIRVTQWNQIPYWGAVDFDIHNRVFLGCRNLACAVCRSSLHVTT FT SCPFISPSANTVQNPKSTSYVPRPANPSIPPLLPPSSIRSTPSSNICINFN FT LGRCQKNPCKYLHVCSFCGGAHAKIVCPIYRANNKKYKNYLSTPVNIPHLI FT SELKDHPDTNFSDFLISGLTHGFHPGVSASPSYNLCCPNLLSAKAEPETVD FT QLIKKEIDNDFMIGPFTAPPFTVFRISPIGVATRKFSGKKRLIIDLSAPHN FT SVFPSINSTIPIDEYSLHYHDIDQAITLIKTAGRNAWLAKVDITSAFKIMP FT IHPDFWHLFGIRWRSKFYFAVRLTFGCKSSPKIFDMLSEAVCWILSNNYGI FT PYLIHLLDDFLIISPPSFPPNNHLAITQKVFADLGIPLAEEKTSGPSTSIE FT FLGINLDSNKFQASLPKEKVDRITSLCQIFIENQSCTKRELLSILGHLNFA FT MRIIPQGRPFITHLLQLSTSVPGLEDTIFLSKPCRDEISLWISFLKQWNGC FT SFFYSDLIASPVDIKLFTDAAPSVGFGGFYQGRWFASPWPPQMLEIPQNEQ FT SSALFELYPIVAAAILWGNEWSASSILVHCDNEATVQCINKGRSHSPSLMP FT LLRRLVWTAATKQFIISAEHVPGCKNQIADSLSRFFFQKFRLLAPEADPHP FT TPVPHYSEMILP" FT CDS 3590..4675 FT /product="DIRS-8B_DR_2p" FT /note="tyrosine recombinase." FT /translation="LSVSLFLPEISAVSTRSGPSSNTCSSLFRDDIAINHP FT LHHLRESAISLILQGVAPRTLQSYLTAWSSFKNFHCLYKTPFPDFSLLSIT FT SFIAHLHLTKNLQASSIRSYLSGIQFFHKLIHGSPSEAISNSQTSLLIKGI FT QKSHPSSPDTRQPITLHILSKCIHTLRKGYISIHTARTLDAMFTLAFFGFL FT RCSELTTTSNFNPTIHPTLSDLTLLDEETLSFFIKQSKTDQLRRGHPIYIF FT DIPSPTQPFQTLKAYMHYRRAQEPNRLAPLFTDDANRPVTRFWFQFHLKEI FT LRISGFPPEPFSSHSFRIGAATTAAHNGLSQQQIQTLGRWSSNAYKSYIRL FT SQYHLREAQQILGSRHQHL" XX SQ Sequence 7313 BP; 1986 A; 2256 C; 1126 G; 1940 T; 5 other; agtgaagttt attcataaac taatttcgag aggagcacgt gattatgatt gaacacggct 60 ggttctgcat tagcatgctt gatccaccaa tcaggccatt cctaaccact ataaagagcc 120 agggttttct cactacagtc atcttcgatt tgaagaacaa agcaagcaac gcttacgaca 180 gamaacaacg aaaactaaaa caacatcaac tactacaaca acaacaacaa ctactacaac 240 aacaactact tcaacaacta ctactactac aactactacg acaacaacaa cgacaacaac 300 aacagcacgg cgaagcaagc taaagctaat caacagctac atcaacaata acccttacca 360 ctgcccatcw wtctacaata ccttcaaaag atggaaatca aatcaacaac aacgacaaca 420 gccccagtaa ttccaccatc aatccagtac cactcacctt tacagaagag ccagcagcac 480 ccgaactccc cacggtacct caaacagagc cccagaacgc cacaagaagc cgccgatcca 540 cccgatcatc caccagactt cgcaccagtt ccactcctcc atcgacatcc agacagcaag 600 ccacgtcccc agcctcatca tatgctacag cttcctcctt ccgtcagccc acaagaataa 660 ctatttcaga actacgtaaa catctcgcag atctcggcat tcacgcccca aggaccctca 720 ataaaccgga actactcaaa ctctattcaa acgccacttc agactcccac ccgttactca 780 gtcccagcaa aagacctatc accagggaac ccagcgccag acacaccccc taccctcaac 840 ccacccgcac tcaaccagga aaaagaaaat caaaaaccaa acacgtatcc agcagcgcac 900 ccacaggacc acacccccag gaatcccagt caccaccagc tacgtcacac cgtcaggact 960 tgaaccaagg actccccgtt tctcaacttt tacctccctc tttctcttgg cctccagccc 1020 ctccctccag ttctcctgct acccctccta tccctccaat ttcttctaat cctttccaac 1080 ctaacgtttc cgctcctcct gccattcaga ttaatcagcc tttctctctc cctcctcaac 1140 cttctacttc ttactctatt ccccataacc aaccagcttc ctctcttcca acctacccta 1200 ttcccccctc tttttcactc cctttagcac acactagcac tgatccaact caaactctcc 1260 ccgcaccagt ttcctcagca cggccctcct acacactttt caccgccact cccctcccta 1320 tgccgcctaa cgctctggct ctggatccca cccctatctc tacagctaca agaaaccaga 1380 ttctctcagg tgcagatata gatctcatca cactcctgtc accaatttca cctcccgcgg 1440 cagaacgtca ggtggattgc ggcgaattca ccttcacctt aaaacagtcg tccaacacac 1500 agtcacgtat cttatcctta gccgaattta ccatggcatt ctcacgttat acagaagtca 1560 tctgctccgc cttcccccat aggaggcgcg agttaaatga ctacatggcg atcatagccg 1620 aactcgcgct ctcctacggg ggcactcatt tttacaccta tcacaaatta ttttcagcta 1680 aatgtgcaat ccgggtaacc caatggaatc aaattccgta ctggggggcc gtggactttg 1740 atatccataa tcgggtcttt ttaggatgcc gtaatctagc ctgcgcggtc tgccgctcma 1800 gtcttcacgt cactacctct tgccctttca tctctccctc cgctaacaca gttcagaacc 1860 cgaaatctac tagttacgtt ccccgccccg ctaatcctag cattcctcct cttctccctc 1920 cttcttctat ccgctccact ccttccagta acatctgtat taacttcaac ttaggtagat 1980 gtcagaaaaa tccttgcaaa tatctgcatg tctgttcatt ttgcggtggg gctcacgcta 2040 aaattgtttg cccaatctac agagcaaaca ataaaaaata taaaaactac ctatcgactc 2100 ctgtgaatat tcctcacctc atttctgagt taaaagatca ccctgatact aacttctctg 2160 attttctcat ttcaggttta actcacggat tccacccagg tgtttcagcc tcaccctcat 2220 acaacctctg ctgtcctaat cttctatctg caaaagccga accagaaaca gtcgatcagc 2280 ttatcaaaaa agaaattgat aacgatttca tgatcgggcc ttttacagct ccgccgttta 2340 cagtatttcg tattagtcct attggagtgg ctactcgaaa attctccggt aaaaaacgcc 2400 tcattatcga tctctccgcg ccacataact ccgtattccc cagcataaac agcactatcc 2460 cgatagacga atactcgctc cactaccacg acatagacca agccataacc cttatcaaaa 2520 ccgccggccg taatgcctgg ttagctaaag tcgatattac atccgcattt aaaattatgc 2580 caatccaccc agatttctgg cacctttttg gcattcgctg gcgctcaaaa ttctactttg 2640 cagtccggtt aactttcggc tgcaaaagca gcccaaaaat attcgacatg ctttcagaag 2700 ccgtatgttg gatactttcc aacaattacg gaattcctta ccttatacat ctgttagacg 2760 acttcctaat catctctcct ccgtcgtttc ctccaaataa tcatttagcg atcacccaaa 2820 aagttttcgc cgacctcgga atccctctag cagaggaaaa aacatccggt ccaagcacgt 2880 ccatcgaatt cctgggcatc aacctagact cgaataaatt ccaagcatct ctcccaaaag 2940 agaaagtcga ccgcattaca tctctatgcc aaattttcat cgaaaaccag tcatgtacaa 3000 aacgcgaact cctctccatc ctcgggcact taaacttcgc catgcgcatt atcccccaag 3060 gtcgtccgtt tattacgcac ctcctacaac tctctacctc agttcctggt ttagaagata 3120 caatctttct ctctaaacct tgtcgcgacg aaatcagctt atggatatct ttccttaagc 3180 aatggaacgg atgctccttt ttttacagcg accttatagc atcgccggta gacattaaat 3240 tattcaccga cgccgcccca tcagtcggtt tcgggggttt ttaccaaggc cgttggttcg 3300 cgtccccatg gccaccccaa atgttggaaa taccccaaaa cgagcaatca tcagctctat 3360 tcgaattata tcccatcgtc gctgcggcca ttttatgggg taacgagtgg tccgcctcta 3420 gcatcctcgt acattgcgac aacgaggcta cagttcaatg cattaataaa gggcgctcgc 3480 actcaccgtc actcatgccc ttattaagac gcctcgtttg gacagctgcc acaaaacagt 3540 ttatcatatc tgctgaacat gtaccaggtt gcaaaaatca aattgctgac tctctgtctc 3600 gctttttctt ccagaaattt cggctgttag caccagaagc ggaccctcat ccaacacctg 3660 ttcctcacta ttcagagatg atattgccat aaaccatcct ctacaccacc tcagagaatc 3720 agccatttct ctcatcctgc agggcgtcgc tcctaggacc ctccaatcat atctcactgc 3780 ttggagttcc ttcaaaaact ttcactgtct gtataaaact ccttttcctg acttttctct 3840 cctttctatc acctcattca tagcacattt acaccttaca aaaaatctac aagccagttc 3900 cattcgaagc tacctaagtg ggatccaatt ttttcacaaa ttgattcacg ggtctccttc 3960 agaggctatt tcaaattctc aaacttctct ccttattaag ggtatacaaa aaagtcaccc 4020 ctcctccccc gataccaggc aacccatcac acttcacatc ctttctaaat gcatccacac 4080 cctacgcaaa ggatatattt ccatccatac agcccgcaca ctcgacgcca tgttcacatt 4140 agcttttttt ggatttttgc gatgttctga actcacaaca acatcaaatt tcaatcctac 4200 aattcaccca accttatcag atctaacctt gctagatgaa gaaactctat ctttcttcat 4260 caagcaaagc aaaacagatc aattacgcag aggacatcct atttatatat ttgacattcc 4320 ctcacccact caaccattcc aaaccctcaa agcatacatg cattacagga gggcccaaga 4380 acccaacaga ctcgctccac tttttactga tgacgctaat cgtccagtta ctcgtttctg 4440 gtttcaattt cacctcaagg aaatccttcg catttcaggt tttccccccg aacctttctc 4500 ttcgcactca tttaggatag gagcagctac tacagcagca cacaacggcc tttcccaaca 4560 acagattcaa accctcggac gttggtcttc taacgcctac aaatcataca ttcgcctcag 4620 ccaatatcac ctcagagaag cccagcagat tctcggtagc cgacatcaac acttatgaat 4680 ctctaaaaac acaaatatct gctctctcta attccatcct acaaatagga ctcttatata 4740 tatatttttt cttttttccc ccttctgttt tttctctctt cctcatttcc ccttggggtt 4800 tatatgtctg taagtatgct caaatgtgta ttcgtaggtg ttcaagcgtg tgtttatgtg 4860 tgtgtgtgtg tgtgtatgtg tgtgtgtgtg tgtgtgtgtg tgtgcgtgtg acatatttat 4920 ttatgtgtgt gcgtatgcaa gtagggaagt acgtgttata aaaataataa aaaaaaaata 4980 tttcaatccc tgcataatac agctgtaagg cttagcggga acatgcatta cttctaacct 5040 tacaattcat actctacaac gctctaaagc tttttatctc cacactcccc gcataataca 5100 gccgtaaagg cttagcggga acaaatattg cttcctcacc ataccaaatt cttacattac 5160 aacgctcgaa gctcttttta ctaacacaat ccccgcataa tacagtcgta aaggcttagc 5220 gggaacaaat attgcttcct caccatacca aattcttaca ttacaatgct cgaagctctc 5280 tttaccaaca caatccccgc ataatacagc cgtaaaggct tagcgggaac aatattgctt 5340 cctcaccata ccaaattctt acattacaat gctcaaagct ctttctacca acacaatccc 5400 cgcataatac agccgtaaag gcttagcggg aacgcacacg gctttctcac catacaatct 5460 tctacttcaa cgctctaaac tccaccaaat catatttcac ccattatcct atttcaccaa 5520 cttttcattt gacactgaat acctacgcac ccctgcaatt acagagtcgc taccaagcct 5580 catccctatc tgctacacag cagttaatca gctctccaaa gcgtacttcc ctttagatac 5640 tgacttccat cgcacctctg cagccgcaga gacgctcagc cgagcatcac tcccctctac 5700 atcactgccg cagcagtgac gctcagcgag cgcatataca cttccactta gcaatccact 5760 actgatacac tctccagcac tgctaaagca gttaaacgtc tctgcggagg cgtatttccc 5820 ttccagatac tgacttccat tgcacctctg cagcagcaga gacgctcagc cgagcatcat 5880 ttacatcccc tgcaccactg ctgcagcagt gacgctctgc tggagctcat gcacacttcc 5940 attacactta taccacttct gcactcccag cactgccaaa gcagttaaaa cgctgctgca 6000 gaagcgtatt cttcccctta gtactgacta ccaccgcacc tctgcagtcg cagagacgct 6060 cagccgagct tcatcccccc tgcaccactg ctgcagcagt gtcgctccac cggagctcac 6120 gcacactacc atttatccat accactactg acacaccccc cattggactg ccgaagcggt 6180 taatcagtta accgcttctc atctgcagct acagctccgc agaaccttac agcttcattt 6240 tccatcacta ctatactaaa cctttacgaa actcaccaca tatgttttaa acctttacac 6300 ttaccccact cactctcccg ctggtcctta caattaaccg gccccgggag cacacatagt 6360 catacataag cactttcagt taatttttac acccacacca gtctctgttg ctcctccaag 6420 ctatttctgt atcacttttc agcagccgga tatggcatta atctcctgtg ccttttgggg 6480 ggttcttcaa atacgcggct gctgtcccga gcggagcatt ttggggagtt gtcgagatct 6540 acctgagctc gaggctcccc tctcttcctc caaacgggag ggagcccagg gctcaagaac 6600 cttcgagctc agggctctct cccgggacag catgccaaac ttgcttataa tcaatcatca 6660 gctaagtgtg aactcttgaa gtgaagttta ttcataaact aatttcgaga ggagcacgtg 6720 attatgattg aacacggctg gttctgcatt agcatgcttg atccaccaat caggccattc 6780 ctaaccacta taaagagcca gggttttctc actacagtca tcttcgattt gaagaatccc 6840 cccttccacc cctacctttt cacctttccc tccatagggc agcacggtgg ctcagtgact 6900 agcactgtcg cctcacagca agaacgtcac cggttctagt tccttaacag gccggtggtc 6960 gtttctgtgt gtagtttgca tgttcttccc gtgcttgcgt gggttttccc cgggttctcc 7020 ggtttcctcc cacattccaa aaacatgcac aacaagttaa tcgttaaatc taaatttcaa 7080 tacaggtaat ctaataatgc agcatatctt ttaatagctt tcaatcttag tctttagcta 7140 ttgtmaaaag gggagttgtc gagatctacc tgagctcgag gctcccctct cttcctccaa 7200 acgggaggga gcccagggct caagaacctt cgagctcagg gctctctccc gggacagcat 7260 gccaaacttg cttataatca atcatcagct aagtgtgaac tcttgaaatg act 7313 // ID Kolobok-1_DR repbase; DNA; ZEB; 6303 BP. XX AC . XX DT 27-FEB-2007 (Rel. 12.02, Created) DT 27-FEB-2007 (Rel. 12.02, Last updated, Version 1) XX DE A family of autonomous Kolobok transposons - a consensus. XX KW Kolobok; DNA transposon; Transposable Element; Kolobok-1_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6303 RA Kapitonov V.V. and Jurka J.; RT "Kolobok, a novel superfamily of eukaryotic DNA transposons."; RL Repbase Reports 7(2), 113-113 (2007). XX DR [1] (Consensus) XX CC Kolobok-1_DR is a consensus sequence of a family of autonomous CC Kolobok transposons that were active in the zebrafish genome in a CC last few million years. The Kolobok-1_DR transposon is CC characterized by 15-bp imperfect terminal inverted repeats, TTAA CC target site duplications, and it encodes two proteins: (i) the CC 699-aa transposase, Kolobok-1_DR1p, composed of the THAP CC DNA-binding domain and catalytic "DDE" domain, which is conserved CC in all Kolobok transposases, and (ii) the 155-aa Kolobok-1_DR2p CC protein. The second protein is conserved in highly diverse CC Kolobok transposons present in the genomes of vertebrates (frog, CC fish), chordates (lancelet, sea urchin, sea squirt), and CC cnidarians (starlet sea anemone). See also comments in CC Kolobok-1_XT. XX FH Key Location/Qualifiers FT CDS join(6026..5932,5870..5753,5645..5394) FT /product="Kolobok-1_DR2p" FT /translation="MSERNTIQPYIYDTDSDTDQAESTVEESIIRIRYVTM FT HAPSVICLIKFANVMRYVKHRQDIKISTKINFALGDTAGKFPVYVNVAALK FT TSNEIINSITVSCNKVRRRMDQLEMQPSCMTEHPGMDPVCFNIYSLQNASN FT IYKADYGPLRLRGVQK" FT CDS join(283..561,1459..1585,1688..1931,2038..2311, FT 2416..2593,2716..2792,2963..3143,3254..3502, FT 3627..3798,3882..4030,4168..4337) FT /product="Kolobok-1_DR1p" FT /translation="MPPREHPCAILGCKAQHKSLHRLPANEQQRLKWLSFI FT FEDKVPENVPKLLYVCANHFSVDCLENAGQYNSGFASKLFLKTGSVPTIRD FT PASLQATTSTKSNKVDVACQTEPFHTSLHSVGTQLSFRTLQAKRRSTSVQT FT TVSSFELTVPSASSTAFMTSTPLKPSLKRPRLELEEEEEEEEEAEETLTES FT KIIAQDSDVTFDPAEECTSATEPTDLSIQECPVHNIAKFIVYETCLMELFS FT DCPVCQRRCDVKSQRLGTFLSVQQVCPHCEFVRKWNSQPIIGSTPAGNLHL FT SAAVYLSGASFFVVEKVFAAMKLHIFKYDSFRRHARLCIEPAIVHKWRNWQ FT SEMLEQLSRRENVIVGGDMRADSPGHSAKYGSYTMMDLATNTVVDLQLVQS FT NEVGGSYHMEKEGLKRSLDLLDARGVRLECIITDRHPQIQKYLRDRNVTQF FT YDVWHIEKGISKKLDKICQIKGCEKLRKWLRSIKNHIYWTAASSTTGPERV FT AKWTSILNHVQDKHVHEDPNFPACLHPQRISRDKNKWLSAATMPFYKLEKV FT LANKRILKDVAKLSPHHQTSTVEAFHSVILRFAPKNVVFPFLGMLCRLYLA FT ALHYNENAGRPQATSATGKPIYKLAFPKAKKGEYRVREVKTQQTFGYVEEL FT LDLIFNQVFVDPSPYVDEVLGIHIPPALSSAYDRPEMEEAISSRVTRFNQ* FT " XX SQ Sequence 6303 BP; 1979 A; 1158 C; 1177 G; 1989 T; 0 other; agggcaccta ttgtaaaaaa tcttcttttc aagctgtttg gacagatctg tgtgttggta 60 tagtgtatag accatcatac tggggtgata tgaacacacc cagtcctttt ttttcaattt 120 aactacaaaa aaaagggtcg gccaattgga gctgttttca aatcgatcgc accattacgt 180 aggtgtgcga ttccccccgc ccaccgaatt gattgacagc tgcgcgtatt aacatatcct 240 gagtttgccg tttccgccat tatcagcgtg aacgtcgaag cgatgccacc gagagaacac 300 ccttgtgcta ttttaggatg taaggctcag cacaagagcc ttcaccgtct tcctgcaaat 360 gagcaacaga ggcttaagtg gctgagtttt atctttgaag acaaggtgcc tgaaaatgta 420 ccaaaactgt tgtatgtttg cgctaaccac tttagcgtag actgcctcga gaacgcggga 480 caatacaact ccggattcgc gagtaaattg tttttgaaaa ctggatcagt tcctacaata 540 cgtgatcctg cctcacttca agtaagtatt acattttatg tacttctagc cttagcattc 600 taattatgtt aagccgatga atagcatctg cggctaaagt agcttcggat gtttgtttgc 660 acgtgtttac ctcttattgt gcttttcaaa aacagtctaa atactcgaag ctgcaaatgg 720 tagtgtgtac atgaaggggg tttagccatt agaaacagaa accgtagaaa aatagcaaca 780 ttgtaaaaag cagtaccact ggctacagct aaagtttgtt gttgttaaat gtttgttgta 840 ctgtaatggt aagccgaaat gaatagcatc taaaagttaa aagcagctaa agttgcttcg 900 gatgtttgtt tgcacgtgtt cactgctatt atgcttttct aaaacaatct aaataatcga 960 agctataagt ggtagtgtgt acaagaaggg cttttagcca ttagaaacag aaaccgtaga 1020 aaaatataaa cattgtaata aagcagtaac cagtactggc tacagctaaa gtttgttgtt 1080 tatgtgggat gtttgttgta atgtgtatac tgtaatgtta agcagaaatg aatagcatct 1140 gcggctaaag ctacttcgga tgtttgcgcg tttacagcta ttgtgctttt ctaaaacagt 1200 ctaaatactc gaagctttaa gtggtaatgt gtacataaag gggttttagc catcagaaac 1260 agaaaccgta gaaaaaatta aacattgtaa taaagcatta ccactgacta agttacagct 1320 aaagtttgtt ttgtatgtgg gatgtttgtt gtaatttttt tcactgtact tgcacgtggt 1380 ctgcagttgt ttacaataag atcggtatgt tttaatcaaa taaaggtcat tttaaaaaat 1440 gtatgttttc ttacaaaggc cacgacgtcg acaaagagca acaaggtgga tgttgcttgc 1500 cagacagaac catttcacac atctttacac agtgttggta cccagttgtc ttttaggacg 1560 cttcaggcta aacgcaggag tacaagtaag ttaaaatttc tcagtttatt ttaagttgct 1620 gcattagaaa cgtgttttat tatctttata aacgtgcaat gagattaatc ataaaattac 1680 actataggtg tccagacaac agtatccagt tttgaactga ctgtaccatc agcgtcctct 1740 actgcgttta tgacttcgac acccctcaaa ccttcactaa aaagacctcg tctggagctg 1800 gaggaggagg aggaggaaga agaagaagcc gaagagactt tgacagagtc aaaaattata 1860 gcccaagact cggatgtgac atttgaccct gcagaggaat gcacatctgc aactgaacca 1920 acagacttgt cgtgagttta tgcattatgt aaattatttg aaagattgtg tattgtcatc 1980 ataacattca tcatcatttt ttttttatta tgtacatgtg tatatttatt tttacagaat 2040 ccaagaatgc ccagttcata acattgccaa gttcattgtc tatgaaacat gcctcatgga 2100 gctcttcagt gactgcccag tgtgtcagag gagatgtgat gtaaagtcac aaaggcttgg 2160 gacatttctg agcgtacagc aagtctgccc acactgcgag tttgtaagaa aatggaacag 2220 tcagccaatt attggtagta ctccagctgg caacctgcat ctttctgctg cagtctactt 2280 gagcggtgca tcattttttg tagttgaaaa ggtaattatc tatcaaaatt tatcacattt 2340 cttttagaaa cataatataa attttgtcac agggtgcatg cacattgtac taaaaattct 2400 cctctctgtt gtcaggtatt tgctgccatg aaactacaca tttttaaata tgattcattt 2460 cgtcgtcatg caagactttg cattgaacct gctattgtcc acaagtggag aaattggcag 2520 agtgaaatgc tagaacagct cagtcgaaga gagaatgtga ttgttggagg agatatgagg 2580 gctgactccc caggtattta gtcaaaagat ttaatttcta attcaataca gtctaataaa 2640 aaaaataaat atgaaataca ttacaattat tccaaaaatg tgtttaaaat aattatgtac 2700 ggtttgttct tgaaggtcac tcggccaagt atgggagtta tacaatgatg gaccttgcaa 2760 ctaacacagt ggtcgaccta cagttagtcc aggtaataca gaaaagtgtt atgtgaacta 2820 gcataatatt ttggatggaa aatgtgtgat ttttatatat atatatatat atatatatat 2880 atatatatat atatatatat atatatatat atatatatat atatatatta tttttttttt 2940 taccttattt aaaattttat agagtaatga ggtgggcggc agttaccata tggaaaaaga 3000 aggcctaaag agaagccttg acctgttgga tgcccgcggt gttcgtctgg aatgcatcat 3060 cacagaccga catcctcaaa tacagaaata tctcagggat cgaaatgtca ctcagtttta 3120 cgatgtgtgg catatcgaga aaggtatgta atttagcagt tattaattta cttaagatta 3180 aattgctatt tacagttctt acatgtttat tattattttt gtgttgatgt cttttttttt 3240 ctttaatttc caggaatttc caaaaaactg gacaagatat gccagataaa ggggtgtgag 3300 aagttgcgta aatggttacg tagcatcaaa aaccacatct actggactgc tgcatcatcc 3360 acaacaggtc ctgaaagagt ggcaaagtgg acctctatct taaaccatgt acaagacaaa 3420 catgtacatg aagaccccaa ttttccggct tgtctgcatc cgcaacggat aagcagagac 3480 aagaacaaat ggctgtcggc tggtaagttg tatacagaca aataaaatat tctttgcgat 3540 taaaactaat tcaaaataga gcttgtttca ttgttcagat ttgtttttga tttagactct 3600 aataccttaa atttttgctt gtttagcaac gatgccattc tacaagctgg agaaagttct 3660 cgctaacaag agaatattga aggatgtggc caagctaagt cctcaccacc agacgtcaac 3720 tgttgaagct ttccacagcg tcatactgcg gtttgcaccc aaaaatgtgg tattcccatt 3780 tctggggatg ctatgcaggt aaagccaatg atcagaaatt aatatttaca atactataca 3840 gtcttgtcat ggtacgtcat ttacattttt ctttttgtta ggttgtactt ggctgcactg 3900 cattacaatg aaaatgccgg gcgtccccag gccacatcag caactggtaa acctatttac 3960 aagcttgcct ttccaaaggc aaagaaagga gagtataggg tccgagaagt gaagacacag 4020 caaactttcg gtatgtatat aaatgtttat agtattatta ataattcaat aaataaaaaa 4080 aataattgca ctttacattc acaatgaaaa ctcactactc aaaatacaaa attaagttac 4140 taaagagata actgttatat tgcacaggtt atgtagaaga gctgctggac ctcatcttca 4200 accaagtgtt tgtggaccca tcaccctatg ttgatgaagt gttgggaatt cacataccac 4260 ctgctctatc atcagcctac gatcgacctg agatggagga ggctatctcc agcagggtga 4320 cccgcttcaa tcaatagcta tcctaaaacc cacatagctg ccctgctctt caggaaactg 4380 tctccgaatc ttcagcacca cacaggcagg aatcacaaca cggactctcc gcccgaggaa 4440 accccaacac cagctcacaa agctccgata agccaaaaag cggcactgtc taacagagac 4500 ataataatga aacattgttt tagaaattgc actacaattt gaacttggga gttatcacat 4560 atacattttt attttttata aaataactag ctttaacaaa atttcagaag tcatattttg 4620 tttattcaac tgccaatgta tatttgatat gtttgtatgt gtttgtagat atactgtact 4680 ttctaatgtt catggacaat tgtatttcat tcaacaaagt gacaacttgg agtttatatt 4740 ttatatttgc acaaggatac attttatatt tgccaatcac attttaattt atacttcaat 4800 ataaagtttt tatatgacta aggaattgaa gcatcttttc tgaagggttg atgtttctat 4860 gccatactga tttttatata ttttaagagt ttatatatta atgtttagtg aatttaaaat 4920 tatttttcaa taaaaagtac atgcattact cagcctttta tgaagtcatt caaattgtca 4980 tctcagaaat agcagcaata aaaaacggta cgtatttcat ttttattgac attgaggtaa 5040 agttggtttt atattcagat attcggacat tctccttaat agtatcattt cagtaaagta 5100 ttctttgcaa attcaaaata gggaaatcgt attagcagca aatgacaaag tacaacattg 5160 tttacagtca agtagatagt gttaatgttg tttacaagtc atacttgcat gctaaatccg 5220 atcgattgtt taaactacca tggtaaacaa tacagggact tctgaatgtg cggatataaa 5280 accaacttta tcgctatttt attgactaca cagcgttaat gtgcttaaaa agaaagcgag 5340 catgcagatc atgaacaagg taacctgcta ttctttctat ttatttaaac tcacttttgc 5400 actccacgaa gtcgcagggg accatagtct gctttgtata tattcgatgc attttgcaag 5460 gaataaatat tgaagcaaac gggatccatt cctggatgct ccgtcatgca cgagggttgc 5520 atttccagtt gatccattcg tcttctaacc ttattacacg acacagttat gctgtttatt 5580 atctcattcg atgtctttag agcagcaacg ttaacataaa ctggaaattt acctgcggta 5640 tctccctgca gcatacattt tcagcctcag ttggcatttt ttcacaattg ccacacaagc 5700 acctgccaat atttgtcaac gtcaatattc atgaattacg aataaatcat accaatgcaa 5760 aatttatctt tgtggatatc tttatgtctt gtctgtgttt gacgtaacgc atgacgttcg 5820 cgaactttat caaacaaatt acggatggcg catgcatagt tacatagcgg ctgtaaaagt 5880 atacaaaaac ataaaatatt ttgtatgtac cactctgaca cattctggca cattcggatt 5940 atagattcct caacagtcga ctctgcttga tctgtgtccg agtctgtgtc atatatgtag 6000 ggttgaattg tgttcctctc actcatgatg actcttactg tctgcctctg tgacgcggtg 6060 cacaaagcag acgcgctctt ggctccgccc ccttgttacg ttgggcgggc agtcgaaact 6120 cattatcatg ttaaacaaca caccccaaaa cagcaacccg tgtacacgcc tctaaattta 6180 cactttttaa cacagcataa taaccatatc tgaattgcgt tttgatctga acctacattg 6240 gcacactcag aagaaccata atattactat taaatctcaa aaaagggcta aaatatgtgc 6300 cct 6303 // ID Gypsy41-I_DR repbase; DNA; ZEB; 5057 BP. XX AC . XX DT 21-SEP-2007 (Rel. 12.09, Created) DT 21-SEP-2007 (Rel. 12.09, Last updated, Version 1) XX DE An internal portion of the Gypsy41_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy superfamily; Gypsy41_DR; KW Gypsy41-LTR_DR; Gypsy41-I_DR; endogenous retrovirus; gag; KW integrase; reverse transcriptase. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5057 RA Dib M.R. and Naveira H.F.; RT "Gypsy41_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 7(9), 812-812 (2007). XX DR [1] (Consensus) XX CC Gypsy41-I_DR is an internal portion of the Gypsy41_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its long CC terminal repeat is deposited in Repbase as Gypsy41-LTR_DR. CC Gypsy41_DR is characterized by 4-bp target site duplications. The CC internal portion encodes one polyprotein the 1608-aa polyprotein CC Gypsy41_DR1p (pos. 199-5022) composed of the gag, protease, CC reverse transcriptase, and integrase domains. Some insertions CC fairly recent, according to the high identity between their CC flanking LTRs. Consensus obtained after the alignment of at least CC nine independent insertions bearing at least 85% Homology over at CC least 1000bp. XX FH Key Location/Qualifiers FT CDS 199..5022 FT /product="Gypsy41-I_DR_1p" FT /note="ORF." FT /translation="MSTFDLETFVANPTLEVFERCRKDDLLQIGEHYRIPV FT VRQSLKRLIKDEVLKHLIKHKILNLPDADGDKAGAEAVEHSPDNLVDAEKE FT TGEYAEVETEAKTGLPPFHPFSPTSIESGGDARLKVRLARMQIEAQERAET FT RHAEMKLRLDIRKLEIDAETQVKLRQLELDSMKVGSGRSVQLNPVSPAAND FT NSASFLSDVSSSAFDISKHIALVPHFRESEVDSYFAVFERVAAALHWPKEV FT WCLLLQCRLVGKAQEVCSALTLEESLKYDSVKSAVLRAYELVPEAYRQRFR FT NQKKNAQQTFVEFARDKGVLFDKWLAANNVRDFNALRELMLLEDFKNCLPE FT RIVTYLNEQKVTTLSQAATLADEYVLTHKSVFSVPRPDKTLSSFTAQNSAR FT PKSSSPTNKEIRECFYCHKIGHLISECLVLKRKNQQGQMKPAAFVKTLSEL FT DSSENEIDAGFKPFLMKGLISVNGNTEEQKEIKILRDTGALHSFLISDALP FT LSDQTYCGSNILVQGIEMGVLTVPLHRIHLQCDLISGFVKVGVRPSFPVKG FT VAFILGNDLAGDKVSSLPEVVDKPDCFIPADDISNTFPDVFSANVVTRAQK FT RRIGEEIILSDSFLPPLFTENPVSVDFKSFKGAEKETRSDNTPPTDASECD FT LLTEPISHERIKNAQRDDESLTKCFSAADKDFKQTKNNLSVYVIENGLLLR FT KWRPSRDVENEWEVVCQIVLPTIYREQVLNLAHDHPLSGHLGVTKTYKRIL FT KHFFWPGLKKDVVSHCRTCHVCQVTGKPNQKIPPAPLVPIPIVGEPFEHVI FT LDCVGPLPKTKAGNQFLLTIMCSATRFPEAIPLRKITAPVVIKAMIKFFST FT FGLPKIVQTDQGTNFLSKVFAQVLTSLGIKHRTSSAYHPESQGAIERFHQT FT LKSMLRKYCMSSEKDWDEGIPLMLFAIRESTQESLGFSPADLTFGHVFRGP FT LKVLKDGMLNEKSKGVNILDFVSRFRERLHNVCTLARESLSVAQEDMKGWY FT DRKAVVRDIKPGDDVLVLLPVPGSALSARFSGPYKVSKKLSETDFVINTPD FT RKRKFRTCHINMLKLYYPRKVSGNLEKEQLAAVSSDPLVAAVTAVCDSADV FT SELPSLGECHNDDDDFTFRNDLTMCGRLLNSEILDNLPDYFNTLSETQRRD FT LIKLLREYPMLFSDVPTQTNVLQHDIQVTCDHPIKQHAYRINNVKRSVMKD FT EVDYLLKNNLAEYSHSPWSSPCLLVPKADGTFRFCTDYRKVNSVTVPDSYP FT LPRMEDCIDNLGSAKFVTKLDLLKGFWQIPLSPRAAQISAFVTPDNFLQYK FT VMAFGLRNAPATFQRLVNIVLSGVPNCHAYLDDLVIYSCDWDEHLSALRTV FT FKRLSDASLTINLAKCEFGKATITYLGKEVGQGQVRPVSAKVTAIAEFPIP FT TTRRELRRFLGMAGYYRSFCRNFSTVVCPLTSLLSPKADFVWSDACQQAFE FT SIKLLLTCAPVLAAPDFSRPFKIEVDASSLGAGAVLVQEDDVGFDHPICYF FT SRKFNRHQRNYSTIEKEALALILALQFFEVYVGSSVLPVTVYTDHNPLVFL FT SRMYNHNQRLMRWSLIVQNFNLNIKHKKGSENVIADTLSRA" XX SQ Sequence 5057 BP; 1396 A; 948 C; 1143 G; 1570 T; 0 other; tataattggg ggctcgtccg ggatctattt aaattattta aaatcttaat ttagtcccgg 60 ataatagttt tgtgtgtgtg tactcggatg gtgtagagaa aaaaaggaga aaaactgtga 120 cagtgaaata gtatcatttt ctctctttgc aattttggtc ttttgttagt aagtaggcta 180 tttgtgagaa tttagacaat gagtactttt gatttagaaa catttgttgc taatccgaca 240 ttggaagttt ttgagcgatg taggaaagac gatttgttgc agataggtga gcactatcgc 300 attcctgttg tgaggcagtc tttaaagcgc cttattaaag acgaggtttt aaagcacttg 360 attaaacaca aaattttaaa tttacctgat gcggatggcg ataaggctgg tgcagaggct 420 gtagaacaca gtcctgataa tttagtggat gcggagaagg agacgggtga gtatgccgag 480 gtggagaccg aggcaaagac aggcttgccg cccttccatc cattttcccc gacttcaatt 540 gagtctgggg gtgatgcgag gctaaaggtc cgcctggctc gcatgcagat tgaagctcaa 600 gagcgagctg agacgcgcca tgccgagatg aagctgcgcc tggatatccg taaattggaa 660 atcgacgcgg agacacaggt aaaacttaga cagctcgagt tagactcgat gaaggttggt 720 tctggccgat ctgtgcagtt aaatcctgta agtcctgctg ctaatgataa ttcggcaagt 780 tttttgagtg atgtttcgtc tagtgctttt gatattagta agcatattgc tttagtgcct 840 cacttcagag aatccgaagt ggattcctat tttgctgtgt ttgagcgtgt ggctgccgca 900 cttcattggc ctaaagaagt ttggtgtctt ttgctacagt gtagattagt cggaaaggca 960 caggaagttt gttctgcttt aactttggag gagagtttga aatatgattc agttaagagc 1020 gctgttctcc gcgcatacga gttagtgcct gaagcgtatc gacaacgttt tagaaaccaa 1080 aagaaaaatg cacagcaaac attcgtagaa tttgcccgtg acaagggagt tctttttgat 1140 aaatggttgg ctgctaacaa tgtgagagat tttaatgcgc tgcgagaatt gatgcttctg 1200 gaggatttta aaaattgcct gcctgagaga attgtcacat atcttaatga acagaaagtc 1260 acgactttgt cacaagccgc aactcttgcg gatgagtatg tgttaacgca caagagcgtg 1320 ttttctgtac cccgacctga caagacattg agttcattca cggctcaaaa ttccgcacgt 1380 ccaaaaagta gttcaccaac gaataaagag attagagagt gtttttactg ccacaaaatc 1440 ggacatttaa tttccgaatg tttagtgctt aaacgaaaaa atcagcaagg gcaaatgaaa 1500 ccagcagctt ttgtgaaaac tttgtctgaa cttgactcta gtgaaaatga aattgacgct 1560 ggctttaagc catttttaat gaagggactg atttctgtaa atggaaatac tgaagagcag 1620 aaagaaatta aaatcttgag ggacactggc gcacttcact cgtttttaat ttctgatgca 1680 cttcctttat ctgatcagac ttactgtgga tccaacattc tagttcaggg aattgagatg 1740 ggtgttttaa cggtgccttt gcatcgaatt cacttgcagt gtgatttgat ttcaggtttt 1800 gtcaaagttg gtgtacgtcc ttcctttcct gtcaagggtg ttgcgtttat tctcggaaat 1860 gacttggccg gggataaagt ttcatctttg cctgaagtgg ttgataaacc tgattgtttt 1920 atccctgctg atgacatttc taacacgttt ccagacgtat tttcagcgaa tgtagtcact 1980 cgtgcccaaa aacgcagaat cggagaagaa ataatattat ctgactcatt tttgccacct 2040 ttgtttactg agaatcctgt ttcggtcgat tttaaaagtt ttaaaggtgc tgagaaagaa 2100 acgcgctccg acaatactcc gcccacagat gcgtccgaat gtgatctttt aactgaacct 2160 atttcgcacg aacgaattaa aaatgcacaa cgagatgatg agtccttaac gaaatgtttt 2220 tctgcggctg acaaagattt taaacaaacg aaaaataacc tgagtgtata tgtgattgag 2280 aatggattat tgttgcgtaa atggcgtcct agcagagatg tagagaatga atgggaggtt 2340 gtttgtcaga ttgttctgcc tacaatttat cgagaacaag tgctaaattt ggcccacgat 2400 caccctttat ctggtcattt aggtgtgaca aaaacttaca agcgcattct taaacatttc 2460 ttctggcctg gcttaaagaa ggatgttgtt tctcactgtc gcacttgtca tgtttgtcag 2520 gtcactggca aaccaaatca aaagattcca ccagcaccac ttgttccgat ccctattgtt 2580 ggcgaaccat ttgagcacgt cattttggat tgcgttggtc ctcttccaaa gacgaaagca 2640 ggtaatcagt ttttattaac tattatgtgc tctgcaaccc gctttcccga ggccatcccg 2700 cttaggaaaa tcactgcacc tgttgtgatc aaagcgatga tcaaattttt ttctactttt 2760 gggttaccaa aaatcgtaca aacagatcaa ggtacgaatt ttctgtcaaa agtgttcgcg 2820 caagttttaa cctcattagg cataaagcac cgaacatcaa gcgcgtatca tcctgaaagt 2880 cagggagcta tcgaacgttt tcatcaaaca ctaaaatcca tgttgagaaa atattgcatg 2940 agctctgaaa aagactggga tgaaggaata ccattaatgt tgtttgcaat tcgtgaatct 3000 actcaggagt cattagggtt ttctccggca gatttaacat tcggccacgt ctttcgcggg 3060 cctcttaaag tgctgaaaga tgggatgtta aatgaaaaat caaaaggagt aaatattttg 3120 gattttgtta gccgttttag agaacgtttg cacaatgttt gcactcttgc acgtgagtca 3180 ctgtccgtag ctcaggagga catgaaagga tggtatgaca gaaaagctgt tgtccgtgac 3240 attaaacctg gtgatgatgt tttagtactg ctaccggtac ccggttcagc gttgtccgct 3300 cgtttttctg gcccctataa ggtgtcgaaa aaactgagtg aaactgactt tgtgataaat 3360 acacctgatc gaaaaagaaa atttcgcacc tgtcacatta acatgctcaa gttgtattac 3420 cctaggaaag tgtctggtaa tctcgaaaag gaacaacttg ctgctgtttc ctctgatccc 3480 ttagttgctg ctgtcactgc tgtgtgtgac tctgctgatg tgagtgagct gccttctctg 3540 ggtgagtgtc acaatgatga tgatgacttt acgtttcgta atgacttgac tatgtgtggt 3600 agacttttga attcggagat ccttgataat ttgccagatt attttaatac tttgtcagaa 3660 acacagagac gtgatttgat caaactgttg cgagaatatc cgatgctgtt tagtgatgtg 3720 ccgactcaaa ctaacgtgct acagcatgat attcaagtga catgcgatca tcctattaag 3780 caacatgctt acagaataaa taatgtaaaa cgttctgtga tgaaagacga agttgattat 3840 ctgttaaaaa ataatctagc agagtatagt catagtccct ggagctctcc gtgtctgctt 3900 gtgccaaaag ctgacggtac attccgtttt tgcacggatt atcgtaaggt aaattccgta 3960 acagtgccag atagttatcc attgcccaga atggaagact gtattgacaa tttgggttcg 4020 gcgaaatttg tcacgaagtt agatctactc aaaggatttt ggcaaatacc attatctccg 4080 cgagctgcgc aaatttcagc tttcgttaca ccggataact tcctacaata caaagtaatg 4140 gctttcgggt tacgaaacgc tcccgcaact ttccagcgcc tcgtaaacat cgtattgtct 4200 ggagttccaa attgccatgc atacctcgat gatttagtaa tttattcctg tgactgggat 4260 gagcatttgt ctgcacttcg tactgttttt aaacggttat ctgacgcttc gttgactatt 4320 aatttagcca agtgtgaatt tgggaaagca acaattacct atttgggcaa agaagtaggt 4380 caaggtcaag ttcgtcctgt gagtgccaaa gtgacagcta tagctgagtt tcctattcct 4440 acaacgagac gcgagttgcg ccgcttttta ggaatggctg gttattacag gagtttctgt 4500 cgtaatttct cgactgttgt ttgtcccttg actagcttgt tgagtccaaa agctgatttt 4560 gtatggtctg atgcatgcca acaggctttt gagagtatta agttgttgct tacttgtgca 4620 cctgttcttg ccgcacctga cttctcacgt ccatttaaaa tagaagttga tgccagttct 4680 ttgggtgctg gcgctgtgct tgtccaagaa gatgatgttg gctttgatca cccaatctgt 4740 tacttttcac gtaagtttaa tagacaccaa cgtaattatt ccaccattga gaaggaagct 4800 ttggctttga ttttggcgtt gcagttcttt gaggtttacg tgggttcatc agttttgcct 4860 gttacagtat acactgacca caatcctcta gtgtttctgt ctagaatgta caatcacaat 4920 caacgcctta tgcgttggtc gttgattgta caaaatttta acttaaacat aaagcataag 4980 aaaggatctg aaaatgtcat tgctgacact ttatctagag cttaatgttg ctttatgttt 5040 agtcttaaga gggtgag 5057 // ID Gypsy123-LTR_DR repbase; DNA; ZEB; 320 BP. XX AC chr16; XX DT 03-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Gypsy retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy123-I_DR; KW Gypsy123-LTR_DR; Gypsy123_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-320 RA Jurka J. and Kohany O.; RT "Gypsy retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1283-1283 (2008). XX DR Genome; chr16; Positions 15705031 15705350. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 320 BP; 62 A; 58 C; 95 G; 105 T; 0 other; tgacgacccc ttgtggtcgt ttgtggctgg ggtgtgggtg tcggtctgtg catctgtgtg 60 tgtgtgttgc aggtccagct gattgcgctg ggattgagaa caccggtttg tgattgagtt 120 gctctgctat aaatagcgtg ttcgtttcag acgagaggga gaggctgcac gtggaatctg 180 gtcacttacc tttcggctgg gatcggtttt ggacctgatg ggctagcaac atgaccttaa 240 taatagtaaa taaatatatt tttgtaccgc aattgtactc gacttctcct tatttgtggt 300 gaccggttta ggtcataaca 320 // ID L1-7_DR repbase; DNA; ZEB; 5559 BP. XX AC AL645691; XX DT 04-AUG-2002 (Rel. 7.07, Created) DT 04-AUG-2002 (Rel. 7.07, Last updated, Version 1) XX DE L1-7_DR is a non-LTR retrotransposon from the L1 clade. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1 clade; KW L1-7_DR; ORF1; ORF2; endonuclease; reverse transcriptase. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5559 RA Kapitonov V.V. and Jurka J.; RT "L1-7_DR, a family of non-LTR L1-like retrotransposons from RT zebrafish."; RL Repbase Reports 2(7), 27-27 (2002). XX DR Genbank; AL645691; Positions 62218 56660. XX CC L1-7_DR is a family of L1-like non-LTR retrotransposon. This CC family was active recently (ORF1 and ORF2 have a few stop-codons CC only). XX SQ Sequence 5559 BP; 1619 A; 1123 C; 955 G; 1862 T; 0 other; ttttataggg gggcgctgtc gcgcgaatac gatgtagacg cgttcacagt gagctcctca 60 caggtgacat ttcatttaag aaataatcag acctagaagt taatttcaaa ctttagtgtg 120 ctttgactcg gcttatacat acgttcggac agtaaatttg tggatatcgt gtgtagaatg 180 gctgctcctg gcaacaaacg aggtggcaaa gagacacgag ctaatacggc taacgttagc 240 tcgggtgact cacgaggtac ccacgaggga gaaccacttt cagctgaaat gtttaaacaa 300 tcgctggagt ctataaaaaa tgacatttgc attaaaattg aaagtgcgat cagcggtcta 360 caatcagaca tcaacggagt gaaagaagag ttgacttcag ctgtagccac aattcaacaa 420 tcgcttggct cgcaggagga gcggctgaga gctgtcgagg attcggcgac agcaacgagc 480 aattctttaa ggtaccttga aactaccgtg gctaaacttc agactgaggt tacggctctg 540 tctttaaagt gtgaagatct tgaaagcaga tcaaggagga ataacatcag actgatcgga 600 ataagcgagg gagaagaagg caaccgtcca acggagtttg tttcaaactt tctccgcgac 660 ctgttgaaca tggaggagaa gccctgcttg atcgagctca caggtcactt caaccaaagc 720 ccaaaccggg tcaaccaccc aggcccttca tcattagagt ccacttcttc catattcgtg 780 aacagattct ccaagcggcg cgtcaagaag gaaggctgac atacaaggga aatatctggc 840 tatctggcta taatctcttt cctgatctca cggctaacga agccaagaaa agagcggctt 900 ttggggaagt gcgcaaattc atacgaaacg ttgaaggggc tcgatttggc tttcgtcatc 960 ccgccagatt ccgaattacc ttgccgggta atacagagca gctatttacg gaccccctca 1020 aagcgatgga atatgtgaaa aacacccttg cagctgacgc gattggaaca aagaccagtg 1080 ttatgcccga ataactgaaa atcacttgtc ggctaccttc tgagatgact gtccattcca 1140 gattttcttg caaggtactt tatctttgga ataaggagaa ctgattgcat tttacttgga 1200 ccagtccatt gttctactat tatttcattc tctaaaaaac agaacactgc atctgactgt 1260 ttattagcct aataaccaga tttaaaatct ttggttattg ccttaatttt aagcactatc 1320 tagacacata ctgcactttg tggtctgcgt gcttattttt ttttctcacc tgatgacgcg 1380 cttgtctttt ccgtagcgca tcctagacga caagttcatc gtctgtgctt tttgtcttat 1440 ggggttttat atacacctag ttaagggagg tgttaaagtg ggtgggggat atcaatgtgt 1500 tatagttcag aattttgttt tattttattt atttattttt ttttccctgt ttcttttttc 1560 ttgtggctac tggtatttta cacaatccat tctgtctgtt aagttttaaa atatctcaag 1620 atatcttcaa taatttattt aaaatgtata tcaataagta aaatcataaa ttttgtaagc 1680 tggaatgtaa aaggacttgg tcgtcccact aagcttaaca aagtgatcaa tcatttgcat 1740 aatttacaag ctaaaatagt tttttttaca ggaaacacat cttaatgttt ctgatcacac 1800 catgttacgt agaaaatggg tctcacaatc ttttcattcc ttatttacca gtagagcaag 1860 gggagtggct attcttattc ataaagatat tccttttaat acatcagacg tgctggctga 1920 ccgtaatggt agatatatca ttgtctctgg atcacttttt aatactccgg ttgtattggc 1980 taatatttac gcaccaaatt ttgatgatga taattttttt attaaatttc tctccctctt 2040 accaaattta cattcaagat atttgattat tgggggagat ttcaatctct gcttgaaccc 2100 tcaacttgac agagcatctc taaggggaac agctctctca aagtcagcta aaattataaa 2160 ttcctttctt aatgattatg cagttacaga cgtatggcga ttcttgaatc cacttacacg 2220 tggttattcc ttcttctctt atgctcatca gactttttct cggattgatt attttttaat 2280 tgacaacaaa ctcttaccat gtgttcagca ttgcacttac aatgcaatag tcatttcgga 2340 tcattctact ctcactttag gacttaaatt tgagtgtaaa aatgttcaac gtccactttg 2400 gaggttaaat gttcgtctat taaccactga ggattttgtc cagattattt cctcccaaat 2460 agactttttt gttcaggtta attcttctct agacatcaca catagcactc tttgggaagc 2520 atttaaagcc tacattagag gtcaaattat atcttatgta tcatttgaaa ggaacaaata 2580 taaaaaatct attgcagatc tgacagatca aatctcccaa ttagaaaatc aatatgtaaa 2640 tagcccttct ccaaacatac ttaaagaata tctcctctta aaagctgatt ttgacacact 2700 ttctactgtt acagcagagg agatattaat aaaatcaagg catgcttatt atgaatgtgg 2760 agagaaggcg catagacttt tagcccacca gcttagacag tcctctgcct cccatataat 2820 ctctactatt aaagcaaacg atatcctact gtcagatcac tgtgcaatca ataacgcatt 2880 tttagaattt tatcagtcat tgtattcttc tgaacaaatt tctgatcagg tggactttca 2940 taactttttc caagatcttt ccatacctcg cattagttct gaagatatga aatttctgga 3000 cagccctatt tctttagatg aggtttacaa tgctatatca gcaatgcaaa gtggcaaaag 3060 tcctgggcca gatggtctgc cagctgaatt tttaaagact ttttctggac aactagctcc 3120 tcttttacta aatatgtaca acgaatcatt tgatctaacc tgtcttcctc agactttgcg 3180 agaagcctca atttctttta ttttgaaaaa gggcaaagac cctcagttgt gcagttcata 3240 tcgaccaata tcacttttaa atgctgatgt aaaaatctta gccaaaattt ttgctctacg 3300 attagaaact cttcttccca aaattatttc ctctgaccaa actggctttg ttaaaaacag 3360 attttcattc tttaatatca gacgattatt caatatcatt tatcatccta tgtctttatc 3420 tgtacctgaa gtcttgattt cacttgatgc agagaaggct tttgatagag tggaatggaa 3480 ttatttaatt tatgctctta agagttttgg ctttggagca aattttatat cctggatcag 3540 attattgtac aaagccccac ttgcctctgt tcgtaccaat aacatttctt ctgcatattt 3600 tcagattaat cgaggcacaa gacagggctg ccctctctct cctcttttat ttgcagttgc 3660 aatagagcct ttggccattg ctctgcgtca aaacacaaat atagctggta taataagagg 3720 aggacaagag cataaagtgt ctctatatgc agatgacttg ctgctatacg tttctgatcc 3780 atctaaatcc ttgcctcata ttctcaaaac gtttgataca tttggcaggc tatctggcta 3840 taaattgaat ttacagaaaa gtgaaatatt tcccatcaat aatgctgccc gtgagttttc 3900 ctttacatcc ttcccattca agatatcaaa ttcatttata tacctcggca tacatttatc 3960 taacgcttat tttaagctct ttgctgaaaa cttactcccc ctactggagc gagttgaaca 4020 agatttaaag cgttgggcct ctcttccatt atcagtcgca ggtcgaataa actcaattaa 4080 aatgtctttt cctaatttca gatgttacta ctgggcctca aatattcgtc ctatacttca 4140 ctggctctac gaagatcctg gggctgatgc tttatcatgg atagccattg aatctacctc 4200 atgcttgccc tcctccctag cagcattagt ttatgctcct ctttccttcc cttatgacaa 4260 atatacaaaa aacttgatgg taagatccac acttaaaatt tggaaacaaa ttagacgtca 4320 atttggttgg caaactatat ctccaaaatc tcccattcat tctaatccgg tttttagtcc 4380 atcaataatt gataaatcat ttgcgtcctg gcatgctaga ggtgttaagg aaataaacga 4440 cttataccat gagggaattt tctcttcatt tcaacagata tgtgaaaaat ttaatattcc 4500 aaataaaacc tttttccatt atctccaaat tcgcgatttt gtgcataaaa tgttccctca 4560 ttttccagta ttgcctccta gctcctcatt tgataattta ctggagaggc ccacaatttg 4620 gaaagggatt atatcatcac tctactataa aattatggca ccaacaaata catcccattc 4680 ccaaataaaa tctgtgtggg aagatgatct gggaatcaca atttcagatg aggattggga 4740 atcagtctta tacagaattc acaagtcctc tgtctgcgca aagcatggac ttctgcagtg 4800 caaaattgtg catcgtgtac actggacaaa cctaaagctc tcaaaacact ttcagcatat 4860 agacccggcc tgcaatcgct gtaatttcac cccagccact catgcacata tgttttggtc 4920 ttgtataaag cttaattact tttgggacac tatttttaaa attatatctg tttttttggg 4980 tatatccatc tccccttgcc cactgatagc catgtttgga attttgccta caacatacaa 5040 tcttacaaaa atacaatctg atagcatagc ttttattgtg ctcctggcca aacgactaat 5100 attgcttaag tggaaggatc agaagcctcc aaccccaaca cagtggattc gtgatgtctt 5160 attttttgtt aagttggaaa aaattagact gtctagaggt tcatctaatc agtttaaaaa 5220 aacttgggac ccatttagga aatatatcca agaacatgtt ttattgtcaa ctacagacca 5280 aatccttgtt ccagcaaatt gatttatcta ttttttcttt cctaaatgcg aaattagagt 5340 gtattgtgtg cttcagattg cacaactgta gtttcattta agccacacta ttgattttta 5400 tttttctatt tatttattta ttttatttat ttattttatt ttttttatta ttattttttt 5460 ccctttttct cttattgtac gtgccaaagt tgatactgtc tatacttttt gaaatgtaat 5520 tctgttttgt cttctttttt gctgtaattt gccataaaa 5559 // ID DNA8-4B_DR repbase; DNA; ZEB; 321 BP. XX AC . XX DT 22-OCT-2008 (Rel. 13.1, Created) DT 22-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Putative nonautonomous hAT DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; 8 bp TSD; KW DNA8-4B_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-321 RA Bao W. and Jurka J.; RT "hAT DNA transposons from zebrafish."; RL Direct Submission to Repbase Update (11-NOV-2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 321 BP; 88 A; 56 C; 68 G; 109 T; 0 other; cagtggttct caaactgtgg tacgtgtacc actagtggta cgcaggcttc cttctagtgg 60 tacgcggagg aatgaaatat gtcatgtaca tgctacacac atttcaaaat ttatcaaaaa 120 tgatgtatat aatatgccat atatgacata tagcctatat ttctgaggta atctgccacg 180 ttttttttaa ctgtgcagag ttgtagctgc tttactgggc ctactacgct actgtatttc 240 aatactgctc attttggtgg tacttggaga gacaattttt ttctgaggtg gtacttgatg 300 aaaaaagttt gagaaccact g 321 // ID DNA-2-24_DR repbase; DNA; ZEB; 1252 BP. XX AC . XX DT 03-DEC-2008 (Rel. 13.12, Created) DT 03-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE nonautonomous DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; 2-bp TSD; KW DNA-2-24_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1252 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(12), 2133-2133 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1252 BP; 451 A; 167 C; 169 G; 465 T; 0 other; cactctaaaa aaaaaaaggt tcctagaggt tctatataga accttggggt tctatacttg 60 gccagtagaa ccttttaccc aaaaaatggt tccatttaga accctttgag tgcaaagaac 120 ctttttgatt aaaatggttc tatagttttt gattcataac ataattatct agtaaagatt 180 ataaaataat tataagctat aacacagata tacaaagctt tttgagtcaa aacctaataa 240 aatgaatgct gtggtcaaaa actttatcag tgtacagtat gatgggattt ttttcattca 300 ggtttgcaca ttagcatata acttatattt tatatatact taatatttaa catttttaac 360 aaattatttt gtgaaataaa taaaataacc agtttattta tttagactgt gcagtaccat 420 gtttttatga tttaataatt tatttaaaac taataatcat cgtcgtggac aggtggtgaa 480 gagcctcgag agtgacggtt aatttttcga aaggagaagt ccaccacagc ttcatcattc 540 acacaatata tatatatata tatatatata tatatatata tatatatata tatatatata 600 tatatatata tatatatata tatatatata tatatatata tatatatata tatatatata 660 tatatatata tatatatata tatattatat atatatatat ataatatata tatataatat 720 atatatatat atatatatat atatatatat atatatatat atatatatat atatatatat 780 atatatatat atatatatat atatatatat atatacttaa aggtttaata tttcacaata 840 catttagcct gcatttttgt gtcctttata aataacacct gatctttcat ctatggatta 900 ttaaaatatg tttggcttta aaaggtttta ataatatcca gggacggact ggccataggg 960 agaaccggga cctttcccgg tggcctgatg gtcaatctag cctgccaccg tgactctggc 1020 ctgccgcgcg tacacagact cggtgcgggc acgagcgctg cactgctacc taaataaaga 1080 acccttaaat ggttctaaat agaaccatct aacttgaatt cgaagaacca tttggggttc 1140 tatttaatga acccttaaaa tggttctatt tagaaccttt ttggggttcc atatatgaag 1200 cgcttgatag aaccattttt ggttctatat agaaccgttt cttttaagag tg 1252 // ID Gypsy7-LTR_DR repbase; DNA; ZEB; 804 BP. XX AC . XX DT 07-DEC-2004 (Rel. 9.11, Created) DT 07-DEC-2004 (Rel. 9.11, Last updated, Version 1) XX DE Gypsy7-LTR_DR is a long terminal repeat of the Gypsy-7_DR LTR DE retrotransposon - a consensus. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy superfamily; Gypsy7-I_DR; Gypsy7-LTR_DR; Gypsy7_DR; KW endogenous retrovirus. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-804 RA Kapitonov V.V. and Jurka J.; RT "Gypsy7_DR LTR retrotransposon from zebrafish."; RL Repbase Reports 4(11), 292-292 (2004). XX DR [1] (Consensus) XX CC Gypsy7-LTR_DR is a long terminal repeat of the Gypsy7_DR CC retrotransposon whose internal portion is deposited in Repbase CC as Gypsy7-I_DR. XX SQ Sequence 804 BP; 185 A; 135 C; 164 G; 320 T; 0 other; tgaccaggcg atctctctag tattgtgtgc ggggggtctg aaattctgat gggaaaagtg 60 gacctgtgca gaacggcttc ggtgtccctg ggtcagtaac ttttgtccat ttgcatccat 120 tcatacaccc acacattcca cacacacatt acaacgttat cacataggct ttctttacat 180 tttttccctc tcttgtttcc agtcgatcag ccacacatta ttattgtttg tttgtttgtt 240 tgttttcttt tgggttatct aaacattatt attgaagtgt ttgaattgtt tgtttgatgt 300 ttccttttgt tacctaagaa gtgctaattt tgactatgat ctatgactga acagtgatta 360 aaagtatata ttttgattta gaaaaccttt ttgttgtact gcttgttaaa aatataccag 420 ttaatttaat ttaaagtgtg tgttgggtta tctttatttt cttctctttg tgtgtatggt 480 ggttaagtgt ttgccgcggc tatccagggt tttaatttaa tttttggcaa gttaatcttg 540 tctggcgccc aaagatttat taattctggt aactattaat tttttttttt tttgtttgag 600 gtccgctggc agtgggatta agggacgtgt gaaagccagc tagtgcagta cccacgtgga 660 ccagcggttg agcgttcggc tgcggtgcag gaggtcccgt gttcgaatcc cgcagagttc 720 agtttttgag atttgaatta tttttttttt ttcttattat aaactgtata taagatttat 780 ttattacaaa caaccaccgt caca 804 // ID HARBINGERN3_DR repbase; DNA; ZEB; 1064 BP. XX AC . XX DT 05-JUN-2002 (Rel. 7.05, Created) DT 05-JUN-2002 (Rel. 7.05, Last updated, Version 1) XX DE HARBINGERN3_DR is a nonautonomous DNA transposon - a consensus. XX KW Harbinger; DNA transposon; Transposable Element; Nonautonomous; KW 16-bp TIRs; 3-bp TSD; HARBINGERN3_DR; Harbinger/PIF superfamily; KW nonautonomous DNA transposon. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1064 RA Kapitonov V.V. and Jurka J.; RT "HARBINGERN3_DR, a nonautonomous DNA transposon from Danio RT rerio."; RL Repbase Reports 2(5), 20-20 (2002). XX DR [1] (Consensus) XX CC HARBINGERN3_DR is an ancient nonautonomous DNA transposon. CC Putatively, CC it belongs to the Harbinger/PIF superfamily. CC HARBINGERN3_DR copies are flanked by 3-bp target site CC duplications. CC The consensus sequence has perfect 21-bp terminal inverted CC repeats. CC There are approximately 1000 copies of HARBINGERN3_DR in the CC zebrafish genome; they are ~91% identical with the consensus CC sequence. This family is composed of minor subfamilies. XX SQ Sequence 1064 BP; 325 A; 185 C; 182 G; 371 T; 1 other; gggtccgttc ttcgtacctc gcttaaatga tctaagatga tttggcagat cctggatctt 60 ttaatcttga taactgatct ctcgctaatt tggttcttca aacaagttcg cgaatcagat 120 taaaatgtct ggatgaactg atctgagatc gctgcgtgtg ttgtgaagga cagatctatc 180 gatcctcgaa atcatgatca gcaatgcaac gattggctga cggcacagca gcgtaatgac 240 atcatctgat taatattcaa ttatccatgt gagcaaaatt acatcaaatt agcagtaaac 300 ggtttgttaa atatgacacg caataacctt ccacatttgt tgtgagctgc aggctttaca 360 ctttcatgtg tcaagagtat tcatcatgta tttcaatgca tatcaatgta tttagttcta 420 catttagaga atattttctt tattatagta gcctaggcct attcaagttt ttttaatcgg 480 cgtaaagaat aactggctgt ttaaattaat tttgcatcaa aaaagcattt tgatattggt 540 aaaggtgtct gcaacttttg tgaagcaatc aaaaacatgt gcatgactgc ataaatgtat 600 tatgctttat ttatttttta tttttttata cacaatttgt actaaagcgt ttgtatctaa 660 aaagttcata tcaataagtt ttctctttgc accaccaggt ggcagtcttt gtactttcat 720 ttcgagggtg cagattgcat aagttttatt aatatatctt tataacttta tatatatagt 780 taaaaatata tatatttact attttcccaa gtgtatataa ctactactgt aagaaaatat 840 cagaattcgg tacatacttt cwgtattatc tttgcttgaa ctgagccgat ctaatcctgt 900 ttatatgaat tgaacctgct cccgagcagg tttgagctac cagaactgtt gctatgacaa 960 caactctcgg atgagttttg aagaacgaaa cgatcctgga tcatgtcaaa tcgtcaatat 1020 ccaaatccag ctaattgagt aatccacgta cgaagaacgg accc 1064 // ID L1-3_DR repbase; DNA; ZEB; 5520 BP. XX AC . XX DT 04-AUG-2002 (Rel. 7.07, Created) DT 19-FEB-2010 (Rel. 15.03, Last updated, Version 2) XX DE L1-3_DR is a non-LTR retrotransposon from the Tx clade. XX KW L1; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; endonuclease; L1 clade; L1-3_DR; MutsuDr3. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5520 RA Kapitonov V.V. and Jurka J.; RT "L1-3_DR, a family of non-LTR L1-like retrotransposons from RT zebrafish."; RL Repbase Reports 2(7), 23-23 (2002). XX RN [2] RP 1-5520 RA Kojima K.K. and Fujiwara H.; RT "Cross-genome screening of novel sequence-specific non-LTR RT retrotransposons: various multicopy RNA genes and microsatellites RT are selected as targets."; RL Molecular Biology and Evolution 21(2), 207-217 (2004). XX DR [2] (Consensus) XX CC L1-3_DR is a family of L1-like non-LTR retrotransposon. This CC family was active recently (no stop codons in protein sequences CC encoded by the genomic element). CC It can be an active element. It encodes two proteins: CC 449-aa L1-3_DR1p (positions 214-1560) and 1279-aa L1-3_DR2p CC (positions CC 1565-5401). These proteins are most close to corresponding CC proteins CC encoded by other L1-like elements. L1-3_DR1p is a putative CC RNA/DNA CC binding protein, and L1-3_DR2p is composed of the AP endonuclease CC (aa positions 1-200) and reverse transcriptase domains. CC This is the same as a Tx-like non-LTR retrotransposon family, CC MutsuDr3, specifically inserted into 5S ribosomal RNA genes [2]. XX FH Key Location/Qualifiers FT CDS 214..1560 FT /product="L1-3_DR1p" FT /translation="MDSQSNSTDLFDEAHGENITGRSGERQKENSTGLRGE FT RQLENNTGQEGERNMVNKERQENQQQEMQQQRSGKKYMKEATVILNVDKVK FT EVRVLEIINAVTEKCGLGKILAIRPRQDKDYELTLENEETCDKLMDGLIIK FT GETCEVKMLQNRDYVVSFMHLPAYLDDQLILDKLEGWGVFPITKIKRRVYP FT GTEIEDGTRYLKVRFPREVISLPYSTRLETAEGQQYFRVMHSHQVKTCRLC FT MSPDHMVKDCPDFKCHKCEERGHFARDCDAIKCPDCQNYLSKCECWMEEEE FT EEDEIQVSGQMHEGNSEKESNEEEQTTTQTIQTTTEEITLKEENAKKEKEK FT ETETDSQQIEHEEEVAWTPMDITSSFKNVLDVIEKEDLKEQSKGTDRETGK FT HEEEVEKDKLEKRQTRRSAKVLSKLETARKKGLKKGQLKSHNRYDSLRGLG FT EEED" FT CDS 1565..5401 FT /product="L1-3_DR2p" FT /translation="MTVFISFFLLLMVLNCVSFNARGLMDLRKFKNVKEKC FT KREEIIILQETNWKNEVMDIYKKEWDGDFYYSNGDTKAGRGVAILIKKNAL FT QMSKVIYKDKQGKCMILEIKYEGKDIILVNVHAPNEESEKKSFFNILRRFL FT KNYKQIIVCGDFNTVFSRQDITEGMVFKSDTGRKELKSLIEEKGMIDIWRE FT RNGKKKEFSRRQIVGNFVNQSRIDYVLCTRNIEIYIEKIRYDETVLSDHKF FT VFFNFNTEEIQRGPGVWTLNSEILKNEDYVKKVKEIIEKEKGNQIYNEDKR FT LWWENVKFLIKKSTLTFCRIIQKNKRHKEKTIKENLEIELDKNEKDIQKIK FT EMEGKLKEIEEKKYEGARLRSKAKYVVEGEKCTKFFFDLERQKAKAETIKI FT IQGAKGESIEGNEEILKEIKKYYEELFKTQGVDEVQMSKLLKQIKTKIDEE FT DKKECDQEIGEEEIRKAIESLNKKKSPGIDGLNSEFYVCFKEILIPILTDV FT FKEILQKEELNERMGMGLMKLIHKKGEKTLLKNYRPITMLNTDLKILTKIL FT ANRLKEVMPKIIKTTQAYAIKGRDIADITMSIKDIIEYMKEKKEEGYIISL FT DFEKAFDRVEHQFLFKVLKKFGFGEVFRKWIKILYKGILTKIKCNGFLTEC FT FKITRSIRQGCPLSALLYSLVAEPLGLAINEEEKIKGIEIEENKVNKKMFQ FT YADDTTLIVKGKESVKEAMKIVQQFCKGSGSKVNEDKTVYMKFGKETDLAE FT CTNFKEVEEIKILGVLMGKDARKARDKMWEGFLTDIERRLNYWKLRTLTLK FT GKVLILNVLMESKLWHVLYVLEMPMWIEKRLKKCFTDFLWGGKPPRIAFNT FT VVGEIDKGGLGLIDVEQRKNSLRVKRVKKYLEKENKAEWKKTMKYFLNKCG FT NFNMGDGILWMKTKAWMTENLPEFYREILGAWGNFLKQVEYSPHGRENILN FT QPLFLNNNILSQGKVLYYKKWIEVGILKVRDILYEFKEGFLTEQYVIDTME FT EAKEEYNRKEIEKNLDIIKQAIPKEWIRSIEKFEKEKETKEVYVKTGEKIC FT NFNECTVKNIYCFFRENVFKEPTANKYWIEKYKNVKANEIWGNMKGRYVET FT KVECLEFLIRHKAIFSDVILNKIGMEQSGMCKVCQKEEEGFLHMFLYCQKL FT EGFLKDCKVLIKGLLGDWDENETEWNRVVMLGWNKNNENKKIVNLCIMLMK FT NAMWERRIVAKKEKMVMDVWGIFKRKMERYVEKLYVYHKNENILSELHKIL FT TDKACQVFKEMNWKLPHF" XX SQ Sequence 5520 BP; 2426 A; 551 C; 1209 G; 1334 T; 0 other; agcacgctag agggagctag agtgcaacga ggaagtgtaa ggctgcagac aggagagaga 60 ggctctccat ttttttgtgt tatttcgagt ttatttagtg tcttttttgt gatagtggac 120 tttttttgat ttttttgtct tttaaccacc tgacagcagc acgtgtgctg gctggaaggt 180 ggttagcaca gatttttttt ttgagagaaa atgatggact cacaaagcaa cagcacagat 240 ttgtttgatg aagcacatgg ggaaaacatc acaggaagga gtggagaaag acaaaaagaa 300 aacagcactg gattacgagg ggaaagacaa ttggaaaaca acactggaca ggaaggagaa 360 agaaacatgg taaacaaaga acgacaagaa aaccaacaac aggagatgca acaacaaaga 420 agtggaaaaa agtacatgaa ggaagcaaca gtgattctaa atgtggacaa agttaaagag 480 gtgagagttt tggaaataat taatgcagta actgaaaagt gtggacttgg gaaaattctt 540 gcgataaggc cgagacagga caaagattat gaactaactt tagaaaatga agaaacatgc 600 gataaactga tggatggact aataataaaa ggagaaacct gtgaagtgaa aatgctacaa 660 aatagagact acgtggtttc cttcatgcat ttgcccgctt acctggatga tcaattaata 720 ttagataaat tagaaggctg gggagtattt cccataacaa aaattaaaag aagggtatat 780 ccgggcacag aaatagaaga tggaactcga tatctcaaag tgagattccc cagagaagtg 840 atatctctcc cttacagcac aagactggaa acggcagaag gtcagcaata ttttagggtg 900 atgcacagcc accaggttaa gacttgtagg ctgtgcatga gccctgacca tatggtcaaa 960 gattgcccag actttaaatg ccataaatgt gaggaaaggg gacattttgc cagagactgc 1020 gatgctatta agtgcccgga ctgccagaac tatttaagta agtgtgaatg ttggatggag 1080 gaagaggagg aggaggatga gatccaggtg agtgggcaaa tgcatgaagg aaacagtgaa 1140 aaggaaagta atgaagagga acaaacaacg acacaaacaa tacaaactac aacagaggaa 1200 ataacattga aggaggagaa tgcaaaaaaa gaaaaggaaa aagaaacaga aacagacagt 1260 caacaaatag agcatgaaga ggaggtagca tggacaccaa tggacataac ttctagcttc 1320 aaaaacgttt tggatgtaat tgaaaaagag gatcttaaag aacaaagcaa gggaacagac 1380 agagagactg gaaaacatga ggaggaagta gaaaaagaca aacttgagaa aagacaaaca 1440 agacgatcgg caaaagtttt atcaaaatta gaaactgcaa gaaaaaaggg tttgaaaaaa 1500 ggacaactga aaagccacaa cagatatgat tctttgagag gtttgggaga agaagaggac 1560 taagatgacg gtttttattt cctttttcct tcttttaatg gttttaaatt gcgtgtcttt 1620 taatgcaaga ggtttaatgg acttaagaaa atttaagaat gtaaaggaaa aatgtaaaag 1680 agaagaaata attattttac aagaaacaaa ttggaaaaat gaggtgatgg acatatataa 1740 aaaagaatgg gatggggact tttattatag taatggagat acgaaagctg ggagaggagt 1800 agcaatttta ataaagaaga atgcactgca aatgagtaaa gtaatatata aagacaaaca 1860 aggaaaatgt atgatattag aaataaaata tgaagggaaa gatattattt tagttaatgt 1920 gcatgcacca aatgaagaga gtgaaaagaa aagctttttt aacatattaa gaaggttttt 1980 aaaaaactat aaacaaataa tagtatgtgg ggattttaac actgttttta gcagacagga 2040 cataacagaa ggtatggttt ttaaatcaga cacgggaagg aaagaactaa aatcactaat 2100 agaggaaaaa ggaatgatag acatttggag agaaagaaat gggaagaaaa aggagttttc 2160 tagaagacaa atagtaggga attttgtaaa ccaatcgaga atagactatg tgttatgcac 2220 aagaaatata gaaatttata ttgaaaaaat aaggtacgat gaaactgttt taagtgacca 2280 taaatttgta ttttttaatt ttaatacaga agaaatacaa agaggcccag gggtatggac 2340 attaaatagt gaaattttaa aaaacgaaga ctatgttaaa aaagtaaaag aaataataga 2400 aaaagaaaaa ggaaaccaga tctataatga agacaaaaga ctatggtggg aaaatgtcaa 2460 atttctgatt aaaaaatcta cattaacatt ctgtagaata atacaaaaga ataaaagaca 2520 caaagaaaag acaatcaaag aaaacttaga aatagaacta gacaaaaatg aaaaagacat 2580 tcaaaaaata aaagaaatgg agggaaaact gaaagaaata gaagaaaaga aatatgaagg 2640 agctagacta agaagcaaag caaaatatgt agtagaggga gaaaaatgca caaagttctt 2700 ctttgattta gaaagacaaa aagcaaaagc agaaacaata aagataatac aaggggcaaa 2760 aggagaaagc atagaaggaa atgaagaaat tttaaaagaa atcaaaaaat actatgagga 2820 gttatttaaa acacagggag ttgatgaagt acaaatgtca aaattattaa aacagataaa 2880 aacaaaaata gatgaagagg ataaaaaaga atgcgaccaa gaaatagggg aagaagaaat 2940 aagaaaagca atagaaagct tgaacaaaaa gaaaagtcca ggaatcgatg gtttaaatag 3000 tgaattttat gtatgtttta aagaaatttt aattccgatt ttaactgacg tttttaaaga 3060 aatactgcaa aaagaagaac taaatgaaag aatgggaatg ggattaatga aattaataca 3120 caaaaaagga gaaaagacat tgttaaaaaa ttacagacca atcacaatgt taaacacaga 3180 tttaaagatt ttaacaaaaa ttttagcaaa tagattaaaa gaagtgatgc caaaaataat 3240 taaaacaaca caggcctatg cgataaaagg aagagacata gcagacataa caatgagcat 3300 aaaagacatc atagaatata tgaaggagaa aaaagaggaa ggatatataa taagtctgga 3360 ttttgaaaaa gcttttgata gagttgaaca ccagttttta ttcaaagtac tcaaaaagtt 3420 tggttttgga gaagttttta gaaaatggat aaagattttg tataagggta ttttaacaaa 3480 aattaaatgt aatggctttt taacagaatg ttttaaaata acaagatcga taaggcaagg 3540 atgtcctctg tcagcacttt tatattcact tgtggcagaa cccctgggct tagctatcaa 3600 cgaagaagaa aaaataaaag gaattgagat tgaagaaaat aaagtaaata aaaaaatgtt 3660 tcaatatgca gatgacacca cattaatagt aaaaggaaaa gagagtgtga aagaagccat 3720 gaaaatagta caacaatttt gtaaaggatc agggagtaaa gtaaatgaag acaaaacggt 3780 ttatatgaag tttggaaagg aaacagattt agcagaatgc accaatttca aagaagtaga 3840 agaaatcaag attttagggg ttttaatggg gaaagatgcc agaaaagcga gagataagat 3900 gtgggaaggt tttttaacag atatagaaag gaggttaaat tactggaaac taagaacact 3960 aacattaaaa ggaaaagttt tgattttaaa tgttttaatg gagtctaaat tgtggcatgt 4020 tttatatgtt ttagaaatgc caatgtggat agaaaagagg ttgaaaaaat gttttacaga 4080 ttttttatgg ggaggtaagc caccaaggat tgcttttaat acagtcgtag gggaaataga 4140 caaaggtggt ctgggtttaa tagatgtaga acaaagaaaa aatagtttaa gagtgaaaag 4200 agtaaagaag tatctagaaa aagaaaacaa agcagagtgg aaaaaaacaa tgaaatattt 4260 tttaaacaaa tgtggtaatt ttaacatggg agatgggatt ttatggatga aaacaaaagc 4320 ttggatgaca gaaaacttac ctgaatttta tagagaaatt ttaggtgcgt gggggaattt 4380 tttaaaacaa gtggaatata gtccacatgg aagggaaaac attttaaacc aacctctttt 4440 cttgaacaat aacattttaa gtcaagggaa ggttttatac tataagaaat ggatagaagt 4500 tgggatttta aaagtgcggg acattttata tgaatttaaa gaaggctttt taactgaaca 4560 atatgttata gacacaatgg aggaggcgaa agaggaatac aacagaaagg aaattgagaa 4620 aaatcttgac ataattaaac aggcaattcc aaaagaatgg ataagaagca tagaaaaatt 4680 tgaaaaagaa aaagaaacaa aagaagtgta tgtgaaaaca ggtgaaaaaa tatgcaattt 4740 taatgaatgt actgtgaaaa atatttattg tttttttaga gagaatgttt ttaaagaacc 4800 aacagcaaac aaatactgga tagagaaata caaaaatgta aaagcaaatg aaatatgggg 4860 aaacatgaaa ggaaggtatg tggaaacaaa agtcgaatgt ctagaatttt taataaggca 4920 caaagcaatc ttttctgatg tcattttaaa caagatagga atggaacaaa gtggaatgtg 4980 taaagtatgt caaaaagaag aagagggttt tttacacatg tttttatatt gtcaaaaatt 5040 ggaaggtttt ttaaaagatt gtaaagtttt aattaaagga ctacttggag actgggatga 5100 aaatgaaaca gaatggaaca gagtagtgat gttgggatgg aataagaaca atgaaaataa 5160 gaaaatagta aacctatgta taatgctgat gaaaaatgca atgtgggaaa ggagaattgt 5220 ggcaaaaaaa gagaaaatgg tgatggatgt atggggaatt tttaagagga aaatggagag 5280 atatgtggaa aaactgtatg tatatcataa aaatgagaac attttgagtg agctgcacaa 5340 aattctgacg gacaaagcat gtcaagtttt taaagaaatg aactggaaac taccacattt 5400 ttaatgtttt tgaatatgaa atgatctttt aatgtctgtt ttattatgat ttactgtaaa 5460 agattgtaat tgaaattggt tcaataaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 5520 // ID HARBINGERN7_DR repbase; DNA; ZEB; 478 BP. XX AC . XX DT 11-JUL-2002 (Rel. 7.06, Created) DT 11-JUL-2002 (Rel. 7.06, Last updated, Version 1) XX DE HARBINGERN7_DR is a nonautonomous DNA transposon - a consensus. XX KW Harbinger; DNA transposon; Transposable Element; Nonautonomous; KW 14-bp TIRs; 3-bp TSD; Dr000169; Dr000374; Harbinger superfamily; KW HARBINGERN7_DR; nonautonomous DNA transposon. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-474 RA Bao Z.; RT "Dr000169 and Dr000374, unclassified and uncharacterized RT repeats."; RL http://www.genetics.wustl.edu/fish_lab/repeats/. XX RN [2] RP 1-478 RA Kapitonov V.V. and Jurka J.; RT "HARBINGERN7_DR, an ancient family of Harbinger-like RT nonautonomous DNA transposons from zebrafish."; RL Repbase Reports 2(6), 8-8 (2002). XX DR [2] (Consensus) XX CC HARBINGERN7_DR copies are flanked by 3-bp target site CC duplications CC generated upon their integration in the genome. CC It has 14-bp terminal inverted repeats (1 mismatch) similar to CC other CC HARBINGERs. There are several thousand copies of HARBINGERN7_DR CC harbored by the zebrafish genome; they are ~13% divergent from CC the consensus sequence. XX SQ Sequence 478 BP; 149 A; 83 C; 103 G; 140 T; 3 other; gggctatatg cacagctagt gtttttcagc caatgataaa cttccggtga aagcttaatg 60 tgactatttt caatttcata aggttscttt tacagcatcg atgttgtaat gtaattacaa 120 tacattcagt taaatagact taagcattca tttagttgtt caagcgtaaa acgagatgaa 180 agaccgtgta accgctagcg ccgtcagtag cagcaggcta gcgcagaaat tccattgaaa 240 atactggggt aaaataaacg gtcatatttt aaagacatgg cagggggaaa tggaatttaa 300 tgcagtgctt cttgtccggt ctgagaccca ctttatattg tatatctatc aggcagtgaa 360 gatcactgat ttttaaagaa atcagatctt aaacgtgmca attttgtaat ggaaagacag 420 ttcaccggaa gccctggggc tgsctggctg aaattaaaag tctgtgtgca tatacccc 478 // ID Gypsy133-I_DR repbase; DNA; ZEB; 5525 BP. XX AC chr23; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy133-I_DR; KW Gypsy133-LTR_DR; Gypsy133_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5525 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1527-1527 (2008). XX DR Genome; chr23; Positions 3092856 3087332. XX CC Positions [4191-4667] - Integrase core CC 'CTAT' target site duplication CC LTRs are 98% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 1345..3360 FT /product="Gypsy133-I_DR_1p" FT /translation="MLSCPHLVVVMGGVSVPCLVDTGSMVSTITETCFREC FT FEPWDQEWLKSCQWLQLRAANRLSIPYIGYIELDVELCGEAVPKCGVLVVR FT DPPGGISAQAPGVVGMNILGRCYQELFGQYGSSLFDSPPVSSASRKVFQAL FT QHCHQARLQMPTEGIGKVRVRGPQVSRVPGGTIKMVVATCSEKFSSGVALF FT EPLETGLPEGLLASPALVQVIRGTVHVPIVNVETSDGLLFPRRVIGTLTSV FT YIVSPSPGVIETRSLAGTGPLTGTTCAISAKADSPRVPEGIDAVDLSFLSR FT EEQDKVRALLGKYQTVFAVCESDLGCTDMISHDIPLLDDIPVRQRYRRIPP FT TEYDVVRTRINQLLETQVIRESCSPYASPIVLVKKKDGSLRLCVDYRQLNA FT KTRKDAFLLPRIEESLDLLSGARWFSTMHLASGYNQVPVTEEDRYKTAFCT FT PFGLFEWNRMPFGLCNAPSTFQRLMQPLFGDQCQSLLLYLDDIIVFSSSVE FT QHLERLEVVLGRLKREGLKARLDKCAFLRPEVGYLGHVISAKGVSTDPNKT FT QAVAQWPRPSSATGLRSFLGFASYYRRFVEGFAKIAAPLHRLTAEFSDKRK FT SKHSGAGFGDAWSDQCENSFAELKVRLTTAPVLAYADFSKPFILEVDASQG FT AWGQTSLKSRRGEFALLRMRVVV" FT CDS 3381..5456 FT /product="Gypsy133-I_DR_2p" FT /translation="MSNYSSMKLEFVALKWAMTEKFREYLLGSRCVVFTDN FT NPLSHLLSAKLGAAEQRWAAQLAAFNFEIRYRSEKSNLNADALSRQNSSES FT PVMGGLVPGTVVPAPLRQALEGAPIVEVQQSMISTMPGYSLSDLGSLQAVD FT PVIAEVLPFWRRKIRPGRDERQQMSSLALTLLRQWDRLVEKDGVLYRRVHC FT PDGGEETLQLILPAVLKSEVVMQLHQQHGHQGIERTTELVQQWCYWPGLSA FT DVSRWCLECERCQAAKDTRPLAQSFMGHLLASRPNEIVAMDFTVLEPSRSG FT IENVLVVTDVFSKYTLAIPTRDQKAETVAQALVNEWFFKFGIPSRLHSDQG FT RNFESLLIRQLCDLYGVKKSRTAPYHPAGNGQCERFNRTLHNLLRALPVSK FT KRDWVACLPQVVFCYNTTPYQATGESPYFLMFGQKPQLPVDFMLGRVQEPV FT AGSVHNWVEEHQSRLSVAFQGARERLQAAADRRKLTHDRQVRDAPLIEGQL FT VFLRHCSYKGRHKIQDMWSPVVYQVVRAPRDGGAVYTIAPVEDLSKAITVH FT RSLLKNRIQKDFSGVSTDAVVELPLSPGDVEEEELDLVRVVQAQGAERLSV FT DQGSSATLTRGTPNLSGESSHLVAVGAEGSELCLETGLGSSIEPGKVSEVM FT PRRTGRTTTGCHSNLHRLPRPAGASLPSNSVSNCITAIFRPWK" XX SQ Sequence 5525 BP; 1266 A; 1181 C; 1550 G; 1528 T; 0 other; cttggcgtag tcggcagggt attttctgac attttttggg ctgtgagacg tggatttttg 60 tgttcgtaca gtgtttttgt ctgtgtttta gaagtaaaac agcagtattc aagcgtggtt 120 gctgaaccca ataggtaagg gtctttatat atattttttt ctccatcttt aaagtttatc 180 tttctctcat catggaagat gaactacaag agttgcgaga tctagtagct cagctgaaag 240 ctgataattt acgattgcag caggaacagc cccaagctag ctccccaggc cctagtgttg 300 atccctccaa tgtgtctgtt ccctcagcgc ccccatttga acttggtaat gctgatgctg 360 catcagcaga gcgatttgtt tttgtcccaa gagatagaaa atgtgccaaa tttaatggta 420 agtcagggat tggcataaat gagtgggtag aggaaatcca ggcttgtatg cgggctcgcc 480 acctatctgt ggctgatcag tctttcttta tatttgatca tttggagggt gaagcccggg 540 aagaaatacg ctatcggtcg gctcaggtac gtgaagatcc aaaacaaatt atcgctgctt 600 taaaagaatt gtatggttgt tctcaatctt atattgtgtt acaggaggct ttcttctcca 660 gaagacaaca ggagggtgag tccctgttgg agttttcact ggccctcttg agtcttctgg 720 agaaggtgaa acagcagtca ccacacgtaa tgacaaatgc tgaagttctt ttgcgagacc 780 aatttgttga acatgtgctt gatagttaat gcagcttgtg cgtcgcctac cctctagtac 840 tctgctcgat gtgtgtaaag aggctatcag gtgggagagg gagggtatgc ctgggggtgt 900 taggggccgc agtcattctg ttccattagc ctatgggttc caatatggag ttaggggtgg 960 tactcattca cctggcaatg cccaacagac tgaaataagt gatttgaggg agattttaaa 1020 atcgcagcaa gagcagctga accagcttac taggagtttc gctctgctgc atgaaccccc 1080 tcgtggtcct cggttgggtc gtaatgattc tgtaatatgt aggagatgtc aaaaacgtgg 1140 tcactttgca agggagtgtg agggggagcg tgttccttcg cgccctcctc caccagaccg 1200 aacggacacc agagttccag agaataggca gttaaaccca cgtcaggtgt cggaaaactt 1260 gctcccgccg tgttgttgag tcgcagcaca ggtgggggag atgtagactc ggtgttgtgt 1320 agggatggtt ctataccaaa cttaatgctg tcttgtcccc atttggtggt agttatgggt 1380 ggggtgtctg tgccgtgcct ggtggacact ggctcaatgg tgtctaccat cacggaaact 1440 tgtttccgtg aatgttttga gccatgggac caggaatggc ttaaatcctg tcagtggttg 1500 cagcttaggg ctgctaatag gttgtctatt ccttatatag ggtacatcga actagatgta 1560 gaactgtgtg gtgaggcagt tccaaaatgt ggtgtgttgg ttgtccggga tcctcctggt 1620 ggcatcagtg cccaagcccc tggtgtggtg ggtatgaaca ttttgggccg gtgctaccag 1680 gagctctttg gacaatatgg atcttctctt tttgattctc ccccagtgtc aagtgcttca 1740 aggaaggtgt tccaggcttt acaacactgt catcaggcaa gattacagat gccgacagag 1800 gggataggaa aggtacgggt gcgtgggcct caagtaagtc gtgttccagg gggtacgatt 1860 aagatggtag tggctacttg ttctgagaaa ttttctagcg gtgtggccct gtttgagccg 1920 ttggagactg gacttccaga agggctacta gcttctcctg ccctggttca agttatcaga 1980 ggtactgtac atgtccctat tgtaaatgtt gagacctccg atggcttgtt attccctcgc 2040 agggtgattg gtacattaac tagtgtgtac attgtttctc cctccccagg agtcattgaa 2100 actaggtctc tagcggggac tgggccctta actggcacca cgtgtgccat tagtgccaaa 2160 gcagattctc ctcgggtgcc cgaagggata gatgcggttg atttatcatt tctctccagg 2220 gaagagcagg ataaggtgag ggctttgctg gggaagtatc agactgtctt tgctgtatgc 2280 gaaagtgatt tagggtgcac tgacatgatc tcacatgata tcccgctact agatgacatc 2340 cctgtccggc agcggtacag acgcatacct cccactgaat atgatgtggt gcgaactcgt 2400 ataaaccaat tgctagagac acaggtgatt cgtgagagtt gcagtcccta tgcctctcct 2460 atagttctgg tgaagaaaaa ggatggcagt ctacgcctgt gtgtagacta ccgtcaatta 2520 aatgccaaaa cgaggaagga tgcgtttctc ctgccccgta ttgaggagtc attggatctc 2580 ctgtcggggg cccgctggtt ttccacaatg catttagcca gtggatacaa ccaagtcccg 2640 gtcaccgaag aggacaggta taagactgct ttttgcactc catttgggtt atttgaatgg 2700 aatcgtatgc cctttggact gtgcaatgct cctagtacgt tccagcggct aatgcaacct 2760 ttgttcgggg atcaatgtca gtccttattg ctatatttag atgacataat tgtgttctct 2820 tcctctgtag agcagcatct agagcggttg gaggttgttc taggtcgtct aaaaagggag 2880 gggctgaagg ctagactgga caagtgtgcg tttttgaggc cagaggttgg ttatctaggc 2940 catgtcatct cagcaaaggg tgtctccacc gacccgaaca agacccaggc agttgctcag 3000 tggcctcgtc cgtctagtgc cactggcctg cgttcttttc tcggctttgc cagttattat 3060 cgtcggttcg tagaggggtt tgcaaagatt gcagcccccc tgcaccgact gacagctgag 3120 ttttctgata agcgcaagtc aaaacattct ggggcagggt ttggggatgc ctggagtgat 3180 cagtgtgaga atagttttgc tgagttaaag gtacgactca ccactgcccc tgtactcgca 3240 tatgcagatt tttcaaagcc atttattctg gaggtggatg ccagccaggg ggcttggggg 3300 cagacctctc tcaagagcag gaggggggag ttcgccctat tgcgtatgcg agtcgtagtc 3360 tgaggcccac ggagcgcaat atgtctaact acagttctat gaagttggag tttgttgcgc 3420 tcaagtgggc catgacggag aagtttaggg agtatttgtt gggcagtcgg tgtgtcgtct 3480 ttacggataa caatccactt agtcacctgt tgagtgcgaa actcggggcg gctgagcagc 3540 ggtgggctgc acagttagct gcattcaact ttgagattcg ttatcgctct gaaaaaagta 3600 acttgaatgc agatgctctt tctcgtcaga attcttctga gagtccagtg atgggaggcc 3660 tggtaccagg aacggtagtg cccgcacctt taagacaagc cttggagggg gcgccaatag 3720 ttgaggtaca acagtctatg atatctacta tgccaggtta ttctttgtca gatttgggat 3780 cactccaggc agtggatcct gtgattgcgg aagtactgcc attttggagg cggaagatcc 3840 gcccagggcg agatgagcgc caacaaatgt catctctagc tttgacattg ctccgtcaat 3900 gggaccgctt ggtggagaag gatggggttc tctatcggcg ggtgcattgt ccggatgggg 3960 gggaagagac cctacagtta attcttccag ctgttttgaa gtctgaagtg gtcatgcagt 4020 tacaccagca acatggccat cagggcatag aacggacaac tgagcttgtc caacaatggt 4080 gctattggcc aggtctttcg gctgatgtgt ctcgctggtg cttggagtgt gagagatgcc 4140 aggctgctaa ggacactagg ccattagctc aaagttttat gggacatctt ctggcctctc 4200 gtccaaatga aatcgtggca atggatttta ctgtattgga gccgtcaaga tcaggcattg 4260 agaatgttct tgtggtaact gatgtattta gcaaatatac cttggctatc cccactagag 4320 accaaaaggc ggaaactgta gctcaggccc tggtcaacga atggttcttc aagtttggca 4380 ttccaagccg cttgcattcg gaccagggcc gtaattttga atctctgttg atccggcagc 4440 tttgtgatct gtacggtgtg aagaagtctc gcacagcacc atatcatcca gctggaaacg 4500 gccagtgtga gcgtttcaac cgtacattgc acaatctgtt acgggccttg ccagtttcca 4560 agaaaaggga ctgggtagcg tgtttaccgc aggttgtctt ttgctataac acaaccccgt 4620 atcaagcgac tggggagtca ccctactttt tgatgtttgg gcagaaacca cagctaccag 4680 tggatttcat gttggggaga gtccaggaac ccgtggcagg cagtgttcat aactgggtcg 4740 aggagcatca gagccgttta agtgtagctt ttcagggggc acgtgaacgg ttacaagcgg 4800 ctgctgatag gcggaaatta acccatgaca ggcaagttcg tgatgcccca ctcatagagg 4860 gccagctggt attccttcgt cactgtagtt ataaaggtcg ccataagatc caggacatgt 4920 ggagtccggt agtttatcaa gtggtgcgag ccccaagaga tggtggtgca gtgtatacta 4980 ttgcaccagt ggaggacctg agtaaggcga ttacggttca tcggtcattg cttaaaaatc 5040 gaatacagaa agacttttct ggtgtgtcaa ctgatgcagt tgtggaacta ccactgtcgc 5100 ctggggatgt tgaagaggaa gagcttgatt tggttagggt ggtgcaggca caaggtgcag 5160 agagattaag tgttgaccag ggatcctcgg ctaccttgac acgaggtact cctaaccttt 5220 ctggggagtc ctcacatttg gttgcagtgg gggccgaagg tagtgagctg tgccttgaaa 5280 ctggactggg ttcttcaata gagccgggaa aagtgtccga ggtgatgcca cgccggactg 5340 gtagaaccac tactggttgc cattccaatc ttcaccgcct tcctcgcccg gcaggagctt 5400 ccctgccgtc taattcagtg tccaactgca ttactgctat ctttaggccc tggaagtgac 5460 ttcggcatgt ttcctctttt tctggtgtaa tcaccgggtc ggtgattcaa aagtgggggg 5520 tagat 5525 // ID Gypsy105-LTR_Dr repbase; DNA; ZEB; 782 BP. XX AC . XX DT 28-APR-2009 (Rel. 14.05, Created) DT 28-APR-2009 (Rel. 14.05, Last updated, Version 1) XX DE A long terminal repeat of the Gypsy-105_DR LTR retrotransposon; a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy superfamily; endogenous retrovirus; KW LTR; Gypsy-105-I_DR; Gypsy-105-LTR_DR; Gypsy-105_DR; KW Gypsy105-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-782 RA Dib M.R. and Naveira H.F.; RT "Gypsy105_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 9(5), 947-947 (2009). XX DR [1] (Consensus) XX CC Gypsy105-LTR_DR is a long terminal repeat of the Gypsy105_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its CC internal portion is Gypsy-deposited in Repbase as Gypsy105-I_DR. XX SQ Sequence 782 BP; 116 A; 216 C; 139 G; 311 T; 0 other; tgtcacaatc tcggtctgat caccggtttg atgtttgtgt tactttttcc ctttcactcg 60 tattgttttg atcactcata ccctgctttg tttacctctc gcttccccac ccatctgctc 120 tattactttc cattagtcta tcaaactaat ttcactcacc tggtttcctt gatttctccc 180 cctatttatc cctctgttga tcgtagttct gtgtcagatc gtagtttttc cccttccggc 240 atgtctggcg atctcctggt ctagtcctgt tccttgtata tcctgtgagt tttgtcgtgt 300 ctgatgtcat cctgttgatt ttcctaaagt gttttttctc cgtctgcctt caagcctgtt 360 cccgtctgtt tttgctttgc cgcccatctg actctccgtc cctatcctgc cggccctcaa 420 cactctctcg ccggcttcca agcctcagtc ttcctcgtgg ccattctgtc ccagtcagcg 480 gcggccggat tcatcgcccc ctgtcgggta tcagggtatt gcaccctctt gaagactgtt 540 catctaacct ctccggagta gttttattta atcgtttttg gagactttat ttctcagttt 600 tgagtttttg tgaactttgg gcttttttgc caactccatt acggaggaat ttttagtttc 660 cttttttgaa gattgttttg ttgactattt tttttgcccc ttgagtttgt gggcttcaat 720 aaactttgct gagacctgca tctgtgtctg aaccatcttc tctctcactc agacctctga 780 ca 782 // ID BEL30-I_DR repbase; DNA; ZEB; 6648 BP. XX AC chr21; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL30-I_DR; KW BEL30-LTR_DR; BEL30_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6648 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1403-1403 (2008). XX DR Genome; chr21; Positions 25876746 25883393. XX CC Positions [5501-6070] - Integrase core CC 'ATAGG' target site duplication CC LTRs are 99% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 533..6457 FT /product="BEL30-I_DR_1p" FT /translation="MSSPTENTDNLSEAPEVRSSSRERTFTEKGQQLHDQE FT ARKNEKAFIKAYDCWKLIAKEVRVKLKAFCSAEDLDKMQTDVKSKLAIVRQ FT TYEPIRRNHTTTPNIVKKMDACDTLTAEICDLVSERMENIDEAFNDQLEKE FT RVRMVLNKEEYGSVFGNTKTETVISELSQESDHNSSTTSRSSSKHADAEAD FT LAAKVEQAKAMQEIHIQQAKLDKLESEWKLKETQMLAEIQKKEAEMKVKLE FT VEKTRLQQLQADNEVKVAAARVRAYNNFDGHESLEEDTDHKTQYSCQDATL FT QTPLNPQALPFQPHTAHFKTPMPQEEVSLAQAIASSLTLSRLPVPEPTTFT FT GDPLKFIDWKISFMALIDQKPFPASEKMLYLKSYLAGEARKAVEGFFYRNS FT EEAYQGAWAVLQDRYGSPFVVQRAFRDKLMKWPRIAANDPIALREFADFLQ FT GCAEAMPHVKGLGILNDCEENHKLLKKLPEWMMRRWSRIVVEELDRSGEYP FT SFVHFTQFMQTEARIACNPIASPLLMNVKNTDERLPKRTKALNTGIQIKTS FT ISGPPENYRPKPPCLVCKDETHGVVKCPSFAAKTMDEKKSFIHENHLCFGC FT LRKGHNTKDCRRRHTCNTCGRRHPTCLHIEMDRNPLESTSKDSKATGGHSS FT KEIHKVMVHALTQHVSATSSIVPVLVSSTTNPQKEILTYALLDTQSDSTIV FT LEDLVTELNVNTQSVQLKLGTMTAVNTVIASKVACGLQLRGLNSETYIQLK FT QTYTRDFIPVEKSHIPTKATALQWPHLKHLMNKLHPLQDCEVGLLIGYDCP FT SALAPLEVVTGNENEPFAQRTTLGWSIIGSANPHLDRCGNETFVHRVTVKE FT KPLAIDVLKALESDFNERSYEEKYVSQDEVRFIQLLNEKIKQKGDGHFEIP FT LPFKGSSPPILPNNKKLATVRLQHLKKKLMNNKQYFDHYTAFMEEILKIGD FT AEPAPPFSDDETVWYIPHHGVYNPKKPDKLRIVFDCSAKFRGISLNDTLLT FT GPDLINPLLGVLCRFRREAVAVICDIERMFHQFSVSPEVRNYLRFLWWEGG FT LFEREPQEYRITVHLFGAASSPGCANVCLKYLAQQHKVNYPAASAFLENNF FT YVDDGLTSVPTVKEAKKLIVETQELCKGAGLRLHKFNSNQSEVLSCVASSE FT RAVNTDPLNLKPYPAAGAHILGIQWSSENDTFSFNTSVKDHSPTRRGLLSV FT IASLYDPLGFVAPFTLSGKHILQELCCRGIGWDDPIPENLRSRWEEWKNGL FT QRLKEIAIPRCYYPQNFGEIVRVELHHFADASNAGYGSCSYLRYQNNRNEI FT HCSFVMAKARVAPTKITSIPRLELSAAVTAARMSVMLKAELEMKIEQEFFC FT TDSQVVLAYINNEARRFHVFVANRVQLIREITDPNQWNYVDTAQNPADHAS FT RGLHASAITSSSWLSGPEFLWEQDVHPMPKPSTELLTGDPEVKSIQVLSTQ FT ISDRKDILSRLSRFSTWTMLVKVVARIKRLGSKLKHHGEIVTVTERRKAAE FT MLYQLVQQQAFPQELKALQENFHGSTILSSSPLFRLNPILDVGLLRVGGRL FT TGSTLSSELKHPIILPKDSHITNLLLAHYHAQVCHQGWSQTLMELRANGFW FT AIGANKSVSKLIHRCVKCRKLRRPTEEQQMSELPRERCEVSAPFTFCGMDC FT FGPFVTKRGRKEYKRYGLIFTCLSSRAIHLEMLDDLSTDVFINALRCFISL FT RGAVSHLRCDQGTNFVGAKNEFKEAPKQCDTEALKAFLADKQCEFTFNAPS FT ASHAGGVWERQIRTIRSVLNVTVAQCSGRLDDASLRTLFYEAMAIVNSRPL FT NVDGINDPMAPEPLTPNHLILMKSKVALPPPGKFVKEDMYAVKRWRRVQYL FT TEQFWSRWKKEYLMNESTRQKWHVPRRNLKVNDIVIIKEDSLPRNQWQLGR FT VVETTEGSDGLVRRVKVQAGERKLTQKQTHCSKPTIIERPIQKLVLLLEDE FT " XX SQ Sequence 6648 BP; 2080 A; 1427 C; 1450 G; 1691 T; 0 other; gtgaaaactc accgctgagc catccaccaa cgcggacatc tttaatcaac ggatggaaga 60 acatctaata ctatgcacac cgtccttgtg tggatgacac agctgcagtg agagactttg 120 gctgaataac tgggtaaact caaccaaaga tacagtatac tcaaagaact gtcatttgct 180 aaacaattct aaggaggatt tcagtgtgca agacaaatac tgtaaatgct agatctgtaa 240 agctgacaag taataagtta aagagtgtgg ctaaggtcat tgtcacataa caatatccta 300 aatcttgatt ctaataattc ctgagggcta aaattgtcgt aaattatcta gtgtattctc 360 gtttatgaca tttgcattgt ctgtttaata tcttggggtg ttcattagtg taaacatcta 420 atttaagtgg taatatcgtg agaaaagtta aatttaccac tcatttgttt tgagtctaga 480 tgtaaaatac cttaagacct tacagtataa acagatccac tataacaata caatgtcatc 540 acctactgag aatacagata acctttcaga agccccagag gtcaggtcga gctcccgtga 600 acgtactttt acagaaaagg gccaacaatt acatgatcag gaagctagga aaaatgaaaa 660 ggcattcatc aaagcttatg actgttggaa gcttatagca aaggaggtta gagtaaaact 720 gaaggcattc tgctcagctg aggaccttga taaaatgcaa actgatgtta aatctaaact 780 agccatagta cgccaaactt atgagcccat tcgacgcaat catactacta ctccaaatat 840 tgtaaagaaa atggatgcat gtgacactct tacagctgag atatgtgacc ttgttagtga 900 acgcatggaa aacatagacg aggctttcaa tgaccaacta gagaaagaaa gagtgagaat 960 ggtgctaaat aaggaagaat atgggtcagt ctttggaaat acaaaaacag agacagtaat 1020 ttcagagtta tcacaagaat ctgatcacaa ctcaagcact acctccaggt cttcaagcaa 1080 acatgcagat gcagaagcag acttggctgc caaagtagaa caggctaaag ctatgcaaga 1140 aatacacatt caacaagcga aactcgacaa actggagagt gaatggaaac tcaaagaaac 1200 ccagatgtta gctgaaatcc agaaaaagga ggctgaaatg aaagttaagt tggaagtgga 1260 aaaaactagg ttacaacagt tacaagcaga taatgaagtg aaggtagcag cagcacgagt 1320 aagagcgtac aacaattttg atggtcatga gagtttagaa gaagacactg accacaaaac 1380 ccagtatagc tgccaagatg ctactctcca aactccacta aatccacaag cattaccctt 1440 ccagcctcat actgctcatt ttaaaacgcc aatgcctcag gaggaggtga gtcttgctca 1500 ggcaattgca agctcactta ctttaagccg actaccagtc ccagaaccaa ctacattcac 1560 tggtgacccc ttaaagttta tagactggaa aatatctttc atggccctaa ttgatcagaa 1620 gcccttccct gctagtgaaa aaatgcttta tttaaagagt tatcttgctg gagaagcgcg 1680 caaagctgtt gaaggatttt tctacagaaa ctcagaagag gcatatcagg gtgcctgggc 1740 agttctacaa gatcgatatg gaagcccatt cgtagtccaa agagccttca gggacaagct 1800 catgaagtgg cccaggatag ccgcaaatga ccccattgca ttaagagagt ttgcagattt 1860 ccttcaaggt tgtgccgaag ccatgcctca tgttaagggt ttgggtattc taaatgattg 1920 tgaggaaaat cacaagctcc ttaaaaagct gcctgagtgg atgatgcgcc ggtggagtcg 1980 tattgttgta gaagagctag acagatctgg agaatatcca agttttgtgc atttcacaca 2040 atttatgcaa actgaggccc gcatagcgtg taaccccatt gcctccccac tcctaatgaa 2100 cgttaaaaat acagacgaga gacttcctaa gagaactaaa gcacttaaca caggcattca 2160 aataaaaact tccatctcag gtcctcctga aaattacagg ccaaaaccac cttgcttagt 2220 ttgcaaagat gaaacgcatg gtgttgttaa atgtccatcc tttgcagcaa agactatgga 2280 tgaaaagaaa tcatttattc atgagaatca tctatgcttt ggatgcttaa ggaaggggca 2340 caataccaaa gactgtagaa ggcgacacac ctgcaataca tgtggtcgac gtcatccaac 2400 ctgcttgcac atagaaatgg acagaaaccc tcttgaatcg acaagcaaag attccaaagc 2460 aactggaggt cactcaagca aggaaatcca caaagtgatg gtccatgcat taacacagca 2520 tgtttctgca acctctagca tagtaccagt tcttgtgtca tcaacaacaa atccccagaa 2580 agaaattctc acttatgcct tacttgacac tcaaagcgac tcaactattg tcttggaaga 2640 tttggtcacc gaattaaatg taaacaccca gtcagtgcaa cttaaacttg gcacaatgac 2700 ggctgtcaac acagtcatag caagtaaagt tgcctgtggt ttacaacttc ggggacttaa 2760 ctctgaaact tacatccaac taaagcagac ctacacaaga gactttattc cagttgagaa 2820 gtctcacatc cctactaaag ctacggctct ccagtggcct catttgaagc atctaatgaa 2880 caagttgcat ccacttcaag attgtgaggt tggtctgcta attggttacg attgcccatc 2940 agcactggct cctttggagg ttgtcacagg caatgaaaat gaaccttttg ctcaaaggac 3000 aacgctaggc tggagcataa tagggtcagc caatccccac ttggatagat gtggaaacga 3060 aacctttgtg catcgtgtca cggtaaagga aaagccactt gccatcgacg tgctgaaagc 3120 actggaatca gactttaatg agaggagtta tgaggagaag tatgtgtcac aggatgaggt 3180 tcgcttcata cagcttctaa atgaaaaaat taagcagaaa ggggatggcc actttgagat 3240 cccactccct ttcaagggta gtagtccacc aattctccca aacaacaaga aactggctac 3300 tgttcggttg caacacctaa agaagaagtt aatgaacaac aaacagtatt ttgatcatta 3360 cacagccttc atggaagaaa ttctcaagat aggtgatgcg gagccagcac cgccattttc 3420 tgacgacgag actgtgtggt atatcccaca tcatggtgta tacaacccca agaagccaga 3480 caaattaaga attgtgtttg attgctctgc aaagtttcgt gggatctctc tgaatgacac 3540 tctactgaca ggccctgact taataaaccc tttgctggga gttctttgtc gatttagaag 3600 ggaagcagtt gctgtgatct gtgacatcga acggatgttc caccaattca gtgtctcacc 3660 cgaagttcgc aactacttaa gattcctctg gtgggaaggc ggactgtttg aaagggaacc 3720 tcaagaatac agaataacag tccacctttt tggggctgca tcctcccctg gatgtgccaa 3780 cgtttgtctt aagtatctgg cacagcaaca caaggtgaat taccctgctg cttcagcatt 3840 tctggaaaat aacttctatg tggatgatgg gttaacaagt gtcccaactg tcaaagaagc 3900 aaagaaactc attgttgaaa cacaggaact gtgcaaagga gcagggctac gtctgcataa 3960 gttcaattca aatcaaagtg aggtcctctc ctgtgtagct tcatcagaaa gggcagtgaa 4020 cactgatcct ctcaatctta aaccctatcc tgctgcaggg gcacacatac ttggcattca 4080 gtggtcatca gaaaatgaca ccttcagctt caatactagt gtaaaggatc actccccaac 4140 acgccgtggt ctcttatctg tcatagcatc tctgtatgac ccacttgggt tcgtagctcc 4200 tttcacacta agtggcaagc acatccttca agaactgtgt tgtagaggta ttgggtggga 4260 tgatcctatt cctgagaact tgcgctcacg gtgggaggag tggaagaatg gtttgcagag 4320 actaaaggaa attgcaattc cgcgatgtta ctatccacag aactttggtg aaattgttag 4380 agtggaatta caccatttcg ctgatgccag taatgcagga tatggctcat gttcgtatct 4440 caggtaccaa aacaacagaa atgagattca ttgcagcttt gtaatggcca aagcgagagt 4500 tgcccccaca aagattacaa gcattccaag attggaactc tcagctgcag tcaccgcagc 4560 caggatgagt gttatgctga aggcggaact cgagatgaag atcgagcaag aatttttctg 4620 cacagactcc caagttgtac ttgcttacat caacaatgag gctcgacgtt tccatgtatt 4680 tgtggcgaac cgtgttcaac tgattagaga aatcactgac ccaaatcagt ggaactatgt 4740 ggatacagca caaaacccag ctgatcacgc ctctaggggt cttcatgctt cagccatcac 4800 ttcatctagc tggttatcag ggcctgagtt tctatgggaa caggatgtgc atcctatgcc 4860 taagccttct actgagctgc taacagggga tccagaagtc aaatccattc aggtgttgtc 4920 aacccaaatc agtgacagga aggacattct tagccgtctg agtcgatttt ccacctggac 4980 aatgcttgtt aaggtggttg ccaggatcaa aagacttggg tccaaactta aacatcatgg 5040 tgaaattgtg actgtcacgg aacgtagaaa ggctgcggaa atgctgtatc agctcgttca 5100 gcagcaagcc tttccccaag agctgaaggc acttcaagag aattttcatg ggagtactat 5160 tctaagttca agtcctcttt tccgcctcaa ccccattctg gatgtgggac ttctacgtgt 5220 tggtggaaga ctaacaggtt cgaccctcag ttcagaacta aaacatccca ttattctccc 5280 aaaagacagc cacatcacca atctgctctt agctcactat catgcccagg tctgccacca 5340 gggttggagt cagaccttaa tggagcttcg ggccaatgga ttttgggcca taggtgcaaa 5400 taaatcagtc agcaagttaa tacacagatg tgtgaaatgt cgtaaactca gacgaccaac 5460 tgaggagcaa cagatgtctg aactcccaag agaacgctgt gaagtctccg cccctttcac 5520 tttctgtgga atggattgct tcggcccatt tgttaccaag cgaggtcgca aagagtataa 5580 gagatatgga ctgattttta catgcctgtc ttctcgtgcc atccatctag aaatgcttga 5640 tgacttgtca acagacgtat ttattaatgc tttgaggtgt tttatcagtc ttagaggagc 5700 agttagtcat ctacgctgtg atcagggcac aaacttcgtg ggtgcgaaga atgagtttaa 5760 agaagctccc aagcaatgtg atacagaggc tttgaaagct tttctggcag ataaacagtg 5820 tgagttcact ttcaatgctc cctcagcgag tcatgcaggt ggtgtgtggg agcgacaaat 5880 tcggaccatt cgaagtgtgc tgaatgttac agttgcccag tgttcaggta gactagatga 5940 tgcctccctt agaactctgt tctacgaagc aatggctatt gtcaacagtc gtccattaaa 6000 tgttgatgga atcaatgatc ccatggcacc agaacccctg actccaaacc accttatctt 6060 gatgaagtca aaggtagcat taccacctcc aggaaagttt gtcaaggagg atatgtacgc 6120 tgtaaagaga tggcgaaggg tacaatattt aactgaacag ttctggagcc ggtggaaaaa 6180 ggaatacctc atgaacgagt ccactagaca aaagtggcat gtaccccggc gcaacttgaa 6240 agtaaatgac atagtcatca tcaaggaaga ttcacttcca aggaaccagt ggcaactagg 6300 acgagtggtc gagacaacag aagggagtga tggcttagtg cgtcgagtaa aagtacaagc 6360 tggggagcga aaattaacac agaaacagac ccattgctct aagcccacaa tcatcgaacg 6420 accaatccaa aagctagtgc tcctccttga agatgaatga tgatctgtca atcttgtaga 6480 tacacttacg tcagatatgt ttgactaaca gacatagggt gacccttatc gtatttgtga 6540 cctataccaa tagttattca ttattgccta gaaatcatgt atgagtaact gtttttcttg 6600 tttagttttt cctgctaaat tcagttatca ttacatgatt ggtgggag 6648 // ID LTR-10_DR repbase; DNA; ZEB; 875 BP. XX AC . XX DT 01-DEC-2008 (Rel. 13.12, Created) DT 01-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE Putative long terminal repeat - a consensus. XX KW LTR Retrotransposon; Transposable Element; Nonautonomous; KW solo-LTR; LTR-10_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-875 RA Bao W. and Jurka J.; RT "Long terminal repeats from zebrafish."; RL Repbase Reports 8(12), 2167-2167 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 875 BP; 286 A; 166 C; 170 G; 253 T; 0 other; tgttttagta tgccccaaat tttagtatgt aaaatgtttc agcataccaa aaaggtttta 60 gtatggtttg tgtttatgca agtgagacaa gtgtggtgct aatcgacaag agaaggcacc 120 aataatggtc catacaccat actaatcaag caatacaact gtttattgct ggtacacaca 180 gctgatatca attatcacaa taatattcaa taatcatgaa tcatactcat gaacatcatt 240 aatcgaaata caacaaaatt gcataaaagg cttataatgc gaaataaaga tgagattaca 300 tgaataaaag acagataatg aaaacttcaa gcaatttgat acaagtatat acaacctcag 360 agacgtgcca tcattttcct cttcaaagac tggggagccc gttgcagtca cattggaaaa 420 ccaacacgtg tgtccgagaa tgccgggcga tgcgcgctta tcccacggct gagcttccaa 480 tgtttcagca agaaaccctc acctttatag tcaaaatttg ggtgctggcc aaaggtcggc 540 ctaataggta agagagaagt gtgagttgat caggcccacc ttctctctca gctagttgca 600 cattatttct gattgcaaag cacaaatttg aggtttgtag tgtcagcatg attatctaga 660 gtcaatattc tacctcctga aatttggaga aagttaacct taaggtccgg agacccttct 720 ctggttattt ggtcatgctt aagtgtcaat attttacctc ctgaaatttg gagaaaggtg 780 tgaagttaac cttaaggtca tgagaccctt ctctgttcat ttggttgttt gttgagcgca 840 acagaagaaa aacaaaaaat atcccatttg ttaca 875 // ID Gypsy-173_DR repbase; DNA; ZEB; 929 BP. XX AC . XX DT 08-APR-2009 (Rel. 14.04, Created) DT 27-OCT-2009 (Rel. 14.11, Last updated, Version 2) XX DE Solo-LTR of the Gypsy LTR retrotransposon; a consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Nonautonomous; KW LTR; Interspersed repeat; Gypsy superfamily; Gypsy-35-LTR_DR; KW Gypsy-173_DR. XX NM Gypsy-35-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-929 RA Bao W. and Jurka J.; RT "a family of Solo-LTR from zebrafish."; RL Repbase Reports 9(4), 855-855 (2009). XX DR [1] (Consensus) XX CC The internal portion of this Solo-LTR is not identified. XX SQ Sequence 929 BP; 178 A; 186 C; 180 G; 385 T; 0 other; tgttacggtc cgtgggagtt tgtatttttg tgccttctcc ctctctgttt tgcttgtctc 60 ccttcgctcc cttcgcatct ccttgtctgt cattcgctct ttgtctcgcc ctatcacctt 120 ttgttccgct actttttctc tactaggtac ctgctgattg gatcccgagc ctgtcactcc 180 tcattagggc gtttcctcgt ggaccaattg gacttcgttc cgcggacttt aaattcgacc 240 ggcacgcctt gtttttcaga gtgctatgtg tgtgcgtgtt ggtttgtgtt cgctcgctgc 300 tgttcataat gtatgctttg ttctgtgttt ctgaataatt acatcgatgt aaattaagtc 360 atggttaaag agtttatcat tcctttttgg ggaaaaggga atagtttact aagtcgcgcg 420 acatacattt tgcccgtagg taacttcaat gttcagaaac actaggtaag tgggtgagcg 480 ccgcattttg tatttttgct tattatttca gggagtttag gtgtggcgtc gcaaacgctg 540 aagatttcgg tttctttttc acattttgat ttttattagt aagtttagag ggggatcttg 600 tgttctagtt taggtggctt ttggttttgt ttagttcttt gctttggcgc cactctagtt 660 cctttttccc ctaaaacttt tgattgtatt tcttcttttt ttcttcttct tcttcttctt 720 attattatta ttatatatac attgtacaaa gcgcacacat gtaagcctct cttttgttgg 780 cttcattttt cttctcctcg taagaggaat aaacgaattg aatgtaaatt ttggtgtcct 840 ggtgtatttt ccatccatct actcagtaat aatttaatat tataagtgtt acgtttattt 900 tccctagaca acttaattaa accgtaaca 929 // ID Helitron-N3_DR repbase; DNA; ZEB; 3016 BP. XX AC . XX DT 07-OCT-2008 (Rel. 13.1, Created) DT 07-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE non-autonomous Helitrons from Zebrafish - consensus. XX KW Helitron; DNA transposon; Transposable Element; Nonautonomous; KW Interspersed repeat; TIR; Helitron-N2_DR; Helitron-N3_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-3016 RA Bao W. and Jurka J.; RT "Helitron transposons from zebrafish."; RL Repbase Reports 8(10), 1341-1341 (2008). XX DR [1] (Consensus) XX CC Helitron-N3_DR is basically the same with Helitron-N2, except for CC an ~600 bp additional fragment. Helitron-N2_DR or Helitron-N3_DR CC usually forms tandem repeat in the zebrafish genome. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 3016 BP; 1028 A; 533 C; 438 G; 1016 T; 1 other; ttattattat tatgttggct tgagaaagcc aacatactgt aatgctactt aaacttattc 60 ttattattat tcttattatt attattccgg cttctgacca aaaaaaagca atcgcttctc 120 ctcccagggc ttagatgcta caagccccaa atttggtcaa aagctgtcta ttgcccatga 180 gatggttgct gtatgtcctc atgccgataa gatttgtagt ttttgaatta aaaatttttt 240 gaattaaaaa attgtaaata ttacaatcga gtatttgggg catataaaaa agtatacaat 300 cctcaaattt ggccaaaaga tgtgctttaa attgaagata attatcatat ttatttggtg 360 taattataaa taacagtttt cctataaata ttttttttat attaaatttg ttaatttcaa 420 acctaaagct tgtcagagtg tccacgagat ggcgccagaa gactattttt gagccctttg 480 taatttagta atggcccttt aagaggcact ttccctttaa gaggcggaga tatccgggct 540 gactgttttt gctctacaga cagaaattcg ggctcatttc agacgtgctt gttacaacat 600 ttgaagctgt ttacattaaa atttctcagc gaaaatacac aaatacaagt catacaacac 660 tatttagtgt tcctttaaga ggcagagaga tctgtggtca aaatcgaggt aagattgaac 720 ttaattctgg tcatcagtca ttcatattat gctcgcggtg aactccataa agcgtaataa 780 aactgatctg gatgcagaat catccgcgtc ggaatcgacc cgcatgcatt tctgtctgct 840 accctgtgag ttttaaaatg catgcctctg atgttggcag aaacataact ttacttcaga 900 tatttctttc aaaacagaat attctgtgcc gtgaaaaagc cttctcctct acttgtgata 960 atgaaaatga aagccatgcg aacattcacg tatattatac aataaacata tttaatataa 1020 aatcattcta actttatttt taatctgtat tttaaccctc taaaaacagc tatgtccttg 1080 ccacaaaact aataatccat cttcaaaacc tacaatttct ttccattcgg tccggtcgga 1140 cctgaacgtc tttataccgt tgtaatcacg tgtaaattct aacaaagaca atgtttgcac 1200 tttttaaatc acattttatt tattgtgcaa aaataaaaca tccaaacagc atcataacca 1260 ataaaaacac acagcaaaga aaatgtattt taaaatagtg aatatatata tatatacata 1320 caatgtatga agatttttta tatcccctta atatgctctg ctgttgctgc atcaatttct 1380 ctctgtaaca tctggattgc aatttacact gcaatgttca caatttatta ttatgggggg 1440 aaaatccaaa cagtattata atgttttaaa gcaaaaatgt taagaacaat tatttacaaa 1500 cagtgaaaat aaagtgtgtg tatatacata tatatacata cagttgaagt cagaattatt 1560 agctcccctg tttatttttt ccccaatttc tgtttaatgg agggaagatt gtttcagcac 1620 atttctaagc ataatagttt taataactca tttctaataa cttctttatt ttatctttgc 1680 catgatgaca gtaaataata tttgacttga tatttttcaa gacacttcta tacagcttaa 1740 agtgacattt aaaggcttaa ctaggttaat aaggtgaact aggcaggtta gggtaattag 1800 gcaagttatt gtataacgat ggtttgttct gtagactatc aaaaaaaaat tgcttaaagg 1860 ggcaaataat tgtgtcccaa aaatagtttt taaataatta ataactgctt ttattctagc 1920 cgaaataaaa caaataagac tttctcaaga agaaaaaata ttatcagaca tacagtgaaa 1980 atttccttgc tctgttaaac ataatttggg aaatatttaa aaaagggaaa aaaaaatcaa 2040 aagggggtta ataattctga cttcaactgt atatatgtat gtatatatat atatacatac 2100 atatatacat atataacaca tacacacatt tatggttagg gttataagga ttatttctta 2160 atcccccttt tatgttagca aaaacataag tttacttcag atatttcttt caaaacaata 2220 ttctgtgccg caaaagagcc ttctcctcac ttgtgataat gacaatgaaa gccaagcttt 2280 tggcctatgc tgtaccaaaa aaaaaaaaat aaaatcaccc aggtgatgac aactaaatat 2340 tacgattata acattgtgtt agtggccagt ggggtggaac aacgtcattg gtcagaatta 2400 accacgtgat gaagacgkcc tataatattg tatttaatgc gattttcttc ctgtaaactg 2460 tatatataac tatataacta taactatata tataatgctg caaattactg taaattgact 2520 gtactctata ttaactgcaa aaattataag actgattaaa taaaagacaa attaactaaa 2580 tgcatgtgct attgtttcat tagtatttta atttaatatt tttctttata tgatatttat 2640 tataatgtaa ttttgttcag attataatat atactataaa tactaactat actatagtga 2700 tctgcattaa acaacctgaa caaatcagtc ctctctctct ctctctctct ctctctctct 2760 ctctctctct ctcgatttga tcaaaaatgt aaatgaggcc ttccgattaa cagtccagtg 2820 gaccctagca acagcctaga aaccactcag aacaccctag caactaccta ggaatgcctt 2880 agctacacct tagcaaccgc ctagcaacca cttctcaacc gctgtgcttc ctgccaactg 2940 cctttgagtc cccggcgcag tcacgttggc tttctcaagc caacatcaaa gttcgtcaac 3000 gaactttagg ttctag 3016 // ID Gypsy-20-LTR_DR repbase; DNA; ZEB; 1446 BP. XX AC chr12; XX DT 11-FEB-2005 (Rel. 10.01, Created) DT 11-FEB-2005 (Rel. 10.01, Last updated, Version 1) XX DE A long terminal repeat of the Gypsy-20_DR LTR retrotransposon - a DE fossilized sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy superfamily; Gypsy-20-I_DR; Gypsy-20-LTR_DR; Gypsy-20_DR; KW endogenous retrovirus. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1446 RA Kapitonov V.V. and Jurka J.; RT "Gypsy-20_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 5(1), 14-14 (2005). XX DR Zebrafish.; chr12; Positions 34574763 34576208. XX CC Gypsy-20-LTR_DR is a long terminal repeat of the Gypsy-20_DR CC LTR retrotransposon that belongs to the Gypsy superfamily. Its CC internal portion is deposited in Repbase as Gypsy-20-I_DR. XX SQ Sequence 1446 BP; 426 A; 222 C; 269 G; 529 T; 0 other; tgtaacccat tctttttttt ccgtgttgca gaacagcctg cattaatttg gcgcttcccc 60 tttaagaggg tgcggtaagc gcgtgcactg gaagagtgca cagatatagc tgattgcgtg 120 actgatgaga gattacgctc cacctgtgtg gatttttctt ttctttgata gagatttcgt 180 aattttgtgt gaaggatttt ctttttatat gtgcttaatt gtcatatgca taatctgagt 240 gcgagttgct gaagctgaag tgggtacttc acactgattt ctgtgttttg attcagagca 300 ttctggtgag tcgatcctgc ttgaatatga ttatgttatg tagtattttt ccattatctg 360 aaaaaaaata ccagtcagaa tgtgatttta tttaaggaat tgcagataaa ctactgagta 420 actgatgtgg gttagccaaa gagaagctaa gcacagctga tgatgctagc tttgctaacc 480 atggtgacca tgcggtgttc gttgctttaa gaattgtata taaatatata catatatatt 540 tttttcaaca tgtttagtag aaattgtagc aaatggtatg caagtgattt atgtttattt 600 gatcaatgta ttttatttaa cagttttttt cactgtgtca tatgtgtgag gaaaaaaaag 660 agtttaaagc atatttttat tccaaacttt ttgagatgta tataaaagca ccaaatgtgc 720 atacagagtg tacaaaattc attatatggt ttgttagaaa tgctttaaac gttctgatga 780 tgttgatgtc attaatttga agtgtacaat tgttgcagac tggtttattc cccccggcta 840 aatcgtggat gatgcaaatg ggactttaga tttcgagtga tcttgccatg gattggttct 900 ttcaggcaga catcttacta gctgcatcaa ctcttcaaat cttcatctca tcccgtcttt 960 caactcttca tgtcttctcc tcatccacct ttgctaactt gaatgccatt cctaagggag 1020 cggtaggact gagacattgc atttacacac atcacaagga acgttttttt attccaaaga 1080 attttgctaa attgtggact tttttttcat ctgttccaat ttccaagtga gaaaattatt 1140 gggttatgtt tatggtacat tgttgtttac attatttgtt tatgtactaa gaggaaaaaa 1200 aaaaattcaa taattgtgga aaaaaataca ttttccttgg ttatataata ctgtattgtt 1260 ttttttattc tacaaatttt catttcttaa tttgatataa aaaagggaag aaaattcagt 1320 tttttattta acgtctttgt agtcagcaca catagagaga aataaaaacc aagctcccct 1380 tcatgaacct agacaactag tactagaata tacagctctg tcataaacat taaaacagga 1440 gttaca 1446 // ID I-3_DR repbase; DNA; ZEB; 5426 BP. XX AC . XX DT 03-OCT-2008 (Rel. 13.1, Created) DT 11-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Non-LTR retrotransposon from zebrafish - consensus. XX KW I; Non-LTR Retrotransposon; Transposable Element; Nonautonomous; KW I-3_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5426 RA Jurka J.; RT "I-type retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1343-1343 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 404..1624 FT /product="I-3_DR_1p" FT /translation="MKNKYLNIMAKNQNTKKRILEQDLETELRTGFNFPSE FT NENWQRFIVLESLTDDMPLTKLSPFAIHKGITGIAGTVKDIKKLRSGQILV FT ECARKANSENLLQATTLAGITMKAFPHSTLNNSKGVIRTRELQDMDESEIA FT LELKPIGVIDVKRIIIKKDGQMIKTGTYILTFNRPQPPEKIKLGYLSVAVD FT IFIPNPLRCFKCQKFGHGSAGCKNKTTCQNCGEDDHGMNCTKPPKCKNCLG FT DHAASSKNCPIWHKEKEIQKIKHTRKMSYPEARKIVEALPAFCLKKPYSVA FT VTPKTQSISCQTDLTWLFKDKPQTISNTKNSTSQKTKNELIQTGPIQDSYL FT EQKESKVKKKTPSTTNQQPNQDKCRTHKDDNPKDIEMTETTTSRSRSLSPK FT CKNKGPVPIFPT*" FT CDS 1780..5307 FT /product="I-3_DR_2p" FT /translation="MISICGPNIQRPSGGSAILVRNDIIHSQVVLDTQLQA FT TAVQLTLHKTITICSIYIPPHEAVIPQELDKLISQLPTSFLLMGDFNSHNI FT LWGDDHCDAKGRRMETFIENNDLCLLNDGTKTYLHPGHGTYTAIDLTICSA FT DLLLDCSWRVWNDLCGSDHFPIILTFPQLDVTSRPSRWQLKKADWTTFQTX FT CCEKFMSATQDNQNSIEDFTQQLIAIADETIPKTSTKPQRKTNPWFNEQCK FT KAIETRKKAERAFNKHPTTTNFIHFKICRAQARRTLNEAKKQSWKQFVSSL FT TYYTPMKKIWNMIRKMKGKENGTRIQHIIKQGILQTKEMDIANTLAEAFEK FT NSSIENCLPAFQIIRAQQECKELNFYSDNKEVYNQPFKIKELINSLKCSHD FT TAIGPDKIHYQFLKHLPYSSLCLLLDEFNNIWSSGKIPASWKEATVIAIPK FT PGKDHTDPNNYRPIALTSCLCKTMERMINNRLVWTLEKNKQLTDYQCGFRR FT GKSTLDQLIRLETYIRDGFIRKEHVVAVFFDLEKAYDTTWRYGILKDLHQM FT GFRGKLPTFISNFLTRRNFRVQVGNSLSNTHQQELGVPQGSILSVTLFSIK FT INSIVEIIAPNMFCSIYVDDLCICYRGKNMNTIERQLQLNIHKINQWAVEN FT GFKFSKTKTVCTHFCIQRSLHHDPELFINGNMIKVVKEIKFLGLLFDNKMT FT FIPHLKFLKNKCLKAMNIIKVLAKTNWGADLTVLLRLYRVLIRSRLDYGSI FT IYDSARKSYKQILNTVHHQGLRLALGAFRTSPVQSLYIEANEPSLQNRRLK FT LSLQYAIKLKANKNNPAYTAVFEPQFVQLYESKPKTIRPFGLRINSHLQNL FT NLNLDILKQIHFNSLPPWNLKMPIINMELGHQKKGVTHPNMYKQQYLDIRT FT QYSQHIPIFTDGSKMESNVAAAMVTGEQSHGISLPKECSIFTAEARALLLA FT LEYIENAQQKKSIIFTDSKSCLQVMESSTNDHPLIDNILTKVHQLQNQFYH FT IIFCWVPGHVGLLGNERADAAAKDALKQVINKCQIPPSEMKPFINAYILNK FT WQKEWDALENNKLHEIQPEVSHRILRHFKNRFDQVVFTRCRIGHTRITHGF FT LLQGENPTQCLCCKTPITVKHILLDCPAFTDSRRTFYEMNSFKDIFDKVEP FT EKILEFLSCINTKNLI*" XX SQ Sequence 5426 BP; 1973 A; 1071 C; 927 G; 1453 T; 2 other; gcccccgtgg ggacccgggg tagaataggt ccctagcacc cccttgctgt tcgtaagagg 60 cgacaaatgg ggcaacttgt tggccgtgag ttgcgactcg tgttggtgga ggaagagatc 120 ctggtgattg aggattttga tttgctgcgg ctatcaatta tccaacacat cactttggct 180 catacttcaa ctagacgaga gattggaaag gcccattcca atcaatcggc ttgtcaggac 240 ttgcctttct cccggaaggc gctggcatag ggtcaattga gtattataga ccctgtgttt 300 atatgctcaa gtatatcatc actgtatcct tgatgtacat ccagtggatg tccgtggctc 360 tgtacatccc cttgtaggac tccgaggtgg gtgaggagca gagatgaaga acaaatatct 420 aaacattatg gcaaaaaatc aaaacactaa aaaacggata ttggaacaag acttggagac 480 tgaactaaga actggtttta atttcccatc tgaaaatgaa aactggcaaa gatttattgt 540 actggaatca ctgactgatg acatgcccct gaccaaactg tccccttttg cgatccacaa 600 aggaatcaca ggaattgcag gaacagtaaa agacattaaa aagctgcgat ctggacaaat 660 tttggtggaa tgtgctagga aagctaactc agaaaattta ctacaagcaa caactttggc 720 tggaattact atgaaagcat tccctcattc taccctgaac aacagcaaag gtgtaatccg 780 caccagagaa ttacaagata tggacgaatc tgaaatcgct ttagaattaa aaccaatagg 840 tgtaatcgac gtaaaaagaa taatcatcaa aaaggatggg cagatgatta aaacgggaac 900 atacatcctg accttcaaca gaccacaacc accagagaaa atcaaacttg gatatttaag 960 tgttgctgta gatatattca taccaaatcc actgagatgt tttaaatgcc agaaatttgg 1020 tcatggatct gcaggatgca aaaataagac aacatgtcaa aactgtggtg aggatgatca 1080 tggaatgaac tgcacaaaac ctcctaaatg taaaaactgc cttggagatc atgcagcctc 1140 atcaaagaat tgtcccattt ggcataaaga gaaagaaata caaaaaataa aacacacaag 1200 gaagatgagt tatcctgaag ctcgtaaaat agttgaagct cttcctgcct tctgtctgaa 1260 aaagccctat tctgttgcag tcacaccaaa aactcaatca ataagttgtc agacggatct 1320 aacttggcta ttcaaggaca aacctcaaac cataagcaac accaaaaact caacttccca 1380 aaaaacaaaa aatgaactca ttcaaactgg accaattcaa gactcatacc tagaacagaa 1440 agaatcaaaa gtgaaaaaga aaactccaag tacaacaaat caacaaccaa atcaagataa 1500 atgtcgaact cacaaagatg acaatcctaa agatattgaa atgacggaga caacgaccag 1560 tcgtagtcgt agtctctcac ctaaatgtaa aaataaagga cctgtaccta tatttcccac 1620 ttgagaataa acaatcacat aatacagtgg aattgtcggg gactaaaaac caactttgca 1680 gaattacagc atttaaattc tgtttttaac ccacttgcct attgtattca ggaaacccac 1740 ttatctccaa atgatgcctt aacttttaaa ggttttaata tgattagtat atgtggacca 1800 aacatacaac gaccctctgg aggatcagct atcctagtga gaaacgacat tatccacagt 1860 caagttgtat tggacactca attacaagca acagcagttc aacttactct acataaaaca 1920 attacaatat gctctatcta tatcccacct catgaagctg taatacctca agaactggac 1980 aaacttattt cacaattacc aacttcattt ttactgatgg gtgattttaa cagccacaac 2040 atcttatggg gtgatgatca ctgtgatgcc aaaggaagaa gaatggaaac ttttatagaa 2100 aataatgatt tatgtctttt aaatgatgga accaaaacat atcttcatcc aggacatggt 2160 acttatacag caatcgatct gacaatatgt agtgctgatc ttttacttga ctgttcatgg 2220 agagtatgga atgatctatg cgggagtgac cacttcccaa ttatcctcac atttccacaa 2280 ctggatgtaa caagtagacc ttcaagatgg caactcaaga aagcagactg gaccaccttt 2340 caaaccwtgt gttgtgaaaa atttatgtca gctacccaag ataatcagaa ctcaatagaa 2400 gacttcactc agcagttaat cgctatagca gatgaaacca tcccaaagac atcaacaaaa 2460 ccacaacgaa agacaaatcc ctggtttaat gaacaatgta aaaaagctat tgaaactcgg 2520 aaaaaagcag aacgtgcttt taataaacat ccaacaacta caaattttat ccattttaaa 2580 atttgtagag cccaagctcg cagaacccta aatgaagcaa agaaacaaag ctggaaacaa 2640 tttgtatcca gcttaactta ctatactccc atgaagaaga tttggaacat gatccgtaaa 2700 atgaagggaa aggaaaatgg aactcgcata caacatatta taaaacaagg aatacttcaa 2760 acaaaggaaa tggacatagc aaatacatta gctgaagcct tcgaaaagaa ttcatcaata 2820 gagaactgcc tcccagcatt tcaaatcata cgagcacagc aggaatgcaa agaactaaat 2880 ttttattctg acaacaaaga agtatataat caacctttta aaattaaaga attaataaat 2940 tctttaaaat gctcacatga tacagccatt ggaccagata aaatccatta ccagttttta 3000 aaacacttac catatagttc actatgcctc ctactggacg agttcaacaa catctggtct 3060 tcgggaaaaa ttccagcttc gtggaaagaa gcaacagtaa tagctatacc taaaccagga 3120 aaagaccaca ctgatccaaa taactacaga ccaattgctc tgaccagttg tttatgtaaa 3180 acgatggaaa gaatgattaa caaccgtctt gtctggacac tagaaaaaaa caaacaacta 3240 actgactacc agtgtggatt tagacgtgga aagagtacat tggatcagtt aatccgacta 3300 gaaacttata tcagagatgg attcatcaga aaagaacatg ttgttgctgt tttctttgat 3360 ttagaaaaag cttatgacac aacatggaga tatggaattt taaaagatct tcaccaaatg 3420 ggatttagag gaaaacttcc taccttcatt tcaaactttc tcactagaag aaacttcaga 3480 gtccaagtgg gaaattccct ttcaaatacc caccaacaag aactcggagt tcctcaagga 3540 agtatactct cggtcacact cttcagtatt aaaataaata gcattgtgga aatcattgct 3600 ccaaatatgt tctgcagcat ctacgtggat gatttatgta tctgttacag agggaagaac 3660 atgaatacaa ttgaaaggca attgcagtta aatattcaca aaattaacca gtgggctgtc 3720 gaaaatggtt ttaagttctc taaaacaaaa actgtttgta cacatttttg tatccagcga 3780 tcacttcatc atgatccaga actttttatc aatggaaata tgatcaaagt tgttaaagaa 3840 atcaagtttt taggtctcct ttttgataac aaaatgactt ttatccctca tttaaaattt 3900 ttgaaaaaca agtgcctcaa agctatgaac ataatcaaag tgttggcaaa aacaaattgg 3960 ggtgctgact taactgtact tttacgattg tacagagtac tcattcgatc tcgtttggat 4020 tatggaagca taatttatga ctctgccagg aaatcttata aacaaatcct caatacagtc 4080 caccatcaag gattacggct tgctttagga gcattcagaa catcacccgt tcagagtctg 4140 tacattgaag ctaacgaacc ctcacttcaa aatagacggc tgaaactttc attacaatat 4200 gctatcaaac taaaggcaaa taagaataac ccagcttata ctgcagtttt cgaacctcag 4260 tttgtacaat tatacgaaag taagccaaaa accatcagac catttggttt acgtatcaat 4320 tcacacttac aaaatctaaa tttaaacctg gacattttaa aacaaataca cttcaactca 4380 ttgcctccct ggaatttaaa gatgcctata atcaatatgg agctaggaca ccagaagaaa 4440 ggagttactc atcccaatat gtacaaacaa caatacctgg atatcagaac tcaatactct 4500 cagcacatac caatcttcac agatggttca aagatggaaa gtaatgtagc agcagcaatg 4560 gtgacagggg aacaaagtca tggaatcagt ctacctaaag aatgctccat attcacagct 4620 gaagcccgcg ctttactttt agcattggag tacatagaaa atgcacaaca aaagaaatcc 4680 atcattttca cagactctaa atcatgcctc caagttatgg aatcttcaac aaatgatcat 4740 cctttgattg acaatatttt aactaaagtt caccaactac aaaatcagtt ttatcacatc 4800 attttttgct gggttcctgg acatgtcggg cttttaggca atgagcgagc tgatgcagct 4860 gctaaagatg ctttgaagca agtaataaac aaatgccaaa ttcctccgtc agagatgaaa 4920 ccttttatca acgcttacat tttaaacaag tggcaaaagg aatgggatgc attagaaaac 4980 aataaactac atgaaatcca acctgaagtt tcacacagaa ttttaagaca ttttaagaat 5040 cgatttgatc aagtagtttt taccagatgc cgtattggac atacgagaat tacacatggw 5100 tttttgctcc aaggtgaaaa ccccactcag tgtttgtgct gcaaaactcc tatcactgta 5160 aaacatattt tactggactg tcctgctttt actgattcca gaaggacttt ttatgaaatg 5220 aactctttta aggacatttt tgacaaagtg gaaccagaaa agattttaga atttttatca 5280 tgtattaaca caaaaaatct tatttaattt atgttttatt ttgtttgttg atttttaaat 5340 tgtatattta ttgttgaaag attcctgcca tgaagatagc cttggttgct gacatggcac 5400 taaataaaaa aatacattac attaca 5426 // ID TDR7 repbase; DNA; ZEB; 541 BP. XX AC . XX DT 04-MAR-2002 (Rel. 7.02, Created) DT 04-MAR-2002 (Rel. 7.02, Last updated, Version 1) XX DE Zebrafish non-autonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; TDR7. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-541 RA Jurka J. and Drazkiewicz A.; RT "TDR7: a non-autonomous DNA transposon from Danio rerio."; RL Repbase Reports 2(2), 35-35 (2002). XX DR [1] (Consensus) XX CC TA-target site duplication. Several hundred copies in the genome. XX SQ Sequence 541 BP; 187 A; 62 C; 62 G; 227 T; 3 other; tacagtgctg cagcataatt gagtacaccc cattttgaaa atgaatattt ttatccattt 60 ctcagtgaat ataggcaatg tattttggtg catttaaaca aaacagattt attaaacaga 120 tatatttatt aaaataatat atattttagt caccaaacat atttagaaat tgaaagataa 180 tacaattaaa ttcaagcaaa atattgcaaa aaaaaattac aacctacaaa atttcaacaa 240 aatttttctt ttttttttay ttatttattt tattdatttt tttttttttt tttgcttctc 300 ttgatttttc ctctttttta aaaatttgta tttaatattt ttctataaca tataaatttg 360 ggtgtactag tttttggacc gttatcgtaa gttattttgt tagataagct ccagatttgg 420 cttcagtact gactaatcta atgtatatgc acaaatataa tattgtatag cttcctatta 480 aaaatattaa tttaaaagat angatttgtg aggggtgtac ttatatatgc tgagcactgt 540 a 541 // ID Gypsy152-LTR_DR repbase; DNA; ZEB; 772 BP. XX AC chr4; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy152-I_DR; KW Gypsy152-LTR_DR; LTR63_DR; LTR_Retrotransposon. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-772 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1566-1566 (2008). XX DR Genome; chr4; Positions 37018258 37017487. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 772 BP; 181 A; 128 C; 164 G; 299 T; 0 other; tgtcacgaga cacaatccta ttcaaccgtg tgatgatgac ttctctctct gccaatcgcg 60 ttcgttgcag agccctttaa aaacggaagc ggcaacgggg agagatcagc tcttgttgtt 120 ttggcgtgtt gtctttttgg tggagtccgt gtgcgttgtc gctgttactg atgtgtggtt 180 tgattgagat ctattgatgt tgactttgag tgagtaactg atgactttga gtaagatttg 240 tttaagagtt ctgacgtgtg tttctacatc tatagcccgc taatgctcca ttcaaaacct 300 tctgtgactt gacaagggag atctggaaca acatagtaag tcacgtgaca tacatatttg 360 cacgcaggat tattgatgta tagacacact agttagcgaa gtgcgctaat tttgtatttt 420 tatttattag ttagagtaag ttcagggcgc tttgcgctga agacttttct tttcatattg 480 ttattttctt atttagtgga ttggtaagga gattaaggga atgtttttgt tatttatttc 540 tttgatttcc atgcgcaaac ttcagttccc tctcctattt gtcttggttc ttttcaaatt 600 tatcttatgg tttgtaaata ttgtaaataa tgtgggagca ataaatcttt tgcggtgcta 660 atttgttgtt tggtgttttc ttttgccatt cttcaacatc agtttatact ttggtcatct 720 ttatgttact ccttttaata ccctagacaa attgccatca aggatcgtaa ca 772 // ID hAT-N65_DR repbase; DNA; ZEB; 820 BP. XX AC . XX DT 29-OCT-2008 (Rel. 13.1, Created) DT 29-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Putative nonautonomous hAT DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW 8 bp TSD; hAT-N65_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-820 RA Bao W. and Jurka J.; RT "hAT DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1334-1334 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 820 BP; 254 A; 155 C; 170 G; 240 T; 1 other; tagtgatggt gaaatgaagc tttctgaacc agtgaagcgc tcgactcaat tgtatcggaa 60 aaaggttcgt tactcgaagc tttctaaatc agagctcgat gaggacctct tctggtgaaa 120 gtgtgggaaa gcactgtgtt gccataatgt caaatgttca caactgacca actcattcat 180 gtagtaatac aacaacagca atataaacaa attactcaat tactgtagtt tttacacaag 240 atatattaca ttacacaact gatgattgta gtaaaatcca aggtattcaa aagtttatca 300 taatacaatt agtcctgctc caagcgggga tcgaaccagc gattcctgwt tgggaggcga 360 gtgctctagg aggacgctat gtgaatgatg cttcagggtt gcactcacca gatggtctct 420 gttaaacaca atactacgat atgcggtggt tttctttaaa gatagtcgta aaaaaaatac 480 gctaatacac tttagtatgg ttcaaaacct ttttactaaa aattactatt gtattttagt 540 ttaattactg gtgtgtggga ctggtgcagt gctttgaaac agtacaaatg tatgtaatcg 600 acgccaattc tctcgctata cactttacta tgagggtgtt taaacctttg aatcaaaatt 660 cgaagcagtc acgtgggtat aggcgaaaat gaagcttcgg acgtcactga tcacgtgatg 720 atggcaaaac gaatcaagct ccggtacact gcttcactgc gagatgtatc gtttttttga 780 tacatgcttc gaagcctcgg cgcacaacgt cacatcacta 820 // ID BEL13-LTR_DR repbase; DNA; ZEB; 315 BP. XX AC chr11; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL13-I_DR; KW BEL13-LTR_DR; BEL13_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-315 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1376-1376 (2008). XX DR Genome; chr11; Positions 10713286 10712972. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 315 BP; 88 A; 52 C; 61 G; 114 T; 0 other; tgttatgact taaattgata tgtagctgca agagtgattt attaatgctc catttatcgt 60 ttaattgtca taaagtgaat ttatttacct tggcatttat atatgctgtt ttgctttaag 120 aacagttttc ctatgatgct tagcgctact gaaagcgcgc ctgtaaccat tggcaactga 180 ctcagtgtga gagagtgtgt gtgttcggag ttttgctcct gcttcagtat acaaaaactt 240 cagtaaacag ttcattttcg ttctaaacaa agttttccag tcgtcattat tgctgctaga 300 tgaaaagaca gaaca 315 // ID BHIKHARI-2-LTR_DR repbase; DNA; ZEB; 1718 BP. XX AC . XX DT 24-NOV-2008 (Rel. 13.11, Created) DT 24-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE LTR of retrotransposon bhikhari from Danio rerio - a consensus. XX KW LTR Retrotransposon; Transposable Element; Nonautonomous; LTR; KW BHIKHARI-2-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1718 RA Bao W. and Jurka J.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(11), 1774-1774 (2008). XX DR [1] (Consensus) XX CC TSD is 4-bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1718 BP; 478 A; 366 C; 345 G; 522 T; 7 other; tgtaggaact catcggcatt tttcagtggt caacacaagt tgaatctgaa atcttgggca 60 tctccaatgg tctcattcat tgaactagct tgagggctta taccatcagg ataacagagc 120 gccaattgat ttcgggctcc tctgaccagt acttctccgc agatgttctg gtacccttcg 180 gctaactagc caatgattaa ttaaatcagt taaattaaaa acaatggccc ttacaggatc 240 ttcagcggaa accatttcac gagactttta attgactcct ggtcatgtgc tcgccttggg 300 cacatcctgc tagaagcaca atttcgtaga gacaaagaga tctctagaaa cattaatgca 360 tatgaataaa agacaatctc ttaaaatgtt agaactaagg cagatgtatc tttgaataat 420 atgccataat gttcctcata tgctgtttat tccaatcaac taggttaatc aatgtgttgt 480 tttaaccctt ataacagagt aaaataatct gtatgtgtct gaaatgtatg atgtctggta 540 ttgaacgaaa gctagtgaca tttgcaatac gttggtgtaa atgaaaaact agtagtacaa 600 acagtattca tcttgtaacg tttgatgtaa gatagtttca atcttgcaag agttctacca 660 cgtgaaatct acatgcagtc tgttaattat tgacctttcc tacgtcagta cggggcgtgg 720 ccccgccctt catggcaact gctcattggc agacagtgcc atggggacga accagaccag 780 gtcacttaaa aacaccctgc aacaaagaaa ctttgctttt tgcatcagct agcttgcgca 840 tgcttttgct ctctgccccc tttcgcctgc ccggacaccg cgccgccacg cgattgggtc 900 actacgaact tccacgcaaa cctcttaagt ttcgggactc ccgaagctgc atgaacttcc 960 gcaaaccgac tctcagcagt ccgtcacctc ggtaacccag gacaacagcc attcacgagc 1020 gcgtgaatcc caaagacgtc atcgaagcga gaaaccaccc aatcaggagc accgtgtttc 1080 cccaaggcca gccggcgagg gagattcgac ttcagcaaaa agggcgcaag taacacaaac 1140 taatgttgtc tcttgctgat cttgtgcaga ttgatcttaa gcagttaaag ataagccata 1200 tctctgcttt tgtggtttct caggtgttat tctaagatct tgcaagaagt attagaatta 1260 atttgggacg ttgtcaactc attttgcatc ctctgaactg cctytgtgtg tatgtgtgaa 1320 tgtgtgtgtg ygtttgtttg ttgtttgctt attacgtagt tagtttcatk tatcgtagtg 1380 tagctcaata aatctttgta ctcattttgt aagaactgtg ttcttgttta tgtgcttcta 1440 agtcattgcc tcgagctgta gatcttgtta ccttgctaaa agwtmaycca atactgtgtt 1500 atgttattat cgttggccac gtaaataata twaacaagat tggtaagata cgatgtcttc 1560 tatttgcggg acaaatggta gataaagtgt gaatcttttg atctgattca atctgactca 1620 tttgagtcaa atcaatgatt cgaatcttca ggatccgatt cattgaaatg agctaattaa 1680 tccctgatcg atattaaaac gtaggttgat cccctaca 1718 // ID Copia-7-LTR_DR repbase; DNA; ZEB; 483 BP. XX AC . XX DT 17-NOV-2008 (Rel. 13.11, Created) DT 17-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Nonautonomous; KW Copia-7-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-483 RA Bao W. and Jurka J.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(11), 1779-1779 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 483 BP; 106 A; 137 C; 78 G; 162 T; 0 other; tgttacgata accagcgatc attcactgga gatcgctggt aattcctcta tcatcattca 60 cttcacttcc gcattcccca ggactacatt ttcatacatg cacttctccc aatcacgcac 120 gcctggtcac tcatctatcc tgattacatc accagctgaa acctgttaca gagactgatt 180 acacacagta tttaagcagc acacacactc attcacattg ccgagtcttg tttactgttt 240 ggtgacatta caacgcgtta tcctggtctt gttttcccgt gttttggttc ttagccttgt 300 ttatcctagt tatcctgttt agccgcctgc ctttcgacct attgcctgtt tattgactac 360 gactctggat ttgcccacac atacctgttc attcctgtat tgaccactgc ttgcctgact 420 aaagaataaa acctgcatat tggatcctac cttctgttgt tggtgtcact ccccggcgtt 480 aca 483 // ID Gypsy158-LTR_DR repbase; DNA; ZEB; 168 BP. XX AC chr5; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy158-I_DR; KW Gypsy158-LTR_DR; Gypsy158_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-168 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1578-1578 (2008). XX DR Genome; chr5; Positions 28397116 28397283. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 168 BP; 50 A; 37 C; 38 G; 43 T; 0 other; tgaactagca ataacatgtc atgaggtagt gtgaccgcta gagggggtac ttagctaata 60 agaagccttc cactgagagt tcgacagagc agtgtgttgt cctagccaga tccacctcat 120 gtatttgtgt gcagcatctg tcaaataaac agcctcaaat tacatcca 168 // ID BHIKHARI-4-I_DR repbase; DNA; ZEB; 4940 BP. XX AC . XX DT 24-NOV-2008 (Rel. 13.11, Created) DT 24-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE Internal portion of retrotransposon bhikhari from Danio rerio - a DE consensus. XX KW LTR Retrotransposon; Transposable Element; Nonautonomous; KW internal portion; LTR; TSD 4-bp; BHIKHARI-4-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4940 RA Bao W. and Jurka J.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(11), 1793-1793 (2008). XX DR [1] (Consensus) XX CC TSD is 4-bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 2443..4044 FT /product="BHIKHARI-4-I_DR_1p" FT /translation="MSPTANPTARGEDVETWLQNITDGLITKAIELLLLLV FT IFMILKRLLTQDSSQNHNHPELTKFTCLLSDAFTAQLKLSDQHLTHLQKEL FT TCAQSRIDKLEAKGQDDFKRQTTEFLKEQLEQLQTALAVAERDQHELKSSQ FT QDLIQRLQSAEQQVEEAKTSIQHKDAKINALXDQLSTYSSANDDLTQQLDD FT ISDELYRLRELIQVYQQKPEPRWAEPSSASPPLSKADSLISINSGRSERLK FT TKSSPALTTNNFPSDASRKSLHTAVKPAQEETLRDLDKLAKRITQFNPNST FT ESRNIQAYLQDIDFYLEARPHVTDRDRFYLLRSTSSPEVQSFLDRQRAHIK FT SNYQLLREALIEEFKSSESEHGVFTALETKQGRHKTPQAFYHRFRQAYFAA FT HDNPDLEEDVNFKTLFLRNLHPGLSRHLGIIACPRTMSIRQLRDLTHRAYL FT KQKMSPRRDVKSSVAPITQGPTRAQKDTLCNDDTTDYRAHSDRDSHDDDSY FT YTTWLEKSRDQPRSTRHSRHPKLFPRGNRNNSTPTVC*" XX SQ Sequence 4940 BP; 1521 A; 1193 C; 876 G; 1336 T; 14 other; tattggtgga gaacttagca gttcaatcta gataattaat ttatacaggc cactatttct 60 ttattacaaa agaaatctaa cagagatgag cataagcata ttgtgggatg aatggatgga 120 tgaatgaatg agctgcgatc tgcgagccca ccttcctgct gtaaacatct gtgaactgct 180 ttgtgaggat tttataaaag ttagaataga gtgtgttctt tttattaaat aactttttgc 240 tacgctgaac cactgtgtcc cagtgctttt cagatagcat aacaagctac gagccgagtg 300 agatctcata tcaagctaga actgagcgtg ttcctcagct tctaattacc ttaagtttta 360 aaactttcac tgcgtgtccc agtgttatgt tgatcgactt tacagttatt aatagctaga 420 gattggttgt cgctgcgtgt tccatcgtca atccaataag ctatccccct ccctgtttaa 480 gctgccacta aaaacataac tgtgtgttcg ctgaacagag ttcaggtaac awygcgtatg 540 ttttagcatg atttaaacag catctgtaaa gttgaagtct tgtgcccaac agtttaagct 600 gctgcgttca ygtgtcagaa agacacacac acttacatac acgcacgcac acacacacgg 660 aggatccaca gagagcactc gtgtgtttgc ctgcattact aaactgcttc tcaaagttta 720 agcccgctct traaggacag agtgtgaaac cccacatcat aaaaccattt atgattgctg 780 gataatgyat gattgcatgg aaatgtgccg acatttcgcg tttaatgttt gaatctgggc 840 tacgtgcaaa gattcacttt tgctaacatt ccctttgttc tggttgcata gcgacccgca 900 cagaacagga agttaccatc gcttccgagg gaagtctgaa ctgcgcatgc gcatcaaatg 960 taaacaaact tcgcgttacg tcaacagttg cattgtttac atttatacga acctagaaac 1020 attaattaat gaattagcct gcaaggctag cgattagcaa tttacaacgg tggttgaatg 1080 cagtgtctgt ttgagaacct gaaagtgatt ttaaccattt aacatctttg cattttgttg 1140 gtccctatat tctytacycc catttgtact atcttttatg cattttactt tactaacagt 1200 ttgctttaaa gcaaaacata acccactgtt cttttatgta ataactaatt acatttttac 1260 caaaaaggaa atactggttc actattatta ttattaatac aaattacmat ttgtttaaca 1320 ataataaagt gaatgttcta tttgtgtttt gtgcacaaaa tcacaatatt acattgaatt 1380 gtaattctat ttgctcacaa gcaactttga attaacattt caccagaatt ccctttttag 1440 tctaattcct ttagacatta ttaagtcttg agcttcgata gtttaagcaa gcataccact 1500 ttttacaact tcttattcac cccctttaca gaaaatctgt ttcatcaaat tcagattttc 1560 tcaatatcta attacattgt agtggttatg cactgatagc aattgcatgt ttttgtttcc 1620 ttgtattttc ccttcctcct tcaccttttc taggaaacct ctcattacga gtcttraata 1680 aattagagac aacccactct cttataattt attttgttta gttgtacgtc aactatcaat 1740 tttgtactcg tagcgagacg ttaaggcttc agagctagtt gtggaccagg tccactcagt 1800 tctccttttt cttcctgctt ctttcttgat tagttttcat ttcctttccc ttgtttcctc 1860 ctgtcttcya tctaattcat ctcatttgta ggaagcaaat ctaagcaaac caacttatca 1920 agtctatgga aagaaacaaa tttgttttgt aatcaaacca tcctttgttt tgatttactt 1980 catgcatttg tgaatgcaat taaaacttct gatttttcct tcaagatcaa ttgacagaaa 2040 cgagtagtga acagccgttc cctctaaaac atcgctggta agcggtccaa taaggtgtgt 2100 agtgtcaatc ctgtcatatt gagacaagaa aaaattcaga attataattg tagtatttcc 2160 ctcatagata taagcagtaa tattattctt ttcattttca gtgatacaaa atattataaa 2220 tattttcata actgaaagaa aaataatatt tttctgttta aatttgtttt aaccttgttc 2280 tctctctctt ttcttttcgc acacggagac caacacatag tcaccacacc acttcatttt 2340 ggttcataat cactgaaaca aacaacactt agctgctcca ctacatatag actgataaca 2400 ccacaaagat ctggatcagt ctccacctcc cagcaagcca gcatgtcacc gactgcaaac 2460 cccactgctc gaggagaaga tgtggaaacc tggctacaga acatcacgga tggcctaatc 2520 accaaggcaa ttgaactttt attactttta gtaatattca tgattctgaa gagacttttg 2580 acacaggact cgagccagaa tcacaaccat ccagagctaa ccaaattcac ctgtctcctg 2640 agtgacgcct tcacagccca gctcaagctc agcgaccagc acctcaccca tcttcagaag 2700 gagctgacrt gtgcacaaag ccgcatcgac aagctggaag cgaaaggcca agatgacttc 2760 aagcgtcaaa ccactgagtt cctgaaagaa cagcttgaac agctccaaac tgcccttgct 2820 gttgctgaac gagaccagca tgaactgaag tcatcccagc aggacctgat acagcgtctc 2880 cagtctgctg aacagcaagt agaggaagct aaaaccagca tccaacacaa agaygctaaa 2940 attaatgctc tgraggacca actgagcaca tacagcagtg caaatgatga cctgacccaa 3000 cagctagatg acatcagcga tgaactctac aggctcagag aactcataca ggtctatcaa 3060 cagaaaccag agccaagatg ggcggagccc tcatcagcat cacctccgct gagcaaagct 3120 gactccctca tctccataaa cagtggaaga agtgaacggc tgaagaccaa gtcatcacca 3180 gccctcacca ccaacaactt cccttctgat gcaagcagaa agtcactaca cacggctgtc 3240 aaacctgcac aggaagagac tcttagagac cttgacaaac tggcgaaacg catcacccag 3300 tttaacccaa actccactga gagcagaaac atccaggctt acctccagga catagacttt 3360 tatcttgaag caagacctca tgtgactgac agagacaggt tctacctcct cagatccaca 3420 tccagtcccg aggtgcaaag tttcctagac agacaacgtg ctcacataaa gtcaaactac 3480 cagctactgc gtgaagcact gattgaagag tttaaaagct ctgagtctga acatggagtg 3540 tttactgccc tggaaaccaa acaaggtcgc cacaagacac ctcaagcctt ctaccacagg 3600 ttcagacaag catactttgc tgcacatgac aacccggatc ttgaagagga tgtaaacttc 3660 aaaaccctct tcctacgaaa ccttcatcca ggactaagtc gtcatctagg catcatagcc 3720 tgtccacgca caatgtccat ccgacagttg cgggacctca cacaccgggc ctacctcaag 3780 cagaagatgt ctccaagaag agatgtgaaa tcttcagtgg cccccatcac ccaaggtcca 3840 acacgtgcac agaaagacac cctgtgtaac gacgatacca cagactacag agcacacagt 3900 gatcgtgact cccatgacga cgacagctac tacacaacct ggttggaaaa gtcacgggat 3960 cagccacgct ccacaaggca ctcaagacac ccaaaactgt tccccagagg aaacagaaat 4020 aacagcaccc ccacagtctg ttagccaaca ccgtcaatgg ctatcaccat ggataagaga 4080 tgacagccac ctgttctgca acaacacaga ttcaaaaggg ggaggtctct caccaacaat 4140 agaactcaga ggtatcagaa aaacaccact ctgtttttac ctcacactca ccaagaaaac 4200 attgaactct aaactcattt cacctgacag caaccacggt caagcacaga gaccacagtc 4260 tgtcacagcc agaaagcccc atcagaggaa tgtgacgtgg aagacaagtt tgcatatcag 4320 ctccacactg ccatgacaaa caactaccca caktctgcca aagaaacact tcctaatgga 4380 cccccatcag gtcaccaacg acccgtgacc caaggaaaag atgcatagag ccaaacttca 4440 tgtgtaccta gagacaatga atgagaagca agcgcatcaa accagcaagg aagggggagg 4500 gcccaaaggc tgagacagat ctgaccatac agtacacgca ttacatcagt aactgcacac 4560 agcaaacaca tctgacatta aaacatcatt gtttggttta actctgccta tcgtttccta 4620 tcctatcttt ctttaattag aatatcagga ttggcacacc atacctaatt ttgataccat 4680 ttaaatatgg ttgtgaacca gagagcgcta cgccagttgg tcatattatc atagactact 4740 gagaactgaa ctaacttaga aatgttgagt gtcagtttca acgatcagta cccaaacaat 4800 ttgatactct acaccagtac agtatgttga gatggaggtg ttgtcttgtt gtttgccgtg 4860 tatgtgtctg tctctccatc ttgtttctcc agcgttcgcc gatgtcttca tcccaacaca 4920 tcgcctaacg aagggggatc 4940 // ID DNA-5-7_DR repbase; DNA; ZEB; 674 BP. XX AC . XX DT 24-OCT-2008 (Rel. 13.1, Created) DT 24-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE nonautonomous DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; TSD 5-bp; KW DNA-5-7_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-674 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1239-1239 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 674 BP; 201 A; 118 C; 147 G; 208 T; 0 other; cagggtgtcc gcggggtctt aaaaagtatt aaaagttgat aaatcaatta tgagaaaatt 60 aaggccctta aaaggtatta aaaagtctta atcgcgtttt tacgaggtct taaattttgt 120 tcaagcgttg tccaaagtgt ttgactccaa aaaagcatat tatatatatt aaatatattt 180 attttcctcc taatattaac aacggcgtgc tcgatccacg cgattggttc gggccggggc 240 gcgtccggcc aagctcctcc gtccgggtga tgtcacgtaa tttgcgacaa acgcgggaag 300 gtgatgtgac gctctccaca tgtagcgaaa ccatagagcg aaacagacgg caaaaatggg 360 aaaatgtaag tttgcgtact cctggttgga gaaagacgag tttaaacagt ggctgaagcc 420 tgtcgctgaa aacaaccgcg taatttattc tttcaagctt tgtttcttcc gagcttatct 480 gagttctgtt gcatggttgt gctccattga tttatttaac tacaactgtt tattaacaac 540 tgtttattaa gttaaaggtt tgatactgat tagaacttga agtggcgatg aggtcttaaa 600 atattctgag aaggtcttta aaaaagtctt aaaaaggtat tgaaattacc tttaggattc 660 ctgcatatac cctg 674 // ID DNA-2-32_DR repbase; DNA; ZEB; 8782 BP. XX AC . XX DT 28-FEB-2009 (Rel. 14.02, Created) DT 03-MAR-2009 (Rel. 14.02, Last updated, Version 1) XX DE Nonautonomous DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; 2-bp TSD; KW DNA-2-32_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-8782 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 9(2), 528-528 (2009). XX DR [1] (Consensus) XX SQ Sequence 8782 BP; 2746 A; 1479 C; 1677 G; 2870 T; 10 other; cccttacaga aatttcacat gagtttcaca tgtgaaattc acatgtgatc acatgtgcat 60 gtgaaattgc acatgtgaaa ttttacatgt gaaacacaca aaccacatgt gaaatgcaca 120 tgggatcaca tgtgatcaca tgggaaaatg tgtgtatttg gaacgctttt gtgtgaattc 180 acatgtgatc acatgggaaa acgtgtgtat ttggaacgac tttgtgtgaa twttcatgtg 240 aaccacatgt gatcacatgg gaaaacgtgt gtatttggaa cgmtttygtg tgaattywca 300 tgtgatcaca tgggaaaacg tgtgtatttg gaacgatttc gtgtgaattt tcatgtgaac 360 cacatgtgat cacatgggaa aaggtgtgta tttggaacgc tttcgtgtga attcacatgt 420 gatcacatgg gaaaacgtgt gtatttggaa cgctttcgtg tgaattcaca tgtgatcaca 480 tgggaaaacg tgtgtatttg gaacgctttc gtgtgaattt acatgtgaat cacatgggaa 540 aacgtgtgta tttggaacga tttcgtgtga attttcatgt gaaccacatg tgatcacatg 600 ggaaaacgtg tgtatttgga acgctttcgt gtgaattcac atgtgatcac atgggaaaac 660 gtgtgtattt ggaacgcttt tgtgtgaatt cacatgtgat cacatgggaa aacgtgtgta 720 tttggaacga tttcgtgtga attttcatgt gaaccacatg tgatcacatg ggaaaacgtg 780 tgtatttgga acgctttcgt gtgaattcac atgtgatcac atgggaaaac gtgtgtattt 840 ggaacgcttt cgtgtgaatt cacatgtgat cacatgggaa aacgtgtgta tttggaacgc 900 tttcgtgtga atttcacatg tgatcacatg ggaaaacgtg tgtatttgga acgatttcgt 960 gtgaattttc atgtgaacca catgtgatca catgggaaaa cgtgtgtatt tggaacgctt 1020 tcgtgtgaat tcacatgtga tcacatggga aaacgtgtgt atttggaacg ctttcatgtg 1080 aaytcacatg tgatcacatg ggaaaacgtg tgtatttgga acgatttcgt gtgaattttc 1140 atgtgaacca catgtgatca catgggaaaa cgtgtgtatt tggaacgatt tcgtgtgaat 1200 tttcatgtga accacatgtg atcacatggg aaaacgtgtg tatttggaac gctttcgtgt 1260 gaattcacat gtgatcacat gggaaaacgt gtgtatttgg aacgcttttg tgtgaattca 1320 catgtgatca catgggaaaa cgtgtgtatt tggaacgatt tcgtgtgaat tttcatgtga 1380 accacatgtg atcacatggg aaaacgtgtg tatttggaac gctttcgtgt gaattttcat 1440 gtgaaccaca tgtgatcaca tgggaaaacg tgtgtatttg gaacgctttc gtgtgaattc 1500 acatgtgatc acatgggaaa acgtgtgtat ttggaacgat ttcgtgtgaa tttacatgtg 1560 atcacatggg aaaacgtgtg tatttggaac gctttcgtgt gaattcacat gtgatcacat 1620 gggaaaacgt gtgtatttgg aacgatttcg tgtgaatttt catgtgaacc acatgtgatc 1680 acatgggaaa acgtgtgtat ttggaacgct tttgtgtgaa ttcacatgtg atcacatggg 1740 aaaacgtgtg tatttggaac gctttcgtgt gaattttcat gtgaaccaca tgtgatcaca 1800 tgggaaaacg tgtgtatttg gaacgctttt gtgtgaattc acatgtgatc acatgggaaa 1860 acgtgtgtat ttggaacgct ttygtgtgaa ttyacatgtg atcacatggg aaaacgtgtg 1920 tatttggaac gcttttgtgt gaattcacat gtgatcacat gggaaaacgt gtgtatttgg 1980 aacgatttcg tgtgaatttt catgtgaacc acatgtgatc acatgggaaa acgtgtgtat 2040 ttggaacgct tttgtgtgaa ttttcatgtg aaccacatgt gatcacatgg gaaaacgtgt 2100 gtatttggaa cgatttcgtg tgaattttca tgtgaaccac atgtgatcac atgggaaaac 2160 ctgtgaattt taacgttttt gtgtgaattc acatgtgaac tacatgtgca ttcacaagtg 2220 atcacatggg aaaatgtttg tatttagaat tttgatggtg agcattgaaa catgcgttcc 2280 catgtaaaaa tctcttgtgg cttttctctt gtgtacttca cacmataaaa tttgtgcatt 2340 agtgtgaaca aaaattattc ttaaacttaa atataaatac aaacaattta ttgtcaatta 2400 tataaaatca caacaattat ttactgaaat accttgaggg gaaaactgtc actaatataa 2460 tgtgaataca aaaggggcca catggtagct ccgtggtcat cactgttgcc tcaaagcaag 2520 aaggtcactg gttcgagtct caggtgtgcc aggtgtggag tttgcatgtt caccccatgt 2580 ttgcgtgggt ttccttcggg tgctctggtc aattctgcat atgagagcat ctggtgaaaa 2640 aagcatgtac cacagtaatt gctagttcat tctgctgtga tgacccgtga tgaaaaggga 2700 aagaaaagcc aagggaaagt gagttttgct tttgacgagt ttgtcctttt ggttcatgtt 2760 aatgacttga tcttgtaatc tgtaagaaaa aaatacaaaa cttattttta ataaaagaaa 2820 ctaccaaatg ctcaaatgca tacacgtgaa tgcaaaatga attcatagta tttagatcac 2880 attttttcac attagcagaa ataggtagta atgaagtaca aactttgtta ctttacttaa 2940 gtagatatgt ctagtgtcag tattatactc cactgttaac gttttctgaa atatttttct 3000 tttacttctt acatttgaat acaaatctac tccttacatt tttaaatcag gctcgttact 3060 ttcgctttaa tgtgctttgt ggagcagacg tagctttagt ataatgcaaa aaatatgtag 3120 tcaaatacat ttagtttttt tattttatca aaagaaaaaa actgaaatat gttttgaaaa 3180 aaaattaggt tttaaattta agacatttat tagttataaa tatataggcc tacagaatta 3240 tattcaagag tatacaaaca agtatgacag ctgctgttct taacagctat agtgttttgt 3300 gttcagctga tgatacacag ggctcgaaat taaccttttt ttcttggtag caccggtgct 3360 cctaacttca aaaatttgga tgcaccagac aaaatttagt cgcacccacc gacaaatatg 3420 agcaccgtta ctaaacgttt tatgttaaca gattccaata tatattacca ctctaatcac 3480 aactaacaaa taaacaaaaa tctgcatgcg ctcactggcc ctgcatgcac tgcttgacgt 3540 gaatgagatg aactgaatta actataataa ctgtgctgtt cttatgggtg gtgtctgata 3600 ctgcacagat gctattgaca tataacttat tatttgcata cgtcacccta atatataacg 3660 gtcttttcat gaactgcaat attagtttca tctcagtgaa tgcatgctct ttagcgatgg 3720 tgactgcggt atcagtcgca cgattaacag cattgtgatg attaaataaa ggattaaata 3780 acggttgaac attcattcat ttttttccgg cttagtccct tttttaattc ggagtcgcca 3840 acttatccag catctgtttt acgcagcaga tgcccttcca gctgcaaccc atctctggaa 3900 aacattcata cacacccaat cacacgtata cactacggac aatttagcct gcccaattca 3960 cctatactgc atatctttgg actgtggggg aaaacggagc acccgaagga aacccacacg 4020 aatacaggga gaacaartaa acttcacgtt agaacatctc gtgcttaaaa tgtgtagatt 4080 aagtggcaaa cactgttacc taaaaactcc atcccaccag ttttacagtg aatgctataa 4140 tcattttcat agcggacttt gagcactaga tgtcttttat ccactcttcg aaaagtgtaa 4200 atgctggatt gacattctcc acataatgac taatcagatg tgccatgatt tgtaactgta 4260 gatggtttct aaaactaaat ctatcagtgt tgttttcatt ctctcttatc cagacctttg 4320 catttttgca gctgtagctc cctgtcatcg gaactgagtg attgacagct gatattaacc 4380 aatccattcg cattctgttc tagcgcaatt ggccaataaa aaaagcttga agacgggctt 4440 tgcttagtgc agcaagttga cgtgttaact gttataaact attctttagt gctgcgtttg 4500 gtgtttttag gtgtaaagat gcttgcgtga atgcctccta attccgcgca tgtggtgaac 4560 atttttctgt gaaaacaaca attttatgca ctcgcataaa tgctgccaaa tacattttga 4620 ggtcgcaaag ataaaacttt gggcgcatat gcgaccaaaa cagtcgcaat ttcaagccct 4680 gtaggatgat gttgcttggc tactttgcag ctaaaaattg gcaacaactt tttttacaaa 4740 aaaaaaaatt tgaaatgtaa gtaattttgt ataaatgtat gcattttgtg aggtgtagtt 4800 accaccatga aactaaaaca gatattagta atgaatactt tttacttaag tacatatttc 4860 aggctgtact tatttacttt tacttgagta gaaaagtcta gtccgtagtt caacttttac 4920 cagagtatct gtacttctac ttgagtaaag gatgcatgta catttgccat ctctacgcat 4980 tagagatagc aaacgctata aaagttacct gttaagacat taactttgtg cactggaagt 5040 gctccttact agcctctgta ttgggtgtat tttctctttt cagacactga ttgactgccc 5100 gttttcactc cctctcccaa agtgatgtaa acgattacct atcaggaaac atcagatgtt 5160 aaaaacagaa aaaactgtat acagggtgat tcaaaaggaa tcccacaaat ttgaaaatcg 5220 aaaactctgc aaagaagaaa aggggagcta agctctttct gtcgtcgatt taaggaagct 5280 ctgaagtttc atttggttac tcattaggtc gctaggagag ctgttgtcat catgtttctt 5340 tgattaaata cagattttca aagttgtgtt attctttttg aatcaccctg cagtcacaga 5400 ctgttaactg ctataaaaca aagtacctat aaggaatctg tagactaaga aggtacaact 5460 agtggctagt aaaagatttt gctttaataa tatataaata tatatatata tatatatata 5520 tatatatata tatatatata tatatatata tatatatata tatatatata tatatatata 5580 tatattatat atatatatat atatatatat atatattttt tttttttttt tttctttaat 5640 aagaaatacc ttctaggcaa tcaaaataca aacttagact atggtacaaa aaaaaggcaa 5700 aactgtataa ttgagacaac tgtcgacttc cttgaattgt agcattgtcc tttgaagatg 5760 gttaagtctc caatgtgtca ggcttcttct ttaaaataac catcatcttt atatttccca 5820 agctgtactt tttaaacccc tgcacatcat tgatttctgt acaaaaagca aagagaaaca 5880 atcaaccact gtgacagatt tttgtaacac caatattagt ctataatatc taatcttaaa 5940 acttcattaa tacaagtaga gagagagaaa gagagagaga gagagagaga gagagagaga 6000 gagaaagaaa gaaagagagt cagacaaaac taaaaatgat atcatatcta tgttcattaa 6060 catcttacac atattaatcc tgtatgctat aatcttcata aaaagtaaga ttaagttaac 6120 atgcaataat gaagattgaa ttaaagtttt gtatcacagg gataaatgaa gttttacaat 6180 aaattagaaa ttaactttac attgtgaaac gtattataaa tcaattttaa aaattggttt 6240 tagtgtattt taaagaaagt acatgcagcc acacgtcttt caatattgtt taaataatct 6300 taccatctaa aacttttaaa tgctagtgta tctcaataaa aagataagtt aaacaaaata 6360 taaaccaatg tttctggaaa ggatggacaa gcctttaaaa taatgacaga tctacagaac 6420 tgttattaag tacttttgaa atgaaataca taaaataaac aagtagcttc tgcttacctt 6480 aatatctatt attgtatcta ttccattatt tacatttagg acatttcaat ttaaaaccaa 6540 gcacaatagt catttagatc aggtttataa aaccctattc aattaaattt tagttaatgt 6600 gttaaaagag aattctggat taatacaaat taatacatca ctaatacagt tgcatcaatt 6660 acaacatttg cagtgtttta agcaagtgaa tgagctcgga tatcttttgg cgcgcgtttg 6720 actggtatca ctggcttggt caaatattct gaacatattc tctttttttg ttgttgtact 6780 tgcagaacca actgttaagg ttacgttaag tttctctaca catcctaaat tataaagttg 6840 tgaggacagt atatgaatat gcacttaccg ctgaattata ccgctgaaga tgataaggtc 6900 aaaatggtcc agcttctcca aaaaacgtag ccctctcttt tgtcctttga ccgtgacttt 6960 ttattttaac tgtggtacgt tagtgttcat atgtgaatgg catttacagt taatatatca 7020 acaaacacgt attttatctt catttcgcga ccgaaataag gagacaaaac ggacaaaaga 7080 ttgctaaacg accgttaaga aactgcgttt gtttgtgaat ccttgttgtc aggacaacgc 7140 gcgcgttgcc gcgaatgctc ttctgtgatg aacgcgcgcg caagttcacc ccagttgcgt 7200 aagagtgcct cgtggtcgca taattcaaaa gcaaaatcgc aaaatgtgtg agtacgcatt 7260 taaagcgagc gccttctgtc ttcttcgtgc tcgagaaaga gtagtataag ttactgtagc 7320 ttgtaactgc ttcatttaaa agggttaaaa aaaagactcg actctaggaa tcgaatcctg 7380 tcttggagtc gactccggca ctacgtccat tgagggtcga ggaatcgact cttttgaagt 7440 caactctcaa ttccctacag actgattcag aaatccattt attttttgcc attgtactca 7500 tgtacacaat attaaatctc attacatttt tactttttat aaatcatgtt ttttgaaccc 7560 aagaaacaac ataaaacttt atatttaaaa attactaaaa cagaattatc attgcaaata 7620 aactcttcaa attctattta agttacttgt ttccaaaaca tccctgaata aaagtacaaa 7680 aaaatgtagc aagtacagca cagcattgcc acagcattgg gaaattggat attagattat 7740 atatcagcac aatgcacaag tgtagtaaag ttgtacatga atatactgca aattgtactg 7800 ttgttgatgt aattttatta gaattttata gttcattatt ttgcagttgc acataatttt 7860 acctcattta tcctccttta taatgctgtt atactgcagt tttgcactta ttcacggtat 7920 agtgttggtc atcatgaaat atggttttca gatgatgatg tttttcacat gtgggttcat 7980 gtgatcacta tgctgtccac atgtggacag cataatttta catgttgtcc acatttggtc 8040 acattgactt cagatgtggt caaaggtgat cacatgtggc cctatgaact tcacatgttg 8100 tccacatgtg gccacatgaa tttcacatgt tgtccacgtg gtcacataaa cttcacatgt 8160 ggtcacatga acacatgtgg tcgtgagctt cacatgtggt cacataaact tcacatgtgg 8220 tcacatgaac acatgtaatc gtgagcttca catgtggtca cattaacttc acatgtggta 8280 acatgaactt caaatatggc cacacgaact tcacatgtgg ccacatgaat ttcacatgtt 8340 gtccacgtgg tcacataaac ttcacatgtg gtcacatgaa cacatgtggt cgtgagcttc 8400 acatgtggtc acataaactt cacatgtggt cacatgaaca catgtaatcg tgagcttcac 8460 atgtggtcac attaacttca catgtggtaa catgaacttc aaatatggcc acacgaactt 8520 cacatgtggt cacatgaact tcacatgttg tccacctgtg gtcacataaa cttcacatgt 8580 ggccatgtgt gatcacataa actttacatg tggccatgtg tgatcacata aacttcacat 8640 gtggccacat gaacttcaca tgttgtccac atgtggtcac ataaacttca catgtggtca 8700 tgtgtgatca catgttgtca catgtgatca catgtggtca catgtgatca catgtgaaat 8760 tcatgtgact tttctgtaag gg 8782 // ID Gypsy124-LTR_DR repbase; DNA; ZEB; 623 BP. XX AC chr16; XX DT 03-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Gypsy retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy124-I_DR; KW Gypsy124-LTR_DR; Gypsy124_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-623 RA Jurka J. and Kohany O.; RT "Gypsy retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1285-1285 (2008). XX DR Genome; chr16; Positions 14966086 14966708. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 623 BP; 122 A; 117 C; 127 G; 257 T; 0 other; tgtcacggtt tattctttat ttagccttcc ttcattatta ttattattat tattattatt 60 attattctct tcgtgatttg tgttgtggtg gagattgttt tatcaattgc aatttctttt 120 ctttgaggct ggctccgcct tgcacctgcc gctaattcaa cttgtcgcct ttataagcgg 180 aagctgagtg tgtgtcgggg tggctcttga tttccagcgc gtctgactga tcctcacgat 240 cttgtctctc tttggagagt atgacccgta cttgctgaga atctctccgt atcttgccct 300 gatgaagaag gtccgaactt ctgtgcgcgt cttgtgcagc cgctgctgag gagccatgct 360 ggtgttgtgc tgtttctttg atctatgtat tttttttatc aatattttct ttatgatata 420 tgtttttgtt tgttagtcaa agcaattgct ttgacttggt tattgtttga ttaatgattt 480 atatttctgt tgccacctta attagatagg acacgtatct tctggtctct aaataaatag 540 agaattgtaa ttctggtttc tgtgtccttt tatgttgcta accccttccc tagacgagcc 600 ttaaaaaact ggggttcgta aca 623 // ID TDR18 repbase; DNA; ZEB; 567 BP. XX AC . XX DT 04-MAR-2002 (Rel. 7.02, Created) DT 21-APR-2011 (Rel. 16.04, Last updated, Version 2) XX DE Zebrafish non-autonomous DNA transposon. XX KW DNA transposon; Transposable Element; TDR18. XX NM TDR18. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-567 RA Jurka J.; RT "TDR18: a non-autonomous DNA transposon from Danio rerio."; RL Repbase Reports 2(2), 28-28 (2002). XX DR [1] (Consensus) XX CC TA target site duplication. XX SQ Sequence 567 BP; 205 A; 80 C; 81 G; 199 T; 2 other; cagttgaagt cagaattatt agcccccctg aattattagc ccccctgttt atttttttcc 60 ccaatttctg tttaacggag agaagatttt ttttttcaac acatttctaa acataatagt 120 tttaataact catttctaat aactgattta ttttatcttt gccatgatga cagtaaataa 180 tatttkacta gatatttttc aagacacttc tatacagctt aaagtgacat ttaaaggctt 240 aactaggtta attaggttaa ctaggcaggt tagggtaatt aggcaagtta ttgtataacg 300 atggtttgtt ctgtagacta tcgaaaaaaa tatattagct tgcttaaagg ggctaataat 360 tttgacctta aaaatggttt ttaaaaaant tataaattaa aaactgcttt tattctagcc 420 gaaataaaac aaataagact ttctccagaa gaaaaaatat tatcagacat actgtgaaaa 480 tttccttgct ctgttaaaca tcatttggga aatatttaaa aaagaaaaaa aaaatcaaag 540 gggggctaat aattctgact tcaactg 567 // ID Gypsy-165-I_DR repbase; DNA; ZEB; 4366 BP. XX AC . XX DT 17-NOV-2008 (Rel. 13.12, Created) DT 28-OCT-2009 (Rel. 14.11, Last updated, Version 2) XX DE An internal portion of the Gypsy-165_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW endogenous retrovirus; Interspersed repeat; reverse transcriptase; KW gag; Gypsy superfamily; integrase; Gypsy-165-I_DR; KW Gypsy-165-LTR_DR; Gypsy-165_DR. XX NM Gypsy-165-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4366 RA Dib M.R. and Naveira H.F.; RT "Gypsy-165_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 8(12), 2162-2162 (2008). XX DR [1] (Consensus) XX CC Gypsy-165-I_DR is an internal portion of the Gypsy-165_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its long CC terminal repeat is deposited in Repbase as Gypsy-165-LTR_DR. CC Gypsy-165_DR is characterized by 4-bp target site duplications. CC The internal portion encodes two proteins: the 344-aa gag CC Gypsy-165_DR1p (pos. 18-1049) and 1131-aa polyprotein (pos. CC 959-4351, conceptual translation) composed of the protease, CC reverse transcriptase, and integrase domains. XX FH Key Location/Qualifiers FT CDS 18..1049 FT /product="Gypsy-165-I_DR_1p" FT /note="Gag-protein." FT /translation="MDPEALSTKELLAVIGSHEASFQRHEEVLRRQEEVMT FT KHSELLADVTSSIRQLFQSLPGVSSPASPAAPPLSTNSPPIAPVAAAEPRL FT PPPKPFSGDPSSCQGFLTQCSLTFELQPSSFPTDRSKIAYIITLLTDKALS FT WASAAWESQPSYCQSYTAFEKEFKKVFNHPVSGQEASKRLLTLRQGPRSAA FT DFAIEFRTIAAGSGWNDEALRVCFLGGLAESIQDEMATREPAKDLESLIDM FT AIRLDIRLRERRMTRGRASHSQTPVHKPASPVHAPPVRLLPVNEQAPEPPE FT DMQLGRSRLSPSERDRRMRERRCLYCGMSGHFRSTCPELSGNEGPHRVTGG FT L" FT CDS 959..4351 FT /product="Gypsy-165-I_DR_2p" FT /note="Polyprotein." FT /translation="MPLLWYVWSFSIHLSRTIGKRGSPPSYRRTVMGKIRV FT PPANTVLALKAILSWESHQFPVQAMIDSGAAGNFIDLSLAKKLKIPTHLLP FT HPQSVTALDGRPLEPGKVTEATQSLKLTIAKHQQEETFYLIDSPEYPVILG FT HPWLHRHNPHINWSTGSILDWSPSCHFTCFTHSLSAPHPEPQDSVDLSQVP FT AVYHKFRAVFSKSRATSLPPHRPYDCAIDLLPGSSPPRGRVFSLSPPEQAA FT MNAYIQESLATGIIRASTSPAGAGFFFVGKKDGGLRPCIDYRGLNKITIRN FT RYPLPLMATAFELLQGASIFTKLDLRNAYHLVRIRQGDEWKTAFNTPTGHY FT EYLVMPFGLTNAPAVFQALINDVLRDMLNIFVFVYLDDILIFSKSMEEHEG FT HVSRVLQRLLENHLFVKPEKCEFHVSLTKFLGYIVTPGHLEMDPSKIKAVL FT NWPIPTTVKEVQRFVGFANFYRKFIRNFSSVVAPLTALTKGGGVKIEWGPK FT AAAAFQDLKDRFTSAPILSIPNPDIPFMVEVDASDVGVGAILSQRNEDGKL FT HPCAFMSRRLSNAERNYHVGDRELLAVKLALEEWRHWLEGARHPFQVLTDH FT KNLEYLQQAKQLNPRQARWSLFFNRFQFILTYRPGSKNLKPDALSRAYSPE FT THEKPVSSIIPSSRIVAPLRWDLEQVVRKAQTKEPDPGNGPLGALYVPRAV FT RAQVLQWGHESVLTCHPGSSRTLEFLRRRFWWPSIKEDVKGYVEACQVCCQ FT GKSSHQRPQGLLHPLPVPHRPWSHLSLDFITGLPLSQGNTVILVVVDRFSK FT AARFIPLPKLPSAKETAELIISHVFRVFGIPQDIVSDRGPQFLSRFWGAFC FT RLFGTTASLSSGFHPESNGQTERVNQDLETTLRCMAANNPTTWSSYIMWAE FT YAHNTLKSSATGLSPFECQFGYSPPLFPEKEVQVGVPSAQHFVRRCRRTWR FT RARSALLRTSLRYQHQANRRRRRPPTFRVGQRVWLATKNLPLRVESRKLSQ FT RFIGPFRIARKVNPVSYRLYLPRSLRINPTFHVSLLKPVLSSPFAPPRRPP FT PPPRIIDGQPAYTVHRILDSRRVQNSLQYLVDWEGYGPEERSWIPAKDILD FT PSLIREFHTQRPGCSGRNVRSRS" XX SQ Sequence 4366 BP; 1016 A; 1224 C; 988 G; 1138 T; 0 other; gaaacaccga gtcagacatg gacccagagg cactcagtac caaagaactg ttggcagtga 60 ttggcagtca tgaagcttcg ttccagcgtc atgaagaggt tctccgtcgt caagaggagg 120 tgatgacaaa acactcagag ctccttgctg atgtcacgtc gtccattcgg caactctttc 180 aaagccttcc tggggtttct tctcctgctt ctcctgctgc accaccccta agtaccaaca 240 gtcctcccat agctccagta gctgctgcag agcccagact tccacccccc aagccattct 300 ctggagatcc tagttcctgt cagggatttc tcacccagtg ctctctcact tttgagcttc 360 agccctcaag ttttcccact gaccgctcaa agatcgctta tattatcacc cttctgactg 420 acaaggcatt gtcttgggcc tctgcggcat gggagtccca gcctagttat tgccagtcat 480 acacagcctt tgagaaggaa ttcaagaagg tgttcaatca cccagttagt ggacaggagg 540 cttccaaacg tctccttact ctccgtcaag gtcctcgcag cgctgcagac ttcgccattg 600 aatttcgaac tattgcagca ggtagtggat ggaatgacga ggccttaaga gtctgctttc 660 tgggcggatt agctgaatcc attcaagatg agatggccac ccgggaacca gccaaagacc 720 tagaatccct tattgatatg gccattcgcc ttgatattcg cttgagagaa cggagaatga 780 ctcgaggcag agcatcccat tcccaaactc ctgttcacaa acctgcatct ccagttcatg 840 cgccaccagt cagacttctc ccagtcaatg aacaagctcc cgagcctcca gaagatatgc 900 agctaggtcg ttccagactc tctcccagtg aaagggacag acggatgagg gagcgacgat 960 gcctttactg tggtatgtct ggtcattttc gatccacttg tccagaacta tcgggaaacg 1020 agggtcccca ccgagttaca ggaggactgt gatggggaaa ataagagttc ctcctgccaa 1080 caccgttcta gctctcaaag ctattttgtc ctgggagagt catcagttcc cagtccaggc 1140 aatgatcgat tcaggggccg caggtaattt catagatctc tccttggcca agaaacttaa 1200 gattcctacc caccttctcc ctcatcccca gtcagtaact gctttggatg gtagacccct 1260 tgaacccggc aaagtaactg aggccactca gtccctgaag cttaccattg ctaaacatca 1320 gcaggaggag actttctacc ttattgactc tcccgagtat ccggtcattc taggtcatcc 1380 ctggttgcac agacataatc cccatatcaa ctggtctact ggttccattc tagattggag 1440 tccttcatgt cacttcacct gttttaccca tagcctctct gcccctcatc ccgagcctca 1500 agattctgta gatctgtctc aagttcccgc tgtctatcat aagtttaggg cagtattcag 1560 taagtctcga gccacctctt tgccacctca ccgcccatac gactgtgcaa ttgaccttct 1620 ccccggttcc tctcctccta gaggcagagt cttctcccta tctccccctg aacaggctgc 1680 tatgaatgct tacatccaag agtccctggc aactggcatc atccgagcct ccacttcccc 1740 tgctggtgct ggcttcttct ttgtggggaa gaaggatggg gggcttaggc cttgtatcga 1800 ttaccgaggt cttaacaaga taaccattcg gaatcgatat cccctgcctc ttatggctac 1860 tgcctttgag ctgctgcagg gagcttccat ttttaccaag ctcgaccttc gcaatgccta 1920 ccatctggtg cggatacggc aaggagatga atggaagact gcttttaaca cccccacagg 1980 ccactatgaa tacctggtga tgcctttcgg ccttaccaat gcccctgccg tgttccaggc 2040 acttatcaac gacgtcctcc gagacatgtt aaatatattt gtattcgttt atctggacga 2100 tatacttata ttttccaagt ccatggagga gcacgagggc catgtcagca gggttctcca 2160 aagactcctt gaaaaccatc tctttgtcaa gccagaaaaa tgtgagtttc atgtttccct 2220 gactaagttt cttgggtaca ttgtcacccc tggtcacctg gagatggacc ctagtaagat 2280 taaagctgtt ctcaactggc ctattccaac cacagtaaaa gaggtgcaac ggtttgtggg 2340 ctttgcaaac ttttacagga agtttatcag gaatttcagc tcagttgtgg ctcccttgac 2400 agcactgaca aagggaggag gagtcaagat tgaatggggt cctaaagcag cggctgcctt 2460 ccaggatctc aaggatcgat tcacctcagc tcccatactc tctatcccta atccagacat 2520 accctttatg gtagaggtag atgcctcaga tgtgggtgta ggagccattt tatcacagag 2580 gaatgaggat ggaaaactac acccctgtgc tttcatgtca cgtcgcctgt ctaatgccga 2640 gcgcaactac cacgtggggg accgagagct gcttgctgtt aagttggcct tggaagaatg 2700 gcgccattgg cttgagggcg ctcgacatcc tttccaggta cttacagacc ataagaacct 2760 agaatatctc cagcaggcca agcaactgaa ccctcggcag gctcgatggt ctctgttttt 2820 caacagattt cagttcatcc tgacttatag acccggttcc aagaacctta agcctgatgc 2880 cttgtcccga gcctactctc cagagacaca tgaaaaacct gtttcttcta ttattcctag 2940 ttcaaggatt gttgcccccc tcagatggga tctggaacaa gtggtccgta aggctcaaac 3000 caaagaacct gatccaggga atggaccgtt gggggctcta tatgtccctc gagcagtgcg 3060 agctcaggtc ttgcagtggg gtcatgagtc tgtattgacc tgccacccag gtagttcccg 3120 tactttagaa ttcctccgac gtcgcttttg gtggccttcc ataaaggaag atgttaaagg 3180 ttatgtggag gcctgccaag tatgttgtca gggaaaatca tcacaccagc gacctcaggg 3240 actgctccat cccttacctg ttccccacag gccttggtca cacctttccc tggatttcat 3300 tacaggactt ccactctccc agggcaacac ggtcatattg gttgtggtgg accgattttc 3360 caaggctgcc cggttcattc ctctgcccaa gttgccatcg gctaaagaga ctgctgagct 3420 cataataagc catgttttca gagtttttgg cattccccaa gacattgttt ctgaccgagg 3480 tccacaattt ctgtccagat tctggggggc tttttgcaga ctcttcggaa ccactgccag 3540 cctatcatcc gggttccatc ctgagtctaa tggtcagaca gaacgagtta accaagattt 3600 ggaaaccact ctacggtgca tggcagccaa taaccccacc acttggtcat cttacataat 3660 gtgggctgaa tatgcccaca acaccctcaa gtcctcagcc accggacttt ccccttttga 3720 atgccaattt ggttattccc ctccattgtt tcctgagaaa gaggtccagg tgggagttcc 3780 ctcagcccag cactttgttc gacgctgtcg acgaacctgg aggagagcta gaagcgctct 3840 ccttcgaacc tccctgagat accaacatca ggctaatcgc cgtcgtcgaa ggcctcctac 3900 tttccgggtt ggccagagag tctggttggc cactaagaac cttccactcc gggttgagtc 3960 gagaaaattg tcccagaggt tcatcgggcc atttagaata gccaggaaag ttaaccctgt 4020 ttcttatcgt ttgtatcttc ctcgttcact tagaattaat cccacatttc atgtctcttt 4080 attaaaacct gtcttgtctt ctccctttgc cccccctcgc agaccccctc cacctcccag 4140 gatcattgac ggccagccag cctacacggt ccaccggata ctggattcca ggagggtcca 4200 gaactcactt cagtatctgg ttgactggga gggctacggg ccagaggagc gctcctggat 4260 tcctgccaaa gacatcttgg accctagttt aatccgggag tttcataccc agaggccagg 4320 gtgttctggt aggaacgtca ggagccgttc ctaaaggagg gggtcc 4366 // ID SUSHIIDR1_LTR repbase; DNA; ZEB; 154 BP. XX AC AL591175; XX DT 04-MAR-2002 (Rel. 7.02, Created) DT 04-MAR-2002 (Rel. 7.02, Last updated, Version 1) XX DE LTR of putative novel retrotransposon SUSHIIDR1. XX KW Gypsy; LTR Retrotransposon; Transposable Element; LTR; KW SUSHIIDR1_LTR; retrotransposon. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-154 RA Jekosch K.; RT "SUSHIIDR1: putative retrotransposon (LTR)."; RL Repbase Reports 2(2), 17-17 (2002). XX DR [1] (Consensus) XX SQ Sequence 154 BP; 19 A; 34 C; 35 G; 66 T; 0 other; tgacggcttg gattgccgtg cgcctctcgc tgaagcagtt ttattttgct ttcattttct 60 tttgcattga ctatctagtg acggcttggt ttgccgtgcg cctcctattg ggacgattgg 120 tattgcattt agttttcttt tgctttaccc cttt 154 // ID CR1-37_DR repbase; DNA; ZEB; 2717 BP. XX AC . XX DT 20-FEB-2009 (Rel. 14.02, Created) DT 20-FEB-2009 (Rel. 14.02, Last updated, Version 1) XX DE CR1-like non-LTR retrotransposon - a consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; AP endonuclease; CR1 clad; CR1-37_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-2717 RA Bao W. and Jurka J.; RT "CR1-like non-LTR retrotransposons from zebrafish."; RL Repbase Reports 9(2), 523-523 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS join(2..391,425..748,752..1825,1797..2513) FT /product="CR1-37_DR_1p" FT /translation="YRPPSSKSGFYDNLKHFMKELDPKKELILVGDFNINW FT TEKLQRKKLQEIVSSFDLVQLIQGPTRITSSSETQIDLIFSNKPERVTKSI FT NLLTGISDHNLILVVRKLTKHRFLYQQDKRNVLNVIPKKLKNSEILSTDNL FT EDASECLMAEINKTKSKFSRNITFSKKKKKQNLPWLNEQLWKLMRQRDHAL FT KKSIKSGLAHDKRTFQQLRNKVVKELRQAKAIFXIDLINNAKGNSREVENI FT NKVQKKITINNKAIELQSQGRLIQDKYQIARLFNNFFIDSXQDLAQGFSFR FT SKDITLPNYDFTVFNISEVSESCVQKFLSQLXSSRAKDAFECDSAFFXKYA FT NVFXTPLTHLVNLSIRQSYVPSAWKSAVITPIFKTGDPTNLVNYRPISILP FT XASKVIEKVVCNQLVEHLNLDKFPLHSMQFGFRARHSTETANCYFIEQIKS FT SLDRGGVVGAVFLDFKKAFDNVNHNILLSKLSRFNFSVNALSWMESYLKSR FT KQCTRINGACSPYINCNFGVPQGSILGPILFSLYINDLPTVCPEVQVQMYA FT DDTVVYTHAKTKEHAAKKLTAALKKVSDWLYRSCLTLNVSKTVGMFFFNVK FT LWECFFSIKKNEEHCSDILVNGEVINIVDHFKYLGIIIDSNLNFKKHIKKV FT SKSARASLRIYRNIRHQLPMAAAKLFMNTMILPHLSYCATSWSHTNITTLK FT PLYVIYKQTVKILAKKPRSYHHCNCIAQQKLLTFDSFLFYADVCLMYKIFH FT DLAPPPLKQCVIVGKDNIRQTRSSVRGDCSVKLKRTAFGQSVFSVKAAERW FT NKIPVYIRESSTFNNFKTVLKIWLKQNQICSH*" XX SQ Sequence 2717 BP; 1012 A; 381 C; 444 G; 872 T; 8 other; atatagacct ccttcatcaa agagcgggtt ttatgataat ttaaaacatt ttatgaaaga 60 gctcgaccca aaaaaagaat taatattggt tggagatttt aatattaatt ggactgaaaa 120 gttacaaaga aaaaagcttc aggaaatagt aagcagtttt gacctagtgc agttaattca 180 ggggcctact agaattactt catcatctga aacccaaatt gacttgattt ttagcaataa 240 accagaaaga gtaacaaagt caataaactt attaacaggg atatcagatc ataatcttat 300 attagtggta agaaaactaa caaaacatag gtttttatat caacaagata aaagaaatgt 360 gttaaatgtt ataccaaaaa agcttaaaaa ttaatttgaa gaggaaatca aaaaattaga 420 ttgaagtgag atattatcta cagataatct agaagatgca tctgagtgtt taatggcaga 480 aattaataaa accaaaagta aatttagtag aaatatcacc ttctctaaaa aaaaaaaaaa 540 gcaaaattta ccttggttaa atgaacaatt gtggaaatta atgagacaga gggatcatgc 600 cctaaagaaa tcaattaaat cagggttagc ccatgataaa agaacctttc agcaactgcg 660 taataaagtt gtaaaggaac taagacaagc aaaagcaata tttyttattg atcttataaa 720 taatgcaaaa gggaatagta gggaagtata ggagaatatt aataaagtgc aaaaaaagat 780 aactattaat aataaagcca ttgagttaca atctcaggga cgtttaattc aagataagta 840 tcaaattgct aggttattta ataatttttt tatagattca gyccaagatt trgcacaagg 900 atttagtttt agaagtaaag acattacatt accaaattat gattttactg tttttaatat 960 ttctgaagta tctgaatctt gtgtacaaaa atttttaagt caacttarga gttctagagc 1020 taaagatgcc tttgaatgtg actcagcatt ttttwaaaag tatgcaaatg tttttartac 1080 acctttaact caccttgtta acttatcaat aagacaatct tatgttccaa gtgcctggaa 1140 gtcagcagta ataacaccca tatttaagac tggagatcct actaatctgg tgaattatag 1200 gcctatcagt attttaccaa ytgcctctaa agtaattgaa aaggttgttt gtaatcagct 1260 ggtagagcat ttaaatttag ataaatttcc attacactct atgcagtttg gatttagagc 1320 acgtcattct acggagacag caaactgtta ctttattgaa caaattaaat ccagtctaga 1380 taggggtgga gttgttgggg cggtctttct tgattttaaa aaggcctttg ataatgttaa 1440 ccacaacata cttttatcta aattatcaag gtttaatttt tctgttaacg cattatcatg 1500 gatggagtca tatttaaaat caagaaagca atgtacaaga attaatggtg catgctcacc 1560 atacatcaat tgtaattttg gagttccaca aggatcaatt cttggaccaa ttttgtttag 1620 tttatacatt aatgatctgc ctacagtgtg tccagaagtt caggttcaaa tgtatgcaga 1680 cgacacggta gtatatactc acgctaaaac caaagaacat gctgctaaaa aactwactgc 1740 tgcactcaaa aaggtttcag attggttata tcgttcatgt cttaccctca atgtaagtaa 1800 aactgtggga atgttttttt tcaattaaga aaaatgagga gcattgctca gatattttag 1860 ttaatggaga agtaataaat attgttgatc attttaaata tttaggtata attattgact 1920 ccaatttaaa ttttaaaaaa cacattaaga aggtttccaa aagtgctaga gcaagcctga 1980 ggatctatag aaacattagg catcagttac ctatggctgc tgctaaactt tttatgaata 2040 caatgatttt acctcattta tcttattgtg ctacaagctg gtcgcatacg aacattacaa 2100 cattaaaacc attgtatgtt atatataaac aaactgttaa aattttagct aaaaagccaa 2160 ggagctatca tcattgcaac tgtattgcac aacaaaaatt actaacattt gacagtttct 2220 tattttatgc tgatgtgtgt ttaatgtata aaatatttca tgatcttgca ccaccacccc 2280 tcaaacaatg tgtgattgtc ggcaaggaca atataagaca gactaggtca tctgttagag 2340 gtgattgttc agttaaactt aagagaactg cttttggaca atcagttttt tcagttaaag 2400 cagcagaaag atggaataaa ataccagtgt atatcagaga gagcagcact tttaataatt 2460 ttaaaactgt tcttaaaatc tggctaaaac aaaatcaaat atgcagccac tgatttgatt 2520 tcattttatt gacaatgatg tattgtattt attgtactgt aagtgtatat tgtatttgtt 2580 ataagtgtta aatttgttac ttcctgccca gggactacag atgtaaatta gctttatagc 2640 taactctggc acaacatgtc atgttgtaca ttgtccctgt ataaataaat aaagtaaata 2700 aataaataaa aataaaa 2717 // ID ERV1-N1-LTR_DR repbase; DNA; ZEB; 880 BP. XX AC . XX DT 22-OCT-2008 (Rel. 13.1, Created) DT 22-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE nonautonomous endogenous retrovirus from zebrafish, LTR - DE consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW internal portion; ERV1-N1-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-880 RA Bao W. and Jurka J.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1597-1597 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 880 BP; 243 A; 149 C; 174 G; 314 T; 0 other; tgaaggattc ggatttctgc tttccagtcc atctcatcat ggaccaacgt ttatcctttt 60 catcatcctg tttctcagta tttaactgtg aattgatttt aattaatgaa taagttaaca 120 cattgcttaa tataagaatg ttcagtgcat ttgcctgtct ctttatacct attttcagga 180 gccagcaaac taggtcagag gtcatggaga ccttgagtgg aactgagggc tgaggactgg 240 aagcaactgt ttctattgtt atttctagga gttaatactg tctaaaatgt ctaaagtgat 300 tttgctatag ttttacggtt aaatctttta aagtgattct actttttatt caagatagac 360 tttgtgtagt aagaaattat aactgcctga atgtagatta taccggtttt taaccagttt 420 tgataaagac ggctgacagt acactcagcc ttggataaga aacagattat actttaagtg 480 tgtgtgttct atttgattgt tgatccgttt cacaggtctg gagtcagctc taatcagtct 540 aaaggatgtc tgacgtttat gtttaagata ttgttcagaa ttgtacgcac gaaccccacg 600 tggaaatgtt cctagtctat ctgtgtttta accaatcagg ttagaacacc tatatgtatg 660 acgcagttaa tgacgttatg tgagagtata attgttattg attggtgatt gtaagcagag 720 cggcctagga actcattgcg agtgttgaag ctggcttctg cgaatgaaac tgcaaactat 780 cttcagtaaa cttgatttat ttatttaaat ctctgactcc ggctcttctt catctaccag 840 actggtcatt ctttgggtta aactgtaaac tatccctaca 880 // ID CR1-14_DR repbase; DNA; ZEB; 1194 BP. XX AC . XX DT 03-NOV-2008 (Rel. 13.11, Created) DT 03-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE CR1-14_DR is a CR1-like non-LTR retrotransposon - a consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; Nonautonomous; KW reverse transcriptase; AP endonuclease; CR1 clad; CR1-14_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1194 RA Bao W. and Jurka J.; RT "CR1-like non-LTR retrotransposons from zebrafish."; RL Repbase Reports 8(11), 1688-1688 (2008). XX DR [1] (Consensus) XX CC The 5'- part is incomplete. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1194 BP; 304 A; 226 C; 213 G; 441 T; 10 other; tgtctttcca atcgcagttt traagtcacc attgktgaac attcwtctga ggcagccttt 60 ttgtcatgtg gtgttcccca gggatctatt ttagccccta ttcttttttc cttatatatg 120 cttcccctgg gttctatttt tagaaaacat ggtctgtctt tccactgtta tgcagatgac 180 actcaaattt acctgcttta aaaccgagtt caagtggatt ggagactctt atgtcttgcc 240 tgtcggatgt gaaagcctgg ttgtctttga actttctaga ttttaatgag agtaaaactg 300 aaatrattgt gttcggcctg tcaaattctt ctaggacacc accagtagat cttggtgaat 360 ttggtttttc tgtgaagcct tgtgtaaaaa atctgggtgt tatctttgat gatggcttga 420 gatttgataa acagattaac gcagtggtca aatcctgctt ttttcagctt cgacttttaa 480 aaacagtcaa acctatttta tctagaaatt tttttraaaa aataattmat gctttcatta 540 cttctagatt ggactactgt aatgcgctat attttgggat cwgtcagaca actctatccc 600 gcctacagtt agttcaaaat gctgctgcaa ggcttttaac ckgtaccaaa aaayatgacc 660 acatcacacc agttttgcgc tctttgcact ggttgcctgt gcactatcgt gtcactttta 720 aaattcttct cctggttttt aaatctctaa atggcctcgc tccttcctac ctgtcagacc 780 tgttgactga acatcagcct ggtcggtctc ttcagtcatc taaccagaga ctattataca 840 ttccaaagtc gaggctgaaa tgcaggggtg accgggcttt cgcagtagct ggtcctacct 900 tgtggaatgc tctgcccctc tgtattaggt ctatatcatc tctgtctgtt tttaaatcta 960 ggttgaaaac ttaccttttt gatttggcat tttatcagtg agatttgttt gyttttagct 1020 ataataacta ttagtgagaa ttgtttgttt tagctataat tttatacttg ttctttgtga 1080 tttgtattgt tcagcacatt ggtcaacctt tggttgtgta taatagtgct atataaataa 1140 aattgacatt gaaataaata aataaattga cataatagct ctaaatacaa taat 1194 // ID Copia6-LTR_DR repbase; DNA; ZEB; 456 BP. XX AC . XX DT 29-OCT-2008 (Rel. 13.11, Created) DT 29-OCT-2008 (Rel. 13.11, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Nonautonomous; KW Copia6-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-456 RA Bao W. and Jurka J.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(11), 1780-1780 (2008). XX DR [1] (Consensus) XX CC 5-bp TSD. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 456 BP; 85 A; 107 C; 78 G; 185 T; 1 other; tgtcatgact gtgtttgtcc accagatgcc actgtagttc gccagcaact cccagcgaac 60 tacagtggct taagaactac atttccatac attctttcgc gcacacaccc ggtctgtcat 120 ctacactgat ttacaaccac agctgtctcc gttctccgtt gattacctgg actatatatt 180 cagtcattca tcctctgttc gtcagttggt cgttgtcgtc tttnttggta tgttgttcct 240 gatcttgatc tagctcttgc tcttgtatct tgtcttagtt gttaaagtct agttcgagtt 300 taatctgttt gagttcctcg tttgtttgtt tattttgtta ctttgttact ttgatttttg 360 tattttttgt cactttcgct gcactataca tcattaaaaa tctgcacttg gattctacac 420 ccgtttttgt tcctgtttcg atcccgtaac gtgaca 456 // ID EXPANDER1_DR repbase; DNA; ZEB; 3365 BP. XX AC . XX DT 05-JUN-2002 (Rel. 7.05, Created) DT 15-NOV-2010 (Rel. 15.12, Last updated, Version 3) XX DE EXPANDER1_DR is a RTE-like non-LTR retrotransposon - consensus. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; ORF2; RTE clade; REX3; EXPANDER1_DR. XX NM EXPANDER1_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-3365 RA Kapitonov V.V. and Jurka J.; RT "EXPANDER1_DR, a family of RTE-like non-LTR retrotransposons in RT zebrafish."; RL Repbase Reports 2(5), 15-15 (2002). XX RN [2] RP 1-3365 RA Kojima K.K. and Jurka J.; RT "Consensus update."; RL Direct Submission to Repbase Update (10-NOV-2010). XX DR [2] (Consensus) XX CC EXPANDER1_DR is a family of RTE-like non-LTR retrotransposons and CC it CC was active in zebrafish a few million years ago. CC There are several thousand copies of EXPANDER1_DR fossilized in CC the CC zebrafish genome; they are ~91% identical with the consensus CC sequence. CC The 5' portion of EXPANDER1_DR is incomplete. CC The consensus encodes a 3' portion of the reverse CC transcriptase. CC EXPANDER1_DR is 87% identical to EXPANDER1 from the fugu genome. CC Such a high nucleotide identity between non-LTR retrotransposons CC from the species diverged more than 100 million years ago CC strongly indicates that horizontal transfer events were involved CC in evolution of EXPANDERs. CC [2] Consensus update and extension. The full-length sequence is CC >74% identical to RTE elements in fugu, medaka, stickleback, and CC lamprey. XX FH Key Location/Qualifiers FT CDS 162..3353 FT /product="EXPANDER1_DR_1p" FT /note="apurinic-like endonuclease and reverse FT transcriptase, N-terminal truncated." FT /translation="MSSTTSRTVTSPGMAQPGPHPGARPGVGARMRAPGGR FT VFSHGTRPGLARRSDVGPSSRRPTTCRGSRKGPVHCGTGGGRRLGPRQPDR FT QAQKLALGTWNVTSLGGKEPELVREVERYRLEIVGLTSTHSLGSGTQLLEK FT GWTFLYSGVAHGERQRAGVGLLIAPQLSRHVLEFSPVNERVASLRLRVGDR FT SLTVVCAYGPNTSAEYPAFLESLGGVLEGAQTEDSIVLLGDFNAHVGNDSD FT TWRGVIGRNGLPDLNPSGVLLLDFCANHSLSITNTMFEHKGVHQCTWHQDT FT LGRRSMIDFVVVSSDLRPYVLDTRVKRGAELSTDHHLVVSWIRWQGGKLDR FT PGRPKRIVRVLWERLAEPQVRGIFNSHLRKSFDQIPREAGDMETEWSMFSD FT SIVNAAVRSCGRKVCGACRGGNPRTRWWTSEVRDAVKLKKESYRAWLACRT FT PETADGYRRAKRAAARAVAEAKTRAWEEFGETMEEDYRSAPKRFWQTVRRL FT RRGKHLHIDAVYSGSGELLTSTGDVVGRWKEYFEDLLNPTDMSFIEEAETG FT DAGVDSPITQAEVTEVVCKLRSGKAAGVDEIHPVYLRSLDVVGLSWLTRLY FT NIAWRSGTVPLDWATGVVVPIFKKGDRRVCSNYRGITLLSLPGKVYARVLE FT RRIRPMVEPRIQEEQCGFRPGRGTLDQLYTLTRVLEGSWEYAQPVHMCFVD FT LEKAFDRVPRGILWRVLGEYGVRGDLLRAVSSLYEQSRSLVRIAGNKSDLF FT PVHVGLRQGCPLSPILFIIFMDRISRRSLGLEGVRFGDHRISSLLFADDVV FT LLASSDMDLQHALGRFAAECDAAGMRISTSKSEAMVLHRKKVVCHLQVGGK FT SLPQVEEFKYLGVLFTSEGRMEREIDRRIGAAAAVMRSMYRSVVVKKELSR FT KAKLSIYRSIYVPTLTYGHELWVMTERTRSRIQAAEMSFLRRVAGRTLMDR FT VRSSDTREELGVEPLLLHIERSQLRWLGHLFRMPPGRLPREVFQACPTGRR FT PRGRPRTRWRDYVSRLAWERLGIPPEELEEVSGDREVWGSLLRLLPPRPGP FT GKAAEDE" XX SQ Sequence 3365 BP; 701 A; 816 C; 1116 G; 732 T; 0 other; aaatctcctc catgaaccgc caccttatcg tggtggaggg gtttgagtgc ctgagtgatc 60 ctaagagcta tgttgtcggg ggctaatgcc cctggtaggg tctcccaagg caaacaggtc 120 ttaggtgaca ggtcagacta agtgcggttc aaaaacccct tatgagttca acaacatcaa 180 ggactgtgac gtcgcccggt atggcgcagc cggggcccca ccctggagcc aggcctgggg 240 ttggggctcg tatgcgagcg cctggtggcc gggttttttc ccacggaacc cggccgggct 300 tagcccgaag gagcgacgtg ggaccatcct cccgcaggcc caccacctgc agggggagcc 360 gtaaggggcc ggtgcattgt ggaacgggtg gtggtcgaag gttaggacca cgacaacccg 420 atcgtcaggc acagaaactg gctcttggga catggaatgt cacctcactg ggagggaagg 480 agcccgagct tgtgcgggag gtggaacggt accgactaga gatagttggg ctcacctcca 540 cgcacagctt gggctctgga actcaacttc ttgaaaaggg gtggactttc ctctactctg 600 gagttgctca cggcgagagg cagcgggctg gtgtgggctt gcttatagct ccccaactca 660 gccgccatgt gttggagttt tccccggtga acgagagggt cgcctccctg cgccttcggg 720 tcggggatag gtctctcact gtggtatgtg cctacgggcc aaacaccagt gcagagtacc 780 cggccttctt ggagtccttg ggaggggtgc tggaaggtgc ccagactgag gactccattg 840 ttctactggg ggacttcaat gcccacgtgg gtaatgacag tgatacctgg agaggcgtga 900 ttgggaggaa cggcctccct gatctgaacc cgagtggtgt tttgttattg gacttctgtg 960 ctaatcacag tttgtccata acgaacacca tgttcgagca taagggtgtc catcagtgca 1020 catggcacca ggacacccta gggcggaggt caatgatcga ctttgtggtt gtatcatctg 1080 atctccgacc gtatgtcttg gacactcggg taaagagagg ggcagagctg tcaactgatc 1140 accacctggt ggtgagttgg atccgctggc aagggggaaa gctggacaga cctggtaggc 1200 ccaagcgtat tgtgagggtc ctctgggaac gtcttgctga accccaagtc agagggattt 1260 tcaattctca cctccggaag agctttgacc agatcccgag ggaggctgga gacatggaga 1320 ctgagtggtc catgttctcc gactccattg ttaacgcggc cgtgaggagc tgtggtcgta 1380 aggtctgtgg tgcctgtcgg ggcggcaatc cacgaacccg gtggtggaca tcggaagtaa 1440 gggatgccgt caagctgaag aaggagtcct acagggcttg gttggcttgc aggactcccg 1500 agacagctga tgggtatcga cgggccaagc gtgctgcagc ccgggcggtt gcggaagcaa 1560 aaactcgggc ctgggaagag ttcggggaga ccatggagga agactatcgg tcggccccaa 1620 agagattctg gcaaactgtc cggcgcctca ggaggggaaa gcatctccac atcgacgctg 1680 tttacagcgg aagtggggag ctgttgacct caactgggga tgttgtcggg cggtggaagg 1740 aatactttga ggatctcctt aaccccaccg acatgtcttt catcgaggaa gcagagactg 1800 gggatgcagg ggtggactca cccatcaccc aagctgaggt cactgaggta gtttgcaagc 1860 tccgcagtgg caaagcagcg ggggtggatg agattcaccc tgtgtatctt aggtctctgg 1920 atgttgtggg gctgtcgtgg ctgacacgtc tttacaacat cgcatggagg tcggggacag 1980 tacctctgga ctgggcaact ggggtggtgg ttcccatttt taagaagggg gaccggaggg 2040 tgtgctccaa ctataggggg atcacactcc tcagcctccc ggggaaagtc tatgccaggg 2100 tactggagag gaggatccgg ccaatggttg aacctaggat ccaggaggaa caatgcggtt 2160 ttcgtccggg ccgtggtaca ctggaccagc tctataccct caccagggtg ctcgagggtt 2220 catgggagta tgcccaacca gtccacatgt gttttgtgga cttggagaag gcattcgatc 2280 gtgtccctcg cggcattctg tggagggtgc tcggggagta tggggtcaga ggcgatctgt 2340 tgagggccgt ctcgtccctg tatgaacaga gtaggagtct ggttcgcatt gccggcaata 2400 agtcagattt gtttccagtg catgttggac tccggcaggg ctgccccttg tcaccaattc 2460 tgttcataat ttttatggac agaatttcaa ggcgcagcct tgggctggag ggggtccggt 2520 tcggggacca caggatttca tctctgttat tcgcagacga tgttgttctg ttggcttcat 2580 cggacatgga ccttcagcat gcactggggc ggtttgctgc cgagtgtgac gcggctggga 2640 tgagaatcag cacctccaag tccgaggcca tggtgctcca ccggaaaaag gtggtttgcc 2700 atctccaggt tggaggaaag tccttacccc aggtggagga gtttaagtat ctcggggttt 2760 tgttcacgag tgagggaagg atggaacgtg agattgacag gcggattggt gcagcggcag 2820 cagtaatgcg gtcgatgtac cggtccgttg tggtgaagaa ggagctgagc cgaaaggcaa 2880 agctctcgat ttaccggtca atctacgttc ctactctcac ctatggtcat gagctttggg 2940 tcatgaccga aaggacaaga tctcggatac aagcggccga aatgagtttc cttcgaaggg 3000 tggcagggcg cactcttatg gatagggtga ggagctctga cacccgggag gagctcggag 3060 tagagccgct gctcctccac atcgagagaa gtcagctgag gtggctcggg catctgtttc 3120 ggatgcctcc tggacgccta cctagggagg tgttccaggc atgtcccacc gggaggaggc 3180 ctcggggaag acccaggaca cgctggaggg actatgtctc tcggctggcc tgggaacgcc 3240 tcgggatccc cccggaggag ctggaggaag tgtctgggga tagggaagtc tggggttctc 3300 tcctaagact gctgcccccg cgacccggcc ccggaaaagc ggcagaagat gaatgaatga 3360 atgaa 3365 // ID Polinton-2_DR repbase; DNA; ZEB; 16276 BP. XX AC . XX DT 14-MAR-2006 (Rel. 11.02, Created) DT 14-MAR-2006 (Rel. 11.02, Last updated, Version 1) XX DE A family of autonomous Polinton DNA transposons - a consensus DE sequence. XX KW Polinton; DNA transposon; Transposable Element; KW Interspersed repeat; Maverick; Tlr; Polinton-2_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-16276 RA Kapitonov V.V. and Jurka J.; RT "Self-synthesizing DNA transposons in eukaryotes."; RL Proc. Natl. Acad. Sci. USA 103(12), 4540-4545 (2006). XX DR [1] (Consensus) XX CC This transposon is characterized by identical 354-bp terminal CC inverted repeats and 6-bp target site duplications. The consensus CC sequence was built based on multiple alignment of several copies CC that are >90% identical to each other. It encodes a family B DNA CC polymerase (POLB-2_DR), retroviral integrase (INT-2_DR), ATPase CC (ATP-2_DR), cysteine protease (PRO-2_DR) and additional four CC unclassified proteins (PX-2_DR, PW-2_DR, PY-2_DR, and PZ-2_DR), CC conserved in Polintons from different species. XX FH Key Location/Qualifiers FT CDS 2667..3773 FT /product="INT-2_DRp" FT /translation="MSGETKKMSELLKQTYYTPSNPGSLGGKKRLKDAIFN FT ETGVRLSDKHISDWLAGEDTYTLHRTAPLKYKRNRVIVFNIDEQFQADLVD FT MSAYSKENDNKNFLLTCIDVFSKHGWVRVLKNKTGVEVVKAFESILKEGRI FT PLKLQTDKGKEFLNKHFQSMVKKYNITHFAPSTELKACVVERFNRTLKGRM FT WRYLTATNSKRYIDVLQDIVKGYNNSYHRSIKMRPIDVTKENESVVFHNLY FT GTCVGRSVKSKCKFKIGDIVRISKVRGTFTKGYEENYTHEFFTIAACIPRE FT PHVYRLCDYDGDVIDGVFYEQELQKIVVNKNKAFKVEKILDKKKQGRETLV FT LVQWLGWPAKFSSWVNQKELVDVQTP" FT CDS 4680..5147 FT /product="PW-2_DRp" FT /translation="MVYFNNRNMDAERYVSYYQNQAGNGLPGYAGGGVMYG FT AGLGGIFRGLFRMAIPLLKRGFSIAKPHLKSAAKNILTDVVSNTLSRTINN FT NTQDGSGLMVMARRHSSKPPGVRRSGHTQKKRKRAPKKTPVLKRKRKEPGK FT RTAPSSKKKHKSIKTIF" FT CDS 3824..4669 FT /product="PX-2_DRp" FT /translation="MAEDGFYVTLPCNASLSVYPENRISCYTTRLARTINL FT KGEWQVGLIEFEFPITWYTFSEQDCRFILNNGETKVSDGLRYDNYGEGKEI FT DTLINTNQNISNELYMLKYGYYDDILFLIREINANMPPQVKLGYDHVKNKV FT FLKAPPKMSLTFFGKLAVILGLKPGVSIESANHTRENHSTGTTPVTYAPYQ FT ADINAGFYSLFIYSDIIEYQSVGDYYVPLLRCVHIAGEKHKVVSVRYDKPH FT YAAVNKTSINEITIQVKDDQNQEVNFSYGKVCAKLHFRPVK" FT CDS 6468..7001 FT /product="PRO-2_DRp" FT /translation="MDTIELTNIMDKVSRNVHFIGVIPCDYLAKTPLRKLP FT AMAIINTHPSGLPGEHWLAIYINEVGVGCFFDSFGNKPNDVRFPAFIDDFL FT RLNCVVVQYSAKQVQDYSSDTCGQHCVFFLYHMAKGKCYDYILKLYSNDFV FT KNDKRVSTFVKKLKRSRCNENVFNCVQCVQLGAVFMSHA" FT CDS 9024..8332 FT /product="ATP-2_DRp" FT /translation="MGFDNVFGHGGVQDAGVFDFRFQHPFTCIISGPSFSG FT KTFFVKMLLQNCEKMFSKNVENVIFVYNSWQPLYDDLLKMYDVKFIKGIPE FT SFDDDHLFPPEKTNLLILDDLMDECSNNLEVSRVFTQYSHHKNMSCIKIVQ FT NLFIQGKFSRTISLNTNYLVLFKNPRDSGQVNVLARQMFAGNTKFFMECYH FT DATSIPHGYLLIDYKAHTPDQYRLRTAVLSDRPVVYIQKRK" FT CDS 5154..6458 FT /product="PY-2_DRp" FT /translation="MALLHTMSEECLKSELDLFTVPLTQTAIEKNAYIEVP FT PLSAISDTSPLEFFIAGNGEDYLDLNNTLVFLRLKITNPDGTDIADGAPVG FT LINYAGATIFSQVDVSLGDRLISQGSNLYPYRCIIECLINYGKDTLESLFS FT AGLFYKDTAGHMDTADPAGGNHGLTKRAAFTNASSVLELLTPLHSDIFFQE FT KLMLNGVDVRIRMTRGKDEFCLMRSDAVAYKLSILSASLFVKKVAVSPAVR FT LGHAQALLSTTAKYPIDRVCVKNFSIPAGSRVCNQENLFLGTLPKSIVIAM FT VDNDAFTGTYAKNPFAFKHYDAEFLSIYLDSLQFPSKPLQPDYANGSAVRE FT FYQLVLASGKHLKNHALSIDREDFLNGYTLYAFNLTPDEDCGQHISLIKSG FT NIRLEARFRQPLPRTINLIVYAIFDSIIEVSNRRQILVDYF" FT CDS 13192..9017 FT /product="POLB-2_DRp" FT /translation="MATKRCLDSDGNEQYRTVRFRGDSNDGHHNQQNTTET FT LRHLIQTVEDMDRPRSTTDALIELLNSVQSMSDNQPEPSFDLNVFLSSPQN FT SDVSEDNQRELCFDPDQFLNMSQNIDACDEWSTTDFDMDHFLNTSQSSVSE FT QNSKSDQDQIVLSCQTGEGSDTRPYQTFNPTAANMGLSQDEIETLVRQGGS FT VHKYTVIPRARFNGLEIHRRINLREISSTDLADYTTFLHELLSEIVSFSRL FT LAGDSGVINITLRGESLTSDVNTLLSPRNNYDLDLFINQLEKIMQSNSEVQ FT ADQALDLCVSIATCKNGGGRRKIQDLAHDEVIRRNKMNLFCPTNITNNLCF FT TICLAHFLNPQKSHSELEAIAANMQTTAGHKVQYKIAFDDIARFEQMLNIK FT IVTFYRSNTGMLEKYTTTDKPHSKTAYLYLHDGHYFLIKNLTAFIGTPYVC FT EYCHVGYTCRRNHRCEYVCDVCNYPDCHTQTKQIIQCNDCLRFCRSRYCFE FT MHKQPPPGQQFAQCDVTKYCKMCNRRYYISGSKPKPHRCEAEYCVHCGESL FT ATGGEHECFIQPYKPMEPSDRYIFYDFETRFENGKHVANFVCAITFNGEEF FT VAEGVDCVDRLIKKFRRPKYRNYTWVAHNASGFDNFILLEYFVRMGIPPKI FT TMRGCRLIFMFDRTYKQRFLDSYLFMPMRLSKTPEAFQLDNIEKGYFPHVF FT NRAENDCYIGPYPEKHFYGYETMSDKERSDFDAWYCTVSGKVFDFKKELAK FT YGKNDVVLLRKACIKYRHEFILCTDLDPFTFTTLAGCCMGVYKSKFLPKDT FT LALTRNDGYTNQNKTYSNASIEWLEYVAKSRSIEIQHALNGGEVSFGKYHV FT DGFYDDGRVKKAFDFLGCFYHGCERCYNPYDINPLSKLSYGVLRRQVNDRC FT EILQRMYGLQVEFMWECDWKAAKNSDVGVMDFMSTYKRPERLKPRDALFGG FT RVNAFKLYHKTSDDEKVFYRDITSLYPFVQSTKSYPVGHPKIIFKDFENLE FT NYFGFIKAEVIPPRGLFHPVLPYRTGGRLMFPLCRSCAEEQNQDTPCQHAD FT KERALTGCWVSVELSKAVERGYVVSEVYEVWHFSQRSDTLFSDYVKTFLQF FT KQESSGFPEDVVTDEEKESYVKDYFEKEGVKLNIDNITFNAARRSINKQIL FT NSLWGRFSLKSQLPSSELISDPEQFARHIFAKGHDINYFSFLSDSIALVQY FT TQPYGKGKSTRDTNVFLGAFTTCYGRLELYDLLEKLGERVLYCDTDSVIYT FT YREGEYDPPLGPYLGELTDELDGGDYINIFASSGPKSYGYSTAKGKVCMKA FT KGITLNTVNSQAIRLDTLIGLVDDYVSGRDNSRHVLAHTDTIVRNKTQFTL FT HNKALVKKFKVVYTKRVLLPDYSTVPYGF" FT CDS 7967..7188 FT /product="PZ-2_DRp" FT /translation="MSTLKYAEKMYLVPQHQLDKIKPTTPRESIQHAVESD FT LDTTIRNILYRTDIDPHEKAKLYTAVLQRYLTITKQGDMDATTLKLSLPQH FT DHTDKEDPQKTITIGAVEDLIVDDILKNIPQRNEKNVRYILDKMSKSKDHA FT SWSESGEFMFKGKTIPGSHMLDLVKSITAPQKIADDRRPIGWYEFLEAFAD FT LNIPYSMVPNHGVRHTINSFKNKSKSPTSVSFKKGKKLKTQQSSHMTRSSY FT LEDTFFESPTFDRSRWMAF" XX SQ Sequence 16276 BP; 5110 A; 3153 C; 3086 G; 4927 T; 0 other; agtagtctta taatacacac agtgtcaaga tgatttattg ccttatgtca ataatttatg 60 agtatgtttt ggacggtatg gtaataattt atggcatggt ttgtccaatg tgtttcatga 120 atgtaagact ggtgaccaat gacaggcagc tttccccagt tcacaggctt agggtccctg 180 cttagtggta ggatttatga cggtttgtcc caaattgaca gttgaaccaa gtgtcgggct 240 gattttggca aagatttaca gtatagttag atttgtggca atgtgtatgg tttgtcattt 300 ttacaaccta tgcagctctt ctggcattcc agctccttcc aatctgtgag tgtaagcagc 360 agggtgttgt tatttaacac catcccaccc ctcccctccc tcaccgcctg aatgcatgtg 420 gtacatatag tgtattttgg aaaatagtga ctaaagcaca gaagacactc tgcaactttt 480 ttcctgttac gatggcagca ccgcactcac ctaaaaaacc aaaaagagtg gtgtgtatgt 540 cgcctacaag aggacatgtg gaaatgccac agccgccacc atgggcaccg aagaaaaaag 600 aaggagttcg tttgacatca tttctccatg aatatgcacc cgttactaaa cgtgatttca 660 gcatgtctga acaacgcgtc gtgagatacg tgtttgacaa actgaaaagg attgtgacat 720 tgtacactgt ggatgttgaa gacgatttta catcacagac gccttccttg attttaccgg 780 ctaaggattg gaggctattt tacaaccacg tgtggagtga tttaaaagaa tgctgcgatc 840 gaccgctcta cattgccgaa tggaatggta taatgccgaa aacacggtat cgtgtggttg 900 gtgatcactc taaaaagcag cctgatgact gtgtatattt tttttgctgc aacgataaca 960 caaagctgga gaattatgtg tggcctattc ctgagacgga acaccacaca tactataaaa 1020 tgaagttttt gtttcagtgg aaggacatgt acatgatgga cagtattttt tcatgcatca 1080 acaagatgtt taaatgggct gatatgtatg acagatacaa ttctgttaag acgaagctat 1140 ttcatcctga aaagagtgac cctgaactct caaaatttga cttaactcct cccacatatt 1200 acagtcatta aaacgcctgg aggtcgactc attcattcac agtaaactct caacatgtct 1260 caagaattta catctgactg ctgctacagc aacacgcctg aaacagtacc cgcactgcct 1320 gacgacaacg cgatgccgcc ggagccaccc acacaagttt tagacaataa ctggctggat 1380 caattttgta tggatctggg ataccacaat ttaactttcc tcacaacccc atatggaaat 1440 gctggcacga tagccggagc tgcggcctat gcagggtgtg acgatacaga tggctgtata 1500 cgagctgatg cttttaaaat cataaaagca accatcatcg tcatcctgga acatctcatc 1560 aacaaaaagc tgaaggacct ctgcctcggg tgtgaagttg accatcccag ccagatcaga 1620 cacagctgtt tgtttgaacc taatgccttt ttctttgaca catacttcga tgaactgaca 1680 cgcaccctga ttaaaccagg actgaaacac gtcattgctc aagcactaaa ccgtcatggt 1740 ctaaggattc accctcagag aattcaagga tcggtggatg ccattttgca tgagctacgc 1800 gacgaagtct acattgtgga aaagctcaga gaagtcagag agaaggttct ggacactaac 1860 agcgagcaga tcgtttatga cgctgttgac tgctggaagg gaacctcaca agccacccgc 1920 gtgtaaagtc tcaaagacac ctctcaagct gaacgtttgc aaagtcagta tgggtaacat 1980 ttgttgttat gctggactgg caaaatgtct tctgaaacaa cgtgtaagat gtgaaatgac 2040 taacttgttg tataagacca tagatgacag agttgatctg gttgttgtga aaaataattg 2100 tggcgttgat accaaacgat tttctaacct caaaaaacaa atgaatgttg taagaatgct 2160 aacgaatgag tgggctgaca tgatgcatat gtgtattaat gcaggcgagc caatctgtgt 2220 aactattcga aaaatacttg acctgatgat tgaatgcggt aatggtataa gaattgccga 2280 cacattgtgt ttgtgtacat atgtgtcaga tctgtgtgtg accatgcttt caagaaatga 2340 caaagctgat gtttgtatcg ttattgaaac actggttgat tatctccttg ataaaaacat 2400 tgttatatgt ataaagtttc ttcactacct tgacgatgta tgatgtctat atatatatat 2460 tctttttaca atttgttatt gaaacactgg tttattatct ccttgattaa aaaaataaaa 2520 aacattgtac tatgtataaa agtttttttc actaccatga cagtttatga tgtcatcata 2580 tactttttct atgatttgtt aaaaataaaa aagtatgtat caccaagcta cacgtcttat 2640 tctttttttg acaccgaacc ggtaatatgt ctggtgagac gaagaaaatg tctgaactgt 2700 tgaaacagac ttattacacc ccttctaacc ctggttcttt aggcggtaaa aaacgtttaa 2760 aagatgcgat tttcaatgaa acaggcgttc gcttatctga taaacatatt tcagactggt 2820 tagcgggtga ggatacttac acacttcaca gaacagcgcc tctaaaatac aaaagaaata 2880 gggtaattgt cttcaatatt gatgagcagt ttcaggcaga tctggttgac atgtcggcat 2940 attccaagga aaatgacaac aaaaattttt tattgacatg catagatgtt tttagtaaac 3000 atggatgggt ccgtgtgtta aaaaacaaga ctggtgtaga ggtagtaaaa gcttttgaat 3060 ctatcctgaa agaagggcgc attccactca aattacaaac agataaaggg aaagagtttt 3120 taaacaaaca ttttcaaagc atggtaaaaa agtacaatat cacacatttt gcaccttcca 3180 cagaattaaa agcatgcgtt gtcgagcggt ttaatagaac cttgaaaggt agaatgtggc 3240 ggtatttaac agcaacaaac tctaaacgct acatagatgt tcttcaagac attgttaaag 3300 gatacaacaa cagctaccac agaagtatta agatgagacc aatagatgtg accaaagaaa 3360 atgaatctgt agtgtttcat aatctatatg ggacatgtgt cggtcgtagt gtgaagtcaa 3420 aatgtaaatt taaaataggt gacatagtta gaatttctaa agtaagaggt acgtttacta 3480 aagggtatga agaaaactat acacacgagt tttttacaat agcggcatgc ataccgcgtg 3540 aaccacatgt ctacagatta tgtgattatg atggtgatgt catagatggt gtcttttatg 3600 agcaggaatt acaaaaaata gttgtaaata aaaacaaagc attcaaagtg gaaaaaatat 3660 tagacaagaa aaaacagggt cgtgaaacat tagtactggt gcagtggtta ggatggccag 3720 caaaattttc ttcctgggta aaccagaaag agttggttga tgttcagaca ccataaatac 3780 aaaacgcgtg tatagaacac gtgcatttat gtaataatta accatggctg aggatggttt 3840 ttatgtcaca ctaccatgca atgcatcgct atctgtttat ccagaaaacc gtatatcgtg 3900 ttatacaaca aggctagcaa gaactataaa cttgaaagga gaatggcaag ttggactgat 3960 cgagtttgaa ttccccataa cgtggtatac attttctgag caagattgtc gtttcattct 4020 taacaacggc gaaaccaaag tcagtgacgg tttaagatat gataattacg gtgaaggcaa 4080 agagattgat actttaatca ataccaacca aaacatatcg aacgagcttt acatgctaaa 4140 gtacggatac tatgatgata tccttttttt gatcagggag ataaatgcaa acatgccccc 4200 acaggtgaaa ctcggctatg atcacgttaa aaacaaagtt tttttaaaag caccaccaaa 4260 aatgtcatta acattttttg ggaaactggc agttatttta ggattaaaac caggggtgtc 4320 tattgagtct gctaatcata cacgagaaaa tcatagcaca ggaacaacac cagtgacgta 4380 tgcgccctat caagctgaca taaatgctgg gttctattca ttgtttattt attcagacat 4440 aatcgaatac caatcagttg gtgattatta tgtccccttg ctaaggtgtg tccacatagc 4500 tggagaaaaa cacaaggttg tcagtgttag gtacgataaa ccacactacg ccgctgttaa 4560 taaaacatct atcaatgaga ttaccataca agtaaaagac gatcagaatc aagaagtgaa 4620 ttttagttac ggaaaagtgt gtgccaagct acactttcga ccggtgaaat agagcaacaa 4680 tggtgtattt taacaaccga aatatggatg ctgagcgcta cgtgtcatat taccaaaatc 4740 aggcaggcaa tggactccct ggttatgccg ggggcggtgt tatgtatggt gctggtttgg 4800 gtggcatatt caggggcctt ttcagaatgg ccataccatt gcttaaacga gggtttagca 4860 tagcaaaacc gcatcttaaa tcagccgcca aaaacatatt aacagacgtt gttagcaaca 4920 cattatcacg gactattaat aacaatactc aagacggatc agggctgatg gtaatggcac 4980 gcagacactc atctaaacca ccaggcgtgc ggaggagtgg tcatacacag aagaaaagaa 5040 aacgtgctcc caagaaaacg ccagtcttga aacgcaagcg caaggagcca ggaaaacgta 5100 cagccccatc ttcaaaaaaa aaacataaat ctatcaaaac aattttttag accatggcct 5160 tactacacac catgtcagaa gaatgtttaa aatccgaact ggatctgttc acagtgccgc 5220 tgactcaaac cgcgattgag aaaaatgcat acatcgaggt gccgccgctg tctgcaattt 5280 cagacacatc gcctctcgag ttttttatcg ccggtaatgg tgaagattat ttggacctaa 5340 acaacacgtt ggtgttttta cgcctcaaaa taacaaatcc agacgggaca gatatagcag 5400 acggtgcacc tgtaggtctt atcaactatg cgggagctac aatattttca caagtggacg 5460 tgtctttagg agatagatta atttctcaag gatcaaattt atatccctat cgatgcataa 5520 tagagtgtct catcaactat ggtaaagaca ctcttgagtc gcttttctct gcaggcctgt 5580 tttacaagga cacagcaggg cacatggaca cagcggatcc tgctggtgga aatcatggtc 5640 tgaccaaaag ggcggcattt accaatgcta gcagcgtgct ggagcttctc acacctcttc 5700 acagcgatat tttttttcaa gaaaaactga tgcttaacgg tgtggacgtc agaatacgga 5760 tgacgagggg taaagatgaa ttctgtttga tgagaagtga tgccgtagct tataaattaa 5820 gcatattatc tgcgtccttg tttgtgaaaa aggtagctgt ttcaccagcc gtgagattgg 5880 gacatgcaca ggccctgctt tcaactactg ctaaataccc cattgacagg gtgtgtgtga 5940 aaaacttttc aatccctgca ggatcgcgtg tctgcaatca ggaaaacctg tttttaggaa 6000 cactacccaa atcgattgtt atagcaatgg ttgataatga tgcttttaca ggtacttatg 6060 ctaaaaaccc attcgcattt aagcattatg atgcagaatt tctttctatt tatctggact 6120 ctcttcagtt tccatcaaaa ccactacagc ctgattatgc gaatggttct gcagtgcgtg 6180 aattctacca attagtatta gcatctggaa aacatctcaa aaaccatgcc ctatccatag 6240 atagggagga ctttttgaat ggatatacgc tatacgcatt taatctcaca ccagacgagg 6300 attgtggaca gcacatctcg cttatcaagt caggaaacat caggcttgaa gcgcgcttca 6360 gacagccttt acctcgaaca atcaacctga ttgtctatgc catattcgac agcatcatag 6420 aagtgtcaaa tcgtcgacag atcctggttg attactttta aaaaataatg gacacaatag 6480 agctcaccaa catcatggat aaggtttcac gcaacgttca ttttataggg gtgatcccgt 6540 gcgactattt agcaaaaaca ccgttaagaa aattacctgc aatggctata atcaacacac 6600 atccttcagg tcttccgggt gaacattggc tagctatata cataaacgaa gttggtgtgg 6660 ggtgtttttt tgacagcttt ggtaataagc cgaatgatgt ccgttttccc gcttttatcg 6720 atgatttttt aaggctaaat tgtgtagttg tacaatattc tgccaagcag gtgcaagatt 6780 attcatcaga cacatgcggg cagcattgtg tgttctttct ttatcatatg gcaaaaggaa 6840 aatgttatga ttacattctg aaactgtata gcaatgattt tgttaaaaat gacaaacgtg 6900 tttcaacttt tgtaaaaaaa ttaaagcgat caaggtgtaa tgaaaatgtt ttcaattgtg 6960 ttcaatgtgt acagctgggt gcagtgttta tgtctcatgc atgaaacaat taataaaaaa 7020 actaaaagaa actttaagaa tccatcatat gtctttttta ttattattat acatacatac 7080 atacatgcat acatacatac attattacat aaaaaacatt cattacaaaa ctaaaaagtc 7140 tggaaaaaga gaacggtgaa agtacataaa ttaagatact gatattaaaa agccatccac 7200 cgactcctgt caaaagtggg agattcaaaa aaggtgtctt ctagataact agaacgtgtc 7260 atatgactgg actgttgcgt ttttagtttt ttaccttttt taaaggaaac agacgtgggg 7320 gattttgatt tgtttttaaa agagttgatt gtatgtctaa ccccatgatt tggtaccatt 7380 gaatacggta tgtttaaatc agcaaaggct tctaaaaact cataccaccc tataggtctt 7440 cgatcatcgg caatcttttg cggagccgtg atacttttaa ctaaatccaa catatgtgaa 7500 ccagggatgg ttttgccttt aaacatgaat tcgcctgatt ctgaccaaga agcatgatct 7560 ttgcttttag acattttgtc taaaatgtaa cgaacatttt tttcattcct ctgaggaata 7620 ttctttagaa tatcatcaac aattaaatcc tccactgctc ctatggtaat cgtcttctgt 7680 ggatcttctt tatcggtatg atcatgttga gggagtgaca attttaaagt ggtcgcatcc 7740 atgtctccct gtttggtaat tgttaaatac ctctgcaaaa cagctgtgta cagcttggct 7800 ttttcatgcg ggtcgatatc ggtcctgtac aaaatattcc ttatagtcgt gtccaaatca 7860 ctttcaacag cgtgttgtat agactcacgg ggcgtagtcg gttttatctt atccagctgg 7920 tgctgcggaa caagatacat cttctctgca tatttcagcg tagacattat ctgttggcta 7980 tcaggctggt tataaaagga actgcaatac ttaaaagagg gagaagaaat ccaccctgct 8040 gattgataaa ttttcttttt cttctgacac caatcttttt attagcgatg agtttaatat 8100 cactctttct cctcttcagc ttctgatact gctgttctgt aagtggtatg ttcccacgta 8160 gtatattaag agcgatttca cacaatgtta caataagctc atctgtggct gcctgtaaaa 8220 taacacgtct ttgtccaggt ttagctttta atagtaactt taaaagagac aagtttctta 8280 aaagtttagc agacataatt taaatttttt ttaacccagg cgttgtcact attttctttt 8340 ctgaatatag actaccggac gatctgaaag caccgctgtt ctgagacgat actgatctgg 8400 ggtgtgggct ttataatcaa tcaagagata accgtgagga atacttgtgg catcatgata 8460 acattccata aaaaattttg tattcccagc aaacatctgt ctagcaagaa cattcacctg 8520 accgctgtct ctagggtttt taaacaacac caaataattt gtgttcagac ttatagtcct 8580 actgaacttg ccttgaataa ataggttctg cactattttt atgcaggaca tatttttatg 8640 gtgtgaatat tgtgtgaaaa cacgtgacac ttctaagtta ttactacatt catccattaa 8700 atcatccaga atcagtaaat tggttttttc tggggggaac aaatgatcat catcgaaact 8760 ctcaggaata cctttaatga atttcacatc atacattttc aacaaatcat cataaagagg 8820 ctgccatgaa ttgtacacaa aaataacatt ttcgacattt ttagaaaaca ttttttcaca 8880 attttgtaac aacattttta caaaaaacgt ctttcctgaa aatgacggtc cactaataat 8940 acaagtaaag gggtgttgaa atctgaaatc aaaaacacct gcatcttgaa cacccccatg 9000 accaaaaacg ttgtcaaaac ccataaggta ctgtcgagta atcagggaga agtactcttt 9060 tggtgtagac taccttaaat tttttgacta atgccttgtt gtgcagtgtg aactgggttt 9120 tatttctcac tatggtgtcg gtgtgagcca gtacatgcct gctgttatca cgcccactaa 9180 cataatcatc aaccagcccg atcagtgtgt ctaacctaat ggcttgggag tttacagtat 9240 ttagtgttat gcctttagcc ttcatacaca ccttaccctt agcggtacta taaccatatg 9300 acttgggacc ggaagatgca aagatattga tataatctcc accatccaac tcgtcagtaa 9360 gttcaccgag atatggtcct agcggtgggt catattcacc ttctctatat gtgtaaatga 9420 cactgtcagt gtcacagtat aagacccgct ctcccaattt ttctagaaga tcataaagtt 9480 ctaatcgccc gtaacatgta gtaaaagctc ccaagaatac gttagtgtca cgcgtcgact 9540 tgcctttacc gtatggttgt gtgtactgaa cgagtgcgat agagtctgat aaaaaggaaa 9600 aataatttat gtcatgcccc tttgcaaaaa tgtgccgagc aaactgttca ggatctgaaa 9660 ttagttctga gctaggaagc tgtgatttca agctgaagcg tccccaaagt gaattaagaa 9720 tttgcttatt tatggaacgt cgtgcggcat taaatgttat attatcgata ttgagtttaa 9780 ccccctcttt ttcaaagtaa tctttgacat aagactcttt ctcctcatca gtcactacat 9840 cctctggaaa tccggaactc tcctgtttaa actgtaaaaa agttttaaca taatcactaa 9900 atagtgtatc agagcgttga gaaaaatgcc aaacttcata cacctcagag acaacatagc 9960 ctcgctccac agcttttgac aactctacgc tgacccaaca tcctgtcagg gccctttcct 10020 tatcagcatg ctggcagggg gtgtcctgat tttgttcttc agcacatgat ctgcagagtg 10080 gaaacatcag cctaccccct gttctatagg gcaagacagg gtgaaataaa ccacggggcg 10140 gtatgacctc tgctttgatg aatccaaaat aattttcaag attttcaaaa tctttgaaaa 10200 ttatttttgg atgccctacc ggatagcttt ttgtactttg tacaaaaggg taaagagagg 10260 tgatgtctct gtaaaacacc ttttcatcgt ccgatgtttt gtggtacagc ttaaatgcgt 10320 ttaccctacc tccgaataat gcatcacgcg gttttaatct ttcagggcgt ttgtatgtgg 10380 acatgaaatc catcacaccg acatctgaat ttttagccgc tttccaatca cactcccaca 10440 tgaactctac ctgtaagcca tacatccgct gcaaaatttc acacctgtca ttaacctgtc 10500 ttctgagaac cccataagac aattttgata acgggtttat atcgtatgga ttataacaac 10560 gttcgcatcc atgataaaaa cagcctaaaa aatcaaatgc ttttttcaca cgtccgtcgt 10620 cataaaaacc atcgacatga tatttaccaa atgacacctc gccgccattt agcgcatgtt 10680 gaatttctat ggaccgcgat tttgcaacat attctagcca ttcaattgag gcatttgaat 10740 atgttttgtt ttgatttgtg tagccgtcat tacgagtcag tgccaatgta tcttttggca 10800 gaaatttgga cttgtacaca cccatgcagc aaccggcaag tgttgtaaaa gtaaacgggt 10860 cgagatctgt acacaaaatg aactcgtgac ggtactttat gcatgccttt ctcaacagca 10920 caacgtcatt tttaccatac tttgccagtt cttttttaaa atcaaagact ttgcctgaga 10980 ctgtacagta ccatgcatca aagtctgatc tctctttgtc tgacatcgtc tcatacccgt 11040 aaaaatgttt ttcggggtat gggcctatgt agcagtcgtt ttcggcacga ttgaacacat 11100 gtggaaaata gcccttctca atattatcta gctgaaatgc ctcaggggtc ttggatagac 11160 gcattggcat gaaaaggtat gaatctaaaa acctttgctt gtatgtacgg tcaaacatga 11220 agatcagacg acagccgcgc attgtgattt ttggtggtat tcccatccta acaaaatatt 11280 ccaaaagtat gaaattgtca aaaccagacg cattgtgtgc aacccatgta taattacgat 11340 acttgggtcg tctaaatttt ttaatgagac gatccacaca atcaactccc tcagccacaa 11400 attcttcccc gttaaatgta atcgcacaca caaaattggc aacgtgttta ccattttcaa 11460 atcgtgtctc aaaatcatag aaaatgtaac gatcgctcgg ctccataggt ttgtatggct 11520 gtataaagca ttcatgttct ccaccagtag ccaaactttc accacagtgg acgcaatatt 11580 cagcttcaca acgatgtggc ttgggtttgg atccactgat gtagtaacgc ctgttacaca 11640 ttttacaata ttttgtaaca tcgcattgtg caaactgttg tcctggtgga ggttgtttat 11700 gcatctcaaa acagtacctt gatctacaga agcgcaagca gtcattacac tgtattattt 11760 gttttgtttg ggtatgacag tctgggtagt tgcaaacgtc gcacacatat tcacatctgt 11820 gatttcgtct acatgtatag cctacatgac aatactcaca aacgtatggt gtacctatga 11880 atgctgttag gtttttaatc aaaaagtaat gaccgtcatg caggtataaa tatgccgttt 11940 tggaatgtgg cttgtctgtg gtagtatact tttccaacat cccagtgttc gaacgataaa 12000 aggttacaat tttgatattc agcatctgtt caaatctcgc tatgtcatca aatgctattt 12060 tgtactgaac tttgtggccg gctgttgttt gcatgtttgc tgctattgct tcaagttctg 12120 aatgcgactt ctgagggttt agaaagtgtg ctaagcaaat tgtaaaacac agattattcg 12180 tgatatttgt agggcaaaac aaattcattt tgttccttct aatcacttcg tcatgtgcta 12240 gatcttgtat cttccgacga ccgccgccat ttttacatgt cgctattgac acacacaaat 12300 ccaacgcctg atccgcctgt acttcagagt tgctttgcat tattttttca agctggttga 12360 taaagaggtc taaatcatag ttgttgcgag gcgacaatag tgtattaaca tcagacgtca 12420 gactctcccc tctcaatgta atgtttatta caccggaatc accggctaat aatctggaga 12480 atgaaactat ttcagacaac aattcatgta aaaatgttgt atagtctgct aaatctgtgg 12540 atgatatctc ccgcaggttt atgcgtctat gtatttccag accgttaaac cgcgctctag 12600 gtattacagt gtacttgtgc acactaccgc cttgtctcac tagtgtttca atttcatctt 12660 gacttagacc catgtttgct gcggtggggt taaatgtttg gtaaggacgt gtatcagacc 12720 cctcacctgt ctgacatgat aatacaatct gatcctgatc tgattttgaa ttctgttcag 12780 acacactgct ctgagatgta tttaaaaagt gatccatatc aaaatcagta gtagaccatt 12840 catcacatgc atcaatgttc tgagacatgt ttaaaaattg atctggatca aaacatagtt 12900 ctcgttgatt atcctcagat acatcactgt tctgaggaga actcaaaaac acattcagat 12960 caaaagatgg ttctggctga ttatcggaca tggattgaac tgaatttaat agttctatta 13020 acgcatcagt cgtagatcta ggtctatcca tatcctcaac cgtttgaatt aaatgacgca 13080 atgtttctgt agtgttttgt tggttgtggt gaccatcatt actatctcct ctaaatctca 13140 ctgtacgata ttgttcatta ccatctgaat ccagacaacg ttttgtggcc attttaacac 13200 aaatttattt atggtagtta aaaaaaaaaa aaataccaat agttgtacaa taagtaaatc 13260 agttacaaaa actaatcatt caaatcaatt actaaaaatc taagtatata gaaaccctaa 13320 aatgagcgtc taaaacataa tactacacat aagacatatt gtagaaaaaa gacatgcgat 13380 ttaaaagaaa aaaaaacaac ttaccgagat tgttgtttgc gtgtgttgac cagcaatgtc 13440 aataattgaa cagtcaattt attatcattc attaatatct gttgatcatt cattattttc 13500 tgttgattgt acattatttt ctgatgatta gccatacatc gtcgaactaa ggccgcatgt 13560 tcctttgcat caatttttaa ctgctttaaa tcctttgtag cagttgcgat gaaaagctcg 13620 atggtccgtt tatctctctg ttgctgcgat gcaatattgt tcaatccatt tgatagctgt 13680 aaaagaagct ccttgtactt cagaatgtca tttgtggcaa cttttgcata cgtgtttgat 13740 tgatccacaa ctggcaccgg ttttgtgttt tgcttgtttt tgtaagcctc ctggttggat 13800 ggtagtgtat cagaatccga atcttgcacg gtggtggcag cgtgaaccgg gatctcgttt 13860 gagaactcag aagcattcca aacattcacc atatcgtcgt tccatttttc aacatcgtcc 13920 tgactcgaaa caacatttgg gtcagtctgg cctgtgtgtg tgtgaataat gctaatgagt 13980 aaacatatca accagatgta catttttttt ttctatatca ttaaatagag ataacatacc 14040 ttgtgatttt cttggtcgtt ttggttcagg tgtatttggt ataacgctta tttcgtcaga 14100 tgtagcatcc acatgaacct gtgctgtagt ggtgactact ttctgcttta caccaccgac 14160 acgacggtct ctcggcacaa tattcctttt acgtcctaaa ataaatggag ggtatgatta 14220 ttaatattat tttggtaata ataaatataa caatgactga tctaacaata catactgctc 14280 gttcctgcgc gtgtacttgt tgtgtagacc tctagtagac cggtagaccg tttgtcgtcg 14340 acacggctga cgatttcact catagacgct atagttaaaa caaacacaat acaagatttt 14400 tttttttcat tattatttaa cacatgtgga aaaataaaca agaaaaaaaa aagtttttaa 14460 aaattaccta gtggtaatgt gttttcatga atactcgtca ggggtgacga cacatttata 14520 cacgctagta aaaaaaaaaa agacaagatt ttttagtgga tgtagaaaac atgaaaatag 14580 acaaaaaaaa aaaaaagttt tgctaaagaa atacctgatg gtgcatcttc atgaactgtc 14640 cgaggtgcag acgactcgtt aatacactct agtaaaaaaa gagacagact aatttagcgt 14700 gtacgcctat agaaacataa aacagacaaa aaaataaata aaacgtgaaa agcaaatttt 14760 accttgtgag ctacgatttt cagcaagatt tttaaactct aatggtgaca ataaaagaag 14820 aaaattatga catatcccat cacacaatac cgtaatgtat aaataaaaac atgtcatata 14880 tatatatata tatatataca tataacaaac caaaaaaaac ataccgtcag tattcagcgc 14940 taaaatctca gcccatatcg cctgaagtcc tggtgtctgc agagagaggg gcagttcgcc 15000 ttttaagcga tgtctgattt aataatcatg tataatatta aaaaaacgat tgctttaaac 15060 aacagtattt tgaatatttc ttatatataa acttaccatt agaggttcct gccattttta 15120 tttccggaac agtttatttt tttcccccaa aaaaattgaa agccgaagtc ttttcaacca 15180 cgttggtttt ccgaaatact gtgattttac tcttagggac gagggcttag tgtcttgtaa 15240 aatgatgtgt gaaaaaagtg ggcgtgtgtt gatagtccac acctccaatg taaaacacac 15300 attgtactgg tgtaaaaaaa caaaacaatt ttaatatacg agtaaaagta tatattgctc 15360 aaataatgaa ataaattaag atatattaaa taatacacat cttttctttt agccagtata 15420 tcgattttta aaacatctga ttgaagatct gcttttgtgt atgtgtaaga caattaagca 15480 tacaatgaat attatacatt atttatttca cgcatttata catacacatt tttatttttt 15540 ttttgtgaat ataaaatatt tggatttttc tgcacttttc attgtttctt ttttaaaaca 15600 tccttttaaa aataattttg ttgttgttgt tttttaccac aacacatggc tccgatgtga 15660 attaaagtac cacagactac gtcacagcag gaataaacga gataaggaga gaagcttttt 15720 tctgggggta cagatgtcta catagccacc attcccaaac agtaggacgg cagatagcac 15780 gacgaccagg gtagttagat aaaaccagaa ctttttaaga tcaaacaaag ttttcaaagt 15840 aagaccaaaa tcaaaggatc ctcagtcagc ggtatacgat cgattgcccg attacacccc 15900 aacaacctga aaccccccta catacactca cagattggaa ggagctggaa tgccagaaga 15960 gctgcatagg ttgtaaaaat gacaaaccat acacattgcc acaaatctaa ctatactgta 16020 aatctttgcc aaaatcagcc cgacacttgg ttcaactgtc aatttgggac aaaccgtcat 16080 aaatcctacc actaagcagg gaccctaagc ctgtgaactg gggaaagctg cctgtcattg 16140 gtcaccagtc ttacattcat gaaacacatt ggacaaacca tgccataaat tattaccata 16200 ccgtccaaaa catactcata aattattgac ataaggcaat aaatcatctt gacactgtgt 16260 gtattataag actact 16276 // ID DNA-8-35_DR repbase; DNA; ZEB; 738 BP. XX AC . XX DT 12-MAR-2009 (Rel. 14.03, Created) DT 12-MAR-2009 (Rel. 14.03, Last updated, Version 1) XX DE A nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; TSD 8-bp; DNA-8-35_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-738 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 9(3), 686-686 (2009). XX DR [1] (Consensus) XX SQ Sequence 738 BP; 219 A; 128 C; 170 G; 220 T; 1 other; cagtgttgcc agattgggcg gttttgaatt tgattgtgcg ggtaaaaaat ggcataggtg 60 ggttgacaaa atttgggcgg ttttgaaacg taaattttat atgcaactta tttaacaaaa 120 cataactgtg tgctcctact ccattcagtt agtagtgacc agtgcatagc gcaaaccgcg 180 tgggcccacc cgcaagagga ggtgaaggaa agttgtatca aaatctgact ccccggacaa 240 atctgactcc ccttcgcgtg cacgcgcatc acgcagcgcc tgcagttaaa ctcacagcgg 300 tttatcatga cacttttatc ratcattttt tccgcaattg acagcaagaa caaacataga 360 agcattgtct gattgacaag cagcacctaa ttatgttcaa aagaatttaa aacgccaaaa 420 tgggacgaat ttatacccca tttcgaaagt aaaacatact ctaataggta gtgtaatcta 480 cacaacatta agtgcagtgt gtgggtgaga gggggcagtg tttcgatgtg atccggttat 540 gttgtggttc tgtgactgct ggtgttggtt gctgtatggt taaaaattat gacagttaaa 600 gtaactaggt taatttcgat ttaaataaat taaaattaaa ttaaaatatt aatctaatat 660 tttgggcatt tttgtgcgtt tgggcgggtt ttggacagct tttgggctgg aaaaagtcag 720 ctatatctgg caacactg 738 // ID DNA-8-19_DR repbase; DNA; ZEB; 1647 BP. XX AC . XX DT 01-DEC-2008 (Rel. 13.12, Created) DT 01-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE A nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; TSD 8-bp; DNA-8-19_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1647 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(12), 2142-2142 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1647 BP; 505 A; 279 C; 296 G; 565 T; 2 other; cagtaccgcc gctccctata cgcagagtac gcaggctgcg tagggcctca actccggagg 60 ggggcaccat ctcggctgct caaaaaaagt attgatattt atttactaat atcaacataa 120 actgagtgta aataaaatat atattgataa taagttgtta aatacattta aaaaatcatt 180 ttatgacgga cgcataaaca aacatgtgac gtaatttttc gtgcacgcgc atagccgcgt 240 gcgcgtggcg cggcccggtc cacttatctt tgcggctgcy agctgcatat aattataata 300 gatgcaagtt gcagctctat gcccggaaaa agacaacagc agtctggagc cgagaagagg 360 aagaaagtac aagatgaatc gcgtgcatca ctttcaggta acttccatga atctggaaaa 420 cttatttttg gttgttcaag taaattagtg aggctaacgt ttaaacttcg ccacaactgc 480 tatgcattgt gctttagaca ggacatgttc tatgactttt cgcgtttgcc cctcttcccc 540 atgttgctga taattacaca aactcaatta ttgttattaa cttgtgcaat actttacaat 600 tctgtttgca caccagctac tgtacataca catgcactgc acaattattt cactcttatg 660 tattatttat tattatagta gttattatgt atatattata tatattatat attgtatata 720 tattatatat tatatcattc ttttgcagac agcagttatc attgtttatt gttttgtcta 780 tttctataca tttcttattg ctgctgttgt tattgcttgt atcactgcaa tgacaataaa 840 gtctctttag tcttttttgt gcattacagt atgaaataac aaaactgagt aatattctgt 900 gatttctgaa agatcaataa tataaaatga gtcaaattac tgtaaatcac aggccagaaa 960 ccatttttat tgcatgaatt cattaatgat tatttttttc tgaatgttaa caaagttttt 1020 tggtttaaaa atacaccaat taattcttaa aaaacacaaa tgtctttgat ttatactgaa 1080 agcaagggtc agcttttgtc accatagatt acagtgcaaa ttaaatccac attttggttg 1140 tagtttagta aacattatga agagtgctgg actagcccat tgttttaatg gtatgtcatg 1200 agtttttctt taaatgacta atttcttctg tattgtacat ggaggtcaat gctgaaattt 1260 gtcacaaaag aagatgaggt ggcctgacca gcactgctcc aaagacacca gcagctgcac 1320 aggtacaagt acaggatgca gcactgagga ccgtccatct ctacaggtac agagagggtt 1380 ttattcctta ccttttgtaa atgtttgaaa ttgggaatgt ttctgcagaa gacatgtcag 1440 trtttatttt gatgtaaact attattttgt aatggactag ataaatattt gaatgtttaa 1500 gtataatttg agtgtttata tgcatattta tgtacataaa taaataaata aataaacatg 1560 tttaaatgtt tgtgtgattt tgttcctctg tgggggcacc acagtaaaat ttcgcatagg 1620 gcaccctttt ggccagcagc ggccctg 1647 // ID Gypsy84-I_DR repbase; DNA; ZEB; 5752 BP. XX AC chr11; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy84-I_DR; KW Gypsy84-LTR_DR; Gypsy84_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5752 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1479-1479 (2008). XX DR Genome; chr11; Positions 42653688 42647937. XX CC Positions [4493-4675] - Integrase core CC LTRs are 97% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS join(1055..3022,3026..3970) FT /product="Gypsy84-I_DR_1p" FT /translation="MILPSVLIDLCLQRMIPLLLLYQGKIHRGDAYVVTKW FT GRFCRSPWPTESNSGSPQANVNEISVGESDTVAPTAILSQIVEGTREASKS FT PVCPTILQRLVGKCPVVNAQIGMGVDISCLLDTGSMVNTITESFFEKHFHQ FT LGVNGLKKCNWLQLKAANGLEIPYVGYFETDMKVLGQTLRGQGILVVKDPV FT DVFFHQKKEFTPGLLGMNIIGQCYNDLFEKHGPALFSVPQVRQAEQGWKDA FT LSQCHGVEDAPVPGFVGYARVQSCQPVQIPAGTMRLVPALCPKNIHSGTVV FT VLEPQNPENGGLPAGVLISSAMLQCVQGIVHVPIVNVGTEPLYLPPRIRLG FT SLLNVEIISSSQGIYFQGDRSQDPNVVVVQCHAVKSESSIDMQGVELSGLS FT EIEEEEVKRLLMKYSDVFSKHDGDLGCTNLIDHQIPLVDDTPVRQRYRRIP FT PSQFEDVKAHIRQLLDSHIIRGSSSPYASPIVLVKKKDGSLRMCIDYRQLN FT GRTRKDAYPLPRIEESLDALTGAKWFSTLDLASGYNQVPVAESDRFKTAFC FT TPFGLFEFNRMPFGLCNAPSTFQRLMERIFGDQSFQSLLLYLDDVIVFSTT FT IKQHIQQLGLVLDRLRQQNLKIKLSKCYFFRSEVRYLGHVVSSTGVSTDPE FT KISAVAKNQPQSLQELRSFLGFASYYRRFVKDFSKMAGPLNALVAELIRGQ FT KSKRPKINLGSNWTGSCERAFQALKMALTNAPVLAYADFSKPFILDIDASH FT QGLGAVLSQEHGGKHRPVAFASRGLRPPERNMQNYSYMKLEFLALKCAVTE FT KFREYLLGQKCFVYTDNNPLSHLQTAKLGALEQRWANQLADFDLEIKYKPG FT RSNVNADALSRKTPSVVELALTTPVPKELHQYGPNSCHTVASETMSVFPEQ FT PREDLGALQEADPAIGRFLVYWNRQKTPDAQERAMESSEVLELVRQWGKLV FT KMEGVLYRKFHPQEEH" XX SQ Sequence 5752 BP; 1608 A; 1113 C; 1431 G; 1600 T; 0 other; caattggcgt agtcggcagg acatttaaaa ttgatttaat ttagcagagg ctgttgtgaa 60 gtgtcttact ctattgattg tgcaacttta aatgtttttg cataagttca gtgtaagtaa 120 acggtgactg tgcataaagg atttcgcaga ctctgctttg tggaccctgg atgtcattgt 180 gctgattgaa ctgtcttcca gaattcacca acagatatgg agcaagaact aatggacttg 240 agggagcggg ttcaaagctt acaagcacag aatgaggagt tgatgagact gcgttcccca 300 tctagcagtg atgcccctag ccaacctcac ccttctgttt ctaatgaccc atcttcagtg 360 tcttttaaca gaatgcttta tgtgccccga gagaggaagt gcccacgttt ttatgggaag 420 ttggactcta acttgagtat tgaggactgg actgacgagt ctaaggtgtg tattgagggt 480 agaggttggg ctgataaaga aaaggtggtc tttctgttgg atcatcttgg ggaagaaact 540 agaatggagg taaaacttca tccacctgtt actagacaaa ctccagagtc tgtttttgaa 600 gttttaaaag atttgtatgg gggaaaacag acatttgtac aattacagca gaggttttat 660 gagcgaaaac agaaagaagg tgaatcaatg actgaatttt ctcatgcttt aatgtcattg 720 atggacttga ttcttaatag taatccaggt agtgttccca attccaatag agtgcttcgc 780 gatcagtttg ttgaatatgt ccaagatgta acgctaaaac gtgagttgaa atgacttgtg 840 agggagaagc cttctcttac gttgctcgag gttcgacgtg aggctcttag atgggtagaa 900 taggggagca gggaagcaac tctttcatat tcccagccat ggtgtaatgc cactcaaacc 960 agggtagttg accaggaacc taaggttaga ttccaggaat caacagagtt aaaagaaatg 1020 aaagaaatgt tgagacagca acaagcacaa ttaaatgatc ttacccagcg tcttgataga 1080 cttatgccta caaagaatga tccccctgct gttgctttac caaggcaaaa tacatcgcgg 1140 tgatgcctac gttgtaacaa aatggggcag attttgtcgt tcaccttggc ctacagagtc 1200 aaactccgga tcacctcagg caaatgtaaa tgagatttca gttggggaat ccgatactgt 1260 agcccctact gcaattttaa gccagattgt tgaagggact agggaagctt caaagtcacc 1320 tgtttgccct actattttac agcgactcgt gggcaaatgc cctgttgtaa atgcacagat 1380 tgggatggga gtagatattt cttgtttgtt ggatacagga tccatggtaa acacgataac 1440 agagtcattt tttgagaaac atttccatca attaggggtg aatggtctta agaaatgtaa 1500 ctggcttcaa ttaaaagcag ctaatggcct tgagatcccc tatgtgggtt attttgaaac 1560 cgacatgaaa gtgttgggac aaacccttcg aggacagggc attttggttg tgaaagatcc 1620 agttgatgtt ttttttcatc aaaagaaaga atttactcct ggcctcttgg gcatgaacat 1680 tataggacaa tgttataatg atttgtttga aaaacatggc ccagcactat tttcagttcc 1740 acaggttagg caggcagaac aaggttggaa agacgcccta tcccagtgtc acggggtgga 1800 ggatgcccct gttcctggct ttgttgggta tgccagagtg cagtcctgcc aacctgttca 1860 gattccagct ggaactatga ggttagtgcc tgctttgtgc ccaaagaata tacacagtgg 1920 gacagtagtg gttttagaac cccaaaaccc agagaacggg gggttacctg ctggagtgct 1980 aatctcatct gctatgctgc agtgtgtaca aggtatagtt catgtaccta tagttaatgt 2040 cggcacagaa cctttatatt tgccccccag aattagattg ggtagtttgc ttaatgttga 2100 aatcataagt tcatctcagg gcatttactt tcagggtgat aggtcacagg accctaatgt 2160 agtggttgtt cagtgccatg ctgttaaatc ggagtcaagc attgatatgc aaggcgtaga 2220 actgtctggg ctttcagaga ttgaggagga ggaagttaag agattgttaa tgaaatattc 2280 agacgtcttt tcaaaacatg atggcgattt gggatgtaca aatttaatag accatcaaat 2340 ccccttagta gatgatacac cagtacgcca acgctatcgt agaattcccc ccagtcagtt 2400 tgaagatgtg aaggctcaca ttagacagtt gttggacagc cacattatta gggggagcag 2460 tagcccatat gcctccccaa ttgtattggt gaaaaagaaa gatggttctt tgaggatgtg 2520 tattgactac cgccagctga atggcagaac ccgtaaggat gcctaccctt tgccaaggat 2580 agaagagtcg ctggatgcat tgactggagc aaagtggttt tcaactttgg atctggcaag 2640 tggatataat caggtaccgg tagcggaatc tgatcgattc aagactgctt tttgtacccc 2700 atttgggctg tttgaattta acaggatgcc atttggactg tgcaatgccc caagcacttt 2760 tcaaagatta atggagagaa tttttgggga ccagagtttt cagtcgctgc tgctctactt 2820 ggatgatgtg attgttttct caacaactat taagcaacat attcaacaac tggggctggt 2880 gctggatagg ctgagacaac aaaaccttaa gataaaactc agtaaatgtt actttttccg 2940 gtcagaagta cgttatttgg ggcatgtggt ctcttccact ggggttagca cagatccaga 3000 gaagatatcg gccgtagcta agtagaatca gccacaaagt cttcaagaac tcagatcttt 3060 ccttggcttt gctagctact atcgacggtt tgtaaaggac ttttcaaaaa tggctggacc 3120 cttgaatgca ttggtagcgg agctgattag ggggcaaaag agtaaacgac ctaagattaa 3180 cctggggtcc aattggactg gttcttgtga acgtgctttc caagccctta aaatggctct 3240 gactaacgct cctgttctgg cttatgctga ttttagtaag ccctttattc tggacattga 3300 tgcaagtcac caaggtctgg gcgctgtgtt gtctcaagaa catggaggta aacatcgacc 3360 agtggctttt gcaagccgtg gattgcgacc acctgaacgt aacatgcaaa attacagcta 3420 tatgaaattg gagtttttgg ccctgaaatg tgctgttaca gaaaagttca gggagtactt 3480 gttgggccag aagtgctttg tgtatactga taataatcca cttagccact tgcagactgc 3540 aaagttggga gcattggaac agcgttgggc taatcaactg gcagatttcg atctggaaat 3600 aaaatacaag cctggacgtt caaatgttaa tgctgatgct ctttctagaa aaactccatc 3660 tgtggttgaa cttgctttaa cgactccagt gccaaaggag ttacatcagt atggtcccaa 3720 cagttgtcac acagttgcca gtgaaaccat gtctgtcttt ccagaacagc ctagggaaga 3780 tcttggtgcg ttacaggaag cggatcctgc gattgggcgt tttcttgttt attggaatcg 3840 ccaaaagacc cctgatgcac aagagagagc aatggagtca tcagaggtgc tggaactcgt 3900 gcggcagtgg gggaagctgg tgaagatgga gggagtgttg taccgaaagt ttcatcctca 3960 agaggaacat tgagaaatcc gccaggttgt attaccatct gtactcaggg aaagagtctt 4020 ggccagtcta catgatgatc atgggcatca aggcattgag cggaccgcaa gtttagtgag 4080 aactcgttgt tattggccag gtatgtttaa gtttattgag gaatggtgca agagatgtca 4140 gcgttgcact ctggctaaag tggtacgacc aagagtgcga agtttcatgg ggcacctgat 4200 ggcagagaga ccattggaca tcctggctat tgacttcaca ttactggaac ctgcatcaaa 4260 tgggcttgaa aatgtactag ttatgactga tgtgttttca aagttctcac aagctatccc 4320 aactaaagat caaactgcag taacagtggc acgtgtttta gtggagcgtt ggttctactt 4380 gtttggagta ccacgacaaa tccattccga tcaaggaagg tgctttgaaa gcaagttgat 4440 ctacgagtta tgcaaactat atggcatttc aaagactcgc accacacctt aacgaccaca 4500 aggaaatggg caatgcgaaa ggtttaatag aaccatgcat gatttgcttc gtacattgcc 4560 accaaaggaa aagtgttgct ggccagacca tcttgctcag gtagtctttg cgtacaatac 4620 tacagagcat caatctactg ggtactctcc tcatgtcctt atgtttgggc aggagccaca 4680 cttgccagtc gattttctgt tgggattgga tgatggtttc cagagtgacg tagactggat 4740 agtgtaccat aaggacaact tggaaagaat attcaacaat gcccgagctc gtcttcagtg 4800 tgccacagaa catcgtgctc gaacaaatga ccagcttgtg cgacatgaga agctaaaaga 4860 aggccagctt gtgtatcgaa gagctcacaa cataaaaggc aggaacaaga ttcaagatgc 4920 ttgggattca accccttaca aagtagtgcg gtaccttgat gaacttggag ctgtgtatgc 4980 tattgttcct gtagaaggtg gcaatattag aaacctccat cgttctgaac tgcgttcagc 5040 ctataccccc ccaaatgatt ttggcctttt agatcaggca gactatcaaa aaacttcact 5100 gatgaagagg gagatattgt aatgatgtcc ctgaggcctg ggggtgtgtt atctcaagaa 5160 aaagttaatc cgcaacctag aacacaaata atagaagata ttagtccatc tttacctgta 5220 gtggttgatg tggaggacca gtctgctcca cggccggaag gagcggttgt aaggcgtacg 5280 agaagagcaa gagctggaca gcattcaaac ccataccatc ttcctcggcc tgtacatgtc 5340 agtgaagctg agttgcatag ccatgttgct cttccagtaa cggggctgtc cagcaatgtc 5400 ctaaatggtt tgtttcgccc atggttgtag tggtttaagt taatcgacga gacgtcgatt 5460 taaattttgg ggggtagatt gtaatgagac catgtctggg cactacaagt cccagcaggc 5520 ttagcgccat gcaaggaagt gttgttgttt atgtgcggac actttatttc ctgattgcgg 5580 cgtaaattaa caacaggttg taggttgctc agcgctatat aagtgagtca aagtttgttt 5640 gccattttgc tgtcgttgtt gcattctggt cgcggcacgt aaaggccgag tcaacacttc 5700 acgtgagggg tagctgatga gcaaacaaca tttaaagctg ccagatcttc at 5752 // ID LTR4_DR repbase; DNA; ZEB; 346 BP. XX AC . XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 13-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Putative long terminal repeat (consensus). XX KW ERV3; Endogenous Retrovirus; Transposable Element; LTR4_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-346 RA Jurka J. and Jurka M.G.; RT "Long terminal repeats from zebrafish."; RL Repbase Reports 8(10), 1352-1352 (2008). XX DR [1] (Consensus) XX CC 5bp TSD. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 346 BP; 106 A; 59 C; 82 G; 99 T; 0 other; tgtaggagcc atacattgta tattggcgat tggccacaag gtgtcagtac gagactatat 60 aactgtggct tcactgcatg gaggaagaga gcgttggtag gaagcacaca gtttttgacc 120 gtatcgtatc gtatcgtaac tgaacgtaac tcaactcaac ttaaggtaac agagaagtgt 180 gcattataga ttaaggaaaa gttgatgaag taattctgtt gattggatgt tgattggaaa 240 ataaagacgt ttttatttgc atctattttg cctacggact ctgactttta cgcgtaaaaa 300 acttgctcgc tctaccaaca atagcggcaa tgggaaagcc gctaca 346 // ID CR1-26_DR repbase; DNA; ZEB; 3390 BP. XX AC . XX DT 25-NOV-2008 (Rel. 13.11, Created) DT 18-NOV-2010 (Rel. 15.12, Last updated, Version 2) XX DE CR1-like non-LTR retrotransposon - a consensus. XX KW Rex1; Non-LTR Retrotransposon; Transposable Element; KW Nonautonomous; reverse transcriptase; AP endonuclease; CR1 clad; KW CR1-26_DR. XX NM CR1-26_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-3390 RA Bao W. and Jurka J.; RT "CR1-like non-LTR retrotransposons from zebrafish."; RL Repbase Reports 8(11), 1700-1700 (2008). XX DR [1] (Consensus) XX CC The 5'-part is incomplete. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 1..2877 FT /product="CR1-26_DR_1p" FT /translation="SWRPSWQTCLLAPHINVVFLLFFVCFLSFPRCIHAMH FT YQHIADKLAGSCNLRQTFRIYQFYSTGLTTHCPLPQPLQRRRRKRHKRGKR FT GGLHARLKSRANRPPLPSILLANVRSLDNKLDELRARITSQREVRECCALI FT FTETWLSEKVPGTAVQLQTHSLHRGDRTTASGKAKGGGVCVFINNSWCGDV FT QTVHKHCSPDVEFLLLKCRPYYLPREFTAVFIAAVYIPPRANATAALGKLF FT NVFNAQEMAHPDAVIIAAGDFNQCNLRTVLPKYHQHVSIPTRENNTLDNVY FT SNIRGAYRAAPRPHFGQSDHISLFLYPAYRQRLKQTNPITKLVKIWNPQTE FT STLQDCFALTDWDVFKTAATKEDLSVNVQDYAEYVTGYISTCVENIIPTIQ FT VKKFPNQKPWINSKVRHMLNARSLAFTSGNENEYKSAKYGLRSAIKEAKRQ FT YKEKLDNTYSTASAGQMWRGLQHITDYKSTTNVTISASESLPDDLNTFYAR FT FESSSINTEQRHTQTQTSQPALPPPXVTPAAVHKALRKINPRKAAGPDNIP FT GQALKACASELVDVFTSIFNLSLSQSSVPTCFKTTTIVPLPKKSPLTCLND FT YRPIALTPIIAKCFERVVLPLIQSSIPDTLDPLQYAYRTNRSTSDAIAAAL FT HTSLSHLEDKDTYIRMLFIDYSSAFNTVIPHKLTHKLSELGLHPTLCDWLL FT DFLTGRPQSVRIGNKSSSTILTNIGTPQGCVLSPILYTLFTHDCVASHKNN FT TIIKFADDTAVIGCITGGDEAAYRKEVASLVTWCENNNLTLNTDKTKEMIV FT DMRKERRTHQPLFIRKLEVERVSSFKYLGVHISEDLTWTLNTTQLVKKAQQ FT RLYFLRKLRKLGLSSKILSNFYSCVVESILTNCITVWYGNATEKDRKRLQR FT VVRTAEKIIRSPLPSLQTIYHHRVHKRTASILKDPTHPQHGYSHSYPQGGG FT IGV*" XX SQ Sequence 3390 BP; 989 A; 878 C; 678 G; 844 T; 1 other; tcatggcgcc cctcgtggca gacgtgtttg cttgctccgc acataaatgt cgtgttttta 60 ttgttttttg tttgtttttt gtcttttccg cggtgcatac atgcaatgca ctatcaacat 120 atagcagaca aacttgctgg atcttgcaac ctccgccaaa cttttcgaat ttaccagttt 180 tattccaccg gacttacaac acattgtccg cttccacaac cccttcaaag gcggcgacgc 240 aaacggcaca agcgtggtaa gaggggagga cttcatgcta ggctaaagag ccgtgctaac 300 cgaccaccgc tacctagcat cttgctggct aacgtgcggt ctctggacaa caaactggat 360 gagctaagag caaggattac atcgcaacgg gaagtaagag aatgctgcgc tctgattttc 420 acagaaacgt ggctctccga gaaagtccca ggaaccgctg ttcagctaca gacccattca 480 ttacacagag gagaccggac cacagcctcc ggtaaggcta aaggaggagg tgtgtgcgtg 540 tttattaata actcgtggtg tggagacgta cagactgttc ataagcactg ctcgccagac 600 gtggagtttc tactgctgaa atgccgcccc tattatctac caagggaatt tactgccgtg 660 ttcatcgccg ctgtttacat ccctccgcgg gcgaacgcta cagcagcact cggcaaactt 720 ttcaatgttt tcaacgcaca agaaatggca catcctgatg cggttattat cgctgcgggc 780 gactttaacc agtgtaactt acggactgta cttcccaaat atcaccaaca tgtgagtatt 840 cccactcgtg aaaataacac actggacaat gtttacagta acatacgcgg tgcatacaga 900 gctgcccccc gcccccactt tggtcagtca gaccacatct ccttgttttt gtatccagct 960 tacagacaaa gactgaagca aacaaaccca atcactaaac tggttaaaat ctggaatcca 1020 cagacagaga gcacccttca ggactgtttt gctcttacag actgggatgt gtttaaaact 1080 gcagccacca aggaggactt gtctgttaat gtacaggact atgctgagta tgtgactggg 1140 tatatcagca cttgtgttga aaacatcata cccaccatac aagtcaagaa gttccccaac 1200 cagaagccct ggataaacag caaggtgcgt cacatgctga atgctcgttc tctggcattt 1260 acatcaggca atgagaatga gtacaaatct gcaaaatatg gactgagaag tgccatcaaa 1320 gaggctaaga ggcagtataa agagaaactg gataacacct actccactgc ctcagctgga 1380 caaatgtggc gaggcctgca gcacatcaca gactacaaga gcaccacaaa tgtcacaatc 1440 agtgcctcag aaagcttgcc tgacgacctc aatacatttt atgcccgctt tgagtcctcc 1500 agcatcaaca cagagcagag acacacacaa actcaaactt cccaacctgc cctcccccct 1560 cctgyagtga caccagctgc agtacacaaa gcactgagaa aaatcaaccc ccgcaaagca 1620 gccggacctg acaacatccc gggacaggcc ctcaaggctt gtgcttcaga gctggttgat 1680 gttttcacct ccatctttaa cctttccctt agtcaaagct ctgtcccaac ttgcttcaaa 1740 accacaacca tcgtccccct tcctaaaaag agccctctga cctgtctgaa tgattacagg 1800 ccaatagcac tcactccaat cattgccaaa tgttttgaga gagtggtact acccctcatt 1860 cagagcagta taccagacac tttggacccc ctgcagtatg cataccggac caataggtcc 1920 acctcagatg ccattgctgc tgcactacat acttccctct ctcacctgga agataaagac 1980 acctatatca ggatgctttt tatcgattac agttccgcat tcaacacggt tatcccccat 2040 aaactcaccc acaagctgtc tgaactcgga ttacacccca cactctgtga ctggctctta 2100 gatttcctca ctggcagacc gcaatctgtc aggattggaa ataaaagctc aagcaccatc 2160 ctcaccaaca tcggcacccc acagggatgt gttttaagcc ccatcctcta cactttattc 2220 acacatgact gtgtcgcatc tcacaagaac aacaccatca ttaagtttgc ggatgacact 2280 gcagtgatag gctgtatcac tgggggagat gaggcagctt ataggaagga ggtggccagt 2340 ctagtgacat ggtgtgaaaa caacaacctc accctcaaca cagacaagac caaggagatg 2400 atagtggaca tgaggaagga aaggagaact catcagccac tgtttattcg caaacttgaa 2460 gtggaaagag tgagcagttt taaatacctg ggggtccaca tcagtgagga cctcacctgg 2520 acactgaaca ccacccagct ggtcaagaaa gcacaacagc ggctgtactt cttaaggaag 2580 ctaaggaaac tcggtctgtc atctaagatc ctcagcaact tttacagctg tgtggttgag 2640 agcatcctga ccaactgcat tactgtatgg tatggaaacg ctactgaaaa ggaccgcaaa 2700 cgtctgcaga gagtggtgag gactgcagag aagatcatta ggtccccact gccttctctg 2760 cagactatct accatcacag agtccacaag agaactgcct ccatcctgaa agaccccact 2820 catccacaac acggttattc acactcctac cctcagggcg gaggtatagg agtgtgagat 2880 gcaggactgc cagactcaag aactctttct tcccatcagc catcagactt ctaaacagat 2940 aaccaacgca cataacagtc tatttttctg ctcagtacaa cactacatac tccatcccat 3000 tttgcactat tttattcttt attctttttt tttgcacata atccaattgc actaggcact 3060 tttttgcata ataagcacaa taaaaaaaaa aaaacaaaac aaaacaaaaa acactgtaca 3120 ctgtttacat ctgtttcatt actcaaatca ggtttacata ttgtttactt tcatagatat 3180 ttatatactt acattttaca atcaatcttc agtcacttct gtgtatatat gtgtatttat 3240 gtgtatgttg tgtatgatgt gtatgttgtg tgtatgactt cactgtggac ggcaaagtaa 3300 gaatctcatt gtacagggag acgtgtttcc ttactgtgca catgacaata aacagttgaa 3360 ttgaattgaa ttgaattgaa ttgaattgaa 3390 // ID HATN3B_DR repbase; DNA; ZEB; 979 BP. XX AC . XX DT 28-FEB-2009 (Rel. 14.02, Created) DT 10-MAR-2009 (Rel. 14.02, Last updated, Version 1) XX DE HATN3B_DR is a nonautonomous DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; hAT superfamily; HATN3_DR; KW HATN3B_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-979 RA Bao W. and Jurka J.; RT "hAT-like DNA transposon from zebrafish."; RL Repbase Reports 9(2), 547-547 (2009). XX DR [1] (Consensus) XX SQ Sequence 979 BP; 282 A; 182 C; 197 G; 318 T; 0 other; caggcccgga ttggctaatc gggaggaccg ggagaattcc cggtgggccg gtccgttttt 60 ttggccgcga gggccggtgt ccctagctgc ttgcactctc agcagtcgca cttttttcat 120 ttatttattt atttgaccat agcctcactc tttttattca ttattttgcc gcagctccgc 180 tctttttatt tattattctc gcagccccgt gagcaaaatg cagcctgcag gttaatgatg 240 atgtaactat cattcgaccc caaacagcgg caccttagtg aatataagaa tttgaataat 300 taatattaat gaatattaca cctgctcaat gaaaatcaga tgattcacta cattaataaa 360 tctgacataa cttgatcaga aatctaaaaa ggggagaaaa agattaagcc atcaatagaa 420 agtaatactg tggttaaaag agtcttgcag ataacagtgc aatacttatt tttttattcc 480 aaacgcacca gcataacagc gccattgtgg tccagtggtc agcacgttgc gttacgacgc 540 cgccgacccg agttcgaacc tcacctgagt aattattatt tttttttttt tttcattttt 600 agtgttaaga catataatac tgtttgggtt acttatgtag gtgtacatat atttattttt 660 atttttttgt gtgaccaccg ttacaaagga ctggatatct ataaagaatt ttgcacagaa 720 tatatattca gatattgcag gaaaggacat tgtgatgttt taaagaatag tgttggagat 780 ttagctaccc tttaatgttc tcatatttat gaaaaacatt tgaagttcta aagcagctgt 840 ttccttgaaa aagacgtgat agtgtcatta gaaactgaat tggaaatgac gttattttaa 900 tatagtcagt cgtgaactga ggtgggccgg tctaaggctt gaaactccag ggctgaaaag 960 gagtcccact ccggccctg 979 // ID Harbinger-N12_DR repbase; DNA; ZEB; 702 BP. XX AC . XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Harbinger-N12_DR is a non-autonomous DNA transposon that belongs DE to the Harbinger superfamily. XX KW Harbinger; DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; Harbinger superfamily; KW Harbinger-N12_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-702 RA Bao W. and Jurka J.; RT "Harbinger DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1290-1290 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 702 BP; 171 A; 189 C; 164 G; 178 T; 0 other; gggtgctttc acacttggtt catttgcctg gaccgtaccc aagttcgatt gtctcccctt 60 gccacctcct cggttggttt gtgttcacac tgtctttttt ccttctgaac cccggtacgc 120 ttgcgtcatc gagctgctgt tttgtttacg gccgttgcta ggtgacggtc acgaaagcaa 180 gcgccgaaat ggaaagactt ccgcacatcg cggtcattct gcttttactg aagcttttgg 240 ttttggacat acagaaagcg actcgcgtcc atctgccgca aaaatattat aaacattcag 300 aacatcacac agtgtctgca gaagctgttt ttggaggaga caagcagaca tcactgtgta 360 tcactgtgtt tgccctggct cagtcccgcg cgtgtcacca aggtacgaaa cgtgacacac 420 acacacacac acacacgaac gaacgaacgt tatgaaaacg atagtttgtt gtacagttgg 480 cggttcactt ccgttatttg gtacgattgc attcacatca gaagtgaacc gtaccagagt 540 tcgcatgaac cgtaccccag accacctctt tcaggcggac tcgggtacgg ttcacgggtg 600 cgcacccgag ttcagaagac acgttcacac tagctaaacg taccgtacta tgacgtcaaa 660 cgaacccggg tgcggaccaa aagtgctagt gtgaaagcac cc 702 // ID Gypsy-171-I_DR repbase; DNA; ZEB; 4869 BP. XX AC . XX DT 13-APR-2009 (Rel. 14.04, Created) DT 13-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-171-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4869 RA Bao W. and Jurka J.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 9(4), 865-865 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 104..4753 FT /product="Gypsy-171-I_DR_1p" FT /translation="MDFDLIKFSLLPTLEEFDRCRKKDLILIAEFFNLTVS FT KEQKKQVIKDELYVKLVEAGVLPSRAVEGSEQEQDLASDGEATSGSESGSI FT TVQKIDPMVTIRLKELELELKKQENETQLLRIKELEITADRDIKLRKLELE FT AQKIQSTKPTPLPRSSRPPSSSSEAGQQPDFDVGKYIKLVPPFRETEVDSY FT FVAFERVAAKLKWPKEMWALLLQCNLIGKAQEICAALPIEDSLDYDKVKLA FT VLRAYELVPEAYRQKFRACSKTAKQTFVEFAREKRALFEKWCVSNKTVTIE FT DLQELILVEEFKNCLPDNVVVYLNEQKVSKLSEAAILADEFVLTHKTVFPA FT VRPPRIVQSDVREKGNYSFRPAAEINTPKPSDKKRVCFYCLDPTHLISNCK FT AWKQKASNKPKGVAFTSLTSSSVQSTPRSDPTVFKSFLQTGTVALPNSHDI FT KSVVMLRDTGSAQSLIRENTLPFSSKSYTGTNVLIRGIEMGCTSVPLHNIH FT LQSDLITGPVSIGVYSQLPVEGVDVILGNDLAGTKVFPFPIVSAEPNNSSS FT DDALALSHFNSLFSSCAVTRAQRKKFEDVLDLSESFLSLSPESTEHKLLIQ FT PEPVQADAGVNEGDAPLKVGREQLITLQKSDVTLNKCVSAAADRTKISDAP FT VGYFWDDGVLMRWWQPSDRERDGAYQIVLPVGYRTQIMKLAHDHICSGHLG FT VTKTYDRIARHFFWPNMKSSVSAFVRSCHICQLVGKPNQTVPLAPLQPIPA FT VGEPFERLILDVVGPLPRAKSGHQYILTIMCTATRYPEAVPLRSITSKSVI FT RELVKFCSMFGLPKIIQTDRGTNFTSHLFKQLVKELQVEHELSSAYHPESQ FT GALERFHQTLKTMLRAYCVETSKDWVDGLPLLMLAIRSTVQESLGFSPADL FT VFGHAPRTPIKLISEQFLSKSPSHVPILDYVSTFREHLHKVWEVARKHLSA FT TQAKMKSRYDKKSVARSFQPGDSVLVLLPVPSSPMHARFSGPYVIEKKLSD FT TNYAVLTPDRRRKSRVCHINMLKAYVDRDRFDAHPATKVPVVKIALATDYV FT PQTDGLSDKNMQTGVTLTNSTVLTTLEKNLSYLNDDHRSDVIRLIKMYPTL FT FNDVPSQTNVLTHDIDVGQSMPIKQHAYRVNPNKRQVMREEVNYLVRNGFA FT VASQSPWSSPCILVPKSDGSLRFCTDYRKVNNVTKGDSFPLPRLEDCVDRV FT GSSRFVTKLDLLKGYWQIKLTERASEISAFVTPDAFMQYTVMAFGMRNAPA FT TFQRLMQKVLSGVQNCEVYLDDVVVYSMTWEEHMVTLNSVLERLAKASLTL FT NLSKCEFAKAVVTYLGKVVGQGQVKPVNAKVESIVEFPAPSNKRELRRFLG FT MTGYYRGFCKNFATVVAPLTDLLSTERKFVWDDKCESAFCSAKDLLCNAPV FT LSAPNFTLPFSLQVDASARGAGAVLMQADDAGIEHPVSYFSKKFTRCQQNY FT SVIEKEALALLLALQHFEVYLSSGNTIVVYTDHNPLTFLARMSNSNQRLMR FT WALIVQEFDLDIRYKKGAENVIADALSRIYIGD*" XX SQ Sequence 4869 BP; 1362 A; 1015 C; 1118 G; 1374 T; 0 other; attgggggct cgtccgctcg ttaatacgat atagtttatt ctttcaaatt tgcttggctg 60 tttttgaact ttttggtagg ggaggagttt aagtcaactt aaaatggatt ttgatctaat 120 taagttttct ctcttgccta cgctagagga atttgatagg tgtagaaaaa aagatttaat 180 tctaatagcg gagtttttca acttaacagt gtcaaaagaa caaaaaaagc aggttataaa 240 agatgagttg tacgtcaagt tggtggaggc cggcgttttg cctagtcgcg ctgtagaggg 300 gtccgagcaa gagcaggatt tagcatctga tggcgaggcg acatctggtt cagagtcagg 360 gagtataacg gttcagaaga tcgatccgat ggtaactatt cgtttaaaag aattagaatt 420 agaattaaaa aagcaagaaa atgagactca actcctgcga ataaaagaac ttgaaataac 480 cgctgaccgt gatatcaaat taagaaaact tgagctggaa gcgcaaaaga ttcagagtac 540 taaaccaaca ccattaccac gcagcagccg ccctccttcc agtagctctg aggctgggca 600 gcaacccgac ttcgacgttg gtaagtacat caaacttgta cctccatttc gtgaaaccga 660 agtagattcg tactttgtgg cttttgaacg agtagcggct aaattaaaat ggcccaagga 720 aatgtgggca ttgttgttac agtgcaatct tattggtaaa gcccaggaga tttgtgcggc 780 tttaccaatc gaagattcgc tggattatga taaagtgaaa ttggctgttt tacgagctta 840 tgaattagta ccagaggcgt atcgacagaa attcagggct tgttcaaaga cagccaagca 900 aacgtttgtg gagtttgcgc gtgaaaagag agcgttattc gagaaatggt gcgtctctaa 960 caaaaccgta acaatcgagg atttgcagga actgatactg gttgaagaat ttaaaaactg 1020 cttaccagat aatgttgtag tgtacctgaa cgagcagaag gtctctaaat tgtccgaagc 1080 tgcaatttta gcagatgaat tcgtgctcac gcacaaaacg gtttttcctg ccgtgcgccc 1140 tccaagaatt gttcaatcag atgtgaggga aaaaggaaat tattctttcc gaccggcggc 1200 tgaaattaat actccgaaac cgtctgataa aaaaagggtt tgcttttact gcctggatcc 1260 tacccatttg atctcaaatt gtaaagcgtg gaagcaaaaa gctagtaata aacccaaggg 1320 ggtggcgttc acttctctta cctctagttc ggtacagtca actccgcgat ctgatcccac 1380 tgtgtttaaa tcatttttac aaaccggtac tgtggcactg cctaatagcc atgatattaa 1440 atcagtggta atgttaagag atactggcag cgctcagtcg ttaatacggg aaaatacctt 1500 acccttttcg tctaaatctt atacgggtac taatgttctc attcgaggaa tcgagatggg 1560 ctgtacatct gtcccactgc acaatattca tttacaatct gaccttatta ctggtcctgt 1620 gagtattggt gtctattcac aactaccagt ggaaggcgta gacgtaattt tgggtaatga 1680 tttggctgga acaaaagtat ttccctttcc aattgtgtcg gcagagccaa ataatagcag 1740 ttcagatgat gcgctcgcgc tgtcgcactt caactctctt ttttcgtctt gcgccgtgac 1800 ccgtgcacag agaaaaaaat ttgaagacgt acttgatttg tctgagtcgt ttttatcttt 1860 atcgccagaa agcactgaac ataaattgtt gattcagccg gaaccagtgc aagctgatgc 1920 aggcgttaac gaaggcgatg ctcccttaaa agtaggtagg gaacaattaa tcactttgca 1980 gaagtctgat gtaacgttga ataaatgcgt cagtgctgct gcagaccgaa cgaaaatatc 2040 agacgcgcca gttggttatt tttgggatga tggtgtactg atgcgctggt ggcagccctc 2100 tgatagagag cgggacggag cttatcagat cgttcttcct gtcggctacc gaactcaaat 2160 aatgaagtta gcccacgatc acatctgctc tggacacctt ggtgttacaa aaacgtatga 2220 tcgcattgct cggcatttct tttggccgaa tatgaagtca tccgtgtctg cattcgttcg 2280 gtcttgtcac atctgtcaac tagtgggtaa acccaaccag acagttccgc tagcccctct 2340 ccaacccatt cctgcggtag gagagccttt cgagagactg atccttgacg ttgttggtcc 2400 tttacctcga gcgaagtctg gtcaccagta cattcttacc ataatgtgta cagccacacg 2460 ctacccagaa gccgtgccac tgcgctcaat aacatccaaa tcggtgattc gagagctggt 2520 taaattttgt tcaatgttcg ggttaccgaa aataattcaa acagacagag gaacaaactt 2580 tacatcgcat ttatttaagc agttggtaaa agagttgcag gtggagcacg aactgtcaag 2640 cgcatatcac cccgagtcgc aaggtgcgct cgagcggttt catcagacgc tcaagacaat 2700 gctgcgcgcg tattgcgtcg aaacgagcaa agattgggta gatggattac ctctgttaat 2760 gttggccatc aggagtactg tacaagagtc gttaggattt agccctgctg accttgtatt 2820 tgggcatgcg cctagaacac caataaagtt gattagtgag caattccttt ctaaaagtcc 2880 ctctcatgtg ccaatcctgg attatgtgag tacatttcgt gagcatctcc ataaggtttg 2940 ggaggttgct agaaaacatc tatctgctac gcaagctaaa atgaagtcca gatacgacaa 3000 gaaaagtgtt gctcgtagtt ttcagccagg cgattctgtc cttgtattgt tgcccgtgcc 3060 tagttctcca atgcatgccc gcttttctgg tccttacgtc attgaaaaga aattgagtga 3120 taccaactat gctgtactga ctcccgatcg gcggcgcaaa agccgagtct gccacattaa 3180 catgctaaaa gcatatgtgg atagagatcg ttttgatgcg caccctgcca ctaaagtgcc 3240 tgtagtaaag attgctttag ctacagatta tgttccccaa acagatggtc taagtgacaa 3300 aaacatgcaa acgggtgtaa cacttaccaa ttcaactgtt ctgaccactt tagagaaaaa 3360 tctctcgtac ctgaatgatg atcaccgcag tgatgttatc agactgatta aaatgtatcc 3420 cactctgttt aacgatgtac catctcagac caacgtgcta acccatgaca tcgatgttgg 3480 tcaaagcatg cccataaagc agcatgctta cagagtaaac ccgaacaagc gtcaagtgat 3540 gagggaggag gttaattatt tagtgcgtaa tggttttgcc gtcgccagtc agagtccgtg 3600 gagttcgccc tgcatcttgg tgcccaaatc tgacggctcg ctacggtttt gcaccgatta 3660 ccgtaaggtg aataatgtaa ccaaaggcga ttccttccct cttccacgct tggaagattg 3720 tgttgatcga gtcggctcct cacgtttcgt aaccaagctt gacctcctga aaggttactg 3780 gcaaattaaa ttgacggagc gcgcatctga gatctccgcg tttgtcactc cggacgcatt 3840 catgcagtat accgtaatgg ctttcgggat gcgcaacgca cctgccacat tccagcggtt 3900 gatgcagaaa gttttatcgg gtgtacagaa ttgtgaagtc tatctcgatg acgtagtggt 3960 ttattccatg acctgggaag aacatatggt cacacttaat tcagttctag agcgcttggc 4020 aaaagcttca ttgacactaa atctcagcaa atgtgaattc gccaaagctg ttgttacgta 4080 ccttgggaaa gtagttggcc aaggccaagt taaacctgtg aacgctaaag tggaaagtat 4140 tgttgaattt ccagcaccct ctaataagcg agagttgaga cgctttttgg ggatgaccgg 4200 atattaccgg ggattctgta agaattttgc cacagtggtt gcgccgctta ctgatctcct 4260 gagtaccgaa cgaaaatttg tgtgggatga taaatgtgag tccgcatttt gctcagcaaa 4320 agatttgctt tgtaatgccc cagtcttatc agctccaaat ttcacactgc ctttctctct 4380 tcaagtggat gcgagcgctc gtggcgcggg tgcagtatta atgcaagctg atgacgctgg 4440 cattgaacac cctgtcagtt atttttcaaa gaaatttaca aggtgtcagc aaaactatag 4500 cgtcatcgaa aaggaagcat tagccttact tctagctctc caacattttg aagtgtacct 4560 aagctctggg aatacgatcg tggtgtacac cgaccacaac cctttgacat ttctcgcccg 4620 tatgtcaaac tcaaaccagc gcttgatgag atgggcgctt attgttcaag aatttgattt 4680 ggacattcgt tataagaagg gtgcggaaaa tgtcatcgct gatgcattgt ctcgaattta 4740 tattggtgac tgaatgaatt caaattttca aaattgtttg agtatttgaa ttaatgttgt 4800 aactttgttt ttttttttac aaaagcaatt ttttattaca aaagtgaatt ttgtaactaa 4860 tggtagggg 4869 // ID Gypsy55-LTR_DR repbase; DNA; ZEB; 240 BP. XX AC chr16; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy55-I_DR; KW Gypsy55-LTR_DR; Gypsy55_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-240 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1426-1426 (2008). XX DR Genome; chr16; Positions 28706945 28706706. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 240 BP; 71 A; 37 C; 49 G; 83 T; 0 other; tgtagatcat tgtgacagat gatctgtctt ataaatataa gtgatttgtg tgtttataaa 60 gtgttacact ttaatgtgtc ggctccttta agaatgcact cctcattagt tatgcatgag 120 gaacatacca cgaggttgta tgttatagca tgttccggtt aacaagtgcg cttacaataa 180 agtacaatct ttacatgaag atggacgtgt gttaattcct tggagtatca gttcacaaca 240 // ID ERV2-I_DR repbase; DNA; ZEB; 3091 BP. XX AC AL591210; XX DT 05-JUN-2002 (Rel. 7.05, Created) DT 20-MAY-2005 (Rel. 10.06, Last updated, Version 2) XX DE ERV2-I_DR is an internal portion of the ERV2_DR endogenous DE retrovirus. XX KW ERV2; Endogenous Retrovirus; Transposable Element; ERV2-I_DR; KW ERV2-LTR_DR; ERV2_DR; LTR retrotransposon; class I ERV; gag. XX NM ERV2-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-3091 RA Kapitonov V.V. and Jurka J.; RT "The ERV2_DR endogenous retrovirus from zebrafish."; RL Repbase Reports 2(5), 13-13 (2002). XX DR Genbank; AL591210; Positions 31851 28761. XX CC ERV2-I_DR is an internal portion of the ERV2_DR endogenous CC retrovirus. It is flanked by 99% identical ERV2-LTR_DR. CC ERV2_DR belongs to the class I of endogenous retroviruses (4-bp CC TSD). CC It encodes ERV2p, a 455-aa gag-like protein that is closely CC related CC to this from ERV1_DR. The C-terminal portion of this protein CC includes CC the zinc knuckle motif present in gag proteins of many CC retroviruses. CC ERV2-I_DR is 72% identical with ERV1-I_DR. XX FH Key Location/Qualifiers FT CDS 718..2085 FT /product="ERV2-I_DR_1p" FT /note="Gag-like protein" FT /translation="MFRQNTRLINTTEKGSLTTPSLSDQDFKSLLRQHMDS FT ITESVRLNSTKKFSIDPDDVWTLDECKKVLDACIRKNNVKVIICCYREFNL FT SRAKQQAQAKSELEKQIQELHAKITSLTKQLNLKKLKTDIEEVSQPDNKHP FT DLQIVSEKDISIQSDSDLPIGSVEVCGATRSKRTESSHNNTSVVQIQTVSR FT TLGPKEIDRLSQSLPSARTNFSEFSRALISKMRLYDMSLTEVTQLMSQILT FT ESEFNSFEHTVTSKLQHASKDDLREGVLKALKNIVGPKIDWSKVTCCVQRK FT DESVNEFTERFCQSAITYSGLADNSDSVLDDKGPLVRIWSDGLAAEYRKAL FT PFLNITWSSTTLRNNMNSLALWERDSYIKDRVRFEAAATKVNTEKTSNHKC FT LRKNICCHYCGKKGHWMRECRKNKKNFKEMNRVNQLLNSLLNALNIPLPSS FT LNPLSSPQRC*" XX SQ Sequence 3091 BP; 1048 A; 623 C; 547 G; 873 T; 0 other; acttggtgcc gtgagccgga tagagactga aattatcaaa ttctcaacaa tttgctacaa 60 aagactgaaa tctacctgaa attcaacttg aaccatcaag atcaaggaaa cttcttcaga 120 cggaatcttc agctcaacgg caatttgaag tcactcagcg gacgtcacaa attctcaact 180 atttgctgac tacaaccctg atgaacatta aagctacctg aatacaaaga tcagatcact 240 ccagactgaa tcttttcagc tcaacctcga tttggtgagt ttcactctat caaaagaacc 300 atacagacca gtctatacac atggttattc tctgctccat cagagaaaag ttaacaaaca 360 gctgcagggt ttgtttaaca aaaagctatc ctctatttca atagagaagt aacagtttat 420 cctctgttta gacagagaag taaagcctta cgtgctattt tcttttaaaa agttatcctc 480 tcttcagaga agttaagcct tatgcgctat tttttttctt ttatttcgtt ttccctcttt 540 tcaaccagag aagtttaagt ttagctatac aagctaattg atgttatctc ttgttcaggc 600 aggaagtttt agcaacatgt gctaatattg ttaccctctg tttcgccaga gaagaaatta 660 ctacactttc atctgaaaag ttttagtatt tttactttgt gataataagc tcacaaaatg 720 tttagacaga atacaagact gatcaataca actgaaaaag gtagtttaac aactccttca 780 ttgtcagatc aggatttcaa atcgctgcta cgacagcaca tggactcaat tacagagtca 840 gtcagactaa attcgacaaa gaaatttagc attgacccag atgatgtttg gacattagat 900 gaatgcaaga aagtgcttga tgcatgcatt cgcaaaaaca atgtaaaagt cattatctgc 960 tgctacagag aattcaattt atctcgagct aaacaacaag ctcaagctaa atcagaatta 1020 gaaaaacaga tccaagagct tcatgctaag ataacctcat taacaaaaca actgaaccta 1080 aagaaattaa aaactgacat tgaggaggtt agtcagcctg ataataagca tcctgactta 1140 caaattgtct ctgaaaaaga catttccatc cagtcagata gcgatttacc tataggctca 1200 gtagaggttt gtggtgcaac aagatctaag agaactgaga gttcacacaa caacacttct 1260 gttgttcaaa ttcaaacagt ttccagaaca ctaggaccta aagaaataga cagactgtct 1320 caaagcttac catcagcacg cacaaatttt tcagagttta gcagagcact aatcagcaaa 1380 atgcgtcttt atgacatgtc attaacagaa gtcacccagc taatgtctca aattctcact 1440 gaatctgaat tcaacagttt tgagcatact gtgacctcta aattacaaca tgccagtaag 1500 gatgatttga gagagggtgt tttgaaagct ctaaagaaca ttgttggccc aaagattgac 1560 tggtcaaaag tgacttgttg tgtgcaaagg aaagatgaat ctgtgaatga attcactgag 1620 agattttgtc aatccgccat aacttacagt ggattagctg ataattcgga cagtgtgcta 1680 gatgataagg gacccctagt ccgcatctgg tcagatggcc ttgcagctga atacagaaaa 1740 gctttgccat ttcttaacat cacctggtct tccaccactc tcagaaataa tatgaacagt 1800 ttagctttgt gggaaagaga ctcttacatc aaagacagag tcagatttga agcagccgcc 1860 actaaggtca acacagaaaa gacatcaaat cacaaatgtc ttaggaaaaa tatctgttgt 1920 cattactgtg gtaaaaaagg acattggatg agggagtgta gaaagaataa aaagaacttt 1980 aaagagatga acagggtaaa tcaacttctg aacagtttgc taaatgccct caacataccc 2040 ctcccctcat cactaaatcc attgagcagt ccacaaagat gctaaccatt ttttggctgt 2100 aagtgcttaa tgtcccagtt catttacaat taagatgaaa gactctttgt aaaaagacac 2160 aaaggaaact tcttcctttt tattggggaa ctctgcttaa gtccacttac agccttgaat 2220 gtgtttaaca cattttacaa tcactaccca ttcaatgtta tgatgtctga taggactgat 2280 atttattgaa tgatagtttg tgtcacacat tttaaagttt gcccaccaca ctcagggaca 2340 gaacaggaca cacaaaccag tgaggagccc agggcagaac tgaaagcaac aggcttcagg 2400 ggccgcaccc tgtggtgaag actccctttt cttttttgtt tttcttcccc catcctcacc 2460 tcactgtttt ataggtgtca taattttaag attaggagac agaaaataca caacacaata 2520 cgatcaccaa cttagcacaa ccactataca atcacctgaa taactagaca catgaactgt 2580 atgatcatac aagtctatgc atgaatactg ctggtctgct gtttccgtgt tttcttgtgt 2640 gcaggtaaac acaaacactg catcttctca gaaacactct gaaccagagc atcactacgg 2700 agatccatga atgacaggaa gtcatcagat caactctgtg ctggagaaga accacaagaa 2760 acacacctca caaaagactg aacactattc acaagccatg gactttcaga agatctgcag 2820 tttaactaca tgatttttgt gtcaccatgt aggatttatg acattacaac tgattgtcat 2880 aaacgtgtga acagttttga aatctacact tacatctacg tagatattga aatgacacag 2940 aataagttaa atgtgcctgt aacttttcaa gttacatgac tagaagatgg ggcttctgtt 3000 atcaaacaga agccattgaa tggttttagg aaagcttaca aatttatata tttttttgtg 3060 agagttttta aataactctc aaagggggaa c 3091 // ID MOSAT_DR repbase; DNA; ZEB; 2087 BP. XX AC . XX DT 22-OCT-2004 (Rel. 9.09, Created) DT 22-OCT-2004 (Rel. 9.09, Last updated, Version 1) XX DE Zebrafish mosaic satellite repeat - a consensus. XX KW SAT; Satellite; Simple Repeat; MOSAT_DR; mosaic repeat; KW tandem satellite repeat. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RA Gentles A., Kohany O. and Jurka J.; RT "MOSAT_DR: Zebrafish mosaic satellite repeat - a consensus."; RL Direct Submission to Repbase Update (JUN-2004). XX DR [1] (Consensus) XX CC Average similarity to consensus 94%. Approximately 2.1 kb unit CC which occurs in tandem repeats, and inverted repeats, and CC mixtures of both. XX SQ Sequence 2087 BP; 684 A; 476 C; 301 G; 626 T; 0 other; atcatgctag caacatgcta attcatgcta gaatcatgct agtaacatgc taattcatgc 60 tagaatcatg ctagtaacat gctaattcat gctagaatca tgctagtaac atgctaattc 120 atgctagaat catgctaaca acatgctaat tcatgctaga atcatgctag taacatgcta 180 attcatgcta gaatcatgct aacaacatgc taattcatgc tagaatcatg ctagtaacat 240 gctaattcat gctagaatca tgctaacaac atgctaattc atgctagaat catgctagta 300 acatgctaat tcatgctaga atcatgctaa caacatgcta attcatgcta gaatcatgct 360 agtaacatgc taattcatgc tagaatcatg ctaacaacat gctaattcat gctagaatca 420 tgctagtaac atgctaattc atgctagaat catgctaaac atgctaattc atgctagaat 480 catgctagta acatgctaat tcatgctaga atcatgctaa caacatgcta attcatgcta 540 gaatcatgct aataacatgc taattcatgc tagaatcatg ctagtaacat gctaattcat 600 gctagaatca tgctaacaac atgctaattc atgctagaat catgctagta acatgctaat 660 tcatgctaga atcatgctaa caacatgcta attcatgcta gaatcatgct agtaacatgc 720 taattcatgc tagaatcatg ctagtaacat gctaattcat gctagaatca tgctaacaac 780 atgctaattc atgctagaat catgctaata acatgctaat tcatgctaga atcatgctag 840 taacatgcta attcatgcta gaatcatgct agtaacatgc taattcatgc tagaatcatg 900 ctaacaacat gctaattcat gctagaatca tgctaacaac atgctaattc atgctagaat 960 catgctaata acatgctaat tcatgctaga atcatgctaa caacatgcta attcatgcta 1020 aaatcatgct agtaacatgc taattcatgc tagaatcatg ctaacaacat gctaattcac 1080 gctagaatca tgctaataac atgctaatct atctaatcta tctatctatc tatctatcta 1140 tctatctatc tatctatcta tctatctatc tatctatcta tctatctatc tatctatctt 1200 ttcatacttt ttaaaactgt ttaaactaaa taaactatta ccgtcaggct ttcacaagcc 1260 agcctcaaag tttgttctca aactttaaga atctagttat attattcttc ttctgagact 1320 aaaaattaaa ctctatctcg tcctaggcct ttcaagctat gaccatcaaa ctcgggtcag 1380 acctccgaac tattctgact cgagttgcta tatcttttcc gactgatccg actttcggtt 1440 ttccgaaaaa cgtcccggga ccgtcggaaa aatcccatag acgtaacatt ggatcaaact 1500 ttgtgacctc ataactccgc atcagaatgt cacacagact tctaactggg ctcatttaac 1560 tcagactatc aaactgccaa tgactgatca cctttaaact ttctggccac gccctagcaa 1620 ccacttttgg accctagaaa ccgtcccata gacttccatt gcaaaagact ctcattgact 1680 ttacatggga tcaaactttg tgagctcata actctgcatc agactgtcct acagactaat 1740 ggctgagctc atttaactca gtctagccag cagccaatca ctgatggcct tttaactttc 1800 tagccacacc ctaacaacca gatactgcac cctagcaact gtcccataga ctgccattaa 1860 agaggttcag actgatattg tcatgctgca gtgtgataga gacatggggg ttgtttttaa 1920 ctgttcatac tggcaagaag ctttgataca tcatcagtct aagctgtcaa tcaactccat 1980 agcaacaaaa cagactaacc tagcaacgat ataacaccag ctatatctca gcatcagaac 2040 atcgtagaga ggcgggggtt ggcttgtttg actcatgctc gcacacc 2087 // ID CATCH3LTR_DR repbase; DNA; ZEB; 343 BP. XX AC AL590152; XX DT 05-JUN-2002 (Rel. 7.05, Created) DT 05-JUN-2002 (Rel. 7.05, Last updated, Version 1) XX DE Long terminal repeat of the CATCH3_DR endogenous retrovirus. XX KW BEL; LTR Retrotransposon; Transposable Element; KW BELL/CATCH superfamily; CATCH3I_DR; CATCH3LTR_DR; CATCH3_DR; KW endogenous retrovirus. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-343 RA Kapitonov V.V. and Jurka J.; RT "CATCH3, an endogenous zebrafish retrovirus from the BEL/CATCH RT superfamily."; RL Repbase Reports 2(5), 5-5 (2002). XX DR Genbank; AL590152; Positions 52554 52896. XX CC CATCH3LTR_DR is a long terminal repeat of the CATCH3_DR CC endogenous CC retrovirus. See comments for CATCH3I_DR. XX SQ Sequence 343 BP; 108 A; 49 C; 81 G; 105 T; 0 other; tgtaagagcc attatgtttt gtaatgctgt ttttgaccac tgggtggagt gtgggtttgg 60 aggagtatat aggtagacaa agacttagaa caacaggtgt ataattagag ggaagcacaa 120 cattttttga ccgtgtttga aagagcttta ttgaggcatt caatgtgatt ttatgagcat 180 ttatatatca gactgtgagt ggaaattgaa ccataagaac tgtctgtgca aaaaataaat 240 ctgataagca gtatttggag gatgattcgt ttgttgagct ctcacgcgtc agaaacaacc 300 ctacaacacc tcaagtgaca aaatggtcat ttgtgtcact aca 343 // ID hAT-3_DR repbase; DNA; ZEB; 2920 BP. XX AC scaffold151; XX DT 30-SEP-2005 (Rel. 10.09, Created) DT 30-SEP-2005 (Rel. 10.09, Last updated, Version 1) XX DE hAT-3_DR is a autonomous DNA transposon - a fossilized copy. XX KW hAT; DNA transposon; Transposable Element; Interspersed repeat; KW autonomous; transposase; hAT-3N1A_DR; hAT-3N1B_DR; hAT-3_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-2920 RA Kapitonov V.V. and Jurka J.; RT "hAT-3_DR, a family of autonomous hAT DNA transposons from RT zebrafish."; RL Repbase Reports 5(9), 257-257 (2005). XX DR Zv5 assembly; scaffold151; Positions 20917 23836. XX CC hAT-3_DR is a autonomous DNA transposon that belongs to the hAT CC superfamily. The genome harbors only one copy of hAT-3_DR and CC several CC hundred copies of nonautonomous elements derived from hAT-3_DR CC (hAT-3N1A_DR and hAT-3N1B_DR subfamilies). XX FH Key Location/Qualifiers FT CDS 314..2248 FT /product="hAT-3_DRp" FT /translation="MDARGKKRKITEENRGFNVSWTESFAFVANAEGLPEC FT LLCSEKLSNNKKSNVERHFQGRHATFAAEYPVGSERKSAIALLVEKLEERK FT NKFKKWIASPNSTTAASFVATREIIKRGKPFTDGDYMKESFINISEHLFSD FT FKNKTEIIQKIKDMPLSAKTVKERAIKMAGNITEQQIKDINSAPAYSIACD FT ESCDVTDIEQTALLCRYVNSDGPQEEIIKLIPLKGQTRGEDICEAVLKCLN FT ENGINTNHLISVATDGAPSMRGSKRGFVTLLQKALDRNLLAFHCILHQEAL FT CAQTFPSECMVVMNLVIEMVNKIIAKALNHRQFRALLDEVDSEYSDLLLHN FT KVRWLSKDEVLRRFVACLEHVKTFLKSKDLIYPQLEDTEWLEKLHFMVDMT FT SHLNKLNESLQVRGNTALQMLEAVLSFERKLTVFARDVQRGTLSHFPSLRE FT FKESHQDHTINGDYLHNAIVDMQTAFESRFCEFRKEKMTLSFPVTPLEIDP FT SLLSTFPGIIQADLEMEMADISDKDLWVSKFKRLTAELEDVTRQKAQLAQS FT HKWSEMEGLPVPEKLIYDTWNALPDCYKNMKTYAFGVLSIFGSTYLCEQIF FT SNMNYIKSKYRTRLTHESLQSCVKIKVTSYMPDVEKLSSDVRKQKSH" XX SQ Sequence 2920 BP; 904 A; 551 C; 644 G; 821 T; 0 other; caggggtcgg caacccgcgg ctctagagcc gcatgcggct ctttagcgct gccctagtgg 60 ctccctggag ctttttcaat aaattttgaa aatgaaaaaa gatgtgggag gtaaatatag 120 tttttgtttt taatatggtt tctataggag gaaaagtttt ccaatgctgt aaaagtgtgt 180 agaataaata tttaatttca acatttctgt caacgaagat ttgcgtcatg gtctgcgaca 240 cacgtttcca tcagcagggc gggatgccag acaggtggct attgtaaata aaccggcggc 300 tgtgtgatgg acgatggatg caagaggaaa aaagagaaaa ataactgagg agaacagagg 360 atttaacgtt tcttggaccg aatcatttgc attcgttgcc aatgcggaag gtttgcctga 420 atgtttgctc tgtagtgaga agttgtcaaa taacaagaag agtaatgtgg aaagacattt 480 ccagggaagg catgctacat ttgcagccga gtacccagtt gggagtgaga gaaaaagtgc 540 gattgcatta cttgtggaga aattagagga gcgcaaaaat aaatttaaaa agtggattgc 600 atctccaaac tccactactg ctgcaagttt tgttgcaacc cgggagataa taaaacgtgg 660 aaaaccgttc acagatggtg actacatgaa ggagtcattc atcaacatat cagaacacct 720 tttttcagac ttcaaaaata aaaccgaaat aatacaaaaa attaaagaca tgcctctctc 780 cgctaagaca gtgaaggaaa gggccattaa aatggctggt aacatcaccg aacagcaaat 840 caaggacatt aattcggcgc cagcatactc aatagcatgt gatgagtcct gtgatgtaac 900 cgatattgaa cagactgcgc ttttatgcag gtatgtgaac tccgatgggc cgcaagaaga 960 aataattaaa ttaataccac tgaaaggcca aacacggggg gaggacatat gtgaggctgt 1020 gctgaaatgt ttaaatgaaa atggaataaa caccaaccac ctgatttcag tggctacaga 1080 tggggcaccc agcatgagag gatcaaaaag ggggtttgtg actttgctac agaaagcatt 1140 ggatcgaaat ctgcttgcat ttcactgtat cttgcatcaa gaggcactgt gtgcacaaac 1200 attcccatct gagtgtatgg tggtgatgaa ccttgtcatc gagatggtga acaagataat 1260 tgcaaaagca ttgaaccacc gtcagttccg tgcattgctt gatgaagttg acagcgaata 1320 ttccgatctc ctgctacaca acaaagtccg atggctctcc aaggatgagg ttttacgtcg 1380 ctttgtggct tgtttagaac atgtgaaaac attcctgaaa agcaaagacc tgatctaccc 1440 gcaattggaa gataccgagt ggctcgaaaa actgcacttt atggtggata tgacaagcca 1500 cctaaacaag ctgaatgaaa gtctccaggt gcggggaaac actgcactgc aaatgctgga 1560 agctgttttg tcattcgagc gcaaactgac tgtctttgcc agagatgtac agcgaggcac 1620 gctctctcac tttccctccc tgagagaatt taaagaatcc catcaagatc acacaataaa 1680 cggcgattat ttacataatg cgatcgttga tatgcaaact gcatttgaga gcagattttg 1740 cgagttcaga aaggaaaaaa tgacactgtc tttccctgtc acacctctgg agattgaccc 1800 ttccctgtta agcacattcc caggaattat tcaagctgac cttgaaatgg aaatggctga 1860 tatatccgac aaagatttat gggtgtctaa attcaaacgt ctgacagccg agcttgaaga 1920 cgtcactcgc cagaaagccc agcttgccca aagccacaaa tggagcgaaa tggaagggct 1980 cccggtaccc gagaaactta tctatgatac atggaatgct cttcctgatt gttataagaa 2040 catgaagaca tatgcatttg gagtattatc catctttgga tcaacatact tgtgcgagca 2100 gatattctca aacatgaact acatcaaatc aaaatatcgc acccgcctca cacatgagag 2160 cttgcagtcc tgcgtcaaga tcaaagttac atcttacatg cccgatgttg agaagctgtc 2220 cagtgatgtc cgaaaacaga agtcacatta aaagggtaag aatcctgtca taggcacgta 2280 tttagtggta gttataaagt gatttcttat ttttgtcagt atatatttgg aatgacactt 2340 aagggatatg attgtgtttt gataggtccg aggtagctgc gttggttgcg tgtcaaaaga 2400 gaagaggatg ctggatttca cccttgcact aacttgcttg gcatttgcac agaaatcaaa 2460 cttaaactat attaatttga ttttatatac tttttttaaa aggctgctgt tttatttttt 2520 tataaatgca gtctgcgttt tattgcaccg tttgttgctc tgataagggg cacgttatgc 2580 acgttccatt ttgatgttgt ttttgaaatc cttaaaataa aaattttatc aaaaatctaa 2640 tttctttatt acatttattt tattttatca aagcaatgaa acataatccc ttaatattgt 2700 aatgaagtta aacttgaggc tgcatcgtac aacagagtag tcacgtggtg cgtcattcta 2760 caggatgcac tgcagggaaa ataatagata catatggcat ttgttgcctt cattataagg 2820 cttaaataag gcttttaatt ttctgcggct ccagaaatat ttgttttttg tttttttggt 2880 ccaatatggc tctttcaaca ttttgggttg ccgacccctg 2920 // ID Gypsy156-I_DR repbase; DNA; ZEB; 6411 BP. XX AC chr5; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy156-I_DR; KW Gypsy156-LTR_DR; Gypsy156_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6411 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1573-1573 (2008). XX DR Genome; chr5; Positions 18361077 18354667. XX CC Positions [2960-3385] - Reverse transcriptase CC Positions [4928-5398] - Integrase core CC 'GAAA' target site duplication CC LTRs are 100% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 123..1814 FT /product="Gypsy156-I_DR_2p" FT /translation="MDLCREHDFDPEKSVLLTRVDPGTTLNDVRKAFGHID FT DVKKIKWIGDKFPGELLCEFQEVATPFVVEEEQLNGSEQYVWGVVPLDSRQ FT FKDVQHPPSISTPIQSGSVQTLDDMADQLLVQIENLAQMHCVNLSDLSQQV FT VNKLAHKTDNVDQGPITSTPAAKLFVDARPRVMSAEASMPVSKVSFANQDD FT SPPIKKNIGNETPVTFTIPQDVQRVVVEHVVKHDPEYQGHHLSPPKLRVFS FT GHCPKPNGEVDFTTWQLYVKQLLPDTSLTERHKRRHILDSLLPPALTVALG FT AGTNAPPEVYVLELEKAYGSVTGGDELYIQFIETHQNRGEKPSDYLRRLHA FT LLLEVIEKKGVIGKQSDTLLLKQFIRGCWDEALITQLRLKELLTEQTPSEL FT SYSSILFKLRSYEHEKQLKENRMNLHLNAPPKRAQSHVHVAVNQGSIASSE FT GVQNNEVLMRRIKELEAEVVKAKNQTSATVNQQGFDQLQASATKKRGKEVK FT LSRPVAQTQPKQLPISGFCYRCGEDSHYLQHCTNPVNAALVQQKLVQRHEQ FT RGSPNKMMMPTTHLNLA" FT CDS 2189..6361 FT /product="Gypsy156-I_DR_1p" FT /translation="MPLLVGTNVLRYLVRDFRMSQGKEYLQNMSASVNWMT FT AYHQCDKVLTPGKQPQYSTPVRLSGRKSVTISKEEKCEVTGVYRLKEFGLN FT KTMIIDEPLTHQIPGGLVVECKLIQAKLSARNKVKVVIRNVSDHSVTLQPK FT GVLAECSVVDWIKSVPLFDGTGNSQAKTPLMANILVKTEDPVTVDFGDSPI FT SEELKAHIMGRINREVSSAFAKHDLDVGHVSGATHRIELMEHVPFKERTRR FT VSPADFDDLKKHLLDLLASEIIEESNSPYASPVVLVRKKNGDLRMVVDYRK FT LNKLTKRDAYPLPRIEETFTLLSGSKWFSVLDLKSGYYQLEVEESDRPKTA FT FTTPFGNWQFRRLPQGLTNSPAMFQRTMEKVMAGLNLQEVIAFLDDLIIFS FT DTLEQHEERLMKVLQRIATFGLKLAPSKCKIFQTSVKYLGHVISAQGIHPD FT PDKISAVKEWPIPKTVRDLRSFLGFAGYYRRFVEGYSKIVKPLNVLLQGEF FT STRRSSSTSYRVRSKLQSLAGRWDDECQIAFDTVIQKLTTAPTLGFADWKL FT PYIVHTDASVNGLGAALYQVQDGKTKVIAYASRGLSRSEKNYPVHKLEYLA FT LKWAVCEKFHDFLYGAKFTVLTDNNPLVYVLTTAKLDAAGHRWLAALSMYD FT FEIKYRAGKTNVDADGLSRRPQEPLGDDAETIMIDERIESLLSKAECAAVE FT FEKFGQEEIKAVCMRQSVFCGMFSSQPGEGDLVEARATSTPAVEMLLCDED FT VVPDDLIDPECLPGQTALPSMTRTDWYQLQREDVALRKVIGLLESGNNLSH FT VERLNESREVNLMLRERHKLRLVDGVLHRMVCDQYGQKYGQLVVPSSFRDR FT ALEGIHDETGHMGYERTLELARARFYWPKMAEYVERKCRTCERCIRRKARA FT QRGAELVNIKVYAPLELVCIDYLSLEPDSSDTRNILVITDYFTKFSWAFPT FT KDQTAKTVASVLWENLICHFGFPKRIHSDQGANFESELVKELCTLAGVKSR FT TTPYHPRGNPVERFNRTLLDLLGTLSDKKKEHWRKYVRPLVHAYNCTRNDA FT TGESPFLLMFGRQPRLPIDLCFGINPKGYNSKTHTHYVTELKQRLRYAYKL FT AVQNAEKRQLMNKARWDKKVTAASVEVGDRVLIKNVNIRRKHKIADRWEST FT VYVVVKQPNADIPVYVVRPENGDGSERILHRDLLLPCGFLPVSPTVADEIL FT TVPENCSSRVNTDDLRGDAGVDNFGEILEAGSSKSLNPHAPEFNSVLQNVP FT ELSRTLSVSNRQDDEMNTLSVTSVENDIENSCETEIQQDVSSLEEDDDVAV FT QPDELSVEERDCVVGLRRSKREKKPPNKLNDYVGWKAQCKRQGVTLENQSS FT QLSQVLEMMQQQMRVQLTQSELLFSLIG" XX SQ Sequence 6411 BP; 1892 A; 1196 C; 1579 G; 1744 T; 0 other; gagaagagag agaaaaattt ggcgcccaac gtggatggca ttgattttct attaatctta 60 atcctaactg ctctggatta ttattttttt tcatattgag tgattttttc tccacattta 120 aaatggatct atgcagggaa catgattttg atcctgaaaa gtctgtactg cttacacgag 180 tagatccagg cacaacttta aatgacgtca gaaaagcctt tggccacatt gatgatgtga 240 agaaaatcaa gtggataggt gacaaattcc caggggagtt gctttgtgaa tttcaggagg 300 tcgctactcc attcgttgtt gaagaggaac aattgaatgg tagtgagcag tatgtatggg 360 gtgttgttcc actggatagc agacagttta aagatgtgca acatccacct agtatttcaa 420 caccaatcca gagtggttca gtgcagactt tagatgatat ggctgaccag ttgttggtgc 480 aaatagagaa tttggcccaa atgcattgcg taaatttgag tgatttgagc caacaggtgg 540 tgaataagct tgcgcataag actgataatg ttgatcaagg tcctatcact tcgacgcctg 600 cagcaaaact gtttgttgat gctaggccac gtgtaatgtc cgctgaagct tcaatgccgg 660 tttcaaaagt gtcatttgca aaccaagacg attcaccgcc tataaaaaag aacattggta 720 acgaaactcc agtaactttt actataccac aagacgtaca gcgtgtggta gtggaacatg 780 tggtgaaaca tgatccagaa tatcagggac atcacctctc gccccccaaa ttacgagtgt 840 tttctggaca ttgccccaag cctaatgggg aagttgattt caccacatgg caattgtatg 900 ttaaacaatt actcccagac acttctctga cagaaaggca taagcgacga cacattctgg 960 atagtctcct ccctcctgct cttactgttg ctcttggggc tggtacaaat gcaccaccag 1020 aagtgtatgt gctggagctt gagaaagctt atggaagtgt cactggtggt gatgagttgt 1080 atattcagtt catagagaca caccagaaca ggggtgaaaa gccatctgat tatttgagac 1140 gacttcatgc attgctgctt gaggtgattg aaaagaaggg agtgattggc aaacaatctg 1200 acactttact tctcaaacag ttcattagag ggtgctggga tgaagccctt ataacccagt 1260 tgcgtcttaa ggagttgtta acagaacaaa caccatctga attgagttat tcgagcatac 1320 tcttcaaact aaggtcatac gaacatgaaa agcagctcaa ggagaaccgt atgaatctgc 1380 atttgaatgc gcctccaaaa agagctcaga gtcacgtgca tgttgccgta aaccaaggaa 1440 gtatagctag ttcggagggg gtacaaaata atgaggtgtt gatgagacgc ataaaagagc 1500 ttgaagctga ggtagttaaa gctaaaaacc agacgtctgc tactgtgaac caacaggggt 1560 ttgatcaact tcaggctagt gcaactaaga aaagaggtaa agaagtgaag ctctcacgtc 1620 ctgtagcaca aacacaacct aagcaattgc ctatcagtgg attttgttac cgctgtggtg 1680 aggactctca ttatttacaa cactgcacca atcctgtgaa tgctgctttg gtacagcaaa 1740 aattagtgca gaggcatgaa cagaggggtt ctcccaacaa aatgatgatg cccacaacac 1800 atttaaactt ggcatagctt ctgttgtgga acggacagaa gctgagttgg agaaaagttc 1860 cacagaaata agtaaccacg ttccagaggg tctaattggc gaaccatggg aaagtgaagc 1920 atggttggat ggagttcagt gtctttgttt agtagataca ggatcccagg tgacctgtgt 1980 agctgaatca ttctacaagc agtacttgcg gcatcgaaac ctttctccca tagggaatct 2040 acttagagtt gagggagctg ctggacaaat agttccgtac attgggtaca ttgaaataga 2100 catacagttt ccagaatctg cttgtggaac agatgccgtt gttacagcat tagcattggt 2160 ttgccctgat caggtttata atgcacgaat gcccttgcta gtggggacaa atgtactccg 2220 ctatcttgtt cgagatttta ggatgtcaca gggaaaggag tacctacaga acatgtctgc 2280 cagtgtcaat tggatgactg cttatcacca gtgtgacaaa gtattgacac ccggcaaaca 2340 gccacaatat tctactccag ttagactgtc tggtagaaag tctgtaacaa tcagtaaaga 2400 ggaaaagtgt gaagtaacag gagtatatag attgaaggaa tttggtttga acaagacaat 2460 gattattgac gagcctttga cgcatcaaat tcctggtggt ttggtggttg agtgcaaatt 2520 gatacaagca aagctcagtg ctcggaataa agttaaagta gtcattcgta atgtttcaga 2580 tcacagtgtg actttacagc caaagggagt gctagctgag tgttcagtgg tagactggat 2640 taaatctgtg cccctttttg atggtacagg caattcacaa gccaagacac ccctgatggc 2700 aaatatctta gtgaagacag aagatcctgt tactgtggat tttggggatt ctcctatatc 2760 tgaggagtta aaagctcaca tcatggggag aataaacaga gaagtgtcaa gtgcatttgc 2820 gaaacatgat cttgatgtag ggcatgtttc gggtgcaacc caccgaatag agttgatgga 2880 acatgttcct ttcaaagagc gtactcgtcg agtctcacct gctgactttg atgatttgaa 2940 gaagcatttg ctagatttgc tagctagtga gattatagag gaatcaaaca gtccttatgc 3000 ttccccagtt gtactggtga gaaagaaaaa tggagattta agaatggtgg ttgactatcg 3060 caagctaaat aaactgacaa aaagagatgc gtatcctctc ccaaggatcg aggagacgtt 3120 tactttgttg tcaggctcaa agtggttctc ggttttagat ttaaagagtg gatactacca 3180 gttggaggta gaggaaagtg accgtcccaa aacagcattc actacgccat ttggtaactg 3240 gcagttcagg cgattaccac aagggctgac aaattctcct gcaatgtttc aaaggactat 3300 ggagaaagtc atggcaggtt taaatttgca ggaagttatt gcttttctcg atgacttaat 3360 aattttttca gatacactgg aacagcatga agaacggctc atgaaagtcc ttcagcggat 3420 tgcaactttt ggattaaagt tggcaccttc caagtgtaaa atttttcaaa cctccgtcaa 3480 gtacttaggg catgttattt ctgctcaagg tatacaccca gatccagaca agatctctgc 3540 cgtgaaggaa tggcccattc cgaaaacagt cagggatttg aggtcattcc ttggttttgc 3600 tggatattac agacgctttg tcgagggtta ttccaagata gttaagcctc tgaatgtgtt 3660 attgcaaggt gaattctcca ccaggaggtc atcatccaca tcttaccgtg ttaggagtaa 3720 gttgcaatct ttagctggaa gatgggatga tgagtgtcaa attgcatttg acactgttat 3780 tcagaagctt acaactgcac cgactctggg ctttgcagat tggaagttac cctatatcgt 3840 tcacacagat gccagcgtaa atggtctggg tgctgcctta tatcaagtac aagatggtaa 3900 gaccaaagtg atcgcctatg ccagtcgggg tttatcgagg agtgagaaaa actaccctgt 3960 acacaaacta gagtatttag ctcttaaatg ggctgtgtgc gaaaaatttc atgacttttt 4020 gtacggtgca aagtttactg tattaactga caacaacccg ctagtgtatg tgcttactac 4080 agctaaattg gacgcagctg gtcataggtg gctcgcagct ctgtcaatgt atgatttcga 4140 aatcaagtat cgagcaggca aaactaatgt tgacgcagac ggactttcac ggcgtccaca 4200 agagccactc ggtgatgatg ccgaaaccat catgatagat gaacgaattg aaagcttgct 4260 gagcaaagca gaatgtgccg ctgttgagtt tgagaagttt ggacaggagg agattaaggc 4320 tgtgtgtatg agacaatctg tattttgtgg aatgttttct tcacagccag gcgaagggga 4380 cctggtggaa gccagggcta cctcaacccc tgctgtggag atgctactgt gtgacgagga 4440 tgtagtgcca gatgacctga tagatccaga atgcttgcct ggacagactg ctctaccgag 4500 tatgactcgg acagattggt atcagttaca acgagaggat gttgccctca gaaaggtgat 4560 tggtcttttg gagtctggga acaatctgtc tcatgtggaa agactcaatg agtctaggga 4620 agttaattta atgttaaggg aaaggcacaa actgaggtta gtagatgggg tcttgcaccg 4680 tatggtatgt gatcagtatg ggcaaaagta tgggcagctg gtagtcccct ctagtttcag 4740 agacagagct cttgaaggca ttcatgatga aactgggcat atggggtatg aaagaacttt 4800 ggagcttgcc cgagccagat tttattggcc taagatggca gagtatgtgg aaagaaagtg 4860 cagaacttgc gagcgttgta tcaggcgaaa ggcccgtgcc caaaggggtg ctgagctagt 4920 aaacatcaag gtttatgctc ctttagagtt agtctgtatt gattaccttt cactagagcc 4980 agactccagt gacacccgta acatcttggt gatcacagat tattttacta agttttcgtg 5040 ggcatttccc actaaagatc aaactgccaa gactgtggct tctgtgttgt gggaaaattt 5100 aatttgtcat tttggtttcc ctaaaagaat acacagtgat cagggagcca attttgagtc 5160 tgaactggta aaagaattgt gtactctagc tggtgtcaaa tcccgcacaa caccttatca 5220 ccctagggga aacccagtag aaaggtttaa ccgaaccctt ctggatttat tgggaacctt 5280 gagtgataag aagaaagaac attggagaaa atatgtacgt ccactggtgc atgcgtacaa 5340 ctgtacgaga aatgacgcga ctggtgagag tccttttctt ttaatgtttg gacgtcaacc 5400 tcgtcttcca atagacctct gttttgggat taaccccaag gggtataatt ccaaaactca 5460 cactcattat gtcactgaac ttaagcaaag actacggtat gcttataaat tagcggtcca 5520 aaatgcagag aaaaggcaat tgatgaataa agctcgatgg gataagaaag tcacagccgc 5580 ttcagttgaa gtgggcgatc gtgttctcat taagaacgtc aacattcgaa gaaagcataa 5640 aatagcagat cgctgggagt ccactgtata tgtggttgtc aagcagccaa atgcagacat 5700 tccggtgtat gtggttcggc ccgagaatgg tgatggctct gagcgtatcc tccacagaga 5760 tcttttgctc ccttgtgggt ttcttcccgt cagtcctact gtagctgatg aaatactcac 5820 tgtgccagaa aattgctctt ctagagtgaa tacggatgat ttgagagggg atgcaggggt 5880 cgataatttt ggtgaaattt tggaggctgg aagttcaaag tctttgaatc ctcatgctcc 5940 agaattcaat tcagtcttgc aaaacgttcc tgagctttct aggactctta gtgtttcaaa 6000 caggcaagat gatgaaatga acacattgag tgttacatct gttgagaacg acattgagaa 6060 tagttgcgag actgaaattc agcaagatgt ttcctccttg gaggaagatg atgatgtggc 6120 agtacagcct gatgagttga gtgttgaaga gcgtgattgt gttgttggct taagaaggtc 6180 taagagagaa aagaaacccc caaataaatt aaatgattac gttgggtgga aagcacaatg 6240 caagaggcag ggtgttactc tggaaaatca atccagtcaa ttaagtcaag tacttgagat 6300 gatgcaacaa cagatgcgag tgcagctaac acaaagcgaa ttgttgttta gccttatagg 6360 ttagcttatt aagtgcctga tgaggacatc agagttttga gcaggggagg a 6411 // ID Tc1-7_DR repbase; DNA; ZEB; 1646 BP. XX AC . XX DT 26-SEP-2008 (Rel. 13.09, Created) DT 27-SEP-2008 (Rel. 13.09, Last updated, Version 4) XX DE DNA transposon - a consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Tc1-7_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1646 RA Jurka J.; RT "Mariner-type families from zebrafish."; RL Repbase Reports 8(9), 936-936 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1646 BP; 551 A; 318 C; 366 G; 411 T; 0 other; cagttaggtc cataaatatt gggacattga cacaattcta acatttttgg ctctatacac 60 caacacaatg gatttgaaac gaaatgaaca agatgtgctt taactgcaga ctgtcagctt 120 taatttgagg gtatttacat ccaaatcagg tgaacggtgt aggaattaca acagtttgca 180 tatgtgcctc ccacttgtta agggtccaaa agtaattgga caattggctt ctaagctgtt 240 ccatggccag gtgtgtgtta ttccctcatt atcccaatta caatgagcag ataaaaggtc 300 cagagttcat ttcaagtgtg ctatttgcat ttggaatctg ttactgtcaa ctctcaagat 360 gagatccaaa gagctgtcac tatcagtcaa gcaagccatc attaggctga aaaaaaaaaa 420 caaaaaaaca aacccatcag agagatagca aaaacattag gcatggccaa aacaactgtt 480 tggaacattc ttaaaaagaa agaacgcacc agtgagctca gcaacaccaa aagacccgga 540 agaccacgga aaacaactgt ggtggatgac cgaagaattc tttccctggt gaagaaaaca 600 cccttcacaa cagttggcca gatcaagaac actctccagg aggtaggtgt atgtgtgtca 660 gagtcaacaa tcaagagaag acttacacca gagtgaatac agagggttca ccacaagatg 720 taaaccattg gtgagcctca aaaacaggaa ggccagatta gagttctaaa aaagccttca 780 cagtgctgga acaacatcct atggacagat gagatcaaga tcaacttgta ccagagtgat 840 gggaagagaa gagtatggag aaggaaagga actgctcatg atcctaagca gtgaagcatg 900 gtggtagtag tgacatggcg tgggcatgta tggctgtcaa tggaactggt tctcttgtat 960 ttattgatga tgtgactgct gacaaaagca gcaggatgaa ttctgaagtg tttcaggcaa 1020 tattttctgc tcatattaag ccaaacgctt cagaactcat tggacgagcg cttcacagtg 1080 cagatggaga atgacccaaa gcatactgca aaagcaacca aagagttttt gaagggaaag 1140 aagtggaatg ttatgcaatg gccaagtcaa tcacctgacc tgaatccgat tgagcatgca 1200 tttcacttgc tgaagacaaa actgaaggga aaatgcccca agatgcccca agaacaagca 1260 ggaactgaag acagttgctg tagaggcctg gcagagcacc accaggaatg aaacccagcg 1320 tctggtgatg tctatgcgtt ccagacttca ggctgtaatt gactgcaaag gatttgcaac 1380 caagtattaa aaagtgaaag tttgatttat gattattatt ctgtccaatt acttttggtc 1440 ccttaacaag tgggaggcac atatgcaaac tgttgtaatt cctacagcgt tcacctgatt 1500 tggatgtaaa taccctcaaa ttaaagctga cagtctgcag ttaaagcaca tcttggtatt 1560 cgtttcattt caaatccatt gtgttggtgt atagagccaa aaatgttaga attgtgtcga 1620 tgtccaaata tttatggacc taactg 1646 // ID hAT-N54_DR repbase; DNA; ZEB; 1689 BP. XX AC . XX DT 21-OCT-2008 (Rel. 13.1, Created) DT 21-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Putative nonautonomous hAT DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW 8 bp TSD; hAT-N54_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1689 RA Bao W. and Jurka J.; RT "hAT DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1323-1323 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1689 BP; 482 A; 346 C; 309 G; 551 T; 1 other; tagggatgca cgatatatcg gcggccatat cgttatcggc cgataaatgc tatttttaag 60 attatcgtta tcgatccgat gtctaaatta ggccgatatc tttaagccga taaattacgt 120 aattgtacag acctgcctgc cgcgcacgcg tgggcggaac atgttcagac ttgtccagtg 180 cactataatg gatctctcct cacactgttt attccttcaa agtccgtcct ttcttcatgt 240 ggtattgctg tgcgttaata actcagtaag taaccatgtt ccttatcatt gtagatatgt 300 ttgattgagt gtcattgtgt attttgcttt aaatcgcctc aggaaaaaat attacatgtg 360 taaacctttc ctgctctgtc tctttgttgt gttagagata gcaggggaga ttcacttatt 420 actccagggc cgaataatat tagtttcggg gtggacacat gtaacttggc ccaatcacca 480 aagagggaac atcactaaaa tggtctcatt gcgaaaaaaa acgctcctct tttctgtctc 540 tccctcttcg cttcacacac acacacacaa acacacacca aactgaccag cacacatgac 600 gtgctaaaca aaaacttgtc gggttgtttg taatctaata tgattatttg ttattatcaa 660 cgcgttgctt gccatgtgtt gttgatgtaa gattgtattc agtttgatcg gaatattcat 720 attgattata acgagcgcgc gtgagagctg tcaccgcaca cacacacaca cacagatctg 780 acagccttga gccttcacag atcaacaggg gagagcctag gtgcaccggt cactgaatcc 840 agcatacggg ccacacattc tgtctttata ttctgctctg tgtttgtcat aaagtcatct 900 gagctcaatg ctcaggtttt gtatcaaaat ctgactccga gggcaaatct gcgccctcag 960 ttgagtgaaa ctcacagcgg tttatcataa acttttcatc gatcattttc cccgcgattg 1020 acagcaagaa cgaacacaga agcgttgtct gattgacaga scagcaggac ctacatgtgt 1080 tcaaaagaat ctaaaacgcc aaataggatg aatctatgcc gcattttcta aaataataca 1140 ttctgtatct agcttttcgc ttgtacggtg gctctgaact gtcaaaacac ctctaaaaaa 1200 ggctttttcg gacatgacat atttgtacac gtgtacatca ctgtaacacg ttttattcaa 1260 gaacatttga gctggtttca gtccttagct ctttcatttt catcttattt aaaatatatt 1320 taacttgctt gttgattaaa tcccattttg ttatttaacc tatttaattt tatgcaacag 1380 tcaagttgca tgttaatttg ctatgaagaa aaggaaataa tgtatttttt gcatgctgat 1440 ttatgtgaaa tcgtgaatat aggtctatat aatcaccttg caattttgaa attatagctt 1500 ttttgctttg agtatagact attcagttca taagagcagt tcaatctttt atgaagtttg 1560 ctgtataaaa atataacatt ttgaaatatt tataattatc ggcctatata tatcggctat 1620 cggcctccaa gtctgaagaa ttatcggtta tcggtatcgg ccaaaaaaat ccatatcggt 1680 gcatcccta 1689 // ID hAT-N39_DR repbase; DNA; ZEB; 1594 BP. XX AC . XX DT 16-OCT-2008 (Rel. 13.1, Created) DT 16-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Putative nonautonomous hAT DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW 8 bp TSD; hAT-N39_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1594 RA Bao W. and Jurka J.; RT "hAT DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1309-1309 (2008). XX DR [1] (Consensus) XX CC Copies are ~97% identical to the consensus. TSD is 8 bp long, and CC TIR is 37 bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1594 BP; 466 A; 338 C; 242 G; 547 T; 1 other; caggcccgga ttaagaactc aggggcccct gggcacattg tcttgtaagg ccccctacct 60 ggccgcaccc ctatagttga atttaaaaaa taagaataat ataataattt ttaaataaaa 120 tgaatctata atttgttgcc cacatattta aataaatgat actatatagt acatatgtat 180 ttatagtcta caaagattta aattattttt caaagcatta aataaaaata ctaataaaac 240 tacaaaaaaa tatatatata tatatatata tatatattat atatatatat atatatatat 300 atatatatat atatatatat atatatatat atatatatat atatatwtat tttttttttt 360 ttttaagggt attttaatgt attgctttta caaacttata aagcatatta ttgccatggc 420 atataacgca caatgcttct ccaatccagt tctatgaatc tgagttctat gaatctgagt 480 ctttcataat acaaacactg atttactgtc tctctcctcc tctccctgta ttttctctac 540 tcttctcgtg atgaagactt gattataaac ctaaaagtat ccgaaatcac agacactata 600 cctcttctct cccgtctctc tcctcttctg ggcctttctc tctctctccc tctctctctc 660 cctctctctc tccttttctg tctgtctctc acacgatgaa tccagtccgc gctgcgctgc 720 cgttagcctc ctccgcctgg ttaatgctag cttctcatca tttaaagtta ccataacgtc 780 ttcttctgta ccctcattct cctgatcatg ggtctgttta aagatggtgt gtatggaaaa 840 tgcctcaata aagatgcctc ctctcctctt tctctcgcgt ttgctaaatc cacttgtcca 900 ctcatttttg atctatgaca gatataacgt tacactaaag tccgctcagt ttagtgcggg 960 ttattacaaa actgacgttt ccgcgcttga ccaattacag cattctgttt agttaaagaa 1020 agaaaggcaa tttaaatata ataataatat taatatgtga tcgcgatttt tttgctcaga 1080 ggaaatacag ttgtctgagc aatatagttt ttgcagagag ggaggcgcgg caggcgccta 1140 taataaggct aaacattatt aaatacccac gagacacttg tgactttata tcacatttgc 1200 attatttttt aaaaaaatgt ttctttcctg ttaaaaataa atatatttcc aatctgatat 1260 gtaatgggga gaaaaaaaaa gtagcacgag ttcagctgat ggtggattcg aaccgatgtt 1320 atgattaatc gcgtcaaaag atgatgctgt gtgccttaca aggtgcgcca ctgcggctgt 1380 attatttcac agctctttac tcatcttgtc tctatcaatc aggcatcgat gggcgtaaac 1440 cctgaatagc ttattccaac tagatgcgct atttgcaatt agccaaaaaa gacactccga 1500 aattgtcttt tttttctccc ccctcgcagg ggcccctagt gcgcacgggc ccctgggcct 1560 gtgcccataa ggcccattgg ttaatccggc cctg 1594 // ID Gypsy-28-LTR_DR repbase; DNA; ZEB; 605 BP. XX AC . XX DT 31-MAY-2005 (Rel. 10.05, Created) DT 31-MAY-2005 (Rel. 10.05, Last updated, Version 1) XX DE A long terminal repeat of the Gypsy-28_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy superfamily; endogenous retrovirus; Gypsy-28_DR; KW Gypsy-28-I_DR; Gypsy-28-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-605 RA Kapitonov V.V. and Jurka J.; RT "Gypsy-28_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 5(5), 119-119 (2005). XX DR [1] (Consensus) XX CC Gypsy-28-LTR_DR is a long terminal repeat of the Gypsy-28_DR CC LTR retrotransposon that belongs to the Gypsy superfamily. Its CC internal portion is deposited in Repbase as Gypsy-28-I_DR. XX SQ Sequence 605 BP; 92 A; 137 C; 143 G; 233 T; 0 other; tgtcacggtt gtggatctat ctgctcttca tttgaagtgt ctgcgttgtg tgacagttca 60 ttatttatgg tatgtgtttc taagtgtgtg tgcttctctc tctgtgcttg tgtgggactc 120 gcgtgcctgt cagtgggtgt ttgttttcgt ggcatcagct gagcggtcac atgatccttg 180 ttaccggaac tattgcgtgt caccgctcgg ctgttaatca gcgcgctaca ggtgtagcgc 240 atttttgtgc ctataaatgt tcagcgttcc tgtctcactt tgtcagttcg ttgtatgctc 300 acggtgtttt tgtcttttgt caggatcctc tcgctgattc tcgctggtgg tgagtggctc 360 ggttgcttcc tcgtttcact ttcggatgct tccggttcct ttgatgatga ctgtttgatt 420 cactctgttg acgagtgtca tgaactttct atcggcgctg ccgaccaaga tcgctactgt 480 tattcctgct aaaagactgt gtacttccag ccactcacca ttttctctgt ctttttcttc 540 tgtgatcttc attaaataac ttttacttgc aattgggttc cacgtcctga ttattgagcg 600 tcaca 605 // ID ERV1-N4-I_DR repbase; DNA; ZEB; 3793 BP. XX AC . XX DT 11-NOV-2008 (Rel. 13.11, Created) DT 11-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE nonautonomous endogenous retrovirus from zebrafish, internal DE portion - consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW internal portion; ERV1-N4-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-3793 RA Bao W. and Jurka J.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(11), 1781-1781 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 301..1761 FT /product="ERV1-N4-I_DR_1p" FT /translation="MSSXMEWNTYNEYDLFTLWTENVEPVLYKLTRKGLDL FT DATSEDISIWWKTVGIKQKKCKHKEIIEALNYLNASSIRNNLKKVFANLKE FT KTEQLVTADSEAQKSQEKIEHLQSQLDKLLEDQXALAKRCEKYMDIIEDLK FT NKLIVQKLMPVRNTKANKHMTSESEXADDISLQKVPVSALKVMKHPNGTET FT NIQRALTVHECNSFKAEIGPFPLKGPFQPWWRRVVRHKYAFQLEPKDIWQI FT VXTSIPNVLIYKAPQVLIEGTIIQPIHPNESESEILNRLKLCLCDLRGKSP FT AEWELIYNRKQNNAESFETYAENLFELFEEYSGIDDVTRNNPILLGLLVEH FT AGPHVQKALMAGASPPKNTFEDIVNWGTKVESRRQMHSELAASGCHEDVKA FT IRQSSKIPCKKQDRFCCSCLKMGHTEQKCWSLNRGRPPPYFLRRRQSQSKD FT TEQHSRSSQNSTSSKTITELTNMAMQGLQLLASFASGLTAQ*" XX SQ Sequence 3793 BP; 1202 A; 744 C; 641 G; 1192 T; 14 other; atttggcgaa cccgacgcga tataattttg gcgagtttta atcgcaagat actttttgta 60 tcaacatcta ctttgcttat cctggaaatg gattatttta ctcttggaca cacgatggtc 120 atcagaatga gaactcttct gagtggatta aagctgaact tctaaccaca gagtgggtaa 180 gtctgttttc attgcttcat ggcwtggtag ggaaagtctc catttgcatg aattgtrtga 240 aacaggaatt agaagtawca gaataatcca agtaatcatc tcactgtgaa atccaagtaa 300 atgtcctcca awatggagtg gaatacttat aacgaatatg atctgtttac attatggact 360 gaaaatgttg agcctgtact gtacaaactc actagaaaag gccttgattt ggacgctact 420 agtgaggaca tttccatatg gtggaaaaca gtgggcatca aacaaaagaa atgtaaacac 480 aaagaaatca ttgaggcttt aaattacctg aatgcctcta gcatacgcaa taacctaaaa 540 aaggtgtttg caaatttaaa ggagaaaact gaacaattgg tcacagcaga ttctgaagca 600 caaaagtctc aggaaaaaat tgaacatctc caatcacagc tagacaaact actagaagat 660 caaktagcmt tggctaaaag atgtgagaaa tacatggaca taattgaaga tttgaaaaac 720 aaattgattg tgcaaaaact aatgcctgtc agaaacacaa aagctaacaa gcatatgact 780 tcagaatcag agraagctga tgatatatct ctgcaaaagg tgcctgtttc agctttaaag 840 gttatgaaac acccaaacgg cacagagaca aacattcaac gtgctttaac agtacatgaa 900 tgtaactcgt ttaaagcaga gattggacca ttccctctta aagggccctt ccaaccatgg 960 tggaggcgag ttgttaggca caaatatgca tttcaactcg aaccaaaaga catttggcaa 1020 attgttttma cctcaattcc aaatgttctg atctataagg ccccacaagt cttaattgaa 1080 ggcacaatta tacagccaat acaccccaat gagtctgaga gtgagatctt gaacagacta 1140 aaactctgcc tttgtgacct gcgtggtaaa agccctgctg aatgggagtt gatttacaac 1200 cggaaacaga ataatgcaga atcgtttgaa acatatgcag aaaatttgtt tgaacttttt 1260 gaagaatatt ctggcataga tgatgttaca cgcaacaatc caattctttt gggattactg 1320 gttgaacatg cagggcccca cgttcagaaa gcgctcatgg ctggtgctag tcctccaaaa 1380 aacacatttg aagacattgt gaactgggga acaaaggtcg agtcacgcag acaaatgcat 1440 agtgaattag cagcatcagg gtgtcatgaa gatgttaaag ctattagaca atctagcaaa 1500 ataccttgca agaaacaaga tcgattctgt tgttcctgtt taaaaatggg ccacactgaa 1560 cagaagtgtt ggtctctgaa cagaggcaga cctccgccct atttcctgag acgtagacag 1620 tctcaatcta aggacacaga acaacactct agaagcagcc aaaattcaac ttcaagtaaa 1680 actattactg agttgacaaa tatggctatg caaggtctcc agctattagc gtcatttgct 1740 agtgggctaa cagctcaata gaaataacct aataatatta caccctgact gcactgatcc 1800 tttttgcatt ttcaaacctg tgattgtgtt aaacatgcag ccttcctcat aaacacacaa 1860 tagttgtttg tactactaca tgtatacata gactactaga attgttaaac aatatctagt 1920 gaactgtgtt aatgttggat ttgattgtct cctataagtg gaccgctctg ctattttcca 1980 tatagctgca gctgaaccat aaaacaggag cacaaaactt gccacctggc tacagtgtta 2040 ttcacaattg cattaacata aaaatgcttt ccttccagca ggattcagta tcartgcact 2100 tgtccttttt accatgctct aataacaaac caaggtgtgg atctttgagc ccgatcacaa 2160 ccccatctta catttcaagg gcttcctgtt aaaacaacag gaccaaattt gcatcacaat 2220 tgtcaatagt aacttttgtt acctttggtc ggagcatggt cacaccaaca cattcacaca 2280 gttaaataca atcacataaa caayaccatt tgtctctcta cacatagtca amttaaactc 2340 ttctttttaa ctaaaatatt ttcatattgg aaagccctac atcagctgtg tgctttccaa 2400 taaggaaaat ttccattcac tttttgttca tctaaaccat tttgcttttt acctttttga 2460 atctttttta tgctctcaca gcaatttttg ttttcaataa ctggactttc attctggaca 2520 ataatggact ttcatgttat ttcttgattt gagacatttt aaccttttta gatcactttt 2580 tatttttaca gttaagtcat caccaaggca attttttgaa atttctgaat ttgatcacat 2640 taattatatt atactcttac tataacagca agtcctgaaa tcaaccaaac tttttataaa 2700 gacaaatatt ttcagaaaca aattaacggt gtcttttatg ttttttagat gatgctccta 2760 ctatcagaga ggggtttgaa ttcattactg gtggactttt tcaaagacac atacattttt 2820 ccagaaacat taaaggtccc atttttgttt ttacaccata ttcatttata ttgttggtgc 2880 tcctactatc agagaggggt ttggaatcaa catatattac aatattttaa tctttccaaa 2940 ggtctcattt atttgtcttg tttttcacaa ctttttcast tactcttaaa tcctgtatat 3000 caaagcatat actttatctt tttaacataa accgatttaa tttggccact tcataatcaa 3060 ccaaattgtt tttattttat ttttatttgc tttctttcta aacaccagaa agattataat 3120 ccctatcatg ttgttacttt cccttcatta tattctgagc aaacattaca ttatatgtca 3180 tttattttgt cttcggacat tgagttttca tttgaaatgc cattatattg atatttgaac 3240 tactacatgc ttatattgcc atataacctc tatgacatat gccagttttg cagatatgtt 3300 ttttcttctc attgccatgg ttattctttt gctgttctgt caccagttat acatcgtcaa 3360 tgaatgcctc tttcagtttt catgggttgc tctcctttgc aaatcatttt ctgtgcatga 3420 aggtaaaggc agcttcagtg tggaacacaa acactggaga attgaagacg ccctccacaa 3480 gatatactaa atcccacaga taacagagwt trcttggctt gctaagccag agacggacat 3540 acatcttgtg aaatggacat taggatccag acaagacatc ctgacaagtg tccccatgtg 3600 atcatctgac tatgcctaca cctgacagga aaaggtgttc tgtgtgcata tcttctgagc 3660 gtcattgaca atgatcatca tcagtctgca ctactctgcc aaaggagaca ctggattact 3720 ctactcagct ttaatttgag gtcacatgtg gaccttttat aatttctgag gtccaatgaa 3780 cctgagaggg gtt 3793 // ID Helitron-N2_DR repbase; DNA; ZEB; 2463 BP. XX AC . XX DT 01-OCT-2008 (Rel. 13.09, Created) DT 01-OCT-2008 (Rel. 13.09, Last updated, Version 1) XX DE non-autonomous Helitrons from Zebrafish - consensus. XX KW Helitron; DNA transposon; Transposable Element; Nonautonomous; KW Interspersed repeat; Helitron-N1_DR; TIR; Helitron-N2_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-2463 RA Bao W. and Jurka J.; RT "Helitron transposons from zebrafish."; RL Direct Submission to Repbase Update (01-OCT-2008). XX DR [1] (Consensus) XX CC contain a 22-bp internal TIR: 3-24 and 2412-2433. 3'-end hairpin CC position: 2435-2454. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 2463 BP; 855 A; 408 C; 346 G; 853 T; 1 other; ttatgttggc ttgagaaagc caacatactg taatgctact taaacttatt cttattatta 60 ttattattat tattattctt attattattc cggcttctga ccaaaaaaaa gcaatcgctt 120 ctcctcccag ggcttagatg ctacaagccc caaatttggt caaaagctgt ctattgccca 180 tgagatggtt gctgtatgtc ctcatgccga taagatttgt agtttttgaa ttaaaaattt 240 tttgaattaa aaaattgtaa atattacaat cgagtatttg gggcatataa aaaagtatac 300 aatcctcaaa tttggccaaa agatgtgctt taaattgaag ataattatca tatttatttg 360 gtgtaattat aaataacagt tttcctataa atattttttt tatattaaat ttgttaattt 420 caaacctaaa gcttgtcaga gtgtccacga gatggcgcca gaagactatt tttgagccct 480 ttggaattga gtttactgca cctttgcacc ttttacaatt tctttccatt cggtccggtc 540 ggacctgaat gtctttatac cgttgtaatc acgtgtaaat tctaacaaag acaatgtttg 600 cactttttaa atcacatttt atttattgtg caaaaataaa acatccaaac agcatcataa 660 ccaataaaaa cgcacagcaa agaaaatgta ttttaaaata gtgaatatat atatatacat 720 acaatgtatg aagatttttt atatcccctt aatatgctct gctgttgctg catcaatttc 780 tctccgtaac atctggattg caatttacac tgcaatgttc acaatttatt attatggggg 840 gaaaatccaa acagtattat aatgttttaa agcaaaaatg ttaagaacaa ttatttacaa 900 acagtkaaaa taaagtgtgt gtatatacgt atatatacat acagttgatg tcagaattat 960 tagctcccct gtttattttt tccccaattt ctgtttaatg gagggaagat tgtttcagca 1020 catttctaag cataatagtt ttaataactc atttctaata acttctttat tttatctttg 1080 ccatgatgac agtaaataat atttgacttg atatttttca agacacttct atacagctta 1140 aagtgacatt taaaggctta actaggttaa taaggtgaac taggcaggtt agggtaatta 1200 ggcaagttat tgtataacga tggtttgttc tgtagactat caaaaaaaaa attgcttaaa 1260 ggggcaaata attgtgtccc aaaaatagtt tttaaataat taataactgc ttttattcta 1320 gccgaaataa aacaaataag actttctcaa gaagaaaaaa tattatcaga catacagtga 1380 aaatttcctt gctctgttaa acataatttg ggaaatattt aaaaaaggga aaaaaaaaaa 1440 atcaaaaggg ggttaataat tctgacttca actgtatata tgtatgtata tatatatata 1500 catatatata tatatatata tatatatata tatatatata tatatacata catatataca 1560 tatataacac atacacacat ttatggttag ggttataagg attatttctt aatccccctt 1620 ttatgttggc aaaaacataa gtttacttca gatatttctt tcaaaacaga atattctgtg 1680 ccgcaaaaga gccttctcct cacttgtgat aatgacaatg aaagccaagc ttttggccta 1740 tgctgtacca aaaaaaaaaa taaaatcacc caggtgatga caattaaata ttacgattat 1800 aacattgtgt tagtggccag tggggtggaa caacgtcatt ggtcagaatt aaccacgtga 1860 tgaagacggc ctataatatt gtatttaatg cgattttctt cctgtaaact gtatatataa 1920 ctatataact atataactat atatataatg ctgcaaatta ctgtaaattg actgtactct 1980 atattaactg caaaaattat aagactgatt aaataaaaga caaattaact aaatgtatgt 2040 gctattgttt cattagtatt ttaatttaat atttttcttc gtatgatatt tattataatg 2100 taattttgtt cagattataa tatatactat aaatactaac tatactatag tgatctgcat 2160 taaacaacct gaacaaatca gtcctctctc tctctctctc tctctctctc tctctctctc 2220 gatttgatca aaaatgtaaa tgaggccttc cgattaacag tccagtggac cctagcaaca 2280 gcctagaaac cactcagaac accctagcaa ctacctagga atgccttagc aacatcttag 2340 caaccgccta gcaaccactt ctcaaccgct gtgcttcctg ccaactgcct ttgagtcccc 2400 ggcgcagtca cgttggcttt ctcaagccaa catcaaagtt cgtcaacgaa ctttaggttc 2460 tag 2463 // ID TDR15 repbase; DNA; ZEB; 549 BP. XX AC . XX DT 04-MAR-2002 (Rel. 7.02, Created) DT 04-MAR-2002 (Rel. 7.02, Last updated, Version 1) XX DE Zebrafish non-autonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; TDR15. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-549 RA Jurka J. and Drazkiewicz A.; RT "TDR15: a non-autonomous DNA transposon from Danio rerio."; RL Repbase Reports 2(2), 25-25 (2002). XX DR [1] (Consensus) XX CC TA-target site duplication. XX SQ Sequence 549 BP; 152 A; 113 C; 111 G; 158 T; 15 other; tagtgttgtc anaagtattg anttyggtac caatcgatac tgaaatttta aaaacgtcca 60 tttcccgcta acatttgagc actgttgagc gcgttcttaa acaccgctga ttkgccattg 120 tgttcacrtg ctcaacagaa atgactgtga ttggccgtga aggtcatcag ttnaccactg 180 tttaccgagt gcaaacacag atacagagac actggagcat tttaaagcca tgtcgatcag 240 ctgatttgtc tatcagctga tnnwtngwmn atcagctgat aaatcagctg atctacgtgg 300 ctttaaaacg ctccagtgtc cctgtatctg tgtttacact cagtaaacag tggtgaagtt 360 cggtaaacct gatgaccttc acggccaatc acagtcattt ctgttgagca tgtgaacaca 420 atggcaaatc agcgctgttt aagaatgggc tgctcaacag cgctcaaatg ttagcaggaa 480 atggacgnac nttttttaaa tttatttcag tatcgattgg taccgaattc catacttgtg 540 acaacacta 549 // ID ERV1-N2-LTR_DR repbase; DNA; ZEB; 794 BP. XX AC . XX DT 24-OCT-2008 (Rel. 13.1, Created) DT 24-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE nonautonomous endogenous retrovirus from zebrafish, LTR - DE consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW internal portion; ERV1-N2-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-794 RA Bao W. and Jurka J.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1599-1599 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 794 BP; 198 A; 166 C; 128 G; 302 T; 0 other; tgtaagaggc taaattatca tatgcatttg tttatatgac aaaacatgtc tttttgactc 60 aagaaggtcc atactgactc caaaggcaga tactgataag atcagggcct ggcgtgtgga 120 ctttttgggg gttttacgat gtggtcacat gttgattggt cagtttgaat gtctcagcat 180 ttgggcttta gtcacatggt caagaaagat acaaaatagg tggtaaactc tgctctgtct 240 cttttcctgg cctcttttct gtctctgcgc tttcctgctc tgctcctctc ctcctctgga 300 gtctatcttc tctgactcta ctgcaatcag gctaacttct gggcctgctc tctttgtctc 360 tctctctccc tccccctgtg tctctccttc tacctctggt aactttaatt ttacatttaa 420 gctgggtaca atttatatca tatgttttat catttcatat tttatcattt tatacagtta 480 ttttgtaatt aattcagttg taaccataga gaattattgt cattaaatca tctcattttc 540 aagtatttgt gaattacttt tgatttaaca tcaacattta attgttccat attgcaatac 600 aatacaatca cataatatca acgctctccc acatttttag ctattaatac tgcccttatt 660 tccgtttttg gtatgagatt gcagaagaat ggtttgacta gaaatgtaca gttttttacc 720 atcaatgctg ataacttttt agtcactcgg cagaactaca gttcgaaccg ctgtggttaa 780 aacccattct taca 794 // ID HARBINGER1_DR repbase; DNA; ZEB; 4016 BP. XX AC AL591210; XX DT 02-MAY-2002 (Rel. 7.04, Created) DT 02-MAY-2002 (Rel. 7.04, Last updated, Version 1) XX DE Autonomous Harbinger-like DNA transposon. XX KW Harbinger; DNA transposon; Transposable Element; KW DNA-binding protein; HARBINGER1_DR; HARBINGER2_DR; KW Harbinger superfamily; TDR; transposase. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4016 RA Kapitonov V.V. and Jurka J.; RT "HARBINGER1_DR, an autonomous Harbinger-like DNA transposon from RT zebrafish."; RL Repbase Reports 2(4), 12-12 (2002). XX DR Genbank; AL591210; Positions 80099 84115. XX CC HARBINGER1_DR is an autonomous Harbinger-like DNA transposon. CC Given well preserved 16-bp TIRs, the TAA target site duplication, CC and encoded protein sequences, HARBINGER1_DR was active in the CC zebrafish genome recently. The zebrafish genome harbors multiple CC families of nonautonomous DNA transposons, for example CC HARBINGERN1_DR, that are derivates of autonomous CC HARBINGER1_DR-like CC elements. CC HARBINGER1_DR encodes two proteins, 424-aa HARBDR1p1 (2 exons, CC positions 190-419 and 1084-2128) and 250-aa HARBDR1p2 (positions CC 4001-3249). CC HARBDR1p1 is a Harbinger/IS5-like DNA transposase. CC HARB1DRp1: CC MESIALLSLWVFFLSRLLLSGAQPSHNITMSRSIVFVARKLSRNRAARIRKFARWQLKCS CC EDFAALLSDYSSCTSIRTTWIRQRSDVWWQHVLTSWTDCEWKDNFRMGRAAFLQLCNRLQ CC PHIQRQTTTFRKPVPVEQRVAICIWRLATNVEFRTISHLFGIGQSTAVSITNCVASAIVK CC NLLSIFIRTPSEQEFESIIQGFRDKWGFPQCGGAIDGTHIGILAPPVSSADYYNRKGFYS CC VILQGVVDHRLMFWDINVGWPGKVHDARVFANSSLFDRGQGNSLFPPNIERFGDVDVPVM CC LLGDAAYPLMPWLMKPYPENQLTTPAQSTFNNRLSKARMTVERAFGPLKGRWRCLMKRCD CC CHIDNINSIISACCVLHNYCQLYGEDCDEIATDDENVNSDWNSCHSATRSNATRDALCSY CC FSNL CC HARB1DRp2 is a DNA-binding protein with the trihelix motif. CC It is related to multiple DNA binding proteins and transcription CC factors identified in plants (GT-1 proteins), insects (GenBank, CC g7294624) and mammals (GenBank, g12844693). Presumably, HARB1DRp2 CC is a former host gene captured by Harbinger in the past. CC HARB1DRp2: CC MPSVPCPGPFPGSFEKCECAESGPSTVHLAGPGPVGRGVPERGSLGLWRGTLVVRVQACQ CC SPKLKTLRHLLRHFKTGKNHVTSLSATLLTPKSNRKRTSDAFSATIPKTTVAWKGSLWST CC DEVQCLLDVWADESIQEQLDKTHKNSEIFRKIKDHLLGRGYQRTTEQCRDKMKKLRAQYL CC KVRDALRRSGSKPEEKDKFKWYDAVDNIIGSKPSSQPNVLESHSQSMGDSTPSTPSETED CC TGSERSQRGE. XX SQ Sequence 4016 BP; 1195 A; 786 C; 817 G; 1218 T; 0 other; ggccgtactc acactaggta cagttgcctc gaaccgggcc aaagcacgct tgtcccccct 60 cccctctccc ccgacggccc acactcacac tatgttatag catccgtacc caagcccgtt 120 tacgtcacta tagtgcgtca ttgtagtgcg acgtactcaa aataaacagt aggagatgcg 180 ctaagcacaa tggagtccat tgctttgtta agtttgtggg tatttttttt gagtcgtttg 240 ctcctcagcg gtgctcagcc ctctcacaac ataacgatga gccggtctat cgtctttgtg 300 gctcgaaaac tttcacgaaa ccgtgctgct cgaatcagga agtttgcgag gtggcagctg 360 aaatgcagcg aagactttgc agctttgttg agcgactaca gttcatgcac atctataagg 420 tgagaaaaat tcgaagtaat aagaacaaaa ggaaggtgta ttattaaatt tcaaacacag 480 tatgtttttt aatgataact gtagcgtagt atgtctaata atggtgtaac gccttagact 540 attagggata tgtaatagat aatgttcgat aacaatagtt aaaagctgcg aatattgcat 600 attgtacaat caagtgtcca acacgtaacg tcacttgttt gtaactacat cagttatata 660 caggttatgt aattataaat ttgttattta attataaact gtaacactgt aacacttcca 720 tctacattgt gattaaattg tttagcattt actttgttac aatatgttgt agtttttagt 780 ggaaatgcaa aagaaatcat gtttttaatc agtgtttatt ttggttattc acaactaaat 840 gtacacagtc ttatagaaaa tagaacagct atactttagg tatgtaataa gtattttgtg 900 tgcattgcca gcacctaagc ttttcactca tcatagcaca cgtgctgctg gtgatgtaac 960 aataaaagtg gtttgatttg tcatagtggt ttcttgtttg ttgatttcca cttcaaagag 1020 ctggaacgtt attgtgtatg caaaaactat acaattatta atattgtttc cattctgctg 1080 caggactaca tggattcgtc agagatctga tgtgtggtgg cagcatgttc tcaccagttg 1140 gacagattgt gagtggaagg ataatttcag aatgggaaga gctgcattct tacagttgtg 1200 caaccgcctg cagccacaca ttcaaagaca aaccactaca ttcaggaaac cagtgcctgt 1260 ggaacaacgt gtggcaatat gcatatggag attggccaca aatgttgagt tcagaacaat 1320 ctcccattta tttggtattg gtcagtcaac tgcagtaagc ataacaaact gtgttgcctc 1380 agcaattgtg aaaaatttgc tttcaatttt catccgcaca ccctctgaac aggaatttga 1440 aagcataatt cagggtttca gagacaagtg ggggttccca caatgtggag gagccattga 1500 tggcactcat attgggattt tagctccacc agtcagctct gcagactact ataaccggaa 1560 aggtttctat tctgttattc tgcaaggtgt ggttgatcat agacttatgt tctgggacat 1620 taatgtggga tggccaggca aggtacatga tgcaagggtt tttgctaatt catccttgtt 1680 tgataggggc cagggtaatt ccctatttcc tcctaacata gagaggtttg gggatgtgga 1740 tgtgcctgta atgcttttag gggatgctgc atatccgttg atgccatggt tgatgaaacc 1800 ataccctgaa aaccaactga caacccctgc acaaagtacc ttcaacaacc gcctcagcaa 1860 ggcacggatg actgtggaaa gggcatttgg acccttaaaa ggaaggtgga gatgcttaat 1920 gaagaggtgt gattgtcaca tagacaacat taactctatt atcagtgcct gttgcgtgct 1980 tcacaattat tgtcaattgt atggtgagga ctgtgatgag attgctactg atgacgaaaa 2040 tgtgaacagt gattggaatt cttgccattc agccaccagg tcaaatgcca ctagagatgc 2100 actgtgctca tatttctcta atctgtaagg acagattttg cagaacaatg gttttgatgg 2160 catactttgt gtaatgttgt atagttattt accctgttag tgttggtata ctacttctgt 2220 gattgttgtc aataaagaca tacagtattg cttattttca gaactgctct aattatgtaa 2280 taacacacaa aaacattaga taatatccaa aaatgttatt taaaatataa gaagcacatt 2340 ctgcagtact aagagtactt tacaatagat gaccgtgtgc agtaatggga cataaaagaa 2400 aatcaaaaag aaagtaaatc aaatatagag tttgttgaat actgaggtat aacaataaag 2460 tgcagttttt agagatatac attgtggttg aataactatc aaaaaatata taaaatatgt 2520 atatatacat atataaatat aatatatata aatatataag tacaacaact cttatataaa 2580 gaaaagaaaa acaacaaact ataatatata aactataaac tattatataa agaaaaaact 2640 atataaaaga acaacattaa gaaagcaaaa tcaaaactaa agtttgttgc atactgaggt 2700 acaacaacaa agtgtggttt tttggcagca tatattgtct ttaaagagcg gttaccaagt 2760 gtttatttaa aaaaaaaaaa aaaaacacac acacacacac atacatacat ctcacataca 2820 cacacatttc tctttcttgc tggtaacgca gttgttcccc ctccataaat tcgtcatact 2880 gccgtttctg caggtttaca aaggcagcca tagtgtcgtt tgcctccacc tttctcttcc 2940 tcttcctttg ttccctggaa ggtgttgttt ctagtgtctc aacactactg tttgaggtgc 3000 caggtgagtc tgaaccgggg cacataactg tcgaattaca agaaaagaaa acacaaatat 3060 aaacaattta ataatgcaca gttaacaaac atgccttatg atcattttac ttttttccat 3120 ttgttgtaaa acatgtttgc tattacaata ttggtaaatt tgtggaaaca gttacctttg 3180 tttcttagtt atatataaat aataaaacat gtcatttacg tctgggacac gcatatgctg 3240 aagcaacgtt actcaccacg ctgggatctc tccgatcctg tgtcctccgt ctcagatggt 3300 gtagatggtg tagaatcacc catgctttgg gaatgggatt ccaacacatt cggctggctg 3360 gatggtttgc tgccgataat gttgtcgaca gcatcatacc atttaaattt atctttttct 3420 tctggtttgc tcccagatct acggagggcg tcacgcactt ttaagtactg tgcccgtagt 3480 tttttcattt tgtcccggca ttgttcggtg gttctttggt aaccacggcc gagaagatga 3540 tccttaattt tccgaaatat ttcggagttc ttgtgcgttt tgtccagctg ctcttgaatg 3600 ctctcgtcag cccaaacatc caacaaacac tgcacctcat cagttgacca aagggatcct 3660 ttccacgcca cggttgtttt tggaatagtc gcagaaaatg cgtcactcgt ccttttccgg 3720 ttgcttttgg gagttagaag cgtcgcagaa agtgacgtaa catgattttt tcctgttttg 3780 aagtgtcgca gcaaatgacg tagcgtcttt agtttcgggc tctggcacgc ttgcactctc 3840 actacaagtg taccgcgcca aagcccaagt gaaccgcgct caggcacacc tctcccaact 3900 gggccagggc cggccaagtg aaccgtgctc gggcccgatt cagcgcactc acacttctca 3960 aacgatccgg gaaacgggcc tgggcacggt acggatggca tagtgtgagt aggccc 4016 // ID Gypsy-19-LTR_DR repbase; DNA; ZEB; 1625 BP. XX AC chr13; XX DT 11-FEB-2005 (Rel. 10.01, Created) DT 11-FEB-2005 (Rel. 10.01, Last updated, Version 1) XX DE A long terminal repeat of the Gypsy-19_DR LTR retrotransposon - a DE fossilized sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy superfamily; Gypsy-19-I_DR; Gypsy-19-LTR_DR; Gypsy-19_DR; KW endogenous retrovirus. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1625 RA Kapitonov V.V. and Jurka J.; RT "Gypsy-19_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 5(1), 12-12 (2005). XX DR Zebrafish.; chr13; Positions 7932458 7930834. XX CC Gypsy-19-LTR_DR is a long terminal repeat of the Gypsy-19_DR CC LTR retrotransposon that belongs to the Gypsy superfamily. Its CC internal portion is deposited in Repbase as Gypsy-19-I_DR. XX SQ Sequence 1625 BP; 461 A; 246 C; 294 G; 624 T; 0 other; tgtaacccag ctcactgact aggtcactaa aacaaattat aaatagttta ttttaaatta 60 taacatttta ttttatgatt gtgaggagaa atattaaata gttcatttat attatataca 120 taatgtttaa aagttaataa taaatagaga ataaagtaag ttttgactcg gtggaggtcg 180 gaggttagct ccgccccctt gcgtgtgtgt gagagctgcc ggtgaaggag agtcactttg 240 attaatcagc gcgagcagat tgaacgacat tagacactgt gaaacagaag tttagataat 300 ccaaacgatc taaaaatcat attgtttcgg tgattgactg agaaaagccc taaagtttct 360 ggttttaagg atagaaagct gacaacaagt gatagaaacg agagaagtgc gagcggagca 420 tattaatgcg atcgagagaa cataaaactt atatttaaat gagacacatt cttctttatt 480 ctccctgttc atctttatct ttccttggtg agtaagagat cattattatt agctgttaat 540 tggttttgac atgtttttgt agctgtttag aatagaaagc aaggatgatt tatttaaaaa 600 tgtttatctg gagtttattt tattaccaca taattgtaaa tattgtttat gcacttcgaa 660 aatgtgcatt ggttgaattt gcatttattt ttctttgttt ttcattgaaa atgtattgca 720 tgtgacgcta gctttgctag ccgtgtacat tgactttaat gtattcagtg agaacatgtg 780 actgactagc atgtatgtta tctatgctga aatgagtgac acagacaatg tgttttattt 840 aaacatgttt attgagttat taatccttgt ttctcttttg gctttgtgtt ttatcaaggc 900 cgggtttttt tgtgtgtgtg cgtgtgtttc ccctttcgtt tctgctacaa tatcccgatc 960 tgtccctgcc gtggatgctt tctcggcggt ccggacatgg cgagccgctc ttttgaaaga 1020 tcgtgcctgg aagtaaatta atctgttgct cattgaactg gtaggacaaa acattcttgc 1080 actttcactg cgctcatatt tttgttttta ttcaattttg gattttttct ctatctggac 1140 tgattttttg gatttttcac atcgaactgg agctaaaaga acttttgagt gttgaacatt 1200 gaacttggac tctgaattac tacttttatt caatcttttc cttcgttttt tttatttttc 1260 ttcctcattg tatacgattt gtgaaactga atttgtctga agtgagatta tcatctgaag 1320 tgtggtgatt tcattatatc ttttattgtt ttattctagt attcaccaaa ctgaactgtt 1380 ttgacattta tttatttgtt ttattttgtt ttcctcctcc acttgtttaa acaatatact 1440 tacctcagtt attcaacttt atttttgagt ctgacttatt tgactatcta aaactggttg 1500 ttatttctat acactcttgc gagcgaatta aaatcttctg acatctggtc ctaataagaa 1560 aatagcaaat actgtaaaaa taacacaata gaaaacgcat caataacaca tagataaggg 1620 ttaca 1625 // ID ERV1-N3B-LTR_DR repbase; DNA; ZEB; 1498 BP. XX AC . XX DT 19-FEB-2009 (Rel. 14.02, Created) DT 19-FEB-2009 (Rel. 14.02, Last updated, Version 1) XX DE nonautonomous endogenous retrovirus from zebrafish, LTR - DE consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR; ERV1-N3B-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1498 RA Bao W. and Jurka J.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 9(2), 556-556 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1498 BP; 459 A; 276 C; 300 G; 463 T; 0 other; tgtaagaatt gaaatattac aaatattttt gatgtctttg acaattcaaa acatgagctg 60 aaaacagaga caattgactc cacaacccag caggtgggaa tgcgggaaaa accagttcct 120 gctgcatttg catctgagtc tgctgaaact cggcttcatc caccccaact cacacatatg 180 gtcatttcaa tgtttgaatg ttctagatta accaaactga ctttaactat catgtttgat 240 atcatttgaa atgtctcagt gcgactggta caatggtctc attctctcag aaatagacat 300 aatcaaaaca ataaatatat aacttcaggc atcttttgaa ccactgtgaa gggtgtgact 360 tttccacagg cagcactaaa tgtgaatgcc ttttgttatg cacacacccc ctttccttga 420 gggaattctt ggattattta aggtaaggtt tgagaaaggc tcagggcgat tctgcctaac 480 cagatcagcc cataacatgg agcttacgct ccatgttatg tataatttgg aagtcaggta 540 tgcatctttt catcttggat ttatctttga taatttgatg ttttgtattg atcatttatg 600 aattgactga ttgaattggc ataatacatt tacctgacta ataaattgta atgtttgacc 660 atattttgct cactctgtaa ttattaattc aagtagatta cgctgtatcc ttgtttccgc 720 acgtcttgcg tcaggtcagt agacggacta aaatccagtt gagatggagc attgggcaca 780 ctaattgtgt tttagggatc cgactgactc ttgatattga ttcaagtgta cttaggtgga 840 aagggcgtta acttccgtct aggtcaacaa ggtgttgcac gactaaccta gattgggtac 900 ccgacctttg tttgaaggta atattattct aaattaagtt catatgggaa accatgggct 960 tggtgaaaac caaagttact atagagtcca gtagtcgggt acatttatat taatgcccta 1020 acctctaatt agaaatgatg attgtcttaa ctcaagggtg tacatctaag cgggactaac 1080 taaaagtatt ttagaacgag cgcgctggta gcggccatct aaagtattag tcaattacct 1140 ctgtttaata ttcaagactg accggagtgt gaccgtgagg gacgccttcg gccgaggcga 1200 tttctctgag cgacatgagc acccgaatga acggatgtaa tagttactag tgaagacaaa 1260 aggttcaaac atttatctaa attacatatt gatataatct tcttaatgtt ttatgattgg 1320 agtcagataa aactaaagat aaatatatta atattctaaa ccacctcata ctaaaattgg 1380 agtcagatat gagttaaatt taaggtaaat caacattgtg cactggaaag accgaacagg 1440 cattgaacct tcatccacca gtggacaccg gcattgggcc aactgggtgg accttaca 1498 // ID Gypsy8-LTR_DR repbase; DNA; ZEB; 1793 BP. XX AC . XX DT 07-JAN-2005 (Rel. 10, Created) DT 07-JAN-2005 (Rel. 10, Last updated, Version 1) XX DE A long terminal repeat of the Gypsy8_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy superfamily; Gypsy8-I_DR; Gypsy8-LTR_DR; Gypsy8_DR; KW endogenous retrovirus. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1793 RA Kapitonov V.V. and Jurka J.; RT "Gypsy8_DR, an LTR retrotransposon from zebrafish."; RL Repbase Reports 4(12), 322-322 (2004). XX DR [1] (Consensus) XX CC Gypsy8-LTR_DR is a long terminal repeat of the Gypsy8_DR LTR CC retrotransposon. Its internal portion is deposited in Repbase CC as Gypsy8-I_DR. XX SQ Sequence 1793 BP; 543 A; 272 C; 324 G; 653 T; 1 other; tgtaacccgg ataacccagg ttactagttg ttagtacaca gcaatagaaa aaatagaaat 60 acccttataa atatgagaaa aaacactggc agaatactcg attaaataat atacattaaa 120 attatttatt tatataaata atttataaaa gagtattata cagcattcaa gggagtattg 180 ctttagaatg acatcatcgt gccttctgct atagcagcag aacaacttta caaactataa 240 caaatctctc ttgtctgtta gtttgtgttt actgttgcta tggtgattca gtgtttgcaa 300 gggagtgaga gaagtaagtt gttattgtat tatcagtaga aaaytattag atttatatct 360 tttaatgttc taaattcgta ttaactgtgc aattttaccg ttaataagat gctgcaatga 420 tttatataat atatttttga tgaaggattg agagaaaaag cgtcatttgt atatgagagt 480 gaaatctgca cacgagtgcc acctggagtt ttcctgggtt ttcgctttga ttttgagtga 540 tttccttggt gagtttgtgt ttatttttgt tttcatttta acatgatttc aagtagtgtc 600 aaaaatgtgt gtatatatgt taagtaaatg tgcatatgaa gcgtgtacat atgagattta 660 ttgttaaaga cattctatta attgatattt gccttgtgtt ttgcaaggct gggttttttg 720 tgtgtgtcga agcatctgtg atgagtccct aattcacgtt ggactcgagg agaaacttac 780 ggcgagccac ccacgcaacc gactctcctg ttagactgtg tgtttgtgcc accggactgg 840 taggagagtt cttttgattg tttggagaaa cgagactcgg actcatcgcg accttccgct 900 tttccttctt acttttccta aaatcctcaa cggaaaacct ttactggaac tgagctaaga 960 gaatcaacag ttctctttca cgaaatcctt ggtattgaac tgtgaacggc aaccggtgga 1020 acgaatcatc tgaaagcgaa acaaatcctt tggacagcaa aacgaatctt ttgagtagcg 1080 aacgattcat ttgaacagag ttttgagtct ttgaacagtg aggtgattca ttttaacaga 1140 gtagcgatct tttgaacagg gaggtgattc atttgaaccg tgtattgatt ctttaaacag 1200 agaggtgatt tatttgagca cattaatgat ttattgagtt aaaagtgtga atattgaatt 1260 atagcctgca tttgagtaga gttggtatat ttattattga tttatttctt tattttcttt 1320 tatttattac atttgagaat acagagtatt ttttatcaca aatttattgc cttctttact 1380 ttgttttata caaagggtaa cacagtgttg tagctttcta agatttttat tttcattttg 1440 tacaagtgtt tgaaatttct caactgtctt gatcacaact ttgttttaac tgtatattga 1500 cttcacgttc atctgtgtaa agaaaaaaag agaaaattat ttacttttta ttattataat 1560 ttataatcga acctatatat acataaatac ttatgcttat tggacagaaa ttcacaaact 1620 cataacatat ttctaattat ataaactata ttgaaggaaa aaactgactg tgtctgatta 1680 ttcactcacc tgctgtgctg cttttctctc tccagtagcc gttctttctc ttctgaccaa 1740 ttggcccaca ttgaaaaata cgctatcgcg acaaatcaat cgtacttgct aca 1793 // ID DNA-TA-2_DR repbase; DNA; ZEB; 196 BP. XX AC . XX DT 14-OCT-2008 (Rel. 13.1, Created) DT 14-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE DNA transposon from zebrafish. XX KW DNA transposon; Transposable Element; Nonautonomous; TSD TA; KW DNA-TA-2_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-196 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1246-1246 (2008). XX DR [1] (Consensus) XX CC The TSD is mostly TA dinucleotide. This consensus is CC characterized by ~65 bp TIR (mismatch from 12-30). CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 196 BP; 76 A; 25 C; 25 G; 70 T; 0 other; cactgaaaaa aagtgttgca tgcagaactg ttgcaaacaa tttatttgtg ttgaatttaa 60 acaaacaaat taagtttaat aatgttcaac ttaatttgtt tgtttaaatt cagcccaaat 120 aaattgttta caaccactta acgtaaaaaa atttgagtaa atccaaggaa tcatctttga 180 ataatttttt tcagtg 196 // ID DNA-6-N1_DR repbase; DNA; ZEB; 708 BP. XX AC . XX DT 21-OCT-2008 (Rel. 13.1, Created) DT 21-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE nonautonomous DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; TSD 6-bp; KW DNA-6-N1_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-708 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1240-1240 (2008). XX DR [1] (Consensus) XX CC The elements of this family are ~86% identical to the consensus, CC and the TIR is ~300-bp. The TSD is characterized by 6-bp. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 708 BP; 221 A; 145 C; 138 G; 204 T; 0 other; tacggtggcc gagagagctt aacatgctgc aatttaagaa aacacatgca attacaaaaa 60 acgccagcaa attgagaaaa gatcttcatc tgtttgacag cacacgtgct gcaaatcctc 120 acaacgcaaa cacattttta aaaacgcgct gcattttctc acaacgcaaa caaattaata 180 aaacgtgctg cattttctca caacgcaaac aatttatgaa aacgcgctgc attttctcac 240 aacactttca aatagcacgg aacacaacgg aaatgtttca aggggaccct aaaacgtgac 300 agacccggct atttaggtta tggttattta ggctaataaa atacaaaaga caagggaatc 360 aggatgttaa aataaacaaa caaaaaactc acctttcaaa gatcacctac aacttcactg 420 agcaatagtc acggcacagc ggcgtccgtc acgtttttgg gtccccttga aacatttccg 480 ttgtgttccg ttgtttttgc aagtgttgtg agaaaatgca gcgcgtttta ttaatttgtt 540 tgcgttgtga gaaaatgcag cgcgtttttt aaatgtgttt gcgttgtgag gatttgcagc 600 acgtgtgctg tcaaacggat gaagatcttt tcttaatttg ctggcgtttt tttgtaattg 660 catgtgtttt cttaaattgc agcatgttaa gctctctcgg ccaccgta 708 // ID Gypsy56-LTR_DR repbase; DNA; ZEB; 188 BP. XX AC chr16; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy56-I_DR; KW Gypsy56-LTR_DR; Gypsy56_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-188 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1428-1428 (2008). XX DR Genome; chr16; Positions 35714307 35714120. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 188 BP; 54 A; 29 C; 41 G; 64 T; 0 other; tgttatgtag gtgatattgc cattgtcccc agtgccatta ggtggcaccc acacgttact 60 gcttatgttt gtgtgctctt gtaaataaaa tagtgaagaa gataagtact cccgttgtag 120 ctgtgttatt tagcaacata gtgcattcaa gagtgtattt taaatgttat gatagtgaga 180 acataaca 188 // ID CR1-12_DR repbase; DNA; ZEB; 1740 BP. XX AC . XX DT 29-OCT-2008 (Rel. 13.1, Created) DT 29-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE CR1-12_DR is a CR1-like non-LTR retrotransposon - a consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; AP endonuclease; CR1 clad; CR1-12_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1740 RA Bao W. and Jurka J.; RT "CR1-like non-LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1207-1207 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS join(2..283,270..683,628..1503) FT /product="CR1-12_DR_1p" FT /translation="SQSSQLFNFSPIVTSEVCKALKNLDVSKSPGPDKLEP FT YFFKLAADFIAPSLTHIFNLSLITNEVPVIWKSTFVLPWETPHKQIITVPF FT LSCLPGAVFQAKVLESLVNNQLKEFLEINNIMSKFQSGFRKQHSTITAVLK FT VLNDFIDSVDTQKHCAALFIDLSKAFDTVDHTLLLQRLKSIGLSDHAIGWF FT KNYLTNRNQCVQVDGLVSSTLNVTKGVPQGSVLGPLLFTIYKLKVFHRVQY FT WGLFFLLYINNIDQNIKNANFHYYADDTVIYTAASTPNLALSQLQIAFNVL FT QQNLSELKLVLNANKTKLMLFSNSKQAKQKLGPVTTHGAEIELVSQYKYLG FT ILIDDTLSFSTHIQQLVKKLKVILGFYFRIKNCLSFGAKKKLVEATFMSTL FT DYGDVIYMHASSQCLNALDTVYHGALRFITNFKSLTHHCVLYTRVGWSALS FT IRRLKHFHIVVYKSILGLLPSYLQNYICKKPVLNYGLRSQDIYLLSVPKVR FT TEMGKRLLSTQHLLLGIICKHSCSSKT*" XX SQ Sequence 1740 BP; 527 A; 304 C; 292 G; 617 T; 0 other; aagccagtca agccagcttt ttaatttttc accaattgta acctcagagg tgtgtaaagc 60 ccttaagaat ttagatgttt ctaaatcgcc cggtcctgat aaattagaac cttatttttt 120 taagttagca gctgatttta ttgctccttc tttgactcat atttttaatc tttcacttat 180 aacaaatgaa gtcccagtga tatggaaatc aacgtttgtt ctcccctggg agacgccaca 240 caagcaaata attaccgtcc catttctaag ctgtcttcca ggctaaagtc cttgaatcac 300 ttgtaaacaa tcaattaaag gaatttcttg aaataaataa tattatgtca aaatttcaat 360 caggtttcag aaagcaacac agcactatta cagcggtttt aaaagtatta aatgatttta 420 ttgattctgt ggacactcaa aaacattgtg cagccctgtt tattgacctc tccaaagctt 480 ttgatacagt ggatcatact ttgttgttac aacgcctcaa aagtattggt ttatctgatc 540 atgctattgg ttggtttaaa aactatctta caaatcgaaa tcaatgtgtg caagtggatg 600 gccttgtttc tagtactttg aatgtaacta aaggtgttcc acagggttca gtattggggc 660 ctcttctttt tactatatat aaataatatt gatcaaaata tcaaaaatgc caattttcat 720 tattatgcgg atgatacagt tatatacacg gctgcatcta cacctaatct ggccctttct 780 caactgcaga tcgctttcaa tgtacttcaa caaaatctat ctgagttaaa attagtttta 840 aatgccaata aaactaaact tatgttattt tcaaattcaa aacaagccaa gcaaaaacta 900 ggtcctgtta ctactcatgg tgcagagatt gaattagtgt cccagtacaa atatcttgga 960 attttaattg atgatacact ttcttttagt actcacattc agcaattggt gaaaaaatta 1020 aaagtgattt taggcttcta ttttagaatt aagaactgcc tttcttttgg tgctaaaaag 1080 aagctagttg aggcaacatt catgtcgaca ttggactacg gtgatgtaat ctatatgcat 1140 gcttcgtctc aatgtctgaa tgcattggat actgtttacc atggtgcact aagatttata 1200 actaatttta aatcccttac tcatcattgt gtgctgtata ctagggtagg ttggtctgct 1260 ttgtccattc gtaggctcaa gcattttcat attgttgttt acaaatctat actgggttta 1320 ctcccatcat atctccagaa ttatatttgt aaaaagcctg ttcttaacta tggtcttcgc 1380 tcacaggata tttatttact ctctgtccct aaagtgcgga ctgagatggg aaaaaggctt 1440 ttaagtacgc agcacctttt gcttggaata atctgcaaac acagttgcag ttcaaagact 1500 tgatttcact acatgctttt aaaaagacta taaaagtttt agaagttgaa acacaggttt 1560 gtagatgttt tgaatagttg tcagtgtgat tcatgttaac tgtaaattgt tttttgtttg 1620 ttgtctgttg gactcatgtg acttgtgtac tgcctaatct tggccaggac gctcttgtaa 1680 aagagatttt taatctcaat gtgtctttcc tggtaaaata aaggttataa taataataat 1740 // ID Nimb-3_DR repbase; DNA; ZEB; 5257 BP. XX AC . XX DT 14-APR-2010 (Rel. 15.11, Created) DT 14-APR-2010 (Rel. 15.11, Last updated, Version 3) XX DE A family of Nimb non-LTR retrotransposons - consensus. XX KW Nimb; Non-LTR Retrotransposon; Transposable Element; Nimb-3_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5257 RA Jurka J.; RT "Non-LTR retrotransposons from vertebrates."; RL Repbase Reports 10(11), 2018-2018 (2010). XX DR [1] (Consensus) XX CC >99% identical to consensus. Low-copy number. Probably active. XX FH Key Location/Qualifiers FT CDS 38..1231 FT /product="Nimb-3_DR_1p" FT /translation="MAEGSVDSGSDPEMEEERVDGDSGGNASNEWSVVENR FT KRKKSWAQISESDSEKGSHQARRRREEYKVMLKFASDSVNTINPLKLTKTL FT KEMLGTIESVKTLRDGKMIIYCKDHKQQKMALAMKTLLGHKVICSIPEEKK FT WIRGVITGIPTDVTVEKIKRNITGAAVKEVKRLKCVRNNEKVDSLSVMIHF FT DENKLPERVYLGYLSYNVRLYVPPPIRCYKCQKFGHVAAVCRGKQRCARCG FT GDHEYGKCGQEVKPRCCNCGGEHSAGYGGCQVRKNAVQVQNVRMSEGISYA FT EALKKVKQTSREPETRVTESQQKATMKEQNKDNEVCIDKVSFVTFIAEVVN FT CSAQTESRTERIKIIIRAASKYLELEGVTVDMINDRLKIQSGTSQTVYSQP FT CGSI" FT CDS 1219..4992 FT /product="Nimb-3_DR_2p" FT /translation="MWKYIMVLTILQWNARSLIANGQEFKKFIVDHENSPD FT IICVQETWLKSHLDFVINGYTSVRKDRDKSNGGGVATFIKQSIGYRAVEIT FT GEQEAVAVEICDGTQKVRIINYYNPCDKLSKDNLEHIQGNGNNKIVWCGDF FT NAHNTLWGSKKTDYNGLIIEDMLDWGGLVCINDGGYTRVELIKGKYSNLDL FT TLVSESLAGRCDWKVLKQNTIGSDHYPILSSIGIQIIRTVVERMPRWKFRT FT ADWENFKELCNNRLSEIDRCEDDVEVINSKLCEVLKSTAEEVIGRKKANGK FT KKAVPWWNEECSEVVSKRNNALKKVRKTLNYNDFINYKRAQAIVRKIIRRA FT KKNYWREYCDSIGEDVNVSDIWGMIRKMAGKQRNNNIPTLTENNRLVISDM FT EKAEVLAKNFARIHSNANLSEEIRRNREQILIRNPWLLENKGPSESTLDRE FT FTLFELKKALAESKKTSPGKDEICNEMIKHLSDNSLCIILKLFNKVWESGI FT LPAEWKHGVIIPIAKPGKDQSQPNNYRPIALTSNICKLMERMVMSRLVYAI FT EKENFFAAYQSGFRKGRNTMDSVICLESEIRKAQVNKEVLVGVFFDIEKAY FT DMMWREGLLIKLEKMEINGKMYNWIKNFLLKRTIQVRVGSAFSQIYKVENG FT TPQGSVSSPILFNIMINDIFSKMELGIGRSLYADDGALWKRGRNVVHVERC FT LQNAVRTVQDWADEWGFRFSTDKTQVICFSKKKVNPNININIYRQKIEQVS FT VIRYLGMWMDVKLNFNMHIQKIIDKCKKGINVMRCLAGAEWGACRFSLKRI FT YNALVRSTIDYGCVVYSSAAKTQLLKIDAIQSQALRICCGALRTTPIIALQ FT VEMGEMPLKIRRIKLKMRYWVSIKSQEERHPVKTVLKECWEYGHKKIDSFG FT WTVKDEAQNMGIRDINLAPAVPTSAIPPWLFHKPVVDLLLHEEKHKNNMLN FT EKEIQQYINQSYFNYLQIYTDGSKDPKDEKTAVAVYIPKFNIKISKRITDR FT LSVYTTEIVAILLALQWIEDVKPLRSVICSDSLSVLNNLITGTSKARQDIM FT NEIMQNLFRIRQGGLFVSFLWIPGHMGVVGNEEADQLAKEALQHTQIEIKI FT KISKSEVKNIIAKETGRMWQREWDNGEKGRHMHNIQGCVGGTRNKFGNRRN FT DVVITRLRAGHCLLNQYKFRIGKHETGCCDKCGKIETLEHVITECVAYDNQ FT RYQLVQGLSIYGIKNLSFQTLLGNSSNQVEIIMQLINYLKETKLINRI" XX SQ Sequence 5257 BP; 2019 A; 683 C; 1186 G; 1368 T; 1 other; ggttggtgag acgagctgga ttgtgactgg tgcgctaatg gcggaaggta gtgtggattc 60 cggaagtgat ccggaaatgg aggaggagag ggtagacgga gatagtggag gtaatgctag 120 taatgaatgg agcgttgtag aaaatcgtaa aagaaagaaa agttgggcac aaatatctga 180 atcggacagc gaaaagggaa gtcaccaggc gagaaggaga agagaagagt ataaggtaat 240 gctgaagttt gctagcgact ctgtgaatac catcaatcct ttgaagctta caaaaactct 300 taaagaaatg ttaggaacga ttgagagtgt taagaccttg agagacggga aaatgattat 360 atactgtaaa gatcataagc aacagaagat ggccttggct atgaagacat tattagggca 420 taaagtaata tgttctatcc cagaagaaaa aaaatggatt agaggagtaa taacaggaat 480 ccctactgat gttacagttg agaaaataaa acgaaatatc actggggcag cagtaaaaga 540 agtcaaacga ctaaaatgcg ttagaaataa cgaaaaagta gacagtctgt ctgtaatgat 600 tcattttgat gaaaacaaac taccggaaag ggtgtactta ggatatttga gctacaacgt 660 taggctatat gtccccccgc caatacgctg ttacaaatgc caaaagttcg gacacgtagc 720 ggcagtttgt agaggtaaac agagatgtgc gagatgtggg ggagaccatg agtatggtaa 780 atgtgggcag gaagtaaagc caaggtgttg taattgtggt ggggaacaca gtgcgggcta 840 cggaggatgt caagttagga aaaatgctgt acaagtccaa aatgtccgaa tgtctgaggg 900 aatctcgtat gctgaagctc taaagaaagt taaacagaca tcaagagaac cagaaaccag 960 agtaacagaa tctcaacaaa aagcaacgat gaaagaacaa aataaagata atgaagtatg 1020 tattgacaaa gtatcgttcg tcacatttat agcagaagta gttaattgct ctgcgcagac 1080 ggagagcagg accgagagaa ttaaaattat catcagagct gcttcaaaat atttggaatt 1140 agagggtgtt acagtagata tgataaacga tagactcaag atacaatcag gaactagtca 1200 gacagtatat agtcaaccat gtggaagtat ataatggtat taacaatact gcagtggaat 1260 gcaagaagcc taattgcaaa cggacaagaa tttaaaaaat tcatagtaga tcatgaaaat 1320 agccctgaca tcatatgtgt tcaggaaact tggcttaagt cacatttaga ttttgtaatt 1380 aatggataca cttcggtaag aaaagataga gacaagtcta atggtggtgg ggtagctaca 1440 tttattaaac agagtattgg gtatagggca gttgagatca ctggggaaca agaagctgtg 1500 gcagttgaaa tatgcgatgg aacacagaaa gttagaataa taaactatta taatccatgt 1560 gataaattaa gtaaggacaa tctagaacat atccaaggga atggaaataa taaaatcgtg 1620 tggtgcgggg attttaatgc tcacaacact ttatggggga gtaaaaaaac agattacaat 1680 ggactaatta tagaagatat gctggattgg ggaggattag tatgtatcaa tgatggaggt 1740 tacacaagag tagagttaat aaaaggaaaa tactcaaatt tagatttaac actagtttct 1800 gaaagcctag ctggaagatg cgattggaaa gtattaaagc aaaacacaat tggtagtgat 1860 cattatccaa ttctaagttc aattggaata cagataattc gaactgtggt agaaagaatg 1920 ccaaggtgga aattcagaac agccgactgg gaaaatttta aggaattatg taataataga 1980 ttgtcggaaa tagatagatg tgaggatgat gtggaagtta ttaattctaa actttgtgaa 2040 gttcttaaga gcactgcgga agaagtaata ggaagaaaaa aagctaatgg taagaagaaa 2100 gcagttcctt ggtggaatga agagtgcagc gaagtagtaa gtaaaaggaa taatgcatta 2160 aaaaaagtta gaaaaacatt aaattataat gactttataa attataaaag agctcaggca 2220 atagtgagga aaataattag aagggcaaaa aagaattatt ggagagagta ctgtgacagt 2280 attggggagg atgtcaatgt atctgacata tggggtatga taaggaaaat ggcaggaaaa 2340 caaagaaata ataatattcc tacattgaca gaaaataaca gattagtaat ttctgatatg 2400 gagaaagctg aagttctagc aaaaaatttt gccagaattc atagtaacgc taacctatct 2460 gaagaaatta gaagaaatag ggaacagatc ttgatcagga atccgtggtt gttggaaaac 2520 aaagggccat ctgaaagtac attagatagg gaatttacat tatttgagtt aaaaaaagca 2580 ttggcagaaa gtaagaaaac ctcccctgga aaagatgaaa tttgtaacga aatgattaag 2640 catttatcag ataattcatt atgtataata ctaaaattat ttaataaagt atgggagtca 2700 gggattttac cagcagagtg gaaacacggt gtcataatac caattgcaaa accaggtaag 2760 gatcagtcac aaccaaataa ctacagaccc atagcattaa cttctaatat atgtaaatta 2820 atggaacgca tggtaatgag cagactggtt tatgctatcg aaaaagaaaa cttttttgcc 2880 gcataccaaa gtggtttcag gaagggacgt aatacaatgg attcagtgat ttgtttggaa 2940 tcagaaataa gaaaggcaca agtaaataag gaagtgttgg tgggagtatt tttcgatatt 3000 gaaaaagcat acgacatgat gtggagagaa ggacttttaa ttaaattaga aaaaatggaa 3060 attaatggaa aaatgtataa ctggatcaag aattttttgt taaaaagaac aatacaagtt 3120 agagtgggtt ctgcattctc ccaaatatat aaggtagaga acggaacacc acaagggagt 3180 gttagcagcc ctatcctatt taatatcatg ataaatgaca tcttttcaaa aatggagttg 3240 gggatcggga gatctttata tgcagatgat ggtgcgctat ggaagagggg aagaaatgta 3300 gttcatgtag aaaggtgttt acagaatgca gtgaggacag tacaagattg ggcggatgag 3360 tgggggttta gattttctac tgataaaact caggttattt gtttttcaaa aaagaaagtt 3420 aatccaaata taaatattaa tatatacaga caaaaaatag aacaggtatc agtaattaga 3480 tatttaggaa tgtggatgga tgtaaaactg aattttaata tgcatattca aaaaataata 3540 gataaatgta aaaaaggaat taatgtaatg aggtgtttgg caggagctga atggggtgca 3600 tgtcgtttct cgcttaaaag aatctataat gcactggtaa gatctactat agattatgga 3660 tgcgtagtat atagttccgc agctaaaact cagttgttga aaatagatgc gattcaatct 3720 caggcactaa gaatatgctg tggagcacta agaactacac ccattatagc attacaagta 3780 gagatggggg aaatgcctct aaaaattaga agaattaagt taaaaatgag atattgggtt 3840 agcattaaaa gccaagaaga gaggcaccca gtaaagacag tactaaagga atgttgggaa 3900 tatggacata aaaagataga tagttttggt tggacagtaa aagacgaggc tcaaaatatg 3960 ggcataagag acatcaattt agcccctgct gtgccaacct ctgcgattcc gccttggctt 4020 ttccataagc cagtagtcga tttattacta catgaggaaa aacacaaaaa taatatgtta 4080 aatgaaaaag aaatacaaca atacataaat cagtcatatt tcaattattt acagatatat 4140 acagatgggt caaaagatcc taaagatgaa aaaacggcag tagcagttta catcccaaaa 4200 ttcaatatta aaatatcaaa aagaataacg gaccgacttt ctgtgtacac tacagaaatt 4260 gtagctatac ttcttgctct tcagtggata gaagacgtga aaccattaag atcagtaata 4320 tgctcngatt ctctgtcagt actaaacaac ttaattactg gaacatctaa ggcaagacag 4380 gacataatga atgagataat gcaaaattta ttcagaatca gacagggtgg actttttgta 4440 agcttcttgt ggattcctgg tcatatggga gtagtaggta atgaagaggc agaccaactt 4500 gcaaaggaag cattacaaca tactcagata gaaataaaaa taaaaataag taagtcagag 4560 gtaaaaaaca taattgcaaa agaaacaggt agaatgtggc aaagggagtg ggataatgga 4620 gaaaaaggca gacatatgca taatattcaa ggatgtgtgg gtggtactag aaataagttt 4680 ggtaatagaa ggaatgatgt ggtaataact aggttgcgtg ctggtcattg tttgctaaac 4740 cagtataagt tcagaatagg gaagcacgag acaggatgtt gtgataaatg cgggaaaata 4800 gaaacattag aacatgtaat tacagaatgt gtggcatatg acaatcagag atatcagtta 4860 gttcaaggat taagtattta tgggattaaa aatttgtcat ttcagaccct attaggaaac 4920 agttcaaatc aagttgaaat aataatgcaa ttaattaatt acttaaagga aacaaaattg 4980 attaatagaa tttaagaata gcttagcaaa ataaggaaat aaaatttact atgattttta 5040 actagtattt tttttctttc tttctctcct ctcctcctac tgttattact gttctctctt 5100 ccctatggta atctcatccc tccattcgta tcatatatct ttagtactca tgctttttcc 5160 aacattatgc gctccacact cctatccagt aggtggcgga aatgcaccat ttaagttggt 5220 ctgccaaccg ccattaaccc cataagaaga agaagaa 5257 // ID Gypsy-169-LTR_DR repbase; DNA; ZEB; 306 BP. XX AC . XX DT 16-MAR-2009 (Rel. 14.03, Created) DT 16-MAR-2009 (Rel. 14.03, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy-169-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-306 RA Bao W. and Jurka J.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 9(3), 699-699 (2009). XX DR [1] (Consensus) XX SQ Sequence 306 BP; 95 A; 72 C; 77 G; 61 T; 1 other; tgtggagagg taagcgtgat cgaacaccca atcagggggg agagcatgct aggagcccgg 60 cggctaatca gcgggtcaat cagaaatgat aaataacacc tgtctattct agtgattggg 120 ccggagagac tcccttctta aggagaacgg gaggagcagt cggggagagg agagagcaca 180 cacacacata cacccctaca cgtgcagttt gctagtgctg tggaaaaaca aaaattaaaa 240 tatattgctt acctgatatc gccgaccctg tcttcttctt cccacacaac aacgaacwat 300 actaca 306 // ID Gypsy101-LTR_DR repbase; DNA; ZEB; 524 BP. XX AC chr19; XX DT 02-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy101-I_DR; KW Gypsy101-LTR_DR; Gypsy101_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-524 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1512-1512 (2008). XX DR Genome; chr19; Positions 23441414 23440891. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 524 BP; 108 A; 173 C; 92 G; 151 T; 0 other; tgtcaccgac tcggtcccag tcattcccct cgctgaccag caaaggtcac catctccgga 60 cttctaacca ttacgtcatc cttctcactc agatcccagc acacctgttc tccatctcac 120 ttatgaccca cgtaccacat ataagcagct cacatacaca ctccattgcg aagtcttgtt 180 tagccccggc cagcatttct gagcgttcta tcctgcctga tctcccgttt accacttcag 240 cccgttcctc gactctgctc tgtcttctgc ctgcccttga cctaaagcct gattacacgg 300 acactgattc tcgctgcctg cccctgacta aagcctgtat tacggactct gaaccacgct 360 gcctgccctt gactcaagcc tggtaatcac tctgcctctg ttatctgcta atcatcgttg 420 agttgtatgt tgtatgttcg cacaaccgtg cgaggtgttg atgtttaagt gtgacttaat 480 aaatactgca aaatggatcc ctccgtgtca gtctccccgt taca 524 // ID BEL24-LTR_DR repbase; DNA; ZEB; 524 BP. XX AC chr20; XX DT 03-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL24-I_DR; KW BEL24-LTR_DR; BEL24_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-524 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1396-1396 (2008). XX DR Genome; chr20; Positions 53247859 53248382. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 524 BP; 145 A; 88 C; 107 G; 184 T; 0 other; tgtaagtgac cactaaatca ttatatgcct ttaaataatt ttagagcatc agagcatgtc 60 acgtgtttcg gtagcctagg tgactaatag gttaatcagg ctagcaggag cgaagagaag 120 acacgttgct gtgcgattta tggaaaaagg gaggcttaat cctgcacaat ttatttaata 180 gaaccctttg ggaaagcgga gcgaggtatg tgaagttaat atttaagcca catgagttca 240 tttgtatgta tatgttaatc gtgagctctg ccttcttatt agttgtttca aactgtcgtt 300 atatacttca ttgtttcact ttcatatgtc gctctgtaca ttatatcttt atttgtcgta 360 ttattctgta tacaagttgt ttttattgat ttttgttgtt tctttgtgta gtttcacggt 420 gctacgcaac gaaagtttaa tggctctcgt cttgagcaaa ataaagaatc aagcacccgt 480 cagaaactct ctgtctcgtt catggaaacc aaagggcatc taca 524 // ID HATN1_DR repbase; DNA; ZEB; 427 BP. XX AC . XX DT 02-MAY-2002 (Rel. 7.04, Created) DT 02-MAY-2002 (Rel. 7.04, Last updated, Version 1) XX DE HATN1_DR is a nonautonomous DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW HATN1_DR; nonautonomous DNA transposon; hAT superfamily. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-427 RA Kapitonov V.V.; RT "HATN1_DR, ancient nonautonomous hAT-like DNA transposon from RT zebrafish."; RL Repbase Reports 2(4), 14-14 (2002). XX DR [1] (Consensus) XX CC The consensus sequence was reconstructed based on 40 copies CC present CC in sequenced DNA that represents ~1% of the genome. Several CC thousand CC copies of HATN1_DR are expected to populate the complete CC zebrafish genome. The element was active long time ago, its CC copies are ~10% divergent from the consensus sequence. The genome CC is populated by multiple subfamilies related to HATN1_DR. CC This element is characterized by 21-bp terminal inverted repeats CC and CC the 8-bp target site duplications. XX SQ Sequence 427 BP; 143 A; 62 C; 58 G; 163 T; 1 other; taggcccaca cggaatctgc gcgcgcagaa ttccgcagat ttttagccca tcattaattc 60 tgtttattta cttgtgtaaa tgtgtgtaaa tctatattta ttcagttttt aaattaattt 120 cagtaatatt attgactaat atgaaaatgt tcatctgatt tatttacaat acagtttgta 180 aagtaatatt ttctgtcttt tagtagatat attatatgag agacttgctt tgtttaccaa 240 ataaagtgaa tctaattgga tttgcatttt aaacattaaa taaaagttaa aaagatatta 300 ttttttattt catatattaa ggttttagtt atgatactcc caaaataatt ccgcakaaat 360 ccgcagattt ttaccaaaat tctcagcaga aatagcaaaa aatgtccgca gattccgtct 420 ggcccta 427 // ID Gypsy80-LTR_DR repbase; DNA; ZEB; 289 BP. XX AC chr12; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy80-I_DR; KW Gypsy80-LTR_DR; Gypsy80_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-289 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1476-1476 (2008). XX DR Genome; chr12; Positions 29208041 29207753. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 289 BP; 101 A; 45 C; 55 G; 88 T; 0 other; tgtaacaggg ttatattgta tctctccact tgcatacatt aaaatactaa tgagaaggaa 60 ggggttaaca tttattcatt gttttatatg tagagcgccc cctctgttac ctaaaaagaa 120 aggaaattta ttgtagtatg aaggaggaag ccattgcacg aggctggtag attagatctg 180 cgctcatttt tctgttctaa ctgtttccac aataataaaa tcgataaaag atatatcaga 240 gtgaatattt atttacacaa cgcagaagag acgcaaaaga ctagttaca 289 // ID CR1-3_DR repbase; DNA; ZEB; 5068 BP. XX AC . XX DT 02-MAY-2002 (Rel. 7.04, Created) DT 02-MAY-2002 (Rel. 7.04, Last updated, Version 1) XX DE CR1-3_DR is a CR1-like non-LTR retrotransposon - a consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; KW AP endonuclease; CR1 clad; CR1-3_DR; ORF1; ORF2; KW reverse transcriptase. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5068 RA Kapitonov V.V. and Jurka J.; RT "CR1-3_DR, a family of CR1-like non-LTR retrotransposons in RT zebrafish."; RL Repbase Reports 2(4), 8-8 (2002). XX DR [1] (Consensus) XX CC CR1-3_DR is a family of CR1-like non-LTR retrotransposons and it CC was active in zebrafish recently. The consensus sequence encodes CC two proteins, CR1-3_DR1p (position 254-1801) and CR1-3_DR2p CC (positions 1805-4717). CC The 971-aa CR1-3_DR2p protein is composed of AP endonuclease CC (aa positions 72-282) and reverse transcriptase (aa positions CC 543-788). The 516-aa CR1-3_DR1p protein is distantly related to CC proteins encoded by ORF1 in known CR1-like elements from chicken, CC pufferfish and turtle. CR1-3_DR copies are lesser than ~5% CC divergent CC from the consensus sequence. Approximately several hundred copies CC of CR1-3_DR are present in the zebrafish genome. CC CR1-3_DRp1: CC MALDHIDLESHCCTNCHKLLQKIAVLETKLLAIQPELPNYTGPVHRGASQHSAGKPYKSTPATVLRTEKQ CC IETVENQRELWHKQGARPKGTRGVRSQRSYAAALAFSTPNTARTQIQLQNRYEVLSNMGEESPNAVRQRS CC HDSAINSAWNRGSRPTRQRHSAPIATEIRTLIVGDSIIKNFRSRSTMTYCFPHATVSDVNKELENILRKH CC ETAKRIIIHVGKNDIRKEQSELLKRDFCKLLETVERLKIQPFISGPLPARGTNMFSRLLGLNAWLQKTCN CC KKGLNFIDNFNLFWNQRQLFTSDGLHPNKLGAKVLKDNILFSLHHPSAVYATELNCTHTPSKCPDDHRTS CC NQLLSGLVADESPKDRDNTTQPQQPPITDKLPSAPCTLSSTQADCDASEQHQVSALIDDPQENSQVNISQ CC QPETPDPQPLSPDTFSLSSCSPLLDFSKKMEELVCAGTKLSHSIAASPQLATKSQQATRPSLSPPRPTPR CC KGLRSLRQRQVPKNTSSLAGELKNCH CC CR1-3_DRp2: CC CVSGACLDNSVFIRCLQNKQEPSVSVAFSTQICVLLRDRKPKTLSDRIANHSNLVSIKCISETSVVKTTK CC TVKLALLNIRSLNNKSLLVNDFINTNCLDFMLLNETWLDDSCSAAVLNEAAPLNFDFLSVCRANRRGGGI CC AALFKDVYECKQVSFGDYLSFEYLSIALKGSPRILLIIIYRPPKYSPAFIEDFTELLSIVTSEYDYFSIA CC GDFNIHIDNPEINAVKELMTVFNTFDLTQHVQGPTHNRGHTLDLLITKGLHISSTVVKDVALSDHFCIFF CC DILITPAIKDRSVSVRKRCINENTSEQFMKAISLAPSISADSVDSLLDLFNSKIENVINDIAPVKVRKRT CC GKERVPWRNSRAVQMKKRQCRKAERMWRKTKLVVHYNIYKDSLQAFNVELKTARQTFFSNLINNNVNNAR CC TLFATVERLTNPPSRIPSELLSVSKCNEFANFFTDKINNIRKAISSSNQPSCVDIKQTQPQLEKSDIMSD CC FMAINGKILEEMVQIIKTSTCSLDTLPTSFFKTVFTCSEMDLLKVVNDSLLSGIFPNSLKTAVVKPLLKK CC SNLDNTLLSNYRPISNLPFIGKIIEKVVFNQVNKFLNFKGCLDNFQSGFRAHHSTESALIKIINDIRLNT CC DSGKITVLVLLDLSAAFDTVDHSILLDRLENWVGLSGTVLKWFRSYLEGRGYYVSIGDHRSRWTPMTCGV CC PQGSILAPLLFNLYMLPLSQIMRKNQISYHSYADDTQIYLALLPNDYSPIDTLCQCIDEIKNWMCQNFLQ CC LNKEKTEVIAFGNRDEVLKVNAYLGSKGQTTKNKVKNLGVTLESDLSFNSHVKAVSKSAYYHLKNIARIR CC FFVSSEDLEKLVHAFISSRVDYCNGLLTGLPKKTVRHLQLIQNAAARILTRTRKSEHITAVLRSLHWLPV CC TFRIDFKVLLLVYKSLNGQGPQYITDMLTEYKPNRSLRSLGSYKLEVPRVQSKQGESAFSH. XX SQ Sequence 5068 BP; 1585 A; 1119 C; 949 G; 1415 T; 0 other; cttcctgctt tgctactgtt cggacgtctt tgcctactgc tccgctcctt gcactcctgc 60 ttttttattt tataatctaa ttttaacttg agcctatcag cacagtgctc aaacaaaaca 120 gcttgaagat caacatcgaa tctacaaact ttctggagta caactttact tttatttaga 180 ggaattatcg tctaaacaat cctcccgcta ccagctgctt tcattcctga gacgggaaaa 240 cacggctgtg aaaatggctc tggatcatat tgatttggaa tctcactgct gcaccaactg 300 ccacaaactt ttacaaaaga ttgcagttct tgaaacaaag ttacttgcga tccaaccaga 360 actgccgaat tacacaggac cggttcatcg tggagcctca cagcattcag ccggtaagcc 420 ctataaatct actccagcta ctgttttgag aacagaaaaa cagattgaaa ctgttgaaaa 480 tcagcgtgaa ctatggcata aacaaggtgc gagaccaaag ggtactcgtg gggtcagatc 540 acaaaggtca tacgctgctg cattagcgtt ctctacccca aatacagctc ggactcaaat 600 acagcttcag aacagatatg aagttttgag taatatgggt gaagaatccc caaatgcagt 660 cagacagaga tcgcatgact cagcaattaa ctctgcatgg aacagaggct caaggccgac 720 tagacagcgg cattctgctc cgatcgcaac agagataaga acactgatcg ttggagactc 780 aataatcaag aatttcagga gcagatctac catgacatac tgctttcctc atgctacagt 840 ttctgatgta aacaaagaac ttgagaacat attaaggaaa catgagactg caaaacggat 900 tatcatccat gtggggaaga atgatattcg gaaggagcag tcagaactgc tcaagagaga 960 tttctgtaag cttttggaaa cagttgaaag attgaaaatt cagccgttca tcagtggacc 1020 actccctgca agagggacaa atatgttttc aaggctcctt ggtctaaatg catggctgca 1080 gaaaacatgt aacaagaaag gactgaattt catcgacaac ttcaatctct tctggaacca 1140 aagacagctg tttacatcag acggccttca tccaaacaag cttggtgcaa aagtgctgaa 1200 agacaacatc ctcttctccc tccatcatcc atcagctgtg tatgccactg aactgaattg 1260 cacacacaca cctagcaaat gtccggatga ccacaggact tcaaatcagc tcctgagtgg 1320 acttgtggct gacgaatcac ccaaggacag agataacacc acgcagccac aacaaccgcc 1380 gataacggac aaactgccgt ctgcaccctg cacactgagc tcaacacagg cagactgtga 1440 tgcatcagaa cagcatcaag tctcagcact cattgatgat ccccaggaaa acagccaggt 1500 aaacatatct cagcagccag aaaccccaga cccacagcct ctgtcaccgg acacgttctc 1560 attgtcatct tgctcgcctc tcttggattt ttcaaaaaag atggaggaac tggtgtgtgc 1620 tggaactaaa ctctcacact ccattgctgc aagcccccag ttagcaacca aaagtcagca 1680 agccacacgc ccatctctaa gcccacctcg ccctacacct aggaaaggcc tcaggtcttt 1740 gcgacagcgc caggtcccaa aaaacacctc atctttagct ggtgaactta aaaactgcca 1800 ttgatgtgtc tcgggggcct gcttagataa cagtgtcttt atcagatgtt tacagaacaa 1860 gcaggaaccc agtgtgtctg tagctttctc tacgcagata tgtgtcttat tacgtgacag 1920 aaaaccaaag actctttcag accgtatagc aaatcattca aatctggtgt ctattaaatg 1980 tatatctgag acttctgtag taaaaacgac taaaactgtt aagttagcac ttctaaacat 2040 cagatcactc aataataagt cacttttagt caatgatttt atcaacacaa attgccttga 2100 ttttatgctt ttaaatgaaa cttggctaga tgacagttgt agcgcagcag ttctgaatga 2160 agcagctcct ttaaactttg actttttgag tgtttgcaga gctaatagga gaggtggagg 2220 catcgctgcc ctgtttaaag atgtgtatga atgtaaacaa gtgtcatttg gtgactattt 2280 gtcttttgaa tatctgagta tagcactaaa agggtctcca cgcatcttac taatcattat 2340 ctacagacct ccaaaatatt ctccagcttt tattgaggat tttacagagc tgttatcaat 2400 agtaacatct gaatatgatt attttagtat tgctggggat tttaatattc acattgataa 2460 tccagaaatc aatgctgtaa aagaactgat gactgttttt aacacttttg atctgactca 2520 gcatgttcaa ggacccacac acaatcgtgg acacactctt gatctactta taactaaggg 2580 tttacacatt tcatcgactg ttgttaagga tgtggcacta tctgatcatt tctgtatttt 2640 ctttgatata ttgatcactc cagctattaa agacagatct gtctctgtca gaaagagatg 2700 cataaatgag aacactagtg agcagtttat gaaggccata tcgctagcac caagtatatc 2760 tgcagattct gttgattctc tccttgattt gtttaattct aaaattgaaa atgtcataaa 2820 tgacatcgct cctgttaaag ttaggaagag aactggcaaa gagagagtcc cttggagaaa 2880 ctcaagagca gtccaaatga agaaaagaca atgcaggaaa gctgagcgta tgtggagaaa 2940 gacgaaacta gtagtccatt ataatatcta taaagacagt cttcaggctt ttaatgtgga 3000 actaaaaact gctagacaga ctttcttttc aaaccttata aacaacaatg taaataatgc 3060 tcgcacactc tttgcaacag tagagaggct cacaaacccc cccagtcgga ttcccagtga 3120 gctcctctct gtaagcaaat gtaatgaatt tgctaatttc tttactgata agatcaataa 3180 tatcagaaag gcaatcagct catccaatca gccaagttgt gtcgatatca aacaaactca 3240 accacaactt gagaagtcag acattatgtc tgatttcatg gcaattaacg gtaaaatctt 3300 agaagagatg gtgcaaatta taaaaacatc aacctgcagt cttgacacgc tccccacatc 3360 attcttcaaa acggtgttta cctgttcaga aatggatctc ctaaaagtgg taaatgattc 3420 acttctctca gggatttttc caaactcact taaaactgca gttgttaaac ccctcttgaa 3480 gaagagcaac ctggataaca ccctattgag caattacagg ccaatctcaa atctcccttt 3540 cattggcaaa atcattgaaa aagttgtttt taaccaggtt aacaagttct taaacttcaa 3600 ggggtgttta gataattttc aatctggttt cagagcacat cacagtacag agagcgctct 3660 tataaagata atcaatgata tacgtctaaa cacagattca ggtaaaataa cagtgttggt 3720 attgctcgat ctcagtgctg catttgacac tgtcgatcac agcatacttc tggataggct 3780 ggaaaactgg gttgggctgt ctgggacagt cctcaaatgg ttcagatctt accttgaagg 3840 gagaggttac tatgtcagta taggtgatca taggtcgagg tggacaccca tgacatgtgg 3900 agtcccacaa ggctcgattc tggcacctct cctgttcaac ctttacatgc tccctctgag 3960 ccaaataatg agaaagaatc aaatctccta ccacagctat gctgatgaca ctcagatcta 4020 cctagcctta ctgcctaatg actacagccc cattgacacc ctctgccaat gtattgatga 4080 aattaaaaat tggatgtgcc aaaactttct tcagttaaac aaagagaaaa ctgaagtcat 4140 tgcgtttggg aacagagatg aggttctcaa ggtgaatgcg tacctaggct ctaaaggtca 4200 aacgacaaaa aataaggtca agaatcttgg tgtgactctt gagtcagatc tgagtttcaa 4260 cagtcatgtc aaagcagtca gtaaatcagc atactatcat ctcaaaaaca tagcaagaat 4320 cagattcttt gtttctagtg aagatttaga gaaacttgtt catgctttta tcagcagcag 4380 ggtggattac tgtaatggac tcctcactgg ccttcccaaa aagacagtca gacacttgca 4440 gctcattcag aatgctgcag ccagaattct gaccagaacc agaaaatcag agcacatcac 4500 agctgtcctc aggtctttac actggctgcc agttacattc agaatagatt ttaaagtatt 4560 attactggtc tataaatcac taaatggcca aggacctcaa tacattacag atatgctcac 4620 tgaatacaaa cctaacagat cactcagatc tttaggatca tataaactag aagttccaag 4680 agttcagtct aagcagggtg aatctgcctt cagccactaa cagcgcccct tgctgctgga 4740 atcagcttcc agaaatgatc agatgtgctc caacattagg cacattcaaa tcaagactga 4800 aaacacatct gtttagctgt gcctttactg aatgagcact gtgctgcgtc cgacagatcg 4860 cactattatg tttttcgttt tctttttcat tcttttataa cctattttaa ctaattttaa 4920 tttgttttta tctgttttta ataattttta ttgtctgcat tttattttcc taaacttgtc 4980 ttttttattc ctgtttatgt aaagcacttt gaattgccac tgtgtatgaa atgtgctata 5040 taaataaact tgccttgcct tgccttgc 5068 // ID L1-14_DR repbase; DNA; ZEB; 3454 BP. XX AC . XX DT 20-FEB-2009 (Rel. 14.02, Created) DT 20-FEB-2009 (Rel. 14.02, Last updated, Version 1) XX DE L1-14_DR is a non-LTR retrotransposon from the L1 clad. XX KW L1; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; endonuclease; L1 clad; L1-14_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-3454 RA Bao W. and Jurka J.; RT "L1 non-LTR retrotransposons from zebrafish."; RL Repbase Reports 9(2), 552-552 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 1..588 FT /product="L1-14_DR_1p" FT /translation="KIIKGPCRWRFHPRWLHDPKFMEFIGTQIDLFFKTNL FT KETSASIRWEAFKAYIRGQMISYTCSVNRKRKMELQNLEERIRKLEEDIYS FT DLSNSNILKQELLILRAKYNEISATKAEASLIRLKQTFYEYGEKASRLLSW FT QIKQLQSSKAIHNIYTTAGNLTSDPDEINETFFQILPNAIQYRIIRMQHGP FT NFIFR*" FT CDS 678..3065 FT /product="L1-14_DR_2p" FT /translation="MKSGKTAGLDGIPIDIYKAFKDKLAVPLLEMYEESML FT EGCLPPSLRAALITLIPKPHKPLDRCESYRPISLMNSDAKIIAKILALRLE FT AHLPSLIGMDQNGFVRNRQAFHNIRRVLNIVHEMDGSKDTGILSLDAEKAF FT DRVEWSYLFDILSRFGCGQNFCQWVKTLYQSPTAEVITNNLISKPFNLYRG FT TRQGCPLSPLLFVMTLEPLAIAIRTHPSIKGIEIAGVEHRISLYADDIVLF FT LTKLKDSIPNLNKLINKFGKFSGYKVNYSKSSILFLNEQERQNPVIQHPYE FT TSVDGFGYLGIKVTPQVKNLISANYTPLITQVNESLNRWMSLPLTTIGRVN FT TIKMNILPKFLYLFQSIPLTPPPSFFSNMKKAFIKFIWNNRRSRLRLLLLY FT LPYERGGLRLPNLFWYFKAAQLRSACYWFAGSPSIPWIQIERKCTGNLSLD FT TYLYSDSLKKLKQNTSNPFVRATILSWHESQVELGETPVLSKFSPIWGNAL FT FVPGKSDKGFKLWFQKGLQKISDLYEDNILMSFETLKERYKIPQSHFFKYL FT QIRNFIRSKNHSYQCPTLSSLEEIAVKDPSAKYKISTLYSKFVNASKESTN FT HKRLAWIEDFQIDITENEWEESCLQAQTLSINTRFKLIQYNWVMRTYITPE FT MLNKFNRNILDICGKCLTYKGTLFHCVWQCEHLQFFWKEIIGVISKTIGKE FT IPVCPKICILGLFPPALSLKSYEQKFINLCLLHAKRLIAIHWKSTHAPSLS FT LWIKEMSSCIAMERLTYSIKQKMHIFYKIWDFFLXFLNNSALTV*" XX SQ Sequence 3454 BP; 1171 A; 585 C; 564 G; 1133 T; 1 other; aaaattataa aagggccttg tagatggaga tttcatccaa gatggcttca tgatccaaaa 60 tttatggagt tcattggtac tcaaattgat ttatttttta aaactaatct gaaagaaaca 120 tctgcatcta taagatggga agcctttaag gcttatataa ggggccaaat gataagctac 180 acatgctctg tcaatagaaa aaggaaaatg gaattacaaa atttagaaga aagaattcgg 240 aaattagaag aagatatata ttcagatttg tcaaattcca acatacttaa acaggaactg 300 cttattctaa gagcaaaata taatgaaata tctgcaacaa aagctgaagc cagtttaatc 360 cgtcttaaac aaacatttta tgagtatgga gaaaaggcaa gtagactttt gtcttggcag 420 ataaaacaat tacaatcaag taaagctata cataatattt atacaacagc tggtaatctg 480 acttcagacc cagatgagat taatgaaact tttttccaaa ttcttccaaa tgctatacaa 540 tacagaatca tcagaatgca gcatggacca aactttattt ttagatgatc ttttgattcc 600 ttctataagc attgaagcca aaaaaagaac tggaggaacc tctaactagt aaagaaattt 660 taaatgctat tgtgaatatg aaaagtggta aaacagctgg actagacggg ataccaattg 720 atatatataa agcctttaaa gataaattag cagttccttt gttagaaatg tatgaagagt 780 cgatgttgga gggctgtctt cctccttctc taagagctgc attaataaca ctcattccta 840 aaccccataa acccttagac agatgtgaat catataggcc aatttcgtta atgaactccg 900 atgccaaaat tatagccaag attttagccc ttcgattgga agctcatcta ccgtccctaa 960 taggaatgga ccaaaatggc tttgttcgaa acagacaggc ctttcataac atcaggcgag 1020 tactaaatat cgttcatgag atggatggct ctaaagatac aggtatttta tcgttagatg 1080 cagaaaaagc atttgacaga gtcgaatggt cctatttatt tgacatactc agccgttttg 1140 gatgtggaca gaatttttgc cagtgggtga agaccttgta tcaaagtcca actgcagaag 1200 taattaccaa taacttaatc tccaaaccat ttaatctata tcgtggcaca cgtcaaggct 1260 gtccactttc tcctttatta tttgtaatga ctttggaacc tttggcaata gcaattagaa 1320 ctcacccatc tataaaggga attgaaattg caggtgtaga gcatcggata tctctctacg 1380 cagacgatat tgttttattt ttaacaaaac ttaaagattc aataccaaat cttaacaaat 1440 tgattaataa atttggaaaa ttttcaggtt acaaagttaa ctattcaaag tcatctatcc 1500 tctttcttaa tgaacaggag agacaaaatc cagtaataca acacccttat gaaacttctg 1560 tagacggatt tggatatttg ggaatcaaag ttactccaca agtaaaaaat cttatctcag 1620 ccaattatac tcctctgata actcaggtaa acgaaagttt gaacagatgg atgtctcttc 1680 ctctcactac tattggaagg gtgaatacca ttaaaatgaa tattctccct aaatttctat 1740 atttgtttca gtcaatcccc ctgacccctc ctccttcctt tttctcaaat atgaaaaagg 1800 catttattaa atttatttgg aataatagac gctcaaggct tcgcttatta ctcctgtatc 1860 ttccttatga acgaggcggg ctgcgactcc ctaatctttt ttggtatttt aaggcagctc 1920 agctaagatc tgcatgttat tggtttgctg gctcacctag cattccatgg atacagattg 1980 aaagaaaatg tacaggcaat ttgagtcttg acacatattt atactctgat tctttaaaga 2040 aactcaaaca aaatacatct aacccatttg tgagagccac tattttatca tggcatgagt 2100 cacaggttga gctgggagaa accccggtac tttctaaatt ctctcccatt tggggtaatg 2160 cattattcgt accagggaaa tcagacaagg gttttaagct gtggttccag aaaggcttgc 2220 aaaagatatc agatctttat gaagataata tattgatgtc ctttgaaacc ttaaaggaaa 2280 gatataaaat tcctcagtct cattttttta agtatcttca gatccgtaat tttataagat 2340 ctaaaaatca ttcatatcag tgtccgaccc taagtagttt agaagaaatt gcagttaaag 2400 atcccagtgc caaatacaaa atttctacac tatatagcaa atttgttaat gcttccaaag 2460 aatctacaaa tcataaaaga ctggcttgga ttgaagactt tcaaatagat attacagaaa 2520 atgaatggga agaaagttgt cttcaagcgc aaacattatc aatcaatacc cgttttaaat 2580 taatccaata taattgggta atgagaacat atataacacc agaaatgtta aataaattta 2640 atagaaatat tctggacatt tgtggaaaat gtttgactta caagggcaca ctgttccatt 2700 gtgtttggca atgtgagcac ttgcaattct tttggaaaga aataattggt gtaatctcta 2760 aaactattgg taaagaaatc cctgtttgcc ctaaaatctg tatccttggt cttttcccac 2820 ctgccttatc gctgaagtca tatgaacaga agtttataaa tctgtgtcta ttacatgcga 2880 agcgcctgat tgccattcac tggaaaagta ctcatgctcc atctttgtca ctctggatta 2940 aggaaatgtc ttcctgtata gcaatggaaa ggttgactta ttcaatcaaa caaaaaatgc 3000 acatattcta caagatttgg gacttcttcc ttyatttttt aaataacagt gcactgactg 3060 tttagtaatc ccaaacaatg tgactgttaa aatacactgt gtttgtaatg actaacttta 3120 atatactttc actggaatgt atatgttgaa atgtcaattt tttaatggta tttatttttt 3180 atatattttt aatttttaac aaaattgccc tttttatttt tatctttaag tttattttag 3240 ttaatcttta ttttctattt atttgttttt gtattacttt cctcgcatga gaatgcattt 3300 ttttaattct attcctttct ttttttaatt tttaattttt attttgaacg tcgggatggg 3360 gcgggagttc tttgattggt atttggttac taatacagta agaatgtatc acacctttct 3420 gaaacaataa aagtgttaaa aaaaaaaaaa aaaa 3454 // ID hAT-3N1B_DR repbase; DNA; ZEB; 910 BP. XX AC . XX DT 30-SEP-2005 (Rel. 10.09, Created) DT 30-SEP-2005 (Rel. 10.09, Last updated, Version 1) XX DE hAT-3N1B_DR is a nonautonomous DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW Interspersed repeat; hAT-3_DR; hAT-3N1A_DR; hAT-3N1B_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-910 RA Kapitonov V.V. and Jurka J.; RT "hAT-3N1_DR, a family of nonautonomous hAT DNA transposons from RT zebrafish."; RL Repbase Reports 5(9), 256-256 (2005). XX DR [1] (Consensus) XX CC hAT-3N1B_DR is a nonautonomous DNA transposon that belongs to the CC hAT superfamily. It is a subfamily of the hAT-3N1_DR family. This CC element is a deletion derivative of the hAT-3_DR autonomous CC transposon. XX SQ Sequence 910 BP; 285 A; 170 C; 185 G; 270 T; 0 other; caggggtcag caacccgcgg ctctagagcc gcatgcggct ctttagcgct gcccttgtgg 60 ctccctggag ctttttcaaa aatgtttgaa aatgcaaaaa gatgggggag gttaatatat 120 ttttttgttt taatatgatt tctataggag gacaaacatt tttaacgttt tcaaatgctg 180 taaaagtgtg tagaattaat atttaatttc aacatttctg tcaacgaaga tttgcgtcat 240 agcctgcgac acacgtttct atcagcaggg cgggatgcca ggcaggtggc tgttgtaaac 300 aaaccgggag ataataaagc atggaaaacc gttcacagat ggtgactaca tgaaggagtc 360 attcatcaac atattagaac acctatttgc agacttcaaa aacaaaacca agataataca 420 aaaaattaaa gatatacctc tctccgctaa gacagtgaag gaaagggcta ttaaaatggc 480 aggtaacatc accgatcagc aaaccaagga cattaattca gtgccagcat actcaattgc 540 ctgtgatgtg accgatattc aatgcgcttt tatgcaggta tgtgaactct gatgggccgc 600 aagaagaaat gatcaaatta ataccactga aagaccagag gctgtgctga agtgtttaaa 660 tgacaacgaa ataaacacca accacctgat ttcagtggct atttaatcat gaaggctcat 720 tatgtatttg tagccaactt agtcattttt atagtaggct aatatagcta atatagatac 780 atacagcatg tgtttccttc attataaggc ttatataagg ctttacattt tttgcggctc 840 cagacatatt tgttttttgt tttttttggt ccaatatggc tctttcaaca ttttgggttg 900 ccgacccctg 910 // ID Helitron-1_DR repbase; DNA; ZEB; 26300 BP. XX AC CAAK03016851; XX DT 30-NOV-2007 (Rel. 12.11, Created) DT 30-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE A family of autonomous Helitrons - a fossilized copy. XX KW Helitron; DNA transposon; Transposable Element; Helitron-1N1_DR; KW Helitron-1_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-26300 RA Kapitonov V.V. and Jurka J.; RT "Helitrons on a roll: eukaryotic rolling-circle transposons."; RL Trends Genet 23(10), 522-529 (2007). XX RN [2] RP 1-26300 RA Kapitonov V.V. and Jurka J.; RT "RPA70-encoding Helitrons in zebrafish."; RL Repbase Reports 7(11), 1178-1178 (2007). XX DR Genbank; CAAK03016851; Positions 27552 1253. XX CC This is a young family of autonomous Helitrons. It encodes a CC 2505-aa RepHel protein (pos. 14055-21569) and a ssDNA-binding CC RPA70-like protein (3 exons at pos. 3828-7844). The RepHel CC protein is composed of the OTU cysteine protease (pos. 232-347), CC Rep, Hel, and AP endonuclease (pos. 2291-2441) domains. CC Helitron-1_DR contains also copies of several different CC transposable elements (they appear old). This transposon is CC inserted into a long array of (TAA)n microsatellite. CC Helitron-1N1_DR is a nonautonomous deletion derivate of CC Helitron-1_DR that was transposed by the Helitron-1_DR-encoded CC proteins. The genome contains several families of Helitrons CC coding for both RPA70 and RepHel proteins. XX FH Key Location/Qualifiers FT CDS 14055..21569 FT /product="Helitron-1_DR_1p" FT /note="RepHel." FT /translation="MSIEEALQRALVQSDACLLNIRDNICAVIKVESEFAV FT VDSHSRNGKGMAEYDNGRSIVVYCPDIHMLFAHVVNLAMSLNAQGTPFEVT FT GLKATIISKSTCAARGYKPGQQCIITHVSPAESDVQCCILDSVKQRDLNKP FT AFLEKVIQMQSDSSKKLQQIESKLTHVSKFPGNILMSDVIIGNTENTQFNF FT KPLTWEEQNALCQRLHLINERVHVQVANMCSLGVPCKTKSIKGDGNCLFRS FT IADALCGNQEKHLIIRRAVVKHLEANSSQFLRYIGEEYRSVKDYLTQSKMY FT YSGSWGSHVELFCAANLLKINLMTFNVDRWNAYVPNNELFTENSIYLKHCN FT GNHYEVVTCVKNGSQENVCAEACNIGVFHQSITCSLRKRKLNYTPAQCKRQ FT KEHDRYHSDSCFRQKKIENLKFKYCEDSNYKQRKIQNTKTKYCEDSDYRQR FT RIDNSKIRYCEDSDYRQKKIDNSKIKYCEDSDYRQRKIDKSKIKYNEDSDY FT KEKKLHNLQCKYRNDTKLQQLKCENSKRKYSRNVSFQTFVRQYSQTKYKSN FT KVFRESVCEYSKNKYHRNPTFKASVCEYSKNKYHKNPSFKARVCDYSKNKY FT HKNPSFKARVCDYSKNKYHKNPSFKARVCDYSKNKYHKNPSFKARVCDYSK FT NKYHKNPSFKARVCDYSKNKYHKNPLFKARVCEYSKNKYHKNPSFKARVCE FT YSKNKYHLNPSFKARVCEYSKNKYCRNMNFKAKVCEYSKKKYHEDEMFRMR FT KIKKRSEDYVKQRERQKDINFAINNFCHEVSSGPEFVCCVCHRLLFRKQVL FT ECRKDCYDIRGQQISDLAETCITTKYLHKCSSECENLCHLSGYAASKLWIC FT YTCHRKILGGKLPEESIANNMHLVDIPKELKGLNSLEGHLIALNIPFMKLL FT CLPRGKQKGCHGPVVCVPVNTTDVSNILPRNECDDHMIRIKLKRKLTYKGH FT YEYKYVNTDHVRHALSYLVRHNKWYKDVEFNEQWVNSMNETDDLDKDANDF FT VVDEHHETDKNEDDKEEDITYIKEQSGLLSDTCLQPVDLGSEIIDQHFHDV FT LNIAPSEGNSPVRLLSDKANEAKCFPILYPTGGPTFHDERQEKITLSRYLN FT ARILNADGRFAQSTDFIFYAQYLSEVDQIVSNVSIAMRKGSEQDCLNVTSN FT MLTNCDSLQKILHYDEGYKFLRPVRGTPPYWMSTQKDLFALIRQLGIPTFF FT ASFSSADLRWPEMINTILKQEGKQINADDLDWSEKCGLLRRNPVTAARMFD FT HRWHCFLKNVIMSPANPIGKIKDYFYRVEFQQRGSPHVHCLFWVENAPKLN FT DNDSNNDHVVADFIDTYITCETPTESDNVLYETVNSVQKHSTRHSKTCRKK FT NTVCRFNFPRPPSSRTFITRTYNADNLKGKEGESLANKIIKKVKGGLNSGV FT NFDSVDAFFSSIGINQTMFEQAYNKCSKKKTIVLKRNPKDVWVNQYNRDLL FT QAWQGNMDIQYITDAYSVVVYILTYITKAEQEMGLLLQRAQNEALNGNLEA FT KTSLKRLGTIYLHNREISAQESVYRLTGMHLKECSRKVQFIPVGQNPVKMS FT LPLHLLRNQAENVQSDDKSSFWMRNVIDRYKNRPQKEPLKDLCLASFCSEY FT RILTKSEVMSQNKKAENKIIKLNNQNGYVKRRTRTEPAVVRYPRFSPTKDA FT EKYYHSLLQLFLPHYNDSDLKPIKYDTYEEFYNTGIIQYSSEMKEVKYIVD FT ENRELFEKESDKIDKAKQLLDQGVDLEDAWAEICPETERQRHECNELMKDK FT ILPDEDDHQELFPDLIGNPQTGCTIESNHTTMPRNDALLLLRSLNEQQSAV FT FYKVRNWCLQKILGKNPDPFRIFLTGGAGTGKSHLIKAIHYETTRLLSQLC FT ENPDDIVVLLTASTGVAAYNIGAATIHNTFSIGSNVRLPYQPLSDDKINTL FT RTKLGSLQILIIDEVSMVDHRLLSYIHGRLRQIKQTGEYLFGKVSLFAVGD FT FYQLPAVKGTSLYADTKGVNLWESNFEVAELSKVVRQQDPTFAEMLNRLRV FT RKKNESISPTDIATLRNCETGDTCNNLHIYATNAEVDKYNIERLHECCQQI FT ISINAQDYIKNAKTGRMERKVGFHTKVFNSCLSACVSLGIGARVMLKKNID FT VADGLVNGAFGTVVHITQSQKNDDDFPTAIHVEFDNPNVGKIQRSKTRQRF FT SANSTLIKVEEDQVTNDGGIRRQFPLRLAYACTVHKVQGLTVDKAVVSLKK FT IFTAGQAYVALSRVRSLSGLIIEDFQESVIFCNEKIELTMKSMQKLVLDKY FT NSTKSVGTFRIALLNIQSLRAHFQDLLAHTVLMNADCICLTETWLYENDEL FT ELQIPEFVFNHNPRSKCYDNKCVVSAELKHQRGGGVGMYCSEKIECHIFTP FT ELWNLECLYFTVPHVNLKAAVLYRPSSYKIDVFRQHILHIIEELEKHSGQK FT IIMGDFNDDIFTSSMTMKLMEQHGYTQHVQRPTTEKGTLIDHVYTKQMEHI FT IVDVVQTYYSFHQAILISFL" FT CDS join(3828..4015,4677..4872,7107..7844) FT /product="Helitron-1_DR_2p" FT /note="RPA70." FT /translation="MADSKRKCLEVEGYVHDVSEVLQGKSSNVRYFTALLQ FT EANQNSRVVVFDVQKHNSFQCAARDRSPVKLSFVEHSPSKQHNNTNDLLVK FT SGTKVSVLRRLEFAREDGLTSRAPVRSLSDIQRDVREYQRVTVKVKVLRML FT QDKESVVRGQKLRRKSVVVADSSDLILLSLWGDHVMTIGEWYILTNVSVRV FT FKGMTSLSTTMQTSLSVVENSGPAKEFVEEDVTSVVGEIVKAEVKVDHFCP FT KHHLLENINVATHMTRCTKCDSFCKTSKSRVEMRGHIGITVDFVQRRIPLD FT DAEIRELLNLNGRENLDSHALAAKLLVHDDLKVEMWREFITRVLFVKKDDA FT VDAPKDGSDEAAAAVSNGEKDEFDNLDSLFD" XX SQ Sequence 26300 BP; 8667 A; 4063 C; 4959 G; 8611 T; 0 other; atatagctgc aagcagcaat taaggggcca agcactattg gccataaggt ggcgctgctc 60 cagacgtttt tgagtacttt cagggcatgg ggttgaagat gcataccatg ttttgtaatg 120 atatgacaat gtgttggtaa aatatagcat ttttggccaa aaatcgaaat gggcgacgcc 180 caaaatggct ggcctgtgaa aatcagacat cattcgactc gacatgctct gccggatgta 240 atgagaccag tttcatgagt ttcggacgaa aggttgagac gttataagcc aaaaagcaag 300 ttttttgtat ctccggacca ctagggggca gtgtgccgaa actctgcatg taacctcaga 360 ccctagttgt cataacacac accaagtttg gtgtgaatcg ctgaatgcat tacggagata 420 tcgtctcaag tccattttcg caagtccttt gttaaatttg atcgcaagtt aaacgaaaac 480 ggttggtcaa atcaacttga attccataac ttttggttgg catggtctgt agatcatgtg 540 attcaatttt ggtgaaaatc ggaaaaacgg cctaggacga gttcgatcaa ataggttttt 600 caaggtggtg ctgctccaga agtttttgag tactttcagg gcatggggtt gaagatgcat 660 accatatttt gtaatcatat gtgaatgtgt tggtaaaata tagcattttt aaccaaaaat 720 caaaatgggt gacggccaaa atggctaacc tgtgaaattc agacatcatt tgattggaca 780 tgctctgtca tagcttacag tatcacaaat aaaaagagac cacaacgaaa taaaaataag 840 taaaacctaa acaaagctga tgatttgact gcatgtctca gtcattccta tcaatcaggc 900 acttcccaga agttttactg acagagaatt tagataatgg ctgcgtggtt ttcatctcac 960 tgcacacttt tttcagttac tggctgttat tggctcataa aattgtcaat catcactcca 1020 tgcacctacc ctttgcacgg agtcaatata cagctaataa caacttaagt taacgcagag 1080 cgcggcttgt ttgcaaaaag taaacaaagg agaatatttc tgaggagagc tgatacttgc 1140 atcgcgtgga tgaagcttta ctttgggttt ttctaacggt gagtaactta ttatactgtc 1200 attgcttgtt gtttacgtct caggtcttct gtgtttagcc tgcttgtgcc ttaaagtgtg 1260 aatgtgtctc tgttagccac aatctttcag taatggttgt cattattatg cagcatgttg 1320 gccatttatc tgaatggctg ctgttagatc tgtaagctat gtgtataaat taacatttgt 1380 ataaatgggc atgagagtaa atcctgtttg ctactcaaag ggaaaacaca attgtttact 1440 atagaatcct aattctgttg ttttttaggt ttttttgtca tgtagctgaa tctctgcacg 1500 agtgaagaag tgatgtgaag ttggagatgg agtgaagaat gttctgttag tgactgagag 1560 cagaagaaca acttgtataa gacttttttg ttgcttatgc actagactta ttttatacaa 1620 tgtattactt gggctcaaat aaatcattca tgctgtacgt tttagcttat tatttcacaa 1680 taaaaaaaat tagtgtagtg ttaatccgtc agtgtatgtg tggtcatcag actgcttttc 1740 cacaaacaag gagagggaaa aaaaaatgtc aagacatggt ctcacaaaaa taattccaga 1800 aatttcattg tgcatgatat tggcatcaaa tgtaaaccat atactcagga gatgtgagct 1860 tttgagcaat tacatttttt tgtgttgatt caggactgat ttcacatatt tgtttatgaa 1920 ataataaaat gtttagtgtg gggctatttt tttgcattac ttaagcatcc ataacatttt 1980 ttcttttaga gcattataaa cattggatga tggataataa agggcagagt gtcttctttc 2040 aatgatacat aaactgaatt tagaccaaat tattcagcca ctgtaaatgt tttagctttg 2100 ggatgacatg tttaggagaa agtgcctctc acggtgaggg agagaaggac atcgacatag 2160 cctcagaaat ttttttacag aaatctcatt ttgcatgcta ttgtcatcaa attaaaaata 2220 taaacttatg aaatacttgg ctttcaagtc atgtttttgt aggacattac attaatgttt 2280 ttctgtgttt gtatatgaaa aagtaggtaa tactgaacac aatatttttt tttcctcata 2340 atttcataat gtataacatt ttgtattttt ttaacatgta acttctcctt ttactacagt 2400 aaaactcaaa gggttttctt tccaatgata cataatttgt gtttgtagcc tactggagtc 2460 aggatctgtt actatttaaa gttaggtagg tgtaaaaccc caaaattgag tgctggctaa 2520 caggttaaga cgatttaatt ttgaatcaaa tcgagaaatc gcgatttaaa aaaaataaaa 2580 aaaataaaat atatatatat atatatatat atatatatat atatatatat atatatatat 2640 atatatatat atatatatat atataatata ttataatggc accgatacgc ttggggtcac 2700 tctccgtatg aagcaggaag cacaccagaa gcgcctctca gtgccgtgcc gcacagcacc 2760 acgccatgga ttttagaatt ctaaacggtt tctatacaca caccgacgct gcacagcaac 2820 gattcaagaa acagttctca tttctgacgg gccacaaagc gccatctgat ttgtttaatg 2880 ttaaatatca tgtgaatgtg tgcgtttggt gtgtgatagt ttaaactgtc atgtccgcgt 2940 cgtgtcgtgg cgcttctggt gtgtgacttg cttgactgcc ttttaaagcg tgaagtcttg 3000 taggatttta cttgttgcat gtcagtgtgg attgtcaaga catatttcct catcaagtcg 3060 ataaactcga tctcaacatg tatgtggaaa actggtcgat cttttggttt gaacacgagt 3120 agaagtgagg gaccgtagca gtgaaaatga aagtacctgc ccccatgacg tcattaagcg 3180 gacctagttg aaaaccagac agatcaggca gcatacagag agtggagcaa agcgcgtcaa 3240 atgatcggta tgcgcactgc gcagacgcgg acgtcgacgc gtatatgtgg tgagccaata 3300 ggagggcgcg attgtggatt tgaacgatac cgggacaggt ataagtgttt ggagttgcgg 3360 ttggcgttgg actttcgttc gtcattggac ttgaggagaa gacgaagcaa tttcaacagc 3420 ggcaacaata ggagatctta gtacggtatg tttttatact ataagtattg tttcggtggt 3480 ggcaaaatgt cgtgtaatgt ttatttgaaa taacaatgtt tgagaccctg tagtgaacgt 3540 gtaattaatg gatgttgccg tctttgttgt agacgatgga tatactgttt tgcgagatgt 3600 gtagctaagt taagtgaaat tttaaacagc tggtttatcc aactgttgta taggagcggc 3660 gtacagtgta aaatggcgta ttgttgtata gtacagattt atttgtaatg gtgcatgttg 3720 tagtttagct gaagaagcct ttgcaaatgg cgacctataa gaataatgtg ctggatatag 3780 aaggatatgg gcaagatatg atgtgttttc gtgtgttcca cagagtaatg gctgactcaa 3840 agaggaagtg tttggaggta gagggttacg tacatgacgt gtctgaagtc ctacaaggga 3900 aatcaagcaa tgtgcgttat tttacggcct tgctgcagga ggccaaccag aacagcagag 3960 tcgttgtgtt cgacgtgcaa aagcacaaca gttttcagtg tgctgcaaga gaccggtgag 4020 tagcttttaa aggatcattc agtaaaacag aaaattgtag taaagtttta tgtatttatt 4080 tgttattgta taattacact taagtggttc taaatcttta ttaatgtttc tgttaaagag 4140 aaaataatat ttaaaatgtt agaatcatat agccatttac tttcgtagtt aaaggtaaaa 4200 aaagtgaatg tttaccatct ggtaaatgtt tttttttttt ttttttttcc tttgcttaag 4260 ttaaataggt ttgtgccaag gggatatgta accgtttact gtgtagtttc gtttttttag 4320 ggcaattgtt ttcaggtttg tgttgtgtat ttatgtacaa ttgttttttt gtaatttttg 4380 catttagtaa aaaaattgca ttattttaat tattgtttag tggtacatgc aagtctttaa 4440 gcatagaaaa aatgcattgt gtctctttct ttgcgtataa acacttgaag ttcagtaaaa 4500 gtttttgata tttttacaaa atatttgtaa ttgaaaaagt gttgttcaat gggtttaaaa 4560 gtgataaaat gttaatgttt ggttatcctg tattaaagtg ctaattctta tcgagtaaag 4620 agggtataca gtgcattttc agcaagaata ataactgaaa atgttgtttt ctgcagctct 4680 ccagtcaagt tgtcttttgt ggagcattct ccctccaaac agcacaacaa cactaatgat 4740 ctgttggtca aatctgggac aaaggtgtct gttctccggc gactggaatt tgcacgggaa 4800 gatggcttga cttcaagagc ccctgtgagg agtttgagtg acatacagcg ggatgtcaga 4860 gagtaccaga gggtaggtta aaaaaaaaaa aagttttaaa ggatgtggta gtttctgtat 4920 cgtctaaggc cccatttaca ctgatgcgta gtagttttaa aacgattctt agaatgaaaa 4980 tattctatag gtgtttcatt ttgcatccct gaataggtct ctgtgcaaac tatatgatgg 5040 aaaatgtaca ttatgtggcc aaacctgcac tagcattaaa gttggttgag tgtattctga 5100 catctgagct gcgggctgtt ggctagtgca tctgtgatct gctgctgaat gtttgacgtg 5160 atgtttagtt tgtatgtgta agagtagagc tgggcgatat ggcaaaaaaa ttatcatgat 5220 gattttttca tattagtcta tcgatattta tctcaaatgt aaaatcttta tatttatcaa 5280 ttatataaga ttttcagcat gccaatggct ttgtgcaact gatttaacac atgtagtgaa 5340 atgtgttcgc tgtcaacaat ataaataata aacaaaagaa caatattaat tcagcatttg 5400 catgtaagtt ttcaacaacc aaagacatta tgtacagatt attggatatt gtttatcatc 5460 agaaaaaaaa aataatgata ataaaaaccc gatttggaac aacgtgagtg aatagtcaca 5520 gaatattgag ttttgaatga actttccttt taacggtgag tggtgaattt agggtgaaat 5580 acttttaatg tattttattt agggatgcta atgattactt gatttattgg tggtcaacaa 5640 cgcttttaat tgacagacct tatcgatgac taagcatgga catgtaaagg gttagtgatg 5700 ctttgtgctg agactcatcc acactttaca taatcatgtg tgtgcagttt accttatgcg 5760 gtcaccctct tgaaatctca tattgtggat cgcataaaac cttatgtgca tatctatgat 5820 tatatctgtg cgtgttaacc aaatttgtct aacagaatag catgaatagg tgtatgcata 5880 tttcatttta caaatcgcac atcagaacaa aaacatgcgg ggagccagcc tttaaatata 5940 taaaggttct tttattgtcg atgagagaca atagtggtat tacagatttc aaaacaaata 6000 ttttaatatt aaaaaggaaa cataagcttg acatcaaagc atgttattgt cttaaatacc 6060 cttcaaatac gcatagtgcg taaatattat cgaaccgtca ttgttttatt taaaaatttt 6120 atatcgtgat cattatctac atccagttat tgcccagccc tatgtaagat gttgtgtttt 6180 tttttttttt ttttttttta ttattagtat tattattgaa tgattgttct tttgtattat 6240 tcttaggtaa tatgttttgc ctgagggcaa aatattatgc aatattaaat tataaccata 6300 acatacactg catataaggg atttgcttgg cttaatttcc tatattgtat aaagtatact 6360 tttataatgg ccattattga tttattaaaa ctgattttct gcaaaagaga cggctgtatt 6420 caccatgcat tttctcagat aatatttatt gtaggcaact agtgttcatt gttaagatag 6480 tgataagcta gtgtgatgca aatgacattc agtagcctac aatgtgtttg tagatatact 6540 ctgcagtgtt tctctgtaag ccttgcaaag tgtgaactct gaaaaggatg ctagtttaaa 6600 ccttgtgtac ttgatattaa gaaactgcct gaattaggta gagacatgca catgttttca 6660 ggtgtgctcg ttttcctaca tccacagtga cgtgtagtac cagcattttt aaatgtacac 6720 tgcctcttca gtgtaaaaat atatttcagg ttgaaaactc ttgtagtgtg gatggaaagt 6780 gtaaccattg caaaagttat gtgttttaaa agtaaagcgt attcatgtaa atggtgtaaa 6840 aattgagatc cttaacaaac aaagaaactt tattgctttg ttggtaacaa gtcttctaag 6900 aaatagctgt acaaaagagt tggcatcaga cagtgtagaa tattgtgtag agatgaatac 6960 aattgattca ttttgtttta agtgtttgta actgttgcca tttacggtgt attgcacctg 7020 agattccact tttgaaatga tgctagaaaa ttttaaaggc agaaagtgtg tataatattg 7080 agtgaatttt tttttgtgtt tttcaggtca ctgtgaaagt caaagtgtta cgaatgctgc 7140 aggacaaaga aagtgtggtg cggggacaga aactgcgcag gaagagtgtt gttgtggctg 7200 acagcagtga ccttattttg ctgagtcttt ggggtgatca tgtaatgaca ataggtgaat 7260 ggtatatttt gacgaatgtg tctgtaagag tgtttaaggg gatgacatca ttaagtacaa 7320 cgatgcaaac tagcttaagt gtggttgaaa attctgggcc cgcaaaagaa tttgttgagg 7380 aagatgtaac ttcagtcgtc ggagagatag ttaaggcaga agtcaaagta gaccattttt 7440 gtccaaaaca ccacctgttg gaaaacatta atgttgccac acacatgacc cgctgcacaa 7500 aatgtgattc tttttgtaag acttcgaaat ctagagttga gatgcgtggt cacataggaa 7560 tcactgtgga ttttgtgcag cgcagaattc ctttggatga tgcagaaatt cgggaacttt 7620 taaatttgaa tggacgggaa aatctggatt cacatgcatt agctgcaaag ctgcttgtgc 7680 atgatgatct aaaggtggaa atgtggaggg agtttatcac cagggttttg tttgtgaaga 7740 aagatgatgc tgtagatgct cctaaagatg ggagtgatga agcagcagca gcagtctcta 7800 atggggaaaa ggatgagttt gacaatttgg attcattgtt tgactgactt gagttgcgtt 7860 gaacgttaag tgggacttta acaatgtgca atatttgaaa aagactgaaa aaatgcatcg 7920 taagttttaa gaagtttttc caagaaatac agatgatgac agtctgttct tagttaaatt 7980 tttttagcgt gtattttatt tatttttatt gtaaaacatt gtacaagacg tgttatgtaa 8040 attattagat tggattcatt gtttgactgg agttgtgttg aacgttaagt gggacttttt 8100 aacaatgtgc aatatttgaa aaagactggg ggaaaaaatg tagcttaagt tttaagaagt 8160 ttttccaaga aatacagatg ataacagtct gttcttggta aaaaaaaaaa aaaaaaagtg 8220 tgtattttat ttattttatt tttattgtaa aacattatgt acaagacgtg ttatgtaaat 8280 tattagatgt gcaataaatg tgtgcctgtg tgtgaacagt tgtggtttga ttcatatagt 8340 aatttttatt ttttttgtga catttgagaa atatgaagtg taaaattaaa aaaaaataca 8400 atatgtgaat gtgaaagcag ttagtcaagg aagaaaaaag caaatgtgtt tttcaatatg 8460 cagtatgaaa aatattgaag ctgtttgaaa gtaatataat ctcagtgcga ttaatagaaa 8520 tctaaaacct tttgtgactt aaagtggata tttcttcaca aaatactttt taatgattaa 8580 gtcatacgtc ctcaaaaaaa attgttctgt tgtggttttt tttttttttt ttttttttaa 8640 atgtgcgcaa acatcaaagt tgttgcatat gtaaaaaaaa aaagccatgg gtaaaagaca 8700 gtagtatttt taatgtaaaa gagttgttgt attaaactag aattcatgtg aaagcgtaat 8760 caacaaaata ggtacttttg taaccttgaa gattgtatta tgcctgctgt ttataaaaaa 8820 aaaatcagtt ctagacaatg ggagactgtt tgctgatgta atggatttca attaaaaatg 8880 tgtaacatac aaatgaagtg gtataatatt taatttataa agcacatttt taatcaacct 8940 tacttgccaa aatgttgtac aaactttaga aaaaaaaaga aaagtatatt aaaacaaaat 9000 acataaggtt acacacaatc aaaagcacaa agaaatgcgt ttcaccaaag atttaaattt 9060 tggtagtgaa gaagccgttt taataggtaa taacagattg ttccataatg taatattaat 9120 ttgtacctaa taattttaca tttccgtttt cctgtaaaat cacgtgttaa agaatgaatg 9180 attgaaaaaa tccgaggatt cggtacacaa acgaatcttt tgtagcttaa tgtttatagg 9240 ctactgcgca tgcgggaaag aatcactacc tggcaggaga atcaaattaa agatttgttc 9300 taaataaacg aatccttcag gaacgagtct tcagccattt tgtgttgtga agcggtaaga 9360 caaatttctt gctttcttat taatcagtgt tgtatatctt cgtatcactt atattttctg 9420 gtatttaaac tgtttgtctt ttgactgata cgtttgtaga gttaactggg gtgttatttc 9480 ttcaaatgac gcctatgtta agcgtctttg taaagacggt aacgaggaga tcatttacat 9540 tatttgagag tttttaatga taaaaaaaag tattgcataa gcgtcaagtc tctgtcggtt 9600 gtatttatgg gcttcaaaaa atgttttgcg aacgtaaatt ggccttcatg aacattgaca 9660 tgaaaatacg tgtccaaagt aagcgtaacc gcgccattct gtgcaccgtt aaaatgatca 9720 ctaagcattt aagtgttcgg tgggccttgt tgaacttgca caaatgaatg gtgatgattg 9780 tttgtggaac agttttttac gtttagtgat ttatgcctta tgcaaacgaa tataggtaat 9840 ttgtatgttt atattttcac tatagcatcg cgtgatgatt aattatggga ttagtgtttg 9900 attatatgcg tgttatcgtg ttaatcgcgt taggatattg tttaactgtt actgtgtaat 9960 tgaaataaag tacgtaagcc tatttaacag ttactaagtt acatagttgc attgcattgt 10020 tattttagac tgaggaagta tgtgatgaca aaataatttg taatgatatt ttttattgat 10080 tgtgcccata atatttacta tacacatatc gtattacgta agcattgatt taacaagatg 10140 atgacttgta gtgaaagatg acaatttatt tgattgttag aatagctgct ttacatgttt 10200 gaagttaact aacattgtgt aagattttat caattcaaca aatgttgaaa tgtctctttt 10260 aattagtagt atgcattata gttgactaat ttatgtttct tttgcttttt tggaaatatg 10320 gaatgactgc agcagagatc aggtaagtaa agtttcatgt gatatgtttt agtaaatact 10380 gcatcaaaga aattctgtgg ctgaccatgg caaggcttgt tctgagaaca acctgtcatc 10440 ttaaagcgct ggatgttgtt gatcactgtg ctatagcacg gtttcagagt gttggctatc 10500 tttttatagt ttaggccatt tttatgtaaa gcaacaattt gtttgttttt tttccaggtg 10560 ttgtgagaat ggtttgccat gatgtgccat gtagaacttt cagttattgt caaactcatt 10620 tctgcagtga tgtcctatta aaagatatac ataaatgttt gctgaaatgt gagggtgtac 10680 tcacttttgt gacatactgt atataaaagg aattccatgc aaatgtcaac cttatcctga 10740 aaatgttttt accaaaatag tgaaaaaata tctatgttgt ttccttaggt ttgtttagta 10800 cagtactgta aaaatcttca aacatgtttt tgtcaatgtt ttcaaacaat tacatgtaat 10860 ttatcaaagt ttcataggtg ccagtttcac gtcacatcca taaccaagcg ttttcttcat 10920 aaatccaaaa atgttaaata aaataaaaat cattcatttt gttctatagt aagccaactg 10980 ttatccattt gattagcata atttcttttg gattgtgcag ttaaatttgc agaattttta 11040 caaactatgt tgtatactgt aaactttcat acttttttgg tgacattcat aatgcatgtt 11100 tgtttttttt ctataaatta tgaaaaatgt ttatagattg atatttttgt aattccatta 11160 actttgtggt tagttgcttt ggaaaatata aacagatgtt ttaatatata aaaaaaaaca 11220 tacttaacat atactcaaca tatacttttc ctgtgattgc aaatgtatga gcaattccag 11280 ctttatggat gtgacatttg gagtaaaatc ttaaagcatg cttttacaaa gaaagttaac 11340 attatattga aacatctgtt atcaaaagta taagagtagc atggtgatat tttagcagat 11400 ttactatgct gctatcaaaa acggaacaaa caaattatac aatcttaatg taaaaataac 11460 ataaaaatta agtaaaaaaa acatgacaaa taatgtacat tttatttcga ttttaaaaac 11520 acatttgatt attactagtt ttgacataca gttgtattta tttaagctag taaaataaaa 11580 ctttgttgcc ctgaagttac ttgatatatt attattagca gttcactgca attgtatata 11640 atgtattgaa taggatcttt ttgttctttt cagaagactt tctaaacttt caaaagaaag 11700 actttcagaa gactttctta aggtaagcat gcaaaactta attgttctag ttaaaaatgc 11760 aggccttgta cattttagag gaagcagtga taattgtaga agctttgatt tgacagaatt 11820 atgtaaatgt caattcaata ctaagttcac tgtttgtgtt caggtgtgtg caattttgac 11880 ttttaatgtt cttaattatt cagcagtgta actaatgttt ttaaattgta actaatgttg 11940 ttttttcctg aatagtcaag atgtatagta aattgtgttc tttttagggt gacaaatatt 12000 catgtgatta taaatgtgat caaaatgtac atcaatacga tgagtaaatt tatagtttta 12060 tatgagtaat ttaggagtct gtttaatggc tgcttatttt ttgagtgctt ttaaaaacat 12120 ttttgtaaat gtacaaatgt atatgcatgc ttaaaataga ttaaactgat tataatatat 12180 aacattataa tagaataaac tataatatta attctaatca attcattttt attatgaaaa 12240 aactgtcaaa gtattgagat aaaatatata tatttttcat attctacatc cctaatgtac 12300 atgcaattgt tttatttttt tttattcagg tattaaaatg gttttcattt caggtgttcc 12360 aaaatgtaca tcaatacaat aagatttgtg aatttatagt tttagattag taatttagga 12420 gtttgtttaa tggctgctta ttgtttgagt gctttaaaaa catttttgta atgcaaattt 12480 ttgaatgaat tacacacaat gtgttttatg acatttttgg ttgctgtgaa tttttgctgc 12540 attatatatt aaatatattt gacaggatat ttattaaaat gcaatgtagt aaacactgta 12600 ttaaaatagt gtctctgtgt gtgttttcat gtaatataga tgccacgtgg aaaaggatac 12660 cgccgttccc aggcagcaaa gaggaggaat gctgagcgga ttacactcac ttccctcttg 12720 gacctggcag aggttaagcc tcctctcaaa aaaaaggctg agttcaacga gagctcatca 12780 aacaatccag agatctacca gactgcagat acttctttaa actgctgcag ggatctaaat 12840 acttctccga actgctgcaa ggctccaaat acttctccaa actgccgcaa ggcttcaaat 12900 acttctccaa tctgccgcaa ggctccaaat acttctccaa tctgccgcaa ggctccaaat 12960 acttctccaa tctgctgcaa ggctccaaat acttctccaa actgccgcaa ggcttcaaat 13020 acttctccaa tctgccgcaa ggctccaaat acttctccaa actgccgcaa ggctccaaat 13080 acttctccaa actgccgcaa ggctccaaat acttctccaa actgccgcaa ggctccaaat 13140 acttctccaa actgccgcaa ggctccaaat acttctccaa actgccgcaa ggctccaaat 13200 acttctccaa actgccgcaa ggctccaaat acttctccaa actgccgcaa ggctccaaat 13260 acttctccaa actgccgcaa ggctccaaat acttctccaa actgccgcaa ggctccaaat 13320 acttctccaa actgccgcaa ggctccaaat acttctccaa actgccgcaa ggctccaaat 13380 acttctccaa actgccgcaa ggctccaaat acttctccaa actgccgcaa ggctccaaat 13440 acttctccaa actgccgcaa ggctccaaat acttctccaa tctgccgcaa ggctccaaat 13500 acttctccaa actgccgcaa ggctccaaat acttctccaa actgccgcaa ggctccaaat 13560 acttctccaa actgccgcaa ggctccaaat acttctccaa actgccgcaa ggctccaaat 13620 acttctccaa tctgccgcaa ggctccaaat gtttccaatg atgttaatca atgtttcaat 13680 gttaataact ctgattattt tccacagaaa ttcatacagg gttctttcca tcaaggtgat 13740 gagcgatttg gtttcaacag aaacagacaa tgtgcagtga acagcattac agcagttatg 13800 atgagtgtgt tgaagcatgt gcagacatgg acaacagatg acttaaatgc tgttttgttg 13860 catggtgata agttgtacac atctatgtgc cagcagggaa gattaatgac catttacagc 13920 gaggatatgt ttctgttgca gaactaccaa cagtgcacac actatatgat acaaatttgt 13980 caataaagta ttctgaatct ctgagtggtg ttgttggtgt tgatgtgtat gatgactcat 14040 tgcgtgatgt ttgcatgtca attgaagaag cactacaaag agctttggtt cagtctgatg 14100 cctgtttgtt aaacattcga gacaacattt gtgctgtcat aaaagtggag tcagaatttg 14160 ctgttgttga ttcacattct cgtaatggta aaggtatggc agaatatgat aatggaagaa 14220 gcatagtagt ttattgtcct gacatacaca tgttgtttgc tcatgttgtt aaccttgcca 14280 tgtcactgaa tgctcaggga acaccttttg aagtgaccgg tctgaaagcc actataatca 14340 gtaagtctac atgtgctgca agaggttata agcctggtca acagtgtatc atcacacatg 14400 tttcgcctgc agagtccgat gtacaatgtt gtattttgga tagcgtaaaa caaagagatt 14460 taaacaaacc agcatttctt gaaaaagtga ttcaaatgca gtcagattct agtaagaaat 14520 tacagcagat tgagtcaaaa cttacacatg tgagcaagtt tcctggtaat attttgatga 14580 gtgatgtgat cattggcaat acagaaaata ctcagttcaa tttcaaacca ttaacatggg 14640 aagagcaaaa tgctttgtgt caacgtttac accttattaa tgaaagagtt catgttcaag 14700 tagcaaacat gtgctcttta ggtgttccat gcaaaacaaa atctattaaa ggagatggca 14760 attgcttgtt caggtcaatt gcagatgcat tgtgcggtaa tcaggagaag caccttataa 14820 ttaggcgtgc tgttgtgaaa caccttgaag ctaatagttc acaatttttg agatacatag 14880 gagaagagta cagatcagtg aaagattatt tgacacagtc aaagatgtat tattctggat 14940 cgtggggtag tcatgtggaa ttgttttgtg cagccaatct tttaaaaata aatttaatga 15000 cctttaatgt tgacaggtgg aatgcatatg tccccaataa tgagttgttt acagagaatt 15060 ccatctattt aaagcactgc aatggtaatc actatgaagt tgttacatgt gttaaaaatg 15120 gaagtcaaga aaatgtttgt gcagaagcat gcaacattgg tgtttttcat cagagcataa 15180 cctgctcttt gcggaaaagg aaattaaatt acactccagc tcaatgtaaa cgacagaaag 15240 aacacgaccg gtaccacagc gacagctgct tcagacaaaa aaaaatagag aacttaaaat 15300 ttaaatattg tgaagacagt aattacaaac agagaaagat tcaaaacacc aaaacaaaat 15360 attgtgaaga cagtgattac aggcagagac ggattgacaa ttctaaaata agatattgtg 15420 aagacagtga ttacaggcag aaaaagattg acaattctaa aataaaatat tgtgaagaca 15480 gtgattacag gcagagaaag attgacaagt ctaaaattaa atacaatgaa gacagtgatt 15540 ataaagagaa aaaactacac aatctacaat gtaaatatag gaatgatact aaattacaac 15600 agctaaaatg tgaaaattca aagagaaagt attcaagaaa tgttagtttt caaacgtttg 15660 ttcggcagta ttcccagacc aagtataaaa gcaataaagt ctttagagaa agtgtatgtg 15720 aatattcaaa gaataaatat cacagaaatc ccacttttaa agccagtgta tgtgagtatt 15780 caaagaataa atatcacaaa aatccttcat ttaaagccag agtatgtgac tattcgaaga 15840 ataaatatca caaaaatcct tcatttaaag ccagagtatg tgactattcg aagaataaat 15900 atcacaaaaa tccttcattt aaagccagag tatgtgacta ttcgaagaat aaatatcaca 15960 aaaatccttc atttaaagcc agagtatgtg actattcgaa gaataaatat cacaaaaatc 16020 cttcatttaa agccagagta tgtgactatt cgaagaataa atatcacaaa aatcctttat 16080 ttaaagccag agtatgtgaa tattcgaaga ataaatatca caaaaatcct tcatttaaag 16140 ccagagtatg cgaatattcg aagaataaat atcacctaaa tccttcattt aaagccagag 16200 tatgtgagta ctcaaagaat aagtattgca gaaacatgaa tttcaaagca aaggtttgtg 16260 aatactctaa aaagaaatac catgaggatg aaatgtttcg tatgagaaaa ataaagaaaa 16320 gatctgaaga ttatgtaaaa caaagggaaa ggcaaaaaga cattaatttt gccattaata 16380 acttttgcca tgaagttagt tcaggtccag aatttgtttg ctgtgtttgt catcgtttgt 16440 tgttcagaaa acaagttctt gaatgcagaa aagactgcta tgatataaga ggtcaacaga 16500 taagtgattt ggctgaaaca tgcattacca caaagtattt acataaatgc agcagtgaat 16560 gtgaaaatct atgccattta tcaggttatg ctgcttctaa attgtggatt tgctatacat 16620 gtcatcgtaa aatacttggc ggaaaattgc ctgaagagag tattgccaat aacatgcatt 16680 tggtagatat cccaaaagaa ctaaaaggtc tgaactcatt ggaaggacat cttattgcac 16740 tcaacattcc atttatgaaa ctgctttgtc ttccccgagg aaaacaaaaa ggctgtcatg 16800 gtcctgttgt ttgtgttccg gttaacacca cagatgtgtc aaatatactt ccacgtaatg 16860 aatgtgatga ccacatgata agaattaaac tgaaaagaaa attaacatat aaaggccatt 16920 atgaatataa atatgtcaac actgatcatg ttaggcatgc actatcttat ttggtgagac 16980 acaataagtg gtataaggat gtggagttta atgagcagtg ggtaaactct atgaatgaaa 17040 ctgatgattt agacaaggat gctaatgact ttgtagttga tgaacaccat gaaacagaca 17100 aaaatgaaga tgacaaagag gaggatataa catatattaa agaacaaagt ggtcttttgt 17160 cagacacatg cctgcaacct gtagatctgg gttcagaaat aattgatcaa cactttcatg 17220 atgtactcaa catagcacca agtgaaggta acagtccagt tagattgctg tctgataaag 17280 ctaatgaggc aaagtgcttc cctattctct atcctactgg tggtccaaca tttcatgatg 17340 aaagacaaga aaaaatcaca ctgtcacgat acctgaatgc acgaatacta aatgcagatg 17400 gacgttttgc acaaagcaca gattttatat tttatgcaca atatttatca gaagttgatc 17460 aaattgtttc taatgtctca attgcaatga ggaaaggttc tgagcaagat tgcttaaatg 17520 taacatcaaa catgcttaca aattgtgatt ccttacagaa aatattacat tatgatgaag 17580 gatataagtt cttaagacca gtcagaggaa cccctccata ttggatgtct acgcaaaaag 17640 atctgtttgc ccttattcga cagcttggaa tacccacttt ctttgcatct tttagttctg 17700 ctgatctaag atggcctgaa atgattaaca ctattcttaa acaagaaggg aaacaaataa 17760 atgctgatga ccttgattgg tctgagaaat gtggactctt aagacgaaat cctgtcactg 17820 cagcaaggat gtttgatcac agatggcatt gctttcttaa aaatgttatt atgtcaccgg 17880 caaaccctat tggcaagata aaagactact tttatcgggt tgaatttcaa cagcgtggtt 17940 ctcctcatgt tcactgtctg ttttgggtgg aaaatgcacc aaagctcaat gataatgatt 18000 caaataatga ccatgtggtt gctgacttta ttgatactta catcacttgt gagacaccta 18060 cagaaagtga caatgtacta tatgaaacag ttaacagtgt tcagaaacac agtacaagac 18120 attcaaaaac atgtcggaag aaaaatacag tctgcagatt caattttcca agaccaccat 18180 ctagtcgtac ttttattaca agaacctata atgcagataa tctaaaaggg aaggaaggag 18240 aatcattggc aaataaaata ataaagaaag tcaaaggtgg tttgaattct ggtgtaaatt 18300 ttgattcagt tgatgcattt ttttcatcta ttggaattaa tcaaacaatg tttgaacaag 18360 catacaacaa atgttccaaa aagaaaacta tcgttttgaa aagaaatcca aaagatgttt 18420 gggtaaacca gtacaacaga gatttactgc aagcttggca agggaatatg gatattcagt 18480 atatcacaga tgcttattct gtagtggtat atatacttac gtacatcaca aaagctgaac 18540 aggaaatggg actgctcctt cagcgtgcac aaaatgaagc actaaatgga aatctcgaag 18600 ctaaaacgtc attaaaacgg ttgggaacta tctatctaca caatagagaa atttcagctc 18660 aagagtcagt gtacagacta acaggaatgc atttaaagga atgttcccgt aaagtacaat 18720 ttattccagt tgggcaaaat cctgtaaaga tgagtttgcc tttgcatctt cttcgaaatc 18780 aagcagaaaa tgtccagtct gatgataaaa gcagtttctg gatgagaaat gtcattgata 18840 gatataaaaa tagaccacag aaagagccac tgaaagatct atgtcttgcc agtttttgtt 18900 cagagtacag aattctgact aaatcagagg ttatgtcaca aaacaaaaaa gctgaaaata 18960 aaataattaa acttaataac caaaatggtt atgtaaaacg aaggacacgt actgaacctg 19020 cagtagtaag gtatccaaga ttttcaccaa caaaagatgc agaaaaatat tatcattcat 19080 tattacagct tttcttacct cattataatg actctgattt aaagccgatc aagtatgaca 19140 catatgagga gttctacaac actggtatca tacaatatag ttctgaaatg aaagaagtca 19200 agtacattgt ggatgaaaac agagaattat ttgaaaagga aagtgataaa attgacaaag 19260 ctaaacagct tttagaccaa ggtgttgatt tagaagatgc ctgggctgag atatgtcctg 19320 aaactgaaag acagcgtcat gaatgcaatg aactgatgaa ggataaaata ttacctgatg 19380 aagatgatca tcaagaactt tttccagatc tcataggaaa ccctcaaaca gggtgtacca 19440 ttgaatcaaa ccacactaca atgcctagaa atgatgcatt acttttacta aggtccttaa 19500 atgagcaaca gtcggctgtg ttctataaag tcagaaactg gtgtctacag aaaatacttg 19560 gaaaaaaccc agatccattt agaatatttc ttactggtgg tgctggcaca ggaaaaagtc 19620 acttaattaa ggccatacac tatgaaacta ccagactgtt atctcagctg tgtgaaaatc 19680 ctgatgacat tgttgtactt ttgacagcat caacaggggt ggctgcctat aacattggtg 19740 ctgcaactat tcataataca ttttcaattg gttcaaatgt cagactacca tatcagcctc 19800 taagtgatga caaaataaat acattacgca caaaactggg tagtttgcaa attttgatta 19860 ttgatgaagt gtctatggtt gaccatcgtc tcttatcata tattcatgga aggctgcgac 19920 aaataaaaca aactggtgaa tatctttttg gcaaagtaag cctgttcgct gtgggtgatt 19980 tttatcagct tccagcagtt aaaggaacat ctctctatgc agatacaaaa ggagtaaacc 20040 tatgggaaag taactttgaa gttgctgagt taagcaaagt tgttcgacaa caagatccaa 20100 cttttgcaga aatgttgaat cgcctgagag tacgcaaaaa aaatgaaagt attagtccca 20160 ctgatattgc tacattaaga aattgtgaaa caggagacac atgtaataat cttcacattt 20220 atgctacaaa tgctgaagtt gataaataca acattgaaag actacatgaa tgttgccaac 20280 agataattag tataaatgct caagattata taaaaaatgc aaaaactggt cgaatggaac 20340 gaaaagttgg atttcataca aaagtgttta actcatgttt gtctgcatgt gtttctcttg 20400 gtattggagc aagggtgatg ttaaagaaaa atattgatgt tgctgatggt cttgtcaatg 20460 gagcttttgg tactgttgtc cacataacac aaagtcaaaa aaatgatgat gatttcccga 20520 cagctattca tgtagaattt gataatccaa atgtcggaaa aattcaaaga tccaaaacac 20580 gtcagagatt ttctgcaaat tcaacattaa ttaaagtaga ggaagaccaa gtcacaaatg 20640 atggtggtat caggcgtcaa tttcctctta gactggctta tgcatgtact gttcataaag 20700 ttcaaggtct gacagtggat aaagcggttg tttcacttaa gaaaatcttt acagcaggac 20760 aagcatatgt tgctttgagt cgtgtgagat ctctcagtgg gttaataatt gaggactttc 20820 aagaatctgt catattttgc aatgaaaaaa ttgagttgac catgaaatca atgcagaaat 20880 tagttttgga taaatataat tcaactaaat cagttggtac attcagaata gcactgctta 20940 atatacaaag tctgagagct cactttcagg atcttcttgc acatacagtt ctaatgaatg 21000 ctgattgtat ttgtttgaca gaaacatggc tgtatgaaaa tgatgaactg gaactgcaaa 21060 taccagaatt tgtgttcaat cacaacccac gatctaagtg ttatgacaac aaatgtgttg 21120 tttctgctga attaaaacac cagagaggag gtggagttgg aatgtattgt tctgagaaaa 21180 ttgagtgcca tatctttact cctgagctct ggaatttgga atgtttatac tttacagttc 21240 cacatgttaa tttaaaagct gcagttcttt ataggcccag ttcatataaa attgatgtgt 21300 ttcgacagca tatattacat attattgaag aactagaaaa gcattctgga cagaaaataa 21360 taatgggtga cttcaatgat gatatcttta cttcatctat gactatgaaa ttaatggaac 21420 aacatggata tactcaacat gtacaaagac caaccacaga aaaaggtaca ttaatagatc 21480 atgtgtatac aaaacaaatg gaacatataa ttgttgatgt tgtgcaaaca tactatagtt 21540 ttcatcaggc aatacttatt tcatttttgt gataaaaatg ttatttatga catcagatga 21600 gacggctctc tggggatggg aagggaaggg aagggaaggg aaagacagga attggagaaa 21660 aggtagcctg tttgtttgtt ttttttaatg tttttttgtt ctccaaattt tgatatgttt 21720 tggtatgtat ttgtcaccca attcataaac acttcactgt ggcctggtat ccctttgtat 21780 gtggtgcctc gttctttttt gcctctgcca tggactatta tcacccatac tctacaatgg 21840 acttatgggt taatgactgg agtgtctcat ctttttgaac ttttaattaa tgatctacat 21900 gtacataaga tacacctctt tcacaccctt tggatctacc tcggctttca agtcctttgc 21960 aatcaaactg gttcagcagt catcagcatg tccacctgtt ctgcaccaca aaggtctcca 22020 cttagtctcc tcttcagaca atcaatgtgc gtattttcat catttataat ttatttataa 22080 atgttatttt ggttttgttt tcatatgcat atcatgtgta gttgtctttt aaattatata 22140 acaatataaa tattgttgtt atttattttg tttagattat ataacaatta acaatataaa 22200 tattcttgtc tacaatagaa atacctgttg attgttttta taaaattgta ttgtgtttta 22260 ggaagatggg aacatcattt gtcatgcagt taaatgcaag caattatgcc tgaactctcc 22320 aatcattctg ttgtgttgga atcgacattt ctacctctac aagaacatat ataatatata 22380 gttcttgtaa gtatatttta ctttactgat tttctaaatt tttcttatgg atataattgt 22440 atatgttttc taaattgcag aaccatacaa taaacagtac tatatatttt atattctttt 22500 gggaataaaa tgcactacat acaatctgta atattcatgt ttgaaataac tctgccctcc 22560 ttgtcaaatc ttttagcttt actggagact tgcatccttg cactggctag cttcatcttg 22620 gtctaccccg ctgtacatat gcgtctctat ttgttcctgt ttatcaagac tgtttgctga 22680 acacatatta taacagaaag ttgttgtgag tatatttatc tctcgtatac actgttttat 22740 atatttctat aaatacacta tatataatat ataataattg tttttgtagg acacaaggaa 22800 actctcagca acattcagct gtacaagaaa tcttcagtcc tttacctatg atacctttat 22860 tagtttcacc agctccaact tgatccatac aacttttttc accagaatcc cattgcaaat 22920 ttatcacact ttgccacatt taccataagc accctccaca atttctgcat aactgtcttc 22980 ctctattttt gtcttttttt tttttttttt attggtcagt ttgattatgt ttctttcact 23040 aaatgacttt tatgttcatt tgtggttggt ttgaatcttt gttagttcta tatatttcac 23100 actttcttta ttcactatac gtgttttctg tttgtacccc attttgttgt tgttaatact 23160 gttacttcta aaattgttct ttgtattatc ttgttagttc tgaaattgtt ctttgtttat 23220 caagagtaca atgcccttct caccaggatc ttgcaatatt tgaaaaaaat aaaaataaaa 23280 tacatactta ttttgaatac atatacatat tctaaagaaa tgtctgtatg taattgttga 23340 atgaaataac tttaatgtat agtgatattt atgtctttgg actttcagcc atatagtgtt 23400 cttgctgaaa tgcataacaa cacagatgtc tatattgtct tctcatgtaa gtttcaaata 23460 aaatattaca tatgctatgt acatgtttga tattatttat ataaaaattg atttgctgcc 23520 agtaacagaa atgtagtatt ttagtagtaa ttagtattaa ttgggttaca attacaaaag 23580 gtaagttaag gtttttattt aaacttctaa tgaaaaatcc taatttatca gcctgaccat 23640 aatccgactg aaatatagta atatttcaac aaacagtcta tgagaagttt gcataagaat 23700 cttttgaaca aaatcaatat gatgtacaga aagttttgcc aaatatggca tattaaatat 23760 caatgttgat tcaaggaata agttgatata cattttacca caataaacat gcagtcaaaa 23820 cttactagca attctaactt tcttacttcc tttctgattc attattgata gttagtgatt 23880 ggtttataat tcgtgaccac taagttgcac tgtaaccaag tttatgtgtt tgcgtcatag 23940 tgagttacca atgacacata aagttcagtg tcaatatgtt taagcattgc agatacagcc 24000 ttaaatgaac tttgacgttt atcgatatga tttagataag cgtctaggag gagtttgcaa 24060 aataaggttt ttgaacaaaa tcaatatgat tgaaacgaat ttcacaaaac ttgcttgtaa 24120 ttttctgttt attatccgaa tcaaatttgc tgacaattta gctaccagtc taagagaagt 24180 ttaaataaga attctttttt acaaaatcaa tatggcatac aggaatttta gccaaaaatg 24240 gcatatttaa tatcaatgtg cttgggatga tccgaggaat aagttaaaat attttattac 24300 cacaataaac atatgcagtc aaagattatt agtacattta tgattcattg ttgatggtta 24360 tgattagttt ataactcgtc accactaggt tgcactgttt tcaaatttat gtgttgtgaa 24420 ttgtacatta caaatgacac acctacattt cagtgacaat atgtttataa accaaaaaag 24480 cacgtttttt tgtatctccg gaccactaga ccactagagt ttcattttga gacttcatgt 24540 ttggtctgat taattttaag actgtcctct atcattgtgc cagatgcaca ccatgttttg 24600 tagtgatatg tgaatatgat ggtaaaatat agaatttgaa atgggcgaca gccaaaatgg 24660 ctgacttgtg aaaatcagac atcattcgac tcgacatgct ctgccaaatg taatgagacc 24720 agttttatga gtttcagacg aaaagttgag atgttataag ccaaaaagca agtttttttt 24780 atctccggac cactaggggg cagtgcgccg aaactccgca tataacctca aactctagtt 24840 gtcataaaac acaccaagtt tgatgtgaat tggtgaatgc gttacggaga tatcgtctca 24900 agtccatttt tgcaaattct acattaaatt tgattgcaag ttaaacaaaa acggttggtc 24960 aaatcaactt gaattccata acttttggtt gacatggact gtagatcatg tgattcaatt 25020 ttggtgataa tcggagaaac ggcctaggac gagttcaatc aaataggctt tttgaagaat 25080 ttaaaatagc gccaaaaaat taatgacgga caatgatgtc atagggtgcg tttgaatcgg 25140 actgagccaa ggaatcaggg gaaaaaagaa ttttgattgt agcccattcg gttcaaaagt 25200 taatatgata aacgtacatg aaagtttgga caagtggtgg cgctagagag cttgattttg 25260 agacttcata tttggtatga ttaatgttaa tactgtcctc tatcattgtg ccagatgcat 25320 accatgtttt gtaatgatat gtgaaaatgt tggtaaaata tagcattttt ggccaaaaat 25380 cgtaatgggc gacgcccaaa atggctgacc tgtgaaaatc agacatcatt caactcggca 25440 tgctctgccg gatgtaatga gaccagtttt atgagtttcg gacaaaaggt tgagatgttt 25500 taagccaaaa agcaagtttt ttgtatatcc ggaccactag ggggcagtgc gccgaaactc 25560 tgcatataac ctcaaaccct agttgtcata acacccacca agtttggtgt gaatcggtga 25620 atgcgttatg gagacatctt ctcaaatcca ttttcgcaag ttcttcgtta aatttgatcg 25680 caagttaaac aaaaacggtt ggtcaaatca acttgaattc cataactttt ggttggcata 25740 gtctgtagat catgtgtttc aattttggtg aaaatcggag aaacggccta ggacgagttt 25800 gatcaaatag gtttttcgaa aaatttaaaa tagcgcgaaa aaattcatga cggaaaatga 25860 cgtcatatgg tgcgtttgaa tcggactgag ccaaggaatc agagaaaaaa agaattttga 25920 ttgtagccca ttcggttcaa aagttatgat gataaacgta tgtgaaagtt tggacaagtg 25980 gtggcgctag agagtttgat tttgagactt catgtttggt ctgattaatg ttaatactgt 26040 cctctatcat tgtgccaaat tatacaactt tcccgcatac gcctatatgg gctgccattc 26100 aaatccggcg gaagaaacgg aagaagaaga agaagaagaa gaataataat aataataata 26160 ataataataa taataataag aacactaacg aaagcaatag gtgcctacgc acctacggtg 26220 cttggcccct aataataata ataataagaa cactaacgaa agcaataggt gcctacgcac 26280 ctacggtgct tggcccctaa 26300 // ID DNA-2-26_DR repbase; DNA; ZEB; 500 BP. XX AC . XX DT 03-DEC-2008 (Rel. 13.12, Created) DT 08-JAN-2009 (Rel. 13.12, Last updated, Version 1) XX DE Nonautonomous DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA-2-26_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-500 RA Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(12), 2135-2135 (2008). XX DR [1] (Consensus) XX CC 2bp TSD. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 500 BP; 140 A; 151 C; 100 G; 109 T; 0 other; cccacatagc aaatgttttc tggcccagct ccgggccaca caataacttt tccctcggcc 60 caagtaccgc aaagaatgac ggtcctgaag tggcccataa ctggattaaa gacaagggcc 120 acacatgggc cataaccgtg gcccgaatct cagccaattt atcaccctta actggccctg 180 aactgggcca aaacaggttt attattggta ccgattctca gctataagtt tatactttaa 240 ccacaaccac caggaaaccc caaaaactga gccataaccc agcctaatgt tacctaaaca 300 atgtgaacaa tcgctacact gatctgggcc acacacctcc catccacaac tggcccaaat 360 gtagtttgcc gtcatgcatg cagtgccatc attgccatgc atggcccaca ttcggctgac 420 attatgcatg ccagtgccgg caacacgcca gcagtgccat aatgaggcca catttgggcc 480 aagtccgttt gctatgtggg 500 // ID Gypsy143-LTR_DR repbase; DNA; ZEB; 148 BP. XX AC chr3; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy143-I_DR; KW Gypsy143-LTR_DR; Gypsy143_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-148 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1548-1548 (2008). XX DR Genome; chr3; Positions 1443400 1443547. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 148 BP; 45 A; 30 C; 29 G; 44 T; 0 other; tgctctgttc ctaaacccgc tactgcgcat gtacaacggt gatatggcag aaagagaaag 60 ccatcatgta aagttatgta aactgttttg ctgtgcataa gtaaaagctt ataaacgtta 120 ttatggtctc ctggctcatc atctaaca 148 // ID Gypsy164-LTR_DR repbase; DNA; ZEB; 381 BP. XX AC chr7; XX DT 13-OCT-2008 (Rel. 13.1, Created) DT 13-OCT-2008 (Rel. 13.1, Last updated, Version -1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy164_DR; KW Gypsy164-I_DR; Gypsy164-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-381 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1607-1607 (2008). XX DR Genome; chr7; Positions 63698614 63698994. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 381 BP; 63 A; 99 C; 68 G; 151 T; 0 other; tgtcaccgta ccttgtgttc tcactctctt cagttctatg ttacagatta ccttcagtca 60 cctgcttttc tagtattcac aggtcatcgg agtgatttgc tccacctgtt tctcatccct 120 ttgttactat ttatagcttg tcatttccct tcatgtttgt cagtctgttg ttgttggtgt 180 gtgtgtataa cggtctgtta ctcaccccat atttgtgtcc acggctgcca gtgaacgagt 240 ctcccgttgt ttctggaatg ttttccacgg cactcccctg gcactcaccc gattgggttt 300 tttgtttatt tgcaaagact ttctccttaa taaacacttt gtgttctgca cttgagttca 360 ctccttcttc tacccgtgac a 381 // ID Gypsy57-LTR_DR repbase; DNA; ZEB; 1225 BP. XX AC chr16; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy57-I_DR; KW Gypsy57-LTR_DR; Gypsy57_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1225 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1430-1430 (2008). XX DR Genome; chr16; Positions 14952580 14951356. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1225 BP; 374 A; 174 C; 245 G; 432 T; 0 other; tgtaaccagg ttaaaaaaag gtattttgtt aaacagaaaa aggttattga caccctgggg 60 ttgacacaca gcataacatg agcaaaatca cacacttctc ataatgtgag taatatagcg 120 tgctggccct ttaaggccca tatgtgaact gagcgatgtc acgcgagagc tctagatgca 180 gcgcgagttt ttctccatgc agtatgaacg tctgttggga gactgagaaa ggccaataaa 240 gtctgtttac tcatgaaata ttgtgcaata catgcatgca gtgacttaaa aagtcttttt 300 catagtgatt aatattcgga tggctgtgga ttttgttgtt ttgatcatga tttccaacat 360 cgtgaaggaa agctaagcta aagcgtaagt gttgacttat aatctttaat ttggtaatat 420 taagtatgat attggtgttt atccttattt aagccatgta attgaaactg gattattttg 480 tttagagggg agaagagagt tttaagggtc cagagtgtga cggaaagaaa acatagtgag 540 tgttgtacaa ctttattgaa ttcaagtgtt tgagaatgta ttttcatatt tccttgaaga 600 tgttatgttc aaaatgtcat gtgtttacaa agattttatt gtattttgtt cttcatgatt 660 tgcagaggaa aaagtcattt gcagctgtta atgctgtgtg tgtatatatc attgagagct 720 gacgagctgt aagtttgagt catttaattc attatatttg aattctaatg aacacttatt 780 cactgattta aaactgagtt ttaaacgttg tttaattgta ggagaatatt tgtaacatgt 840 tcacatggca acatctcatt caatgtattt tgtaattgta aagtgagaga ttctattcaa 900 tctgactttg atggtgaatg acttacactt ggttgtttgt tttctcagcg tgactgctgc 960 taccgttgtt tatgtactat ttcagttttg aaaagttatt atcagtcatt gtaattacaa 1020 gaaagaaaag agtacaacac atttggatat aaaaagtcaa atttattgtt ttcagtggct 1080 gtacacaatc attaactttg ctattaaaat tgtccaaaat ttgcatactt aacgctgcgc 1140 tgttccaagt acagtctgga ataaaagtcc ccaatatttt caatcctgtg tccggtgtca 1200 ttacacattt aattaccggg ccaca 1225 // ID DIRS1a_DR repbase; DNA; ZEB; 5979 BP. XX AC . XX DT 11-FEB-2002 (Rel. 7.01, Created) DT 09-MAR-2009 (Rel. 14.04, Last updated, Version 3) XX DE DIRS1a_DR is a nonautonomous DIRS-like LTR retrotransposon - a DE consensus. XX KW DIRS; LTR Retrotransposon; Transposable Element; Nonautonomous; KW nonautonomous LTR retrotransposon; Gypsy; DIRSDR1; KW DIRS superfamily; reverse transcriptase RNase H; phage integrase; KW DIRS1_DR; DIRS1a_DR. XX NM DIRS1a_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5979 RA Kapitonov V.V. and Jurka J.; RT "DIRS1_DR, a family of DIRS-like elements in zebrafish."; RL Repbase Reports 3(1), 2-2 (2003). XX DR [1] (Consensus) XX CC DIRS1a_DR is a subfamily of DIRS1_DR retrotransposons. There are CC ~100 copies of DIRS1a_DR in the genome, they are ~0.7% divergent CC from the consensus sequence. Therefore, this family CC retrotransposed in the zebrafish genome very recently. There is a CC 11% divergence between the DIRS1a_DR and DIRS1_DR consensus CC sequences. Presumably, DIRS1a_DR is a nonautonomous family CC because ORF1-ORF3, that are intact in DIRS1_DR, are corrupted by CC stop codons. The DIRS1a_DR was reconstructed based on multiple CC alignment of 7 copies. XX SQ Sequence 5979 BP; 1115 A; 1821 C; 1643 G; 1400 T; 0 other; gttccccttc ggggggaact tcagcactat aagtggattt gattgtaaaa tccacgcatt 60 gggaggttcg gttcagaagc tactcgtctg aaagagtatt gaacgggcca attaagaatg 120 aattggcagc gcaagcctgc gcaggtgagc ggcataagca atcaactgag tatataagct 180 cacctggcgc cagcagacgc tatccttttc gcttcagaga ctttctgatc gagtcgatga 240 gggttcctcc tgctgtgacc agcgattctg agcgaacgag agcattctcc cggtccagag 300 tgtgtacacg cagtggcaga cggtcgagct gggtttctcc cttgcctggc gttctttggg 360 tccggtcctc cagagcggtg cgtataaagt tgcaaacttc acagaaagag caacacagtc 420 gtgcagcacg tccttatcag gacggcgctc cgactgtgcg tttctggatg cggtggtttc 480 ctgtcctcgg atgatgggcg cgggcactgc gttacatgtc tgggggtcca gcatgttaat 540 gcgctgctcg cgggcagttc atgtcgtcat tgcgatgcca tgactgttgc gcaagatcgc 600 ggttagcctt tgcaaaaggg cgaaccaccc cagttgtccc ctgctctgca gcgggcactc 660 gggcagatct gagggtttca gcgagagata atccgtcgcc cacgggcccg cggacctccc 720 gctcctctaa gcgctccatc caagcttcgg gcgggggaag cgatccgtct aacagatggt 780 agcccttacg cccgatgaca ccggagacca gatgtccacc gcggcatcgg agggtgggct 840 ttcattgtcc gatgatgatc cagacccgct cgcccccttc gggctggtga gcgctgtcac 900 tacggatcct gaaacggaca tgttagccgt gctttcccgg gctgcttcgg ccgtgggttg 960 gagatggttt atcccccagc tccgcggccg gaccgactag aggggcgttc ccttcttccc 1020 ggaggtgcac agtaggctca cgcggtcttg taaagcactt ttttctgctc gtgctgcgtg 1080 ttcctccacc ctaaccactc ttgacggtga aacagccagg gggtatgtgg cgattcctca 1140 ggttgagtgc gcgatggcgg taaatccgcg cggcgcctct tcttggcggg gtccgcctcg 1200 tctcccatcc aaagcctgta agttatctac ctccctcgga gctagagctt acatagctgc 1260 gggccaggct gcttccgcct tgcatgcgat agccacctac cagcgctacc aagcgcaggc 1320 gctggccgag ctgcacgagg gtgggtccaa cccaggctta cgagcttcgc accgccgccg 1380 acttagctct tcggactact gagtccgctg cgtgtgcgtt ggggaggacg atgtccacat 1440 tagtggtcca ggagcgccac ccctggctga tatgcgcaaa gtcgacaaag tccgctttct 1500 tgacttcccc atatccccag ccaggttcgg cgacactgtc tgtgaattca cccaggaatt 1560 caaggcggtg agagagcagt tggtgggtga tgtcttatcg gcggtccgta gcccgctccg 1620 cccgccgtgc cattcatacc tgctcctcgc cgaaggcgcc cgcctacgag agctgctccg 1680 cccccgcaca cgcctccggc gaagcgagcg cgtcgggcac ctcggaagca ggcagccccc 1740 ctgcccagaa cgccgctaag tccggcaacg gaccgcgaag cgcccctggg acaggccatc 1800 tggagaagag ggaacttgct ctttccccgc tggagggcgg ggccccattt ccaacggcac 1860 tttttactgc catgaaaaca ttatgaaagg gcacttttca cttccccaga tgtgacagcc 1920 cgaaatctgc cagtctggga cgctatgctt tctagcttgc agattcggtg cgtttcgcca 1980 gtggctcacg agcgctggga ggacggtctc ctttctccca cccctcgagc ctcccctccg 2040 gagctcgggt ttggagtgag agcaaatatc tcacctccag cttttccgtg ggacccgcga 2100 gcttcccgga tcagcacacc cactccgcgc tgccccactg ctggtacgtc agcgattgta 2160 gcgatgagtc cattagcgag agctctgcct gcctggttag cgcgggccag ctcttcgcgg 2220 tggctcatac gcacaatcag actcggctat gcgattcagt tcgcgaaacg gccccccaag 2280 tcacgggtgt gtattcacca gggtcagccc cctgtccgcc cctgtcttgc gagaggagat 2340 tgctgtcctc ctggcgaagg atgcaatcga gccgctccct ccagccgaga tggagagcgg 2400 gttttacagc ccacgcttca tcgtgcccaa aaagagcggt gggtcacggc caatcctaga 2460 tctgcgcgtt ttgaaccgct gcctgcacaa gctgccgttc agaatgctca cgcagaagcg 2520 catcctccgg tgcgttcgtc ctctgggttg gtctgcagca ttagacctga tggacgcgta 2580 tttccatgtc tccactcttc ctcgccaccg acagtttctg cggtttgcgt ttgaaggtcg 2640 agcttggcaa tacaaagccc tccccttcgg gctctctctg tctccgcggg tcctcaccaa 2700 gcccgcggag ggtgcctcag cgccccttcg gctcgcgggc atctgcatac tcaatttctt 2760 tacgactggc tgatttttgc cctctctcgg agcagttgat tatgcacaga gacaaagtgc 2820 tctggcactt ccacctgtgg gggtttcagg tcaaccgaga aaagagcaaa ctcgcccccg 2880 tgcagaggat ctcttctctc gggctggagc tggactcggt caccatggca gcgcgcctct 2940 ccggagagcg cgctcagctg atgctgtact gtctgagaga gctcgacagt aaaatagtgg 3000 tcccactgaa actatttcag aggctcctgg ggcatatggc atccgcagcc gcttcatgcc 3060 gctcggatta ttctatatga gaccacttca gcactggctt cacgatcgag tccccagacg 3120 cgcatggcac gcgcgcgcac accgagtctc tgttactgcg ctgtgtcgcc gcgccctcag 3180 cccttggagc gacccctcgt tcctacaggc ctgggtgtct ctagaacagg cgtccagtct 3240 tgttgtcgtt tcagcagaca cttccaacac gggctggggg gctgtgcgtt gcgggcatgc 3300 ggctgcggac ctgtggaaag gtacccagtt gcattggcat atcgcctgga gctgttggca 3360 gtgttcctcg ctctccaccg tttttttccg gtgctggagc ggcaacacgt gctggtcagg 3420 acggacagta cggcggcggt ggcgtatatc agccgtatag ggggtatgcg ctctcgccgc 3480 atgtctcagc tcgcccgccg tctgctcctc tggagtcatc cgcggctgaa atcgctgcac 3540 gccattcata ttcaggcaag ctcaaccgtg cagccgatgc gctctcacgg cagccttgcg 3600 tcctggagaa tggagactcc accccgagtc tgttcagctg atatgggcgc gattcgggga 3660 agcccagatc gatctgttgc ttcccccgag atcgctcatt gccagttgtt ctttccctga 3720 ccgagtgctc tcggcacgga tgcactggct tacagctggc ctcggggcac gcgcaaatac 3780 gcgtttcccc cagtgagcct gctcgcgcag ttactgtgca aggtcaggga ggacgaggaa 3840 caggttgctg gttgcgcccc tctggctcaa ccggacctgg atgtcagagc tctccctcgc 3900 gatagccctc ccctggcagt cccttcgaga gagcacctac tctctcaggg acagggcacc 3960 acctggcacc ctcgccgatc tttgaagaga tttttagacg cgaggaagac ttaggtaacc 4020 tccgattgcg gtggctaata ccgtcactcg ggctagagcc ccctccccga gcgcgcctat 4080 gccctgacgt ggagtctatt cactgaatgg tgtgtctctc gctgagaaga cccccgtaat 4140 ttgccagatc agcgttgtgc tttctttacg ccgagagaag ttggagagca ggctgtcgcc 4200 ctccacactc aaggttacgt ggctgccatc tccgctctca taacgcggtg gctggcagca 4260 ccgtgggaac gcataacctc atcatccggt tcctcagggg cgttaagcga attaatccac 4320 cccgccccct ctcatgccct cttaggatct cgccctcgct acacaagccg cgtcagatcc 4380 cttcgatcct cgactcagta tctttctgtc cctgaagaca gctctgctgg tcgcgttgat 4440 atcgattaga gggtcgggga cccggaggca tttttcggtc agtgactcgt gcctgtaatt 4500 gggctggctt ctctcacgtc ctgagacccc gcccgcgata tgtgcccaag gttcctacca 4560 ctccgtttta atacgaggta gtgagcctgc aagcgctgcc ctcggaggag gcagacccag 4620 cccttcttta ttgtccagtt cgcgttttgc gtattatccg gaccgcactc agagtttaga 4680 tcatctgagc agctcttcgt ctgttatagc ggtcggcagc agggaagtgc cgtaccgaaa 4740 taagttccca ctagattgtg gatgcctttc tttcactatc agagccgaga tgagccgcgt 4800 cccccgagag cgcgtgcgca ctccactcgg agcttcgcat cctctcgagc gcgcgcacgc 4860 ggcgcccctc taacagacat ctgtagagct gcgggctggg tgacacccaa cacatttgca 4920 aggttttaca atctgcgagt ggagccggtt tcctcaaggg tattaggtaa cccttggtga 4980 ttgaggaaac aattcggtag ggtgttgaaa cacgcttgct gcgccatttt ccctaacacg 5040 gagatacgtg cgccttttta tctgtcagta aagttccccg tcaggtgagc cctgcagatt 5100 cctccgtggc ccccagcact gactcagcgg aggagtcact tgctggccca ctacgttgta 5160 ggtctgcccg ctggtcagcc cgcgttttgg gtaaaggtgc ctgctatgcg tggtccccac 5220 taggcgatcc catatgctta ttccgccacg gttaagtccc ccccctgggc ggacccgtgt 5280 cttccctccc cgctaaccac tcttttgcta tgcgtactcc ccctttttag ggctagtcca 5340 tatgtaaatt ctgccatcta tccccccttg ggtaacggat ggcctccgca gcgtcctccc 5400 tatcgggatt gcacgcttcc caacgtactg tcgtattttc ctagaattat ctagatgctc 5460 acgacttccc aaaaaatata tataaatccg taaaacttct gttgaagtag gataaattag 5520 ggccagggac acgttggagg accgcgcccc ccatgatgtg ggtgcgtcac gcttgcttga 5580 ctatctcctc atcgggggtg ttggtaaggt gcagtcatta tggcgctttc aatgggctcc 5640 caatgcgtgg attttacaat caaatccact tatagtgctg aagttccccc cgaaggggaa 5700 cgttcgaggt tactaaagta acccttcgtt ccccgaggag gacggaagca ctatactccg 5760 tcgccataat gactgtccct tagctgttga aagtctcttc agcttaaaaa ggatagcgtc 5820 tgctgcgcca ggtgagctta tatactcagt tgattgctta tgccgctcac ctgcgcaggc 5880 ttgcgctgcc aattcattct taattggccc gttcaatact ctttcagacg agtagccctc 5940 ctcggggaac gaagggttac tttagtaacc tcgaacgtt 5979 // ID Gypsy68-I_DR repbase; DNA; ZEB; 4152 BP. XX AC chr14; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy68-I_DR; KW Gypsy68-LTR_DR; Gypsy68_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4152 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1451-1451 (2008). XX DR Genome; chr14; Positions 16063276 16059125. XX CC Positions [3061-3546] - Integrase core CC 'AACCC' target site duplication CC LTRs are 99% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 613..3987 FT /product="Gypsy68-I_DR_1p" FT /translation="MTIEGQKAEEVHTVQRIKQNQGAKQTHTKCKFCGKQH FT EWKKLSCPAYGKQCKKCGKLNHFAATCKTKETKRKGIHSVTEIEEEPFQEI FT LSLGSQMREDTQEKQLFATMLIGERPVKFQLDCGASCNVIPIQQLNPDTVM FT EKTEQILVMYNKSTLKPVGKCKIKIRNPRNRKLYRMEFIVVEDTSSVPLLG FT NKAVQAMDLVRIQHENIMAIDDIVTTERCRGKEPWDKSDIHREYADVFEGD FT GCLEGEYNLEVDPAVKSVRLPKRRVPVALLKHLKDELGSLVERGIIAQVEK FT STEWISSMVVVKKPSGKLRICIDPRPLNKALRRQHFPLPTIDDVLPDLTKA FT RVFTVCDVKDGFWHIRLSEESSYLTTFATPCGRYRWIRMPMGISPAPEVFQ FT HRLTQALEGLPGIRIIADDILICGEGDNDEAAEKDHDTKLRQLLDRCRDRN FT IKLNFNKLKLRQKEVPYIGHRLTSEGLKIDPEKVRAILEMPRPSDVKGLQR FT LLGMVNYLTHKDVIWEWTDVQEDAFNKLKQMIASAPVLKYYNPEDDLVLQC FT DSSETGLGAALLQAGQPVPYSSRALTATEKGYAQIEKECLAILFGMEKFHQ FT YTYGRKVEVHSDHKPLETIVKKPLLNAPKRLQRMLLRLQRYDINVVYVPGR FT LMHLADTLSRAYLPECAAEGSVETEIETINMLQYLPISEGILQKIQRETAK FT DEKFQMLQQVIAQGWPEQKTQLKEEVRQFFSIREELSEQGGVIFRGEKALI FT PAKLRTEIMERIHASHLGIESCLRRARDCVYWPGMSAAMREYIGKCAVCRT FT VDARQQKETLCAHDILIRPWAKVGTDLFSFNNRAYLITVDYFSNFWEIDYL FT TDTSSSTVIHKLKAHFARHGIPDTAISDNGPQYSSQEFKKFSIAWEFRHVT FT SSPAYPQSNGKAESAVKTAKHLMEKAKRAKADPYLAILEHRNTPSQGFNAS FT PAQRLMSRRTKTLLPTRDSLLQPGVVNTEHAVEINQRKQAYYYNRGAKDLR FT PLQEGEQVRVQLDTKRKAHWTPAKVERMTNNRSYTVKLENGTVVRRNRKHL FT RTVPGRPQRADTGNTVKLAQTNHPDEGHTKAVPETCITRTHHTDTSNSVVI FT TRSGRIVKRPTHLRDFV" XX SQ Sequence 4152 BP; 1490 A; 794 C; 982 G; 886 T; 0 other; tggtggcagc ggtaaattac caagaagaaa agtgaaagac cgaggtaaat cactacggag 60 aaagtaaata agagagacgt gcgccgatca tggagggtgc tttcggattg aagccgccag 120 cgaattttga ctggaatgcg acgaatatac caaaagcttg gaaagcatgg gaagaagagt 180 tttccctgta cattgattta tcgcttacag atgcggatga caagacgaag gtaaaagttt 240 tccagtattt aatcggagaa accgggagag aactgagtaa gacgctaggc gccgccctcc 300 cggctgacgg tgaactgaca cttgaatggc tgataaccgc gttcagaaac cactgcaatc 360 cagcaccaaa cgaaacagta gaaagataca aatttttctc cagaaatcaa aactcaggcg 420 aaatgattga taagtatatt actgatttga aagtggaact ttggatcagc taaagattct 480 ctgattagag accaaattgt ctgcggcata ctgaactcac atgtgagaga acgtctatta 540 agagagcctg aactgacgct tgagaaatgc gtaaggattt gtagagcatc agaactgtca 600 aaagagaatg taatgaccat agaaggacag aaagctgaag aagtgcacac agtacagaga 660 attaaacaaa atcagggtgc aaaacaaact catacaaagt gtaaattttg tggaaaacag 720 catgagtgga aaaaactgag ttgcccagct tatggaaagc aatgcaaaaa atgtggcaaa 780 ttaaaccatt ttgctgccac gtgtaaaaca aaagaaacta agagaaaagg aatacacagt 840 gtgacagaaa ttgaagagga accatttcag gaaatcctaa gccttggttc gcaaatgaga 900 gaggacacac aggaaaaaca actgtttgca accatgttga taggagaacg accagtaaaa 960 tttcaattag actgtggagc cagttgtaac gtaatcccca ttcagcagtt aaaccctgac 1020 accgtgatgg aaaagactga acaaatcctg gtcatgtata ataagagcac cctgaaacca 1080 gtaggaaaat gcaaaataaa aataagaaac ccgagaaata gaaaactgta ccggatggag 1140 tttattgttg tggaggacac ctcatcagtg cccctactgg gaaacaaggc agtacaggca 1200 atggatctgg taagaataca gcatgaaaat attatggcaa ttgatgatat tgttaccaca 1260 gaaagatgca gaggaaaaga accatgggac aaaagtgaca tacataggga gtatgctgac 1320 gtatttgaag gagatggatg cctggaagga gaatataacc tagaagtgga tcctgcagtg 1380 aaatcagtaa ggcttccaaa gcgaagagtt cctgtggcac tattgaaaca cctaaaagat 1440 gagctgggaa gcctggtaga gagaggcatt atagctcagg tggaaaagag cacagagtgg 1500 attagcagca tggtggtggt gaaaaaaccg tctggaaaac tcagaatttg tattgaccca 1560 agaccactaa acaaagcatt gagacgtcaa cattttccct tacccaccat agatgatgtt 1620 ctccccgatc tgaccaaagc gagagtcttt acagtgtgtg acgtgaaaga tgggttctgg 1680 cacataaggc tttcagaaga gtcaagctac ctcactacct ttgcaacgcc atgtggcagg 1740 tacagatgga ttagaatgcc tatgggaatc agcccggctc cagaagtgtt ccagcacagg 1800 ttgactcagg cattagaggg gcttccagga atacggataa tagcggatga catccttatc 1860 tgtggtgaag gtgacaatga tgaggcagct gagaaagatc atgacacaaa gctaagacaa 1920 ctcctggatc gttgtagaga cagaaacatt aaactaaatt tcaacaaact caagctaaga 1980 cagaaagagg tgccatatat tggacacaga ctgacatctg agggactaaa gattgaccca 2040 gagaaggtga gagctatatt ggagatgccc aggccatctg atgtgaaggg actccagaga 2100 ttattaggta tggtgaacta tttgacacac aaagatgtga tttgggagtg gactgatgta 2160 caggaagatg cattcaacaa actgaaacaa atgattgcct cagcacctgt tttgaagtac 2220 tacaatccag aggacgacct agtgttgcaa tgtgactcct ctgagacagg actgggcgca 2280 gcacttttgc aggcaggaca accagtccct tatagcagca gagccctcac agccacagaa 2340 aaagggtatg cacaaataga aaaagaatgc ctagcaattc tttttggaat ggagaaattt 2400 caccagtaca catatggcag gaaagtagaa gtacactcag accacaaacc cctagaaacc 2460 atagtaaaaa agccattgct taatgcgccc aaaagattac agagaatgct tttgagactg 2520 caacgatatg acattaacgt tgtctatgtg ccaggtcgac tgatgcactt agcagatact 2580 ctcagcagag catacctacc agaatgtgct gcagaaggat cagtagagac agaaattgag 2640 acaatcaaca tgttgcagta cctgcccata tcagaaggga ttctacaaaa gatacagaga 2700 gaaacagcta aggatgagaa atttcagatg ttgcagcaag tgatagctca gggatggcca 2760 gagcagaaaa ctcagctgaa agaagaggtg agacaattct tttctataag agaagagctt 2820 agtgagcaag gtggagtgat ttttagaggt gagaaagctc taataccagc aaagctgaga 2880 acagagatca tggagagaat acatgcatca catctaggca tagaaagctg tttacgaaga 2940 gcaagagact gtgtatactg gcctggaatg agtgccgcta tgagagaata catagggaaa 3000 tgtgctgttt gcagaactgt agatgcgcga cagcagaaag agacattgtg cgcacatgac 3060 attctgataa ggccttgggc aaaagtaggc acagatttat tttcattcaa caacagagcg 3120 tatctcatca cagttgacta tttttctaac ttctgggaaa ttgattatct tactgatacc 3180 agctcaagca cagtgataca caaactaaag gcccattttg cacgacacgg gatcccagac 3240 acagctatct ctgataacgg gccacaatac agctctcagg aatttaaaaa attcagcata 3300 gcttgggaat ttaggcatgt gacgtcatct ccagcatatc cgcaaagcaa cggcaaagcc 3360 gagtcagcag ttaaaaccgc aaaacatctc atggaaaaag ctaagagagc aaaagctgat 3420 ccatacctgg ctatactaga gcacaggaac actccatctc aaggatttaa tgccagtcct 3480 gcccaaagac tcatgagcag aagaactaaa acattactgc caacacggga cagcctcctg 3540 cagccaggtg ttgtaaacac agagcacgct gtagaaatca accaacgcaa acaagcctac 3600 tactacaaca gaggggcgaa agatcttaga ccactgcagg agggagaaca agtaagagtc 3660 cagttggaca caaaacgtaa agcacactgg acaccggcaa aggtggaaag aatgaccaat 3720 aaccgatcat acactgttaa gctggaaaat ggaactgttg tacggagaaa tcgaaaacat 3780 ctaaggactg tgcctggaag accacagaga gcagacacag ggaatactgt caagcttgct 3840 cagactaatc atccagatga gggacataca aaggcagttc cagagacatg cattacacgc 3900 acacaccata cagacaccag caacagtgtt gtgataacta gaagtggcag aatagttaaa 3960 agaccaacac acttgagaga ttttgtttaa acatctgtca ttgttttatg acactgaagc 4020 gaaattaaga gcatagtgtt atgctgttgt gcacatgttg taatgttatg ttgtaatgtt 4080 atgttgttcc ttgcatacag tgcacataat actgtatata aattgttaaa aaaaaaaaaa 4140 aaaaaggaaa ga 4152 // ID BEL20-I_DR repbase; DNA; ZEB; 5825 BP. XX AC chr12; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL20-I_DR; KW BEL20-LTR_DR; BEL20_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5825 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1387-1387 (2008). XX DR Genome; chr12; Positions 35884125 35878301. XX CC Positions [4851-5411] - Integrase core CC 'GATAC' target site duplication CC LTRs are 100% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 42..5765 FT /product="BEL20-I_DR_1p" FT /translation="MISNSEPYPPDVRPRRQIRQPAYLADYEVHGSGFVKR FT APVSNRLSEYANEAVAPMSANVSRSSSPTSQPSYTDDLILQDVWPEPSGYE FT SPGYDKQHFQVQQTSDIKSMCMQMKRDNDELRSSILPEILSALQGLKAENA FT TLRQEIQQRSPSIQTPIPAPRTHLPPSVISHRSSRAHSGEVSTPQPARYEQ FT LTEQMGDMTLNSRMSHVVDGHGVHSQRAHTYSAQPSGTGHYENVPLGPRYH FT YLQTEREFSPASRPELVPRMEKLHTQEQIYRGPKPTIPSLTAADPRQFSRL FT RMALENILPADASERFKYQILTDHLELEEALLVADSYCNSVRPYTDTMHAL FT VKMYGQPHKLVLQSIAEVLESPNLKSGDVKSFKLFALRVRSLVSMLEQLGP FT EGVAELDCGSHVSRLQSKLPHELQISFKRYIHPLRITVPTLLHFADWLEYE FT LQVQDDTAKAVMYAPPLPFKRREGRHEHKPSGRSTNILLGAGKPTAPTECR FT IPTLDSKPSSTKYREKSHAYCPYCDNSSHFLNGCLNFKELTKEQKEAWIRK FT NNRCWRCGRNHHASRCTLKALCKTCNRKHLLVLHDLNERVVSTSEETEPKE FT NSCLVNTTKGTLYADRSVGSRRVLLKVSKVIIANGDASMETFAVLDDGSER FT TILLHAAAQQLKLQGQPEDLILRTVRQDQQVLHGTTVSFTVSPVSHPHKKY FT HIQHAFTAERLSLAEHTYPVAFLQKKYRHLAGLPLQQMDKVQPVLLIGSDY FT PHLLTPVEPVRLGPPGGPAAVKTRLGWTLQGPTQEVRRELSTHQCFFTSVL FT PNTDLFAHVERLWQMDVIPYRNNKIVTRSKLDKEAISLLQENTVRVEVQGT FT MRYATPLLRMQSMPCLNMPKEAVLPQLRSVERRLLKNPEQASAYQAEITRL FT KEAGYVAKLEPREADCSKESWYIPHHMVQHNNKNRVVYNCSFQFEGHSLND FT FLLPGPTLSPSLLAVLLRFREHPVAISSDIRGMFHQVRLLPGDMPLLRFLW FT RDLKPDLPPDVYQWQVLPFGTTCSPCCASYALQTHVINHSQPGDRFREVIE FT KSFYVDNCLHSLHSPKEAKDLVDGLCALLASGGFELRQWASNCPSVITHLP FT PESRSSTCELWLSQDQQDVHESTLGLQWHCQTDTLRYKHRARSSAPVTMRH FT IYRVLASQYDPLGYIIPYTTRAKILVQRLWDKKRDWDDPHLPEDLLALWHE FT WEKELSGLEEISLARCYSLSEVDHASCKYEIHIFCDASEQAYGSVAYLRIE FT HKEGEVEVAFVAARSRVAPRKQQSIPRLELCAALSGAQLSQLLNTELTVPI FT HSTTLWSDSTTVLMWLTSSSCRYKVFVGTRVAEIQEMTASAVWRYVRSSDN FT PADDITRGKSLQNLSRDSRWSQGPSFLKLPSESWPEQPHLAAEGPNSELRK FT SSFSVLVTTTAPSAKLQHFNTLTECLNAYGQELHGAAYTSSAELQKDVQRA FT VLRQAQAESFPEELRLLMSGKSVSSKSRLLTLSPELDAATGLIRVGGRLRH FT CEVLEADTIHPVVLDPRHPVTLLIIRDYDERLHHPGTERLFAEIRRTYWIL FT RGREAIRRYQHQCTECKKWRGRPEVPLMADLPLTRQRYFRPVFYSTGMDCF FT GPFVIKIGRRNEKRWGIVFKCMTTRAVHLDLLSSIDSDSFLMALRRFIARR FT GNPHELLCDQGTNFKGGERELNEAFAALQSELQNHLAVQQIKFTYNPPSAP FT HFGGCWEREIRSIKAALKVTIGAQTVTEEVLRTVLIEVEGILNSKPLGYTS FT SDVADLDPITPFCFLIGRRDVSLPQVVYEDSEILSRRRWRHSQLLAEHFWR FT HFLKYYLPDLQARQKWKTEKKTLEIGDVVMIVDPQLPRALWPVGRITQVFP FT GADGRVRTANVEVKGKTYTRIVARLIQLPALPEDE" XX SQ Sequence 5825 BP; 1607 A; 1444 C; 1370 G; 1404 T; 0 other; agtggtcctt cgagccggat tgagtgaatc acagtagaga gatgatatct aattctgagc 60 catatcctcc tgatgtcaga cctagacgac aaataagaca acctgcatat ctagcagatt 120 atgaagttca tggatcaggc tttgtgaaga gagctccagt atccaatcgc ctgagtgaat 180 acgccaatga agcagtcgca cccatgtcag cgaatgtatc acgatcatct tctccgacga 240 gtcagccctc atacacagat gatctaatac tccaagatgt atggccagag ccatctggat 300 atgaatcgcc agggtatgat aagcaacact ttcaagtcca gcagacctct gatataaagt 360 ctatgtgcat gcagatgaag agggacaatg acgagctccg tagctccatt ctaccagaaa 420 tcctttcagc gcttcaaggg ctcaaggcgg agaatgctac attaagacag gagatccagc 480 agcgtagccc cagcattcag acaccaatac cagctccccg cactcacttg ccaccctcag 540 tcattagtca tagaagctcc agagcgcata gtggtgaagt ttctactcct caaccagccc 600 gttatgagca gctgacagag caaatgggag atatgacttt aaattctagg atgtcacatg 660 tagttgatgg tcatggtgta cattctcaga gggcacacac atattcagcc cagccttcag 720 gaacaggaca ttatgaaaat gtgccccttg gtccaagata ccattatctt cagacagaaa 780 gagaattcag tccagcctca cgcccagagt tagtgcctag gatggagaag ctgcataccc 840 aagagcagat ttatcgaggc cctaagccta caattcccag cttgacagca gctgacccaa 900 gacagttttc cagactacgt atggcactgg agaatattct gccagcagat gcgagtgaac 960 gttttaagta tcagattctg acagatcatt tggagcttga agaggctctc ctggtggcag 1020 actcctactg taattctgtg agaccctaca cagacacgat gcacgctctg gttaagatgt 1080 acggtcagcc ccacaagctc gttctgcaga gtatcgcaga agtgctggaa agtcccaacc 1140 ttaagtcagg agatgttaaa tcttttaagc tctttgctct tcgagtgcga tcattagtga 1200 gtatgttgga acaattgggg ccagagggag tcgccgaatt ggactgtggc tcacatgtgt 1260 ctcgtctcca gagcaagctt ccccatgagc tgcaaatcag ttttaagaga tatatccatc 1320 ccttaaggat cacagtccct acacttctac actttgctga ctggttagaa tacgagctcc 1380 aagtacaaga tgacacagcc aaagcagtca tgtatgcccc acccttacct ttcaagcgga 1440 gagaaggtcg tcacgaacac aagccttcag ggagatcaac aaatattcta ctaggtgctg 1500 gaaagcccac tgcccccact gaatgccgaa tacctacgtt agattccaag cccagttcta 1560 ctaagtatag agagaagtca cacgcctact gtccttactg tgacaactcc agccacttcc 1620 tcaatgggtg tcttaacttc aaagagctaa caaaggaaca gaaagaggca tggattcgta 1680 agaataatag gtgctggcgg tgtggacgta atcatcatgc atccagatgc acattaaagg 1740 ctctgtgtaa gacgtgcaac agaaaacacc tgcttgtatt gcatgatcta aatgagagag 1800 ttgtcagtac tagtgaagag actgaaccta aggagaactc gtgtcttgtg aataccacaa 1860 agggcaccct atatgcagat cgttcagttg gcagccgcag agtgctcttg aaagtgagta 1920 aagtaatcat tgcaaatgga gatgcatcaa tggagacttt tgctgtgtta gatgatgggt 1980 cagaaaggac cattctactt catgctgcag ctcagcagct gaaactccag ggccagccag 2040 aagacctcat actgcgcact gtaagacagg atcagcaggt gcttcatggg acaacagttt 2100 ccttcacagt ctcaccggtg tctcaccccc acaagaagta ccacattcag catgcattta 2160 cagccgaaag actgagcctt gctgaacaca cttatccagt ggctttctta cagaagaaat 2220 atagacatct agctggtctc ccattgcaac agatggataa agtgcagcct gtattgttga 2280 ttgggtcaga ttacccacat ctcctcacgc ctgttgaacc agtacgtcta ggcccaccag 2340 ggggcccagc agcagttaaa acccgcttgg gatggactct tcaaggtcct acacaagagg 2400 tccgtcgtga gcttagcaca caccagtgct tctttacctc tgtactacct aacacagacc 2460 tgtttgctca tgttgagcga ttgtggcaga tggatgtgat accctatcgc aacaacaaga 2520 tagtgacacg atccaagcta gacaaggaag caatcagtct gttgcaagag aacactgtga 2580 gagtagaagt acaggggaca atgcggtatg ccacaccgct cctccgaatg cagagtatgc 2640 cttgtttgaa catgcccaag gaagccgttc tccctcagct taggagtgtt gagaggaggc 2700 tgctaaaaaa cccagagcaa gcatctgcct accaggctga gattactaga ttgaaggaag 2760 caggctatgt ggctaagcta gaaccaagag aagcggattg ctctaaagag tcttggtaca 2820 tcccccacca tatggtgcag cacaataata agaatagagt cgtctacaac tgctcctttc 2880 aatttgaagg ccacagcctt aatgactttc tcctgccagg ccccacatta agcccatcac 2940 tgttggcagt gctgcttcga ttccgagagc accctgttgc tataagtagc gacattcgcg 3000 gtatgtttca ccaagtcagg ctgctccctg gtgacatgcc cttgcttcga tttttgtggc 3060 gagacttaaa gcctgatcta ccccctgatg tgtaccagtg gcaggtactt ccctttggca 3120 ccacttgcag tccctgctgt gcctcatatg ccctgcagac tcacgttata aatcacagtc 3180 agccgggaga tcgatttaga gaggtaattg agaaatcatt ctatgtcgac aattgcctcc 3240 acagtcttca ttctcctaaa gaggctaaag accttgttga cggactttgt gctctcctag 3300 ccagtggagg atttgagctt cgacaatggg ctagcaattg tccatctgtg attactcacc 3360 ttcctccaga atccagatct agtacctgcg aactgtggct cagccaagac caacaagatg 3420 tccatgagtc cacgcttgga ctgcagtggc attgccaaac tgatacactg cgctacaaac 3480 acagagctag gagctctgcg ccagtgacta tgcgccacat atacagagta ctggcaagtc 3540 aatacgatcc tctcggatat atcatcccat acactacccg ggctaaaata ttagtacagc 3600 gtttatggga caagaaacgg gactgggacg acccacacct gccagaggat ctactcgccc 3660 tatggcatga atgggagaaa gaactgagtg gactagagga gatatcctta gccaggtgct 3720 actctctgtc agaagtagat cacgcatctt gtaaatatga gatccacata ttctgtgatg 3780 cttcagagca ggcgtatggg tcagtggcct atctgcgaat agagcacaag gagggtgaag 3840 tagaagttgc ctttgttgct gccaggtccc gtgttgcccc ccgaaagcaa caatctatac 3900 caagactaga gttgtgtgct gctttaagtg gagcacagct atcccaactc ctgaacactg 3960 agcttacagt tcctatccac tccactacgc tctggtctga ctctaccact gtgcttatgt 4020 ggttgacatc tagctcctgc cgttacaagg tatttgtggg gacaagagta gctgagatcc 4080 aagagatgac agcatccgct gtctggcgct atgttcgatc atcagacaac ccggccgatg 4140 acatcactag aggcaaatct ctccaaaacc tctctagaga cagcaggtgg agtcaaggac 4200 catctttcct taagctccct tctgagagct ggccagaaca gccgcattta gcagcagaag 4260 ggccgaactc tgaactaaga aagtcatcct tctctgtttt agtaaccact actgctccct 4320 cagctaaact gcagcacttc aacaccctga cagagtgcct gaatgcttat ggacaggagc 4380 ttcatggggc ggcctacacc tccagtgctg aactgcaaaa ggatgttcaa cgtgcagttc 4440 tccgtcaagc acaagccgag tcctttccag aagagctgag actattgatg tctggaaaat 4500 cagtatcatc gaagagcaga ctactcactc tttccccaga acttgatgct gctactggac 4560 ttattagagt aggaggccgg ctacggcatt gtgaggtctt ggaagcagat actatacacc 4620 cagttgtcct tgatcctcgg cacccagtca cactgttaat cataagagat tatgatgaga 4680 gactgcatca tccaggtacg gagcggttgt ttgcagaaat cagaaggaca tactggatac 4740 tgagaggtcg tgaagccatc cgacgatacc aacaccaatg tacagagtgt aagaagtgga 4800 gaggacgccc agaggtccca ttgatggcag acttacccct tacaagacaa agatacttcc 4860 gtcctgtctt ttactcgact gggatggact gttttggccc cttcgttatt aagattggtc 4920 gacgtaatga gaagaggtgg ggtatagtct tcaagtgtat gaccacaagg gcagtccatc 4980 ttgatcttct ctctagtatt gactctgact cattcttgat ggctctaaga aggttcatag 5040 ccagaagagg caacccccat gagttactgt gtgaccaagg cacaaatttt aagggaggag 5100 aacgtgaact gaatgaggcc tttgctgctc tccaaagtga gctacagaac catctcgctg 5160 ttcagcaaat taaattcacc tacaacccac caagtgcccc tcactttggt ggatgctggg 5220 agagggagat ccgttcgatc aaggcagccc ttaaagtgac aattggagcc cagacagtta 5280 ctgaggaagt gttaaggacc gtcctgattg aggtagaagg tatcctcaac tctaagcctt 5340 taggatatac ctcctctgac gtcgctgatc tcgacccaat taccccgttc tgcttcttaa 5400 ttggtcgcag agatgtgtca cttccccaag tcgtctatga ggactcagaa atcctaagtc 5460 gacggcggtg gcgtcacagc caactacttg cagagcactt ctggaggcac tttctgaaat 5520 actacctacc tgaccttcag gcacggcaga agtggaaaac cgagaagaag acattagaaa 5580 ttggggatgt agtcatgata gttgatcctc aactccctcg tgcactttgg ccagtgggca 5640 gaatcacaca agttttccct ggtgccgatg gtcgagtgcg gactgctaat gtggaggtta 5700 agggcaagac ttatacccgt attgtggctc gacttataca gcttccagct ctacctgagg 5760 acgaataatg catgttatgg atcttttgtc agatgagcga attcagttct gaattcgggg 5820 gcggc 5825 // ID DNA-X-10_DR repbase; DNA; ZEB; 1507 BP. XX AC . XX DT 25-NOV-2008 (Rel. 13.11, Created) DT 25-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE nonautonomous DNA transposon - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; DNA-X-10_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1507 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(11), 1736-1736 (2008). XX DR [1] (Consensus) XX CC TIR is ~ 114-bp long. The TSD is uncharacterized. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1507 BP; 490 A; 323 C; 296 G; 398 T; 0 other; gagaagggtc tggctgcaga gttgcacttg catcttcagg cttgcactct cgggcttgca 60 cgctcaggca cccacgcttc ctatgcaagt gacgtcattt acaatcgaca cctgcacctg 120 tcacgtcact tgaaatcaac acaaataaac caaaagtaag tttccagttc gtttgcgttg 180 ccagtgacga cgttgtcaaa taattctaag tagcctactt aagttgtaca ccatgacaga 240 ccccacctac cccccacccc ccgaaaagca gttagtatat tcatcatatg attatattgc 300 tttgatgtaa atttaccaaa gcaggctact attaacattc ataccagcaa agcttattat 360 ttatttatat taactatcag gtctaaatat acacacaagg tattcttcca cagttatgaa 420 taaaataata aaaaaacacc aggcaaattt gaattttagt ttttatttta aacatttcaa 480 ggagttgtag tattacaact aacagataca cattatggac ttacagcaat gattcttgaa 540 cgacttggga aactattctt tattcagcag gtcccaggac gattgcaaac aagaatgcag 600 atgtcgatgc aagtgacgtg gcaggagagt gcagggctcg tttgcatgaa agtccttaac 660 caaaaaaatc cctcagttta aggttaaggg tacccttagt aaaagttggt ttgcaaggag 720 aaaaaaaaaa aaaaagatca acataaagaa tttaaggctg ccattaagta tttcccctca 780 gtccttagta accatttcct cttaaaacat ttaagggacc taatcagctc ccttgaacaa 840 atgacactca aaatgcaagc attctcaccc caaaaaaatg ttttagataa gactcacagg 900 actgtaaata aataatttaa taaaaccatt taataaacta atatacgtgt tatgataagt 960 tagcacgtct gcgcaggagt gcatccatat gaaatcacag ttgcagtgca gcttcaacag 1020 tctctacatg atcccagccc agttttatag gtcctccaaa cataatgtcg agatcgactc 1080 gataagggaa cgtctcgact cgcttaacgg aaagaaagcg gcacagatga tgtgtaacat 1140 tactgcagag ctgttgtaag gcaagcattc gtggttcgtt acgatctgtt tacgtaacta 1200 tagcaactgt ggcgcccgtc gccgtatgtt gccagaaact acagtgaaac gcttagaaaa 1260 aaggcacttg agtaagggtg acagataaga agtttgttgc aacgatctta atgaaatccg 1320 ttacctcaga gatttttccg cctcagggac acccttaagt taaattaccg aaggactttc 1380 atgcaagcgg ccacaggtgg cgattgtaaa tgacgtcact tgcagaggaa gcgcggttgc 1440 ctgagcgtgc aagtccgagc gtgcaagcct gaaggtgcaa gtgcaaatct gcagccagac 1500 tctattg 1507 // ID DNA-2-23_DR repbase; DNA; ZEB; 800 BP. XX AC . XX DT 03-DEC-2008 (Rel. 13.12, Created) DT 03-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE nonautonomous DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; 2-bp TSD; KW DNA-2-23_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-800 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(12), 2132-2132 (2008). XX DR [1] (Consensus) XX CC Copies are ~92% identical to the consensus. TIR is 14-bp long. CC TSD is 2-bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 800 BP; 297 A; 93 C; 108 G; 302 T; 0 other; cactctaaaa aataactcaa gaggtctgtt accactacat agtttaatac attattgtaa 60 cttaaaaatt ctaaaatgct gattagatta aaacttttaa gttgccagaa taattttatt 120 aagttttcaa aacaggaaaa tattgataat tacatcaatt taaaaatgcg gctaatttaa 180 aggtaaatat atctgtcagt tactttaaaa caatatttaa cttgtagcaa cccaaagaaa 240 ttgggttgat aactcaattt ttccacacta atggagttgt tttgtatttt taagtccaca 300 cactgatgtt ttttgttatt ttttatactg ataatatggg acagtacatg tttttgtttt 360 gtaaaacaat gaaaacctgt ggtttaatac aaataaaaaa ttaccgaaaa acaaaaatgt 420 tgttgttaaa tgtatttctt acaagcatga acagtaaatt tgaaaacaaa caaaaaatta 480 aacaatttca gctacttaat ttatttgagg ttctaaactt gaaaatgtgc atgtaattaa 540 gtacataaca ctggatttgt tttcaatttt aatttaaaaa actaattgaa acaacttatt 600 tttttgagta ctcaacttaa aaaatttgag ttttttgcta tgtaaaaatg cctctgattg 660 aagaggaaaa gatttggcta acataataaa gtttgttgtc tgaacctcat tttgttagat 720 gttgtcgtaa cttgaatata ttaatgcaaa ctgttgcgtt aattttgttt tgttttgtct 780 aaacaatatt ttttagagtg 800 // ID BEL9-I_DR repbase; DNA; ZEB; 6001 BP. XX AC chr18; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL9-I_DR; KW BEL9-LTR_DR; BEL9_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6001 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1367-1367 (2008). XX DR Genome; chr18; Positions 26979601 26985601. XX CC Positions [5026-5586] - Integrase core CC 'CTCGC' target site duplication CC LTRs are 100% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 34..5952 FT /product="BEL9-I_DR_1p" FT /translation="MSVSEANSSSPVQRSVRLRTLPAHLENYILDINLPSC FT LPSSTFAANVVGDILASSTLAPMSKSEQIQSNKELSLRELMENLSAAEQEE FT SNEIAQLTSRLSLYEKRQQRRQDMMLRINAFLHEEEADSLIVQGADAAIQK FT GAHSQESVPHISSTAEHSSKIPAIYLASQPALMSPEQKRSDLPPISSAPVT FT PNTPYVVQSTNIAPPASQTLPLHNVSSIHSMPSSYFMSPRLHADPDAAQIQ FT IRTPVQYGHQSPPKLIRSSGLEDAPRLPQQQWAINYPVTVPHMQPSVAHPG FT NHYAYSAQPDYRSAYHQPYAVHSNEWPQPIDYRAAPPVHMPLYGAPKPSIP FT DFNADNERDFANLKLALDNLLDSHPELSEKYKYHVLLEHLKLPEAQMIGQS FT CRHEAHPYTSAMQALQLHYGQPHQLAQSEIAAILNAPDVKVGDASSFQSFA FT LRVHLLVSMLLSLEGSRGIELSCCSHVDRLLSKLPKYLRDNFIEYLQIQGK FT LNVSSLNPYNLRDFSGWLQGKAQQQRLSNKLVERYQKEKFIPQAHGRTSTR FT PRGQSTLVYHGVESQSNRAPKPESQPPQWRKSKVHCLFCHSKDHYITQCSD FT IKEQSADNLMQWIVEGKRCWKCGRSHSPESCNLRKPCGMCGEIHLQVLHGA FT AEKRFKAKQTTALSSRIYLTPSNSSGRVLLKVVPVLLHNEDRTIETFAILD FT DGAERTMLLPTAAQKLQLEGKEETLALRTIRHDVTFLTGSIVTFDVSSMLN FT SGTRYRIQNAFTAAGLDLVEQSYPIKALQHKYAHLRNLPLQPFKNVQPLLL FT IGSDNSTLITAEKTVQASNQMGPVAVRTKLGWALQGPEGLLNKGTTMTQSY FT FFTASAAHDTLYRDVERLWQLDSLPYRSEKVVIRSRQDQESIKILDTKTCR FT LLVNDVERYATPLLRPAEAPKLRAPKESVLANLRNTEKRLRDQPEKAAAYK FT GEMNRLIQAGYVKQITAEEAENSEESWYLPHHLVYHNDKARLVFNCSHVYK FT GTSLNDQLLPGPNLGPSLIGVLLRFRQHQVAISGDIRGMFHQVHLLPEDTP FT LLRFLWRDLQQEASPEIYEWQVLPFGTTSSPCCAIYALQRHVREHPRSTPG FT LIESVERSFYVDNCLESHPNGAAAQARVDDLRSLLADGGFDIRQWASNRLS FT VVAHLPPEARSKDMELWLEHNRSDQREPALGLCWNCSSDTLGYRYRPIEYS FT TLTMRTIYKILASQYDPLGFLTPYLTRAKVLVQQLWAKRRDWDDPDLPLDL FT CESWKQWESELPNISAISIPRSYLPDGMDQPNTEHHLHVFCDASEKAYGAV FT AYLSSTQGGATNVAFVMARSKVAPKRQQTMPRLELCAALAGAQLANLIQTE FT LSVTLQRTTLWTDSTTVLEWLQSESCRFKVFVGARVSEIQELTDLRAWRYV FT DTLSNPADDLTRGKKLLDLATPNRWSKGPSFLLQAPDCWPEKPHTAPLLQS FT DELRNITLCNLVSVQQQDSVPDASQFTSWKDLVEAVRLQCQGTGEPAQTQP FT SNSYRAAELTLLRACQESSFPEELTSLKTGQPVHINSRLRHLAPELDEATN FT LIRVGGRLRRMQPISDFEIHPIVLDSRHPATALLIKDYDERLLHSGSERVF FT AEIRRQYWILKGRQAIKKHQLQCTECQRWRARPRIPQMADLPSARLRIFSP FT PFHSTGIDCYGPFIAKIGRRHEKRWGVIFKCLTTRAVHLDLLNSMDTDAFL FT LALRRFIARRGRPVEILSDRGTNFRGADAELRAAFEEMETQLQQQLASYQI FT EFKFNPPNAPHFGGVWEREIRSIKNALQVAVGTQPLPEDVLHTILVEVEGI FT INSKPLGYVSADIADPDPITPNMLLMGRRDASLPQAVYAPETIGRRRWRHC FT QNIIEHFWIHFLRDYLPALQSRYKWQQQKEAIALGTVVMIVDPNLPRADWP FT VGRVLKLFPSQDGKVRAVEIQVKDKVYTRPVARLIPLPGLPDSAKDQSA" XX SQ Sequence 6001 BP; 1704 A; 1490 C; 1371 G; 1436 T; 0 other; tggtccttcg agccggagct aactactaca gacatgtcag tatctgaagc aaactcttcc 60 agtccagttc agaggtcagt gaggctacgt acacttcctg cgcatctgga aaactatatt 120 ttggatatca atcttcctag ttgtcttcca tcaagcactt ttgcagcgaa tgtagttgga 180 gatatattgg catcatctac gctcgctccc atgagtaaat cagagcaaat tcagtcaaat 240 aaagaacttt cattgagaga actgatggag aacctctccg cagcggagca ggaggaatca 300 aatgagatcg cacagctcac aagcagattg tctctttatg agaaaaggca gcagcgtcga 360 caggatatga tgctgcgaat taatgcattc ctacatgaag aggaagccga cagccttata 420 gtacaaggtg cagacgcagc catacagaag ggagcacatt cccaggagtc tgttcctcac 480 atttcctcga ctgctgaaca ctcatcaaag ataccagcta tttatttagc atctcagcct 540 gcattaatga gcccagaaca gaagagatct gacctccctc ctatctcatc agcaccggta 600 actcctaata caccttatgt agtccagtct actaacatag ccccgcctgc atcgcagacg 660 cttccattac acaatgtgag ttccattcac tctatgccat catcatattt catgtctcca 720 cgactgcacg ctgacccaga cgcagcacag atacaaatac gtacacctgt acaatatgga 780 catcagtcac cccccaaact tataaggtcc tcaggcctgg aggatgctcc tcgactgccc 840 caacagcaat gggccataaa ttatcccgtt acagtgcctc acatgcagcc ttctgtagcc 900 catccaggca atcattatgc ctattcagca cagcctgatt accgtagtgc ctatcaccag 960 ccgtatgcag tgcactcaaa tgaatggcca cagcctattg actatcgtgc agcaccccca 1020 gttcatatgc ccctgtatgg agcaccgaag ccttcaattc cagattttaa tgcagacaat 1080 gaaagggatt tcgccaatct aaaactggca ttagacaatc tactagattc acatccagaa 1140 ctatcagaaa aatacaagta tcacgtgctg ctagagcatc tcaaactccc agaggctcaa 1200 atgatcggcc aatcatgccg acatgaagct catccttata catcagcaat gcaagctctt 1260 cagttacatt atggacagcc tcatcagtta gctcagagcg aaattgcagc tattctgaat 1320 gctccagatg ttaaagttgg tgatgctagt agcttccagt catttgcttt aagagttcac 1380 cttcttgtga gtatgctgtt atctctggaa ggatccagag gtattgaact tagctgctgt 1440 tcccatgtag ataggctcct cagcaaactc cctaagtatc tcagagacaa cttcattgaa 1500 tatttgcaga tacaaggtaa gctgaatgtc tccagtctca acccatataa cctcagagac 1560 ttctctggct ggcttcaagg taaagctcag cagcagcgcc tgtccaataa gctagttgaa 1620 cgttatcaga aagaaaagtt tatcccccaa gcacatggga gaacctcaac gcgacctaga 1680 ggtcagagta ctttagtcta ccatggggta gaatcccaaa gcaatagagc tcctaagccg 1740 gagtctcagc cccctcagtg gagaaaatcc aaagtgcatt gtctgttctg ccacagtaaa 1800 gatcactaca taacacagtg ctcagatatt aaggagcagt cagcagataa tctcatgcag 1860 tggatagtag agggaaaacg atgttggaag tgtggtcgtt ctcattctcc agagagctgt 1920 aatctgagaa aaccctgcgg tatgtgtgga gaaatacacc ttcaagtgct ccatggagct 1980 gcagagaagc gattcaaagc taagcagact actgcactga gtagcagaat ctatctcacc 2040 ccttctaact cctcaggtcg agtcttgctg aaagtggtgc cagtgctgct gcataacgag 2100 gacaggacca tcgagacatt cgctatactt gatgatggag cagaacgaac catgctgctg 2160 cccacagcag ctcaaaagct gcagctggag ggtaaggaag aaaccttagc cttacgtact 2220 attcgccatg atgttacatt ccttactggt tcgattgtta cctttgatgt gtcctccatg 2280 ctcaattctg gaaccagata taggattcaa aatgcattta ctgctgctgg tttagatctg 2340 gtggagcaga gctaccctat aaaggctctc caacacaaat atgcacactt aaggaatctt 2400 ccactgcagc ctttcaagaa tgttcaaccc ttgttgttga ttgggtcaga taatagcact 2460 ctgatcacag cggagaagac agttcaagcc agcaatcaga tgggaccagt tgcagttcgc 2520 acaaagctgg gctgggctct tcaaggccca gaaggcctcc taaacaaagg taccacaatg 2580 actcagagct atttctttac tgcctcagca gcacacgaca cactttatag agatgtggaa 2640 agactgtggc aattagactc attaccatac cgcagcgaga aggttgtaat tcgctctaga 2700 caagaccaag agtccatcaa gattcttgat actaaaacgt gcagattatt ggttaatgat 2760 gtggaacgtt atgccacccc actcttacgc ccagcagaag ctcctaagct gagagctcct 2820 aaagagtcgg tactagccaa tctacgtaac acagaaaaga gactccgaga tcagccggag 2880 aaagcagctg cttataaggg agaaatgaat aggctgattc aggctggcta tgtgaagcag 2940 attactgcag aggaagctga aaactcagag gagtcctggt atctgccaca tcatctcgtt 3000 tatcataatg ataaagcacg gctagtgttc aattgttccc atgtatacaa gggcacctcc 3060 ctaaacgatc agttactgcc aggtccgaac ctggggcctt ctctaattgg ggtcctactg 3120 cgattccggc aacatcaagt ggcaatcagt ggggatataa gaggaatgtt ccaccaggtt 3180 cacctgttgc ctgaggacac tccgttgcta cggttccttt ggagggactt acagcaggaa 3240 gcctcaccag agatatatga atggcaggtg ctaccttttg ggacaacaag tagtccctgc 3300 tgcgctatct acgcattaca gaggcatgtc agagaacacc cacgcagcac tccaggtctc 3360 attgagtcag tcgagcgcag cttctatgtt gataattgtt tggagagtca ccccaatggt 3420 gcagcagccc aagctagagt tgatgacttg cgtagtttac tagctgatgg tggatttgat 3480 atcagacaat gggctagcaa tcgattatca gttgtggctc acctaccccc tgaagccagg 3540 tctaaggaca tggagctgtg gctagagcat aatcggagtg atcaaagaga gcctgcatta 3600 ggactctgct ggaactgtag ttcagataca ctgggctaca gatatcgacc cattgagtac 3660 tcaactctga ccatgcgcac aatttataag atactggcca gtcagtacga cccgctaggg 3720 ttccttacgc catacctaac cagagctaaa gttcttgtgc agcagctgtg ggcaaagagg 3780 agagactggg atgatccaga tctccctcta gatctatgtg aatcctggaa gcaatgggag 3840 agtgaattgc caaatatcag tgccatctca atccctcgta gttatctccc agatggaatg 3900 gatcagccca acactgaaca ccaccttcat gttttctgtg acgcctctga gaaggcgtac 3960 ggagcagtag cctacctctc atccacacaa ggtggtgcta ccaatgtagc attcgtcatg 4020 gccagatcaa aggttgcccc taaacgacaa cagactatgc cacgactgga gttgtgtgca 4080 gcgctggcag gagcgcagct tgccaatctg atccagacag aattgtcagt cacactgcaa 4140 aggactacac tgtggacaga ctccacgaca gttctagagt ggttgcaatc agaatcatgt 4200 agatttaagg tattcgttgg tgctcgtgta tcagagatac aagaacttac agatctgcga 4260 gcctggcgtt atgtggatac gctgagtaat ccagcggatg acttgacaag aggaaagaaa 4320 cttctagatc tagctactcc caacaggtgg agtaaagggc catcctttct gttgcaagct 4380 cctgactgtt ggccagagaa accacatact gctccattac tccagtcaga tgaactgaga 4440 aatataacac tctgcaactt agtatcggta caacagcagg acagtgtgcc agatgcaagt 4500 cagttcacat catggaagga cctagtcgaa gctgtccgcc tccaatgtca gggtacagga 4560 gagcctgctc aaactcagcc aagcaatagt taccgtgcag ctgaactcac ccttctaaga 4620 gcatgccaag aaagcagctt tccagaggag ctcacctcac ttaagacagg ccagcccgtt 4680 catatcaaca gcaggctcag gcacttagca ccagagctgg acgaggcaac taaccttatc 4740 agagtaggag ggaggctacg acggatgcag ccgatcagtg actttgagat ccatcctata 4800 gtgctcgact cacgtcatcc tgccactgcc cttcttatca aagactatga cgagcgcctg 4860 ctgcatagtg gatctgagag agtcttcgca gagatcagaa ggcagtattg gatcttgaag 4920 gggcggcagg ctataaagaa acatcagctt caatgcacag agtgccaacg atggagagct 4980 cgccctagaa ttcctcaaat ggcagaccta ccatcagctc gcctccgtat cttttctcca 5040 ccatttcact caacgggcat tgactgctat gggccattca ttgctaagat tggcaggcgc 5100 catgagaaga gatggggagt gattttcaag tgtttaacaa cccgtgcagt acacttggat 5160 ctgctcaatt caatggatac ggatgctttc ttacttgccc tccgtcgatt cattgcacgg 5220 cgaggcagac cagtggagat tctctcagat cgaggcacta attttcgagg ggcagatgcg 5280 gaattgagag cagcctttga ggaaatggaa acacagcttc aacagcagct ggcaagttac 5340 cagattgagt ttaaattcaa cccacctaat gctccacatt tcggaggtgt atgggagcgg 5400 gagattcgct ccatcaagaa tgcactccag gttgctgtgg gaactcaacc tctaccagag 5460 gatgtgctac acaccatcct tgttgaagta gaaggtatca ttaactccaa gcctcttgga 5520 tatgtttcag ctgacatcgc tgaccctgat cctataaccc caaacatgct cctcatgggg 5580 cggcgggatg cttccctgcc gcaagccgtt tatgctcctg aaactatagg acgacgaaga 5640 tggaggcatt gtcaaaatat catagagcat ttctggattc atttcttgcg agactacctg 5700 ccagcacttc aatctcgcta caaatggcag caacaaaagg aagccattgc attagggact 5760 gtggtaatga tagtagatcc taatctacca cgtgcagact ggcctgtggg acgagtgctg 5820 aagctctttc cgagtcaaga cggcaaagta cgagcagtcg aaattcaagt aaaggataaa 5880 gtctacactc gtcctgtggc acgacttatt cccctaccag ggctgccaga cagtgctaaa 5940 gatcagtcag cataaagaat tctcttgata tacacatttg ctactcaaat gtgggggcgg 6000 c 6001 // ID TC1DR1 repbase; DNA; ZEB; 1006 BP. XX AC AL603743; XX DT 04-MAR-2002 (Rel. 7.02, Created) DT 04-MAR-2002 (Rel. 7.02, Last updated, Version 1) XX DE TC1 Danio rerio 1. XX KW Mariner/Tc1; DNA transposon; Transposable Element; TC1; TC1DR1; KW transposon. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1006 RA Jekosch K.; RT "TC1DR1: Tc1-like transposon from D. rerio."; RL Repbase Reports 2(2), 18-18 (2002). XX DR [1] (Consensus) XX CC Putative novel DNA transposon similar to worm (C. elegans) TC1 CC transposons, one reading frame (pos. 1-1006) with several frame CC shifts CC and stops. XX SQ Sequence 1006 BP; 315 A; 219 C; 241 G; 231 T; 0 other; atggcacctc atggcaaaca actctcagaa gacctgaaaa aacaaattgt tgccttacat 60 aaagatggcc taggctatga gaagatagac aacactctga aactgtgctg cagcacagtg 120 gccaagacct tccaccgctt taagaggaca ggttcttccc agaacaggcc tcgccatggt 180 cagccaaaga agctgagtgg acgttctcag catcacatcc aaagattgtc tttgaaaaac 240 aaatggatga gtgcttccag cattgccgca aagattaaaa atatgggagg tcagcctgtc 300 agttctcaga ccgtatccct cacactgcat taaattagta tgcatggttg tggtcctaga 360 agaaaacctc tttaaagacg atgcaagaaa gctcgcaaac aatttacagc agacatgcaa 420 actaaggacc aggattactt gaaccatgcc ctgtggtcta agaccaagat aaacccattt 480 ggttcagata gtgtcaagtg tgtgtggcag caaccaggtg gggggtacaa agacaagtgt 540 gtcttgccca caattaaaca tggtgttggg agtgtcatgt tctggagctg catgagtgct 600 gctggcattg gggagctaca tttcattgag ggaagcatga attctaacat gtactttgac 660 atactgaagc agttccctca ggaaactgga ctgcagagca gtgttccagt atgataatga 720 cctccaatac acctccagga tgaccactgg cttgctaaag aggctgaggg taaaggtgat 780 ggactggcca agtatgtctc catacataaa ccccatataa cacctctggg ccatcctgaa 840 aaggaaggtg gagaagcaca aggtgtcaaa tgtccacgac ctccatgatg ttgtgatgga 900 gtagtggaag agaatttcag tggcaacctg tgatgctctg gtcaactcca tgcccaaaag 960 aattaaaacc atgctggaaa atgatgatgg ccacacaaaa tattga 1006 // ID DNA-8-30_DR repbase; DNA; ZEB; 1022 BP. XX AC . XX DT 12-FEB-2009 (Rel. 14.02, Created) DT 12-FEB-2009 (Rel. 14.02, Last updated, Version 1) XX DE A nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; TSD 8-bp; DNA-8-30_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1022 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 9(2), 530-530 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1022 BP; 338 A; 181 C; 189 G; 313 T; 1 other; tagggatgca acgattatag attttggttg tacgattata gtctgaggaa taatcacggt 60 ttcacggtta tcacgattat tatgcattca ttaatttcaa aacactacta gtttagaaaa 120 atcacatgaa aactccttat attttaaagt gttatttatt gctgctcagt aaccaaataa 180 tacagcacaa aataataata taatattaaa aacaaacaat agtccatttc tctttggact 240 taaaaataag tgaaaaaagg ttttggccat ttaaacataa gtaataattt tacactgaaa 300 aataaatgac agaacaatga ataaataaat aaataaataa ataagaataa ataaaaaatg 360 tattttaaat ttaatgtcta atgtaaatct aagctacata ttagctaagt atttcccttt 420 taaagagaac aaatgtagtc aaaggctgct gaacctctgt ctgaaaggca cttttgatca 480 actatattac aaaattgttt ggtattttcc ttttgtattt ttctctttgg agtagaaatg 540 tgtatattcc atacaaagta tgaagctgat cggtccagtt cttgtcacgt gactcgcggt 600 gcgctcgcgg cattctttca actaggtgcg cgtrcacctg gaaagcgcat gagcgtgtct 660 ggccacttgc actcggtgtt caatgcgtgc acgtgcgtcg ctgccaatat gcacacgcaa 720 gccgtgcgcc tccattggaa ataacataac gaacttgcgc gcacaaaaga cgcgatatgt 780 gaacggccct tagttaatag cctcagccat agttaaaatc agcctttaat ccagtgtgcg 840 tgggtattag cctcggtgta caagcttgga actttaaact agcgcacagt tggcataact 900 ttgtttagtt gcagcagttt tgttccgtgc agaaacgcgg tgttgagaag cctttacgat 960 taattaacca tgacgaatta aagcgcagtt aatagtgaaa tcggttaatc gttgcatccc 1020 ta 1022 // ID Copia1-LTR_DR repbase; DNA; ZEB; 187 BP. XX AC chr1; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 01-OCT-2008 (Rel. 13.1, Last updated, Version -1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia1_DR; KW Copia1-I_DR; Copia1-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-187 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1585-1585 (2008). XX DR Genome; chr1; Positions 26847429 26847615. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 187 BP; 53 A; 38 C; 39 G; 57 T; 0 other; tgttagcgtt actattggat gtgccctcag actgtcatgt catgtgttac ctgcttccca 60 tatttggaca tatgcaaata aatgtattcc ggttcagcag aaaaccgagg agagatgtga 120 gttattttac ctccgtgtaa ttaagattat ataccggtta ccatacacga aaatggctgc 180 gccaaca 187 // ID hAT-7_DR repbase; DNA; ZEB; 3826 BP. XX AC . XX DT 22-MAR-2010 (Rel. 15.09, Created) DT 22-MAR-2010 (Rel. 15.09, Last updated, Version 1) XX DE DrAc2 has 11 bp. TIRs flanked by 8bp. TSDs. XX KW hAT; DNA transposon; Transposable Element; Ac; hAT-7_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-3826 RA Arensburger P. and Atkinson P. W.; RT "haT-7_DR."; RL Repbase Reports 10(9), 1228-1228 (2010). XX DR [1] (Consensus) XX CC There are at least two copies of this element in the current D. CC rerio assembly. XX FH Key Location/Qualifiers FT CDS join(256..1221,2150..3031) FT /product="hAT-7_DR_1p" FT /translation="MELVGKKNATSPIWQYFGFTPDEKGEPADVTQAVCRL FT CSKVVPVKGSQTTNMYVHLRSRHPAEAAKLAPREGTISKDRSQPSIIGAFS FT KSTKYKRNSERWAQCTAAVSKYIAKEMVSYHTVEKASFKDLLQTFDKQYEL FT PGRKYFTETAIPNLYNETRDIIAKDLKAANFVALTTDMWSSINMTPYMSVT FT VHYISEDWKLEAKCLETTFISENHTAEVLAEALSDAMQDWDIEPSKISCIT FT TDNGANIVAAIRRLGWPWLNCFGHNLNVAVNYSLQKEKTKTDRALGICRNI FT NGAFSHSWKRRQELRKAQEQLNLPQRMLITDCTTRWGSTYAMVVRVLEQLP FT AIKRVFADDKSRRGTLPNVTWQDIAVLEAVRDGLKPVAEFTDILSAENYVT FT VSSLLPMLQLTKDILKEEETDVEITAGIKRRILEKLDSKYDDSTLQLMRKS FT TLLDPRYKGDHIDAAQLDVIKSQLEVEMVEYWNMPPVPIRVEEEEEEDAAG FT TSSSENQPAKRIKTLGRLLGRAKPSVSTSTPVDQRAKSEITSYLQEEVIDG FT DDKPLDWWKENRRRFPLMANLAKKYLCVTATSTPSERVFSAAGNIITPLRS FT LLKPEKVNQLVFLAKNL*" XX SQ Sequence 3826 BP; 1281 A; 674 C; 733 G; 1138 T; 0 other; tagggttgcg cgataatgcc cattgtcatg tcgtcctatc gtcagcctgt aagatcgccg 60 atacacgata gtatcggggg gaggggcaat tttttgatta tttatttgtt tgtttattta 120 taagaccatt tatttactta ttagtctact ttttcttttt aatttacact tttattttaa 180 tattaacgta attattaatt atcacgcgaa atgacacatg ggaaatgagg accgtctatc 240 attctgacaa ggaagatgga gttggtggga aagaaaaatg ccacatcgcc tatttggcaa 300 tatttcgggt ttacaccaga tgaaaaaggt gaacccgccg atgtcacgca ggcagtttgc 360 agactttgct cgaaggttgt gccggtaaaa ggatcacaaa ccacaaacat gtacgtacat 420 ttgcgtagcc gccaccctgc tgaggctgct aaactagcgc caagagaagg cacgattagc 480 aaagatcgaa gtcagccatc aatcatcgga gctttttcaa aaagcacaaa gtataaaaga 540 aacagtgaaa ggtgggctca gtgcactgca gctgtatcga agtacattgc gaaggaaatg 600 gtctcctatc atacagtcga gaaagcaagc tttaaagacc tgctccaaac ctttgataaa 660 cagtacgagc tgcctgggcg caagtacttc accgaaacag ctatcccaaa cttgtacaat 720 gaaactaggg acatcattgc taaagatctg aaagctgcta actttgttgc tttgaccaca 780 gacatgtggt caagcataaa catgaccccc tatatgtctg tgacagtcca ttatatttcg 840 gaagattgga agctcgaagc aaaatgcctg gaaacaacat ttatttcgga gaatcacact 900 gcagaggtac tggctgaggc gctgtcagat gccatgcaag attgggacat agaaccgagt 960 aaaatttcat gcattacaac tgacaacggg gccaatattg ttgcggccat acgaaggctc 1020 ggatggccat ggctgaactg ttttggccat aacttgaatg tggcagtgaa ttattctcta 1080 caaaaggaga aaacaaaaac tgatcgcgct cttggaatct gtcgcaatat taacggggca 1140 ttttctcaca gctggaagag gcgacaagag ctccgcaagg cgcaggaaca gctaaattta 1200 cctcaacgta tgctgataac ggtacagaaa ttattatatt ttatttgctt aatatttatc 1260 ttgtggcaca accatcttag tgaaaaataa gaaaaaatgt attttttgtc agactagaaa 1320 taatgcttgt aaatcaaata taatattatt taattaggct atacactgta aaaagatttc 1380 tgttgatcta ggctaactaa aaaaattaag gtaatgttca catctatatt ttatcatttt 1440 agtcaatgaa aaataagtaa cttgacctaa acaatatttt ataaatctaa gaaaactatt 1500 ttatataact tggcagttgg cacaatgtag gcctatacat ttttcttgtt gaatgaagta 1560 aaaatgttta aattgtgttt ttgataaaaa caaaaagaaa tagcctagat gtaatcaaag 1620 caaaaaattc taatttagaa atagcctatg tattagattt ttttcaatta aaaattggat 1680 tttttttgta aataattatt atttgtaaat ttgtaaataa ttattatttg accaaatcat 1740 acaaaaagac atcaatagtt ttgattacat cgatatttat taaaaactta aactttactt 1800 ccacacatcc acacaagtga ccagtgcgtc gtctgcgatt tgaccacgga ccattgactg 1860 gctggtggtg atgagtgtgt ataatgtgca tgcgcgtaca aaatatccaa cctaaaaatt 1920 gtatattcgt caaaacaaaa aatgtattaa ttgagttctt aataaaaaat atttagattg 1980 agcaactttt taaactaaaa atatattact tgaaaaataa aacaaataat tacaaattta 2040 attgaagaca aattattaat tgggtgaggg actgcttgaa aaacttttta cagtgtaata 2100 agtgatcaca ttaacaatcg acgcataatt gtcatttaat atattttagg attgtacaac 2160 tcgttgggga tcaacatatg ccatggtagt aagagttctg gaacagcttc ctgccatcaa 2220 aagggttttt gctgatgaca agagccgtcg tggaacactc ccaaacgtga cctggcagga 2280 cattgctgta ctagaggctg tcagggatgg gttgaagcct gtcgcagaat ttactgatat 2340 tttatcagct gaaaattacg tgacagtctc atctctgctg cccatgctgc agctaacaaa 2400 ggatatcctg aaggaggagg agactgatgt ggagataacg gcaggaatta aacggcggat 2460 tttggagaaa ctcgactcga aatatgacga cagtaccctc caactaatgc gcaaatctac 2520 cctccttgac ccacgctata agggcgacca catcgatgcc gcacaacttg atgtgatcaa 2580 atcacagctt gaggttgaga tggtggaata ttggaatatg ccacctgtac ccatcagagt 2640 ggaggaggaa gaggaagaag atgcggcagg tacgtcatca agtgagaatc agccggctaa 2700 gaggatcaaa acactgggtc gtcttcttgg gagagcaaag ccgtctgtgt ccacatccac 2760 acctgtagac cagcgcgcaa aatctgaaat cacgtcttac ttacaagagg aagtaattga 2820 tggtgatgac aagccactgg actggtggaa agaaaataga cgccgttttc ctttaatggc 2880 aaacctggca aaaaaatacc tgtgtgttac cgcaactagc accccttcag agcgcgtgtt 2940 tagcgcagcc ggtaacataa tcactccact tcgcagccta ctcaaaccag aaaaagttaa 3000 ccaacttgtt tttttggcaa aaaatctgta gacggaagca ttactcttgc atcgtaaaca 3060 gtcatattgt tttaaatctg caagcaatgt aactattgca ttattttagt tcttggttat 3120 ttaaattttt tttttgcatt ttgtttattg aaacgtgttt ggctattagc aacaagataa 3180 acattatttt atgttaaata tgttaaacaa gttagagttt gaaataaacc ttttcatttt 3240 catagccgac tgattattgt gcatttattt tgaatacgca tcatacattt gcggcttgca 3300 tgcatatgac gatttttaaa agagaatcac tcctctttga aaaacatatt aagcgacttt 3360 aaaactaaat ttccttcttt cacaaattat gtcaagcctg cttgttataa agagacattg 3420 tacagttatt ttaaccatgt taaaaaaacg gcaaaagtgc attattttcg ttttgtttta 3480 gtatttttgt tatagcctac aacaaaaata gcataactat acaataacaa aaattatata 3540 ttattacata attttgtata ttagcctaac actgataaat ttatgaatga tttttaaaat 3600 atattcctct ggtgtcaggt gggtgaaggt tacgcgacga tgacagtgtg aaatatgcaa 3660 gggccaatga attgtacttt gataatatta actgcctaat aataattttg aaaaaaaaaa 3720 aacaacaact agaaaataag ccgtattatc gtcattttta ttagctatcg tccatatggc 3780 tgactatcgt cgatagacga tagtatcgtc tatcggcaca acccta 3826 // ID ERV1-N5-I_DR repbase; DNA; ZEB; 6136 BP. XX AC . XX DT 11-NOV-2008 (Rel. 13.11, Created) DT 11-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE nonautonomous endogenous retrovirus from zebrafish, internal DE portion - consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW internal portion; ERV1-N5-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6136 RA Bao W. and Jurka J.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(11), 1783-1783 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 6136 BP; 2500 A; 722 C; 1241 G; 1665 T; 8 other; aatggcgccc aaaatcgtgg ggctggacag agacgaacgg gtgacactgg acatgctggg 60 gcttggagga aaaacacaaa tatgtggcca cgataaaaaa cggtaagcaa ttttacgctt 120 atacgtttgt gttttttcta ccaaggacct gcttgtaacg gtaaaaagtc acttgaactg 180 tttctccagc tataacgaac tacaagtaaa actaaaaaag taaaagtaaa gatataaaag 240 gaaaacaatg gggttttgaa taccagaaga aaataacaat ttgcatgcaa atgatctgtg 300 ttttcctaag agggccttga cggataggat aaacgttaac tagtaactgt tactcatgtt 360 tgggaaattt ttaatattat gaagaatatg catagagcat ttttaagttt taaaaggaag 420 ttcttaaagt gtggtgacag ttaagagatt aaagcagatt aataaagtat agagagaagc 480 gcgcaaagac ctcggagtac ccgttctgtt ttaataatga taaatattat tattcaggag 540 gctgggggaa aaaaggcaaa aatcctgtga aatgctattt ttactgaaga acagtaaaga 600 cgggagcaga agtaacaggt cactcaagag gagtgtagtg ragaattctg tggttacagc 660 tctaaggtta gaggctgttc ggacaggtca ctcaagagga gtgaggtgta tttatgtaat 720 tgtaagtatt tgtgtatttg tctgtgttgc tggggtgctt agaagctatc acctttgatc 780 tagagagact agaatagagg tgaggttcag gttacacagt gtaagagatg aggttaggat 840 gagtgagaag tatgattgtg tctagaagta tgtttgcaat aagagtgagt ttaaaagaat 900 tatcgtttta tattataata taaaagaaag actgcatatg ttgtgtgatt taagaacaat 960 aagatagaga gtgatgcagt ttagggaaga gattgtgata cagaggaggt ttaagcctca 1020 taaaattaat tgagatcata ttgcatttga aagacacaga ctgaaaagaa gaatatatac 1080 tagttaaaca tttttctgaa taatctctgt gtgtttgaag tgtttttaag tatgaactag 1140 gcagagaggt gactgaaatt cctgacatgt cgacagtctg gaagaaggtc tgcttcagga 1200 gtggtaagaa tctgtctatt ttaagaacaa aagtttcaca ggacagatag aagctaaaga 1260 gtaaagatat attataaggt attaaataaa tagaaatatt ataaagtaat gtaataagat 1320 aaatgaaata agaaaatata catgatagaa taagatttaa aaaaaataaa taaaaagtgt 1380 taataaaaat gcaactggaa gctatactaa taaatattaa gaaattaaaa aattgaagca 1440 tagaaataac tacatatgga gataaattgt taaattgata aggaaataaa tttactaaat 1500 cagtatgaat caaaaataga taagttaata atttgagtaa akaaaraatt agagaagaga 1560 aaaactttaa tgaatattga ttaattgaag agttaatgat caaatgttta aggaaaataa 1620 gtttaaagaa aaatgaatta ggaaaatcta agagacaaaa atgctaattt ttgaaaagga 1680 aaagaagcat aggaaaatgg agaaaaaaga aaggtccaaa cattgtttaa tagaattaaa 1740 agagaagatg tttaaaaata tatattawaa agaaatataa gataaaaatt agaaaaaaaa 1800 acagtaacat tttgaagaaa ataagatttg gctttaaaag ataaaattat gtttaagtgc 1860 aataaaaaat tgttgtggag taagttgact gttgtcatct gttttaaagt cagctacagg 1920 ctgtcatctg agccaacagc tatttgtagt aaaagcagtg agttagatgt aaagcataga 1980 cactgaggaa aaatgaaaga acatcatagt gagttcaaaa attatatata gttaatattt 2040 tatgtactta aaaattaatt gaaaattaaa aattgcattt cagattggaa aaatgcttaa 2100 agtaaacaaa catgtagagt taaaatagta ctatgaactt gaagctattg cagtaaggat 2160 tttgaacagg tatattagac attaggccac catatagtac tgttgaatta aaacaggtat 2220 ggaaatgtta gagaaattaa aatartgtaa gatgtaatgt aactagtgag gaagcactag 2280 aataaattaa ggaaaaataa gtgaagcaca atctgaagcc aaagaaatta attataagaa 2340 catttaacaa atagaaaatt aagcatattt acatatgata aaaatgattg agaattagaa 2400 aaaataagtt gttaaacagc aaacaaaaat agataaataa ataaattaat taataaaaaa 2460 aatggataaa aggagtcttt atttgatgaa ggaaaaaatg gggaaaaaaa aaaaakttat 2520 tgagtcatga gctaaattct atatatatat taattaagag gacataggga tgaactttgg 2580 gggactgtag tcttgtacct accagacatc tctgagtaca ctggcaaatg gaaaggtgtt 2640 tgaaaaaaca cacagagcaa aatggctgtg gacacttgtg acatgctaca gcagtgatgc 2700 aggaactttt tgcccaggat ttactgagga ctgaaaaagc tacatttgtt cagagacatt 2760 caaaagagac actaacaaga gactgatgaa ctgatcaaaa agagaaaaaa acgctccacc 2820 agaactttac atcagctgag tcaaagaaag aagccagccc aaagagagag ggggaaacac 2880 tgtgccctga tatgaaggaa tttcatcagg acaacacaga aaagactgtt gcaccaatct 2940 gtgagctata gttgtctcag gttgcgaggg gggatagtct gacagtccat ctcaaagtag 3000 ctcctaaagt ccaaattaag ttcaagcatt acagaaggaa aattaagaac aaaagtttag 3060 ctacccattc ttgtatcaaa tcagactcct gtcaatttga gagagaaatt attatcatgc 3120 aaatttttaa gtaaaaaaag aaaaagtgtt ataaggtaat ttcaaggtga aactttaaat 3180 gatgaaatga tatattatga tgaatgttaa ataatttagg tttaagtttt aataggttta 3240 atcataaaag ttccttaaac attgaaggat aaattacaat gcagcagatt gactaaagaa 3300 aaagagaaaa agcttttgat aaggcaataa ataatttgga gaataaataa gattggaatt 3360 aagtatttaa gttaaaatta agtaaaggaa atatcttaac ataatcttat tgtgaatggc 3420 ctaaatcaag attggatatt gaaaactgga aagagattca aatttagaaa aatgattaaa 3480 aaaaaaatga tacagttaaa gggaagattg aaataacaat tgaaataaat tcataagaca 3540 tggtaaggtc aaaaagtaac agttctaaat tactataaaa caaattgaaa gagaaaagtg 3600 tttaagaaaa agatattgtt tcacatgaaa acatatgtga agaataatta gaagaaaatt 3660 awtaaggaat aatagttaaa acagagcaaa gagaaattta caaaacatac agtaagtttt 3720 agtcagctga taaaaaaaag tgcatcaaaa ttgtataaac aactagttta atgggagtgc 3780 cttaggaaat aattatgagc caaaagaaaa taactcaaat taggacaaaa aaggaattaa 3840 tatatagatc agaaaataaa ttagyaaaag aaagaagtgt caaattccaa ctaaatgatt 3900 gtcattacag gactgagatg cagataaaat cagccatgca gcacagtgag ggttaaaatt 3960 ggatcagatg cagatctgcc cagaaagaaa accactaaat gctgaaatag gtataaaact 4020 ctattgaaaa gtgtaaaaat ataagagcac taatagaaac atctagataa tgtaatatgc 4080 aatctgtcct agacatttct atttaaattt aaatttaaag tagtaataaa tgtataagta 4140 actaataata tttaattgaa gttaatttga taattaaaat aaatggttaa tcaagacaga 4200 taaaagaggc ttaatgttga taatatgatt taagagcaat caataggaga agattttcca 4260 gctgaagttc atattgtatg tgaagatatg ttcttttgga tggtaaatat attaagtgct 4320 tagtttctat tgagtttatt ttaggtaact accagaggag ggtagaaact tgtttgtact 4380 tgtgtagctt gatgaatagc aaatactgta tacaccagaa tgcataaagg gttaaaatta 4440 gggagaagat cttaggatgg acactgtcac aatatgtgga tgattgaatt atctacagct 4500 tcacagagct ctagttcaat taaaagactt gactggagac cattgctcca aacagtgaga 4560 tatgatgact gtttttggtg ttgactggat agagaattat tctgtaaaca agcacagttg 4620 agaaactatg atgagtagat gattagaggt agcatcagga gcagattatg aagcagggat 4680 agttattaca tctgtgacaa gagaagtact tcctaaattt tgaaatgcct tcagaaataa 4740 gttcataatg ggtttgcatt cataaagtat tcaaaattga agttaataag tgatcaataa 4800 ataaaaacat ctaaaggagt atgaaaaatt atgaataaag caaaagagag aataattgta 4860 ttatgctcaa ttaccagaaa tggctgaaag agatgaagat gatccatcaa gaccaacgct 4920 caatgaaata tgtgaaaata ataaacttaa tgggattaat ttgctaacaa gtgcattggt 4980 gagtgatggc atggaaactg acagaaacag gcatctaaca ctgcatcgca tgaactgcta 5040 ttgggtcgac ccttgaaaga aaccttgaaa gaaaccttgc aagaaacagc atgacccttg 5100 caagaaacca catgaaactt tgcataaaac ctggcatata atctcaccta taaccttgca 5160 cataacctct tggaatggtt tgcaaagatg tctaatgcag tagggacttt aatgaagaaa 5220 caaataattg ccatgtgcaa agcgatatct gtgcaggaaa ccagaaagga gccatgtgag 5280 agagcagtga tacaaggtcc aattggtcta ggtgattgag tctatctgag gatgctctgg 5340 agagctcagg aggaaaggac catatctgga tgagggcaaa tccaacagct gccaaagtgg 5400 atgccattga tgaggtacca atgcctcaaa ggagcgaaag gaaacaactc cagcaaggga 5460 aagctgaccg gaaacagagg agcctgaata ttccaaagct ctaaactact aagtgcaacg 5520 tgtggaagca agagtttatc agagcaaagt aaagagtgat tctgataata tgcctgagta 5580 ccacatacag aagtgccaca agtcaaggga tgatgaatgc acaactgcag atttcagaac 5640 tcagactctc aaccacacat ctccatcaac atcttttcaa agtctaatga agatgctgat 5700 agcttagaag aagagaaaca aggatgagca aagatcaaga tcaacatttg atgcttttta 5760 cgtttgcata ttgattttgt ttgcatattg ctttcatggt acctatatat aatatcactt 5820 cagtatattc tacatgagtg tgaaaactag cagctagcac tgaaccaact gaatacttat 5880 gcttaaaact ttattgtctt tgatgtgtta agagaaaaac agagtacagt cttttacttc 5940 cttcaaatca aaagtgacag ggaagatgtt tggctggact tctgtcaaag tacagtggca 6000 taatctggtc aattgataac agctatgtgg ctaatatcag tcactagata gaggtaatta 6060 agagaactgg attatgaaat tgcactttgg ggttaaaaga caataatatt aagactaaag 6120 tcttaagagg agggat 6136 // ID DIRS-3_DR repbase; DNA; ZEB; 5342 BP. XX AC . XX DT 26-SEP-2008 (Rel. 13.1, Created) DT 09-MAR-2009 (Rel. 14.04, Last updated, Version 2) XX DE DIRS-like LTR retrotransposon family - a consensus. XX KW DIRS; LTR Retrotransposon; Transposable Element; KW reverse transcriptase RNase H; phage integrase; DIRS-3_DR. XX NM DIRS-3_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5342 RA Jurka J.; RT "Families of DIRS-like retrotransposons in zebrafish."; RL Repbase Reports 8(10), 1268-1268 (2008). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS join(308..1777,1752..3764) FT /product="DIRS-3_DR_1p" FT /translation="MEKTFRSCVPPCPRFLTEQDTHDLCVQCLGQEHAQSA FT LEGGGCVHCDSLPGRVLRSRRALFDKEPAAGVPRGTGPARAEAERRLMSWG FT SQLDLAEGMETSSRASSPPSTVRSRSSQRELEARSSASSSRADGRLLRLPA FT SEGVESMEGLRGQNDVPPLCPEYDELAEVLANVTAKYNINWEVERQEVRQK FT AGLCDERILPSRDRPLRGGLPFNKDLQKEIIYTWKNPYTARVISQQGSIYS FT AVAGLNEHGCRTMPKMEEHLARHLLPQASSAKTPALPSKPXKLTSMLVGKA FT YAAAGQSVGCLHTMSLLQAYQADLLKEALDNGGAGPDTVREXLRASDLSLR FT ATKEVARSLGRSMAAMVVTERHLWLNQADIKEEDKRIFLDAPISPAGLFGD FT SVDRVAKSFERAKKRSGSYGSFLRPKGHPSGAMQRDQPQPSTSSSARYRAE FT QKESVASRQPPKRKWGGKRSQEAKTVTYGGPRTIVVKRSGPQKKSNGLGLR FT RSPSXGTSSGSPQRGAKATSSLPTGGLSTNPPTVGASGRGDPRRYPQTFHP FT KSGDVSGLAPSTEGPRAVKSRASCGEFASVPRVSYSGEHRDQSRETGTPAE FT IFGRVETAAECFAVGPAYHREGLQNSVCFTPLSLQRGAPYPSEARAGSSDG FT TRSRISSEEESHRTDPSSRHRVRVLQPLLHCSKEGWRAASDIRSKTAKSLR FT TNTEVQNVDYQHHRVTNTVRGLVCHDRSEGRILPHIHPSKSQEVPQIRLRG FT QGLSVQGSSVRPSTLTPHIYQSRRRGFGSAAATRDSNSQLPRRLAHSSPIE FT GVSGSTSRCCSRSHREIGVAAKPKEKCACASSDDHFFRCAMGLHDDVCVSV FT PHPNRVDSGSRKEDQTGPSHHXQTVPEAVGSTSGSLQHNPIGSATHETITV FT VAQNQGVFPEGKSVPHNQSIEALPTSVIDMEKALVSVPGPYTRSNISAPGS FT DIRCLTQGLGSDPEGPSRIGTVEGPSSPHAYQLLGDVSSVSGSKALLPSSE FT RSPCVSEDRQHIGGLLHQPSRGFEFAPSVQVGESNPSVGSGEAALSEGSLH FT PRXDECGSGPPVETGVGAWGMATTPKSGGGHLAEIRQSRRRSIRLSXNNTL FT PIVVLPHKHIQHLWGWMPWCRRGRGFVCTLFPQSLCSRESWRGSVKGGTTY FT CW" XX SQ Sequence 5342 BP; 1235 A; 1398 C; 1430 G; 1270 T; 9 other; tggttccctt tcgggaactc tacgctgcgt ctctttagga gacacaatgg gataccatcc 60 ctctcttata accttctgaa gcacaggtgt aatcaatcca atgtaattgg cgggacgcaa 120 cgtgcgtgtg gcgtcagacc gaaggtataa aagccccaca ctcacagtga acttcagctt 180 tttctctctt cactctgcga atgagttcgt tgattttgaa gcactcacag cactgataag 240 gcactacaaa aaccatttaa actgaatcaa agcacacact ttcattttct atttaaaaat 300 tagcaaaatg gaaaagacat ttagatcatg tgtacckcca tgtcctcgtt tcctcacaga 360 acaggacacg catgatttgt gtgtgcagtg cctcggtcag gagcacgcac agtcagccct 420 tgaggggggt ggctgtgtgc attgcgattc acttcctggy cgagtgctgc gaagtcgccg 480 cgctctgttt gataaagagc cggcggcagg tgtgcctcgc ggcactggtc ccgcccgtgc 540 tgaggccgag cggcgattaa tgtcgtgggg atcgcagctc gatctggcag aggggatgga 600 gacaagctcg agggcttctt ctcctccttc tactgtgaga tcgaggtcca gtcagcgcga 660 gctggaagcc cgttcatcgg cttcttcctc tcgcgctgat gggcggctgc tccgcctccc 720 ggcttctgag ggagtcgaga gtatggaggg ccttcggggc caaaatgacg ttcccccatt 780 atgccccgaa tatgatgagc tggcagaggt cctggccaat gtaacagcta aatataatat 840 taactgggaa gtagagaggc aggaagtgcg ccaaaaagct ggtttgtgtg atgaacgcat 900 cctgccatca cgagacaggc ctctgagggg gggtcttcca tttaacaaag atctccaaaa 960 agagatcatt tacacttgga aaaatccata cactgcccgt gttataagcc agcagggatc 1020 aatatattca gccgttgctg gtctgaatga gcacggatgc cgcacaatgc cgaaaatgga 1080 ggagcatttg gcgcgccatt tgctacctca ggcatcgtcg gcgaagaccc ccgccttgcc 1140 gtcaaaacca rtaaaattaa catcgatgtt agtcggcaag gcatatgcgg cagcaggcca 1200 gtctgttggg tgtctgcaca caatgtcact tttgcaggca taccaggcgg acctgctgaa 1260 ggaagccctt gataatggtg gggcggggcc cgatacagta agagaartac ttcgggcctc 1320 ggacctgtct ctccgtgcca ccaaagaagt agctagatct ttaggccgct ctatggcggc 1380 catggtggta acggagagac atttatggct gaatcaagcg gacatcaagg aggaagataa 1440 acgwattttc cttgatgctc ccatttcgcc cgcgggtttg ttcggcgact ctgtagatcg 1500 ggtcgccaaa tcgtttgagc gggcgaaaaa gaggtcgggc tcctacggga gcttccttcg 1560 accaaaaggg catccttctg gggctatgca gcgggaccag ccccagccgt caaccagctc 1620 atctgcccgg tatagagccg agcaaaaaga gagtgtggct tcccgtcagc ccccaaaaag 1680 gaaatggggt gggaaacgct ctcaagaggc gaagacggtg acatacggtg gtccgaggac 1740 catcgttgtg aaacggtctg ggcctcagaa gaagtcctag caytgggact tcttcgggca 1800 gcccccaacg aggagcaaag gccacctcct cgttgcccac agggggtcta tccaccaacc 1860 ctcccaccgt tggtgcctca gggcgtggag acccccggcg atatccacaa acgtttcatc 1920 ccaaaagtgg tgacgttagt ggactcgccc cctctactga gggtccaaga gcagttaaat 1980 caagggcctc ctgtggtgag ttcgcctcag tgcccagagt tagctactca ggggaacata 2040 gagaccagtc tcgagagact ggtacccctg cagaaatttt tggcagagtg gaaacggctg 2100 ccgaatgttt cgcagtgggt cctgcttacc atagagaagg gctacagaat tcagtttgct 2160 tcacgcccct ctcgcttcaa cggggtgctc cataccctag tgaagccaga gcaggctcta 2220 gtgatggaac aagaagtaga atctcttctg aggaagagag ccatagaaca gatccctcct 2280 ctagacatcg agtcagggtt ttacagccgt tacttcattg ttccaaagaa ggatggaggg 2340 ctgcgtccga tattagatct aagacagcta aatcgctccg tacaaacact gaagttcaaa 2400 atgttgacta tcagcaccat cgtgtcacaa atacagtccg aggactggtt tgtcacgata 2460 gatctgaagg acgcatactt ccacatatcc atccttccaa gtcacaggaa gttcctcaga 2520 ttcgccttcg ggggcaaggc ttatcagtac agggttcttc cgttcggcct agcactctca 2580 ccccgcacat ttaccaaagt cgtcgacgcg gctttggctc cgctgcggct acaagggatt 2640 cgaattctca attacctcga cgattggctc attctagccc gatcgaggga gttagcggtt 2700 caacatcgag gtgttgttct cgctcacata gagaaattgg ggttgcggct aaaccaaaag 2760 aaaagtgtgc ttgtgccagc tcagacgacc acttttttag gtgtgctatg ggactccacg 2820 acgatgtttg cgtgtctgtc ccccacccga atcgagtcga ttcgggcagc cgcaaagagg 2880 atcagactgg gccaagccat cacwgtcaaa cagttccaga agctgttggg tctactagcg 2940 gcagcctcca acataatccc attgggtctg ctacacatga gaccattaca gtggtggctc 3000 aaaaccaggg ggttttcccc gaggggaaat ccgttccgca caatcaaagc atcgaggcgt 3060 tgcctacgag cgttatcgat atggaaaaag ccctggtttc tgtcccaggg ccctacacta 3120 ggagtaatat ctcagcgcct ggctctgaca tcagatgcct cacgcaaggg ctggggagcg 3180 accctgaggg gccttcccgc atcgggacag tggagggacc atcatctcca catgcatatc 3240 aactgcttgg agatgttagc agtgtttcag gctctaaggc acttcttccc tcaagtgaga 3300 ggtcaccatg tgttagtgaa gaccgacaac acatcggtgg tctcttacat caaccatcaa 3360 gggggtttga attcgcgccc tctgtgcagg ttggcgaatc aaatccatct gtgggctcag 3420 gggaggctgc tctctctgaa ggcagcttac atcccaggyc cgatgaatgt gggagcggac 3480 ctcctgtcga gacaggggtt ggagcctggg ggatggcgac tacacccaaa agtggtggcg 3540 gccatttggc agagattcgg cagagccgac gtcgatctat tcgcctgtca raaaacaaca 3600 cattgcccat tgtggttctc cctcacaaac acatccagca cctttggggc tggatgccat 3660 ggtgcagacg tggccgaggc ttcgtctgta cgcttttccc ccaatcgctc tgctcccggg 3720 agtcctggag agggtccgtc aaggggggta caacctattg ctggtagccc cttattggcc 3780 cacacgagtg tggttctcgg acctagtgtc tctcctcgac ggtctcccat gggagattcc 3840 cgtccagaga gacctcctgt cccaggcgga gggaatgata gtacaccccc gcccggacct 3900 ctggaaactg tgggtgtggc ctctgagggg gcccaccttg tagatcttgg tttgtcaact 3960 gaggttgttc aaaccatact aagctccaga gctccctcca cgaggagatt gtatgccacc 4020 aagtggaaac tttttacttc gtggtgtaca gaccaccacc tggatccagt ccactgccct 4080 gcggggtcag tgctgcaatt tctccaggag cgttttgaat tcggtttgac tccgtcaacg 4140 cttaaaggtt atgtagcggc aatgtccgca taccgtactg atggtcttgg caaagaccct 4200 ctggtggtca gattcctccg tggaacgagg aggttgaggc ctgcctgcgc caataggttt 4260 cctacttggg atttgtcgat agtgcttgag ggcctgtcga cagccccctt tgaaccaatt 4320 gaggatgtgt cagaaaagtt tctgaccctt aaaacgtttt ttttttgttg gccattacat 4380 ccatcaaaag agtaggagat ttacaagcat tgtctgtagc tccctcttgt ctggaattct 4440 cacctggtat ggtgagagca tttctgtacc ccagagcagg gtacgttcct aaggtcccca 4500 ctgaggtggt gcgacctact gtgctgcagg cctttaatcc tccaccattt atgacgccag 4560 atcaggagcg cttgaatctg ctttgcccag tgcgggcact agatgcatac gtacatcgta 4620 cgtctgcttg gcgtacaaca caacagttgt ttgttttgta tggctcaccc aaaattgggg 4680 cgccagcatc taagcagtct ctgagccggt ggatagtcga ggctatttca ctagcatatg 4740 aagccctgca tcggcctcta cctgaggcga tcagggctca ctcaaccagg agtatggcgg 4800 cttcgaaagc ctttcgttct ggccagtccc tgactggcat ctgcaatgca gctgggtggt 4860 ctaccccaca tacctttgta aggttttatc agttggacct ggaccctact ccgggttcca 4920 gtgtcctagc aatataggag aactcatccc cggcatgtaa aactatggcg tgtttgggat 4980 agcgttccca ttgtgtctcc taaagagacg cagcgtagag ttcccgaaag ggaacgcgtc 5040 aggttacgaa tgtaaccatg gttccctaag ggaacgagac gctgcgtcgc gttgccatac 5100 tttttcatac ctggtgcgct ccgttcgaga ggtataagct gaagttcact gtgagtgtgg 5160 ggcttttata ccttcggtct gacgccacac gcacgttgcg tcccgccaat tacattggat 5220 tgattacacc tgtgcttcag aaggttataa gagagggatg gtatcccatt gtgtctccta 5280 aagagacgca gcgtctcgtt cccttaggga accatggtta cattcgtaac ctgacgcgtt 5340 tt 5342 // ID Gypsy47-LTR_DR repbase; DNA; ZEB; 969 BP. XX AC . XX DT 25-SEP-2007 (Rel. 12.09, Created) DT 25-SEP-2007 (Rel. 12.09, Last updated, Version 1) XX DE A long terminal repeat of the Gypsy47_DR LTR retrotransposon; a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy superfamily; LTR; Gypsy47-I_DR; KW Gypsy47-LTR_DR; Gypsy47_DR; endogenous retrovirus. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-969 RA Dib M.R. and Naveira H.F.; RT "Gypsy47_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 7(9), 824-824 (2007). XX DR [1] (Consensus) XX CC Gypsy47-LTR_DR is a long terminal repeat of the Gypsy47_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its CC internal portion is deposited in Repbase as Gypsy47-I_DR. XX SQ Sequence 969 BP; 178 A; 208 C; 177 G; 406 T; 0 other; tgttatgtat ttgtgttttt tgtgctctgt tttctctgtg ttttttctct ctctctctct 60 ctctctctct cgctctgttc tcccgtatct gctctcatgt attctcaggt tctgttcatt 120 ggtccattca gctgtcagtc attcctcccc ggttacgtca ctcttacgtc acatccaatg 180 agcaaagacc acccgtctta aaagcgcgcg ccagaccact cgctctttct tgttttcttg 240 ttttcttgct ttcttgtttt cttgttctct tgcttgcatt tgtctctttc gctgtttcgt 300 cgcttttgtt tgtatttacc gcgtgcgttt agaaattacc ccgtctaact gtgttttgtt 360 gttgcttgtt tctgtgtgca cgttatccgt ccgttattcg tacattgttc ttagtttatc 420 gttgtttacg aggcttggac gaagcggaca ggtaagaagg tcacgtgaca tacatttcgc 480 aacactcagg actgctctgc tgtctagtac actaggttag gggcgagcgc cactccttgt 540 actttttgaa tagatagata gagtagatat agttaggact cggaagatct tcgtgtgtta 600 attatactgt ttttcacata gttagttagc tagatgcttc tgggaagtta gatttagttt 660 gggttttgtt cttttcattt ctttagcgcc gcccgcgtcc cttctgccag ctctcttgtt 720 tttccccgtc tttatttgtt tcagtgttgt acatattgta aatagtatat ttttgcttta 780 cttacttttt ctttattttt gattaattcg ttgttgttgt tcactttggg attgtaaata 840 aaagcacaca ttttttggca ttactttggt tgtctttcca ctcactcatt cactgtttat 900 ataaatatat tttaattaaa taaatgtcaa aaccccacac cctagactaa catcaggggg 960 tttgtaaca 969 // ID Gypsy108-LTR_Dr repbase; DNA; ZEB; 534 BP. XX AC . XX DT 29-APR-2009 (Rel. 14.05, Created) DT 29-APR-2009 (Rel. 14.05, Last updated, Version 1) XX DE A long terminal repeat of the Gypsy-108_DR LTR retrotransposon; a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy superfamily; endogenous retrovirus; KW LTR; Gypsy-108-I_DR; Gypsy-108-LTR_DR; Gypsy-108_DR; KW Gypsy108-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-534 RA Dib M.R. and Naveira H.F.; RT "Gypsy108_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 9(5), 951-951 (2009). XX DR [1] (Consensus) XX CC Gypsy108-LTR_DR is a long terminal repeat of the Gypsy-108_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its CC internal portion is Gypsy-deposited in Repbase as CC Gypsy-108-I_DR. XX SQ Sequence 534 BP; 108 A; 173 C; 106 G; 147 T; 0 other; tgtcaccgac tcggtcccag tcattcccct cgctggccag cagaggccgc catccccgga 60 cttctagcat tacatcatcc acatagactg attgtgcaca cacctgaact gaatcacggt 120 aatgacccac gccacctata taagccacac tcaaaccact gttcagtgtg aagtcttgtt 180 tagccccggc cagcattact gaacgttctt tcctgcctga tctcctgtgc ataaccccgg 240 actgtttctg actctgagtt gccttctgcc tccccacgac ccttgcttga tacacggact 300 ctgaaccacg ctgcctgccc tcgacccacg cctgtcttaa ggattctgaa ccacgccgcc 360 tgccactgat ctatgcctgg taaatcactc tgtgtctgtc agccgccagc cccacgacct 420 ttattgatta ctgttgatgt gtgttcgcac tttagtgcgt gttggatgtt tgtgtttgac 480 tgtgtctaat aaatactgca aaatggatcc ctccgtgtca gtctccccgt taca 534 // ID Gypsy145-LTR_DR repbase; DNA; ZEB; 311 BP. XX AC chr10; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy145-I_DR; KW Gypsy145-LTR_DR; Gypsy145_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-311 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1552-1552 (2008). XX DR Genome; chr10; Positions 8882185 8881875. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 311 BP; 50 A; 90 C; 58 G; 113 T; 0 other; tgttttgccc cgcatacaat tctaagcgtt tttctctgtt agtgatctga ttctgtgttt 60 tgaccccaga ctgtttttcc cgtttccttg attgttgatg ccgctgcctt ttgtgaccct 120 ctgctcagtt taacggaaca ttctctctgc tgtatgccgc ctgtctcgac caatcgcctg 180 cccacggact ctgagttaag tttctccttc gccatccttg tttgttgttg tctgatcctg 240 cctgtacgac catcctgttg tgattcaaca aagctgcaaa tggatcctca atctcccgac 300 gctcctttac a 311 // ID Looper-N8_DR repbase; DNA; ZEB; 1107 BP. XX AC . XX DT 17-NOV-2004 (Rel. 9.1, Created) DT 02-MAR-2009 (Rel. 14.04, Last updated, Version 2) XX DE This is a nonautonomous DNA transposon that belongs to the DE Kolobok superfamily. XX KW Kolobok; DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; piggyBac superfamily; KW HARBINGERN7_DR; Looper-N8_DR. XX NM Looper-N8_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1107 RA Kapitonov V.V. and Jurka J.; RT "Looper-N8_DR, a nonautonomous DNA transposon from zebrafish."; RL Repbase Reports 4(10), 284-284 (2004). XX DR [1] (Consensus) XX CC Looper-N8_DR is a nonautonomous DNA transposon that belongs to CC the Kolobok superfamily (Previously, it is incorrectly CC classified into piggyBac superfamily.) Its is characterized by CC 16-bp terminal inverted repeats and the TTAA target site CC duplications. This is a composite transposon, it contains a copy CC of HARBINGERN7_DR (pos. 483-368). XX SQ Sequence 1107 BP; 323 A; 259 C; 231 G; 293 T; 1 other; aggtgcagta ggtgatctgc caaaatgcta accgcttagc atattatctt tggacggcgg 60 ggagggactg tgattcaaag ccacaccccc tgaaatcgcg agcgcgcgca ccgcgtcaca 120 acagccgaca gacaacccac tagttcatgt cattcgccag ttagttagtg ccagcgccgt 180 gcaggaatta catgtattac tttgccacac ttacatgcta tttcagagcg aatattcaga 240 gtagcatggt aaacagtata ggaaggctgt cattgttcaa aaatgaccca atgtgaatat 300 aaaagcaact tcagctcaat aaagcaggtt aggcggaata gcgctattta ctggtgtttt 360 gttgttaaac taaatataaa tctacattat agatgctgtc aaagaacctt atgaaactga 420 aaataatcac atcaatcttt caccgggaga tttcagtggc tgaacaacac gtctgtgcat 480 ataagtcatt cataacacaa cacaatctaa cataaaatta gcctgactaa aatagtttca 540 aaacagaaca ttacctgtct aacagtaata cttcagccat ggtgtcgtcc ttcctccagc 600 gtgctaaagt aactccaata ttgattcagg tttttaaaag tttcaattca gcatttgatt 660 tctcccggtc cgtctcgtga ccgcgcggcc gctttaaggg cgaattatac ccgcgcctgg 720 ctttacagta atcggcccga gctcggctgt ccttcggcgc ctccggctcc gactcggcat 780 ctgcgagcgg cccgcagctc gctcgcgcgc atgcgtttgt gatgcagacg ggcacgcgct 840 aatggcggat ctgcgtgaac agatgcgcag aagtcggaat ctacatttgc tgacagacag 900 tctgacctgc ytatcggaat taagggagat gacggtccga ctctatttaa ttggatgaac 960 attttttagt tttatgcttt acccagaata taaaaataca tataaacaca tttagatcat 1020 ttactgtaat cattactatt ggactgtgaa gagactttca accagcacaa caaaaaatgt 1080 ttctgaagac aatcacctac tgcacct 1107 // ID KenoDr1 repbase; DNA; ZEB; 5793 BP. XX AC AB097144; XX DT 02-JUN-2009 (Rel. 14.06, Created) DT 02-JUN-2009 (Rel. 14.06, Last updated, Version 1) XX DE Danio rerio retrotransposon KenoDr1 DNA, complete sequence. XX KW Tx1; Non-LTR Retrotransposon; Transposable Element; KenoDr1. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5793 RA Kojima K.K. and Fujiwara H.; RT "Cross-genome screening of novel sequence-specific non-LTR RT retrotransposons: various multicopy RNA genes and microsatellites RT are selected as targets."; RL Mol. Biol. Evol 21(2), 207-217 (2004). XX DR EMBL/GenBank/DDBJ; AB097144; Positions 1 5793. XX FH Key Location/Qualifiers FT CDS 169..1515 FT /product="KenoDr1_1p" FT /translation="CFSVFSCVGASTMSAAXAGVRRHHRRRFLLKAPNGKL FT LEMSRMDFSRKVRQNPLNFRLEDLNCIVTLPLGKGFDVSFRAASLLNDFWQ FT RFDAVKIQFSMFTVEKLSDNTLKTVIVRMFNHTVPGDDICVWLGRYCTVRP FT QPVKVLDEDGIWNCSWRIPIKQWEDAGGYQGLRHLPSMIVLGENRGYIHYQ FT GMPKLCRKCGKNGHLAEACQELICGKCREVGHSFEQCTNGRRCNLCGEENH FT LFRDCPKSFANKLKNNKMAAPQASMVEQREEAVPVVLAGHSLPATGTGQEE FT ENGAAIGAESPILFEQREEAPPQQSVTQNVTANEEEETASSLLTVSEASVD FT ASQSETQCSLPYAQMQKRAARSPLEMEEKRLRASERLDSSKSEDLDRSWPL FT GSPNGVSFLHVKLRSSSPKELQEVFSVASEVVSISSPDPTPVGIGKEQMQQ FT DIT*" FT CDS 1574..5020 FT /product="KenoDr1_2p" FT /note="apurinic endonuclease and reverse FT transcriptase." FT /translation="MFISLFHLLMALTISTINVRSVRSPIRAQSVLSFLKL FT LKSDVFLLQECALPFISNYRKWEDVWTPGPSLWSGSNQNKNDGVAILIKNP FT NILVKGTTVVRDGRAFLVTLTFLGRDFNVLNIYGHTDKNERYELLEDLQPH FT MLGKPPLVVAGDFNCILTQKDRKKADFKLDKTSTLLNNLVKDFKLVDCFRQ FT VHPREEGFTWFSSDSTSASRIDYIFTRDFPPVDATLTPTFFSDHTILSCTL FT SLPPGVTIGEGWWKLNCSLLEEEEVVKGYREQFSQWQTLQDFYDTRAQWWE FT MVKGRTKTYFKKIGKQKKNKRKKRMLGLQKRLQRYFNLLNMGFDFNNEIKE FT VKKEMLVLAEENSRGVILRAKEKKIEENEKCTRYFFKKIITKGGGISKIKN FT DEGTILNTTEGGLKEVEKFYQNLYGEKIVHEDALNEVLNFLDKKVQKEKHS FT VLTQDFKKEDLDKALKSFKKGKSPGMDGLPLEFYQCFWDILAGVLLTVFNE FT FENFNKLPADFRLGIVSLLFKKDDKTILKNWRPITLLNTDYKFFTKILATR FT MSTVLEDVIHPDQACAVPGRKITDSLLLIRDAICYARDRNIRLAVVNLDFE FT KAFDRVSHQYLLQVLEKMGFPGRFLAWVGLLYNDLVSKIIVNGHLTKAVNI FT CSGVRQGCPLSPLLYVVCIEPLAQLLRRDKWITGFKIPGGLIASCVLYMDD FT VTLLTTDLTSARRAMDLTDFYGRASGARLNRKKSEAQLFGPWGNVVTEGLE FT LTFKENDFKILGVKFDKEGGGGQNWNDILTKVNQKLGFWGLQKLTMEGKIL FT IFKSVILPLILLVSSVFSPPRRFIRKLERAVFYFLWGSKWERLRRTELKKK FT PEHGGKGLPDPHLFLGSRFTALHFSYALTPSRENKTAAMTRFWMGSYLRTL FT KVLAVDLKYPVSFNLPREYDFIKKFLKKYVLETNDVTILTNHKALISLVQD FT RETVSRVPGLTLCEAQQVWRNAAHPALQNGHKDLLWMVAHEILPVRAVMHS FT RGMARNPVCPRAGCGCPETVRHLLWECGAARDLWARTGPLYHQYLPAGRAQ FT MDYHLIIHGVGQGVKDMTQQDFTSLWLTLTAIKDAIWVTRNLLVGKQVTVP FT LQGTKGLITTTLQGYRTLTRGQGARGHTEEVPAATVPGRP*" XX SQ Sequence 5793 BP; 1693 A; 1129 C; 1358 G; 1612 T; 1 other; attatacttg ggcttgcctc gcgggcagtg acagcgatcg cctcttggag gtttcctggc 60 tcacaattac tcctgtgttg tgagctctca acccttgaga ggtaaaggaa ttgcactttt 120 taaggtaaga cgttttaagt atttattggc tatttattgt ggttttagtg ttttagtgtg 180 tttagctgtg ttggtgcctc caccatgtcg gctgctcntg ccggcgtgag gaggcaccac 240 aggcgacgct ttttattaaa agcaccaaat ggtaagctcc tggagatgtc cagaatggac 300 ttctccagga aggtgagaca aaacccttta aattttagac ttgaagactt gaattgtata 360 gtgactctgc ctctaggaaa aggatttgat gttagtttta gagctgcttc cctattaaat 420 gacttctggc aaaggtttga tgcggttaag atccagtttt ccatgtttac tgtggaaaaa 480 ctttctgaca acacacttaa aactgttatt gtcagaatgt ttaatcacac agtccctgga 540 gatgacattt gtgtgtggct gggcagatat tgcactgtcc gcccccagcc agtcaaggtg 600 ttagatgagg atggcatctg gaattgctcc tggcgaattc ccattaaaca atgggaagat 660 gccgggggct accagggctt gagacatctg ccctcaatga ttgtgcttgg agagaacagg 720 ggctacattc attatcaggg gatgcccaaa ttgtgcagaa agtgtgggaa gaatgggcat 780 ctcgctgaag catgccagga gttaatttgt gggaagtgca gagaagttgg tcacagtttt 840 gagcagtgca ccaatggcag gcggtgcaat ctgtgtggag aagaaaacca tctcttcaga 900 gattgcccaa aaagctttgc caacaagttg aaaaacaaca aaatggccgc cccacaagca 960 agcatggttg aacaaaggga ggaggcggtc cctgtggttt tggcgggcca ttccctgcca 1020 gccacaggaa ccggccagga agaagaaaat ggggcggcca taggggcgga gtcacccatt 1080 ttatttgaac agagggagga agcccctccc caacaaagtg tgacacaaaa tgtaacagcc 1140 aatgaggagg aggaaacggc ttcctctctc ctcaccgtct ctgaggcaag tgttgacgct 1200 agtcaaagtg aaactcagtg ttcacttcct tatgctcaga tgcagaaacg ggctgcgaga 1260 tctcctctgg aaatggagga gaaaagacta agggcttcag agcggctaga tagttccaaa 1320 tcggaagatc tagaccgctc atggccctta ggttctccaa atggagtttc atttctgcat 1380 gtaaagctaa gatcatcatc accgaaggag ctacaagagg tcttctctgt ggcttctgaa 1440 gtggttagca tctcctcccc tgacccaaca cctgttggga taggtaaaga gcagatgcaa 1500 caagacataa cctgattttt atcttgtttt taaaaccatt gatatctttg taaactaact 1560 gtaactcttt taaatgttca tatccctttt tcatctgctc atggccctta ctatttccac 1620 tattaatgtg agaagtgtga ggagccccat tcgagcacaa agcgttttat cttttttaaa 1680 actcttaaag tctgatgtgt ttttattaca agaatgtgcc ttacctttta tctctaatta 1740 cagaaagtgg gaagacgtgt ggaccccagg tccctcccta tggagcggat cgaaccaaaa 1800 caaaaatgat ggggtcgcaa ttttaataaa aaatcccaac attctggtga agggtaccac 1860 tgtggtgaga gatgggcggg cttttttagt aacactgact tttttaggaa gagattttaa 1920 tgttttaaat atatacggac acactgataa aaacgagaga tatgaacttt tagaagactt 1980 gcagccccac atgcttggta aacctcccct agtggtggca ggggatttta attgtattct 2040 aacacagaaa gacaggaaaa aggcagattt taaattggac aaaacatcaa ctcttttaaa 2100 caatttagtt aaagatttta aattagttga ctgttttaga caagtgcatc caagagagga 2160 aggcttcact tggttcagta gtgacagcac cagtgcctct cgaatagact atatttttac 2220 aagagatttc ccgcccgtgg atgctacatt gacccccacc tttttttctg atcacacaat 2280 actatcctgc actctttcac ttccccctgg tgtgacgata ggcgagggtt ggtggaagct 2340 gaactgctcc ctgttggaag aggaggaagt ggttaaagga tatagggagc agttcagcca 2400 atggcagacc ctccaggact tttatgatac acgagcacag tggtgggaaa tggttaaagg 2460 gaggacaaaa acatatttta aaaaaatagg aaaacagaaa aaaaacaaaa ggaaaaaacg 2520 catgttgggg ctgcaaaaac gtttacaaag atattttaac cttttaaaca tgggctttga 2580 ttttaacaat gaaattaaag aagttaaaaa agagatgttg gttttagcag aggaaaatag 2640 caggggagtg attttaagag caaaagagaa aaaaattgaa gaaaatgaaa agtgtacaag 2700 atattttttt aaaaaaataa tcacaaaggg aggggggata tcaaagataa aaaacgacga 2760 ggggactatt ttaaatacca cagagggggg tttaaaagaa gtcgaaaagt tttaccaaaa 2820 cttatatggg gaaaaaatag tacacgagga cgctttaaat gaggttttaa attttttaga 2880 caaaaaagta caaaaagaaa aacattcagt tttaacccaa gattttaaga aagaagattt 2940 agataaagct cttaaaagtt ttaaaaaggg gaagtctcca ggaatggacg gtcttcccct 3000 ggagttttat caatgcttct gggacatttt agctggggtt ttactgactg tttttaatga 3060 attcgaaaat tttaacaaac tacctgcgga ttttagatta ggaatagttt ctctgctttt 3120 taaaaaagat gacaagacaa ttttaaaaaa ttggcgacca attactcttt taaacactga 3180 ttataaattt tttacaaaga ttttagcgac aaggatgtcg acagttttag aggatgtaat 3240 ccacccggat caagcctgtg ctgtgcccgg gaggaagatc acggacagcc tattgctgat 3300 ccgagatgcc atctgttacg cgagagacag aaacattcgt ctagccgtgg taaatttaga 3360 ttttgaaaaa gcatttgaca gagtctcgca ccagtacttg ttacaagtac tggaaaaaat 3420 ggggttcccg ggaagatttt tagcatgggt gggactgctg tacaatgacc ttgtcagcaa 3480 aatcattgta aatgggcatt taacaaaagc agtaaatatc tgcagtggcg tccgtcaagg 3540 gtgtccgtta tccccactcc tgtatgtggt ttgtattgaa ccactggcac agctcttgag 3600 aagggataaa tggattacag gttttaaaat tccggggggt ttaatagcat cttgtgtttt 3660 atatatggac gacgttaccc ttttaacgac agacctcacc tcggcccgaa gagctatgga 3720 tttgaccgat ttctacggtc gggcctcggg agccagatta aaccgaaaaa agtccgaggc 3780 ccagctcttc gggccgtggg gtaacgtcgt cacagaggga ctagagctga cttttaagga 3840 aaatgatttt aaaattttag gggtaaaatt tgacaaagag ggtgggggag ggcaaaattg 3900 gaacgacatt ttaacaaagg taaaccaaaa gctgggattc tgggggttac aaaaattaac 3960 aatggaaggt aaaattttga tttttaaatc cgtaatttta ccactcattt tacttgtttc 4020 ttctgttttt agtccaccaa gacgttttat caggaagttg gaaagggcgg tgttttattt 4080 cctgtgggga tccaagtggg agcgcctaag aagaactgag ttaaaaaaaa agcccgaaca 4140 tggtggaaaa ggcctcccgg atccccacct gtttttaggc agccgcttca ccgccctgca 4200 ttttagttac gcgttgaccc catccagaga aaacaagacg gctgccatga ctcgcttctg 4260 gatggggtcc tacctgagaa ccctgaaagt tttagcagtg gaccttaagt acccagtgtc 4320 ttttaattta cctagggagt atgattttat aaagaaattt ttaaagaagt atgttttaga 4380 gactaacgat gtcaccattt taactaacca caaggctctc atttctcttg tgcaggatcg 4440 agagacggtg agtcgggtcc cgggcctcac attatgtgag gcccaacaag tgtggaggaa 4500 cgcggcccac ccagctctcc agaacggaca caaggacctg ttgtggatgg tggcacatga 4560 gatcctcccg gtcagggcgg ttatgcactc cagaggcatg gccagaaacc ccgtctgccc 4620 acgggctggg tgtggatgcc cagagaccgt gcggcacctg ctctgggagt gcggcgctgc 4680 gcgggacctg tgggccagga ccggccccct gtatcatcag tacctgccgg cggggagggc 4740 ccagatggac taccacctta tcatccatgg ggtgggccag ggcgtgaaag acatgacaca 4800 acaggacttt acatcgctct ggctcaccct gaccgccatc aaggatgcca tctgggtcac 4860 aagaaacctg ctggtgggga agcaggtcac ggtacccctg caagggacaa agggactgat 4920 aacaaccacg ctacaggggt accggacctt gacaagaggc caaggggccc ggggtcacac 4980 ggaggaggtc ccggccgcca ccgtacctgg ccgcccgtag atgcaccctc accactctgg 5040 ctacgggaac agcgggccag caggccagag gagggggatc tccgctatgc ccttgaaaca 5100 caggggctgt gaccgtcctg gagagttggt gtgaatggac tttttttttg gtatgactca 5160 ccctcgatcc tggacaaaga aatgatttta tcttcttatt gttactattg cttttcatgc 5220 attttttaac ggactgggtt ttatcacctt ttttatcaac atctacgcca aggacttttt 5280 ttttcataac acacgcacac acaaaagact ctcgcacaac atacactttt atttcctcca 5340 caaaccctga ttttaaagtg cttttgttaa atgccatatg tcttttaatg tgtattgaaa 5400 tgttttaaaa tgtgtgtttt atgtttcaat tcatttttac taagtcctgc gttgttaaaa 5460 tggtgactag accgtgattc ttttttagac ccgttttaat ttttattgaa cagcttgttt 5520 tttagtcttt catttgtttg ccttttttaa aaaaaaaaac atgtcctcta tacacttgac 5580 attttatgtt catttaaagt acttttgcaa gtatttattt acttatttat ttatttattt 5640 atttatttat ttatgtatat atttattcat tttttattta tttactgtta agtactcaaa 5700 tgaaactaag tgaaaagata tgtttttaaa tatgattgtg atgtaagaaa caaatgggaa 5760 acgctgtcaa taaatttttt tcgaaagaaa aaa 5793 // ID Gypsy-30-I_DR repbase; DNA; ZEB; 5597 BP. XX AC . XX DT 28-FEB-2007 (Rel. 12.02, Created) DT 02-MAR-2007 (Rel. 12.02, Last updated, Version 1) XX DE An internal portion of the Gypsy-30_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy superfamily; Gypsy-30-I_DR; KW Gypsy-30-LTR_DR; Gypsy-30_DR; endogenous retrovirus; gag; KW integrase; reverse transcriptase. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5597 RA Dib M.R. and Naveira H.F.; RT "Gypsy-30_DR, a family of LTR retrotransposons from zebrafish."; RL Diect Submission to repbase Update (28-FEB-2007). XX DR [1] (Consensus) XX CC Gypsy-30-I_DR is an internal portion of the Gypsy-30_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its long CC terminal repeat is deposited in Repbase as Gypsy-30-LTR_DR. CC Consensus obtained after the alignment of at least three CC independent insertions bearing at least 85% Homology over at CC least 1000bp. The internal portion contains two ORFs encoding the CC 407-aa Gypsy-30_DR1p gag (pos. 204-1424) and 1363-aa CC Gypsy-30_DR2p pol proteins (pos. 1388-5476) composed of the CC protease, reverse transcriptase, and integrase domains. Some CC insertions fairly recent, according to the high identity between CC their flanking LTRs. XX FH Key Location/Qualifiers FT CDS 204..1424 FT /product="Gypsy-30-I_DR1p" FT /note="ORF1(gag-like)." FT /translation="MSETESAAAAAAASSSANSSSSSSSASGSVNQQPLAP FT FVAGNLHIELPAAFKGDGTESFSSWARRFEVAVRAITPQGTDLMNMLSSVL FT PTRLPSDAAFLYWDSLPTPVKTNYNTVTERLKHVFGLQQSLPFFQTHVNAR FT PRKIGESLEVFSADITRLVLEAFPDYDIIAQEGEKFRRFVAGLDPALQAKI FT HEQGATDMDQALIVACRCERARAAMQLHAGSTQSTQNQNQVAAVQSHHTEG FT KLLQAVEQLTITVNELKNDVRQLQEKNAHLTHCLEFWERNKIPSVKQQPRV FT TRSPSPTYHNPQHQSDNAHSSKSWRKDAAVSTDWRRTMTGDSDNSWGGDKV FT RPYYSSERASNFSDRRYRNDEYRRTRSPSPVRRWSRDESPQSRRVVRFQSP FT DSQHPYKQQGNFH" FT CDS 1388..5476 FT /product="Gypsy-30-I_DR2p" FT /note="ORF(pol-like)." FT /translation="QPTPIQTTGKLPLADVEGQTSASTSLPHTDFTVSKNY FT EHIQSPNKVSANILAIVEGVEVSALVDTGSTISIVGEDFRNSHPMLRKRPI FT KTSMVLARSVTGQCLDTLGVLVLGMRLGTELLQHDFQVVRGAHHPIILGWD FT FLQKHHALIDVTNEKMSLWNFELPLLSTGHEAAACCNVSVLASTKLPPWSE FT TVITACVAGATAVSPVPTAYTGVLEPNTASNVAVAHTLSEVRNGLTTVRVL FT NTTEEDIEIHAGQHLGEFQFLSALDSTVLEGECCNILSDPSAATPSVSIDN FT SDLSASQVESLTELLTKYCDIFSKDSDDRGRTGMIKHRIRTGEALPIKQRP FT YRVTPEQRQEIQQQVTKLLKADVIEESYSPWAAPVVLVRKRDGTWRFCLDY FT RKLNSVTVKDSHPLPRVDDALDALSGSAWFSTIDLQHGYWQVELAKEDREK FT TAFTTGDGLYHFKVMPMGLTNAPATFQRLMEMVLRGLPWKTCLVYLDDVLI FT FSKTFNDHLQQLEEVFTRFKSCGLKLNPSKCSLAKREVQFLGHIVSKDGIK FT PDPRNVQSVQDWPVPRSPTEVRSFLGLCSYYRKFIKDFAQHAVPLHKLTEK FT NARFSWTAECNEAFTYLKHALSHPPIVSFPDFHQPFLLYTDASASAIGAVL FT AQEKGTQETVIAYASHVLTKAERKWSTYDRELWAIVWAVRHFRHYLYKQHF FT FIITDHKPLMGLRKIPIDSDRTGRRARWALELDPFEWTVIHRKGLKHANAD FT ALSRRQASDSAMEIPASSSGEANVGSPVSKSDSGYLSAAQGSKVLGPSVSI FT THVSPTLSACVNQEMERTPSLMLQLATSESSFIEHQKQDPVLKEVISWKLK FT GLRPPYRKISKRSQEERIFWKEFTRLTLHNGLLCREIXNSSTKSVTCQVIV FT PQTLRKQVLQSLHGNPIAGHLSAEKVVKQAQQLCFWPFMSRDIYEWCKKCV FT PCDARRTPIPRQKAPMKTITATAPFQKVAADILELPVTSRGNRYVLVVQDY FT FSKYVNLYAISDQRATTVAKCLFENFVCEHGIPEVLHTDQGRQFESELIQN FT LCQLVGIQKTRTSPYHPQCDGMVERFNRTLIDQLTKILQQQQGEWDDYLRQ FT VALAYNTSPHSSTGFTPFFLTHGREARLPANLLLPKSSPQSPTKSVPEYAV FT DMISKLEDVFTAVAQNSEHAYHQQKRHYDRNLNFTPYQEGDLVWLHDPTTV FT RQKLTPHWKGPFEIVECLESERDVGVTYKIHYLLDDPEKCQIVHYNRLRPY FT CAPVPQKPQQTTLSASYPSPSLTALSGALPFKPPQYARCTPNLTSEDQILP FT QAPLPLQVPSSSVDPASVTSTPNQSSSSAPMTCLGRRPVKLPGYLRDYLLT FT " XX SQ Sequence 5597 BP; 1594 A; 1259 C; 1223 G; 1520 T; 1 other; tttggtggca gctgggttat ttttgcattt ttggcgcttt tcgcggagtt ttattaactt 60 gactgccgcg aaattggcac catttggact ttgctctccg attgtgtcac cggatttatg 120 aagtgataat tgtgtaaata ttcggaagaa caaccacacg acttgccaca caagggtgag 180 atacaaacaa atgaacagaa agtatgtctg agactgagtc tgctgctgct gctgctgctg 240 ccagctcttc tgccaattca tcctctagtt catccagtgc tagtggcagc gtaaaccagc 300 agccactggc accttttgtg gcaggcaatt tacacattga acttccagct gcatttaaag 360 gggatggcac tgaatcattt tctagttggg ctcgtcgttt tgaagtggct gttagagcaa 420 ttacaccaca aggaacagat ttgatgaaca tgttatcatc cgtgttgcca acacggctgc 480 catctgatgc tgcctttttg tattgggaca gccttccaac tcctgtaaag actaactaca 540 acacagtgac agagagacta aagcatgttt ttgggcttca acagtcattg ccatttttcc 600 aaactcatgt gaatgctcgc cctcggaaaa ttggtgaaag tttagaagta ttcagcgctg 660 atattactcg tcttgtccta gaggcttttc ctgactatga tattattgcc caggaaggtg 720 aaaagtttag acgttttgtg gctggattgg acccagcttt gcaagcaaaa attcatgaac 780 aaggtgcaac agatatggat caagccctta tagttgcatg tcgctgtgaa cgggcccgtg 840 cagctatgca acttcatgca gggtcaactc agtctacaca aaaccaaaat caagtggccg 900 ctgtgcagtc tcatcatact gagggaaagt tacttcaggc tgtggagcaa ttaacaataa 960 ctgtgaatga actgaaaaat gatgtacgcc agctacaaga gaaaaatgct catcttacac 1020 actgtttgga gttctgggag agaaacaaaa ttccttcagt aaagcaacag cctagagtga 1080 cacgtagccc ttcaccaaca tatcataatc cacaacatca gtctgataat gctcattcat 1140 ctaagagctg gcgaaaggat gctgcagtgt caactgattg gcgtcgcaca atgacagggg 1200 acagtgacaa ctcatgggga ggtgacaaag ttagacctta ttactcatca gaaagggctt 1260 caaacttttc agaccgcagg tatcgtaatg acgaatatag acggactcgc agccctagcc 1320 cagttcgtcg gtggagtaga gatgagagcc cccagagccg acgcgttgta cgctttcagt 1380 cccctgacag ccaacaccca tacaaacaac agggaaactt ccactagctg atgttgaggg 1440 gcaaacatca gcttcaacct ccctgcctca caccgatttt acagttagta aaaactatga 1500 gcacatacaa agtcccaaca aagtgtcagc aaatattttg gcaattgtgg aaggagttga 1560 ggtctctgct cttgtggaca ctggctctac tatttctatt gttggggaag actttcgaaa 1620 ttctcatccc atgttgagaa aacgccccat aaaaacttcc atggtgcttg cacgctctgt 1680 cactggacaa tgtctggaca ctttaggagt tttagttttg ggcatgagac ttggcacaga 1740 gttactgcag catgatttcc aggttgttag gggtgcacac catcctatta ttttgggctg 1800 ggacttctta caaaagcatc atgctctgat tgatgtgaca aatgagaaaa tgtccttatg 1860 gaattttgag cttcccttgc tctctacagg tcatgaagct gcagcctgtt gtaatgtttc 1920 tgtgcttgca tctaccaaac taccgccatg gagtgaaacg gttatcacag catgtgttgc 1980 gggagcaaca gctgtttctc ctgtgccaac agcctacaca ggtgtcctgg aacccaacac 2040 ggctagtaat gtggctgttg cacacaccct cagcgaggtt cgaaatggtc tgactacagt 2100 gcgggttttg aataccacag aggaagacat cgaaatccat gcaggccagc acctgggtga 2160 gtttcaattt ctttctgctc ttgactcaac agttttggag ggagaatgtt gtaatattct 2220 atctgatcct tcggccgcta ccccctctgt atccattgat aattcagatc tgtcggcatc 2280 ccaagttgaa tccctaacag agctgttgac taagtactgt gatattttta gtaaggattc 2340 tgatgacaga gggagaacag gcatgattaa acatcgaatt cggacggggg aagccctgcc 2400 aatcaaacaa aggccttacc gtgttactcc tgaacaaaga caagaaatac agcaacaggt 2460 gacaaaactt ttaaaagcag atgtcattga agaaagctat agcccatggg cagcgcctgt 2520 agtgttggtg cgtaaaaggg atggaacatg gcgattttgc ttagattaca ggaagttaaa 2580 ttcagtaacc gtgaaagact ctcatcctct acctagagtt gatgatgctt tggacgcttt 2640 gtcaggatct gcttggttct ccacaataga ccttcagcat ggttattggc aagtggaact 2700 ggcaaaagaa gatcgtgaaa aaaccgcatt caccacaggt gatgggctgt accattttaa 2760 agttatgccg atgggtttga caaatgctcc agcaaccttt caaaggttga tggaaatggt 2820 gcttcgtggt ctgccatgga aaacatgctt ggtgtatttg gacgatgtcc ttattttcag 2880 taagacattt aatgatcacc tccagcaatt ggaggaagtg ttcaccagat tcaagtcctg 2940 tgggttaaaa ctcaatccct ccaaatgctc tcttgccaaa agggaggttc agttcttggg 3000 ccacattgta tctaaagacg ggataaaacc tgatccaagg aatgttcaga gtgtccaaga 3060 ttggcctgta ccacgctcac ctactgaagt gaggtcattt ttgggattat gctcttacta 3120 taggaaattt ataaaagatt ttgctcagca tgcagtccct ctccataaat tgacagaaaa 3180 aaatgctaga ttcagttgga cagctgaatg taatgaggca tttacctatt tgaaacatgc 3240 cctatctcac ccacctattg tttctttccc agattttcat caaccgtttc ttctgtacac 3300 agatgcctct gcatctgcta ttggagctgt tttagcccag gaaaagggga ctcaagaaac 3360 agtgatcgca tatgccagtc atgtcctcac aaaagctgaa aggaagtggt ccacgtatga 3420 tagggaactc tgggccattg tgtgggctgt gagacatttc cgccactatc tgtataagca 3480 gcatttcttc atcattacgg accacaaacc tctcatgggg ttaaggaaaa taccaattga 3540 cagcgatcgc actgggagaa gagcccgctg ggcccttgaa ttggatccat ttgaatggac 3600 ggtcatacac cgaaaaggac taaaacatgc aaatgctgac gccctatcac gtcgtcaagc 3660 atcggattca gctatggaaa tcccagcctc gtcttcaggt gaggctaatg ttggctctcc 3720 agtatctaaa tctgacagtg gttacttatc tgctgcacag ggatccaaag tcttgggtcc 3780 atcagtcagt ataacccatg tttcaccaac cttatctgcc tgtgtgaatc aagagatgga 3840 gagaactcca tctcttatgc tacaactggc tacttctgaa tcaagtttta tagagcacca 3900 aaaacaggat ccagttttaa aggaagtaat aagttggaaa ctaaaggggt taaggccacc 3960 atacagaaag ataagcaaac ggtcacagga ggagaggatt ttctggaaag aattcacaag 4020 gctcacgcta cataatggtc ttctctgtcg agaaatcvtc aattcaagta cgaagtctgt 4080 gacttgtcag gtcattgtac ctcaaacttt aaggaaacaa gtccttcaat ctctccacgg 4140 caatcccatt gctggacatc tgtcagcaga aaaagttgtg aaacaggcac aacagctctg 4200 cttctggcct ttcatgtccc gggacatata tgagtggtgt aagaaatgcg ttccttgtga 4260 tgcccgacgc accccaattc ctcgtcaaaa agcacctatg aaaacaataa cagcaactgc 4320 accttttcag aaagtggctg cggacattct tgaacttcct gttacaagtc gcggaaatcg 4380 atatgttctt gttgtgcaag actatttttc gaaatatgtt aatctttatg ctatttctga 4440 ccagagggct accacagtgg caaagtgtct gtttgaaaat tttgtctgtg aacatggaat 4500 tccagaggtc ttacacacag atcaagggcg acaatttgaa tctgagctaa ttcaaaatct 4560 ctgtcaactg gtgggcattc agaagaccag gactagcccc taccatccac aatgtgatgg 4620 tatggttgag cgtttcaata gaactcttat tgaccagttg acgaagattt tgcagcaaca 4680 gcaaggagaa tgggatgatt atctcagaca ggtagcatta gcatataata ccagtcccca 4740 ttcatcgacc ggctttactc cctttttcct gacacatggt cgggaagcca gattgccagc 4800 taatttactg ttgccaaaga gttcaccgca atctccaacc aagtcagtcc ctgagtatgc 4860 tgtagatatg atcagtaaac ttgaagatgt atttacagct gttgcccaaa acagtgaaca 4920 tgcctatcat caacagaagc gacattatga taggaatctg aattttaccc catatcaaga 4980 gggagatctt gtttggctgc atgacccaac cacagtaaga cagaaactta cacctcactg 5040 gaaaggaccc tttgaaattg tggaatgttt ggagtcagag agggatgtgg gtgtcacata 5100 caaaatccac tatcttctgg atgatccaga aaaatgtcaa attgttcatt acaaccgact 5160 gagaccatat tgtgcaccag ttcctcaaaa accgcagcaa actaccctct ctgcttcgta 5220 tccttcccca tccttgaccg ctctatcagg tgcgttacct ttcaaaccac cacaatatgc 5280 acggtgcaca cctaacttaa cgtcagaaga ccaaatcctc ccacaggcac cccttcctct 5340 tcaagtgcct tcttcctcag tggacccagc cagtgttacg tccacaccta atcagagttc 5400 aagttcagct ccaatgacat gtctaggccg gcgtccagtt aaactacctg gatatttaag 5460 agattatctt ttgacataag ggactctaaa aagctacttt aacaggtagt ttctttcttt 5520 tttcttttct tctgtttttc tttaattttg taactctaaa gaaacgagga cgtttcttaa 5580 gtgggtggaa aaagaaa 5597 // ID hAT-N23_DR repbase; DNA; ZEB; 1738 BP. XX AC . XX DT 03-OCT-2008 (Rel. 13.1, Created) DT 04-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Putative nonautonomous hAT DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW hAT-N23_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1738 RA Jurka J.; RT "hAT DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1293-1293 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1738 BP; 526 A; 332 C; 337 G; 542 T; 1 other; gggatgcacc gatcccgata cttggatcgg atatcggttc gataccgcat atttttgctg 60 gatcgggtat cggccagctg gtaccgatcc aaatccgatc ttgcgcgtgc gctatattct 120 gtgtaattta taagccaaca aaagccacga aggtagccta ttataaggca ctagaaagtt 180 gtcatgtagg cctactatat cccaaatgtt ctgaagccat tctatagttt aatgtgaagt 240 acagatgaat attcaagtgg ttaaattcat gttcagttca gcgcttcccg ccgctgcgcc 300 tgtcaatcat tctccattca ttcacaattc aaactgcgtg tcgcgttcat gacaacacga 360 aaaactttct ccaagactat ttgttcctgt tartttaaat aatcagtgga gaaatgcatt 420 tagttttgca ttaaatgggt tcattttgac ctgcaacatg gagttcattg ctgtttctaa 480 atcatgttgc gctgtgtctg ttagatcagc agatcgcatt cgcaacactg gagtagagag 540 ggttattacc tgctcttcac acacaaagta ggcctacctg aaactttaaa ttagaaaagg 600 ctcaataata cattggacag atcctcaacg cactgatttc ttccctttgt gctgctgttt 660 tggaagtgtc cgcagatacc ggagggctgc gcgctgtgtc tcagtgatgc atcaagcgct 720 tcattcagcg caccggctgc gcagacgcga catgcgccgc tccgtaaaat tattctcaca 780 ataagtttct gttctctcat ccttccgact gagtttattc ttccgagggg aatactttaa 840 gtgctgcgaa tagtttgtaa agcctggctc attgtggtca aagtagttga ttaaattaat 900 aaaagtgaaa ctgacgggcg caatatggat tgtcattaac agaaaattat ttagcctaat 960 ataggctact ctgaaactgt gaatatttca ctgtaatttt atttatatta atattttatt 1020 taacagacct gtttgtcagt ttgtgcgctg aagcatataa gcggaccttg ttttgaactt 1080 cacctcaaat attcttgttt attttgtaga taactgacca acaaaaatac attaaaataa 1140 ccaacaaatt ataccaaacg aaattcgttt caaaaagata atttttcaga caaaaaataa 1200 tttttcagac atgctttcag tttctccatc tcactcgcgg cgagtttagg cgagggaatg 1260 tttattaagt aaaactggcc tcagacggtt tagtttagtc tgcttcaaac ctgcaggaaa 1320 atatcaggct ttgctaaaag gctggttatt ttaccaacga ttagggtcaa tagtggtagc 1380 ctattacaga tttcaaagaa aaatcttaat attaaaaaag ggaaacataa gctggaccac 1440 aacagatcat atcaatgtca taatgaatgc tcaaataatg tagcctagtt tatagcaagc 1500 atcatgtttt tagttagatt gtgccgtctg tcatatacag taggcctata ggtctgttat 1560 ttttactaaa gaagatatat tatttgagtt tatcaccaga actatagaaa ctgtattatg 1620 tttaaaaaaa ataaaaaaaa aaaaggcaca gtatcgggat cggatctgta tcggtcgata 1680 ctcagaattt tggtatcgag atcggatcgg ttccaaaaaa atggtatcgg tgcatccc 1738 // ID CR1-41_DR repbase; DNA; ZEB; 1865 BP. XX AC . XX DT 28-FEB-2009 (Rel. 14.02, Created) DT 04-MAR-2009 (Rel. 14.02, Last updated, Version 1) XX DE CR1-like non-LTR retrotransposon - a consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; AP endonuclease; CR1 clad; CR1-41_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1865 RA Bao W. and Jurka J.; RT "CR1-like non-LTR retrotransposons from zebrafish."; RL Repbase Reports 9(2), 526-526 (2009). XX DR [1] (Consensus) XX CC The 5'-end is incomplete. XX FH Key Location/Qualifiers FT CDS join(89..667,627..1262,1009..1734) FT /product="CR1-41_DR_1p" FT /translation="YSPASWFTSFPSITSELPQFEILFSSFQLPTTSEISN FT LISKSKSSSSHLDPLPTVLVKACLPSLIPLISAIIHSSLSXGTVPASFKTA FT AITPILKKPGSDPSNFNNLRPISNLPFISKILEKIVAIQLQSHLSQNNLYE FT QFQSGFRPLHSTETALIKITNDLLMAADSGLISILILLDLSAAFDTISSQH FT PHQVRPLTLFRHSTLINRLSTLGITNTPLDWFKSYLSGRTQFIQLKSFTSQ FT HFTVSSGVPQGSVLGPLLFIIYLLPLGNILRKYNIQFHCYADDTQLYLSGK FT PSSCFPPTSLTVCLAEIKSWLSLNFLKLNSSKTEVLLVGTKSTLSKTDSFS FT LSIDDCSIYPSKQVKSLGVILDSTLSFDSHINNITRSAYFHLRNINRLRPS FT LTPSCHCCSRSIVQKLRFCLLVPNQHSPKLTAFLSQLTTVPFIPLNRLRVW FT VSSLXAPYLLILTSITSPGLLTSTYATLTVSALLLPPHATAVLVHSLVTSR FT LDYCNSLLFGLTHKSLRKLQLVQNSAARIITRTPSSHHITPVLQQLHWLPV FT HFRIDFKILMLTFKAIHNLAPPYLSALIHISVPSRTLRSSSSLHLSVPSSR FT LSTMGNRAFSRSAPRLWNSLPPAIRNIDSLTLFKSXLKTHLFKMAFNT*" XX SQ Sequence 1865 BP; 452 A; 548 C; 220 G; 637 T; 8 other; actaaccccc cagactcact accatctcac cttcactcaa cagctttctg taattctatt 60 atgacctttt taatgaaaaa atctttaata ttcaccagca tcttggttca catcctttcc 120 gagcattaca tctgaacttc cacaatttga gattttattc tcctcttttc aactccctac 180 aacttcagaa atttctaatc ttatttccaa atctaagtca tcttcctctc atttagatcc 240 tctccccact gttctrgtta aagcytgtct cccttctctc atccctctca tttcagccat 300 tattcactct tctctttctw ctggaactgt tcctgcttcc ttyaaaactg cagcaattac 360 tccaatactc aaaaaacctg gttcagaccc atctaatttt aataatttac gacctatttc 420 taatctacca tttatttcta aaatccttga aaaaatagta gcaatacaac tccaatccca 480 cttatctcag aacaacctgt atgaacagtt ccagtctggt ttccgcccac tccacagcac 540 agaaactgcc ctgattaaaa tcaccaatga ccttcttatg gcagctgatt ctggactaat 600 atctattctc attctcctag acctgagtgc ggcctttgac actatttcgt cacagcaccc 660 tcatcaatag actatccact cttggcatca ccaatacccc acttgactgg tttaaatctt 720 atctctctgg ccgcactcag ttcattcaac tcaaatcatt cacttctcag cattttacag 780 tttcttcggg tgttccccag ggctcagtcc tggggcccct actttttatc atctatctcc 840 ttccactygg taatattctc cgtaaatata acatacagtt tcactgctat gcggatgaca 900 cccagctcta tctctctggc aaaccctctt cttgttttcc acctacatcc cttacagtct 960 gcttagcaga aatcaagtcc tggctctctc ttaacttcct caaactaaat agttcaaaaa 1020 ctgaggtttt gcttgttggt accaaatcaa cactctccaa aactgacagc ttttctctct 1080 caattgacga ctgttccatt tatccctcta aacaggttaa gagtctgggt gtcatccttg 1140 ayagcaccct atcttttgat tctcacatca ataacatcac ccggtctgct tacttccacc 1200 tacgcaacat taaccgtctc cgcccttctc ttaccccctc atgccactgc tgttctcgtt 1260 catagccttg tcacatctcg tttagattac tgcaactctc ttctctttgg tctcactcat 1320 aaatctctcc gtaagcttca gttggtccag aactcagctg cccgtatcat caccagaacc 1380 ccttcttctc atcacatcac ccctgtcctg cagcagcttc attggttacc tgttcatttt 1440 cgaattgact ttaaaatctt gatgttaaca ttcaaagcca tccacaatct cgcccctcca 1500 tatttgtctg ccctcatcca catttctgtt ccttctcgca ccctgcgatc ttcttcctcc 1560 ctccacttgt ctgttccctc ttctcgcctt tcaactatgg gcaacagagc attyagccgc 1620 tctgctccgc ggctttggaa ctcactacca cctgccatca gaaacattga ctcccttaca 1680 ctgttcaaat caaractcaa aacccactta tttaagatgg cttttaatac ttaattttat 1740 ctgcactatt gctgtgtata ttttattatt tctcttgttg ttttattgct cttgttgatt 1800 gtacggtgtc cttgagttgc tagaaaggcg cctttaaata aaatgtatta ttattattat 1860 tatta 1865 // ID DNA2-5_DR repbase; DNA; ZEB; 1822 BP. XX AC . XX DT 26-SEP-2008 (Rel. 13.09, Created) DT 29-SEP-2008 (Rel. 13.09, Last updated, Version 4) XX DE DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW DNA2-5_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1822 RA Jurka J.; RT "hAT-type families from zebrafish."; RL Repbase Reports 8(9), 933-933 (2008). XX DR [1] (Consensus) XX CC 8 bp TSD. Preliminary classification: hAT element. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1822 BP; 517 A; 451 C; 346 G; 508 T; 0 other; taggcttggg cggtatccaa attttgatac cgtcaaaccg cctccatatt ttaccccggt 60 atccggtatt accggcgtaa taaaaaaaaa ttatgacgta aggctcagac agcgtcacca 120 aactgttggc ttaagcctaa accattcaga aactgaatac attctatgcg accttaggct 180 cgcggtcaga gagagagaga gagagagaga gagagagaga gagagagaga gagagagaga 240 gagagagaga gagagagaga gcgcaaagcg acgcacagca cctctctctc tctctctctc 300 tctctctctc ttttacacac gcacacacac acacacacac acacacacac acacacacac 360 acacatctct cccgcgattt attcagaaat ttactcccac accgtaagaa atctggccgg 420 ctcccgcgga acagcagcgg gaatgcagag agctgtaatc gggcaataaa aattataccc 480 gataggtccc gagtccgaca ggtacatttt gattgacagc tttatacagc ccaaacccgt 540 tcacagcccg acattattca aatgtgctca tgcacacagc tctaatgcat ttttttcaag 600 aatgagtcat ttatatgttt taacatcatt tattcgtaac aagcgtagac tataggccac 660 ttgaaagttg gaacaaagaa ataaaataag tcctacagct gtaacatcgt aacatctcag 720 cactctatag gtctaggtac atgcacttag cctttaaatt agcctacaca ccaatgaaaa 780 gaaatcttgc ttaacaaatc aaattaaaat aaactctaaa tgatgacggg tattttggca 840 ataacgaaat taagataaag gctctgcaag atttgtttct ccacttctct tcacaatctg 900 gcattaaggc ggcggtgaag ctgaataata aaagaataat gccaccatta gtctgtatac 960 tctgtctaca ttatgattat atggtttaat taatatttat ttataattta aaaacccttt 1020 actcaccaag attaggctac gtgtgtttgt ggaggaacat tattaaatcc actggcttta 1080 gcgcagtctc tcactcatgg tgcgtccagc agctgaacac cctttcctgc ggctacaggt 1140 cgagatgcag agttctgatc cggctgattt tgcaagtttt gggtaggatt gtttgttcat 1200 cttctaccag tcaggaacat gcatttaatt ttccatgaca gcagtttcac tttagcgaat 1260 ggcctcttct ttttacctgt cagcttgcgg tttagggctc taccgtaata cgcagtgtat 1320 gagcgaccgc gaaatacctg cggctggaat aaccgctcct tctgagctag tgactggctt 1380 gaccgccgtc aaaattcgct ttttttacta cttcgtcatc atttgtttca cctttgcagg 1440 gatggatttt aatgcaggga tttggatttt ttcgggtctc gccgggttcg ggcaaagatc 1500 tttagctcta atgcagacct ctcgttcata ttaaccacgt ctcccttctg ttcagtcgaa 1560 actcccaaac tgcaggacac tttgaagcgc gactcgtcac gtcacctctc cccctccctc 1620 tcccccttct ctccctccct ctcccccttc ctttccctct ctcctccaca ctgtcccgtg 1680 atgacaacca cacatacgtt tttgtaggca ttaaataaag gatatttaat attttatgac 1740 ggtataacgg tattgaaact gacaccgttg ctatttttag atcccgcggt ataccataat 1800 accgtaatac cgcccaagcc ta 1822 // ID DNA-5-4_DR repbase; DNA; ZEB; 1072 BP. XX AC . XX DT 17-OCT-2008 (Rel. 13.1, Created) DT 17-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE nonautonomous DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; TSD 5-bp; KW DNA-5-4_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1072 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1236-1236 (2008). XX DR [1] (Consensus) XX CC The consensus was built from copies which are ~15% divergent from CC it, and it is characterized by 5-bp TSD and 14-bp TIR. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1072 BP; 368 A; 174 C; 200 G; 328 T; 2 other; cagtgcttcc cacaggtttg aaatatactt gcggtggtag ccggattgaa acacaccttt 60 tacccacagc acatggttaa ctacatgcga caaaaaaagt gatttttaac aatattttta 120 tttattatga cactgtttct tttcttggtt ttatttcagg ttatttttaa aagactgttg 180 aaataaaaaa agaaatccac tatttctytt acttttatgc tattcgggag ccatgtgcac 240 ccactgacag gtaaaatctg ctctctctct ctctctatct ttgctttatt ggcatgacat 300 tttgtacagt attgccaaag cattttagat atgtataaaa tatgtaatat attaacatca 360 ttaaattaat aaaatataca gcaaatgaga aataaataac agaaaaaagg aataaaaaca 420 tctatgaaat aaatatctct aaaaawtata ataattacaa gaataaaaag tgtagattat 480 ttaaataagg caaattagtt gattaaccat ttctaatggt gtacaaatat tttgcagcca 540 gtttagatgt aattttgttc tctctgagta tatagtaaag tttatctgag tcgtttatat 600 taaataatta attatttaat gtttctctcg ctcttgcggc tgtgcacgcg ctgtcagctg 660 cgaaaccaag taaaagtaaa ccaggcttaa aataaaaata acgacaactt tagaaagcga 720 aagcaaaagc cgaatcccgg acattttagg agatttagaa actacggtcg gatgcatttt 780 taagtcccaa aaaggcgcgt ctctgtgggt ctgcagagca cgtgagactg tctgtgggag 840 tgttgacagt tctcatgcac acacaacaag cagtaggaaa cgtgcggagc gagagaagat 900 tttccactgg ttaagttaat cgatcgcgga tgtttggcta tcttgggcgg atacaaaaaa 960 agtgtaaaag gatgataata ataaaaaata aatgtgtcac caaatatact tgggcggccg 1020 ttaatatacc tgggcggccc gcccaagtaa agtctatgtg tgggaagcac tg 1072 // ID DNA-2-16_DR repbase; DNA; ZEB; 1712 BP. XX AC . XX DT 12-NOV-2008 (Rel. 13.11, Created) DT 12-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE nonautonomous DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; 2-bp TSD; KW DNA-2-16_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1712 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(11), 1707-1707 (2008). XX DR [1] (Consensus) XX CC TSD is 2-bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1712 BP; 453 A; 400 C; 322 G; 537 T; 0 other; cccagctagc aaaattcatg tggctcaaat ccggcccaca ccagacactt acatctggcc 60 cacataccgc atggaatgat ggcacttggg cggtccgctc ctgtttgcca gatctgggcc 120 acaattaagc catagcaata tcacatatca gccagaattc aaccaaatga accagaactg 180 accctattct gggccacagt ttgcttttat tctggcccag atccggccca caccagacac 240 ttacatctgg accacatact gaatggaatg atggctctag ggcggtccgc tcctgtttgt 300 cagatctggg ccacaaataa gccaaagcaa taccacatat tagccagaat tcaaccaaat 360 gaaccagaac tgaccctatt ctgggccaca gtttgctttt attctggccc agatctggcc 420 tacaccttac acttgctggg tgtggcaaag tacaaaaatc ccagcatgca ttgcagtatg 480 aatacattct atagcttatt gatgctccag ttttcattat ttgctgttga ttctgctgtt 540 tatgttgacg ttgttgattt tgtcactttt agtatcctgt ttgtgatgtt tgtgagtttt 600 gttcattttg ttaattgtca ccattgttga ggttcataca ctgattattg atgtctctgt 660 gagcagaagt tgaaccatag aatttttttt gagcttataa accttcataa ttctgtggca 720 tgtgcaagca tgttattacg ttacttctaa taatggatta aacattggtt aacatcagtg 780 aattatatta tttcatcaca ggttgtgcct cacttaattt tattattttg ggtatttttt 840 tactttactt ggcttgtttg ctgtaatgaa taaacttttc aagcaaagtt cttaaaagtt 900 acagcagccc aaaagaaaat ttatattaag aggttcaggg ctatgagccc aactaaagca 960 aacacttttc tccatgatgg tgactttagt ctatgtggtc cacatatgtt ttaagataac 1020 tgggccacat ttgccatatc ttctctgggc cgctttaggc tcacagccac attagccaga 1080 gctaactgtg ccaaatattt gccaaaagtg gcccacattt gttttaagat aactgggcca 1140 catttgccat atcttatcct ctgggccact ataggctaac agctgcatta gccagagttt 1200 actgtgccga atatttgcca aaagtggccc acatatgttt taaaataact gggccacatt 1260 tgccatattt tctctgggcc acattaggct cacagccgca atagccagag cttactgtgc 1320 caaatatttg ccaaaagtgg cccacatatg tcttaagata actgggccac atttgcactt 1380 acacgtgtga gccactttag gtttagaccc agattaccct taaatgacta tgccacaatt 1440 ttgccaactg tggcccacat ttgtcctcca tcatttgggc caaatttatc attttccaca 1500 tgggccacat taggctcaca ttcagattac attttgccat aagtgccaaa tctttgcctt 1560 aaatggccca tatatgaatt tgaatctttg gccccccttt gccattgtac aggtgggcca 1620 cttcaggctc acattcattt tgtctgggcc gaaggaatac caccagtgcc gcataactgc 1680 ctaaagtggc ccacattcgt atgctatctg gg 1712 // ID LTR12_DR repbase; DNA; ZEB; 493 BP. XX AC . XX DT 10-JAN-2010 (Rel. 15.11, Created) DT 10-JAN-2010 (Rel. 15.11, Last updated, Version 2) XX DE Long terminal repeat of an ERV1-type endogenous retrovirus - DE consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR12_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-493 RA Bao W. and Jurka J.; RT "Long terminal repeats from zebrafish."; RL Repbase Reports 10(11), 1978-1978 (2010). XX DR [1] (Consensus) XX CC >85% similar to consensus. 4bp TSD. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 493 BP; 127 A; 102 C; 99 G; 165 T; 0 other; tgtgggaata taaatttata tatctatata taactgaaaa gttgttttta gggtgtgaca 60 gaccttaaaa tgtcttggtt tcctgagccc tgactgtctc caaagtaatg caaattgtca 120 gtttcacagc acggctgtgt ctattgtctt ctcaagccgg tcaactggtc agacggttct 180 gtctccagaa catcatttgc acactgtttt catgctaaaa tgtatcgtga atgggtactt 240 atatgctgat gagatcgggc ctggaaaggc aatttaccct gccgacaaca acacttcatt 300 tgcatgcttt gttttagact gtccccacct acatgtatca agcgaaaaat gtatttaaac 360 tgagtgtctg ctgtactagt tagagttctt tcgatgacgc cacagcacgc tgagttcttc 420 ttattaaagg attctgcttt gaagataccc gaaagttctc cctggttttt tggctcgtct 480 tgatttcaca aca 493 // ID DNA-3-4_DR repbase; DNA; ZEB; 241 BP. XX AC . XX DT 29-OCT-2008 (Rel. 13.1, Created) DT 29-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE nonautonomous DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; TSD 3-bp; KW DNA-3-4_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-241 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1227-1227 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 241 BP; 82 A; 44 C; 48 G; 67 T; 0 other; ggcccagttt cacaaacagg gcttagatta agccaggact aggccttagt taaattaggc 60 tatttaagac acatttataa aaatggcctt agaaaaatat attactggtg tgcacctgga 120 gacaaaaaaa atgacactga cgtattttaa gacatctcac cgcaagttat tttgagttga 180 gacagctcaa acatgcaatt taatcttgac tagccttaaa ccttgtttgt gaaaccgggg 240 g 241 // ID DNA8-2_DR repbase; DNA; ZEB; 824 BP. XX AC . XX DT 17-NOV-2004 (Rel. 9.1, Created) DT 17-NOV-2004 (Rel. 9.1, Last updated, Version 1) XX DE DNA8-2_DR is a non-autonomous DNA transposon. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-1_DR; KW DNA8-2_DR; nonautonomous DNA transposon. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-824 RA Kapitonov V.V. and Jurka J.; RT "DNA8-2_DR, a nonautonomous DNA transposon from zebrafish."; RL Repbase Reports 4(10), 252-252 (2004). XX DR [1] (Consensus) XX CC DNA8-2_DR is a nonautonomous DNA transposon. Its is CC characterized by 13-bp terminal inverted repeats and 8-bp CC target site duplications. It is expected to be a member of the CC hAT or P superfamilies. XX SQ Sequence 824 BP; 218 A; 169 C; 162 G; 275 T; 0 other; cagtgttgcc agacgtacga taattatcgt atttgtacga taatttcgac ctctgtacga 60 tgtacgatca ataatgagaa aaatcctata atgtacgata atttcagaat tttatggcat 120 ttaaaagtaa cttcattgta atcttcgggc ttctttactt attgatctag ctgcatcttt 180 tctgtactgt ctgtgaggag taggtggagt taggcacgtt ttctatctca tcttacgttg 240 cctcttctca gccaatcaac gtggagaatg gctctctctc acgagttaac gctcctgctg 300 cgcgtgcccg tggttaggtc agtagttttc agtttgaggt cgatctttta agcaataatg 360 ttaaaacaat tcgtgctgta aaggaaggtg agaaggacgg gagttttccc ttgatgttct 420 cgtttttccc cattcaaatg cgtcttgtga acgtgttttt ttccaaagtt agtctcatca 480 aaacaaagca tgcacacctc agagtgtgca ctgctttcaa gccaaccagc actattttgc 540 agtccattac ggcatccaat ttgtacggaa aaataaatcc ctcaacatct ggcaacatgc 600 aacattcaat gacagcaact cagaggtgga gtttgaacct tcccactaag gtacctagtt 660 tttttttaac gtattgtggt aatttgcacg ttgtgtcaaa acgctatgtg tctagataat 720 agctgtatac tctcagtttg actcgtgtac gataattttc ctccaaatac gataattttg 780 aggttctggt acgatacttt gacatttcca atctggcaac actg 824 // ID BEL20-LTR_DR repbase; DNA; ZEB; 844 BP. XX AC chr12; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL20-I_DR; KW BEL20-LTR_DR; BEL20_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-844 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1388-1388 (2008). XX DR Genome; chr12; Positions 35878300 35877457. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 844 BP; 239 A; 182 C; 162 G; 261 T; 0 other; tgtgcaaaag gaccgctgat gtctacgtcc cggatgacac caaccaatca gtgcacaatg 60 caacgcagta tcatcgaggt gggctgagct aagaactagt tgagagcgac agtgcacacg 120 ggagataacg ctggcgataa cacctgctat cgaactcact cggcaaaaaa gagacaaaac 180 agtcactact ggtatactgt gctacagtgt gcgtgtctta aaagtaagtg atcactattt 240 ataaagttat tattgacaga agatcagctg ttgttcacgt accacctgta gcagcttgtt 300 agcacagtac tgcttatatt accagcttaa actcgggaat taccaagtct ttaagtacaa 360 gttgtagtta tatgttcata tgtacacatt ttagtatgtt actgatgtat gtttatagta 420 taacacgtat gtgcgcgtta tgcgacgata attggctacc tgcggtgatc attcaagttg 480 gacatgccca gacaaagacc ctgccatcat gcggcattca tgtgctcttg gcgccctctt 540 gtggacactc ggagaagtgc acctttatta tctactgggc cattattttc ttactatact 600 tgttgtatgc tgttcagtta gtaatgttta ttgtgaaaca tctgtattga ttattgttta 660 ttgtttattt gtttcagaaa ccacacatat gttccattgt ataccaagta ctttagtaaa 720 tgaaaatcac atctaatacc ctgagtgcct ctcttcaatt aagtaaccat ctcactcagt 780 tctgaccgac tgcctattgt gtttcacacc caaatttaat acatccacga acccctccat 840 ttca 844 // ID DNA8-4_DR repbase; DNA; ZEB; 2395 BP. XX AC . XX DT 17-NOV-2004 (Rel. 9.1, Created) DT 17-NOV-2004 (Rel. 9.1, Last updated, Version 1) XX DE DNA8-4_DR is a non-autonomous DNA transposon. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-4_DR; KW nonautonomous DNA transposon. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-2395 RA Kapitonov V.V. and Jurka J.; RT "DNA8-4_DR, a nonautonomous DNA transposon from zebrafish."; RL Repbase Reports 4(10), 254-254 (2004). XX DR [1] (Consensus) XX CC DNA8-4_DR is a nonautonomous DNA transposon. Its is CC characterized by 16-bp terminal inverted repeats and 8-bp CC target site duplications. It is expected to be a member of the CC hAT or P superfamilies. XX SQ Sequence 2395 BP; 762 A; 360 C; 418 G; 855 T; 0 other; cagtggttct caaactgtgg tacgcgtacc actagtggta cgcgggcttc ctcctagtgg 60 tacgcggagg aatcgctgaa taatgaaaga aacaattaac acttttaatc ctttaatgcg 120 taatataaca ttgatgtgat cagcattggc tgttttgtga agcgtacgat cacaacgctg 180 tgcttagagt gtttatttta gtgcattgtg tcataagctg ctgaaaatca gtttccgaag 240 tttaagaatt gattctgaag cgtgatttga ctgacatgaa aagtggccaa tcacagtcga 300 ccttgtctag ttgcgtgaaa cgtaagggcg ggacaaaggc tttctgtgtt gcagagacag 360 aaaaaacaac ttaaaaacaa atgttttgtt ggaataatgc ttctcggatg ttttcactac 420 agagatgtct agcagagtaa ttaaaccgcg gacatatttc ctgcctattt gatgatcttc 480 atacacgttt cgctctccga cagccagtcg actcgcctct gacctgtcac tcctctaaat 540 acattctgaa tggcggcgcg ggaatgaagg tattgcgcaa ttactcattt ctgcaaatgc 600 ggcgagtgct ttattttaac acgagagcgc attcatgtac gcaaatcgct aaaacagata 660 tgcgagctct taaataggct tttttcaaat gaactgcaag tgctctcatg atcgcgtgtg 720 tgctcagata gaatacaatc gcgatctttc cactattaaa gtagacatta tttatctttg 780 ctccttaact aaatttagtc aaaagtattc aaagttatca aatatttaga aatagattga 840 tacatgatcg atgacaatgc taatggtctt cttatttggc tgtgttgtga agtgaaactt 900 acttttagca aaggtggaac attttttatt tctttacgac aaaataatta tgctggaaat 960 gtttctggat gaggtatttg tgtgatttat acatttagct gtattctgat ctgtcctaaa 1020 gcattacagg tcaaagcaat ccattgttag cctacttatt gtttatttat cacaatttac 1080 aataaggctc ttaaagcact gctttgataa atgtagtgtc gtcatatttc aaatacctca 1140 agtaaaatga agtcatttgt tttatttttt tactgacagt gtacttgtat ttttttacgt 1200 ttctatttta gattaagatc aagtgtaaaa taaaatctac atgtgatgta cagctattta 1260 agaaacaaat ttgccatatt gtaaagaaaa tagtgtaaaa tatgtaaata aaaggctaaa 1320 tttaaatctt tttaccatat actttaattt aacctttcaa agcttaaaaa tatatatttt 1380 taaaaatggg caaaaagtgt ctctatggcc ttaaagatta tttggtagtt ttaccagtgc 1440 tattgaaaaa gattaattgt catgaaagga aaaattgact atacttcctt tgttagtgca 1500 aattatacat gtagctttgg gatgtaagca atcaaaccat taaatcaggt ttacacttaa 1560 aaaaaagtga tttcataatg aaaatcagat tttttttcaa aagtaaccaa ctagtacttt 1620 ttaaaagagt aatttgtact tttacctaat attagtatta tttactaaaa tttttgttat 1680 tttagcagtg taatagtatt tttacttgag tacaaaaaag atttcacaag cacttagtgc 1740 aggtttatta tattattaat atatgattaa taaatattat tattaatggt ttttatttgt 1800 atctattcct atgcaatgtt ttatgggttg ctgagaatac atttcaggtc ttcaatacat 1860 agcatgtcag tcttgaaaaa aaaataggtg ctggaattga ttttcattag gctgtgcctg 1920 tatgaaccct gtcatgtaca tttaacattt atttcaagat ttgtctgaaa gtgtatgaat 1980 atgacatata gcctatattt ataaggtcca agcagcattt ccagcttttg atgaataggc 2040 tactatgaat actttagatt taagtacagc agttttcttt ttacttttta agaacagcgc 2100 cattttttta tgaactttta aaagcacatt ttaactaaaa cttttttttt ttttatctgc 2160 aagcacagtt aatgttcaga ctatttcaaa tgtttaaagt gactgacaat aatacatatt 2220 cttactaata ggaagtaatc tgcatcgttt taaaactgtg cacacctgta gctgcttaat 2280 taggcttacg acgctatttt ttaatactgg ttaaaaaata gcgtcgtaag gtggtacttg 2340 gagagacgat ttttttctaa ggtggtactt ggtgaaaaaa gtttgagaac cactg 2395 // ID Gypsy10-LTR_DR repbase; DNA; ZEB; 1641 BP. XX AC . XX DT 07-JAN-2005 (Rel. 10, Created) DT 07-JAN-2005 (Rel. 10, Last updated, Version 1) XX DE A long terminal repeat of the Gypsy10_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy superfamily; Gypsy10-I_DR; Gypsy10-LTR_DR; Gypsy10_DR; KW endogenous retrovirus. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1641 RA Kapitonov V.V. and Jurka J.; RT "Gypsy10_DR, an LTR retrotransposon from zebrafish."; RL Repbase Reports 4(12), 314-314 (2004). XX DR [1] (Consensus) XX CC Gypsy10-LTR_DR is a long terminal repeat of the Gypsy10_DR LTR CC retrotransposon. Its internal portion is deposited in Repbase CC as Gypsy10-I_DR. XX SQ Sequence 1641 BP; 477 A; 249 C; 363 G; 552 T; 0 other; tgtagccctt attaaaaaag agcttactta aacccacttg cacctaaaag cagagcataa 60 tatattttta tacatagaaa tatacagaga ttataattat atttttcata ttatggtaat 120 ctaatagata gataatatta acagaccact attctgttaa gaaaatgcat tttagtatgc 180 atactattat aggagttata tggtatgttt ccctttaagt aactcgttcg ctacatttca 240 gtgtttggtt gatttgcact tgggtgaaaa tgtgcttttt gctatcaagt ttgtgtgtta 300 atacctgatt gggtctatac caattagtcc ccgcctctct tataggtgaa taagctatta 360 ttggtttatc gtgagagctg aggagaagat cgagagaacg agaccgcaga ggcttgtctc 420 tgagagaaac ttaagtgtta taactgcaaa tcgtgagtta tatacagtct atattgattt 480 aaagtgttct aaagctgttt aagaacctta gtgtatgcat taagtgttat aattgcgagt 540 atatggagac cgtttgcgag ctgaagacat tgacgctcgt gtgagaccat atcctcgtgt 600 tttagctgtc tgctacacta gactgacgca gtgttttagc gagtgaactg ttaagagagt 660 gaagctcgtt caggaggaca gattcattgc atcggcggga gtgttcatgc tgtgtgctga 720 cgctgcatcg gactgacggt tcgcggtggg gaacatcgat cgcgggacag acggcatcgc 780 gcggaacgga cggacatcga tcgcgggacg gacgacgatc gtgacggttg ccggaacgcc 840 aggtacattt acagtgtttt ctttttgctc ttttaagagt gaagaggcca ttttgcagtt 900 tcaaaggcca ctacgcaccc gagcaatata acaggcctgc atcgagggtg ggtgagactg 960 tgtgtgtgtg tgtgtgtgtg agagtgggcc cactttcaat aatatttcag tgttgcaaaa 1020 gtgtttattt gcggttaacg cttacaaaag agtgaaaaag gtgttatttc tatatttttt 1080 ttagagaata tttgctttga tattctataa tctaagatac gaatgcacta aataattgtg 1140 agtaatctaa atcattcaag ggtgaattta cagttttagt aagggagcgt ctaaaaagtg 1200 catcaagtca caaaagtgta aaatgcttat tgcttaagtg attggggtcc agttgttgga 1260 tactcatact gtgtgtttat tgaaggttaa taactgtttc tctttgcaat tagtgtttta 1320 tctattgtgt attgatataa ggttgtatta ataattcata tcttattata aatagtatta 1380 taattatatt tatacctata atatcaccta tataaataac aaatacatct atactcatct 1440 ttattttatt ttgtttataa ataaatttct atttgtttat aacttaattt attttgctta 1500 cttattcatt gaagttctct aaaagtggta tctgattagg gggagtgaaa taatagtcga 1560 gaggctaaac tgtaggaatc cggtgctcca ccctatagag cttaggtcag gacaaagcct 1620 aagagggcag ggggcgctac a 1641 // ID DNA-1-5_DR repbase; DNA; ZEB; 748 BP. XX AC . XX DT 17-OCT-2008 (Rel. 13.1, Created) DT 17-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE nonautonomous DNA transposon from zebrafish - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; TSD 1 bp; DNA-1-5_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-748 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1214-1214 (2008). XX DR [1] (Consensus) XX CC The element is characterized by 38-bp TIR and 1-bp TSD. The CC copies are ~98% identical to the consensus. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 748 BP; 185 A; 192 C; 161 G; 209 T; 1 other; accgtacgtt cacaccgaaa gcggcgagag cgtccaaggt cgctctggcc gccctggcga 60 caacgctgtc tgccttcagc tccggcggcg agagcgtcaa aactcgctac attgatctcg 120 tactttaaag gagccgttgc agcattatca gttacattcc tgcataagac atgtttttag 180 cgtgaaaatg ttgcgcactt cttatcaaat tcatataaca atggaagatc aagttgcrtg 240 tggtgtggct ttgctctatt tatccaatat gtgtccatat gtctgaaata tcctgaagca 300 gcagtactgt tttgtactgt cacatcgcgt gagattcccg ctttttaaag ataaatttat 360 ataacattaa tgaacatcag taactcgcca gtaacttatt taatgtatgt tcctgtaaga 420 aaacattgta aataattgga aatccggcga ccgcaataat caaacccttg ggaacaactg 480 tggtcggaac caaagttcac aggtctgtgt tccctgaact gctatcaagc ggtggacgtc 540 ttcattctga ttgcttgccg ccgaaccgcg tcatagctca ttaccataaa gttgacttca 600 tttcaactct cctcgacgct cacgccggcg aagacgcgcc gcgctgctcc tcgccgctta 660 tcgtcgccta tcgccgccgg ctctcattga aaatgaatga cttccggcta ctttgacgct 720 ctcgccgctt tcggtgtgaa cgtacggt 748 // ID MOSAT-2_DR repbase; DNA; ZEB; 1603 BP. XX AC . XX DT 20-FEB-2009 (Rel. 14.02, Created) DT 20-FEB-2009 (Rel. 14.02, Last updated, Version 1) XX DE Zebrafish mosaic satellite repeat - a consensus. XX KW SAT; Satellite; Simple Repeat; Nonautonomous; mosaic repeat; KW tandem satellite repeat; MOSAT-2_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1603 RA Bao W. and Jurka J.; RT "Zebrafish satellite repeat."; RL Repbase Reports 9(2), 568-568 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1603 BP; 526 A; 388 C; 324 G; 365 T; 0 other; ggtcttttag agtctggatg atcatgtctc aagcctgacc atgtcatcca aacccagagt 60 cagctcggaa aatggaaatc aggacaaaca tatgctatcg gccattgaaa gagtttcaga 120 tcagaaagcc actctcccat aatctgaggg aaaggtgcag tttggatttg aacctagcac 180 ctgctgttcc caagacaggt gctctggcca acttagtctt aacactaaaa gaaaaaacct 240 ctggatacga tacttccaaa cgcagtttca tctaagacag tgggggggaa gcaaaattta 300 ctgtgctttc tactcagtta ctttgcacat gcataaagag gctcttgtac gacaaatcca 360 aaaggcgtga tctcaacaat aggtggcagt tacttttccc caaaaccaca ggttgccgac 420 gagtcattcc tcaagcccac taggccctac accgttttct aatctccctt atttaaaaca 480 gctcaattaa gtcatcggct ttattgtaga gtacccctag tatctgagcc tgcactgcgc 540 aatgaaagaa agaagcaaaa cgggatggac agatgcctaa aatgttagaa acgcaccaca 600 ccttaagtca gtgaagaaaa agagcaaagc ttgcgactct ggtgggactc gaacccacaa 660 cctttgaatg gctcagctca cagcctagaa gtccaatgcg ctatccattg caccacagag 720 cctaagtgct ctgcttcctc tagaactgtt ggacagaaag cactttttag tactctctgg 780 cccatgaaag aacagccaac tgcggacaag gcactgctgg aattcaaagc caggatctcc 840 tgtttactag acaggcactt tgaccaacta agccacagcg ccaaacgacc ttgacacagg 900 ctaaatgttt aattccaaaa aaaaaaaaaa aaaactaaca acggtagcgc tgcatgtggg 960 aatgtctgga cttgaaatat taaagtccaa ttggttaggc attaccacac tatgctatat 1020 aaaggcatcg tccaaagccc tcttgaacga gcagcgttct taaggatttg agttttgtta 1080 agaaaccgac tgctaaacat cccgctgcgt aaacactcgt tactcttaac aaaagcaaac 1140 acgaaacctg tacattaggc actgctggga tttgaaccca ggatctcctg tttactagac 1200 aggcgctttg accaactaag ccacagcgcc aacgctgaat aggaggcaag acgtttttgc 1260 acacagagac taaaaaaaga cagcaagtct taaaatggca agcaaagagg ttgtttaact 1320 ctatttgtcg cactattcat tacgcctcaa agctatacaa ttttataatt cattccggtg 1380 agctctataa aacaagcaca cgcaccacct ggactaaggc gctgctggga tttgagccca 1440 ggatctcctg tttacaagac aggcgctttg accagctaag ccacagcgcc acagaaattc 1500 acagtaagag acatttgccc gataacaaaa atcgaagaag aaaaaataat acacaacaaa 1560 aaccagcatg tcaggatgca gaaaatgctg acaagccgga cgc 1603 // ID TDR8B repbase; DNA; ZEB; 963 BP. XX AC . XX DT 03-OCT-2008 (Rel. 13.1, Created) DT 04-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE A non-autonomous hAT DNA transposon from zebrafish - consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; TDR8; KW TDR8B. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-963 RA Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1256-1256 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 963 BP; 261 A; 190 C; 249 G; 262 T; 1 other; tagggttggg cgatgtcgac caatttggca tcgtacgatg tctaatgtga aacatcgcga 60 tggacgatgg catcgtcgtc gtaggcggcg gtgaattaat tatttatgaa taattaatta 120 attcattaat tcataacgaa ttaattattt gtagcctacc gtttcaacta cctgacccgc 180 atggtctttg ttttacccat aaccaaatca taaataaata aagataagtt acacacaaat 240 taccacctgt caatcacttt ttccgcggga ctctggcatg aataggcaga gtgatctgtg 300 tcgttataat ggcgtcgaca aacttggttg gtaaaaaagg tgcacaacca acaggaacca 360 accaacagta tctgaggttt tcrctaaaat tactaagtac aagcgtgaaa gcgaaagatt 420 gaagcagtgt actgatgctg tgacacgtta cctgatagat tcaaaagaca gaagagtcgt 480 tagatagaga caagattaat taaatatcac gtttaacaac tatagtgaga cgcgatccag 540 cggtacatcc ttgataaact gtccgacgtg cactgctctc tggggttttg tgctcaaagc 600 acccgctgac tgcctggagc tcagacgcgc gcgtacgctg cagcgcatga cagagtgtgt 660 gtgtgtgtgt gtgtgtgtgt gtgtggtcac gtgatgtgcg ttttcagcgg tatagtgtgg 720 aaggagggct tttcagaaat gctagatgaa acgccagtgt ggacgtggat tgttcgtttt 780 cgttctaaaa tgccatttta aaactaagac gtattagtgt aaacggggcc taagtgtgta 840 tttttcgcga gcgggttgct gctgcgacgg gggcggggcg gaggatcgcg atgccggctc 900 agcatcgtga tgtctatcgg ccatcggcga tggacgatgg catcgtctat cgacccaacc 960 cta 963 // ID Gypsy-165-LTR_DR repbase; DNA; ZEB; 541 BP. XX AC . XX DT 17-NOV-2008 (Rel. 13.12, Created) DT 17-NOV-2008 (Rel. 13.12, Last updated, Version 1) XX DE A long terminal repeat of the Gypsy165_DR LTR retrotransposon; a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; LTR; KW endogenous retrovirus; Interspersed repeat; Gypsy superfamily; KW Gypsy-165-I_DR; Gypsy-165-LTR_DR; Gypsy-165_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-541 RA Dib M.R. and Naveira H.F.; RT "Gypsy-165_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 8(12), 2163-2163 (2008). XX DR [1] (Consensus) XX CC Gypsy-165-LTR_DR is a long terminal repeat of the Gypsy-165_DR CC LTR retrotransposon that belongs to the Gypsy superfamily. Its CC internal portion is deposited in Repbase as Gypsy-165-I_DR. XX SQ Sequence 541 BP; 97 A; 123 C; 100 G; 221 T; 0 other; tgtcaccatc ctgcagtttg tctccattcc cctagaggtc cccgtgtttc ccctctgtat 60 ttgttcttcc tcatgtgttc ttgtttactt aatttagatc acctgtgact tgttccagcc 120 agtgtattta agttgtcact ttgtctgtgt tcctcactcg gtgtttgagt catctctggt 180 gaacttgcct tgcgtttccc ttgatctcct gagctacatc tagtaagagt gttttatttt 240 ccttgttgct gactgcatta attatctgtt tgttttctcc cctcgtggag ttttctttac 300 atgtttatta tactgttaga gtatttttgt atacctcatt ctgagaagaa cctaagttgt 360 ctgttttgct acctcatttt tgagaagaac ctaagttgat attttttttt aataaatcct 420 tgttttccgg attccctctt ggagtgtttt cgtttgggtc caacacaact tcctctgtgg 480 cttagtggta tacacatcgg tttccacacc agagacccgg gttcaagacc cggtcgtgac 540 a 541 // ID DNA14-5-1_DR repbase; DNA; ZEB; 693 BP. XX AC . XX DT 11-JUL-2002 (Rel. 7.06, Created) DT 11-JUL-2002 (Rel. 7.06, Last updated, Version 1) XX DE DNA14-5-1_DR is a nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA14-5-1_DR; KW Dr000016; nonautonomous DNA transposon. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-691 RA Bao Z.; RT "Dr000016, an unclassified and uncharacterized repeat."; RL http://www.genetics.wustl.edu/fish_lab/repeats/. XX RN [2] RP 1-693 RA Kapitonov V.V. and Jurka J.; RT "DNA14-5-1_DR, a nonautonomous DNA transposon from zebrafish."; RL Repbase Reports 2(6), 4-4 (2002). XX DR [2] (Consensus) XX CC This element is characterized by imperfect 14-bp terminal CC inverted CC repeats and 5-bp targets site duplications. CC Its classification is not certain yet, although it CC is expected to be a member of the TRANSIB superfamily (see CC description of TRANSIB in drorep.ref). CC There are ~1000 copies of DNA14-5-1_DR present in the zebrafish CC genome; they are ~90% identical with the consensus sequence. CC Unclassified version of DNA14-5-1_DR was identified as CC the Dr000016 repeat. XX SQ Sequence 693 BP; 189 A; 153 C; 151 G; 200 T; 0 other; cagtgtttcc tctaggattt tttccagctg tggcggcagg ccttttttac acagatctac 60 caactacctg tggcgttatt tcaatgacaa atgtcgtgag cgcagtatta caagtcgaga 120 tcgcatttat gtaatagcct acgagcatgc gaatctcttt gcttgcgcgc cgatttcctc 180 tgctcgtgca caaaacttct tgcacgccct caaatatacg ctgctcaagc gcagatcttc 240 ttgtgcgctc tcaaataaac gctgctgaag tgcgatttag tgcgtttatg taacgagtat 300 gtctccaaca tttattagat ttgctaggaa tatttatgaa tgtctccaat agacctacag 360 agcgacatta atgcgtcctg aagtaaagtg aaacggctat acgtcgtgtt tgctggcctc 420 acgcatcacg cacctgtcag tcagacagtc agtcagtcag cacgtaacct taaagggtta 480 aacaaatgac gcacagcact actacggtta cagaaaagtt tgcgctgtta taattcactt 540 accttttaat acgttttggt gcgattatta cccgctatta aaaaaataaa caaacgactg 600 aatgttttga atgagaagct gtaatgtagc cgtggcggga tgaattttgg tgtggcgccc 660 cgccatggaa gaatgaatgt agcggaaacc atg 693 // ID TZF28C repbase; DNA; ZEB; 1502 BP. XX AC U51227; XX DT 08-JAN-2009 (Rel. 14.02, Created) DT 08-JAN-2009 (Rel. 14.02, Last updated, Version 2) XX DE Mariner/Tc1 type DNA transposon. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Nonautonomous; KW TZF28C. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1502 RA Jurka J.; RT "Mariner-type DNA transposons from zebrafish."; RL Repbase Reports 9(2), 560-560 (2009). XX DR [1] (Consensus) XX CC ~86% identical to consensus. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1502 BP; 463 A; 316 C; 342 G; 378 T; 3 other; cagccatttg ctaaaatcat ttaagttaat ttatttcctc attaatgtac acacagcacc 60 ccatattgac agaaaaaaag aattgttgac atttttgcag atttattaaa aataaaaaaa 120 ctgaaatatc acatggtcct aagtattcag accctttgct gtgacactca tatatttaac 180 tcaggtgctg tccatttctt ctgatcatcc ttgagatggt tctacacctt catttgagtc 240 cagctgtgtt tgattatact gattggactt gattaggaaa gccacacacc tgtctatata 300 agaccttaca gctcacaatg catgtcagag caaatgagaa tcatgaggtc aaaggaactg 360 cctgaagagc tcagagacag aattgtggca aggcacagat ctggccaagg ttacaaaaaa 420 atttctgctg cacttaaggt tcctaagagc acagtggcct ccataatcct taaatggaag 480 acgtttggga tgaccagaac ccttcctaga gctggccgtc cggccaaact gagctatcgg 540 gggagaagag ccttggtgag agaggtaaag aagaacccaa agatcactgt ggctgagctc 600 cagagatgca gtcgggagat gggagaaagt tgtagaaagt caaccatcac tgcagccctc 660 caccagtcgg ggctttatgg cagagtggcc cgacggaagc ctctcctcca tggagtttcm 720 taaaaaacac ctgaaggact ccaagatggt gagaaataag attctctggt ctgatgagac 780 caagatagaa ctttttggcc ttaattctaa gcggtatgtg tggagaaaac caggcactgc 840 tcatcacctg tccaatacag tcccaacagt gaagcatggt ggtggcagca tcatgctgtg 900 ggggtgtttt tcagctgcag ggacaggacg actggttgca atcgagggaa agatgaatgc 960 ggccaagtac agggatatcc tggacgaaaa ccttctccag agtgctcagg acctcagact 1020 gggctgaagg tttaccttcc aacaagacaa tgaccctaag cacacagcta aaataacaaw 1080 ggagtggctt cacaacaact ccgtgactgt tcttgaatgg cccagccaga gccctgactt 1140 aaacccaatt gagcatctct ggagagacct aaaaatggct gtccaccaac gtttaccatc 1200 caacctgaca gaactggaga ggatctgcaa ggaggaatgg cagaggatcc ccaaatccag 1260 gtgtgaaaaa cttgttgcat ctttcccaaa aagactcgtg gctgtattag atcaaaaggg 1320 tgcttctact aaatactgag caaagggtct gaatacttag gaccatgtga tatttcagtt 1380 tttcaatttt ttaataaatc tgcaaaaatg tcaacaattc tgtgtttttt ctgtcaatat 1440 ggggtgctgt gtgtacatta atgaggaaaa aaatgaactt aawtgatttt agcaaatggc 1500 tg 1502 // ID Gypsy147-LTR_DR repbase; DNA; ZEB; 703 BP. XX AC chr15; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy147-I_DR; KW Gypsy147-LTR_DR; Gypsy147_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-703 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1556-1556 (2008). XX DR Genome; chr15; Positions 37963421 37964123. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 703 BP; 195 A; 121 C; 123 G; 264 T; 0 other; tgtaacaggt atgaaaatta taattcgctt taagtataga attacacaaa atatgtgttt 60 tatctaatgt gttagtaagt atacctgagg tgtttgttag tgtgtgtctt ctcctttaaa 120 tttctaaggg agctcacgct gtttaatttc ctgttttgcg ccccgcctct tccctttctg 180 cactgactat gcatgggaga ggtgggtgtc tgctaagctc caattaatct ttaatcatct 240 ttactttata aataactttt atagctagtt ttcatgttta aataatattt atacctttat 300 tttcacatat atttcattat ttttgtgtta tcacttaaat tgaggttttg attaacttta 360 gcctaactga agcatgtgaa tgctaaatgc attgcacttt acttttaaca gaaaagctgt 420 aaagggcgaa gacacactgt aaaagctgtg ctttttgatt tattctcagg tatgttttgt 480 cactttgtta atattgtttg gagttttatt ctatctttct ctccatctct tccctttctg 540 cactgactat gcatgggaga gaaaagctgt aaagggcgaa gacacactgt aaaagctgtg 600 ctttttgatt tattctcagt gagtaaagag ccgtcacaac tgcagatctc cgactggtta 660 ttccttaata aagcaaagaa aacacaacgc agaaccggtt aca 703 // ID Nimb-1_DR repbase; DNA; ZEB; 5939 BP. XX AC . XX DT 01-FEB-2010 (Rel. 15.03, Created) DT 01-FEB-2010 (Rel. 15.03, Last updated, Version 2) XX DE A family of Nimb non-LTR retrotransposons - consensus. XX KW Nimb; Non-LTR Retrotransposon; Transposable Element; Nimb-1_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5939 RA Kojima K. and Jurka J.; RT "Non-LTR retrotransposons from vertebrates."; RL Repbase Reports 10(3), 486-486 (2010). XX DR [1] (Consensus) XX CC Only 9 sequences in the zebrafish genome are >97% identical to CC consensus. The 3' termini are composed by (GAA)n microsatellite. XX FH Key Location/Qualifiers FT CDS 1892..5662 FT /product="Nimb-1_DR_1p" FT /note="includes endonuclease, reverse FT transcriptase, and RNase H domains and a CCHC FT zinc-finger motif." FT /translation="YRKKLILRYDGINNTVTSLEFKKWIDDFKIKPHVICV FT QESWLKPNLSYVVQGYEIVRKDRKKGNGGGVITLVKAGVAYREIKVNKKYE FT SIVIDVRAENRNIRVINFYNPCKRLNXDLTKNIGIEGHDNIMWCGDFNAHS FT TLWGCVNSDQNGQVIEEILDSNDLVCVNDGNITRVDVLRGMNSVIDLTIVS FT NNLARQCEWETLYHSTLGSDNFLIKCSIGIEVDECHEEKMTRWRFKGANWK FT KYKEICENKMLSTKFNFEEIEDTNNKINAVLCSAAQEVXGKNKGGKIKKAV FT PWWTXKCNEVIQIRNKALKKVKKTYSFDEFIKYKRTQAEVRKVIRTTKRNY FT WRKFCDEIGADIDINDMWNMIKKMGGKQINKNIPALIKEGKYIMTNEEKAE FT ELVEVFVKVHSNNNLSDSMRIKREQIIKENPNVLTQKAPTGGTLDADFTIH FT ELKQVIQNLKCTSPGKDEICYEMIKHLSDTSLIRLLKFFNKIWEVGKLPMS FT WKHGVIIPIGKPGKEHTXATNYRPIALTSNLCKIMEKLIVSRLNYVLESRN FT LLNHYQSGFRKGRSTMDPVLCLESEIRKAMMNKEVLIGIFFDIEKAYDMLW FT KEGVLTKLDNMGIKGKMYNWIMDFLFNRTIQVRVGTSYSSIHSTDNGTPQG FT SVCSPVLFNIMINDIFETVGQEIGKSLYADDGALWKRGRNXEYVKEKMQKA FT VNEVENWVNKWSFKLSVEKTKVMCFSKKKRKSPNIIINLYGQELEQVQEIR FT FLGVWIDSKLTFTTHIKKIQDKCKKRLNILRCLAGVEWGATRHALKRIYCA FT MIRPILDYGSIVYCSASKSQLQKISVIQSQALRIGSGAFRTSPIPAIQVEM FT AEQPIKYRTLKLKMAYWSTVRGHADTHPVKAIIKNCWEHEYSKNRSFGWNA FT ELEARKLGLDQIKIGPTLTIPDTPXWMFQVPQVDLQIHSIIKDKNRSLSKD FT NLVRRYLKQKYLNYTQIYTDGSKNPDNGYTAAALYIPNNKQNVSMRLSNHI FT SVFSTELMAISMAIQWVEQNQPLRVVVCSDSYSALHCLNMGTSETRQDLLN FT EIFQNINKINQIGVTVKFLWIPAHVGVEGNEIVDKMAKTALGKDEVDIEIP FT LSKTEIKHIINKSVNNLWQEDWDSESKGRHLYNIQNKVGKERKVYGNRKED FT TVISRLRIGHSNLNSSLFKIGKRETECCRHCGQIETIEHVLIKCTKYRRER FT DQFKIKLRDLDMTDITITNIFKNAQKETKVYDYLLKFLRSTGLFYRI" XX SQ Sequence 5939 BP; 2305 A; 829 C; 1264 G; 1534 T; 7 other; aaaagtcaca agtcaaataa tgggagaaaa gaaatggacg agaggagtca ttacgggaat 60 acaaacaaat gtagctgtgg aaacaattaa gaaaagcatt aggggagcaa aagtgataga 120 tgcaaggaga ctaaaaataa cccgaaataa tgagaggagt gacagtctgt cgataatgat 180 acactttgaa gaggaaaaac tgctggaaag agtgtttgtt ggattaatga gctatgcagt 240 aagactgttc atccccccca ccgctaagat gttacaagtg ccaaaaattt ggacatgttg 300 cagcagtgtg caaaggaaga caacgatgtg caaaatgtgg aggggaacat gactacggaa 360 aatgtgaaga aggtgtgaaa ccaaaatgct gtaattgtgg gggagaacat agcgcgggat 420 atggaggttg tcaggtgagg aaaagtgctg taaaagttca gaatgtcaag atctctgaag 480 gaataacata cgcagaagcc atcaagaagg ttaagacgga agaaaagaaa gaaacagaag 540 ttaaggttgt agacacaaaa atgagaaaag agtgtttggt tcaggaaata gacgattcca 600 tgattattca aaataagatg gaatttttaa cctttatggt agaagtggta aattgctcgg 660 cccaaactaa aagtagaaca gaaagaacaa gaataattct aagagcagcg gaaaagtact 720 tgaatgtggt aggaataaca gttgaacaga taaatgagaa actaaagatg aatgtagcaa 780 actctcagtc aacatgtgga gaagtataat ggtgttttta atcttgcagt ggaatgcaag 840 aagtctgata tcaaatggat tagaatttaa tagggatgta aaggtatcag aatttcacgg 900 tacggtattt attgaatcat ttacaggaaa aaacgtatga aacttatgaa acttatgaaa 960 atactccaaa aaagtgccaa aagtgtcaat gacatacaaa ttagccatct acctgtaagc 1020 tttgaaacag gaacttcaat tttaataaca aaaaaaatta aatcatgtaa aaaaattaag 1080 tttcaattta gtattgttga aaactcatca cattcaacat ttaatcactc actcacttag 1140 atagagatgg gtttaaagga aaattatcat ataaatataa tctggtaaaa gctggtatct 1200 ctgggtattt acaatgtccc ctgcaacaga aaaaaaccct ctcatttggg actgaggttt 1260 ctgggacaag agagatatga cttggcccag gttgacagca gtgggtaacg ttgtgcattg 1320 tctttccccg acttgagagg acaaaccatg agtgagatag aggtctcatg tctgcatcaa 1380 tctgaacagt gtgctgtgtt tctgcagtcc ccatgactga ttattagtcg tattccccaa 1440 cttgcaggca cgtgcacaca cagggctcaa cctgtgcagt gcacatgccc tttttagtct 1500 tggatagaaa gttcccttcc aaaatgatca aaagtgcccc cgcgacgcga caaaccctcc 1560 gtcccgcttt acagtacgcg cgcgacaaca cagctgatca gaacgcgcgc gacaacacag 1620 ctgataagaa ctcgcgcgac aacaccgcta ttcagtacgc gcgcggcgac aacacccgct 1680 ttagggtttt ccagctcttt ttcatccccg cttctagcag cacactccat ttccgcatta 1740 ctggatctgt agcgacaaca gaccgcaagg gatgatggtc aagcatgggc tgatgggaat 1800 tgtagttttc gctacctccc gttcgcttca ttcgcctgag caaattttct cagaagacct 1860 acagttttac agagtcatgc gactacggta atatcgaaaa aaattaatat tgcggtatga 1920 cggtattaac aataccgtta catccctaga atttaagaaa tggatagatg attttaagat 1980 aaaaccacat gtaatatgtg tacaagaatc ttggctaaaa cccaacttaa gttatgtggt 2040 ccaaggatat gaaatagtaa gaaaagatag gaaaaaagga aatgggggag gagtgattac 2100 tttagttaag gcaggggttg catacaggga gataaaagta aacaagaaat atgagtcaat 2160 agtgatagat gtaagggcag aaaatcgtaa cataagagtg ataaatttct acaacccctg 2220 taagagattg aatakcgatc taactaaaaa tataggaata gagggacatg ataacataat 2280 gtggtgtgga gattttaatg cacatagtac cctgtgggga tgtgtaaata gtgatcaaaa 2340 tgggcaggta atagaagaaa tattagattc caatgattta gtatgcgtaa atgatggaaa 2400 tataacaaga gtggatgtat taagaggaat gaattcagtc atagatttaa caatagtatc 2460 caataactta gcaaggcaat gtgaatggga aaccttatat catagtactt taggaagtga 2520 taattttctt ataaaatgtt caattgggat agaagtagat gaatgtcacg aggagaaaat 2580 gacaagatgg agatttaaag gtgctaattg gaagaaatat aaagagattt gtgaaaataa 2640 aatgcttagt acaaaattta actttgagga aatagaagat acaaataaca aaattaatgc 2700 agttttatgt agtgcagcac aagaggttaw agggaaaaat aaagggggga aaattaaaaa 2760 ggctgtacca tggtggacag awaaatgtaa tgaggtaatt caaatcagaa ataaagctct 2820 aaaaaaagtt aaaaagacat actcttttga cgaatttata aagtataaga gaacccaagc 2880 agaagtaagg aaggtaataa ggacaacaaa aagaaattat tggagaaagt tctgtgatga 2940 aattggcgca gatatagata taaatgatat gtggaatatg atcaagaaaa tgggtggaaa 3000 gcaaataaat aaaaatattc cagctttaat aaaagaaggt aaatacataa tgaccaatga 3060 agaaaaagca gaggagttgg tagaagtctt tgttaaggtt cacagtaata ataatttatc 3120 agacagtatg agaataaaaa gggaacaaat cataaaagaa aatccaaatg tattaacaca 3180 aaaagcacca actgggggca cattggatgc agattttact atacatgaac tgaagcaggt 3240 aatacagaat ctgaaatgta catctccagg taaagatgaa atttgttatg aaatgataaa 3300 acatctttca gatacatcac taattagact tctaaagttc ttcaataaaa tatgggaagt 3360 gggaaaactt cctatgtctt ggaaacatgg agtgataatt ccaattggaa aaccgggcaa 3420 agagcataca gwagcaacaa actatagacc aattgcgttg acatctaatt tatgtaagat 3480 aatggagaag ttaatagtat ctcggctaaa ttatgtgtta gaaagtagaa atcttttaaa 3540 tcattatcag agtggcttta gaaaaggacg aagtacwatg gatccagttt tatgtttaga 3600 atctgaaatt aggaaagcaa tgatgaataa ggaggtgtta atagggatat tcttcgatat 3660 cgaaaaggca tatgatatgc tttggaaaga gggagtatta actaaattag acaatatggg 3720 aatcaaaggg aaaatgtaca attggattat ggactttttg tttaacagaa ctattcaggt 3780 aagggtagga acaagttatt ctagcataca ttcgacagat aatgggaccc cacagggtag 3840 tgtatgcagc ccggtgttat ttaacattat gattaatgat atatttgaaa cggttggaca 3900 agaaataggc aagtctttat atgcagatga cggtgctttg tggaaaaggg gaagaaackt 3960 agaatatgtt aaagaaaaaa tgcaaaaagc agttaatgaa gtggaaaatt gggtaaataa 4020 atggagcttt aaattgtctg tagaaaaaac caaagtgatg tgtttctcaa aaaaaaaaag 4080 gaaatcccca aatatcataa taaatctata tgggcaagag ttagagcagg ttcaagaaat 4140 caggttccta ggagtatgga tagactcaaa acttacattt acaacccata taaagaaaat 4200 acaagataaa tgcaaaaaac gattaaatat tctaagatgt ttggcaggag tagagtgggg 4260 tgctaccaga catgcattaa aaagaatata ctgtgctatg ataaggccaa ttttagatta 4320 tggcagtata gtatattgct cagcatccaa atctcagcta caaaaaatta gtgttattca 4380 atctcaagca ttaagaatag gaagtggagc atttcgaaca tcgcccattc cagcgattca 4440 ggtggaaatg gcagaacaac caataaagta tagaacactt aagctgaaaa tggcatattg 4500 gagcactgta agagggcatg cagatacaca tccagtaaaa gcaataatca aaaactgttg 4560 ggaacatgaa tatagcaaga acaggagttt cgggtggaat gcagaattgg aggccaggaa 4620 attaggattg gatcaaataa aaataggacc tacattaact ataccagata cacctcmctg 4680 gatgtttcaa gtgccacaag ttgatttaca gatacacagt ataattaagg ataagaacag 4740 gagtctgtct aaagataatc tagtaagacg atatctaaaa caaaaatatt taaattatac 4800 acagatatat acagatgggt ctaaaaatcc agacaatggt tatacagcag cagctctata 4860 tattcctaac aataaacaaa atgtttcaat gaggttaagt aatcatatat cagttttctc 4920 aacagaatta atggcaatat caatggcaat acagtgggtg gaacaaaatc agccattaag 4980 agtagtggtg tgctctgatt catactcagc actacattgt ttgaatatgg ggacctcaga 5040 aacacgacag gatctactaa atgaaatatt tcaaaacata aataaaataa accaaatagg 5100 agtaacagta aaatttttat ggatccctgc tcatgtagga gtagaaggaa atgaaatagt 5160 agataagatg gcaaaaacag cattggggaa agatgaggtg gatattgaga tcccgctatc 5220 taaaacagaa ataaaacata ttataaataa atcagtaaat aatctttggc aagaggactg 5280 ggattcagaa tcaaaaggaa gacatctgta taatatacaa aataaagtgg ggaaagaaag 5340 gaaagtatat gggaacagga aggaagacac agtcatatcc agacttcgta taggtcattc 5400 caatttaaac agttcactat tcaagatagg gaaaagggaa acagaatgct gcagacattg 5460 tggacagatc gaaaccatag aacatgtatt aatcaagtgc acaaaatata gaagagagag 5520 ggaccagttt aaaataaaac taagagattt ggacatgact gacattacaa tcactaacat 5580 ttttaaaaat gctcagaaag aaactaaggt atatgattat ctgctaaaat tcttgaggag 5640 tacaggtttg ttttacagga tttaatcgga aagaattata atttggttat ttagttagtt 5700 ttatttagat ttttattatg aatattgttt attattattt aattactatg taagggttta 5760 aaataggttt ttggtttgtg ttttatgaaa aaaaaaaatt ttttcttttt tttttttttt 5820 tttagagaat aagtagtaat ccttccactc attgaagtct cacactccaa cacagtaggt 5880 ggcggtaatg cacctcaagc tggtttgcca cccgccatta aacgaaaaag aagaagaag 5939 // ID TDR4 repbase; DNA; ZEB; 301 BP. XX AC . XX DT 04-MAR-2002 (Rel. 7.02, Created) DT 04-MAR-2002 (Rel. 7.02, Last updated, Version 1) XX DE Zebrafish non-autonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; TDR4. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-301 RA Jurka J. and Drazkiewicz A.; RT "TDR4: a non-autonomous DNA transposon from Danio rerio."; RL Repbase Reports 2(2), 32-32 (2002). XX DR [1] (Consensus) XX CC Contains ~30 bp imperfect TIRs. XX SQ Sequence 301 BP; 66 A; 86 C; 91 G; 56 T; 2 other; ggtcgcacac cagawgcgcc gctcagcgcc gcgacacggc gcgcacatga cagtttaaag 60 tatcgcacac cagacgcgca cattcgcatg atatttaaaa tgaaactaat cagatggcgc 120 tctgtggcgc ggcagaaata tgaacagygt cctgagtcgt ggctgggcgc cgccgacagc 180 cgccgacttg cggcgccggt gtgtgtaccc tgatagaaaa cctatgttta gaattctaaa 240 atgcatggcg ctgcgtggcg cgtcgccgag cgtcgccgag cggcgcttct ggtgtgcgac 300 c 301 // ID hAT-N56_DR repbase; DNA; ZEB; 1154 BP. XX AC . XX DT 21-OCT-2008 (Rel. 13.1, Created) DT 21-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Putative nonautonomous hAT DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW 8 bp TSD; hAT-N56_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1154 RA Bao W. and Jurka J.; RT "hAT DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1325-1325 (2008). XX DR [1] (Consensus) XX CC TIR is 12-bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1154 BP; 344 A; 230 C; 242 G; 338 T; 0 other; tagggatgcc acggttctca atataatatt gaaccgtacg gtacgacctc cacggttcaa 60 tacgcgcttg tgaattgcgg ttttctcggt tttgcgttta aataatttat gtgcgtttta 120 tatctccccg aattgactgt gtgtgcctgt aagtccatga gctgagatga ggaaactcct 180 ccccgcgagt aaatatccaa tcactatgct ctaaagttag ggggcgcttg accatcattc 240 cacactttaa catggcgagc ggcggtgaag tgaggctgag gcaacagtac aaggaagcgc 300 cggcgaagtg aggcaacagt acaagaaagg agaaagcgcc ggcgtctttc aaatctgcgg 360 tgtgaggaca tttcggtttt gccggtaagt ataatgccaa taaaagaaaa aacggtgaac 420 aaaaaataac tgtgtgcaag cattgtttta cacgtataac catgactgca catctgcagc 480 gccatcaccc agcaatatca ctgtctgaag gcaggatagc agagaaggta ataaagtcct 540 ccaaatagca acaatccctc gctgaatctt tcaagcaaac atacccaact ggttccgaga 600 gacacataaa aataacaaag gcagtgcggg tgtttattgc taaagatttg cagccgtttt 660 cggtgattgg agatgcgggc ttttgtcatc ttataaaagc actcgatccg cgttacagac 720 tacagtctcg cattttttca gcaccgagat aattaagatg gtttatttat gtttacttaa 780 gtacaaatat gtttatttga aatggccaat ttatctttat taacttcaca tgtatgttta 840 ttttcaaagt gtaggatttt atgtgttccc cagtttgtac atgggctacc agcagctgtg 900 agatttcagt gttcattttt tttatacaat acactaggaa ctagtgtact tttcttggct 960 tttaagtaaa acaaaatgag tattgcaata aatcgttttt atttttttct ccttaacctc 1020 ttactgttcc tattgcctat agaataaaaa aaaacacacg tgtcaagcca tgagaaccga 1080 accgaaccga aaaccgtgtt aaaaaaaccg aggtatgtat tgaaccgtgg gctaactgta 1140 ttgttgcatc ccta 1154 // ID hAT-N68_DR repbase; DNA; ZEB; 3032 BP. XX AC . XX DT 30-OCT-2008 (Rel. 13.1, Created) DT 30-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Putative nonautonomous hAT DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW 8 bp TSD; hAT-N68_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-3032 RA Bao W. and Jurka J.; RT "hAT DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1337-1337 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 3032 BP; 987 A; 538 C; 553 G; 952 T; 2 other; tagggctgca cgatatatcg tttcagcatc gaaatcgcga tgtgcgcatc cgcgatagtc 60 acatcgcagg agggtgcgat gcaaaaaaaa aaaaaaaaaa attatatata tattatgggt 120 ggagccgaac cggaatacgg tattcggaaa agcaggaata gcgtgttttt acgaatactt 180 atttcaaaca atacttgata aattatttgt atttgggaac aagaaaaaca ttatatcaaa 240 aagctgcgtt tcctcatgag accgcastgc atgcccgcgt gagtgagtga gttagtgagt 300 gagacgttca ggagtcggtg ggcggggtgg ggccggggta aggagactga gcaggctgct 360 tctctataca gcaacagaga aggcttggca gagcgaaaaa tacttttctg cattactatt 420 tttgtttatt ttatcatact aaaataacac gtgacagaag ttcactcaaa agtgaaagtt 480 ctttctgtta ttaattacta acactcccgt cattcctact actttcagaa cataaatttt 540 gatattttac catatgaaaa tgcaatgcag ttatgaaaat tgagaaatta aagcaataca 600 gagatctggt ccccgcgtca ggtcagggtg cacctgtaac cttaaaaacg cagacgcggc 660 ggaggcgcac taaactcaca cgggaaagaa ctgggcattc aattcatgtt ttagtgaaat 720 aattctttaa atacagctct tttgatctct aaaggggaaa aaagcaaaag aaatttgtcg 780 ttaaattaat tttagcaaag acgcaactgc ctgtcatttt cacattcgtt tttcagttag 840 ggatttggtc tattagagta acctaataaa attgttttgt tgaacactgt agtctacaca 900 ataatcgata ttataattta ttatatacat tagctattat taaaactaaa ctatacaatt 960 gataaaacaa aacccttttt aatgtttgct ttcattttga caccgctaac agttgtaata 1020 agaaacatca cattatgcag aagcatgcag tctcaagtta aaaacacctt ttgagtttaa 1080 agatgatcag cctgccaagt caagtttaca gtgtcataaa tgaaatgggt agcatgagta 1140 aaggggattt gcccagcaca caacctcatg cgctgcaggt aggctgaatg ataatcttta 1200 aaaatatata ggctatgcaa tacaaaactt taaacattta tttatttttt aatgaaaaga 1260 aaagaaatca ggagagtaaa atataattta aaccactggt ctggcgctga tactttattg 1320 tttgtatcca gaagaagggg aaaaaagcag atgaaaacct ggctggggag acggcgataa 1380 catatcagtt cttcagcaaa gatcacgaca tcattgtgat gcgtttatca tacaacgcag 1440 ttcgtgttta taatcattaa tattaactag tattaattct ctctctagac ttagttttag 1500 ggcttgttct tgtcggcgca atatagcctc ggcttattgc acgtacgaat ccgattgtga 1560 agaatgatta agcgcttgca ttttttattg catccttttc gcaagttatt catttgtttt 1620 gttttttaag tggtgcgaac attatttact ggcaaacatg ttcctcgatc tcgtacatca 1680 acaaataatg tagcctactt aagttaattt accttttatg taaaaattaa ggaataaccg 1740 aatttaatat taataaataa aacatccgta taaacgtaca tgtttatttt tgaactttcg 1800 ttaataaaac aacacaaaac tgtttataaa tattgtagtt aaatattaaa ttactttaac 1860 tgcttgaaac tgtataataa taacaattaa acgttacatt tttaagaact taacaaatag 1920 tctaaatggc actttactaa tcagtgtaat aataatataa atattaataa ttataagtaa 1980 aaagaaaata ttaacctagt aacaatataa gtcagatgtt taaaaaaaac agcgtagttt 2040 ctctccacct tttgtagtca tattttaaat tatttgccag aaaaactaac atgttgactt 2100 tgtccggttt aagtgaagaa tggggttata atgtttccag caatacactg tcagacggtg 2160 tgcttgtagc gaaaatgcac agacaggcta ctttttcaca accatgcgag agagggcttc 2220 tccttctgtt ccgccgcccg catttctcgc aacgcaggtt tctctactct gcggagcggc 2280 gcgttcttaa gtagttacca aatagcctat gctcaatttc tcgtcaatgt cttccattta 2340 tcactttctc ttttgtatct ggccattttt gcaaatgcct ctgccatgcc tggctgagct 2400 gggcttctgc gtttagaaga caggagttga ttaagcctac atggcttttt ttccctcgcc 2460 gtcaatcgcc actagcttgc atggttcggg acccgtggat ctgcgtatcg atggatttgc 2520 tcctcagtgt ttggactctc agcactcagc agtgaatatt aaaacacact ttacttaaaa 2580 cccccataca aaaagcttaa tccaaattga ttcgttcacc tttctcggtc ggcacgaatc 2640 caaaatgttc ccagatgtaa gagttcgttt tctaacaaga ttcatggttg aacttgaact 2700 tgctgaatcg cgcctaatgt tatattctaa cagatatgta atacttggca acataacagt 2760 aatttgtgtt ttaaagtgcg tgacktcacg tactacggcc ggtggcaatg gacaaccgag 2820 gtggcaatga ccaccgcggt gacgcggttg tcatggtgac cgtcacagcc ctagttttaa 2880 gtatatatat atatatatat atatatatat atatatatat atatatatat atatatatat 2940 atatatatat atatatataa aaatatcgca atatatatcg caggggaaaa aaatatcgca 3000 atgtcaattt tttccaatat cgtgcagccc ta 3032 // ID Gypsy159-I_DR repbase; DNA; ZEB; 4323 BP. XX AC chr3; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy159-I_DR; KW Gypsy159-LTR_DR; Gypsy159_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4323 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1579-1579 (2008). XX DR Genome; chr3; Positions 34991002 34995324. XX CC Positions [1706-2161] - Reverse transcriptase CC Positions [3179-3658] - Integrase core CC LTRs are 99% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS join(2..1852,1856..4324) FT /product="Gypsy159-I_DR_1p" FT /translation="MDPEMPDESLQKTSPQGDPATLLQLSSELSTQGAQLA FT LHRQQLVRLTTLTEELVQAVQRLHVAPVNPSAAPSPPPPSPLSMPLDPLPQ FT SAVNPRLAFPDKFDGNPSICKGFLLQCSLFVDQQPFLYPTDSSRIAFVCAL FT LSGKALEWATAIWKPEGNTFISFKDFLTQFRAVFEHPADGKSPGEQLLTLV FT QGRRTAAEYALAFRTLAAQTNWGEDPLKLFFRKGLSAELQSELACREEGKT FT LSELIEITIRLDNLIKACRVPQLRRLPDLEPTQNEPMQLGYTQLTPEERER FT RLQNHLCLYCGESGHRCINCTVRPARSVSLSSSNISSLTIRIRMSQADWGI FT DCTALIDSGAAGNFMSWEFARRHKLTLLPCESVLAVEALDGRPIGNGRIQH FT VTKSLNLQIGALHKERINFYIIQTSRHPVILGLPWLRRHDPHISWQKGQIL FT QWGSTCLHQCLDEITKLPVQTINIQSLQDQEFATLPTVYSDLTEVFSQQSA FT THLPPHRPYDCAIELLPGTIPPKGRIFPLSQPKSEAMNSYIQEELEKGFIR FT PSTSPAAAGFFFVKKKDGNLRPCIDYRGLNEITVKYRYPLPLVPAALEQLC FT QAKIYTKLDLRSAYNLIRIQGDEWKTGFSTTRGHYEYTVMPFGLANSPSVF FT QAFMNDVFRDMLDQWVIVYIDDILIYSNTVEEHIQHVRAVLQRLIHHHLYA FT KFEKCELHLTSVSFLGYIISAEGVSMDERKVTAVQEWPLPQTLKQLQRFLG FT FANFYRRFIRNFSTLAAPLTSMTKRSHAKLIWQPEAIQAFSVLKERFTSAP FT VLRHPNPELPFVVEVDASNTGVGAVLSQRQGIPEKMYPCAFFSRKLNSAER FT NYDVGNRELLAIKLALEEWRHWLEGAIFPFTVLTDHKNLEYLRTAKRLNPR FT QARWTLFFTRFNFTVTYRPGSKNTKADALSRMEERGNEVEQADAIIPDHLL FT LAPVQWDLITEITQFNHDNPPPANCPAGLTFVPPEFRQEVLKQVHDNPSAG FT HPGITATKNLVFNRFWWPALWKDVMGYVKNCITCQTTKSSHQVPAGLLQPL FT PVPERPWSHIALDFITDLPVSQGNTVVLTVIDRFSKACRLIPLPKLPTALE FT TAEAMCNWVFRFYGLPDDIVSDRGPQFTSRLWAAFCKNLNINVSLTSGYHP FT ESNGQTERLNQEITRFLRSYCHQHPSDWNRYLLWAEYAQNSLVKPSTGITP FT FKCILGYQPPLFPWSEEPTTLPAVDDWFRRSEEIWNRTHTHLLRAVRRQEV FT QANRHQRTGHRYMPGQWVWLSTRDLRNRNSCRKLNPRYVGPFKIIRQVTPV FT SFRLELPNHYCISPTFHVSLLKPAGGPREEIPLHDEQALPITIEGEMAYRV FT RELLDSRRRGGSLQYLVDWEGYGSEEQSWVNARDILDPTLTLEFHQRHPDK FT PAPRPRGRPRRQQITRLRSCSQGEGS" XX SQ Sequence 4323 BP; 1144 A; 1139 C; 960 G; 1080 T; 0 other; aatggatcca gaaatgcctg atgaatcatt acagaaaact tcgccacaag gggatccagc 60 aactctacta cagttatcgt ccgaattgtc cactcaagga gctcagttag cccttcatcg 120 acaacaactg gtacgactga ccactttaac cgaggagctt gttcaagctg tacaacggct 180 acatgtcgca cctgttaacc cctcggccgc tccttctcct cctcctccgt cgcccctcag 240 tatgcccctc gatcccctgc cacagtctgc agtaaatccc agattggctt tccccgataa 300 atttgacggc aatccctcca tatgtaaggg atttttactg caatgttcat tatttgttga 360 tcaacaaccg tttctttacc ccactgattc cagccgaatc gcctttgtat gcgctctgct 420 ttctggaaaa gccctggaat gggctacagc tatatggaag ccggagggta acacttttat 480 atcctttaag gatttcctga cacaatttcg agcagtgttc gaacaccccg ccgatgggaa 540 gagccccgga gagcagctgt tgacacttgt tcagggaagg cgaacagcag ctgagtatgc 600 actcgccttt cgtacactgg cagcccaaac aaactggggt gaggaccctc ttaagctgtt 660 ttttcgtaag ggactgtctg ctgaattaca gtcagaactg gcttgtcgcg aagaggggaa 720 aactcttagt gaattgattg aaatcacaat ccgactggat aatcttatca aagcatgccg 780 agtaccgcag ctgcgacgcc tccctgatct cgaaccgacg cagaacgaac ccatgcaact 840 cggttacact caattaactc cggaggagcg tgaacgcaga ttgcagaatc atctctgtct 900 gtattgcgga gaatctgggc atcgatgcat taattgcacg gtccgtccag ctcgatcggt 960 gagtctatca tcttctaaca tatcaagttt aactattcga ataagaatgt ctcaggcgga 1020 ttggggtatt gattgtacag ccctcattga ttcaggtgcc gctggtaatt ttatgtcatg 1080 ggaatttgct agacgtcata agttaacgtt gctaccatgt gaatctgtgc ttgcagtgga 1140 ggcgctagac ggacgtccta ttggaaatgg aaggatccag catgtcacaa aatcccttaa 1200 tttacaaatc ggggcgctgc ataaggaaag gatcaatttc tatataattc agacatctag 1260 acatcctgta attctcggtt taccctggct gcgccggcac gaccctcaca tttcctggca 1320 gaaggggcag atactgcagt ggggctcaac ttgtttacat caatgccttg atgagatcac 1380 taagcttcct gtccagacaa taaatataca gtcccttcaa gatcaagagt ttgccacatt 1440 accaacagta tattctgatt tgaccgaggt gttcagccag caaagcgcta ctcacttacc 1500 acctcatcgc ccctatgact gtgcaattga gctgctgcca ggcacgatac ctcccaaggg 1560 aagaatattt ccactgtctc agcccaagtc agaagccatg aactcctaca tccaggagga 1620 actagaaaaa ggattcattc gcccatctac ttcaccggcg gcggccggct tctttttcgt 1680 taagaaaaaa gatggtaatc tgcgtccatg cattgattat cgtggtctca atgagatcac 1740 agtcaaatat cgttatcccc ttcctctggt accagcagcc ctagaacaac tatgtcaggc 1800 taaaatctat accaaactcg acttgcgaag tgcctataat ctaatacgga tatagcaggg 1860 agacgagtgg aagaccgggt tttccaccac cagggggcac tatgaataca ccgtgatgcc 1920 cttcgggttg gccaacagtc catccgtttt tcaggcattt atgaatgatg tcttccgtga 1980 catgctggat cagtgggtaa tcgtctatat agacgacatc ctgatctact ctaatactgt 2040 ggaggaacac atccaacatg taagagcagt attacagagg cttattcacc atcaccttta 2100 tgcaaaattc gagaaatgtg agttacatct gacaagtgtg tctttcctgg gctacattat 2160 cagcgcggaa ggagtttcca tggatgagag aaaggtaacc gcagtgcaag agtggccact 2220 cccccagact ttaaagcagc tacaacgctt cctaggcttc gcaaacttct atagaagatt 2280 tattagaaac ttcagtaccc tagcagctcc cctcacctcc atgactaaac gcagtcatgc 2340 taaattaatc tggcaacccg aagctataca ggccttctca gtgctaaagg agagattcac 2400 ctccgctcca gtattacgtc atcctaatcc cgaactacct tttgtggtgg aagtggacgc 2460 ctccaacaca ggagtagggg cggtactctc tcagcgccag ggcatccccg agaagatgta 2520 tccatgcgcc ttcttctcac gcaagttgaa cagtgcagaa cgtaactatg acgtcggtaa 2580 cagggaactc ctagctatca agcttgcact ggaggaatgg cgacattggt tagagggagc 2640 gatttttcca tttaccgtac taaccgacca caaaaatctt gagtacttgc gcaccgccaa 2700 gcgcctcaat ccccgacagg ctcgctggac tttgttcttt actcgtttta actttactgt 2760 aacttatcgt ccgggcagca agaacaccaa ggcggacgca ctttcacgaa tggaggagag 2820 gggtaacgag gttgagcagg ctgatgccat tatacccgat catttactac tggctcccgt 2880 tcaatgggac ttaatcacgg agattactca gtttaatcat gacaatcccc cacctgcaaa 2940 ctgtcctgct ggtttgacat ttgtaccacc ggagtttcga caggaggtgt taaagcaagt 3000 gcacgacaac cccagcgcag gtcacccagg tataacagcc acaaagaatc tagttttcaa 3060 tcgtttctgg tggccagcgt tatggaagga tgtcatgggc tacgtgaaga actgcatcac 3120 ttgccagacc acaaaatctt cacatcaagt accagccgga ttattacaac cactcccagt 3180 tcccgaacgt ccctggtccc atatcgctct agatttcata accgatcttc cggtatctca 3240 gggcaacacg gtggtgctca ctgtcattga tcggttctct aaggcatgtc gtctcattcc 3300 tctgcctaaa ctgcccaccg cgctagagac agcggaggcg atgtgtaact gggtatttcg 3360 cttttatggg ctccctgatg atatcgtctc tgatcgggga ccccaattta catcacgatt 3420 atgggctgct ttttgtaaaa acctaaatat caacgtcagc ctcacctctg gatatcaccc 3480 tgagtcaaat ggacagaccg aacgactaaa tcaggaaatt acccgcttcc ttcgatccta 3540 ttgtcatcag catccgtcag attggaatcg ctatctcttg tgggccgagt acgcacaaaa 3600 ttcccttgtc aaaccctcta ccggaataac cccattcaaa tgcattctcg gatatcaacc 3660 gcctctattt ccatggtccg aagaacccac tacattacct gcagttgatg attggttccg 3720 aagaagtgag gagatctgga acagaactca tacacaccta ttaagagcgg tgagaagaca 3780 ggaggtgcaa gccaaccggc atcagcgaac aggtcatcga tacatgccag gacagtgggt 3840 gtggttatcg accagagatc tacgcaaccg aaattcctgt agaaaactta atcccaggta 3900 tgttggtcct tttaaaatta tccgtcaagt taccccagta tcatttcgtc ttgaactccc 3960 taatcattat tgcatttctc ccacatttca tgtttccttg ctgaaacccg ctggtggtcc 4020 acgagaggaa ataccactac acgatgaaca ggcgctcccc atcaccatag agggcgagat 4080 ggcttatcga gttcgagaat tactcgattc gaggcgtcgg ggtggctccc tccaatatct 4140 ggtggactgg gagggttacg gttcagagga acaaagttgg gtcaacgcga gggacatcct 4200 cgatcccaca ctcaccctag aatttcatca gagacatccg gacaagccgg ccccccgacc 4260 gcgcggaaga ccccggcgcc aacaaattac tcgcctccgg agctgctcgc agggagaggg 4320 ttc 4323 // ID Kolobok-N1_DR repbase; DNA; ZEB; 799 BP. XX AC . XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE nonautonomous Kolobok transposons from zebrafish- a consensus. XX KW Kolobok; DNA transposon; Transposable Element; Nonautonomous; KW hairpin; Kolobok-N1_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-799 RA Bao W. and Jurka J.; RT "Kolobok-type transposons from zebrafish."; RL Repbase Reports 8(10), 1345-1345 (2008). XX DR [1] (Consensus) XX CC This is a ancient transposon, built from copies which are ~90% CC identical to the consensus. The TIR is half of the whole length CC (~400 bp), making a perfect hairpin structure. The TSD is CC putatively TTAA tetranucleotide. Its termini are more similar to CC that of most Koloboks rather than to PiggyBacs. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 799 BP; 244 A; 152 C; 146 G; 251 T; 6 other; aggaacactc cacttttttt ggaaataggc tcattttaca actcccctag agttaaacag 60 ttgagtttta ccatttttta atccattcag ccratctccg ggtctggcgg gagcactttt 120 agcttagctt agcataaatc attgaatcgg attagaccat tagcatctca ctcaaaaawt 180 tcaaaaaaga attttrrtaa tttttctatt taaagcttga ctcttctgta gttacattgt 240 gtactaagac tgacrgaaaa ttaaaagttg ctattttcta ggttgatatg gctaggaact 300 atactctcat tctggcgtaa taatcaagga actttgctgc cgtaccatgg ctgcagcagg 360 cgcaatgata ttatgcagca ctgttaccta gctacctagc tggggactat tttcaggtac 420 tgcgtaatat cattgcgcct gctgcagcca tggtacagca gcaaagttcc ttgattatta 480 cgccagaatg agagtatagt tcctagccat atcagcctag aaaatagcaa cttttcattt 540 tctgtcagtc ttagtacaca aatgtaacta cagaagagtc aagctttaaa taggaaaatt 600 atcaaaactc tttgatcatt tttaatgaga tgctaatggt ctaatccgat tcaatgattt 660 atgctaagct aagctaaaag tgctcctgcc agacccagag atcggctgaa tggattcaaa 720 aatggtaaaa ctcaactgtt taactctagg ggagttgtaa aatgagccta ttttcmtaaa 780 taaaaagtgg agtgttcct 799 // ID hAT-N28_DR repbase; DNA; ZEB; 741 BP. XX AC . XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 13-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Putative nonautonomous hAT DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW 8 bp TSD; hAT-N28_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-741 RA Jurka J.; RT "hAT DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1298-1298 (2008). XX DR [1] (Consensus) XX CC putative hAT transposon, TSD=8 bp. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 741 BP; 234 A; 125 C; 128 G; 254 T; 0 other; ggtcccactt tatatagtgt cgcttatacc tgtgaactta cacggtaact agatgtgtaa 60 gtagtatgta actacagtgt atgtacgtac acactggtac atagtattta cttgtgtaat 120 actggtgtaa ctacacacat gtaacaacac actgaatagt atgtgtaagt tcaaatgtgt 180 aacaggacat taataacaca atctttggta aactacccct ttaatgtgaa ttactgatgt 240 gattctacaa tttaccttaa aacaacatat aaacataagg catttgtata tttctatata 300 cacttatact tttttgtttt ttttttttgg cagaagtttg ctgttgttaa acagttgcac 360 aaaggcttag gttttatata ttacacagta atttgtgacc attttacact ttatattgag 420 tggacagttc ctgaggattt accctgtaat tacactggta tcacaaggtg aaaccttcta 480 atgcagcgtt agttctgcca ttcttattct aatacagtgt tcagaagtac atggtatgca 540 aacaggtgtg gaagtgggaa aacacctgca agtatattgt tgttacacaa tgcacttatg 600 taattcttgt tacagagtgt tacaccagta ttacacaagt aaatactatg taccagtgtg 660 tacatacact gtagttacat actacttaca catctagtta ccatgtaagt tcacaggtat 720 aagcgactat aaagtgggac c 741 // ID CR1-7_DR repbase; DNA; ZEB; 2681 BP. XX AC . XX DT 21-OCT-2008 (Rel. 13.1, Created) DT 21-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE CR1-7_DR is a CR1-like non-LTR retrotransposon - a consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; AP endonuclease; CR1 clad; CR1-7_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-2681 RA Bao W. and Jurka J.; RT "CR1-like non-LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1202-1202 (2008). XX DR [1] (Consensus) XX CC Consensus is built from 3 copies, which are ~9% divergent from CC the consensus. The 5'-part is incomplete. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS join(186..1007,1607..1927,1931..2479) FT /product="CR1-7_DR_1p" FT /translation="FNIHFDNTTNTLYNXFKLILDSADLIQYVNFPTHSKG FT NILDLVCCSGVAPNNISISDHLISDHKSVLFNVSLSLGRVKPLRSVSFRNI FT KNIDHNILDEMINYSVNHCISTSTSDLMHFYNNSLSQILDITHTASYTNPS FT PWFTPELRHQKALGRRLERLGKSGLEIHKQMYSDHLAAYKTAIQGAKSAYY FT SSLINTGHSNSRTLFYTVGKLFRSHDSLSLDPTQCGPYSSFLCAKIDQIHQ FT GLLTSASSSYDNHYDHIPVDFSVSTVSNLTAHGCSLSHSWCSSGICIGSSF FT IIYILPLGHIFRSYGINFHFLAEDTQVYMSTKNAQSHPPSNLTKCLRDINM FT XMPSNFLKLYASKTEALLVGSKSVLSKAQSFTLSIILQSLQLVKSLGVILD FT GMLSFNSHISNISRSAFFHLRNIARLRPSLCQQSAEVLAHALVTSRIDYCN FT SLLSDIPFNQLHRLQLIQNSAARVVTRSRASEHITPLLFHLHWLPVFYRIQ FT YKILLLTYKALHNLAPDYLSDLLHWYTPRRSLRSSAAELLSVPNVKMKSFG FT AGLLAAQHCSYGIPCR*" XX SQ Sequence 2681 BP; 716 A; 614 C; 407 G; 934 T; 10 other; ttacagatag cagagcctac atcctatgaa gttttggttg cattacttaa acgttcgact 60 cctactattt tggttgtaat ttatcgacca ccaaaactat ctgctatgaa tatatttttt 120 agtgagcttg ctaatctatg aactcatatt tgtactctat cttcaaatat tttgttgatg 180 ggtgatttaa tattcatttt gataacacca ccaacaccct gtataatrat tttaaattga 240 tcctggacag tgccgacctc atccaatatg ttaattttcc tacacattcc aaggggaata 300 ttcttgacct tgtatgctgt tctggtgttg ccccaaacaa tatttccata tctgatcatc 360 taatctcaga tcataagtct gtacttttta atgtctctct ctctctggga agagtaaaac 420 ctttacgctc agtctctttc cgcaatatca agaacattga tcataatatt cttgatgaaa 480 tgattaacta ttctgtcaat cattgtatct ccacttctac ttctgacttg atgcactttt 540 ataataatag tctgtctcaa atattggata taactcacac tgcatcctat actaacccct 600 ctccctggtt tacacctgaa ctgcgacatc aaaaagcttt agggcgtcgc ctggaaagac 660 taggaaaatc tggtcttgaa atccataaac agatgtactc tgaccatctt gccgcttaca 720 agactgcaat acaaggagca aaatcagcgt actactcgag tcttattaac actggacact 780 cgaactctag gaccctgttt tatactgttg gtaaactgtt caggtcacat gactctcttt 840 cacttgaccc aactcagtgt ggtccttact ccagctttct ctgtgccaag attgatcaaa 900 ttcatcaagg attgcttaca tctgcttcat cctcttatga taatcattat gatcatattc 960 ctgttgattt ttcagtaagc actgtttcta atttaactgc ccacggttga ttttattact 1020 aaggagatca tgaaatctaa gacttccacc tgttcacttg atcccttaca gactatttta 1080 gttaagtcct gtattaattc actctcatcc ttcataacat ctattgtctg ttcttcacta 1140 tccactgcta ccrttcctty atctttaaaa atagcttctg tcactccaat actttaaaag 1200 actgrtgcag atcccaatga tctaaactat tttaggccaa ttgctaacct cccctttgtt 1260 tccgaaatac tggaatgaac agtagcagct caacttcact cacacctgct tactaataac 1320 atctatgagc actttcagtc tggattcygc cccaaacaca gcacagaaac agccctcact 1380 aaggttgtta atgatcttct cttagctgct gattctggct gtttgtstat acttattctt 1440 ttagatatca ccgctgcttt cgataccata aatcattcaa tattactgga tccactggcc 1500 tcaattggaa tcgctgataa tgccctctgc ggctctgtca caatcttact taagtgacag 1560 aaaacagttt gaggtaaaga atccaggtaa agaacagtcg ctctgatgca gtctcagtca 1620 ctcatggtgt tcctcaggga tctgtattgg gtcctcattt attatttaca ttttaccttt 1680 aggtcatata tttcgttcct atggcataaa ttttcacttt cttgctgaag atacgcaagt 1740 ttatatgtct actaagaatg ctcaatctca ccctccctcc aatcttacta aatgtctgcg 1800 agatattaat atgtgkatgc ccagtaactt tttaaagtta tatgccagta aaactgaagc 1860 ccttctggtt ggctctaaat ctgttctttc taaagctcaa tcctttactc tatcaataat 1920 tctccagtaa agtttacagc tcgttaaaag tctcggggtt atactagatg gcatgctttc 1980 gtttaattca cacatcagta atatttcacg atctgcattt tttcacttac gtaacattgc 2040 aagacttcgc ccttcactat gccaacagag tgctgaagtg cttgctcatg ctctggttac 2100 atcaaggatt gattattgca attctctcct atctgatatc ccttttaacc aacttcacag 2160 attacagtta atccagaact cagcagctag ggtagtcacc cgctcgcgtg cttctgaaca 2220 cataactcca cttctctttc acttgcattg gcttcctgtt ttttatcgta tacaatataa 2280 aattctttta ttaacttaca aagctcttca caacctggca cctgattacc tctctgatct 2340 tcttcactgg tacactcctc ggcgctcact gaggtcttct gcngctgaac tattgtcagt 2400 tccaaatgtt aagatgaagt catttggtgc ggggctttta gctgcacagc actgcagtta 2460 tggcattccc tgccgttaga tattcgtcag gtggactcta tctccaattt taaatcccag 2520 gtaaaaacct ttctgttcgg gattgctttt aatgatttta tggattaaca ctatgattta 2580 ttactcttct gtcttgtttt ttataatgtg cctgtgacat actgtaargt gtccttgagt 2640 gctctgaaag gcrcctataa ataaagggta ttattattat a 2681 // ID BEL32-LTR_DR repbase; DNA; ZEB; 606 BP. XX AC chr15; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL32-I_DR; KW BEL32-LTR_DR; BEL32_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-606 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1408-1408 (2008). XX DR Genome; chr15; Positions 43273648 43273043. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 606 BP; 177 A; 88 C; 124 G; 217 T; 0 other; tgtaagtgct gttattaagc atttttgcat aattgcatat ttattttaca tatttgtttc 60 atatttcatc ctttagtaat ttaaggtcat aatattttgt attaatttag ctgaggtgta 120 tttataaatg taaacgtgtt gaaaggttca aatgtgttat tatttcatat gttgtacttt 180 aagacgttta ttatggaaac gtgtcacttt gaggaaatga cgtgtaatgt gagtgacttg 240 tttatggatc cccggataga ctgacgatga tcagacgcat gtcagtgctg tgtaaagtta 300 catgtgttta agaagcataa agttgctgct gtcataattc aaaggacagt gaagaagact 360 ttattaatcc aaggaccctc ttttcatgac aagcgaccaa cgaggtattt gttttgctta 420 ataatgtatt agataatcgt agtttgtata attgcttata cgtttgtact gtgttcattt 480 actaatacag ttctcacggc gtcacggcgt cacggcaaca gccgcaaaga acattatgga 540 cgaccttaaa taaacgcccg ttctggagaa acaacctgtg tctgtgttgt tgtgaagaga 600 gttaca 606 // ID TE-2-1_DR repbase; DNA; ZEB; 4035 BP. XX AC . XX DT 28-FEB-2009 (Rel. 14.02, Created) DT 03-MAR-2009 (Rel. 14.02, Last updated, Version 1) XX DE nonautonomous transposable element from zebrafish - a consensus. XX KW Transposable Element; Nonautonomous; 2-bp TSD; TG-CA; KW minisatellite TSD; TA minisatellite; TE-2-1_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4035 RA Bao W. and Jurka J.; RT "transposable elements from zebrafish."; RL Repbase Reports 9(2), 566-566 (2009). XX DR [1] (Consensus) XX CC This transposable element is characterized by 2-bp TSD. In CC majority cases, it inserts into TA minisatellite site; in a few CC cases, the TSD is revealed only 2-bp long. It lack any detectable CC TIRs, but the termini are 5-TG and CA-3'. XX SQ Sequence 4035 BP; 1341 A; 737 C; 623 G; 1332 T; 2 other; tgagcaattc catgaaaatg tcaaccttat catgaaaaaa taaacttttg accaaaataa 60 caaaacctat ttcaatttgc ttattaaggt ctatatttta ttgtaatgat gtgctcttgt 120 tgaattttta aggaaattgg tttgtttatc catgaaatat gaggattgtg ccaatggtaa 180 atttcattat catccaacca cttttcgtgc tttcaaaaaa ggggttaaag actcaccttt 240 tccatgcttt atgttaacag taatcattct gcagaaggat tggagtctgt agtataaata 300 attaccctta ctttttcata acattattgc ccttttgatt tatgagctac atatttaaga 360 catcacgcag cttcggtcac gtgtcacgtc cgtaacgatt taatgaaaaa gtttatattg 420 tggaataaat attataacaa ataaaatgaa actttgtata caaagccaaa atatgtccat 480 actaattatg atcgccaact catctcaggt ctttgctcta aacaaccaca attttgcgtt 540 acggacgtga ccatttttga actccttatt aatataagcc aatgactagc gcatatttgt 600 tcattatgat gctgcattac aattactgac ttacattaca taacatccat atgttatatc 660 aagaactgaa catttcattt aactataaac cactttagta acatttcaga aataaatgtt 720 taataaagaa aacaacattt tacatattgc cataatcttg gcgcataatc ttggtgtttt 780 ggcgatgcag caccttgtgt tgaagcactt cttgtttctt ttgtagatag ggtaactgac 840 cgtcaaataa tagatcaaat agttaagata caatttatgc caaactagga atgctgattt 900 cgattcattt tgccaaccaa caaccgctgg ttaascgaat attaaccgtt aacagattaa 960 tatcaaaata ttatttaata aaaaaaattg acgtgcctat tttgtctgac acattaaata 1020 ttgcatttta aaatactgat ttaggttatt ttaacatgcg aacagcatac agaacgagca 1080 tcttcacgca cctgcagtgt tttttagcag catagaaaaa ctgaataaac taacaaaaaa 1140 taaataaata aataaatcct gctctagatc tttttcactt aaatgacaaa ttaagattac 1200 aaaacgaaaa cattaactga atccttttta agaaccggca taggctgttt gtccaatcgt 1260 gttttttttt ttccgtcaat taaatattag gctacctcaa atagattgct ttacgaraaa 1320 aaatgcattc aattaatgct ccgctgttat aatattacaa aacctgtcaa catttttaat 1380 agaagtcatt agtttaaaga ttatgtctta attatgatta tgacttaacc tatttcctcc 1440 gtcttacttg cggttcatgt gcgcacgcgc tgcctatagt gaaagataat gacaatgaac 1500 cataaggctt tgaacctata cattttgtaa agtataggct aagcatataa caattttgtt 1560 atttacatgt gatactgcat taatttgcat acatgtttta taagctgtaa aatgaaagca 1620 tcagtttggt gcgaagttat cataaaatat ttgtctttgt ttgatttgta gattaggcta 1680 tttaaaatgt atataagaac tattaataac tgttattttc caaatggaaa aaagggtggt 1740 ttgttgattg tagtctgcac taaatcatga atatgcctag ttgagtttat gacgcggctc 1800 caggcagcac agacgaccag gatcagcgct ggttcgtcaa ctcctgtgtc aaactgtggc 1860 cagaatatcc ttctttcatt ttgctttttt ggcaaaaaac atctgtaagc acgtgtgttt 1920 gcacgtgtta ctgtcatgcg agttaacaaa tatgtgttgc tcatagaagc cgatcacaac 1980 gaacgcgctt cttcattcca gagacgtgag gcgctccgca atgcatttta tgttgacaag 2040 aagaaaagga gcacggtgct cttttgaatg tcactataga taacaactga atcagctggt 2100 ctattgtgcg cgagtgttgc tgtcaatgtt ttcataattt aattttttaa ataacattat 2160 gaaattcaag atttgtaaag tcatacaact ttggatgact taaaaacagc gaccacaagt 2220 ctctactccg tcattaaaaa gttcgccgta gtcatcttga cactgcactt tgtcacctcc 2280 aaggaaaccc cgcctctgct ttcgtttgat tgaagaatga aagacgccac tgaggtaaca 2340 tgcttttctt ctggagcgca caaagcgcgc aatgacaaaa cgcgaggcgc gcatgcaagg 2400 cgcataatca ataacctcca tgcatttggt cttttcccac tgctcaaaat gcgcacgccg 2460 cgggtcaggc tgcaggcact tgcacaggcc acagcggagt tttcatttca tgcagagcat 2520 aaatatgttt ttttatttca ttaattaaat aactatttat aaagataaat tatcaaagcg 2580 acacaattca ttttccaagc cctttatcca aaatctaagc actttcaaag cttgaaaatt 2640 ggctatagcc tacattaaaa ttaaaacatt ttcaaagatt tccagcaccc ggcaaatgcg 2700 cttgagtgct tattaaaatt attttacagg cataagaata agaataaaat aagaataaaa 2760 tcttgatatt gatctaagct ttaactacat tatatgagta ttaaatttgt tagaaaatcg 2820 ataggacctg cagctgtaaa tatttaattc aatattttac acaatcggtt atgaactata 2880 gcctaaaaat ataaacatta tagtcatcca tcgtacatgc ctacaaccaa tgactatttt 2940 tctatgcttg tttaacatag cctacacaaa acctttctta agttttaaag ttttctctct 3000 caccctggtg ctctgtatga actttttgca acctgtcctg aagtgactgc cgacgagaaa 3060 aagaaagcaa gcgcgagtgg agaaattaaa ttatttattc catcaactaa cacattaact 3120 agaataaact aaaataaact ttgtcaaata taatattgcc atatctgtga caaaactaga 3180 ttttaaatat tctccacgta gacacggtct tgcgcactaa cagtcgggca atgatttaat 3240 tcaagtttgt ctggaaacac taactttagg agcctgctaa acataggccc attatagctg 3300 attatgttta ttatcaattc tatattattc agagtataaa agcaccatgt tagcatgcaa 3360 cattgttacc gaattgagtt ttgaacattc gaacgtgcgt taacctttaa caacacacat 3420 gccataatac gaattaaata taacaaggtg atatctattt aagtttgatt aattgtgcaa 3480 ccttaccatt cagatgtctc agaggttaaa gtttattggt ttacacaccc aacagatatc 3540 cacaaatagg tctttggtca agttttgaaa gcatttatct gattttcata atgtgtcaac 3600 tgtcaatttc agagcagtca cgtccgtaac gaagacaagt cacgtccgta acgatagttt 3660 ttcctcatta atgcaaaaat gaaaaactaa ataaaattaa ttatatatgt cccatgcaga 3720 gccaaccctt atccttttga ctgttatata tttttttggc ttctgcattt tcattcacag 3780 aattttcaca aagtatgtca actcacaaaa ctcatgtctt ttttttgtca cgtccgtaac 3840 gcattcattt tgccgtcatt ttaaatataa aacacctgca tagactgata tttttttgat 3900 gcctgaactc catggttaag gggtaagaaa aatataaaca tatatttcaa tatactgtta 3960 acatatacct tctctgagaa tatacgtttc atttttttac tgcaaatgtc acgtccataa 4020 cgctggaatc agcca 4035 // ID Gypsy121-LTR_DR repbase; DNA; ZEB; 327 BP. XX AC chr19; XX DT 03-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Gypsy retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy121-I_DR; KW Gypsy121-LTR_DR; Gypsy121_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-327 RA Jurka J. and Kohany O.; RT "Gypsy retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1279-1279 (2008). XX DR Genome; chr19; Positions 39637367 39637041. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 327 BP; 54 A; 75 C; 92 G; 106 T; 0 other; tgtgacgcct gctgcgctgt catttgtctt ccgtgtctgt ctgtgtgtct gttttgctcc 60 ctctacagat gcgctgtaac attgcggggc ataatcaggt ggatggggaa cacctgagtc 120 taagtgagcg ctcctttatc agcgtcctgc cactgtttgg gagtatgatt cttccggcta 180 agggttggga cggagaccag cggcgtgcga gttggcgggg cgttcttgcc taatgtttga 240 tcacagtggg cttaaataaa acttttgtaa cgtgattttt ggcttccccc tcttattttt 300 tacttgtttc gccccgaggg cgtaaca 327 // ID L1-16_DR repbase; DNA; ZEB; 4805 BP. XX AC . XX DT 11-MAR-2009 (Rel. 14.03, Created) DT 11-MAR-2009 (Rel. 14.03, Last updated, Version 1) XX DE a non-LTR retrotransposon from the L1 clad - consensus. XX KW L1; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; endonuclease; L1 clad; L1-16_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4805 RA Bao W. and Jurka J.; RT "L1 non-LTR retrotransposons from zebrafish."; RL Repbase Reports 9(3), 691-691 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS join(717..1178,1168..1575,1579..4500) FT /product="L1-16_DR_1p" FT /translation="MTTNNQTMNIVSWNVRGLGHVIKRSRVFAHLKSLEAD FT IMFLQETHLNAANQCRLRANWISQVYQAPFNAKARGVAILFRKTVVFRLSS FT MVADPQGRFIMVIGHINSLPITLLNIYGPNFDDPNFFRKIFSIIPDTNTTN FT LIIGGDFNCYLDPYGTHMDRLSTKPPPTITSVQMLHNLIKSRNLVDIWRLQ FT HPTDRDYSFYSHVHKSYSRIDYFLVASELISNIGKTKYNNIIISDHSPVTL FT QLENILPKRNYCWRFNPMLIQDPTFLSYLNDKMDEYFSTNDNGEVSDSVWD FT TLKAVMRGHIISYESSKKRALNSRLQEIENILPNYEQAYKKSLSNSDYNKM FT LKLKYEYNSILNRQISNLLLKLKQKHFELGDKPGKLLARQLRGEQANKVIH FT MVKTKTGDLITNPDEINGCFREFYSDLYTSKANHNDPNYDNFFSKLDMPKL FT SEVSKLELESEILIEDILKAIHAFPSGKAPGPDGFGCEFFKTFSKKLAPYM FT LRMLQDSIKKNTLPDSLYDANICVILKKGKPETDPANYRPIALLNFDQKVL FT TKILADRLARHISTIIHPDQTGFIPGRFSFCNVRLLLNILYAKDKLTNKPA FT AILSLDAQKAFDQIEWPYIFETLRQFGFGDKFINWIKIVYLRPSSSVLTNG FT IRSNPFVLQRGVRQGDPLSPLLFNIALEPLAVGVRTNPHIQGISLGNAESL FT ISLYADDVLLYISDPVSTIPKILNYIDSFSKLSGYSINWEKSEFMPLTNNL FT SDTFLNNLPFKIVNNHFTYLGLNIPKNPKHLFKLNFLDMINRIKANIERWK FT LLPLSLIGRVNVIKMVVLPRFLYLFQNLPIYLPMSFFKQIDSIILSFIWNY FT KIPRIAKAHLQKNTDSGGLGLPIFKHYYWAANARALTFWQRGVLEDVTYET FT SPLWVYAEAISVPESSLPTLLFTECKLIKKQGSCNFVVRNSLRILNQINKF FT YKLPKVSVNTPISHNPSFKPSLIDSTFTAWKRKGLSRIGDLYIDNQFASFT FT QLKDKFSLPQSHFFRYLQIRNYVKENISQFINKPESHIIQKLLSSNPNTRH FT LISSFVRVFEAPICTKHIKIAWAMDLNEEVADEIWEEALSRIKKASINSRH FT KLIQFKVLHRLHFSKTRLSKIFPSVSPKCDRCGRAEGTLAHLFWFCPVLYE FT FWSSIFELFSSAYGTTIQPDYNLAIFGYSNIMDTFPSTQQQALMVGMITAK FT KMILLNWKSPKAPCFKRWLNELLYVIQMERLRLETSQKKAETTWGPILRLL FT GI*" XX SQ Sequence 4805 BP; 1523 A; 931 C; 853 G; 1498 T; 0 other; atggcgcgct caaaccaaga ccgtcggagg gagaaaggcc ccgcatcatc atcacccgtg 60 tccatttcta tcaagaaaaa gaactgatcc tctggctccg tcgcgatcgc actctagcgt 120 acaacggtca caagatattc atatttccgg attataccgc agaagtgatg gagctacgga 180 gatcatttgg tgatgtgatg ggagagctca gagagttaaa agtgtcacac agtctccgtt 240 tttctgcgcg cctcagcttt cagcacaact aacagctcgt gaccttctca gaccctgcag 300 aggcggctaa atatgtgaca tccaggctaa gatagaggat catacacaac gctgatatca 360 caatcaaggt actttttaat gctgcagagt aagtcttgtt tcggtttaac taagatgacg 420 tacattatac attaatgggg acggcctcat tattaccaga aacgtggtaa tggagcggaa 480 cggaaccacc ggagctggga tatatcgtca cctccatgcg aatggtgttc tccagtccaa 540 gaaaggatgt agtgcttcat gttagggaag cctgttacgg gggagggttg ttggacttta 600 agtcatgaaa agttccactt gttcagggtt tcggtatgtt tttttttttt ttttcttcgt 660 agagggagag gggtcgtctc acctcagtgt ttcattttct actgtttaaa gggtagatga 720 ctactaataa tcaaaccatg aatatagtct cttggaatgt gagagggctc ggtcatgtaa 780 ttaaaaggag cagggttttt gcacatctga aatcacttga ggccgatata atgtttctgc 840 aagaaaccca tcttaatgct gccaaccaat gtagactcag agctaactgg atatctcagg 900 tatatcaggc cccgtttaat gctaaagcta gaggtgtagc tattctcttt cgcaaaactg 960 tggtatttcg cctctcatca atggttgctg atcctcaggg aaggtttata atggtgatcg 1020 gtcatatcaa ctcgctcccc ataactttac tcaacatcta cggtcccaat tttgatgacc 1080 caaatttttt tcggaaaata tttagcataa ttccagacac aaataccact aaccttataa 1140 taggtggcga ttttaattgc tacttagacc catatggata gactttctac aaaaccacca 1200 cccacgatta cttcagtgca aatgcttcat aacctaatta aatctagaaa tctagttgat 1260 atctggaggc ttcaacatcc aacggataga gactactcgt tttactcaca cgtgcataaa 1320 tcatattctc gaatcgatta ctttcttgta gcatctgaac tgatttctaa cattggcaaa 1380 accaaatata ataacataat aatatctgac catagccctg tcacgctcca acttgaaaat 1440 attctgccaa agcgtaatta ttgctggcgt ttcaacccta tgttaatcca agatccaact 1500 tttttgtcat acttaaatga taaaatggat gagtatttct caactaatga taatggtgaa 1560 gtctcagact cagtctaatg ggatacttta aaagctgtaa tgcggggaca cattatttca 1620 tacgaatcat ctaaaaagcg tgctttgaat agccgactgc aagaaattga aaatatactc 1680 ccaaactatg aacaagcata caaaaagtca ctatccaatt ctgattataa taaaatgtta 1740 aaattaaaat atgagtataa ttctatccta aatagacaga taagtaacct acttctcaaa 1800 cttaaacaga aacacttcga gctgggagac aaacctggaa aactacttgc aagacaacta 1860 agaggagaac aagccaacaa agtaattcat atggtaaaaa caaaaacagg tgatttaata 1920 acaaatcctg atgaaataaa tggttgtttt agggaatttt attcagattt atacacttct 1980 aaagctaatc acaatgaccc aaattacgac aactttttct caaaattgga catgccaaag 2040 ctcagtgagg tctctaaact agaactggaa tccgaaatat taatcgaaga tatattaaaa 2100 gcaattcatg cctttccctc agggaaagcc ccaggcccag atggctttgg gtgtgagttt 2160 tttaaaacct ttagcaagaa actagccccc tatatgctga gaatgctgca agactcaatt 2220 aagaagaata cacttccaga ttcattgtat gatgcaaata tctgtgtaat tttgaaaaaa 2280 gggaaacctg agacagatcc agccaattac aggcccattg cccttttaaa ttttgatcaa 2340 aaagtgttga ctaaaatatt agctgataga ctggcacgac acatttctac aattatccat 2400 ccagatcaga caggctttat tccaggaagg ttctcattct gtaatgtccg ccttttactt 2460 aatattttat atgcaaaaga taaattaacc aacaaaccag ctgcaattct ctcattagac 2520 gcgcagaaag cctttgatca gattgaatgg ccgtatatat ttgaaacgtt aaggcagttt 2580 ggatttggag acaaatttat taactggatc aaaatagttt atctacgtcc atcttcatct 2640 gttttgacaa atggcatcag atctaacccc ttcgtattgc agcggggagt acgtcagggt 2700 gaccctctat cccctcttct ttttaacatt gcacttgagc cattagcagt aggagttaga 2760 actaatccac atatccaggg tatttctctt ggcaatgccg aaagcttgat cagtctatat 2820 gctgacgacg ttttacttta catctctgat ccagtttcta ctatccctaa gattctcaac 2880 tatattgata gtttcagtaa gttgtcaggt tattcaatta attgggaaaa aagtgaattt 2940 atgcctttaa caaataattt aagtgacact tttttaaaca atttaccttt taaaatagta 3000 aacaatcatt ttacttatct cggcctgaat attccaaaga accccaaaca cttattcaaa 3060 ttaaatttct tagatatgat taacagaatc aaagctaaca ttgaaagatg gaagttgctc 3120 ccattgtcac taattggccg ggtgaatgtg ataaagatgg ttgttctgcc aaggttctta 3180 tatctttttc aaaatctccc gatatattta cccatgtctt tttttaaaca aattgattca 3240 attattctct cattcatttg gaactataaa atacctcgca tagccaaagc ccatttacaa 3300 aaaaatacag atagcggagg tcttggcctt ccaattttca agcactatta ctgggcagca 3360 aatgccagag ctctcacatt ctggcaaagg ggtgttctgg aggatgtgac atatgagact 3420 tccccattat gggtttatgc cgaggccata tctgtgccag agtcctctct cccaacattg 3480 ctttttacag agtgtaaatt aattaaaaaa caaggaagct gtaattttgt ggtaaggaat 3540 tcgttacgga ttttaaatca gataaataaa ttttacaaat tacctaaggt ttcagtcaat 3600 acgccgattt cacacaatcc ttcatttaaa ccttccctga ttgacagtac ttttacagca 3660 tggaaaagaa aagggctatc tcgcatagga gatctttaca tagacaatca atttgcatcc 3720 tttacacagc tgaaggataa gttttctctc ccgcagtctc atttctttcg ctatctgcaa 3780 attagaaact atgtaaagga aaatatctca caatttataa acaaaccaga aagtcatatt 3840 attcagaaac ttctatcctc aaaccctaat actagacacc taatatcaag ctttgttcgt 3900 gtttttgagg ctccaatatg cactaagcac attaagatag cttgggccat ggatttaaat 3960 gaagaagtgg ctgatgagat ctgggaagaa gccctgtcaa gaataaaaaa agcttcaata 4020 aattcgagac ataaattgat tcaatttaaa gttttacatc gccttcattt ctctaaaact 4080 agattaagta aaatctttcc ctctgtatct cctaaatgtg acagatgtgg tagagcagag 4140 gggactttgg cacatttatt ttggttttgt cctgttcttt atgagttttg gtccagcatc 4200 tttgagttgt tttccagtgc ttatggaacc actatccaac ctgactacaa tttggctatt 4260 tttggatatt caaacataat ggacactttt ccttcaaccc aacagcaagc cctgatggtt 4320 ggcatgatca cagctaaaaa gatgatctta cttaattgga agtctccaaa ggctccctgt 4380 tttaagagat ggttgaatga gctgctttat gtcattcaaa tggagcgact acgtcttgaa 4440 acttcacaga aaaaagctga aaccacatgg ggacctattt taagactctt gggaatctag 4500 cattttacct ttggaaattt tcatctctaa ctcatatatt catgacactg actttgtaga 4560 aatgtgaata ctgtcatctt tttaaaaaat tgtttatgct acatatgtca tgtgtatact 4620 tatgtatatt tgataactga gacgattcct cccttttttt ttgttttttt tttgaggggg 4680 tgggatgggg ctaatgtcaa ttattttgta ttgtatttat tattattttt gtattattat 4740 ttgtattgta ctgtgtttgt taaaaatctt ataaaaatgg taaataaaac aaaatattac 4800 aaaat 4805 // ID SINE_DR1 repbase; DNA; ZEB; 257 BP. XX AC . XX DT 01-APR-2002 (Rel. 7.03, Created) DT 01-APR-2002 (Rel. 7.03, Last updated, Version 1) XX DE Zebrafish non-LTR retroelement (SINE) - a consensus. XX KW SINE; Non-LTR Retrotransposon; Transposable Element; SINE_DR1; KW SINE_SM. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-251 RA Jurka J. and Drazkiewicz A.; RT "SINE_DR1: a SINE retroelement from zebrafish."; RL Repbase Reports 2(3), 10-10 (2002). XX DR [1] (Consensus) XX CC The closest related element is SINE_SM from Prosopium CC spilonotus (see references under SINE_SM). XX SQ Sequence 257 BP; 76 A; 55 C; 40 G; 84 T; 2 other; ttatgaagtg tgcttcacac aggtgagtgg gcttgacaaa ccacctgtag aaacactctt 60 ctttctctnt aaataaaaaa aataaaaaac wctccccttc ttactctagc acttaattct 120 ctgagcacta acagttcctt tgtataatta gcacttcttg tgtgtattgc ctcttcttgt 180 tgaatcgctg aatgcctcct caattgtaag tcgctttgga caaaagcgtc tgctaaatga 240 ctaaatgtaa atgtaaa 257 // ID hAT-N36C_DR repbase; DNA; ZEB; 1917 BP. XX AC . XX DT 28-FEB-2009 (Rel. 14.02, Created) DT 04-MAR-2009 (Rel. 14.02, Last updated, Version 1) XX DE Putative nonautonomous hAT DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW 8 bp TSD; hAT-N36_DR; hAT-N36B_DR; hAT-N36C_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1917 RA Bao W. and Jurka J.; RT "hAT DNA transposons from zebrafish."; RL Repbase Reports 9(2), 543-543 (2009). XX DR [1] (Consensus) XX SQ Sequence 1917 BP; 653 A; 325 C; 344 G; 595 T; 0 other; taagggtgga gccgaacccg aatacggtat tcggaaaggc acgaatagcg tgtttttacg 60 aatacttgat tcgaacaaat acttgaaaaa ttatttgtat tcgggagcaa gaaaaacact 120 atatcaaaaa gcagcgtttc ctcatgagac cacagtgcat gcccgcgtga gtgagtgagt 180 gagagagaga gagagagaga gagagagaga gagagagaga gagagagaag agagagagag 240 agagaggcaa aacgcgccaa agcccgaaac tgaaagcgag acgtgacttt taaggggttg 300 tttcatatgg atttattaat cattcttact gttcagtgat cgcaaactgc cgtagtttat 360 taaagacgca aacctctcac tgcacgtcag ctgcgcgcct tcagcagacc tcctcattcc 420 tgcagcacga gagctttatg attgtttatg cgcgccaaaa gtggcggatc tgtccggtaa 480 aatatctgac tgcgtgtcac cgcatcccta aggactgttt ggcgaaatat ttgactgcat 540 gtcactgcat aacaaacgac tgaaaggata taactagaga actctccact gtgctactgg 600 gtgagagcgt atggcaagcc gttttagcat ttaaaccttg ttcagactat ccataaatat 660 atttagagat atctctaatt atattttgac ttgtcataat tataattcga ctagtcagat 720 tggaaatatc tgcaaatata ttatgataga ctagtcatat tcctccattg acttccattg 780 aaaaatattt gcagatatct ctaaataagt tttgactagt cacaattgta attagagata 840 tctccaaatg aattttgact agtcaaaaat ccaattcaag atatctacaa ctgtaattag 900 acttttagta aattaattag agatatcttc aaatatattt gtaaatatct ctaaatatga 960 tgaattaaag atatctctaa atgtattatg actagtaaaa actcagagat atcaattaca 1020 attgttacta gtcaaaattc atctgatttc gtactagtac aattgtaatt gcagatatgt 1080 ctaattgtca ttctgagtag tggaattata attatgacta gtcaaaatat aattagagat 1140 atctctaata tatttatgga gtcagaacaa agatttaaat gctaaaaagg cttgccatag 1200 agagcgcttc tcactgaaca gagcaccgcg caaccttcta tgtaatgtat tatcacatgc 1260 actaaaagca tcctaaaagc atcaacacag ccacattctg ccccagcaag tcagtttcaa 1320 acataaaaaa aaacaaacaa acaaactttg tgtctttttt tgtggctggc agatgataat 1380 agcagggctg tatcttttag tattatatag aatacttttg ttctgccaga tctcccgggc 1440 agggttttat ttagatttat ttagttaata ttagtttttg gaattctgtc cattggaaag 1500 aagttctttc agaagaagaa cagttttttg aagtattatt tgtttttatt ctgtctattc 1560 agtgtccttc aacaagaacg gtggtggtgg ggttcataat attcacaatg gtattttatt 1620 agttgttggt ttaaccatgg atgagataat ggcttctatg ttattttctt ttcaatgaca 1680 tttcttatca catgttcaaa aacaacagcc aatattgcat atctataatt tatataatgg 1740 gtataattaa acaatacttt cactataacg taaattagaa gtgtaataga aaaaatatat 1800 atttttgcgc cattttgaat tttactcgaa tacaaataca aatacaaata caaatacttt 1860 tcccccctca acaaatacaa atacaaatac aaataccggc tgctccgcac atcccta 1917 // ID DNA-8-3_DR repbase; DNA; ZEB; 1163 BP. XX AC . XX DT 21-OCT-2008 (Rel. 13.1, Created) DT 21-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE DNA-8-3_DR is a nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; 2-bp TSD; GTNNNNAC; TSD 8-bp; KW DNA-8-3_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1163 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1242-1242 (2008). XX DR [1] (Consensus) XX CC The elements of this family are ~90% identical to the consensus. CC The TSD is 8-bp, showing a GTNNNNAC pattern. The TIR is not CC obvious. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1163 BP; 372 A; 232 C; 247 G; 309 T; 3 other; catagactgt aaaagatatg gacgtagtat ccgtgacgtc acccataggt ttctgaagag 60 cgcaaaagaa gctacaagta ggcgtggcca accgtcgcca ttttgttcgc gcgtcatcgc 120 accgaccggg ggataccaaa caagggcaaa gaggcggagc gtgagcggag ctacagacgc 180 ctgctagcat tttgcttaga cctggcttaa gacaggcttt tctttgggag aaacgcttaa 240 tacttcatta cctgcgactc gtttgtgttc tgaccacatg tgcttggttg tacactatat 300 caataaagtg tttagacttt taaaaacact gttgtaatac attgagccac taaacattgt 360 tcttatgacg tttttctaca ggaggaaaac gcgaattact tccaaatact tcaaatatag 420 tctgtgttag taaatgcaag gctattgatg aaatccaggc ataacactgt atgacaacag 480 tttcagatga ctgttctaga gcctacagct aatcaatctg tcagattctg gagtgcatta 540 cagctctaaa gaaaattata aatgataaat gatcttaaat aaaacaaata catttataga 600 gatggtatat aagtatataa tttcactcac ctgggaaatg gaggccacgt gaatggtttg 660 tgagcacaat taagtgcaca cagcatgcca tatcatctga taattgtagg aaataattcc 720 aaaaggcaat tgactgtgta aagccacata aaacaaaaca aaaatacgat gtatatgccg 780 agttcagcgg ctaatcagcc ggaatcagct gaggtgacgt gacagcgacc agcgagacct 840 agctgtcact caagtggcca cgcccttaat tatgcagact taatataact taataaaaac 900 gaaacggatg agttataaaa aaaaaatctc ccccccctca cagttgtcat gaagggtaat 960 attagctata tgcaccaaaa ccattttttg taccaggctg taaacatgtt tttatcagct 1020 gtaaagttgg ccaatttaac attgcrgtca atagaaatca kctggatttc ctggagccag 1080 cccccaaagg sgagtttcga tgaattgcag tttcagttac ttccgtattg gcttcacgag 1140 ggagagcggg aggttgccgc ttg 1163 // ID RTEX-1_DR repbase; DNA; ZEB; 4732 BP. XX AC . XX DT 23-FEB-2009 (Rel. 14.02, Created) DT 10-NOV-2010 (Rel. 15.12, Last updated, Version 2) XX DE RTE-like non-LTR retrotransposon - a consensus. XX KW RTEX; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; RTEX-1_DR. XX NM RTEX-1_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4732 RA Bao W. and Jurka J.; RT "RTE-like non-LTR retrotransposons from zebrafish."; RL Repbase Reports 9(2), 563-563 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 1..792 FT /product="RTEX-1_DR_1p" FT /translation="SFGTPSHIIIHTGTNDLRAQQERVGQLVCRVAEKAAE FT TCPNAKITISTLLPRRDIHPDTINRVNADISRGCALLPNVHLTHHTSITVR FT DLYDHVHIKKDKVNVFAKALKDTAWGRQTSAHTTNRLSPPYHMENMKQPPP FT GHHWGPPPHTRHFQPTQAQQQRPRPSSGHSQKTQTMPGPPQRETPTAGHHQ FT RPTPSNRLNPRQVNNSRRSPQHPTTSTTAADSTPRPALLPQPRNYAQALKG FT PAKPLEMGEIRQLLQYISAQLT*" FT CDS 906..4364 FT /product="RTEX-1_DR_2p" FT /translation="MTLSISLWNIQGLKSSAFGLKSLNTEFQANIKNKDII FT ILQETWSKTNTTTHCPPNYREIILPSQKLNTTRQGRDSGGQIIWYNSKLHK FT YIDTVKTGKYHAWLKIHKDLLSSQKDCFLCAIYIPPSESPYYSEDMFDTLE FT KETSHFQAQGHVLICGDMNARTGQQPDFTNAQGSKYINSNLPGIQTSFSHL FT HRNNHDHIVNKSGKELLQICRSLGLYIVNGRIRGDRLGRFTFCSPLGNSTV FT DYMITDIDPSSLRAFTVRELTPLSDHSQITLYLKKTITNPCTQPNKLFNIR FT KPYRWAENSAEDYQNAVNSPKTQQILDNFLVNAYAHTKQGVNMAVKDINNI FT FESTAKQAKLKVKTRQNNPPKNDKNWFDQECLSIRKHLRNLSNQKHRDPNN FT AEIRLLYCETLKQYKQTLRTKKAQYTQKQLTTIENSINSNQFWDNWKNLIK FT NDHEELPIQNSEIWETHFQTLFNKVETDTNPKQNQITKTLATLESTIKDNQ FT NPIDFPITIMELKDKIKSVKPKKASGPDGILNEMIKQTSPKFQSAILKLFN FT LVLSVGHFPDIWNQGLITPIFKNGDKFDPNNYRGICVSSNLGKLFCSLINA FT RLLDFITTHNVLSRSQIGFLPKYRTSDHIYTLHSLIEKHTVQNKGKIYACF FT IDFKKAFDSIWHQGLLYKLIESGIGGKTYDLIKSMYTESKCGIKISTKRTK FT YLSQERGVRQGCCLSPTLFNIYINELALSLERSTAPGLDLHDSQIKCLLYA FT DDLLLLSPTEQGLQQNLQLLDQYCQTWALTVNLNKTKIITFQKRARAQGTQ FT HTFTLGTNQITHTTQYNYLGLNITSTGNFNPAVNELRDKARRAFYAIKRQC FT PIDIPVQIWLKILESVIEPIALYGSEVWGPLTNPEQDLAKWDKHPIETLHT FT ELCKNILHVHRHTTNNGCRAELGKYPLIIKIQKRAVKFWKHLKLSDPDSYH FT YKALQDRELSRRADPLSQLAQSFRVSETSPEELNTLLPLTQITHQIKNSYT FT HHWDTQTQLQSKMQCYLALKRQYTLADYLHTVTDKGLRNTLSRYRLSGHQL FT AIETGRHRQTWLPVEERLCPHCPQQPIETELHFLTECTKYSEIREKFYPKL FT THTHKNFESLSNNEKLPILLGECVCCCVLAAQFVHSCHRLRNPQ*" XX SQ Sequence 4732 BP; 1675 A; 1247 C; 831 G; 978 T; 1 other; agttttggca caccctcaca catcatcatt cacacgggca ccaacgacct gagagcccag 60 caggaaagag tcgggcagct agtctgcaga gtagcagaga aagctgcaga gacctgcccc 120 aatgcaaaaa tcaccatctc caccctcctg cctcgcagag acatccaccc cgacaccatc 180 aacagagtca acgctgacat ctccagagga tgtgctctac tgcccaacgt gcacctgact 240 catcacacca gcatcacagt aagagacctc tatgatcacg tacacataaa aaaggacaaa 300 gtcaatgtgt tcgcaaaagc actgaaagac acagcatggg gcagacagac atcagctcac 360 acaacaaaca gactctcacc accttaccac atggaaaaca tgaaacaacc tccacctgga 420 catcactggg gacccccacc tcacacgaga cactttcagc ctacacaagc acaacagcag 480 agaccgcggc ccagctctgg acacagccaa aaaacccaga ccatgccagg acccccccaa 540 agagaaacac ctacagctgg tcatcaccaa agacctactc catcaaacag actcaatcca 600 agacaagtga acaacagcag gaggagtcca cagcatccaa caaccagcac aaccgctgct 660 gacagcacac ccagaccagc actgctgcca caacccagaa actacgccca ggctctcaaa 720 ggaccggcga aacctctgga gatgggtgaa atcagacagc tgctacaata catcagcgcc 780 cagctgacgt gaacagcccc ccaatttaca atatacaacc accatgtata tatatatata 840 tataactaat atattagtaa aggtttatct caaacttaaa aaacaagaac tttacctatc 900 tctaaatgac cctgtcaata tcattgtgga atatacaagg cctaaaatca tcagcctttg 960 gactaaaaag cttaaacaca gaattccaag caaacataaa aaataaagac attattattc 1020 tccaggagac atggagcaag acaaacacta ccacacattg cccacccaac tacagggaaa 1080 ttattcttcc ctcacagaaa ctcaacacaa ctcgacaagg gagagactca ggaggacaaa 1140 tcatctggta caactcaaaa ctccacaaat acatygacac agttaaaacc ggaaaatatc 1200 acgcatggct caaaatccac aaggatctac tgtcgtccca aaaagactgt ttcttatgtg 1260 ccatatacat cccaccatca gaatccccct actacagtga agacatgttt gacactctgg 1320 agaaagagac gagccacttc caggcccaag gacacgtgct catctgtgga gacatgaacg 1380 ccagaacagg acaacagccg gacttcacca acgcacaggg aagcaaatac atcaacagca 1440 acctaccagg tatacagacc agcttctccc accttcacag aaacaaccac gatcatatag 1500 tcaacaaaag tggaaaagag ctcttgcaga tctgcaggag tctgggactg tacattgtca 1560 acggtcggat aagaggggac agactcggga gattcacatt ctgctcacct cttggcaata 1620 gcacagtaga ctatatgata acagatatag acccttcatc tctcagagca ttcactgtta 1680 gagaactcac cccactttct gaccatagcc aaattacttt atacctcaaa aagacaataa 1740 caaacccttg cacacagccc aataaactat ttaacatcag aaagccatac agatgggctg 1800 agaacagtgc agaagactac caaaatgcag taaacagccc aaaaactcaa caaatcctag 1860 ataacttcct ggttaacgca tatgcccaca ccaaacaagg agttaatatg gcggtaaaag 1920 acataaacaa tatattcgaa agtacagcta aacaggcaaa attaaaagtt aaaaccagac 1980 aaaataatcc acccaaaaat gacaaaaact ggtttgatca agaatgcctg tcaattagga 2040 aacacctcag aaacctgtca aatcagaaac acagagaccc aaataatgca gagattcggc 2100 ttctctattg tgaaacacta aaacaataca aacaaacact cagaaccaaa aaggcacaat 2160 acacccaaaa acaactgaca acaatagaga actccattaa ctcaaatcaa ttctgggaca 2220 actggaaaaa cttgatcaaa aacgatcatg aagagctacc gatccaaaat tcagaaatat 2280 gggagaccca tttccaaaca ctattcaata aagtagaaac agacacaaat cctaaacaaa 2340 atcaaataac aaaaacactg gcaacactag aatcgactat caaggataat caaaacccaa 2400 tagacttccc catcactata atggagctta aagacaaaat taaatctgtt aaacctaaaa 2460 aagcatctgg acctgacgga atattaaacg aaatgataaa acaaaccagc cctaaatttc 2520 aatcagccat cctaaaatta tttaatctag ttctgagtgt tggtcacttc cctgatatct 2580 ggaatcaagg attgataaca cccatcttta aaaatggaga taaatttgac cccaataatt 2640 acagagggat ttgtgtgagc agcaacctgg gaaagttatt ctgtagttta ataaatgccc 2700 gactactgga cttcatcacg acacataatg tcttaagcag aagtcaaatt ggatttttac 2760 caaaataccg cacatctgac cacatttaca cactgcactc gctaattgaa aaacacactg 2820 tccaaaataa aggtaaaata tacgcatgct tcattgactt taaaaaagct ttcgactcaa 2880 tttggcacca aggcttactt tataagctga ttgaaagtgg cataggagga aaaacatatg 2940 accttattaa atcaatgtac accgaaagca aatgtggcat caaaattagc acaaaaagaa 3000 caaaatatct ttcccaggag cgtggagtga gacaaggctg ctgcctgagc ccaacactat 3060 tcaacatcta cataaacgag ctggcgctca gtctggagcg atccaccgct ccgggtctcg 3120 atctccacga ctctcagatc aaatgcctgc tgtacgcaga cgacctgctg ctactatcgc 3180 caaccgaaca gggccttcag cagaacctgc agctgctgga ccagtactgc cagacctggg 3240 ccctgaccgt caacctaaac aaaaccaaaa tcatcacctt ccaaaaaaga gccagagccc 3300 agggaacaca acacacattc acactaggta ccaatcagat aacacacaca acacagtata 3360 attatttagg cctgaacatc acctccactg gaaactttaa tcctgcagtg aatgagctga 3420 gagataaagc ccgcagagcc ttctacgcca tcaagcgtca atgtcccata gacatccctg 3480 ttcagatctg gctgaagatc ctagagtccg tcatcgagcc catcgccctc tacggcagtg 3540 aggtgtgggg cccactgaca aaccctgaac aagatctggc caaatgggac aaacacccta 3600 tagaaaccct gcacacagag ctgtgtaaga acatcctaca cgtccaccgg cacacaacaa 3660 acaacggatg cagagcagaa ttaggcaaat accctctgat aataaagata cagaaaagag 3720 ctgtcaaatt ctggaagcac ttaaaactca gcgacccgga ctcataccac tataaagccc 3780 tgcaggacag agaactgagc agaagagcag acccactctc ccagctggcc cagagcttca 3840 gggtctctga gacgtctcct gaggagctga acacactcct gccactgact cagatcacac 3900 accagatcaa gaacagctac acacaccact gggacactca aacacaactg caaagcaaaa 3960 tgcagtgtta tctggccctg aagagacaat atactctggc agactatctt cacacagtga 4020 cagataaagg tctgaggaac accctgagca gatacagact cagcggacac cagctggcta 4080 tagagacggg ccggcacaga caaacatggc tgccggtgga ggagcggctg tgcccacact 4140 gccctcagca gccgattgaa acagaactgc acttccttac ggagtgcaca aaatactcag 4200 agatccggga gaagttctac ccgaaactca cacacacaca caaaaacttt gagtccctgt 4260 caaacaatga gaaactgccc atcctactgg gggagtgtgt gtgctgctgt gtgttagcag 4320 ctcagtttgt gcactcctgc caccgtctga ggaacccaca atgatccgct ccactgtctc 4380 caacacacaa cacaggaact ctttactcgt ggacttactg ttgatattat tgtcaccact 4440 tttatcctgt ttcacaaatt accattaatc tatttttatt tttattaata tttttagttt 4500 tactgatata ttgttatttt tattaatcta tttttatttt ttacatttct tttctttatt 4560 tctatattat attttatgtt tatatttgtt tgcactactg ccctttttgc actgtcttgt 4620 ctgtacgcag cgacgctgca ctgctttggc aatacgaatg tacagctatt tgtcatgcca 4680 ataaagcacc taaaatgtga aatgtgaaaa tgtgagagag agagagagag ag 4732 // ID DNA2-2B_DR repbase; DNA; ZEB; 929 BP. XX AC . XX DT 13-MAR-2009 (Rel. 14.03, Created) DT 13-MAR-2009 (Rel. 14.03, Last updated, Version 1) XX DE A family of nonautonomous DNA transposons - consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; TA target site duplication; KW DNA2-2B_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-929 RA Bao W. and Jurka J.; RT "DNA transposons from Zebrafish."; RL Repbase Reports 9(3), 690-690 (2009). XX DR [1] (Consensus) XX SQ Sequence 929 BP; 277 A; 200 C; 195 G; 257 T; 0 other; gcccctttca cacagtgata ccggtaaata tccggaaaat ttccggaacg actttaccgg 60 taaattcata aaaacgctgt tcacacaggc aaggacgttc cggaattttt ccggaaaaga 120 ccattcacac atccattcca aaataccggt aaattctgac atcattaacc agaaatgagc 180 tctaaacggc tgcgcttgta tttgtaaaca tagaaggggt cgggcttttg ttgatggttt 240 cacttttatt taaagcttta gcattcatgc agctgctttg tcccagagac atccaaaggc 300 agaagcgcga accgcagcta aatgtttaca catttgacta cattacaaat tctgtggatg 360 gataagtatt gtgaacaact tcgatgaaaa catatggagg aacactttcg catgtcgaga 420 tgttcataat atgtgtgtgt gctggcgctc accggagcac tccttcacgt gcacacgcgt 480 caagcaactg aagcagagct tgaaggtaaa caaacagcgg tttatcataa gcattttatc 540 gataattttt tacacagttg gcattaagaa ggaacataga aacgttatct gactaacatc 600 tagcagctaa atgtgtctgg aaaaatattc aaaggctttt attttcataa acagcgcgga 660 tgtgaatgcg tctgagtgtt ctgattggct aaagtagacg tctcacgtca gcacgttcta 720 aacgtgaacg cgctctttcc ggcaatcttc cttctgcgtt cacacagcgc agcattccgg 780 caaattaccg gtaatgttac aacttctctt tccggaaaat tgccagaacg aatttaccgg 840 tattttcaaa aagggcctgt tcacacatac agacctttcc ggaaaattgc cggtaatttt 900 ccggaaaggt ctgtatgtgt gaaaggggc 929 // ID Gypsy91-I_DR repbase; DNA; ZEB; 4259 BP. XX AC chr18; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy91-I_DR; KW Gypsy91-LTR_DR; Gypsy91_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4259 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1493-1493 (2008). XX DR Genome; chr18; Positions 25145529 25149787. XX CC Positions [1949-2425] - Integrase core CC 'CACAT' target site duplication CC LTRs are 94% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 44..3586 FT /product="Gypsy91-I_DR_1p" FT /translation="MTTPSATPFAEVIQALVVLHREQHEALVEIRADQDQR FT FRALVEAQREDRELVRRLLDQGIRPASTSAAHPPISLQKMGPQDDPEVFLD FT LFEKMAEACGWPRAEWPVRVIPLLSGEAQIAAQQLPAQNLLDYAHLKRAIL FT QRAGCNPEEQRQRFRSLELSESGRPFVFAQQLRDACRRWLIPDGQNIDQVV FT DSVVLEQFIARLPSRTSEWVQCHRSTDLETAIQLAEDHLVARSMVGDTTSL FT SSLSLSSSLSLPSPRPRQQGPPVPAPRKRGSGADTSRSMPRWGGYPERGGE FT ARPLVVSPATTGFSPAQSVGPTGPGGAVVRSGPTCWRCGGVDHSPENCSAM FT EVGALIRVPDAPAVAPGRNGLYQIPVSIKGDKYQALVDSGCNQTSIHPCLM FT QDTALDKSRTVRVRCVHGDEIHYPLAAVEIQFRGQKHSVEVAVNPHLKHPL FT ILGTNWPDFNKLLGVLTAGASWRKGSTARGRIAQLGESPVTSSHSDSEGGL FT GISRCKDFPLEQSRDDTLRNAIEQVKVIDGHVLQPERPLNYPYFAIIKDRV FT YRVTQDTQSKEDTTQLLVPKSRREMLFQAAHANPMAGHLGQAATLNRLMAR FT FFWPGIHGDVSRWCAACPECQLVNPPATPKAPLRPLPLMEIPFERIGMDLI FT GPLERSACGHRFALVLVDYATRYPEAVALRNISAKSVAEALFRFISRVGIP FT KEILTDQGTAFMSCTLRELYELLGIKSIRTSVYDPQTDGLVERFNRTLKAM FT IRKFVHKDAKNWDKWLEPLLFAVREVPQASTGFSPFELLYGRQPRGVLDVV FT REAWEDEPSNSKNEIQYILDLRAKLHALGRLSMENLLKAQNEQRRRYDKGT FT KLRTFSPGDKVLVLLPSSSSKLLAKWQGPFEVTRRVNDLDYEVVRKDRSGA FT RQIYHINLLKLWREPEDVALATVITGEDDLGPECVDCNRSSALVAGGDHLS FT LAQLADVQLLQNEFKDVFSPLPGRTNMIQHHIETEPGVVVRTRPYRLPEHK FT KKVVQEELGKMLEMGVVEESHSDWASPIVLVPKTDGTVRFCVDYRKLNAVS FT KFDAYPMPRIDELLDRLGAARFYSTLDLTKGYWQIPLSPISREKTAFTTPF FT GLHQFVTLPYGLFGAPATFQRLMDKILALHSAYAAAYLDDIIIFSNDWQRH FT MQHLRAVLSALRRAGLTANPRK" XX SQ Sequence 4259 BP; 963 A; 1132 C; 1236 G; 928 T; 0 other; gtggtgccga aacccgtgaa gaagaggaga cttcgctgcc aagatgacca ctccgtcagc 60 tactccattc gcggaagtga tccaggcgct cgtggtcctc catcgcgaac aacatgaggc 120 gctggtggag atccgggcgg accaggacca gcggttccgt gctttggtgg aggcccagcg 180 cgaggaccgt gagctcgtcc ggagactgct ggaccagggg atccggcccg cgtcgacatc 240 tgccgcccac ccccccatct cccttcagaa aatggggcct caggacgacc cggaggtatt 300 tctagacctc tttgagaaaa tggcggaggc gtgtggctgg ccacgggccg agtggccggt 360 gagagtcatc ccgctgctgt caggcgaagc ccagatcgcg gcgcagcagc taccggccca 420 gaatctcctg gactatgctc acctgaagcg agccattctc caacgggccg gctgcaaccc 480 ggaagaacaa cgccagcggt tccggtccct ggagctgagt gagagcggcc ggcccttcgt 540 gttcgcccag cagctccgtg acgcctgccg cagatggctg atcccggatg gccagaacat 600 cgaccaggtg gtggactcag tggtgctgga gcaattcatc gcccgcctcc cctcccgaac 660 atcggagtgg gtccagtgcc accggtccac cgatcttgag acggccatcc agttggcgga 720 ggaccacctg gtagcgaggt ccatggtcgg cgatacaacc tctctctcct ctctctctct 780 ttcttcttct ctttctctcc cctctcccag acccagacag cagggacctc ctgtgccagc 840 gccgaggaag cgggggtctg gggcggatac ctctcgctca atgcctcggt gggggggata 900 tccggaaagg ggtggggaag cacgccctct agtggtttct ccagccacaa caggtttctc 960 tccggctcaa tcggttggac caactggccc cgggggtgca gtggtgaggt ctgggccgac 1020 gtgttggcgg tgcgggggcg tggatcatag ccccgagaat tgctccgcga tggaggtggg 1080 ggctttgatc cgcgtacccg acgccccggc tgtcgccccc ggtcgcaatg ggctatatca 1140 gataccggtg agtataaagg gggataaata tcaggccttg gtggattcag ggtgtaacca 1200 aacctctatc cacccatgcc tgatgcaaga cacggcattg gataaaagcc gcacggttag 1260 ggtgaggtgt gtgcacggtg atgaaattca ctacccgcta gcggccgtag agatccaatt 1320 tcggggtcaa aaacatagcg tagaggtagc ggttaacccg cacctcaagc atccgctgat 1380 tttgggaaca aattggcctg attttaataa actattaggg gttttgacag cgggtgcctc 1440 ttggaggaaa ggatcgacgg ctagggggcg gatcgctcag ctgggggaat ccccggtgac 1500 gtcatcccac tctgactcag agggaggcct gggaatttcc cgctgtaagg acttccctct 1560 ggagcagtcg cgtgatgaca cgctgagaaa tgccatcgag caggtaaaag tgatcgatgg 1620 gcatgtcctt cagcctgagc gaccactcaa ctatccctat tttgcgatca ttaaagacag 1680 ggtgtatcga gtgacccaag acactcaatc aaaagaagat accacccaat tattggttcc 1740 taaaagccgt cgggaaatgc ttttccaggc ggctcatgct aaccctatgg ccggacattt 1800 aggtcaagcg gccacactaa atcgcctcat ggcccgtttc ttttggccgg gcattcacgg 1860 agacgtcagc agatggtgcg ctgcgtgtcc tgaatgtcag ctggtaaatc cgccggccac 1920 cccaaaagcg ccattgcgcc cgttaccatt aatggagatc cccttcgaaa gaattggtat 1980 ggatctcatc gggccattag agcgatccgc atgcggacac cgatttgcat tagtcctagt 2040 ggattatgca acacggtatc cggaagctgt agctctccgc aacatctccg ctaagagtgt 2100 tgcggaggct ctcttccgct tcatctcccg ggtggggatt cccaaggaga tcctcaccga 2160 ccaaggcacc gcgtttatgt catgcacttt gcgcgaatta tacgaattat tgggcattaa 2220 atcgattcgg actagtgtct atgacccaca aacagacgga ctggtggaac ggtttaatcg 2280 cacacttaaa gcaatgattc ggaaattcgt tcacaaagac gcgaaaaatt gggataaatg 2340 gttagagccc ttgttgttcg ctgtgcggga ggttccccaa gcctctactg ggttttcccc 2400 attcgagctc ctctacggca gacagccccg aggggtgttg gatgtcgtca gagaggcttg 2460 ggaggacgag ccttctaaca gcaaaaatga aattcagtac attctggacc tgcgagcaaa 2520 actccatgca ctggggcggc tatccatgga gaatttgctc aaggctcaga atgaacaacg 2580 ccggcgttat gataagggca ctaaactacg aacattttca ccgggagata aagtgcttgt 2640 actgctgcct tcttctagct ctaaattact cgccaagtgg caagggccat ttgaggtcac 2700 acggcgagtg aacgatctcg attatgaggt ggtacgaaaa gacaggagtg gggcacgtca 2760 aatttatcat attaatttat taaaactatg gagggagccg gaggacgtgg cgttagcaac 2820 ggtgattaca ggcgaggatg acctgggacc ggagtgcgta gactgtaatc ggtcgagtgc 2880 tctggtcgcc gggggtgatc atctctcgct cgcccagctc gctgacgtcc agctcctaca 2940 aaatgaattt aaagacgtgt tctcacccct gcccggtcgc actaatatga ttcagcacca 3000 catcgagacc gagccaggcg tggtcgttag aacccgcccg taccgtctgc ctgaacacaa 3060 aaaaaaagtg gttcaggaag aattgggaaa aatgctagag atgggggtag tagaagaatc 3120 acacagcgac tgggcgagcc cgatcgtctt agttcctaag acagacggca cggtccggtt 3180 ctgtgtagat tatcgtaaat taaatgccgt atcaaaattt gatgcgtatc caatgccgcg 3240 gattgacgag ttgctcgacc ggttaggcgc tgctcgcttt tattcgacat tggatttaac 3300 gaaagggtat tggcaaattc ccctttctcc aatatcccga gaaaaaactg ccttcacaac 3360 gccgtttgga ttacaccaat tcgtgacact tccatacggg ctgttcgggg ctccggcgac 3420 gtttcagcgc ctgatggaca aaatacttgc cctccactca gcatatgccg ctgcgtattt 3480 ggatgatatc attatcttca gtaatgactg gcagcggcat atgcagcacc tgcgggcagt 3540 attatcagcg ctgagacggg caggactcac ggccaaccct aggaagtgag caattgggcg 3600 agtggaggta aggtatctgg gcttccactt gggtcatggg caggtgcggc cacaaattga 3660 taaaaccgca gctattgcaa cctgcccgag gcccaagacc aaaaaggagg tgagacagtt 3720 tttggggctg gcgggatatt atagacggtt tgtccctgag tattcggccc tcgtcaccaa 3780 agttaaagct gcactttgcg gggggccgct attgcatgct cctaattttg ctctcccctt 3840 tattttacag acggatgcgt ccgatcgggg tttgggggcg gtgctcgccc aggaggtgga 3900 gggtgaggaa cggccggtgc tgtacattag ccgcaaatta tctaacaggg aggcgaagta 3960 cagcaccata gaaaaagagt gtttggcgat caggtgggcc gtcctcactc tccactatta 4020 tctcttgggc aaggagttcg tcctctgttc ggatcatgcc cccctccagt ggctccaccg 4080 catgaaggat acgaacgcgc ggatcacccg ttggtatctc gctttacagc ctttcaaatt 4140 caaagtgatc cacaggccgg gcgagcagat ggttgtagcg gacttcctct cccgggcggg 4200 gggagggggg agagctgcag gccggacggc tgcccggcct gagacgggcg gtgggggta 4259 // ID BHIKHARI_I repbase; DNA; ZEB; 5277 BP. XX AC AJ011117; XX DT 19-JUN-2000 (Rel. 5.05, Created) DT 06-MAY-2009 (Rel. 14.06, Last updated, Version 2) XX DE Internal portion of retrotransposon bhikhari from Danio rerio. XX KW BHIKHARI_I. XX NM BHIKHARI_I. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RA Vogel M.A. and Gerster T.; RT "Promoter activity of the zebrafish bhikhari retroelement RT requires an intact activin signaling pathway."; RL Mech. Dev 85(1-2), 133-146 (1999). XX RN [2] RA Vogel M.A.; RT "BHIKHARI_I."; RL Direct Submission to Genbank (29-SEP-1998)Vogel A.M., Abteilung RL Zellbiologie, Biozentrum der Universitaet, Klingelbergstr. 70, RL Basel, 4056, SWITZERLAND. XX RN [3] RP 1-5277 RA Bao W. and Juka J.; RT "BHIKHARI_I from zebrafish."; RL Direct Submission to Repbase Update (06-MAY-2009)the complete RL internal sequence.. XX DR [3] (Consensus) XX FH Key Location/Qualifiers FT CDS 2585..4438 FT /product="BHIKHARI_I_1p" FT /translation="MAHSKDPVGHWKDLETWLSVVTGSLLPKAAETLQPLT FT QNQLDENIDSIMKQDPSQSFNHKELAKITGTLSHTLIATLKLSDRHASQLQ FT HKLTRLQARIEQLELEAQERLEQPNEVDEGTTEEIDKLQEALTAITEEREQ FT ARADHADVANKLDYAEQLLKEAKVDLRDKKARIKALETHLSEARHEIDRLM FT QEVDDIKEESASELRHAYALRYEPPKTRCAPASPLPSRTGSPVPELSPIER FT GEKPCRRSSPTPSEEPYLTPQRREPVIASHRSSYSLDLKDLDKLARNIGKF FT TPSVSGGLEVHAYLQDIDFHLEMRPNVSDKERLYLLRATSSTEVRSFLDRQ FT PARVKNDYRLLQEALIREFANPESDQGLLSALETKQARNESPHAYYNRLRQ FT AFFGTRNEPNMEEDLNFKILFLRNLHPAVSQHLGVLACPRTMSIQQLRDLT FT QKAHDKQKMVLEKNTKTATVFDFNTQNPELALEGAQRPNSVRPPSPAWNAS FT SSNRQRNSYADTRFKQRNSHWDGPRGQRRSPEHHRERNQWGSNKSWSPSKG FT RHQNPGSSSPRSQRRYSKNFHPDNAQTQSQQEENAPPGFDPQELVKLMMKE FT FLKCIEEDRKREKEKADSA*" XX SQ Sequence 5277 BP; 1602 A; 1237 C; 995 G; 1443 T; 0 other; tagtggtgga gaatgattcg ggcaagcaga tctaactttt aaagcacaaa acccagaaat 60 cacgttgtta atcttttcta tttttgtccg gaaagaatag attagaggga acaaaagttt 120 tgctactgca caatttcatt gtggtaccat ttgttttatt gcttgactgc atttagtgtt 180 gttaagaaag taaaacttca tttaagttgt ccgccgagac aaactgtaag ccgctctaaa 240 gcgcatgtag agaaagacaa agcgtattcc tttgtgctta tctcataacg ttatataagg 300 ccttcacagt atgatttcgt acgccgcgat tcccggatgg aaactcagct gtgttttctc 360 ctggtaactg atttagcata gctaactaaa ttcttgttgc cctgtaattt gctaatactt 420 agtaaattat ttaaaacgcg gagctcagag agagctgcct agcgtagttt ataggagaag 480 ttttgaaatt gatcaattga aattgtggtt tgaatccacg gatgtgtttg cagtctaaac 540 agacttaaca cagagtgctg agatccccca tatctctgtc acgatttgta ttaatctcag 600 aaatcctgtt gattaatttt atctcacttt cataagttga gaatcgttca gtactgttaa 660 gatataaaat ctaactagtg ctttttgctt gaacagtgtg agatgcgccg tcatctcaag 720 tgatactgtg aaatttgcta ttattccact ttgtggtcgt tgccatagca gcaaccttgg 780 tccggaagta cagagagact tgccggaatt acattcaagc gcgcatgtgc agtccgccag 840 cgccagcttg aattgtaaac aaagagcata gcgtgaagct aagctaaact aacgctaacg 900 cgtttgcatt gaagtctatg catttccagt agcagcgaag gctaatagat agcacgtagc 960 atttagtggt cttaaatgca ttgtatggct gatgtttgaa ataactcttg catctcagtt 1020 gtttctcttt actcaccttg gaaaagttga caaatgcaaa gttaataaaa gtgtgcacga 1080 tctaactgct gagtttaatc actattattt tattttattt ttgtttttcc tgctgtacct 1140 ggcctgttga atttgatcag tttgaactca cataagctgt tggttttatt agtgatgaat 1200 tatagttttg ttttcatgac tataaattaa tgctctctga cctgttcaag tataaatact 1260 ttttccattc atcaacatct aggaacaccc atttcctata attgttgatt tatacggata 1320 agactcactg attaaataaa ttataattgt ttaatttaat gataaactgc attaaatgtt 1380 actctaacca acttcattgt tttgaaattt taatttcatt aatggttcat ttatgcatgg 1440 ctttatcatt ttagcaataa ttaataatta aatttagtca taagtgttct tatcaatctt 1500 aatgataact tataaaagtt attaccacgt tacagattta aagttaagtt caattcaaaa 1560 ttgaatttag tctattaaat ttagtttatt ttaagcctgc ttacagacag tgctccccct 1620 gctgcttcat tgatgcaatt ggacagtcca gcccagtgag ctgtaatggt accaacctaa 1680 gccatcattg agcctagtct tagagcagta gccctccaat tagccaaact tcaaaaggaa 1740 aattcagatt tttctactac gtaggctaag tggttactat ataacacaat aaaaccaaca 1800 cttcttgtag tgtcaactca acagcgccac cgctttgaca tccttaaata cacctaatgc 1860 taagagaaaa ctaactcctt tgttatctct ttctactaac tatcttcttt ttactagact 1920 attgatgctg gtttaaggat tgtgcttgtg agagtcaaaa agcagagaat tttagaaaca 1980 tttaatagat gtttgggaca cttgaacact tttaattgtt aaaattgtct gatctcaccc 2040 ctcctttctc actcatcagg tcaccaagaa gataggcgtc gttagatcca attctaaaca 2100 cgcatccaga aacaatttat ttggtgtacg gagtcaatcc tattcagaag ccaatactgt 2160 tattgttgtc ctactttctg tcacttcaga agaaagttaa tagcaacttc agtcctggta 2220 tttagaaatc ttatttttat atttaacaaa agctaaagga accctctctc taggaaagtg 2280 gaaactacag attttcaccc tactccttag tctccttatc acttttgttt ttctttgttc 2340 tctcactgtg tctgtgctac tgactctttc cacagagact accagttaga cacgcgtacc 2400 caccagaggg ccacagttcc agagaaccac acttcactca acattagaac acacttaaca 2460 ttgcacatta acatttctcc ttttctgttt gcattgttct accttgtttt ttttctttct 2520 cctctctatc tctcttcaag gtctgtgtca gttaaaatcc taagtcctcc ccccgccgca 2580 aaacatggct cactccaaag atcctgttgg ccactggaag gacctggaaa catggctgag 2640 cgttgtaaca ggcagtctcc tccctaaagc tgccgaaaca ctgcagcccc tgacgcaaaa 2700 ccaattggat gagaacatag acagcatcat gaagcaagac ccaagtcaaa gcttcaacca 2760 caaggagctg gccaaaatca ctggtacttt gagtcacaca ctcatagcca ccctcaaatt 2820 gagtgacaga cacgcctccc aactccaaca caagctgaca cgcctgcaag cccgcatcga 2880 gcagctagag ctagaggctc aggaacgtct ggaacaacca aatgaggtgg atgaaggtac 2940 cacagaggag atcgacaaac tacaagaagc cctaacagcc atcacagaag aaagagaaca 3000 agccagagca gaccacgctg acgtcgctaa caagctagat tacgctgaac agctactgaa 3060 ggaagcgaag gtggacttaa gagacaagaa ggccagaatc aaagcccttg aaactcacct 3120 gagcgaagca agacatgaga tcgacagact aatgcaggaa gtggatgaca tcaaagagga 3180 gtccgccagt gaactcaggc atgcctatgc actgcgctat gaacctccaa agacaagatg 3240 tgcaccagcc tcgcccctgc caagcaggac aggatcccct gtccctgagc tctcacctat 3300 tgaaagaggt gagaaaccat gccgaagatc ttcgccaaca ccttctgaag agccttacct 3360 taccccacag cgacgagagc ctgtgatagc cagtcacaga tcgtcataca gtctggacct 3420 taaagacctt gacaagctgg ccagaaacat tggcaagttt actccaagtg tgtcaggtgg 3480 tttggaggtc cacgcttatt tgcaagacat tgatttccac ctggaaatga gacccaatgt 3540 ctctgataaa gaaagactgt atttgcttcg agccacatcc agcactgagg tgcgcagctt 3600 cctggaccga caaccggctc gggtaaagaa cgattaccgc ttgctccaag aagccctcat 3660 cagagagttt gcaaaccctg aatcagatca aggactgtta agtgctctgg agacaaaaca 3720 agctcgcaat gagtccccac atgcttacta caaccgactc aggcaagcct ttttcggaac 3780 tcgcaacgaa ccgaatatgg aagaagacct gaactttaag attctcttcc tgagaaacct 3840 ccatcctgca gtaagccaac atctcggagt actcgcatgc ccacgaacaa tgagcatcca 3900 gcaattgcga gatttgacac agaaagccca cgacaaacaa aagatggtgt tagagaaaaa 3960 cactaaaact gcaacagttt ttgactttaa cacccagaat ccagaactgg cactagaggg 4020 tgcccaacgc ccaaacagcg tgagaccacc atccccagcg tggaatgcgt cttcgtccaa 4080 tagacaacgg aactcctacg ctgacacaag atttaaacag aggaacagtc actgggatgg 4140 accgcgtgga caacgacgct cacctgaaca ccaccgggaa agaaaccaat gggggtcaaa 4200 caaaagctgg tcaccatcta agggaagaca tcaaaaccct ggatcatcaa gtccaaggag 4260 tcaacgaagg tactccaaaa acttccaccc tgacaacgct cagactcaat cccagcaaga 4320 ggaaaatgcc ccaccgggat ttgaccctca agaactggtg aaattgatga tgaaagagtt 4380 tctcaaatgc atagaggagg acaggaaacg ggaaaaggaa aaagcagatt cagcctgact 4440 agtcgccgag cggagaagca acaaccacct gaaccaacaa acacttgcag gtgaaagccc 4500 tcaggtttcc aaaagtgccg ttcgggtcgt ctgccactcc tccgacaaac ctcagttcag 4560 tcctgaaacc cttcctgacc catgccaaac tccggtacct ccattactgg atgaacattt 4620 gactgctcca tcatcattga acaacaaatg ggggaggacc acaacttgac acagatcaac 4680 tgttctaccc tctacatact aacataacct ttgacattct agggaaacac ctagttaaac 4740 acaacactaa tactttctcc tctacagcac tgctagaaca acctctagta agagaggatc 4800 cactccacat atgcaacact gttattgctg gtttttaacc taccttgttt tgttgcatca 4860 ctgtgcctct tcatttcctt ctttctctct ctctccctaa ctaatttgca gttccagtcc 4920 cctctacagg gaactcctgc aaattcatgc acagggccta aaagtaggat tgactccttc 4980 tcccaaatta gtgtcaattg ctgcagaatt attgcttatt taagttgaag ctagcgccac 5040 tccatactcg caatgaagta aactagctaa ataactaaat ttgaattatt tcaggcaatg 5100 ttacactaca ccttgtgttc ttaacccgaa caatttaagc taaaacctgc accagaactc 5160 agtctatgga atggaaatgt ggtttgtgtc ttgtgtgtct gttctctctc tacctatgcc 5220 tcttgttcca gtgtctgccg atgttcatgc caggaacatc accatccaaa tggggga 5277 // ID DNA-8-13_DR repbase; DNA; ZEB; 5541 BP. XX AC . XX DT 20-NOV-2008 (Rel. 13.11, Created) DT 20-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE A nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; TSD 8-bp; DNA-8-13_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5541 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(11), 1728-1728 (2008). XX DR [1] (Consensus) XX CC TSD is 8-bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 5541 BP; 1765 A; 1000 C; 1069 G; 1707 T; 0 other; caggcccggc gctacggggg ggtaaagggg ggcattgccc cctcagaaag agtttgtgcc 60 cccccagttt ttataaagga ttgttcactc aaactgaaaa ttctttctgt cattaattat 120 taacaccccc gtcgttccaa ccttcagaac acacatttaa atattttaaa atatatgaca 180 tagcaattga attatgagaa tggctatatc atgatgaaca gtgctttata ataacaataa 240 taattcctta catttctata gcacttttct tggcactcaa agcgctttac acataggggg 300 aatcttctca tctacttttt catttttttt tttatgcaag aaagaacatt acaaagatat 360 aatatgaaca aagagacaaa gagggttaca taaatagaat tccagcaaat taaacaagtt 420 taaatgattt acactaagaa agcgttttaa atgcagtgta attgttagag tcttttaaag 480 agttcacata aatttcaaag tcctttctga aaataaaaaa gtaagggtgt ttctttgaaa 540 atttacattt atggatgtaa aattttgttt aaaaaaagca ataaatcaat aataaattta 600 caatgagaag gatctacact cagagagtaa ccaaagaata caatttttgg ggaaatacag 660 aagtttggaa ataagaattt tttaatgaaa ttggaaaagt caacccaaaa cactttacag 720 taagtacact cccaaaataa atggtttaga gtttcatcat attgagtaca aaatgagcat 780 ttaggggaaa tattgttatt aaatttagct agtttggtat ttattgggta gcaatctcct 840 catctaccac cagtgtagca tccacctgga tgaagcgaca gcagccattt tccaccagac 900 cgcacaccac agagagagaa gacagagtaa tgaagccaat tatgatatgg ggatggttag 960 gaggccatga tgatggacag aggccagtgg gtaaatttgg ccaggatgcc ggggttaaac 1020 ccgtactctt tttcgaaaga cattctggga ttttagggta tatgctgaag catgctaata 1080 ggacttttaa tttgccagta acctgggtta agcgcttccg gcgaactgta tgccaaagga 1140 aaaatctgca cgtttttaag gtctgttttc tttaaaaatc agcgatctcc actggctgag 1200 tgatatccga tatgaagtgg gtctcagatt gtactagaag cactgcatta aattacattt 1260 tccccgccat gtctttataa tatgagcatt tattttaccc cagtatatat tcaatggagc 1320 gtctgcgcta tcctgccgat tctgacgggc gagtgcgagt aaaaggcctt ttgtctcgtt 1380 ttacgcttga acaactaaat gaatgccaaa gtttatttaa ctgaatgtat tttaattaca 1440 ttacaacatg aatgctgtaa aaggaaccgt ataaaatgga aaaaaactca caataaccgg 1500 tcaccggaag tttataagta tatgggaaca acactagctc ggcatatacc caattatatt 1560 catctgtctc catattctac tgcatcccag aatattacac agcaccaaaa atgtaaaaaa 1620 tatatatatt gaatatattt aatatatttt tgtgtacggc agacggcaac tcgccgtgac 1680 gcagattccc gagtctgtgt gctccaatca ggtgagccct cttcagcgct cgccatgaca 1740 acgagacagc agtgacgtgc cttgtaaaaa tttgcatctg cattcctata aagctcgtgt 1800 gctctctgtc agcgttgtca gttcgctcgg tgttgctgtt tgtacttgac caaggtgagt 1860 tcaataccga cggtacgagt ggagccacgc caatctcacg aagtccgctt cgtgaagctt 1920 cactcgggcc cgtctgtgct gtagttcctt gtggtttgcg tacataaacg taacatcgca 1980 aggaatatta cgtgtttgtt gctcgaaatg tgccgccgag caacatccag tgaatactgg 2040 cacggcctgg cgcccggtct tctcgtgcac tccctggggc gaggcggcct ttgagcccta 2100 ggcagggaac aaacaacccc cgaggaccct cagggaaaca cggggggccg gtagtgggac 2160 ctcgctgcga gggttatggt gggtgggagt agcgaggccc actcatcgtc tattttaaac 2220 tatagtttca aactttagtt tactatagtt tgaaatagac accatggtaa ctgttatgat 2280 ttaactttat gcaaagagct tagaatgacc aagagcaaca gccccggatt ccgccatttt 2340 gaagtgaaag cgatcggccg tccattggat ctcattgctg tcgcgatgga aagcagctgc 2400 tttgttttta tttaatttat tcaaaaagcg cgttattgtt tcccaaattg tagagtgtcg 2460 cctcttggtg attctatgct ctttgctctt tgtctgactg ttgtaatgtt ttgctttgtt 2520 tttcagctaa gctgttgagc aaagaaggac tattgaagac aacagttgaa gacaaccgcg 2580 ggttatattc acgagcgcgc gcgttataat gtttccgggt tgttttctga aagtttgtct 2640 tgtttgcatg tcaaactaaa taaactatac tctttctaaa cacctgtctc gtgttttatt 2700 tgtcctctac agcaatgcac tgtcatgtaa gtctgttgaa tgcatttgta tgcaacatgg 2760 tttgacaata cagtagtcta acaaataaat ggttgttata aatgtacctt cattagtcaa 2820 catttggtgt aggtaataaa aagttaatca aagatgtcct aagacataag ggttatgtta 2880 tggttgtttc ataactatat ttattattat tctgtatgtt taaaaccagc aaattatact 2940 atatatttaa catgtcatga acttactgtt attttaaaat ttgcttaata aaatgtccta 3000 agacaagaat gggtgttgtt ttggttttat gtcaacctac agtaggattt aggttgacta 3060 cagcaggtat aggctacata catcataggg agttttttta atgacccaaa tcaagtaaaa 3120 cgtctcttac tcaagtactt ttagcatgca aaagtcaaat ttaagtatat aaaatacagt 3180 ttccaacaac aaaattttgt cttgtgaaaa agaccagtgt ggtggtgagg aaaaaaaagt 3240 aatatcaagt tatatatatt tataaaagaa gttgaacaag ttgtcataat atttagtcat 3300 tagattatgt gaccgctaga tggcgctcaa ttaaaaggag cgttcatttc aaaggagcgc 3360 taccctgtac taaaatggcg gctctattga cgcattcctt ccaatagaca acatcagggt 3420 aggcgacatc caatgtatat ttctatggga gctaactagt attcacgtca tatcatgctc 3480 aactcttatt actcttttac gctctttgag agtcttggcg ggatttcaga atgatattta 3540 ggcaactatg aagtagcagt accgtctgtg taccatagtt acacaatgca gcatttagct 3600 cgctcgtaaa catcgctgtc tgagcagaag tttcatcatt agccctgact gtcagatgct 3660 cttcatctca ggagctgtca gactttggcc aacattgccc cctcagaaag agtttgtgcc 3720 ccccagtttt tataaaggat tgttcactca aactgaaaat tctttctgtc attaattatt 3780 aacacccccg tcgttccaac cttcagaaca cacatttaaa tattttaaaa tatatgacat 3840 agcaattgaa ttatgaaaat ggtgagatta aagctagtcc tatagtccag agatccggtc 3900 cccgcgtcag gtcaggtgcg cccgtaaata gcctttaaaa cgcgcacacg gcgatggagg 3960 cgcactcaac tcacaaggga gaaaagtggg tgacatgata ggagcagctt aattcattac 4020 atgttttggt taactaatta tttaaacaca actctcttga gctctaaaag gaaaaaacac 4080 ataagaaatt tgctgttaaa ttaattttag cagagacgca actgccagtc atttattagt 4140 tattcgggaa gatgtagcct aataaaaatg ttttaataaa aataaatagt ttactgcaca 4200 gtcccacagt aggctacact ataaataata ttttaattta tttattaata ttattattaa 4260 tattattaca attaagctaa aattgacaaa acgcagaacc ctttgaatgt ttgctttcat 4320 tttgctaaaa agcgcacttt tacagttgtg tagagaaaca tcccattatg cagcagcagt 4380 caaggtataa cacataataa atgccttttg cgtttaaaga tgatcggcct gcgaagtcaa 4440 gtttgcagtg ttataaatgg aataggtata tagtgcttaa gtggaattac cgcggatcaa 4500 ccgagaaacc tctcacgtgc gctgcaggta ggcagaatga taatctttta aaaatatgta 4560 atttaggcta tactgctatt tgctcatgca aaattaaaca acataaaata tattgtagat 4620 ctattttaag agctaaaata caacatatcg agtttgtttt gacttgtagc tcatggccct 4680 aaaatgaaaa taacctgaaa acaagctatt aaaaaaggct taataaataa aataattaat 4740 aaataaaatg acgggtaggc ctagtaacct aatggtttat gatggtttac aatatcacga 4800 aaaaaacaca cttatgatag cctaagataa atctttgttt ttaacattta aaatgatgaa 4860 aaaaagttat atagcatatt gagttgcgtg ttatccgtgc caaaatttaa ccaattttca 4920 cggtaataac atgtgaattt tggtatcttt tttaggtatt tttctggcta cactaatgcc 4980 ttaaaatgta actaagtaat tgttttaaat gacattagat tgtaacattt agcttttccc 5040 taacactgat gttggttgga taataggcta cgtgaagcgg gatgactcgg ttaagctgat 5100 ttcactatat ttatttaccg gttaaatgaa tagtctgatc gtgggtcagc ttgcatgtgt 5160 gggcataatt tgacgtcttt acctaaaact ttaaacattc ataggcctgt tatttgaaat 5220 aaaacggaaa taaggagaat aacatatgat ttaaagcact cgtttggcac tgaagccatg 5280 atccgtctgc attaataagc ggtatgcatt accatcgtta ataaaattcg taatagcgaa 5340 actgcatatt gttattacta ttaaaacgta gcctatatta ctgccttagg tcagtaactt 5400 aggcagtaat gcaaagtaat aataaatgtg aataaatagc ctaaatgaaa gaaacaagag 5460 accaaaatca gtcaccaggt ctagtgcccc cctatggatt taaagtgccc cctcagtttt 5520 gacttcctgg cgccgggcct g 5541 // ID Gypsy140-LTR_DR repbase; DNA; ZEB; 994 BP. XX AC chr4; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy140-I_DR; KW Gypsy140-LTR_DR; Gypsy140_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-994 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1542-1542 (2008). XX DR Genome; chr4; Positions 36204451 36203458. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 994 BP; 263 A; 140 C; 186 G; 405 T; 0 other; tgtaacagag ttacaattgc actctgttaa aattataaca tttttcaaat tatattgtga 60 tttttattta tttaaatatt ttatttcata tttgtgaatt ttatcatttt aagaatgatt 120 ttggagtcat tttcatttat ttactatggc agtgtttaac ctgcacaccg cccctttttg 180 tacctcttgt gttgccgtag tttgtttttt tcccctccca ccctcacaat gctgctgtgt 240 attgctgagg gtaggttgtg tagtgcgttg ctcctgattg tatttaatat ttctatcgtt 300 ttaaagcaag tgtttataat ttttatggaa actatataac ttattatata ttatattatg 360 tattatatta caaatatgct atattttcta tagattgaca gaagtgctaa ccgcatggca 420 gatcgccatt ttgtctgcat gttgcttcag ttctcagtga gaaaactcta ttattttgtg 480 taaatagggg tcggatgtgc agtgcaggca gaagacaaca taattgtgtt taattgtctt 540 ctccaggtac gttttactgt ctcattgttt tgtttgcctt ttgttttact caacgcacat 600 tttttgttgt ccaccaggtg gctctgttgt gcgtagtttt tgcacatgca tttttgtttg 660 tagttgagtt tggtttctaa tgataaagaa atgttcattt tcttgaaatt gtagaaaaaa 720 aagtgttttc ttgcacattt taaattgata atttctaaaa gtgtaattgt gtataaattg 780 tgaaagttat ctaatgtaat tacatatatt gttaagaaaa caattcattt gaccagtgac 840 aatgctgctg tgtgttgctg agggggtcgg atgtgcagtg caggcagaag acaacataat 900 tgtgtttaat tgtcttctcc aggactatat aaagtgaagg aaagcagctc ttggtgtcgt 960 ctcaatttca ttgagtgaaa gtagatctgt taca 994 // ID Gypsy136-I_DR repbase; DNA; ZEB; 6588 BP. XX AC chr13; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy136-I_DR; KW Gypsy136-LTR_DR; Gypsy136_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6588 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1533-1533 (2008). XX DR Genome; chr13; Positions 43719068 43712481. XX CC Positions [2837-3376] - Reverse transcriptase CC Positions [4853-5329] - Integrase core CC 'CTTG' target site duplication CC LTRs are 99% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 241..1608 FT /product="Gypsy136-I_DR_1p" FT /translation="MSNINRYLADKHTSVPVGRGRGVLGLPIPFPMESPSA FT DNNNNNRPGYTNGVHSTQERNAADMQFNASHRFAEASTSTPHAGPDMLGQM FT TSIVNQIGQQLADNIMSHLSPYSLDTVTPIRHKSDDNKLDSPSGLLNVSQN FT QVIQHRKVKEPPSFRGDSSDTIVIEEWEDLMRTFVKKSNMSANEHVEEILT FT HLRGKAKDVVKFWIRNCDTATTVCPNSVYGLLRKHFSSTHYSPVPLADFYT FT TLPEEGENPYDYWLRLNKAADVASECLREQGKTLDKPSVEIVHMFIRHCPS FT KDLALTFRSKPIDKWSAHEVQAVLNDYQSDLSFKATSTVYRRQSEKVSVNK FT IDINTVPTPALSVCEQQQKSADASALEKVIDMLEKVLLTNTNRNRPRPKPQ FT PYSRLPRIEGFDALPCAICNDAAHSALTHCREHRLCFQCQSPDHSRRFCPR FT WGQSPQPNQEN" FT CDS 2042..6283 FT /product="Gypsy136-I_DR_2p" FT /translation="MILGSNAIKSLLTILKNTDEYWRLISLPSGNAESESL FT EFISLLSNTQRWRGSHIPDKVGTVKLKHCVTLQPHCEHLVWGKLPAGSTTS FT VGSTVLVEPTQSRCSPKRIMVGRVITPLWGDGWIPVKIINPTDSVLTLRRN FT AKIADVSPCTAVEDLPESDPISSHVQHLNSDIVSQESEDEMAKILNEMGLS FT DLDLMSCEVSASWKIKLLHLIKKYDSIFSRNKMDCGEAKDFVHRIRLVDEK FT PFRLPYRRVPPSHYEKLRVALDEMEEKGIIRKSNSEYASPLVLVWKKNGDL FT RICTDFRWLNARTVKDAHPLPHQSDALAALGGNAFFSTMDLTSGFYNVPLH FT EDDRKYTAFSSPFGLHEYNRMPQGLCNSPATFMRMMMSVFGDKNFTSLLCY FT LDDLMVFAPNEQVALERLEMVFSRLKAHNLKLSPSKCHFLRKSVKFLGHVI FT CADGVQTDPGKIKAITDVKAVDLMGADGVTPCHKKIRSFLGMVLYYQHFIE FT RCSAKAKPLFDLISKTDTKGCRGRRKKLNLKNVIKLTPTDWTDKCQNAFET FT LKEDLLTSVTLAHPDFNREFILAIDASFDGLGAVLSQIPEGSTRARPVAFA FT SKTLSRSQINYPAHRLEFLALKWAICDKFHYWLKGRHFTAWTDNNPLTYIL FT TKPRLDACEQRWVAKLASFSFDLKYVPGTKNIVADALSREPFVQSSVTHRL FT VTEPYHSLLNQVNGVVDQVVQDSFRLTTNYQVFSPADVPPDGTGVSSDDQM FT MAGTLSSQDVTAVLDAHCSGGVGEMMGLESIPDIINVDQTTALNASTLSAA FT QEHDNVVRRAIFYVQRHKRPNKHERASESRPVLRLLKNWNKLTIRDGMLFR FT VKRDNHMNKKLYQFIVPDSLKGQVLHGLHDAAGHQGRSRTVSLARQRFFWT FT GMERDIVNHVKTCMRCIVGKTPEPKDRAPLENIHTTEPLELICIDFWTAEQ FT SDNKSVDVLVATDHFTKLAYAFPCKNQSAKQVAYHLWKGIFCVYGFPKRVH FT SDQGPNFESKLIKELLEMAGVKKSHTTPYHPMGNGITERFNRTLGNMIRAL FT PPQSKAKWPQSLQMLTFCYNCTVHETTGFAPFYLMFGRVPRLPIDVMFQHV FT LRNDTVVKYSDFVSRLKCDLSEAAKIAQKHTSTEQARHARIYNRKVKGSPL FT IVGDRVLLANRGERGKRKVADKWESNLYEVMSVRSPINVYRIRDTETNREK FT TVHRNLLLPVNFLYNDDESDLPSQSPSVESTSDLPDDPDSISRTAHWIVQS FT DQHSCPDIDSDSIILSEPQCSLVVQHSDSKADSDTCDSSVSAQQDCEMISN FT IPHLLDEVLVPDVDLDQSPSQHSLTRPISPLSKVVNNPVPDTVSPVICTQP FT EHIQTDMIRTRAGRTVKPPQRLICEMNNQFVLDSSPSVISLLDVVKNLFRG FT " XX SQ Sequence 6588 BP; 1869 A; 1395 C; 1443 G; 1881 T; 0 other; ttggtgccgt gaagttgtga tgccgctgtg ttgctatggg gtgtcttcaa gcaagctgat 60 cactgcttct aaaggactgg aagcatctga catcagttga gcagttcgtc atctacatac 120 agtgctttgc gaataaagga tctcatagac ttttcacaca tttcacacta caggactttc 180 tcacactgct ttttgaaatc aagtttattt atggacaatt ggtgaatcac attgatcaaa 240 atgagtaaca tcaatagata cttagctgat aaacatactt ctgttccggt gggtagaggc 300 agaggggttt taggcctccc aattccattt ccaatggaaa gccccagtgc tgataataat 360 aataataata gacctggtta tactaatggg gtgcactcca cacaagagcg caatgcagct 420 gacatgcagt ttaatgccag tcatagattt gcagaggcta gcacctcaac acctcatgct 480 ggacctgata tgctagggca gatgacttca attgttaatc agattggcca acaattggca 540 gacaatatca tgtctcatct tagcccctat agtttggaca ctgtcacacc tattagacat 600 aaatctgatg ataacaaatt agatagccct tcaggcttgt taaatgtgtc gcaaaaccaa 660 gtgattcaac acagaaaagt caaggagccg ccttccttca gaggggatag ttctgacaca 720 attgtgattg aggaatggga agatctcatg agaacatttg ttaaaaagag caatatgagt 780 gccaacgaac atgtggagga aatccttaca caccttcgtg gtaaagcaaa agatgtagtc 840 aaattctgga tcagaaattg tgatactgcc accactgttt gcccaaattc tgtctacggc 900 cttctgcgca agcatttcag ttctactcat tactcacctg tccctcttgc tgacttttac 960 actactttgc ctgaggaagg agaaaaccct tatgactact ggctgagact taacaaagct 1020 gctgatgtcg cctctgaatg cctgcgagaa caaggtaaaa cacttgacaa gcctagtgtg 1080 gaaatagttc acatgtttat cagacactgc ccgagtaagg acttggcctt aaccttcaga 1140 tccaaaccaa ttgataaatg gtcagctcat gaagttcagg ctgtgcttaa tgactatcaa 1200 tctgatttat cattcaaagc caccagcaca gtgtaccgca gacaaagtga gaaagtttca 1260 gttaacaaaa ttgacattaa tacagttcca acgcctgctc tttctgtttg tgagcaacaa 1320 cagaaaagcg cagatgcctc tgctctagaa aaagtcatcg acatgcttga aaaagtcctg 1380 ctgaccaata ctaacagaaa tcggcctcgt cctaaaccac agccttacag cagacttcct 1440 aggattgaag gtttcgatgc tttgccatgt gctatctgca atgatgctgc tcactctgct 1500 ttaacacatt gtcgtgagca tcggctatgt tttcagtgtc agtctcctga tcattccaga 1560 cggttctgtc ccaggtgggg acaatcacct cagcccaacc aggaaaacta atgggtctgt 1620 gcgcaaggga ggttagtaca gaccacaatg ataatcctcc catatttgct acacaagatt 1680 accagtctat ttttgagcaa agtaaatcta gtgtctcacc tgaaaaaacg gctgtttttc 1740 aaggtataca gcgagtccct agaaatgatt ctttatttta caccactgtt tctgttgaaa 1800 atggaccatc tttcgttgca ttactcgaca gtggttctat ggcttgcact cttagtgagg 1860 cagctgaagc taagttgttg gtgagtactc ccacactgtc aaagcattct gctgatgatg 1920 ttgtaattat tggatgtggc ggtcatcgag tggcaccaac ggcgatgtat gatctgtcca 1980 tgtgtgttta tggatgtaaa atgataatac tgtgctggtt gttccaggcc aaacagatga 2040 gatgatattg ggcagtaacg ccatcaaaag cctgctcact atcctgaaaa atacagatga 2100 gtactggaga ttaatctcac tgccaagcgg taatgcagaa agcgagagtc tggagttcat 2160 ttctttgctt tccaataccc agagatggag aggcagtcac atacctgaca aagtgggtac 2220 tgttaagctc aaacattgtg tgacacttca acctcattgt gaacatcttg tgtggggcaa 2280 gcttcctgct ggttctacaa catctgtggg gagcacggtg cttgtggaac ctacacaatc 2340 cagatgtagc cctaagcgga tcatggttgg tcgagtcatt acacctttat ggggtgatgg 2400 ctggatacct gtgaaaatta tcaaccccac agacagtgtg ttaacactgc gtagaaatgc 2460 aaagattgca gatgtttcac catgcactgc tgttgaggat cttccagaga gtgaccctat 2520 tagctcccat gtccaacatt taaatagtga cattgtttcc caggaatctg aggatgagat 2580 ggcaaagatc ttaaatgaaa tgggattgag tgacttggat ctgatgtctt gtgaagtttc 2640 tgcttcatgg aaaattaaat tactccatct catcaagaaa tatgactcca tattctcaag 2700 gaataaaatg gactgtggtg aagccaaaga ctttgtacat cggattcgtc tagtagatga 2760 gaagcccttt cgtctcccct atcgtcgtgt tcctccaagc cattatgaga aactgagagt 2820 agcgctggat gaaatggaag aaaagggcat aatcagaaaa tctaacagtg agtatgcttc 2880 cccccttgtc ctggtgtgga agaaaaatgg agatctgagg atctgtactg attttcgatg 2940 gctaaatgcc agaactgtaa aagatgctca tcctcttcca caccagtcag atgctttggc 3000 tgctcttgga gggaatgctt tcttctcaac tatggatctc acctcagggt tctataatgt 3060 tccgttacat gaggatgata gaaaatatac tgctttctcg tcaccatttg ggctgcatga 3120 atataacagg atgccgcagg gcctctgcaa tagtccagca acgttcatgc gcatgatgat 3180 gtcagtcttt ggtgacaaga acttcacaag tctattatgc taccttgacg atctcatggt 3240 ctttgctccc aatgagcaag ttgcacttga aaggctagag atggtgtttt caagactcaa 3300 ggctcacaat ctcaagcttt ctccaagtaa atgtcatttc ttgagaaaat ctgttaagtt 3360 tcttggccat gtcatttgtg cagatggtgt ccaaacagac cctgggaaaa taaaggccat 3420 cacagatgtt aaagcagttg atttgatggg tgctgatggt gtcactccat gtcataagaa 3480 aatcagatct ttcttaggta tggtattata ttaccaacat tttatagagc gatgttctgc 3540 taaagccaaa cctttgtttg atctgatttc aaaaacagac acaaaaggct gtagaggcag 3600 gaggaagaaa cttaacttga aaaatgttat caagcttaca cctacagact ggactgacaa 3660 atgccagaat gcatttgaga cattgaagga ggaccttctt acaagcgtaa cgttagcaca 3720 ccctgacttt aaccgtgaat tcattctagc aatagatgcc tcatttgacg gtttaggtgc 3780 tgttctctca caaattcctg agggtagcac tagggccaga cctgtggcat ttgctagcaa 3840 gacattgtca cgttctcaga tcaattaccc agctcacaga ctagaatttc ttgcgcttaa 3900 atgggcgata tgtgacaagt tccattattg gttgaagggc cgccatttta cagcttggac 3960 ggataataat cccctcacct acatacttac caagccaaga ctcgatgcct gcgagcaacg 4020 ctgggttgct aaattagcct cgttcagctt tgacctgaaa tatgtgcctg gcacaaagaa 4080 cattgttgct gacgcgttga gcagggagcc ttttgttcag tcaagtgtga ctcaccgtct 4140 tgtaacggaa ccttaccact ctctgctaaa tcaagtgaat ggtgttgtgg accaagttgt 4200 gcaggattct tttcgactta ccactaatta tcaagtgttc agtcctgctg atgttcctcc 4260 tgatgggact ggagttagca gcgacgacca gatgatggca ggtactttga gctcacaaga 4320 tgtcactgca gtattggatg ctcattgttc tggtggtgtt ggagagatga tgggcttgga 4380 atccattcca gacataatca atgtagatca gactactgca ctcaatgcaa gcactttgtc 4440 agctgcacaa gaacatgata atgtagtgag gagagcaatc ttttatgttc agcgacataa 4500 gcgtcctaac aaacatgaac gtgctagtga gtctcgtcct gtcttaaggc tgcttaagaa 4560 ctggaataaa cttaccatca gggacggcat gcttttcaga gtcaagagag acaatcacat 4620 gaacaaaaag ctatatcaat ttattgtccc agattccctg aaaggacagg ttcttcatgg 4680 tctgcatgat gcggctggcc atcaaggtcg ctccaggaca gtttcgttgg ccagacaaag 4740 atttttctgg acaggaatgg aacgcgacat agtgaatcat gttaagacct gtatgcgatg 4800 tatcgtggga aagacccctg aaccaaaaga tcgtgcccct cttgagaaca tacacaccac 4860 ggagccatta gagctcatct gcattgattt ctggacagca gagcagagtg ataacaagag 4920 tgtggatgtt ttggtggcaa cggaccattt cactaaatta gcctacgctt tcccttgcaa 4980 gaaccagtct gctaagcagg ttgcttatca tttgtggaag ggaattttct gtgtgtatgg 5040 attccctaag cgtgtgcatt cagatcaggg tcctaatttt gagagcaaac tgattaagga 5100 gctactagaa atggcaggtg tcaagaaatc acacaccact ccttaccacc caatggggaa 5160 cggaatcact gaacgcttca acaggacatt aggaaacatg ataagagctt tacctcccca 5220 gtcaaaagcc aagtggccac agtctcttca gatgctaaca ttctgttata attgcactgt 5280 tcacgagacc acaggctttg cgcctttcta tttgatgttt gggagagtcc cacgtcttcc 5340 catcgatgtg atgttccaac atgttctccg aaatgacact gttgttaagt attctgattt 5400 cgtatctaga cttaaatgtg accttagtga ggctgcaaaa attgcacaaa agcacacttc 5460 tactgagcaa gctcggcatg caaggattta caatcggaaa gtgaagggat cgcccttgat 5520 tgttggtgac agagtccttt tggcgaatcg tggtgagcgt ggcaaacgga aagttgcaga 5580 caagtgggag tccaatctct acgaagtgat gtcagtaagg tcaccaatca atgtatatcg 5640 gataagagac actgaaacaa acagagaaaa gactgtccac cgaaatcttt tgctccctgt 5700 taactttctg tacaatgatg atgagtcaga tttaccatct cagtctccct cagttgaatc 5760 aacctcagac ttacctgatg atcctgacag tatttcaagg accgctcact ggattgtgca 5820 gtctgatcag catagctgcc ctgacattga ctctgactca ataattttgt ctgaacctca 5880 gtgttcttta gtcgttcagc attctgattc aaaggcagat tctgatacct gtgatagctc 5940 agtatccgct caacaggact gtgaaatgat aagtaacatt ccacatttgt tggatgaagt 6000 actagttcca gatgtagact tagaccagag tccctcacag cactctttga ctaggccaat 6060 ttcacctctt tcaaaagtag tcaataaccc tgttccggat acagttagcc ctgtaatttg 6120 cactcaacca gaacacattc aaacagacat gattagaacg agggctggta ggaccgttaa 6180 acctccacaa agattgatct gtgagatgaa caatcagttt gtacttgatt catcaccatc 6240 tgtgatttct cttcttgatg tagtgaaaaa tctctttcgt gggtaaatga gttggtttca 6300 ttttgagctg gaattttctc tttagtatag tctgaattta attcaatcct tgaattgcgt 6360 atctaaaggg aggtgtttaa ggcatctctt tagtctgtgt ttagtctgga ttttggtacc 6420 ggtccattct tgcatacctc taatggagga attttgtaat tgatatttac gataaaaagt 6480 gtcaaggata ttcccgtttt tttgtttatt tttttctagt tcttgtgttg ttaagcttta 6540 ttactgtaca ggcactgccc cagtttagca taatttaagg ggggtgta 6588 // ID HARBINGER2_DR repbase; DNA; ZEB; 3727 BP. XX AC Contig:ctg25784.2; XX DT 08-OCT-2003 (Rel. 8.09, Created) DT 19-MAY-2005 (Rel. 10.06, Last updated, Version 2) XX DE Autonomous Harbinger-like DNA transposon. XX KW Harbinger; DNA transposon; Transposable Element; KW DNA-binding protein; HARBINGER2N_DR; HARBINGER2_DR; KW Harbinger superfamily; TDR; transposase. XX NM HARBINGER2_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-3727 RA Kapitonov V.V. and Jurka J.; RT "HARBINGER2_DR, an autonomous Harbinger-like DNA transposon from RT zebrafish."; RL Repbase Reports 3(9), 180-180 (2003). XX DR Genbank; Contig:ctg25784.2; Positions 85260 81534. XX CC HARBINGER2_DR is an autonomous Harbinger-like DNA transposon. CC It is characterized by 13-bp TIRs and the CAG target site CC duplication. The zebrafish genome harbors ~100 HARBINGER2N_DR CC nonautonomous elements derived from HARBINGER2_DR. The CC HARBINGER2N_DR consensus sequence is nearly identical to CC HARBINGER2_DR (only 4 indels, it matches positions 1-603 and CC 3311-3727). CC HARBINGER2_DR encodes two proteins, the 276-aa HARBINGER2_DR-1p CC (2 exons, positions 642-1437 and 1558-1592) and the 368-aa CC HARBINGER2_DR-2p transposase (positions 2140-3246). CC HARBINGER2_DR-1p is similar to Myb-like proteins present CC in different eukaryotes. XX FH Key Location/Qualifiers FT CDS 2140..3246 FT /product="HARBNGER2_DR-2p" FT /note="Transposase" FT /translation="MAALQRVFQLRVRQRERQRQRQRRPRSTLCTINAFIR FT QHQNPLDMLDDMAVIHRYRLPRGEIVQLLNVIGPQLMRATRRNFALSPDVQ FT LLAAVRYYATGSFLQVLGDGLGLSKPSVSRAVQAVTYALLPLAAEHIKFPA FT SRQAMSDIQEYFLTHYHIPQVIGVIDGTLIPISTPSVDGHTYICRKGYPAI FT NCQVICDHNCLITDIVARWPGSTHDSYIFTNSSVGQEAQNSNGHWRLLGDS FT GYPLRPYLFTPVANPVSNSEAHFNEAHRVARSTVERTLGRWKLRFRAIHKS FT SGGLLFVPQKCCAVITVTAMLHNIAVRARVPLDIREEDEEVEEENEVEMRI FT HDDQPRHVQYMAGFGARQQVIDTFF*" FT CDS join(642..1437,1558..1592) FT /product="HARBNGER2_DR-1p" FT /note="similar to Myb-like proteins present in FT different eukaryotes" FT /translation="MAETKKQRKVIFQKEEINIILEEVELQKHIIFSRFKG FT SHTNKEKQKMWDDIATKLTATRGIKRSGNEVRKKWQDFSSLAKRKRALQRT FT TINKTGGGPNDAPILTAEEEKALSILGTTASDGICGGIDLHGGEGLRQPEP FT ESGPPCSQEQSGSPPIDRQLPSPSPPNSPTDQPPSSIVQATATTPRFENIC FT GCSQDLVQLEREKLDVLKDIRQSLKEANEMNYNFQREIIELKKAKMALEER FT RLSLEEMSFARPSISVPIILPDESDPGEQTTNLNQ*" XX SQ Sequence 3727 BP; 1202 A; 697 C; 734 G; 1094 T; 0 other; ggctgcgttt cccgataacg ttgatcttag cacttaagag cgttttctac gagtcatttt 60 gcgaacgttc gttattgttt cacgtgcgtt tcccaaaaat gcacttaaca caattgcacg 120 tagcccagct ttaagtgcaa cttaggagtc gctatccgtt tgttaagtgc tgaaatgtca 180 cgctatagaa tggctcgtta ttgttgcaca tgttatagca atccatataa ttcttcttct 240 acttgtgtga atgtatattc aactcgaata acataaaaaa aaaatatttt tgagccagtt 300 taaaaacata aattaactgg aaatgtcgta ctgtaaaaac actgtcttgc tccaatctcg 360 cataaaacta attctaacag tccttgccgg aaatgacatc agcatcattt tcgatattta 420 tatgaaacct taattaagta gaattattta tcaatttctt tatcaaaaat tgcttatctc 480 acatcaaaat aaataaataa gtaaataaat aaataaataa ataaataagc tcttacattt 540 atttttgaaa agtttagcct aattatcccc acctgatgtg catttgctaa taggacaaaa 600 atataaataa gcacatacct aggggtggag acactagcaa aatggctgaa accaaaaaac 660 aaagaaaagt catttttcaa aaagaggaaa ttaatattat tttagaggag gtggagctac 720 aaaaacatat aattttcagc aggtttaagg gcagtcacac aaataaagaa aaacaaaaaa 780 tgtgggacga tattgctaca aaacttactg caacaagggg gattaaaaga tcagggaatg 840 aggtcaggaa gaagtggcag gacttttcaa gcctagctaa aagaaaaagg gcactgcaga 900 ggacaacaat taataaaaca ggtggtgggc ctaatgacgc ccccattctt acagcagaag 960 aagaaaaggc actgtcaatt cttggaacaa ctgcctcaga tgggatttgt ggtgggattg 1020 acctccatgg aggagaaggg ttgcgtcaac ctgaacctga gtcaggtcca ccatgttcac 1080 aagagcaatc aggttcacct cctattgaca gacagctacc atctcccagt cctcccaaca 1140 gtccaacaga ccagcctcca tcttcaattg ttcaggctac agccacaaca ccgcgatttg 1200 agaatatctg tggttgtagt caggacttgg tacagctgga gcgagagaaa ttagatgttc 1260 taaaagatat cagacaatct cttaaagagg ccaatgagat gaattacaac tttcagaggg 1320 aaatcattga acttaagaag gccaagatgg ctttggaaga gaggagactt tcactggagg 1380 agatgagttt tgcaaggccc tccatttcag tgccaattat cttgccagat gagtcaggta 1440 agattgttca atgttgttta atcatgaata attatatgtt ttaataaaat atgttgtgca 1500 acatgaaata ttcttttcca tatgtacact catcacattt tgccatgtct tttgcagatc 1560 ctggggaaca aactacaaac ttgaatcaat aaaaattgtt taaatgcaat gttttgatgt 1620 ttgttaatcc actcaaagtc taaggctcaa tgaaatacta taaagcagaa gaattcttat 1680 tatattatca tatcataagt gtttcaaatt taaatatata cttgtagtct ctttaaaaaa 1740 agtaataagc atgcaagtcc tatatatttt actacagctc gagtcttgat gtgtgtggca 1800 atatagtatt agggcctatc tattagccta tccatatagg tgcagaaaaa ttcttaatgt 1860 gcacattctt tattagcact gttttatata aatgtaaatc tttgaccatt atgaacctaa 1920 caagagaata aattcaggga agaggtaatt aatgcaattt ctaaaatcaa tgattcagtt 1980 tcttcaagtg gggcctactg caattagtga aatagatgac aatcagtgtg tgacatgtga 2040 atattatcac agcctgaaaa actttaatca tatggttcct gtgagcaatt ttaggatagt 2100 atataaagtt cacacttcag tagtcttgca gagtaaatca tggctgcact acagcgagtt 2160 ttccagctga gagtgaggca aagagaaagg caaagacaaa gacagagacg accacggagt 2220 acactatgta ctataaatgc cttcatcagg cagcatcaaa atcctctgga tatgcttgat 2280 gatatggctg tcattcacag gtaccgtctg ccacgaggag aaatagtcca gctgctcaat 2340 gtcattgggc ctcagctgat gcgtgctaca agaaggaatt ttgccttgtc ccctgatgtg 2400 caactcttag ctgctgtgag atattatgca acaggcagtt ttcttcaggt acttggagat 2460 ggacttggac taagtaaacc atctgtgtcc agagctgtac aagcagtcac ctatgcactg 2520 cttccacttg cagctgaaca catcaaattt ccagcatcaa gacaggccat gtcagacatt 2580 caggagtatt ttctaacaca ttaccatata ccacaagtca ttggagtcat tgatggtact 2640 ttaattccca tcagtacgcc ttctgtggat ggccatacat atatatgccg caaaggttat 2700 ccagcaatca actgccaagt gatctgtgac cataactgtt taatcacaga cattgttgca 2760 aggtggcctg ggagcacaca tgactcctat atctttacca actcatctgt gggtcaagag 2820 gcccaaaact caaacggaca ttggaggctg cttggtgaca gcggatatcc attgaggcca 2880 tatctgttca cccctgttgc taatcctgtc agtaacagtg aggcacattt caacgaggct 2940 caccgtgttg cccgaagtac tgtggagcgc actttaggaa ggtggaagct acgctttcgg 3000 gctattcaca agtccagtgg tggtcttctc tttgtgccac aaaagtgctg tgctgtaata 3060 acagtaacag ctatgttgca caacattgca gtgagggcaa gagtgccctt ggacatcagg 3120 gaggaagatg aagaggtgga ggaagagaat gaagttgaga tgagaattca tgatgatcag 3180 ccaagacatg ttcaatacat ggctgggttt ggggcacgcc agcaagtcat tgacacattt 3240 ttttgagttc ctcctttatt tgtgtcatct ttaaatcaca ttaaatattt tctgttaaat 3300 acaacatatg tagcctacta ccctcgaaat gtccatatac aacagtgtaa tttttttttt 3360 cattctaaca tttattatgt aatttttatt tttttcttta ttcattatta tttgtatatt 3420 gaattgtgta atgtgtagaa aatataaatg aatagagata tacgtacaaa taaaatgaat 3480 aatgactttg aagagaagta aattgtaggt gcaatttgcc ggttgtccag caggtgccct 3540 cataactctg tctccttacg atgcacttaa ggctttacga ttactccaga gcactcgtag 3600 tccactaaga ttttcaagtg ctacttaagt tacgatgctt ttgggaaaca gaccgtaata 3660 ttaagatcag tcgtacgatc atttctacga acttcttagg cttacgatgc ttttgggaaa 3720 cgcagcc 3727 // ID Gypsy141-LTR_DR repbase; DNA; ZEB; 197 BP. XX AC chr16; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy141-I_DR; KW Gypsy141-LTR_DR; Gypsy141_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-197 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1544-1544 (2008). XX DR Genome; chr16; Positions 22754594 22754398. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 197 BP; 68 A; 37 C; 37 G; 55 T; 0 other; tgtaatattc atgtaagcac tgaatatttg cgacactaat ttacgactag tagaagagtt 60 tactagtgat ggttaacctg taatacggca acacgttgtg gaaaaaaagc tggaagttgt 120 tggtactcac tgtccgtctc cttatttaat tactgcacgc cccgtgcaac aaataaagtt 180 aaacacaaaa caagaca 197 // ID Gypsy-170-LTR_DR repbase; DNA; ZEB; 898 BP. XX AC . XX DT 16-MAR-2009 (Rel. 14.03, Created) DT 16-MAR-2009 (Rel. 14.03, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy-170-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-898 RA Bao W. and Jurka J.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 9(3), 701-701 (2009). XX DR [1] (Consensus) XX SQ Sequence 898 BP; 134 A; 244 C; 172 G; 348 T; 0 other; tgtcatggtc tgtcacctgc atgttgtcac agtcattcgt gtttgttttc tcgtagcaca 60 tggcattgtt ttgacagcgc gccatgtgct ccactgcctc actccaccca cttatcaggt 120 tactcattag atcattattt cattattcgc acctgtctct agtcttagcc ttatgagtct 180 ccctatttat tctcctctct gtcattgtct tgtaccgggt cgttagtgtg tttacctcac 240 gcggtggcct cttagatcgt gtcctgtctt gtatccattt ttgactccag tgccagtttt 300 ttttgatttc gattttgtgg tttggactgt ttcctgtttt ttcccctttt atcagatcgc 360 cctcgctgct atcagttgtg gcagacgctt ccctctccac ttcctgcttc agccttttct 420 atcccgccca ccagcgatct tctccctctc tggccgcctt gtatcccgcc cgggcgcttt 480 cacttcggcg gctgatactg ctgtgtttcc cgtttgactg gagagttgga ccaccgtgcg 540 cctcctgctg gaagcacttt atattgccct tcacctgagt tacccatttc ctgttttttg 600 actgagagtt ggactgccgt gcgcctcctg ctggaagcac ttagtattgc ctgtaattat 660 accttccgtt tttgcccctt tttgactggc gactttggat cgctgagcgc cctctgttga 720 actttcttta tttatttttt gccagtattt tgtgcttttt ctgtggcatt tacagcaccc 780 tgtgctgaga attttttgtt gtataatatc ccctctgcag gccactttag taataaataa 840 ttctgttttt ccctgcattt gattcctcct gattttttat actgacactt gtatgaca 898 // ID Gypsy47-I_DR repbase; DNA; ZEB; 5069 BP. XX AC . XX DT 25-SEP-2007 (Rel. 12.09, Created) DT 02-OCT-2007 (Rel. 12.09, Last updated, Version 1) XX DE An internal portion of the Gypsy47_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy superfamily; Gypsy47-I_DR; KW Gypsy47-LTR_DR; Gypsy47_DR; endogenous retrovirus; gag; integrase; KW reverse transcriptase. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5069 RA Dib M.R. and Naveira H.F.; RT "Gypsy47_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 7(9), 825-825 (2007). XX DR [1] (Consensus) XX CC Gypsy47-I_DR is an internal portion of the Gypsy47_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its long CC terminal repeat is deposited in Repbase as Gypsy47-LTR_DR. CC Gypsy47_DR is characterized by 4-bp target site duplications. The CC internal portion encodes one polyprotein the 1579-aa polyprotein CC Gypsy47_DR1p (pos. 215-4951) composed of the gag, protease, CC reverse transcriptase, and integrase domains. Some insertions CC fairly recent, according to the high identity between their CC flanking LTRs. Consensus obtained after the alignment of at least CC five independent insertions bearing at least 85% homology over at CC least 1000bp. XX FH Key Location/Qualifiers FT CDS 215..4951 FT /product="Gypsy47-I_DR_1p" FT /note="ORF." FT /translation="MALFDLQTFINNPTVEQFDRCRKDDLIQIADHFKISV FT VKQALKKNIKSALYERLVELKVLIMPVVEEEETEFLGAGAQSSSADEPDEG FT RNESGAAAALTEGKPTLSPFDPFSPVYSGSKEDGRLKVRLARLQFEAQERA FT QARQAEMELRLEIRRLEVTAEKEVKLRQLELEALKTPPKVAAQSTRAETSV FT APVGDPTSTTFDISKYVSLVPIFRESEVDTYFSAFERIAASFRWPKDSWSL FT LLQCKLVGKAQEVFATLSLEDSLNYDTAKSAILRAYELVPEAYRQKFRNHK FT KLSSQTFVEYAREKGVLFDKWCNASKATDFNSLRELILLEEFKDRLPERVV FT VYLNEQKVMSLSQAAVLADEFVLTHKNVFYPVRSDKNPTVQSVNFPRIPTA FT TSRPKEAIECYYCHKKGHKLSECSTLKRKNPQQPSKSVGFAKSITTNSNDE FT STDPSYTPFLLRGLISLSGNPEDQCEIVLLRDTGASHSFMVENALSLPEHS FT SCGSNILVQGIEMGFVNVPLYRVHLQCDLFTGFVSVGIRPSLPVKGVSFIL FT GNDLAGGKVMPILEVTDHPNLTYCPDELSVNHPDTFPACVMTRAMSRKDDS FT VDLSDSFIAPVFSENVVSPPDKTATQKQQEKSEYFNFDGDALSLPVTRQQI FT IAAQKDDDSLVKCFDSVVDSDQIDKKQIAYFVENGLLMRKWCSNIETDADW FT NNVYQIVVPSAYRQHVLCLAHDHVMSGHLGITKTYNRILKHFFWPCLKKDV FT VQFCRTCHTCQCMGKPNQVIPPAPLFPIPVIGEPFEHVIIDCVGPLPKTKT FT GNQFLLTIMCTATRFPEAIPLRRITAPLIIKSLIKFFSTFGLPKIIQTDQG FT SNFMSKMFAQVLKSLSITHRVSSAYHPQSQGALERFHQTLKSMLKKYCADT FT CKDWDEGVPLVLLAVRETIQESLGFSPAQLVFGHDLRTPLKLLKENILEIK FT ESPKMNVLSYVSRFRERLHDACKLAKESLKAAQNVMKRQYDTRAVARSFQP FT GDQVLVLLPMPGSALSARFSGPYVVQKKLSETDYVIETPDRKRKSRVCHVN FT MLKTYHVKEKTQITTPVLPAVAVVSDVTHSEADDDGLVLRNAPQQCARLTN FT SEMLKNLNSHLTYLSDSQRQDIIHLIQKFPNLFSDTPTQTSVIKHDINIQN FT SSPIKQHPYRVNAHKRAIMRKEVEYLMENGLAVPSCSPWSSPCVLVPKPDG FT TFRFCTDYRKVNAVTVPDCYPLPRMEDCIDNLGSASFVTKLDLLKGYWQIK FT LTPQASAISAFVTPDHFMQYSVMAFGLRNAPATFQRLINTVLFGVPNCNAY FT LDDLVVYSSEWSEHISLLHTVFEKLSNATLTLNLAKCEFARATVTYLGKEV FT GQGQVRPVAAKVSAIAEFPVPKTRRELRRFLGMSGYYRSFCRNFSSVVSPL FT TNLLSPSQQFVWSAECQHAFESIKALLCCAPVLAAPDFTTPFKLEVDASAL FT GAGAVLIQEDRDGIEHPVCYFSRKFNKHQINYSTIEKEALALLLALQYFEV FT YLGSSSFPIMVYTDHNPLVFLSRMYNQNQRLMRWALIVQNYNLEIRHKKGT FT DNIVADALSRCV" XX SQ Sequence 5069 BP; 1419 A; 1097 C; 1061 G; 1492 T; 0 other; ttttgggggc tcgtccggga tcatttaacc catttttaaa caattaaaat tgtttgtatt 60 tatagtttgt ttgagtgtga aagcaacctg ctaatttaca aaggcccact gtaggcccga 120 cccgcggaac gcgctgctca ggtaaacacg cgcacacaca cacacacaca cacatttctc 180 tctgtgttgt ttctggtata agtaaataaa caggatggca ttgtttgatt tacagacatt 240 tattaataat ccaaccgtag aacaattcga ccgctgtaga aaagatgatt taatacagat 300 agcggaccat ttcaaaattt cagtggtaaa acaagctctt aagaaaaata ttaagagtgc 360 cttatacgaa aggttggttg aattaaaggt tctcataatg cctgttgtgg aagaggaaga 420 aaccgagttt cttggtgctg gcgcgcaaag ctcaagtgca gatgagccgg atgaagggcg 480 aaatgaaagc ggtgctgcgg ctgcgctcac tgaaggcaag ccaacactgt cacctttcga 540 tccgttttcc cccgtttatt ctgggtctaa agaggatgga cgtttaaaag tccgcctggc 600 ccgtttacag tttgaggcac aggaaagggc acaagcgcgt caggctgaga tggaactgag 660 actcgaaatc cgcaggcttg aagttacggc cgaaaaagag gtgaagctgc gtcagctgga 720 gttggaggct ttaaaaactc ctcctaaagt ggctgcgcag tccacccgtg ctgagacctc 780 tgttgctccg gtgggcgatc caacctcaac caccttcgat ataagcaaat atgtatctct 840 cgtaccaata tttagggaat ccgaagtcga tacatatttt agcgctttcg agcgcatagc 900 ggcttctttt cgctggccta aagattcatg gtcccttctt ctccagtgca aattggttgg 960 aaaggcacag gaggtttttg ctacactgtc cttagaggac agccttaatt atgatactgc 1020 taaatcagca atcctccgtg cttatgaact cgtacctgaa gcatacagac agaaatttag 1080 aaatcacaaa aagctttcta gccagacttt tgttgagtac gcgcgagaaa aaggcgttct 1140 ttttgataaa tggtgtaatg caagtaaggc aactgacttt aattcgttaa gagaattaat 1200 tttgcttgaa gaatttaaag atagattgcc tgaacgtgta gtggtatacc ttaacgaaca 1260 aaaagttatg tcgttgtcac aggcagctgt gttggcagat gagtttgtgc ttacgcacaa 1320 aaatgtgttc tatccagtcc gatcggataa aaatcccact gttcagtctg tgaattttcc 1380 acgtattccg actgctactt cccgacctaa agaggcaata gagtgttact actgccacaa 1440 gaaagggcac aaattatctg aatgttccac gctgaagcga aaaaatccgc aacagccatc 1500 taaaagtgtt ggttttgcaa aatctattac aaccaacagc aatgatgaga gcactgatcc 1560 gagctacaca ccgttcttgc tcaggggtct aatttcatta tctggcaacc ctgaagatca 1620 gtgtgagatt gtactgctgc gtgatacagg cgcttctcac tcgttcatgg tagaaaatgc 1680 attatctctg cctgaacact catcttgtgg atctaatatt ttagtccaag gcattgagat 1740 gggttttgta aatgtgccac tatatcgcgt tcatttacag tgcgatttat tcactgggtt 1800 tgtgtcagtc ggcattcgtc cttctcttcc tgttaaagga gtttcattta ttctaggtaa 1860 tgatttggcc ggtggtaaag taatgcctat tttagaagtt actgatcacc ccaatcttac 1920 ctattgtcca gacgaattat cagtaaatca cccagatact ttcccagcgt gcgtaatgac 1980 gcgcgccatg tcacggaaag atgattcagt tgacttatca gactctttta ttgcgccagt 2040 attttctgag aatgtagttt ctcctccaga caaaacagct acacagaaac aacaggaaaa 2100 atcagaatat tttaattttg atggtgatgc tctgagtttg cccgttactc gtcagcagat 2160 tattgctgcg cagaaagatg atgattcttt ggtaaagtgc ttcgactccg ttgttgattc 2220 agatcagatt gataaaaagc aaattgcgta tttcgttgaa aatggcttgt tgatgcgcaa 2280 gtggtgctct aatattgaga ctgatgctga ttggaataat gtctatcaaa ttgtcgttcc 2340 ctccgcttac cgccaacatg tattatgtct agcacacgat catgtgatgt caggtcattt 2400 aggtatcacg aaaacataca atcggatact taaacatttc ttttggcctt gtttaaagaa 2460 ggacgtcgtg cagttttgtc gcacctgcca tacgtgccaa tgcatgggga agccaaacca 2520 ggtaattcca cctgctcctc tttttcccat cccagtcata ggagagccat ttgaacatgt 2580 aattattgat tgtgtgggtc cattacctaa aaccaaaaca ggtaatcagt ttttactaac 2640 aataatgtgt actgctactc gatttcccga ggcaataccg ctccgcagaa ttacagctcc 2700 tcttataatc aaatcattga ttaaattttt ctccacattc ggcttaccaa aaatcataca 2760 aactgaccaa ggatcaaatt ttatgtcaaa aatgtttgct caagttctaa aatcattgtc 2820 tatcacccac cgagtatcaa gtgcttatca cccacaaagt caaggtgctc tcgagcgatt 2880 tcaccagacg ctcaagtcta tgttaaagaa atactgtgcg gatacctgca aagactggga 2940 tgagggcgtt ccattggtac tattagcggt gagagaaact atccaagagt ctctcgggtt 3000 tagccccgct cagcttgtat tcggtcacga tttgcgaact cctcttaaac ttctgaaaga 3060 gaacattttg gaaattaagg aaagtccaaa aatgaatgtt cttagttatg taagccgttt 3120 cagagaacgt ttacatgatg cttgtaaatt ggcaaaagaa tcacttaaag ctgcgcaaaa 3180 tgtcatgaag cgtcagtatg acacccgagc ggtcgcgcgc tcctttcagc ccggcgatca 3240 ggtgctcgta ctgttgccaa tgcctggttc tgctctctct gcccgctttt ctggacccta 3300 tgtggtgcag aaaaagttaa gcgagacaga ttacgtaatt gaaacacctg acagaaaacg 3360 caaatcgcgc gtatgtcatg ttaacatgtt aaaaacgtac catgttaaag agaaaactca 3420 gatcaccaca cctgttcttc ctgctgttgc agttgtgagc gatgttacgc attctgaagc 3480 tgatgatgat ggcctggtgt tacgcaatgc tcctcagcag tgcgctcgtt taactaattc 3540 agagatgctg aaaaatctga attcacatct cacgtattta tcagattcac aaaggcaaga 3600 tatcattcat ttaattcaga aattccccaa tcttttcagt gacactccaa cgcaaacatc 3660 cgtaattaaa catgacatca atattcaaaa ttcttcccca ataaagcaac atccttaccg 3720 tgtgaacgca cacaagcgtg ctattatgag gaaggaggtt gaatatttga tggaaaatgg 3780 tcttgcagtt cctagttgca gtccatggag ttcaccgtgc gttctggtac ccaaaccaga 3840 cggaacattt agattctgca cagattaccg taaggttaat gctgtcactg tcccagattg 3900 ctacccttta ccccgtatgg aagattgtat tgacaacctt ggttcagcta gttttgtcac 3960 taaacttgat ttgttaaagg gctactggca aattaaatta actccgcaag cttctgctat 4020 ctcagccttc gttacaccag atcacttcat gcaatattca gttatggcct tcggattgcg 4080 aaacgcacct gccacatttc aaagattaat taatacggtt ttattcggcg ttccaaattg 4140 taacgcatat ttggatgact tagtggttta ttcatctgaa tggtctgaac acatttcttt 4200 gctgcacact gtttttgaaa aactgtccaa cgccacatta acattaaatt tggcaaaatg 4260 tgaatttgct agggcgaccg tcacatacct tggtaaggaa gttggtcaag gtcaagttcg 4320 gccagtcgct gcgaaagtct ccgccatagc cgaatttccc gtacctaaaa ccaggcgaga 4380 gttgcgccgg tttttgggta tgtcggggta ctatcggagt ttctgccgaa acttttccag 4440 tgtggtttca cccctaacga atttacttag tccttctcag caatttgttt ggtcagctga 4500 gtgtcagcat gcttttgaga gcataaaagc tttgctctgc tgcgctcctg tactcgcggc 4560 cccagatttc accactcctt ttaaactgga agtagacgcg agtgctttag gtgctggcgc 4620 ggtacttatt caggaggacc gtgatggcat cgagcaccca gtctgttatt tttccagaaa 4680 attcaacaaa caccaaatta attactccac tatcgaaaaa gaagcacttg cattactatt 4740 ggcattacaa tactttgaag tttaccttgg ttccagttcg tttccaatca tggtttacac 4800 cgatcacaat ccgttagttt ttctctcccg aatgtacaat caaaatcagc gcctcatgcg 4860 ctgggctttg attgtacaga attacaacct ggaaatccgt cacaagaaag gcaccgacaa 4920 tatagtggcc gatgctcttt ccaggtgtgt gtagttggtc gtaattaaac cgcatgcatg 4980 gtttgttttg cgagtttttt tttcttacta tttcacaatt tttttttttt aaacaaacaa 5040 tattatatgt ttgttctcat gggtgggag 5069 // ID DNA-1-2_DR repbase; DNA; ZEB; 1645 BP. XX AC . XX DT 14-OCT-2008 (Rel. 13.1, Created) DT 14-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE nonautonomous DNA transposon from zebrafish. XX KW DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; TSD 1 bp; DNA-1-2_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1645 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1210-1210 (2008). XX DR [1] (Consensus) XX CC The element is characterized by 17-bp TIR. Like DNA-1-1_DR, the CC elements insert specifically into CWG, and duplicate the W only. CC Thus, the TSD is one base pair. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1645 BP; 534 A; 306 C; 270 G; 534 T; 1 other; gggggctatt ccataaacca agcttacaga aaaagccagg cttattttgg taagtcaggt 60 ttattaatgg cggattcggt ttcataaatc aaatttacga tagttcaaac ttagttaccc 120 tggcaactta tgctgggaaa ctagcctggt ccggggcagg ctaactctca ggctaagtct 180 taagttcaag cttaatcaaa gtaatgttaa cattcacctg ccataatctg atgccatttt 240 aattcactta acctcttaat aatgattaaa actttatagt cacttaatag caaaaaaaaa 300 ataaaataaa tatataaata tatatatata tatatatata tatatatata tatatatata 360 tatatatata tatatatata tatatatata tatatatata tatatatata ttccaaactt 420 tgcattggta ctgaataatc taatgtatat gcacaaatat attattgtaa agcatcctgt 480 agaaaatatt aatttaaaag agaaatctgt gaggggggtt cttagctctc tctctctctc 540 tctctatctc tctctctata tatatatata tatatatata tatatatata tatatatata 600 tatatatata tatacatttt tttttttttt cggcttagtg cctttattaa tctggggtta 660 ccacagcgca atgaacggcc aacttataca gcatatgatt tatgcagcgg acgccttcac 720 tgggaaacat ccatacacac tcattcacat tcctacacta tggccaattt agcttattca 780 attcacctat agcgcatgtc tttggatttg ggggaaaccg gagcacctgg aggaaaccca 840 cgcgagcacg ggggtaagaa agacactgac taatttaaag aagaagacta catttcttaa 900 ataatgactt taaagctrta aaatgagtgg aataatcagg agatgatccg tgctgtgcag 960 cagcgtcact ttaatgtaaa gatattttcc cacgggccgc gggaaacaag tggcaaaggg 1020 aaagcaacga tttgcagcag attcgggagc aaattaaaaa tcccaccgct tgtgcgtgca 1080 cggctctttg tttattttaa acctatgtgc ttacatttag aacattttgt atgttacagt 1140 gagcctgcaa tgcattaaaa cttttccttg caagcaacaa ttggcagtaa gcgcaaatcc 1200 tctctctctc acgcacatgc gcgcggtcaa gcacaattac tatgttttat aacaaatatt 1260 taatttcatg ttaataggcc taaataatag cctatattta aaaatatatt gctactaatg 1320 tagcctattg aacaatctgc agtttcgcgg ctcactcgct gtgaacagct gtgtctcatt 1380 tcagagctct ttcagttctc gaagttgcca tggtagcaca caacataatc gatgtatcta 1440 tgctcgactt ttaaggtttc acgacagaaa cgtgcacgag cagttatcta gattatttaa 1500 actgaactca gactagtcag atcaaatgat cttcaacttg ctgttatgga accaatttaa 1560 gcgtggacgt ttagttcagc tcattgaagt caggcttttg actttaatcc ccgcttttct 1620 taaccacttt tatgaaatag ccccc 1645 // ID TDR13 repbase; DNA; ZEB; 772 BP. XX AC . XX DT 04-MAR-2002 (Rel. 7.02, Created) DT 04-MAR-2002 (Rel. 7.02, Last updated, Version 1) XX DE Zebrafish non-autonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; TDR13. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-772 RA Jurka J. and Drazkiewicz A.; RT "TDR13: a non-autonomous DNA transposon from Danio rerio."; RL Repbase Reports 2(2), 23-23 (2002). XX DR [1] (Consensus) XX CC Several hundred copies in the genome. CC Approximately 48 bp TIR. XX SQ Sequence 772 BP; 221 A; 148 C; 213 G; 185 T; 5 other; caacacaggc tcattctgaa aacgtagccc tatatacatt tctggagatc gcgaattatg 60 tagccagaag tatgtatggc tgcatttcgt ctttaaaaac gaacgctacg gggtggtatg 120 atgccgttcc ttttcgcgct taccagctga ccgcttacct ccatatggac ggctttcccg 180 ctgttaccag tttgtccagt agctcgccat gtacgtcggc agacttgaga cgcagagagg 240 aggagttgac cacgacaacg gggttcgagt ccggtgaaga acggttccag aaagcagnar 300 rcagtaagac aaaaacagaa gccaaaaaat aaaataaaca agtaaataac agggtgagaa 360 tgtggtaaaa tctgaaaatg tggtaaaaat caggcaaggg cttttntctt tttctggatt 420 gcttttgaaa acactgtcgg ttgggtttag ggaagtgggt gggtgggtca atcngtgctt 480 ttgaaaacac tattggttgg gtttagggaa ggaggagggt gggtcagtca gtcggtcagt 540 cagtcagtca gtcagtcagt cagtcagaca gcggcctctg gtggatttac gcgagaacag 600 caggcgtgaa tggcactcgc gagagaaatt tgagatctga aaaagcatac acagcggcct 660 ctggtggatt cgcgaaaaca aaaactgcaa aaaaaaaacg tacctcctgg gacgtatttg 720 gcgctctcca gaaatgtata taggggtacg ttttcagaat gagcctgggt tg 772 // ID DNA-AT-3_DR repbase; DNA; ZEB; 2015 BP. XX AC . XX DT 02-DEC-2008 (Rel. 13.12, Created) DT 02-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE non-autonomous DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; TSD AT; KW DNA-AT-3_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-2015 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(12), 2154-2154 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 2015 BP; 635 A; 350 C; 352 G; 678 T; 0 other; aggccttttt cacacttcct ggttccggta agcgaagcag tgtaccacaa catccggttt 60 caatgcgacg gagcaagaaa gaatggcagg gtcatctcat cgctgtgatt gctgcgttcc 120 cagctgctcc aactcaaaaa cgacactctt actttaatga ccacgatctc ccgaaagacg 180 aaggacaggg aaaatcatgg atgacaatat taaagggggt tgaatgccgt ttcaagttat 240 atgcgggagt acattcgtat gctgtatgca cttcaaagcg gagattatta tgtttcatta 300 aagactggtg gtctatggtt tattttgatg agtaattata atatagacaa aactccagtt 360 aagctatctg ccttccacaa acaagtgctc atcatggtca ttaatctata agcatgattt 420 atccctacat tgttgttatt ttggaataat agacagttca tacaaatgca aatctctttt 480 tttttaaatt ggtttgacag caaaattatt tgtttagtca gttatataat aaagatggta 540 tacttttctt acaatgaagt cctttattac aaaatctcaa tattacaaaa cccagcaagt 600 tttcaactgt tattggttca attcctgcag agacatgtta ttcattcatg ttatttagag 660 gtattaataa gaccatcctt ctgatgcatc gtcttttgat attactaaat cttatgtcgg 720 caaattttgt tttactaaat accaagagta atgatagagc tattcatact cttttccaaa 780 aagaaatcat tactgtacct tatattcaat tttattatat gtctttatta aagggtgaat 840 tgaagaaaag tgtggctgtt accacagaaa tatcatgtta gaaatagaat taaagaaatg 900 tcctataaac tcatccacag gttttatcca gcaaaacatt gtttgcaaaa atttaaaagt 960 gggattgatg taaattgtat ttttgtaacg aatatgagga gacagtttta cacttatttt 1020 ggcattgtac atagtcagct tatggggtga gttgcaaaat gtattaataa cgattctttt 1080 gtgttcaaaa cctctttcat tatggataaa tgtacatttg ggctttactt gattatccta 1140 atctagaggg aaaatagttt tatttattta atttgcttat gttgttaaca aaattgtaca 1200 ttcataaatg taaatttact aatatgaaac cagattttaa ggtattgatg actgaaatat 1260 gcttgtataa taccagtatc aactctctaa acaagaaact cgacccactt taaaatgtgt 1320 gaactctaca atgtatttca ttattacata tgttatatta cattttgtat atttgtattt 1380 tatttataat taccccctgg tatgtacatt tttgtattgt aaaatgttat ttgctgttct 1440 ttaaataaat aataaaaatg aatattatcc acgagtagga gctaacgtgt ttccagactt 1500 gagaaaacct cacgagcatt aaacaacctg taatgtatta accgatagca actcgcctgt 1560 acagctaact tacctcgcca ttcactatat ctgactctga ataaaaagga aggagggggt 1620 tagcacagtt tacagacggg actactgaca taaaactcat cctgctggat gttaaatagc 1680 caggttaatc tagtttacga atggcttgaa tgagcagatt ttatgctgtt ggctgtgcac 1740 cttactggaa cctgacgttg caaacgctgt gtaagacacg ttttctgcac aaactctgca 1800 aatcggtatc atcttttaaa tgctgtttta ttaaggcagt tttacacatt caccgaacaa 1860 catcttcctg tccaaattat ggcaaagttg accgtgagaa agtcatccgc tgtcccgagg 1920 ctgtcatgtc tctgcgactc ctagcgaacc cggaaatatc gatacactgc ttcgaagggc 1980 gcttccggtg ttcaattccg ctgtgaaaaa ggtct 2015 // ID DNA9NNN1_DR repbase; DNA; ZEB; 1163 BP. XX AC . XX DT 05-JUN-2002 (Rel. 7.05, Created) DT 05-JUN-2002 (Rel. 7.05, Last updated, Version 1) XX DE DNA9NNN1_DR is a nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA9NNN1_DR; KW nonautonomous DNA transposon. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1163 RA Kapitonov V.V. and Jurka J.; RT "DNA9NNN1_DR, an ancient nonautonomous DNA transposon from RT zebrafish."; RL Repbase Reports 2(5), 10-10 (2002). XX DR [1] (Consensus) XX CC Several thousand copies of DNA9NNN1_DR are expected to populate CC the CC zebrafish genome. They are ~85% divergent from the CC consensus sequence. CC This element is characterized by 9-bp terminal inverted repeats CC and 3-bp target site duplications. CC Its classification is not certain yet, although it is CC expected to be a member of the Harbinger/IS5 superfamily. XX SQ Sequence 1163 BP; 361 A; 231 C; 222 G; 343 T; 6 other; gggctctatt ttaacgatct aggcgcaaag tctaaagcgc atggcgcaaa agcattaagg 60 gcgtgtccga atccactttt gctattttaa ggatggaaaa atatgctttg cgccctggcg 120 catggtctaa cagggttgtg cttattctct taatgagtta tgggtgtgtt ttgagcataa 180 cgtgcattaa accaatcaga gtctcatctc ccattccctt taagagtcag ttgcgtcgcg 240 ccatggcgca tttgctattt acatggcgga ctttgtaagt ggaaaaactg aacgcttcac 300 tagcgagaaa acagttaaac agaccatctg cagcgtgagg ataaagaacg agcctcctcc 360 attcggcctc tttactttca ctttctcttt ctctctttcg tggataagga aacgtgttgt 420 acgcactcca ctgaagacat ccattagcct acatatttaa ttttgtttgt taagcgcaaa 480 gatttgtttc aaaactattt ctaaattcag ttctaatttc cagcaaacga ataaatgaac 540 aataataacg aagtgtggtc aaaaaactga gttatatcca aacacacatg ctgccccata 600 tggtccaaaa cctgacaggt ggacaaatct aagcttgttt ttawtaaaac aaatataaat 660 atgcatataa taaataatac tactaataat aataacatta tacaaaagca aattgtcatg 720 aataaactga aaaaagcccc ccgagatgaa ggcatgaagg cagtggtttt tatatttatg 780 tagaaaataa tttttgtaat attttaatcc tttaattatt tttcatatgt aaagatattt 840 gtgtattgct gtacatcctg tgtgtattaa gcaatgtgta agcgaggcac aactaacgcg 900 ctctgcgctg tagaccggct ttgagttggt ctattgcaca gtctatttta gtttctcaaa 960 atagcaacgc gccaacaatg cgccttaaca cacctctttt ttagaccaga acgcctatga 1020 gtccacaaag tagcgcaaat ggatttgcta tttaaacarc gtggcgcaaa acstgaaaat 1080 tackgttgcg ctgagctgaa aatagcaaca aattacggca aacacgyctt gcgccntatt 1140 gcgccgggtg tatgataggg ccc 1163 // ID DNA-3-2B_DR repbase; DNA; ZEB; 832 BP. XX AC . XX DT 29-OCT-2008 (Rel. 13.1, Created) DT 29-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE nonautonomous DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; TSD 3-bp; KW DNA-3-2B_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-832 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1224-1224 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 832 BP; 274 A; 173 C; 162 G; 223 T; 0 other; ggccgcattt acactgcact gttcaagtga ctcaattccg atttttttct ctcccatgtg 60 gcacagatcg gatatggccc atgtacgtgt aagcaggaac aaatcacatg gattccgatt 120 tactcaaatc agattcaggc cttgttcata tgtggaaatt tatccgatat gaatcggatc 180 tgtgttctcg tgtctgcagt gtaagcaggt agatcggatt ttcacctgtc aatgcgagtc 240 gcgcgtcatt aaaaaccata gcgaatgacg tcaagtctga cactttcatt tcagaacaga 300 cttcagcaga gtcccaaacc ttaaatctca tacacgagga cttaaacagc ttttatattg 360 tcatatagca caggtattta agcatgttaa cgagagcgag agagcgcaat gtccgccacg 420 taaccaatat aaactaatat aaaactagtg catagctaca tcacgtgcat aaatcacgcc 480 atggacatta cattaagatg cctaaaggtt taaaaaaagc ctatatatca aaagaagatg 540 gacaaatgag aacctttctg tcataaacta tagtaaaaca gcttgtcaaa agctgcgaac 600 agattacata caggaataga gaaaagagca ctctttaatt atcagctgtt agtcagcgcc 660 cgctcttttt tttcctggtc attgcgcctt tgccgtgtgc tgtgtaaatg cagacgatcg 720 gatacgagtc acttttaaaa gatgatgtaa gcaggtcatc aaaaaaataa aaaaatcgga 780 tacagtcaca aaatcggaat tgaccatcaa gatctgcagt gtaaatgcag cc 832 // ID CR1-27_DR repbase; DNA; ZEB; 1926 BP. XX AC . XX DT 01-DEC-2008 (Rel. 13.12, Created) DT 01-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE CR1-like non-LTR retrotransposon - a consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; AP endonuclease; CR1 clad; CR1-27_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1926 RA Bao W. and Jurka J.; RT "CR1-like non-LTR retrotransposons from zebrafish."; RL Repbase Reports 8(12), 2124-2124 (2008). XX DR [1] (Consensus) XX CC The 5'- part is incomplete. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS join(3..470,573..764,855..1562) FT /product="CR1-27_DR_1p" FT /translation="VVDKKEIIGIVKKCKNKSSMDWNNIDMNIIKKVIEDI FT VEPLTYICNRSFQSGKFPNKMKIAKVIPLYKSGDKQYFTNYRPVSILSQFS FT KILEKLFVDRLNNFIEMHNLLADEQYGFRKNRSTSLALTEIIEEITNKIDK FT KEKVVGIXIEFXLYKKFKNRQQFVEIGNHKSSYLNVNCGIPQGSVLGPILF FT IIYINDMCKISELFKFILFADRYKYXLFWGVKQRLCYFGKLKSSHNIDLHI FT DNEVIERVYAIKFLGVVLXHKISWKXQINNVISKLVKTVAIVYKARFILDN FT KSMYILYNTLMLPYMSYCVEIWGNTYKSNLQSLCTMQKKVIRLIEHVGYLE FT HTNPLFFKLKILKFNDLVKFKTAQFMFKARNKLLPKCIQKFFLKKDKGVSY FT NLREKQNFRRLKTRTTLKSLCMSVCGVKLWNSLXQTLKQCQDINQFKKXYK FT YILLKEYSDE*" XX SQ Sequence 1926 BP; 776 A; 172 C; 325 G; 635 T; 18 other; aagtagtaga taaaaaggaa ataataggta ttgtaaagaa atgtaaaaat aaatcgtcca 60 tggattggaa taatattgat atgaatataa ttaagaaagt gattgaagac attgtagaac 120 cattaaccta tatttgtaat agatctttcc aatctggtaa atttccaaat aaaatgaaaa 180 ttgcaaaggt aattccactt tataaatctg gagataaaca atactttaca aattataggc 240 cagtttctat actatctcag ttttcaaaaa tattagaaaa actctttgtt gacaggctta 300 ataatttcat agaaatgcat aaccttttag ctgatgarca atatggtttc aggaaaaata 360 gatcaacatc tctggcatta acagaaatta ttgaggaaat tactaataag atagataaaa 420 argagaaggt agttggaatt tktatagaat ttwwtttgta taaaaaattt taaagaaggc 480 atttgataca ataaatcata atatattggt aaataaatta gaaaggtatg gtattagagg 540 agtagccttg gactggataa gcagttattt aaaaaaatag gcaacagttt gtggaaattg 600 gcaaccataa atcttcatat ctaaatgtaa attgtggcat accacaagga tcagttttag 660 gtccaatatt gtttataatt tacattaatg atatgtgtaa gatatcagag ttgtttaagt 720 ttattttatt tgcagataga tacaagtaty ttttgttctg gggataattt accaaaactt 780 atggagttga ttaaaacacg aaattaataa attgaaatta tggtttgacg taaacaaatt 840 gttattaaat ataagtaaaa caaaggttat gttattttgg gaaattaaaa agtagtcata 900 atatagattt acatattgat aatgaagtta tagaaagagt atatgcaatt aagtttcttg 960 gtgttgtatt gratcataaa attagttgga aaycgcaaat aaataatgtg atatcaaaat 1020 tagtaaaaac tgtagcaata gtgtataaag caagatttat cctggacaat aaatcaatgt 1080 atatattata taacacactt atgttgccat atatgagtta ttgtgttgaa atttggggga 1140 atacatataa atctaattta caatccttat gtacaatgca gaaaaaagta attagactaa 1200 tagaacatgt ggggtatttg gagcatacaa atcctttatt ttttaaattr aagatactta 1260 agttcaatga tttggttaaa tttaaaacag cacaatttat gtttaaagct aggaataaat 1320 tacttccgaa atgcatacaa aagttttttt tgaagaaaga caagggggta tcatataatc 1380 tgagagaaaa acaaaatttt aggaggttga aaacaaggac aactttaaaa agtttgtgta 1440 tgtctgtatg tggtgtaaaa ttatggaaca gtctayatca gactctcaaa caatgtcagg 1500 atattaatca atttaaaaag ntgtataaat atatattatt aaaggaatat agtgatgaat 1560 agaggtgatt gaaaaaggat aaaatgagaa aggtatgatg aaatttawtt aatggctatt 1620 tgcagtacta tgtwtttttt atttttgggt ctatgttaca aaatgtaaag gtacaaatgg 1680 aaycaaacat ataatatgtc aaagatgcca aaaggatmaa atatgtctca tgtaaaaaag 1740 aagagatrag taaaaagaag taataagttt aaktaataga cgaatgatgt atgtggtaat 1800 ggggtggraa aataataagc ttgtgcttca tcccactcca ttttcgacca tgttgaaatg 1860 tattttttgt taatgatatt ttatgtatga tattttgttg aaaataaata aatcaaatca 1920 aatcaa 1926 // ID Copia4-LTR_DR repbase; DNA; ZEB; 265 BP. XX AC chr15; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 01-OCT-2008 (Rel. 13.1, Last updated, Version -1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia4_DR; KW Copia4-I_DR; Copia4-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-265 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1591-1591 (2008). XX DR Genome; chr15; Positions 7901070 7900806. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 265 BP; 72 A; 47 C; 58 G; 88 T; 0 other; tgttaaagta ctggcactct tgctcattgt attaatataa tcactgaagg gttcataatt 60 atgtttcagt gtattacggt actgtagcga tgacgtcatg ttgtttacgt gctttacgtc 120 accacgttgt actgttgttg tacggaagtt ttaccgtctc gtgttcatgc tgtacagaga 180 gagcatgtaa aagtgagtaa taaaagacgc tgataaaaac gagtcgttct gtttagtgct 240 taagctacgc actccaagct caaca 265 // ID ERV1-N6-LTR_DR repbase; DNA; ZEB; 439 BP. XX AC . XX DT 01-DEC-2008 (Rel. 13.12, Created) DT 01-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE nonautonomous endogenous retrovirus from zebrafish, long terminal DE repeat - consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW LTR; ERV1-N6-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-439 RA Bao W. and Jurka J.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(12), 2174-2174 (2008). XX DR [1] (Consensus) XX CC TSD is 4-bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 439 BP; 139 A; 82 C; 95 G; 123 T; 0 other; tgtaagaaat gagtctgcct cgacccaact tccactctaa atggtgaatt gatgctcatt 60 tcccaaaagg gctgacttat gagagaaaag agaagtcaac aaggtcttgt ctctgcattg 120 tttgaggtga ctacatctca accaaatggt aacaattgta gaatggggat gggtgtggtg 180 cacaaagaac catctcatca atggtgttga aactgcatta acatggagag atagggagaa 240 cagctgggtc aacagccaag gagcaacttc attaacatct aaaggcaaca tcaaaaacag 300 tataaaggtt tgagtttaag aatgttcagg gttcattctg actccggtga acccaaggta 360 ctgccatgta ttttgagact gctatgctga ataaacatct tcattgagat tttcttcgtc 420 atcatcattt tttcttaca 439 // ID Gypsy-167-I_DR repbase; DNA; ZEB; 4990 BP. XX AC . XX DT 12-MAR-2009 (Rel. 14.03, Created) DT 12-MAR-2009 (Rel. 14.03, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-167-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4990 RA Bao W. and Jurka J.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 9(3), 694-694 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 175..1107 FT /product="Gypsy-167-I_DR_1p" FT /translation="MEVFSVQNFIKNPTVEQISRCRKQDLLLVADHYQIEV FT SKHSRKAELRDCILENLYEKNVLPVEKVSENVDKVILDDVAVEHLSVGAEE FT DLQGQAKVDAEAKAGLPPFDPFSPMSSVSRSDARLKVRLARLQLEAHDKEQ FT ARQAEVNLRLEIRKLEIEAETQIKLRELDLEAMKISSCSAPPSTSTPATAL FT PPTATDVSTPTDTAPHAFDIGKHIVLVPQFRESEVDSYFTAFERIATSLHW FT PKDVWSLLLQCKLLGKAQEVCSTLSLEDSLKYDCVKSAILRAYELVPEAYR FT QRFRGCRKNGTQTFVEFA*" FT CDS join(1108..2910,2867..4942) FT /product="Gypsy-167-I_DR_2p" FT /translation="EKRLLFDKWCTASKVDDMESLQELMLLEEFKNCLPDR FT VVVYLNEQKVNSLSHAAVLADEFILTHKNVFAPVRSDKSQIAQSQPSRSKK FT STVPEERACFYCRKQGHVISDCLVLKRKQQSQVKSVGFLNTRTAFKIPDNN FT KNNDIDDKYKPFILEGFISLSENCDERVKVKILRDTGALQSFIVADKQLSG FT DTYCGSNVIVQGIEMTCITAPLHRIHLTSDLCTGFVRVAVRSSLPVKGIDF FT ILGNDLAGGKVTPVLEVLDKPDVSIQLDSCSDPEIFPACAVTRAQSRKMGN FT VVELADSFFSPFLAGENFTIEDSLENQAGKANVDVTDNNLLQLPVSRERIT FT MAQKEDQSLVSLINGAVSGVENKKAKSTFFMDKELLMRKWSLSADDDLEWD FT AVYQIAVPTPYRQHVLYLAHDHQFAGHLGITKTYDRILRHFFWPGLKKDVV FT KYCRACHTCQVVGKPNQIIPPAPLIPIPVVSEPFQHILVDCVGPLPKTRSG FT NQFLLTIMCVATRFPEAIPLRKITAPVIIKALIKLFSTFGLPQVIQTDQGT FT NFLSKLFEQTLQSLAIKYRVSSAYHPESQGALERFHQTLKSMLRKITEYNE FT RLGSLCCEKLLSTTKDWDEGVPLMLFATRETVQESLGFSPAELVFAHQVRG FT PLKVLKERMQSVDSSAKTNILDYVSKFRERLHTACTLAKDALTTAQKGMKR FT KYDKKVVVRSFTAGDKVLVLLPVSGSSLSARFSGPYIVQKKISETDYIILT FT PDRKRKTRVCHINMLKAYHAKETLGKDSSVSNTEPAVPTITVTQSSFEENR FT DADDDDVVLRHTQQCVRLKNSDIFSDLNSYLKHLSCDQRKDIIALIFEFNG FT VFHDVPTQTNVLKHDIDVGTAHPIKQHAYRSNMNKRSLMRKEVNYLLENNL FT AKPSHSPWSSPCLLVPNTDGTYRFCTDYRKVNSVIVADSYPLPRMEDCVDN FT VGTARFVTKLDMLKGYWQIPLTDKASDISAFVTPDSFAQYTSMAFGLRNAP FT ATFQHLVNLVLGDLPNCSAYLDDVVVYSMTWCEHISLLKAVFTRLLKASLT FT LNLAKCEFGQATVTYLGKEVGQGKVRPVEAKVVAILQFPVPTTRRELRRFL FT GMAGYYRNFCKNFSTVVNPLTSLLSPSTPYAWSDACQNAFDCVKTLLCSTP FT VLSAPDFAAPFKLEIDASAVGTGAVLIQEDGNGIDHPISYFSRKFNKHQLN FT YSTIEKETLALLFALQHFEVYVGSSNLPVLVYTDYNPLVFLSRMYNHNQRL FT MRWSLLVQNYNLEICHKKGIDNVMADALSRV*" XX SQ Sequence 4990 BP; 1449 A; 931 C; 1099 G; 1511 T; 0 other; atttaattgg gggctcatcc gggatatcta aatactaatt tgacatgctt tagagaattt 60 tagagaccgg aagttatgat tatttgtgtt tggtggtgac aattgtgcta gagtgagtag 120 tgcgtatatt tcaggcagag aaatatcgta atactagttt gccttatcta ggtaatggag 180 gtgtttagtg tacaaaactt tataaaaaac ccaacagtag agcagattag cagatgcaga 240 aagcaggact tattgctggt ggctgaccac taccagattg aagtaagtaa acacagtcgt 300 aaggcagaac taagggattg catacttgag aatttgtatg aaaaaaatgt gctgccagtg 360 gaaaaggtga gtgagaatgt tgataaagta attcttgatg acgtggcggt cgaacatctt 420 agtgtgggtg ctgaggagga tctgcagggt caagctaagg ttgatgcgga ggcaaaggct 480 ggattgcctc cctttgatcc attctcccca atgtcctctg tttcaaggag tgatgcgaga 540 ctgaaagtcc gtttggcacg cttgcaactg gaagctcatg ataaagagca agctcggcag 600 gctgaagtta atttgcgttt ggaaatacgc aaacttgaga ttgaggctga aacgcagatc 660 aagctacggg aactagatct ggaggcgatg aagatttcct cctgttctgc tccaccgtct 720 acctcgaccc ctgccactgc cttgcccccc actgccactg atgtttcaac tccgactgat 780 actgctccac atgcctttga cattggtaag cacattgttt tggtaccaca attcagagaa 840 tcagaagtag attcatactt tactgcattt gagagaatag ctacttcttt gcactggcct 900 aaggatgttt ggtcattgtt attgcaatgt aaattgcttg gtaaagctca ggaggtatgc 960 tcaactttat ctttagaaga tagtttaaag tatgattgtg tgaagtctgc aattctaaga 1020 gcttatgagc tagttccaga agcttacaga cagcgtttta gaggttgtag aaagaatggt 1080 acacagactt ttgttgaatt tgcgtgagag aaaaggctac tatttgataa atggtgcaca 1140 gctagcaaag tggatgatat ggagtcactg caggaactaa tgttgttaga ggagtttaaa 1200 aattgcttgc ccgaccgtgt tgttgtgtac ttgaatgaac agaaagttaa ttctctttca 1260 catgcagctg ttttggctga tgagttcatt ttaacacaca aaaatgtctt tgcaccagta 1320 cgttcagaca aatcacagat tgctcaatct caacctagtc gttctaaaaa gagcaccgtt 1380 cctgaggaac gcgcctgttt ttattgtcga aaacaaggtc atgtcatttc cgattgcttg 1440 gttttaaaaa gaaaacaaca atcacaagtt aaaagcgtag ggtttttaaa cactcgcact 1500 gcttttaaga ttcctgacaa taacaaaaac aatgacattg atgacaagta caaacctttc 1560 attttagaag gattcatttc cttatcagaa aattgtgatg aaagggtgaa agtgaaaata 1620 cttagggaca ctggtgcgct gcagtctttt attgtagcag acaaacagtt gtctggcgac 1680 acctactgtg gatctaatgt aatagttcaa ggaattgaga tgacttgtat tacagcaccg 1740 ctgcatcgta tacacttaac aagtgatttg tgtacaggat ttgttcgagt tgcagttcgg 1800 tcgtcactac ctgtaaaagg cattgacttc atccttggca atgatttggc tggaggaaag 1860 gtaacgcctg tgttggaggt tctagataag ccagatgtaa gtatccaatt agactcctgt 1920 tcagatccag aaatttttcc cgcatgcgca gtcacgcgcg cacagtcacg caagatggga 1980 aatgtggttg aattggctga ttcttttttc tctccttttt tggctggtga aaatttcact 2040 attgaggatt cattggaaaa tcaggcagga aaggctaatg ttgatgtgac tgacaataat 2100 ctgttacagt tacctgtgtc tcgtgagaga attactatgg ctcaaaaaga agaccagtct 2160 cttgtgtcac ttattaatgg tgctgtttca ggtgttgaga acaaaaaggc aaagagtaca 2220 tttttcatgg acaaagagtt attgatgcga aaatggtctc tgagtgcgga tgatgatctg 2280 gagtgggatg cagtgtatca aattgctgtt ccaactcctt accgccagca tgtgttatat 2340 cttgcacatg atcatcagtt tgctggtcat ttagggatca caaaaactta cgataggatt 2400 ctaaggcact tcttttggcc tggtttgaag aaagatgtgg tgaagtattg tcgcgcttgc 2460 cacacatgcc aagtagtagg gaagccgaat caaataattc cacctgcacc cttgattcct 2520 attccagtag tgagtgaacc atttcagcac atacttgtgg attgtgttgg accgttgcct 2580 aagacaaggt caggtaatca gtttttgctt acaataatgt gtgttgcaac tagatttcca 2640 gaggctatac ctcttagaaa aatcacagca cctgtaatca tcaaagcact aatcaagctt 2700 ttctctacct ttggtttacc tcaggttatc cagactgatc agggaacaaa ttttctctcc 2760 aaactttttg aacagacttt acagtcactt gcgatcaaat atcgagtgtc aagtgcatac 2820 catcctgaaa gccagggtgc acttgaacgc ttccatcaga cgttaaagtc tatgttgcga 2880 aaaattactg agtacaacga aagattggga tgaaggtgtt cctctgatgt tgtttgctac 2940 tcgagaaacg gtacaagaat ctcttggatt cagtcctgca gaactggttt ttgctcatca 3000 ggtaagaggt cctttaaaag tcttgaaaga acgaatgcaa tcggttgatt ccagtgcaaa 3060 aacgaacata ctagattatg tcagtaaatt tcgtgaacgc ttacacactg cgtgtaccct 3120 tgctaaagat gcactaacta ctgcacagaa aggcatgaaa cgtaaatatg ataagaaagt 3180 ggttgtacga tcattcactg cgggtgataa agtgttagta ctcctaccgg tgtctggctc 3240 ttcactttcg gcacgttttt caggtcccta catagttcaa aagaaaatca gcgaaacaga 3300 ttacattatt cttacgcctg atcggaaacg taaaactcgt gtgtgtcaca taaatatgct 3360 taaagcgtat catgcaaaag agacactcgg aaaagactcg tccgtgtcaa acactgaacc 3420 tgctgttcct accattacag tcacgcagtc atcttttgaa gagaatcggg atgctgatga 3480 tgatgatgta gtacttcgtc acactcaaca atgtgtgagg ctgaaaaatt cagacatttt 3540 cagtgatctc aattcatatc tgaagcatct gtcttgtgac cagagaaaag atattattgc 3600 tttaatattt gaattcaatg gtgtgtttca cgatgttcca acacaaacaa atgtgttgaa 3660 acatgatatt gatgtgggta cagcccatcc tattaagcag catgcttatc gttcaaacat 3720 gaataaacga tcattgatgc gcaaggaagt aaactatttg ttagaaaata atttggcaaa 3780 acctagtcat agtccgtgga gttcgccatg tctgttagta ccaaatactg atggtacgta 3840 cagattttgt acagattata gaaaggttaa ctctgttatt gttgcggata gttacccgtt 3900 accacgaatg gaagattgtg tagacaatgt tggtacagca cgttttgtta caaaattgga 3960 catgctcaaa ggctactggc aaattcctct tactgacaaa gcttcagata tttctgcatt 4020 tgttacacct gactcgtttg cacaatatac atctatggcc tttggacttc gaaatgcacc 4080 cgcaacattt caacatcttg taaacttagt tctaggtgat ttaccaaatt gtagtgctta 4140 ccttgatgat gtggtagtct attctatgac ttggtgtgaa cacatttcgt tgttaaaagc 4200 agtttttaca agactgttaa aggcatctct tacgttgaac ttggccaagt gcgaatttgg 4260 tcaagccaca gtcacatatc ttggaaagga ggttggacag ggaaaagtac gtccagtaga 4320 agctaaagtg gttgctatat tgcaatttcc agttcctacc actcgccgag aactgcgacg 4380 gttcttggga atggctgggt actatcgaaa tttctgcaag aatttttcta cagtagttaa 4440 ccctttaacg tctttgctta gtccctctac accatacgct tggtctgatg cttgtcaaaa 4500 tgcttttgac tgtgtgaaaa ctctgttatg tagcacacct gttctttctg ctccagattt 4560 cgctgcacca tttaaactag aaattgatgc tagtgctgtt ggtactggtg ctgtactgat 4620 ccaagaggat ggaaatggca tagatcatcc gataagttac ttttctcgaa agtttaataa 4680 gcatcagctt aattactcta ctattgagaa agaaactctt gctttgttgt tcgccttaca 4740 acattttgaa gtttatgttg gctccagtaa cttgcctgtg cttgtatata cagattacaa 4800 tccccttgtg ttcttatcga ggatgtacaa tcacaatcaa cgtttgatga gatggtcttt 4860 attagtgcag aactataact tggagatttg tcataagaaa ggcattgaca acgtcatggc 4920 agacgccctt tctagagtgt aaagtctcga actttacgtt acaaacgggg tttgttttta 4980 agtgccaggg 4990 // ID HERODr repbase; DNA; ZEB; 3388 BP. XX AC . XX DT 26-MAY-2009 (Rel. 14.06, Created) DT 26-MAY-2009 (Rel. 14.06, Last updated, Version 1) XX DE HERODr is a family of HERO non-LTR retrotransposons - a DE consensus. XX KW Hero; Non-LTR Retrotransposon; Transposable Element; HERODr. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RA Bouneau L., Fischer C., Ozouf-Costaz C., Froschauer A., RA Jaillon O., Coutanceau J.P., Korting C., Weissenbach J. et al.; RT "An active non-LTR retrotransposon with tandem structure in the RT compact genome of the pufferfish Tetraodon nigroviridis."; RL Genome Res 13(7), 1686-1695 (2003). XX RN [2] RP 112-1444 RA Kojima K.K. and Fujiwara H.; RT "Cross-genome screening of novel sequence-specific non-LTR RT retrotransposons: various multicopy RNA genes and microsatellites RT are selected as targets."; RL Mol Biol Evol 21(2), 207-217 (2004). XX RN [3] RP 1-3388 RA Kapitonov V.V. and Jurka J.; RT "non-LTR retrotransposons HERODr - a consensus sequence."; RL Direct Submission to Repbase Update (26-MAY-2009). XX DR [3] (Consensus) XX CC This is a young family of HERO non-LTR retrotransposons. The CC consensus sequence was built based on multiple alignment of CC several copies less than 2% divergent from each other. XX FH Key Location/Qualifiers FT CDS 13..3234 FT /product="HERODr_1p" FT /note="Contains the RT and REL domains." FT /translation="MTTHRAEVTTSGKTQEEPGPEATHSAQSLLVSPTPAA FT GRSPATQSCPQVTAAHNSPQSPQSQQVAVTRSDCVPLAQPRIQWPQSSKKA FT EWLQFDKDVNQILEVTGKGGVDQRLSTMTTLIVNIAAERFGTVTPKPTPST FT YTPSHRVKEIKRLRKELKLLKRQYKAAGEVERAGLEDLRGILRKQLVNLCR FT AEYHRKRRRERARKRAAFLANPFKLTKQLLGQKRTGKLTCSKEAINNHLKA FT TYSDPNREQPLGPCGALLTPPEPTSEFNMKEPCRSEVEEVVRRARSSSAPG FT PSGVPYKVYKNCPKLLHRLWKALKVIWRRGKIAQPWRYAEGVYIPKEEKSE FT NIDQFRVISLLSVESKIFFSIVAKRLSNFLLSNKYIDTSMQKGGIPGVPGC FT LEHTGVVTQLIREAREGRGDLAVLWLDLTNAYGSIPHKLVEVALEKHHVPQ FT KVKDLIIDYYSKFSLRVSSGQLTSDWHQLEVGIITGCTISVTLFALAMNMM FT VKAAETECRGPLSKSGVRQPPIRAFMDDLTVTTTSVPGARWILQGLERLVA FT WARMSFKPAKSRSLVLRKGKVRDEFRFRLGQHQIPSVTERPVKSLGKAFNC FT SLNDRDSIRETSTAMEAWLKAVDKSGLPGRFKAWVYQHGILPRLLWPLLIY FT EVPMTVVEGFEQKVSSYLRRWLGLPRSLSNIALYGNTNKLKLPFGSVREEF FT IVARTREHLQYSGSRDAKVSGAGIVIRTGRKWRAAEAVEQAETRLKHKAIL FT GAVAQGRAGLGSLAATRYDSASGRERQRLVQEEVRASVEEERTSRAVAMRQ FT QGAWMKWEQAMERNVTWKDIWTWNPLRIRFLIQGVYDVLPSPSNLYIWGRV FT ETPACPLCSKPGTLEHILSSCSKALGEGRYRWRHDQVLKSIAEAISKGIKD FT SRYRQATAKVIQFIKEGQRPERTAKNCSAGLLSTARDWVMTVDLERQLKIP FT PHITQSTLRPDIILVSEATKQLILLELTVPWEERMEEAQERKRGKYQELVE FT QCRANGWRTRCMPVEVGSRGFASYTLSKAYGTLGITGTNRRRALSNNVEAA FT EKASRWLWLKRGEQWGQ" XX SQ Sequence 3388 BP; 977 A; 819 C; 984 G; 608 T; 0 other; aaagcagtag agatgacgac acatcgcgca gaagttacaa cttctggtaa gacgcaggag 60 gagccaggcc cggaggcaac ccacagtgcc cagagcctcc tagtgtcgcc aacacctgct 120 gccggccgct cgcctgctac tcaaagctgc cctcaagtga cagcagctca taacagtcca 180 caaagccccc aaagtcagca agtggcagtt acaagatctg actgtgttcc cttggcacag 240 ccaagaatcc agtggcccca atcctcaaag aaagctgagt ggctccagtt cgacaaggac 300 gtgaatcaga tcctggaagt gacaggcaag gggggtgtgg accagcgact gtcaacaatg 360 accacgctca tagtgaacat tgcagctgag cgattcggaa ctgtgacacc caaacccact 420 ccatcgacat atactccaag ccacagagta aaggaaatca aacgtctcag gaaagaactt 480 aagctactaa agaggcagta caaggcagca ggggaagtag aaagagcggg cctagaagat 540 ctgagaggaa tcctgaggaa acagctcgtg aacctatgta gggcagagta tcacaggaag 600 aggcggagag agagagcaag gaaaagggca gcatttttgg ccaacccttt caagttgacc 660 aagcagctcc ttggccaaaa gaggactggc aaactcacct gctccaagga ggctatcaac 720 aatcacctca aggccactta ttctgacccg aatagagaac aacccctggg gccttgcggt 780 gcactgctga caccacctga gcccacatca gagttcaaca tgaaggaacc ctgccggagt 840 gaagtagagg aagtggtgag gagagcaagg tcaagctcag caccaggccc aagcggagtg 900 ccttacaagg tatataagaa ctgcccaaag cttctacaca ggctctggaa ggccctgaaa 960 gtcatatgga gaagagggaa gattgcccag ccatggaggt atgcggaggg agtgtacatc 1020 ccaaaagagg agaagtcgga gaacatcgac cagtttcgag tcatctcctt gctcagtgtg 1080 gagagcaaaa tattcttcag cattgtggcc aaaagactct ccaacttcct attgagcaat 1140 aaatacatcg acacgtctat gcagaaggga ggcataccag gagtcccagg ctgcctggaa 1200 cacacaggcg tggtaactca gctcattagg gaggcaagag aaggcagggg ggacctggct 1260 gtgttgtggt tggatctcac caatgcctat ggctcaatac cccacaagct ggtggaggtc 1320 gcactggaga aacatcatgt accccagaag gtgaaagacc tcatcatcga ctattacagc 1380 aagttcagct tgagagtctc ctctggccag ttaacatcag attggcacca gcttgaggta 1440 ggaataatca ctggttgcac catctcagtg accctctttg cactggcaat gaacatgatg 1500 gtcaaagcag ctgagacaga gtgcagaggc cccctcagca agtccggagt aaggcaacct 1560 cccatcagag ccttcatgga cgacctcaca gtgacaacaa cgtcggtacc aggagcaaga 1620 tggatcctcc aagggttgga gaggctcgtg gcatgggcac gcatgagctt caaacctgca 1680 aaatccagat ccttggtgct taggaaaggc aaagtcagag atgagttccg cttcaggctg 1740 ggacaacacc aaatcccatc agtcactgag agaccagtaa agagtctcgg gaaggccttt 1800 aactgtagcc tcaatgacag agactccatc agggaaacca gcactgccat ggaggcttgg 1860 ttgaaagcag tggataaatc agggctccct ggaagattta aggcttgggt ttaccaacat 1920 ggaatccttc caagactcct ctggcccttg ctaatctatg aggtccccat gactgtggtt 1980 gaaggttttg aacaaaaggt gagcagctat ctacgcagat ggctgggatt gccacgcagc 2040 ctaagtaaca tcgctctgta tgggaacacc aacaagctca aacttccttt tggctcagtc 2100 agggaggagt tcattgtggc acggacacga gaacatctgc agtactctgg atccagagat 2160 gcgaaagtgt ccggggcagg gattgtcatc aggacaggga gaaagtggag ggcagcagag 2220 gcagtcgaac aagcggaaac ccggctgaag cacaaggcca tcctgggggc agtagcacaa 2280 ggcagagctg gacttgggag cctagcagca acccgatacg actcggccag tgggagggag 2340 aggcagaggc tggtgcagga ggaggtgcgt gcttcagttg aggaggagag aaccagcaga 2400 gcagtggcca tgcggcaaca aggtgcctgg atgaagtggg agcaggcgat ggagcggaat 2460 gtcacctgga aggacatctg gacatggaac cccctgagaa tcaggttctt gatccaaggg 2520 gtctacgacg ttcttcccag cccatcgaac ctgtacatat ggggcagagt agagacacct 2580 gcatgcccgc tgtgttccaa gccagggaca ctagaacata ttttgagcag ctgttccaag 2640 gcactaggtg aaggtcggta tcgatggaga cacgatcagg tccttaaatc cattgctgag 2700 gcaatcagca aggggatcaa ggacagtcga taccgccaag ccacggccaa ggtcattcag 2760 ttcatcaagg aaggacaaag gccagagaga acagcaaaga actgctctgc tgggttgctc 2820 tccacggccc gagactgggt gatgacagtt gatcttgaga ggcagctaaa gattccacca 2880 cacatcaccc agtctacgtt gagacctgac ataatcttgg tctctgaggc cacaaagcaa 2940 ttaatcctgc tggagctgac ggtgccctgg gaggagagga tggaggaggc tcaggagaga 3000 aagaggggaa aatatcagga gctagtggag caatgtaggg cgaatggatg gaggaccagg 3060 tgcatgccag tggaagtggg cagtagggga tttgccagct acaccctgag caaggcctat 3120 ggtacactgg gaataacagg cacaaaccga agaagagccc taagcaacaa cgtggaagca 3180 gcggaaaaag catccagatg gctctggttg aagagggggg aacagtgggg gcagtagcat 3240 gccacttgga cacaggccgg ggtctgatca gcctcggtcg ggtcgcctgg aggagggtgt 3300 ctgttgcaag acccgaaaca ccctgtgagc ccaggaaaca acactgatga tgtgtccaag 3360 gttgtgcatc aggagatgtt tctgtaac 3388 // ID Gypsy-14-LTR_DR repbase; DNA; ZEB; 1295 BP. XX AC . XX DT 11-FEB-2005 (Rel. 10.01, Created) DT 11-FEB-2005 (Rel. 10.01, Last updated, Version 1) XX DE A long terminal repeat of the Gypsy-14_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy superfamily; Gypsy-14-I_DR; Gypsy-14-LTR_DR; Gypsy-14_DR; KW LTR; endogenous retrovirus. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1295 RA Kapitonov V.V. and Jurka J.; RT "Gypsy-14_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 5(1), 2-2 (2005). XX DR [1] (Consensus) XX CC Gypsy-14-LTR_DR is a long terminal repeat of the Gypsy-14_DR CC LTR retrotransposon that belongs to the Gypsy superfamily. Its CC internal portion is deposited in Repbase as Gypsy-14-I_DR. XX SQ Sequence 1295 BP; 402 A; 171 C; 250 G; 471 T; 1 other; tgtaacccag attactgaag ggttatattt gtacatttat aaaataatat aaatgatcag 60 gggataaaac aacatataac tgtggattca ttgttctaaa atgttcttaa taagaaataa 120 attgtttaaa aataataatg tgaaatattc acatgttatt ggtttagcca atcagtgatg 180 tgtgggaggg atttgcgctt gctgacgcag tgggaagaga tcgccagact gtggaggaag 240 agagagtgag atcgctagat ttgatgattt aaactgtgaa atcagattat aactttgaaa 300 ctagtttgag aaacttacaa aaagctttca tagagaagta tttgaagagg attcagtcgt 360 taaattgttc ctgtgacgat cgatgggaaa attgcatcat tgtgttgttg angagtgaat 420 atatggacat ggtgagttta attactttgt aaacactctg aatttaatgt taaatgacaa 480 acaaacacag tttgtagaga aacaaaagtt tattttgctg ttgtgtgata tatactgtat 540 ttagttgtct ccacgtgtta ttatttccct tcacgtgtat gttcacatgc atatcgcctt 600 gtatacatgc ctaaaaaaaa gagatacagg ttaattgtaa tgattataat gttatttgct 660 tgtatgtgga aatacttacc tgatataatg atatcattga ctaaatgtag tcatgctttt 720 ggccttttgt acaggtcagg tttttttctt tggtgttttt gattgaagac tgattcacat 780 tgatggcggg ccaacctatt cggacactag aaaaggaaat tggaaattca ccctactgaa 840 ccaggagagt actttaatac tttttatttt tccagctcaa ggtaaaagac tgtttttttc 900 ccgtttctgg atagagactg ttcgtgtttt tttccctgag aactgattaa gaaaaaagac 960 taataccagc gcacggacac ttggaaatca aactattttc cttttgaagt gaaatgagaa 1020 atactgagtt tgttgattct gaacaagtgg ttttctttac attttcaatg taatttcaat 1080 tgaactttat tttgttttat tgtgttttaa ttgttttatt gtattttgtt aattttattt 1140 tgggggaaag atctgcagaa aaatatatct ataaacattt caaatactta cctatatagt 1200 ggtttctgtc atttttcagc aaatgctcat tacctccagt agtcgaatcc taggcttctg 1260 atgtccagat tgatttgata gtgtaacctg ttaca 1295 // ID Gypsy119-I_DR repbase; DNA; ZEB; 4466 BP. XX AC chr19; XX DT 03-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Gypsy retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy119-I_DR; KW Gypsy119-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4466 RA Jurka J. and Kohany O.; RT "Gypsy retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1274-1274 (2008). XX DR Genome; chr19; Positions 42973770 42978235. XX CC Positions [2080-2313] - Integrase core CC 'CCAGC' target site duplication CC LTRs are 99% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 3523..4467 FT /product="Gypsy119-I_DR_3p" FT /translation="MGLYHFKVLPFGLHGAPATFQRLMDQVLRGYEEWSAA FT YLDDVVIYSSSWEDYLQHLKQTLDKIQAAGLTLNVSKCEWVKAQTNYLGYR FT LGNGELRPQVDKVEAVQHSPRPTTRKEVRSFLGLVGWYRRFLPDFSTTAAP FT LTNLLCKQAKNPIQWTQDCEKAFTTLKRQLCSEPVLKSPDFTQPFTVQVDA FT SAVGVGAVLMQEDAGQERPVAYISRKLLPRETRYSTVEKEGLAIKWALESL FT RYYLLGREFTLETDHRALTWIHSMRDHNARVTRWYLSLQPYQFKVVHRPGF FT QNRTADYLSRYLNGSRLGEGKGG" FT CDS join(19..2313,2317..3399) FT /product="Gypsy119-I_DR_1p" FT /translation="MPTRKTSTRGMKPKAGLLSQTGALSKEDGAWASAEST FT ETELEEELPPVRPKSTSPQVTKPVSSPEAKTSSTGGGMYALMRDFLDSQAE FT REQRYLTELQSFKESILQAVRPAEGSSETESPRLGLPTPTAQRPPSFQRDR FT DSPTDLRYGSQPKMRTEPKLPVFQQGEDKENFLRRFERLAKTWRWPKEEWS FT FRLVPLLTGQALEAYLAMDEDQAEVYEDLREALLAKFNISPETYRQRFRAM FT STPAGESPTKSYHRLRTLYRRWVKPDSHSKEEIGETIVLEQLLRVLPHEVQ FT MWVKEHEPTTGLEAAKLAQQYINAHRGIQRSQPLKGTVKNFVGNRTEPVPA FT QNTDTGKGLVCFHCQQPGHKALVCPVRKGKLTGFCYVPREGDSDNITVHKV FT QYVKASVNGHNVNAMLDTGSSMSLIKRSHVTHVPYSTMVNVQCVHGDVKSY FT PQIEVNVGVQDQLYLLNVAVVDDLPADMILGRDIPVLSELLNMNCDTKLPI FT NVALSCPVVTRAQTKTGPQPLPDFHDSLLQGGTKGLKKTRRQKRLEKGLGT FT PAPETQTEGLETHSWEIPKDIASLQRADKTLTGLFVKANEDKMDLCGEKYV FT IINDVLYMQTSESTRLVVPTCCRHVVLSVAHNIPWAGHLAFQKTYARISSR FT FIWPSMYTDVQSYCTTCPTCQKTSTVRHQGKAPLLPLPVISVPFQRIAMDI FT VGPLERSTSGHQYILVISDYATRYPEAFPLRSITTAKVINALIQLFSRVGI FT PNEILTDQGTNFTSLMKMMHRQLGITAIKTTPYHPQTDGLVERFNQTLKNM FT LRKFVSETGKDWDKWLPFLMFAYREVPQASTGFSPFELLYGWRVQGPLDLL FT RKNWETTSSDEVTEKAKGIVQYVLEMRDRLETYREKAQENLKKAQKTQKLW FT YDQNARTRELLPGQKVLLLLPTTTNQLLVKWQGPYTVLRRLGPVTYEVHHP FT DKGKEKQTYHINLLKEWKEPPNKETSLLIRKVEEVEEEDLLEQRGAEPAQT FT TGVGLNHLEESKQAELHHLMNQYSGLFRQRPGRTNLVEHRIQLVNQIPSRQ FT QPYRDPESLVKPLKTEIETMIELGVIEPSTSEWSSPVVIVPKKDGTLRICI FT DFRKLNAQSRFDAYPMPR" XX SQ Sequence 4466 BP; 1300 A; 1011 C; 1148 G; 1007 T; 0 other; tttggtgtca gaagtgagat gccaactcgc aagacgtcta cgagaggcat gaagcctaaa 60 gcggggcttc tttcacagac tggagccctt agcaaggagg atggggcatg ggcgtcagct 120 gagtctacgg agacggaatt ggaggaggaa cttcctcccg ttcgacccaa gtccacatca 180 ccgcaagtca caaaacctgt ttcttctcct gaagcaaaga cttcatctac tggaggtggg 240 atgtatgcat taatgaggga ctttcttgac tcacaagcag agcgggagca gcgttaccta 300 acggagctac agagctttaa agagtcaatt ctgcaagcgg tccggccagc tgaggggtcg 360 agtgaaacag agagtccacg attggggttg ccaacgccaa cagcacaacg acccccatct 420 ttccagagag atcgagactc gcctactgac ctgaggtatg gcagccagcc aaaaatgcga 480 acagagccga agttaccagt cttccagcaa ggggaggaca aagagaactt cctgcgtcgg 540 ttcgaacgac tagccaaaac ctggaggtgg ccaaaggagg agtggagttt ccgcttggtg 600 ccactgttga ctgggcaggc acttgaggcg tacttagcca tggacgagga ccaagctgag 660 gtatatgagg acctgaggga ggcgttgtta gcaaagttca acatctcacc tgaaacctac 720 cgtcagcggt tccgggctat gtctactcca gcaggagagt cgcctactaa aagttatcat 780 cggctgcgaa ccctttaccg gcgttgggtg aaacctgaca gtcattccaa agaggaaatt 840 ggtgagacga ttgtcttgga gcagttactc cgggtcctac cacatgaggt gcagatgtgg 900 gtaaaggaac acgagccaac aacaggactt gaggctgcaa aactggcaca gcaatatatt 960 aatgctcacc gaggcattca gcgttctcaa ccacttaaag gtactgttaa aaactttgtt 1020 ggtaatcgca ctgaacctgt tccagcacag aacactgaca ctgggaaggg gttggtgtgc 1080 tttcactgcc agcagccagg gcacaaagct cttgtgtgtc ctgtgcgtaa aggtaaactg 1140 actggatttt gttatgttcc aagggaaggg gacagtgaca atattacagt tcacaaagtg 1200 cagtatgtga aagcttctgt gaatggacat aatgtaaatg ccatgttgga cactggtagc 1260 tcaatgtcat tgataaagcg gagtcatgtc acacatgtgc cttattcaac tatggtgaat 1320 gtacaatgcg ttcacggcga tgtaaaatcg tatccacaaa ttgaagtgaa tgtgggggta 1380 caagaccaat tgtatttact aaatgttgca gttgtcgatg acctacctgc tgatatgatt 1440 ttgggaagag atataccagt gttaagtgaa ctgttaaaca tgaactgtga tacaaaattg 1500 cctataaatg tcgcactgtc atgtccggtg gtaacacggg ctcaaacaaa aacaggtccg 1560 cagccattgc ctgacttcca tgatagtctg ttgcaaggcg ggaccaaagg cctcaaaaaa 1620 acacgccgtc aaaagcggct ggaaaaaggg cttggtactc cagccccaga aacacaaact 1680 gagggtctgg aaactcatag ttgggaaatc ccaaaggaca ttgctagcct gcagagggct 1740 gataaaacat tgactgggtt atttgttaag gccaatgagg acaaaatgga cttatgtggg 1800 gagaagtatg ttataattaa tgatgttttg tacatgcaaa cctctgaatc cacacgtttg 1860 gttgtcccta catgttgtcg tcatgtggtt ctgtctgtag cacataacat accgtgggca 1920 gggcaccttg cattccaaaa gacatatgca cgcattagtt cacgtttcat atggccttcc 1980 atgtatactg atgtacagag ttattgcacc acatgcccca cctgtcaaaa gacaagcact 2040 gtgcgtcacc agggtaaggc cccattactt cctttgcctg taatctctgt gccgtttcag 2100 cgcatcgcga tggatatcgt gggcccactg gagaggagta catctggaca tcagtacatt 2160 ctggtcatca gtgactatgc caccaggtac cctgaggctt tccctctgcg gtccatcacc 2220 accgccaagg taatcaatgc cctaatacag ctgttctcta gagtaggaat acccaatgaa 2280 atacttacag accaaggaac caactttact tcctgactga tgaagatgat gcatcgtcaa 2340 ctgggtatca ctgcaatcaa aactacccca taccacccac agacagacgg attagtagag 2400 cggttcaatc aaacactgaa gaacatgctt cgcaagtttg tgtcagagac tgggaaggac 2460 tgggataaat ggctaccgtt tctgatgttt gcctacaggg aggtgcctca agcttcgacg 2520 ggattttctc catttgagct cctgtatgga tggcgggttc aaggaccact ggatctcctt 2580 cgaaagaact gggagaccac ttcgtctgat gaggtgacgg agaaggcaaa aggtattgtg 2640 caatacgtcc ttgagatgag agaccgtctg gagacctaca gagagaaggc ccaagagaac 2700 ctcaagaagg cccagaagac acagaaactg tggtacgacc agaatgcccg aactcgtgag 2760 ctgctgccag gacagaaggt tttgttactg cttcccacaa cgacgaatca actactggtg 2820 aagtggcaag ggccctatac agtacttcga agactgggac cagtaacata tgaagttcat 2880 caccctgata aaggtaaaga aaagcaaacg tatcacatta atctattgaa ggagtggaag 2940 gagccaccta acaaggagac atcgttgctg atcaggaaag tggaggaggt agaggaggag 3000 gatctactgg agcagagagg tgcagagccg gcacaaacca caggagtggg gctaaatcat 3060 ctggaggaat ccaagcaagc tgagctacat cacttgatga accagtattc tgggttattt 3120 cgtcaaaggc ctggacgaac gaacctggtg gagcatcgaa tccaacttgt taatcaaatc 3180 ccctcaagac agcaaccata cagggatcct gaaagtctgg taaaaccact caagacagaa 3240 attgagacga tgatcgagct gggagtgatt gagccctcaa caagtgaatg gagtagccca 3300 gtcgtcatcg ttccaaaaaa ggatggtacc ctgcggatat gtatcgactt ccggaaactg 3360 aatgcacagt caaggtttga tgcgtatcca atgccacgtt aagatgattt gcttgagagg 3420 attggccaag caaaatatct taccactctg gacttgtgta aaggctactg gcaagtgccc 3480 ttggaagaaa agtccagaca gtatacagca ttctgtacac ccatgggatt ataccacttt 3540 aaggttctac cttttgggct tcatggggca ccagccactt tccagcggct aatggatcaa 3600 gtactaaggg gatatgaaga gtggtctgca gcctacctcg atgatgtagt catctacagc 3660 tcgtcctggg aagattatct gcaacaccta aaacagaccc tagacaaaat tcaagcagca 3720 ggtttgaccc tcaatgtcag caagtgtgag tgggtgaagg cacaaaccaa ttacctggga 3780 tatcgcctag gaaacggaga gttgcgtcct caagtggata aggtggaagc agtacaacac 3840 agtccaaggc caacaacacg gaaggaagtt cggtcatttc ttggtttggt gggttggtat 3900 cggaggttcc tccctgactt ttcaaccact gcagccccct taaccaactt gctgtgcaaa 3960 caagctaaga atccaattca gtggacacaa gattgtgaaa aggcattcac aactttaaag 4020 agacaattat gttcagaacc tgtactcaag agtccagact ttacccagcc atttaccgtc 4080 caagtggatg cttctgcagt aggtgtcggg gcagtcctca tgcaggaaga tgcaggacaa 4140 gaaagaccgg tggcatatat tagcaggaag ctacttccac gagaaacccg gtattccacc 4200 gtggagaaag aaggactggc aataaagtgg gcattggaaa gtctccggta ctacttgttg 4260 ggccgagaat ttacattaga gacagaccac agagcactaa cctggattca cagcatgaga 4320 gaccataatg ccagagtaac caggtggtat ctgtctctac agccatacca gttcaaggtt 4380 gtccatcgac caggattcca gaacaggaca gccgactacc tttcgaggta cctgaatggc 4440 agccggcttg gagaggggaa aggtgg 4466 // ID LOOPERN2_DR repbase; DNA; ZEB; 1375 BP. XX AC . XX DT 02-MAY-2002 (Rel. 7.04, Created) DT 02-MAY-2002 (Rel. 7.04, Last updated, Version 1) XX DE LOOPERN2_DR is a nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; LOOPERN2_DR; KW nonautonomous DNA transposon. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1375 RA Kapitonov V.V. and Jurka J.; RT "LOOPERN2_DR, a nonautonomous DNA transposon from zebrafish."; RL Repbase Reports 2(4), 20-20 (2002). XX DR [1] (Consensus) XX CC The consensus sequence was reconstructed based on 20 copies CC present CC in sequenced DNA that represents ~1% of the genome. Approximately CC 2000 copies of LOOPERN2_DR are expected to populate the complete CC zebrafish genome. LOOPERN2_DR copies are ~6% divergent from the CC consensus sequence. CC This element is characterized by 14-bp terminal inverted repeats CC and CC putative TTAA targets site duplications (or, less likely, CC TTTAAA). CC Its classification is not very certain yet, although it is CC expected to be a member of the piggyBac/Looper superfamily. XX SQ Sequence 1375 BP; 366 A; 284 C; 331 G; 393 T; 1 other; agagcccata ttatgggttt ttgaaaattc ccctccatgt agtgtgtaac acagctctaa 60 gtgaagtgaa gtatccagct aaggcttaaa tctgtaagag tacagtgttt aaaactgttg 120 attcatctat aaaagagtcg actcatagtg cttcaaacga gtcgccttga taccgagtca 180 ttaggtgttt cgccatgacg tacgaacgaa accaagttat tcacgtgcac gcgcaaaccc 240 gggagatttc aaacctgagg ccccgccctc tgacgcagaa acccagacac acacacacac 300 acacacccac aaacacacgc acacccacaa acacacgcac acaaacatgc cggtcgattg 360 aagtcacact gcagatggat atattgagtc tctacccaaa gatgaaacct cagcattata 420 accaagcagt tggaaactac tggaaacttc tggaagctac atgctacaaa gaatacttca 480 tctgcgtttg ttaaaggaag gatcagtaaa gagtaactta ctgatggacg tcaggatggg 540 tttcttcctc catttctcaa gtgtaagtac gtgcgattaa agttgttgcc tcgttgactc 600 tagcttgcaa atgtatttag ttgtgatttg ttacttgtaa ccgcgtgtac tgtatcaggt 660 taactcgcta tattctctta tcgcgtgcaa agtcacgtta aaaacgcgac gcgtgctgct 720 ttgtttacgg agctccgtgg acgaggagta gtgtgtgtgt ctgtgtgtgc gtgtgcgcgc 780 gctgtcgtcy ggaggtgtgt gtgtttgtgt gtgtgtgtgt gcgcgcgcgc gttgtcgtcc 840 ggagcagggt taagtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg caatgtgtgt 900 gtgtgtctgt gaaaagagca gagtgagaca agctaggaga tctcctcaac tgtttttgga 960 gtttttgctc aataaaatag ttagtcgtct gaattttcaa gtccatcgtc tgtatttaca 1020 ttgacccact ggcagctaaa atccacgcct acactatcga gcgtgtatga actgtgatta 1080 ctttttatat tgctgattag ctgttgggca tttcactctc tgactgaagg cagtcgacca 1140 atcgcaacag actgtcatcg gtccaatcag cgcagattag cttcgcgcta aggaggggtt 1200 tgggaacaaa tgaatcactg gacgattcat acaggagtcg ctgggataat taggtaaaaa 1260 taaatgcaga ttataagacc atgaaagtgt tttttgacct tgcatgcata ttagactgtt 1320 gttggagacc cttacaacca agatatgacc ctatttcatg tataatatgg gctct 1375 // ID SAT-2_DR repbase; DNA; ZEB; 184 BP. XX AC . XX DT 21-AUG-2010 (Rel. 15.11, Created) DT 21-AUG-2010 (Rel. 15.11, Last updated, Version 2) XX DE Satellite-like DNA - consensus. XX KW Satellite; Simple Repeat; SAT-2_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-184 RA Jurka J.; RT "Satellite-like DNA from zebrafish."; RL Repbase Reports 10(11), 2024-2024 (2010). XX DR [1] (Consensus) XX SQ Sequence 184 BP; 52 A; 44 C; 38 G; 50 T; 0 other; tcactgattc aaatgatccg ttcagagtga gtctccagtt aacgactcac tgattcaaat 60 gatccggtca gagtgagtct ccagttaacg actcactgat tcaaatgatc cgttcagagt 120 gagtctccag ttaacgactc actgattcaa atgatccggt cagagtgagt ctccagttaa 180 cgac 184 // ID DIRS-1B-LTR_DR repbase; DNA; ZEB; 623 BP. XX AC . XX DT 21-JAN-2009 (Rel. 14.01, Created) DT 21-JAN-2009 (Rel. 14.01, Last updated, Version 3) XX DE A solo-LTR derived from DIRS retrotransposon - consensus. XX KW DIRS; LTR Retrotransposon; Transposable Element; Nonautonomous; KW LTR; MER6; DANA; SINE_DR2; DIRS-1-LTR_DR; DIRS-1B-LTR_DR; KW DIRS-1C-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-623 RA Jurka J.; RT "DIRS LTR subfamilies from zebrafish."; RL Repbase Reports 9(1), 2-2 (2009). XX DR [1] (Consensus) XX CC See comments in DIRS-1-LTR_DR. >96% identical to consensus. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 623 BP; 154 A; 192 C; 129 G; 148 T; 0 other; ttaagtgaag tttattcata aactaatttc gagaggatca cgtgcttatg attgacacgg 60 ctggccccga gtcaagctaa tgtacgaacc accaatcaga ctactcctaa ccaagtataa 120 ataaccaaac accttacctt tagtcatctt cgtcttgaag aatcccccct tccacccctt 180 cgcctcctcc tttgtctaca gggcagcacg gcggcccagt ggctagcact gcggcctcac 240 agcaaaatgc ctccggttcg ggcatctacc caaacggtca gcattttcgt gtggagttca 300 tgttctcccc gtgcttgcgt gggttttccc cgggtcctcc ggtttcctcc cacactccca 360 aaaacatgac acttaagtaa attgactaaa ccaaattagc accaaattcg attcaattct 420 gtcagcaacg catcacctta gcaaccctca cgcagcagga gggagggggg ttctcgagat 480 ctacccgagc tcaaactccc ctctcgccct gcaacgggag ggagccccgg gctcgaggct 540 ctcatgagct cggggctctc tcccgggaca gcatgccaaa ctagcttatt accaatcatc 600 agctaagtgt gaactcttga aat 623 // ID EnSpm-4N2_DR repbase; DNA; ZEB; 5399 BP. XX AC . XX DT 07-APR-2009 (Rel. 14.04, Created) DT 07-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE EnSpm-4N2_DR is an nonautonomous DNA transposon - a consensus. XX KW EnSpm; DNA transposon; Transposable Element; Nonautonomous; KW Autonomous DNA transposon; EnSpm-4_DR; EnSpm-4N2_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5399 RA Bao W. and Jurka J.; RT "EnSpm DNA transposons in zebrafish."; RL Repbase Reports 9(4), 860-860 (2009). XX DR [1] (Consensus) XX SQ Sequence 5399 BP; 1700 A; 871 C; 993 G; 1835 T; 0 other; cacagcaaaa tttccagtgt tgtttggagt tgtttttaca gtgtaaattt ttttgagtta 60 aaaagtgtta tttgtggtgt taatttacgt gacataaaca cctgcagtgt acccttgtgg 120 tcagagtaag ggttaatact gcgagttaac atagttaaca ctataaacaa gttgtttcca 180 tttccggtcg cttcacctta cttcctcttt gattaatggc ggatggcaca tggagagcat 240 tcaaacaggt aactgtcata ttacttattt aaccacaaat attcaataca tttttacatg 300 ctttatgttt ctcaatgtta aattaatatt tgtaaatatt ctaaccgaga tttcaagttc 360 taaccacttg tcaatttata cacgacagac atctggagcc acgagacaag ctgctttagc 420 gggaaaacaa aggtaacttt aaacatttta cagttaacag ttttaaggag ttcctatatt 480 tgaatcgggt tcagttcgag cctattatga gaaataggca gtaatgactc atgcaaccgt 540 ttgaattgtt taaactgttt atttgtttta aatatctgca aagcagcagc tgtcagtact 600 taacgttagc tgaaccgtaa cctagctttg tgtgtaatct aacgttatct ataatgttca 660 ctgttcagtg cgattgattt gtggcgcgag cagcgaggag tcgcgggctg cctgttcacg 720 ctttcatcgg tgcttttaaa cggtgaataa cgttatacga ttcaagtgta caggtacaga 780 cctgcagacc ataatacgtt aaaaacagac gcaaaataca ttcgctgtaa aagttagccg 840 tgcttgttga catgaaggga taacgttagc cagttcagaa atgattcttc tttaaccggt 900 tcttttcagt gaatcaatcg aaccagttca ctatattgaa ctgaatcatt tgaaattgtt 960 cgcttctccg gtaagcacca atccttaaaa tggcttactt tataacctgt ctgacactca 1020 gttccactca aaaataccgt atgttgagtt tcgacttatt cagtaactgt aacaatacac 1080 tgacggaact gctgtgaaaa gaactgataa tgatgagcgc gaatcgagta gtgacgtgtt 1140 ccctataagt ggaagctgag ctccactgac aacgaccgaa aaatgctaag ctcagaagag 1200 acgaacccaa gcagactgtt attgccattg tttacctagc aaatttcatt gttgactcta 1260 aggtagtagt cgaatagtaa ctttatttat ttaatattga agtgctgttc aatataatat 1320 tgcattgcat gaagagctgg catttttaga tagaactgta cgtactctta acttttgaag 1380 atctttcatt aatgtgtaat gtgaaaaaaa atgttttttt aatctgtaat gtgcatttat 1440 acaagtataa ttttcctata gcctacgtct tgctaacaat aatgtctagt tagaaattgg 1500 tatgtagaat ccttcataat aaaaaaacaa aaagaacgta aatatagcta tttttagaaa 1560 aagaaaaaca taatgtgctt ttttatttat atgtactatt cataataaca tcggcttctg 1620 tatcacctta acatgaaaat gaataaatta acttgattta attttactaa tgtcaagttt 1680 gtgagataga ttcttgttaa taataaagtg atatattgtg tatttttcaa cagcttccat 1740 catgttggtg aaagttctat attaaggagt gaaaaaatat gtaaaattgc ttactgattt 1800 tacatttcaa tgttttcgca gtgaagataa gtttaagatt gtgctggtca gagttgtgct 1860 cactaaagtc ataaccattt cctgagtcta tttttttttt tttttttcag tgaaaagcaa 1920 atttggatta tcaacaactg cagatcctca agttttcaat gacacggacc ctaatgtaga 1980 agaggagatc ttccacgagt taatggacag cagtccacat ttatgcttga ctgttagatg 2040 tttggaagaa aatatttctg ctggcctgcc tgagggtaat tattcaataa atatttggac 2100 agttttaaaa tgcctctgtt ttacagtagt ataagtcaaa acagttagaa ttaagtaaaa 2160 agacttaaaa gagaactgag aaattcatag aagagcttac tgaacagtgg agtgtgtata 2220 gaaataaaga gtttttatat gtttcttgac cattgccagg tgttcgttta gcttaagttc 2280 atgtgatgcc cagtgatgaa aggttgatca aaatagtctg tcaaaaccac agtgtagaaa 2340 aacattgagg atgatggtgt atagtaaaag ctattagtac agcattgttt taaattaagt 2400 tgaaatataa gataaattta tttaaggtaa actaaagaaa atattacgta aaggttaata 2460 ataatgtttt ccatttctcc atcagtgatt tcacccacaa gatcatcaac agccagcaca 2520 tgtactgata cagtttcact ctcttcaacg gatcatgaag acccagagcg aacaatgcag 2580 aatcaattga caaggactaa cagctgcttg atagtagatg gtgaaagggc aaaacaggta 2640 tatgcatatg ttgtttgtta tggttaattc aatcatatac taattcagtt ttattatttt 2700 acaagtataa ccatttttct ggtcacttta tcagattgtt caagatgcat tgggaaagca 2760 atctggtggt gatgaagtac ttgacgaata caaggtcaca aaaactttag ggccacagca 2820 ctagacggca gctggttaat attgtcgtca gtcacatgaa agagattcat gggtaagatt 2880 tcattaaata tatgcaagaa aagcatggat ttttttttta aaccaaaaaa aaaaaataaa 2940 atgtaaaaag ttaagtactg tccttaatca aatattttac tgattgttag gagaatcccc 3000 acgtgcaagc aacgggagac ttatgctttg ggcattattt ctctttttcc cagcctgaga 3060 gacccgtttt ccgcaaaagg ctatgtaaga aattgtctta ttgtacttaa cacaaacttg 3120 ttcatccgtt ttaattattt aaaggtttgt ggcttttttt atttagtgtt ttatgtcgca 3180 gaggtgaatg ttgtattata tattaagatt atacacatcc caaacctttt ttcttgtctc 3240 ttacaggaac atttctacgg tcctggagag ccagtgttct gcagagttta gctccttaaa 3300 actgtaatta ggaagaacac aagacgtgtt ctcttggagg tatctgaagt tggaggacca 3360 actagcaaac gaaggataga aactgggcaa caacttgaag gggatgcatt caggcaggca 3420 atttcttttc ttgtccacac agttgaagca agtgaagttt tgctaaaaat ggagctaaac 3480 tctaagcata gacactggcc catgatccta aggaatgagt tgacatcctc gacacccctg 3540 gtttaaaggt ttgtgttttt tatttttatt tgtatcatac agtcagtctt tttaatgcta 3600 actggcattt ttaaaatatt tctatctata taacaggaac atttttacgt ttatccagaa 3660 aagggaaccg tttgtaatct aaagtgtgac gagtcagctt ctaatctaat atgcttcttt 3720 cttagattat tcaagacttt agtctcctgt tcaatgctga aacatccaac aagttgcttg 3780 agaggtggga gaccgcattt aagcacagga tcatcaatga ggcgaaatct ctgacttcaa 3840 ctgccagact tcattgtctt attaattcag cagaaggaca agaatctgaa aatggtacga 3900 tttctaagga ttttgtagtt tgaaatttgc tttggtttct tttgtatttg tgagctataa 3960 gtgtgaatgt gggcttgctt cattttaata ttaatacgta taactttatc atcattcatt 4020 cagactggga taatgatatg tcatctgtcc tactccttct gcatctccta cctccttctt 4080 cagagagaaa gaagacttaa gatcagttca acagaggcct ttgaaagact tgtgcatttc 4140 cataaggtaa aactcacaat attttcgaaa gtcttttcat tgttattatt aaggttgtta 4200 agaccattat gcgatgctgt aaaatgtacc ttttgaaact tttttttaaa agccaataca 4260 attcttcagt tgaatgctgt tgaagaatgt atgttacatc ttggatgaaa tgagggttta 4320 tcaaaaatat atataaacat atttttgtgg taattctttt ccctgtcaca ctacacaatg 4380 aatgacatga acacttattt tatttttgca ctttgggtaa agtttttttt gtggttttgt 4440 tttatttttt gtgttgtttt cattaacaaa attgtaagtt tcctaattgg tttcctgcat 4500 ttaatgacat atttttggct gtttttttgc agtcttgcac cagcattgaa gaccatctca 4560 gtgagaggaa gggccaccag ccatatcgcc taagatcagg aaggatcaag gggaggattg 4620 accatttcta tgtggttatg gacaaacatc tcatctcttg tgcagcaacc aggtccttga 4680 gtgctgttga tgaactcttt aaagtccact atgtgttcaa cttgacgtac gaggaagccc 4740 ttgtctacat ctttaccttt ctgcagacta cagtctacaa catcgatgtt ggacctacca 4800 gtgagtcacc cagggtaaag aaactgcgtg caaagctttt acactaaatg tttcttttac 4860 aataatgtgt tggtgttgtc aaagcttgca agcaagcagc acctctttaa gactctcaca 4920 gtttttgata ctgaaaatgc tgtacatttt atgttggttt aactgtgcgt ttaagatgca 4980 ttgttaaaaa taaaagttta aactggtaaa ctgttctgat ttttttttta actgatcaat 5040 gcaaactgtc ctgtaatgca gtaaacaatc tatataacta aaaatacatt taaatatcaa 5100 cctaggtatg tatgagcact atattattag tgttaagatg aacactctta gtgttaagct 5160 attaacactg cgagtgtagt tttaacatat ttactttatg caaataggga aagtgagtgg 5220 aaaacagaga gttaaagtca acactagcag tgcttttttt taacactatg caagtgtaaa 5280 ataatttact ctttttcaag tgtaaattta actcgattta gtgtggacct atataaacac 5340 taaagaagtg ttaattttca cactttgaga gtgaaattaa cacttcaaaa tttgctgtg 5399 // ID Copia3-I_DR repbase; DNA; ZEB; 4164 BP. XX AC chr16; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 01-OCT-2008 (Rel. 13.1, Last updated, Version -1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Copia; LTR Retrotransposon; Transposable Element; Copia3_DR; KW Copia3-LTR_DR; Copia3-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4164 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1588-1588 (2008). XX DR Genome; chr16; Positions 8436039 8440202. XX CC Positions [1473-1967] - Integrase core CC 'TCACA' target site duplication CC LTRs are 100% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 873..3509 FT /product="Copia3-I_DR_1p" FT /translation="MVDCGATSHILTEKNNFTRFDKSFDPKSHYMELADGA FT RMNNVALMRGDAEVVLLDVEGKCVRITLKKALFIPSYPQSIISVQAATTDG FT AKVIFQEGQNELISKDGVVFRIEEHERLYYLKTVNNNKQCVDVSVTDMMSM FT DKVSLTCDVKTWHEIMGHCNVSDVLKLPEVVEGMKITGNTKLDCNVCTEGK FT FINNRNRKTDTKAGEALELVHTDLAGPIEPTSQDGYKYAVSFTDDFSGAVS FT VYFLKNKSDTILATEKFLADSAPYGRVKCIRSDNGTEYTSDAFQSLLRKKG FT IRHDTSSPYSPHQNGTAERQWRTIFEMGRCLLIEKGLPKVLWPYAVQTATH FT IRNRCYNDRFKNTPYFMLTGRKPDLSKMWVFGSECYAYKHNHKKLDPRCEK FT GIFVGYSKNSPAYLVYNPQTEKVSKHRLVKFIRKSSAEQQTQTDEEYLEIQ FT YRDRTQDSGNSGDAPKSEESTENRVLDENMESIDTDNTEVSEHDGTTDKVD FT TEDNKDSTVATVKASQKHYHSNREKRAPKYLQDYLTYSKDDETNVSVDYCY FT RAVCGVPQTYKEALMSPEAPGWEHAMKEEMDSLKENDTFELTTLPEGRKTV FT RGRWVYALKENAETGKIFKARYVAKGYSQTEGIDYHETFAPTANLTSVRAL FT MQIAAQNNFSVHQMDVKTAYLHAPIEEDIYLEQPEGFEETSDKGDKLVCKF FT KKSLYGLKQSGRNWYKLLNDHLEQNNFERNQSDYCVYRKQIENETIIVIIW FT VDDLIIAASSDDQLNSFKEIMKSKFNMKDLGKISYFLGIQFEQKEGTIKMN FT QKRYILKMLERFGMSNCKPKATPSELKVECNINEEENNNEIENPKEYRELV FT GSLIYAMTCTRPDISWIVSKLSQTLA" XX SQ Sequence 4164 BP; 1516 A; 746 C; 929 G; 973 T; 0 other; gggtatgggc ccaggcgcgc cagtgctaca ggaggaagat ggcagaggct agttttcgac 60 ggagacgaaa acaactacga actatgggaa gtaaaatttc ttggacattt gagaataatg 120 ggtttaaaag aaaccatact gtctaccggg gatatagacg agaagacaaa cgaggagtgc 180 tacgctaaat taattcaatt tctcgatgat aaaagtttgt cactggtgat gcgggaggcg 240 actgatgatg gcaggaaagc tctgcaaata cttcgaaatc actacgctag tcaaggtaaa 300 cctagaatta tcaccctata cactgaacta acttcgttaa agaaggaatc tgacgagacg 360 ttaacagact acattatccg agctgagaaa gcggtgactt cactaagaaa cgcaaaagaa 420 gtaataagtg acggtctgat aatagctatg atcctaaagg gactaccaga ttcttacaaa 480 cctttcgcta ttcatactac acagagtagt gaagagttga cttttatcca gttcaaaagc 540 aaactaagaa gctatgaaga gacagaaaaa tttgatgata aacccaaatc tgacaatgtg 600 atgaaagtac acatagcatc cgtgacctgc tacggatgtg gaaatcgcgg tcatatagcg 660 cgcgattgcc gccaaaaagg cgtaccaaag tggtgcaact accatagaag ttcaacacac 720 agtgatgaga catgccgccg gagaaaagat ggtcacaaag atgaagcaaa acaggcaaag 780 cagacaaaca agaggagcat gagaaagaac aaacatttgt gttcaaagtc agtcaaacat 840 ttcttccaga caacattaca caaaatggac taatggtaga ctgtggagca acatcccaca 900 tcttaacaga gaaaaacaat ttcacgagat ttgacaagag cttcgaccca aaatcacact 960 atatggagct agcggatgga gccaggatga acaatgttgc attgatgcgg ggcgatgcag 1020 aggtggtact actggacgta gaagggaaat gcgtcaggat cactctaaag aaggccttgt 1080 tcataccatc atacccacag agcatcatct ccgttcaagc tgctacaaca gatggggcca 1140 aggtgatctt ccaggaagga cagaatgaac tgataagcaa agatggagtt gtgttccgca 1200 tagaagagca cgagagacta tactatctga aaacggtaaa taacaacaaa caatgtgttg 1260 atgtatcagt tactgatatg atgtccatgg acaaagtgag tctaacttgt gatgttaaaa 1320 catggcatga gatcatggga cattgtaatg ttagtgatgt cttgaaacta cctgaagtgg 1380 ttgagggtat gaaaatcaca ggaaatacca aattagactg taatgtttgt actgagggga 1440 agtttatcaa caacagaaat aggaaaactg acacaaaagc tggtgaggcc ctagagttgg 1500 tacacactga cttagcaggt cccattgaac caacttctca agatggatat aagtatgcag 1560 tttcattcac agatgatttt tcaggggcag tgtctgttta cttccttaag aacaagagtg 1620 acactattct agcgacagag aagttccttg cagacagtgc accatacggt agagtcaaat 1680 gcataagatc agacaatggt acagaataca caagcgatgc atttcaatca ctcttacgaa 1740 aaaagggtat aaggcatgat acgtcttccc cttactctcc acatcaaaat ggtacagctg 1800 agagacaatg gcgaaccatc tttgaaatgg gaaggtgtct gctaatagag aagggactac 1860 caaaagttct gtggccttat gctgtccaaa ctgcaactca catccgaaac agatgttaca 1920 atgacagatt taagaatact ccatacttta tgttgacagg aagaaagcca gacctttcta 1980 aaatgtgggt ttttggatca gaatgctatg cgtacaaaca taatcataag aaattggatc 2040 ctagatgtga aaagggaata tttgtggggt acagtaaaaa tagcccagct tatctggtat 2100 acaatccaca gacagaaaag gtgtcaaaac atagattggt gaaattcatt aggaagagca 2160 gtgctgaaca acagacacaa accgatgaag aatacttaga aatacaatac agagatagga 2220 cacaggatag tgggaacagt ggtgatgccc caaaaagtga ggaaagcaca gagaacaggg 2280 tcctagatga gaacatggag agtattgata cagacaatac agaggtgtca gaacatgatg 2340 gaactacaga taaagtagac acagaagaca acaaagacag tactgttgca actgtcaagg 2400 ctagccaaaa acattaccac tcaaatagag aaaagagagc cccaaagtat ttacaagatt 2460 atctgacata ctccaaagat gacgaaacca atgtgagtgt tgattattgt tacagggcag 2520 tgtgtggagt cccccaaacc tataaagagg cccttatgtc acctgaggct cccggatggg 2580 agcatgctat gaaagaagag atggactcac ttaaagaaaa tgacacattt gaattgacaa 2640 ctctaccaga gggcagaaag acagtaaggg gaagatgggt atatgccctt aaagaaaatg 2700 cagaaacagg aaaaatcttc aaagctaggt atgttgcaaa gggatacagc caaactgaag 2760 gcatagatta ccatgagaca tttgctccta cagcaaatct cacctcagta cgggcattaa 2820 tgcagatagc tgctcagaac aatttctcag ttcatcagat ggatgtcaag acagcatact 2880 tacatgctcc tatagaggaa gacatatact tggaacaacc agagggtttt gaagaaacat 2940 cagacaaagg agataagtta gtatgcaaat ttaaaaaatc cctatatggc ctaaagcagt 3000 ctggaagaaa ctggtacaaa cttttaaatg atcatttgga acaaaacaac tttgagagaa 3060 atcagtctga ttactgtgtg tatagaaaac aaattgaaaa tgagacaatc atagtgatca 3120 tctgggtaga tgatttgata attgcagcaa gcagtgacga tcagctcaac agttttaaag 3180 aaataatgaa gtccaaattt aacatgaagg atttagggaa aatatcttat tttttgggga 3240 ttcagtttga acaaaaagaa ggaacaatta aaatgaatca gaaaaggtac atcctcaaaa 3300 tgcttgaaag gtttggaatg tcaaactgta aaccaaaagc taccccaagt gagttaaaag 3360 ttgagtgtaa tataaatgag gaagaaaaca acaacgaaat tgagaacccc aaagaatatc 3420 gtgaacttgt tggaagcttg atctatgcaa tgacctgtac cagaccagac attagctgga 3480 tagtcagtaa actatcacaa acccttgcgt agcctaaaac acatgactta gtagcagcta 3540 aacatgttct gagataccta aaaggtacag ctgactacga gttgtgttac aagaaaacag 3600 acaagacttt aattctaata gcattcagtg attctgactg ggcatcttct gtagaagata 3660 ggcgtagcac ttcaggatac tgtttcagcc tgacagaaca aggccctgct atttcatgga 3720 agtctaagaa acagccgaca gtggcactgt cgacctgcga agctgaatac gttggtctag 3780 caaacactac tcaagaaagc atgtacctaa ctctacatgt acctgctaaa tgacatagac 3840 atgtacctgc taaatgacat agacagtagt gtttacactt gcactacagt gtatggtgat 3900 aatcaagggg ccattgcact gagtagaaat ccagtgaata gatctaggtc caaacacatt 3960 gatgtgaaat atcactttat tcgtgatgct gtcagtaagg gaaaaataca tattgtgtat 4020 tgcccaacag aagacatggt ggcagatatt ttgacaaaat ctgtttcaaa gatcaaaatt 4080 ctaaagttca aaggatttct gtttgggaat taaatttttg tgatgacatt ctacatggct 4140 tgagatgatg agaacaagtg gggg 4164 // ID Kolobok-1N1_DR repbase; DNA; ZEB; 1423 BP. XX AC . XX DT 13-FEB-2009 (Rel. 14.02, Created) DT 13-FEB-2009 (Rel. 14.02, Last updated, Version 1) XX DE putative nonautonomous Kolobok transposons from zebrafish - a DE consensus. XX KW Kolobok; DNA transposon; Transposable Element; Nonautonomous; KW TTAA TSD; Kolobok-1N1_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1423 RA Bao W. and Jurka J.; RT "Kolobok-type transposons from zebrafish."; RL Repbase Reports 9(2), 548-548 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1423 BP; 378 A; 336 C; 301 G; 408 T; 0 other; agggcaccta tggtgaaaaa tctacttttc aagctgtttg gacagacatg tgtgtaggta 60 tagtgtatag accgtcatat tggggtgata taaacacacc cagtcctttt tttttttttt 120 ttcaatttaa caacataaaa acggtggacc aattggagcg gttttgagac cgaccgcaac 180 ttgacgtagg agtgcggtcc ccccgcccac caatattgat tgacaggcgc gcattaccat 240 atcctcagtt tgttgtttca cgtccgccat tttcagcgtg tgagtcaaag cgatgtcacc 300 aaaggaacac ccttgctcta tttttggatg taaggctcat tgggctcaac acaagagcct 360 tcatcgtctt cctgcgaatg agcaacagag attttacatt tagcggccat ttgcgctgaa 420 cacgcagtga atgcaacgta tcgagattcg ggccattaaa aacagctgat ccaccgagtc 480 tttcagcact ctctccgaaa acactcgatt gatctcggct ggcggatcaa cattcaccac 540 atgatcgctc tcatcggcgc cattgtttgt aactttaggt gggtttgcaa acctgtgtac 600 ttttcattgc gtcttcctta aacttcagcc gtttgcattt cccgcggtca cagaagctcc 660 ctgtcatctc aactaggtgc ggctggaaag tgtgagcgcg ttgcctcgcg tgcgcgtccc 720 tccattggaa ataacgaact tgccttagtt atagcctcag tcaaagttaa tccagtgtgc 780 gtggttatta gtctcggtgt acaagctcgg aactttaaac tagcacacag ttggcataac 840 tttgtttagt tgccgcagtt ttgttccgtg cagaaacacg gtgttgagaa gcctttatga 900 ttaattaacc gtgacgaatt aaagcgcggt taatagtgaa accggttaat cattacatcc 960 ctatacggat gacgcatgca tcgttacata gcggctgtaa aactatacaa agacacaaat 1020 gattttgtaa ctaccactct gacacattct ggcacattag gaatattgat tcctcaacag 1080 tagtctctgc ttggtctgtg tccgagtcgt acatgtacgg ctgaatgctg gtcctctcac 1140 tcatgttgct tctctctgtc tgcctgtctg ttgccataca caaagcgggg gagctcttgg 1200 ctccgccccc ttgttaagtt gggcgggaag tcgaaactaa ttttcatgtg aagcaacaca 1260 cccctaaaac aacgacctgt gtacacgccc ccaaaatgac actttttaac acattataat 1320 aaaaaaaatc tgaattgtgt tttgaactga acctaaactg gcacactcag aagaaccata 1380 atattaatat taaatcttaa aaaaggggta aactaggtgc cct 1423 // ID Gypsy157-LTR_DR repbase; DNA; ZEB; 683 BP. XX AC chr6; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy157-I_DR; KW Gypsy157-LTR_DR; Gypsy157_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-683 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1576-1576 (2008). XX DR Genome; chr6; Positions 41149138 41149820. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 683 BP; 231 A; 128 C; 155 G; 169 T; 0 other; tgtgacgatg tcataaataa aattataatc aaagtttgga gtaagtcagc aatctcagta 60 atacttttat tttggatgat gtttgttcct taaaagaagg cggggcttgg cgtgagagag 120 aagaaaagca agagagagaa ggaagtggtc actggcagtc atggaagacg cacgctgcgg 180 ttggttgttc atcatgactt tatcagacag cgttgtttaa acctgttgag atggcaagaa 240 gtgaatgaat gaagtgaaaa ccgaaagcat ctactgcatt tcacgtgtca aatgaagtga 300 gaaccgaata gcatctactg cattcaacgt gtcaactgaa gtgagaaccg aatagcatct 360 actgcattca acgcgtcaac tgaagtgaga aacgaatagc atctactgca tttcatgtgt 420 caactgaagt gagaaccgaa agcatgtact gcattcaacg tgtcaactga agtgagaacc 480 gaatagcatc tactgcattc atcgtgccaa atgaagtgaa aaccgaaagc atgtactgca 540 ttcaacgtgt caactgaagt gagaagcgaa tagcatctac tgcatttcag tgcatctact 600 gaactgagta ctgaacagca tttactgcat ttcaacgagt caacttaaat gaaaaccaca 660 aggcctctac tgcctgcaat aca 683 // ID CR1-34_DR repbase; DNA; ZEB; 3558 BP. XX AC . XX DT 13-FEB-2009 (Rel. 14.02, Created) DT 13-FEB-2009 (Rel. 14.02, Last updated, Version 1) XX DE CR1-like non-LTR retrotransposon - a consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; AP endonuclease; CR1 clad; CR1-34_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-3558 RA Bao W. and Jurka J.; RT "CR1-like non-LTR retrotransposons from zebrafish."; RL Repbase Reports 9(2), 520-520 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 3..3398 FT /product="CR1-34_DR_1p" FT /translation="MKMAPVFLSACRRLVLVYGSFILLFFLLNGVSSLHKY FT DREFLLSIRNFVTIRGYDSNGRRPPDPTFQTRLPVNIGCIPDCLLNRKKRY FT RKRGKRGGVSVRLRRLTLPGASTSHLELSAVEGVGCELLSSRRHWPVSCEV FT RSSFLRYIIPDYSETMLHSRTVPLKFRNGGINALNLKALEYVPNPTHSSDT FT LSNPLNVESLRMALINARSISNKTFLLNDFISTRDLDILCITETWLSAGDS FT DCLLEVSPTDFNFLSTPRLSGRGGGVATIFKNIMKCRSLAVQDYSSFELQL FT VKIDGCVPIVCALIYRPPKYNKMFIQEFSEFLTTIVPRFEKILILGDFNLH FT ICCPKEPMVKEFLHIIDSFNLSQSVSKPTHKFGHTLDLVLSLGISITNIDI FT EEGYLSDHNPVVFNIQIPSPFLNSKPPDRRARCLNPSTASDFTKAYSLLPS FT LPFGLEINESMSLFNSTCNEIMELCAPLKILKPKAKQQPWLNSDTRSIRQL FT WRKAERIWKKTKLQVHYEILKDSMIKYQKAVKAAKAEYFANLIKKAPNNPR FT VLFKTINLALNTETTSGLAICQTTCEKFLQFFISKIEALRAGILRQDAFLD FT STIPPSLAHFLDEFKPITLHDLVDLVLHMKPSHCSLDIIPPHLFIQVLEVV FT GLDLLSIVNSSLLSGVVPSCFKHAVVQPLLKKPNLDSSDLKNYRPISKLSF FT QSKILERVVFQQLSSHLIGNDILDKFQSGFRTRHSTETALLKVSNDIFLTV FT DSGKCALLILLDLTAAFDTIHHNILIERLRLWAGIQGNVLNWFSSYLSGRT FT FSIELGNFSSSSARLECGVPQGSILGPALFSIYMLPLAAICKKHNVSYHLY FT ADDTQLYLPIKMGENSVLQSFFTCFNEIKEWLAGNFLHLNESKTEVVVFGP FT PSVCESTINKLGSIKPKVHSHVKNLGVTFDTELKFDKQINSVVKSGFYQLR FT NIAKLKPFLSFKELETVINVFIMSRLDYCNSLYSGVSQASIARLQLVQNAA FT ARLLTGTKKFSSISPVLLSLHWLPVQFRIQYKLLLLVFKSLNGLAPTYISE FT LLHWHTSLRPIRSENTFQLVVPKTRLKSRGDRAFSAAAPRLWNGLPLQIRS FT CSSLGAFKSHLKNHLLSLAFDCN*" XX SQ Sequence 3558 BP; 989 A; 717 C; 679 G; 1173 T; 0 other; ggatgaagat ggcgcctgtg tttctgtcgg catgccgtcg ccttgttctt gtctacggct 60 cttttatcct gcttttcttc cttttaaacg gcgtttcatc actgcataag tatgatcgcg 120 aatttctgct tagcatcagg aactttgtta cgatacgcgg ttatgattca aatggacgga 180 ggccaccgga tcctactttt caaacccgcc taccggtaaa catcggttgt atacctgatt 240 gcctgctaaa tcggaagaag cgatatagaa agcggggaaa gagaggtggt gtttcagtga 300 gacttagaag attaacttta cccggagctt cgacctcgca cctggagctg tcagcggtgg 360 agggcgtcgg ctgtgagttg ttgtcgagta ggcgccactg gccggtctcc tgtgaagtaa 420 ggtcctcttt tcttcgttat atcataccgg attattcgga gacaatgctt cattctagaa 480 ctgtgccctt gaaatttcgg aacggtggaa taaatgccct taatcttaaa gcactggagt 540 acgttcccaa tccgacacac tcttctgaca ctctctcaaa ccctttaaat gttgaaagtt 600 tgaggatggc gctaataaac gccagatcta tttcaaacaa gacattttta ttaaatgatt 660 ttatttccac acgagatctg gacattttgt gcataactga aacttggctt agtgctggtg 720 actctgactg cctattggaa gtttctccaa cagactttaa ctttttaagc actccccgtc 780 tctctggcag gggtggggga gtcgctacta tattcaaaaa tatcatgaaa tgtcgctcgc 840 tggccgttca ggactactcc agttttgagc tgcagctcgt taaaatagat ggctgtgttc 900 cgattgtttg tgctctcata tacagacctc caaaatacaa caaaatgttt attcaggagt 960 tttcagagtt tttaactacc attgtaccaa ggtttgagaa aatcttaata cttggtgatt 1020 ttaatttgca tatttgctgt cccaaagagc ctatggtgaa ggaattttta catattattg 1080 attcttttaa tctgtctcaa tctgtctcaa agcctacgca taaatttgga catactctag 1140 atttggtact atctttaggg atatctatca caaatatcga cattgaagag ggttatttgt 1200 ctgatcacaa tccagttgtt tttaatattc aaattcccag ccccttttta aatagcaaac 1260 cacctgaccg tcgtgctcgt tgcttgaatc cttcaacggc cagtgatttt acgaaggctt 1320 attcactact tccatctctg ccttttgggt tggaaataaa tgagtcaatg tcactattta 1380 attccacttg taatgaaatc atggagctgt gtgctccttt gaaaatttta aaacctaaag 1440 ccaaacaaca accgtggcta aatagtgata cccgtagcat tcgtcagcta tggagaaaag 1500 cggagagaat atggaagaaa actaaactgc aggtgcatta tgaaatttta aaagactcta 1560 tgattaaata tcagaaggca gttaaagccg caaaagcaga atactttgct aatctcataa 1620 aaaaggctcc taataaccct agagttcttt ttaaaacaat taacttagcc ttaaacactg 1680 agactacgtc tggtcttgct atttgccaga caacatgcga aaagttttta cagtttttta 1740 ttagtaaaat tgaagccctt agagctggga ttctgcgtca agacgctttc cttgacagta 1800 ccatcccacc ttcactagca cactttttgg atgaatttaa accaatcacg ttacatgatt 1860 tggtggactt ggtcctccat atgaagccct cccattgttc actggatatt atccctccac 1920 atcttttcat tcaggtattg gaggtggttg gattagacct tctatcaata gtgaattcta 1980 gtttgctgag tggagtcgta ccttcttgtt ttaagcatgc tgtagtacaa ccccttttaa 2040 aaaaacccaa tctggattca tcagatttaa aaaactatcg accaatttca aaattgtctt 2100 ttcagtctaa aatcttggag agagttgtct ttcaacagct ttctagtcac ttaattggaa 2160 atgacattct tgacaaattt caatctggtt ttaggacacg gcatagcact gagacagctc 2220 tcctaaaagt gtctaatgat atttttctga cggttgattc tggaaaatgc gcacttctaa 2280 ttttactaga cctgactgcg gcatttgaca caatacatca caatatcttg attgagcgtt 2340 taagattgtg ggctgggatt caaggaaatg ttttaaattg gtttagttct tatctttcag 2400 ggaggacctt ttctattgaa ttgggtaact tttcttcctc atctgcacgt ttggagtgtg 2460 gtgtccccca aggttcaatt ttgggcccag ctcttttctc tatctacatg cttcccttgg 2520 cagcaatttg taagaagcat aatgtaagct accacctcta tgctgatgat actcagcttt 2580 acttgcctat aaaaatggga gaaaattctg ttttacagtc ctttttcacc tgttttaacg 2640 agattaagga atggctggct ggtaatttct tacacttgaa tgaatcaaaa actgaagtag 2700 tagtgtttgg tcctccgagt gtctgtgaat ctacaattaa taaattaggt tcgatcaagc 2760 ctaaggtcca cagtcacgta aaaaatcttg gagttacttt tgacactgag ttaaagtttg 2820 acaagcaaat aaactcagtc gtcaaaagtg gtttctatca attaaggaac attgccaaat 2880 tgaagccttt tctttctttt aaagagctgg aaactgtcat taatgttttt attatgtcaa 2940 gactggacta ctgtaactct ttgtattctg gagtttcaca ggcatcaata gcacgcctgc 3000 agctggttca aaatgcagct gctaggcttt taactgggac aaaaaaattc tctagtatct 3060 caccagtttt attgtctctt cactggctcc cagtccaatt tagaattcaa tataaattat 3120 tgcttctagt ttttaaaagt ttgaatggtc tggcaccgac ttacatttct gagcttcttc 3180 actggcacac ttcattgagg ccaatcaggt ctgaaaacac atttcagcta gttgtcccca 3240 aaacgcggtt gaaatcaaga ggtgacaggg ctttttctgc tgcagccccc cgactgtgga 3300 atggtctgcc tttacagatc cggtcatgtt cttcacttgg tgcttttaag tctcatctaa 3360 aaaatcacct tctgtctctt gcctttgatt gtaattaatt ttaatctgtt ttatatattt 3420 tgcaaatttt ttatatatag gtatagattt tgtatgctgg tcgttttgta aagcactttg 3480 gtcaacatct gttgttttta aatgtgctct ataaataaag caaacaaaca aacaaacaaa 3540 caaacagaca tatcaaaa 3558 // ID Gypsy-166-LTR_DR repbase; DNA; ZEB; 681 BP. XX AC . XX DT 17-NOV-2008 (Rel. 13.12, Created) DT 17-NOV-2008 (Rel. 13.12, Last updated, Version 1) XX DE A long terminal repeat of the Gypsy166_DR LTR retrotransposon; a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; LTR; KW endogenous retrovirus; Interspersed repeat; Gypsy superfamily; KW Gypsy-166_DR; Gypsy-166-LTR_DR; Gypsy-166-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-681 RA Dib M.R. and Naveira H.F.; RT "Gypsy-166_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 8(12), 2165-2165 (2008). XX DR [1] (Consensus) XX CC Gypsy-166-LTR_DR is a long terminal repeat of the Gypsy-166_DR CC LTR retrotransposon that belongs to the Gypsy superfamily. Its CC internal portion is deposited in Repbase as Gypsy-166-I_DR. XX SQ Sequence 681 BP; 106 A; 190 C; 127 G; 258 T; 0 other; tgtgaggagt gtgttagacc tacctgtttg tgattttgtt ttcttgcgtg tcgccttcat 60 gtttcgtcac ggagcacatg ctctttgttt tgacagttcg ccatgtgctc ttccgtcagc 120 gtttcctttt ccccacctcc ttgttattca cttattaaac aattattgtc acctgtagct 180 ccttgatctt ctcctctatt taattcccgc ttgcctcctt gtctttgtca gaccgtcttg 240 tgtgtttcgc gagttcactc gctctagacc cgcgtttatc ctgtcttgtc ttgtgtaacg 300 ttatctatct ggctccatag agattagttt cggttttcgt tcgccctctc tgctcatctc 360 tcttctgccc tgtcaggcta tcttcggtgt cattgcttgg ggcgctcgga ctccatcgcc 420 tgccgacatc cggccacctg cagctgctcg tcttccggct gctccgccca ctaagcctgg 480 atactcacca cctccacgag tccgccacct tgtggacatt caattgaact acagtccagt 540 actagttact gctgcctgaa ttaccctttt tattcagtgc tttttctttt atatttttga 600 tcctctgtct gaggcatttt tgatttttga agttaaaata aatcttcacc tgcatattga 660 atctgtggtt tcttcctgac a 681 // ID BEL34-I_DR repbase; DNA; ZEB; 6028 BP. XX AC chr6; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL34-I_DR; KW BEL34-LTR_DR; BEL34_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6028 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1411-1411 (2008). XX DR Genome; chr6; Positions 17783689 17777662. XX CC Positions [5052-5627] - Integrase core CC LTRs are 99% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 77..1921 FT /product="BEL34-I_DR_1p" FT /translation="MDKSQPSKNVEEANQALEIESRSGRSHRSTRSSASAA FT ATTARAKAAACKVKASYAEKEAIMMREQAQLEEHQQRTLAETARRKAEVEA FT ELYVLKLQKEAVAASKEADVYAAAAIGEEELGEDLEKLSCISSRIERTREY FT VQTQSQLPNTPNLISNLQHEGENSQYVTRSYNEGSLSSHSPTKKEESKSEI FT IPIKKELTVQHSLQCSPSKVKIDPYSGDDEERHQIYPVRVQRESDPLSQEN FT AGCSTPAPHVSDFTTYLLRKEMVSSGLFQFDECPENYWAWKTSFQAVTSEL FT NLTSREEIDLLIRWLGPDSSNHAKRIKSVYVSNPTSGVRVIWRRLEECYGC FT PEVIEQAMLKRLDVFPRITNRDIHRLRDLGDLLLELQSAKESGRLPGLAYL FT DTARGVNPIVEKLPFGLQDKWITQSSRYKEDHKVYFPPFEFFVEFVCRQAR FT TRNDRSFAFASSSSLSSSKADGAARISSRASVFVKKTEIKTAQSNQEGTLE FT NNVEPNKVCTMHNKPHPLHKCRTFRSKHLDERKAHLKEKSICFKCCGSTKH FT VAKDCIALIRCRECNSDRHVAAMHPGPAPWSVTTWSIETPESKTEQIKEKT FT DEDTAEVTNKCRDLRRC" FT CDS 1773..5984 FT /product="BEL34-I_DR_2p" FT /translation="MWRPCILDLHHGLLLHGLLRHPSQRQNKSKKKQMRTQ FT LRSPTSAEICGDVDKPKSCSKICLVKVYPSNQCEKAKRIYAVLDEQSNRSL FT VKSQFFDLFNIDSSSSPYTLKTCSGIVSTAGRKITGFSVESLDGKTVVALP FT PLIECDSLPDDRSEIPTPEVAVHFPHLTVVSDKIPPLDPNAPILLLLGRDV FT LSVHKVREQRNGPHNTPYAQRLDLGWVIVGEVCLDGIHRQSSVNAYKTNVL FT LNGRTSYFPSCTKGINVVEKAQSPLLSYIPNPPCSLKSNESYSINLKEDVF FT FRSPDDDKATLSVEDKVFLEMMDKEVYLDEDRHWVAPLPFRSPRIQLPNNR FT EQAMQRLSSLQRTLSKKPKMKTHFFDFMQKEIEHKQAELAPPLHSEEECWY FT LPIFGVYHPHKPDKIRVVFDSSAQFRGVSLNDVLLSGPDLNNTLLGVLLRF FT RREKIAVMADIEQMFYCFKVKEPHRNYLRFLWDKDNCPEKEIIDYRMTVHV FT FGNSPSPAVAIYALRQAAEHGKVECGTDAKNFVLRNFYVDDGITSVPTERE FT AIDLLKRTQTMLSKSSLNLHKVASNSASVMEAFPSSERAKDLKDLDFSKDT FT IPLQHSLGISWNLKTDCFMFKASQECKPFTRRGILSTVNSLYDPLGFVCPI FT TMQGRALVRELSTIQQDWDTILPLEKESSWKAWISSLAELDQLQIPQAYVP FT TSVCRAPLCELCIFSDASTLAIAAVAYLRVIDSNGQPHVGFVMGRSKLAPF FT PAHTVPRLELCAAVLAVELMEFIKGEIDLELHDIHFYTDSRIVLGYIHNVT FT RRFYMYVANRVARIRKTTEPNQWHYICSEQNPADHATRFVPAANLPLTNWY FT SGPEFLRKCGPVRCNVDESYGLVQVEEDVEIRPQVKVLATSITEQSLDSSR FT FQRFSRWKSLVRAVTTLTRITKSFSQSSPNSPCRKWHLCTKTSDVETSQAK FT STIIKTVQREVFREEFESLTKNGKISQCSTLLRLDPFVDKEGLLRVGGRIQ FT CADISDPEKHPLILPSSHHVTDLLIQHYHDQVAHQGWHFTEGAIRRAGLWI FT VSGKRRISNIIHKCVLCKKLRGKMESQKMSALPPDRLSVDPPFTYTGLDVF FT GPFTVVTRKTRGHNIENKRWAVIFSCLSSRAVHLEVMESLSASSFICALRR FT FLAVRGPVKHFRSDRGTNFVGAVKELQIDSGDSELKGFLQNQSCTWTFNAP FT HSSHMGGVWERMIGIARRILEALLIKSATRLTHEVLTTLMAEVMAIMNSRP FT LTPISTDASMPQVLSPAMLLTQKASVAPTPPGNFELGHLHKSQWRQVQMLA FT DSFWKRWKLEYLSTLQSRRKWTEERDNIQEGDVVLLKDGETKRSEWPIGLV FT TKTVASSDGKVRKVMVKTAKQGAVREYLRPICDVALLLSSD" XX SQ Sequence 6028 BP; 1854 A; 1308 C; 1354 G; 1512 T; 0 other; gaatagtaaa aagtcgtcta cacggcggga aacgtcaagc aagcacaagc ttactgagtt 60 agactgtgat atcaagatgg acaagtcaca gcccagcaaa aatgttgaag aagctaacca 120 agcactagaa atagaatcac ggtcaggcag atcacatcgg tctactagat cttcagccag 180 tgcagctgcg accacagcaa gagctaaagc agcagcatgc aaagtaaaag cctcatatgc 240 agaaaaggaa gctatcatga tgagagaaca agctcaacta gaagagcatc aacaaagaac 300 attagcagaa acagcacgta gaaaagcaga ggttgaggct gaactgtatg tattgaaact 360 gcagaaagaa gctgtagcag catctaaaga ggcagatgtt tatgccgctg cagctatcgg 420 tgaagaggaa ctaggtgaag acctagaaaa actttcatgc atttccagca ggattgaacg 480 caccagagag tatgtgcaaa cacaatctca gttaccaaat actcccaatc tgatttctaa 540 tttgcaacat gagggtgaga attctcagta tgtaacacgg agttataatg aaggctcact 600 tagctcccat tcacccacaa agaaggagga atctaaatca gagatcattc ctataaaaaa 660 ggaactaaca gttcaacaca gccttcagtg ctctccttcg aaagtaaaga ttgatccata 720 tagtggagat gatgaggaga ggcatcaaat ctatccagta agagtacagc gtgaatccga 780 tccgctctca caagaaaatg ctgggtgctc tacaccagcg ccacatgtca gtgacttcac 840 tacatacctg ctgagaaaag aaatggtgag ttcaggtttg tttcagtttg acgaatgccc 900 tgaaaattac tgggcgtgga aaacatcttt tcaagctgta accagtgaac ttaatctaac 960 tagtagggag gagattgacc ttctcataag atggctcggt cctgattctt ctaatcatgc 1020 caaacgaatc aaatcagttt atgtcagtaa ccctacctca ggagttagag tgatatggcg 1080 tcgcctagaa gaatgctatg gttgtccaga agtcattgag caggcaatgt taaaaagact 1140 tgatgttttc cctcgcatca caaacagaga catccatcgt ctgagagacc tcggtgattt 1200 gctgcttgag ctgcagtctg caaaagagag tggtcgtttg cctggactag cttacctcga 1260 cacagcacgt ggagtaaacc ccattgtcga aaaactccca tttggtctcc aagacaaatg 1320 gattacgcag agttccagat acaaagaaga tcataaagta tattttcctc cttttgagtt 1380 ctttgttgag tttgtttgtc gacaagcaag aaccagaaac gatcgtagtt ttgcatttgc 1440 ttcatccagc agtctgagca gctcaaaagc cgatggagct gccaggataa gtagcagagc 1500 ttctgttttt gtcaaaaaaa cagaaataaa gactgcacaa tcaaaccagg aaggaacgct 1560 ggaaaataat gtagaaccaa acaaggtctg caccatgcac aataagcctc atccactcca 1620 taagtgtcga acattcagga gcaaacatct ggatgaacga aaagcacact tgaaggaaaa 1680 gtccatttgt tttaaatgtt gtggttccac caagcatgtt gcaaaagact gtatagcatt 1740 gatcaggtgc agggaatgca atagtgatcg acatgtggcg gccatgcatc ctggacctgc 1800 accatggtct gttactacat ggtctattga gacacccgag tcaaagacag aacaaatcaa 1860 agaaaaaaca gatgaggaca cagctgaggt caccaacaag tgcagagatt tgcggagatg 1920 ttgataagcc taaatcatgt tcaaagatct gtctcgtgaa agtttatcca tctaatcagt 1980 gtgaaaaggc caagcgcatt tatgctgtac tagatgagca gagtaataga tctttggtga 2040 agtcacagtt ctttgatctg ttcaacatcg atagtagttc ttccccgtac acattgaaga 2100 cgtgttcagg catagtgtct acggcgggga ggaaaataac tggattttct gttgaatcac 2160 ttgatggaaa aacagtagtt gcacttccac ccttgataga atgcgacagc ttaccagacg 2220 atagatctga aattcctacg cctgaggtag ctgtacattt tccccacctc actgtggtgt 2280 cagataaaat cccaccttta gatcctaacg ctcctatcct tctcctctta ggacgtgatg 2340 ttcttagtgt gcacaaagta cgtgaacaac ggaatggacc ccataacaca ccatatgcac 2400 agcgcttaga cctaggctgg gtaatagtgg gagaagtttg tcttgatggc attcatcgtc 2460 aatcaagtgt gaacgcttat aagaccaacg tacttctcaa tggacgaact tcatatttcc 2520 cttcatgcac aaagggcatc aacgtcgtag aaaaagctca gtcacctctc ttgtcataca 2580 ttcccaaccc accatgttct ctcaaatcta acgagagcta ttcaattaat ttaaaggagg 2640 atgtgttctt cagatctcca gatgacgata aagccactct atctgtggag gataaagtct 2700 ttctagaaat gatggataaa gaagtgtacc ttgatgagga tagacactgg gtggcgccac 2760 ttccattccg ctccccaagg attcagctcc caaataatag agagcaagct atgcagaggc 2820 taagctcttt acagcgcact ctttcaaaga aaccaaaaat gaaaacgcat ttctttgact 2880 tcatgcagaa ggagatcgag cataaacaag cagaactggc tccaccctta cattcggagg 2940 aggaatgttg gtatttgccg attttcggtg tttatcaccc ccacaaacct gacaaaataa 3000 gagtggtttt cgattcaagt gcccagttca ggggagtttc cctaaatgat gttttgctga 3060 gcggtcctga cctcaacaac actctcttgg gtgtactgtt gcgctttcgc agggagaaaa 3120 tagctgtgat ggcagacata gaacaaatgt tctattgctt caaagtaaaa gagccgcatc 3180 gcaactatct tcgcttcctt tgggacaaag acaactgccc agaaaaggag ataattgatt 3240 acagaatgac tgtacacgtc tttggcaata gtccgtcacc tgcagtcgca atatatgccc 3300 tgagacaagc agcagaacac ggcaaagtgg agtgcggaac agatgcgaag aactttgtat 3360 tgcgcaactt ctatgtggac gatggtatca cgtctgttcc aactgaaagg gaagccattg 3420 atctcttaaa acgcacacag acaatgctat ctaagtcaag cttgaacctg cataaagtgg 3480 cctccaacag tgcttcggtg atggaagcat ttccttctag tgaaagagcc aaagatctca 3540 aagacctaga tttcagtaaa gatacaatac ctcttcaaca cagccttggt atcagttgga 3600 acctcaagac agattgcttt atgtttaaag cttctcaaga atgcaaaccc ttcaccagaa 3660 gagggatcct ttccactgtc aatagtttat atgatcccct cgggttcgtc tgtccaatca 3720 ctatgcaagg cagagctcta gtgagggaac tttcaacaat ccaacaggat tgggacacca 3780 ttcttccatt ggaaaaggaa agttcatgga aggcctggat ttcatcatta gcagagcttg 3840 atcaattgca aatacctcaa gcttatgttc caacctcagt gtgcagagct ccactttgtg 3900 agctgtgcat tttctcggat gcttctactt tagctattgc tgcggtagca tatcttcgag 3960 tgattgactc aaatggacag cctcatgtgg ggtttgtgat gggaagatcc aagttagccc 4020 cctttccagc acatactgtt ccgcgcctcg agttatgcgc tgcagtgcta gctgtggagt 4080 tgatggaatt cataaaagga gaaattgacc tggagcttca tgacatccac ttttatacag 4140 atagccgcat agtgctcgga tacattcaca atgtcacacg cagattctat atgtatgtgg 4200 ctaacagagt tgcacgcata aggaaaacca cagaacctaa tcagtggcac tacatctgtt 4260 ccgaacagaa tccagcggac catgccacta ggttcgtgcc agcagccaac ttacctctaa 4320 ccaactggta ttcaggtcca gagttcctta ggaaatgtgg tcctgtgaga tgtaatgtgg 4380 atgaatccta tggacttgtt caagtagaag aggatgttga aattcgccct caagtgaaag 4440 ttctggcaac gagcattact gaacagtctc tagactcaag cagattccaa cgtttctcga 4500 gatggaagtc tctggtgaga gctgtcacaa ctcttactcg catcacaaaa tccttctcac 4560 aatcttcacc taacagtcct tgtcgaaagt ggcatttgtg tacaaagacc tcagatgttg 4620 agacctcaca agctaagtcg accataatta aaactgtaca gcgagaagtg ttccgagaag 4680 aatttgagag tctgaccaag aatggtaaaa tctcacaatg tagcacactc ctaagactcg 4740 acccctttgt ggacaaagaa ggtctgttaa gagtcggggg tcgcattcag tgcgctgaca 4800 tctcagatcc agagaaacat cccttgatac ttccatctag ccatcatgtg actgatcttt 4860 taattcagca ttatcatgat caagtggctc accaaggatg gcattttacg gagggcgcta 4920 tacgtcgtgc tggattgtgg attgtcagcg gcaaaagacg catttcaaat atcatccaca 4980 aatgtgtcct atgcaaaaag ctaagaggta aaatggaaag tcagaagatg tcggctctgc 5040 ccccagaccg gctttctgta gatccacctt tcacatacac aggtcttgat gtttttggac 5100 ctttcactgt ggtgacacgc aaaacaagag ggcacaacat tgagaacaag cgatgggctg 5160 tcatattcag ttgcttgagt agtagagcag ttcacttgga agttatggaa tctttatcag 5220 cgtcaagctt tatttgtgct ttgagacgtt ttttggctgt gagagggcca gtaaagcact 5280 ttcgatcaga cagagggacg aatttcgttg gagcagtgaa agaactccaa atcgacagtg 5340 gcgactctga gctaaaaggg ttcttgcaga atcaaagctg cacttggaca ttcaatgctc 5400 cacactcctc tcatatgggg ggtgtgtggg aaagaatgat cgggatagcc agacgtattc 5460 tggaagcgct cttaataaaa tctgctacaa gactcacaca tgaagtctta acaaccttaa 5520 tggcagaagt catggccatt atgaattccc gacctttaac cccaatttct acagatgcaa 5580 gtatgcctca agtgctctca ccagcaatgc tcttaactca gaaagcaagt gttgcaccaa 5640 ctccaccagg aaactttgag ttaggacact tgcacaaaag tcagtggcgt caagtccaga 5700 tgttagcaga ttcattctgg aagcgatgga agctggagta cttgtccacc ttgcaatcca 5760 gaagaaaatg gactgaggag agagataaca ttcaggaagg agacgttgtt ctgttaaagg 5820 atggggaaac taagcgcagt gagtggccaa ttggcctcgt aacaaaaacc gtagcctcat 5880 ctgatgggaa agtccgaaag gttatggtaa agactgctaa gcaaggggct gtcagggagt 5940 acttaaggcc tatatgtgac gtagctttac ttttatctag tgattaaaag catagtggta 6000 tatccaatac accaggcggg gagtgtac 6028 // ID Gypsy92-I_DR repbase; DNA; ZEB; 4223 BP. XX AC chr18; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy92-I_DR; KW Gypsy92-LTR_DR; Gypsy92_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4223 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1495-1495 (2008). XX DR Genome; chr18; Positions 35961687 35957465. XX CC 'ATATA' target site duplication CC LTRs are 100% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 125..4060 FT /product="Gypsy92-I_DR_1p" FT /translation="MAACIGQMDAFESTVEDWATYVERVEQYCLANDIGNE FT RKVAVLLSVMGAKTYNLLRSLSAPVKPAEKTFKEIVEVLQTHLNPKPLTIA FT ERFRFHKRNQHKTESVSEYMAELRRLAEHCQFGDGLSDALRDRLVCGIHSE FT STQKRLLTERELTLARALEIAVSMETAAKDALELQRKAISECHINKFTSKK FT SENSLCFRCGKKSHSPTECWFKEKECRQCHRKGHIQKMCKTKATENKRKSF FT GKRHTKMHELNESDTENSDVASLSYIELHALKETDREIIWVSPIIEGVKLK FT MELDTGSALSIISYKDYKDHFEKFKLKHTSVILKTYTGEKISPMGKLKVRV FT KCEKQRRVLNLYVLKKGGVPLFGREWLRSIRLNWQSLKAIQVTPETTLQQD FT KLEEVLVKHAQVFQDGIGTLKHIKARLKVESDVQPKFHKARPVPYSVRPKV FT EAELKRLEDQGILSKVNWTDWATPIVPIVKKSGDVRICGDFKVTINPVLQA FT DQYPLPLIDDIFASLSGGQRFSKIDLAQAYLQMEMEESSKKYLTINTHKGL FT FRYNRLVFGVSSAPAVWQRAMDQVLQGVAGTQCYLDDIIVTGHDDKTHLAN FT LNAVLERLAEYGLRANRSKCEFFKDSIEYCGHRIDKAGLHKSPDKINAVLN FT APRPENVNQLRSFLGLVNYYHRFLPNLSTVIHPLNALLQKTSKWVWSKDCD FT KAFKMAKQLITSETVLTHYDPSLPLRVACDASPYGIGAVLSHKMTDGSERP FT IAFASRSLTVSERKYAQIDREGLSLVWGVKKFNQYLYGKHFTLITDHQPLV FT AIFSPNKCVPVMAAARLQRWALFLGGHDYSIEFKGTKHHGNADGLSRLPHE FT NADEGTDETNMFHIVQMEPLPVTSAQIQNETRRDPVLSKVFELTMKGWPAK FT GDPQLLDYSNRREQLSVTQGCVMWGTRVIVPHKLQARVLTALHEGHLGVVK FT MKSLARSYVWWPGLDRQIEDLAKACSGCQQIQRQPKAAPLHVWEFPTSPWQ FT RVHIDYAGPIFDKMLLVVVDAYSKWPEVFPIKNATSTMTVELLRTLFSRTG FT LPEQLVSDNGTQFTSEEFQSFVKSNGIRHTTSVPYHPATNGLAERFVQSLK FT QSLKAMGKEKVSLQEKIANFLLAYRNAAHATTGQSPATLFMGRSLRTRLDL FT LKPDLRRHVQAKQCQKNTKQSVLRTLHVGQNVLARDYRQTHQKWQPGKILS FT KTGPLTYTVQVGANMVWRRHIDQLLDATSQVDSSQNLTENVETSPCPDEYA FT LITPEFPSTDTGVTESETVQVSSAQSSSQNAMPQAKRYPERVHKPPDRLNL FT " XX SQ Sequence 4223 BP; 1331 A; 890 C; 987 G; 1015 T; 0 other; gaactggcga cgaggattgt aaagacccag ttgtgcagca agtttcatca tcaggcttaa 60 cgtgcagtgg gtaatgtaca gtatatacat atattaacga accgaaaggg aagacgaaac 120 agaaatggct gcgtgcatcg gacaaatgga tgcttttgaa agcacggtgg aagactgggc 180 aacatatgtg gagagagttg agcaatattg cttagccaat gacatcggga acgagagaaa 240 agttgccgtt ctcttgagtg taatgggggc aaaaacatac aacctcttac gcagtttatc 300 agcaccagtt aagccagccg agaaaacttt caaagagatt gtggaagtgc tgcaaacgca 360 tttaaatccg aaaccattga cgatcgcgga gcgctttcgc ttccataagc gaaatcaaca 420 taagacagag tctgtatcgg agtacatggc cgagctccgg agactcgcag agcactgcca 480 attcggtgat ggtttgtctg acgcgctcag ggataggctt gtttgcggca ttcacagcga 540 gagcacgcaa aagagactcc ttactgaaag agagctcacg ttggcgcgag cgcttgaaat 600 tgccgtttct atggagacag ccgcaaaaga tgcgctggaa ttacagagaa aggcgatctc 660 ggaatgtcac ataaacaagt tcacgtcaaa gaaaagtgaa aattcattgt gtttcagatg 720 tgggaaaaag tcacatagcc ctactgaatg ctggtttaaa gagaaggagt gcagacaatg 780 tcatcgcaag ggacacatac agaaaatgtg taaaacaaaa gcaactgaaa ataaaagaaa 840 gtcatttgga aaaagacaca ctaagatgca tgagctgaac gaatctgata cagaaaactc 900 tgatgtggca agcctctcat acattgaact acacgccctt aaagagacag atcgcgaaat 960 catctgggtg agtcctatca ttgagggagt aaaactgaaa atggagcttg atacaggatc 1020 ggcactgtca ataatatcgt acaaggatta caaagaccat tttgaaaagt tcaaactaaa 1080 gcacacatct gtaatactca agacatatac aggagagaaa atctcaccca tgggaaaact 1140 gaaggtgaga gtaaaatgtg aaaaacaaag gcgagtcttg aatctttatg tgctgaaaaa 1200 ggggggagta cccttatttg gacgagaatg gctgaggagc attcggctta actggcagtc 1260 cttaaaagct atacaagtta ccccggaaac aacacttcag caagacaaat tagaggaggt 1320 gctagtgaaa catgcacaag ttttccaaga tggaattgga actctaaaac acattaaagc 1380 aaggctcaag gttgaaagtg atgtgcaacc aaaatttcac aaagcacgcc ctgtaccata 1440 ttcagtacgc ccaaaagtgg aggcggagct aaagcgacta gaagaccaag gaatcctgtc 1500 taaggtaaac tggactgact gggctacacc tatagtccca attgtcaaga agagtggtga 1560 tgtaagaatt tgtggcgact tcaaagtcac gattaatcca gtcttacagg cagaccagta 1620 cccattgcct ctcattgatg acatttttgc gtcactgtca ggtgggcagc gtttttcaaa 1680 aattgatctt gcccaggcgt atttgcagat ggagatggaa gaatcgtcta aaaagtacct 1740 gacaataaat acgcataagg gtctctttcg ctacaacaga cttgtatttg gggtctcgtc 1800 agcaccagct gtgtggcaaa gagccatgga tcaagtcttg cagggtgtcg ctggcacaca 1860 gtgctattta gatgacatta tagtcacggg ccatgacgat aaaacccatc tggcaaactt 1920 gaatgctgtg ctggaaagac tggctgagta tggtctgaga gcaaatagga gcaaatgtga 1980 atttttcaaa gattcaattg aatactgtgg acaccggatt gacaaggcag gccttcacaa 2040 gtctccagac aaaatcaatg ccgtacttaa tgcaccaaga ccagagaacg tgaaccagct 2100 tcgttcgttt ctcggactgg tgaactatta tcacagattt ttaccaaatc tctccacagt 2160 tatacatcca ttaaatgctt tgcttcaaaa gacaagtaaa tgggtgtggt caaaggactg 2220 tgacaaggcg ttcaagatgg ctaagcaact gatcacgtca gagactgtgt taacacatta 2280 tgaccccagt ctgcccctgc gtgttgcatg tgacgcttca ccatacggca taggtgcagt 2340 tttgtctcac aaaatgactg atggatctga gcggccaatt gcatttgcgt ctagatcctt 2400 gactgtgtca gaacgcaaat atgcacagat tgatcgagaa ggcttaagct tagtatgggg 2460 agtgaagaag tttaatcaat atttgtatgg aaaacacttc actctgatca cagatcatca 2520 gccattagtg gccattttca gccccaacaa atgtgtccca gtaatggctg cagcacgcct 2580 acaacgatgg gcgttgtttt taggtggtca tgactattcg attgagttca agggcactaa 2640 gcaccatgga aacgcagacg gactgtcgcg ccttcctcat gagaatgcag atgaggggac 2700 agatgaaact aacatgtttc acattgttca aatggaacct ttaccagtca caagtgctca 2760 aattcaaaat gagactcgtc gagacccggt actgtccaag gtgtttgagc tgaccatgaa 2820 aggatggcca gctaaagggg atcctcaact gttggactac tcaaatcgtc gtgaacagct 2880 ttctgtaaca caaggatgtg tcatgtgggg tactcgagtg atagttccac ataagctaca 2940 agccagagta ctcactgcct tacatgaagg acatttaggt gttgtgaaaa tgaaaagcct 3000 ggccaggagc tatgtttggt ggccagggct agatcgccaa attgaagatc ttgctaaggc 3060 atgctctgga tgccagcaaa ttcagcgaca gccaaaggca gcacctttac atgtgtggga 3120 atttcctacc agcccctggc agcgagttca cattgactat gcaggaccca tttttgacaa 3180 aatgcttctg gtggtagtgg atgcatattc aaaatggcca gaggtatttc ccatcaaaaa 3240 tgcaacatcc acgatgactg tagaactgct ccgcacattg ttctctcgca ctggcttgcc 3300 agagcagtta gtcagtgata acggtacgca gtttacttcg gaggaattcc agtcatttgt 3360 caaaagcaat ggaataaggc acacaacatc tgttccttac catcctgcta cgaacggttt 3420 agcggaacgc tttgttcagt ccctaaaaca gtcactaaag gccatgggaa aggaaaaagt 3480 gtctctacaa gaaaagattg caaacttcct gctggcctac agaaatgctg cacatgcgac 3540 tacaggtcaa tctccagcca cactttttat gggcagaagt cttcgtacgc gcttagactt 3600 acttaagcct gatctacgcc gacatgtgca agccaaacag tgtcagaaaa acacaaagca 3660 atccgtactc aggacgttac atgttggaca aaatgtactg gctagagact acagacaaac 3720 acatcaaaag tggcaaccag gcaagatatt gtccaagact gggccactta cttacactgt 3780 acaagttggt gccaacatgg tctggcgcag acacattgac cagcttttgg atgcaacctc 3840 acaggttgat tcttctcaaa acttaacaga gaatgtggaa acttccccat gccctgatga 3900 gtatgcactc ataacaccag agtttccctc tacagacact ggtgtaaccg aatcagaaac 3960 tgttcaggtg tccagtgctc agtcctcatc tcaaaacgca atgcctcaag cgaaacgtta 4020 tcctgagcgt gttcacaaac cacctgacag actaaactta tagttattca tatatacata 4080 tgataactag ctgaaataac tgttacaact atttgcatag ttgaataagt tctgcattgt 4140 tgtgtttgga gtgcttgtat acttactggt aaatataatg taatgcagct tgaagggata 4200 catcaatgct ggaggggagg aat 4223 // ID DNA-8-32_DR repbase; DNA; ZEB; 1449 BP. XX AC . XX DT 28-FEB-2009 (Rel. 14.02, Created) DT 02-MAR-2009 (Rel. 14.02, Last updated, Version 1) XX DE A nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; TSD 8-bp; DNA-8-32_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1449 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 9(2), 532-532 (2009). XX DR [1] (Consensus) XX SQ Sequence 1449 BP; 420 A; 271 C; 249 G; 507 T; 2 other; cagagatggg aagtaacgaa gtacaaatac tttgttactg tacttaagta gatttttctg 60 gtatcagtac tttactccac tacttatttt tttgacaact ttttactttt acttcttaca 120 tttttacaca aatatctgta ctttctactc cttacatttt ttaaaccacg gtcgttactt 180 tcgttttcat gtgtttggtg gcgcgatcta tattatttct tgtcattgcg caataatatg 240 aacacctttt tgatgcagtc agtctatttt tcctcattgc gcgcctgcgt gctgcactgc 300 acggcctttt caaggctatc gcgcaccgta cgtgggctag tttatgaaga ttactatgag 360 aaaggctagg atgactacgc agatgtgaca gagggagacg gcgaatttaa catgactgat 420 gctgccctgt ttacacggta ataagacacc ttttgctcgt tcggatcaca ggtaaacaag 480 agagacacat tccagttcaa aaacatgccg tttaaatgct taagcctttt gtccacttct 540 gatccatact gcacgggctt tctctggtct ttcaatctaa taccgtgaaa aatagtgctt 600 ttttgtaaag atttgtacat ttagtactaa aacaatcaaa tactttataa aaaaaatgta 660 tgcattgcgg tttatttcct ttatttctgg gttatgtata agttctgctc tttgaagaaa 720 taagcagctt tagcctaatg caaaaaaaag gttgcagtca aatttgataa gaattttttt 780 tagctaaaga aaaaaaatct atatattttg gacagaattc agtgtaaaat gtaagacatt 840 atttataaag atagacatgg cctacagaat tatttttaga gtgaatcaac atgaatggca 900 gctgatgttt ttgatctaac agccaaaaaa ctacatgctg aacgctgtga gtttgcgtca 960 ggatgatgat aaatgggccg catttttaag caatgttgtt aggttatatt gcagagtttg 1020 atgctgcttg gctactttca taaatttggc aacaaacatt ttacataaat ttattttata 1080 atttatcagt ggtcgccaca gcagaatgaa ccwccaatta ttccagcata tgttttacac 1140 agcggatacc cttccagctg caacccagta ctgggaaaca cccatacaca ctcacattca 1200 cacacacact tatacactac tgccarttta gtttacctat tattcaccta taccgcatgt 1260 ctttggactg tggttgaaac aggtactttt tacttttttt tacttttgta catttttgag 1320 gccgtacttc tttactttta cttaagtaca aaagtgtagt cagtacttca acttttacca 1380 gagtcgtttt aaacatgagt atctgtactt ctacttaagt aaaggatgtg tgtacttttg 1440 ccatctctg 1449 // ID Gypsy7-I_DR repbase; DNA; ZEB; 5822 BP. XX AC . XX DT 07-DEC-2004 (Rel. 9.11, Created) DT 19-MAY-2005 (Rel. 10.06, Last updated, Version 2) XX DE Gypsy7-I_DR is an internal portion of the Gypsy7_DR LTR DE retrotransposon - a consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy superfamily; Gypsy7-I_DR; Gypsy7-LTR_DR; Gypsy7_DR; KW endogenous retrovirus; gag; integrase; protease; KW reverse transcriptase. XX NM Gypsy7-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5822 RA Kapitonov V.V. and Jurka J.; RT "Gypsy7_DR LTR retrotransposon from zebrafish."; RL Repbase Reports 4(11), 291-291 (2004). XX DR [1] (Consensus) XX CC Gypsy7-I_DR is a consensus sequence of the internal portion of CC Gypsy7_DR LTR retrotransposons. Its long terminal repeat is CC deposited in Repbase as Gypsy7-LTR_DR. The internal portion CC encodes the 1658-aa Gypsy7_DRp polyprotein (pos. 476-5449) CC composed of gag, protease, reverse transcriptase, and CC integrase domains. Given that some Gypsy7_DR genomic copies CC are flanked by 100% identical LTRs, it is possible that CC Gypsy7_DR elements are still transpositionally active. The gag CC domain is similar to the Arc protein important for long-term CC spatial memory in vertebrates (mammals, birds). Presumably, CC Arc was derived some 300-400 million years ago from a CC Gypsy-encoded gag protein. XX FH Key Location/Qualifiers FT CDS 476..5449 FT /product="Gypsy7-I_DRp" FT /note="Polyprotein" FT /translation="MANVNPSPSTSVDIDPPDIATPVWPPVQQRQFSSPSN FT IPTYHSTPTQLDPYGRTQVHFHTTTPGVTSTVQPDPMQLCTSASTVESPPS FT TATQHALPGYLPTPGREIHQLTAHVQGNWDRVFDCLKRQDKAVKELTEKSS FT KSFSLHEAKLAKMESTHQQLLNTLTAQRKDDTETADQLTKAVKVMVTQEIQ FT RSESTLISEIRFMVEQAQLELQKDIQATKEHSDKNFERLSSDLNHCSTEIN FT AIKNQLDNLQTEISDVIPPIKQVSDPPSSAPVSVSTQSSSSVTAPMPFQTP FT VIKSDHLKLTFPTFGRPSDDADPLLYVTRCKDFLALHPLDDPDILATFRTV FT LYGTARDWWEVARSAISTWSEFETAFLSAFLSEDYEDELAERVRTRTQAEK FT ESIRDFAFTYRAMCKRWKPTLTESELVKMILKNIKPHLASQLRSRVHTVDE FT LVKLGLQLEKDYVQQLHYVEHVTQPSPQRIAPNRVEKPPVLCWRCKGLHPP FT GSCPHYSSSVQTTQSSSHPPPTGNKRYFQTQKHGGNPSNNAMSVTLPSKSL FT PKSTVTKSVVIPQQLIVPIYIGAWRGKAILDTGASYTLLHESLWKEIDPQA FT SLHPWTLGPLYLANGEAEVPLGWTNFEIILHDKVFPTQAAILTPKALAYSV FT VLGLDFIYSSGLQINVVDQTYSFKSNPNEEYPFQPGHASVPVGRSQHLNKN FT AQTQHSSKTLSLLSSIPPPLPFPVVSQLAPSSDDQALIEMAVAEAHLPLES FT KPQLLHLLQSNPKVCTLQLGRTTVLQHCIYTTHPVPVKQRPYRLTPGKQAI FT VEEQIEEMLKAGVIEQSCSPWASPVVLVPKKDNSLRFCVDYRKLNAMTESD FT AYPIPNITEILESLSGASTFSSLDLNCGFWQVPMDDKSKLMTAFITSRGLY FT HFNVMPFGLKNAPATFQRLMEIVLRDLLGKICYVYIDDIVIYSPTLTQHLH FT DIQTILERLEKAGLTLNLKKCSFCLPEITFLGHVVSHQGVAADPKKVEVIH FT AYPVPQNLKDVQRFLGLAGWYHRFVPNFSRIAEPLNNLKKKGRQFKWDSLC FT QQAFDNLKFCLTTPPILGHPDLNIPFTVYTDASDSGLGAVLTQRKEQGGEE FT VIAYASRTLTKAEVNYSTTEKECLAVVWALDKWQHYLEPRMFTVVTDHSAL FT QWVMNSTKPASRLMRWALRLQRYDFVIEYRKGRLNVAPDALSRMYSMPGCN FT LYTTEKDLPDFPVTPQTIWEEQHQDTDIMKIFQALAKNEQQEQAQYTVLED FT KLYHITHLADETVHYKVVIPSTLRPTVLEWYHDTPLSGHLGIYKTYKRIQD FT VAYWPGMWTDIKKYVKNCAKCQVTKWDNRKPAGKLQQVTTSRPNEMWGVDI FT MGPMPKSGKQNEYLLVFVDYFSKWVELFPMRHATAQTIATILRQEMLTRWG FT VPDFILSDRGAQFVSSLFTELCGKWNITPKLTTAYHPQTNMTERVNRTLKS FT MIAGFVEDNHKTWDTYLPELRFALNSAIQESIGMTPAELHLGRKIHSPMDK FT LLHRRDLSPTKPAYDMVHKITQLQRQAKENYTKAQKRQLRSYDKNRRDVFF FT RERERVWVRNFPISSAQHHFSAKLAPKWKGPYRIIQQLGPVNYQVSLEDTG FT EDVRNVHVCNLKPCFPTAEELEAREKNCTKILPQQDQKRF" XX SQ Sequence 5822 BP; 1796 A; 1330 C; 1183 G; 1513 T; 0 other; taagtggcgc ccgaacaggg accctgaaca cttaaaaaaa aaaaaaaaaa aaaaaaaaaa 60 aaaactgaac actttaaaag acattgaaca ccattgaaac cttactttat tttgggaaat 120 tttgaacttt gaaactcatt tgaattgttt gactgatttc agttgacaac aacctttttg 180 actttttgaa ctgttttgct ttactgacaa agactttgga aaaaacaatt ttgttaaaaa 240 aaaaaaaaaa aaaaaaagtt gttatagaac atttgtgtca ttgtacagga attgtactga 300 ttttgatttt gtataccttg tgaatttggt acttgtggta cttttgacat ctctctctct 360 atatatatat tttttgtgac cacattgcct aaaaagtcaa cattttcttt tttgattttt 420 attttttcaa aacccttata ttgttccttg acttacactc atacacacta taaacatggc 480 caacgtcaat ccttcccctt caacttccgt ggatattgat ccaccagata tagccactcc 540 agtttggcca ccagtgcagc aaagacagtt ctcatcacct tccaacattc ccacatacca 600 ctccacaccc actcagttag acccttatgg aaggactcaa gtgcatttcc acaccaccac 660 tccaggtgtt acttctacag ttcaacctga cccaatgcaa ctgtgcacaa gtgcatccac 720 ggtagaatca ccgccctcca cagcaaccca gcatgctctt cctggatacc tccctacacc 780 tggaagagaa attcatcaac ttactgctca tgtacaagga aactgggatc gtgtatttga 840 ctgtctgaaa cggcaagata aagctgtgaa ggaactcacc gaaaaatcgt ctaaatcttt 900 ttccctgcat gaagcaaagc ttgcaaaaat ggaatccact catcagcaac tcctgaacac 960 cttaactgca caacgaaaag atgacacaga gacagcggat caactcacta aagctgtgaa 1020 ggtgatggtg acacaagaaa tccaaaggag tgaaagtacc ttaatttcag agattcgctt 1080 catggtggaa caagctcagt tggaattgca gaaggatatt caagctacca aggaacactc 1140 tgacaagaat tttgaacgcc tttccagtga tctaaatcac tgcagcactg aaattaatgc 1200 cataaaaaac caacttgaca atcttcaaac agaaataagt gatgtcatcc cacctataaa 1260 gcaagtgtct gatcctccaa gcagtgcacc tgtatccgtt tcaacacagt cttcatcttc 1320 agtgactgct ccaatgcctt ttcaaacacc tgttataaaa agtgatcatt taaagttaac 1380 ttttccaacg tttggaagac cttcggatga tgctgatcca ctgctatatg taacacgctg 1440 caaagatttc ctggccttac accctctaga tgatccagac atcctagcta ccttccgcac 1500 tgtcctgtac ggtacagccc gggattggtg ggaagtggct cgctctgcta tttccacatg 1560 gagtgagttt gaaactgctt ttctctcagc tttcctttca gaagactatg aggatgagct 1620 ggcagagagg gttagaacta gaacacaagc agagaaagag tcaattagag actttgcttt 1680 tacatacaga gcaatgtgta aacgatggaa gcccacatta actgagagtg aattagtaaa 1740 aatgattcta aaaaacataa aacctcacct agccagccaa cttcgaagcc gtgtccatac 1800 agtggatgag ttggttaaac tgggccttca gcttgagaag gattatgttc agcagttaca 1860 ttatgtagaa catgtgactc aaccctcacc acaaagaatt gcccccaacc gagttgagaa 1920 acctccagtt ttgtgttgga gatgcaaagg tctgcatcca ccaggtagtt gtcctcacta 1980 ttcctcctct gtgcaaacca ctcaatcatc tagtcaccct cctcctactg gaaataaacg 2040 ttattttcag acccaaaagc acggaggtaa tccatctaac aatgccatgt ctgttacact 2100 tccttcaaag tcattaccca agtcgactgt tactaaatct gtggtcatac cacaacagct 2160 gatagttcca atttacattg gggcttggag aggaaaagcc atattggata cgggtgccag 2220 ttacacttta ctccatgaga gtttgtggaa ggagatcgat ccccaagcca gcctccatcc 2280 ctggacactt ggcccactct atctggccaa tggagaagcc gaagttcctt taggatggac 2340 gaattttgaa atcatattgc atgacaaagt ttttcctact caagctgcca ttctcactcc 2400 aaaagccttg gcttactctg tagtcttggg tttagatttc atttattcaa gtggtctaca 2460 gattaatgta gttgaccaga catactcttt taagtccaac cctaatgaag agtacccttt 2520 tcaacctgga catgctagtg ttcctgtggg aagatcccaa catttgaaca aaaatgcaca 2580 aacccaacat tcaagtaaga cactatctct gctcagctct attcctccac cattaccgtt 2640 tccagtagta tcccaacttg cacccagtag tgatgatcaa gctctgattg agatggctgt 2700 tgccgaagca cacttaccac tagaaagtaa gccacagtta cttcatcttc tccagtcaaa 2760 cccaaaagtc tgtactcttc agcttggaag aaccactgtt cttcaacatt gcatttacac 2820 cactcaccca gtacccgtta agcaacgtcc ttatcggttg acacctggaa aacaagccat 2880 agtagaggaa cagattgaag agatgctaaa ggctggtgtc atcgaacagt cttgttctcc 2940 atgggcatct ccagtagttc ttgttcctaa gaaagacaac agtcttaggt tctgtgtgga 3000 ctacagaaaa ttgaatgcga tgacagaaag tgatgcttat ccaataccta acatcacaga 3060 gattttagag tctctttctg gagcatccac attctcatcc ttggacctca actgtggatt 3120 ttggcaggta ccaatggatg acaaaagcaa gttgatgact gcattcatca cctctagagg 3180 gttatatcat ttcaatgtta tgccctttgg actgaaaaat gctcctgcta ccttccaacg 3240 tttgatggaa atcgtcctga gagatttact tgggaaaatt tgctacgtct atattgacga 3300 cattgtcatt tactcaccca ccttgaccca acatcttcac gacatccaga ccatcttgga 3360 gagactggaa aaagcaggtc taaccctaaa cctaaaaaaa tgttcctttt gcctacctga 3420 aattaccttt ctaggacacg tagtgagtca ccaaggagtt gcagctgacc ccaagaaggt 3480 agaggtcatt cacgcttacc cagtcccaca aaaccttaag gatgttcagc gattcttagg 3540 actggcagga tggtatcacc gttttgtacc aaatttttca cgcattgctg aaccactgaa 3600 taatctgaaa aagaaaggac gacaattcaa gtgggattca ctatgccagc aagcatttga 3660 caatctaaag ttctgtctta ccacacctcc catcctgggc catccagatc ttaacatacc 3720 ttttactgtg tatactgatg ccagtgactc aggactaggg gctgttttga cccagcgtaa 3780 agagcagggt ggcgaagaag taattgctta tgccagtaga accttgacta aggcagaagt 3840 gaattactcc accacggaga aagagtgtct ggctgtggtg tgggctttag acaagtggca 3900 acactacctg gaacctagaa tgtttacagt ggttacagac cattccgctc tgcaatgggt 3960 catgaattcc accaaaccag ccagtcgact catgagatgg gccttgcgct tgcaacgcta 4020 tgattttgtg atcgagtaca gaaaaggacg gctgaatgtt gctcctgatg cgttgtcccg 4080 tatgtattcc atgccaggct gtaacttgta caccacggaa aaggatctgc ctgatttccc 4140 tgtcacccca caaaccatct gggaggaaca acatcaagac acagacatta tgaagatctt 4200 tcaagctctg gccaaaaatg agcaacagga acaagcccag tacactgtgt tggaagacaa 4260 gctgtatcac atcacccacc tagcagatga aactgttcac tacaaagtag tcattccatc 4320 tactcttaga ccaacagtac tagaatggta ccatgatact cccttaagcg gacacttggg 4380 aatttacaag acgtacaagc gaatacaaga tgttgcttat tggccaggaa tgtggacaga 4440 cataaaaaaa tatgtcaaaa attgtgccaa atgtcaagtc accaaatggg acaaccggaa 4500 acctgctggc aagttacaac aagttacaac atcacgacca aatgagatgt ggggagtgga 4560 tataatgggt ccaatgccga agtctggaaa acaaaatgag tacttactcg tatttgtcga 4620 ctatttctcc aaatgggttg aactgtttcc catgcggcat gccacagcac agaccattgc 4680 caccatacta agacaagaaa tgttgactcg gtggggagtc cctgacttca tattgtcaga 4740 cagaggagcg cagtttgttt cttctttatt cacagagctg tgtggaaaat ggaacatcac 4800 tccaaaactt accactgctt atcacccaca gaccaacatg acagaaagag tgaatcgcac 4860 tttgaagtct atgattgcag ggtttgtgga ggacaaccac aagacctggg atacatactt 4920 accagagtta cgttttgctt taaattctgc aatacaggaa tccattggga tgacgcctgc 4980 cgaacttcac ctaggtcgga aaatccacag tcccatggat aaactgctgc acagacgtga 5040 tctctcacca accaagcctg catacgacat ggtacacaaa ataacacagt tacaaaggca 5100 agccaaagaa aattacacaa aggctcaaaa acggcagtta aggagctatg acaagaacag 5160 aagagatgtg ttcttcagag aaagagagcg tgtatgggtc cgtaattttc ccatctctag 5220 tgcacaacat cacttcagtg ctaaactagc tccaaagtgg aaaggaccat accgcattat 5280 ccagcaacta ggtcctgtga actaccaggt atctcttgaa gacactggtg aggatgtgag 5340 aaatgttcat gtgtgtaatc ttaaaccatg tttccccacg gcagaggagc tggaagcaag 5400 ggagaaaaat tgcacaaaga tcctcccaca gcaggatcaa aaaagatttt aaaaatgtaa 5460 atcctcgtga gcattgaaca acatgggttg ttctcacgaa gggggggaga gtgtgacgag 5520 atggatgctt taatgtttat ttttctcaag tgccactagg gggcgctgct ccgaccggtc 5580 cttcctatcg cttccagaca atacttccgg gggtcggaag gaagcggaag ggcaggtaaa 5640 caatcggaga ttataaaaga gggagaaaag gccagagaag gggcttcttg ttgttttggt 5700 gtcgggtatt tggtggcaag aattggagaa ggaggagaag aacggtgtaa ggtggagagc 5760 ttgttggttt caatgactgt gtgaggaaaa aactgagggg tggaagcaat tcgtggacgt 5820 gg 5822 // ID Gypsy-25-LTR_DR repbase; DNA; ZEB; 771 BP. XX AC . XX DT 11-FEB-2005 (Rel. 10.01, Created) DT 11-FEB-2005 (Rel. 10.01, Last updated, Version 1) XX DE A long terminal repeat of the Gypsy-25_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy superfamily; Gypsy-25-I_DR; Gypsy-25-LTR_DR; Gypsy-25_DR; KW endogenous retrovirus. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-771 RA Kapitonov V.V. and Jurka J.; RT "Gypsy-25_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 5(1), 24-24 (2005). XX DR [1] (Consensus) XX CC Gypsy-25-LTR_DR is a long terminal repeat of the Gypsy-25_DR CC LTR retrotransposon that belongs to the Gypsy superfamily. Its CC internal portion is deposited in Repbase as Gypsy-25-I_DR. XX SQ Sequence 771 BP; 208 A; 135 C; 149 G; 279 T; 0 other; tgtagcgctg ccacataatt gaggtacgct ttctgttatg ttcattaaat gtgtgaatag 60 gcctggcagt aaagcatact cacggttctt gaggctccga atatacaaat gagtaatgga 120 cttttatttt agattcttta ttttgaaggg atgaggtgag gtgaatgtat cggtgttcta 180 tttcctgttc tgttgatctt tttcttatat atacacattc acataacata ctcaccacaa 240 gggggagata gagggcacct ggacattact cacagtgtca tctgaagaag tcagaggagc 300 acatggagta agagtttgcc aaggtctacc ttttttcttg ttttcgtgtg cgaaggacaa 360 ttcgtttgtg tggatttttt gccctttttg gacttgttag tctttcacgt ctgaagtaaa 420 gactatccta agaggacact tgcaacgtaa gatggtacaa ttatctggac tcaaactttt 480 ctggtgagaa caatccattt tgcatatact ctcatccacg gactttggac tttgcacata 540 tacatacata catacacaca caccctcact caccattttt gtattatgtt attgtataaa 600 tgttgtttgt gtttttgttt cattgttaag acttgttcag tgacatttgt ggttctatta 660 catagactat ctctcaatcc atgataaaat ttgaatacag aaactagaga attgaaccat 720 atcttttatt tttttcttgt gtatcccgtt gatttaaaga cgagtgttac a 771 // ID Gypsy-23-I_DR repbase; DNA; ZEB; 5793 BP. XX AC . XX DT 11-FEB-2005 (Rel. 10.01, Created) DT 11-FEB-2005 (Rel. 10.01, Last updated, Version 1) XX DE An internal portion of the Gypsy-23_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy superfamily; Gypsy-23-I_DR; Gypsy-23-LTR_DR; Gypsy-23_DR; KW endogenous retrovirus; gag; integrase; reverse transcriptase. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5793 RA Kapitonov V.V. and Jurka J.; RT "Gypsy-23_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 5(1), 19-19 (2005). XX DR [1] (Consensus) XX CC Gypsy-23-I_DR is an internal portion of the Gypsy-23_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its CC long terminal repeat is deposited in Repbase as CC Gypsy-23-LTR_DR. Gypsy-23_DR is characterized by 4-bp target CC site duplications. The internal portion encodes one CC polyprotein composed of the gag, protease, reverse CC transcriptase, and integrase domains. Likely, based on CC structure of related elements, including Gypsy7_DR and CC Gypsy-22_DR, the polyprotein was encoded in active elements by CC one ORF. It contains a few stop-codons in the consensus CC sequence that was built from four copies less than 4% CC divergent from each other. XX SQ Sequence 5793 BP; 1766 A; 1247 C; 1202 G; 1578 T; 0 other; gagtggcacc caaacaggga cctgaatttg atttgaactt tgttaatcac tgaaatatca 60 ctgtaaagtt ttttttttgt gtgtgtgtgg gtgtgatgac tagaatttag actggtagtg 120 tgtgcattgt atatgtaaac agacgtgatg tgtattgact tgctgaaagt gtttgaaagt 180 ttggatttgg agggtgttaa tgacatttgg tgatatctgt acatgattgt atagtgaaga 240 tttggtagaa aaaaaaaaaa aaagtcaaac aaaaaataac acgcacactg atatatttct 300 cacatttaat gttttttttt tttacctagc agttcattct aaagcaaaat ggctgctaca 360 ccaacttccc cttccacttt tgtagatatg gagcctcttc ctgatatttc aactccactt 420 tggacctcaa cagtgccacc tgtgagtaca tcatctctcc ctagtgtctc tgacaggtat 480 aacacgcata ttcattttac cccagtgcat catgaacccc ctagtccaat ggcttttcct 540 attcttgctg gtgttaatcc agtgaatgca caatctgggg aatcacacct ggagaatatg 600 cagatgtgtt ccacaagtga agaggaaact ccatctgcag atactcaaca tgatcttcca 660 cctgttttac caactcctgg aagagaaatg caatatatta ccactcaagt tcaaggaaac 720 tggtcaagta tggttgactg catgaagaaa ctagaggaaa acaccagtca tgtcatgagt 780 ctctcaacaa ctgtacaaca aaaagaagaa tctgaaactg accacctgac gaaagcagtg 840 aaacaaatga ttagtaatga atttcaaaag gtggaaaaaa ctgttatatc tgaaatgcag 900 ttcatggtac gccaactcca ggcagaactt caacaagacc aaaaaactgc tcagcacata 960 tttcaaaaag agcataatta tctactttca gaacttcaaa agctgcagaa aagcaccaac 1020 caaatggaca aaattcccac ttgtgtgaat gaattgagag aggaaatgga gaaacaattt 1080 caatatatta tcaagagaat tgatgtacag aaaccacctg ctgcaagtac catcatgccg 1140 ttgacttcat ctgatcatcc tcaaccctct tcgtcatcat cgccatcact tccttcacct 1200 ttaattaaga gtgaccatat taagctgact tttcccactt ttggacagat atccgacgat 1260 tcagatcctc ttttatatgt aacaaaatgt caagactttc tggctctgca tcctctcaat 1320 gatgtagact tactagccac ctttcgtaca gtccttcatg gaacagctag ggattggtgg 1380 gaggtcagac gttccagcat acttacatgg actgatttca aggctgcatt tctgtctgca 1440 tttctatctg aggactatga agacgaatta gctgaaagag tgagaactag agtacaagga 1500 gaaaaagaat ctatcagaga ttttgcattt acctatagag ctctctgcaa gaggtggaag 1560 acggatctaa ctgaaaatga aattgttaaa ttaatactca aaaatattaa gccttacctg 1620 gcaagccagc tgcgcagtcg ggtgcaagct acacactcat tcatgaaagt atcatacacc 1680 agctcaattc atccgatcca cttgttccat ggtccgaggg acccctttat ctggcaaatg 1740 gggaagcaga aactccgttg ggttggaaaa acatcacaat tcaactccat gaccaagagt 1800 ttatcttacc tgctgccatt ttatcttctc aagctcttgc ttacgctgtt gttttggggt 1860 tggatttcat tttctttagc ggattgcagt tgaatgtcac ggaccagaaa tattcattta 1920 agtgtaacgc tgcaaaggag tatctttttc agccaggtag ggcaactcta ccaatggaaa 1980 gtgtacaccc agaaaaccgg ctacaggaga aaacaataca aaacttgtcc ttattgagtg 2040 ctattcctcc tctacctcat ccaataccca tgatccctcc tgacggcatg aacgaacaaa 2100 tgttgatctg cagtgtagtg gatgaagctt gtttacctca agagagtaag caggagttgc 2160 tccatatcct ggagagcaat cctcagtgtg cactctcagg actggacgta ctgatgttct 2220 acaacaccgc atatacactt cctgccaagt acccattaag caaaggccct accgattatc 2280 cccactcaaa caacgggctt tggaagaaca gctggaagaa atgctgagac aaggaattgt 2340 tgagccatcc cactctgggt ggtcatcacc ggtagtgctt gtacccaaaa aagatggtaa 2400 gctgcgtttt tgtgtggatt acagaaaggt aaactctgtg acagagagtg atgcttatcc 2460 tattccaaac atcactgaaa tccttgagtc tttgtctgga gcagcaatat tctccagtat 2520 tgaccttaac actggctatt ggcaagttag catggatcct gatagtaaag ctaagactgc 2580 ctttatcact tctgctggtc tgtatcagtt taatgtcatg ccttttggtc taaaaaatgc 2640 tcccgctact ttccagaggc ttatggagat ggtcctgggg gatctacgaa aaaagatttg 2700 tttagtctat attgatgaca ttataattta ctccccttca atcacccaac atttccatga 2760 tcttcaaatg gttcttcaca gattggaggc tgccggcctc actataaacc tgaaaaagag 2820 caaattctgc ttgcaagaac ttgtgttcct gggacatgtt gtgagtgtta gaggcattgc 2880 agcagaccca tgtaagacag aggccattca ttcgtatcct gtaccaaaaa acttaaagga 2940 ggtgcagagg ttccttggtc tagctggatg gtaccatcga tttgttccaa atttttccag 3000 gatagctgaa cccctaaatt ccctgaaaaa gaaaggaaga ctatttcagt ggacaaaaca 3060 ctgccaacaa gcctttgatg agctaaaaac ttgtcttact tccccaccga tactgggcca 3120 tcctgatcta cagtttccct ttacagttta cacggatgcc agtgactctg ggttgggtgc 3180 catcctggca cagcgtaaga atgctggcca agagatagtc attgcctatg ccagcagggt 3240 tttatctgga gcggaggtca actatacagc tacagaaaag gaatgtctgg cagtcgtatg 3300 ggccctggaa aaatggcaat attatttgga acataaactc ttcacagttg tcaccgacca 3360 cgctgctctc caatgggtaa tgggatccac taaaaccaac agccgtctca ttcgatgggt 3420 cctacggcta cagaaatttg atttcatcat cgaatacagg aaaggcaaac ttaatgtagc 3480 cccagatgca ctctccaggt tttctcaacc ttctagttgt tacctgtatt tgggggatct 3540 acgaaaaaag atttgtttag tctatattga tgacattata atttactccc cttcaatcac 3600 ccaacatttc catgatcttc aaatggttct tcacagattg gaggctgccg gcctcactat 3660 aaacctgaaa aagagcaaat tctgcttgca agaacttgtg ttcgtgggac atgttgtgag 3720 tgttagaggc attgcagcag acccatgtaa gacagaggcc attcattagt atcctgtacc 3780 aaaaaactta aaggaggtgc agaggttcct tggtctagct ggatggtacc atcgatttgt 3840 tccagatttt tcacagagat agcgtgaagc cccgtaaatt ctcctgaaaa agaaaggaag 3900 actatttcag tggacaaaac actgccaaca agcctttgat gagctaaaaa cttgtcttac 3960 ttccccaccg atactgggcc atcctgatcc acagtttccc tttacagttt acacggatgc 4020 cagtgactct gggttgggtg ccatcctggc acagcgtaag aatgctggcc aagagatagt 4080 catcgcctac gccagcaggg ttttatctgg agcggaggtc aactatacag ctacagaaaa 4140 ggatgtctgg cagtcgtatg ggccctggaa aaatggcaat attatttgga acataaactc 4200 ttcacagttg tcaccgacca cgctgctctc caatgggtaa tgggatccac taaaaccaac 4260 agccgtctca tccgatgggt cctacggcta cagaagtttg attttatcat ggaatacagg 4320 aaaggcaaac ttaatgtagc cccagatgca ctctccaggt tttctcaacc ttctagttgt 4380 tacctgtact ccaaccttaa agatccaggt ttgccagtat cagatgatat cttgtgggaa 4440 gagcagcaca aagatgcaga cattgtggag atactgaaag caggtgcaga atcgaatgca 4500 acttcgtttg atcaatacga ggttgtagaa gacaaattgt accagaaaac tcacctgcca 4560 aataatcagc tgcactacag aatatatgta ccgaagagcc tccgcccatc tctattacag 4620 tattatcatt ctgctccagt aagtggacat ggaggcatct ttaaaacata caagagacta 4680 caggaagtag ctttttggcc tggcatgtgg tctgatgtta agcggcatgt aaaaagttgt 4740 acaaaatgcc aaactgtaaa acaggataat cagaaaccag ccgggatgtt acaacagatt 4800 accaccactc gcacaaacca gatgcttgga gttgacatca tgggtcctct gccaagaagt 4860 tcccaacaaa acgaatacct gcttgttttc gtggattatt tctcacggtg ggttgagtta 4920 ttccccattc gtcaggctac tgctcagtgt gttgccacaa ttttcagaaa agaaatcctg 4980 actagatggg gagtaccaga tcacattctc tctgaccgtg gggttcagtt tatctcatcc 5040 gtgttcaagg aattgtgtga gaaatggaat atcacccccc gactcaccac tgcatatcat 5100 ccacaaacta acctcacaga aagaataaat cggacactca agaacatgat ggctgcatat 5160 gttgaagaaa accacaaaaa gtgggaccag ttcttaccgg agtttcgatt tgcattgaat 5220 tcagctgttc atgaaactac tggtctatct cctgcagaac ttcaacttgg cagaaaactc 5280 caaggaccaa tggataagat gctgtctggc actaacctgc ttcctgatgc tgactcatat 5340 ggtgtagtcc aacacattca tcaactgcaa actcaagcac aagaacacag taaaaaacca 5400 aaaatgagac aaatgaggaa ttacaataaa aacagaagag aagtaatctt caagaacaaa 5460 gatcgggtat ggttgcgaaa ctttccacag tcaagtgcaa aacataattt tagtgcaaaa 5520 ttggccccaa aatggaaagg accataccgt gtcttaaagc agctgggtcc tctaaattat 5580 aaaatttctt tggaagatag tggagaggat gttaggactg cacatgtgtg taatttgaaa 5640 ttgtgttttc caactgcgga agaattagag atgcaagaaa aacagaaaat ggtggagatg 5700 tttcaggagg tctctgatga agaagacttc ctaggatttt agtctgtgta tttttcaaca 5760 accatgggtt gttttttttc caggggggga gag 5793 // ID Mariner-2_DR repbase; DNA; ZEB; 1481 BP. XX AC . XX DT 19-JAN-2010 (Rel. 15.11, Created) DT 19-JAN-2010 (Rel. 15.11, Last updated, Version 2) XX DE Mariner-type DNA transposon - consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-2_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1481 RA Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 10(11), 1869-1869 (2010). XX DR [1] (Consensus) XX CC ~88% identical to consensus. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS join(307..642,663..1283) FT /product="Mariner-2_DR_1p" FT /translation="MGKTADLTVVQKTIIGTLHKEEKSQKAIAKEAGCSQS FT AVSKYINSKRKGKCGWKVRKGKKIKCLEKVHRYDRSLERIVKKGTEAGVRA FT SRTTTYRRLHDMGYSVEFHVSSHSKCLCWAKEKKYWSVAQWSQVLFSDESK FT FCISFGNQDPRVWRKSRDTKVKLLKSSVKFPQSVMVWRAMSYAGVGPLSFI FT KSTVNAALYQDVLEHFMLPAADKLFGDDDFIFQQDLGPAHSAKATSTWFRT FT MVSLFLIGQETHLTLNPIENLWAIVKRKMREQRPRNELKADIKATWATITP FT QLCQRLIASMPRRLDAVISAKGGPTKY" XX SQ Sequence 1481 BP; 483 A; 283 C; 318 G; 397 T; 0 other; tacaggtatg tgccaaaaaa tagaatatcg agggaaagtc catttatttc aataatttgt 60 ttcaaaaagt gaaacttgtg tgttatatta gttcactaca cacaaagtga aatatttcaa 120 gccttcattt gtttcaattt taatgattat ggattaaaga tgaaaaaaaa ctcaaatcca 180 gtatttcaca aaattagaat atcatgacaa agttcaacat tgtaggctcc ctgtgtctca 240 atctagtccg ctaattaatg caaagccttg aaattgtctc agtctggctt agtaggcttc 300 agaatcatgg ggaagactgc tgacttgacg gttgtgcaga agaccatcat tggaaccctc 360 cataaggagg aaaagtctca aaaggcaatt gcaaaagaag ctggatgctc acagagtgca 420 gtatctaagt atattaatag taagaggaag ggaaagtgtg gctggaaagt aagaaaagga 480 aaaaagataa aatgtctgga aaaggtgcac aggtatgacc gcagccttga gaggattgtc 540 aagaagggga ctgaggctgg tgtcagagca tcaagaacca ccacatacag acgtctgcat 600 gacatgggct acagcgtaga attccatgtg tcaagccact cctgaaccaa aaaaaatgtt 660 agaagtgtct gtgctgggct aaagagaaaa agtactggtc tgttgcccag tggtcccaag 720 tcctcttttc agatgaaagc aaattttgta tttcatttgg aaatcaagat cccagagtct 780 ggaggaagag cagagacaca aaagtcaagc tgctaaagtc cagtgtgaag tttccacagt 840 cagtgatggt ttggagagcc atgtcatatg ctggtgtggg tccattgtct tttatcaagt 900 ccacagtcaa cgcagctcta taccaggacg ttttagagca cttcatgctt cctgctgccg 960 acaagctttt tggagatgat gattttattt tccagcagga cttgggacct gcacacagtg 1020 ccaaagctac cagtacctgg tttaggacca tggtatccct gttcttaatt ggccaggaaa 1080 ctcacctgac cttaaacccc attgaaaatc tatgggctat tgtcaagagg aagatgaggg 1140 aacagagacc ccgaaatgag ctgaaggccg atatcaaagc aacttgggct accataacac 1200 ctcaactgtg tcaaaggctg atcgcctcca tgccacgccg ccttgatgct gtaattagtg 1260 caaaaggagg cccaacaaag tactgaggac ataatacagt acattaacac actttttttc 1320 atctttaatc cataatcatt caaattgaaa caaatgaagg cttgaaatat ttcactttgt 1380 gtgtggtgaa ctaatataac acacaagttt cactttttga aacaaattat tgaaataaat 1440 ggactttccc tcgatatttt aatttttggc acatacctgt a 1481 // ID Gypsy119-LTR_DR repbase; DNA; ZEB; 990 BP. XX AC chr19; XX DT 03-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Gypsy retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy119-I_DR; KW Gypsy119-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-990 RA Jurka J. and Kohany O.; RT "Gypsy retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1275-1275 (2008). XX DR Genome; chr19; Positions 42972780 42973769. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 990 BP; 229 A; 201 C; 202 G; 358 T; 0 other; tgtgatgagg tgacttgtcc ctggctgaca aacagtgctc aacattcaaa cataattcac 60 ttaccaaaat tatccagaaa agcttgaggc gattcgggtg cacacgtccg ccatcagctc 120 attggttttc acacacacac aaacgcatcc acaaacgcat ttcttcatcg ccatccataa 180 cgcacgctat agtcacaaat acacgcatat acacatgttg tctgagtgtt taatgttttt 240 ggtttgtccg tatctaagtg ttttcagtta ttgtttgttt aatttttacc ggttgtttgg 300 tttccgtgtt tctccttcat ctgtcttcct ggttgctctt ccatcggccg gtcaaacaga 360 gcgcctccca gcggcctggc taaatagtgc cgggttcact gtggattgct ccagagcaac 420 cggaagcaga ctcgtccatt gtcctctaga gaaggggggg ggaatgggtt tatacttttc 480 tttatgtgtt taaatgttca tgtttatatc tttatatctt tatatcttga tctattttag 540 ttaacattga ttaattgtga ctgtttgggg tttaaaggtt tttaaatact aaattggaga 600 acatgtactg atgtttatac ttacctttca attagggtat gttggtctgt gttaaggggg 660 tggggcttca cctttaaaaa gcagccccaa acaacaattc ggggtcagtt cacctggcga 720 gcgagcgagc tactgtggtg ccgtaagcga tctgtgactt gtatatttta attgtgataa 780 acatatatcg ttttgtttgt tgtttatttg ttcttttatt tttgtttgtt tgctactttg 840 ctgggttttg ggtttcactc tcgtacactg taaataaacc atcactttac acgttatatc 900 agcgagttgt gccatcattt tgaggttttc tcattttttt tatatccttt gagaacccac 960 tacgagcctt cctggctcgt aaacctcaca 990 // ID Tc1-6_DR repbase; DNA; ZEB; 1203 BP. XX AC . XX DT 26-SEP-2008 (Rel. 13.09, Created) DT 27-SEP-2008 (Rel. 13.09, Last updated, Version 4) XX DE DNA transposon - a consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Tc1-6_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1203 RA Jurka J.; RT "Mariner-type families from zebrafish."; RL Repbase Reports 8(9), 935-935 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1203 BP; 336 A; 256 C; 258 G; 353 T; 0 other; cagggtgggc catttatatg gatacacctt aataaaatgg gaatggttgg tgatattaac 60 gtcctgtttg tggcacatta gtatatgtga gggggcaaac ttttcaagat gggtggtgac 120 catggtggcc attttgaagt cggccatctt ggatccaact tttgtttttt ttcaatagga 180 agagggtcat ttgacacatc aaacttattg ggaatttcac aagaaaaaca atggtgtgct 240 tggttttaac gtaactttat tctttcatga gttatttaca agtttctgac cacttataaa 300 atgtgttcaa tgtgctgccc attgtgttgg attgtcaatg caaccctctt ctcccactct 360 tcacacactg atagcaacac cgcaggagaa atgccagcac aggcttccag tatccgtagt 420 ttcaggtgct gcacatcttg tatcttcaca ccatagacaa caaagatagt ggggccattc 480 ttcatcaatg gaaacctcaa ggccactgga tatttgaaat tgctacatga tgatgtgttt 540 ccctctttat gcactgaagc tggcacgttc cctgagtttt tccagcaaga tggtgcacca 600 ccacattatg ggtgtcaggt ccgagcattc ctagatgaac agtttcctga aaagtggatt 660 ggtcgtcgtg ggccagttga atggccccca aggtctcccg atctgacccc cttagacttt 720 tatctttggg gtcatctgaa ggcaattgtc tatggtgtga agatacaaga tgtgcagcac 780 ctgaaactat ggatactgga agcctgtgct ggcatttctc ctgcggtgtt gctatcagtg 840 tgtgaagagt gggagaagag ggttgcattg acaatccaac acaatgggca gcacattgaa 900 cacattttat aagtggtcag aaacttgtaa ataactcatg aaagaataaa gttacgttaa 960 aaccaagcac accattgttt ttcttgtgaa attcccaata agtttgatgt gtcaaatgac 1020 cctcttccta ttgaaaaaac aaaagttgga tccaagatgg ccgacttcaa aatggccacc 1080 atggtcacca cccatcttga aaagtttgcc ccctcacata tactaatgtg ccacaaacag 1140 gacgttaata tcaccaacca ttcccatttt attaaggtgt atccatataa atggcccacc 1200 ctg 1203 // ID Gypsy-115-LTR_DR repbase; DNA; ZEB; 378 BP. XX AC . XX DT 16-NOV-2008 (Rel. 13.11, Created) DT 16-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-115-I_DR; KW Gypsy-115-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-378 RA Bao W. and Jurka J.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(11), 1786-1786 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 378 BP; 51 A; 82 C; 88 G; 156 T; 1 other; tgtcacggtt tatggatctg tgtgttcctc ctgtatgtta tgtatcatgc cttgtgtgtt 60 gtcatgtgtg agggtgtgtg tttgtttacc ttttgtgttg acagcgtgct aggatcagct 120 gatgtcatca gctgacttat catcctccag ctgaacctca tttggctggc tatataattc 180 ctcctgtggt atgtgctggt tatcagtccg ttgttgtcct tgtttgtcct tgttcctgac 240 tgttgtctcc cagtatcctc cgctgtctgt ttgatgtcwt cgttgtggcc tgtggctctc 300 ttttgttttg aactttcgtg tgtcctctgt tcattaaaac actcgcattt ggatcctcct 360 tgtctaagat tcgtgaca 378 // ID Gypsy13-LTR_DR repbase; DNA; ZEB; 1162 BP. XX AC . XX DT 07-JAN-2005 (Rel. 10, Created) DT 07-JAN-2005 (Rel. 10, Last updated, Version 1) XX DE A long terminal repeat of the Gypsy13_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy superfamily; Gypsy13-I_DR; Gypsy13-LTR_DR; Gypsy13_DR; KW endogenous retrovirus. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1162 RA Kapitonov V.V. and Jurka J.; RT "Gypsy13_DR, an LTR retrotransposon from zebrafish."; RL Repbase Reports 4(12), 320-320 (2004). XX DR [1] (Consensus) XX CC Gypsy13-LTR_DR is a long terminal repeat of the Gypsy13_DR LTR CC retrotransposon. Its internal portion is deposited in Repbase CC as Gypsy13-I_DR. XX SQ Sequence 1162 BP; 389 A; 174 C; 218 G; 381 T; 0 other; tgtaaccccc ataaaataat gaatgggagt agataaaaca gataagatag ataaaataga 60 taagacaaat aaaattagat aaaacagata aaataataaa taaatataca cctcatcatc 120 attattaaag tgttaaagtg ttgatgttaa aataagtatt aaaataaaat gtataaaaaa 180 tgttaaaaca tgtatatgtg agttataatt ctctgattgg gtgtcaggct gtcagcagtc 240 ttaagtcagc aaatgagagc gctggatgct catctccgcc cccttgtggt tgcggatatt 300 actaaagcat cggaagagac acacagcgtg acagaaagga aaaccgagtg attataaagt 360 aaatatagca aaaatacata agttaaaact tttaaaacca atataatgtt aaaaatataa 420 gtctatagca cgaggatggc gagggattga gtagatatat gctgatttga gcgcgatgga 480 tgcattttga ctcgctcaca gacatcactg tggtcacgtg tacctgatag gaattaaaaa 540 aagaatacat tagaagaaca ggccaggttt ttttgttgct ggattttgga attcttgcga 600 ttctcggttg atggcgagcc tcccaagtgc agtttcagtg gacctacagg aaaggactga 660 cttctaggga aaaaggacac ttgctttttc actatttccc ttcactcaag taagaggatt 720 tgatccttaa gactgtttgt cttttttttt ccaagaggaa gacattttgt gagtacccaa 780 aagacagttg gaagtaaata attcatttat tttattttga tcttcctgga tttgaaactt 840 gatatatttg gaaagatttt atttggttac atgtaacaaa acaatttgga aattgctgat 900 ttatttgaac tctcatctat ttgttaaatt gtgaatttaa caaaatttac agcgtcatat 960 aaagggtgca cccggggtat atttttactt agtgtttata ttattgttat ttcttttgtg 1020 tgtgtatata atacagaaca acgttgtgca acttaccctt tcgtctctgt gagttactca 1080 gcatcaccac tacctccttt cccgttctta gtaatcttct gattttaatt gtttggtcct 1140 ttaaggttta atacccgtta ca 1162 // ID Mariner-N5_DR repbase; DNA; ZEB; 8969 BP. XX AC . XX DT 24-OCT-2008 (Rel. 13.1, Created) DT 06-MAR-2009 (Rel. 14.04, Last updated, Version 2) XX DE Putative Mariner-type non-autonomous DNA transposon - consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Nonautonomous; KW TSD TA; Mariner-N5_DR. XX NM Mariner-N5_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-8969 RA Bao W. and Jurka J.; RT "Mariner-type DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1614-1614 (2008). XX DR [1] (Consensus) XX CC The TSD is highly (>90%) TA specific. Position 5803-6198 is 3'- CC partial sequence from RTEX-1_DR. This sequence was derived from CC sequence data generated by the Danio rerio Sequencing Group at CC the Sanger Institute. XX SQ Sequence 8969 BP; 2983 A; 1473 C; 1512 G; 2999 T; 2 other; cagtaggtga gagttcaaag tggctattaa cgccgcggct ggtaccgccg ccgtttgaaa 60 taggtgccgc tctgttttta gtgtatgacc gcagtttgtg aatgagccgc cagggggcgc 120 aaagggacgg gatgcgaacg gacagaaata gatcatacag ctactgtgct tgtaaatgat 180 attaaacaat aagtcaaagc attataattc cttcattaat catttaaaat ttgttaacac 240 gctttattgt aaacttttta agtgcaaaga ggattcgttt tggaaaatag aattgaagaa 300 gtaaaagaca actaaaatga aagcacaaac atttagtaca aataagtagt cttaaataaa 360 taaataaata aataaataaa taaataaata aataaataaa taaataaata aagcagccta 420 aatatagaaa gatgttcagt gtttgctatt ttgataggac taagacaaga cattttcatt 480 tatgttttat ttctgttttt atttccattt tcgttgttga ttatatttta ctatcttatt 540 tatgtctcaa caattgtaca taaacgaata ataaacgaat aatgaaaata aatgaataat 600 gaaaataggc ctaaataaat tatttattta aagacattgc cgtacagtac gtgaaacact 660 gagaaaaata aattgaccta ccttaagcag atcactaaaa gtataataaa attatttttg 720 ccgcaggaca taaattaagt cttgcaggtt tatttttaat aatttagttt cagttctgtt 780 catttttttt ttcaaaacgt caatgttctc ctttaaagta gctataactg ttgttttgtt 840 tttttgtttt ttttttttac ttatggctca gtatgttttg tattttaact tcatattcag 900 tcaccttgca aatgtgtgtg aaatataatg ccgcataacc gcggaaaaag aagaaaaaac 960 tgtcgaaggt tagagctaat tcatcaaaat aagctatatt aaaatacaac atttatgtta 1020 atacaggcag agtgtgaaca aactcaaggc taagatatgc gcttgccttt taatgcattt 1080 gcggcctttt tataggctac aggcaactat gtataataat gttttttttt ttcttgtttt 1140 tttttttgta tggtaaagga caatacacat tatgttattg atgtaaactt aggctaaaac 1200 ttaccattga taaacgtgac ctacctcaag cagatcactt aaagtataaa aaaataaagt 1260 aggctatata tatatattct taataaactt cacagccaca aatataaaaa caacttgtat 1320 taaaactggg cgaggattag aaagaatacg atgcttgtta tatataattt aaattttgat 1380 tccaatttct gcaagcacat tttttcactc gctactctgc cagaaacttc gccgccggag 1440 agcacctcaa taatgtaaac atttgtttaa agatggagcg caagatgtgc cgagtggcaa 1500 catgtgaaga acgatggcaa agctaagtgt gattattgta ataaactaag ttataaagca 1560 gagtcccgac caacaggact aagcatatgc ggttggctca ttcttcacga atatagaatt 1620 aaaataatgg cgcgttaggt tcactaagca gcgcattctc tccgcctcat cgtttaacca 1680 gcgggacacg ctgctgaagc aggagagaca ctccgtcatt cataatctta gctgatgtat 1740 cggagtcgaa agaatatatc aaagaggaat tttcttttaa cagtcccgat atgtttaatc 1800 tttttatttt attttttttt ttcgtgtcat ttctcttcag caaattagat attttttaaa 1860 gctgcttgat tcttttaaaa ccgaaagcag ggcccatagt gtttttccat aagatactcc 1920 tttatgaatg ttttaatttt tttattaaag tttttaatgt gctttttgat agttttattt 1980 tatttcaagc agcatctaaa tgcgaattta tcctcaaaac ttggcatgag agcgatgcaa 2040 gtcatgtgtg gcataattgc cttttttccc tgcgcgtgcg agatcaattt ctgatctttt 2100 tgcaactaaa atgaaagcat aaacattcag tacaaataag tagtcttaaa taaataaata 2160 aataaataaa taaataaata aataaataaa taaataaata aagcagtctt ttaacctaaa 2220 tatagagaga tgttcagtgt ttgctatttt gacaggacta agacaagaca ttttcattta 2280 tgtttttatt tctgttttta cttamatttt cgttgttgat tatagtttac tatctcattt 2340 tatgtctcaa caattataga taataataaa cgaataatga aaattaatga ataatgaaaa 2400 taggcctaga tatattattt atttaaagac attgccgaca ttgagcttaa gttgaatgca 2460 gcttcatcaa aagcagaaaa aaataaaata atttgaccta ttttaagcag attataataa 2520 aaatattttt gccgcaggac ataaattaag tctagcaggt ttgttttttt aatactttag 2580 tttcagttca gttttttttt ttcaaaacgt caatgttctc ctttaaagta gctataactg 2640 ttgttttgtt ttgttttttt acttaatggc tcagtatgtt ttgtatttga atttgacgcg 2700 tcagaaaaaa aaaaccgctg aatctctgca aatatttggt taacttataa aacaaatgta 2760 tggtttagtg atgttagtaa tgcatagaaa aatgcaaagc agaattatta aagttattat 2820 ttcattgatc tgaacataac catttataat ataattacta acccgtcgtc taaaatatta 2880 aaataataaa atagtctatt attttattca tggctagagc ttgattaaat tgattaacaa 2940 gccatgtgct tttgtttgtt tgttctcggt actgggtata cgcaaaacat tttctaaaat 3000 gtcaagcagt atttgtctat agagcagcat tttgctaatt ataatagtcc tttatattta 3060 tatacatgtg tttttgcata ttcgttattt cacttgatgc attcgttatt taatgtttga 3120 atactgaaat aaaaagccat cgtttaaaag gttatttcac catcaaaatg tagcctgtta 3180 tatgtatcat taacgtttta tataaaactg ctgtgcagtg taaatgggac cttaatacag 3240 tttaaagcga agttgtctgt aaggatgatg ggcatggcat gtctattgtt gtggctcgcg 3300 ttgtgaatat tagaaaataa ctgtcgggtg aaggctttat ggtcgtgcgg aattgtctgc 3360 tgctgtcatg gctcagcagg caacacccta ttcagctagg ctatgttcgg tgtttgtatg 3420 tacaaattcg ttttggcgaa tgctagagcg taatagagat attttaaata acaagtatag 3480 ttcagtccag aagttgctgt gcatagaaac attcattttc ataaggcccc atttgcggtt 3540 tggttttaaa acgcataggt tttgctacgg ttacgccatc cgtcctcggg agttttggat 3600 tttgtgtaac catgtttgtg gaaaacactt gagggtggag acatacccct tcccccgtct 3660 cataaccaaa agcttgtctt tcaggtgtta atgggcatcg agaccgaagt catgtcgcat 3720 ttccactgtc ggcctatagc tcgcagcgcg taacgcaaac ccagccccca gaacgtcccc 3780 cgagtgttgg catcgtgcat ccgttaataa ctcggtagaa aatgtatttt ataacatcct 3840 catttagaca cagtcaaaca ttcagcccaa atgcactgga gagacctgaa acgtgttccc 3900 aaaaatgctc tgtttgcacg atttgattat tatcatcata tccaaatact ttataaataa 3960 tattcttaac cttctctggt gtttattgtt tttagaaaat atctaaaatc ttttaaaaga 4020 aaatatcttg taaaataaaa gtttgatttg gctttaacgg ttttatttat gtataggcct 4080 acctaaactg ttatagcttg cttttgcttg ttttatattg gtttatttat ttaatgcgat 4140 tttattatat ccgataatgt aattctttaa attatatttc aatgtggctt aggcttatca 4200 agatatgaca cttgttttga gtctacaaaa ggtaatttgt aaagtgttat gtctttctgt 4260 tgtattcata tggtgtatta tctgcaaaat aaataacaaa aagaaagaag ccgctagcag 4320 catccacaga gctgtgaaga gagcgctctt ttgctcggtc tctcacacac aacctttatg 4380 tctgctgtgt ctttaatatg gtgtgtagat tattatgcgt tattgaaaca tcaaggggga 4440 cgcagcaaaa tcacttataa attaaaattg tattattcta tgcaacagcc ttacatttaa 4500 aacggaaaca taagggcgat tttaagcaaa aagacctcag cctataagtc tagatgtttt 4560 cttttccttt ttatttattt gtttatttgt ttggcgcatg tgtgcgatag gaaaactata 4620 gtgtaattac ggcaagccat ctttcccaga ctcggggcaa attcctttca ctccgtcccg 4680 aagccccagc tggccctcct gccatccact gcgtaaaaca tatgctggat aagttgacgg 4740 ttcattccgc tgtggcgatc acagactaat aaagggacta aaccgaaaag aaaatgaatg 4800 aagaatgaat gaataaatta ataataataa aataaaaata aggtaataat aaacacaaat 4860 gccgcgtaag cggccttttt tttagagatg gtacatttga attgcctctt cccacttccc 4920 ccacctcgaa attgaaagtt gcaaagagaa ctcaatagca aagatcatag cccattaaca 4980 catcaacaca tagtccattt acacatcaac acagtgcaag tttagcggaa gaatgtcaaa 5040 tcagaatcac gcagatgaag cacaccagct actactatga aactattaaa attgttttgt 5100 cataaatttg cttatttgca gcagcttttt ttcgcgcaac agaaaagcgg cgttgggaag 5160 ccgtcaatga ataaaaacag ggtcaatagt gaaaatcggc ttttcattgt atccctatta 5220 cctatatcaa tgcattgaag gccttttatg gtgtcatctc gatagaatag cctaagttat 5280 aaataattta aagaacaaag ttataaaaac tcacattaca taaataaaca attaaaaata 5340 tgtttaggtt gtctatttaa ttgccctcgt gatgatggtg tgctttatca ttatgtgcgt 5400 tttacttata actaacctga tttaaatgat ggatctgcga cagagaaact aggggttttg 5460 ggggacatgc atcactatat cctttttttc cccaaactgt gcatagttca gctgcaagtt 5520 actgtacgtc tttttttgga aaagcacccc atgtccgcta gtgccgcgcg cctctttctc 5580 tgggcaaaaa tgagttgatt tttaaaataa taaatatata gattaattaa attttcagca 5640 aaagagacat ttatttcata tttcactatg catttcatct gatctttata gcttaattaa 5700 acacatattg tcaaaaccca cacataggcc tattgtatga aaatgaaaag ggcatttttt 5760 gttattatta cactactatt tgttatttgt tgcctaggct attttcacat ttgaggtgct 5820 ttattggcat gacaagtaac tgtacattcg ttttgccaaa gcagtgcgcg tctcacaaac 5880 aagacagtgc aaaaagggca gtagtgcaaa caaatatgaa aataacatat aggctaataa 5940 aaaaatgatt aataaaaata aaaatagaat aaaaaataaa aatataataa taaaaataaa 6000 aatagatttt tgataacatt aaacaggata aaggtaataa tagtagccta ttatcaaaag 6060 taagtccaca taaagagttc cagtatttgt gtgttggaga caggagcggg tattgttggg 6120 tgaacacaca gcagcacaca cactccccca gtaagatggg ttctgttcga cagggactca 6180 aagttgtttt gtatgtgttg gattatcggg tagaatttct cccggatctc ggagtattta 6240 gtgcactcgg gatatgcagc tccgtctcaa tcagctgctg agggcacagc cgcttctcca 6300 ccggcagcca tgtttttttt tgtgcgggcc cgtctttacg gctagctgat gtccgctgcc 6360 cgcttctaac atgttagaaa atactgcaaa tcaagctgtc ggtatctccc cgcggttgaa 6420 attgttcgtt attccgaaaa aaagaagaag aaatatcagc gcgcagtagt tgctttctag 6480 attatttagg cctatattca gctgttttaa tcttgcaatt ctgataatta gagataatgt 6540 ctgttcaact tgtctgtcat ttatttctca tttgctactt attttattaa tttgttgatg 6600 ttaatatact acatagtctt gtatatgcat atctaaaatc ctttggcatt caaatttgct 6660 ttgaacttga aagagagaaa gaatcggatt ttacctgtca gagcacattg catatgcttt 6720 ttttgaaata acatttattt aacaaatatt ttagcacatc gaaagtaatt aaattacctg 6780 tcttttgatt aaaaccttta tgcaaaaaga tcagaaaaaa aggcaatatg cgacacatga 6840 catgcatcgc tctcatgcca cgttttgagg ataaattcgc ttttaggtgc tgctttgaat 6900 aaaataaaac tattaaaaag cacattaaag aatactcata aaataaaacc ttgatagagg 6960 agtatcttat gaaaaaacac caattgacgg gctctgcttt cggttttaaa agaatcaagc 7020 agctttaaaa atataatttg ctgaagagaa atgacaggga aaaaaagatt gaacatatcg 7080 ggactgtcaa aagaacattc ctctttgata tattctttcg actccgacac atcagcaaag 7140 attatgaacg acggagtgtc ctccactgct tcagtggcgt gtcccgctgg ttataaacga 7200 ggaggcggag agaatgcgct gcttagtttc ggctcgatat atttaaataa aaactaacaa 7260 cgagctgctt ataacgctca aaaagttaaa ctcaaatgca caacccttgt tggttgcacc 7320 tgcaggatgc acaatgaacc taacgcgcca taattttaat tctatattcg tgaagaatga 7380 gccaaccgca tatgcttagt cagtcctgtt ggtcgggact ctgctttata acttattagt 7440 ttattacaat aatcacactt agctttgcca ttgttcttca catatagcca ctcaacacat 7500 cttgcggtcc atctttaatg tttagattat tgaaggtgct attgacgttt tgagaaaaaa 7560 attaatagta ctgaaactaa attattaaaa ataaacttgc gggacttcat ttatgtcctg 7620 cggcaacatt atttttatta cactttaagt gatctgcttg aggtaggtca cgtttatcaa 7680 tggtaagttt tagcctaagt ttacatcaat aacataatgt gcattgtcct ttaccataca 7740 aaaaaacaac accagttatg tatatagcaa atatttttaa atgcattaaa aggcaagcac 7800 atatcttagc cgtttgttca cactctgcct gtattaacat aaatgctgta ttttaatata 7860 gcttattttg attaattagc tttaactgac agccttttct tctttttccg cggttatgcg 7920 gcattatatt tcacacgcat atgcaaggtg actgaatatg aaaataaaat acaaaacata 7980 ctgagccatt aagtaaaaaa acaaaacaaa acaacagtta tactttaaag cagaacattg 8040 acgttttgaa acaaaaaaac ggaactgaaa ctaaattatt aaaaataaac ctgccagact 8100 taatttatgt aatattttca ttattcattt attttcatta ttcgtttatt attatgtact 8160 ataattgaaa cataaaatga gatcgtaaat ataatcaaca acgaaaatgt aaataaaaac 8220 gtaaattaaa aatataaatg aacatgtcta gtcttagttc tatcaaaata gcaaacacta 8280 catctctcta tatttagact aaaagactgc twatttagct ttatttattt atttatttat 8340 ttatttaaga ctaattattt gtactgaatg tttgtgcttt cattttagtt gtctttcatt 8400 tcttcaattc tattttccaa aacgaatcct ctttgcactt aataagttta caataaacca 8460 ggtaaaaaaa aaattaaatg taagttgagg tgatcgcaag ttcacaactg tgaggtgatg 8520 tgctctatcg gcaggataat ctatattatt ttgctgtttt tcatatgcgc acgtttaaga 8580 tttgagatta taacattgcc atgatagtca aagaaatcaa aacattattt tcaccccaac 8640 gcaatttaat cgggcactga aggctaataa cttaattctt atattagctt tttgacttct 8700 caccatgtat cttttctaca gcatagtata gtgtaagagc atgtctatta ccagaaaaac 8760 taaattaaaa ttatatttta attgaacaca tcacatcaca tacagtaagc tacgcctaca 8820 gaacagtctg tttagcattg tgaaaaggca aagctgaata tctgtggtta aagtgccacc 8880 cagcggtcaa atgctgctgg cgcatcaagc accgccgtcg ccgcggcatg aatggcggta 8940 caaggaacac attgaagtag taacatctg 8969 // ID BEL36-I_DR repbase; DNA; ZEB; 5833 BP. XX AC chr1; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL36-I_DR; KW BEL36-LTR_DR; BEL36_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5833 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1415-1415 (2008). XX DR Genome; chr1; Positions 43700466 43706298. XX CC Positions [4850-5416] - Integrase core CC 'ATTAT' target site duplication CC LTRs are 100% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 35..5770 FT /product="BEL36-I_DR_1p" FT /translation="MSASSKSSVRSARSGKSSASSESVTRARARAEAAKVR FT ASYASQEAKLKIEKATREAQNQLETVKLDTELEVLTLQREADVAAVEAQVL FT EDAEIEMHAAAEKGESEEKVKIERTSEYVNSQLNLSRHSPSARFSALPNAP FT ASHADSHNSFITWSPPLKDPSHTQSINDKPKYQDADFVHLPAANLSSPSTG FT IIKSEVDRTNPFMNIHAKPYIPQCIPSTSTSPQAEPLAQYLARRDLVCSGL FT YQFDDKPENYRSWCSSFTSAAREVNLTATQELDLMTKWLGRESGEMVRRIR FT SVHVSNPNLALYKAWERLRECYAAPEIIERSLFQRLDSFPRITAKDHIKLR FT ELGDLLMEIQGAKEDGYLTSLLYLDTSRGIAPIVDKLPYGLQDKWVTSGSW FT YKESHGRFPPFDYFCSFVCHEAKKRNDPSFMYQSSAMTTVKSDKFTGKSFH FT TNKPISVHKTDVCAVKEDPNKICPLHNKPHPLKKCRTFRNKLLDDRKAFLK FT EKGICFKCCSSISHHAKECKSFVKCFECNSTNHDTAMHPGPSPQVVKSLPP FT SQEDGGEGEDNSDMTSVNTSCTEVCGPGQWGRSCSKICLVKLYPKGNKDMA FT IKAYVILDDQSNRSLARPEFFELFNLESEQFTYNLRTCSGTVETSGRKAEG FT FQIESLDGRVLISLPPLIECHEIMNNRFEIPTPNAVLHQPHLRHIAKYIPD FT LEPEAEILLLLGRDVIRAHKVRQQVNGPNNAPFAQRLDLGWVVIGEVCLGN FT VHKPTVNTFKTSVLETGRHSIFQPCTSFMQVKEKQQSSVFKKATEMTLGQT FT VFCKTEHDNKPAPSVEDTTFLKIMDTNVFRNESNSWVAPLPFKEPRQRLPN FT NKEQAINRFSSLQRTLKRKPEMQQQYVEFMEKIFTNGHAEVAPPLKEGDEC FT WYLPTFGVYHPQKPNQIRVVFDSSACYSGVSLNDVLLTGPDLNNSLVGVLL FT RFRKERVAILADIQQMFHCFLVRDDHRNFLRFLWYKDNDVNKEIIEYRMNV FT HVFGNRPSPAVAIYGLRRAIREGAKDHGEDTIKFVERHFYVDDGVVSVPSE FT AEAIDLLQRTQASLAESNLRLHKFASNSQIVMKAFPPEDCAPVIKDLDLSG FT ETSPTQRSLGLLWEISTDTFTYSTSTTTKPFTRRGVLSTVNSVFDPLGLLA FT PVTIQGRALLRELTSESADWDTSLPEVKLNKWVAWRSSLQELEKLHVSRTY FT TKTSLTEAVHTELCVFSDASTKAIGAVAYLKVLQKDGQVEVGFVMGKAKLA FT PLSEPTIPRLELCAAVLAVEMADLIKDELDLELNDIRFFTDSKVVLGYIYN FT ESKRFYVYVHNRVQRIRQSSRPEQWHYVRTEENPADHASRSLPASCLAQTS FT WFTGPSFLRQPPAEKTKSSKMFELIEPENDAEIRPQIQTCATYLEEAGLIS FT TRFQRFSTFTSLVRGVALLVHVAKSFKHSNQNSNCKGWHRCDLPRTPDELD FT QARNVVLKATQKAAFAKELSALLADQPVSKNSRLRKLSPIMEGNFICVGGR FT LKHSEISTAEKNPIILPKESHTSLLLIRHHHEQVKHQGRHLTEGAIRAAGL FT WILGAKSLINSVLHKCITCRKLRGKFEEQHMADLPSERLKICPPFTYVGLD FT VFGPWSVTTRRTRGGQAESKRWAIMFGCMSSRAVHIEVIESMDASSCINAL FT RRFFALRGPAKQLRSDCGTNFVGACKELGMDKMVQRYLSEQGCSWEFNPPH FT SSHMGGSWERLIGIARRILDSMFLQLKARLTHEVLCTLLAEVTAIINARPL FT LPVSADPQQPFILSPSVLLTQKTGVPPPPGDFSDKDLYTKQWRQVQALANQ FT FWTRWSREYLPCLQHRPKWTVPRRNLQVGDLVLLRDKQIARNCWPMARVSA FT VFPGKDGYVRKVEVSTTDQGNIKTFLRPISEVVLLLPKD" XX SQ Sequence 5833 BP; 1738 A; 1352 C; 1310 G; 1433 T; 0 other; gtgaaaaggc agctttgatg tgaaagtggt caagatgtcc gcaagcagca agtcgtctgt 60 gcgcagtgct agaagtggca agtcatctgc ttcatcagag agtgtaaccc gtgctcgggc 120 cagagccgaa gctgctaaag tcagagcatc atacgctagc caagaggcta aactgaagat 180 agaaaaggcc acaagagaag cacaaaatca gctagaaaca gtaaaattag acactgaact 240 ggaagtgtta acactacaac gagaagctga tgtagctgct gttgaggcac aagtgctaga 300 agatgctgaa atagaaatgc atgctgcagc tgaaaaagga gaatccgaag agaaagtaaa 360 aattgaacgc acaagtgaat atgtaaattc tcaattaaac ctttcgagac attcaccctc 420 tgcacgcttt tcagccttgc ctaatgctcc agcatctcat gctgactcac acaacagctt 480 cataacatgg agtcctccat taaaagaccc ttcacataca cagtctatta atgacaagcc 540 aaagtatcag gatgctgatt ttgtacactt acctgctgca aacctgtcta gtccatctac 600 gggtattata aaatctgaag ttgacaggac aaaccccttt atgaacatac atgcgaagcc 660 atatattcct cagtgcattc cttcaactag cacgtcaccc caggcagagc ctttggcgca 720 gtatttagca cgacgtgatc ttgtttgttc aggactgtac caatttgatg ataagcctga 780 gaattatcga tcatggtgtt cctcattcac aagtgcagct cgtgaagtta acctcacagc 840 aacccaggaa ctggacctca tgactaaatg gcttgggagg gaatctggtg aaatggtgag 900 gcgcatccgt tcagtgcatg tcagcaatcc caatcttgca ttatataagg catgggaaag 960 actacgtgag tgctatgctg cacccgaaat tattgaaagg tcactatttc agcgtttgga 1020 cagttttcct aggatcacag ctaaagacca catcaaatta cgtgaactcg gggatttgct 1080 tatggaaatc caaggcgcta aagaagatgg ttatctcaca agcttattgt accttgacac 1140 ctcacgtggg attgcaccaa ttgtagataa acttccgtat gggcttcagg acaaatgggt 1200 gacttcgggg tcatggtata aagagagtca tggccgtttt cctccatttg actatttttg 1260 cagctttgtg tgtcatgagg cgaagaagcg aaatgaccct agctttatgt atcaaagcag 1320 tgctatgaca actgttaaat cagacaaatt cactgggaaa agttttcaca ccaacaaacc 1380 catctctgtt cacaaaacag atgtgtgtgc agtcaaggaa gaccctaaca aaatctgtcc 1440 attacacaat aaaccacacc cattgaaaaa atgcagaaca ttcagaaaca aactccttga 1500 tgacagaaaa gccttcctta aggagaaagg aatatgtttt aaatgctgtt cctcaatttc 1560 tcatcacgcc aaggaatgta agtcttttgt gaaatgtttt gaatgcaaca gcactaatca 1620 tgatacagcc atgcaccctg gcccgtcgcc tcaagtagtc aaatctcttc caccatcaca 1680 agaggacggc ggggagggag aggataattc tgatatgacg tctgtcaata caagctgcac 1740 ggaggtttgt ggtcctggtc agtggggtcg ctcatgttca aaaatctgcc ttgtaaagct 1800 gtacccaaag ggtaacaagg acatggccat aaaagcctat gtaattctgg atgaccagag 1860 caaccgttca ttagccagac cagaattctt tgagctgttc aatttggaga gtgagcagtt 1920 cacatataat cttagaactt gctctggaac tgtagaaaca tctggcagga aagcagaagg 1980 attccagatt gagtctctgg atggcagagt tctcatatct cttccaccac tcattgagtg 2040 tcatgaaatc atgaacaatc ggtttgaaat tccaactcca aatgcagttc ttcaccagcc 2100 tcatctacga cacattgcca agtacattcc agacctagaa ccagaagcag aaatactttt 2160 gctgctagga agagatgtga ttagagcaca caaggtcagg cagcaggtca atggaccaaa 2220 caacgccccc tttgctcaac gtctggacct gggatgggtg gtaataggag aggtgtgcct 2280 gggtaatgta cacaagccca cagttaacac atttaaaacc agtgtcctgg aaactggtcg 2340 tcactcaatt tttcaaccct gtacaagctt tatgcaggtc aaagagaaac aacagagcag 2400 tgtgtttaaa aaggcaactg agatgacact tggacagaca gtgttttgca aaaccgagca 2460 tgacaataaa cctgcccctt ctgtggagga cacaaccttc ttgaaaatta tggacacaaa 2520 tgtcttcaga aatgaatcta acagctgggt agctcctcta ccattcaaag aaccacgcca 2580 acgcttgcca aacaacaagg agcaagctat taatcgattc tcaagcctgc aacgaaccct 2640 caagagaaaa cctgagatgc agcaacagta tgtggaattc atggaaaaga tctttaccaa 2700 cggacatgct gaggtagcac cacctctaaa agaaggggat gagtgctggt atcttcccac 2760 atttggggtc taccaccccc aaaaacccaa tcagatcaga gtggttttcg attccagtgc 2820 ttgttactct ggcgtctctc tcaatgatgt gctcctcact ggccctgacc tgaataattc 2880 ccttgttggt gtcctgctac gttttcggaa agagagggtt gcaatcctag ctgacattca 2940 gcaaatgttt cattgttttt tggtgcgtga tgaccaccgc aactttctcc gtttcctatg 3000 gtacaaggac aacgatgtca acaaggaaat tattgagtat cgaatgaatg tccacgtctt 3060 cggcaatcgt ccatcacctg ccgtggcaat ttatggacta cggagagcca tcagagaagg 3120 tgcaaaagat catggtgaag ataccattaa gtttgtggaa agacatttct acgtcgatga 3180 tggtgtggta tctgtaccat ctgaagcaga ggctatcgat ttgctccaac gaacacaggc 3240 ttcattagct gagtcaaacc tacgtttgca caagtttgct tcaaactctc agattgttat 3300 gaaagctttt cctcctgaag actgtgctcc agtgattaag gacttagatc tgagtggaga 3360 aacgtctccc acacaacgga gtttaggtct gttatgggag atctcaactg acacattcac 3420 ctactccaca tccaccacta ccaaaccatt cacccgtcgt ggagttctct ccactgtcaa 3480 cagtgttttt gaccctctgg gtctactggc acctgtcacg atccagggaa gagccctcct 3540 tagagaactt acctctgaaa gtgcagactg ggacacatcc cttccggagg tcaaactaaa 3600 caaatgggtg gcctggagaa gttctcttca agagttagaa aagcttcacg tctcacgaac 3660 atacactaaa acctcactaa ctgaagcagt gcatacagaa ttgtgtgtgt tctctgatgc 3720 atcaaccaag gccataggtg ctgtggcata cctgaaagtg cttcagaaag atgggcaagt 3780 tgaagtagga tttgttatgg gcaaggcgaa actagcaccc ctgtccgaac ctacaattcc 3840 caggcttgag ctgtgtgcgg ctgtcttagc agttgagatg gcagatctca ttaaggatga 3900 gctagatctg gaattaaatg acataaggtt ctttacagac agcaaagtgg tccttggcta 3960 catttataat gagtcaaagc gattctatgt gtatgttcac aatagagtcc agcgcatccg 4020 acagtcctca agacctgagc aatggcacta tgtacgtacc gaggaaaatc cagcagatca 4080 tgcgtcgcga tccttacctg catcatgcct ggcgcagact tcctggttca ccggtccctc 4140 cttccttcga cagccacctg cagaaaaaac aaaatcaagt aaaatgtttg agctaattga 4200 gcctgagaat gatgcagaaa ttcgtcctca aatccaaaca tgtgctactt acctggagga 4260 agctggactt atatctacca gatttcaaag gttctccacc ttcacctctt tagtaagagg 4320 agtagcgctt ctggttcatg tcgcaaaatc cttcaaacac tcaaaccaaa acagcaactg 4380 caagggttgg cacagatgcg acttaccacg tactccagat gagctggacc aagcaaggaa 4440 tgttgtgctt aaagcaacac aaaaggctgc atttgcaaaa gagctgtcag cactgctagc 4500 tgaccaacct gtatccaaga acagccgctt gcgtaaactc agtccaatca tggagggcaa 4560 tttcatttgt gttggaggaa ggttaaagca ctctgagata tcaactgccg aaaagaaccc 4620 gataattctc cctaaagaaa gccacacatc tttgctgctc attcgacatc atcatgagca 4680 agtaaaacat caaggccgtc atctgacaga gggagcaatc agggcagcag gactgtggat 4740 cttaggagcc aagtcgctaa taaactcagt acttcacaaa tgtataacct gtcgcaagct 4800 gcgtggaaag tttgaagagc agcacatggc ggacctgcca tctgaacgtc tcaaaatctg 4860 ccctcccttt acatacgtgg ggcttgatgt atttggaccc tggtctgtca ctaccagacg 4920 caccagagga ggacaggcag agagcaaaag gtgggcaatc atgtttggct gtatgagctc 4980 gagagctgtg cacatcgagg tcattgaatc tatggatgca tcaagttgca taaatgctct 5040 caggcggttc tttgcattaa gaggtcctgc aaaacagctc cgctccgatt gcggtactaa 5100 tttcgttggg gcatgcaagg aacttggaat ggacaagatg gtacagaggt acctcagcga 5160 gcaaggatgc agctgggagt tcaatccacc tcacagctcg cacatgggag gctcatggga 5220 gcgcctgatt ggcattgcaa gaagaatttt agattcaatg tttctgcagc taaaagctcg 5280 cttaacccat gaagttcttt gcacactact agcagaagtt actgctatta ttaatgcacg 5340 gccactctta cctgtgtctg cagacccgca acaaccattc atactttcac cttcagtgct 5400 ccttacacag aagacaggag ttccgcctcc ccctggagat ttctctgaca aggacctgta 5460 tacaaagcaa tggagacaag ttcaggcgct tgcaaaccaa ttctggacac gctggagccg 5520 tgaatactta ccttgcttgc aacacagacc gaagtggaca gtacctcgca gaaaccttca 5580 agtgggagat ctagttctgc tcagggataa gcagatagcc cgcaactgtt ggcccatggc 5640 cagagtctct gctgtattcc caggaaagga tggatatgtg agaaaggttg aggtttcaac 5700 aactgaccaa gggaatataa aaacctttct aagaccaatt tcagaggttg ttctgcttct 5760 acccaaagac tgatctggaa attagtgtat ggcccagaag ttcatagtga ccttacatag 5820 gtcaggcggg gag 5833 // ID Gypsy156-LTR_DR repbase; DNA; ZEB; 1388 BP. XX AC chr5; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy156-I_DR; KW Gypsy156-LTR_DR; Gypsy156_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1388 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1574-1574 (2008). XX DR Genome; chr5; Positions 18354666 18353279. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1388 BP; 402 A; 208 C; 251 G; 527 T; 0 other; tgtaacccac tcaaattttt gagtattaaa tattgtttta ttttgacatg taattaagag 60 tttcttaatt aattttttta tttgtcagaa tatatgtatt tatgttttat tttcttacct 120 tacctaatgc acaacaattg ggcatttaaa aaaaatttat agatgtataa atttacattc 180 agaagttaat ttctaaaaac cggaattgag gtcaagggtg actgacgtca cttgtacgcc 240 agcttgtggt gaagacagga gagacacgtt ttggctgtct tctaatttaa tccatcctgc 300 caattctgca taaaattaat gtcccaaatt gttttgctgc attaggtaag ttatatacat 360 tcggatttcc tgcatattta ttttactttt cgatttgttt agctatattg tatagatgat 420 gcatactgtt catggatcgc tgtaaatgta caagtttacc gtgttcaatg tgttttactc 480 tgttttatat gtatatattt tgaacatatt gtgaactatt ttgtgaaaat gctcttagtt 540 gaagttagca atgttatgtg aagttgtgaa gttatatgaa gagggtgact gacgtcactt 600 gtacgccagc ttgtggtgaa gacaggagag acacgttttg gctgtcttct aatttaatcc 660 atcctgccaa ttctgcataa aattaatgtc ccaaattgtt ttgctgcatt agctaactga 720 cactggactg ttccatcgcg ccgcggccgc attggactcg tgcaggcaag gggagtggct 780 cgttacattc actccgtgag gatttctgcc aatttggatg gacgcatgtc atcactcaat 840 cgtcgaggat tttcgtagga aacacataca ctgattcaca gtcattgaac tacgttttac 900 attacattca ggactttgaa gtttcatata tataaatata tgttgaaatt gggacgttta 960 gacaactgaa actgtatttt cctacttctg tatagtttta tttttttttc aaaaagggct 1020 gaattgctgt caacatattg tattttattt atttaatttt gtttgttttg aatacttgag 1080 tatgttttta taagggctga aaagcagttc agatacagac cagaaaattt gcaatacagc 1140 aaactcaaaa gtattttctt gtgtttgggt ttttttttta tttttatttg agcagttgga 1200 gattatagta ttaataatat gtaaaatttg aacacaccat tcataaatat attgtgtcgt 1260 taagttgaga tatcccgaac ccgagatttc aagtaaattg tgttgaaatt tcatatatac 1320 tttagaaagc tcagacccag tagataattt tccccttttc agtaaatcaa tttcataggt 1380 gggttaca 1388 // ID Harbinger-N13_DR repbase; DNA; ZEB; 324 BP. XX AC . XX DT 16-OCT-2008 (Rel. 13.1, Created) DT 16-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Harbinger-N13_DR is a non-autonomous DNA transposon that DE putatively belongs to the Harbinger superfamily. XX KW Harbinger; DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; Harbinger superfamily; KW Harbinger-N13_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-324 RA Bao W. and Jurka J.; RT "Harbinger DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1291-1291 (2008). XX DR [1] (Consensus) XX CC The TSD is 3-bp, TWA specific. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 324 BP; 69 A; 95 C; 84 G; 76 T; 0 other; cgccctattc acacggggcg tcagcgtcaa cgcttcccat tcactttgaa tgggtgacgt 60 caggcgttgc cgaactgcat tgtggatccg tcggcgccgc ttcagaagcg ttcctcgctg 120 cagaagttgg gaccagctca acttttcaag cgccgacgga agcgtcagcc aatcagatcg 180 ctgtatgcaa atacaccagc tcagacagtg gcctattgct gactgaattt cattggctga 240 cgcttctatg acgatcgttt cagctccaac ttcagacacg ccctctgtca agcgttgacg 300 ctgaagcccc gtgtgaatag ggcg 324 // ID Gypsy106-I_DR repbase; DNA; ZEB; 6431 BP. XX AC chr20; XX DT 03-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy106-I_DR; KW Gypsy106-LTR_DR; Gypsy106_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6431 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1515-1515 (2008). XX DR Genome; chr20; Positions 5973247 5979677. XX CC Positions [4721-5194] - Integrase core CC 'TCTG' target site duplication CC LTRs are 100% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 141..1691 FT /product="Gypsy106-I_DR_2p" FT /translation="MATVTPRRMLIFQIHKKLPVLSTSQLQLVARAIEDGS FT ESTEGLSEPELYDFIIYYTRSERLKTMEDEGMAQLLSLQELLQELTSIDSG FT SGEANVPAVPIDNALTHTHRQVSASAEDIHTYAMDKHKHAQPSHTDFHAPP FT SLSHDSPALTSPIDRTIYTPGHLGDTGTRDTQQGRASLSSSVTDQVVRLSD FT VTALLPRRECKFHGGQISDTGSDISFSGLCKQIDTALQEGFSEVEIVRAVL FT KITKSGTFREMLTNHDGLTVNGLKRLLRAHVRDKGVTELFHELSNAKQHDK FT ESPHQFLYRLIGLKQRVLFESQQPGVEFNYDKRLVQGTFLHSLYQGLNDKN FT SYVRQDIKPFLADTQVSDDSLLEQLTKSTMEEEARLKRLGAVAKSRSVTVS FT VAHSDITVPTDVTKQNPIDAELQANREAIRELTAQVSSLTQHMKLVNQPVD FT RPPTGSSSTPVKHPPSQTSEKKGRCPDCIMKEIINCPHCFACGQAGHRAIG FT CLQRRMSVNGKRSLEKGSQRP" FT CDS 1628..6214 FT /product="Gypsy106-I_DR_1p" FT /translation="MFAETDVGKREEVAGEGQPATVNSSKPTDIATMSATP FT CHSDPKKRVAQLIGNRCTVSCNINGVPIEMLLDSGAQVTMVSKTWVDKASP FT HVKIQPLHSLLSSQSLEISAANGTEIPLEGWAEVDLQISSRHHGSITISVP FT LLISQNCHCPLLGSNVIAEIIKEHGNTVDVTAILTDSLSIPKSSIGTLVSL FT LQIPTSVEPTPCVVRSSKKGVTIPAGKIYEVRCRVKGAPGNCTMLFQPLTV FT STCPVGLDLFPSLVDVPLGSSKCVRIPIQNSTQHTIYLTQRTVLGTLEEIA FT DLKPVLNSKHKNIESIQTCGAQLCTNSQSYHNKELDREAVSKVRDKWHPPV FT DVSHLAEEEQRIVRQMLYEESMAFAKDEADIGCIPELQLRINLRDDTPIQK FT SYNAIPKPLYREVKDYVQNLLNRGWIRKSTSAYSSPVVCVRKKDQSLRLCV FT DFRGLNNKTIPDRHPLPRIQDLLDNLGGNSWFSILDQGSAYHQGFVEESCR FT HMTAFSTPWGFYEWVRLPFGLTNAPAAFQRCMEGVLVNLRDECCFPYLDDV FT LCFSKTFQDHVEDLRKVLCRLQECGVKLRPTKCELFKRQVRYLGRLVTNEG FT VQVDPKDLEAVLKLKEKKPKTVGEVRALLGFLGYYRSFIQDFSRIAQPLFK FT LQEHPEGSRQRPSLVKSTKRGKGGDHLPSKTPICWSSEHSAVVSRLVDMLV FT SPPILAYPNFDLPFVLHTDASNEGLGAVLYQEQSGKLRVIAYGSRTLTPAE FT KNYHLHSSKLEFLALKWAVCDKFRDYLYYAPTFTVYTDNNPLTYVLSTAKL FT SAVGHRWVGELADFHFTIKYRPGKTNIDADTLSRFPVRLTDHLREYTEIVH FT PEVISTIWQGNKAEREEQVPQAVALQIRVEEDLSAACIPVFTPEDIRDAQN FT RDDTICEVVKLKKNGWTPKDKGQSLVSVGAQRLIREWNRLKLREGILYRES FT GPNQQLVLPTSLTSMVLKHLHDDMGHVGADKVIHLARQRFYWPYMQREIED FT YVIRQCPCIKQKRPTVPEKAPMGSIVTTAPFELLSVDYLHLEPSKGGYEYI FT LVLVDHFTRFAQAYPTRNKSGKTAAERIFSDFIPRFGYPEKLHHDQGREFE FT NTLFQRLQQLAGISHSRTTPYHPQGNPAERLNRTLLQMLRTLQEEKKAEWK FT DHLPNLVHAYNCTRHEATGYSPFFLLFGRHPRLPIDLIFNLNSDQHVRTKQ FT QYAQTWALRMQEAYRIAAENSKRSSEKGKRYYDQHVRGAVLQPGDRVLVRN FT LSERGGPGKLRQYWEQKVHRVIERIKDSPVYRIQAETGDRTFRVLHRNLLL FT PVSDLLVSSEEEDKGVGKKPPRQKCHLRRRETAESDISISDDEENHSYHLR FT PLPVYERKTVRHHLPDPKHHCGLRAIAPEYQPLSQTSECNDATQIDLRDSL FT KTSEIRPTPTYATPPPLVESQEVHSDNVVEDGIFDPLATDPTTVLAPVREM FT EQQLRRSTRQVKPRNLLMYDQPGAPSYQPWRLGANAMCCGFYPMPILPVSP FT EVSYYPHPVAWVY" XX SQ Sequence 6431 BP; 1856 A; 1463 C; 1497 G; 1615 T; 0 other; attgggggct cgtccgggat tctgcatcgt cgagtgggga gagctgtcac gatacgggta 60 gaggttggcg atcgagcttc aaggaaggct gctgtgccca gcgtcatcca gagtgtgagc 120 gtcactggtg aaaaatcgcg atggctaccg taaccccacg acggatgttg attttccaga 180 ttcacaaaaa gcttcctgtt ctgagtacca gtcaactgca gttggttgcc agggctattg 240 aagatggcag cgagagtaca gaaggcctca gtgaacctga gctttatgat ttcatcatct 300 actacaccag aagtgaaagg ttgaaaacaa tggaagatga aggtatggct caacttctct 360 cacttcagga actgttacag gaactgacat caatcgattc tggatctggt gaagccaatg 420 ttcctgccgt tcccatagac aatgcactta cccatacaca cagacaagtt tctgcttcag 480 cagaggacat acacacatat gcaatggaca aacacaaaca tgcacaacca tcacatacag 540 actttcatgc accaccgtcc ctcagtcatg attcgcctgc cctgacatca cctatagaca 600 gaactattta cacaccaggc cacttaggtg acactggtac ccgtgacacc caacagggta 660 gagcaagtct atcttcatct gtcactgatc aggttgtgag actgtcggat gtcacagctt 720 tgcttcctcg tcgggaatgc aaatttcatg ggggtcaaat atctgatact gggtctgaca 780 tttcttttag tggtctctgt aagcagattg acactgctct acaagagggg tttagtgagg 840 ttgaaattgt tagagcagtg ctcaagataa ctaaatctgg tacgttcaga gaaatgctca 900 ctaatcatga cggtttaaca gtaaatgggt taaagagact tctccgtgca catgtgagag 960 ataaaggtgt aaccgagctc tttcatgagc tcagtaatgc taaacaacat gacaaagaga 1020 gtccacacca gtttctctat agacttatag ggctaaaaca acgagttctg tttgagtcac 1080 agcagcctgg tgtagagttt aactatgata agcgactggt tcaaggtact tttctgcact 1140 ccttgtatca aggtttaaat gacaagaaca gttatgttag gcaggacatt aaacctttcc 1200 ttgctgacac acaagttagt gatgactcac tgttggaaca gctgactaag tcaaccatgg 1260 aggaagaagc taggttaaag cgccttggtg ctgtggccaa gtctaggtca gtgactgtga 1320 gtgtagctca ttctgacata actgtaccga ctgacgtgac aaaacaaaac ccaattgatg 1380 ctgagttgca ggctaaccgt gaagccatca gagagctgac ggctcaagtg tcctcactga 1440 ctcaacatat gaaactggtg aatcagcctg ttgaccgtcc accgactgga agttctagta 1500 cacctgttaa gcacccgcca tcccaaacaa gtgagaaaaa gggtagatgc cctgactgca 1560 taatgaaaga gataattaac tgtccccatt gctttgcgtg tggtcaggca ggtcacagag 1620 caattggatg tttgcagaga cggatgtcgg taaacgggaa gaggtcgctg gagaagggca 1680 gccagcgacc gtgaacagtt ctaagcccac agacatagct accatgtcag ccacaccttg 1740 ccattcagac ccaaagaaac gggtggccca gctcattgga aatcgttgca ctgtttcctg 1800 caatatcaat ggtgtgccaa tagaaatgtt attagattct ggagcacagg tgacaatggt 1860 tagcaaaact tgggtagata aagcatcacc ccatgtcaaa atccagccac ttcattcact 1920 tcttagcagt cagtcacttg aaatttcagc agctaatggc actgaaatac cccttgaggg 1980 atgggcagaa gttgacctac agatttcaag cagacaccat gggtctataa caataagtgt 2040 acccctttta atcagtcaga actgccactg tcccttgtta ggtagcaacg taatcgctga 2100 aattattaaa gagcatggca acactgtgga tgtgaccgct atcctgacag attctttaag 2160 tataccaaag agctccatcg gaacactagt ctctctcctt cagataccaa cttctgttga 2220 accaactcct tgtgttgtac gcagcagtaa aaaaggtgta accatccctg ctggcaaaat 2280 ctatgaagtt cggtgtcgag ttaagggggc acccgggaac tgtaccatgc tgtttcaacc 2340 tcttactgtg agtacttgcc cggtgggttt agatctgttt ccatcactgg tagacgttcc 2400 cttggggtca tcaaaatgtg ttagaatccc cattcagaac tcaacccaac acactattta 2460 cctgactcaa agaacagtcc tgggtacgct tgaagagatt gctgatttga agcctgtact 2520 gaactccaag cacaagaata ttgagtccat tcagacttgt ggagcccagt tatgcactaa 2580 ctctcagagc tatcacaaca aagagcttga tagagaggca gtatcaaagg tcagagacaa 2640 atggcaccct ccagttgatg tatcccacct agcagaggag gagcaaagaa ttgtcaggca 2700 aatgttatat gaggagtcta tggcctttgc caaagatgag gcagatattg ggtgcatccc 2760 agagttgcaa ctgagaatca atcttaggga tgacactcca atacaaaagt catataatgc 2820 cataccaaaa cccttatatc gtgaagtcaa agattatgta cagaatttac ttaaccgtgg 2880 ctggattcga aagtcaacat cagcttactc atctccagtc gtgtgcgtaa gaaaaaagga 2940 ccagagttta cgtctctgtg ttgatttccg gggactgaac aataaaacta ttcctgaccg 3000 tcacccactg ccacgcattc aagacctgtt ggataactta ggaggcaact cctggttctc 3060 gatactagac caaggtagtg cataccatca aggttttgtt gaggaaagtt gccgtcacat 3120 gaccgctttt agcacacctt gggggttcta tgaatgggta cgtctcccct ttggccttac 3180 caatgcacct gcagcattcc agcgctgcat ggaaggcgta ctagttaatc tcagggatga 3240 atgttgtttc ccttatctgg atgacgtgct ctgtttctcc aaaacattcc aagaccatgt 3300 tgaggacctg agaaaagtgc tgtgtcgtct tcaggagtgt ggtgtgaaac tgaggccgac 3360 taaatgcgaa cttttcaaac gtcaagtgcg gtatttggga aggttggtga ctaatgaagg 3420 tgttcaggtg gatcctaagg atctggaggc agtgttgaaa ctaaaagaga agaagccaaa 3480 aacagtcggt gaggttcgag ctttgcttgg gttcttgggt tattatcgct cctttatcca 3540 agatttctcc cgtatagcac agcctttgtt taagctccaa gaacatcctg aagggtctcg 3600 ccaacgtcct agtcttgtaa aatctaccaa aaggggaaag ggtggtgacc acttaccatc 3660 taagacacct atctgttggt cctctgagca tagtgctgtg gtgtctcgcc tggtagacat 3720 gttggttagt cctcccatct tggcctaccc caactttgac ttaccctttg ttctacacac 3780 agatgcatca aatgaagggt tgggggctgt gttataccag gaacagagtg gtaaactacg 3840 tgttatcgca tacggatcaa ggacactcac gcctgctgaa aagaattacc acctccattc 3900 tagtaagctg gagttccttg ccttgaagtg ggccgtatgt gataagtttc gggattatct 3960 gtactacgcc cccactttca ctgtatatac agataataac cccctgactt atgtcctttc 4020 caccgccaag cttagtgctg taggacatcg ctgggttggt gaattagcag atttccattt 4080 caccattaaa tacaggccag gcaagactaa cattgatgca gacactctgt cccgatttcc 4140 agtaagactc accgaccacc tgagagagta cactgaaata gtacatcctg aggtgatatc 4200 taccatctgg cagggaaata aggctgagag ggaagagcag gtaccacagg ctgtggctct 4260 tcagatacgg gtagaggagg atctctctgc tgcatgtata cctgtattta cccctgagga 4320 tatcagagat gctcagaata gggatgacac catttgtgaa gttgtaaaat tgaaaaaaaa 4380 tggatggact cccaaggaca agggtcaaag tttggtgtct gttggagcac agcggctaat 4440 aagggaatgg aaccgactaa agctgagaga ggggatcttg tacagagagt ccgggcccaa 4500 tcaacaactg gtccttccaa catcacttac atcgatggtg ctgaaacatc tacatgatga 4560 catgggacat gttggtgctg ataaagttat tcatctcgcc agacaaagat tttattggcc 4620 ttatatgcag cgggaaattg aggactatgt tatacgccag tgtccttgca ttaagcaaaa 4680 acgacctacg gtccctgaaa aagcaccaat gggttccatt gtcactaccg caccttttga 4740 gttgctctct gtcgattact tacacctcga gcctagtaag ggtggctatg agtacatact 4800 tgtgctcgta gaccacttta ctcggttcgc acaagcttac cccactcgaa ataagtctgg 4860 gaagacagcc gctgaacgga ttttctctga tttcattcct cgatttggat atccagaaaa 4920 actccatcat gatcagggtc gggagtttga gaataccttg ttccaaagat tacagcaatt 4980 ggcaggcatc tctcactccc gtaccactcc ctaccaccca caggggaacc ctgctgaacg 5040 cctgaacagg acactgctcc agatgttaag aacgcttcaa gaggagaaaa aggcagagtg 5100 gaaagaccat cttccgaatc ttgtgcatgc ctacaactgc acaagacacg aggcaacagg 5160 gtactcgcca tttttcctcc tctttgggag acatcctcga ttgcccattg atctaatctt 5220 caaccttaac tcagaccagc atgtgcggac aaagcaacag tatgcacaga cctgggcttt 5280 acgaatgcag gaagcatatc gaatagccgc agagaacagt aagagatctt ctgaaaaagg 5340 gaaaaggtat tatgatcaac atgtccgagg agctgtacta caacctggag atcgtgtgtt 5400 ggtcagaaac ctgtctgaga gagggggtcc tggcaaactc aggcagtact gggaacaaaa 5460 ggtccacaga gtcatcgaga ggataaagga tagtcctgtg tatcggattc aggctgaaac 5520 aggtgacaga acttttcgag tgttgcaccg caacctgctt ctcccagtta gtgacttact 5580 cgtcagttct gaggaagaag ataagggtgt tgggaagaag ccacccagac aaaaatgtca 5640 ccttcgacgt agagaaacag ctgaatcaga tatcagtatt tccgatgatg aggaaaacca 5700 ttcctaccac cttagaccct tgcctgtgta cgaaagaaaa actgtcagac accatctacc 5760 agacccaaaa catcattgtg gattaagggc aatagctccc gaatatcagc cattaagtca 5820 aacttcagag tgtaacgatg ccacacaaat tgatctgagg gattcactca aaacatcaga 5880 gataagaccc acaccaacct atgctacacc accacccttg gtggagagtc aagaggtgca 5940 tagtgataat gtggtggagg atggaatctt tgacccactg gcaacagatc ccacaactgt 6000 tctagcacca gtgagggaaa tggaacaaca attaaggaga tccacacggc aagtcaagcc 6060 ccgaaatctc ttgatgtatg accagccagg agcaccttct taccagcctt ggagattagg 6120 agcaaatgct atgtgctgcg gcttttaccc aatgcctatc ctaccagtat ctcctgaagt 6180 gagctattac ccccacccag tggcatgggt ttattgaaag attcctgtgg atgagactaa 6240 gtcagatatt gacattgtag actgacttta tatgctgatt atgactttat gcagattgac 6300 acgagttaag agtgtttact gacttgataa acagttcaga ctacagactg attctcattt 6360 aatttacttg acaatattat agttaatgtg gttttggatg ccaggagaca tcccattaaa 6420 gccggggaga g 6431 // ID Gypsy36-I_DR repbase; DNA; ZEB; 5367 BP. XX AC . XX DT 21-SEP-2007 (Rel. 12.09, Created) DT 25-SEP-2007 (Rel. 12.09, Last updated, Version 1) XX DE An internal portion of the Gypsy36_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy superfamily; Gypsy36-I_DR; KW Gypsy36-LTR_DR; Gypsy36_DR; endogenous retrovirus; gag; integrase; KW reverse transcriptase. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5367 RA Dib M.R. and Naveira H.F.; RT "Gypsy36_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 7(9), 803-803 (2007). XX DR [1] (Consensus) XX CC Gypsy36-I_DR is an internal portion of the Gypsy36_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its long CC terminal repeat is deposited in Repbase as Gypsy36-LTR_DR. CC Gypsy36_DR is characterized by 4-bp target site duplications. The CC internal portion encodes one polyprotein the 1555-aa polyprotein CC Gypsy36_DR1p (pos. 419-5083) composed of the gag, protease, CC reverse transcriptase, and integrase domains. Some insertions CC fairly recent, according to the high identity between their CC flanking LTRs. Consensus obtained after the alignment of at least CC five independent insertions bearing at least 85% homology over at CC least 1000bp. XX FH Key Location/Qualifiers FT CDS 419..5083 FT /product="Gypsy36-I_DR_1p" FT /note="ORF." FT /translation="MISLEDFVSSPSEEKLNLCSKLQLWDIVDNYKLTGID FT KRLRKNELRALIKSALVDCGVLIPVSETADEGMGKFDFSESNLSFEQRKEL FT LELRQAHEKEMYKQATERERELKEKEVELARLKAEELVKQREIEYEKLKHD FT QKLELDRQARDYQLQMERLKIMADGRPLADGVGERNRAGDLVSNLKLLPKF FT NEKDPEVFFSLFESVADERGWPSADRTIMLQSVLVGRAQEAYTALSVEDRR FT NYEKVKSAVLKAFELVPEAYRIRFRTWRKNDRQTHVEVLRELATHFDRWCS FT TSNVRTFEGLRELILTEQLKNIIPENIATYINENKPATASEAAVLADDFVV FT THKRWVSDYRMNRGRNYPIDEPRSNIRKPEFKAEKMLHVPSRPIPDSVCRY FT CFEEGHWKKECPVLKNKHMRKGSKSQNSSSVLLADSVYPELRFEGDSFFVE FT SCKRPELNPSYIPFVTEGFVSLPGGSENVPVKILRDTGASESFILNSVLPF FT STESGLGKNVLVQGITLTCTAVPLHRVVLRSELVNGEVTMGVRPSLPVGGV FT DVILGNNLAGERVWPDASPLPIVITPPVSDDFVAGDSHIFPSCAVTRSKLK FT LDESVQGDFNATPLKKCNVPGLSLLPTFSSAELIAAQQEDVTLSPLFDAIL FT SPEEAHSAAAGYFVQDGMLLRKWMSSSEICGGELVVQIVVPTKMRTAVLEV FT AHGPVGGHLGVKKTYNRILRHFYWPLLKRDVSKFIKTCHVCQMTGKPNQSL FT KPAPLCPIPVMGQPFEHLLIDCVGPLPPSRSGSKFLLTVMCQNTRYPAAFP FT LRKITTKAVVKALSQFIAIFGIPRIIQSDQGTNFSSKMFAEILKQLRVRHQ FT QSSAYHPQSQGALERFHQSLKSLLRAYCTELNRDWEEGLPWLLLAAREVIQ FT SSLGFSPNDLVFAHKIRGPLALLKDCLKGEEPPRSLLDYVNGFRRRLVLAG FT QTARENLEKKQEKMKQLFDRNSEVREFSPGDQVLALLPLPASPFRAKFSGP FT YIVTRRISELNYFISTPDRKKSEQLCHVNLLKPYYSRDQTDCKTSAVLTVS FT QTDEIVPEMEEDEAPTDCVLQPRLSNTETLLNLENLMRHLTISQKGELIKL FT IHEFPELFGDVPSRTHLIEHDVDVGDATPVRQRFYRAPRVKQQALEAEVQY FT LLDNNLAKPSCSSWASPCLLVKKSDNSYRFCTDYRKINKITKPDAFPLPRV FT EDCVDQVGAAKFVSKFDLLKGYYQIPLTPRAQEITAFVTSSGLYSYSVMSF FT GLRNAPATFQRLMNRVVSGLSGCAVYLDDVVVCSDTWEQHLERVRALFLRL FT LEAKLTVNLAKCEFAKATVFYLGREVGHGYVRPLKEKVRAIEGFPIPSTKK FT DLMRFLGMAGYYRGFCPNFSSVVAPLTNLLKASVPFVWSSGCQQAFNNVKL FT LLSSNPVLVAPQFDKPFMIQVDASQIGAGAVLLQTGLDGVDHPVCYFSRKF FT NSYQVNYSVIEKETLALIWALQYFEVYVGGGSQVVVFSDHNPLIFLHTLKS FT PSQRLMRWVLFLQPYNLLIKHIRGTDNVFADCLSRAHSE" XX SQ Sequence 5367 BP; 1444 A; 924 C; 1341 G; 1658 T; 0 other; tatgggggct cgtcttattg agaggcatgc ggaccggtaa ttgttaagga gaatcaggtg 60 catgttttgg cagagcagag cgttcgtttg gcttcggaca ggtaagttcg gctcataata 120 tgttttggag atccccggat aggttattgt tttttttatt ttttggtgtt ttcctgggtt 180 aagtgcacgg ccgtggttgc atttctgagc cgacagttca gctgtgcacc agcggtatta 240 gctgcttaaa gaggcttgac tcttgggaaa acacagaaga gttaggtaag tttagttagt 300 gggaaatgtg ggttagttag ggggatattt gattatgagc cgattgctta tgtccgaagt 360 ttagatgtaa gtcgtaagta tttattagta ctgcatttgg gtaagtttat tctttgggat 420 gatttcgttg gaagattttg ttagttctcc atcagaggag aagttaaatt tgtgttcgaa 480 gctacagctg tgggacattg ttgataacta taaattaacg ggaattgata agcggttgcg 540 aaaaaatgaa ttgagagctt tgattaagag cgcattagtt gattgtggcg tgttaattcc 600 ggtatctgag actgctgatg agggaatggg taagtttgat ttttcggaga gtaatttgtc 660 gtttgaacag cgaaaagaat tattagagtt gagacaggct catgaaaaag aaatgtacaa 720 gcaggcaacg gagcgcgagc gagaactaaa agaaaaagag gttgaattgg cgagattaaa 780 ggcagaggag ttggtaaaac agcgtgaaat cgaatatgag aaacttaaac atgatcaaaa 840 attagaatta gatcgacaag ctagagatta tcagttacaa atggaacgat taaaaatcat 900 ggctgatggg agacccttag cagacggcgt cggggaacga aatcgagcag gtgacttggt 960 aagcaattta aagttgctac caaaatttaa tgaaaaagat cctgaagttt tcttttcttt 1020 attcgaaagt gtagctgatg aacgaggttg gccaagcgct gaccgtacaa ttatgttgca 1080 gtcagtactt gtgggaagag ctcaagaggc ttatactgcg ctttcagtgg aagatcgtag 1140 aaattatgaa aaggtaaaat ctgcagttct taaagcgttt gaattagttc ctgaggcgta 1200 tcgcatacgt tttcgtactt ggagaaaaaa tgacagacaa actcatgttg aagtccttcg 1260 tgaattggct acacattttg atcgctggtg ttccacctct aatgtgcgca cttttgaggg 1320 gttgcgggaa ttgatcctta ctgaacagct caagaatatt attccagaaa acattgccac 1380 ttatattaat gagaataagc cagcaactgc gagtgaggct gccgtgttag ccgatgattt 1440 tgtggtaacg cataaaagat gggtgagtga ttatcgaatg aataggggta ggaattatcc 1500 tattgatgaa ccgagatcaa atattcgaaa acctgagttt aaagcagaga aaatgttaca 1560 cgtaccatca agacccattc ctgattctgt gtgcaggtat tgttttgagg aggggcattg 1620 gaaaaaagaa tgccctgtcc taaaaaataa gcatatgcgt aaaggttcaa agtcgcaaaa 1680 tagctcatct gttttacttg ctgattctgt ttaccctgaa cttcgttttg aaggggattc 1740 attttttgtt gaatcatgta aaaggcctga gttaaatccg agttatattc cgtttgttac 1800 agagggattc gtttcattgc cgggagggag tgaaaatgta cccgttaaga tcttgcgtga 1860 cacgggagct tctgaatctt ttattttaaa ttctgttttg cctttttcca cagagtctgg 1920 gttggggaag aatgtgctgg tgcaaggcat cactcttact tgtacggctg ttcctttaca 1980 cagagtggta ctgcgttcgg agcttgtgaa tggtgaggtc actatgggtg tgcgtccatc 2040 acttccagtg ggtggagtag acgtgatctt gggtaacaac ctagcgggag agcgcgtgtg 2100 gccagacgca tctcctttac ccattgtcat tacgccgcct gtatctgatg atttcgttgc 2160 gggggattcc catattttcc catcatgtgc cgtgactcgt tctaaattaa aattagatga 2220 atctgtgcag ggcgacttta atgcaactcc tctgaaaaag tgtaatgtgc ctggtctgtc 2280 tcttcttcct accttttcta gtgcagagct tatagctgcg cagcaggagg atgttacgtt 2340 gtctccactg ttcgatgcta ttttgtcccc tgaagaggcg cacagtgcag cagctgggta 2400 tttcgtacaa gacgggatgt tactgcgtaa atggatgtct tctagtgaga tttgtggagg 2460 tgagctagta gtacagattg tggtgcccac aaaaatgcgc acggcggttc tagaagtggc 2520 gcatggtccc gttggaggtc atctaggtgt taaaaagact tataatcgaa tactgcgaca 2580 tttttattgg cctttgttaa agagggatgt ctctaaattc attaaaacgt gccatgtatg 2640 tcaaatgaca ggaaagccca atcagtcact taaacctgct ccgctgtgtc ccataccagt 2700 gatggggcag ccttttgagc atttattaat tgactgtgtg ggacctttac caccttcaag 2760 atcgggaagc aagtttttat tgacggttat gtgtcaaaat acccggtacc ctgctgcgtt 2820 tccgttacga aagattacta cgaaagctgt agttaaagct ctctcccagt tcattgccat 2880 ttttggtata ccgcgaatca ttcaaagtga tcaaggaaca aatttttcct ctaaaatgtt 2940 tgctgagatc ttaaaacaac tgcgagtgcg tcatcagcag tcgagtgcgt accatcccca 3000 aagtcagggc gcgctggagc gtttccatca gtcattaaag agtttgctgc gtgcgtattg 3060 cactgaattg aatcgcgact gggaggaggg attaccgtgg cttcttttgg cagctagaga 3120 ggtaatccag tctagtttag gcttcagccc aaatgatttg gtatttgcac ataaaattcg 3180 gggcccgtta gcactgttaa aagattgttt aaagggggag gagccgccga gaagtctttt 3240 ggactacgtt aatgggttta ggaggagatt ggtccttgcg ggtcagacag cgcgagagaa 3300 tttagagaag aaacaggaaa aaatgaagca gttatttgat cgtaattctg aagtccgtga 3360 gttttctcct ggggatcagg tgcttgcgct gttgcctttg cctgcttcac cttttcgagc 3420 aaaattttca ggcccttata tagttactcg ccgcatatca gagttaaatt attttatttc 3480 aaccccagat cgaaagaaaa gtgagcagtt gtgccatgta aatttgttga aaccatatta 3540 ttcccgtgat cagactgatt gtaaaacatc tgctgtgtta acggtttctc agacagatga 3600 aattgtaccg gagatggagg aagatgaggc tccaactgat tgtgtgctgc aaccgaggtt 3660 atccaataca gagacgcttc ttaatttaga gaatttaatg cgccatctaa ctatttctca 3720 gaagggagaa ttaataaagc tgatacacga atttccagag ttgtttggtg atgttccatc 3780 tcgcacccat ttgatcgaac atgatgtgga tgttggtgat gcaacacctg ttcgtcagcg 3840 tttttaccga gcgccacggg taaagcaaca ggcgttagag gctgaagttc agtatttgct 3900 tgataataac ttggcaaaac catcttgttc tagttgggct tcaccttgtt tattggtaaa 3960 aaaatctgat aattcatacc gattttgtac agattatcga aagataaata agatcactaa 4020 acctgatgct tttccacttc ccagagtcga agattgcgtg gatcaagtag gggctgctaa 4080 attcgtaagt aaattcgacc tgttaaaggg ttattatcaa attccgttaa cacctcgagc 4140 gcaagagatc actgcgtttg tgacttcttc cggattatat tcctattctg taatgagctt 4200 cggattgcga aacgctccgg caacgtttca aagattaatg aatcgggttg tgtcaggact 4260 ttcgggttgt gcagtctatt tagacgacgt agttgtttgt agcgatacat gggaacaaca 4320 tctggagcgt gttcgtgctt tatttttgcg gttattggag gcgaaactca cagtaaattt 4380 ggcaaagtgc gagtttgcca aggctactgt gttctatttg ggaagggaag tcggacatgg 4440 ttacgtgcgt cccttgaaag agaaggtgcg ggcaattgag gggttcccca tcccatctac 4500 aaaaaaggat ttaatgcgtt ttttgggaat ggcgggctat tatcgagggt tttgcccaaa 4560 tttttcttct gtagtagccc cattaacaaa tctattaaaa gcgtctgttc cattcgtttg 4620 gtcctctggt tgccagcaag catttaacaa tgttaaactt ctgctttcct caaatccagt 4680 tcttgttgca ccgcagtttg acaagccgtt tatgattcag gtggatgcga gtcagattgg 4740 ggcaggtgca gttcttttgc aaactgggtt ggatggtgtg gaccacccag tttgttattt 4800 ctcaaggaaa tttaactcct atcaggtaaa ctactctgtt attgagaaag agacacttgc 4860 tttaatatgg gcactccaat attttgaagt ttatgtggga ggtggttctc aagttgttgt 4920 tttctcagat cataatccgt taatttttct acatacactg aaaagtccca gtcagagatt 4980 gatgcggtgg gttttgtttt tacaacccta caatttgcta attaaacaca taaggggtac 5040 ggacaatgtg tttgctgact gtttgtctcg agctcattct gagtgatgtg actgagtggg 5100 taagtttctt gtttggtggt ggtcttgttg tgtctcttct ttttcttaaa tgctttcagg 5160 atccggtttg ctggaggaga ggagatgggg caggagtata ccaaactgta gctttacttt 5220 acataccgtt tcatcctaag aatttaagca gtactaatgt ttaaagttta gagaaaacaa 5280 aatagggaaa aaatagtata aaaattgaag agaaattatt ggtatttgtg gcatttatat 5340 attttttctt ttttttaggg ggggaga 5367 // ID Gypsy42-I_DR repbase; DNA; ZEB; 4975 BP. XX AC . XX DT 21-SEP-2007 (Rel. 12.09, Created) DT 21-SEP-2007 (Rel. 12.09, Last updated, Version 1) XX DE An internal portion of the Gypsy42_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy superfamily; Gypsy42_DR; KW Gypsy42-LTR_DR; Gypsy42-I_DR; endogenous retrovirus; gag; KW integrase; reverse transcriptase. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4975 RA Dib M.R. and Naveira H.F.; RT "Gypsy42_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 7(9), 814-814 (2007). XX DR [1] (Consensus) XX CC Gypsy42-I_DR is an internal portion of the Gypsy42_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its long CC terminal repeat is deposited in Repbase as Gypsy42-LTR_DR. CC Gypsy42_DR is characterized by 4-bp target site duplications. The CC internal portion encodes one polyprotein the 1295-aa polyprotein CC Gypsy42_DR1p (pos. 170-4054) composed of the gag, protease, CC reverse transcriptase, and integrase domains. Some insertions CC fairly recent, according to the high identity between their CC flanking LTRs. Consensus obtained after the alignment of at least CC three independent insertions bearing at least 85% Homology over CC at least 1000bp. XX FH Key Location/Qualifiers FT CDS 170..4054 FT /product="Gypsy42-I_Dr_1p" FT /note="ORF." FT /translation="MAEFDLEHFINAPTIEQLDKCRKDDLIRVAEHFKIKV FT SKQQLKREIKSVVVHNLGELGVLQLAAECQQSLHGDDVCTLHSGGEEASEA FT AAEEASKSKAGLPPFEPFSPTLVESGGDARFKLRLAKVQMEERDRVAQRHA FT EMELRLEIRRLEIEADKEVKLRELEIKAAKDAPVSPGQVPPVQATTFTTGS FT GFSSSTFDVSKYIPLVSQFRESKVDSYFNVFERVASALHWSKEIWPLLLQC FT KLSGKAQEVCAALSLEDSLNYDVVKAAVLRAYELVPEAYRQRFRNHKKNAN FT QTFVEFTREKCVLFDKWCTSSKVSNFQTLRELILLEEFKSCVPDRVVVYLN FT EQKVTSLSEASVLADEFALTHKNVFTPARVEKTLVTSISKSPTHFKNPTVK FT TGEERACFYCHKIGHLISECVLLKRKQQSAVPKSVAFVKTIDVGETPAVSD FT RLDADPGYKPFLLEGFVSVDKSTSPVKITILRDTGAMISFIAADVLPLSDE FT TFCGSHILVRGIEMGVTKVPLHEIYLQSDLFTGSVKVGVRESLPVPGVHFI FT LGNDLAGGKFMPVLEVCEKPVMSDHSDELKVDYPDVFPACAVTRAQARKFA FT DADDLTSTFIAPSFENDVLMTENKTDDRCDVQDTKRDLKLQVTREKIIAAQ FT KEDGSLGNIFSLVVPLEVANRRKIAFYIDNGLLMRKWCPDITNEWAVVNQI FT VIPLCYRQMVLSLAHDHDLSGRLGIKKTYQRILKHFFWPRLKSDVAKFCRT FT CKACQFSGKPNQVIPRAPLVPIPVIGEPFSHVIVDCVGPLPKTKAGNQYLL FT TIMCTATRFPEAIPLRKITAPVVVKALVKFFSTFGLPRVVQTDQGTNFLSK FT LFAQVLKTLNISHRTSSAYHPESQGALERFHQTLKSMLRKYCMDTEKDWDE FT GTPLVLFAIREAVQDSLGFSPAQLIFSPSVRGPLKVLEENILSPVTSAKTN FT VLDYVSKFRERLHQACSFAKESLVNAQITMKKKFDRASVLRNFNEGDQVLV FT LLPVVGSAFSACFAGPYTVIKKLSDTDYVIDTPDRRKKSRVCHINMLKAYH FT SRESIRTDQKENENDFSTTALVSEVLYSESNSSEDDGVIVRHDYQQCARLS FT NSEVLADLDSHLHLPDSQSNDIKELISSFPMIFSDVPSCTSVLQHDIDVGN FT ARPIKQNAYRVSAAKRAVMKAEVDYLLNHGLAKPSCSPWSSPCLLVTKSDG FT TARFCTDYRKLNAVTVPDCFPLPRMEDCVDNIGSAKFVSKLDLLKGFWQIP FT LTARASDLSAFVTPDNFTQYCVMAFGMRNAPATI" XX SQ Sequence 4975 BP; 1385 A; 932 C; 1153 G; 1505 T; 0 other; ttcaatggtg gcagcggtgg gatgttcctt tattggaata tccttaattt tggatgagca 60 ttgtgagatt tattgttgct ggtgaatagg agaaactttt gtgaacagga aggtttgatt 120 aaagagcagg tgtctgcgtt gttgtgagtc agataaacat atcacgagta tggctgaatt 180 tgatttggaa cattttatta acgcaccgac tattgaacaa ttagataaat gcagaaagga 240 tgatttaatt cgcgtcgcgg agcactttaa aattaaggtt tctaaacagc agttaaaacg 300 agaaattaag agcgttgtgg tgcacaattt gggtgaactg ggtgtgttgc agttggctgc 360 tgagtgtcag caaagtttac acggcgatga cgtctgcact cttcattctg gtggtgagga 420 ggcgagcgaa gctgctgcgg aggaggcttc taaatccaaa gcaggtttgc cacctttcga 480 acccttttct ccgactttgg ttgagtctgg aggtgacgcg cgttttaaat tgcgtttggc 540 aaaagtccaa atggaagagc gcgatagagt ggcacagcgt cacgctgaga tggagttgcg 600 ccttgaaatt cgtcgccttg agatcgaggc tgacaaagaa gtcaagctgc gtgaacttga 660 gataaaggct gcaaaagacg caccagtttc accggggcaa gtaccacctg tgcaagctac 720 cacttttacg actggatcag gattctcgag tagtacattt gacgtgagta aatatattcc 780 tcttgtttct caatttagag agtctaaggt cgactcatat ttcaatgttt ttgaacgagt 840 tgcgtctgcg ttgcactggt ctaaagaaat ctggcctttg ttgttacaat gcaaattatc 900 aggaaaagct caagaagtgt gtgctgctct gtccttagag gacagtttaa attatgacgt 960 tgttaaagct gcagttttaa gagcttatga gcttgtaccc gaagcatatc gtcagcgttt 1020 cagaaatcat aaaaaaaacg ctaatcagac attcgtagag ttcacaagag aaaaatgtgt 1080 gctatttgat aaatggtgca catctagtaa agtgtcgaat ttccaaacat tgcgtgaatt 1140 aatcttgttg gaagaattta aaagttgtgt gcccgatcgt gttgtcgttt atttaaacga 1200 acagaaagta acttcattat ctgaagcttc agttctcgct gatgaatttg cactgactca 1260 caagaatgtg ttcacccctg ctcgtgttga aaaaacactc gtaacttcca tttcgaaaag 1320 tccgacccat ttcaaaaatc cgacggtaaa aaccggtgaa gaaagagcct gtttctactg 1380 ccataaaata ggacatttaa taagtgaatg tgttttatta aagcgtaagc aacaaagtgc 1440 tgttcctaaa agtgttgctt ttgtaaagac aattgatgtt ggtgaaaccc cggcagtttc 1500 tgataggctg gatgcagatc caggttataa gccttttctt ctggaaggtt ttgtgtccgt 1560 ggataaatcc actagtccgg ttaaaataac catcctccgt gataccggcg cgatgatttc 1620 gtttattgct gcggacgttt taccactttc ggatgagaca ttttgtggaa gccatatatt 1680 agtgcgaggg attgagatgg gtgttacgaa agttccactt catgaaattt acctgcaatc 1740 tgatttgttt actggctctg tgaaagttgg tgtacgtgaa agtttgcctg tgcccggtgt 1800 acattttatt cttggcaatg atcttgctgg cggtaaattc atgccagttt tagaggtgtg 1860 tgagaaacct gtgatgtctg atcattcgga tgaattgaaa gtagattatc ctgatgtctt 1920 tcctgcatgc gccgttacgc gagcgcaagc ccgtaaattc gctgatgcgg atgatctaac 1980 ttcgactttt atagcaccta gctttgagaa cgatgtgttg atgactgaga acaaaactga 2040 tgatagatgc gatgttcaag ataccaagcg tgatctaaaa ttgcaagtga ctagagaaaa 2100 gattattgca gcacaaaaag aagacggctc tttgggcaat atcttttctt tagttgttcc 2160 gttggaagtt gccaatagaa gaaaaatagc tttctatatt gacaatggtt tactgatgcg 2220 taagtggtgt cccgacatta ctaatgaatg ggcagtcgta aatcaaattg tcattcctct 2280 atgttatagg caaatggtac tctccctggc tcatgatcac gatctttctg gacgtttagg 2340 cattaagaaa acttaccaac gaattctgaa gcatttcttt tggccgagat taaagtcgga 2400 tgtagcgaaa ttttgtagaa catgcaaggc atgtcagttt tcaggtaagc cgaaccaagt 2460 aattccacga gcaccgctag ttccaatacc agtcataggt gaaccttttt cacatgtcat 2520 agttgattgt gttggtcctc tgcctaagac aaaggctgga aaccaatatt tattaactat 2580 catgtgtacg gcgacgcggt ttcccgaagc tattccgctg cggaaaatca ctgcgcctgt 2640 ggtggtaaaa gcactagtga agttcttttc aacttttggt ttacctcgag tagttcaaac 2700 tgaccaaggt acgaattttc tttcaaaact gtttgcacaa gtactaaaga ctttaaacat 2760 ctctcataga acatccagtg cgtatcatcc agagagccaa ggcgctttag aacggttcca 2820 ccagacgttg aagtcaatgc ttcgtaaata ctgcatggac acagagaaag attgggacga 2880 gggtacacct ttagttttgt tcgcgattag agaagcagtt caggacagtc ttggtttcag 2940 cccggcacag ttaatatttt ctccctccgt tcgaggaccg ctcaaggtgc tcgaagaaaa 3000 tatactatca cctgtgacaa gtgctaaaac gaatgtttta gattatgtga gcaaatttcg 3060 tgaaagacta caccaagctt gttcatttgc aaaagagtcg ttagtgaatg ctcagatcac 3120 tatgaaaaag aaattcgacc gtgcatctgt tttgcgtaat ttcaacgaag gtgaccaggt 3180 attagtatta ttgcctgttg ttggttctgc tttttctgca tgttttgctg gcccttacac 3240 agtgattaaa aaattgagtg ataccgacta tgtgattgac acgcccgacc gccgaaaaaa 3300 gtcccgtgtc tgtcacataa atatgttgaa agcttatcac tctagagagt ctatccgtac 3360 agatcaaaaa gaaaatgaaa atgatttttc cactactgct ctggtatctg aagttcttta 3420 ttctgagtct aactcatctg aagatgatgg agtgattgtg cgtcacgatt atcagcagtg 3480 cgcccgactt tctaattctg aagttcttgc tgaccttgat tctcatttac atctacccga 3540 ttcacagagt aatgatatta aagaattaat cagttcgttt ccaatgattt ttagtgatgt 3600 gccatcctgt acttcagtct tgcaacatga catcgatgtg ggaaatgcta gaccgattaa 3660 acagaatgcg tatagagtga gtgctgcgaa acgcgccgtg atgaaagccg aggttgatta 3720 tttgctgaac catggtttgg caaaaccgag ttgtagtcca tggagttcac catgtttact 3780 ggtcacgaaa agtgacggta ccgctagatt ttgcactgat tacagaaaac ttaatgctgt 3840 gactgttcct gactgttttc ctcttccgcg catggaagat tgtgtggata atattggttc 3900 cgcgaaattt gtcagtaaat tggatttatt aaaagggttc tggcaaatac cactcacagc 3960 tcgtgcatct gatttatctg cttttgtgac tccagataat tttacgcaat attgcgtaat 4020 ggctttcggc atgcgtaacg cgccagccac aatttagaga cttataaatt gtgtgttagc 4080 tggagttgaa aactgtaatg catacttgga tgatgtcgtg atttattccc atagctgggc 4140 gaatcatatt tccactctca gggaagtatt catgcgtttc gagaaggcat ctctcacggt 4200 gaacttggct aaatgtgagt tcggacaagc tactatcacg tatttaggta aggaggtggg 4260 tcatggtaaa gtgaaaccgg tgaacgcgaa aataaatgcc attgcagaat tccccattcc 4320 tactacgcgt aaagaactgc gtcgattttt aggaatggca ggatattatc gcaatttctg 4380 taagaatttc tcgaccgttg ctaatccgtt aacatcactg cttagtccat cctgcaaatt 4440 tgcatggtct gatgattgtc agcaggcatt tgagaatata aaagctctct tatgtagcgc 4500 acctgtgttt gccgcaccag atgccacgaa accttttacg ttggagattg atgcgagcac 4560 tgttggagca ggtgcagtgc tcattcaaga agacgcccgt ggcattgagc atccagtatg 4620 ttatttttct cgcaagttta acaagcatca agtaaactat tcgaccattg agaaggaaac 4680 cctggcattg ttaatgtctt tgcagttttt tgatgtgtat gtgagttcaa gtggttttcc 4740 agtcactgtg tttactgacc acaaccctct agtgttttta ggcaaaatgt acaaccataa 4800 ccagagactg atgcgttggt ccttggtgac ccagggatat aacctggtta tcaaacacaa 4860 gagggggtcc gagaacatca ttgctgatgc gctgtccaga gctggtattg ttggagactt 4920 ggagtgagta aattatatgc attgcaatgt gtataatttt ttttgaggag ggaag 4975 // ID DNA-10-1_DR repbase; DNA; ZEB; 3496 BP. XX AC . XX DT 24-NOV-2008 (Rel. 13.11, Created) DT 24-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; TSD 10-bp; KW DNA-10-1_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-3496 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(11), 1731-1731 (2008). XX DR [1] (Consensus) XX CC Copies are ~92% identical to the consensus. TSD is 10-bp long, CC and TIR is ~31-bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 3496 BP; 1000 A; 701 C; 555 G; 1236 T; 4 other; ggcaggcact tttttccagg ggtttcagta cagcgtaacc gctactacgt catcgggtgg 60 gcgtggccta tctgtgaatt tgcataggtc acggatcatg cgcagattgt gaaaacagcc 120 gtttgattca gatacctgta cgtattcgca ggggtttcgt gtccaatgcc ttttatatgc 180 agtactaaat agctgtcctg atcgcaggtg tttcatgtga agtcagtgta tgttatatta 240 aacaaattaa cttaagatac ctctcagatt acattaatac tgttataact taagatacct 300 gtcccgatcc casggctttt gtgtcaaata ctttatatat catatacagt aacatcagat 360 acctgtcctg atcccagggg tttcatgtat gatcactttt atatacatta acatcagata 420 cctgtcctga tcccaggggt ttcatgtatg atcactttta tatacattaa catcagatac 480 ctgtcctgat cccaggggtt tcatgtatga tcacttttat atacattaac atcagatacc 540 tgtcctgatc ccaggggttt catgtatgat cacttttata tacattaaca tcagatacct 600 gtcctgatct caggtgtttc atgtatgatc acttttatat acattaacat cagatacctg 660 tcctgatctc aggtgtttca tgtatgatca cttttatata tatcagtaac atcagatacc 720 tgtcctgatc tcaggggttt catgtatgat cacttttata ttatatacat taacatcaga 780 tacctgtcct gatctcaggg gtttcatgta tgatcacttt tatatacagt aacatcagat 840 acctgtcctg atcccaggkg tttcatgtat gatcactttt atatacatta acatcagata 900 cctgtcctga tctcaggggt ttcatgtatg atcactttta tatacagtaa catcagatac 960 ctgtcctgat cccaggggtt tcatgtatga tcacttttat atacagtaac atcagatacc 1020 tgtcctgatc ccaggggttt catgtatgat cacttttata tacagtaaca tcagatacct 1080 gtcctgatcc caggtgtttc atgtatgatc acttttatat tatatacatt aacatcagat 1140 acctgtcctg atcccagggg tttcatgtat gatcactttt atattatata cagtaacatc 1200 agatacctgt cctgatccca ggggtttcat gtatgatcac ttttatatta tatacattaa 1260 catcagatac ctgtcctgat cccaggggtt tcatgtatga tcacttttat atacattaac 1320 atcagatacc tgtcctgatc tcaggggttt catgtatgat cacttttata ttatatacag 1380 taacatcaga tacctgtcct gatcccaggg gtttcatgta tgatcacttt tatattatat 1440 acagtaacat cagatacctg tcctgatccc aggggtttca tgtatgatca cttttatatt 1500 atatacatta acatcagata cctgtcctga tctcaggggt ttcatgtatg atcactttta 1560 tatacattaa catcagatac ctgtcctgat ctcaggggtt tcatgtatga tcacttttat 1620 attatataca gtaacatcag atacctgtcc tgatctcagg ggtttcatgt atgatcactt 1680 ttatattata tacagtaaca tcagatacct gtcctgatcc caggtgtttc atgtatgatc 1740 acttttatat tatatacagt aacatcagat acctgtcctg atcccagggg tttcatgtat 1800 gatcactttt atattatata cagtaacatc agatacctgt cctgatccca ggggtttcat 1860 gtatgatcac ttttatatta tatacagtaa catcagatac ctgtcctgat cccaggggtt 1920 tcatgtatga tcacttttat attatataca ttaacatcag atacctgtcc tgatcccagg 1980 ggtttcatgt atgatcactt ttatatacat taacatcaga tacctgtcct gatcccaggg 2040 gtttcatgta tgatcacttt tatattatat acagtaacat cagatacctg tcctgatccc 2100 aggggtttca tgtatgatca cttttatatt atatacagta acatcagata cctgtcctga 2160 tctcaggggt ttcatgtatg atcactttta tattatatac attaacatca gatacctgtc 2220 ctgatcccag gggtttcatg tatgatcact tttatattat atacagtaac atcagatacc 2280 tgtcctgatc ccaggtgttt catgtatgat cacttttata ttatatacag taacatcaga 2340 tacctgtcct gatcccaggg gtttcatgta tgatcacttt tatattatat acattaacat 2400 cagatacctg tcctgatccc aggtgtttca tgtatgatca cttttatatt atatacagta 2460 acatcagata cctgtcctga tcccaggkgt ttcatgtatg atcactttta tattatatac 2520 agtaacatca gatacctgtc ctgatcccag gggtttcatg tatgatcact tttatattat 2580 atacagtaac atcagatacc tgtcctgatc ccaggtgttt catgtatgat cacttttata 2640 tacattaaca tcagatacct gtcctgatcc caggggtttc atgtatgatc acttttatat 2700 tatatacagt aacatcagat acctgtcctg atcccagggg tttcatgtat gatcactttt 2760 atattaatta acatcagata cctgtcctga tcccaggtgt ttcatgtatg atcactttta 2820 tatacagtaa catcagatac ctgtcctgat cccaggggtt tcatgtatga tcacttttat 2880 atamattaac atcagatacc tgtcctgatc ccaggggttt catgtatgat cacttttata 2940 ttagttaaca tcagatacct gtcctgatcc cagttgtttt gtgtatgatg actttaaata 3000 aaaatagaaa ataaatatga gttatgttaa ttaactaaga tcgagaaaga aaaattaacc 3060 acgatttatt acagaaaaat gtggtattta tttgtttaaa catgactgaa taccatattt 3120 taaactacaa aaacacggtt aatcttacca caaccattat tatttggttt aaaatggaag 3180 tacaaaagcc cagagacagt aaaacggagt aaaacgaggt cagccgtttt tgctaatgaa 3240 atctttaaaa tataacatta taaaaacaaa aaaataacct gtatgattgt ttttttcctt 3300 cgttatccac cgcggtaggt tctttatcca taatcatcct gtttgaaatg accgcgctaa 3360 aaaagaacct gtactgcgca tgatcggtga cctctcagta ttgttttttt tttttaaggg 3420 ggcgtttccc aaacaccgac cgagccacgc ccattttacg tcgtggtaat gaaacccctg 3480 gaatttagtg catgcc 3496 // ID LOOPERN4_DR repbase; DNA; ZEB; 917 BP. XX AC . XX DT 11-JUL-2002 (Rel. 7.06, Created) DT 19-FEB-2009 (Rel. 14.03, Last updated, Version 2) XX DE LOOPERN4_DR is a nonautonomous DNA transposon - a consensus. XX KW Kolobok; DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; Dr000048; LOOPERN4_DR. XX NM LOOPERN4_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 917-64 RA Bao Z.; RT "Dr000048, an unclassified and uncharacterized repeat."; RL http://www.genetics.wustl.edu/fish_lab/repeats/. XX RN [2] RP 1-917 RA Kapitonov V.V. and Jurka J.; RT "LOOPERN4_DR, a nonautonomous DNA transposon from zebrafish."; RL Repbase Reports 2(6), 19-19 (2002). XX DR [2] (Consensus) XX CC About 1000 copies of LOOPERN4_DR are expected to populate the CC zebrafish genome. LOOPERN4_DR copies are ~15% divergent from the CC consensus sequence. CC This element is characterized by 11-bp terminal inverted repeats CC and putative TTAA targets site duplications. CC It belong to the Kolobok superfamily. XX SQ Sequence 917 BP; 284 A; 174 C; 185 G; 274 T; 0 other; atggcaccta tgatgaaaat caacttttgt aagctgtttg aacagaactg tgtgtaggtt 60 tatgtgtgtc cacagtcata ttggagtgat ataaacccaa caagtatctt tattaaaatt 120 tcctgacgtt aaaataggat ccaaatccca gtgattttga ggcccaccgc aacgtgacca 180 ttaggagtgc ggttttcccc gcccaccgaa ttgattgaca ggcgccatgt ctctataata 240 acatgtatac acatgtccac agaacatttt ttgcaaagaa actgggatta aaacatctgt 300 tacaactctc tgtgatctgc tccttaataa ttagttttat aagttttaaa acgtgttttt 360 aaaacagtgc atgtttgtaa taaagacagt aaaattgcta tgtaattctt aaccgctata 420 atcaccacgg ccgcatggtg tcagtaaatg cgcataagtt tgtaaagtta aataaatgtg 480 tgtgtgtgtg tgtgtagtgc atgacaaact gtgtgtactg caaacggcat ttgtgtgtga 540 ctcatcattt cagaaaggct tgaataaact ccaccacaaa tacatcaaat aaacttactt 600 ggtatttttg actaatgagc tgtatttcag cttcatccgt gagtctgtct ctgtcactga 660 ctgctgttta tctgacgtaa cgcatgatga gaagcagaca tgcacgtggg aacggtgggc 720 ggggagaagc agctcatttg catttaaagc cacaggctac aaaaacagct acactgtcct 780 cagacaccaa aatgggcaga ttctgcaggc tataataaat aatctgatgg gtattttgag 840 ctgaaacttt acagacacat tctggagaca ccaaagtctt atcttacatc ttgtaaaaga 900 ggtaaaatag gtgccct 917 // ID LTR2_DR repbase; DNA; ZEB; 1656 BP. XX AC . XX DT 05-JUN-2002 (Rel. 7.05, Created) DT 05-JUN-2002 (Rel. 7.05, Last updated, Version 1) XX DE Long terminal repeat from some endogenous retrovirus - a DE consensus. XX KW LTR Retrotransposon; Transposable Element; endogenous retrovirus; KW LTR2_DR; class I ERV. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1656 RA Kapitonov V.V. and Jurka J.; RT "LTR2_DR, a family of zebrafish long terminal repeats."; RL Repbase Reports 2(5), 28-28 (2002). XX DR [1] (Consensus) XX CC LTR2_DR is a long terminal repeat of unidentified endogenous CC retrovirus that belongs to the class I ERV. CC LTR2_DR solo-copies are flanked by 4-bp target site duplications. CC LTR2_DR copies are over 90% identical with the consensus CC sequence. XX SQ Sequence 1656 BP; 492 A; 318 C; 287 G; 512 T; 47 other; tgtaacatat tgtaacctgc attctggaga taaacattcc atttaacatt ccagaaaaca 60 ntctgggtga atcgccagga caataaaatc acttggcgac gaaaagtctg gcttrgtaat 120 ccstggtgac atgttaacaa gtcatgggag acttyacaga accactctct gcctacttca 180 aaagaacntg acagataaga taagaaanaa aaaaaacact ctttagccaa cattctgctc 240 acagacaaga cattcctcaa aattaaacaa agacatttan aacccaaatc tcttgtttaa 300 tttcatttna cttttstnan ntgtanaanc cctnatatan nctntannca tncgtccctt 360 naggcctttt taaaagcatn aatnatgtaa ggatatgnna antatgttga tgtacnttta 420 atgtgatgna cctgtanagt gtttggtnnc natnggntnc nnnngacatt ttacaacaca 480 tgatgcatag ancnagacca taactcgcat tgctttattc tttaaagccc tgtattaaat 540 gcctgtccgc atgctttagg cggacactca aatttgcata cgagcagccc cgagacaaag 600 gaaggttcgc gcgcaaactt tcccctgaaa tcattggtca gaatgctgaa cgggttgtca 660 cgtgacccgc aggtataach cagacttctc ncccaaacat ctgggcagaa agagaacgtc 720 csaaaagaag aacatcattg acggtggcct tcgggccaca cgtggctttc taagccgcat 780 ggactttttc ccgccacgct ttctttattt tccggcaaag taaactcagt ttaaagtttg 840 cgatcgtgtt cgtccgaact gcattacaaa ctccacgcat caaaggacat caaaagtagt 900 tttcatcacg acggaagaga sacgcataaa agagacctcc agacatgaaa gaccaggtga 960 ttttagttat gcgattaagc tgtgcccctt ttatcaaaaa ggtgtttttt ttwtataacc 1020 ttggtggtcc ttaatggtgc gtgtggaact gaaagttttg tcactcttta atctgaaatc 1080 tatattttct agctttacta tttctctttg ctgttacacg ttctatagac ccgatatctc 1140 cacgtggttt acctttgttt tgtgtttaat gtatgtttgt gcgtttgttt gttcgttacg 1200 tctgactagt caataaattc cattataatc aaatgaattg tattgtttgc ctcacgagaa 1260 aatgtcactg aaatacagat tcccctgcct agctattata agttgtttaa aacttttgaa 1320 tgattaatct acttcacaag aagtagcaat taaattccct ttttctaaat gaatagatta 1380 ccgtgtccgc tcggaagagc ggcggctctg cttgtgttcg tttggtcaaa ttaacaataa 1440 attgatccgg aatattaatg attaataata attcgttatt gttgataatt aatattcata 1500 atctgggaat ccgctcggtc gcgcgagcat gatcatttac aatcatatgt ttgtttgacc 1560 aaattgactg aagcttaagc cggaatatta ataattaaga ataaaccgtt attgttgatt 1620 attaatattc ataaaatgag ctaatttcat gttaca 1656 // ID Gypsy82-LTR_Dr repbase; DNA; ZEB; 1345 BP. XX AC . XX DT 20-APR-2009 (Rel. 14.05, Created) DT 20-APR-2009 (Rel. 14.05, Last updated, Version 1) XX DE A long terminal repeat of the Gypsy-82_DR LTR retrotransposon; a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy superfamily; LTR; KW endogenous retrovirus; Gypsy-82_DR; Gypsy-82-I_DR; KW Gypsy82-LTR_Dr.; Gypsy82-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1345 RA Dib M.R. and Naveira H.F.; RT "Gypsy82_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 9(5), 959-959 (2009). XX DR [1] (Consensus) XX CC Gypsy82-LTR_DR is a long terminal repeat of the Gypsy82_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its CC internal portion is deposited in Repbase as Gypsy-82-I_DR. XX SQ Sequence 1345 BP; 275 A; 201 C; 364 G; 505 T; 0 other; tgtaatgggg cgatccccaa aggggagggg tttcccactc gtatgcgtgt agtgggggat 60 ccccaagggg gaggggtttc ctcactatgt ttaggtgaat ggagttagaa attaagctcg 120 ccaccccttt aaaagtgtgg aggttttgct tccgttatct ccctggcatg gaatagtggc 180 agtgtgttga gtgcgtgcaa acttttgatg tgcgttaagc attgagcgtt gagcagtaag 240 cctgcagtgt ttagaaaaac ctgtaattct gtttctttgg cctgcggaag tgcaatcgca 300 tcagcaagta acggtatgtc tctgtttatg tctttggcta tttctggcga gttttagagc 360 tgttaattaa taataaataa ttcttttagc tacatccatg tatgattggt ggattgctca 420 ccatttgcgt taagagccag ttgactgggc tgcgggtaag tttctatttg agagtctgta 480 aatgttttat ctagaaatat gattttaaaa cgttaacatg cgaatgaata ttgactggtt 540 gttttataga gagcccgttg ctgttttgct gctgtctggc gacccgtttg tgctacggag 600 gggcgatttc actttccttc tcggcacctg attgtgtgtg actcggttgc gtcaaggcag 660 tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtt 720 tgtgtgtgtg tgtgggtgtg agcgcgccct tattgtgcga gggtgcgcta aattctttcg 780 acgttggcgc ggttgtgagt gtgcgtgtat gtgtgcagtg ttgtgattta actatttggt 840 gggtttgtat tttgtttgtg taaattgcta agggctgtta ttgttcaact tacatttcaa 900 tactaacagc aaggattacc ttatatgggg agggttgtgt tattttggct ttgtttgtta 960 tgcgggtggt ttattttgag tgtgggttag tttctcttta ttaatgagtg ttttgttttg 1020 aaggtttcgt tgcactggag tgtggcattt tcctcttttt ccttttgatc tttcccaatg 1080 gtatatcctg tgtaatattg agtgtgagag cagtggccac tgacaaccat tatacaaacc 1140 attgtattgt tttgtgactt attgtaattg actatctgtt ttatatgatt acattttaaa 1200 gaaaaactct ctttaagtgt ctaataaaag attgtttgta cacatctctg ctttatttcc 1260 atttgtttgg agaaaacctg aagttgtggg tagtaaagtg aacctctatt tggttatatt 1320 cagtacagta cgggccccgg tcaca 1345 // ID Gypsy66-I_DR repbase; DNA; ZEB; 6226 BP. XX AC chr14; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy66-I_DR; KW Gypsy66-LTR_DR; Gypsy66_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6226 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1447-1447 (2008). XX DR Genome; chr14; Positions 14659340 14665565. XX CC Positions [4651-5124] - Integrase core CC 'TCAG' target site duplication CC LTRs are 100% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 122..1546 FT /product="Gypsy66-I_DR_2p" FT /translation="MGTLRELRREIERQLHLLVGPSNCEALFKVAQCCAEE FT EGVMIPGMEATEEELYDFIVDFMRREQMHSLEDQGMARLLTLHDLVRELTT FT TDQGSASNVNENEETVGLLTSASTSTAVRGQVAQTTGLTSHVTAEQTVPLS FT RVSEQMTGLIKFSDVASFLPRREFKIYGGQISDTNSDLSFNSICKQIDEGI FT AENFTESEIIRTVLKVIRPGTFKEMLITKDSLTVTELKRFLRAHLRDKSST FT ELFQELSNAKQQEKESPQQFVYRLMGLKQRVLFASEQTESEFQYDRKLVQG FT VFLHSLYQGLNEKCTFVRRDIKTVIATQTATDDQILELITQSVSEETERQK FT RIGHTYKSRVTAASVTQQDEVDGKLASLSHAEVKANREAIHELTAQVTALT FT KSLEKVMSIRPAVQQQADLAAATAQPPSKSETKGKCKQCIAQNVDVCTHCF FT LCGQAGHRAVGCLMKSRSPKERRSLGGDHQ" FT CDS 1459..6132 FT /product="Gypsy66-I_DR_1p" FT /translation="MWPSRASSCGLSHEKSFAKREEVTGRGPPVTEKDLVS FT QPFSNASRGEKPVTQNTFKRAIVEKRVAQLIGGRCMITCNLNGVKTQMLLD FT SGAQVSIVGQAWLKKTLPTIEIQPLESLLADNQLSVTAANGTAVPFNGWVE FT VLLEISNDKQNNITIQVPLLVSQKCMDCPLLGFNVIGEIIRENNDCVSSVN FT LIDLLSETLQMQKDTAEILVSTVNSNQIKEYTASTKVKTGKLRVIIPAGKI FT LEVRCRVKAGHKSGIMMFEPASESCLPEGLELFPAVVNVSPYASKTVRIPI FT QNCTNHDIHLPQRLVLGGIESISEMRPVCPSSSVEQSDKQISSALLCSSQV FT PSVDDVSSKEEHQNQTMDEKWHPPVDLGHLNKQQQEVVRKMLFEESDVFAR FT EEGDIGCIPDLQLKIRLTDNTPVQRCYNAIPKPLYKEVKDYVQNLLNRGWI FT RKSSSSYSSPVVCVRKKDNSLRLCVDFRDLNRKTVPDRHPLPRIQDLLDSL FT GGNSWFSILDQGSAYHQGFVSEDSRHATAFNTPWGLYEWVRIPFGLTNAPA FT AFQRCMEGVLEGFRDECCAPYLDDVLCFSKSFNDHIDHLRNVFCQMRAHGI FT KLRPQKCELFKRQVRYIGRLVSGEGVQIDPADLEAVRALKNKVPRTVGEVR FT ALLGFLSYYRSYIQDFSRLAKPLFELLQRPPETSESRQNPKVHSRSGAIRL FT KEKGQLHSRTPVVWTVEHGSIVSKFVDVLTSPPVLAYPNFDLPFVLHTDAS FT NEGLGAVLYQHQGNKLRVIGYGSRTLTPAEKNYHLHSGKLEFLALKWAICD FT KFRDYLYYAPTFTVYTDNNPLTYVLSTARLNAVGQRWVGELADFHFDIKYR FT PGRTNVDADTLSRHPLKLTDIMEEHINTLSPETVSAVWQGSKASTERDVPW FT IAALELNTPSKEMLFNEENFATVTPEKVKAAQRNDPAIGEVVKLKEQGWTP FT NWKDKNSMKKETRKLLFEWMKLEMKDGILYRKAGQHWQLVLPLLFRALVLK FT KLHNEMGHVGVEKVLHLARERFYWPSMRKEIEEYITTKCACIKQKHPHVHQ FT RAPMGSITSSSPFELVSVDYLHLEQSRGGYEYILLLVDHFTRFAQAYPTKN FT KSGKTAAEKIFQDFIPRFGYPEKLHHDQGKEFENHLFQKLQQLSGVAHSRT FT TPYHPQCNPVERLNRTILQMLRTLEEEKKSNWKDYLPHVVHAYNCTRHEST FT GYSPFFLLYGRSPRLPVDLLFNLKNKTESSNQHTFAEQWEDRMRQAYQIAK FT ENSQRSSAKGKKYYDRAVRGAILQPGDRVLVRNLSERGGPGKLRAYWEKDV FT HRVVERIGDGPVYKIQPEIGSKTLRVLHRNLLLAVNDLPLDESMARTKETK FT KVKQNKQMRNQPEDESESDTSDEETHTYHYDLRKNIPCYQFVSIPEKKREP FT VNPMAQSHSRLRPSATEFCPDKNPKSLENQIQEFDQEHGETQRMVENPLPV FT TQDEDDEELRNEAETIRNVESEMAEEENVRRTQRVGKPRKLLTYEALGEPS FT YQPWRAEANTLWLTQQYMPQENAFYYQSMPTHPCCHWGYNRF" XX SQ Sequence 6226 BP; 1989 A; 1260 C; 1455 G; 1522 T; 0 other; acttgggggc tcgtccggga tcagcgccac ctggtggtca ggtgctgatc attaaagaac 60 aacagtggtt aaacttcacc caggaacagc gagttgagtt gcagacatca gcaatccagt 120 catgggtacc cttcgagaac tgcggcggga gatcgagagg cagctccacc ttctggttgg 180 cccttctaac tgcgaagcac tcttcaaggt agcccagtgt tgtgctgagg aggaaggagt 240 gatgatccca ggcatggagg ccacagaaga ggaactctat gacttcattg ttgatttcat 300 gcgtagagaa caaatgcata gtctagagga ccaaggaatg gctcgcttgc tgacccttca 360 tgaccttgtt cgtgagttga ctacaacaga tcaaggttct gcttctaacg tcaacgaaaa 420 tgaagagact gtgggactat tgacatctgc atcaacatca acagctgtca gggggcaggt 480 agcacaaact acaggtctta catcacatgt aacagccgaa caaactgttc cactatccag 540 agtatcagaa caaatgactg ggttaatcaa attctctgat gttgcatcct ttttgccacg 600 tagagagttc aaaatttatg gtggccaaat ttctgataca aattcagact tgagttttaa 660 cagcatttgt aaacaaattg atgaaggtat tgcagagaat ttcacagaat cagaaataat 720 ccgaacagtg ttaaaggtaa tcagacctgg tacattcaaa gagatgctga ttacaaaaga 780 tagccttaca gtaactgagt taaagcggtt tcttagagct caccttaggg acaaaagtag 840 cactgaatta tttcaggaat tgagtaacgc aaaacaacag gaaaaagaat ctcctcaaca 900 gtttgtatac aggttgatgg ggcttaagca gagggtgttg tttgcgtcag aacaaacaga 960 gtctgagttt cagtatgacc gtaaattggt ccaaggtgtt ttcctacact cactttatca 1020 gggcctgaat gaaaaatgca catttgtccg gagggacatc aaaacagtta tagcaacaca 1080 aacagctaca gatgatcaga ttttggaact gattacacag tctgttagtg aggaaacgga 1140 aaggcaaaaa cgaataggtc atacatataa atccagggtt accgctgcaa gtgtcacaca 1200 gcaggatgag gtagatggta aattggcatc cctgagtcat gcagaagtta aagctaatcg 1260 tgaggccatt catgagctta ctgcacaagt aactgcactg accaagagcc tagaaaaagt 1320 tatgtccatt cgccctgctg ttcaacagca agctgatcta gccgcagcaa ctgcacaacc 1380 accttcaaaa tcagaaacaa agggaaaatg caagcaatgc atagctcaaa atgttgatgt 1440 gtgtacccac tgtttcttat gtggccaagc agggcatcga gctgtgggct gtctcatgaa 1500 aagtcgttcg ccaaaagaga ggaggtcact gggaggggac caccagtgac cgaaaaagat 1560 cttgtgtccc aacctttttc taatgcttct agaggtgaga aacctgtaac acaaaacaca 1620 ttcaaaagag ctattgtgga gaaacgtgtt gcacagctga ttggtggtcg ctgcatgata 1680 acttgcaatt taaatggtgt caagactcaa atgcttctgg acagtggcgc ccaggtcagt 1740 attgttgggc aggcctggct gaagaaaacc ttaccaacca ttgaaatcca acccttagag 1800 tctttattag ccgacaatca gctttctgtc actgcagcaa atgggacagc tgtacccttt 1860 aatggatggg ttgaagtttt gttagaaatc tcaaatgata aacagaacaa catcaccatc 1920 caagtacccc ttttggtgag tcagaagtgc atggattgtc ccctacttgg attcaatgta 1980 attggagaga tcataagaga gaataatgat tgtgtaagta gcgtcaacct cattgacttg 2040 ctttcagaga ccttgcagat gcaaaaggac acagctgaaa tcttagtttc cacagtaaat 2100 tctaaccaaa tcaaagaata tactgcaagc accaaagtga agactgggaa gcttagagtt 2160 atcattccgg caggtaaaat tctagaagtg aggtgtcgtg taaaagcagg gcacaagagt 2220 ggaatcatga tgtttgaacc agcatcagaa agttgcctcc ctgaggggct ggagttgttt 2280 ccagctgtgg taaatgtgtc cccttatgcc tcaaagacag tgagaatacc tattcaaaat 2340 tgtacaaatc atgatattca cctacctcag aggctggtgt taggtgggat agaatccatc 2400 agtgaaatga gaccagtatg cccatcttcc agtgttgaac agtcagataa gcagatcagc 2460 agtgcacttc tgtgttcatc ccaagttcca tcagtagatg atgtcagtag caaagaagaa 2520 catcaaaatc agacaatgga tgaaaaatgg cacccacctg ttgatctggg acatctcaat 2580 aaacagcaac aagaggtggt gagaaaaatg ttgtttgagg aatcggacgt gtttgctcgg 2640 gaggagggag atattggatg tattcctgat cttcagctga aaattcgcct aactgataac 2700 acaccagttc aaagatgtta taatgccatc cccaaacccc tttataaaga ggttaaagac 2760 tatgtgcaaa acctcttgaa tagaggatgg atcagaaagt cgtcatcttc atattcatca 2820 cctgtggtat gcgttcgaaa aaaagacaac agcctgcggt tgtgcgttga ttttagagac 2880 cttaaccgta agaccgtacc tgatcgacat cctcttccga ggattcaaga tcttttagac 2940 agcttaggag gaaattcatg gttttctatt ttggatcagg ggagtgcata ccaccaggga 3000 tttgtgagtg aggactctcg acatgccact gcctttaaca ctccatgggg attatacgag 3060 tgggttcgta ttccttttgg attaacgaat gccccagcag catttcaacg atgcatggag 3120 ggagtgttgg agggatttcg tgatgaatgc tgtgctcctt acttagatga tgtactttgt 3180 ttctcaaagt ctttcaatga ccacatagac catctccgaa atgtgttttg ccaaatgcga 3240 gcgcatggaa ttaaactccg tccacaaaaa tgtgagctgt ttaaaagaca ggtgcgatac 3300 ataggtcgtt tggtatcagg agagggagtt caaatagatc cagcagattt ggaagctgta 3360 agggccctaa aaaacaaagt accacgcacg gtgggagaag tgcgtgcact gcttggattt 3420 ctaagctatt ataggtctta tattcaagat ttttctagac tagccaaacc actctttgag 3480 cttctccaaa gacctcctga gacaagtgag agtagacaaa acccaaaggt tcacagtcgt 3540 tcaggtgcaa taagattaaa agaaaaggga cagcttcact ccagaacccc agttgtttgg 3600 acagtagagc atgggagtat tgtttccaaa ttcgtggatg tgttgacaag tccgccagtg 3660 ctggcttacc caaattttga cctacctttt gtgttacaca cggatgcttc caatgaaggt 3720 ctgggagctg tgctttatca acatcagggc aacaaactcc gtgtcattgg gtatggatcc 3780 cgcactttaa caccagcgga aaagaactat catctacatt cgggtaaact ggaatttttg 3840 gcacttaaat gggccatctg tgataagttt agagactatc tttactatgc tccaacattc 3900 acagtttaca ccgacaacaa tccactgact tatgttctca gcacagcaag gttgaatgcg 3960 gtagggcagc gctgggtggg tgagttagcg gattttcact ttgacatcaa atacagaccg 4020 ggaagaacca atgttgatgc cgacacattg tcaaggcacc ctctgaaact gacagacatc 4080 atggaagaac acatcaacac attgtcacct gaaactgtgt ctgctgtttg gcagggtagt 4140 aaagcatcaa ctgaaaggga cgtaccttgg attgctgctt tagagcttaa cactccaagt 4200 aaagagatgc ttttcaatga ggaaaatttc gccactgtca cccctgaaaa agttaaagca 4260 gcacaacgaa atgatccagc cataggtgag gtagtcaaat taaaggaaca aggctggaca 4320 ccaaactgga aagacaagaa ttccatgaaa aaggaaacaa ggaaactact ctttgaatgg 4380 atgaagttgg agatgaagga tggaattttg tacaggaaag caggacaaca ctggcagtta 4440 gttcttcctc tgctgtttag agcattagtg cttaaaaagt tgcacaatga aatgggacat 4500 gtaggagttg agaaagtgct tcatcttgca cgtgaacgct tctattggcc ttcaatgaga 4560 aaggagattg aagaatacat aaccacaaag tgtgcatgta taaaacaaaa acatccgcat 4620 gtccatcaaa gagcacccat gggttctatc acatccagtt ccccatttga gttggtgagt 4680 gtggattact tgcatcttga acaaagtagg ggtggttatg agtatattct cctgctggta 4740 gatcacttta cacgctttgc acaggcctat ccaacaaaaa ataagtctgg caaaactgct 4800 gcagagaaga tcttccagga cttcattcct cggtttgggt atcctgagaa actacaccac 4860 gaccaaggga aagaatttga gaatcacctg tttcaaaagt tacaacagct gtcaggagtt 4920 gcccactcgc gaaccacacc atatcaccct cagtgtaacc ctgtggagcg actaaatcgt 4980 acaattctgc aaatgctcag gactctagag gaggagaaaa agagtaattg gaaagactac 5040 ttgcctcacg ttgtgcatgc ctataattgc acaaggcatg aatcgactgg ttactctccc 5100 ttctttctct tatatggcag atcaccacgg ttgcctgttg acttgctgtt taacctgaaa 5160 aacaaaacag aaagctcaaa ccaacataca tttgcagaac aatgggaaga cagaatgaga 5220 caagcatatc agattgcaaa agaaaatagc caaaggtctt cggcaaaggg caagaaatac 5280 tacgatcgag cagtaagagg tgccatcctt caaccaggcg accgagtgtt ggtcagaaac 5340 ctctcggaga gaggtggacc gggaaagttg cgtgcatact gggagaaaga tgtacatcga 5400 gtggtagagc gcattggcga tgggccggtg tataagatcc aacctgagat tggctcgaag 5460 acacttcgtg tgctacaccg caacctcctg ctggctgtca atgacttacc tttggatgaa 5520 tcaatggcta gaacaaaaga gacaaagaaa gtaaaacaaa acaaacagat gagaaatcaa 5580 ccagaggatg aatctgaatc agatacctcg gatgaggaga cacatactta ccattatgac 5640 cttcgtaaga acataccttg ctaccaattt gtgtctattc cagaaaagaa aagagaaccg 5700 gtaaatccaa tggcccaaag tcattctcgt ctacgtcctt ccgcaactga attttgtcct 5760 gacaaaaacc ctaagtcctt ggaaaaccag attcaagagt ttgaccagga acatggagag 5820 acacaaagaa tggttgaaaa cccactgccg gtgacacaag atgaagatga tgaggagctg 5880 aggaacgaag cagagacaat cagaaatgta gaaagtgaga tggctgagga agaaaatgtg 5940 agaaggacac aaagagtggg aaaaccacgg aaattgttaa cgtatgaagc tctgggtgaa 6000 ccatcttatc aaccgtggag ggctgaagcg aacacacttt ggttgacaca gcaatacatg 6060 cctcaagaaa atgcatttta ctaccagagt atgcccactc atccttgctg tcattggggt 6120 tataatagat tttgaggtta gaaaaaaatt aggaagagat caaagcaaaa taaagggtaa 6180 agaggttatt aagatgtacg gagccatcta agaaaatagg ggagag 6226 // ID DNA-2-13_DR repbase; DNA; ZEB; 830 BP. XX AC . XX DT 11-NOV-2008 (Rel. 13.11, Created) DT 11-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE nonautonomous DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; 2-bp TSD; KW DNA-2-13_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-830 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(11), 1704-1704 (2008). XX DR [1] (Consensus) XX CC Copies are ~89% identical to the consensus. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 830 BP; 284 A; 115 C; 123 G; 308 T; 0 other; cccacatgga aagaacctat atgaggatat atgtgcatat atgaaaccta tatgcagtat 60 atatgcacat atatgataat atatatgctg tatatatgtg tatatatgcc acatataggc 120 aaaattgagg tgcatatatg tgcatataca ggccaaatag gtctcctgta ttgcttcttc 180 atatccacat ataagccctt tatgttcata catgatatgt ctcctatata gctcatatct 240 cactttggca actgtcatac atgatgcctt gactaggtca tattttctat tatcaaggca 300 ataactaaga actaacttaa aaaagtgcaa aaaaaaacgt atgtatgtgc gaacacttaa 360 atttggtatc acttgtaatt ggcatgttat ggattttaac tatggcaaat aagtgagagg 420 agatggaaag ctatgatgtc tacacgtttt cctgaaacat ttacaaggtc aattctttct 480 tgtatgaaga tagtggaaaa gaaagaaata tataacacaa tggtttgtct ttttttttct 540 ttattttttt tgttgtttat tctcttattt ttgttcttgt actatattta tttcttgtac 600 tatttgatta agtaaataaa aaaaaaaatt tcttttaaat aatgttgtct agcagctgta 660 ttttaataat tttactaaca tataggtgaa tatatatgta cccatatatg tgcatatatg 720 tacatataat ataggaaaac ggccaatttt atatatgtca catatatcag aaacctatat 780 caaaacctat attaaaacat atatgtttat ataggttctt tccatgtggg 830 // ID DNA-TTAA-2_DR repbase; DNA; ZEB; 1669 BP. XX AC . XX DT 06-FEB-2009 (Rel. 14.02, Created) DT 06-FEB-2009 (Rel. 14.02, Last updated, Version 1) XX DE non-autonomous DNA transposon - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; target TTAA; KW TSD 4-bp; DNA-TTAA-2_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1669 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 9(2), 536-536 (2009). XX DR [1] (Consensus) XX CC TSD is TTAA-specific. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1669 BP; 475 A; 339 C; 344 G; 510 T; 1 other; agggtcacga aacaccaaaa cacatttttt gagatgttga cagtcatata tgtgtcccac 60 gctgctaaaa acactattag gacacatata tttcacaaaa aagtgaaaat tggttgtttt 120 tgcgttattt cgagcaaatt cgttcttccg gtttgaaaag aaattttgaa gctacgtcac 180 ggccatgaga tccttgtgta aattccagcg tggagactgg atgtctgtac cggcgggtac 240 aacgtgacgt ctttgagttt tcagcattaa ttcatgagaa agacttggtt cgaaccaatc 300 agcgcgctct attgtgaatg aggtgcaact tcattaatat gcatgatata gcttcgaaga 360 ctgtttttac ctgttacagt gttcagacgg cagagagacg ccacgttgtg ttgccaaccg 420 taattaaaac aagtggaaga agaagaattg tttggagaca tgggtcgtca gtgttgcgtt 480 tataaatgtg ctgcaacgaa ggttttgttt tcatttcccc gctatgagag cgcagctggc 540 gtgtggactc acatgtgtgg attacagtgg tctgctaaca ygcgacaaaa atatgtttgt 600 aaggaacatt tttttacccg agagcttttc gcatctggaa atggtgaaat ccggatttgc 660 cgctcgtctt cttttaaaaa aagacgcggt tccagttggt gttgattgtc ctgtctctac 720 agatttggta agtgtgtgat caatgttctt tgtttatgtt tattcagatg gcaaagttag 780 ttagtatcgt cagcagtact ctttcagttt ttaaagacat accttagtca aaatgattta 840 gttactaagt ttacagtacg ttaatatcac tacaatatta tggactgaaa gatccgctgt 900 aaatgctgct ctccgaatgt aaacatgtaa acaccttaat caaactatta ccgtcgtgta 960 ggattttcgc cgcatatgat ttttgccgca ttttgtgaca ggatagtcta tacacacacg 1020 gctgtttgac gctattctct gcacctaccg agtctctgca tctaagttac cgagaaatgc 1080 agaggtttta tttttttctc ccatacggcg tgcggtatca aacattccat taaaacaaca 1140 ctcttccagc agttcctcgc atccaatatc tcgtttgtca cggggggcat gcatgaaatg 1200 ttcctgaatg aaagtgaaag tgccaaactg cagttaaagt cgacaaatta agaatgaaac 1260 acccaaaatt acatgaaact ccagagatga tggatagcgt ggtgacgcaa tgacgttaat 1320 cgaattatgt gctataacat gtaaaacggg atcatgaaag gaacattcaa aaagcaactc 1380 atgtaaacac cttaatcata ttattgtctt attcagatta aggcaaataa tttgattact 1440 gatgtccatg taaacgtagt cactgtctat ctcccctgcg tctgtgtttg tgttgcctct 1500 gtgaaaatca gctgtgcaca tactgaaact ccccttttca tgcaaaccct tccctctttc 1560 gccactcgac actcccacct aaacaaagct ggactcaccc actttcctga cttttttcaa 1620 actagaggtg tgaaaacacc ctgctgagac aggggggttt catggccct 1669 // ID hAT-N66B_DR repbase; DNA; ZEB; 195 BP. XX AC . XX DT 08-JAN-2009 (Rel. 14.01, Created) DT 08-JAN-2009 (Rel. 14.01, Last updated, Version 1) XX DE Putative nonautonomous hAT DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW 8 bp TSD; hAT-N66B_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-195 RA Jurka J.; RT "hAT DNA transposons from zebrafish."; RL Repbase Reports 9(1), 17-17 (2009). XX DR [1] (Consensus) XX CC ~94% identical to consensus. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 195 BP; 45 A; 43 C; 50 G; 57 T; 0 other; cagtgtttct caaccacgtt cctggaggac caccaacact gcatgttttg gatgtctcct 60 ttgtctgtca cacccattac aggtctttca gtctctgcta atgagctgat gatctgaatc 120 aggtgtgttt ggttaaggag acatggaaaa tgtgcagagc tggtggtcct ccaggaacgt 180 ggttgagaaa cactg 195 // ID MSAT-3_DR repbase; DNA; ZEB; 165 BP. XX AC . XX DT 21-AUG-2010 (Rel. 15.11, Created) DT 21-AUG-2010 (Rel. 15.11, Last updated, Version 2) XX DE Minisatellite-like DNA - consensus. XX KW MSAT; Satellite; Simple Repeat; MSAT-3_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-165 RA Jurka J.; RT "Minisatellite-like DNA from zebrafish."; RL Repbase Reports 10(11), 2010-2010 (2010). XX DR [1] (Consensus) XX SQ Sequence 165 BP; 66 A; 22 C; 44 G; 33 T; 0 other; gagttgacga taacagagtt gacgataaca gagttgacga taacagagtt gacgataaca 60 gagttgacga taacagagtt gacgataaca gagttgacga taacagagtt gacgataaca 120 gagttgacga taacagagtt gacgataaca gagttgacga taaca 165 // ID Gypsy103-LTR_Dr repbase; DNA; ZEB; 819 BP. XX AC . XX DT 21-APR-2009 (Rel. 14.05, Created) DT 21-APR-2009 (Rel. 14.05, Last updated, Version 1) XX DE A long terminal repeat of the Gypsy-103_DR LTR retrotransposon; a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy superfamily; endogenous retrovirus; KW LTR; Gypsy-103-I_DR; Gypsy-103-LTR_DR; Gypsy-103_DR; KW Gypsy103-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-819 RA Dib M.R. and Naveira H.F.; RT "Gypsy103_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 9(5), 943-943 (2009). XX DR [1] (Consensus) XX CC Gypsy103-LTR_DR is a long terminal repeat of the Gypsy103_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its CC internal portion is Gypsy-deposited in Repbase as Gypsy103-I_DR. XX SQ Sequence 819 BP; 125 A; 222 C; 154 G; 318 T; 0 other; tgtcacaccc ctggtttgat cacgccgcct ttgtttatgt tgtcatgtca ttagcacgcg 60 tggatgtttt gttttggtct gtcaccttgt gctcaacgta atattcctcc ctcccccttg 120 ttaccttatt ttactattat catgatttaa ttaatctcac ctggattcag ttatgctcct 180 tcccctattt agttccctct cgtgtgtcat cctgtgtcag atcgttgtgt gtcagtccgt 240 gtaaacagtc ttccgagttg tctctccagt cctgtcgtgt tggttggtgt tatcccttcc 300 agttgtgtct cgcctggccg tcatcttact tctagcgctc tgtccttctt gtgtgtgttc 360 cattttggct catatccctc tgttctcagg caccgtcatc ccctccgcac tgagttctcc 420 gacatcacct gccgtccgac ttcccctctc cctcgctcgc tctcgtcggt cacatctaaa 480 acagccgggc catagtgcgc cctctaccgg aggagtttgt aaggacattc ttttctcatt 540 atatttttgc tgtcttaacc ccggcctttt tgttgttttt tctccttgtg tgcagcgcgc 600 cactggccga gagactctgt tttttgaccc tcggaaattt ttatttttct cttttcaaac 660 ccggtaacgg aggaggtcct aaagactatt tatttgatct ttttggattt ttgtgttatt 720 ttttggctgt tattttgtgg ccctgactac agtctttgct gttcattaaa atctttaatt 780 tgaacctgca tttgtgtcct tccagtccca ttcctgaca 819 // ID HARBINGER3N_DR repbase; DNA; ZEB; 1144 BP. XX AC . XX DT 06-NOV-2003 (Rel. 8.1, Created) DT 06-NOV-2003 (Rel. 8.1, Last updated, Version 1) XX DE Nonautonomous Harbinger-like DNA transposon - a consensus. XX KW Harbinger; DNA transposon; Transposable Element; Nonautonomous; KW HARBINGER3N_DR; HARBINGER3_DR; Harbinger superfamily; KW nonautonomous DNA transposon. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1144 RA Kapitonov V.V. and Jurka J.; RT "HARBINGER3N_DR, a nonautonomous Harbinger-like DNA transposon RT from zebrafish."; RL Repbase Reports 3(10), 183-183 (2003). XX DR [1] (Consensus) XX CC HARBINGER3N_DR is a nonautonomous Harbinger-like DNA transposon, CC a deletion derivate of HARBINGER3_DR. CC The consensus sequence was built based on 100 copies that are CC ~98% identical to each other. This transposon is characterized CC by the CWG target site duplications and 12-bp terminal CC inverted repeats. CC Protein machinery encoded by HARBINGER3_DR was involved in CC transpositions of HARBINGER3N_DR nonautonomous elements. CC The genome harbors ~1000 copies of HARBINGER3N_DR that are CC ~96% identical to the consensus. XX SQ Sequence 1144 BP; 296 A; 254 C; 236 G; 358 T; 0 other; gggcctcatg tatcaacgct gcgtacgcac aaaaactttg cgtacgccag gtttcacgct 60 cagaatcgct cacgtttgga tttactaaca atgaactgaa cgtgggaatg tgcgcagctt 120 cacggcagct ttctggctgg cgtacgcaca ttttttgtgc gtgtctgttt tatttccatt 180 ggcgactcct agaggcagtt gtgttaaatt cctctctaca aagtgtctga gccttgcaat 240 ggcagctgta tgagacgggt tcatctagca ggtatataag gtttccatac catacagttg 300 accagctaaa cattaaagca caatttgcag ccgtcgcctg ttttcccaat gtaatctgag 360 cgatctactg cacgcacatt gctataaaga cactatctga agatgaattt gcatgcgtga 420 atcagaaaca tttccattca ataaatgtgc aaataaaata tgatgcacaa acttattaat 480 gattcctact tgtctttctc gtgataaata gttggcaaaa tctgatatgt agcgggggaa 540 aaaagaagaa agagttcatc agacgctgga ttcgaaccga gttcatgctc gaacgtgtca 600 aaacatgttc acatgcgtct tacgagctgc gccactgaga ctgttaaggg tactgcaaca 660 ttttacacct ataaatcaca ctatttcttt tttaattcac tcagtgcgat gttcagaccc 720 aactgtgtta accgcatcag ctaaactctc ccactctatt tttttctttt gttgttaatt 780 ccggagaaca aacttgcaaa taacaccgct tttctccggt ctacctccga aagcagcacc 840 tccaattcac attctgttca aagtttctct ttttgcttgc ttttgccatt gctttttcgt 900 tgggttttgc cattagcata gtcattagca tattcatacg ggggaggagg cagggagggg 960 ttttgtgctc gtgcatgttg cgctcagttt cacgttcatt cggatgtaca aaagaatatg 1020 cgtgagattc ggcgtacgca gtgtttcata catctgaatt tttttctgcg tacgcacatt 1080 tacagctttg tgcgtacgca atgttttagt aagatttcca cgcaagtctt cgtacatgag 1140 gccc 1144 // ID hAT-N45_DR repbase; DNA; ZEB; 936 BP. XX AC . XX DT 17-OCT-2008 (Rel. 13.1, Created) DT 17-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Putative nonautonomous hAT DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW 8 bp TSD; hAT-N45_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-936 RA Bao W. and Jurka J.; RT "hAT DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1314-1314 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 936 BP; 255 A; 194 C; 256 G; 231 T; 0 other; cagtggcgta gcgcaaaatg cggaggcccc cctgcaggga tcgctgacgg ggccccctga 60 tgaagagggg gggggggggg ggatacgtca tacgtcaatt tgcatatcac tgacgtcatc 120 acgttctacc gtttctgttt gtttaacagc ctatggttaa taaaggggta tttataattg 180 cactacactt tatataagca tgtttattta actaaattta ttgctgtttg aatacagtat 240 gtcattatag atgtgtagtg aatgtgcagt aatctctcat atgtaataaa ctcagacaag 300 ttcagaaact gtcactaaaa tatctgtgat tgtgaacaag aaaagaataa acggtccaat 360 taaattgtta aaataaagta gaagtagatc ggtttgactg tacaagggaa ataccttcac 420 actgcctgtg ctaaatcatt tgtcgtcggt acgcagaggg gtgatctgct ctcgggctgc 480 aggtgggaga ggctgctgta cgaggactga ctgggccgcc tgtcagtgct gtgaggcggg 540 ggaggggccg gcggcatcac acgcagctcg ttgagaagag tagggacggt acagtatgga 600 caaatgacag tctaaaaaaa atctccaata acacacaaaa aaagtcgcta gatttgtcgc 660 taggggggtc tgaaaagtcg ctaaatctag tgacaaagtc gctaagttgg caacattggc 720 ggcaagcaat cagaatgaag tagtccaccg cttgagaggt gttcagagaa cacagacctg 780 tgaactttgg ttccgaccac agttgttccc aagagtttga ttattgcggt tgccggtgga 840 ttttcaatta ttgaaaatcg cgacgacgcg gggcccccta atcacgcggg gcccccccgc 900 ggtgcgtgcc ctgcgggccc gtccgctacg ccactg 936 // ID Gypsy-22-I_DR repbase; DNA; ZEB; 5713 BP. XX AC . XX DT 11-FEB-2005 (Rel. 10.01, Created) DT 19-MAY-2005 (Rel. 10.06, Last updated, Version 2) XX DE An internal portion of the Gypsy-22_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy superfamily; Gypsy-22-I_DR; Gypsy-22-LTR_DR; Gypsy-22_DR; KW endogenous retrovirus; gag; integrase; reverse transcriptase. XX NM Gypsy-22-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5713 RA Kapitonov V.V. and Jurka J.; RT "Gypsy-22_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 5(1), 17-17 (2005). XX DR [1] (Consensus) XX CC Gypsy-22-I_DR is an internal portion of the Gypsy-22_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its CC long terminal repeat is deposited in Repbase as CC Gypsy-22-LTR_DR. Gypsy-22_DR is characterized by 4-bp target CC site duplications. The internal portion encodes one CC polyprotein: the 1686-aa Gypsy-22_DR1p (pos. 605-5662) CC composed of the gag, protease, reverse transcriptase, and CC integrase domains. The consensus sequence was built from five CC copies less than 2% diverged from the consensus sequence. XX FH Key Location/Qualifiers FT CDS 605..5662 FT /product="Gypsy-22_DR1p" FT /translation="MAQFFSHSTSTYVDIDAPDAPTISTPVWSPPVQTQTL FT SGLPPHDQIMHNISPVHSFPSHATPVQTLPTALGGHILMQPLSLSQEHVSM FT QLCTSADVSSPSTMEQHDLPGNLPTPRREIQQVSSYVQGNLDNLMVTMKKQ FT EKCLHELTQKLKTSSSQHVNQITTLTAKIESNKQEIVTILTGAKQQEAADA FT DQLVKAVQLMLATEFQKFESTLTSAVVDKVEKLRRDVHHDLKSIQQTLQGS FT LDQLTTNLQQCEEKISKCQTCVTQLKKDLQVHNVKDVEPQTETKAAPVTST FT LSTETVSTLPNTMVKSDHLKLTFPTFGRHTDDTDPLLYLTKCQDFLALHPL FT TDADLLATFRTVLYGTARDWWEVSRSNIATWKEFESAFLSAFLSEDYEDEL FT AERVRTRVQGDRESIRDFAFTYRALCKRWKSTLTETEIVKMILKNIKPYLA FT SQLRSRVNTVEDLVKLGYLLERDYEEQRRYESRMAHKQASSQKSFSNRPVE FT KQPIQCWRCSGPHPPGNCPMYLTPPSQQSSTQHHPNHGKSFHAAKSGGRPT FT NIIVAASETPQSTKEVPNVFLPSTTMSSLAIPQQLVVPISIGSWFGKAILD FT TGASYTLIHESLMQHFDTSAQLQNWSSGPLYLANGKAEIPLGWLNITIQIH FT GKSFVVPAVVLPSQALAYAIILGLDFIFFSGLKIHVSERKYSFTSDPTEEH FT PFQPGYASEPLVKMTPMTEKKTLRKNKLNLTLLSAVPPPQTSLGMLQTDHV FT DDATQIWNAVSEAQLPKEEKQQLLQILQNNPRVCTQRTGKTKLLQHRIYTT FT SQVPIKQKPYRLSPVKQQVMEEQLEQMLREGIVEPSHSSWASPVVLVPKKN FT GKLRFCVDYRKVNAITENDAYPLPNITEILESLSGSTIFSTIDLNSGYWQV FT MMDPDSKAKTAFIVSDGLYQFNVMPFGLKNAPATFQRLMETVLGELRRKIC FT LVYIDDIIVYSPSVTQHFCDLQTILHRLEAAGLTINLEKCKFFLPEITFLG FT HVVNAKGITADPSKVEAILSFPTPNNLKEVQRFLGLAGWYHRFVQNFSKIA FT EPLNALKKKGQVFKWTAQCQQSFDQLRSCLTSPPILGHPDLKIPFIVYTDA FT SDTGLGAILTQRKDPGSEEVIAYASRTLTGAEVNYTATEKECLAVVWALEK FT WQHYLEYKLFTVVTDHSALQWVMGSTKTNSRLIRWVLRLQKFNFIIEYRKG FT KLNVAPDALSRSPLTTISPVTAVYTKQQTDQHTELPVSDVVLWEEQHSDEE FT TTKLLQAVAEEPNQLEQYEVIEDKLYHKTYLKNDQVHYRVYVPNRLRPTLL FT HHYHSHPLSGHHGIYKTYKRIQAVAFWPGLWTDVKRHVKECVKCQTIKYDN FT QKPAGKLQSTITSRPNQMLGVDIMGPLPRSTQQNEYLLVFVDYYSKWVEFF FT PMRQANAQSVAVIFRREILTRWGVPDFILSDRGTQFISSVFKNVCEKWGVT FT QKLTTAYHPQTNMTERVNRTVKSMIASYVDDNHSKWDQFLPEMRFAMNTAI FT QETTGVTPAELQIGRKLHGPMDKILHGQNLIPDNTSYDVVCHIQQLKSQVQ FT ENCRRAQQRQLRNYNKKRREAGFKNKDRVWLRNFPQSSAQHKFSAKLAPKW FT KGPYRVLKQLGPLNYRIALEETGEDVRTVHVCNLKECFPTAEELEVQEKKR FT LRELFEETSEEEEFFGF" XX SQ Sequence 5713 BP; 1818 A; 1219 C; 1139 G; 1537 T; 0 other; gaaatggcac ccgaacaggg acatgaacac tttaagggac atgaacactt taagggacat 60 tgaacacttt aagggacttg aataattgaa actttcattg attgacttga ataactgaaa 120 atttctttga ctgactgact gacctgaatt actggacagg gttttgtttg tgaatgttta 180 atagtttaaa attgtttttt tcttgctgaa tgtgagaact attgacaaaa atcatcaaat 240 aacagaaatt gtgtaaaaaa aaaaaaaaaa aaaaacttgt catgtatgtg gtttttggta 300 taatgatttg agtttattgt gaaaatttgc ttgagcaacc agtgtaagct gaatttattt 360 tttgtgtcaa agtgtgtggg aacttgaact agaaaatgtg atcgtcatct gtcttgtatt 420 tttgtacatg catttgaaag gtttctctta cattgataat atggacatga tgtaattgat 480 tgcagagaac agtttctttg tgagtatttt ttgttttatt ggggtttttt tttatttatt 540 tatttttttt cctgctgtcc caaacacaca caaatcctaa tcacacccaa caccaacaca 600 agtcatggcc cagttctttt cccattcaac gtcaacatat gttgatattg atgcacctga 660 tgccccaaca atttccacgc cagtgtggtc acctccagtg caaactcaaa cactttctgg 720 acttccacct catgaccaaa ttatgcataa tatttcaccg gtccatagtt tcccaagcca 780 tgcaactcca gtacagacac ttcctactgc tctgggaggc catattctaa tgcaacctct 840 ttctttgtcc caggaacatg ttagcatgca actgtgcacg tctgcagatg tttcatcacc 900 ttctacaatg gaacagcatg atctgcctgg gaaccttcca acccctagga gagaaattca 960 gcaggtcagt tcttatgtgc aaggtaattt ggacaatctg atggtaacaa tgaaaaaaca 1020 agagaaatgt ttgcatgaac ttactcaaaa gctgaaaact tcatcatccc agcatgtgaa 1080 tcaaatcacc acccttacag ccaaaataga gtccaataag caagagattg tcaccatact 1140 cactggagct aagcaacaag aggctgcaga tgctgaccag ttggtcaaag ctgtacaatt 1200 gatgcttgca actgagtttc agaaatttga atcaaccctt acctcagcag ttgttgacaa 1260 agtggaaaaa ctccggagag atgtccacca tgatctcaaa tcaatccagc aaaccctcca 1320 gggaagtttg gatcagctta ctacaaatct tcagcaatgt gaagaaaaaa ttagtaaatg 1380 tcagacctgt gttacacagt tgaaaaagga tttacaagtg cacaatgtga aagatgttga 1440 acctcaaaca gaaacaaaag ctgctccagt tacaagtaca ctctccactg agactgtttc 1500 aactttacca aataccatgg tcaaaagtga tcaccttaaa ttaacttttc ccacctttgg 1560 acgacataca gatgatactg accctttatt gtacctaaca aaatgtcaag atttcctggc 1620 acttcatcct ttgacagatg cagatctttt ggctaccttt cgcactgtct tatatggcac 1680 agctcgggac tggtgggaag tgagtcgctc caatattgcg acttggaagg aatttgagtc 1740 tgcatttctt tctgcattcc tgtctgaaga ctacgaagat gagcttgctg agcgtgttcg 1800 tactagagtt caaggagaca gagagtcaat tcgagatttt gcatttactt atcgagcact 1860 ttgtaaaaga tggaaatcca ctctaacaga gactgaaatt gtaaaaatga tcctaaaaaa 1920 tatcaaacca taccttgcca gccaactgcg cagtagagtg aacaccgtgg aggatctagt 1980 taaattggga tatctattgg aacgagacta tgaagaacaa agacggtatg aaagtcgaat 2040 ggctcacaaa caagcaagtt cacaaaaatc tttctccaat cgacctgttg agaaacagcc 2100 tattcagtgt tggaggtgca gtggtccaca tccaccggga aattgcccaa tgtatttaac 2160 tccaccttcc cagcaatctt ccacgcaaca tcacccaaac catggaaaga gttttcatgc 2220 tgcaaagtca ggaggtcgac ctacgaacat cattgtagca gcctcagaaa caccccaatc 2280 aacaaaagaa gttccaaatg ttttccttcc atctacaact atgtcatctc tggccattcc 2340 acaacaatta gttgtcccaa ttagtattgg atcatggttt ggaaaagcca tactggacac 2400 tggagcaagc tacacgctaa tacatgaaag tctaatgcag cattttgata cctctgccca 2460 gctacaaaac tggtcgagtg gacctcttta cttggctaat ggaaaagcgg agataccctt 2520 aggatggtta aacatcacta ttcaaataca tggtaaatcc tttgtagtac ctgctgttgt 2580 cctcccatct caagctcttg catatgccat catcttgggt ttggacttca tattctttag 2640 tggtctgaaa attcatgtta gtgaacgcaa gtattctttt acgtctgatc ctactgaaga 2700 acacccattt caacctggat atgcaagtga acctctagtt aaaatgacac ccatgacaga 2760 aaaaaagacg ctcagaaaga acaaactcaa tctcaccttg ttaagtgctg tccctccacc 2820 tcaaacctca ttgggtatgc tacaaactga tcatgtcgat gatgcaacac agatctggaa 2880 tgctgtaagt gaagcacagc ttcccaaaga agaaaagcaa cagttactac agatcctgca 2940 gaataacccc agggtatgta ctcaaagaac tgggaaaacc aagttacttc aacaccgtat 3000 ctacaccacc agtcaggtac ctatcaaaca aaagccatat cgtttgtctc ctgtaaaaca 3060 acaggtgatg gaggagcaat tggaacaaat gttgagagaa ggtattgtag aaccatcaca 3120 ttcttcttgg gcttcaccgg tggtgttggt tcctaagaaa aatggcaagt taagattctg 3180 tgtggactac cgcaaagtaa atgcaataac ggaaaatgat gcttaccctc ttccaaacat 3240 cacagagata ctagaatctc tctctggatc aacaattttt tctaccatcg atttaaacag 3300 tgggtattgg caagtgatga tggatcctga cagcaaagca aagactgctt ttattgtgtc 3360 tgatgggcta tatcaattta atgttatgcc ttttggatta aaaaatgcac ctgctacatt 3420 ccagaggtta atggagaccg tactagggga actaagacga aagatatgcc ttgtctacat 3480 tgatgacata attgtgtatt ctccctcagt gacccaacac ttctgtgatc tgcaaaccat 3540 cctccacagg ctagaagctg ctggactgac catcaacctg gaaaaatgca agtttttcct 3600 accagagatc acgtttctag ggcatgtggt aaatgctaaa ggtatcacgg cagatccgag 3660 caaagttgag gccattctct cttttcctac acccaacaat ctgaaggaag ttcagcgatt 3720 cctgggacta gccggctggt accaccggtt tgtgcaaaac ttttcaaaaa ttgctgagcc 3780 cctcaatgcc ctgaagaaaa aaggacaggt gtttaaatgg acagcacagt gtcaacaatc 3840 atttgaccag ttaagatcat gccttacctc acctcccatt cttggccacc ctgacctcaa 3900 aatacctttc attgtataca ctgatgccag cgatacagga ctgggtgcta ttctcactca 3960 acgaaaggat ccaggtagtg aagaggtcat tgcctatgcc agtcgcacct taactggggc 4020 tgaggtaaat tacactgcaa cagaaaaaga atgtttagct gttgtttggg cattggagaa 4080 gtggcaacat tacctagagt acaagctctt cacagtagta acggaccatt ctgctcttca 4140 gtgggtgatg ggatccacaa aaactaacag ccgccttatt cgatgggtct tgagactgca 4200 gaaattcaac ttcatcattg aataccgaaa agggaagtta aatgtagctc ctgatgcttt 4260 gtccagatca cctctcacta caatttctcc tgttacagca gtttacacga agcagcaaac 4320 agaccagcac acagagcttc cagtttctga tgttgtccta tgggaggaac aacattcaga 4380 tgaagaaaca acaaaactac tacaggctgt agcagaagaa ccaaatcaat tggaacaata 4440 tgaagtgata gaggacaagc tgtaccataa aacttacctg aagaatgacc aagtacatta 4500 tcgtgtgtat gttccaaacc gtcttcggcc gacactactt catcactatc attcacaccc 4560 gctaagtggc catcatggaa tatacaaaac ttataaacga atccaagcag ttgctttttg 4620 gccaggctta tggactgatg tgaagagaca cgttaaagaa tgtgtgaagt gtcaaacgat 4680 caagtatgat aatcagaaac cagcaggaaa acttcaatca accatcacct ctcgacctaa 4740 tcaaatgctt ggagttgata ttatgggacc cctacctcgg agtacacagc aaaacgagta 4800 tttgttggtg tttgttgact attactccaa atgggttgag ttctttccca tgcgtcaggc 4860 aaatgctcag agtgttgctg tgatttttag aagagaaatc ttaacccgtt ggggggttcc 4920 agacttcatt ctctctgatc gaggaacaca gttcatctca tctgtgttta agaatgtatg 4980 tgaaaagtgg ggagttacac agaagctaac aacggcttat caccctcaga ccaatatgac 5040 agaacgggtg aatagaactg tcaaaagtat gattgcgtca tatgtggatg acaaccacag 5100 caaatgggat cagttcttac ctgagatgag attcgcaatg aatactgcca tccaggaaac 5160 cacgggagta actccagcag aactgcagat tggaaggaaa ctgcatggac caatggataa 5220 aatacttcat ggtcagaatc ttatccccga taatacctct tatgatgtag tctgtcacat 5280 acaacaactg aagtcacaag ttcaagagaa ttgccgaagg gcacaacaac gccaactccg 5340 aaactacaac aaaaagagaa gagaggccgg tttcaaaaat aaagacagag tatggttgcg 5400 caatttccct cagtccagtg cacagcacaa gttcagtgcc aagctggcac caaaatggaa 5460 aggaccttac cgagttttaa aacaactggg acctttaaat tatcgtatag ccttggagga 5520 aactggagaa gatgtgcgca cagtacatgt gtgtaatctt aaagaatgtt tcccaacagc 5580 tgaggagttg gaagtccaag aaaaaaaacg acttcgggaa ttgtttgaag aaacctctga 5640 agaggaggaa ttttttggat tttgattgta tttcttttaa caaccatggg ttgttttgtc 5700 aaggggggga gaa 5713 // ID REX1-1_DR repbase; DNA; ZEB; 3590 BP. XX AC . XX DT 05-JUN-2002 (Rel. 7.05, Created) DT 16-NOV-2010 (Rel. 15.12, Last updated, Version 3) XX DE REX1-1_DR is a CR1-like non-LTR retrotransposon - a consensus. XX KW Rex1; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; CR1 clade; AP endonuclease; KW REX1 subclade ORF2; REX1-1_DR. XX NM REX1-1_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-3590 RA Kapitonov V.V. and Jurka J.; RT "REX1-1_DR, a family of CR1-like non-LTR retrotransposons in RT zebrafish."; RL Repbase Reports 2(5), 29-29 (2002). XX RN [2] RP 1-3590 RA Kojima K.K. and Jurka J.; RT "Consensus update."; RL Direct Submission to Repbase Update (16-NOV-2010). XX DR [1] (Consensus) XX CC REX1-1_DR is a family of CR1-like non-LTR retrotransposons and it CC was active in zebrafish a few million years ago. The consensus CC sequence encodes one protein, REX1-1_DR1p (position 723-3311). CC The 863-aa REX1-1_DR1p protein is composed of the AP endonuclease CC (positions 1-200) and reverse transcriptase. REX1-1_DR copies are CC ~9% divergent from the consensus sequence. Approximately 1000 CC copies of REX1-1_DR are present in the zebrafish genome. CC [2] Consensus update. XX FH Key Location/Qualifiers FT CDS 11..2971 FT /product="REX1-1_DR1p" FT /note="apurinic-like endonuclease and reverse FT transcriptase." FT /translation="MAARAQREAQGLSSFCIFAVLFLLISGLFVQNSGAFT FT SYTRQELLDIGLCIPDSFISNLRLIHEIARTPRAGRPARPGGSARRPRRER FT KQRRGKRGGLRAKLKLTPHRLSLPSIFLANVRSLVNKMDEIRLRINHSKRL FT WNCNVMIFTETWLNSGIPDNAVSLTEHHTFRADRTADDSGKTRGGGLCIYI FT NKAWCTNSVVVGRHCSANLEFLMVKCRPFYLPREFTSTIISAAYIPPDADA FT KLAMNELHAAISKQQTAHPEAAFIVAGDFNHSNLKTVLPKFHQNILCHTRG FT NKTLDHVYTNIAEAYAVTPLPHLGQSDHLSLFLTPKYSPLIDRVKPSVRTI FT KVWPAGVDSTLQDRFQHTDWGMFASQATHGSHTDIDSYTSSVLEYINTTID FT SVTTQKQITTYPNQKPWMNKEVRLLLKARNTAFRSGDAQAYSTSRANLKRG FT IKKAKHCYKLKLEEHFSNSDPRRMWQGIQAITNYKPSQSTPTATDVSFLNE FT LNDFYARFDRDNKEPYTRITSSTDHSPITLTSSEVHTALSRINGRKAAGPD FT GIPGRVLKVCAEQLAGVFTDIFNLSLNLAAVPTCFKTTSIVPVPKHSSPTC FT LNDYRPVALTPIIMKCFERLVLAHLKDSLPSTLDPHQFAYRGNRSTDDAVS FT IALHSVLTHLDNKNTYARMLFVDFSSAFNTVIPSKLLIKLRDLDIDTSLCN FT WIMDFLTNRPQNVRSGHICSTTVTLNTGVPQGCVLSPFLYSLFTVDCRPVN FT RSNTIIKFADDTTVIGLISNNDETAYREEIQHLATWCTDNNLLLNTNKTKE FT LIVDFRKGRTGSHDPIHINGMAVEPVSSFKFLGTHISKDLSWTTNTSSLIK FT KAHQRLFFLRQLKKNQLSSAVLVNFYRCTIESILTNCVTVWYGSCSVAERK FT ALQRVVKTAQRITGTTLPAIEDIQKKHCLRRARSILKDTFHPAHRQFSLLP FT SGRRFRLPRTKTSRLRNSFFPRAVSLLNSAPH" XX SQ Sequence 3590 BP; 993 A; 1017 C; 698 G; 882 T; 0 other; tcgatccaag atggcggcgc gtgcacaacg cgaggctcag ggtctctcca gtttctgcat 60 ttttgcagta ttattcctgc tcatttcggg tctgttcgtg cagaacagtg gtgcctttac 120 atcgtacacc cgacaggagc ttttggatat tggtttgtgc attccggaca gttttattag 180 caatcttcga ctcatccatg agatcgccag aacaccccgg gctgggcgcc ctgcccggcc 240 gggcggaagt gctcgcagac cgcgcagaga acgtaaacaa agacggggga agcgcggcgg 300 gctaagagct aagctaaagc taacaccaca ccggctctct ttacccagca tctttctcgc 360 caatgtacgg tcactggtga acaaaatgga tgagattcga ctgcgcatca accacagcaa 420 aagattatgg aactgtaacg tcatgatttt cacagaaaca tggctaaaca gcgggatacc 480 agacaacgct gtatcgctaa cggagcacca cacattccga gcggacagaa cggcggatga 540 ctccggtaag accagaggcg gaggattatg catttatatt aacaaagctt ggtgcacaaa 600 ctctgtcgtc gttgggagac attgctctgc taacctggaa ttcctaatgg ttaaatgtag 660 acctttttac ctaccacggg agttcacatc caccataata tctgctgctt atattcctcc 720 cgacgctgat gcaaagctgg ctatgaacga acttcatgca gccatcagca aacaacagac 780 tgctcacccg gaggctgctt ttattgttgc gggggatttt aatcactcaa acttaaagac 840 agtgctccct aaattccatc aaaacattct ctgccacaca aggggaaaca aaacattgga 900 ccatgtttac acaaacatag ctgaagccta tgctgtgacc cccctccccc acctgggtca 960 gtcagatcac ctttctttgt tccttacccc caaatactca cccctcatcg accgtgtgaa 1020 gccatcagtg aggaccatca aagtgtggcc agcgggggta gactccacac tccaggacag 1080 gtttcaacac acagactggg gtatgtttgc ttcccaggcc acacatggct ctcacacaga 1140 cattgacagt tacacttcct ctgttctgga atatatcaac accaccatag acagtgttac 1200 aacccagaaa caaatcacca catacccgaa tcagaagcca tggatgaaca aggaggtgcg 1260 cctcctgctg aaggcacgca acactgcctt cagatcaggg gatgcacagg cctacagcac 1320 ttccagggct aatctgaaga ggggcatcaa aaaggccaag cactgctaca agcttaagct 1380 agaggagcac ttttccaact ctgatcctcg gcgcatgtgg cagggcatcc aggccatcac 1440 caactacaaa cccagccagt ccacacccac agccacagac gtctccttcc tgaacgagct 1500 aaatgacttt tatgcccgct ttgatagaga caataaagaa ccctacacca ggatcacttc 1560 ctccaccgac cactcaccta tcacactcac ctcctcagaa gtacacaccg cactgagtcg 1620 tatcaatggg cgtaaggctg ctggaccaga cggtattcct gggcgtgtcc tcaaagtatg 1680 tgcagaacag ctcgctgggg tattcacaga cattttcaac ctgtcgctca acctagcagc 1740 tgtgccaaca tgctttaaaa ccacctctat tgtgccagtg cccaaacact ccagcccaac 1800 atgcctgaat gactaccgcc ctgtagcact cacacccatc atcatgaagt gcttcgagcg 1860 gttggtcctg gcacatctaa aagactctct gccatccaca ctggacccac atcagtttgc 1920 ctaccgtggc aacaggagca cagatgatgc agtctctata gcactgcact ctgtactcac 1980 acacctggac aataaaaaca cttatgcacg aatgctgttt gtggacttca gctcagcatt 2040 caacactgtc ataccctcca agttactgat caaactaagg gacctggata tcgacacatc 2100 actctgcaac tggattatgg actttctgac taacagacct cagaatgtta gatcaggcca 2160 catctgctcc accaccgtca cactcaacac tggtgtacca cagggctgtg tgctgagccc 2220 cttcctctac tcccttttta ccgttgactg taggcctgtt aatagatcca acaccatcat 2280 caaatttgca gatgacacca cagtgattgg tctaatcagc aacaatgatg agacggccta 2340 cagggaggag atacagcatc tggccacttg gtgcaccgac aataatctgc tccttaacac 2400 caacaagacc aaggagctca ttgtggactt cagaaaggga cgaacaggct cacatgatcc 2460 catccacatc aatgggatgg ccgttgagcc tgtctcatcc tttaagttcc tggggaccca 2520 catctcaaag gacctttcct ggaccaccaa cacctccagc ctgatcaaga aggctcacca 2580 gcgcctattc ttcttaaggc aactgaagaa gaaccagctt tcatcagccg tcttggtgaa 2640 cttctaccgc tgcacaatag agagcatcct gacaaactgt gtcacagtct ggtatggaag 2700 ctgctctgtt gctgagcgta aggcactgca gcgggtggtg aaaactgccc aacgcatcac 2760 agggaccaca ctgccagcca tagaggacat ccagaagaaa cactgtttgc gccgagcacg 2820 cagtattctg aaggacacct ttcaccccgc tcacagacag ttttctctcc tgccctccgg 2880 caggcgcttc aggctccccc ggacaaaaac cagcagactg aggaacagct ttttccccag 2940 agctgtctcc ctcctgaact ctgccccaca ctgactcttt tgccccccca atacaccccc 3000 cactctcctc taacttaaac tcctcgcaat aactgcactg tttaacattt gcacatttaa 3060 aatttgcaca ttcactgcac cacattgaac tgtttactta ttgaactgta catacccact 3120 gcatatgggc atttgtaatt atgtacacac ccactataca tatacacttg taatcatgct 3180 tatttatctg catactactg attattaata gcaacctgta catatattca tatattgtaa 3240 catatacact tgtaatcatg tttatctatc cctgtacata tattcatata ttgtaacata 3300 tacacttgta atcatgttta tctatctgca cactactgat tattaatagc aacctgcaca 3360 tatattcata tattgtaaat ctgttcatag cttatccaac ctgtatataa tgttcatagt 3420 acatccatct gtaaatatca ccatagtttt ctataactgc actttataac ttattcctgt 3480 atcctgcact tgctgctatt gcactgctgg ttagacctaa actgcatttc gttgcattgt 3540 acttgtacat gtgtaatgac aataaagttg aatctaatct aatctaatct 3590 // ID DNA-AT-4_DR repbase; DNA; ZEB; 951 BP. XX AC . XX DT 11-MAR-2009 (Rel. 14.03, Created) DT 11-MAR-2009 (Rel. 14.03, Last updated, Version 1) XX DE non-autonomous DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; TSD AT; KW DNA-AT-4_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-951 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 9(3), 687-687 (2009). XX DR [1] (Consensus) XX SQ Sequence 951 BP; 275 A; 204 C; 188 G; 284 T; 0 other; agacctttta cgcacttccg catttttcga acggaaatac gtcaatgtag tgcattagaa 60 cttcctgtta tcatagcggc agatacgaaa aatggcagag tcgaagaatc gcagtttctg 120 ttgtgttcca ggctgttcgt cttctaacca aaaacagccc taccttttat ttcattcttt 180 tcctgtggat ccaaacctga aacccaaatg gatacaggcg atctgaagag aggaagggca 240 tagttttaat gtaaaaacag gtagcaccta tgtctgcagc cggcacttcg ctccggatga 300 ttacagtggt ggctgcgtcg ttcgtcgcct gaagagtggt gttgtcccga gcctattccc 360 ttggaacaac tttacagctc ctttgagaag ggagtcagta tatgacagaa cctacaaacg 420 tcagtctaaa cagctatgct gtgaagatag cgacatggtg gctaaggcag caaggatgga 480 tcacgactac gtgacacacc cacctgcatt tatcccacct gcagttaaga ttgttcatag 540 tgtatttagt taacgctagt cacctgtgtg taccgttatg catatctacc tgaacaaatt 600 aatgttaata atataagctg ttcacgaaac tcaactaata aaataaaata tatatttttt 660 ttaataatga aacaatttat tgctgtgaca tctagtaaat acacaaagtt ttatcatact 720 agcatgtgtg gaagtattta aaacaagtgt gttacaactg gccctgtgtt aggttctgcc 780 ccgctctccc ctattttaat tgcatgtatt taattactta tttaattaac ttcggaagac 840 aactcaggct cgccgaaaac actctagctg ccttttctcc gccaaaaaaa ctcataccgg 900 aactgggttt acacgttgtt gacgtacttc cggtttttgc gtaaaaggtc t 951 // ID hAT-N66_DR repbase; DNA; ZEB; 195 BP. XX AC . XX DT 30-OCT-2008 (Rel. 13.1, Created) DT 30-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Putative nonautonomous hAT DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW 8 bp TSD; hAT-N66_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-195 RA Bao W. and Jurka J.; RT "hAT DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1335-1335 (2008). XX DR [1] (Consensus) XX CC TIR is 72-bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 195 BP; 43 A; 37 C; 49 G; 66 T; 0 other; cagtgtttcc caaccctgtt cctggaggca caccaacagt acatattttg gatgtctccc 60 ttatctgacc cattcatttc aggttttgga gtctcttcta atgttctgat gagttgattc 120 aggtgtgttt gattaggaag aggttgaaaa tgtgtactgt tggtgtgcct tcaggaacag 180 ggttgggaaa cactg 195 // ID DNA-6-N8_DR repbase; DNA; ZEB; 2308 BP. XX AC . XX DT 15-DEC-2008 (Rel. 13.12, Created) DT 15-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE nonautonomous DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; TSD 6-bp; KW DNA-6-N8_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-2308 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(12), 2138-2138 (2008). XX DR [1] (Consensus) XX CC Copies are ~83% identical to the consensus. TIR is 150-bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 2308 BP; 1114 A; 470 C; 101 G; 602 T; 21 other; tatggagcca atcagtctca aaaataatac atttacwaaa tataattata catccacaat 60 gcaattcaca tttacaaaat ccaattcgta aattcaaaac acaattcgta aatacacaaa 120 wccaattcrt aaattcaaaa cayaattcat aaatacacaa atccaattca taaattcaaa 180 acacaattca aaaatacaca aatccaattc ataaattcaa aacacrattc ataaatacac 240 aaacccaatt cgtaaattca aaacacgatt cawaaataca aatccaattc gtaaattcaa 300 aacatgatts aaaaataaac aaatccaatt cataaattca aaacacgatt cataaataca 360 caaatccaat tcgtaaattc aaaacacgat tcataaatac acaaatccaa ttcataaatt 420 caaaacacaa ttcataaatr cacaaatcca attcgtaaat tcaaaacaca attwataaat 480 acacaaatcc aattcataaa ttcaaaacac aattcataaa tacrcaaatc caattcataa 540 attcaaaaca taattcataa atacrcaaat ccaattcrta aattcaaaac acaattcata 600 aatacacaaa tccaattcgt aaattcaaaa cacrattaaa atataaacaa atccaattca 660 taaattcaaa acacgattca taaatacaca aatccaattc gtaaattcaa aacacaattc 720 ataaatamac aaatccaatt cgtaaattca aaacacaatt cataaataca caaatccaat 780 tcgtaaattc aaaacacaat tcataaataa acaaatccaa ttcgtaaatt caaaacacaa 840 ttcataaata cacaaatcca attcataaat tcaaaacaca attcataaat acacaaatcc 900 aattcgtaaa ttcaaaacac aattcataaa tacacaaatc caattcgtaa attcaaaaca 960 caattcataa atacacaaat ccaattcgta aattcaaaac acgattcata aatacacaaa 1020 tccaattcgt aaattcaaaa cacrattcat aaatacacaa atccaattcg taaattcaaa 1080 acacrattca waaatacaca aatccaattc gtaaattcaa aacacgattc ataaatacac 1140 aaatccaatt cgtaaattca aaacacgatt cataaataca caaatccaat tcgtaaattc 1200 aaaacacgat tcataaatac acaaatccaa ttcgtaaatt caaaacacga ttcataaata 1260 cacaaatcca attcgtaaat tcaaaacacr attcataaat acacaaatcc aattcgtaaa 1320 ttcaaaacac gattcataaa tacacaaatc caattcgtaa attcaaaaca caattcataa 1380 aatacacaaa tccaattcgt aaattcaaaa cacgattcat aaatacacaa atccaattcg 1440 taaattcaaa acacgattca taaatacaca aatccaattc gtaaattcaa aacacgattc 1500 ataaatacac aaatccaatt cgtaaattca aaacacgatt cataaataca caaatccaat 1560 tcgtaaattc aaaacacgat tcataaatac acaaatccaa ttcgtaaatt caaaacacga 1620 ttcataaata cacaaatcca attcgtaaat tcaaaacaca attcataaat acacaaatcc 1680 aattcgtaaa ttcaaaacac gattcataaa tacacaaatc caattcgtaa attcaaaaca 1740 cgattcataa atacacaaat ccaattcgta aattcaaaac acaattcata aatacacaaa 1800 tccaattcgt aaattcaaaa cacaattcrt aaatacacaa atccaattcg taaattcaaa 1860 acacgattcr taaatacaca aatccaattc gtaaattcaa aacacgattc ataaatacac 1920 aaatccaatt cgtaaattca aaacacgatt cataaataca caaatccaat tcgtaaattc 1980 aaaacacaat tcataaatac acaaatccaa ttcgtaaatt caaaacacga ttcataaata 2040 cacaaatcca attcgtaaat tcaaaacacr attcataaat acacaaatcc aattcgtaaa 2100 ttcaaaacac aattcataat atacacaaat ccaattcatt tggcgaaaaa tatttatgat 2160 ttttacttgt gaattcacaa attgtgtgtt gtatttatga attttgaatt tacgaattgg 2220 atttgtattt tgtaaatgtg aattgtgctg tgcatgtatg aattttattt tgtaaatgta 2280 ttatttttga gactgatctg gctccata 2308 // ID Gypsy60-I_DR repbase; DNA; ZEB; 6772 BP. XX AC chr15; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy60-I_DR; KW Gypsy60-LTR_DR; Gypsy60_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6772 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1435-1435 (2008). XX DR Genome; chr15; Positions 21148163 21141392. XX CC Positions [2902-3441] - Reverse transcriptase CC Positions [4924-5400] - Integrase core CC 'TAAG' target site duplication CC LTRs are 100% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 248..1672 FT /product="Gypsy60-I_DR_2p" FT /translation="MGEKFVDRFNISSPISPFIPRGRGVITNSACKDGPTI FT NRMGLSHVTSDYVQGVGRGLFVHDPTAQHSVGNTSTTHNTSCDRQILDCLT FT DMFGRLGEQISDAVAAKIIENSTMSVNRESHVNRGVNSDSFTKQDVSRVTV FT HVTSDRDPVIFRGDGSDKFPVHEWIEMTKTLLFKQKCAVENQADEIMSRLM FT GRAGDIVRISLRSDTSLNVKSDPEIIYSILLKYFSIAPSCLPLADFYSTLP FT KVGESAVDYWIRVNKAADLADEGLRRQGRVMDDISAEVARMFVKYCPDPVL FT SDFFKHKPISEWTAKEVQGRLDEYQREQRSSLTSASSLDMRNVDVLPCDNL FT LPMSRSQETNASFAHYTSSPPVSDGVQDRVHSTCCPERAVRQSHDQGGNSN FT EHILNRMMGMLEQVVERVHQSSNVQPERNLIPRYRGRVRGCGVCGNTSHST FT KSHCLRDRLCFSCFSPGHAQSACPQSQSGN" FT CDS 1687..6357 FT /product="Gypsy60-I_DR_1p" FT /translation="MEGGCAGLDSNSQNVVDVANAEKIFMSAKESCCSENV FT VFQNTNAICSSDSLFYTSVMVDDKVALNGMLDSGSMACTMNESAEQKLLEA FT GIINDQNGDSPDVLLIGCGGSQVKPKRTVTVQLEIYGCKMLVPTLVVSGQH FT DEFIVGTNVIKHIIRCSKQCDLFWRTVSTPLCCNDPMAETYLSMLAGLNRW FT MGDEIPDKIGTVRCNSAVCLEPGREYLVWGRLPKHVVVSPGSAVVTEPTSS FT RSAPRGILVARLVTHLWGDRWVPLKLINTWNKPVMLKRNAKIADLFPGVAL FT EDLDDGALYSQIQKTQPVDSVSLGSSCIGEKLKSVGLEGLDLDSCDVSDSC FT RKKLAEMVIQYNDVFSRHHLDCGNATEFVHRIHLSDTKPFRLPYRRVAPSE FT YQVLRRVLTEMEEKEIIRKSTSEFASPLVLVWKKNGDLRVCTDFRWLNKRT FT LKDAHPLPHQADCLAALGGNSLFSTMDLTSGFYNMPLHEDDRKYSAFTTPM FT GLYEYNRLPQGLCNSPGSFMRMMTSIFGDQNFLSLLCYLDDLIVFGPDEIT FT ALDRLEMVFSRLRKHNLKLAPKKCNFLRKSVQFLGHVVDEFGVSTDPSKVD FT SISKMTGLDLMEADGVTPSQKRIRSFLGMVNYYQHFIPNYSSVAKPLFDLL FT KGQKQKIKRGRANKRVCVPRKLSVTDWTSEVQLAFEHLKLAMVQAVVLAHP FT DFSRPFVLSTDASSDGIGAVLSQVQPGESRARPIAFASKSLSPSQRNYPAH FT RLEFLALKWAVCEKFSHWLKGHVFTVWTDNNPLTHILSKPKLDCCEQRWVA FT KLAAYNFDIKYIPGSQNVVADALSRVPFARSNIGSRLLSESYDSLLNEVQG FT LSAGLVQDAFRWSCGSEGKPNTPCGGLVHVNSNSVAGTCLDKEEVVAVLDS FT QEAWDEGARLRALSILEHLPQIVSEPESQLIYSERDLREKQLSDSVLSRVI FT FYVERHRKPSRRERAKESVGVVRYLKHWEKFVMKDGILYRISHDQLSKGKR FT FQYVVPQSLQLEVLRGIHDQAGHQGQSRTLGLARRRFFWLSLDRDVREHVR FT HCQRCVVSKTPEPEGRAPLESIVTSRPQQLVCIDFWSAEDHNNRSVDVLVI FT TDHFTRLAQAYQCNDQSARQVARVLWDKYFCIFGFPERIHSDQGACFEGQL FT ISELLRVSGVQKSRTTPYHPMGNGSVERFNRTLGGMIRALAPEAKRNWPRQ FT LQSLTFLYNCTVHETTGYAPFYLMFGRVPRLPVDLIFRSVLEDHTVTCYDK FT YIGSLLKDLEVALSIAQAHAKKEQHRHTVLYNRRVKGQDIEIGDRVLLANK FT AARGKRKLADKWASTIFLVVDKNVDTHIFKIRDVNTDQVKVVHRNLLLPVN FT FLPLDKAEDEGSDSVPSLVVADSCEDADNSLTGVIVDRRDPELYVRDKIAQ FT SSEISVSTGNNDGLSLYGNVSSASADSHGRTSDWVSGLPKSCSQLGSTVSL FT ISLSVSQENDCSDKLSTIGTTDFAAASPAHSVISIPLSDVSNDDDGLSVSD FT SRTFIVPRSRFGRILKPVNRLLCTMSGQNVVSMQNGRVLTV" XX SQ Sequence 6772 BP; 1754 A; 1268 C; 1671 G; 2079 T; 0 other; ttggtgccgt gactcgtcag atcacctgat caacttgatg aagatcgccg agaggttgct 60 gctttagtcc ctgtctgaca taatttgatt tatgaacatt tggatgcatc tttttgcaat 120 ttaagacttt ctaaaaaaat caaaaaaaaa aaaaaacaag ggaaatttgc aacaattttg 180 ttttattttg tatctactat tgttcttttt ttttgcgaac tgacattgtt tcattaactt 240 taaaataatg ggtgagaaat ttgtagatcg tttcaatatt agttctccta ttagcccttt 300 tattccacgt gggaggggtg tcattactaa cagtgcttgc aaagatggac ctactatcaa 360 taggatgggc ctttcccatg tgacatctga ttatgttcaa ggtgtgggtc gtgggctttt 420 cgttcatgat ccgactgcgc agcacagtgt tggaaatacg tctacgacac acaataccag 480 ttgtgacaga caaattttag attgtctaac tgatatgttt gggcgactgg gtgaacaaat 540 tagtgatgct gtcgctgcca aaataataga gaacagtaca atgtctgtga atcgagagtc 600 tcatgtcaac agaggggtaa attctgacag ctttaccaag caagatgttt cccgtgtaac 660 agttcatgtc acttctgata gagaccctgt gatttttaga ggagacggaa gtgataaatt 720 cccggtgcat gaatggattg aaatgaccaa aacactttta ttcaagcaga agtgtgctgt 780 tgaaaatcag gcggatgaaa taatgtcacg cttaatgggc agagcgggag atattgtgcg 840 gattagttta aggagcgaca catctcttaa cgtgaagagt gaccctgaaa taatttactc 900 tatcctgctg aaatatttta gtattgcacc atcttgcctc cctttggcag acttttattc 960 aactctgcct aaggtgggtg agtctgcagt ggattactgg atcagagtca ataaagcggc 1020 agatttagca gatgaaggct tgcgtagaca gggtcgtgtt atggatgata tcagtgcaga 1080 ggtggctcga atgtttgtaa aatactgccc tgatccagta ttatctgatt tttttaagca 1140 caagccgatc agtgaatgga ctgcaaaaga agttcagggg agattggatg agtatcagag 1200 agagcagcgc tcgtctctga catctgccag ctctcttgac atgagaaatg ttgatgtgtt 1260 accatgtgac aatcttttgc cgatgtctcg ctcgcaggaa acgaatgcta gttttgcgca 1320 ctacacgagt tcccctcctg tgtctgatgg tgtgcaagac cgggtgcaca gtacttgctg 1380 tccagaaaga gcagttcgac agtcacatga ccagggggga aattcgaatg aacatatcct 1440 gaatcgtatg atgggtatgc tggaacaggt ggtggaaaga gtgcaccaga gtagcaacgt 1500 ccaacctgaa aggaatttaa ttcccaggta cagaggtagg gtaaggggct gtggagtttg 1560 cggtaatact tcgcattcca ctaagtctca ctgcctgaga gacaggttgt gttttagctg 1620 cttttcacct ggccatgcac agtcagcctg tccgcagtct cagtcgggaa actgaccaac 1680 ctgtatatgg agggaggttg tgcaggttta gatagtaact cccaaaatgt cgtagatgtt 1740 gcaaatgctg aaaaaatttt catgtctgcc aaagagtctt gttgtagtga gaatgtagtg 1800 tttcagaaca ccaatgctat ctgcagcagt gacagtttgt tctacacttc agtcatggtg 1860 gatgataagg tagccttgaa tggtatgctt gacagtggtt ctatggcgtg taccatgaat 1920 gaaagtgcag agcaaaaact gttggaagct ggtataatca atgatcaaaa tggagatagc 1980 ccagatgtgc tattaatagg ttgtggtggt agccaggtaa aacctaaacg tactgtgaca 2040 gttcaacttg agatttatgg atgtaaaatg ctggtaccta ccctggttgt cagtgggcag 2100 cacgacgaat tcatcgttgg tacgaatgtc attaagcaca ttatacggtg ttctaagcaa 2160 tgtgacttgt tttggaggac agtttcaaca cctctttgtt gtaatgatcc tatggctgag 2220 acttaccttt ctatgcttgc aggcttgaat cgttggatgg gtgacgaaat tcctgacaaa 2280 ataggaactg tgagatgtaa ttcagcagtt tgtttagagc ctggccgtga gtaccttgta 2340 tggggaagac tgcctaagca tgtagtggtg tcgccgggta gtgctgttgt gactgagccc 2400 acaagttctc gttctgcacc taggggcatt ctggtagcca gacttgtcac acatctatgg 2460 ggagacaggt gggttccatt gaagttaatc aatacatgga ataaacctgt tatgctaaag 2520 cgtaatgcca agatagctga tttgtttcca ggtgttgcac tggaggattt ggatgatgga 2580 gcactatatt cacagattca gaaaactcag ccagtggata gtgtctcctt aggttcgagc 2640 tgtattggag agaaactgaa atctgttggt ctggaagggc ttgacctaga ttcctgtgat 2700 gtgtctgatt cctgtaggaa gaagttggct gagatggtca tacagtacaa tgatgtgttt 2760 tcacgccatc atttagattg tggtaatgct actgaatttg ttcacaggat ccatctttca 2820 gacacaaaac ctttccgtct tccatatcgg cgcgttgcac ccagcgaata tcaagtgttg 2880 cgtcgcgtgt tgactgaaat ggaggagaag gagattattc gcaagtcgac cagtgagttt 2940 gcatctcctt tagtcctcgt ctggaaaaag aatggggatt tacgtgtctg tacagacttt 3000 cgatggctaa acaagagaac actgaaagat gctcatccac ttccccacca ggccgattgt 3060 ttagctgcat taggggggaa ttcattgttt agtacaatgg acctgacctc tggcttttac 3120 aacatgccgt tgcatgagga tgaccgaaag tactctgcct ttacaacacc tatgggttta 3180 tatgagtaca atcggcttcc tcaaggcctt tgtaacagcc caggaagttt tatgcgcatg 3240 atgacaagca ttttcggtga ccaaaacttc ttgagtttgt tatgttattt ggatgatttg 3300 atagtgtttg ggcctgatga aattacagca ctggatcgtc ttgagatggt tttcagccga 3360 ttgaggaaac ataacttaaa acttgctccc aagaaatgta acttcttaag gaaatctgtg 3420 cagtttttag ggcatgttgt tgatgagttt ggagtgtcca cagatccaag taaggtggac 3480 agtatttcca aaatgaccgg attggactta atggaggctg atggagttac accgtcccag 3540 aagcgcataa ggtcgtttct tgggatggtc aactattatc agcatttcat acccaattac 3600 tcctctgtgg ccaagccgct ttttgatctt ttaaagggtc aaaaacagaa gatcaagcgg 3660 ggtcgtgcaa ataaacgagt ctgtgtacca cggaagttga gtgtgactga ttggacatct 3720 gaagtacagt tggcatttga acatcttaag ttggccatgg tacaggcggt tgtcttggct 3780 catcctgact tcagccgccc ttttgttctt tcgactgatg cttcttcaga cggcattggg 3840 gccgttctct cccaggtcca gccaggagaa tctagagcta gacctatagc gtttgctagt 3900 aagtcacttt ccccatctca gcgaaattac ccagcacatc gcttagagtt tttagcatta 3960 aagtgggcag tctgcgagaa attcagtcat tggcttaagg gtcatgtttt cacggtatgg 4020 actgataaca acccgctcac acatattcta tccaagccca agcttgattg ttgtgagcag 4080 cgttgggtcg ccaaacttgc agcatacaat ttcgacatca aatacatacc gggctcacag 4140 aatgtcgtgg ctgatgcttt gagtagagtt ccatttgcga gatctaatat tggttctaga 4200 cttttaagcg aatcgtatga cagcctttta aatgaggtgc aaggactctc tgctggtttg 4260 gttcaggatg cttttaggtg gtcttgtgga tctgaaggga agccaaacac tccttgtggt 4320 ggtcttgtgc atgttaattc taatagtgtt gctggaacat gcctggataa ggaggaggtt 4380 gtggctgtgc ttgactcaca agaggcatgg gatgaaggtg cgagacttcg tgctttgtca 4440 atcctagagc atcttcctca gatagtttct gagccagaga gtcaactaat ttactcagag 4500 cgtgatttac gtgagaaaca gctctcagat agtgtactgt cgagggtaat tttttatgtg 4560 gagcgtcatc gcaaaccatc cagacgagag agagctaagg aatctgttgg ggtggtcaga 4620 taccttaagc attgggagaa atttgttatg aaggacggca tcctctacag aatctcccat 4680 gatcagctgt ctaagggtaa gcgttttcaa tatgtggtcc cacagtcgct tcagttagag 4740 gtgttgagag gcattcatga tcaggcaggc catcaggggc agtccagaac tttgggactt 4800 gcgagacgtc gttttttttg gctgagtctg gatagagatg ttagagagca tgtacgtcat 4860 tgtcaaaggt gtgtagtcag caaaactcct gagcctgagg gccgagcccc cttagagagt 4920 atcgtgacca gcagaccaca acaattggtg tgcattgact tttggtctgc tgaagatcat 4980 aataacagat ctgtcgatgt cctagtgatt actgatcact ttactaggtt ggctcaagcc 5040 tatcagtgta atgatcagtc tgctagacag gtggcacgtg tcttgtggga taaatacttt 5100 tgtatttttg gatttcctga gcgcatacat agcgatcaag gtgcttgttt cgagggccag 5160 ctgattagtg agcttctacg ggtgtctggg gtgcaaaaat cgcgcacaac tccttaccat 5220 cctatgggaa acgggagtgt tgagcggttt aacagaactc taggtggtat gattcgtgct 5280 ttagcccctg aagcgaaacg taattggccg agacagttgc agtctttgac ttttctgtat 5340 aactgtactg ttcatgagac tacgggttat gcaccgtttt atcttatgtt cggtagagtt 5400 cctcgccttc cagtagacct aattttccgc tctgttttgg aagatcacac agtcacttgc 5460 tatgacaaat acataggttc tcttttgaaa gatctggaag tggccttgtc catagctcag 5520 gctcacgcta aaaaggaaca gcacagacat acagttctct ataataggag agtcaagggt 5580 caagacattg aaattggtga tcgtgtgctg ttagcgaaca aggctgcaag agggaagagg 5640 aagttagctg acaagtgggc atctactatt tttcttgtag tagataagaa tgtcgatact 5700 cacattttta agataaggga tgtgaatacc gatcaagtga aagtggtgca tcgtaacttg 5760 ttgctacctg tgaacttttt gccactagac aaagcagaag atgaaggatc tgatagtgtt 5820 ccttccttgg ttgtagctga ctcatgtgag gatgctgata attccttgac aggcgtaatt 5880 gttgaccgtc gagatcctga gttgtatgtc agagataaga ttgcacagtc atctgagatt 5940 tcagtgtcta caggcaataa tgatggtttg tctctttacg gcaatgtttc atctgcttct 6000 gcagacagtc atggtcgtac atctgactgg gtttcaggtc tacccaagag ttgttcgcag 6060 ttgggatcaa ctgtctcctt gatcagttta tcagtaagtc aagaaaatga ctgctctgac 6120 aaattgagta ctatcggaac tactgatttt gcagctgcaa gtcctgctca ttccgtcatt 6180 tcaattcctt tgtctgatgt ttccaatgat gatgatgggt tgtctgtttc agactccaga 6240 acgtttatcg tccctcggtc tcgttttgga cgtatactga aacctgtgaa tagacttttg 6300 tgtaccatgt ctggacaaaa tgttgtgtca atgcagaatg gaagagtttt gactgtttga 6360 acatctgttt tctttttcta gattatggat aagacaatac atgttgttct caggggtttt 6420 tatttattgt actttatata gtttttcttt tctctttttt gcttccaatt gtaatttttt 6480 ttttatgaca tacctggata tggtccagtt gtttttgttt tgatttgtgt ttacttatat 6540 atataccgtg atatggaatt gtgagatgta tgtaggcatc ttactgccag ttgttggttt 6600 gagagattgg cgctcctctc attccacttc atctttgtca gataaccttg tgtttggcat 6660 acctgtttat ggtatctctg tatcggtaca gtgtatagaa gagtactgtt gttttttttt 6720 tctggttcac aactttcgct gccatcattc agcagaattc agtgggggtg aa 6772 // ID hAT-N19_DR repbase; DNA; ZEB; 660 BP. XX AC . XX DT 07-DEC-2004 (Rel. 9.11, Created) DT 07-DEC-2004 (Rel. 9.11, Last updated, Version 1) XX DE hAT-N19_DR is a nonautonomous DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW CHAROLIE3; nonautonomous DNA transposon; hAT superfamily; hAT-N18; KW hAT-N19_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-660 RA Kapitonov V.V. and Jurka J.; RT "hAT-N19_DR, a family of nonautonomous DNA transposons from RT zebrafish."; RL Repbase Reports 4(11), 307-307 (2004). XX DR [1] (Consensus) XX CC hAT-N19_DR is a family of nonautonomous DNA transposons that CC belongs to the hAT superfamily. The genome harbors over one CC hundred copies of hAT-N19_DR that are ~94% identical to their CC consensus sequence. This transposable element is characterized CC by 8-bp target site duplications and 15-bp terminal inverted CC repeats. XX SQ Sequence 660 BP; 188 A; 146 C; 150 G; 175 T; 1 other; cagtggtccc caaccaccgg gccgcggacc ggtaccggtc cgtggaacaa ttggtaccgg 60 gccacataag aaatcataaa ttatttctgt agaaaatatt ccccccgcga cttggtttct 120 tccactcacc cccgccatct cacgtgaaat ggactatgcc aaaatccacc acggagcggg 180 aagaacgata gaatatagct cagtggtggt tgttgatagt ttaaagttat aaaacaggta 240 ttatagagtc caaatcagca gagcatttaa ggattgtgca tatatgtgtg cgtgcatcgc 300 taagtaagta cccgaaatca ttgcaatatg tatctatatg gagtttcaca taatcatact 360 cattagggct gtaacatgcc aatgcataca tttaattcgg ctatacggtt gtattattgt 420 taatatgttc gtcagttgaa atgtttggca cagatgttat gtatttgatg catataaacc 480 ttgattgaaa cataatcaga ccgcgatagt gcaactagcg catgcgcact gagttgttca 540 tgattaacct ggatgcacga gcgcagcagc tgtgttcacm gccccccggt ccacaccggt 600 ccgttgtaaa attgacaacc gttgaccggt ccgcggtgaa aaaaaggttg gggaccactg 660 // ID ZFERV-2-I_DR repbase; DNA; ZEB; 10564 BP. XX AC chr14; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 01-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW ERV1; Endogenous Retrovirus; Transposable Element; ZFERV-2_DR; KW ZFERV-2-LTR_DR; ZFERV-2-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-10564 RA Shen C.H. and Steiner L.A.; RT "Genome structure and thymic expression of an endogenous RT retrovirus in zebrafish."; RL J Virol 78(2), 899-911 (2004). XX RN [2] RP 1-10564 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1608-1608 (2008). XX DR Genome; chr14; Positions 54408559 54419122. XX CC Positions [6400-6669] - Integrase core CC 'GTGC' target site duplication CC LTRs are 94% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 7913..9397 FT /product="ZFERV-2-I_DR_5p" FT /translation="MRVKLWENWRLLGIIGVEILIIMGTILGIEKFQNRNE FT NSAVKSTRMSRHKRSTDSDCFKYYEGKLELDYVKGTTTSFTFDLCDAIDCR FT GSDASWRGYDVYLCAPNSVVMKCENDKGPERADVYWPCVGGWNTVMKWTGL FT WTPQPSLGNEQKPFWQGVNLQRDYDNTKNPLTLSLTNLQGSSNGKDVAYFF FT IGVDITGKDYHTHIVKINFKEKPTKPTRTSQIRDHSKEIRGQKVVIEMDYT FT KLTPEDIMEKATGYGETNLWLEWMISTAKQTKLTDCVACAKARPTLKIEPA FT PLIRSDPIGFGCMISLTREARPKGCGTLAEIFPPINNQTKTGPFSADPGNF FT TCFNFTPSGPVLVRVGHINPLWCKEIIHNAGDIIGPWARGGLYYYCGEKRL FT FVRIKRTTTEGICAMVRLAAPVILIGRKQEGKGGTTIKRRKRGATFDLTTD FT SPTYIDSIGVPRGVPDKYKLADQIAAGFENIPIISAIFPVTPNKNIDRIN" FT CDS join(4360..6669,6673..7755) FT /product="ZFERV-2-I_DR_3p" FT /translation="MVHDLREINSRVTTPIVAVPNPYTALSMVDNSHKWFS FT CIDMANAFFCVPIADECKDCLAFTFEGAQYSYNRLPQGFILSPGIFNQVLK FT QQLEDCQLPDGVVTIMYVDDILIAASSPESCLKATEIVLQRLWESGLKVSR FT DKLQICRQTVTFLGRVVGVSGNGVSPTHKRDILTHPKPQTVKEMLSFLGLT FT GFSRHFVPQYVALTSPLRDMVKQCGMRNLTAQLHWTMEAERAFQKTKAELA FT HCSNLCTPDYKVPFQLDVSETTTSAHGVLYQKQSGNRRILLYVSVILDQVE FT ARQPPCARFAAGLAKIIQKTSHIVMGHPLLILTSHSVVSFVNSSSFTFSAL FT RQRRMIKILTAPNITYNHNGINMADLMTDGEPHDCAPLTDRLSVVREGLLA FT SPLPETDNPITLFTDGSCHRADDGNLKAGFAVVKLQNDQFVTIQSEPLIGK FT QSAQKAELLAIINALRHSEGQVVNIYSDSAYAVSAVHVELPVWIRCGFTTS FT SGRPITHEKEAKELFEAIKLPSKVAIIKYTGHSKGTDLISRGNEAADLAAK FT RASGYLQGQFVVTSEPDVTSLLPEFSREFLSEQQDLAAPEEKSVWKNKGAI FT RHADGLWTAPDGRALLPASIAPSVLKEAHTPAHCSDKQMSRALQSWWHPFM FT PHLISGYITSCNICQEHNIKPAMKPSQGSFPLANGPGDEVIIDFTDMIDRV FT QGKRYLLVMIDAYTGWPEAFPVGKEDSTAVIKCLINHYIPHHGFPRRVRSD FT NGSHFKNEHLAEVEKALGLHHFGAVYHPQSQGKVERLNLTLKNKLAKICAQ FT TKLTWLAALPLALMSVRSSVNRATGYTPFELLTGRQFPGPHAPLQLEPIQP FT MSHKLYYDKLVALITSFDKVLPTDFRGHEYPAPTPDRTWDWVRLRVYRRKW FT KEPRWSIPYKVTSRTSHCVRLAGKGDTWYHLSQCSPSRSLDDTRVDLASQV FT QEREDTAEGAEGDTQTHTHKDPEENFHQSQSSLQALQVVYKTGDLFTAPKG FT DAVAHCVSADCAFGAGIAKKFREKYGVDKVKAQRKTRGDCAVTHEADRIVF FT HLITKTFARDLPTYEDFESSLIRMRTWCNRLDITRLSIPRLGYGLDKLEFY FT RVLHIIKKTFRGQNITISIYTLD" FT CDS join(2152..3123,3127..4236) FT /product="ZFERV-2-I_DR_1p" FT /translation="MSRVLSAYQITEVENMANTMGTSNSTPLVTAGGQSPL FT CQAIRDKFPTPETMFENMKFKYRAGENGAAYINRCCQEWEEACDENPMSNP FT VTNRIFRAAVLKGAPAGVTQAMEQSPEVPGSDDAKWKRYAVHFIDKAEERA FT EKEGSELLKLQKELLKLQLDEKKSAKNKGLKQMTQQQHTHPPQPPPQPQVE FT SGHETFEPYSNHASQYQPQQQQWSRGRGNQRSRGHRGYLGNCHTCGGTGHW FT ANSCPTGNKGCFTCGGMDHLARDCYHQPTGQPMSRPTRGRGNQRGGGRTQR FT GGWGSRPVPPNQPSYQQPLCPEWDGADDSYYARSPQRTQERAMAEPLLTIT FT VNDRDIKALVDTGATYSTVTKDIDQKKLSNKCVEVMGFSGDSEQWPLTKPL FT PVTIADQQLTHSFLYSTNSPVPLLGRDLLIKTGASILCSPEGVIVSFPTGQ FT KVNCSLQGGVGHNQWLLAPSPTPDNCVDIYWALLSPSQTGAYSLFQEWKAW FT IYTLDVFIPPPDDLHCTMFYDRQEDHVYQEAFTDLEGSQWSLKGTGLLIGK FT EGVVAPVCLTKEQETWYKMRQTAAPHISLALHPTHEARELGSMTKRLLAVT FT DWVPTDNTELYYSPSTDSYKVVRDTTDLGTLEHKWISRCHGRDKSDGEGAE FT ELINSMPQELWSTGPTDVGYCDITPVTFDISGGPPIWVPQYKNKP" XX SQ Sequence 10564 BP; 3489 A; 2044 C; 2369 G; 2662 T; 0 other; attttggtga cccacgacgt gatgggaaaa ggaggaattt tgctgaacag agggatgaac 60 tgaattcaac ataaaggcgc gcctaaaaaa taaggtaagc agaaaccttt taaataaatt 120 ctgcgattga atactgcaaa ttaagtgttg aattcagcat aatcctgatg taagtaaaat 180 acagagaaac caaaagtaat attttacttt tttttttttt ttttttttac attttttgaa 240 gaatatcagg gtaaattgtt agaatttaaa ggaaaggcct tgtccctgca gggtaatttt 300 agggtgaatt actagaattc aaataaaatc cttgtcccta caaggtataa gggcgaattt 360 tcgcgcaaga gaattcgaag gaaatcctct tcccttagga ggtaaaaaat taaaagtgac 420 cctactcttg acgaaggtag aaagctttta ataccaggga caaagacaga aagtcatgtc 480 tttgtaagga agtccccctc cctggggagg catttttgag agtcactctt gtcctgacga 540 gaaccagata ctttcaaaat aaaaattgag aagagaaaat aatttcctga gaaagactga 600 ttgcagtatc aggaatagat agagtaggga acataaaata tatatatata tttaaataat 660 aataataata ataataataa taaataaata aataaaaaag ggagtgtgtg tatgtatata 720 tatatatatg tatatatata tatatatata tatatattta tatatatata tatatatata 780 tatatatata tatatatata tatatatata tgtatatgtg tatatatata tatatatata 840 tatatatata tatatatata tatatatata tatatatata tattgtcaac tgtaacaaat 900 tagtgcagag agggaacaat ggagggtgaa tttgacagag aatgtgagga gtatgggtgt 960 ccaaagttag gagttcaatg gaagaaacta actgaaataa tgataagtca aacagaacca 1020 aaagacctga aaacacgcat gaaagaattg gacatgtgca aaacagaatt gattgagcat 1080 tttggtactg attgcatgga gagaaaggga tctctgtttc agaaagtagg aataaaagaa 1140 caaagagcac aagaagctgt cttgaggcac agaaaagagg aagtagggtt gttgggtaaa 1200 cgagctcaca aaaagaagga gaagcagcta gagggaaatt tgagaagatg atctagagtc 1260 actctcacag catctttttt gagaccaaag aaaaaggaag tagacagcaa agacacagaa 1320 accaaactga gcgaaaaaca ggaagctgta gctgagatga aggttgagtc tgtagcccca 1380 cctaccgccc ctatggtgaa gaaactttac ccagacctac cgcccccata caacccagct 1440 actagtcaga aaccaaagca gctaatgaat caggaaacag gaagacaaat ggtactttat 1500 actgttaatg ggggtacatt agacttagat agagctgatg atcaagataa tggggcaatg 1560 acacaggaag aagttagtgg actgttagaa aaaatggtga gaagaagaga agatagcata 1620 agactgctgg aagacatagt aagagaagga gaagacagtc ttttgagacg tgaggggcag 1680 gagcaaaagc atagatcaag tgacatggat gggagtgtag aatccacaga gagggaggag 1740 gaagtagttg tgagaaggaa gaagttaaca aaaaatggac aaggtaacca agaagggaaa 1800 gtatcagttc gtattgaggg ccaggtcaca ttagaagata tcaaaaaagg ggacaatgaa 1860 agttcaggaa gtgatactga aacccaacat gaaccattca cagttaggcg aagtgacaga 1920 ataaagataa aacaacaatc aaaaccacac ttgtccagta taggacaagt gataaatatg 1980 cccctagtgg ctggatccca tggtcctaca tatcagccct ggtcactgac agatcttgag 2040 accctggtaa agtaactacc tgccataact ggagggggag gcagatggtt gaataaactt 2100 tttacattat cacatggtca agtactctca gtgggagact taagacagat aatgagcagg 2160 gtcctgtcag cctaccagat aacagaagtg gaaaacatgg ctaacaccat gggtacttct 2220 aacagtacac cattggtcac tgcagggggt caatcacccc tatgtcaggc catcagggat 2280 aaattcccaa cgcccgaaac catgttcgaa aacatgaagt ttaaatacag ggctggtgaa 2340 aatggggctg catatataaa tagatgttgt caggaatggg aagaagcgtg tgatgagaac 2400 ccaatgtcta accctgtcac taatagaatt ttcagggcag cagtgttaaa aggagctcca 2460 gctggagtga ctcaggctat ggagcagtct cctgaggtac caggcagtga cgatgctaaa 2520 tggaaaaggt atgcagtaca ttttatagat aaagcagaag agagagcaga gaaagaggga 2580 agtgagctgt taaagctaca aaaggagctt ctgaagctgc aacttgatga aaaaaagtca 2640 gctaagaata agggtttgaa acaaatgacc caacaacaac acacccatcc accccagcca 2700 cccccacagc ctcaggttga gtcaggacac gagacttttg agccatattc aaatcacgcc 2760 tctcaatatc aaccacagca gcagcagtgg tctaggggaa ggggtaatca gaggagtagg 2820 ggtcacaggg gatacctggg caattgtcac acatgtggag gcacaggtca ctgggcaaac 2880 tcctgcccaa caggcaataa aggctgtttt acatgtgggg gcatggatca cctagctagg 2940 gactgttacc atcagccaac aggccagccc atgagtcgac ctactagagg tcggggcaat 3000 caaagaggtg gaggccgcac acagagagga ggttggggta gcaggccagt accacctaat 3060 cagccaagct accaacaacc actctgccca gagtgggatg gagctgatga tagttattat 3120 gcatgaaggt ccccacagag aacccaagag agggccatgg ccgagcctct cttaactatc 3180 actgtcaacg atagggatat taaggcccta gtggacactg gagctactta ctctactgtc 3240 actaaagaca ttgaccaaaa gaaattatca aataaatgtg tagaagttat gggtttctct 3300 ggggacagtg aacaatggcc attgactaaa ccacttcctg tcaccatcgc agatcagcag 3360 cttactcact cattcctgta ttctactaac tcccctgttc ctttacttgg cagagatctt 3420 ttaattaaaa caggggcaag catactatgc tctccagaag gagtcattgt atcttttcct 3480 acaggacaga aagttaactg ttcactgcaa ggtggagtag gccacaatca gtggctcctt 3540 gcaccatccc ccaccccaga caattgtgta gacatttact gggccttgtt gtcaccatca 3600 cagacaggag catattcact tttccaagaa tggaaggcat ggatatatac tttagacgtt 3660 ttcattccac ctcctgatga cttacactgc acaatgtttt atgacaggca ggaagaccat 3720 gtgtaccagg aagccttcac agatctggaa ggttcgcagt ggagcttaaa aggaacaggt 3780 ctgttgatag ggaaagaagg tgtggtggct ccagtttgtt taacaaaaga acaagagacc 3840 tggtacaaaa tgagacaaac tgctgcccca cacatatcac tggcattgca ccccacacat 3900 gaggctaggg agctaggctc aatgacaaaa aggttacttg cagtcactga ctgggtacct 3960 acagataaca cagagttgta ctactccccg tcaacagata gctacaaagt ggtgagagat 4020 accactgact tgggcactct ggaacacaaa tggatctcta ggtgtcatgg cagagataaa 4080 tctgatgggg agggggcaga ggaattaatt aactccatgc cacaggaact atggtccaca 4140 ggaccaactg atgtgggata ttgtgacata acacctgtca catttgatat ctctggaggc 4200 ccaccaatat gggtcccaca gtataaaaat aagccttaag ctgtagctgg cattaagcca 4260 acgattgagg ggctactgaa ctctggagta ttgacccctc accagtcaca ttggaacaca 4320 cccatattcc ccgtccctaa acagggaggc aagtacagaa tggtacatga ccttagagaa 4380 atcaacagca gggtcactac cccgatagtt gctgtcccca acccgtacac tgccctttct 4440 atggtggata actcacacaa atggttttca tgcatagata tggcaaatgc atttttctgt 4500 gtaccaattg cggatgaatg taaagattgt ttggccttta cttttgaggg ggcccaatat 4560 tcttataaca gactacccca agggtttatt cttagtccag gcatctttaa tcaggttcta 4620 aaacaacagt tggaagactg ccagcttccc gacggagtgg taacaataat gtatgttgat 4680 gacatcctca tagctgcttc atcacctgaa tcttgtttga aagctactga aattgttttg 4740 cagagattgt gggaatcagg actgaaggtc agtagggata agttacagat ttgtagacag 4800 acagtcacct tcctagggag agtagtgggg gttagtggaa atggtgtatc accaacacat 4860 aagagagaca tactaacaca cccaaagcca cagactgtaa aagaaatgct ctctttctta 4920 ggactaacag gattcagcag acactttgta ccacaatatg tggctctgac tagccctctc 4980 agggacatgg taaaacaatg tggcatgaga aatctgacgg cacagcttca ttggacgatg 5040 gaggccgaaa gagcattcca gaaaacaaaa gcagaactag ctcattgtag taacttatgt 5100 accccagact acaaagtacc gttccagtta gacgtgtcag aaacaaccac aagcgcacat 5160 ggagttttgt accaaaaaca atcaggtaac agaagaattt tgttgtatgt cagtgtaatc 5220 ctggaccagg tagaagccag acaaccccca tgtgccagat ttgccgcagg tctagctaaa 5280 ataatacaga aaacatcgca catcgtcatg ggtcatccat tgttgatctt aacctctcac 5340 tcagtggtca gttttgtcaa ttcaagctcg ttcaccttct ctgccttgag acagagaaga 5400 atgataaaaa ttttaacagc tcctaacatt acctacaacc acaatgggat caacatggct 5460 gacctaatga ccgatggaga acctcatgac tgtgcaccat tgacagacag attatcagta 5520 gtcagagagg gcctattggc ctcaccactt cctgaaacag ataacccaat cacactgttc 5580 actgatggta gctgccatag ggcagatgat ggaaatttga aagcaggttt tgctgtggtt 5640 aaattacaaa atgaccaatt tgttacaata caatctgaac cactaattgg caagcaatct 5700 gcacaaaaag cagaactatt ggccataatc aatgcattga gacactcaga agggcaagtt 5760 gttaacatct actctgattc tgcatatgca gtgtccgcag tccacgtgga gctgcctgtg 5820 tggatcagat gtggattcac aacttcctct ggccggccta ttactcatga gaaagaagca 5880 aaagagctct tcgaagccat aaaattgcca agcaaagtgg ctatcattaa gtacactggt 5940 cattctaaag gaactgacct aatatcaaga gggaatgagg ctgcagactt ggctgcaaaa 6000 cgtgcctcag gctacctaca aggtcagttt gtggtcactt ctgagccaga cgtgacatcc 6060 cttctgccag aattctcaag agaattcttg tctgagcagc aagacctggc tgctccagaa 6120 gaaaagtctg tatggaaaaa caaaggggca atcagacatg ctgatggcct gtggacagca 6180 cctgatggta gagctctgct ccctgcatct atagctccaa gtgtattgaa ggaagcccat 6240 acacctgcac actgcagtga caaacagatg tcacgagcat tgcagtcatg gtggcatcct 6300 ttcatgccac acctaatctc tggttacata acgtcatgca acatctgcca ggaacacaat 6360 attaaaccag ctatgaaacc atctcagggg tcatttccac ttgcaaatgg ccctggagat 6420 gaagttatca ttgatttcac tgacatgatt gacagagtgc aagggaaaag gtacctgttg 6480 gtcatgattg atgcatatac tggttggcct gaagccttcc cagttggcaa ggaggacagc 6540 actgctgtca ttaaatgttt gataaaccac tatatcccac accacggttt tccaagaaga 6600 gttaggtctg acaatggctc acatttcaag aatgagcatc tagctgaggt agaaaaagca 6660 ttaggcttgt agcatcactt tggtgcagtc tatcaccctc aatcacaagg gaaggttgaa 6720 cggttgaacc taactctgaa aaacaaattg gctaaaatct gtgcacagac caaattgacc 6780 tggttggctg cccttccact agccttgatg tcagtacgca gctctgtgaa cagggcaact 6840 gggtacacac cttttgagtt attaactggt cgacaatttc ctggcccaca tgccccactg 6900 cagcttgaac ctatacaacc catgtcacac aaactgtact atgacaaact agttgccttg 6960 attacatcat ttgataaagt cctgccaaca gattttaggg gacatgagta tcctgctccc 7020 accccggacc gaacctggga ttgggtgagg ttgagagtct acagacgtaa atggaaagag 7080 cccaggtggt ctattcctta taaggtaaca tcacgcactt ctcactgtgt tcgtcttgca 7140 ggaaaaggtg acacttggta ccacctgagc cagtgttcac cgtcaagatc gctggacgac 7200 accagagtgg accttgcttc gcaggtccaa gagagggagg ataccgcaga aggtgcagaa 7260 ggtgacacac aaacacatac acataaagac ccagaagaaa atttccatca atcacaaagt 7320 agtctacaag ctttgcaggt ggtctacaaa actggggatt tattcacagc cccaaaaggt 7380 gatgcagtag ctcattgtgt tagtgcagat tgtgcatttg gagcaggcat agcaaaaaaa 7440 ttcagagaaa aatacggggt tgacaaagta aaggcccaga gaaaaacgag aggtgactgt 7500 gctgtaactc atgaggcaga cagaatagtt tttcatttga taacaaaaac gtttgccaga 7560 gacctaccca cttatgaaga ctttgagagc agtttaatcc gcatgagaac gtggtgcaac 7620 aggttagaca ttacaagatt atccataccc cggctggggt atggtttgga taagttggag 7680 ttttacagag ttctccacat tatcaagaaa actttcagag gccagaatat aacaatctcc 7740 atttatactc ttgattaagc tcatatacat aaaaaaatta agctgaatta actgatcaag 7800 atactgcttg ctataagcgt aattacgctg tggaaaaaaa ttaaataaat aaataaaaaa 7860 taaatgaaat aaaacattct agaactatag ttcatctgga agagaataaa acatgagggt 7920 caaattgtgg gaaaactgga gactccttgg gataataggg gtagaaatat tgataataat 7980 gggaacaatt ttgggaatag aaaaattcca gaacagaaat gaaaactcag ctgtaaaatc 8040 aacaagaatg agcagacaca aaagatcgac agatagtgac tgttttaaat attatgaggg 8100 taagctagaa ctggactatg tgaaaggaac tactacatcg ttcacatttg atttatgcga 8160 tgccattgat tgtcgtggta gtgatgcatc ctggagaggg tatgatgtgt atctatgtgc 8220 tccaaattca gtggtaatga agtgtgagaa tgataaaggt ccagaaaggg cagatgtata 8280 ttggccatgc gtaggtggat ggaatactgt tatgaaatgg actggactgt ggacacccca 8340 accaagtttg ggaaatgagc agaaaccatt ctggcagggc gtgaatcttc aaagagatta 8400 tgataataca aaaaatcctc tgacgctatc actaacaaac ttgcagggat catcaaatgg 8460 gaaagatgta gcatatttct ttataggggt ggacataact ggaaaagatt accatactca 8520 tattgtaaaa attaacttta aagaaaaacc aacaaaacca actagaacat ctcaaattag 8580 ggatcatagt aaggaaatta gaggtcagaa agtagtaatt gaaatggatt acactaaatt 8640 gacccctgag gatataatgg aaaaagctac aggatacggg gaaacaaatt tgtggctaga 8700 atggatgatt tccacggcta aacaaacaaa attaacagat tgtgttgcat gtgcaaaagc 8760 tagacccact cttaaaatag aaccagcccc tttgataaga agtgacccca taggatttgg 8820 atgtatgata agtctcaccc gagaagctag accaaaagga tgtggcaccc tagcagaaat 8880 attcccccct ataaataacc agacaaaaac aggcccattc tctgcagacc caggaaattt 8940 tacctgcttt aattttactc cctccggacc agtgttagta agggtagggc atattaaccc 9000 tttatggtgc aaggagatta tccacaatgc gggggatata ataggaccat gggcgcgggg 9060 aggactatat tattactgtg gagaaaaaag attgtttgta agaataaaaa ggacaactac 9120 tgaaggcata tgtgcgatgg tcaggttggc agcaccagta atactgattg gcaggaaaca 9180 agagggaaag ggtgggacta caatcaagag aagaaaaagg ggtgcaacgt ttgacttaac 9240 cacagattct ccaacctaca tagactcaat tggggtcccc cgaggggtcc cagacaaata 9300 taaattagca gatcaaattg ctgcaggttt tgaaaacatc ccaattatct cagctatttt 9360 cccagtcact cctaacaaaa atatagatag aataaattaa gtacattaca atgtcatgag 9420 attggctaat ctaaccagag atgcggtaga aggcctgtca gaacaattag cacctacttc 9480 attgatggct atccagaaca gaattgccct agacatgcta cttgccgaga aatcgggcgt 9540 gtgtgctttt ttccaagacc actgctgtac ctttattccc aacaatactg ctcctgatgg 9600 atcagtaaca agagccctag aaggactaaa aactctctcc aaaactatgg cagaacactc 9660 tggtgtagat aacccatttg atgattggct gaccagtgtg tttggtaaat acaaaaccat 9720 gttcacatct atcctagtct ctattgccac ttttgtggct ataatagtaa catgcggatg 9780 ctgctgcgtc ccttgcatca ggacattgtg taacagggtg atagtaaccg caattgaaaa 9840 gagggatgga gccccacccc cttacacaat gcccctgtta atatcccctg aaggtgatga 9900 ggaggaagag gagtgtcatt tataaatgtt gtttttctct attaactgtg ttttcttgca 9960 gatgctgatt gattatactt tttgtgttat aagaaatttc tctctattaa ttgtgcttcc 10020 ttacagatgc tgattcatca tatattgtgt tataagaaat gttaagtaga tattttgagt 10080 ctaaaaggaa tctatcgaaa tgtttagctt agacattttt ttgttttatg ttttgtgtga 10140 ttgactcagc tgacttgttg tggtcgccca catgtcaact aggggaccga ctgggtgtct 10200 ttccccacag gggttttcgg cccagtcctg tctgagcaca ctggtttata gggcatttat 10260 agggaatttt tttttttttt ttttatgtgt gtgaaatgtt gatttacaat agcagtctga 10320 tatgatgcct cagatgaact tgcggtggct gcccatacgt caattagggg gtcacctggg 10380 tgtctttcct cagagaggtt ttcgactcag gctcatctgg acatattggg ctgaaaatat 10440 atatattttt ttttcttcct tttttccttt atttttattc ttacacatag acttagttat 10500 ggtagtttga agatttattg aatataataa agtgtggcct ctttcagagg ccaagagagg 10560 gatt 10564 // ID Gypsy77-I_DR repbase; DNA; ZEB; 5658 BP. XX AC chr12; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy77-I_DR; KW Gypsy77-LTR_DR; Gypsy77_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5658 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1469-1469 (2008). XX DR Genome; chr12; Positions 11516305 11510648. XX CC Positions [3051-3590] - Reverse transcriptase CC Positions [4668-5144] - Integrase core CC 'GTCT' target site duplication CC LTRs are 99% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 480..5606 FT /product="Gypsy77-I_DR_1p" FT /translation="MAHTPPSPFACVEIDPPDIATPNWPHVQQMIQQPHSQ FT PQQVSTFHSTPTQFHQTPLQTQVHFGSPPPSITDPLSSSATQFHQTPLQTQ FT VRFGSPPPSITAPLSSSPHQDSMQLCSVAKATPSSASTQHDLPVFLPTPGR FT EIYQLSTQVQGNWDYILNCLKQQDEDLKTLTQEVATTSSQFQSKFETLTTK FT IDSNHQHLQTVFATQKQPESENSGQLIKAFKVMLEDNLKQCELALAAEFRF FT MVEQLQLEVQRDIVNLEKNSQKNYDQLSQSISQCNSQVNALIQSIKELKSD FT VVNIKPFAQPVTALHVMTQTPLLSTDPEDDAVLPSTDTLPLHLPHPTSTIP FT KSDHLKLTFPTFGRPSDDGDPLSYLNRCYDFLAVHPLNDTEILATFRSVLY FT GTARDWWEVARANIKTWTEFETAFLSAFLAEDYEDELAERVRTRVQGTKET FT IRDFAFSYRALCKKWKPSLSENDLVKMILKNINPYLASQLRSRVKTVDELV FT KLGHQLEKDLAQQLLYEGRISSNPIAPKPTSNRPLERSSVQCWRCKGQHAP FT GNCPQYSSASVSTQRSNQPVGDKRNIPAQHQRSAPSNNSISVKTATPITEN FT PRHPPTYEFIPQQLIVPITIGTWSGKAIIDTGASYTLLHESLWSALTSLDG FT LHKWPHGPLYLANGETAVPLGWTNADISLHNMVFPTPVANLPSTALTYSVV FT LGLDFIFTSGLQINVAGGNYTFTTAPNQVYPFQPGQASIPEKTYLHQKKKV FT HRKHASQVLSLFTSIPPPQRLFQLPDNSDDKTLIQALVDNSHLHPEEKYQL FT HQLLLKNPQVCTSQLGRTTVLQHRLYTVHPVPIRQRAYRLTPEKQAIVEEQ FT VQEMLMNGLIEPSHSAWASPIVLVPKKDGSMRFCVDYRKVNAITERDAYPL FT PNITEILESLSGAAIFSTIDLNSGYWQVTMEPGSESKTAFIAPSGLYHFKV FT MPFGLKNAPATFQRLMESVLGELRGKICFVYIDDIIIYSPSITQHLNDLQS FT VLTRLYRAGLTINLKKSKFCQHELSFLGHVVNTQGISADSTKVEAIKSYPV FT PKNIKDVQRFLGLAGWYHRFVPNFSQIAEPINSLKKKGRSFQWTPQCQQAF FT EKLKACLTSPPILGHPDLRLPFVVYTDASDCGLGAVLTQRKVQGQEEVIAY FT GSRTLTKPEVNYSATEKECLAILWAIEKWRHYLEPKLFTIVTDHSALQWVL FT NSTKTTSRLIRWALRLQKFDFVVEYRKGKLNEAPDALSRIHPLPGCHVYSA FT QQEPEFPISTAEIWEEQHKDPDIIHIFKTLADKDSSMHEQFEVIDDKLYHR FT TLLKEGHKHYRIYVPATLIPNLLHHYHAHPLSGHLGIYKTYQRIHHVAFWP FT AMWTSIKTYVKRCVRCQTLKNESRKPAGKLQQTTTTRPNEMLGVDIMGPFP FT CSPRRSQYLLVFVDYYSRWVELFPMRDATAQTVAKFFREEILTRWGVPDFI FT LSDRGTQFTSAIFSEVCKKWKIKQKMTTAYHPQTNLTERVNRTLKQMISSY FT VDDNHKKWDQYLPEFRFAINSAVQETIGMTPAELQLGRKIQGPVDKLMHGH FT NLSPDSPAYEVAEQLSTLKMKASECSKKAMARQLRNYNKTRREVSYKEKDR FT VWVRNFPQSSALYHFSAKLANKWKGPYRIIHQLGPLNYRVAQENTGEDVRT FT VNVCNLKPCFPTAEELEKQEKEKLKAIFLQPLEEEEFFGF" XX SQ Sequence 5658 BP; 1760 A; 1371 C; 1039 G; 1488 T; 0 other; taaatggcgc ccaacgtggg gcgtggccca acttgaactg aactttgaac attgaactct 60 gaactttgaa ctctcattca aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa acgcacacac 120 aactttacaa aaagacattt attgtgaaca tttgcatact aattgtgaaa agtttttttt 180 actgcttact ttgtgaaaag agggttttcc ttcctgtgaa aaaaagggat ttttctttgt 240 gaaaattgat agttttcttt gtgggaaaaa acaaccttgt gttcttagtg aaagggttac 300 atctttattg acaattgtga atattttgta agtttttctt atctgttgta attgttattt 360 cttggtgctc ctgtctactt ttcaccaaac attgaccata cccagttttc atttaggatt 420 ttttattttc ttgccatttt ttcacctgtg ccctttagag atattacctt ccgtcaaaca 480 tggcccatac tcccccatca ccatttgcct gtgtggaaat tgatcctcca gacattgcaa 540 cccctaactg gccacatgtg caacaaatga tacaacagcc tcattctcaa ccacagcaag 600 tttccacctt tcactcaact cctacccaat tccaccaaac ccctctccaa actcaggtgc 660 actttggatc tccacctcca agcattactg accctctttc ttcatctgct acccaattcc 720 accaaacccc tctccaaact caagtgcgct ttggatctcc acctccaagc attactgctc 780 ctctctcttc atctcctcac caggattcca tgcagttgtg ctctgttgct aaagccacac 840 catcatctgc aagtactcag catgacttgc ctgtatttct cccaactcct ggacgagaaa 900 tatatcagtt atcaacacag gtacaaggaa actgggatta tattcttaat tgtctgaagc 960 agcaagatga agatttaaaa acacttactc aagaagttgc aacaacctct tcacagtttc 1020 agtcaaagtt tgaaactttg accaccaaaa ttgactccaa ccatcaacat cttcaaactg 1080 tattcgctac tcagaaacag cctgagtcag aaaacagtgg acagttaatt aaagctttta 1140 aagtgatgct ggaagataat ctgaagcaat gcgaactggc tttagcggct gaattcagat 1200 tcatggttga gcaattgcag ctggaggtac aaagagacat tgtaaacctt gaaaaaaatt 1260 cacagaaaaa ttatgatcaa ctttcacaat ccataagtca atgcaacagc caagttaatg 1320 ctctaattca gtccatcaaa gagcttaagt ctgatgttgt taatatcaaa ccctttgcac 1380 aaccagtgac tgctctacat gtgatgactc aaacaccttt actttccacc gatccagaag 1440 atgacgctgt tctcccaagc acagatactt tacctttaca cctacctcat cccacaagta 1500 ccatacccaa aagtgatcat ttgaaattga cctttcccac ttttggtaga ccttctgatg 1560 atggtgatcc tctgtcatat ttaaaccgtt gttatgactt tctagcagta catcccctga 1620 atgatactga aatcttggcc acctttcgca gtgttttata tggtaccgca agagactggt 1680 gggaagtcgc cagagctaat ataaaaacct ggactgagtt tgagaccgca ttcctatcag 1740 catttctcgc tgaagattat gaagacgagt tggctgaacg agtccgcacc agagtccaag 1800 ggacaaagga aactattaga gactttgctt tctcttatag agctctttgc aaaaaatgga 1860 aaccttcact gagtgagaat gacctggtga agatgatctt aaaaaatatt aatccttacc 1920 tggccagtca actgcgcagc agagtaaaga ctgtagatga attggtaaaa cttggtcacc 1980 aacttgagaa agaccttgca cagcagttac tgtatgaggg aagaatctca agtaacccta 2040 tagcacccaa acctacctcc aatcgtccat tagagagatc ttctgttcaa tgctggagat 2100 gtaaaggtca gcatgcccca ggaaattgtc cgcaatattc ttctgcatct gtttccacgc 2160 aacgcagcaa ccagcctgta ggagataaaa gaaatatacc tgcacaacat caaagaagtg 2220 caccttccaa caatagcatt tctgttaaaa ctgcaactcc aattactgaa aacccacgtc 2280 atcctccgac ctatgagttt ataccacaac aacttattgt gccaatcacc atcggaacat 2340 ggtctggcaa ggccataatc gatactggag ccagctacac cctacttcat gaaagtttgt 2400 ggtcagcact tacctctctt gatggtcttc ataaatggcc ccatggtccc ctctacctgg 2460 ctaatggtga aacagccgtt cctttaggat ggacaaacgc agatatcagt cttcacaata 2520 tggtctttcc tacccctgtt gctaacctcc cttccacagc actcacttat tcagtggtcc 2580 taggattgga tttcatattc actagtggac tacagatcaa tgtggctggc ggaaattaca 2640 cctttacaac agccccaaat caagtgtatc cttttcagcc tgggcaggcc agcattcccg 2700 aaaaaacata cctgcatcag aagaaaaaag ttcacagaaa acatgcaagc caagttcttt 2760 cattgttcac gtcaatcccc ccaccacaaa gactgttcca attaccagac aattcagatg 2820 acaagacgct aatacaagca ttagttgata attctcacct gcatccagag gaaaagtatc 2880 agttacacca gctacttctt aagaatcctc aagtctgtac ttcccagctc ggaagaacaa 2940 ctgtgttaca acaccgtttg tatactgtcc accctgtgcc aatcagacaa cgtgcatacc 3000 gtctaacccc agaaaaacaa gcaattgtag aagagcaagt ccaagaaatg ctcatgaatg 3060 gactcataga accctcacac tcagcgtggg cctcacccat tgtcctcgtc cccaaaaagg 3120 atggtagtat gaggttctgt gtagattatc ggaaagtgaa cgctatcaca gaaagagatg 3180 cctacccctt acccaacatc acggaaatat tggaatccct ctccggtgct gccattttct 3240 ccaccattga cctcaacagt gggtactggc aggtgaccat ggaacctgga agtgaatcca 3300 aaactgcctt cattgctccc tcgggcctat atcacttcaa agttatgccc tttggactta 3360 aaaacgctcc cgcaacattt cagcggttaa tggagtcggt cctaggagaa ttgcgtggaa 3420 aaatatgttt cgtgtacatt gacgacatca ttatatactc accctcaatc acccaacacc 3480 tcaatgatct gcaaagtgtc ctcactcggt tgtacagagc aggcctaacc attaacctga 3540 agaaaagcaa attttgtcaa catgaattat cgtttctggg acatgtggtg aacacccagg 3600 gtatctctgc tgattccacc aaagtagaag caataaaatc ctacccggtc cctaaaaaca 3660 tcaaagatgt tcaaagattt cttggattgg caggatggta tcataggttt gtgcccaact 3720 tctctcaaat tgcagagcct atcaactcct taaagaaaaa agggcgaagt tttcaatgga 3780 caccacaatg tcaacaagcc tttgaaaaat taaaggcttg tctcacttca ccacccattc 3840 taggacatcc tgacctccga ctcccatttg tagtttacac tgatgccagt gattgtggct 3900 taggtgctgt gttgacacaa cgcaaggtgc aagggcagga ggaggttatt gcctatggaa 3960 gccgaactct gaccaaacca gaggtaaatt attcagccac agagaaagaa tgcttagcca 4020 tactatgggc tatcgaaaaa tggagacatt acctggagcc gaaacttttc actatagtca 4080 ctgatcattc cgctctgcag tgggtcctga actctacaaa gaccacctca cgtctaatcc 4140 gctgggccct ccgtcttcaa aaatttgatt ttgtggttga atacagaaag ggaaaactga 4200 atgaagcacc agatgcttta tctcgcattc atcctctacc tggttgtcat gtgtattccg 4260 ctcagcaaga accagaattt cccatctcca ctgcagaaat ctgggaagag caacacaaag 4320 acccagatat catccatata ttcaaaaccc tggcagacaa agattctagc atgcatgaac 4380 agttcgaggt gattgatgac aagttatacc atcgtaccct attaaaagaa ggacacaagc 4440 actacaggat ttatgtacct gccaccctta ttccaaatct cctccaccat taccacgcac 4500 acccgttaag tggacacttg ggtatataca aaacttacca gagaatacat cacgttgctt 4560 tctggcctgc aatgtggaca tctatcaaaa cgtatgtgaa aaggtgtgtg agatgtcaaa 4620 cactaaaaaa cgaaagtcga aaacctgcag ggaaacttca acagactact acaactcggc 4680 caaatgaaat gttaggtgtc gacatcatgg gaccgttccc ttgcagccct cgccgatcac 4740 agtatctgtt ggtttttgtg gattattact cgcgttgggt tgaattattt cccatgcgag 4800 atgccactgc tcagacagta gctaagttct tcagagaaga aatacttact cgatggggag 4860 tacccgactt cattttatct gaccgaggca cccaattcac atccgcaatt ttctctgaag 4920 tctgtaaaaa atggaaaatc aaacagaaaa tgacaacagc gtaccacccc cagaccaatc 4980 tcacggaacg agtaaacaga accttaaaac aaatgatttc ctcatatgtc gatgataacc 5040 ataaaaagtg ggatcaatac ttgccggaat ttcgctttgc tataaactcc gcagtccagg 5100 aaacaatcgg catgacccca gcagaacttc aactcgggag gaaaatccaa ggtcctgtag 5160 acaagttaat gcatggacat aacctgtctc ctgattctcc ggcctatgaa gttgcagaac 5220 aactatccac tttgaaaatg aaagccagtg agtgcagcaa aaaagctatg gcacgacaac 5280 tccgcaatta caataaaaca cgacgagagg tatcctataa agaaaaagac cgtgtatggg 5340 taagaaattt tcctcagtca agtgcattgt atcatttcag tgcgaaactt gcaaataaat 5400 ggaagggacc ctaccgcata attcaccagc taggacctct gaattatcgg gttgcacagg 5460 aaaatactgg agaagatgta cgaactgtaa atgtctgtaa ccttaagcct tgttttccaa 5520 cagctgagga attggagaaa caggagaaag agaaacttaa ggccattttc ctccaaccgc 5580 tagaagaaga agaatttttt gggttctgat agctcttttt caaacaacca ggggttgttt 5640 tcttcaaggg ggggagag 5658 // ID ENSPM-6_DR repbase; DNA; ZEB; 32616 BP. XX AC . XX DT 11-AUG-2008 (Rel. 13.08, Created) DT 11-AUG-2008 (Rel. 13.08, Last updated, Version 1) XX DE EnSpm-6_DR is an autonomous DNA transposon - a consensus. XX KW EnSpm; DNA transposon; Transposable Element; KW Autonomous DNA transposon; EnSpm-6_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-32616 RA Bao W. and Jurka J.; RT "EnSpm DNA transposons in zebrafish."; RL Repbase Reports 8(8), 823-823 (2008). XX DR [1] (Consensus) XX CC The consensus elements are constructed from several copies, which CC are less than 2% divergent from the consensus. The transposase CC encoded by EnSpm-6_DR is far distant to most known EnSpm CC transposases. EnSpm-6_DR contains 29 imperfect TIR, and is CC characterized by 2-bp TSD. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 15512..17500 FT /product="ENSPM-6_DR_1p" FT /translation="MSYWTKRRKTLATVQEIEQEILNEYKIQTENNPESTA FT QVNASQNFENRDELPSCNIVSSLATDHDNPCWLDSDSESLSDSGSISDTDQ FT PNAESDSSSNDSVNQLQKKLQTWAASFCIPLIALTALLAILRSEFPDLPKD FT ARTLLGTQTKVPVTKICNGEYYHFGLVSCLLKTLDHIASLPKITTLALQFN FT IDGIPLFKSSKTQFWPILGTIDCDKTRSPFIIGLFCGNSKPSSVWEYLRDF FT IAELRIVLREGIVRNGTRFKVVVSSFICDAPARAFVKHVKSHNGYYGCDKC FT SQTGVWSNKMTYPETDAPLRTDDDFEKMKNEEHHLGNSLGPLTGVVKMITQ FT FPIDYMHTCCLGVTRRLIQMWMKGKQLRTRLSSQQIQKISDRLVALRPFMP FT KEFARKPRSLRDVDRWKATEFRQFMIYTGPIVLKGFVENEIYDNFMLFSAG FT MFLLLSPGLPGSMIEIANKILSSFVKHYSQLYGTDEVVYNVHQVIHLAEEY FT KAFGPLDNISAFPYENYLGTLKRLVRKPDQPLQQVVRRLSEMPTKQSHKSN FT QESHLSSSTKDPKLVHFSDCVLSPNQGDNCIQIDTGIAVIKNIMKQGETVH FT LVCQTFKKMESFYTYPCDSSANGCHRVSVLQEKTDLVPLSSVKQKYVLLPE FT NENSFFALPLIHCR" XX SQ Sequence 32616 BP; 11127 A; 5329 C; 5084 G; 11075 T; 1 other; cccagatagc aaaccgatgt tgattcaacg tcgaatattg gttataatag tcagtatcag 60 tcggcgatca aatttcaaca ttgaaccaac gttcattatt tggtcggtac atcggaagga 120 ttttacgttg aaaaaaagat attgattcaa cgtcgaaatt tcgacattga atcaacgttc 180 attatttcaa tatttaaata tgaatcaata ttcattattt ggtcggtaaa atgccgtttg 240 catcagtgta agtagagaaa tacaacagcg agtgccctta atcttgcctt atactatgta 300 ttttaagctg ttttgggaag acaaatatgc aagttagtta tttaggctaa tgtcaagaat 360 ttgatttcac ttttattttg ttttacttta attatcttgt tttccttcgg aatattattc 420 aagtttctta atcccgtgat atatggcagt aggactgttt acatttattt tgcaataaaa 480 atgcattttg atatttcata agtacatcag tggtgtaggc taaactggat atttgaaatt 540 cattttattc agttaatgtg atctcttccc ctatggttta ggcctaatag atgttcacca 600 gtaatctgtt ttaccacgac aatcactata ttattacaaa aataactgca cacacaaact 660 gaatagattt ctactcatta ataccggcac atatttatca gcgaacatat aaaacatgca 720 agatgattaa ttagattttt aagtagttct aaataattta gtctgaattt aaactattta 780 agttgaaaat aggctacatt taatttgttt taattcaacc catataaatt gtttgtaaca 840 actttgtaga aacatttttt agtatacatt ttatcagata taggccagca tttgtaacta 900 aaaactagta taagctaaat taaattacaa ccaattaatt ttatgtgtga acggttacaa 960 tttttttcgg tgacatcttt ttctttcata ttaggctatt tctcgaacca gaaattattt 1020 aaagagctta tttttatcct gcgtgtctaa aacgtgacaa tttttattag aaaataatta 1080 acactttaag tttatttgtt tggtaaatca aatttgccat atttaaaacg gcttgtatta 1140 tacatttaaa aggtgtacat taaaattttt attgtttatt tatagctttc tgttgaaaaa 1200 tataattgct ttagcagcaa tttttatcgt gccaaaatat ggataattgc aacatgttcc 1260 cacatcatga cattaattta aatgtttgtc aacattacat gaaattacat gaaaatgtaa 1320 tactttatgg tttgggtaaa taaagaattc atttcagtat ttcttcgaga aatagcgttt 1380 aggctacttt tgtcacacct tagacacata cacagcatgc gaaagacaat atatttcaac 1440 attcaattag acacattaac agacatagtc aaaaaaaaaa aaaaaaacat tttatagaat 1500 aacccaataa acaccttctc tctctctcac acacacacaa acataattca acaaaacagt 1560 ttaatagatt taagcatgtt ttaataagtg ttatgactct cctgttgcgg agtcagatgc 1620 ataaaaagca ttgtgctcag ttaaacagac aaaatgttgc catccattaa caatttgact 1680 tatttatttg gcatctttat attcttgttt gttacatgtt gttgttcaac aaatgacatt 1740 tgattataaa ttgtagtaat atttaatgga aattctttgg cagtttggcg ttacattgtt 1800 ccgccataag atagcaacag agatcaatat gatcagttct taatggtgca aatatttttc 1860 ctcaatggat atttaaatct acagctaaac aaaatcatca caaaacaggt aaatctcacc 1920 aagtccatct atctctttta tatttttatt gttttaataa taatatagta atgtagtggt 1980 gtttgctttg atttttgcca actctgtttg tccagttcta gttatctaaa ctcattagct 2040 ttgatgtctt ctgtggtgtt caccaggtag gctggatcca gactggaatt ggtacaattt 2100 ctagaatcct ttgaatggac cttctaggac agaaacagga aagcataagt acacaaatta 2160 acatagattc catttatata ttttttcaga caaaagatga gtttttttat tatttattag 2220 ttataactgt agaataaaag taaggacaaa aatgtataac ttaagagtct caaatattat 2280 atatagtaca atgtctttga tacattcatt ttgacattgc aactgtatat attcccaaat 2340 catgtttaac aatactgtat atttttccaa agtattttct ataatattct ttcttcagga 2400 aaaagtcttt tgtttaattt cagctagaat aaaaacagtt aatttttttt aaccatttca 2460 aggtcaacat tatttaataa taatattttt attaagtaat atttttgatt gtctacaaaa 2520 caatccatca ttatacaata atttgcctaa ttaccctcaa tgtcacttta agcagaatac 2580 tagtatcttg aaaaatatat agtcaaattt ttactattta ctgtcatcat ggcaaagata 2640 atataaatca gttagggata agtaaacaga aatggaggaa aaaattaaac agggtggcaa 2700 ataattcagg gggctaataa ttctgacttc aactgtatat gtatgtacgt atgtatgtat 2760 gtatgtatgt atgtatgcat atataaaatc catcattcca aggaacattc aaacagggtc 2820 gcaaacttgt atgcttgcaa ctatacctct agagctgtag tcaaaaacaa agttcctaac 2880 tgtgctctat tcccacttat tccccctatg ccctattaat tttcatttag aacattctaa 2940 catttaaact tggaatgatt tttaaaaaag cattaaggtc atctctctta ggtgtaattt 3000 gctttcaaag tattcgtaca gttcagtttt accgatcttc atgtttataa ttgtgctccc 3060 ttcacagtgc attttacagt tatgcctcat gacgaaagct ataggggacc tattatttaa 3120 aaagctgaat ttacgttacg tctccaaagc ttgtgaaacc aaaaaaaaaa aagaaaacac 3180 atagcatacg ttaattattt tcatttatag tgggttattt attaagtatc tgtactgtat 3240 aacacatgtg cctgtgggtc agaacatttc tgcagtggtt tgcatgtcaa attataaaag 3300 tagacttttc caaaaaattt aaatttataa gcaatatcct aaataatcat catcatcatt 3360 attattatta atattaatgt aggatttttt caaatttcct aataatccta tatgttccat 3420 ttacatacat ttcaattaat atggaaagcg agtggatgtt tttaccataa ctaaaattga 3480 atttttaaca ggttttgtgt aaaacaatat aatttgcaca aagaaattat ggggttcttc 3540 tctgaagaag ttgattatgg ctgtttagag cgtcataatg gacgttttac gctataactg 3600 acatggttta gacgatggat ctgtgactga gaaactgcgg gttttgggaa aacactcgtg 3660 actacatcgt ttttttccca aacaatgcat cgtactatga ttgtgcagct gcaagttatg 3720 tcattgtttg ttttttcctt gtttgtcgtt gcaccccaga tctgttaatg aaatgttatt 3780 tgaaatgtta aaaaaaaata caaataaaaa aacattttaa aacttatttt aatagtttta 3840 atattttggt gcttttttat ttcattacaa aagctctaag agaaaagatt ttcaagaacc 3900 ttgttcacat gtttctgcat tattacagat gtactatgtt attagtcact tattcactta 3960 tctacaatta aaaatagtga caatgtggca taactctgtt aaatgcttag tttctgttac 4020 tggtaaatat aaatatttag atgatcaata ctacctttgc tactgcctga tacgatttcg 4080 ttatctaaca tatctgtctg taaggtaaca ttggtctcac actgctctgt tctcctcaca 4140 ttacaacaca attactcttc tctaactgct gttatgtctg ttggtctgtc cacaggctca 4200 taattgtatg gctgtgatgc atcgtttgtg tgaatgtctc agaatatctt attttggcat 4260 ccaaacgtgc accgcgatca gttgtcttca tactgtttat atgtgcgatc ttcctagttc 4320 aacgtcactt acaatctggt gatttttcac atgcggaaga acatttatgt gactaaaacg 4380 actacagatg gcttaaaaat aaactttatg gtgtatttta aagaaaatgt agttcctact 4440 ttactctcat gcacattgac tcactggggt gtctaaccta caatatgctc tttaaggcat 4500 gtgctgttta tgaactagct ttaaaaacaa catacgagtc acatgagttc atatgtgaat 4560 atcactacct tagttagttg ttagttcatt gtttgctaat gtaatacaat tttcaactac 4620 tcttgaactc ctgtttaaac tgtttatttt gatggaaacg gaacacattt ctactgcttt 4680 tcagtgtaaa cacgtattag atcaaccatt acatttaaac acaggagttc cagaataatt 4740 agaaatgagt taaggataat gtgccccaag taaaatcatc ctgacaaaga aaacaagttt 4800 atgttagtca catttagctt atgtaaaaac aaacatgaac taacaagtaa ttaaactaga 4860 ttcattgtat ttagatctat tggcatttac tgttaataga tcatggtaag aaaatatgct 4920 atttgcactt aatgtaaata gccattgcca ttttcttatt ctgttggcta atattttcca 4980 aataaacgtt tttcccatgg catactataa aaataaatag cctattagtt cattaacata 5040 ccattctgct ttctcctgct atattataaa agtgataagt attaaacgtt tattgtctca 5100 attgcaggtg tctttttgtt gggttgcttt tttcaaaatg aatcgtaata ttggtgcccc 5160 attcagggta tatataggca gaactgtata aacccagatt taaaaaaaac aaggattttt 5220 aaacactttt tccagtactt atttttttta ttcaagactg acatgcagcg tactgaacac 5280 ctgaaatgta ttctcagcaa tgcactttaa attcaataag aactctccaa tttattcaaa 5340 taaaatacat tagtaatgta gtaattaata atatattcat aaataaaaat ataataaacc 5400 tacaataaat gtgcttgtaa aataaaaaat aaagtaatgt atttgtttat gttctctcaa 5460 agatggaaag agaactgaaa atttgtaccc aagtaaaaat actactacat tggtaaagta 5520 atactactag taactaatac taatattaag tgtacaagta aaagtaaaaa agtactcttt 5580 ttttaaaaaa atacttaaaa taagcaatta gttagtttaa taaaaaaact gattttcatt 5640 aaggcccttt tcaagtgtaa acttgataga atggttcaag tgtttaatca caaggctata 5700 tatatgggaa tttgcactac caaaggaagt gtggttgctt tttccttcac cataatgaat 5760 ttttcaatac cactggcaga actataaggc catagatata tttgatgtcc attttattaa 5820 acaatatttt ctactttgaa atgtttaatt aaagtatatg aagaaaagat tgaaatagtc 5880 ttttatttac acattttaca ctattttact tatttgctca ctgctttgca atatggcaaa 5940 tttgttgaat atatagcttt acagcacatg aaaattgttt tacacttgat tttaaactaa 6000 aatgatttta caaataagta cactgtcagt aatgaaataa aaaaaaacaa atgacttctg 6060 tatttgaaat atgacaacac taaaattttt gaagcagtgt tttaagagcc ttgttgttta 6120 cttgcaacat ttgcagaaat gattgaatgc acaatacctg cattccctcg tcaaaattca 6180 aattgtattt agaagagtga ctgacaggag gagagtcagt tgctgttaga caaaaaaagt 6240 ctgaaatgtg attcggtcag aatggccgga ataagaaaac aataacagac atctctatgg 6300 taaaaacatt tgaaaagcat tatgccaatt aatcatttgt tttttaaaaa tcattttttt 6360 aagtccttac aacagagaaa ggcatttgcc cgtccttaca tttcacataa aactacacaa 6420 gactgtgata aactaacact attcagttga tctgaaaccc caaatacgtt tcccttctga 6480 gttcaagatt aaactcagag ttggttgaac cacctttctg aaacagaccc cagtttgagc 6540 atcaatgtgc tgcagctctt tttagaaata aggggcagag acgagcaacg gtttatgttt 6600 gaactttaaa cacaaacaca ttgggctcta ttttaacgat ctaggcgtaa agttcaaagc 6660 acagggcaca aaagcattaa gggcgtgtct gaatccactt ttgctatttt aaggacgaaa 6720 aaatccactc tgcctcatag cgcatgatct aacagggttg agcttattct cttaatgagt 6780 tataggtgtg ttttaagaat aaaccaatca gagtttcatc tcctattcac tttaagagac 6840 tgttgtgtcg cgccttggcg cattagctat ttacatggcg gactttaaag gtgaaaaact 6900 aaacgcttca ttaccaagaa aacagttaaa tatctgcagt gtgaaaatga gagaatgagc 6960 ctcctcattt tttactttca ctatcttttg tagataagaa aacatgttgt acacacaaat 7020 caattagcct ataccgaact aatttagttt gattgcaaag atttgttctc tatctatttc 7080 taaatctatt tctaaattca aaatctattt ctaaattcaa ttctaatttc cagcaaacga 7140 ataagtgaac aattataagg aagtgtgctc aaacaactga gttatttcca aacacgtatg 7200 ctatgcccca tatggtccaa aacctgacaa gtggacaaat ctaagcttgt ttaaaaaaaa 7260 aaacaaatat acacatgcaa atggatttac tatttaaaca acatggtgga aacggcaaaa 7320 ctagtgttgc actggtctaa aaatagaaac gcgtcttgag ccttattggg ctgagtgtat 7380 gatagggccc attgtattac atataataaa acaaaactgt ttgttgtgtt tttcaatttt 7440 cattatgtaa atgtaataag ctcattctgt tattatgaac atatggttga cagacctacc 7500 tagtggttaa tcccccattc atgcagaatc aaggcatcta gtgtatataa tcatccaact 7560 gactacaggc atcaggccat cctggaaaac aagaaaaata tcttcactaa cattgtaaaa 7620 gctgcacata tactttaatt ttatagattt aaaaagtgat tgttcaccaa aaagtgaaat 7680 ttactcacta tttactctcc ctctaatgca ggggtgtcca aactcgggtc accctaatca 7740 aacacacctg aagcagctaa tcaagttctt actagatata ctagaaaccc cagggcaggt 7800 gtgttgaagc aagttagagc taaactcagc accacagcgg ccctcaagaa accttcaagg 7860 gtttcgttaa tctgttggac aaaaaaaata acagtaacca cttatcattt atggaagtca 7920 atgggtaaag gtttttaaca tttttctaaa taaatttatt tgtttggaac aaaaacaaac 7980 ttaaacaggt ttgtaaccat ttaaggatga gtaaatgaag gcagaattta agattttggt 8040 aaaaaaaaaa aaagccaaat ataaaacaag catcttatta aacgtatgga acaggcaata 8100 aatatgctga tgattcagtc tttatgagcc tcttttatgg cagaaaagag aaacatggcc 8160 attcttggat ttttttcata aagtgatgta atgattctaa acgtgttgaa taaaaaaaca 8220 taaagaaatg tgtatagact tgagaaaaaa ataaaatata atataatata atatattgca 8280 cctacattca ttaatggaga gcatacagaa aagttttttg acctggtttt agagatagga 8340 ctcgtgggtt aagtgtatta acactcaagc taagcagaga cattgttttt tgatgtcctt 8400 ttgaatcagc aataaaatgc ttaagatatt ttactctgct tttattaaaa gtatattgac 8460 ttgtttgcag tatttgctgg tatggcaatg ctactgagag tcaagaaaag tgtttaagga 8520 aaatagtcac cacttctagt aaactggaat acatttacaa agatggagtg gtcataaagg 8580 caatcagtat tgtggctgat tcaatgcacc catttagttc ctcatctgac tatttgccat 8640 ctgagcagag atactactgt ctccctgtgg tgtcaaaaac agactagtca gctggactgc 8700 ttattgtgtt ttattctatt tatttatttt atatactgtt gaatgtgaac atgttgtctt 8760 tggcatgttg tttggtgtga tgcctgctaa ataattctgc aaatcaataa caactaaaca 8820 aatatacaaa ctagacaaat aaccctacct acctactatc taacagcata agcgatgatt 8880 ttcgcctgca ccttgtagta cactcctttt tgagagccat cttgaaaccc tgagtgaatc 8940 caagtagaca gacaaacagc ttatggaaaa ttcaaattct acataagctc atatgcgagg 9000 gccacaaaga tgaaacactg ccttcattct ggttgaagat aacggatgta cctcaaacta 9060 caaaacaaaa tgatgaaaca ggtaagatat tacatattaa taataaaaat gccagttatc 9120 atcatcatca ccatcagtat tatacattta aaaaacaaaa agaacatatt tacatgaaat 9180 cacacatttt cgaaggtcaa tcatcattaa ttaatccatc ataatctgtt ctttcctttc 9240 ccaatgtatg atctcttaaa atataaagac gtttcctgga tacacattta ttaatactag 9300 cataaaatgt ttagatgtaa tttatgtgag gcctaaaaga aaagacttaa gcaaatttaa 9360 ataaaattac ataagatgaa taaatacagg acctatgcta cttggtaaaa ataggtattg 9420 gcaatgaaaa tatatatttt ttataataat tacaacattt tagagttggc tagtttttaa 9480 aagaaaagtt ttaaaagatt tgtttgacct ttgagtttct aaaattcact ctatggggaa 9540 aatattaact tgagagcatt ttaaaagata taggcaccat taaatcaaag aacataattg 9600 aatatataaa tataatcaat atacatgtat attattttaa ctgtaatatt cacattaatt 9660 tattaattac ttaacaatta taatacagtt attatttttt ttaaacatca ttgttttaat 9720 tatttattag gaattgaata taggaattga gtgaataggc ctatatatga ataatttaga 9780 caaatacaaa aatgcgatta aataataaaa tcaataataa tttaataacc ctacacagca 9840 ctttttattc atatctaatg acgattataa cacaaacaat aacacatttt aatcttcaat 9900 tattcattaa taaacctttt aatttgtcac tttttactcc gtgactagcc cgcgaaacca 9960 ggcggtgggc attatataag ttgaggtctg tgcccatcgc ggacacagcc tgggtcattt 10020 aagtggacat tcctaacgaa cataactgcc agttttcctg aggtaaacgt cttttttaac 10080 tttatcgtta tattttatta taaattggtt ataacttgtt catacaagta cctacgaata 10140 aaatggtgtt ttaaatatca agaataatgg ttattatatc gagtacgtta ttcattaata 10200 aagatgaggg gtatttggcg cttgttttcg tgttaataac aataacgtta tatacaatgt 10260 cagaaaaatg tcgcttctag taaatgtact acatttacta ttcaaaaaca gtattctcgt 10320 attttttccg tgttttctct acaatcgcta tatgtaggct atcttacgcc gctaacggtt 10380 taaacggtaa aagccgttta gcaagtatga agcattttac ggtaactaac gttagtggta 10440 acacaaatcc gactgagact ttaatttaga gtccgtcata ttatttgtta gattaaaaaa 10500 gcaaaacaat actacaaaca aacagcaggt tgtcccacat ttggtttgac tgaatcaatg 10560 tgcgtgtatt acagtgcctg tgcgtctgtg tgtgtaaaaa gagcgccagc tcattcattc 10620 attacagcca ggctatgaga ccaagaaacg ctaacgttac acttttaaat taatataata 10680 agctgtaagg tctcagtcgg atagagtgca ctctgcgttt tgcttatctt ataagtgcaa 10740 ttttaggcta tgttttcagt ctttattaac tgtgtaatct gttttctttt cagtgccgac 10800 tgcccctgct tcttgtaagt accattgatt ttatacactt taatcaatta aaaaaaaaac 10860 atttaaaact acaaatactt taaatttaga catttgcaat aaaacctaat tttgtcttgt 10920 tttttccctt attttgttaa catcagctac cccaaaggct cttttaaggg ccgaaaaata 10980 tctttggaaa tgtaagtttt aactgacaat atttacacgt ctcaatattt aggtttagac 11040 aatttagacc cgttttgacc cgggaccttt aattattatt atcatcaaat taaaattata 11100 atcttatata tttttataat ttttaaaacg tttatcatta ttataaatta taaattgaac 11160 tgattatgaa ttgaaatcgt tttttaaatc aagataagta ggcaaaaccc gggaaggaca 11220 ttgaaaggtt tgtggtgaga cactgtttag ttgcttggta tcattttgaa ataattagtt 11280 gtcaagtgaa tttgttgatt catccagtat ttatccgaca ggcttagtag cagcaaagtt 11340 ggactatgtg tttttcacat ttttataaca ttttaagtta acacatcttt gtatattttt 11400 aatagtctgt ctctaaatgt aatgtatctc tgcgtagaac ctagcctggg atcacttttg 11460 catatgcaga aaacatttat ttaaattaaa aaccgaaaat atccttaact gtttaattaa 11520 tgcattaacc cagtaagtgt tttacaatgt catgcagtta tgacacctgt cgaagtttaa 11580 tgtagcttga acacacgaag aacattatgt gacataagtg acgcatcgcg acatagtcag 11640 tgacactgaa ccgtggggtg tgcaggttac catcaggtta ccaaagctta aaccatctcc 11700 ttcatgtaaa tatcatttat ataatatgca ctaaagtctg gcctatattt gacgcttcag 11760 tcgacgttat caagttaatt ttaagttgaa aaccgtttag cggaaatgaa ttatgtgatc 11820 acgtgtaaaa agaaggtttt cacaaatcag agaactattt gatgagagaa aacgcagaaa 11880 gtttgacaaa tgtgggtttt ctaagtttat gtatagttaa cgttacatga agccatgtaa 11940 agcatttttt cccactttta tagaatgtcc ctaaataaaa ctttcacctg tccgctaatg 12000 cgagcgcaat ttgcgtcagg ctgattattt gtcaccattt acacaccaaa caatgcaata 12060 tttatttatc accttatgat ttatttctct tttaatctat aatggcaagc gcatgaccga 12120 cgcctcacgg aagccacagt tcagtgtccc acctaacgtt acttcatcat ctgacaaact 12180 acaacgcaac gctaaattta tttactaacg ttaacgtttc cttacgtttt atcactcgct 12240 tttaacccta atacccagcg catggacatg tagaaactga cattgctaat tcctatgcaa 12300 gaatgaagta aaatattttc gccaacgtta gcctacacag acttcatatg atgtccatat 12360 tttcttgatt tgtaacacat tagaatatag taatgacaat aaaatactta cctagtcagc 12420 ctttaggtgt ttgaaaacat taacaatgcc cacaacaaat gcaactacga tactgcgaat 12480 ccccgaagca tgcgagaatc tcggcgggaa ttttttcttt agcacgtttt tggcgaacga 12540 cacactgcta gtttataccg ccgcctaacg gaaaaatggt aatttactcc aagaccaaaa 12600 gtgtccctag gaaataacta aataaacact tcctaataat gataatccaa taaaatttta 12660 tttgtatagc acttttcata attactccaa aacaacttca caaaaggtgc aaatattagt 12720 tttcacatct taaccaggat tgttaataat tatggagggc actgcaaatg tcgatgatgc 12780 tgttggcccg ctaaaaatca gcatacaaga aaaaagtcat aggtgaaatt aaacatcttt 12840 aagaataaaa atgtaaagtc tactcatggc ttcttgtttg ttactggatt ggaaggtctt 12900 atcactggca ttgtgtttca cattctttgc taatatcttt atgcaatttc agatgttatt 12960 ttacatatta acatataaca caaaagattt gtttaagaac tgtgtttaat tgttaaatgc 13020 ctattttcgc agtcatcctg catctctgac ttgtcataaa tattaattat attaaaatta 13080 aataaatata tttttaattt gcagacatgt aatatctagt gcattcagtg atgtttttct 13140 ttttttcagc agaacacaga acatctgagc tttgtaagta tgttttaact gtataatact 13200 gttagattaa ttaaagctac ataatgttgt atgtacaaaa aaccggcaac catcatccca 13260 gtctcatgtt ctgactttaa caacgcattt aacaaatgcc tggctggtcc atgatggtct 13320 tatatatata tatatatata tatatatata tatatatata tatatatata tataactttt 13380 ggtagttaaa cacgtcacac agttttaaat acatggtaca taaaggcaat gataaaagct 13440 taaaagttaa actgattgaa agatgtttgc atcaagtctt ctatcacttt atgttgaaca 13500 taaaatgtaa aaattcttag atgacataaa agtagtgttg gatagagtac actttaggaa 13560 acacagatat aatcaaaata tcactctagc taaattaaat ataacttttg taccttcagg 13620 aaaatactca tgtgcaccta taagctttgg caaaacatgc ctttatcagt atcaatgtca 13680 taatgcatac acatgcacat gcacatgagt ttgcttttta atatattagc catgtgttag 13740 ccatatatta gttaataaat attattaact gtcatcacgg caaagataaa ataaatcagt 13800 tattaaaaaa gagttatgtc ttgaaaaata tctattagaa tattatgtgc tgtcatcatg 13860 acaaaataaa tcagttataa aataaatcag ttattagaaa atagatatta caactattat 13920 gtttagaaat gtgttgaaaa tgttcttata cacacacaca cacacacaca cacacacaca 13980 cacacacaca cacacacaca cacacacaca cacacagttg aagtcagaat tattagcccc 14040 cctaaattat ttgccgcctg ttcatttctt ccccaatttc tggttaacgg agaaaatatt 14100 ttttcagcac atttgttatc ataatagttt taatacctca tttaactgat ttattttatc 14160 tttgtcatga tgagagtaaa aaatattaga ctagatattc ttcaagacac ttctatacag 14220 cttaaagtta catataaagg cttaactagg taattacgtt aacaggttag ggtaattagg 14280 caagctattg tataacgatg attataatga tgtataacga cgaactgtag aatatcgaaa 14340 aaaaaaaaga tatatatata gcttaaagaa gctaataatt ttgtccttaa aatgtttttt 14400 ttaatagaaa tgtcttttat tctagccaaa ataaaacaaa taagactttc tccagaagaa 14460 aaagaaaaca ttatcagaca tacagtaaaa attcccttgc tttgttaaac aacgctggta 14520 aatatttaaa aaataataaa aataattctg acatctacta tatattttta aataaaatta 14580 ttagccatca ttatacaatg acttgtctaa ttaccctaac ctgcatatag ttaacctaat 14640 taacctagtt aatcctgtaa atgtcacttt aagctgtata gaagtgtctt aaaatatatc 14700 tagtctaata ttatttactg tcatcatagc aaagataaaa taaatcagtt attagaaatg 14760 agttattaac actattatga ttagaaatgt gttagaaaat tcttctctct gttaaacaga 14820 aattgggggg aaaaataaac tgggggttac actgtgtgtg tgtgtgtgtg tgtgtgtgtg 14880 tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgttttag tatgtatgta 14940 tgtggttgtc ctattgaaca taaaataaga taattttagg agattgtccc caaaaatttg 15000 tttcttttgc taaacacaaa agaagatatt ttgaaaattt gaaaatccag aagccattga 15060 cttctatagg actttaagtt tcaactggat atcaatggct actggtttaa attttttaaa 15120 aatattttca tttgtgaaaa aaaaatacaa aggggtgtgg aaccccttta gggtgtgtaa 15180 atgaagaggt ttttgttgtt gttgaattct gaaataaagt aaaaaaggtt gcatgcgatt 15240 agtaaactaa ttattcaggg agataataga ttaaaggcaa ttacatatat tttctatgtg 15300 gttttaattc ttcttttatt catacatttt tttgtctttc aggagaattt ccaaagcaac 15360 aacatcacaa gatccaagta ataatacata cacaatacat tagaacctaa acatgtatag 15420 acctagctcg tatagaaata caaatatagc ttaatacatg tgcagtacat gtacaaaata 15480 ttgacaatag cacacattta cacacatcac aatgtcctac tggacaaaaa gaagaaagac 15540 tttggccact gtgcaagaga tagaacagga gattttgaat gaatacaaaa tacaaacaga 15600 aaacaatccc gaatctactg cacaagttaa cgcatctcaa aactttgaaa atagagatga 15660 attgccttct tgcaatatag tttcctctct tgcgaccgac cacgataacc catgttggct 15720 tgattctgat tctgaatctc tttctgacag tggtagtatc tctgatacag accaaccaaa 15780 tgcagaatct gacagcagta gcaacgattc agttaatcag ttgcaaaaaa aactgcaaac 15840 atgggcagca tctttttgca tacctctcat agctctaaca gcactactgg caatcttgag 15900 gtccgaattt cctgatttgc ccaaggatgc aaggactctt cttggcacac agacaaaggt 15960 acctgtaact aagatatgta atggggaata ttaccatttt ggcctggtca gttgtctact 16020 gaagacattg gaccatattg ccagtttgcc aaaaataaca actcttgctt tgcaatttaa 16080 cattgatgga attccccttt tcaaaagctc caaaactcaa ttttggccaa ttctcggaac 16140 gattgactgt gacaaaacac gatcgccatt tataattggt ttattttgtg gcaatagcaa 16200 accaagctca gtctgggaat acttgagaga cttcattgca gaactgagga tagttctgcg 16260 tgaaggtatt gtgaggaatg gaactcggtt taaggttgta gtctcctcct tcatatgtga 16320 tgcccctgca agggcatttg tcaaacatgt caagtctcac aatggctact atggctgtga 16380 caagtgttcc cagactggag tttggtccaa taaaatgacg tatcctgaaa ctgacgcacc 16440 tttacgtacg gacgatgact ttgaaaaaat gaaaaatgaa gaacaccatt tgggtaacag 16500 tttaggcccc ttgactggag tagtaaaaat gataacccag tttccaattg attacatgca 16560 cacctgctgt ctcggagtga caagaagact gattcaaatg tggatgaaag gcaaacagct 16620 cagaacaagg ttgtcttcac agcaaattca aaagatctca gacaggcttg ttgctctccg 16680 tccttttatg cctaaagaat ttgccagaaa gcctcgttcg ttgagagatg tggatagatg 16740 gaaggccaca gagttccgtc aatttatgat ctatactggg ccgattgttt taaaaggatt 16800 tgtcgaaaat gaaatttacg acaattttat gctcttttct gctggcatgt ttctgctcct 16860 cagccctggc cttcctggtt caatgataga aattgccaac aagattctgt catcatttgt 16920 aaaacattac agccagcttt acggcacaga tgaagttgtt tacaatgttc accaagtgat 16980 tcatcttgca gaagagtata aagcatttgg ccctcttgac aacatatctg catttccata 17040 tgagaactac ttaggaacac tgaaacggtt ggttagaaaa ccagatcaac ctttgcagca 17100 ggttgtcaga agactatcag aaatgcccac aaaacaatct cacaaatcaa atcaagaatc 17160 acatctctct tcaagtacaa aagacccaaa gctggtgcat ttctctgatt gtgttttatc 17220 accaaaccaa ggagacaact gcattcaaat tgacacaggc attgctgtaa ttaagaacat 17280 aatgaaacag ggtgaaactg tccatttagt gtgtcagacg ttcaagaaaa tggaatcttt 17340 ttacacttac ccgtgcgact cctctgcaaa tggctgtcac agagtctctg tcttgcaaga 17400 aaagactgat ttggtacctt tgtctagtgt aaaacaaaaa tatgtccttc ttccagaaaa 17460 tgaaaacagt ttttttgcgc tacccctcat ccattgtaga taaagttccc taagcataga 17520 aaacacagta aatacttaca tgtagtgtca gcttcttttg ggattttttt aataatgcta 17580 accacagact aattgcatca aaagtaacag tctgtattgc gtaatttatg tgtgtgcaca 17640 ttataataca catatcatac atgtgcctat atatatatat atatatatat acacacacct 17700 gtatgtatgt ctgtatgtat attttgcact attacaatat tttttatcaa ttgaatttat 17760 tacaatacta aacatacaat acatatttta tatataaata gttatgttta ttatactaaa 17820 tatatacaca tacacagatc ttatataata tttataccct cagaccttaa aaggtactgc 17880 cccagtgaca gctttcatac ctttatttct aaaagtgtat gtatgtaaac agtttttttt 17940 ctattcctgt tattttccca tactgacatt acataaagat aaatgcattt acacatccct 18000 tttacatatc acacatcaca ggtttgtaat gggtatggga gctgaacacg cagcaaccca 18060 taacaaacca cacatattga ttcacactga gatctagaga taagagatct tttagttact 18120 ttttttcctg tcctaaacag tgatggataa ccagtgttat gcactggtag aattccctga 18180 caagggaaca gatgttgtcc ctgaaaattg gcttttgggt caaaaatgtt attggccaaa 18240 ttacaatcct acaaaaataa agacagctgt gaagaaaaaa gaagctgtca aagacggatg 18300 gaaactattt gagcctgttc gtctacttaa aaagtgtggt aagagttgta tgcaaatatt 18360 aaattattta ttatctccat gaacttgtta ctattcaaga ttcactgaag atttttattt 18420 attttaaata aatattattt attgtctata tattattaga taccttagag aaagcacaga 18480 agatgctaaa aatgtatgtg gagtccaatc tccctacatc tgaattaact tctgatgaag 18540 agggaagtgg aaacaaaata aaaagaaaga aaaggttaac tttatatact aaaaaaacta 18600 ttaaagtgat gtcttattga taactaattt gagtgtttca cttcacaggc ctaatccacg 18660 atttattgac ttctcagaaa acgattctga tcaggacata acatcacata aattgaggct 18720 tgctgcagca cccccaatag attttacagg taatgaactt ttttattgta actattaatt 18780 gttacttttt tcagaagtaa caatatcttg ttaagggtga taaagtaatt ataatagaaa 18840 aagttacatt aaagttacat tgctagtaat aatatagtag catttctgtt ttagtctttt 18900 taacatcctg ctcattctga catttttgta gctatgaatg cttcagcaaa tctggcttca 18960 ggacagactg attcaggctg ccatggtaaa aatgcacatt ccccccagta aattcactaa 19020 ttttctgaat actccacaag ttattcccca gtttaggcat cttttcctct ttaagctgca 19080 gtcacattgg gcttttcctc gcatagactc ccattcatac gcacgcgaat gcgtcagacc 19140 ggaaacgcag ggtcatgcgt taagtttcgc aggttgctgc ggtgcaaagt tcaagcttgg 19200 tgaactctga cctgcgaaat cgcatcactt gactgcgtga gaccaatcga ggattaaagc 19260 tacggtcaca ttagagattg agcatgccaa attctgtcgt gcggcactgc gaaaagggac 19320 gggattaaac aagtttatta gacattttaa aaaaaagcga gcgattgctt catgttttaa 19380 aattctgtcc agggatgtca tgctttgatc ttttgattgt ttttctctaa acataaattt 19440 taaacattga ccaatggctt gcttttttaa atgtctaatc atctagttta atagctactg 19500 ccccttttca cagcgccgta cgacaacatt ttgcatgctc aaactctagt gtgaccacag 19560 caaggctttc taaaaatatc agtgcaggtg tgttgaggca agttggagct acaatctgca 19620 gaacaccggc cctttagggg tttgggcacc cctaccataa gttaaaacat gagcgcatgc 19680 attgaacacc acaagttctc tttcgtgata tttaaaacca tcaataaacc ttaaacactc 19740 ttttttttta ccttttaacc tattttctct attagcctat tgatttttca taatcattgg 19800 aggccgaaat tgaaaattac ttttgattaa ttgcacagtc ctgtaatatt agctattatg 19860 gcgaagtcgt aactaactta actttaaatg acttacactt tgaaacgttt ctccctacat 19920 ccaagtcttg cacaagtagc tctaatgaat gtcctggaag ttttggcgtc agtcacttga 19980 ggcaggtaaa agcaaccaat cgcattgtct cagtgtttgg gaaggggagt actcccgtcg 20040 gttaaccctt ttggaacctt tttatcttaa ttgttaacag agtattattt tattttgaat 20100 gcgattaaat tattgggtgg aacatatcaa tgattggtta atatttgtgg gatcttgtcc 20160 agaaaaaaac atgcattttt attttttaga actaaaaaaa ctttcatttg tttttagtca 20220 atccaaaaag aggacatatt gagcaccatg ggacaagcca tgtacaacac caagagtgtt 20280 gggtaagtaa gttacttaaa atgtatatat tttttattac taattgcatt attaaaattg 20340 gtttcacatt attttcgttg ttattttgat taattttcta tgaattaatt agaaattaat 20400 tgtattattc agtcaaacat agtcttatat ctttaaaaat taagtttttt taatagcaca 20460 cataacaata tttagctagc aaacagaatt actatgcaaa aaagtaatgt atgtatatat 20520 atatatatat atatatatat atatacatac atagggtata tatatatatt ttttttttat 20580 atataaatat atcaaaacaa caaatttttt cgaaacaatg acttcatatt ccatacttaa 20640 aatgtgtgtt cattcttaaa tttctaaatc aaattaattt atttaaaata ataaaaaaga 20700 aaattattta gtttttactc tgctatattt ccaatatcaa atttgtaact ttttatgttt 20760 ttcatataaa aatgtttttt catatgaaaa taagataatt ctgtgaagca tacttatata 20820 aaaaactact gtcaaattta tttgcaatat tttaggcagg tatattataa gtaactgcaa 20880 taaaatttct acaaaattaa tagtaatctt cattttctca gcctttaaat gaaataatta 20940 tagtcactaa ttactttgca attgcaccaa actcaccagg acccagccta ccttgcacac 21000 cagaacacac aatataatga aacacaccag gtcacagcct accgagcaca tgaagacact 21060 cacgaaggac accaggacag acaccatcaa gcacaccagg acagacacca ccaatcccac 21120 caggacacat atagagctca gctggacaca agatatagtc ctcacttgga cccacagtac 21180 agagctcacc aaagtacata ctacagagat caccagggaa cactgagtga actggttaat 21240 ggtaatgatt taagcaaata catctacact acacctttat gttaatttca tgctcattcc 21300 tcactttgta atgcatttta tttagtttgg acttaaacag gctaaactgt atccactaat 21360 acatcttgac atccatgttc cttgcatttt taaagagcga tacatggacc aaacctcaag 21420 tcagactgcc tccgcccttg ttcccagtgg tatgtttttc taagtgtaac ccctcaaagt 21480 ttattagaga ttgtttcacc acatttctaa tcataatagt tttaataact catttctaat 21540 aactgatkta ttttatcttt atcatgatga cagttcatca tttttcatta gatatttttc 21600 aagacacttc tatacagctt aaagtgacat ttactagctt aactaggtta attaggttaa 21660 ctaggcaggt tagggtaatt aggcaagtta atgtgtgatg gtttgttcag tagactatca 21720 aaaattagct taaaggggct aataatattg accttaaaat gttttttttt taaactgctt 21780 ttattctagc caaataaaac aaataagact ttttccagaa taaaaaaaat attattttat 21840 actgtttata tattatactg tgaaaatatc ctgaatctgt aaaacataat ttgggaaata 21900 tttaaaaaaa gaaaaaaaaa tcaaagggcc taaaaattct gtctttaact gtatgtatat 21960 atatatatat atatatgtgt gtgtatatgt gtgtatgtat gtagtgtttt ggtttgtttg 22020 tttaatctta gtctttcaat tactaatgac agaaattagc ttgagtttct ccctctacaa 22080 tatttattaa ctaattgatt aattgttttc tttgcagagt tgtctccttt taaaaggaac 22140 gttttgaaaa ttttattgga aataaaaaat gagcaacgag aacagagggc aatgatacag 22200 caattacagg gagcgccaat acccattcca cttcaagccg aaagcagctt tgctttccca 22260 gcacagtcac tggatgactt tgatgccctt gagaagatgc tgcaaaatga tgctgagaca 22320 aacagattgg tatgtctagt aatgtagtta caattatgtt tcactttttt taggtctagg 22380 acacactaga tcagggataa ccaaacttgt tcctggaggt ccggtgtcca gcagatttga 22440 actccaaccc tataattaaa cacacctgaa caagcaaatc aaggtcttac taggtatact 22500 tgaaacatcc aggcaggtgt gttgaggcaa gttggagcta aaccttgcac aatagtatct 22560 ggatgcagct ttttatgatg taagttgaga ttgcatcact tagcgatgca atgtcagaca 22620 caatgcactc aatggagtga gtgacgtcac tgtgacaggt agggttaaga gtggggttag 22680 gtgagcacat taaaaagcat tgatgcagct cagattgcac tgcaccagtt ctgcattcag 22740 acccctctca ccctgcaggg caccggatct tcagtaacga gattggtaat ccttgcacat 22800 gatgttcata tctttaaata ttatttgaaa atgttaatac atttttaaat aatagtttaa 22860 gatttgaaca tataataaca ttagtaaatg ttacattagt aaaaacaaaa gtttactaat 22920 cccattagcc tattagttcc tcatcactac agcaagagtg ctcttaggat tttgttaatg 22980 tggtggcatg ggacagagga actaataggc tcatgggatt agtaaacatt ttatatttaa 23040 tagccatttg atccttaatc aaatgctaag ttttatcttc atcataatct acactactac 23100 accataaagc atctaagaaa cctgttttga tatatttgca ttaaaattat ggtttgttta 23160 tctagattct tttggtaaat ttgtgaacaa aagttattga tgattgaagt caacagataa 23220 tgccaaaaat aaaaaataaa aatgtcaagc aacctctcag atatcaatca aaatttgcaa 23280 tattccatct agataactta actgaatgtt ttgtctggtt ttgtgtggat ttagatgaac 23340 atttatgtta atgctgatgt tgattctcat tagatttgcc agctgtcact acttggtgga 23400 ctcaacttgc aggacactgt gagacgcatg ttaagccatg ttttaacaaa caggttggca 23460 tcccatttta attgggctgg acgtggaaag aaaaggagct ttgaagcctc aaatttgcaa 23520 aagacaatgt tccgtaagta atagttggtt ttctatacaa acaatctcta aaaaatgtag 23580 acatgattaa atatattaga taaatgtgaa aaaaataaca caatgttagt ttttgttctt 23640 tttttacttt caggtgcatt aagaaatact tcccagggga aagaagcatc aaaaacagac 23700 tttagcgaag tagtaaaaaa atggttacgt tacgctccag acagagatgg tggcagtgga 23760 cgacaataag ttccaaaatg agatcttgct aatcagtttg ttttgttttg ttgtgttgac 23820 tgttgacagt ttcatctaat tttcaaaatt tttaaatgac attaaattta agaaaatttt 23880 taaataaaaa agtgtattag catcttattg tcaataaagg tgtaagtctt ttaggtgttc 23940 agttttaaac ataataaaat taataatatt agattgtttt ggtgagattt ccattaaaat 24000 tcctgtaggc ctgtaacatt cagctatatg aggtaggcta caattatatt gttttaatgc 24060 ttgtaattag aatgattaat tgagtagtaa tattgtagta attaatatat atattttatt 24120 ttatatgtag gtcagtatgt ggcgtgtttg ccgtcaagac catacagtac gtgcattcca 24180 ctgaaatgta agtttgtgca tttttaccgc tacatatgtt tattataaac actgaacatg 24240 ggagtttgtg tgtgctatta ttttttacag actcttagct tctacaatgt ttaaatgtat 24300 atccccagaa caatatctaa tagatattgt aaaagtatct ttaatgtttt tacatttatc 24360 attggtgttt ttacatgctc cttttcacag aagcattgtg tttgttgctg catgcgattc 24420 aaccaacagg agaagatctt aacgctatcc acacttgtac gatgagaatg tgagtttcac 24480 aacttctgcc agtcacaaat ccagagacca agatcctctg tcttagttaa aaaaaatcat 24540 attagggttt agttagccta cattttcttg ttttctattc tcaaagttaa tttacatgcc 24600 agctaaggat ctgttatttt acaataagtt tagtaaaatc atggcaagca gtaaaattgt 24660 gaacaaagta attttttttt taaataaact tttacaacat tttttgtaat attgtcctta 24720 tactctgtgc tcgtttttgt gattgtttta gaacttccga ttcagttagt tcattcagaa 24780 atgactagaa ataacaaatt gcagagaaca ttttattttt aatattatca ctattgagtg 24840 aaattttcct atgcatgtat gttttgatca tttgtaccat taatcttttt ggaaaattac 24900 atatatatat atatatatat atatggttta tatatatata ttgtttaaac ttgttaacct 24960 tcagtaaata tggaatgata ctttataaaa caatgcacta gatgtaacga gcatgtaatt 25020 attgcatttt aactgtgttt aatgcaggat ttctgtgatc tccagttgaa gacatcagat 25080 ttttgactgg acagatgtac caacaccaaa ctgaactatg ctttttcatg tcctatgaag 25140 gtattaatat ggcaaaaacg agcacccata atagtggaat atgtagtagc aacaaatagt 25200 cagtaacact ttccatgaaa atctatattt gtaatgcatt atgcatatat ttaaaaaaat 25260 atgaatatag gtattttatg ctaaacacat aaccttctaa ttataaaact ttcatatttg 25320 ttcttactac tttttcaata tttatggtct gcatgttttt tttttttttc ttttcttttg 25380 tggacttttc cattcaaaac gtcttttact ttaccttatg cagacaaaaa aaaagacttt 25440 atatttcttc aattgatgaa taggattaac ctataataaa aaataaaacc atatcaaaac 25500 caaaactagt taaatcttat tataataata cattaaagaa gaataaactg aaatttaaga 25560 ataaacaaat aaaaaacttc aaataagaaa ccgtttttta tttatttttg tttttgctaa 25620 tacttcacat tttaagcctt ccacatacag taggctactt gtctgtcctg tgtggtttaa 25680 cagggatcat atacctcttt gaaacgtagc cacggccgga gcaagctgca ttggcgctcc 25740 aggcaaacga cttgttgcgg acgccgccct cactattttg cctattctgc caaggtcacc 25800 tctatggacg tgccagcact gaacgtgcct tattgtaaaa ctgtagataa ttaaatgcac 25860 aataatatat ttactttcgt gtagctatgg tagtggattt gttaaatttt ctatactggc 25920 tgcagaaaaa aaagaatatt agtttactta ttcaaagtgt ctaaaaaatt gctttgcagt 25980 cagcctgggc tacagtgccc taaatatatt catacccctt gaacctttcc acatgttgtc 26040 atattaacag ttaaataaac catctgaacc atctgataaa gaacaagctt cctccattta 26100 ggctttttac tttactttaa ctctttcctt tactcctttt tttcgtggat aaggagcggt 26160 gttgtacgca ctcaattgaa gacatccatt atagcttaca taattcgttt tgtttgttaa 26220 gctctaatat ttgtttcaaa actatttcta aattcaattc taatttccag caaacgaaca 26280 agtgaacaat tataaggaag tgtgctcaaa aaatgagttt tatccaaaca cacgtcctat 26340 ggtccaaaac ccgacaggtg gacaaatcta agctttgttt taataaaacg tatatgaata 26400 tgcatataaa taatactact actactaata ataacattat acaaaagcaa gtaaataaac 26460 tgaaaaaaga cccagaaatg aaagcatggt gacagtgttt tttatgtaat agttgtaata 26520 ttttattcct ttaattgttt tcatataaag atatttgcat agtgctctaa atcctataaa 26580 acaatgtgta tgcaattaag gcgcacaaca aacagacttt gctggacttt agatctgctc 26640 tcagctggtc tattgaaaag tcttttttta gttcctcaaa atagcagggc gttaacaaca 26700 ccttaacacg ctaccttttt agaatggcac acctatgagt ccacaaagca gtgtaaatgg 26760 atttgctatt taaacaacat ggcacaaatc ttgaaaatta ctgttgagct tgggtgcatt 26820 tcccaaataa catcataact cgcggctgaa ctatcatagt atgcatagtt tagaaaaaga 26880 ataatgtaac aacgagtgtt tcccaaaacg catagtttct ctgttgcaga tccatggctt 26940 gaaccatgtt agttataacg taaaaccccc ataatgatgc tctaaactgg gtgaagtaac 27000 tgcttcttta gagaagaacc cataacttct ttgtgaaaat tattgtttta cacgcacaca 27060 catttaaaat acaatctaat attagatttg gtaaaaacat gcactagctt tccatattaa 27120 ctgaaatata tgtaaatggc acatacacat atagggtttg ctttactaat attattgaga 27180 agttgaacat tagttatatt ctattccaaa acatatcacg cacaaaagct aacacgtttt 27240 ctaatatcat ataaatcata aaaacaaata atgagtctat ttattatgaa atgaaatgta 27300 atatttatta tttattagaa atgacaaaaa tcagacttta ataattttat tgatgatgat 27360 gacgatgatg attatgatta tgatgataat tcttattaat attaataagt aggattttgt 27420 ttatttattt ttttgaaaag tctactttta caatttaaca cgttaaccac tgcagaaatt 27480 tctaaccagc aggctcatgt catctagaaa ctcaataaac aacctactat aaatgaaaat 27540 aattaactta tgctttttgt tttcacaagc tttgaaaacg taacataaat tccgctttta 27600 aataggtcac ctttagcttt cgccatgagg ctgtgttcaa aatgacatca atgttttcaa 27660 agtgcacagc gaagacagca caaatgtaaa catgaagatc gctaaaactg aactgtaaga 27720 ataggttgaa agcaaattac accttagaga gatgccctta atgctttttt acttttcata 27780 atggcaactt tgaatgttag agtgttctaa ataaatagga catatggcgg aaaagtggca 27840 atagaacaca gccagtaact gtgtcgcgaa ctagctccag agctgtagtt gcgtgtcctc 27900 aagtttgcga cgctgtttgc gagtgttcgt tggagcggcg gatttgggaa acggctaata 27960 aacaaactat ggttgtaacg acggaacttg agtctagttg gcttacgatg gttttgggaa 28020 acgcacccca ggtttgttca ataaaatata atctagccta aataaaatta agtgaaatat 28080 ttagtttcaa agaagcctat attcaactgt ttgttactgt ttgtgacaaa accatccgtt 28140 gcacatgtca attacaattt aattcgttta atcaagtatg ttttttacat ctcccaccat 28200 gcaagtacat acacacttta aaacacacaa acacacacta tttattctca ttctttcttg 28260 ctttattaat atatcgtagt gtataccgtt taataatcaa tttacattat ttataaataa 28320 ataaataaat aaataaataa ataaataaat aaataaataa ataagcggtg ttaatttgcg 28380 ccctgtcatg gttggtatgc acatgtgcaa tgtacagtcg taaggctgca ggcgacactg 28440 ctctcatctt tccaacagag tttgttctaa taaaagcctg ttctgcttat ttaaatgtgt 28500 cattaatagt aacagtttaa tatggcttct ctgaaactgt aaatatgtaa ctttaattta 28560 ggctagatta ttatttaatt taccaaacca acataatctt acggcaacta gtgttttggc 28620 gctgaagcat aggactatag cagacttgtt ttgaagcact tcacttcaaa ggttatacgt 28680 tattcttttt tatctagtag ttaactgacc aatggaaaat acataaaaat taagatacaa 28740 attttagcta acgaaattta tttaaaaaat atatattttt ccaagaaaca tatgaattat 28800 attttctagc taaattatta actgtgcggg gatgatttgt tctgttctgt ggaaaataag 28860 gatttaatta attattctgc ttcacactgc ttcacagttg ctgatgttct ttcatttcat 28920 gttcaggcaa gggaatagca acattagcag aaaagtatac atttagattt caaagaaaat 28980 atattaattt taaaaaggaa acataaaagt ttatcagaaa actttaaaga caccatgact 29040 cttcatttaa taaacacttc gacactcact ctgacaattt taagttaagt cattttaagt 29100 tataagttta catacaactt gcaggaattt tttttttgta aaaatctatt tcaaattgca 29160 tgttgcattt agtaggccta ctggcctaaa taaactttta catcacaaac aaatatataa 29220 tcattacttt tgctggtgca aattttgtgc acgcgcttga gtgcggaaaa ggctatacac 29280 atataattct gcatcttttt ttaggtgtgg ctacgcccca gttaaaaagg aatcataagt 29340 ttgatcctat tctaacgaat gcctttaaat aatgtagtaa gttgtctgca agcatcagtg 29400 ttaattgaaa ttaaaataag ataaaaaaaa aacaacaaat ttaatttgag ataaaaatac 29460 tgttatagta atatttgtga ctgctttgaa ataatgctcg ttgtagacgt cacaggatga 29520 cagatcgtgt agtctggcac atttatgaca aataaaggtt tttatcaaga aaaaaattgc 29580 atgttctcgt gtgatctgac tgtgggttaa taattggcat agttcatgac aatcaatatc 29640 tgttacacct cactgcacca ttgcaaatgt atgtttaatt ttcccccatc ttaatgacca 29700 attcccctca agtggatgca ctgtgtaata tgagctcatc atttctaggt tatatcagtg 29760 gctgctgtta atgcacaggc caggtaataa aaaaataggc ttcatattta attatgggtg 29820 ggtcacagac ggataagatc aatcattggt tctgcaaatt ttaactacct tttctagaat 29880 attaaagggc tttaagctat ttatttatat ctccattctc acacagagtt tctaagtttt 29940 ctgttctttc aacagagttt gttttgccaa cacattctgc cgaggtaatg ggctcagtgc 30000 tgcaaataca ttgcaaagca tctcctgctt attcagattt gtcattaata gtgaaaacta 30060 ttataggctt tattataggc ttctctgaaa cagtgaatgt ttcactttca ttttattcag 30120 gctagttcaa taatttattt aacaaaccaa cacaatctca ctgcaattct tgttttggca 30180 catatatagc agacttgttt ttaagcacat catctcaagt gttattaatt atccttgttt 30240 attgataact gatcaacgaa agatacatta aaaataaata aaaaattata gcaaacgaag 30300 ttagtttcaa aaatatattt tttccaagaa aaaattcaca aattatattt ttgtttgtac 30360 ccctccacta attatactta taatttagaa aaataaggat ttaatgaatg ctccactgta 30420 attgtatcac aaatctcagt tttatatttg taaacaattt aataagttat tatttttaag 30480 attatgtctt tagcatctga tttttccttg ctgctgcatg ctttcatttc tcagtctctc 30540 gatcacagca agttcaggcg agggagtgtt tatgaagaaa cccatggcag gaaaatatac 30600 atttagagtt caaagaaaat atcttagtat taaaaagcag taggcagaaa agagactctg 30660 cctgcatcag aaaaactggg tgagccacgt gtatcgtgaa aatattatga agtaaaacgt 30720 gcccctcgat cattcacagg atacttattt tcagatagca cacacaatct aaaatctttc 30780 tctgtatccc ggacgttttt aacttaactg aatgcattcg tttactggat ccttagactg 30840 aaaggtttac aggggttcac tggtttgaaa atatgtctga tcgtaaaacc tgcttctttt 30900 tatttctaag gtattttagg tatttgttat acactaattc aaattacttg gttgtgggag 30960 acggagcggg cgcagacact aatagaaagt gtcaaattat tctttaaatg tattttataa 31020 ctgaaattta attactaaac aaattgaagc attacgaatt tgccaagaaa tgtcccactt 31080 taccacggta aaataatgct atatatatac ttcctcttcg attcccggag gtataaatcg 31140 ggctaaaact ccagtatttg agcccggctt taaatacaag ataaaaaaag ctattctccc 31200 tcgttagata agcggttgaa agtggataaa agaaacacat ttagatgcaa ggtgtaaacg 31260 aaaattgttg aaggaaatgt aaaaataaat aaatagtgtt ttttttatta atgtaacgag 31320 cttattttca acattttatt tcgtatgaca ttttcaggca atatgcaact attgcatatt 31380 ttgtttgcac aacaataata tacatgtcaa caaggttact gatcttctgt acaataaata 31440 aaatgaaact ttgagtttat ttcttatttt ttgatttgtg caattgtatt catggaaggt 31500 gtgacttatt acttattatt gttacttatt attgagagaa agaatcaagt aaaagtttgc 31560 aactttaaaa atattttctt catgacttta ctgaaatatt gtacacaaat ttcattgtca 31620 ctggtgctca gttttgctac aataatgcta aatacaattg atgtacttta cacacacaca 31680 cacacacaca cacacacaca cacacacaca cacacacaca cacatatata cacgcacgca 31740 cgcacgcaca cacacacaca cacacacaca cacatagtac attttattat gcacatctta 31800 aaaatatgtt tttcttgaga aaattgtttg tctatatatc gaattcaacc tataattcaa 31860 atgttctctc tgtgtatgtt tgagtgtact gcgtgtacag aactaatata tgtaaaatga 31920 agaaattatg acagattgtg gtgtttaatt tgtcatatat actgcaatac ctaaattatc 31980 ttaaaataca tgttgaatta gaacacaaat agacaaatgt atttaatatt ctacagtttg 32040 agtaaatggt tttgactttg tactgcgtgc atagaagttt tgaatgaatg tatgctctca 32100 acgaatgatt actgctgtat atttgtttaa ataaaaaaaa tgatgacaga ttgtggtgtg 32160 ttttgtcata tatgcagatt aaattataat atgtatcctt taaaactgta ttatgtattt 32220 attcacgttg aaaatgttgt ttaagcattc atttcgacgt taaatcattt ggatagattg 32280 tatgttgaat atgtgtggat tctatgctgt tccaatcgac ttaaaatcaa catcatttca 32340 acgttagata gacaaaccaa ttaagtcgat tatctgttga aaatgtgtgg attctataac 32400 gtcgtttcga tcgactgaaa gtcaacatcg attcaacgtt gggtaagtaa ccaattgcgt 32460 cgatttcagg ttagatttga atggagtgcc tgacgtcgtt tcaacattac cccgatgagc 32520 aaaacgatgc tgaatcaacg tcggtgatca acgtggattc gatatcaaaa caaaacgttg 32580 attttaggtt gaaattatgt cattttgcta tctggg 32616 // ID BEL15-I_DR repbase; DNA; ZEB; 6132 BP. XX AC chr14; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL15-I_DR; KW BEL15-LTR_DR; BEL15_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6132 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1379-1379 (2008). XX DR Genome; chr14; Positions 49307098 49300967. XX CC Positions [5147-5707] - Integrase core CC 'TTATT' target site duplication CC LTRs are 99% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 37..3684 FT /product="BEL15-I_DR_1p" FT /translation="MATSRDFNENQQSLRPSRIRRIPGHLDDFVLTSQPQH FT LVSPPLQEDLQHMHVSETITDPDVEYGTSTPYRPDTETTRLRKVEESVWNV FT QEQLRELQSTLKMSIQQSKGSPCMHQDLSYVSATGTGHHRSVPHPSSQQQS FT IQQRSVSLPLLCSGHLSSGNIEKPKYYSPVIPARLPTTNLPQPHSLPPVQM FT TQGLLPQTLQPLSRTASLPIEHHVQQRETRPGSAPAGVMHPPYVYQQPSDF FT HQAYPPQVQSIVHHSITSMTPPTPTVYRAAPSYPSVHVPSMTTVPSHAVNT FT NYVEPQSLTLAPPVMPHFSEPPPLRSVHYSDQYTGNYLPNRPLESISQQPS FT YATPFSSTVPPAVQAPSMMEMAIASSYGIPKPKLINFTTGKESDFALLKKG FT LNSVLGPHLHLTEDYKFQVLLDHLKLPAAFQIAKRYMYDPMPYSRAMQALQ FT QRYGQPRQLVEGEIGSILRTPSLRPGDAQGFEDFALSVSTLVGLLNTLEGS FT SRVELLCGSHVDRLLTKLPPSYRDSFVEYCMSRGILQNGSDRTYTLPQFAE FT WLENKSQALQISRRTAELYSTDKPHIDVRELKPTKPPKVHSTIYYGINHAT FT AGPSTTSSLQPNKEQTKVKKRESFKPYCPFCKCQEHYLNSCSEFVKLNVAQ FT RIAWIKDNDRCWKCGRGHEPAKCTLKKPCSSCGEQHLPILHDTASSINKSV FT LTVNTVPRTVYLDQITHSCRVMLKVVQVRLHNKGRVLDTYAILDDGSERTI FT ILPTAVHHLNLEETDESLALRTIRQEVVEIQGASVSFEVSTSAKPSIRYKI FT HSAFTAQDLNIAKQSCPVDALKHKFAHLRDIPIIPFKEVQPMLLIGSDYPH FT LIAPIAPVLMGPLGGPIAVSTSLGWAIQGPTTFLDQPLETTCLNISMFHPP FT PKDPSHYVERLWQVDILPFQPMKDMTRSKQDRDAITMLETKTKRVEIDGVS FT RYATPLLRKGNAKVLQVGPESVMALLRATERHLKQNPELAEVYNKEIHKLE FT QAGYAVKLTPEEAKVTEESWYIPHHLVHHNSKARVVFNCSYQFHQDSLNDQ FT LLPGPMLSPSLLGVLLRFRQYPVAISGEIRSMFHQVRLLPEDHQLLRFLWR FT NMEKERSPDIYEWRVLPFGTTCSPCCAIYALQRHVTDHREGNEEVVESVLN FT SFYVDNCLQSFLSTNEAKQLIDKMRALLAAGGFEIRQWASNYAEVISHLPT FT EARSEL" FT CDS 3716..6070 FT /product="BEL15-I_DR_2p" FT /translation="MQESTLGLSWHCISDKLSFRNRTICKEEPTMRNIYRV FT LASQYDPLGYILPYTTRAKILVQALWINKCGWDECIGGDMQDKWEQWTNEL FT QYLSRITIPRCYFPICPESNKPKLELHIFCDASERAYGSVAYLRLEGEDGR FT IYTSFVMARSRVAPKKQISMPRLELCGALTGAQLAKLLITELTLPINNTNL FT WTDSTTVLSWIKSESCHYKVFVGTRITEIQELTSPNNWRYVDSKLNPADDI FT TRGKPLYELSQACQWNQGPNFLQQSPEHWPIQPTSQSVEENELRKSFFCAH FT VTVPQSSLPNPEQHSTWAELVQATYQSVHGTEAPPMSAAQRIDTELILLRQ FT AQQDSFPVEIHALQNGKVINTSSRLSTLAPEYDQTLGMIRVGGRLRKAKEL FT DGDTLHPIVLAPDHPITKLIIQDYDNRLLHPGPERVFAELRRSYWIIRGRQ FT AIKKYQHQCRECRTWRSKPVNPKMADLPAARLRLHQPPFWSTGVDCFGPFT FT IKIGRRHEKRWGIIFKCLTTRCIHLDLLHSMDSDSFLMALRRFIARRGKPF FT EIISDQGTNFRGGSRELQEAFSALEPDIKEKLSEQSISFQFNPPHAPHFGG FT AWEREIRSIKSCLRVVLKDQIVTEEVLTTVLIEVEGILNSKPLGYVSSEIS FT DPDPVTPNLLLMGRRDASLPQVVYGKEELLAKRKWRHSQIIADQFWKQFIR FT NYLPNLQPRTKWHHSTPDPSVGQVVMIMDPQLPRAFWPIGRIKQAVPSSDG FT TVRVVDVTIDSRVYRRPVSKLIVLPEMPTDTSDE" XX SQ Sequence 6132 BP; 1803 A; 1513 C; 1274 G; 1542 T; 0 other; aagtggtcct tcgagccgga gtcaacatct atccttatgg caacatcaag ggatttcaat 60 gaaaatcaac agagtcttcg tcctagcagg attcggagaa ttcctggaca tctggatgac 120 tttgtactga cttcccagcc acagcattta gtttcacccc ctttgcaaga ggatttgcag 180 catatgcatg ttagtgagac tatcactgat ccagatgtgg aatatggaac ttctacccct 240 tatagacctg atactgagac tacaaggcta cgtaaagtgg aagagagtgt ttggaatgta 300 caagagcaac tcagagaatt acaaagcacg ttgaaaatgt ctattcagca aagcaaaggt 360 tcaccatgta tgcatcagga cttaagctat gtttctgcca caggtactgg acatcacagg 420 tcagtccctc atccaagtag tcagcagcag tccatccagc aacgtagtgt cagtttgcct 480 cttctctgtt cgggtcatct gagctcagga aacattgaaa aaccaaagta ttattctcca 540 gttataccag cacgcttacc caccacaaac ttacctcagc cacatagtct gcctccagta 600 cagatgaccc aagggttact accacagaca ttacagccat tgtctcgtac agcctctctc 660 cctattgagc atcatgtaca gcagcgtgag acacgaccag gatctgctcc agcaggtgta 720 atgcatcctc catatgttta tcaacagcct agtgattttc atcaggcata tccgccacaa 780 gtgcagtcca ttgtacatca tagcataact tcgatgacac cacctacacc cactgtgtac 840 cgagctgcac ctagctatcc atcagttcat gtaccttcta tgactactgt gccaagccat 900 gctgttaata ctaattatgt ggaacctcag tcattgactc tagcaccgcc tgtgatgcct 960 catttctcag agcctcctcc actgcgatct gtacactatt cagatcaata cactgggaac 1020 tatctgccta acagacccct agaatcaata tctcagcaac cgtcatatgc aacaccattt 1080 tcttccactg taccgccagc agttcaagct ccaagtatga tggaaatggc tattgcttcc 1140 tcatatggaa tacccaaacc taagcttatc aatttcacca ctggaaagga aagtgatttt 1200 gctctgctta aaaaaggact taatagtgta cttggccccc accttcatct gaccgaggac 1260 tacaaatttc aagtgctact tgatcatctt aaattgccag ctgccttcca aatagctaaa 1320 cgatatatgt atgatcccat gccttacagt cgagctatgc aagcattaca gcagcgatat 1380 ggtcagcccc gacaactggt agagggagaa attggtagca tcctccgtac accttccctg 1440 agaccaggtg acgctcaagg ttttgaggat tttgctttat ctgttagcac tcttgttggt 1500 ctcctaaata cattggaggg gtcatccaga gtcgagcttc tgtgtggctc acacgttgat 1560 agattactca ccaagctacc accctcttat agagatagct ttgttgagta ctgtatgtcc 1620 agaggcattc tccaaaatgg gtctgacaga acttacactc ttcctcagtt tgcagagtgg 1680 ctggagaaca aatcacaagc actgcagata tcgaggcgaa cagctgaact gtacagtaca 1740 gataaacccc atattgatgt cagagagctg aaacccacaa agccaccaaa ggtgcattcc 1800 acaatctact atggcataaa tcacgctact gcaggtccct ccaccacctc atcgctccag 1860 ccaaacaaag aacagaccaa ggttaagaaa cgtgagagtt ttaagccata ttgccccttc 1920 tgcaaatgtc aggagcatta cctcaattcc tgttctgaat ttgtcaaatt aaatgtagcc 1980 caaagaattg catggataaa ggacaatgac cgatgttgga agtgtggtag agggcatgag 2040 cctgcaaagt gtactctgaa aaaaccctgt tccagttgtg gtgagcagca cttaccaatt 2100 ctacacgata ctgcatccag catcaacaag agtgttctta ccgtcaatac agtaccccgc 2160 acagtctacc tcgatcagat tactcactct tgcagagtca tgttgaaagt ggttcaggtc 2220 agattacata acaaagggag agttctggac acatatgcta ttctggacga cgggtcagaa 2280 aggaccatca ttttgcccac agcagtacat cacctcaatc tggaagagac agatgaatca 2340 cttgccctta gaacaatcag acaagaagtt gtcgaaatcc agggggcatc agtctccttt 2400 gaagtgtcta cttctgcaaa accaagtatc cgctacaaga tacactctgc ctttacagca 2460 caagatctga acattgccaa gcagtcatgc cctgtggatg ctctgaagca caagtttgcc 2520 cacttgagag acattccaat tatacctttc aaagaggtac agccaatgct gcttattggc 2580 tctgattacc cacaccttat agcaccgata gctccagtgc tcatgggtcc tttaggtggt 2640 cccattgcag ttagtacaag tttgggctgg gccatacagg gtcctacaac atttctagac 2700 caaccattgg agactacatg tctgaacatc tcaatgttcc atcctccacc gaaagaccca 2760 tctcattatg tcgaaaggtt atggcaagta gacatacttc cctttcagcc tatgaaagac 2820 atgacacgat ccaagcagga tagagatgct ataacaatgc tggaaaccaa aaccaaaaga 2880 gtggaaatag atggagtttc tcgctatgca acccctctcc tgcgcaaagg aaatgcaaag 2940 gtcttgcaag taggtccaga atcagtcatg gctcttctaa gagccactga gcgccatttg 3000 aaacaaaacc cagagctagc agaagtgtac aacaaagaaa tccacaaact tgaacaagca 3060 ggttatgcag taaaactcac tccagaagag gccaaagtta cagaggaatc atggtatatt 3120 ccccaccacc ttgtacacca caacagtaaa gcccgtgttg tattcaactg ctcgtatcag 3180 ttccatcaag attctctgaa tgaccagtta ctcccaggac ctatgctgag cccatcattg 3240 ctcggagttc tcctcagatt ccgtcagtat cctgtagcta tcagtgggga gattcggtca 3300 atgtttcacc aggtacgcct ccttccagaa gatcatcagt tactgcgttt cttatggaga 3360 aatatggaaa aggaaaggag tcccgacatt tatgagtggc gtgttcttcc atttggcacc 3420 acatgcagcc cgtgctgtgc catctacgcc cttcagcgtc atgtcacaga ccatcgtgaa 3480 ggaaatgaag aggttgtaga gtctgtgtta aactcatttt atgtggataa ctgtctgcaa 3540 tccttcctct ctacaaatga ggccaaacag ctcatagaca aaatgagggc attactagca 3600 gctggagggt tcgaaataag acaatgggcc agtaactacg cagaagtgat ttctcattta 3660 ccaactgagg cacgatcaga gctgtgatct atggttgact gcgaacaaaa cagacatgca 3720 agagtctaca cttgggctct cttggcactg catctctgac aagcttagct ttcgaaatcg 3780 cacgatatgc aaggaggagc ccaccatgcg caacatttat agagtgctag cctctcagta 3840 tgatcctctt ggttatattc tgccctacac cactagagca aaaatccttg tccaagcttt 3900 gtggataaac aaatgtggct gggatgagtg catcggagga gatatgcagg acaaatggga 3960 gcagtggact aatgaactgc agtacctctc taggataacc attccaagat gctattttcc 4020 catctgtcca gaatcaaaca aacccaaact ggagttacac atattctgtg atgcctcgga 4080 gcgagcttat ggttcagtgg catatttgcg tcttgaaggt gaagacggta gaatctatac 4140 ctcatttgtg atggccagat cccgtgttgc acccaaaaag caaatatcaa tgccccgact 4200 ggaactgtgt ggcgccctta caggagccca actagctaaa ctcctaatca ctgaacttac 4260 tctacctatc aacaacacta atctttggac agattccacc actgtgcttt catggatcaa 4320 atcggaatcc tgtcattaca aggtgttcgt tggcacgcgg ataacagaaa tacaggagtt 4380 aaccagtccc aataactgga gatatgtgga ctctaaacta aatcctgctg atgacattac 4440 aagaggcaaa cccctctatg agctctcaca agcatgtcaa tggaatcaag gccctaactt 4500 ccttcagcag tcaccagaac actggccaat acaacccacc tcacagtcag ttgaagagaa 4560 tgaattaaga aagtcattct tttgtgctca tgtaactgta cctcagtctt ctttaccaaa 4620 tcctgagcaa cactcaacat gggcagagtt agtccaagca acctaccagt ctgtgcatgg 4680 aacagaagcc cccccgatgt cagctgcaca acgtattgat actgaactga tactgctaag 4740 acaagcccag caggacagct ttcctgttga aatccatgca cttcagaatg gcaaggtaat 4800 aaacactagc agtcggttaa gtactctggc accggaatat gatcaaaccc tgggaatgat 4860 tagggttgga ggtcgccttc gtaaggctaa ggaactcgat ggtgataccc tacatcctat 4920 cgtcttggca cctgatcatc caatcactaa actcatcata caagactacg acaaccggct 4980 acttcatccc ggtcctgaac gtgtctttgc agaattgagg agaagttact ggattatcag 5040 aggcagacaa gccataaaga aatatcaaca ccagtgtaga gaatgcagga cctggcggag 5100 taagccagtc aacccaaaaa tggctgatct tcctgctgct cgtcttcgcc tgcatcagcc 5160 tcctttctgg tcaactggag ttgattgctt cggaccattt acaataaaga taggcagaag 5220 acatgagaag cgatggggga tcattttcaa atgtctcacc accaggtgta tccacctaga 5280 ccttttgcat agtatggact ctgactcctt tcttatggct ttacgtcgtt ttattgcacg 5340 acgtggaaaa ccctttgaaa taatatctga tcagggtaca aattttagag gtggaagccg 5400 agaactccaa gaagcattct ctgcacttga gccagatatc aaagaaaaac tgagtgaaca 5460 gagcatttcc ttccagttca accctccaca tgcccctcat ttcggaggag cctgggagcg 5520 tgaaattcgc tctattaaat cttgtcttcg agtggtactg aaagatcaga ttgtgactga 5580 ggaagtacta accactgttc tcatagaggt ggagggaata ctgaattcga aaccactcgg 5640 ctatgtatca tcagaaatat ctgacccgga cccagttact ccaaaccttc tgctgatggg 5700 gcggcgggac gcctcccttc ctcaagtggt ttatggcaaa gaagaattgc tggcaaagag 5760 aaaatggaga catagccaaa ttatagctga ccagttctgg aaacaattca ttcgtaatta 5820 cttgccgaac ttacagccac gtaccaaatg gcatcattct acaccagatc catcggtagg 5880 acaagtagtt atgatcatgg acccccaatt gccacgagca ttctggccaa taggcagaat 5940 aaaacaagct gtacccagca gtgatggaac tgttcgagtt gtagacgtaa ctattgatag 6000 ccgagtctac cgtcgccctg tttctaaact gatagtccta ccagagatgc ctactgatac 6060 cagtgacgag taaagtgaga gaacagtgat gtttatttaa tttctaatct cctatgtaga 6120 ttagggggcg gc 6132 // ID DNA-2-15_DR repbase; DNA; ZEB; 1284 BP. XX AC . XX DT 12-NOV-2008 (Rel. 13.11, Created) DT 12-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE nonautonomous DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; 2-bp TSD; KW DNA-2-15_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1284 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(11), 1706-1706 (2008). XX DR [1] (Consensus) XX CC Copies are ~92% identical to the consensus. TSD is 2-bp long. TIR CC is ~40-bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1284 BP; 435 A; 180 C; 238 G; 426 T; 5 other; cccttacaaa aaataactgc agtaaactga agtaaaactg cagtacaaca gcagtatata 60 gaagtataac tgcagtaaaa ttaagtacaa ttgcaaaatt gcaatacaat gaagtataaa 120 cacagttcaa atacagtaca ataaaagttc aacagcagta taagttgcag taaactggag 180 taaaaacaag taaactctgt ctttactgca cttgtgctgc tgtagtctaa tatgccaatg 240 gtgtggtaag aactgtactg cacttgtgtt ccaaggggtt tgttcttcaa catcttccaa 300 tgaatatttt gtacttttag atttcaggga ttttgttttt tggattcaag taattttcta 360 attagaattt taagagtttw atttgcaaaa acaaataata aaaaataact gcagtgaact 420 gaacagtaca acagcagtag cctatataca aggataactg tggcaaaatt caatgcaact 480 gcaaaattgc agtataatga ggtataaaca cagttcaact acaatgcaat aaagttcaac 540 agcagtaaaa tctgcagtac aactgaagta acattaaata aactgagagg ggcaatatag 600 tgatatgtta tataatttat tgcacttagg tacataagta ttaaaagtgc actttagttt 660 tattatgtat tgcagtattg tagatgtatt tctgtagtta gttatacttc tgaaaagtgc 720 aatacaatat attgtggtag tactttagca atacaactga agtactgaaa cctcatttgg 780 aaaccattgt ccaccgagca ggaaaagtag tagttgtact gcagttgtat tgtagaagta 840 cttcagttgt attgcaatag tggtatggca gttgtactgt agtagtaytt cagttgtatt 900 gcagtagcag tactgcatta tagtagttgt agtacttcag taccattgca gtagttgtac 960 tgcattatag tagtatcact tcagtactat tgcagtagtt ttactgcatt tgtactgtag 1020 tagtactkca gttgtattgc agtagttgta ctgcagttgt attgctgtag ttgtactgta 1080 gtagtacttc agttgtattg cagtagwtgt actgcagttg tattgtagta cttcagttgt 1140 attgcagtaa tactgcaata tacttaaata atgcagtttt ttcgtgtcca aaaaactgca 1200 cttttcagaa gttaactaaa actgcagtat tcctttaaaa aactgcagta ttcttaaaaa 1260 aactgcagta wttttttgta aggg 1284 // ID MSAT-2_DR repbase; DNA; ZEB; 242 BP. XX AC . XX DT 21-AUG-2010 (Rel. 15.11, Created) DT 21-AUG-2010 (Rel. 15.11, Last updated, Version 2) XX DE Minisatellite-like DNA - consensus. XX KW MSAT; Satellite; Simple Repeat; MSAT-2_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-242 RA Jurka J.; RT "Minisatellite-like DNA from zebrafish."; RL Repbase Reports 10(11), 2009-2009 (2010). XX DR [1] (Consensus) XX SQ Sequence 242 BP; 66 A; 88 C; 55 G; 33 T; 0 other; ctgatagccc cgccctaaag gactgatagc cccgccctaa aggactgata gccccgccct 60 aaaggactga tagccccgcc ctaaaggact gatagccccg ccctaaagga ctgatagccc 120 cgccctaaag gactgatagc cccgccctaa aggactgata gccccgccct aaaggactga 180 tagccccgcc ctaaaggact gatagccccg ccctaaagga ctgatagccc cgccctaaag 240 ga 242 // ID Gypsy99-LTR_DR repbase; DNA; ZEB; 176 BP. XX AC chr19; XX DT 02-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy99-I_DR; KW Gypsy99-LTR_DR; Gypsy99_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-176 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1508-1508 (2008). XX DR Genome; chr19; Positions 14219218 14219393. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 176 BP; 51 A; 36 C; 35 G; 54 T; 0 other; tgtagataac tgctatgttc atatatggtc tttccggtgc ccggaagaac acagtttccc 60 tatccgggtg gttgtacaag ttagtgatgt tatgagcagc caagcacgtg aaagctaaaa 120 taaagtaagc ctattgttcc tgatactgtc tccatttttc atatcaacaa actaca 176 // ID DNA-4-2_DR repbase; DNA; ZEB; 1284 BP. XX AC . XX DT 28-OCT-2008 (Rel. 13.1, Created) DT 28-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE nonautonomous DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA-4-2_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1284 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1231-1231 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1284 BP; 456 A; 186 C; 161 G; 481 T; 0 other; gggtaattct tcaactacgg gcacttttat gtcctttgat catatatcca aaaatatata 60 taattttgtt caaattcctc tttctttgtc aaactaacaa taaaacctac tttaaaaaat 120 aaataaattt actagctttc tattattatt tttttcatat tattcaacct ccttttaggt 180 gtcaaaatca ctgttttcac cttgtcgcac acccagagtt ttaaacttca acaatgaaaa 240 taacatttta aaatattatt tatctggtat ctattaatgt atcttaatta agcactagtg 300 aaataattta aatattttat agttattaat gtgagactat tatcaatatt attttttata 360 caaaatatag ctgtcgcaca gtatcagtgt catttccacc acaacactgt aatgtcgctt 420 taaaaagttt gtgtgaaaga ttatttaggt ggtttctatg aaaatgaata gtgccatctg 480 agaacatgtt caagatggag ggaatttaaa aatttttatc aacaacactt gaagaaaaat 540 caaggtaaat attgcttgat aaggaaatac atttattcta cgtcatttcc aaacatgttg 600 taccttcaaa gatgttttta aaaaccaaac aaaattaagg taaataagag tgttttgtat 660 acatgaaagg ctagtatcaa ttcaaagcta atcggtgaag ctatttttca ttttttcaca 720 ataaactgtt gaaatgtcat ttccaccact gtcatttcca tcaccacaca catttttaaa 780 aaaatattta aaaagttctt ttcatcacac attaaaagtg tattcacaat gtataagttc 840 atgctctatt taatatacaa attcagttta tttattttct aataagctct taaccaaatt 900 aatatttaaa ttttagagtg tgacaggtgt acaattcagc atacaactaa tttatttcat 960 tgacaaatgt ataattatta tatatatata tatattgtgg aaagattggt taaactagat 1020 acaaaaatga tcttagacaa tgtttgactg ccataattta tttcattttg aaataaaagc 1080 tgtatattga gatgtggtgg aattgacatt tgttcagttc acgatggaaa aaatgtttct 1140 cttcttatct aagtttgtcc tcttacagat cttaaaacta gacaaattgt ttaaacttat 1200 gaggctatgt tacgttaatt ttgaacaaat ttttttttac tcaacttcac aaggctaaaa 1260 gtgcccgtag ttgaagaatg gccc 1284 // ID DNA-8-29_DR repbase; DNA; ZEB; 1020 BP. XX AC . XX DT 12-DEC-2008 (Rel. 13.12, Created) DT 12-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE A nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; TSD 8-bp; DNA-8-29_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1020 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(12), 2152-2152 (2008). XX DR [1] (Consensus) XX CC Copies are ~ 90 identical to the consensus. TIR is ~49-bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1020 BP; 320 A; 168 C; 176 G; 356 T; 0 other; caggggcgtc gctagacccc atttactggg gcacgtgccc cagtaaaaat cgccagtgcc 60 ccagtaaatg ctttgagata cgatttactt tatatgcaag tgtatttatt aagagtggta 120 atcccagaat aaagacgtta ttctctatgt gcaaccgaac caacgctgat gaaagcaatg 180 tagtttaatc aaaaagcaaa ctgagcatgt gcgcagaaaa aaacataccc gcgcgcctca 240 cgcaccgctc ctgcttgacg ctgacgcgct gctctgctcc cggtgtaagt cccggttgtg 300 aacatgcacg ccgaaaaaat agccgcgctg ctcacgcgcc gctcatctgc gttgtgaaac 360 cggcgtaaca gtcttttatg cggaggtgtc ggcacgcagt gagttttgca agtcgacaaa 420 acaaagttta aataatatga tggtgatctt atttagacta agcatggctc ctgatagatt 480 tttttatttt tttttgtatg aagcaaaaag gagggatgat gagtcaggga ggtatagact 540 taataaacct aattctcggc catgagtcgg gtctaagaca acacacaaca gataattcta 600 taaaacatct tttttttgta ttctatagaa ttgtctatag aatttcattc taatgaaatt 660 aatattcatg caaaagattt cttaaattat cttagcatca ctccagttgt gtctttagat 720 tgttttccag ttacatttag gattaatttc tgttatttat ttagaataat aattgtataa 780 taacaataat actgttattt atttataatt atatatattt tttagtttta ttaaatatat 840 atatatatat atatatatat atatatatat atatatatat atatatatat atatatatat 900 attatatata tatatatata tatatatata tatatatata tattattatt tttttatttt 960 ttttattttt ttgaggggga gctgtgcccc agtagagctt tatgtctagc aacgcccctg 1020 // ID Gypsy35-I_DR repbase; DNA; ZEB; 4464 BP. XX AC . XX DT 21-SEP-2007 (Rel. 12.09, Created) DT 02-OCT-2007 (Rel. 12.09, Last updated, Version 1) XX DE An internal portion of the Gypsy35_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy superfamily; endogenous retrovirus; KW gag; integrase; reverse transcriptase; Gypsy35_DR; Gypsy35-LTR_DR; KW Gypsy35-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4464 RA Dib M.R. and Naveira H.F.; RT "Gypsy35_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 7(9), 800-800 (2007). XX DR [1] (Consensus) XX CC Gypsy35-I_DR is an internal portion of the Gypsy35_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its long CC terminal repeat is deposited in Repbase as Gypsy35-LTR_DR. CC Gypsy35_DR is characterized by 4-bp target site duplications. The CC internal portion encodes one polyprotein the 1472-aa polyprotein CC Gypsy35_DR1p (pos. 44-4459) composed of the gag, protease, CC reverse transcriptase, and integrase domains. Some insertions CC fairly recent, according to the high identity between their CC flanking LTRs. Consensus obtained after the alignment of at least CC five independent insertions bearing at least 85% homology over at CC least 1000bp. XX FH Key Location/Qualifiers FT CDS 47..4459 FT /product="Gypsy35-I_Dr_1p" FT /note="ORF." FT /translation="MPPKKKTQLAADVDVDVDEEDQRDTSGDPVHPEDIPK FT PTGDSAITALAGMFQSFLQYQKERDERQEKESARREQQYKVLTHQITQMQM FT DLDRARHGALSSERSVARVLDYMPQLPKLQDTDDIEHFLTTFERLAEVYKW FT PKEDWAIHLVPLLTGKARSAFVAMSPAHTTDYEKVKEVILKKYEINAETYR FT LRFRALNTPSDESPMELYVRLKDLFSKWVNFKVSSKTNLMETMVLEQYMRV FT LYPEVRTWVKERDPVTAEEAAKLVESYVAARKGFSGAFRYAGSLQAARGKS FT EGSGGSSYSHSQTQIIRTTNPKPIPPKLETSLDTHQTPSSTGVVCYNCGKP FT SHTSPQCPLKKPKSARLCYLPRPTPTTDCSFSREPTITVLLNGKPLTALVD FT TGCARTLVQSQYIARDLWSENTVTVCCVHGDKADLPTAEVYIEVNKQSYLM FT NVGIAANLPYPVLLGTDMPVLADLVQETAWCGVVTRAQSQKLIQSSKDSVQ FT STLKEMPFFTEECVDNAVVSKEDRLERRRNWVADLINSSEQQQFDLAEPEM FT NENDLTIPNELAKLQRDDQTLSDCFQKADNDSVLSSLCGETFLIKNSLLYR FT QTKEEGTQLLVPRAYRREVLKLGHSIPWAGHLGFMKTLMRISKRFYWPGMY FT SEVKDYCKSCPECQMSTGRAPPIAPLVPIPAVETPFERIGVDIVGPVERSQ FT KGNRFILVICDYATRYPEAYPLREVTAKQIASALLHFFSHVGIPKEVLTDQ FT GPNFMSRTLQQVYQLLGIKRVRTTPYHPQTDGLVERFNQTLKSMLKKFVSE FT SGKDWDKWLPYLLFAYREVPQASTGFSPFELLFAHQVRGPLDVLRDSWEAQ FT DKPTKHNILSYVLKMREQLQQSSALAHQNLKDSQVKQKAWYDQKARSRSFQ FT PGDQVLLLLPTSENKLLAKWQGPFQVRRKLGPVTYEIEMPSRQHPLQTFHV FT NMLKQWHDRSSQPESPQDAVKELLVRAVQEEDEIEEQYLPVQQGNGQLDLQ FT HLTVDQQQQLLECIPDHLFLETPGRTNIIEHHIHLKEAKPIRQPVYRVPER FT LLKVMKQELELMQELEVIEPSSSEWSNPIVLVPKKDGSLRFCLDFRKLNSV FT SKFDPYPMPRVDDLVERLSKAKFLTTLDLCKGYWQIALSADSKEMTAFKTP FT FGHYHFRVLPFGLHGAPATFQRMIDQILRGTETFAAAYLDDIIIFSRSWQD FT HLQHLQEVLSRIKSAGLTIRPDKCAIAKEETCYLGHVLGHGVIRPQVGKIE FT AIKNAERPTTKKQVRAFLGLVGWYRRFIPNFSTRAVALTELTKKDKPNMLN FT WTTDCENAFIDLKEALCTEAILQSPDFGKPFTVQTDASERGLGAVLLQGEQ FT GKLHPIAYISRKLLPRETRYSTVEKECLAVKWALDSFRYYLIGRQFTLETD FT HRALTWLNQMKDTNARITRWFLAVQPFQFKVNYRTGLENCAADFLSRTPQR FT VSREGG" XX SQ Sequence 4464 BP; 1369 A; 972 C; 1027 G; 1096 T; 0 other; gttggtgtca gaagtaactc gggtgagtgt cacagaatta ccaaatatgc cgcctaagaa 60 gaaaactcag ctagcagctg atgtggatgt ggatgtggat gaagaagatc agagagacac 120 aagtggagat ccagtgcatc cggaagacat tccaaagccc acaggtgatt ccgcaattac 180 agctctggca ggaatgttcc agtccttcct acagtatcag aaggagaggg atgagaggca 240 ggagaaagag tcagctcgcc gggaacagca gtataaggtt ctcactcatc agatcaccca 300 gatgcagatg gatctggacc gggcccgaca tggagctctt tcttctgaac ggtcagtggc 360 tcgagttctt gattatatgc ctcaacttcc taaattacaa gatactgatg acattgaaca 420 ttttctcacc acttttgaga gactggcgga ggtgtacaag tggccaaagg aggactgggc 480 tattcatctg gttcccctgc tcactggcaa agcaagaagt gcctttgttg ctatgtcacc 540 tgcccatacg actgactatg agaaagtcaa agaagtcatt ctcaagaaat atgaaatcaa 600 cgcagaaacc taccggctga gatttcgagc actgaacaca cctagtgatg agtcgccgat 660 ggagttgtat gtccgcttga aggacctgtt ctctaagtgg gtgaacttta aagtgagcag 720 caagactaac ctgatggaga ccatggtgtt ggagcagtat atgcgtgtgc tgtacccaga 780 agtgaggact tgggtgaaag agagagatcc tgtcacagca gaagaagcag caaagctggt 840 tgagtcctat gttgcagctc gtaaaggatt ctcaggggct ttcagatatg ccggcagcct 900 tcaagcagca agaggtaagt ctgaggggtc ggggggaagt tcgtactctc atagccaaac 960 ccagattata cggaccacca acccaaagcc cataccacct aagcttgaaa cctcattaga 1020 cacacatcag acaccatcta gcactggtgt agtttgttat aactgtggta aaccaagtca 1080 tacaagccca caatgcccac ttaaaaagcc caaatcagcc aggttgtgtt atttgcccag 1140 acccacccct acaactgact gtagtttcag tagagaaccc accattacag ttttgttaaa 1200 cggaaaaccc ttgacagccc tggtagacac aggttgtgca agaaccttag tacaatctca 1260 gtacatagca agagatttat ggtctgaaaa cacagttaca gtgtgttgtg tccatgggga 1320 caaagcagac ttacccactg ctgaagtgta cattgaagta aacaagcagt catacttgat 1380 gaatgttggt attgcagcta atttgcctta ccctgtcttg ttaggaactg atatgccagt 1440 gctagcagat ttggttcagg agacagcttg gtgtggagtg gttactagag ctcagtccca 1500 aaagttaatt cagtcatcca aagattctgt acagagtacc cttaaagaaa tgccattttt 1560 cactgaagaa tgtgtagata atgcagtggt ctcaaaggaa gacagattgg aaagacgaag 1620 aaattgggtt gcagatctaa ttaatagctc agaacaacaa cagtttgatt tggcagagcc 1680 tgaaatgaat gaaaatgact taaccatacc taatgaatta gccaaactac aaagagatga 1740 tcaaacattg tctgattgct ttcaaaaagc tgacaatgac tccgttttga gttctctgtg 1800 tggtgagaca tttctgataa agaacagcct tctttacaga cagactaagg aagagggaac 1860 acaactgttg gtacctagag cgtatcgcag ggaagttctg aaattaggtc attctattcc 1920 atgggctggt catttagggt tcatgaaaac tctaatgaga atctctaaga gattttattg 1980 gcctggcatg tacagtgagg taaaagacta ttgcaagtca tgtccagaat gtcaaatgtc 2040 tactggcaga gcacccccta tagcaccact agtacctatt cctgctgtag agacgccctt 2100 tgagaggata ggggttgata ttgttggacc tgttgagaga agccagaaag gtaacagatt 2160 tatattggtg atttgtgatt atgctactag atatcctgag gcatatcccc ttagagaggt 2220 aacagccaag caaattgcat cagcactgtt acatttcttt tcacacgtag gcatccctaa 2280 agaggtgtta acagatcagg gtcccaattt catgagtcgc acactacaac aagtctatca 2340 gctattaggg attaagagag tacgaacaac tccttaccac ccccagactg atggattggt 2400 ggaacgtttt aaccaaacct taaaatccat gctgaaaaag tttgtgtcag aatccggcaa 2460 agactgggac aaatggctac cttacctcct gtttgcttac cgtgaagttc cacaagcatc 2520 cacaggtttt tcaccctttg aactgctctt tgctcaccaa gtccgaggtc ctctagatgt 2580 attgagagac agctgggaag cccaagataa gccaacgaag cataacatcc tttcttatgt 2640 cctgaagatg agagaacaac tacagcaatc atctgctctg gctcatcaga atctgaaaga 2700 ctctcaagtg aaacagaaag cttggtatga ccagaaagcc aggtccagat ctttccaacc 2760 aggagatcaa gttctgttac tgcttccaac atcagaaaac aaacttctgg cgaagtggca 2820 aggtcctttt caagtgagga gaaaacttgg tccagtcaca tatgagattg aaatgccttc 2880 cagacaacat ccgctacaaa cctttcacgt caacatgttg aagcaatggc atgatcggtc 2940 ctctcagcca gaatcaccac aggatgctgt gaaagagcta ttggtgagag cagtacaaga 3000 agaggatgaa atcgaggagc agtacttacc tgttcagcaa ggtaacggcc aattagacct 3060 tcagcatctg accgtggatc aacagcaaca gctactggaa tgcattccgg atcacctgtt 3120 cctagaaaca cctggaagaa caaacatcat tgaacatcac attcatttaa aagaggcaaa 3180 acccatacgg caaccagtgt acagagttcc ggaaaggcta cttaaagtca tgaagcaaga 3240 gctggagtta atgcaggaac tggaagttat agagccatca tccagtgagt ggagcaatcc 3300 cattgtgctt gtacccaaga aggatgggtc acttagattc tgccttgact tcagaaagct 3360 caactcggtc agtaaatttg acccttaccc catgcccaga gttgacgacc tagtggagag 3420 actcagtaaa gcaaagtttc tcactacact tgacctatgc aaaggttatt ggcaaatcgc 3480 attaagtgca gacagcaaag aaatgacagc atttaaaacc ccttttggtc actatcattt 3540 cagggttctt ccctttggac tgcatggggc acctgccacc tttcagagaa tgattgatca 3600 aattctccgg ggtacagaga ctttcgctgc tgcatacttg gatgatataa tcatcttcag 3660 caggtcctgg caggaccatc ttcaacatct ccaggaagta ttatccagga tcaagtctgc 3720 tggactcacc atccgcccag acaagtgtgc aatagccaaa gaggagactt gttaccttgg 3780 tcatgtcctg ggccatggag tcatccgtcc gcaggtgggg aaaatcgagg ccatcaaaaa 3840 tgcagagcga cctacaacca agaaacaggt acgagctttt cttggcctag taggatggta 3900 tagacgattt atccccaact tttctacaag agctgtagct ctcactgaac tcactaaaaa 3960 agacaagccc aatatgctta actggacaac tgactgtgaa aatgctttca tagatctgaa 4020 agaagccttg tgtacagaag ctattttaca gagtccagat tttggtaaac catttacagt 4080 ccagactgat gcatcagaac gtggtctggg agctgtactt ctacaaggag aacaagggaa 4140 attgcatccc attgcttaca ttagcagaaa actccttcct cgagaaacac ggtattcaac 4200 cgtggagaag gagtgcttgg cagtgaaatg ggcattagac tcttttagat actaccttat 4260 tggtagacag tttacattgg aaactgacca tagagccctg acttggttaa accaaatgaa 4320 agacacaaat gctcgaatta cacgatggtt tttggcagta caaccttttc aatttaaggt 4380 gaactataga actggccttg agaactgtgc agcagatttc ctttctagga caccacagag 4440 ggtgtcacgg gaagggggag gaaa 4464 // ID DNA-2-25_DR repbase; DNA; ZEB; 596 BP. XX AC . XX DT 03-DEC-2008 (Rel. 13.12, Created) DT 03-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE nonautonomous DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; 2-bp TSD; KW DNA-2-25_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-596 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(12), 2134-2134 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 596 BP; 195 A; 107 C; 112 G; 182 T; 0 other; cactctaaaa aacacagggt taaaaactac ccaaattggg ttgtttttaa ccctactgct 60 gggtaaatat gggacagaac acatcagggg ttaaattaac ccaagtaatt gggttattat 120 ttttaacccc ataaatgggt tgtttttcta cccaataatt gttgttattt ttacttaaat 180 aataggttaa tttgatccaa aaatgcctgc gaaagatgac ccagcaattg tgttgttttg 240 actcggcatc actgaagtac ggtgggggat gtgcagcttt gattacaggt gtacaaaagg 300 tgctagtaag aaatttttta attcagaaag gtctgactaa gggctgctat gagcataaat 360 ctcattttaa tcatctgaaa acgcaaggaa tctgcgttta tgttacacta ttttcattgc 420 atttaaacaa attagtacat aatatgtggt atcccagtca gcccctgctg tgagttgatt 480 tgacccaaag agctgggttg aaactaccca agactctgaa aataaccccc aaaaattacc 540 caaatgcctc aacccaagat ttgggtagaa aaaataaccc catatttttt agagtg 596 // ID Gypsy141-I_DR repbase; DNA; ZEB; 4223 BP. XX AC chr16; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy141-I_DR; KW Gypsy141-LTR_DR; Gypsy141_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4223 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1543-1543 (2008). XX DR Genome; chr16; Positions 22758817 22754595. XX CC Positions [1423-2001] - Reverse transcriptase CC Positions [3064-3570] - Integrase core CC 'CCAGC' target site duplication CC LTRs are 100% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 30..1016 FT /product="Gypsy141-I_DR_2p" FT /translation="MDLAGVPSPHMNWESSNLPDAWRKFRQHAELMFAGPL FT TKRGEDEKCSYLLLWIGEKGRDIFNTWALTADEAKVLQTYYDKYEAYVMPK FT TNIIFARYKFHERIQGANESFEQFVTELRLLVKDCAYADSEEMVRDRIVFG FT IHSPRVREKLLSVGSELTLDKAMDIARSHEVAQAQLKTFANSSYNPRDQVL FT HAVTAQRESQKRAKKAEVKASHRSSNDVTERGRSCGYCGNQAHGPSEKCPA FT KGKQCGKCGKRNHFAKVCRSVCKKNMYAVSREVSHCEEDPPAEFFVDSVSQ FT KPGDMEQAFADILLGVEETEVSFKLDTGAQVNVIPHI" FT CDS 1000..3978 FT /product="Gypsy141-I_DR_1p" FT /translation="MSFHTFDKIKAQCKLSPTKHSLTGYGGQVLTVKGTCA FT LPCRYKDRETLMNFYIVNTQAPPVLGLKACLDLDLIKLVLSVNALKEDKCI FT LEEYADVFDGIGLFPGECTIHLKSDATPVVYPPRRVPLALCGRLKEELQNM FT ERQGVIIRVTEPTDWVNALVVVEKPRTRKLRICLDPRDLNKAIKRPHYPLP FT TIEDITPKLMGAKYFSVLDARSGYWAIKLTEESSKLTTFNTPFGRYRFRRL FT PFGIISAQDEFQRKIDETYEGLRGVVAIVDDILVYGQTKKEHDDNLHAMLQ FT RSREKGVKLNPEKSIVSATEVRYFGHCLSAEGIKPDPEKVSAIKNMEPPKS FT KAELETVLGMINYLSKFAPCLSDINAPLRQLLKQSSEFIWDSQHDDTFKKM FT KDLITKEPGPVLAYYDPQKKLHLQVDASKYGLGAVLLQDGKPLSYASRSLT FT ECEVSYAQIEKELYAVLFGCKRFHQYVYGREVIVESDHKPLESVMKKPLAA FT APPRLQRMILQLQKYDITIIHRPGKEIPVADTLSRKSIETEHDNLSEGMDM FT QVHMVYKSLPVSDTKLQQIRVETESDSQLIRLQQVILNGWPGERKKCPAEV FT SDFWNFRDELSLTNGIILKGEKIVIPTSLRQEMLTRIHVGHMGVEKSKQRA FT RDVLFWPGMCKQIEHMVETCDTCLEQRRSNTKEPMISHEIPTRPWQVVASD FT LFTWNHEEYMITVDYYSRFFEVDKLTNTTTATVISKLKVIFARHGIPEKFV FT SDNGPQYRSKEFEDFAKTWGFTHTTTSPYYPQSNGLAEKTVQITKSMLTKA FT MMDRKDPYLSLLEYRNTPVDSFKSPAQLLMSRRLRSILPTTNKQLLPEIVK FT VTEARCKRAKQQQHQKKYYDRSAHQLSQLRMGQPVRIQEHGLWKPAVVVHP FT ANTERSYHVRTSDGQVYRRNRRHLLCTKKEAVEPCGSTNIEENQNSNTSPM FT FPVHAEIPAQPLKVREVQPIQYHTRSGRAVRPREILDL" XX SQ Sequence 4223 BP; 1358 A; 927 C; 971 G; 967 T; 0 other; tggtgtcaga agaaaagtaa atttaaaaaa tggatttagc aggagtacca tcgccgcaca 60 tgaactggga atcgtctaat ttacccgacg catggaggaa attcagacag catgcagagc 120 tcatgttcgc aggtccactg acaaaaaggg gagaagacga aaagtgcagc tatttgctcc 180 tatggatagg agaaaaagga agagatatat tcaacacctg ggcgctgacg gcagacgaag 240 caaaggtact gcaaacatac tacgacaagt atgaagcata tgtgatgccc aaaacgaaca 300 taatctttgc cagatataaa tttcatgagc gaatacaagg agcaaatgaa agttttgagc 360 aatttgtaac cgagctaaga ctattagtga aagactgtgc ttatgcagac agtgaggaaa 420 tggtcagaga tcgcatcgtg tttggcattc actctcctcg agtgcgagag aagctgctca 480 gcgttggttc ggagttaacg ctagacaagg ctatggacat agccagatcg cacgaagttg 540 cacaagctca gctcaaaacg ttcgcaaaca gctcatacaa tccccgcgat caagtgctgc 600 acgctgtcac agcacagaga gagtcacaga agagagcaaa gaaagctgaa gtgaaagctt 660 cacatcgcag cagcaatgac gtcacggagc gcggcagaag ttgcggttac tgtggcaacc 720 aggctcacgg tccatcagag aaatgccctg ctaaaggtaa acagtgtggc aaatgtggaa 780 aacgtaacca tttcgccaaa gtatgtcgtt ctgtttgcaa aaagaacatg tatgcagtga 840 gcagagaggt ttcacattgt gaagaagacc cacctgcaga gtttttcgtg gattcagtct 900 cacaaaaacc aggtgacatg gaacaagcat ttgcagacat actactaggg gtagaggaaa 960 ctgaagttag tttcaagttg gacactggag cgcaggtaaa tgtcattcca cacatttgat 1020 aaaattaaag ctcagtgcaa actctcaccg actaaacaca gtctcactgg atatggtggt 1080 caagtgctca cagtaaaagg gacatgtgca ctgccatgca gatataagga cagagaaact 1140 ctaatgaact tttacatagt caacacacaa gcaccgcccg tgttaggtct aaaagcatgc 1200 ctcgacctgg acttaataaa gctagtgctc tcagtaaatg cattaaaaga ggacaaatgc 1260 attttggagg aatatgctga tgtatttgat ggcattggat tattcccagg ggaatgcaca 1320 atccatctga aatccgacgc aactccagtc gtgtatcccc caagaagggt acctctggcc 1380 ctatgtggca gactgaaaga agagctgcaa aatatggaga gacaaggggt cataataagg 1440 gtaacagagc ccactgactg ggtaaatgca ctcgtagttg tggaaaaacc gcgaacccgc 1500 aaacttagaa tatgcttaga cccccgtgac ctcaacaaag cgatcaaacg tcctcactac 1560 cctttaccga cgattgaaga catcacaccc aaattaatgg gtgcaaaata ctttagtgtg 1620 ttggatgccc gttcagggta ctgggcaatc aaactgacag aggagtcatc caagctcacg 1680 acatttaaca ctccatttgg aaggtacaga tttcgacgtc tcccatttgg catcatttcc 1740 gcacaggatg aatttcaacg caaaattgat gagacgtacg aaggtttaag gggtgtcgtt 1800 gcaatcgttg acgatatcct ggtgtacgga caaaccaaaa aggagcatga cgacaacttg 1860 catgcgatgt tgcaaaggtc tcgtgagaaa ggtgtaaaac tcaaccctga gaaaagcatt 1920 gtgagtgcta ccgaggttcg ctactttggt cactgcttat ctgctgaagg aatcaagccg 1980 gaccccgaaa aggtctcagc gatcaaaaac atggagccac caaagagcaa agcagagctc 2040 gaaacagtcc tgggcatgat aaattacttg tcgaagtttg ctccctgttt atcggacatc 2100 aatgcaccac ttcgtcaact gcttaaacag tccagcgagt tcatatggga ttctcagcat 2160 gatgatacat tcaaaaaaat gaaagacctc ataactaaag aacctggacc agttctagca 2220 tattatgacc cgcagaaaaa actgcacctt caagtggacg catcaaagta tggccttgga 2280 gcagtcctac ttcaggacgg aaagccgctc agctatgcat caaggtcact gactgaatgt 2340 gaagtgagct acgctcaaat agaaaaggag ctatatgcgg tcctatttgg ctgcaagcgt 2400 tttcatcagt atgtttatgg ccgtgaagtc atagtagagt ctgatcacaa gccgcttgaa 2460 tcagtgatga aaaaacctct ggctgcagct ccgccgcgac tccaaagaat gattctacag 2520 ctccagaaat atgacatcac catcatacac cgacctggaa aggagattcc agtggcagac 2580 accctctcta gaaaatccat agagacagag cacgacaatc tcagtgaagg aatggacatg 2640 caagttcaca tggtatataa aagcttacca gtcagtgaca caaaactgca acagatccgt 2700 gttgagactg aatcagacag ccagctcatt cggctgcagc aagtgattct gaatggatgg 2760 cctggagaga ggaagaagtg cccggcagag gtcagtgatt tctggaactt ccgagacgaa 2820 ctctcattga ctaatggaat cattctcaaa ggtgagaaaa tagtcatacc aacctcactc 2880 agacaagaaa tgctcacacg tattcatgtt ggtcacatgg gtgtcgaaaa gtcaaagcaa 2940 agagcacgag atgtgttatt ttggccagga atgtgtaagc aaatagaaca catggtcgaa 3000 acgtgcgata cttgcctgga gcaacgtcgt tcaaacacta aggagcccat gatttctcat 3060 gaaataccaa ccagaccatg gcaagtcgta gcttcagatc ttttcacatg gaatcatgaa 3120 gaatacatga ttacagtaga ctactacagc cggttctttg aagtggacaa actcaccaac 3180 acaacgaccg ccactgtaat cagcaaactg aaagtgatct tcgcgagaca tggtatccca 3240 gagaaatttg tctccgacaa tggtccacaa tatcgaagca aggagttcga agactttgca 3300 aagacatggg gtttcacaca taccaccact agcccctatt acccccagag caacggtttg 3360 gctgagaaga ctgtgcagat cacaaaatcg atgttgacaa aagccatgat ggacaggaaa 3420 gatccctatc tcagtttgtt agagtatagg aatacacctg ttgacagctt taagtcacct 3480 gctcagctcc ttatgagtcg cagactgcga tccatactgc caaccacaaa caagcagctc 3540 ctacctgaaa ttgtcaaagt cacagaggct cgctgtaaaa gggcaaagca gcaacagcat 3600 cagaaaaagt attatgatcg ttcagcgcat cagctgtcac aacttcgcat gggacaacct 3660 gttcgcatac aggaacatgg cctctggaag cctgctgttg ttgttcatcc agccaacaca 3720 gaaagatcct atcatgttcg cacttcagat ggtcaggtgt atcggagaaa ccgacgtcac 3780 cttctctgta caaagaagga agctgtggag ccttgtggtt ctacaaacat tgaagaaaat 3840 caaaacagca atacatcgcc aatgtttcct gtacatgctg aaataccagc acagccttta 3900 aaagtcagag aggtgcagcc catacaatat cataccagat ccggtcgagc agtgcgacca 3960 agagaaatcc ttgatttgta atttgaaatt cgtctggggt gtaggcggat cgctgcccct 4020 gtgagagaac gaaatctaaa atctggtatg ggctgtcaga aagaaatgga cactttgttc 4080 ttatattaag tgccttctgt tgcaattctg tttaatccag caccctttat gtttatgcta 4140 aagtaatgtt taataattta gatatacact tcagtgatgc aatgtgtcat tttcttttgt 4200 tgtttttttg aaaaaaaggg aga 4223 // ID L1-5_DR repbase; DNA; ZEB; 5183 BP. XX AC AL645691; XX DT 04-AUG-2002 (Rel. 7.07, Created) DT 19-FEB-2010 (Rel. 15.03, Last updated, Version 2) XX DE L1-5_DR is a non-LTR retrotransposon from the Tx1 clade. XX KW Tx1; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; endonuclease; L1 clade; L1-5_DR; MutsuDr2. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5183 RA Kapitonov V.V. and Jurka J.; RT "L1-5_DR, a family of non-LTR L1-like retrotransposons from RT zebrafish."; RL Repbase Reports 2(7), 25-25 (2002). XX RN [2] RP 1-5183 RA Kojima K.K. and Fujiwara H.; RT "Cross-genome screening of novel sequence-specific non-LTR RT retrotransposons: various multicopy RNA genes and microsatellites RT are selected as targets."; RL Molecular Biology and Evolution 21(2), 207-217 (2004). XX DR Genbank; AL645691; Positions 102474 97292. XX CC L1-5_DR is a family of L1-like non-LTR retrotransposon. This CC family was active recently (ORF1 and ORF2 have no stop codons). CC The element is incomplete (its ~300-bp 5' terminus is deleted). CC It encodes two proteins: CC a 373-aa L1-5_DR1p (positions 3-1121) and a 1233-aa L1-5_DR2p CC (positions CC 1225-4923). These proteins are most close to corresponding CC proteins CC encoded by other L1-like elements. CC This is the same as the Tx-like non-LTR retrotransposon family CC MutsuDr2, specifically inserted into 5S ribosomal RNA genes [2]. XX FH Key Location/Qualifiers FT CDS 3..1121 FT /product="L1-5_DR1p" FT /translation="GRKYLKEATVIINVEDVQEVRAEDIIKAVIEKCGHGK FT ILALRPRQGKEYELTMETEELCEQLTDKLMINGVNCEIKKLHNRDFVVSFM FT HLPVYLEDKEILEKLDTWGVCPISKIKRRFYPGTDIEDGTRFLKVRFPKEV FT ASLPYSTKIETAEGTQYFRVMHSHQVKTCRLCMSPEHMLKDCPEFKCYKCE FT ERGHFARDCITVRCPECKNFLNKCECWMEGEEGGGEDQDRQVHEENNKREE FT DNEAIERTMEEEMNNNEKKEGENENEENEVQLSGQDGQWSQMEISDSFKTF FT LSTVEKEELRKTGQAEGMANDEETRMENVDKNKVERGFLRRRTFKVKPNLD FT MAKKKVLTKNRAKYVNRYEVLRGLEEDEEK" FT CDS 1225..4923 FT /product="L1-5_DR2p" FT /translation="MRMCRFLISFLFILMVLNCVSFNARGLMGKDKFEKIR FT EKCKNKEVIGLQETNWKEHAMNDFKKLWQGDIIYNNGDGKSGRGVAFLIRK FT DVLNFKRIVYRDNYGKCLVIEVNYEGHELIIANIHAPTEDKDKKEYFEVLN FT KVLKGFKEIIMMGDFNTVFRKQDMADGMVFKSDIGRKQLNELIKENSLIDI FT WRERNREKREFSRRQIVGNFICQTRIDFILCTRNIENFIENIQYEENSFSD FT HKLLHFKVNIENIQKGPGTWILNTTILKNQDYVQKVKEIIENEKENRMYDE FT DKRIWWENTKYQIRKYTIKYCAVLQRCKKYTEKEVKRSLEKELNKENKDIE FT KIKEIEQKLRDLEEDKYKGAMLRSRSKYTVEGEKCTKFFFDLEKQRGKAGI FT LKEIKGRNGKIAKGNIEILEEIKHFYEDLFKAKGIDEEKERKILNYIKVKL FT EKHDNKECDREIEEEEIEIAINQLNKKKSPGIDGIGNEFYIVFKDILKGIL FT KEVFKDIFKCKEMNERMGMGLMKLIYKRKGAKTELQNYRPITMLNTDLKIL FT AKVLANRLKEVMSKLIKSNQAYAIKGRDIADVTMSIKSTIDYLQEKKMNGF FT LISVDFEKAFDRVEHTYLFDVLKTFGFGENFINWIKILYKGAFTKVKCNGF FT LTDCFKITRSIRQGCPLSALLYSLIAEPLGLAIKQETKIKGIKIEEEEDEG FT KIYQYADDTTIIVKEKKSVKEAMKKVQEFCKGTGSKINENKTQYMRFGKAD FT ILTDCFQFREVEELKILGILIGKNERKATEKMWDDLIRGIESRLNFWRMRE FT LCLKGKALILNVLMTSKLWYKLYVTEMPCWIEARLKKCVQDFLWEGKPPRI FT AYNTIIGATEEGGIGLMDIKQRKNCLRVKIVKKLLQEENSTEWKKVMKYFL FT NKVGNFNLGEDILWLKTKNWMTEKLPGFYQEILSAWGKFLDGVFYQVKGRE FT NLLNQPLFLNKSILKEGKELFFKKWMDVGILRIRDVLYEFKKGFLTKQYIV FT DLMEEAKEEYSVKEIENKLETVKGAIPKEWITRIENMEECGNEKVIHVYLK FT GKLCNFKDCLLKDFYVYFRDSVFQEPIANNFWVQRLNSVKKENIWKNMRGK FT IIETRLECFEYFIRHKAIFTECILTKIHIEQNATCKVCFQEDEGILHLFLY FT CKELECFYKKCQKMLKDLLKDWDEEQLEWNTLVMFGWNMQNKNKKFVNLLI FT MMIKKCVWERRNVAKQEKVVLNV" XX SQ Sequence 5183 BP; 2237 A; 441 C; 1130 G; 1375 T; 0 other; gtggaagaaa gtatttaaag gaggcaacag ttatcattaa tgtggaagat gtacaggaag 60 tgagagcaga agatattatt aaagcggtga ttgaaaagtg tggacatgga aaaattctgg 120 ctttaagacc aagacaaggg aaggagtacg aactaacaat ggaaacagaa gaattatgtg 180 aacaattgac tgacaaactg atgataaatg gggtgaactg tgaaattaaa aagcttcata 240 atagggattt tgttgtttcc ttcatgcacc tgccggtcta tcttgaagat aaggaaattc 300 tggaaaaatt agatacttgg ggagtttgtc ccatatctaa aattaaacga aggttttatc 360 cgggtaccga catagaagac ggaacaaggt tccttaaagt gaggttccct aaagaggtgg 420 catccttgcc atacagcaca aaaatagaga cagcagaagg aacgcagtac ttcagggtga 480 tgcacagcca tcaggtgaag acctgtaggc tgtgcatgag cccggaacac atgctgaaag 540 actgtccaga attcaagtgc tacaaatgtg aggaaagggg gcattttgca agagactgta 600 tcactgtcag gtgcccggaa tgtaaaaatt ttttaaacaa gtgtgaatgc tggatggaag 660 gagaggaagg aggaggagaa gatcaggacc ggcaggtgca tgaagaaaac aacaaaaggg 720 aagaagataa tgaagcaata gaaaggacaa tggaggagga aatgaataac aatgaaaaaa 780 aggaaggaga aaacgaaaat gaagagaatg aagtacagtt atcaggacaa gatggacagt 840 ggtcacaaat ggaaatatca gacagtttta aaactttttt aagcacggtg gaaaaagagg 900 aactaagaaa gactggtcaa gctgaaggaa tggcaaatga tgaagaaaca aggatggaaa 960 atgtggataa gaacaaagtg gaaagaggat ttttaagaag aagaactttt aaggtaaaac 1020 caaacttgga catggcaaag aaaaaagttt taacaaagaa cagagctaaa tatgtaaata 1080 ggtatgaagt gttaagggga ctggaagaag atgaagaaaa atgatgtttt taaaatattt 1140 tatctatttt aaaatgcttt taatctttgt ctcttttaac ggaaggagtt gttggttgag 1200 agcgttggaa ggagatgaag gacaatgagg atgtgtaggt ttttaatttc atttttattt 1260 attttaatgg ttttaaactg tgtgtctttt aatgcaaggg gactgatggg aaaagataaa 1320 tttgagaaaa taagagagaa atgtaaaaac aaggaagtga taggtttaca ggaaacaaac 1380 tggaaagaac atgcaatgaa tgattttaaa aagttgtggc aaggggatat aatatataat 1440 aatggtgatg gtaaatctgg aagaggtgtt gcttttttaa taaggaaaga tgttttaaac 1500 tttaaaagaa ttgtgtatag ggataattat ggaaaatgtt tagtaataga ggtgaactat 1560 gaaggacatg aattaataat agcaaatata catgcaccaa cagaagataa agacaaaaaa 1620 gaatattttg aggttttaaa taaagtatta aaaggtttta aagaaataat aatgatggga 1680 gattttaata ctgtttttag aaaacaagat atggctgatg gaatggtttt taaatcggat 1740 ataggaagaa agcaactaaa tgagttaata aaggaaaata gtttaataga tatatggagg 1800 gaaaggaata gagaaaaaag agagttttca aggagacaaa tagtggggaa ttttatatgt 1860 caaacgagaa tagattttat tttatgtact agaaatattg aaaattttat agaaaacata 1920 cagtatgaag agaacagttt tagtgatcat aagctcttgc attttaaagt gaatatagaa 1980 aacatacaaa aaggaccagg gacttggatt ttaaacacaa ccattttaaa aaatcaagat 2040 tatgttcaaa aagtcaaaga aataatagag aatgaaaaag aaaatagaat gtatgatgaa 2100 gataaaagga tatggtggga aaacacaaaa tatcaaatca ggaaatacac aatcaaatat 2160 tgcgcagtac tacaaagatg taaaaagtat acagaaaaag aagttaaaag atccttagaa 2220 aaagaattaa acaaggaaaa taaagatatt gaaaaaatca aagaaataga gcaaaaatta 2280 cgagatttag aagaagacaa atataagggg gcaatgttga ggagcagatc taaatacaca 2340 gtagaagggg aaaaatgcac gaaattcttt tttgatttag agaagcaaag aggtaaagca 2400 ggaatactaa aggaaattaa agggaggaat ggaaaaattg cgaaaggaaa catagaaata 2460 ctagaagaaa tcaagcattt ttatgaagat ctatttaaag caaaaggtat tgatgaagaa 2520 aaagaaagga agattctaaa ttatataaaa gtaaaactag aaaaacatga caacaaagaa 2580 tgtgacagag aaatagaaga agaagagatt gaaattgcaa taaatcaact aaataaaaag 2640 aaaagtccag gtatagatgg aataggaaat gaattttata ttgtttttaa agatatttta 2700 aaaggaatac ttaaagaagt ttttaaagac atttttaaat gtaaagagat gaatgaaaga 2760 atggggatgg gattaatgaa gttaatatat aaaagaaaag gagcaaaaac tgaattacaa 2820 aattatagac caataacaat gttgaataca gatttaaaga ttttagcaaa agttttagcg 2880 aatagattaa aggaagtaat gtctaaatta attaaatcaa accaagcata tgcaataaaa 2940 gggagagata tagcagatgt aacaatgagt attaaaagca caatagatta tttgcaagaa 3000 aagaagatga atggtttttt aattagtgta gattttgaaa aggcttttga cagagtagaa 3060 catacttatt tatttgatgt actcaaaaca tttggatttg gagagaattt tattaattgg 3120 attaaaattt tatataaagg ggcttttaca aaagtaaaat gtaatggttt tttaacagat 3180 tgttttaaaa ttacaagatc aattagacaa ggatgtcctc tttcagcact attatattcc 3240 ctaatcgcag aaccattagg gttagcaata aaacaagaaa ctaaaattaa aggaataaaa 3300 atagaagaag aggaggatga aggaaaaata taccagtatg ctgatgatac aacaataata 3360 gtgaaggaga aaaagagtgt aaaagaagcc atgaaaaaag tacaggagtt ttgtaaggga 3420 acaggaagca aaataaatga aaataaaaca caatatatga ggtttggtaa agcagatatt 3480 ttaacagatt gtttccaatt tagagaagta gaagaactga aaattttagg aattttaatt 3540 ggtaaaaatg aaagaaaagc aacagaaaag atgtgggatg atctaataag aggaatagaa 3600 tcaagattaa atttctggag gatgagagaa ctttgcttaa aggggaaagc cttaatatta 3660 aatgttttaa tgacatcaaa gctatggtat aaattatatg taacagaaat gccatgttgg 3720 atagaagcga gattgaaaaa gtgtgttcaa gattttttat gggaggggaa acccccaaga 3780 attgcgtaca atacaataat aggagcaaca gaagaaggag ggataggatt gatggatatt 3840 aaacaaagga agaattgtct tagagtgaaa atagttaaaa agcttttaca agaggagaac 3900 tcaacagaat ggaaaaaggt tatgaaatat tttttaaaca aagttggcaa ttttaactta 3960 ggagaagaca ttctttggtt aaaaactaaa aactggatga cggaaaagtt accagggttt 4020 tatcaagaaa ttttaagtgc atgggggaaa tttttagacg gagtatttta ccaagtaaag 4080 ggaagagaaa acttattaaa tcaacctttg tttttaaata aaagtatttt aaaagaaggg 4140 aaggaactat tttttaaaaa atggatggat gtggggattt taagaataag ggatgttctt 4200 tatgaattca aaaagggatt tttaactaag caatatatag tagacttaat ggaagaagct 4260 aaagaggaat acagtgtaaa ggaaatagaa aataaacttg aaacggtcaa aggtgccata 4320 ccaaaagaat ggattacaag aatagaaaat atggaagaat gtgggaatga aaaagtcata 4380 catgtctatt taaaaggaaa gctttgtaat tttaaagatt gtttactgaa agacttttat 4440 gtttatttta gagatagtgt atttcaagaa ccaatagcaa ataacttttg ggtacaaaga 4500 ttgaatagtg tgaaaaagga aaatatatgg aaaaacatga gaggaaaaat aatagaaaca 4560 agattggaat gttttgaata ttttataaga cacaaggcaa tttttactga gtgcatttta 4620 acaaagatac atatagaaca aaatgcaaca tgtaaggttt gttttcaaga agatgaagga 4680 attttacacc tgtttttata ctgtaaagaa ttagaatgtt tttacaagaa atgccaaaaa 4740 atgctaaaag atttattgaa agattgggat gaagaacaat tggaatggaa tactctggtg 4800 atgttcggat ggaatatgca aaacaaaaac aaaaagtttg taaatcttct tataatgatg 4860 attaaaaaat gtgtgtggga aagaagaaat gtagcaaaac aggaaaaagt tgttttaaat 4920 gtgtgaaatg tactaaaacg aaaaatggaa agatacatag aaagactgta ctgctatttt 4980 aaaggagaag atatgttacc aagcttttat gatgtgttta atgatgaagt gtataatgtt 5040 ttaaatggtt taaagtggaa aatgcccaaa gaggatgtta tgttatgaag aatgttgatt 5100 ttaatgtaat tgttttgtga aatttcttga agtatttaaa tgttaatgta atgtgattaa 5160 attttaataa aaaataaaaa aaa 5183 // ID EnSpm-N2_DR repbase; DNA; ZEB; 4015 BP. XX AC . XX DT 15-OCT-2008 (Rel. 13.1, Created) DT 15-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE EnSpm-N2_DR is an non-autonomous DNA transposon - a consensus. XX KW EnSpm; DNA transposon; Transposable Element; Nonautonomous; KW EnSpm-2_DR; EnSpm-N2_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4015 RA Bao W. and Jurka J.; RT "EnSpm DNA transposons in zebrafish."; RL Repbase Reports 8(10), 1262-1262 (2008). XX DR [1] (Consensus) XX CC The element is characterized by 2-bp TSDs, mostly TA, and 16-bp CC TIR. It is putatively classified as EnSpm superfamily because its CC similarity with EnSpm-2_DR in their termini sequence. EnSpm-N2_DR CC is a composite element, containing insertions of several other CC TEs (masked out). CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 4015 BP; 1343 A; 561 C; 648 G; 1463 T; 0 other; cactgtaaac cctaatgctg taactaatta attgagttga gttctattaa cttaaatcat 60 taaaattcgt ttatcttaat taaccttgat gagtatttag aactttttct tatttttgag 120 tttgtaaaaa tttaatcttt caagtaaggt gaacaattct gctgggttta atctatacaa 180 aatgtctctt taaagtttca ttaactcaaa ttctgtcagc ctagtgattt tgcgtctgac 240 gtcatccagg ttcacgtacg tgctcatgct tcattcgttt ttttcaagat ggcggatcgg 300 cctcgcttca cactacagta agtaaaatat ttacaaatcc aagaaatatt aaatgtatgt 360 tttaattaat ataatcacag taacagttca atgtaatgta gagttaatca gggtgcgttt 420 acatgtcgta cattacagaa aacatgttta gcctcacttg ttgctaacta acgttaacat 480 tactgtttaa aaagttagaa tcagtgcaat gttagcgtta gttgtaatgt gttaactctg 540 aattttaaca gagtgtgata aagacaaaaa tatttaaaat cggcagatta catttttcaa 600 aaaaatttaa atcttggcgc gagtatgcct tttctcagcc tgtgctgaag ttgtcatgga 660 gacgtgcgct ctcacaaccg catgtcatgg cgctgcgctg aactccatcc agtctgtgaa 720 tgtggtatta tcttcgcgat ctgtaggcac ttcggaaacg aaaggcgcaa gaaacgtaac 780 gttaatcata actcaaatga tctccgccta aacacttaac ggtattttct gatgacgcgc 840 aaaaacgtag tctcaaaact taactctgtc acacgcagct tgttgcttgc tgatccatgt 900 atgaggatag cctgacggag aacatttaag ctgcgctcct cacctgagca ggtacagagc 960 agagcttcat tttacactac cacattaatc accagcactg atcaacactt aatatttcct 1020 cgcgcttaag acgacatctc tgcatatcaa tgattacctg attaatgcga tcagttaagt 1080 gagtttaaat gtcatttagg tcaaatgtat tttaaaggta tgataaaagt tttttttttt 1140 tttttcaatc tattttttaa aatgtctact ctggagagaa tataaacagc ctaaaaaaca 1200 aagtatgcat atctataaca cattaataca aaattatgtt attaaataaa ataataacaa 1260 aacaaaatat tcaagacgtg gtaaagaaga cagttttctg atgtgcaaat tttgtagctt 1320 aaaatatatt tatatcttaa aaatataaat attttagtct atatattttt acatcattag 1380 acaataaggt aagttttgtt atttctagct tttaaaatca ggccttaaat ttctttgtcg 1440 tcctattcag ttttagaaat ttcaatactg tttaaattaa atctgtgttc atcgaattaa 1500 agcaactaac atgataatca gctctattgc taaccgtttt ggaggttaac gttagacatc 1560 aggcaattgt actacttcca ctctcagcta aatgacactg tgactttgtg tacatacaga 1620 attgaaggtc aggtgttgct gactacagat aggttcttaa atggaggcat gtgaaaggtg 1680 ggtaaaatat ttattttgta ctactatgcc tggcatatcc agataaaata aatattacag 1740 tgtatttatc aatgatactt ttgcagcaag gaaagtgaaa cttcagctag accttgagtt 1800 cagtttacaa catgaagatc ctgctttcaa catctgatca acacctgatt ttcatctgag 1860 gtaaaaaata gataagtaaa tgaaacataa taataatatt aatacatttt ttctagaatt 1920 gtatattttg tggtgtgggt ttgttttatg aaattctctc ttgtttcttt ttgctgtttg 1980 cagagtacca acatgttaca aatgagaatt ttccccacaa attctttgca aaactggacc 2040 accacctatt tcatttgatg acaattttaa ggcaaacagc atccaaaact ggcaagacag 2100 cagataccct ggctaatctt ttaaaggttt atgatgagca ggtatgtttg gggtcacgtg 2160 ttaaagtata cattaaaata taatttgaat tattattatt tttttaattt ttaaaaaatt 2220 gattcttgat ttccaggaac tgaatgatgt cagttcatga cggactactg ttatcagagg 2280 tcttttggtc ttgctgcgtg agcgtgactt aagattcttt aggaacacca tggtaagact 2340 tcattcattc ttttcacaat taggcatgtt taatgaaaag atgaaagttt tgaaaacttt 2400 tacttaataa catatttgta atcaagaata gttgatgaat catataatat tggctcagtc 2460 attttgatac tgtccttaca ttaatatcct tgaattatgt ttttagattg ataatcctgc 2520 tgactgaaga tgctttattg gccctgcatt cactgagggt ctctgttgtc ctgaagaatg 2580 aattggacac cacccacagc acacttccag acctttcttg tcttgtatgg attaatgtat 2640 gctcttcaca taatatatag taaaggactt tgaacttgtg caaaaagttt tgcttggcat 2700 ggatgatgga aaataaaaag tctcgtagca tggaaacctt aaagaatgag ttgatgtaga 2760 gctttttttt ttttttttaa cttaataaag actgtctctg gggtagtaac ttcaaaacat 2820 ttatttacct ttaattggta agtatgcatt ttattatata tagggatagt ttgtccaaat 2880 gtaaaattct gtctgacaga atcatagttc aaccaagact gaatatatat atatatatat 2940 atatatatat atatatatat atatatatat atatatatat atatatatat atatatatat 3000 atatatatat atatatatat atatatatat atatatatat atatatatat atatatatat 3060 atataatctg ctctccgtta aacagaaatt gggggaaaaa ataaatgggg gcgaataatt 3120 ctgacttcaa cgatatatat atatatatat atatatatat atatatatat acagtttctg 3180 tgtagtttac actaaacgag tcacttcaca gaatgtgcac ttattatttt tttttaacga 3240 atgccttaag agtaagtttt acacttatta atgtattatt aaattttatc aaacttatag 3300 ttgttttttc ttgaggctct tatgtctgat atcatattta cattgtgggt aatttttatt 3360 aatttcagaa tagtgttcat tttcaatctc tttatgtcta atagttttta tcgttttaag 3420 acatttgtaa ttgcatagga gacaaatttg aaagtactgt tggacatttt ttctttaaca 3480 ataaatgtct aaaatggtat tgtctgtgtg ttttaaatta cagatttatt ttgacagatt 3540 tatggggttt aaagattaaa ataatggcta attttatttg tgaaactgac gatggagaag 3600 ttaattttaa aaaatgctat caaaatgtat gagctgggtg actaagtatg taaaggtgag 3660 ataaataatt aaaaataagt taattgctat taaaatgtat gagctgggtg atttaagtat 3720 ttaaagttga aagaaattaa agcaattaag ttaattggta ttaaaatgta tgagctgggt 3780 gactaagtat ttaaagtttg gaaaaattaa aacaattaag ttaattgcaa ttaaaacgta 3840 taagctgagt aacttgggta ttttaagtta ggtgaagtgt cttgcatgag tacaacaaac 3900 tcaaaactta atgtttctgt ttacttaaaa tagataattt cataacttaa aaaatttgac 3960 gtaactgatt acctcaaatt ttttgagttt tgtcaactta ttcgggatta cagtg 4015 // ID DIRS-9B_DR repbase; DNA; ZEB; 7179 BP. XX AC . XX DT 07-JUN-2010 (Rel. 15.11, Created) DT 07-JUN-2010 (Rel. 15.11, Last updated, Version 2) XX DE A family of DIRS-type LTR retrotransposons - consensus. XX KW DIRS; LTR Retrotransposon; Transposable Element; DIRS-9B_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-7179 RA Kojima K.K. and Jurka J.; RT "DIRS-type LTR retrotransposons from zebrafish."; RL Repbase Reports 10(11), 1839-1839 (2010). XX DR [1] (Consensus) XX CC This consensus is generated from 8 sequences with >90% identity. CC The consensus is ~85% identical to DIRS-9_DR. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 852..2336 FT /product="DIRS-9B_DR_1p" FT /translation="MTSHLTDQDYNQTNYNPQSVPPQPYSTSLSWPPAPIS FT SSSNSPNFPSTSATHFLPSNTVPPAISQPQSFPPSLQPPASCSIPFQPATH FT PSCFPSSSLPPVLTNTNPTQQIPTPAPVSASRPPFTLSSATPLPPPNNALA FT MEPPQVSNAARNQILSGADVDLITLLSPITPPTAERQVDCGEYSLTLKQPH FT SSHSRILSLAEFNIAFSRYTDIICSVFPHRRRELNDYMAIIAELALSYGGT FT HFYTYHKLFSAKCAIRVTQWNQCPYWGALDTDLHNRVFLGCRNLSCAVCRS FT NLHPTTSCPFIIPSSDTQTAVKSSSYTPRPPNHHIPPLLSSSSFHPPPNRD FT TCQNFNIGRCHKNPCKYLHICSYCGGAHAKIVCSVWKSLNKRSKNYLSTPI FT NVPHLAHELHSHPDTNFSAFLISGLTHGFHPGVSALPSYNLFCPNLQSANT FT EPETVDLLIKKEIENKFMIGPFSAPPFTVSRISPIGIATRKFSDKKTSNH" FT CDS 3816..4604 FT /product="DIRS-9B_DR_3p" FT /note="tyrosine recombinase." FT /translation="MQSSSIRSYLSGIQFFHKLIHGSPSDAILNSQTSLLI FT KGIQKTHPHPPDPRLPITLNILTKCIHTLRKGYQSINTARTLDAMFTLAFF FT GFLRCSEITTTSKFNPSIHPTISDLTVLDKETISFFVKQSKTDQIRKGHHI FT YIFDIPSPTHHFQTLLAFLQFRKLQDPNPLSPLFTDDYNRPVTRFWFQKHL FT KEILRLSGLSPDSFSSHSFRIGAATTAAHNGLSQSQIQALGRWSSDAFNSY FT IRFSRPHLKEAQSSLTRRLSPS" FT CDS 2506..3618 FT /product="DIRS-9B_DR_2p" FT /note="reverse transcriptase and ribonuclease H." FT /translation="MPIHPDFWHLFGIRWRSKFYFSVRLTFGCRSSPKIFD FT TLSEALCWILSNNYGVPYLIHLLDDFLIISPPSSPPAKHLAITQKVFADLG FT IPLAEEKTSGPSTSIEFLGVKLDSNKFEASLPQEKIDRIISLSLIFLEKQI FT CTKRELLSILGHLNFAMRIIPQGRPFISHLLQLAASVNSLEESILLNKPSR FT EELSLWISFLKQWNGCSFFYSDLITSPVDISLFTDAAPSVGFGGFYQGHWF FT ASPWPHQMLNIPQNQYSSALFELYPIVVAAILWGHQWSASCILIHCDNEAT FT VQCVNKGRSHSPPLMPLLRRLVWTSANKQFIVVAKHVPGCKNQIADSLSRF FT SFQKFQELAPEADPHPTPVPPYSEMILP" XX SQ Sequence 7179 BP; 1730 A; 2521 C; 1040 G; 1886 T; 2 other; taagtgtggt ttattcataa actaatttcg agaggatcac gtgcttataa tcaacacggc 60 tggctcctta ttagctccgt aatcagccct attagatgat tacggaagca ttataaataa 120 cctgagttta tcactccagt tatcttcgtc ttgaagaagc tcagtccgcc tcgctacaag 180 caacagcctg actaagcaca agcaaacaag cacaagcaac aaacaacaaa cgaacaacga 240 acaaatcaac atcttcaaca ctcctacaat actatacaaa ccaacggaac accagctatc 300 aacatggaaa acaacggcaa caataacacc aacaacaacc aagaaaacat ccaccaagaa 360 caccccgctc cacccgagac gcccgcggcg ctccactctc actcagcccc gcgaggcaac 420 atactagaag ccgaagcctc taccagaggc cggaggatga ctcgctccac ttcagcccgc 480 acataccaaa ccggatccgt ttctcccgca gcacaaagac aacaaataag ctcacctacc 540 tcatcatatg catcggcctc ctccttcatt cacccgacga aaagaatcac agtcaacgaa 600 ctccggaaac tcttgcagac ctcggcatcc acgctcctcg ttcactcaat aaaccggaac 660 ttctgaaact ctactccaac gccacatcgg actctcaccc ttcccataca tcacctacaa 720 gaggaaacaa tagacaatcc agcgatcgcc ccactccgta tccagcgcaa ggaaagagaa 780 aggaaaaatc aaaacacgca cctcgccgca ctcccgccag accacgcccc caggaaccac 840 aagcacaaca catgacgtca cacctcaccg accaggacta caaccagacg aactacaatc 900 cccaatctgt tcctcctcaa ccctactcta cctctctttc ctggcctcca gcccccatct 960 ccagttcttc taatagccct aatttcccct ctacttcagc tactcacttc ctcccttcta 1020 acactgttcc tccagctatt tctcaacctc aatcttttcc cccctctctc caaccccccg 1080 cttcatgctc catccccttt caacctgcta ctcatccttc ctgcttcccc tcttcatcac 1140 tcccccccgt acttacaaac acaaacccca ctcagcaaat acctaccccc gcaccagttt 1200 ctgcctcacg cccccctttc acactctctt cagccacgcc ccttcctcca cctaacaacg 1260 ccctggcaat ggaaccaccc caagtctcca atgcagcacg aaaccagatc ctctcaggtg 1320 ccgacgttga cctcatcact ctcctatcac ccatcacacc ccccacggcg gaacgccagg 1380 tagattgcgg cgaatattct ctcacactca aacagcccca tagctcccat tcacgcatcc 1440 tatcattagc cgaatttaac atagccttct cccgatacac agacatcatc tgctccgttt 1500 tcccccatag gagacgcgag ctaaatgact acatggccat catcgccgag ctcgcgctct 1560 cgtatggggg aacacatttt tatacttacc acaaactatt ctccgctaaa tgcgcaatta 1620 gagtgactca gtggaatcag tgtccttatt ggggggctct ggacactgac ctccacaacc 1680 gagtcttcct aggttgccgc aacctgtcct gcgcggtctg ccgttctaac ctgcacccta 1740 ctacttcctg tcccttcatt attccttctt ccgacactca gacagcagtc aaatcctcca 1800 gctacacccc tcgccccccc aaccatcaca ttccccctct tctctcctcc tcttctttcc 1860 atccccctcc caaccgtgac acctgccaaa actttaacat tggcaggtgt cacaaaaacc 1920 catgcaaata cttacatatt tgctcctatt gtggtggcgc ccacgccaaa atcgtctgct 1980 ctgtttggaa atccctcaat aaaagatcaa aaaattacct gtcgactcca attaatgttc 2040 ctcacctggc tcatgaatta cattctcacc ctgatactaa cttttctgct tttctcattt 2100 caggtctaac tcacggattc caccctggcg tttcagcact tccttcctat aatctcttct 2160 gtcccaacct acaatcagca aacacagaac cagaaactgt cgatttgctg attaaaaaag 2220 aaattgaaaa caaatttatg attgggccat tctctgctcc tccatttacc gtttcacgca 2280 tcagcccaat tggcatcgct acacggaaat tctcagataa aaaaacgtct aatcattgat 2340 ctatctgccc cacataactc cgtcttccct agcatcaata gcaccatcga gccagacgaa 2400 ttctcactcc actatcacga catagatcaa gctatctccc tcatcaaaat agccggacgc 2460 aacgcatggc tcgctaaagt cgacatcaca tccgctttca aaattatgcc tattcacccc 2520 gatttctggc acctatttgg tatccgatgg cgatcaaaat tttacttctc agtccgactt 2580 accttcgggt gcagaagtag ccccaaaatt ttcgatacgc tctcagaagc actatgttgg 2640 attctatcca ataactacgg cgttccgtat ctgattcatc tcctagatga cttcctaatc 2700 atttccccgc cgtcttcacc gccagctaaa cacttagcga tcacccaaaa ggttttcgct 2760 gatctcggta ttcctctcgc tgaagagaaa acttccggac ccagcacttc catcgaattt 2820 ctgggtgtaa aattagactc taacaaattc gaagcatccc tcccccaaga gaaaatcgat 2880 cgcatcatat ctctttctct aatcttctta gagaaacaaa tatgtaccaa gcgcgaactc 2940 ctatctatcc tcgggcacct aaatttcgcc atgcgaatta tcccccaagg ccgcccgttc 3000 atttctcacc tcctccagct cgcagcttca gttaacagtt tagaagaatc cattcttctt 3060 aacaaaccca gtcgcgaaga actcagctta tggatctcat tccttaagca atggaacggc 3120 tgttcctttt tctatagcga cctaatcaca tccccagtcg acatcagtct atttaccgac 3180 gctgccccct cagttggctt tggaggcttt tatcaagggc attggttcgc ctccccatgg 3240 ccacaccaaa tgctcaatat ccctcaaaat caatactcat cagctctctt cgaactatat 3300 cccatagtcg tcgcagccat tctatggggt catcaatggt ctgcctcttg catcctcatc 3360 cactgcgata atgaagccac tgtacagtgc gtaaataaag ggcgttccca ttccccgcct 3420 ctcatgcctc tactaagacg cctagtatgg acatcagcta acaaacaatt tattgtggtt 3480 gctaaacatg taccagggtg caaaaatcaa attgctgact ctctctctcg tttctctttt 3540 cagaaatttc aagaattggc tccagaagcg gacccccatc cgacacctgt ccctccctat 3600 tcagaaatga tcttgccata gatcaccctc tacataacct ccgacacatt tctgtatctc 3660 tcatcctaca aggaatagct cccagaaccc tccagtcata tctcacagcc tggaattcat 3720 tcaaacactt ccattcagta tacaaagcac ccttccccga cttttccctc ctctcaatta 3780 cctccttcat atcccacctt cacatttcaa aaaatatgca atccagctcc atcagaagtt 3840 acctaagcgg gatccagttt ttccataaat taattcacgg atctccctcc gatgccatac 3900 tcaattctca aacttccctc ctcatcaaag gcatacagaa aacccaccct cacccccctg 3960 atcctaggct acccatmaca ctcaacattc tcactaaatg catccacaca ctacgcaaag 4020 gttaccaatc catcaacaca gctcgcaccc tcgatgccat gtttacccta gcattttttg 4080 gcttccttag atgctcagaa atcacaacca cttccaaatt caacccctcc attcacccta 4140 ccatctccga tttgactgtg cttgacaaag aaaccatttc cttcttcgtc aaacagagca 4200 aaacagatca aattcgcaaa ggtcaccata tatacatatt tgacatacca tcccccaccc 4260 accacttcca aaccctccta gcattcctac agtttaggaa gctccaagac cccaaccccc 4320 tgtctccact tttcacagac gactacaacc gcccagtaac ccgattctgg ttccaaaaac 4380 acctcaaaga aatactacgc ctatcaggcc tctcccccga ctcattctca agccattctt 4440 tcagaatcgg ggccgccact acagcagccc acaacgggct ctcccaaagt cagatccaag 4500 cccttggccg ctggtcctca gatgctttca attcctacat tcgattcagc cgaccacatc 4560 tcaaagaagc ccagtcgtct ctcaccagac gactttcacc atcctgaaat tacagcaact 4620 atctgtaacc cccccccccm ccccccccac ctgctttcta gagccctcat cccagccaat 4680 agccccaagc taccagtagg gaaccttcta agtcttcact tccttcttgc gtcgagtttc 4740 tccgcacctc ccttctctcc tttctagcgt tgagttcctc cgcacctcct tctcttcctt 4800 caagcgttga atgcttccgc tcttctctcc atcccctccc tctttgaacc tccccccaac 4860 atccttactc cctaccccct ttccagcgtc gagtttctcc gctacttttc ttttttcagc 4920 gtcgagttgc tccgctacta tccctccttt ctagcttcaa gttcttctgc tactcttatc 4980 tcttcagctt aattaacaca tctaaagccc gactcccccg gagtcaatca attaccccct 5040 caattacgtc cctcaacaat tgacattttt cttcttttct agcgttgagt tcctccgcat 5100 ctcctatcct aatttctata gcgttgagtt tctccgcatc tcctttcctt ccttccagcg 5160 tcgaatgcct ccgctctttc ctccatcccc tccctctctc tccccccccc aacatcttca 5220 ctccctaccc ctttccagcg tcgagtttct ccgcttcttt tcttttttca gcgtcgagtt 5280 gctccgctac tatcacttcc ttctagcgtt gagtattccg ctactctttt cagcctaagc 5340 tttgactccc ccggagtcct gctcagccca cctcctatag gagtctccac ctccccctcc 5400 ctatttagac tcctgtagga gtataatcct tcagctctaa ctcccacgga gtcgccccaa 5460 gagctacgac tcccacggag tcctcccccc ccccctctct cctgccccgg ccaaatacgc 5520 atctttcctg ccttcttgcg ttgagttcct ccgcatctcc tgtcctatat tccctggcgt 5580 tgagttcctc cgcatctcct tttcttcctt ccagcgtaga atgcttccgc tcttttctct 5640 ctcccctctt ataacccctc tcctctcccc cacccttaac tcccaatccc ccttccagcg 5700 ttgagttcct ccgctacttt tctttcttca gcgtcgagtt tctccgctac tttactatat 5760 tcagcgtcga gttcctccgc tactctttct atatttcaaa gcatcactca cccacgctca 5820 gactcccccg gagtccccgc ccagcccacc acctacagga gtctccacct ccccctccct 5880 tcccagactc ctgtagtagt ccaattctac agctctaact cccccggagt tgaccacaga 5940 gctctgattc ccatagattt tcttttcttt acctttccag cgtcgagttc ctccgcatct 6000 catgctctct ccttcccagc gttgagttcc tccgcatcct ctttacttcc ttccagcgtc 6060 gaatgcttcc gctcttctct ccaccccctc tacaacccct ctccccttcc cactccacac 6120 cccttttccc agcgttgagt tcctccgcta cttttcttct ttcagcgttg aattactccg 6180 ctacttccta tttctagcat caaatctctc tgctactctc cttccttccg ttaggcaccc 6240 tccccaaccc tgccgcccca aacccgactc ccacggagtc cccaaccctc ctagaatcag 6300 ttacaacttc tcccacagta acctatctca cttcaactga catttcagca gccggatatg 6360 gcactgacct cttgcttttg gggggcactt caatacgcgg ctgctgtccc gagcggaaaa 6420 catttgcctt tttggggagt tttcgagacc tacctgacct cgaactcccc tctcgccctt 6480 ctaacgggag ggagccccgg gctcgaggat attacgagct cagggctctc tcccgggaca 6540 gcatgccaaa tacgctttat tgattatcag ctaagtgtga actcttgaag tgtggtttat 6600 tcataaacta atttcgagag gatcacgtgc ttataatcaa cacggctggc tccttattag 6660 ctccgtaatc agccccatta gatgattacg gaagcattat aaataacctg agtttatcac 6720 tccagttatc ttcgtcttga agctcccccc tttccacccc ttcatcctcc tcctttcctg 6780 attgggcgac acggtggccc agtggctagc actgttgcct cacagcaaga acaccgccgg 6840 tccaaccttt cgggccggtc ggtgtttctg tgcggagttt gcatgatctc cccgtgttcg 6900 cgtgggtttc ccccgggttc cccggtttcc tcccaccgtc caaaacatga accatagcca 6960 aatcgactaa aacaaattat cacccaatac aacctcagtt cacacttctc acggtgacaa 7020 gcaggggagt tttcgagacc tacctgacct cgaactcccc tctcgccctt ctaacgggag 7080 ggagccccgg gctcgaggat attacgagct cagggctctc tcccgggaca gcatgccaaa 7140 tacgctttat tgattatcag ctaagtgtga actcttgaa 7179 // ID DNA8-9_DR repbase; DNA; ZEB; 1451 BP. XX AC . XX DT 17-NOV-2004 (Rel. 9.1, Created) DT 17-NOV-2004 (Rel. 9.1, Last updated, Version 1) XX DE DNA8-9_DR_DR is a non-autonomous DNA transposon. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW DNA8-9_DR; nonautonomous DNA transposon. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1451 RA Kapitonov V.V. and Jurka J.; RT "DNA8-9_DR_DR, a nonautonomous DNA transposon from zebrafish."; RL Repbase Reports 4(10), 259-259 (2004). XX DR [1] (Consensus) XX CC DNA8-9_DR_DR is a nonautonomous DNA transposon. Its is CC characterized by 10-bp terminal inverted repeats (one CC mismatch) and 8-bp target site duplications. It is expected CC to be a member of the hAT or P superfamilies. XX SQ Sequence 1451 BP; 437 A; 257 C; 291 G; 463 T; 3 other; tagggatggg aagattaacc gatatgtatc gatacgcggt catgcgcgtg cacgatgcga 60 gtgcatcggt tgagcagcag aggatgaatg aaattttgga agcaaatcga gatgcatcgg 120 tttttgcgag atgcatcggt ttttccaaga tacagcttat atttttaata tagaatgtat 180 tttttattta ttaacaagtg ttgtcaacct gtttttggcg cataatttaa tcgacctaca 240 taaagtaagc cttagcaacg gatttgcgga tgtgtacact tacactacaa ctcagtgtat 300 caggtgtgcg gatgaagcta gtggtgcgcc ctcagaaagc gcgagaagca aaacaaacat 360 tttattcccc actgagtttt aaatctaaag tatggcaagc aacattatgg atttaaggat 420 ggacgacatg acaggacaga tgcaatttgc aaaatgtgcc gcgcatctgt aaaatacgcg 480 ggcagtacga ctaatctgaa atctcacttg aagcggcgcc acggtgttgt tgtgaaagca 540 tcttccagtg ttcctgcatc cccggctttc gcactgcttc aactgtagct accagctcca 600 aaagtggtga gaaaagcatt gcaagttttt ttttccatgc gcaacagttt tgtgcgctct 660 acgccaataa cgratgctat tgcattgttc atctgcaaag atattcagcc tagtgtcacg 720 gagaacgaag gttttaaaca cctcctcctg ttaattttta tttaattata ataataataa 780 taatattaat aataataatg ataaaaataa tgggtgtcac ggtggcacag tgggtagttt 840 gaccacctca cagcaagaag attgctggtt tgttatattt ttattagcct gttgtttaca 900 gtgacagtga tgtttcaaag cagcataaca ctgctagttc acgaccttaa gttttgaata 960 ttcagtggca aaatatatct ttgtaaaact tgttttcata gttgaaaagt taaatcacaa 1020 ttcttgttgt gcaatgctaa acctttatgt tgaaagagtg caaatatata tattttaata 1080 tatattgcac ttatacattc tgtttatctt gaaaatactt aaataatatg tgctatatgt 1140 tctaaaatgg ccctctactg taaactgaca ctctggctct gagagaaaag ccagtttgta 1200 aaaaaagtct tggttttgct tggttaataa ataatcaaac tcattcaatg gcacagcatc 1260 ctctttcaca tmatctcata aacttgatct aattagttag tgctgaawta tcgcattgta 1320 tcgtgatatg ggtgtgaatc gtatcgtgtt gcatcgcaat atgtcacaaa tgtatcgcta 1380 atatatcgga tcgtattctt tgtatcgaga tgcgtatcgg atcggcatta tagcttagat 1440 gcccaaccct a 1451 // ID Gypsy101-LTRb_DR repbase; DNA; ZEB; 585 BP. XX AC . XX DT 12-FEB-2009 (Rel. 14.02, Created) DT 12-FEB-2009 (Rel. 14.02, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Nonautonomous; KW Gypsy101-LTR_DR; Gypsy101-LTRb_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-585 RA Bao W. and Jurka J.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 9(2), 558-558 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 585 BP; 112 A; 187 C; 113 G; 168 T; 5 other; tgtcaccgac tcggtcccag tcattcccct cgctggccag cagaggccgc catctccgga 60 cttctaagca ttacatcatc cacttacact gattgtgcat acacacctga actgaatccc 120 gttaatgacc cacgtaccct atataagcca gcactcaaac actcattcag tgcgaagtct 180 tgtttagccc cggccagcat ttctgagcgt tatttcctgc ctgatctctt gtgcattact 240 ccagcctgtt tcttgactct gctttgcctt ctgcctgccc acgacccacg cttgwtacac 300 ggactctgaw cctcgctgcc tgcccwcgac ccacgcctgt tatacggact ctgatgttcg 360 ctacctgccc acgacccacg cctggttcac ggactctgat acttgcagcc tgccccygat 420 ctatgcctgg taaatcactc tgtgtctgtt catcgccagt ctcgcawcca tcattgactg 480 ctgttgatgt gtgtttgcac tttagtgcat attgtgtgtt tgagttagag ctgtgactaa 540 taaatactgc ataatggatc cctccgtgtc agtctcctcg ttaca 585 // ID DNA-2-22_DR repbase; DNA; ZEB; 419 BP. XX AC . XX DT 26-NOV-2008 (Rel. 13.11, Created) DT 26-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE nonautonomous DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; 2-bp TSD; KW DNA-2-22_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-419 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(11), 1713-1713 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 419 BP; 149 A; 57 C; 54 G; 159 T; 0 other; cactgtaata aattcccaga atttcacaac atgttactgt aatttttcac agtaaattac 60 attagtgtca ctttacagga tttaactgtt aaattttacg tattcagaat tttattgtga 120 aattaaggca ggcatcatga ttacagcaaa ttactgtaaa aatggctata tcttttgcac 180 gttagttata ggaattcaaa cactattaat attttattca acatagaaat taatacacat 240 taacttttat gctagtacag gcactttttt tgatatttag aaatatttac agaattattg 300 ttattatgaa gtacctcaca gtaatttact gtgaatttac atacagtacc ttactgtgaa 360 agtaatgcaa ttattagcca gtaatttact gtaaatttaa ggtcaaatat tttacagtg 419 // ID TDR21 repbase; DNA; ZEB; 386 BP. XX AC . XX DT 01-APR-2002 (Rel. 7.03, Created) DT 01-APR-2002 (Rel. 7.03, Last updated, Version 1) XX DE Zebrafish non-autonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; MER1; TDR21. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-386 RA Jurka J.; RT "TDR21: Non-autonomous DNA transposon, related to MER1-group."; RL Repbase Reports 2(3), 14-14 (2002). XX DR [1] (Consensus) XX CC Related to MER1-group. XX SQ Sequence 386 BP; 125 A; 76 C; 62 G; 120 T; 3 other; caggggtcac caacatggtg cccgtgggca ccaggtagcc cgcgaggatc acatgagttg 60 cccgcgggcc tgttctaaaa atagctcacc atagcgccac wtaccagtaa gctgagcata 120 caaatctaat atttttacaa tttttacttt actataaaat acttgtcatt tgtctgaaat 180 atacaatata acttcatgtc atattgmata tttatctgtt tcctaccttg ttaaatcatt 240 gttgacaact attgtgagaa tcattaacat gatcagtgtc ttcacataga tgaatatcat 300 taattattaa taataacata taaaagtaaa ttgagcaaat ttacccaara agtagctctc 360 agtttcaaaa aggttggtga cccctg 386 // ID BELPADRE_LTR repbase; DNA; ZEB; 963 BP. XX AC . XX DT 18-OCT-2005 (Rel. 10.1, Created) DT 18-OCT-2005 (Rel. 10.1, Last updated, Version 1) XX DE BEL-type LTR retrotransposon from Danio rerio - a consensus (LTR DE portion). XX KW BEL; LTR Retrotransposon; Transposable Element; Suzu-like; KW Pao/BEL; BELPADRE_LTR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RA Copeland C.S., Mann V.H., Morales M.E., Kalinna B.H. RA and Brindley P.J.; RT "The Sinbad retrotransposon from the genome of the human blood RT fluke, Schistosoma mansoni, and the distribution of related RT Pao-like elements."; RL BMC Evol Biol 5(1), 20-38 (2005). XX RN [2] RP 1-963 RA Jurka J.; RT "BELPADRE: BEL-type LTR retrotransposon family from zebrafish (a RT consensus sequence)."; RL Repbase Reports 5(10), 296-296 (2005). XX DR [2] (Consensus) XX CC A fragment of this element was described in Ref. 1 (Genbank Acc. CC No. BK005574). This entry reports a full consensus sequence of CC this relatively young family. The closest relative in Repbase is CC CATCH1 from Takifugu rubripes. XX SQ Sequence 963 BP; 319 A; 172 C; 176 G; 296 T; 0 other; tgtcaagtca aactaatata gaaggagaaa aaaaaaatcc atccgccagt cggagggaga 60 ggcggaagga catgcagtag ttcggcattc acgtaaggaa aaaggaaact gaaacctaac 120 cagaacagaa tctccgccaa aacgaaacct ccgccaaagc cgaatctccg ccaaaattga 180 gcaaaagaaa gtgcacaaca gttaaaaaac catcatcaag tcccggagtg gcagccggtg 240 agaactttac tttggttatt gcatgaaaat cgtgactgtc atgtcagtat ttgagggcaa 300 atcaaccgtg aagcagcatc tgagttcaag ttcggcacag aggaaaaagt tgctccaaca 360 gtgcttatct gcccgctgct taattatttc aagatacaaa tagttatttg caaactagaa 420 gatgttataa agcagaaatg tatagttttg gttattttaa tcatcatttg agtgctatct 480 gactgacgtt aagatgttaa ttttcctccg tgattcatat gcatgcatga gcagatcacg 540 tgcacataag ctgattgctg cagtagcaaa attaatcgca tttcatttat atgtcttaaa 600 actaaatatt taagttcagt atattaaact gtactgttga ttgtgcttta acaagtgata 660 attcagtata atacttgaaa atactgtgtg aaaaatcaga caaaattcac aaatgtttgt 720 atatatatgt ttatttgcaa tttaattaca tatcttgtat ttttgtttat tttgaaggtt 780 tttcaccttt caactttcaa ctttcaactt tctactttca actttccaac gttatacttt 840 ttacctgcct ttctgctgca tttatgatgc aataaatgga tgagtggaat ccagttaaaa 900 ctcccttgat gatgtgtctt cctttcaagc gggaaaactt gtgcataaga gaatagcttg 960 aca 963 // ID Gypsy139-LTR_DR repbase; DNA; ZEB; 399 BP. XX AC chr15; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy139-I_DR; KW Gypsy139-LTR_DR; Gypsy139_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-399 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1540-1540 (2008). XX DR Genome; chr15; Positions 31060597 31060995. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 399 BP; 107 A; 127 C; 70 G; 95 T; 0 other; tgtggcgagt gttccaatgc agcgtcgcca caagtacaaa ccaccaacac ctgtcacctc 60 atcagcacca atcgccacct gcagctcatt aagactggag catataaaca gagaaagaga 120 agacacaggg tgagcttgat ctctccgaga agacacgcta atgctccttt ctctgttttc 180 cccagcagat agcagtggct agaccggcac ctcagactgt tttcaaccat ccaaaccttc 240 atcagcactg aagactttcc acattacccg gccaaaggac gcatacctgc acagctaccc 300 taccctcacc tccttacctt gcactgtaaa taaatcaccc tctgggtcgt tacaccattt 360 ctgtttgaat gttcatgttt tatccctcgt ctcgccaca 399 // ID Gypsy128-LTR_DR repbase; DNA; ZEB; 308 BP. XX AC chr23; XX DT 03-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Gypsy retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy128-I_DR; KW Gypsy128-LTR_DR; Gypsy128_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-308 RA Jurka J. and Kohany O.; RT "Gypsy retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1289-1289 (2008). XX DR Genome; chr23; Positions 24363011 24362704. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 308 BP; 67 A; 65 C; 64 G; 112 T; 0 other; tgtgacgagt ctctttgacc aacccactga tcggccgaaa gcgggaagga cagagactca 60 tctgctcaat tgggaacacc tgagttcccc tctataaaag ccagcagtgt tggcttgtga 120 agaagccgtg cttttgctcc tgtttgtttg ttttgtttct tttagctttg gttaatcttt 180 gattttggct tacctaaagt gagctcttta ttattcttta atgtttaaat aaatactttt 240 ggtaccaaac catctggtgt atgtcctccc tttaattttt gtcatcacct ttgagccggg 300 tcgtgaca 308 // ID hAT-N44_DR repbase; DNA; ZEB; 1087 BP. XX AC . XX DT 17-OCT-2008 (Rel. 13.1, Created) DT 17-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Putative nonautonomous hAT DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW 8 bp TSD; hAT-N44_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1087 RA Bao W. and Jurka J.; RT "hAT DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1313-1313 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1087 BP; 308 A; 208 C; 201 G; 370 T; 0 other; cagggccggc ccgtggcata ggcagtatag gcaaatgcta agggcgctgt acatccaggg 60 gggcgccaga aatgagcgcg gttaagttgg ttttgttttc gatattcctg acacacattc 120 agtgatagac ggcaataact caaaaacctc ttaccctaaa aagatctaaa gtgtgtttcc 180 tacaaaaaga caaagctggt tatgtttcac agctgtcttt ttcttttttt ttcgctcgac 240 attacgacca ctgctccgtt taagccctcg caatatgcgt ttagccggga gagctcggct 300 cgcgcggagc ggagctctca gtaagggacc gtcggaaaag tgcttctttt tttttttttt 360 tttttgctcc gtcctgcgtg ttttatttta aacacaacta attttctctt aaatgagcac 420 aaacagttac taaagtagtc gaatgcttca ttcattcatt cattatagat ctgtgcattc 480 ttacattaac acctctgtta tcaaacaaaa cacaatgaga gattcatttg ctgctcttca 540 ctaaataact atagtaactt taatcaatat gcaaatacaa ttaaaagtga aatagatttt 600 atagctttat ttaatttctg tataggccat tgattttaat aggctaaacc ttttatttta 660 ttgtcaatat tttctaatgt ttgctatgta ttctatcatt tgaagctatt tgttgtccca 720 tattttttac atcccaacgt tgacagatat gacataacat tgctaatcaa taaatagcta 780 tagtgctatc tgttagtttt aacgtcagct aatgttatgg aagggcatag atgttatgga 840 aaatagtaat agtaatgtaa ctacccccat cattccttcc tcaagcaaat gtgtaaatat 900 tagatctatt cagcaataat gttaattgca ttggcatatt tatctgacgt tttcccagct 960 tgtagtagtc gatcaaaaag ctatttagtt tcttatgact atacactagc agtggaattt 1020 actgcgatgt gacggggcgc cacctgaaat cttgcctagg gtgccaaatt ggttagggcc 1080 gggcctg 1087 // ID BHIKHARI_LTR repbase; DNA; ZEB; 1714 BP. XX AC AJ011117; XX DT 19-JUN-2000 (Rel. 5.05, Created) DT 19-JUN-2000 (Rel. 5.05, Last updated, Version 1) XX DE LTR of retrotransposon bhikhari from Danio rerio. XX KW LTR Retrotransposon; Transposable Element; BHIKHARI; BHIKHARI_LTR; KW DRE011117; LTR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RA Vogel M.A. and Gerster T.; RT "Promoter activity of the zebrafish bhikhari retroelement RT requires an intact activin signaling pathway."; RL Mech. Dev 85(1-2), 133-146 (1999). XX RN [2] RA Vogel M.A.; RT "BHIKHARI_LTR."; RL Direct Submission to Genbank (29-SEP-1998)Vogel A.M., Abteilung RL Zellbiologie, Biozentrum der Universitaet, Klingelbergstr. 70, RL Basel, 4056, SWITZERLAND. XX DR GenBank; AJ011117; Positions 160 1873. XX SQ Sequence 1714 BP; 492 A; 384 C; 330 G; 508 T; 0 other; tgtagtgagt caacggatcg ttccagaacc gaacaagcca cagaccatgt tatccttaaa 60 agattggttg gtgcctacga gtctgagatt gatttcattt tcaaagcaat ccacaggcac 120 ccagcaaaag gcaaatccac aggataccta gccagtcaac atcacagttg cgtgttaatt 180 aactctacac cctaaaacaa taagaggctc tacaatcaaa aacccaccaa accggttttt 240 gagaagtacc ttttggttcg cttgtccttt ggcgtctgca aggaatacac attctcaggc 300 agacacagaa gcgatcagaa acgcttccag agtttaatgg acatttaagt aaaacagttt 360 atttgagtaa tgtacgtttg ctttatgttt ctgtaggttt ttgtgtgtac atgttccatg 420 tgtttaacaa aactgttata gatagttctt gagtaatagc cctcagtgac aaatgaaatc 480 tacagaatgt ttgtgaaata ctttgtttgg tatcgattga aagctaggaa cattttaaat 540 gcattggttt atatgaagaa accatactat gaaaagtgtt gacgttataa gactttattt 600 taaccactat gctatcaaag cagcacccac accaggtgtc aacccagcct cctagagggc 660 agaattgggg tgtgactcca ccccgcacct tatctgattg gacaagtatt gtgtagaccc 720 agcctctacc aaagcctata aaactttgaa caaagaaatt tcatcgtcgt cttttgccgg 780 tcatctttgc cagtcgtcac cgccgtcgct gcccggacgt cgctcttctc gtgctgtggc 840 cgttacatcg cctcgccgga agcctcgtta catcacggag cagaccgcga ccttcacttg 900 ctgccggctc gacccccgat ctgcttgtgc aaccgcttca acaacaagtc atcgggtcaa 960 cacatccaaa ctctgaaact catcctaact acaggaaccc aggaaggact tcgaacgccg 1020 ccttgtccaa catccgtatc cctagcaacc gactaaggaa acccctcttc accgacaagg 1080 gaattccatc acgtcactga agttgctacg ctgaccaatc agcttcgccg ggatttccct 1140 ttggaccagt cgaggaaacc aaattacatc agaacgcaag tagcacaaac aaggtttcta 1200 tccattgctg actctggtgt gaattatgtt taagtagtaa agaatagcga aattctctgc 1260 ttttatggtt tctctcaggt tgcaatgcta agaacttagc aaaggctaga agttaaatcc 1320 actcagtcaa aactctcctc aaactgcttg tgcgtgtgtg tatgtatgtg cgtttgtttg 1380 ttttacgtag attagtactt tgtgttgtag agtagcaata aacttttgtt tcattttaag 1440 atccgtgttg gtgtgttgtg tgctttataa gttaatgcct cagcttcaga tactgctacc 1500 ttgctcatta aataattaga tactgttttt atattgttat tgttggccac gtaataatat 1560 aatacagaat tgctttacac taaacgttct atttgctgga caaatactta agttaagtgt 1620 aagctttaaa taattctatg tgaatcatgt tcgaatcttt gattcgaatc cccaaagttt 1680 aaacatgatt taaaccaaag agctgattct taca 1714 // ID hAT-N58B_DR repbase; DNA; ZEB; 791 BP. XX AC . XX DT 26-FEB-2009 (Rel. 14.02, Created) DT 26-FEB-2009 (Rel. 14.02, Last updated, Version 1) XX DE Nonautonomous hAT DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW hAT-N58B_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-791 RA Bao W. and Jurka J.; RT "hAT DNA transposons from zebrafish."; RL Repbase Reports 9(2), 544-544 (2009). XX DR [1] (Consensus) XX SQ Sequence 791 BP; 288 A; 148 C; 147 G; 203 T; 5 other; tagggctgtg yrattaatcg aaatcgaatc gcaatttgaa acgttgcgat tagttaaatc 60 gcaagaggct gcaatataaa atatwtatat gtatatatgt aaacaaaaat aacgaataac 120 atctgaggtg aagtgcttca aaaccagtcc gctatgcttc agaaaattaa acatgctaga 180 ctttctgcga tggtgtcgtg aggcatcgca gacatggtat yggttgtcgt gaactatgcc 240 acattacacg agaagaaacg attgaatctt gtgtcacgac gtccatttgt trtttacgac 300 tccccacgac accctacgtc aaggaaatcg tgccgaaatc gtgccgaaat cgtgtgatct 360 gaccggggct ttataactag ccaattgagt gagcacactt tcgttctgcc caatcagaat 420 tgcgcaactg aactacgccc acaataaaaa aaagaaaaca aacaggaaag cgctgacaag 480 tgggattatg gcatctgctg cttcagaagc attaatagac aaattaatat caaagaaaaa 540 cagcatttgt aatatgggaa tattttggtt tcaaagtcac agacaccaaa caaaaacagg 600 tcatttttaa gagctgtcgc agaattgttg ccacggctta cagattattt gagctgaagc 660 ttgaattaaa ttaaaatcag cttgaagctt gaattaaatc gtaaatcaaa tcgcaatatc 720 tgtcaaaaat aaataaataa aaaaaataaa aaaatcacaa ttagatattt tccccaaatc 780 gcacagccct a 791 // ID Gypsy85-I_DR repbase; DNA; ZEB; 6685 BP. XX AC chr1; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy85-I_DR; KW Gypsy85-LTR_DR; Gypsy85_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6685 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1481-1481 (2008). XX DR Genome; chr1; Positions 25819230 25812546. XX CC Positions [4912-5388] - Integrase core CC 'TATA' target site duplication CC LTRs are 99% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 278..1618 FT /product="Gypsy85-I_DR_2p" FT /translation="MLDDQEDKLTSEISNIAAGRGRGALLTTPDKGFWFMG FT PEYTSTGRGRVLRPSFETRPDVSCSPIIRSEKQEGTKNESKISSDMSDLIR FT QIGSEIGVAIRDSLLQNRDSGLSPGNSSREMPDSSKTDMPNATIIDATKLN FT LVLKSEITAPPYYRGDGTDKCSIIEWEELMRVYINKREIVDTERVEEVTNR FT QMGRARDIIKVWLSNKTITPTVDSVFAILRHHFSDCSSSGMPLADFYSVKP FT YSNESPLDYWIRLNKAAEVAEHCLREEGRTLGNRSRELAVMFIRHCPDKEL FT SLVFKSKSAQSWTASEIQERLDEMLREQKATGRVTCQHTAKILEPADNLTP FT LSPQGAESAAPPTDNGALDKVLSMLEKALVCNAQSVRGGWSKNSNRQSRVC FT RVCASKDHSTTAHCKMDNLCFKCFSSGHRSFNCEKTVHGEGTSGRVDEELQ FT GN" FT CDS 1825..6336 FT /product="Gypsy85-I_DR_1p" FT /translation="MLDSGSMATTLSANVVPQLIEAGILSQELVPTSDIVL FT VGCGGKQTSPEGMCDLKLEVYGISFSVPVLVVSGQTDQLIIGTNVLKPLIR FT EMKSNEGFWRVLDKPDQLSQNENRQFLQMLSSIERWRGGSIPDRVGTLKTK FT STVILQPMSEHLVWGRLPPNQKFSVGSTVVIEPSMSRCVHRNILIGRVISP FT LWGDGWLPVKMINPTNSEIVLRKNAKVADVYPCIAMEDFDQVQTVFQNVAA FT CSDSSCGSLSAKSTLPESKVVNDSRLNALGLDGLSVADCEVSSFWKNKLIE FT LIKKYDDVFSRHSLDCGEARGFCHRIRLVDDRPFRLPHRRLSPAHYHKLRK FT TLDEMEQKEIIRKSSSEFASPVVLVWKKNGDLRLCTDFRWLNARTVKDAHP FT LPHQADVLAALGGNALFSTMDLTSGYYNIPLHEDDKKFTAFCSPLGLHEYN FT RLPQGLCNSPATFMRMMLTIFGDQNFMSLLCYLDDLLVFGKTEEESLQRLE FT MVFQRLRDHNLKLSPSKCSFLRRSVKFLGHIVSQAGVASDPIKVEAIVNVA FT EKDLMEVDGITPSVSKIRSFLGMVIYYQHFIENCSVIAKPLFQLLTGQKKP FT RKGKGTGRKLVTVRKLTSDDWTDECREAVEELKDSLVKQVLLAHPDFSKPF FT ILSVDASTRGLGAVLSQVQEGHTVARPIVFASKALNHAQSKYPAHRLEFLA FT MKWAIYDKFSHWLRGHKFTVWTDNNPLKYILTKPKLDACEQRWIAKLAPFE FT FDIQYIPGTKNVVADALSREPFAVPRILHRLTRTPYDVLKHEAETLEVGQV FT QDTFHLSCEMTKGKICEQLSESQQHHEVTNLVQSVVGGKVSCGEVSAILGS FT HSHWEEGAAVRAVTYVQQLRCLENATQSPLAKLTHADLLDKQCQDPVISRV FT RFFVERGKQPSRKERGSESRETLQTLRQWAKFTTKLGILYRVCKNPTTRTR FT TYQYVVPAVLRGLVLNGVHDDAGHQGQHRTLWLARQRFYWNSMEKDVKMYV FT AQCKRCILSKTPEPEARAPLVSITTTAPLELVCIDFWSAEDVNNKSVNVLV FT VTDHFTKLSCAYVCPNQIAKTVARVLWNNFFCVYGFPARIHSDRGANFESS FT LISEMLKLSGVEKSHTTPYHPMGNGQAERFNRTLGMMIRSLPPRSKAKWPQ FT LLNSLTFSYNCTRHETTGYPPFFLMFGRTPRLPVDVLFESVLLDGGAVDVS FT KYVQSLGRDLREAMSLAQQNARQQQGKQAEHYNLRCKGHSLQPGDRVLLAN FT KGERGKRKLADHWESTVYIVGSKNCSLNTYGIRHPVTGRVKIVHRNLLMPV FT IFLPLPAWGDSASPEGNFSSHSVSSQSDVQGCEIADRTSQWIASMDEVVMS FT SDDELTNDAIDNRYLGICDHDMGTQAEHVEGLECLSFSEVVSDCSFRPPDI FT RSEHSCVAEPQNVAPISRSAIPTVNDNCEAVTVGEEVLVGSPHHTADFCSL FT AGQASVQQGTRTRFGRLIKPVSRLLHTMSTQRVCPVIDV" XX SQ Sequence 6685 BP; 1890 A; 1244 C; 1712 G; 1839 T; 0 other; gattggtgcc gtgactcgtc gtttggtttg ctgattgctg accaccggag gtgctgcaag 60 agtgaggact gtcgcaggag ctgggtggtg ccgccttcgt taatctgcat ggccggagcg 120 attcctcagt gactggtttg aaaaaaaaaa gaaaaaaaag aaaaaaaaga aagacttttt 180 tttctctttg cttaagttac atggacattt aattcaaagg acattttaaa tttgagtcta 240 tttgttttgc gaagtgtaag ataaaagggg tttcaatatg ttggatgacc aggaggataa 300 gttaacatcc gaaatcagca atattgctgc tgggagaggg aggggtgctc tactaaccac 360 tcctgacaaa ggtttttggt ttatgggacc agaatataca agcactggga ggggcagagt 420 gctaagacca tcatttgaaa ccagaccaga tgttagttgt agcccgataa ttcgatcaga 480 gaaacaagag ggtaccaaaa atgaatcgaa aatttccagt gacatgagtg acctgataag 540 acaaattggc tctgaaattg gcgtagccat cagagacagt ctgttacaga atagagattc 600 aggtttgtca cctggtaata gttctcgaga aatgcctgat tcttccaaaa ctgacatgcc 660 gaatgctaca attatcgatg ccaccaaact aaatttggta ctgaaatctg agataactgc 720 tccaccttat tatcgtggtg atggtacaga taagtgctct ataatagagt gggaagaact 780 gatgagggtt tacattaaca aaagagaaat agtggatact gaaagggtcg aagaagtgac 840 gaacagacaa atggggaggg ctagggacat cataaaggtc tggctgagta acaagacgat 900 tactccaact gtagattcag ttttcgcaat attaagacac catttcagtg attgcagtag 960 ctcagggatg cctttagccg atttttactc tgtgaaacct tactccaatg aaagcccatt 1020 ggattattgg ataaggttga acaaagcggc tgaggtggct gaacattgcc ttagagagga 1080 gggcagaact ttaggtaatc gaagtagaga attggctgta atgtttattc gacactgccc 1140 agacaaggaa ctttccctag tgtttaaaag caaatctgca cagtcatgga ctgcttctga 1200 gattcaggag agactagatg agatgctgag ggaacagaaa gcaacagggc gagttacatg 1260 tcaacatact gccaaaattc ttgaacctgc tgataatttg acaccgctgt ctccccaggg 1320 agcagaatcg gctgcaccgc caacagataa cggtgctctt gacaaagtct tgtccatgtt 1380 ggaaaaggct ctggtttgca atgcacagtc agtacgtggt ggttggtcta aaaattctaa 1440 tagacagagc cgtgtatgca gagtttgtgc cagtaaagac cacagtacta ctgcccattg 1500 caaaatggac aatctttgct tcaagtgctt ttcttctgga cacaggagct tcaattgtga 1560 aaaaactgtg cacggggaag gcactagcgg gcgagtagat gaagaacttc agggaaacta 1620 gagagcctcc gttgtgggga gggcaatgtg aggctaaatg aaagtccctc cttaaactct 1680 gatattgaat cagtatattc attttgttgt gagaatgtag gcagagacaa aactgtgatt 1740 tttcaaaata ctgttcaatt gctgcagaat gacagtctgt tttacacaag tgttttggta 1800 caggacaaag ttcatctgag ggggatgctg gatagtggct ccatggcaac gactttgagt 1860 gccaatgtgg tacctcaact aatagaagct gggatcttga gtcaggagtt agttcccacg 1920 tcagacattg ttctggtagg atgtggagga aaacagacta gtcctgaagg tatgtgtgat 1980 ctgaagcttg aagtgtatgg gatcagtttt agtgtccctg ttcttgttgt aagcggacag 2040 acagaccagc tgatcattgg tacgaacgta ctgaaaccat tgatcagaga aatgaaatcg 2100 aatgaaggat tctggcgtgt tttggataaa ccagatcagt taagtcagaa tgaaaaccgg 2160 caatttttac agatgctgtc aagtattgag cgatggagag ggggctcaat ccctgacagg 2220 gtgggcactc tgaagaccaa gtctacagta attttacaac cgatgagtga acaccttgtc 2280 tggggccggt tgcctccgaa tcagaagttt tcagtgggta gcacagtggt tattgaaccg 2340 agtatgtctc gctgtgtcca cagaaacatt ctcatcggta gagtaatctc acctctgtgg 2400 ggtgacggtt ggctaccggt gaaaatgata aatccgacaa attcagaaat tgtattgcgc 2460 aaaaatgcca aagtggctga cgtttaccca tgcattgcta tggaagattt cgatcaggtg 2520 caaacagttt tccagaatgt ggccgcatgc agcgacagtt catgtggtag cctgtctgcg 2580 aaaagcacat tacctgaatc taaggtagtc aatgactcgc gtttgaatgc tctgggtctc 2640 gatggattaa gtgttgctga ttgtgaggtg tcctcatttt ggaaaaacaa actcattgaa 2700 ttaatcaaga aatatgatga tgttttctct agacattcgt tggattgcgg cgaggcgagg 2760 gggttttgcc atcgaattcg gctggtcgat gacagaccct ttcgtttacc gcatcgcaga 2820 ctctcaccag cgcattatca caaactgagg aagacactag atgagatgga acagaaagag 2880 ataatcagga agtcatcaag tgagtttgcg tcaccagtgg tattggtctg gaagaaaaat 2940 ggagacttgc ggttgtgcac tgactttcgg tggttgaatg ctcgcacggt gaaggacgct 3000 caccctctgc ctcatcaagc ggatgttctt gcagcactgg gggggaatgc gctgtttagt 3060 accatggact taacatctgg gtactataac attcctttac atgaggacga taaaaagttt 3120 acagcttttt gttctcccct tggactgcac gaatataatc gattaccgca aggcctttgc 3180 aacagcccgg caaccttcat gagaatgatg cttactattt tcggtgacca aaatttcatg 3240 agtctgctat gctatctaga tgacttgttg gttttcggta agactgagga agaaagtttg 3300 cagagactag aaatggtgtt tcagcgtctc agggaccata atctaaagct ttcgccgtcc 3360 aagtgtagtt tcctgcgtag gtcagtaaaa tttttaggcc acattgtttc acaggctgga 3420 gtggcaagtg atccgatcaa ggtagaggct attgtgaatg tggcagagaa ggacttaatg 3480 gaggttgatg gtattacgcc ttctgtgagt aaaattaggt cattcttggg aatggttata 3540 tattatcaac atttcattga gaactgctct gtgatcgcaa agcctctttt ccagttgtta 3600 acgggtcaga agaaacctag gaaggggaaa ggaacaggta gaaagttagt tacagttcga 3660 aaactcacct ctgatgactg gacagatgaa tgcagagaag cggttgaaga gctaaaggat 3720 tccttggtta aacaggtttt gctggctcat cctgacttct caaaaccgtt tatactttct 3780 gttgatgcat cgacgagggg acttggagca gtgctctccc aagttcagga ggggcatact 3840 gttgcaaggc caatcgtttt tgcttcgaag gcactcaatc atgcgcaatc caagtaccct 3900 gctcataggt tagagttttt ggcaatgaaa tgggccattt acgataaatt cagtcattgg 3960 ctccgtggac acaaatttac agtatggacg gacaataatc cgcttaagta tattttgacc 4020 aaaccgaagc ttgatgcctg tgagcagaga tggattgcga agttggctcc ttttgaattc 4080 gacatacaat acatacctgg aacgaaaaat gtggtggctg atgccttgag tcgagagcct 4140 tttgctgtac ctaggatttt gcacagactt accaggacac cgtatgatgt tttgaagcat 4200 gaagcagaga cacttgaagt agggcaggtg caagatacgt ttcatttgtc ctgtgaaatg 4260 acaaagggaa agatttgtga gcaattgtct gagtcgcagc aacatcatga agtgactaac 4320 ctggtacaaa gtgtggtagg aggcaaagtg tcctgtgggg aggtgtcagc tattttgggt 4380 tcgcatagtc actgggagga aggtgcagct gttagggcag tcacctatgt acagcagtta 4440 aggtgcttag aaaatgctac acagagccct cttgctaaac ttactcacgc tgatctgctt 4500 gataaacaat gtcaagaccc agtgataagt agagtaaggt ttttcgttga acgtggcaaa 4560 cagccttcta ggaaagaacg aggtagtgag tcgagggaaa ctctgcaaac actcaggcag 4620 tgggccaaat ttacaactaa gctaggcata ctgtatcgtg tgtgtaagaa tccgacaacc 4680 agaacaagga cgtaccaata tgttgtacca gcagtcttaa ggggactggt gttgaatggg 4740 gtccacgacg acgctgggca ccagggtcaa caccgtactc tttggctagc gaggcagcga 4800 ttttactgga actcaatgga gaaagatgtg aagatgtatg tcgcacaatg taaaagatgt 4860 atactgagta aaacacctga gccggaggcc agagcgccac ttgtatcaat aacaacgaca 4920 gcgccacttg agctggtttg tattgatttc tggtcagccg aggatgtcaa taacaaatca 4980 gtcaatgtgt tggtagtgac tgaccacttt acgaagcttt cttgcgcgta tgtctgtcca 5040 aaccaaatcg ccaagaccgt tgctcgtgtt ctttggaata attttttctg tgtatatggt 5100 ttcccagcga ggatccactc tgataggggt gcaaattttg agagttccct aatttcagaa 5160 atgctgaaac tatcgggcgt ggagaaatcg cacacgacac catatcatcc catgggtaac 5220 ggtcaggctg aaaggtttaa ccgtactctt gggatgatga taagatctct acccccgagg 5280 tcaaaagcaa aatggccgca attgctgaat tctctgacat tttcatacaa ttgcacgagg 5340 catgagacaa ctggctaccc gcctttcttt ttgatgtttg ggcgcacgcc gaggttgcca 5400 gtggacgttc tgtttgagag tgttcttttg gatgggggtg ctgttgatgt gagtaaatat 5460 gtccagtctc tgggacgaga tctgagggag gcaatgtcgc ttgcacaaca aaatgcacgt 5520 caacagcaag gcaaacaggc tgagcattat aatctgagat gcaagggtca ttctttacag 5580 cctggcgata gagttcttct ggcgaacaag ggagagagag ggaaacggaa attagcggac 5640 cattgggaaa gtacagtgta catcgtagga agcaaaaatt gttctttgaa cacttatggg 5700 atccgtcatc cagttactgg ccgtgtcaaa attgtacata ggaacctatt gatgccggta 5760 atttttttgc ctctacctgc ttggggtgat tctgcatctc cagaaggtaa ttttagttca 5820 cattctgttt catcacagtc tgatgttcaa ggatgtgaaa ttgctgacag aacgtcccaa 5880 tggattgcta gcatggatga ggttgtaatg tcgagtgatg atgagttgac aaatgatgca 5940 attgacaatc ggtaccttgg catctgtgat catgacatgg ggacacaagc tgaacatgtt 6000 gaggggttgg aatgtctgag cttttctgaa gtagtgtctg actgttcttt cagacctccg 6060 gacattagaa gtgaacactc ctgtgtagct gagcctcaaa atgttgctcc gatttctcgc 6120 agtgccatac ccactgttaa tgataattgt gaagctgtga cagtagggga ggaggttcta 6180 gtgggttcac ctcaccatac ggcagatttt tgttctttag caggacaggc cagtgtgcaa 6240 caaggtacac gcaccaggtt tggtaggctt attaaacctg ttagcagact tttacataca 6300 atgtccacac aaagagtttg ccctgttatt gatgtgtgaa tttctatttc caaagttttc 6360 attgttgatt tctttttgtt tgatagattg catacccgtc aatttggagg gatggacgtg 6420 tttctggtgg tcctagataa agttgtttat taaggaaaat gttatagtga ccatctagta 6480 gtgtgtcatg attaaaatcg tgtatatggc atgatttttc agacgataga ctgaggatta 6540 gctacccttg agttatctga atctctaacg agatagcttt atagctgaat tacctttatt 6600 tggttgtgtg acatgttaac actattcttg aacattaagt gtctatatgc tgtgcaataa 6660 tctagtgcag aacagtgggg gtgga 6685 // ID Gypsy118-LTR_DR repbase; DNA; ZEB; 1033 BP. XX AC chr20; XX DT 03-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy118-I_DR; KW Gypsy118-LTR_DR; Gypsy118_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1033 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1524-1524 (2008). XX DR Genome; chr20; Positions 25868179 25869211. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1033 BP; 214 A; 175 C; 287 G; 357 T; 0 other; tgtaagcggt gcttcccaaa ccaggatgat gcgtactggc gtcatttgcg tctatgacgt 60 ttccaggaga tgaacagctg agttcaattt gtaatgagtc caagccttta aaagcgattt 120 gtagtttctg ggtcaggagg ggtttttctt ggtttttgat tgcaggtgct gctgagccgc 180 tggccaaggc tatcgtgccg ggacgtgggc atgatgttgt gcgcgagcga atgactggga 240 gggatagttg gtagtgctgg ggtttccccg gactaaagtt tcggcggccg ctgttgactt 300 tccaatcgct gtatcccacg tccggtagcg ggtaagcggg ccagcagcac agttatactg 360 ttgcggggtc attgtgtgtg cattgcattt tgttcgccgg gagtttgagt gagtttgatt 420 gctcccggcg cttgtactct gtctgtctgt ctgtgtgtgt gtgcgtgcgt gcgtgcgtgc 480 gtgcgtgtgt gtgtgcgtgc gtgtgacggg ggatgagcgc aggctagcat tacgctccgt 540 aaaggaaatg ttaaatgacg acggcttgat atttagctgg agtggcattg atggtagctt 600 atgatttgat tgctgtgtgg agtctattta tgggtgttat ttatacattt atatttatga 660 gatttactgt gtggtgtatt acaactctgg gttgatgtga aatgatgatg tgttgcacaa 720 ttacggttga taccccttgt taaatgcctt gtggtggtca ttttaattgc cttacctaat 780 aatttttatg cagtccacta acaatcttta catcatgaac tttatgtcct tttgattccc 840 tacctgctgc tccccttgtg tggaaattta atctttagga ttattgtata aaataaaagt 900 gaaattagta tttaaatgaa ttgagtgttt gacgtttact tgattattgt ggcccaaaat 960 aaatactgta ttttgtccag caaagaactt gtgttgtgtg tgaatacttt attccctttt 1020 tgggagggtt aca 1033 // ID Gypsy-17-I_DR repbase; DNA; ZEB; 6546 BP. XX AC . XX DT 11-FEB-2005 (Rel. 10.01, Created) DT 11-FEB-2005 (Rel. 10.01, Last updated, Version 1) XX DE An internal portion of the Gypsy-17_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy superfamily; Gypsy-17-I_DR; Gypsy-17-LTR_DR; Gypsy-17_DR; KW endogenous retrovirus; gag; integrase; protease; KW reverse transcriptase. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6546 RA Kapitonov V.V. and Jurka J.; RT "Gypsy-17_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 5(1), 7-7 (2005). XX DR [1] (Consensus) XX CC Gypsy-17-I_DR is an internal portion of the Gypsy-17_DR CC LTR retrotransposon that belongs to the Gypsy superfamily. CC Its long terminal repeat is deposited in Repbase CC as Gypsy-17-LTR_DR. The consensus sequence was reconstructed CC based on multiple alignment of two proviral copies (they are CC less than 1% divergent from the consensus sequence). CC Gypsy-17_DR retrotransposons are characterized by 4-bp CC target-site duplications. The internal portion contains two CC ORFs encoding the 572-aa Gypsy-17_DR1p gag (pos. 85-1799) and CC 1578-aa Gypsy-17_DR2p pol proteins (pos. 1794-6527) composed CC of the protease, reverse transcriptase, and integrase domains. CC The second protein, including the protease domain, does not CC start from Met. Presumably, the gag-pol fusion protein is CC formed originally due to a ribosomal frame shift. This family CC is likely still active in the genome. One proviral copy is CC flanked by identical LTRs. XX FH Key Location/Qualifiers FT CDS 85..1800 FT /product="Gypsy-17-I_DR1p" FT /translation="MEIVHAENICIRNAVIISGLTHTERDDEVLKHLSDYG FT SIERLIRIDEPKTEFHGQIIVEFKNDSAMQLLEQSLPTAFQSPTSSDVTYT FT IKSLTSVYTPAASSSATHTFIEGLREISKVTGKPLEELLQDELAKLTASAV FT SPPQTESEFLTTESDPEGSQKRVVEPTQTTMVSAHGDVTPSNESVIFQTKS FT NNSLPQISASPSRLNADMGTPLKLTVSDVTPAEVQRLVVEHIVRSDTPSHS FT LTTMRLRPFSGKPSYSANEIDYDTWRTNIEFFCTDSTLTDAQRSQRILDSL FT LPPAADVVKHLGPHSPPSDYLELLESAFGTVEDGDELFAKFMSTFQDAGEK FT PSQFLHRLQKVLSTAIKRGGVSAADRDRHLLKQFCRGCWDNALITELQLEQ FT KRKNPPAFSDLLLLLRIEEDKQSIKAVRMNQHLGAAKHSATPPRRVVTNLH FT SISAACSAVKHDEVEDLKRQVVELQNQIASMKPFKKCKEFKPKEPSVPSKS FT AKTFSPKPSKGTNPQPNKSAKPRPWYCFNCGEDGHIASRCETSPNPSLVGA FT KNRQLKEKQLQWEVCDGVPDPNDLN" FT CDS 1794..6527 FT /product="Gypsy-17-I_DR2p" FT /translation="FKLTSVPFVGQRGTGESSQSPTDNLAVSVSDDSPTPK FT HKCCLKLPSGLIGTKCTARVLIADKEINCLLDTGSQVTTFPLSFYQDVFAN FT QPIQPLHHLLEIEGANGCQVPYLGYIETSITFPKEFVSSDIEVPTLALIVP FT DTRPNAQVLIGTNTLNSLYSEYISSKPLKHHPVPQGYQAVMQVLEFVHRQG FT AEGNLGWVNLNCRVPESIPAGKTVVLEGSVRMSTPVTDRWVVVEAPRASSL FT PGGIMVSSCLLSLTAGGKYLPIVLKNETEHDVVLPPKIRLAEVNSIQCVMP FT NGQNNVLTSSVNLTKNSEDSKIHFNFDNSPLTSEWRERVTRKLNSMHEVFA FT CHDLDFGHTTKTKHHIRLHDETPFKHKARPIHPKDIQAVRKHLQELLDAGI FT IRESESPFSSPIVVVRKKNGEVRLCVDYRKLNLQTIKDAYALPNLEETFSA FT LTGSRWFSVLDLKSGYYQIEVEEIDKPKTAFVCPLGFWEFNRMPQGVTNAP FT STFQRLMERCMGDINLKEVLVFIDDLIVFSATLEEHEERLLRVLHRLKDYG FT LKLSPEKCTFFQTSVRYLGHIVSPSGVETDPDKIKALKTWPSPTNLKELRS FT FLGFAGYYRRFIKDFSKIVKPLNHLTSGYPPLHKSKKTQEIKGHYLNPREP FT FKQRWTSNCQHAFEEIIDKLTSAPILGFANPKLPYILHTDASTTGLGAALY FT QEQEGKMRAIAFASRGLSFSESRYPAHKLEFLALKWAVTEKFHDYLYGSQF FT TVITDSNPLTYILTTAKLDAASYRWLSALSTYSFSLKYRAGKLNLDADGLS FT RRPHEVAMDVVSRKEQERIDKFLTLHLENLGETSLTQDEVEAICDKHIISS FT MPEDVVESASDRTVLVHSLAMSSNAVPNSYEEEELGVSLIPRLSVQDLIEK FT QGADSTISQIISHLNSGEKPSPTVRGELPELSLMMREWNRFVLLDGVLYRK FT RQNGEVLTHQLVLPKEFRATVLRSLHDEMGHMGIDRTLDLARSRFYWPKMA FT QEVEQKIKTCPRCVLHKAPPEKAAPLVNIRTTRPLELLCMDFLSLEPDRRN FT FKDILVITDHFTKYAVAVPTVNQKARTVAQALWDNFIVHYGFPERLHSDQG FT RDFESHTIKELCSISGIKKGRTTPYHPRGNPVERFNRTLLNMLGSMNDEQK FT AHWRDFVKPLVHAYNCTKSEVTGFTPYELMFGRQPRLPIDLAFGLPTTSKR FT LSHSQYVSKLKKHLEESYQIATRNALKNAERNKIRFDKHVVDSTLEVGDRV FT LVKQVRLRGKHKLADKWEPSAYIVVRRVHDLPVYTVRPEGDEGPLRTLHRD FT LLLPCGFLTLPGEKVSNPPSSTSKPRTRQQVSGEDASENGVNDTAESMEDE FT VPEYWIRIPVTNESHENALGTLTSTFDPPVGCDPQLPFVALGDESHVEMDS FT AGLEACLDDSHQEKQIKQITTLQGESSEMSESGKSSEGELSEEMPQCPDDG FT IERSVEEELEEEIRKPLSNAPDNISSNIEQHETSQEPENPMRRSQRRKEKP FT DRLQYSELGNPLVIVAQALFHGLTTAFTNSLNGVDFVETSSPSTSDKAVTC FT QPVRVNATGRA" XX SQ Sequence 6546 BP; 1944 A; 1410 C; 1436 G; 1756 T; 0 other; agaaaaactt ggcgagccag ccaggagcct gttgtacaga tagtacaaat caaatttaat 60 cagggtgttt gagataattt caaaatggaa atagtgcacg ctgaaaacat ttgtattcgc 120 aatgctgtga taatcagtgg gttaactcac actgagagag atgatgaagt gttaaaacac 180 ttgtcagatt atggatcaat tgagcgatta atccgtattg atgaaccaaa gacagaattt 240 catgggcaaa tcattgttga atttaagaat gactcagcga tgcagttatt agaacaatct 300 cttccaactg cctttcagag ccctacctca tctgatgtta cttatactat taagtcactt 360 accagtgtat atacaccagc tgctagcagc agtgcaactc acactttcat tgaagggcta 420 cgagaaattt caaaagtaac tggcaagcca ttggaagagc ttttacagga cgagctggct 480 aaactcacag cctctgctgt gtctccccca cagactgaat cagagtttct gacaactgag 540 tctgaccctg aaggctctca gaaaagagtg gtagagccaa cgcagactac tatggtatct 600 gcacatggtg atgtcacacc ctcaaatgag tcagtcattt tccaaacaaa gtcaaataat 660 tctttgcccc agattagtgc ttcaccttcc aggctcaatg ctgacatggg cacaccactg 720 aaattaacag tgagtgatgt gactccagct gaagtgcagc gattggttgt tgagcatatt 780 gtgagaagtg atactccttc acattccctt actactatgc gcttgagacc tttctctgga 840 aagccctcat actccgctaa tgaaatagac tatgatactt ggcgaacaaa cattgaattc 900 ttttgcactg actccactct cacggatgca caacgctcac aacgtatttt agacagtctc 960 ctcccacctg cagctgatgt tgtgaaacac ttaggtcctc attctcctcc aagtgattat 1020 cttgagctcc tagagtcagc ctttggtact gtggaggacg gagatgagct ctttgcaaag 1080 tttatgagca catttcaaga tgctggggag aaaccatcac aattcttgca taggctgcaa 1140 aaagttttga gcacagccat caagagaggt ggtgtttctg cagctgaccg tgatcggcac 1200 ctgcttaaac agttttgccg ggggtgctgg gacaatgctt taataactga gctccagcta 1260 gagcagaaaa gaaaaaatcc ccctgcattc tccgatttac tcctgctgtt gcgaatagaa 1320 gaagataagc aatctataaa agcagtgcga atgaaccaac acttaggcgc tgctaagcat 1380 tctgctactc ccccaaggcg tgttgtgaca aacctccact ctattagtgc tgcgtgttct 1440 gctgtaaagc atgatgaagt ggaagactta aagcggcaag tagttgagct acaaaatcaa 1500 atagcctcaa tgaaaccatt taaaaaatgc aaagagttca aaccaaagga gcccagtgta 1560 ccatcaaagt cagccaaaac attctctccc aagccaagca aagggacaaa tccccaacct 1620 aacaagtcag ctaagccacg accctggtat tgttttaatt gtggggagga tggccatata 1680 gcttcccgtt gtgagactag tcccaatcca tccttagttg gtgctaagaa ccgccagctg 1740 aaggagaagc agttacagtg ggaagtatgt gacggagttc cagatcccaa tgatttaaac 1800 taacttcagt tccttttgtg ggacaaagag gaactggaga aagcagtcaa agtcccactg 1860 acaaccttgc agtctcagtc tctgatgata gtcccactcc taaacacaaa tgctgtctaa 1920 aacttccttc tggtcttatt ggaaccaaat gcactgcaag agtgttgatt gctgataaag 1980 agattaattg tcttctcgac actggttcac aagtaaccac ttttcctttg tctttctatc 2040 aagatgtatt tgcaaatcaa cccatacaac ccttgcacca cttgcttgaa atagaaggtg 2100 cgaatggctg tcaagttcct tacttgggtt atattgaaac atcaataacc tttccaaagg 2160 agtttgtgag ttcagatatt gaagtgccaa cccttgcctt aattgtccca gataccagac 2220 ctaatgcgca agtactcata ggcacaaaca ctctaaattc cctgtactct gagtacatat 2280 cttccaagcc tctgaaacac catcctgttc cccaaggtta ccaagcagtt atgcaggttc 2340 ttgaatttgt tcacaggcag ggtgctgaag gaaatcttgg atgggtgaac ctaaactgtc 2400 gagtccctga atctattcct gctgggaaga cagtagttct tgaaggttca gtccgaatgt 2460 ccactccagt tactgacaga tgggtggttg tggaagcccc acgtgcctca tctctgcctg 2520 gtgggataat ggtgtcaagt tgtttacttt cccttactgc tggaggaaaa tacttgccta 2580 tcgtgctgaa gaatgagacc gagcacgatg ttgtccttcc accaaagatt cggctggctg 2640 aagtaaattc tattcagtgt gtcatgccaa atggtcagaa taatgtgttg acatcttctg 2700 ttaaccttac aaagaactct gaagactcca aaatacattt caactttgac aactcaccgt 2760 tgacttctga atggagggaa agagtgacaa ggaaactcaa ttcaatgcat gaagtttttg 2820 catgccatga tctcgacttt ggtcacacca ccaagaccaa acaccatatc cgattacatg 2880 atgagacgcc atttaaacac aaagccagac ccattcaccc taaagatatt caagctgtcc 2940 gaaaacatct gcaggaactg ctggatgctg ggattattag ggagtcagag tctccatttt 3000 cctccccaat cgttgttgtg agaaagaaga atggggaggt aaggctttgt gttgattatc 3060 gaaagctaaa cctccaaact attaaagacg cgtatgctct acctaacttg gaagagacgt 3120 tttctgcctt gactgggtct aggtggttct cagtgttaga cctgaagtcc gggtattatc 3180 aaattgaggt agaggagatt gataagccaa agactgcttt cgtttgtccc ctggggtttt 3240 gggaatttaa tcgaatgccc cagggggtta ctaacgcccc aagcacgttc cagcgtctaa 3300 tggaacgctg catgggggat atcaatctga aagaggtttt agtttttata gatgatctca 3360 tagtcttttc ggcaacatta gaagagcatg aggaaaggct cttgcgagtt cttcaccgct 3420 tgaaagatta tggtctcaaa ttgtcaccgg agaaatgtac atttttccaa acatcggtga 3480 ggtatctcgg gcacattgtg tctccaagtg gtgttgagac agatcctgat aagatcaagg 3540 ccctaaaaac ctggccaagt cccacaaact taaaagaact caggtctttt cttggttttg 3600 cagggtatta taggagattt atcaaagatt tttccaaaat agtaaaaccc cttaaccatc 3660 tcacctctgg ataccctcca ctacacaaat caaagaagac tcaagagatt aaaggtcact 3720 acttgaatcc tagagaacca tttaagcagc gttggacatc taactgtcaa cacgcttttg 3780 aggaaatcat tgacaaactt acctctgccc ctatcttggg ttttgcaaat cctaaacttc 3840 catacatttt gcacacggat gcgagtacta cgggtttagg agctgcactt taccaggagc 3900 aggaaggaaa gatgagggca attgcttttg ccagtagggg tttgtcattc agtgagagtc 3960 gctatccagc acacaaactg gaatttcttg ctttaaagtg ggctgtgact gaaaagttcc 4020 acgattacct ttatggaagt cagttcacag ttataacaga cagtaaccca cttacatata 4080 ttctcaccac cgcgaagctg gatgcagcca gctatcgttg gctttcagct ctttctactt 4140 actccttctc cttgaagtat cgggcaggga aacttaatct tgatgctgat ggcttatcta 4200 ggcgtcctca tgaagtggct atggatgttg tctcaagaaa ggaacaagag aggattgata 4260 aatttctgac cttacatctt gaaaacttag gagagacaag tcttacacaa gatgaggttg 4320 aagccatttg tgataaacac ataattagtt ccatgcctga agatgttgtg gaaagtgcaa 4380 gtgatcgaac agtgttagtg cattctcttg ctatgagttc gaatgctgtg cccaacagct 4440 atgaggagga agaacttgga gtctcactca tacctcgcct gtcagttcag gatctaattg 4500 aaaaacaggg tgctgactca actatctctc aaattatctc tcacttaaac tcaggagaga 4560 aaccttctcc cactgttaga ggtgaattac ctgagctgtc tcttatgatg agagagtgga 4620 accgttttgt gcttctagac ggagtactgt atcgaaaaag acagaatgga gaggttctaa 4680 ctcaccagtt agtgttaccc aaagagttta gagcaactgt attgagaagt cttcatgatg 4740 agatgggcca tatggggata gatcgaacac ttgatttggc aaggtccaga ttttattggc 4800 caaagatggc acaggaagtg gagcaaaaga tcaaaacttg tccccgttgt gttctccata 4860 aagctccccc tgagaaagca gctccacttg tcaacatacg aacaactcga cctttagagt 4920 tgttgtgtat ggattttctt tctttggagc cagatcgaag aaattttaaa gacatacttg 4980 tgataacaga tcactttacc aagtatgccg tggcagtacc aactgtcaat cagaaggctc 5040 gaactgttgc tcaggctcta tgggacaatt tcatcgttca ctacggattc ccggagaggt 5100 tacacagtga ccaaggtcgt gattttgagt cacacactat aaaggagctg tgttccatct 5160 ctggaatcaa aaagggaagg actacgccct atcacccacg tggtaatcct gtggaaagat 5220 ttaacaggac acttttgaac atgttaggta gcatgaatga tgaacaaaaa gctcattggc 5280 gtgactttgt taaacctctc gttcacgcgt acaattgtac aaaaagtgaa gtgactggat 5340 ttactcctta cgaattaatg ttcggaaggc agcctcgtct ccccattgac ctagcctttg 5400 gtctccctac caccagtaag cgactgtcac actcccaata tgtgtcaaaa cttaaaaaac 5460 accttgagga aagctatcaa attgccactc gcaatgcact caagaatgct gaaagaaaca 5520 agataaggtt tgacaaacat gttgtggatt ctaccttgga agtaggcgat cgagttttag 5580 taaagcaggt tcgcttgcga ggcaaacata agctcgcaga taaatgggag ccatctgctt 5640 acattgttgt acgccgagta catgacctcc cagtgtacac agtaagacct gaaggggatg 5700 aaggaccctt gcgtactctg catcgggatc ttctgctccc atgtggtttc cttactctac 5760 caggtgagaa ggtatctaat ccaccaagtt ctacttcaaa gcctaggaca cgtcaacaag 5820 ttagtgggga agatgccagt gagaatggag ttaacgatac tgcagagtca atggaggatg 5880 aagttccaga atactggatc agaattcctg ttactaatga atctcacgaa aatgcattgg 5940 gtactttaac atctaccttt gatcctcctg tgggttgtga tcctcaattg ccttttgtag 6000 ctctaggtga tgagtcacat gtggagatgg attctgcagg actagaggca tgcttggatg 6060 attcacacca ggaaaaacag attaaacaaa tcacaacttt acagggtgaa tcaagtgaaa 6120 tgtcagagag tggtaaatcg tcagaaggtg aattgagtga agagatgcct caatgtccag 6180 atgatggaat agaaaggagt gttgaggagg aacttgaaga ggaaattagg aaaccccttt 6240 caaatgctcc tgataatatt tcatctaata tagagcagca cgaaacatct caagagccag 6300 aaaatccaat gagacgttct cagaggagaa aggaaaaacc agatagattg cagtatagtg 6360 aactgggtaa tccattagtt atagttgccc aagcactttt ccatggcctt accactgcct 6420 tcacaaactc tttaaatgga gttgactttg ttgagacatc ttcaccctca acctctgaca 6480 aagcagttac atgtcagcca gtaagagtta atgcaacggg acgtgcatag attctggtgg 6540 ggaggg 6546 // ID Gypsy-169-I_DR repbase; DNA; ZEB; 4338 BP. XX AC . XX DT 16-MAR-2009 (Rel. 14.03, Created) DT 16-MAR-2009 (Rel. 14.03, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy-169-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4338 RA Bao W. and Jurka J.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 9(3), 698-698 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 44..4321 FT /product="Gypsy-169-I_DR_1p" FT /translation="MTTPSRMPFAEAIQALTVLHQQQHHALVELKNIQEQR FT FQVLMEAQREDREQIRSLLTQEIRPAPTSAAHPPITLQKMAPEDDPEVFLD FT LFEKMAEACGWPRAEWPVRVIPLLSGEAQIAAQQLPAQNLLEYAHLKRAIL FT QRAGRNPEEQRQLFRSLVLAEGGRPFVFAQQLRDACRRWLVQDGRTTAQLV FT DAVVLEQFITRLPSRTSEWVQCHRPDDLETAIRLAEDHLVARSRVGEITSL FT SSPPLSLPLSLSRPVPRPRVRRPPTPAPRRRCAKADXQNVPRGAGPYQDGG FT VELRPLAVSQTVSGLSPVQSVGPAGPGGIVGRSGPMHWRRGHEDSAPKICS FT VKEVSALIRVPATPIVAPGRDGLYRIPVSIKGGTYQALVDSGCNQTSIHQS FT LLQDPALDMSRTVRVRCVHGDIIHYPLTAIDIQFRGKKHRVEVAVNPHLKH FT PLILGTNWPGFNRLLGVLCAGASWKKKSPDRGRVAQLGESQAVTSRADSGE FT GLGISRCKDFPLEQSRDDTLKHALERVQVIDGKILQPDRPLSYPYFAVIND FT RVYRVTQDAQTKEDTTQLLVPKSRREMLFQAAHSNPMAGHLGQAATLNRLM FT TRFFWPGIHGDVSRWCAACSECQLVNPPATPKAPLRPLPIMEIPFERIGMD FT LIGPLERSARGHRFALVLVDYATRYPEAVALRNISAKSVAEALFRLISRVG FT IPKEILTDQGTAFMSRTMRELYGLLGIKSIRTSVYHPQTDGLVERFNRTLK FT SMIRKFVHEDAKNWDKWLEPLLFAVREVPQASTGFSPFELLYGRQPRGVLD FT VVREAWEDEPSXSKNEIQYILDLRAKLHTLGRLSTENLLKAQDDQRRRYDK FT GTKLRKFSQGDKVLVLLPTSSSKLLAKWQGPFVVTRRVSDLDYEVVRSDRA FT DSRQIYHINLLKQWREPEDVALATLVTNEDDLGPESSGRERPSALVTGGDH FT LSASQLLDIRHLQREYSDVFSPLPGRTNLIQHHIETEPGVVVRTRPYRLPE FT HKKKVIQEELSNMLKMGVVEESHSDWASPIVLVPKTDGSVRFCVDYRKVNA FT VSKFDAYPMPRIDELLDRLGAARYYSTLDLTKGYWQIPLSPISREKTAFTT FT PFGLHQFVTLPFGLFGAPATFQRLMDKILGPHTAYAAAYLDDIIIYSNDWQ FT RHMQHLRAVLSALRRAGLTANPRKCAIGRVEVRYLGFHLGHGQVQPQIDKT FT AAIATCPRPKTKKEVRQFLGLAGYYRRFVPNYSALVSSLTDLTKKEGPDTV FT QWSEQCQQAFSKVKSILCGGPLLHAPDFALPFVLQTDASDRGLGAVLSQEV FT AGVERPVLYISRKLSKSEAKYSTIEKECLAIRWAVLTLRYYLLGREFVLCS FT DHAPLQWLHRMKDTNARITRWYLALQPFKFKVIHRPGVQMAVADFLSRAGG FT GEGCRPDGSPA*" XX SQ Sequence 4338 BP; 981 A; 1171 C; 1242 G; 941 T; 3 other; gtggtgccga aacccgggaa gaagaagaaa cttcgctgcc agaatgacca ctccgtcaag 60 aatgccattt gcggaagcca tccaggccct cacggtcctc caccaacaac aacatcatgc 120 gctggtggaa ctgaagaaca tccaggaaca acgtttccaa gtcctcatgg aggcccagcg 180 ggaggaccgc gagcaaatcc ggagtctgct gacccaggag atccggcccg ctccgacatc 240 tgccgcccac cctcccatca csttacagaa aatggcgccg gaggatgatc cggaagtgtt 300 cctggatctt tttgaaaaga tggctgaggc gtgtggctgg ccgcgggctg agtggccggt 360 acgagtcatc ccgctgctct cgggcgaagc ccagatcgcc gcacagcagc taccggccca 420 gaatctcctg gaatacgctc atctgaagcg agctattctc cagcgggccg gccgcaaccc 480 ggaggagcaa cgtcagctct tccgatcgct ggttttggct gaaggcggcc ggccctttgt 540 gttcgcccag cagctccgtg acgcctgccg cagatggctg gtacaggatg gccgaaccac 600 cgcccaactg gtggatgccg tggtgctgga gcaattcatc acccgcctcc cctcccgaac 660 atcggagtgg gtccagtgcc accggcctga tgatctggag acggccatcc gactggcgga 720 ggatcacctg gtggcgaggt ccagggtcgg cgaaataacc tctctctctt ctccccctct 780 ttctctgcct ctctctctct ctcgtcctgt tcccaggccc agagtgcgga gaccgcccac 840 accagcaccc agacggcggt gcgcaaaggc ggatcygcag aacgtcccaa ggggggcggg 900 gccctatcag gacggagggg tggagctacg acccctggcg gtttcccaga cagtgtcagg 960 tctctctccg gttcaatcgg ttggtcctgc tggccccggg ggtatagtgg gaaggtctgg 1020 gccgatgcat tggcgccgcg ggcacgagga tagcgcacca aaaatttgct ccgtgaagga 1080 ggtgagtgcg ctgattcgcg tccccgccac gccaatcgtc gcccccggtc gcgacgggct 1140 ataccggata cctgtgagta taaaaggggg tacatatcaa gctttggtgg attcggggtg 1200 taaccagact tccatccacc aaagcctgct tcaagacccg gcattggata tgagccgcac 1260 ggtaagggta aggtgtgtgc acggggatat aattcactac ccgctaacgg caatagacat 1320 tcaattccgg gggaaaaaac atagagtaga ggtagcagtt aacccgcacc tcaaacaccc 1380 gctaattctg ggaactaatt ggcctggatt taatagatta ttgggagtct tatgtgcggg 1440 tgcttcttgg aagaagaaat cgccggatag gggacgtgtc gctcagctgg gggaatccca 1500 agcagtgacg tcacgcgctg actcagggga agggctggga atttcccggt gtaaagactt 1560 tcccctggag cagtcgcgtg atgacacgct aaaacatgca ctcgaaagag tgcaggttat 1620 tgatgggaaa atcctccagc ctgatcgacc cctctcctat ccgtattttg cggttattaa 1680 tgatagggtg tatcgagtga cccaagacgc tcagacaaaa gaagatacaa cccagttatt 1740 agtaccaaag agccgccggg aaatgctttt tcaggcggct cattctaatc ctatggccgg 1800 acatttaggt caggcggcca cactaaatcg cctcatgacc cgattctttt ggccgggcat 1860 tcacggtgac gtcagcagat ggtgcgctgc gtgcagtgaa tgtcagctgg taaatccgcc 1920 ggccacccca aaagcgccgt tgcgcccttt accaattatg gagatcccct tcgagagaat 1980 tggtatggat ctcatcgggc cattagagcg atccgcacgc gggcatcggt ttgcattagt 2040 tctggtggat tatgcaaccc gctacccgga agcagtcgcg ctccgtaaca tctcagcaaa 2100 gagcgttgcg gaggcgctgt ttcgcctcat ctcccgtgtg ggaatcccca aggagattct 2160 cactgatcaa ggcaccgcgt tcatgtcacg cacgatgcgc gagctttacg gattattggg 2220 cattaaatct attcgcacca gcgtctatca cccacaaaca gacgggctgg tggaaagatt 2280 taatcgcacg cttaaatcaa tgatccgtaa attcgttcac gaggacgcga aaaattggga 2340 taagtggttg gagcccttat tattcgctgt gcgggaggtt ccccaagcct ccacggggtt 2400 ttcccccttc gagcttctct acggcagaca gcccagaggg gttttggatg ttgttagaga 2460 ggcttgggag gacgagcctt ctaawagtaa aaatgaaatt caatatatcc tggaccttag 2520 agcaaaactc cacacactgg ggcggctctc tacagagaat ttgcttaagg ctcaggatga 2580 ccaacgccgg cgatatgata agggcactaa actccgtaaa ttttcacagg gagataaagt 2640 acttgtactg ctacccactt ccagctctaa attactcgcc aagtggcaag ggccgtttgt 2700 ggtcacacga cgagtcagtg atctcgatta cgaggtggtt cgttcggaca gggctgactc 2760 gcgtcagatt tatcacatta atctgcttaa gcagtggagg gagccggagg acgtggcgct 2820 ggctacgctt gttactaacg aggatgacct ggggccggag agctccggcc gtgaacggcc 2880 gagcgctctg gtcaccgggg gcgatcatct ctcagcgagc cagctcctcg acatccggca 2940 cctccaacgt gaatactctg acgtgttttc gcccctgccc ggtcgtacta acctgattca 3000 gcaccatatc gagaccgaac cgggcgtggt cgttaggacc cggccgtatc gcctacctga 3060 acacaagaaa aaagtgattc aggaagaatt gagtaatatg ttgaaaatgg gagtagtaga 3120 agaatcccac agcgactggg ccagcccgat tgtcttggtg cctaagacgg acggctcggt 3180 ccggttctgt gtggattatc gcaaggtgaa tgctgtgtca aaatttgacg cttatccaat 3240 gccgcgtatt gacgagttgc tcgaccggtt aggtgctgct cgatattact cgacattgga 3300 tttaacgaag ggctattggc agatcccctt atctccaata tcccgcgaaa aaacagcctt 3360 cactacgccg tttggattgc accaatttgt gacgcttccg ttcgggctgt tcggggcacc 3420 ggcgacgttt cagcgcctga tggacaaaat actcggccct cacacagcat atgccgctgc 3480 ttatctagat gatatcatca tttacagtaa tgactggcag cggcatatgc aacatttgag 3540 ggcggtattg tcggcgctga gacgggccgg gctcacggcc aacccacgga agtgcgcaat 3600 tgggcgagtg gaggtaaggt atctgggctt ccacttaggt cacgggcagg tgcagcccca 3660 aattgataag actgcagcta ttgcaacctg tccgagacct aagaccaaaa aggaggtgag 3720 acagtttttg gggctggcag gatattatag gcgttttgtc cctaattatt cggcccttgt 3780 cagctcattg actgatctca ctaaaaagga gggaccggat accgtccaat ggtcggagca 3840 gtgccaacag gccttctcaa aggtaaaatc tatactttgc ggggggccgc tattgcacgc 3900 tcctgacttt gctctcccct ttgttttaca gacggacgca tccgatcggg gactgggggc 3960 ggttctctcg caggaggtgg cgggagtgga acggccggtg ctgtacatta gccgcaaact 4020 cagcaagagt gaggctaagt acagcaccat agagaaggag tgcctggcga tcaggtgggc 4080 cgttctcact ctccgctatt acctcctggg tagggaattc gtcctctgtt cagatcacgc 4140 tcctctccag tggcttcacc gcatgaagga taccaatgcg cggatcaccc gttggtatct 4200 agctttacag ccttttaagt tcaaggtgat ccacaggccg ggcgtacaaa tggctgtagc 4260 cgacttcctc tcgagggcgg ggggggggga gggctgcagg ccggacggct ccccggcctg 4320 aggcgggcgg tgggggta 4338 // ID CR1-16_DR repbase; DNA; ZEB; 4289 BP. XX AC . XX DT 10-NOV-2008 (Rel. 13.11, Created) DT 10-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE CR1-16_DR is a CR1-like non-LTR retrotransposon - a consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; AP endonuclease; CR1 clad; CR1-16_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4289 RA Bao W. and Jurka J.; RT "CR1-like non-LTR retrotransposons from zebrafish."; RL Repbase Reports 8(11), 1690-1690 (2008). XX DR [1] (Consensus) XX CC The 5'- portion is incomplete. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS join(182..1003,1007..4018) FT /product="CR1-16_DR_1p" FT /translation="KHLGPRTLGRHNNNGHRLHLHRVFLPKRVPRPKTVEA FT CFTPAPTYNGPWLYQRRKSRSKARTSAALSPLTQKFHLSAALTQKLDFPTS FT NRFEPLTSQTDMLLPPASSGKTLIIGDSIIRHLNIKSNSTEVQNKCFPGAR FT VLDIAVQIPTILNHAKENDTVILHVGANNIRLRQSEILKKDFNELVETVRN FT TSPAIRILISGPLPTYRRGNESYSRLFAFNNWLETYCKNEHLSFINNWDLF FT WERARFYCADGIHPSRTGAHFLSRHVENALHNLQIYTIPVIYCNTNSLGLK FT QGCNQVNDIKVVVTYRDNSNRPTRNINRRKSYLVPIARNYLNDNIPFLQNE FT GQLKMALFNVRSIASKTFLIHDILSEKKLNFMFLTETWLDQNGSISLIEAC FT PPHYTFLQKVRCGKKGGGIAAIFMQSLSCKEVVLDEFLSFECLAIEIIESK FT CLVVIIYRPPKQNQGFISEFSEFLTLCVTRYDKILIVGDFNFHVDNKLDRK FT AAEFLDILYSFSLTQHVNGPTHNKKHTLDLIISLGINVNINEMIDLAISDH FT FCLFFNCQINTILQPKEKIVRKRFINTLTSETFINEIQQAPIYFRPEMNVH FT DKVDILNALLKSTLDTVAPEKIKKRRLGLKKMPWKNDHICQLKRKCRKSER FT LWRKTKLTIHHEILRSNIHEYNKTIRLERQSFYSKIINENSSNPRKLFSTI FT DCLLNKKTANYFDQPSVIMCEDFAIFFNSKITQIRTDIIQIDQNFVDELGS FT YPQNKMERFRNISKENIIQIISQLKSSSSCLDPIPTEFFKKVSNCLIDYVL FT DIVNDSLMTGIFPNAFKKAVVKPLLKKTNLDSTVLNNYRPISNVTFFSKIL FT EKVVLLQLNQFLDEHKIWEKYQSGFRKGHSTETALLKVVNDMRLNIDKGNA FT SVLILLDLSAAFDTVDHCILINRLEKLVGLSGIVLNWFNTYVTERTFLVNI FT DEYTSQEHNILYGVPQGSILGPSLFLLYILPLGSIIQKYGMNYHLYADDIQ FT LYISVEPRDTTALENLSNCMSNIVQWMTANFLKLNKDKTEILIVGKKPERE FT RIEAELCSLGLQSKKELKNLGVIFDCDLNFKSHINQVTKTCFYHLRNVAKI FT RSFLSFDDAKKLIHAFIFSRLDYCNAIYTGLPKGSTMKLQLIQNAAARVLM FT KLKKREHITPVLIELHWLPVHQRIDFKILLLVFKALHNMTPSYISDCLTKY FT NPNRLLRSSSAGLLEFYPVNLXRSGGTSFSHYAPKIWNCLPLEVKESPSIH FT VFKKRVKTHLFKIAYD*" XX SQ Sequence 4289 BP; 1435 A; 697 C; 762 G; 1393 T; 2 other; catatatcac atacaatggc ggcagcagca gcctggagga atgacagggc agacttttcc 60 tcaatcgatc agaaagattt cgaggtacgc tcaatcgagg aggaactttc agttttagag 120 attcaaataa agggattgct tgagaaacat acggagttgt gtgaacggaa aacgttgcta 180 gaaacatctc gggccgcgga cattaggccg acacaacaac aatggacacc gtttacatct 240 acaccgggta ttcctaccca agagagtacc tcggccgaag acagtcgagg cttgttttac 300 acctgctcca acttataatg gaccctggct ttaccaacgg agaaaatcac gctctaaggc 360 cagaacatcg gccgcgctga gtccgctgac acagaagttt catctttcgg ccgctctgac 420 acagaagcta gattttccaa ccagcaatcg ctttgaaccg cttacctcgc aaactgatat 480 gctattgcca cctgcttcat ctggtaagac acttattatt ggcgattcca ttatcagaca 540 tttgaatatc aaatctaatt ctactgaagt tcaaaataaa tgttttccgg gagctcgggt 600 ccttgatatt gctgtccaaa tcccaacgat cttaaatcac gcaaaagaaa atgacacagt 660 tatacttcat gttggtgcaa acaacattcg acttagacaa tcagaaatct taaaaaaaga 720 ctttaatgaa ctggtggaaa ctgtacgaaa tacatctcct gcaataagga ttctaatctc 780 tggaccactg ccaacgtata gaagaggcaa tgaatcttac agcagacttt ttgcattcaa 840 caactggtta gaaacgtatt gtaaaaatga acatttgagt tttattaata actgggacct 900 tttctgggaa agagcgagat tttactgtgc tgatggaata catcccagta ggactggagc 960 acactttttg tccagacacg ttgaaaatgc attgcataac ttgtgacaaa tttataccat 1020 ccctgtaata tattgcaata ctaattcttt ggggttgaaa cagggatgca accaggttaa 1080 tgatattaag gttgttgtca cttatagaga caatagcaat aggcccacac gaaatattaa 1140 tcggagaaaa tcatacttgg taccgatagc aagaaactat ttaaatgata atataccgtt 1200 tttgcaaaat gaaggacaat taaaaatggc actttttaat gtaagatcaa ttgcgagcaa 1260 aacatttttg atacatgata ttctatcaga gaaaaagtta aactttatgt tccttacgga 1320 gacatggctt gatcagaatg gctctatatc tcttatagaa gcttgcccac ctcattatac 1380 gttcttgcaa aaggttcgct gtggaaagaa gggtggcggg atagccgcta tatttatgca 1440 gtcactgtca tgtaaagagg ttgttcttga tgaatttctt tctttcgaat gtctggcaat 1500 agaaataata gaaagtaagt gtcttgttgt tataatatat cgacctccta aacaaaatca 1560 gggttttatt tccgagttct ccgaattttt aactttgtgt gttactagat atgataaaat 1620 attaatagtg ggggatttta actttcatgt tgataacaaa cttgatcgca aagcagcaga 1680 gtttttagat attttatatt cttttagtct cacgcaacac gtaaatggtc ctacacacaa 1740 taaaaagcat acgcttgatc ttataatttc acttggtatt aatgttaaca taaatgaaat 1800 gattgatcta gcaatctcag atcatttctg cttatttttt aactgtcaaa ttaatactat 1860 acttcagcct aaagaaaaaa tagttagaaa gcggtttatt aataccttaa cgtcggaaac 1920 ctttattaat gagattcaac aagcgcctat atattttcgg cctgagatga atgtgcatga 1980 taaagttgat atcttaaacg cgttattaaa atcaactctt gatactgtgg ctccggaaaa 2040 aataaagaaa aggaggttag ggttaaagaa aatgccatgg aagaatgatc atatttgtca 2100 gttgaagagg aaatgtagaa aatcagaacg tctctggagg aaaactaaat taaccataca 2160 tcatgaaatt ttgcggtcta atatacacga atataataaa actattcgtt tagaaagaca 2220 gtctttttat tctaaaatca ttaatgaaaa cagcagtaat cctagaaaac tcttttctac 2280 tatagattgt cttttaaaca aaaaaacagc taattatttt gaccagcctt ctgttattat 2340 gtgtgaggat tttgctatat tttttaatag caaaataacg cagattagaa ccgatattat 2400 acagattgac caaaattttg tagatgaatt gggaagttat ccacaaaata aaatggaaag 2460 gtttagaaat attagtaaag aaaatattat ccaaatcata tctcaattaa agtcctcttc 2520 tagctgtctt gaccctattc caactgaatt ttttaaaaag gtttcaaatt gcttgattga 2580 ctatgtcctt gacatagtaa atgattcatt aatgacagga atttttccta atgcttttaa 2640 aaaagctgtg gtaaagccac ttttaaaaaa aacaaatttg gactccacag ttttaaataa 2700 ttacagacca atttctaatg ttacattttt tagtaagatt ttagaaaaag ttgtattgct 2760 acagctcaat caatttttag atgagcataa aatttgggaa aaatatcagt caggttttag 2820 aaaaggtcat agtacagaaa cagcattatt gaaagtagtt aatgatatgc gcttaaacat 2880 agataaaggt aatgcatcag ttttaattct tctagattta agtgcagcat tcgacacggt 2940 tgatcattgc attttgatta acagactgga aaaacttgta ggtctctctg ggatagtttt 3000 aaattggttc aatacatatg tcacagaaag gacattctta gtaaatatag atgaatatac 3060 atcgcaagaa cacaatatat tatatggtgt tcctcaggga tctatcctag gcccaagtct 3120 atttttatta tatatacttc ctttaggtag tataattcag aaatatggca tgaattatca 3180 tctttatgct gatgacatac aattatatat ttctgttgag ccaagagaca caactgcatt 3240 ggaaaatttg tccaactgca tgtccaatat tgtgcaatgg atgactgcaa attttttaaa 3300 gttgaataaa gataaaacag aaattctaat tgtgggcaaa aaacctgagc gagagaggat 3360 tgaagcagaa ctgtgttctt taggtttaca atcaaaaaaa gaattaaaga acctgggtgt 3420 tattttcgat tgtgatctta attttaaatc tcatattaat caagttacaa aaacatgttt 3480 ttatcattta cgaaatgttg ccaaaattcg ttctttccta tcttttgatg atgccaagaa 3540 attaattcat gcttttattt tctctcggct tgattattgc aatgctatct atactggtct 3600 acctaagggt tctacaatga agttgcaatt gatacaaaat gcagcagcca gagttttaat 3660 gaaattgaag aaacgagagc acattacacc agtattaata gaattacact ggttgcctgt 3720 gcaccaaagg atagatttta agattctttt attagttttt aaagcactac acaatatgac 3780 accttcctac atttccgact gtttaacaaa atataatcca aaccgtttac tacgctcttc 3840 tagtgctgga cttttagaat tttaycctgt aaatctgakg cgatctggcg gcacttcttt 3900 tagccattat gctccaaaaa tttggaactg tctccccctt gaagtgaaag aatcaccaag 3960 tattcatgtt tttaaaaagc gtgttaagac acatcttttt aaaattgcgt atgattaaac 4020 aagtttgtta ggtgtacgca ttttatctat tttgttgtag gtttatgtat gtaatgtatg 4080 gtgtgtgcct gcatatgtaa ggactattgt gtgtgtgatc atgtgaatat atacttacat 4140 tttgtgtaca tgaatgtttt tatttcagta tatgtgataa gtttttgtgg tctattttaa 4200 tgcttttgtt gttgtttgtg aggcactttg agttgcatgt atgttggaaa agtgctatac 4260 aaataaagtt tattattatt attattatt 4289 // ID hAT-N75_DR repbase; DNA; ZEB; 1327 BP. XX AC . XX DT 14-NOV-2008 (Rel. 13.11, Created) DT 14-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE nonautonomous hAT DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW 8 bp TSD; hAT-N75_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1327 RA Bao W. and Jurka J.; RT "hAT DNA transposons from zebrafish."; RL Repbase Reports 8(11), 1742-1742 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1327 BP; 424 A; 204 C; 237 G; 461 T; 1 other; cagaggtggg caaactacgg cccgcgggcc atatgcggcc cgctgaacac tttcatccgg 60 cccgcgaggg agtttttaaa atgctgcttc tgagtaacgg tttggtttca gaattagtaa 120 attaatgatg caattacaag tgaacgccat ttgatggcag tattgcacgc ggtacagtta 180 gtgcaaccag tagaattaga acacactctc ggatattaaa atgcactcga ccgttaatga 240 ctgaaaatga gtcaggcaaa aaaattaaat gactctgaat gtcgtgtgtt taaggaagaa 300 tggacaacaa aatacttctt caccaatatc ggacagaaag cggtatgtct aatatgccaa 360 gaaagtattg ctgtttttaa agactacaac ctaaggggct gatcacaccg aacgcgcatt 420 tacgttccaa aaacgcgagg tgcaccgcac tgcctttttg ttgacaagaa aaaaaaggag 480 cgcggtgcgc ttttttatgt cgttaggcaa cgactgaatc agctgggtat tttgcgagag 540 tgttgctgtt gatattaata taactttatt tgatcaggct cattgttcac attaacacac 600 tcataataac atgacttgga ataacaaaga ctaagattct tcttccgttt tatctcatag 660 ttttataatg attgatatga gcatattaat acaagctcaa atgagattgt tgtgaatttg 720 actcaaatta tataataatt gctggttaaa tgtgttaatt tttgtcatag aatatatttt 780 ccatgcatac atccacttta aaaagttctt ttatcaaaga tttgtggttt catcaagttt 840 tatcaagttt tatgatgatt gatctgtata ttctaaaagc agtgcaaatt agattattgt 900 aaatttgact gaatagtaaa ctttaagaat tatttttaat gtgttaatat gagatttttt 960 ctatcagaca atatcttttc tattctccac tgtaaaaatt attacattat tattattatt 1020 attattatta ttattattaw tattattata ttttataagg gattatttta ttgttttatg 1080 atgattgata agtaaatata aatagcaata gcgtgcaaat taatttgact tgtaaatttg 1140 actcaataat aactattaaa aaaatctgtt ttgtgttaat gtgacatgtt tctgtccgat 1200 attacatgtt ccatgcatac tgccactata aataaaagtt attttttgaa ggatttgaca 1260 tgtggccctt cacttggttt gcaaaacttg atgtggccct cgggtctaaa aagtttgccc 1320 accactg 1327 // ID Gypsy79-LTR_DR repbase; DNA; ZEB; 147 BP. XX AC chr12; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy79-I_DR; KW Gypsy79-LTR_DR; Gypsy79_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-147 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1474-1474 (2008). XX DR Genome; chr12; Positions 13082321 13082175. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 147 BP; 37 A; 29 C; 38 G; 43 T; 0 other; tgttacacag tgtaatagca tacattgacc tgtatccggt ggggacgctg tcctgcggga 60 tgctaggctg ttgttgcggg agactaccat gcaataaagg cagttctagt tcagctcatg 120 tctgttttat tgtatagaac acaaaca 147 // ID TC1DR2 repbase; DNA; ZEB; 876 BP. XX AC . XX DT 04-MAR-2002 (Rel. 7.02, Created) DT 04-MAR-2002 (Rel. 7.02, Last updated, Version 1) XX DE Zebrafish non-autonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; TC1DR2; TDR1; TC1. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-876 RA Jurka J. and Drazkiewicz A.; RT "A non-autonomous DNA transposon related to Tc1 superfamily."; RL Repbase Reports 2(2), 19-19 (2002). XX DR [1] (Consensus) XX CC Related to TDR1, TZF28 and TC1_FR (all variants, see pufrep.ref). XX SQ Sequence 876 BP; 317 A; 160 C; 138 G; 249 T; 12 other; gggggaaata agtattgaac atgtcaccat ttttctcaga aaacatattt ctaaaggtgc 60 tgttgacttg aaattttcmc cagatgttgg taacaaccaa ataaatccat atatgcaaag 120 aaaacaaaac taattagttt acaaattaag ttatgtgtaa taaaatgaaa tgacacaggn 180 aaaaaagtat tgaacacatg aagaaaggga ggtgtagaaa ggcagtgaaa gcccagacag 240 cagctgaaat ctctcagtag ttcttcagca accctctgcc cttcgtcatt gtaaatnaat 300 attagctgct tcagtccaac atctacatta tcaggatgat gaagatgaaa ccagggtgga 360 catttcagca agacaatgat ccaaaacaca gccaaggaaa ctctcaaatg ctttcagaga 420 aagaaaatca agctgtagaa tggcccagcc aatcacctga cttgaatcca atagaaaata 480 caaaataaag atcagatttg atagacganc ananaagacn aagacccaca gaaccatcaa 540 gatttttaca ctctgttgaa gtctgtgaaa aactcacacc tgagcaatgc atgtgacttc 600 attctccata tgagaggcgt ctttaagctg ccatcaccaa aaaarccttt tataaaaagt 660 attaaataca tttcagtagt tcagtacttt ctccttgtgt catttcattg ttattacaca 720 tnacaaattt ttcagattta tatatatrta taaatatttt atgtttgtat tgtttgggtt 780 tttnaccaaa atctggttca attccatgtc aacagctcct ttagaaatat tattcccagg 840 aaaatwaaca tgatgtgttc aatacttatt tccccc 876 // ID Gypsy86-I_DR repbase; DNA; ZEB; 4007 BP. XX AC chr1; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy86-I_DR; KW Gypsy86-LTR_DR; Gypsy86_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4007 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1483-1483 (2008). XX DR Genome; chr1; Positions 53848213 53844207. XX CC 'ATGTT' target site duplication CC LTRs are 100% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 37..3927 FT /product="Gypsy86-I_DR_1p" FT /translation="MMNNIRSPEELKLTGNVNENWKVFKQSFELYALAIGL FT DGNERRKIALLLTVAGRSALDVYNTFVFTEEEKDKYEAVIEKFEQYCTPRK FT NETYERYVFRNRLQKESESIEQYVTDLRLKSQTCNFGTLCDSMIRDQIVIG FT VQDKRVRMQLLKETDLTLDRAIKICQASECAMAQLKTFSEEKETAEVDTVR FT SASEKAMLNKKKQQKPNKESRACGKCGNKHARRKCPAFGKDCRKCGGRNHF FT AKCCYSKKKVKLLEKRSDSEDEEAPIFLDSIKEGQTTSKDEWIAHMEVNGT FT DVLLKLDTGAQANILPMKDFQRLIKKPRICDKKVNLKTYDNNTIPTKGVCR FT VSLSYNGQKKNVLFVLVEGNKQAILGLKTCMQLGLIKRVHVINKDMTVKQK FT KKDANTKEVMHSDWIKQYKEVFRGIGRLPGEHKIKLKETAEPVIHPARKVP FT VALKERLKKKLDTLIKEGVIRKIEEPTEWVNSLVIVEKADGDLRLCIDPKD FT LNKAIQREHYRLPTKSDITSAMSGACYFSKLDASSGFYQMVLDEESAKLCT FT FNTPFGRHCFLRLPFGISSAPEVFHRTVQQLFDGIEGVGVFIDDVVVWGRT FT KKEHDARLCRVLNQAQKTGLKLNKNKCEFGVREITFLGERLSEEGVQPDRE FT KIRAIKEMPEPRDKKDLQRALGLVNYLGKFVPNLSANTRSLRNLLEANTEW FT QWKKEHIEEWVWLKDSLTKEPVLKFYDQDKPLKVSTDASKLGLGAVLLQQH FT DIEWYPVAYASRTMTSAERNYAQIEKETLGAVFGCEKFHEYVYGRPVILET FT DHKPLIAISKKPLGEVPPRIQRLMLRLQKYDLAFEFKPGKHLIVADALSRA FT SLQNSTSSTEEDVQVHVDSIRVQIPVSTAKWAEIAKETQKDEHLKRVIEII FT HLPEKGMLEKPYQHFQDELTVLDGVLLKGTKIVVPNTMRKKMLKLVHEGHL FT GMEKCKRRAREVLYWPGMHRDIVTLVQKCEVCQRHRYQQPKEPMKPHDKPK FT EPWRKIGMDLFQLKDKDYLLLIDYYSNYPEFVRLSNTTSEQVIVETKVIFA FT RHGIPTTVVSDNGPQFISQSFKDFARNYGFDHITSSPLYPQSNGLAEKGVQ FT IVKRLLKKAAETGEDPHLAVLNYRASPMENGLSPAELLMNRKLRTRLPSAN FT HHMMQSSVNHAKERQINYYNRTAKPLIPLAQEDVVRVRCDGQWGPLAKVTK FT ETMPRSYEVLTEHGKLMRRNRRHLLKVPQRALRSNESDFQDVEISKQKEQS FT ELQDKLTETDRKNDVERLTERPKRQIQKPKRLIEEM" XX SQ Sequence 4007 BP; 1534 A; 670 C; 948 G; 855 T; 0 other; tggtaccagg agaaataacg aaaattaaaa agagagatga tgaataacat ccgttcacca 60 gaggaattga agttgacggg aaatgtcaat gagaactgga aagtttttaa acaaagtttt 120 gaactttatg cgctcgctat agggctggat ggaaatgaac ggaggaagat tgcgttgctc 180 ttaacagtgg caggacgcag tgcactagat gtgtataaca cgtttgtctt tactgaagaa 240 gaaaaagata aatatgaggc tgtgatagag aagtttgaac aatactgcac accgaggaaa 300 aacgagacct atgaaaggta tgtcttcaga aacagactgc agaaggagtc agaatccatt 360 gagcagtatg ttacagatct gagattaaaa agccagacat gcaattttgg aacgctgtgt 420 gactccatga ttagagatca aatagtaatt ggagtacaag ataaaagagt gaggatgcaa 480 ctgttgaaag agactgacct cacattagat agggcaataa aaatttgcca agcgtcagag 540 tgtgcaatgg cacagctgaa gacattcagt gaagagaagg agacagcaga agtggataca 600 gtacgtagtg cttcagagaa agccatgtta aataagaaaa aacaacaaaa gccaaacaaa 660 gagagcaggg cctgtggaaa gtgtggcaac aaacatgcac ggagaaaatg cccagcattt 720 ggaaaggact gtagaaaatg tggaggaaga aatcacttcg caaagtgctg ttattcaaag 780 aaaaaagtta aactccttga aaagagaagc gacagtgagg atgaagaggc accgattttc 840 ctggactcaa ttaaagaagg gcaaactact tcaaaagatg aatggatagc acacatggag 900 gtaaatggta ctgatgtctt actaaaatta gacacaggag cacaggcaaa catcctgccg 960 atgaaagatt ttcaaagact gataaaaaag cccaggatat gtgacaaaaa ggtgaatctg 1020 aaaacctatg acaacaatac cataccgaca aagggagtat gcagggtaag tttgtcctat 1080 aatggacaga agaaaaatgt tctttttgtg ctcgtagaag gaaacaaaca agcaatactg 1140 ggactgaaaa catgtatgca gcttggatta ataaaaagag ttcatgtcat aaacaaagac 1200 atgacagtga aacagaaaaa gaaagatgca aacaccaaag aagttatgca cagtgactgg 1260 ataaagcaat ataaagaagt gttcagagga ataggacggt taccaggaga acacaagata 1320 aagttgaagg aaacggcaga acctgttatt cacccagcaa gaaaagtgcc agttgcttta 1380 aaggagagac ttaaaaagaa actagacaca ctgataaaag agggagtcat caggaaaata 1440 gaagagccca cagaatgggt gaactcactt gtaatagtgg agaaagcaga tggagatcta 1500 agactgtgta tagatccaaa agatttaaac aaagcgattc aaagagaaca ctatagactt 1560 ccaacaaagt ctgatataac aagtgcgatg agtggagcat gttacttttc caaattggat 1620 gcctcatctg gattctatca gatggtccta gatgaagaaa gcgcaaagct gtgcactttt 1680 aatacgccat ttggaaggca ttgtttctta agattaccgt ttggcatcag ttcagcccca 1740 gaagtattcc acagaactgt gcagcaactt tttgatggaa ttgaaggagt aggagttttc 1800 attgatgacg tggtggtttg gggacgtaca aagaaagagc acgacgcaag actttgcagg 1860 gtcctaaatc aagcacagaa aacaggatta aaactgaata aaaataagtg tgagttcgga 1920 gtgagagaaa ttaccttcct gggagaaagg ttgtctgaag aaggggtaca accagacaga 1980 gagaaaatcc gcgcaatcaa agaaatgcca gaaccaagag acaagaaaga cctccagaga 2040 gctttagggt tagtaaacta tttaggaaag tttgttccaa atctgtcagc aaatacaagg 2100 tcattaagaa acctgttgga ggcaaacaca gagtggcaat ggaaaaagga acatatagaa 2160 gagtgggtgt ggttaaagga cagtttgacc aaagaacctg tcttaaagtt ctatgatcaa 2220 gataagccac taaaagtctc tacggatgca tcgaaattgg gtttaggagc agttttgctt 2280 cagcaacatg acatcgagtg gtatccagtt gcatatgcat ctcgcacaat gacgtctgcg 2340 gaaagaaact atgcacaaat agaaaaggaa actctaggtg cagtgtttgg ctgcgagaaa 2400 ttccatgagt atgtttatgg aagaccagtg atcctggaaa cagatcataa gcctctcata 2460 gcaatctcaa agaaaccact aggagaagtg ccacctcgca tacaaagact aatgctacgc 2520 ctacagaagt atgatctggc ctttgagttt aaaccaggca agcacctcat tgtggcagat 2580 gccctgagca gagctagttt gcagaatagc acaagcagca cagaggagga tgtgcaagtt 2640 cacgtagact ctatccgagt acagatacca gtatctactg caaagtgggc agagatagct 2700 aaagaaacgc agaaggatga acatttaaaa agagtgatag aaataattca cttgcctgag 2760 aaaggaatgt tggaaaaacc ataccaacat tttcaagatg aactgacagt gctagatgga 2820 gttctgctaa aaggaacaaa aatagtcgta cctaacacca tgaggaaaaa gatgttaaaa 2880 cttgtgcatg agggacactt gggaatggag aaatgtaaga gacgggcacg agaggtactg 2940 tactggccag gtatgcacag agacatagtg acattagtgc agaaatgtga agtatgccag 3000 cgacacagat accagcaacc taaagagccc atgaaaccac atgacaaacc aaaggaacca 3060 tggagaaaga ttggaatgga cttatttcaa ctaaaagaca aagattactt gttgcttatt 3120 gactattatt caaattaccc agaatttgtg agactgtcaa acacaacatc agagcaagta 3180 attgtagaaa caaaagtcat atttgcacgt catggtatcc caacaactgt agtaagtgat 3240 aatggaccac agtttatcag tcagagcttt aaagactttg caagaaatta tggatttgac 3300 cacataacat cgagtccgct ttatccacaa tccaacggct tagcagagaa aggagtacag 3360 attgtaaagc gcctgttgaa aaaagcagca gagacaggag aagatcctca cctggctgtt 3420 cttaactaca gggcatcacc tatggaaaat ggcctttcac cagcggaact tctcatgaat 3480 cggaaactgc ggacaagact tccatcagct aatcatcaca tgatgcaaag ttctgtgaat 3540 catgcgaaag agaggcagat taactattac aacagaacag caaaaccatt gattccactg 3600 gctcaagaag acgtggtacg ggtgagatgt gatggacagt ggggaccgtt ggccaaagtg 3660 acaaaagaga ctatgcccag gtcttatgaa gtgttgacag aacatggaaa gttaatgagg 3720 cggaacagaa gacacctgct gaaagtgcct cagagagcct taaggagcaa tgaaagtgac 3780 tttcaggatg tggaaatctc aaagcaaaaa gaacaatcag aattgcaaga caagttaaca 3840 gagactgaca gaaagaatga tgttgagaga ctgacagaac gtccaaagag gcaaatacaa 3900 aaaccaaaaa ggctaataga agaaatgtaa attgttcata tatatgtaaa aaagttatag 3960 tttcaagttt tcaaaaaaaa aaaaaaaaaa aaaaaaaaaa gggaaga 4007 // ID EnSpm-N1_DR repbase; DNA; ZEB; 2530 BP. XX AC . XX DT 02-OCT-2008 (Rel. 13.1, Created) DT 02-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE EnSpm-N1_DR is an non-autonomous DNA transposon - a consensus. XX KW EnSpm; DNA transposon; Transposable Element; KW Autonomous DNA transposon; EnSpm-N1_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-2530 RA Bao W. and Jurka J.; RT "EnSpm DNA transposons in zebrafish."; RL Repbase Reports 8(10), 1261-1261 (2008). XX DR [1] (Consensus) XX CC TSD is 2 bp long. The terminal sequence is similar to that of CC EnSpm-2_HM. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 2530 BP; 791 A; 438 C; 392 G; 908 T; 1 other; cccaacaaac acacaacgtt gcctcaacgt agcgaccttc ctacaacgtt gtaccaacat 60 tgcaaaaagg ttgaggattc tacgttgttg ctaaaggttg tcacaacgat gtcataaggt 120 tgccatttaa acatcatgag ggtcgcatgc tttaggttgt gcttaggtgg cgtggacgac 180 cactgtacaa catttactgg tcacagctga atcagtttaa atttcaacaa gccgtcgtct 240 cattcatcaa cgaccgagga gacgtgagta ttgctttatt tcttcttgct aaataagata 300 cgtaaatcgt gaatattcaa acaaacctac gatttcgaca acaatctgtt ccaacattac 360 acacacacaa aagccgtaat gaagttatta cgagcgagta acatttcggc attccgttaa 420 cggacggcaa caacattaaa atcattatgt taaatctctc gataaaacat ttcctgatta 480 tttacctacg ccttgtcttt agtcaaatgg acaataatgt ttttgaagtt cagttttttc 540 catatccatc gtgatcagtt agtatgatac ttcatcacac gagccgaccc actttttaca 600 aaacacacaa aataactttt aaaaacaaaa catctctgtg cagattgcgt cccgttaaag 660 gaacagttca ctcaaaaatg atatgtactc acgaattatt tgccctgtaa ataataagtg 720 attccaaacc tttaagtttc tttattctgt tgaacactaa agactgtaag catttgcttt 780 catagtagga aaacactatg gaattaaata gttacagttt tccagcttta tttaatatat 840 cttctttatt gtttagcaga aaaaaagcta aataagtttg gaacaaataa aggatgaggc 900 taaaatggca gaattgtcag ttttgggtga gctatcccta taagttaaag tatataaatt 960 ratgttcata tgcctaatta atatagatag taattaaaag gtctgctagt ctaaatttct 1020 attttgattg gtcttttgat gttttattta aaacccactc ttttctttgc tcaacaggct 1080 atttttacat ggaggtaatc ttctattgat aatttctgga ttttaattat gcgtaagttg 1140 cataaaactc tattgtttaa ccactcactg ttgttttcaa attactcatg taacttgtct 1200 ttcagaatgt tttcagctgt tgaattcacc tcagatcagt gttgctgtaa ttcctgatct 1260 atggaaagaa gaattggatg aggtaagaag tatactactc tatttactat taaatgtagt 1320 tatgttacct aaaacagcat gtccataatg tggaaaatag atcatgctta tgaaaaatta 1380 ctttattatt ttgtacagct gttattgaaa tgacactaag tttaacattc catttacagc 1440 gagcaacgcc acaactacag gacctcttct ttctcttctc ataatttaca gtaagtaagt 1500 attcggagac taactctgac ttctgttaac tcaaacctgt tttccctgta tctctaaatt 1560 tatttaattg tatgtaattt tggtatatga tatattatta aaacattaaa ataaagtagg 1620 ctacataatt actcactgga gattttatat gtaatgaaat tacaataaat gtctgaaact 1680 tcaataactt gctagtcata gtcatttcct gtgtagtgct gttgtgtcaa gctctgatct 1740 gtcctctctt gttcactgcc tattttgact tgtgtctgct gtggatctct gaagctggat 1800 tatccgttga atttgtttct tcctctgact gctttctcca taccaacaac tacgactggt 1860 cttcagctgc gctccccggt ctgctgtatt ctccagtcca gagcttcact gcccagtgtt 1920 tgaattttaa tttatatatt tatttatata tttatttaca ttttaatgat ttcttgttta 1980 cttttcttgt ttgtttaccc tttagtttaa ttatatatat atatatatat atatatatat 2040 atatatatat atatatatat atatatatat atatatatat atatctaaac tttattttat 2100 tttatactgt ttgcttataa ttagatactg tatgtgtact gtatgtatta aatgttgtga 2160 aatatataat aaaaagtttg attatattca tttgtgtggc atttttacac tttaaagttt 2220 acacatgcag ttgttaaatt aaataggaaa aataaatgaa caatatttca tattaagcaa 2280 tactggaaaa gaattgctgc ttatcctgta ttctaggcag tattgataaa ctgccacaaa 2340 aaaatattaa tacaagtagt aagaacgccg cctcgcggtt gctgaatgtc gcacgacaac 2400 gttgctacag ccttctcacg tcttacattt ctacgttgtg acaatgtagc gagcacgtaa 2460 gcggcaacgt tgcctttgaa aaaatacaac gttgtgcaga catcgctaca acgtaaatgt 2520 gtttgttggg 2530 // ID BHIKHARI-5-LTR_DR repbase; DNA; ZEB; 1663 BP. XX AC . XX DT 26-NOV-2008 (Rel. 13.11, Created) DT 26-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE LTR retrotransposon from Danio rerio: long terminal repeat - a DE consensus. XX KW LTR Retrotransposon; Transposable Element; Nonautonomous; LTR; KW BHIKHARI-5-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1663 RA Bao W. and Jurka J.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(11), 1777-1777 (2008). XX DR [1] (Consensus) XX CC TSD is 4-bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1663 BP; 507 A; 327 C; 304 G; 524 T; 1 other; tgtaacatat tgtaacctgc attctggaga taaacattcc atttaacatt ccagaaaaca 60 atctgggtga atcgccagga caataaaatc acttggtgac gaaaagtctg gcttagtaat 120 cccttggtga catgttaaca agtctgggag acttcacaga accactctct gcctacttca 180 aaagaacgtg acagataaga taagaaagaa agaaaaaaaa aaacctcttt agccaacatt 240 ctgctcacag acaagacatt cctcaaaatt aaacaaagac atttataacc caaatctctt 300 gtttaatttc atttaacttt tgtaatgtgt acaatacctg atatagacta tacacattca 360 cgcttgcagg cctttttaaa agcatcaatg atgtaaggat atgggaacta tgttratgta 420 cttttaatgt gatgtacttg tagagtgttt ggtgtcaata gaatgccctc gacattttat 480 aacacatgat gcatagatcg agaccataac tcgcattgct ttattcttta aagccctgta 540 ttaaatgcct gtccgcatgc tttaggcggg cgctcaaatt tgcatacgcg cagcccagag 600 acaaaggaag gttcgcgcgc aaactttccc ctgaaatcat tggtcagaat gctgtacggg 660 ttgtcacgtg acccgcaggt ataagtcaca cttccctccc aaacatctgg gcagaaagag 720 aacgtcccaa aaaggaacat cattgacggt ggtcccttca agggccacac gtggctttcc 780 taagccgcat ggactttttc ccgccacgct tttctttatt ttccggcaaa gtaaattcag 840 tttaaagttt gcgatcgtgt tcgtccgaac tgcattacaa actccacgca tcaaaggaca 900 tcaagagtag ttttcatcac gacggaagag agacgcataa aagagacctc cagacatgaa 960 aaaccaggtg attttagtta tgcgattaag ctgtgcccct tttatcaaaa aggtgttttt 1020 tataaccttg gtggtccgta atggtgcgtg tggaactgaa agttttgtca ctcttttatc 1080 tgaaatctat attttctagc tttactattt ctctttgctg ttacacgttc tatagacccg 1140 atatctccac gtggtttacc tttgttttgt gtttaatgta tgtttgtgcg tttgtttgtt 1200 cgttacgtct gactagtcaa taaattccat tataatcaaa tgaattgtat tgtttgcctc 1260 acgagaaaat gtcactgaaa tacagattcc cctgcctagc tattataagt tgtttaaaac 1320 ttttgaatga ttaatctact tcacaagaag tagcaattaa attccctttt tctaaatgaa 1380 tagattacag tgtccgctcg gaagagcggc ggctctgctt gtgttcgttt ggtcaaatta 1440 acaataaatt gatccggaat attaatgatt aataataatt cgttattgtt gataattaat 1500 attcataatc tgggaatccg ctcggtcgcg cgagcatgat catttacaat catatgtttg 1560 tttgaccaaa ctgactgaag cttaagccgg aatattaata attaagaata aaccgttatt 1620 gttgattatt aatattcata aaatgagcta atttcatgct aca 1663 // ID DIRS1_DR repbase; DNA; ZEB; 6132 BP. XX AC . XX DT 11-FEB-2002 (Rel. 7.01, Created) DT 09-MAR-2009 (Rel. 14.04, Last updated, Version 5) XX DE DIRS1_DR is a DIRS-like LTR retrotransposon - a consensus. XX KW DIRS; LTR Retrotransposon; Transposable Element; Gypsy; DIRS1; KW DIRSDR1; DIRS superfamily; reverse transcriptase RNase H; KW phage integrase; DIRS1_DR. XX NM DIRS1_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 414-5132 RA Jekosch K.; RT "DIRSDR1: putative non-LTR retrotransposon."; RL Repbase Reports 2(2), 9-9 (2002). XX RN [2] RP 1-6132 RA Kapitonov V.V. and Jurka J.; RT "DIRS1_DR, a family of DIRS-like elements in zebrafish."; RL Repbase Reports 3(1), 1-1 (2003). XX DR [2] (Consensus) XX CC DIRS1_DR is a family of DIRS1-like retrotransposons. These CC elements CC are related to Gypsy-like LTR retrotransposons and endogenous CC retroviruses. CC There are ~100 copies of DIRS1a_DR in the genome, they are ~0.3% CC divergent from the consensus sequence. Therefore, this family CC retrotransposed in the zebrafish genome very recently. CC The unusual structure of DIRS1_DR is depicted in the next figure. CC GTTCCCCTTCGGTTGGGGAACTTCAGTGCCATGAATGGGAGGATTCGGATCAGAAGCCGCTTATCTGGAG CC <====== ======> <--------------------------------------------- CC AGTATTGAACGGGCCAATGAATGAAATTAATTGGCAGCGTAAGCTTGCGCAGGTGTGCGACATCTGCAAT CC ---------------------------------------------------------------------- CC TATCTCAGCATATAAGCACACCTGAAGCCAGCAGACGCCATCCTTTTCGCTTCAGATCCTTTCTGAGTGA CC ----------------------------------------------- CC ...................................................................... CC ...................................................................... CC GGTGCAGTCATTATGGCGCTTTCCATATTCTCCCATTCATGGCACTGAAGTTCCCCAACCGAAGGGGAAC CC <====== ======> CC <~~ CC GTTCGAGGTTACAGAAGTAACCCTTCGTTCCCCGAGGAGGGGAACGGAAGTGCCATATTCCGTCGCCATA CC ~~~~~~~~~~~~~~~~~~~~~~~~~~<====== ======> CC ATGACTGTCCCTTAGCTGTTTGAAAGTCTCTTCAGCTT CC AAAAGGATGGCGTCTGCTGGCTTCAGGTGTGCTTATATGCTGAGATAATTGCAGATGTCGCACACCTGCG CC ---------------------------------------------------------------------- CC CAAGCTTACGCTGCCAATTAATTTCATTCATTGGCCCGTTCAATACTCTCCAGATAAGCGGCTTCTGATC CC ---------------------------------------------------------------------- CC CGAATCCTCCCATTCATGGCACTTCCGTTCCCCTCCTCGGGGAACgaagggttacttctgtaacctcgaacgtt CC ----------------------> <====== CC ======>~~~~~~~~~~~~~~~~~~~~~~~~~~~~> CC Fig.1 Termini of DIRS1_DR. The 163-bp sub-terminal inverted CC repeats are CC underlined by a single line. CC DIRS1_DR encodes three ORFs. ORF1 (positions 414-1632) codes for CC the CC gag-like protein. ORF2 (positions 1633-2597) codes for reverse CC transcriptase and RNase H. ORF3 (positions 2598-5129) codes for CC the CC phage integrase. XX FH Key Location/Qualifiers FT CDS 414..1850 FT /product="ORF1p" FT /note="Gag-like protein." FT /translation="MALRLCVSGCGGFLSPDDGHDHCIACLGVQHVNAVLA FT GGSCRHCDAMTVAQLRSRLTFARERATPVASCSKKAAGARADLRVSAGANP FT PPTGSRTSRSSRRSIQASGGESDPSNQMVALTLADTGDQMSSAASEGGLSL FT SDEDPDPLAPSGQVSAVKSDPEADMLAVLSRAASAVGLEMVYPPAPRPDRL FT DGCYVEDQKAKPSKPLVPFFPEVHSRLTQSWRAPFSARAASASALTALDGG FT AARGYEAIPSVERAIAVNLCPRGASTWRGLPRLPSKACRLSASLGARAYKA FT AGQAASALHAMATYQRYQAQALAELHEGGSNPSLLHELRTATDYALRTTKS FT AACALGRTMSTLVVQERHLWLNLADMRDVDKVRFLDSPISQAGLFGDTVGE FT FTQEFKAVKEQSDAMGNVIYRRGRKPAPPAEPSTSAVPRRGRPPTSAAPPP FT PAPPAKRARRSPRKQAAPPAQGAVKSGKRTAKRP" FT CDS 1633..4110 FT /product="ORF2p" FT /note="reverse transcriptase." FT /translation="MRWAMSSIGVAVSPLRPPSHPPPLFLAEGARQRVLPR FT PRLRLRPSGRGVHLESRQPLLPRAPLSPVNGPRSVPETGHPEKRKLALSPL FT EGGAPITTVLFSATKTSVKEHFFPSPDVTARVLPVRDALPSGSQTLRASPV FT AHERWGDGLPSLSPPAPSPESGCGARANRSPPAFPRDPRASRISTPTPRCP FT TAGTSAIVAMTPLARALPAWLARASPSRWLIRTIRLGYAIQFAKRPPKFTG FT VYFSRVNPLSAPVLREEIAALLAKGAIEPVPPAEMESGFYSPYFIVPKKSG FT GSRPILDLRVLNRCLHKLPFRMLTQRRILQCVRPRDWFAAIDLKDAYFHVS FT ILPRHRQFLRFAFEGRAWQYKVLPFGLSLSPRVFTKLAEGALAPLRLAGIR FT ILSYLDDWLILAHSREQLIMHRDEVLRHLRLLGLQVNREKSKLAPVQRISF FT LGMELDSITMVAHLSEERARLLLNCLRELDSKLVVPLKFFQRLLGHMASAA FT AVTPLGLLHMRPLQHWLHDRVPRRAWHAGTHRVSVTALCRRALSPWNDPSF FT LQAGVPLGQASSHVVVSTDASNTGWGAVCRGHAAAGLWKGAQLHWHINRLE FT LLAVFLALHRFLPVLERQHVLVRTDSTAAAAYINRMGGMRSRRMSQLARRL FT LLWSHPRLKSLRAIHVPGTLNRAADALSRQLLRPGEWRLHPESVQLIWARF FT GEAQIDLFASPENAHCQLFFSLTEGSLGTDALAHSWPRGMRKYAFPPVSLL FT AQFLCKVREDEEQVLLVAPLWPNRTWISELSLLATALPWRIPLREDLLSQG FT QGTIWHPRPDLWNLHVWSLDARKT" FT CDS 2598..5129 FT /product="ORF3p" FT /note="phage integrase." FT /translation="MRSSSGLVCSHRPEGRVFPCLHSSTPPPISAVCVRGS FT SVAVQGPPLRALSVSAGLHQTRGGCPSAPSARGHSHTQLSRRLADFSPLAG FT AIDYAQGRGASASPPTGASGQPRKEQTRPRAEDFFSRDGAGLDHHGSAPLR FT GTRSPVAELSEGARQQTSGPTEVLSEAPGAYGIRSRRHAARVAPYETTSAL FT ASRSGPQTRMARGHTPGLGYCAVSPRPQPLERPLVPTGRCASRTGVQPCCC FT FNRRFQHGLGGRVSRACGCGPLEGCPAALAYQSPRAVGSVPRSPPLFTGAG FT AATRAGQDGQYGGGGVYQPHGGYALSPHVSARPPSAPLESPAAEIAARHSR FT PRHAQSCSRCALTTAVTPWRMETPPRVCSADMGAIRGGPDRSVCFPRERSL FT PVVFFPDRGLSRHGCTGPQLASGHAQVCVSPSEPARAVSVQGQGGRGTGSA FT SCAPLAQPDLDIRALTPRDGPPLADPFERGPTLSGTGHHLAPSPRSLEPPR FT VVPRREEDLGNLPTAVVNTITQARAPSTRRAYALKWSLFTEWCVSRREDPR FT NCQISVVLSFLQEKLDSRLSPSTLKVYVAAISAYHSAVAGGTVGKHNLVIQ FT FLRGARRINPSRPPLMPSWDLALVLTSLRSDPFEPLESVSLRFLSLKTALL FT VALASIKRVGDLEAFSVSDSCLEFGPDYSHVILRPRPGYVPKVPTTPFRDQ FT VVNLQALPPEEADPALSLLCPVRALRIYVDRTQNFRSSEQLFVCYGGRQQG FT SAVSKQRLSHWIVDAISLAYSSRGQPCPPGVRAHSTRSVASSWARARGASL FT TDICRAAGWATPNTFARFYNLRVEPVSSRVLGNPLVIEETTR" XX SQ Sequence 6132 BP; 1117 A; 1898 C; 1706 G; 1411 T; 0 other; gttccccttc ggttggggaa cttcagtgcc atgaatggga ggattcggat cagaagccgc 60 ttatctggag agtattgaac gggccaatga atgaaattaa ttggcagcgt aagcttgcgc 120 aggtgtgcga catctgcaat tatctcagca tataagcaca cctgaagcca gcagacgcca 180 tccttttcgc ttcagatcct ttctgagtga gtcgatgagg gttcctcttg ctgatcagca 240 cttcagagcg aacgagtgtg tctcccggtc cagagtgggt cttcgcggtg gcagacggtc 300 gagctgggtt actcccttgc ctgcggttct ttgggtccgg tcctccagag cggtgcgtat 360 agttgcaact ttcctaaaag agcaacacag tcgtgcagca cgtccttttc aggatggcgc 420 tccgactgtg cgtttctgga tgcgggggtt tcctgtctcc ggatgatgga cacgatcact 480 gcattgcatg tttgggggtc cagcatgtta atgcggtgct cgcgggcggt tcatgtcgtc 540 attgcgatgc catgaccgtt gcacagctaa gatcgcggct aactttcgca agagagcgag 600 ccaccccagt tgcctcctgt tctaaaaaag cagcgggcgc tcgggcagat ctgagggttt 660 cagcgggagc taatccgccg cccacgggct cgcggacctc tcgctcctca cggcgctcca 720 tccaagcttc gggtggtgag agtgatccgt ctaaccagat ggtagctctc acactcgctg 780 acaccggaga tcagatgtcc tccgcggcat cggagggtgg gctttcactg tccgacgaag 840 atccggaccc gctcgccccc tccgggcagg tgagcgctgt caaatcggat cctgaagcgg 900 acatgttagc cgtgctttcc cgggctgctt cggccgtggg gttggagatg gtttatcccc 960 cagctccgcg gccggaccga ctagatgggt gctacgtaga ggaccagaag gcgaagcctt 1020 cgaagcctct cgtccccttc ttcccggaag tgcacagtag gctcacgcag tcctggaggg 1080 cacctttctc tgcccgtgct gcgagtgcct ccgccctcac cgcccttgac ggcggagctg 1140 ccagggggta tgaggcgatc ccgtcagtgg agcgcgctat cgcggtcaat ctttgtccgc 1200 gcggcgcctc tacgtggcgg ggtttgcccc gcctcccgtc caaagcctgt aggttgtctg 1260 cctccctcgg agccagagct tataaggctg cgggccaggc tgcttctgct ttgcacgcga 1320 tggccaccta ccagcgctac caagcgcagg cgctggccga gctgcacgag ggcgggtcca 1380 acccaagctt attacatgag ctgcgcaccg cgaccgacta tgctcttcgg actactaagt 1440 ccgccgcgtg tgcgctgggg aggacgatgt ccacacttgt ggttcaggaa cgccacctct 1500 ggctaaacct ggccgatatg cgcgacgttg acaaagttcg ctttcttgac tcgcccatat 1560 cccaggctgg cctgttcggc gacaccgtcg gtgaattcac ccaggaattc aaggcggtga 1620 aagagcagtc ggatgcgatg ggcaatgtca tctatcggcg tggccgtaag cccgctccgc 1680 ccgccgagcc atccacctcc gctgttcctc gccgagggcg cccgccaacg agtgctgccc 1740 cgcccccgcc tgcgcctccg gccaagcggg cgcggcgttc acctcgaaag caggcagccc 1800 ctcctgccca gggcgccgtt aagtccggta aacggaccgc gaagcgtccc tgagacaggc 1860 catccggaga agaggaaact tgctctttcc ccgctggagg gcggggcccc gataacaacg 1920 gtacttttca gtgccaccaa aacatcagta aaagagcact ttttcccttc cccggatgtg 1980 actgcacgag ttctgccagt ccgggacgcg ctgccttccg gctcgcagac tctacgtgct 2040 tcgccagtgg ctcacgagcg ctggggggac ggtctccctt ccctcagccc tccagccccc 2100 tctccggagt cagggtgcgg agccagagcg aatcgctctc ctccagcttt tccgcgggac 2160 cctcgtgctt cccggatcag cacacccact ccgcgctgcc ccaccgctgg tacgtcagcg 2220 attgtagcga tgactccatt agcgagggct ctgcctgcct ggttagcgcg ggccagcccc 2280 tcgcggtggc tcatacgcac aatcagactc ggttacgcga ttcagttcgc gaaacggccc 2340 cccaagttta cgggcgtgta tttctccagg gtcaaccccc tgtccgcccc tgtcttgcga 2400 gaggagattg ctgccctcct ggcgaagggt gcaatcgagc cggttcctcc agccgagatg 2460 gagagtgggt tttacagccc atacttcatc gtacccaaaa agagcggtgg gtcacggcca 2520 atcctagatc tgcgcgtttt gaaccgctgt ctgcacaagc tgccgttcag aatgctcacg 2580 cagaggcgca ttctccaatg cgttcgtcct cgggattggt ttgcagccat agacctgaag 2640 gacgcgtatt tccatgtctc cattcttcca cgccaccgcc aatttctgcg gtttgcgttc 2700 gagggtcgag cgtggcagta caaggtcctc cccttcgggc tctctctgtc tccgcgggtc 2760 ttcaccaaac tcgcggaggg tgccctagcg ccccttcggc tcgcgggcat tcgcatactc 2820 agttatctcg acgactggct gattttagcc cactcgcggg agcaattgat tatgcacagg 2880 gacgaggtgc ttcggcatct ccgcctactg gggcttcagg tcaaccgaga aaagagcaaa 2940 ctcgcccccg tgcagaggat ttcttttctc gggatggagc tggactcgat caccatggta 3000 gcgcacctct ccgaggaacg cgctcgcctg ttgctgaact gtctgaggga gctcgacagc 3060 aaactagtgg tcccactgaa gttctttcag aggctcctgg ggcatatggc atccgcagcc 3120 gccgtcacgc cgctcgggtt gctccatatg agaccacttc agcactggct tcacgatcgg 3180 gtccccagac gcgcatggca cgcgggcaca caccgggtct cggttactgc gctgtgtcgc 3240 cgcgccctca gcccttggaa cgacccctcg ttcctacagg ccggtgtgcc tctaggacag 3300 gcgtccagcc atgttgttgt ttcaacagac gcttccaaca cgggttgggg ggccgtgtgt 3360 cgcgggcatg cggctgcggg cctctggaag ggtgcccagc tgcattggca tatcaatcgc 3420 ctagagctgt tggcagtgtt cctcgctctc caccgctttt taccggtgct ggagcggcaa 3480 cacgtgctgg tcaggacgga cagtacggcg gcggcggcgt atatcaaccg catggggggt 3540 atgcgctctc gccgcatgtc tcagctcgcc cgccgtctgc tcctctggag tcacccgcgg 3600 ctgaaatcgc tgcgcgccat tcacgtccca ggcacgctca atcgtgcagc cgatgcgctc 3660 tcacgacagc tgttacgccc tggagaatgg agactccacc ccgagtctgt tcagctgata 3720 tgggcgcgat tcggggaggc ccagatcgat ctgtttgctt cccccgagaa cgctcactgc 3780 cagttgtttt tttccctgac cgagggctct ctcggcacgg atgcactggc ccacagctgg 3840 cctcggggca tgcgcaagta tgcgtttccc ccagtgagcc tgctcgcgca gtttctgtgc 3900 aaggtcaggg aggacgagga acaggttctg ctagttgcgc ccctttggcc caaccggacc 3960 tggatatcag agctctcact cctcgcgacg gccctcccct ggcggatccc tttgagagag 4020 gacctactct ctcagggaca gggcaccatc tggcaccctc gccccgatct ttggaacctc 4080 cacgtgtggt ccctagacgc gaggaagact taggtaacct accgactgcg gtggttaata 4140 ccatcactca ggctagagcc ccctccacga ggcgcgccta cgccctgaag tggagtctat 4200 tcactgaatg gtgcgtctct cgcagagaag acccccgaaa ttgccagatt agtgttgtgc 4260 tctctttcct tcaagagaag ttggacagca ggctgtcgcc ctccactctc aaggtttacg 4320 tggccgccat ctccgcttat catagcgcgg tagctggcgg caccgtggga aagcataacc 4380 tggtcatcca gttccttagg ggtgctaggc gaattaatcc atctcgcccc cctctcatgc 4440 cctcttggga tctcgccctc gttctcacga gtctgcgatc cgatcccttt gagccactcg 4500 aatcagtatc tctaagattt ctgtccctga agacagctct gctggttgcg ttggcctcca 4560 tcaagagggt cggggacctg gaggcatttt cggtcagtga ctcgtgcctg gaattcgggc 4620 cggattactc tcacgttatc ctgagacccc gccccggtta tgtgcccaag gttcctacca 4680 ccccctttag agatcaggta gtgaacctgc aagcgctgcc cccggaggag gcagacccag 4740 ccctttcttt actttgtcca gttcgcgctc tgcgcattta tgtggaccgt actcagaatt 4800 ttagatcatc tgagcagctc tttgtctgtt atggcggtcg gcagcaggga agtgccgtat 4860 cgaaacaaag attatcccac tggattgtgg atgccatttc actcgcttat tcgagtcgag 4920 gtcagccgtg tcccccggga gtacgtgcac actccactcg gagcgttgca tcctcttggg 4980 cgcgtgcacg cggcgcctct ctaacagaca tctgtagagc tgcgggctgg gcgacaccca 5040 acacatttgc aaggttttac aatctgcgag tggagccggt ttcctcaagg gtattaggta 5100 accctttggt gattgaggag acaactcggt agggtgttga aacacgcttg ctgcgccatt 5160 ctccctaaca cggaggtacg tgcgcctttt ttatctgtca gtaaagttcc ccgtcaggtg 5220 agccctgcag attcctccgt ggcccccagc actgactcag cggaggagtc acttgctggc 5280 ccactacgtt gtaggtctgc ccgctggtca gcccgcgttt tgggtatagg tgcctgctat 5340 gcgtgatccc cactaggcga tcccatatgc ttattccgcc acggttaagt cccccccctg 5400 ggcggacccg tgtcttccct ctccgctaac cactcttttg ctatgcgtac tccccctttt 5460 tagggctagt ccataggtaa attctgccat ctatcccccc cttgggtaac ggatggcctc 5520 cgcagcgtcc tccctatcgg gattgcacgc ttcccaacgt actgtcgtat ttcctagaat 5580 tatctagatg ctcacgactt cccaaaaaat atatctaaat ccgtaaaact tctgttgaag 5640 taggataaat tagggccagg gacacgttgg aggaccgcgc cccccatgat gtgggtgcgt 5700 cacgcttgct tgactatctc ctcatcgggg gtgttggtaa ggtgcagtca ttatggcgct 5760 ttccatattc tcccattcat ggcactgaag ttccccaacc gaaggggaac gttcgaggtt 5820 acagaagtaa cccttcgttc cccgaggagg ggaacggaag tgccatattc cgtcgccata 5880 atgactgtcc cttagctgtt tgaaagtctc ttcagcttaa aaggatggcg tctgctggct 5940 tcaggtgtgc ttatatgctg agataattgc agatgtcgca cacctgcgca agcttacgct 6000 gccaattaat ttcattcatt ggcccgttca atactctcca gataagcggc ttctgatccg 6060 aatcctccca ttcatggcac ttccgttccc ctcctcgggg aacgaagggt tacttctgta 6120 acctcgaacg tt 6132 // ID Gypsy68-LTR_DR repbase; DNA; ZEB; 190 BP. XX AC chr14; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy68-I_DR; KW Gypsy68-LTR_DR; Gypsy68_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-190 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1452-1452 (2008). XX DR Genome; chr14; Positions 16059124 16058935. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 190 BP; 63 A; 22 C; 51 G; 54 T; 0 other; tgttacaata tgttatgtat atagacagca ggatgggagg atgtgacaga ccatgtatgg 60 tttgtagtgt atgacgtcat tgctatgcgc atgcatgtga agtaaaggaa gcacagtgaa 120 aacctgcgtg tctgagattc attgatatat agagtgatag ttgagaatta gagtgaacta 180 agcataaaca 190 // ID Gypsy56-I_DR repbase; DNA; ZEB; 4556 BP. XX AC chr16; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy56-I_DR; KW Gypsy56-LTR_DR; Gypsy56_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4556 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1427-1427 (2008). XX DR Genome; chr16; Positions 35718863 35714308. XX CC 'GGTAT' target site duplication CC LTRs are 97% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS join(1463..2758,2762..3850) FT /product="Gypsy56-I_DR_1p" FT /translation="MGKQTAVLPLYVVQGDYPPLYGREWLHRIKLNWKEIR FT TVKLKTLDGVLQKHTAVFSKQLGEMKKIKAKISLKPEHKPKFCQPRVVPYA FT LRPKVEKELNRLTEMGVLTPVQYSEWATPIVPVVKKNGAVRICGDFKVTIN FT PVLHTEHYPLPRIEDLFASLAGGQCFSKLDLSHAYLQMKVEEESSKFLTIS FT TQKGLFQYSRLPFGIASAPAIFQRAMDQVLLGLPNVHCYLDDILVTGRTEA FT EHLENLDGVLGRLEEFGLHVEKGKCDFFKDSLEYLGHIIDAEGLHKSPEKV FT SAIVNAPPPSNVTQLRSFLGLLNYYGRFIPNLATIANPLNSLLCKGKRWLW FT SAECNAAFKMAKEQLVSQNVLTHYDPQNPIRLACDASPYGIGAVISHMLPS FT GEEKPIAFASRTLSKAEQNYAQIEREALAIVFGVRKFHYLYGRKFTLFTDH FT RPLTTILGPQNGIPSMAAARMQRWALLLSAHNYTIEYKRAEHHGNADGLSR FT LPLHVEHREKNDAVELFYLGQMEKLPVSATDIRRETMSDFILATVVEMVLQ FT GTQAISLTDNNELSPFISKRSELSVQHGCLMRGMRVVVPHKLQKRVLEELH FT TGHPGIVRMKAIACSYVWWPRLDADIELQVKMCQSCQQIQKMPSQAPFHPW FT EWPSKPWERIHVDFAGPCEGHMYLVVVDAHSKWPEVQVMTSMTAEKTIDVL FT RNMFSHYGLPEVIVSDNGPQFVSQEFASFLKTNHVKHIRSAPYHPSTNGQA FT ERFVQSLKQALKVSKGSSTLQKRLETFLLTYRNTPHPTTKESPS" XX SQ Sequence 4556 BP; 1380 A; 946 C; 1112 G; 1118 T; 0 other; tatttggcga cgaggtggat taatttatcc atggacctac cacagtgaat cgtgagtaga 60 taggcgcacc tgaactttat tttctctgcg tagcaaaagt gagctagatt agctagaatg 120 gctgctacta caggccatat ggaagctttt gaggaaagcg tggaaccatg gaccacctat 180 attgagcgat ttgagcactt tgtagtagca aacagcatta atgctgacaa aaaagtgcct 240 gtgctgctca gcgtgatcgg agggaaaaca tacggattgc tccgaagtct gatcgcgccg 300 gacaagcagg gagagaaaag ttttaaagag atcacagaca cgctacagca gcatttttct 360 ccgaagcccc tgatcatcgc ggagcgcttt cgatttcaca ggcggaatca ggaggagagc 420 gagacggtga cacagtatgt tgcagtgctg agaggattgt ctgagcactg tgaattcggg 480 ggtcatctgg atgatgcact atgtgacaga ttcgtgtgtg gtttgaagag tgaagccacg 540 caaaagcgct tgttgactga aactacactg acgtttcata gagcggtaga actggcggtg 600 tccatggaaa cagcgtcacg cgaagcgcat caactcagtg gatcactgac ggtaaacgca 660 ctgtctttta caaagattaa gactgtaaac aaatgtaaac gatgtggaaa gaataaccat 720 agtgatgaag attgttggtt taaggacaaa acttgtaaca cgtgtgggaa aaaaaggcca 780 tatcagtcac gtttgtagaa agagtagcga tgatagaaaa acaaccaaac tacaactaag 840 caagaacgtc ctctgcagcg cctgcaatgg tcagggtgga aaaacgggcg ctgtgcagaa 900 gtgcagcaca tgtcggggcc gtggcatgag gatcatgatc agacagctgg ctcctggcat 960 ggtccagcag atgcagtccg tctgcaccga ctgcaatgga gagggtaagg tgattcacga 1020 gaaagaccgc tgtaaagagt gtgacggtcg taaagtgtgt aaagaggtga aggtcttgga 1080 ggtgcacgtg gataaaagaa tgaaacatgg acagaagatc acgttcagcg gagaagcaga 1140 ccagtcgccc aatacagagc ctggagacat tatattgatg ctgatgcagt atctagcggt 1200 gatgatacag acaatgattt agctctgtat aaactgtcgc agccaggaga aaagtccagt 1260 atcatggtga aaccagaagt tgaaggactg ccacttgaaa tggaactgga cacaggcgca 1320 gcagtctctt taatttcaac agagacttat aataaaatat aaaatactga aacatctgcc 1380 cttatgctca actgacattg ttcttcgaac atacacagga caagcattac gccctgaagg 1440 tgtaattgac gttcatgtaa aaatggggaa acagacagct gttcttccat tgtatgtggt 1500 gcagggagat tatccaccac tttatggcag agaatggctg catcggatta aacttaactg 1560 gaaagaaata agaacagtca aacttaagac actagatggt gttctgcaaa aacataccgc 1620 tgttttctcc aaacaattag gggaaatgaa gaaaataaaa gctaaaatat cactgaaacc 1680 tgaacataaa ccaaagtttt gtcagcccag agtagtccca tatgccctgc gtccaaaagt 1740 tgaaaaggag ttgaaccgtc tcactgaaat gggagttttg acaccggtgc agtatagtga 1800 atgggctact ccaatagttc cagtggtgaa gaaaaatgga gcagtgcgca tctgtgggga 1860 ttttaaagtg acaataaacc ctgtacttca cactgaacac tatccgttac cacgcatcga 1920 ggaccttttt gcttcattag caggaggaca atgtttcagt aaactagact tgtcccatgc 1980 ttacctccaa atgaaggtgg aggaggagtc ctcaaagttc ctcacaattt ccacacagaa 2040 agggctattc caatattctc gtcttccttt tgggattgcc tccgcgccag caatattcca 2100 aagagctatg gatcaagtat tattaggcct tcctaatgtc cattgctatc tggatgacat 2160 tcttgttact gggcgtacgg aagctgaaca cttggaaaac ttggatggag ttcttggacg 2220 cctggaggag tttggcttgc atgtggaaaa aggaaaatgt gactttttca aggactcact 2280 ggaatatttg ggtcacataa ttgacgctga aggtttgcac aagtcgcctg aaaaagtgag 2340 cgcaattgtc aatgccccac ccccatcaaa cgtcacacag ctgaggtcat tccttggttt 2400 gttaaattat tatgggaggt tcatacccaa tttggcgact atagcgaatc ctttgaattc 2460 tctgctgtgc aagggaaagc gctggctgtg gtcggctgag tgcaatgctg cattcaaaat 2520 ggcaaaggaa caattagtat cacaaaatgt cttgacacac tacgacccac agaacccaat 2580 tcgacttgcg tgtgatgcat caccatatgg cattggtgct gtcatctcac acatgttacc 2640 cagtggtgag gaaaagccta tagcttttgc atcgcgcacg cttagcaagg cagaacagaa 2700 ttacgcgcag atcgagcgcg aggctctggc aattgtgttt ggagttcgga aattccatta 2760 gtatctctac ggacgaaaat tcacactctt cacagatcac cgccccctta ctacaatctt 2820 agggccacaa aatggaattc catcaatggc ggcggcacgt atgcaaaggt gggcattgct 2880 tttgtctgca cacaactaca ctattgagta caaaagagcc gaacatcacg gaaatgctga 2940 tggtctttcc cgtctgccat tgcacgtaga acatagagaa aagaacgatg ctgtagagct 3000 gttctacctc ggacagatgg aaaaactccc tgttagtgcc actgatatta gaagagagac 3060 tatgagcgac tttattttgg ctacagtggt cgaaatggtt ctacaaggaa ctcaagctat 3120 cagcctgaca gataataatg aactctcacc tttcatctcc aaacgcagtg agctgtctgt 3180 tcaacatgga tgtttgatga ggggtatgag agttgtggtt ccacacaaac tacagaaaag 3240 agtgctggaa gaactgcaca ctggacatcc aggaattgtc agaatgaaag ccattgcttg 3300 tagctatgtg tggtggccgc gtctggatgc tgacattgag ttgcaagtga aaatgtgtca 3360 gtcatgccaa caaatacaga aaatgccatc acaagcacct ttccatccat gggaatggcc 3420 aagtaaacca tgggagcgca tacatgtaga ttttgctggg ccatgtgagg gtcacatgta 3480 tctagtcgtg gtcgatgcgc attcaaagtg gcctgaggtg caagtcatga catcaatgac 3540 agcagagaaa accattgatg ttttgaggaa catgttcagt cattatggtc tgcctgaagt 3600 cattgtgagt gataatggac cacaatttgt ctctcaggag tttgcgagtt tcctgaaaac 3660 caatcacgtg aagcatattc gttccgctcc ttatcaccca tcaacaaatg ggcaggctga 3720 gaggtttgtt cagtcactaa aacaagcact gaaagtttct aagggatctt ctactctaca 3780 aaaaaggctg gaaacattcc tgctgacata ccgcaacaca ccacacccaa cgacaaaaga 3840 gtccccttcc tagttgttta tgggacgcca gttgcgcaca cgtcttgatg ccctaaaacc 3900 cagtgtgaca gcagcagtgc ggctctccca agtatctcaa attctccgca gagaaggtcg 3960 tttgagacca agacagtttg gagtaggtga tgctgtgctt gctcgtgatt acaggggaag 4020 agagaaatgg acatcaggag tggtgacggc tcaaagtggt ccagtatcct atactgtgga 4080 tgtgggtact tctgaggaat ggaggcgtca agctgatcaa cttttgtcca ttccaaaata 4140 aacagctgaa aagctttcaa cagacttgcc tgataacaag aatgtgtctg cccctgtaca 4200 gacctctgca gatgccactt cttctgcatc actaacaagg gaacaagatg ttaacaatga 4260 catatcagaa gagtgtcaga cacaacatga cattacacca acacagatta gacgttatcc 4320 agcaagagtg attaaaccac ccaaccgtct gacattataa gaaaacagaa gctgctgaat 4380 taagctataa ttagttatgt tagtgagcct gttctgatct ggggcagaat aatccccgaa 4440 ctggctcgag gcaagaggca gtctaccctc tacctctaga gtaaaataga taaaggtatt 4500 atagttgtgt ttttgaaaaa tgtttttgag ttgtttctgt ttaagagggg aggaaa 4556 // ID hAT-N26_DR repbase; DNA; ZEB; 1459 BP. XX AC . XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 13-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Putative nonautonomous hAT DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW 8 bp TSD; hAT-N25_DR; hAT-N26_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1459 RA Jurka J.; RT "hAT DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1296-1296 (2008). XX DR [1] (Consensus) XX CC putative hAT transposon, TSD=8 bp. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1459 BP; 520 A; 274 C; 237 G; 425 T; 3 other; caaggttatt atagttaacg aaaacgaacg aaaaaacgaa aactaaaatg taaaataatt 60 gtcgttaact gaaataaaaa taaaaacgag agtttttwaa aaaaaactag aactaactga 120 aactgtattg tgtacataca aaactaactg aaactaacta aaattatagc aaaaacctcc 180 ttcgttttcg tatttgtaaa tgtatttaat acataatctt actgtaagcc ttttagaagt 240 aaatataatt cccgcgctgc aggtgtttgt gtttgaccct ttcgcacctc agagtctgta 300 gtcctgctgc cattggccag atcgagccgg ttctcctcca gtcatcctcg ctgttgctcc 360 cgcgacaaaa acaacgagtg gacatgacag cagcacggtg acagcattaa atacagacac 420 ttaaaactat tactttaaca catttgtgcc caataatggg ttttatttct gtatcgcgat 480 cgcgaacgaa tctttttaac cggatcttct aagtgaaccg gttgaactag ttcaccaaat 540 cgaactgaat catttgaaac gattcgcgtc tccagtaagc acttatccac aaactactta 600 ctttttaaca agcctaatac ccccctctga ctctaaataa tccaatatat actctcattc 660 agttattaga acagcgacgt tacactgaga tcaaattgag aagcggtgaa ccgataatac 720 tgcgcatgcg tgattcagtg aaccaaacac aaacagtaca tgacagcctg ctgtgtgact 780 gaactaaact tgaacaactg cagcgcgggt aaaccgaggg cttaaatgag aggatctggt 840 gaaaaacgta agtttatttc ataacagaca atcgtaatgg aatttaaata agtgaatcat 900 gmttcagttg ggttgcaaaa ctttactgta agacattttt cagatcatgg tgatgtttta 960 actgactgaa attactattg ctgactacat aatgttgcta tgttgagtcc taacatccgc 1020 gttaaawatc tgaacttccc atcactactg tgtatctaac ctcagaagca gccttgcaca 1080 catattgtac ttaaggctgc tatcttgtgg gctgttgtat ttgaatttga ggatgggtag 1140 atggttgtgg tagatgatag tgttcatgtg tcctctgaat acaggtttca aaaaatgtat 1200 gactgagttt ttattataca atatttatcc tgttgatttt gaatctcgca cctaacaaat 1260 atcctcattt agaaaaataa aactaatact aaaactaata aaaactaaac taaaactaag 1320 tcatttcaaa atatagaaac taataaaaac tagtaaatct gcctctaaaa ctaattaaaa 1380 ctaactgaat ttgaaaacaa aaagtcaaaa cgaaatagaa actaaaacta atgaaaaatc 1440 caaaactatt ataaccttg 1459 // ID Kolobok-N4_DR repbase; DNA; ZEB; 3002 BP. XX AC . XX DT 14-OCT-2008 (Rel. 13.1, Created) DT 14-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE nonautonomous Kolobok transposons from zebrafish- a consensus. XX KW Kolobok; DNA transposon; Transposable Element; Nonautonomous; KW TTAA TSD; Kolobok-N4_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-3002 RA Bao W. and Jurka J.; RT "Kolobok-type transposons from zebrafish."; RL Repbase Reports 8(10), 1348-1348 (2008). XX DR [1] (Consensus) XX CC TSD is TTAA specific. The termini 5'-AG is sheared with other CC autonomous Kolobok elements; therefore the element is classified CC into Kolobok superfamily. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 3002 BP; 1026 A; 543 C; 520 G; 912 T; 1 other; agagcatatt gtaggttaga caccccagtg agacaatgtg tgtaagagta aagtaagaat 60 tagattttct ttaaaataaa cgttaaagtt tattattaag ccgtttggag tcgttttaag 120 tcacggaaat ttacttccgc gtttgaaaaa tcgccagacc ggaagtgacg tcggatcagg 180 gagttcgcgc atataaacac tatgaagaca actgatcgtg gtgcgcgttt ggatgccgag 240 ataagatatt ctgagagact cacacaaacg acagaccaca gccatacaat tataagcctg 300 tggacagacc agcagacaga acagcagtca gagaagtgta atttaactct attttgtgag 360 ggagaacaga gcaggtttat ggtgaggcca gtgttggctt acagagagat atgtaagata 420 acaaaatctt atctggcaac agcaaagcta gtattgatca tctataagta ttaatactta 480 ccagtaacag aaactaatga gcatttaaca cagttatgcc acattgacac tatttttaat 540 agtagataag tgaatacaaa acacaaataa cacagaacgt ctggaataat gcagaaacat 600 gtaaacaagg tactggaaaa tcttttttat tttacaaaag gttttcagag cttctgtatt 660 aaaataaaaa caacatcaaa tattaaaaca attaaaataa gtttttaaat atgtcttttt 720 gtcatttttt cattgggtat aagattatct gtatcagatc ccatttaggc wacaatccat 780 aaaaagacaa gtacagttgt tacagtataa aatatttaca aagatttgat caacagatct 840 gatctgcatt tatgattaaa taaacttcaa ccctaattca agttcaaatt ctaaatgcaa 900 ctcaagttca agttttattt atatagcaca aaatcataag cagtgttaat acccagcatt 960 tttactcttt atatatgtat aagtatctgt gtgagtgtgt gtatactgaa aggaaaatat 1020 atcatctgac ccatttaaaa gtaaacctaa aattattttt cagtaagttg gccaacgttt 1080 atgtgggtac cacaccatca tgtctttaat gtgtgttaaa ttgtgaatgc tgtgtaaaaa 1140 gatgccgaca gcctcatcct tgtaaacccg ctggaaggat ctagaaagga gagcaaatgt 1200 agtggccact tctgcattct cgtcgagcct gtgatgaact ccagtagtcc accgtcagac 1260 acgcagtgag ggctgatgca tatattctgg ggaaataatg caaaagttta aatcatgttt 1320 attaataatg tatattttaa aattcaccat atgattttta atataacaca atacagaaaa 1380 cccaaccaca tttgcagacc cctggatagt gaactgcaaa ccatctcaca tatacagtag 1440 actgtaccca ttataagttg agcatgtaag aacaaacttc aattttttga atgttatttg 1500 tttttcttct acaagatgaa tattcctcat atgtgaataa aaagtgcctt tcctcattga 1560 caggacagaa tatttttact ctccctcaag tggtttcaaa ccttattgac tttcttctgt 1620 tgatcacaaa ccaagatatt ttgaagaaag ctaaaaatct gtaaccattg acttctatag 1680 tgggaaaaac agatatggaa gtcaatggtt acacgtttcc aatatttaaa aaaaaaaagt 1740 caaagatagt gtgagtttac ctgcctgcat tatatagaaa tattgtagta attctttgcc 1800 cttctagttt ttgttatcaa ataataccat gcacaaataa ttcacgatag gggtcataat 1860 attttacccg atgaacgagg tgaatgtctg aggtaaaatt ggttttatat tcacacattc 1920 tgactttctc cctaataata taatttcaga agagtattat ttgctaatcc ctaatatcat 1980 gttagcagcc aatgactatc aaagtacaac attgttcact gtcagtttaa tagtgctaat 2040 gttgttttta agtcatattt acgtgtgatt tcagtcatta taataaaagt actatggtaa 2100 acaatactga gagtgtgtca cacaagctgt cactaatcga atgtgcgaat ataaaatcca 2160 ctttacctta gttgaggtta ctctgtgagc caaagtgaag cacagttgag cgagaggcca 2220 gtggagagaa agccttcagc tccgggtcat catctgtctc ccaaagtcta ataaacacac 2280 cacaaagcaa acaaagtcac aacgctgtaa ctatacaaga catttattat attagcaaac 2340 tgatgctact agccgtctta tcatatgtgt acatttttac acgctttaaa aaagtatatt 2400 taatatgctt aacaatacaa gctagcagcc gtgctatcgg ctactccaaa gtaaacacat 2460 cacgttgctt caaattactc aaagaataat gcttacggac ttgaaagttg acaaacataa 2520 acgaaaaacg agttgtctta ccgtgtacag ctcatcgagc tgcatctccg actgatccag 2580 gagtgtttct cgccgttcgc gtcagctatc actcttatcg gattcatccc atttcacgaa 2640 ccaaaatagc aagaggaaac ttgtggactt gatctgttga caggctggag tttgtgcagt 2700 ttccataaac gcaataatat accacgatga tgataaatta cgccaaaact aacaaccagt 2760 tctgctatta gctcctctgc cccactgtaa acagcgcgcg tctgtcagat aagtccctgc 2820 actgacgtca ccggggtggt tagaaaaaaa agcggaagtg ggaaaacaga cacattttct 2880 ttcaaatttg tgcccttgca cttcgtaaaa cattttaata tggtatttta tggtattata 2940 gagtattttc acaaacgatt ctaccttttt tttaaacagt tttttaacct acaatatgct 3000 ct 3002 // ID DNA-2-3_DR repbase; DNA; ZEB; 1765 BP. XX AC . XX DT 24-MAR-2005 (Rel. 10.02, Created) DT 29-MAR-2005 (Rel. 10.02, Last updated, Version 1) XX DE This is a nonautonomous DNA transposon - a consensus sequence. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA-2-3_DR; KW Interspersed repeat. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1765 RA Kapitonov V.V. and Jurka J.; RT "DNA-2-3_DR, a family of nonautonomous DNA transposons from RT zebrafish."; RL Repbase Reports 5(2), 35-35 (2005). XX DR [1] (Consensus) XX SQ Sequence 1765 BP; 632 A; 285 C; 247 G; 599 T; 2 other; cagtttttta caatggctat gacacttttt tcaatacatt taacaagttt gctaaactct 60 taacgcagca acacacctaa aacacacaat tggcaaaaca gttaatttta tgctcaaaat 120 cacacattat aaactaaacc tctaaactac ttttcaaaat acaataataa acgaacacac 180 tacaccatgt ctaacaaaac actgcaatca tgtttcaaaa tgaaattatt gttcaaaaca 240 ctaacacatg ttctcttcca acataaacat tcagtcaatc agaacacact gacaataaaa 300 acactatcat cagctgacat tacagaaact ccattagttt atgtgtcagc tctgccctta 360 tatttgaagt ctcgttacag ttttgttttg ttgggttaag taaatgcatg cattttcttt 420 cttttactgc aaaaattcta tacaaaacga agaaaaaacc atctcagaat tgctttataa 480 aatttacagt ttatgtaaaa aaaaaaaaat acagacacag aatttctgca gtacagactt 540 tttttgcaga aggacatcac tgcaaacaga aaaagcacca caattataat aaaataacaa 600 agtaatcttc tattctgccc ggtcttttgc cacatgttct catccacatt atacttgata 660 tcttctgtgg cccggcacgg taactgtggc cctttggcct attatgtttt tcctacggcg 720 acatccccag ccttgtcaac attgaaaatt tcatgtggta cttgattggc ctctaacttc 780 atgactctct gaaagtaatt agacacagtg tatgcaaatg ctgtactgta tatgtactgt 840 atgtactaat gaactccagg tttaaagagt agtttactgc aaaacacact gtgtatagta 900 cagtaatgac tgtattgcat aatcttacat ggacgtattg gtgacagagt tctttgatgc 960 gctcactgtg ctcamtgttc ctttcaaaaa cctcaagagt cctctattag aacatatgat 1020 catgtgattg gaaaaaccaa aacaagacat gagtgtgctt tctagatggg aatcagctgt 1080 catatatata tatatatata tatatatata tatatatata tatatatata tatatatata 1140 tttttttttt ttttccagtt gccaatgaga aatagcatat tgcggttata taaataaata 1200 aagaaatata tagacacaca taaacagctg ttctaattgt gaatagacac acaaatatat 1260 tccctgtatt ttattcaatt gctaatgaga aacatcatat tgcggttata taaataaata 1320 aataaatgta tatatatata tatatatata tatatatata tatatatata aaataaataa 1380 atgtttctac atatatatgg caataaaatg aataaaatac agggaatata tttgtgtttc 1440 aattcacaat tagaacagct gttcactttg actttatcct atataatttc tgtttagaac 1500 atggtgttat ctgttctgac atactgtgtt aaagcatttg taaatttgac tgtaaaatta 1560 cattgttttg ttcttggttg tgcttgtgtg taaaagaagt taaagcattt taaatttgtg 1620 ttaactggat gcattttgtg tcaaaacaac aagaaatgtg ttaattgtat agccyacgaa 1680 gacggatgtt gtgctaactg tgttaagagt ttaggaaact tgttaaatgt attgataaaa 1740 gtgtcatagc aattgtaaaa aactg 1765 // ID hAT-N76_DR repbase; DNA; ZEB; 4133 BP. XX AC . XX DT 07-JAN-2009 (Rel. 13.11, Created) DT 07-JAN-2009 (Rel. 14.01, Last updated, Version 2) XX DE nonautonomous hAT DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW hAT-N76_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4133 RA Jurka J.; RT "hAT DNA transposons from zebrafish."; RL Repbase Reports 9(1), 18-18 (2009). XX DR [1] (Consensus) XX CC It contains additional insertions of I-1_DR fragment and TDR18, CC which were not included in the consensus. The youngest copies are CC ~98% identical to consensus. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 4133 BP; 1210 A; 710 C; 773 G; 1440 T; 0 other; cccagatagc aggcagtaat cggcccgagc tcggctgccc tccggcgcct ccggctccga 60 ctcggcatcg gcgagtgtta tccgggccga gtgcggcccg cagctcgctc gcgcacatgc 120 ggttgtgatg cggctgtaaa gcactggccc gattccggtc aaccatatcc ggcccgagtc 180 tgtctgtaga gtcctggccg cttctggcaa ggactcggct gatggcaacc gattaagttc 240 ttattttatc tagtaatctg ttagctccac gtttaatgat aatttgagaa aatattcatg 300 gtacgatagc aaaaaaaaaa aaaaaaaaga atatttagtg tggatggtcg caatgtttag 360 acgcaaacaa aacaatatta tttataaata gattatacat gtcatctaga gaaaaactat 420 gtaaaattcg cttatttttg agatagcacg ctcctgtgct gactgttttt ttttggtttt 480 tcaaagattt cacgtggatt ccggtcagaa aggtaagctt tttactgctt gtgttctgta 540 taattagcga atttagagct taaaacattt ccacggtgtt tggttgtaac agcgtttagt 600 aatttaaaac agtttgatac gaaatgtaac ttgctttaag aaacacgact ggagctaatg 660 tatgctaacg ctaacagtta caaacacgtc tgaaagttcc ttttcctttt acacaatgtt 720 agattgtaac gtgtgtagta actgaaaact attttaatta ttttaaagaa acatgaaacg 780 aatttgtgtg tatgctaact ttgcgttaaa gaaagtttct gtagacgaat ccattaacta 840 acttggcaaa acgtgttttt ttcctgattt aacattataa agtaccgtgg atgttgaaaa 900 aaatcattgt acaggcgtat acttatccaa cagtattaat tctgtgatta tgtgtaaata 960 aaaactaatc gttttggatt aataaaatat tattgtttat cttgtttatg cagttagtca 1020 cgaaaaatga tcgaaactcg actgtttact attctcttta aattatccaa gagaaatata 1080 ttactatgca ctttaattca tatgtgttag cctatgtgaa tcaattatat cttttgttca 1140 agctgtaatg gtcagttaat gtaaaactga acgtcttaaa gcagattttg atgttttttt 1200 tttttttgtt ttttatgtta ttgacagtga taaattctga tttaaatgaa tgggtaacaa 1260 tatttgttgg ttaagtctta ttgttcttgt ttaacaaagg ttagtgagat cattttgcag 1320 cacatcacga ccaaagaagt caatggcaca ctcttttgaa cgtgaatagg catgttatag 1380 ttgtgtgcta aaagtccatt ataagctgta aactgctgat acattttcaa catagattga 1440 gcttttcaga cattacatta ctaattgctt cacaaacagc tgaaatggtt tattgttatt 1500 ggctatcatc atcaaacatt tttcacaatg gtaggttaat cttcatgggt acagaataat 1560 gcatgtgctt gaccatctta caaatctatg tctcagattc ataaggctgt cactctgcat 1620 gaatctccga ataaatcttg tgcaaccttc tgtgtactga ttatttacac tataggtatt 1680 atatgcatta attaactttg atagtatttc aggatttgaa cttttcttta cgggaggtta 1740 acaacaatgt gatgttttat tatagatgca tatgaggaag ctcgactcaa acatccacac 1800 gcaactgtga tttctgactg gtagacagat gaggatgatg atcgcctgtc atacatctaa 1860 agacagaaca ggtttgtttg caattatttt aaacacatac tgtcaacatt atgtcattga 1920 tctattacaa cctgtgagtt ttgccctggc actatactaa agtgatgttt tgttttgtaa 1980 atgtatctga ttaagggaca gtatagacac atctatacca tgatgaagaa aaattacttg 2040 gaacaaagag gctggataaa aggcaggtat gaagatttca gaaatcaatg cagcaccaca 2100 agtaccatca ccatcaatga ctatggcaga aatcttaaga ccaccatcaa gatgcacaga 2160 tactgtacat tccagtacac caggcgtgtc caaactacgg cccgcgaggc tttttataaa 2220 tatcaataga atctggcccg ctatacaaaa atgaacgtaa ttcaataaat aaccaccggg 2280 tgtcgctatt acatgcattc aattaagcag cagttcttgt tatgatctat ctatctatcc 2340 gtctatctat ctatctatct atctatctat ctgtctgtct cacgtgtctg tgtgatgtat 2400 gtaaaatttg gcccgcgaca acgtttgttt ttttgcatct ggccctcggc ccaaaaagtt 2460 tggacactcc tgcagtacac ctatgaacat ttacagtctt tgttctagtg cagtttgaga 2520 tcaagacaga acttattgca tctcgtgatg tctttgtaag aattgaagac atcactaaat 2580 ttgttaaggt ttaatgcagc tagaaagtaa agatcaacta gagaatatca gcagtgtcta 2640 acccactgtt gtatttacag aagttgtatg agaataagct acaggttgat ttatactaat 2700 ctgttatttt tcttttttag cacgactgac tgaagtttta aaatcatgca acgtcataaa 2760 gcagcagctg ggcctgattc tcacaaaacc aaaacagacg tgatgaaatt ccagatgtaa 2820 acacatttga ccttcctttg gccaatttga tttggaaaag cttgaaaatc aactaaagga 2880 gcagcccgaa caaatgaaga aactggtaag ttcaagctct tgctaatttc aaagtgtttt 2940 aagtttattt gttttttttg atagtttttt ttttttgctt cgtaggtagc ctactttttt 3000 cctaattttt tctttttttt ttacattagt acttgttcac tactaattat ctgtttatct 3060 tttaatcaca gatggagtga ggacaaatgt ccataccacg ccactgataa agaagtggaa 3120 aaatgcatta caaggtagct acaacttgcc cctgacaggg atggaggaag aaagaagaga 3180 aagctaatgt gtcaaatgtc tacatcacta ttttgtttta ttttttaaac aacattttaa 3240 gtgtattagg atgttccctg atacttgaac aatttgtttc tttcatttgc atttgtttgt 3300 ttattattat tttaatgttt tattaaaaaa tgtttcaaga tgtttcttca gaattctgca 3360 gtacttttta caagtttgac ttggatattt tacggattgt cattggtatt tttaagatgt 3420 ttcttgctat atcttaaatt ggtttctata tttaggttta tataaaattt agctgttttt 3480 ttacaaattt ttaattcaag tctattggga aaacagggtt gttttaaaaa acatctgtgt 3540 ttgtgtgtaa tttttagatt ttttgtttaa atctttctgt gtgtctttat tgtgttgttt 3600 tattgttgtt tttatattga tattcttgtg cactacaaaa ttatttataa aattttagta 3660 ctattttttt catgcaatca ataaacgttt aacatttatt tattttgtca tttgtctgat 3720 tattttcgtt acagaatatg tatgcagttt gcactttatt caaaggttaa acgcagtgct 3780 tacactttgt gtttacatta tagcctgtgt ttgctgttat ataaattcaa atggttgtaa 3840 taccatatat caagtaccaa tgtaatacca aaaggtttta taaggtgtat aaatacggag 3900 tcgcgccagc tccgggccgc agcgcacatt accctcgcgc cgattccggc gcggatgcgc 3960 agcgtcggct cgcttacggc cgccgcatct ggcccgcttg attcgggccg gaatcgggca 4020 gtgagtccac aggcatccgg cccgagtccg gcagccgaag ttgggccgag gggttagtct 4080 ccggcaggcg acaatctagc cggaactggc ccgagtgtat tttgctatct ggg 4133 // ID DIRS-1-LTR_DR repbase; DNA; ZEB; 634 BP. XX AC . XX DT 01-APR-2002 (Rel. 7.03, Created) DT 21-JAN-2009 (Rel. 14.02, Last updated, Version 3) XX DE A solo-LTR derived from DIRS retrotransposon - consensus. XX KW DIRS; LTR Retrotransposon; Transposable Element; Nonautonomous; KW LTR; MER6; DANA; SINE_DR2; DIRS-1-LTR_DR. XX NM DIRS-1-LTR_DR; SINE_DR2. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-634 RA Jurka J. and Drazkiewicz A.; RT "SINE_DR2: SINE-like retroelement from Zebrafish."; RL Repbase Reports 2(3), 11-11 (2002). XX RN [2] RP 1-634 RA Bao W. and Jurka J.; RT "Re-classified to DIRS, and renamed."; RL Direct Submission to Repbase Update (24-OCT-2008). XX DR [1] (Consensus) XX CC Contains ~200 bp segment similar to HE1_SINE, MER6 and DANA CC elements starting around position 201. The 200 bp segment CC contains a hairpin-like GC-rich structure. new comment: This is a CC solo-LTR from DIRS LTR retrotransposon, because it contains a CC split LTR similar with those of DIRS-4_DR. Other example of CC solo-LTR derived from DIRS is DIRS-4N2-LTR_DR. It contains at CC least 3 subfamilies labeled as DIRS-1B-LTR_DR,DIRS-1C-LTR_DR and CC DIRS-1D-LTR_DR. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 634 BP; 157 A; 179 C; 124 G; 174 T; 0 other; ttaagtgaag tttatttata aactaatttc gagaggatca cgtgcttatg attgcttgca 60 gccggtcccg cattattcaa tttatgattc accaatcaga cgattcctaa gccactataa 120 ataccctaag ttccatataa cagccatctt cgttttgaag aatcccccct tccaccccta 180 ctcctcctcc tttcctagat gggtggcacg gtggcccagt ggttagcact gttgcctcac 240 agcaagaacg tcactggttc tagtccttac caagccagcg acgtttctgt gcggagttta 300 cacgttctcc ccgtgctcac gtgggtttcc cccgggttcc ccggtttcct cccaccgtcc 360 aaaaacatgc aacttaagtt aattgactaa tccaaatcgg caccatagac atgctcctag 420 taagtagtta tctcttaaga gcaatcacta tctgttcatt agctactaca gcaggggagt 480 tctcgagatc tacctgagct caaactcccc tctcgccttg caaacgggag ggagccccgg 540 gctcgaggat cttatgagct cagggctctc tcccgggaca gcatgccaaa caagctttat 600 aatcaatcat cagctaagtg tgaactcttg aaat 634 // ID LOOPERN1_DR repbase; DNA; ZEB; 1234 BP. XX AC . XX DT 02-MAY-2002 (Rel. 7.04, Created) DT 02-MAY-2002 (Rel. 7.04, Last updated, Version 1) XX DE LOOPERN1_DR is a nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; LOOPERN1_DR; KW nonautonomous DNA transposon. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1234 RA Kapitonov V.V. and Jurka J.; RT "LOOPERN1_DR, a nonautonomous DNA transposon from zebrafish."; RL Repbase Reports 2(4), 19-19 (2002). XX DR [1] (Consensus) XX CC The consensus sequence was reconstructed based on 5 copies CC present CC in sequenced DNA that represents ~1% of the genome. Approximately CC 500 copies of LOOPERN1_DR are expected to populate the complete CC zebrafish genome. LOOPERN1_DR copies are 7% divergent from the CC consensus sequence. CC This element is characterized by 16-bp terminal inverted repeats CC and CC putative TTAA targets site duplications, LOOPERN1_DR. CC Its classification is not very certain yet, although it CC is expected to be a member of the piggyBac/Looper superfamily. XX SQ Sequence 1234 BP; 318 A; 273 C; 257 G; 370 T; 16 other; agggtcacga aacaccaaaa cacatttttt gagctgttga cagtcgtata tgtgtcccac 60 actgctaaaa acactattag gacacctata tttcactaaa aagtgtaaat tggttgtttt 120 tgcgttattt caagcaaatt cgtacttccg gtttgaaacg aatttttgaa gctgcgtcac 180 ggccatgaca taatagcgtg tattccagcg tgcagactgg gcgtctgtac cagagtgtgt 240 cttattacgt cttacagtgt gctgcattaa tgcatgagta aggcttggtt caaaccaatc 300 agcgcgctct attgtgcaac ttcattaata ttcattactg tcacagtgta gacggcagag 360 acgccacatt gtgttggcaa aacaagcgtg aagtgttgct ttymtarttt gctgcagtga 420 agttttgttt tcattttctc tctgtgagag cgcagctgga gtcacgtgtg gattaacagt 480 gtacgcgacg cgcgacaaca ataacttacg tgtctaagga ggawyattgt ttacctgaga 540 gctgttctca tctgcaaacg ctgagatccg gattcgcttg tagtctcctc ttaataaaga 600 cgcggctcta gttgctrgtg attgtcctgt ctctacagat ttggtaagtg agcgaccagt 660 gctctctgtt taytcagttt gtatnntatt cgtatcaagt taactattgc accgagtgta 720 aacatgttag cactaacagt tacaaccaaa ctttaacctc atgtmggatt ttgtgaccgg 780 aataacacac gcggctttct gacgctacct gccgtgtgca tctaagtttc mgrgaaatgc 840 rgaggttttt tttctctcat tcgccgtgcg gtatcaaaca ttgcatgaaa aatayacgct 900 tacagcagtt cctcgaatca aatatctcgt ttgtcgcgag gggcatgaat gaattccctg 960 aatgaaagag ccaaactgca gttaaagtcc accatttaat aatttggcaa ataattcgat 1020 tacagatgtc catgtaaaca cagtcacttt ctcccctgtg tktgtgtgtt ttgactctga 1080 aactcagcgc gstcaaatag acactcccac accatgcctc ttttcttcct ccgacactcc 1140 cccctaaaca gagctggaca cgcccacttt tctgactttt tccaaagtag aggtgtgaaa 1200 acaccctgct gaaacgaggg ggtttcatgg ccct 1234 // ID Polinton-1N1_DR repbase; DNA; ZEB; 15469 BP. XX AC . XX DT 15-MAR-2006 (Rel. 11.02, Created) DT 05-APR-2006 (Rel. 11.02, Last updated, Version 2) XX DE A family of nonautonomous Polinton DNA transposons - a consensus DE sequence. XX KW Polinton; DNA transposon; Transposable Element; Nonautonomous; KW Interspersed repeat; Polinton-1_DR; Maverick; Tlr; KW Polinton-1N1_DR. XX NM Polinton-1N1_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-15469 RA Kapitonov V.V. and Jurka J.; RT "Self-synthesizing DNA transposons in eukaryotes."; RL Proc. Natl. Acad. Sci. USA 103(12), 4540-4545 (2006). XX DR [1] (Consensus) XX CC This nonautonomous transposon is characterized by ~7.3-kb-bp CC terminal inverted repeats and 6-bp target site duplications. The CC consensus sequence was built based on multiple alignment of CC several copies that are >90% identical to each other. This CC nonautonomous family was derived from the autonomous CC Polinton-1_DR. XX SQ Sequence 15469 BP; 5466 A; 2295 C; 2295 G; 5413 T; 0 other; agagagaatt atgatggagc acctgtcaaa aagtttgatg gattgtagcc ccgcccccta 60 aggcgggact tccggtctaa gccccgcccc ttttatataa actttatagt tccggtaatt 120 agattcttat cggtttaata taaaataata gttccggtaa ttagattttt atcggtttaa 180 tataaaataa taattccggt aattagattt ttatcagact aattaaatta taattccggt 240 aattagattt ttatcagact aattaaatta ataggctatt tccaggtaat gaaaatattg 300 gtaattaaat tctaatcaaa cagaaataaa attattatta taataataac ggtaattaaa 360 ttactgttat tagaaccatt attttatatt tttagttatt tgactaaata ttatattaca 420 atatattata ttatattaag ttatataaaa ttaatatttt attaaataaa attatgtatt 480 gtattataat atattatatt aagttatatt aaatcattat tttattaact tatattgtaa 540 ttataccata ttatataata ttattatatt atattattat tataatattt atattagata 600 ttaaattaaa ttatattata ttatatatca ttattattct attctattct attctattat 660 attatattat attatattat attatattat attataatta ataaactatt aacataaaca 720 tttctttatg ttcataccga tgtttcctct gtgtctcagt ctgggcctga gcgtgtgagt 780 gagtgatgat aaggaaaact ttggataaca tcccgtcaac tgcttgtgaa agatagattt 840 atatcagatg tagaatatgt atatctgatt gtatgtgttg ggtgtctccc actaactact 900 atatgttgtt aggtgattag aaagtgtgct gatatctata tatccacatg cagtcaactg 960 tatgcatatg tatgtgtttg tgtgtgcgta cgtgcgtgcg tgcgtgcatg tgtgcgcgct 1020 tgagcgaact tgggtctggc tttgtatgct tgaatccaac gacacagtat ttactaaaaa 1080 tactcattaa taggttatgt tgtttttcac attattacta caataataca tattttaaga 1140 cacacaaaca ttttgatgta tctttgtcgg agtattaaat taaaatatgt ttacaatatt 1200 ttttgtcgcc ttagcccctt tttaattcct gggtggacac agcggaatga accgccaact 1260 taaccagcaa ggttttgcgc agcgtatgcc cttccacccg caacccatct ctgggaaaat 1320 gtttacaata ttaatccagt aaataacaca tcttaatctt tattgacaat aaattgtcac 1380 accttttaaa acaacgttat ctaaaatgta atattctaaa atgttgaaaa acacagatgt 1440 aaattcaagt tgtctttata attaatgaaa acttttatta tgaaaaacaa tttaaaatga 1500 aagtttttcc tttgactaac atgttttcaa tttatttaca tgcatgtcct gtaatttatg 1560 gcacacattg taatacagtt aatccaaagc ctaatgtttt gagcactggt catcactggt 1620 gttggtttaa aaaatcaaca taaaatattt actaaaaagg acttatcact ctttagacat 1680 tattgttttt tctgtttcaa aagataaata aaatgtaatt ttattacaat aaattgtagt 1740 aattgcatta ctagacaact tagttactga attttaaatt caaactgttt ttaattcatt 1800 gcaaatactt gatcatataa agagaaatat gttaattata ctgaataatg gaataacact 1860 gatatttatt ttaagaagaa aattggctct cagtttatct aaaaacagta cagctccagc 1920 ccttagttaa atctacttaa tgttttatag taaaattttc tagtttaact actgaatcca 1980 ctatgtacat gttcatacaa tagtgtcaaa tctagtaagt ttacaaattt gtcaaaatta 2040 gtcaagttta aaagtagttc ttgaaagtag actgattact tttttacagg gtaaatgatg 2100 cttttcttct accaatagat ttaatcaaga ttattgtcac tacacacaat ttgttgggag 2160 tgtatatgtt tctttaaaca atgtttttta ttggcaaaca cattttcaaa tgatcaaact 2220 aaaatctaac ggaatgtttt tcactctgaa tataaaagct ttatctgtgc atttcacttg 2280 gaaaagtgat gtttaaaata ctattcatta aacttactgc atgactggag ttttattttt 2340 tacaaaccaa tagcattgac ctaattcatg cttttgattt tccactctgt agcagttttt 2400 caacacaaaa tgtaaacgtt ttatgcatgt gctaaaaatc tgtaattttg caccactgtc 2460 agagtttgta tgtaacaaag catacatgtt tttaaaactg tcagttttca aaagttactt 2520 aacgttgtct cctttcttct ggagttttta ggaacaacaa tgtttcattt gtgggggaaa 2580 gaaaaaaaaa gtgatgtttt gaaatacatt ggttacacca atatggtgta aaggttagaa 2640 gttggggttt catccagacg gccggggctc aacgcctgat tttttaatgc tttttaacct 2700 ctaaagtgtc ccaagtaaca ttaataacaa cacttagcca caattatttg tatcgcagtt 2760 gaaggctaca cttaatactg ccagcaaaac acaataataa caaagagtaa acactttctt 2820 attttaagaa tactttgcag taaactttaa tgcattagta agtttatttt atccaccttc 2880 tgatgcagaa tgtttttgaa acatttaatt atgtttttat gaccacgggc atctcaccaa 2940 aatatttgac ctaattcaag taaccagtgc tgcagttttt aatcacaaaa tgtctttaca 3000 aacttttttg cgtatgtgtt aaaagtctaa ttttacacaa ctcttatagc gtccgttaca 3060 atgcatacag taaatgtcac aataaccatt tctactgcca attatctttt ggtgtctaat 3120 tcttcacaat attttttcac gagaactatg aatctacaaa ggcaaggcaa ggcaagacaa 3180 ggcaagttta tttctatagc acatttcata cacagtggta attcaaagtg ctttacataa 3240 acaggaataa aagagacaac tataagaaaa ataaaaacaa ataatgaaaa atttaaaaaa 3300 atgataaaaa cagatcaaat gtgttaaaac aagttataaa agaatgaaaa gaagagaaaa 3360 acatattagt gtgatctgtc aatagtgcga tctgtctaca aaactaaaag aatagttgcc 3420 actaaatctg cattaatatc ttgttcaaca gtgtctaatg tactgtgtat taaaactgta 3480 ttcttttgaa ctgtcatgaa agtttgcgtt gtgaatgttt tggttttaaa ccagagacaa 3540 ccaacaatat tatgtatacc ttttgttcaa atactttttc aaatgaaaga taaattatga 3600 aattgtatgt cagtgcatct ttaaaactaa attgggcaaa aaaaaattag gttgtctttc 3660 taaactgact acaaaaccgt tatacatgag aactgtgtgt tattgggttg acttaaacgt 3720 taaccaattt taataaatgt tctacagttt atacatttgc tttaaacaag tgaccagtcc 3780 ttgacacact atgcaacatt ctaattttct aaatatttaa atatacattt ggcataaata 3840 cagggcttta atcagttttg acaatgtttc tatgatttgt ttcacaccat catgttttgc 3900 acctagctca atttttaact tttttacagg aaaggcatag acacttgagt tgataaactt 3960 ttatttcaag taaatataca ctgaaataga gtagtacata acaaacaaac atggattgca 4020 ttaccataat aaagacaaat taactaagaa aaaaaaaata catttataca gtcatacatt 4080 gaagaaaaaa cagggtacac cagaacaata aagacattac tgaatattac caaacaacaa 4140 agaaatgctg ctaaaagcat tctgtcatgt cctagtatgt ggctagtcat aacttccaca 4200 atctcagcaa tatttgttat tttgttttgt gaaaccagtg acacacaaat atctgttata 4260 tatgtggaca tacatagaac ctgcgttacc ccccatctca tcactcaatt cagtcagcac 4320 ctccagaatt tttgtattgg ttgcacagac agcaccattg ttatgaaaac acaggtgtgt 4380 caaatgattt atcgtatcgg cccatgagtt taggtccctc tcacaacgtc catttctctc 4440 ttgagatttt tcacacgttt gccatctcct gtacaatcat gtttagaatc acacgcatca 4500 ttaatgatct tctccagaag aaaaaccata tggtccctat aatgctgttt aaacatgttg 4560 tccacaatag accttatttg acaatcaaaa acagggttgc actcgtactc gggcctttca 4620 tgcgcattat cgtcagtatt ttgagaacaa tagcaaccaa ccatctttgc atggtggctg 4680 tcagaaaaaa atctgaaaat atttttaaga aaacgttttt tctttaatca caatacaaag 4740 tttgacccat aatctgtgta gatcatataa agtagtacct taaagatgcg gcctgtgtcc 4800 ttaaacagcg atagttcagc cctaatagtt cagtgcaaac aagaccaaag taggggctgt 4860 ggtcctcaaa gatagtacct ttttatgtct ttaatcttct ttttactcct gatcaattgg 4920 ttcttgtaga aagctttagg atcagtgact gtagccagaa gcattactcc ttggtagcta 4980 gaagtaaaaa tgggtgcttt atctacacat tcaaacaggc ccgccctcac tcacggggag 5040 gtactatgaa catctgctat ggtgtcataa aactcatgat agttgacatc ttgtctttta 5100 tttaagacat aggcgtgatg atgtagggat acctatttta aaaataaaaa taaagataca 5160 gagtcatacc gtaaatatat tcacaatgaa agattttaga gaactaatat tatagatttt 5220 ttacatttca tggaagtttt aggttgacaa aatataatgc acagctgaag tcatctcaaa 5280 actcattgcc aaaatggaca atacaattaa tttgaatgtt tttaagcaaa tatgtacagt 5340 cttttcaaca ttattctgca gtttgaattg ttcaatacaa ttgtttattt ggtttcctat 5400 ttgggctcaa agcagtcaca tatttaactg aaatttgcat ctttcatgat agtgttcttt 5460 ttcaaagaat gattgagaag aaataggatt acatctacac ttgcatattc tttttagaat 5520 ttaagtgaac tatacttgta agaaaataca attaaattta tgaaataacc catattccat 5580 aacttattac aatgcctttg aggtcaagaa agagatcaag tttaggagcc attacctcat 5640 aattggttat tagataaaag ttattttaaa aagtcaagtt tggttaaaaa cctaggcttt 5700 cttaaatttg ttctgttaaa acataaagaa aattttaaaa atcaagttat gcattatacg 5760 ttctggagat atataaacat acactttacc atcagataat ctacgaataa caatattcac 5820 caattttata caaagggtac caaaaccgtt tgcttatgtt tgttgcagag tagcaataag 5880 attttttttt tttttttttt tttttatgtt atgtgatact aaatctattt tctggcctat 5940 aattttagtg atatacttat tataaagttg aaataaaaag ctgatacatt tagttaaaca 6000 tcatgtgctt gacattttgg taaatagtta aagcttctga ttgtactaat tgaatattta 6060 gcattttgaa aggtaaatat gtgtcagggt atgtaaagct gacacctcta acttgtttgg 6120 ttgagatttg tggaatgttt ccacaaattg aaatattaaa gtgtttttct tattcactat 6180 tcagcaactt cgatgcaaca ttgtaaaagt gttagaaata tacaaaagta catgtcatgc 6240 tgttttgtaa aaacagacag tcatatcagt aaagaactgc acttcctgag acaaacattt 6300 tgttttgctg taaatcttta atattttaat gtatcgggca ggatgtaagc atacaactta 6360 catataactg actttttgta cgacaatctt tcgatgaaca tgtttacagt atattgcaac 6420 attttagagt ggtctttgaa atagttacaa tatcacagat gtaaagtact cgaacaatgt 6480 ttgaacaatg ttagctgatt gctgtactta agtactactt ttatgatttg taccctactt 6540 gagtacagtt aaaattaatg tttactttta cttcattact tttttaaaaa cagtattttt 6600 tttcacaact ttcaatttta tttattctca aaaaaagttc ctcactccta acaaaacgac 6660 aaagctgctt aaattaatgt tatttactat gtaattgaca ttcatttcaa tattccatca 6720 aactggtcag acatgctctt cgattgttta gaagtcatgt cctgtcacat tgagctctac 6780 aatacacatc aaccccagta agtctgctat cagtacactt ctgttaagca gttcataaga 6840 aaatatacat tcagtccatt gtctgtgtac acattcacat aattttattt agatgccgca 6900 cagaaataaa tttaaacctc acttaagtat atttaaactt atacaaacag aaagtgttta 6960 gttatttgtc ctacagtatg cttggtcatt gtttttcgcg tacaatttta ttacagtacg 7020 tgttctaagt acatttggtt agtagttatt ttgacaccct ttgattaatt gggcacatta 7080 tttttctaca taatgttata gacacatgta ttaagacaga gaataaatta gtttgagtcg 7140 acttaactga aatgaactgc gaacatgtca aatctttata aaagacggta atcttataat 7200 aacacaaatt taactgctta ctgcagtgag aaaaatatta aaatgaaatc acaaaaacat 7260 aaaaagtata catgcagcta gacaaaagac ctgaatactg tcaatattta catgtttact 7320 gaattctttt cagtatagag tatactgtaa aagtataatt tcaggtgtga cagcagtaaa 7380 cattaagcac ataagtagct aactataaca tttttattta tacatcctgt caacttatga 7440 gaatagtagt agttcactta tacttgtagt agataaatag tttgtgctgg aaatgatgca 7500 atgaaatgat gcatttatgt ctgcacttaa accatccata tgagaactgt cggtgctgca 7560 tgctcaatgc ttcaacattt tcaccacagg tctgactttc caaccaattt agcgttataa 7620 tttttagtga tacagtgtcc atattactct agtgtcttga aacttgcaag agtacagcta 7680 gtgtggcctt ggttaactag tagtacacaa agcagttacc acaagggata gtgtgttatt 7740 gaaaacatgt catgcaattt gtagaaacca atttagaaaa caactgaagt aaccgctgag 7800 cattttatct cagaataagt gcagtgttag attacctgtg gattttgaaa atgtttataa 7860 tttaataaaa attataaaca ttattaaaaa attataaaaa taattgtaat aaaaataaca 7920 aaagcttcaa acattattca aacatgtaat gaacttacac agtcatatgg tcaaagtata 7980 cttgttgcta atcaaatata gcacataaaa atacattaaa tacccaagag aatatgctgt 8040 agtacaaatt atgtacaatt caaatatacc aaatttaata tttaaaaata tggcatttca 8100 agatagcaaa taattaatta cattgaatta tacaggctct ttgtctagct gcatgtatac 8160 tttttatgtt tttgtgattt cattttaata tttttctcac tgcagtaagc agttaaattt 8220 gtgttattat aagattaccg tcttttataa agatttgaca tgttcgcagt tcatttcagt 8280 taagtcgact caaactaatt tattctctgt cttaatacat gtgtctataa cattatgtag 8340 aaaaataatg tgcccaatta atcaaagggt gtcaaaataa ctactaacca aatgtactta 8400 gaacacgtac tgtaataaaa ttgtacgcga aaaacaatga ccaagcatac tgtaggacaa 8460 ataactaaac actttctgtt tgtataagtt taaatatact taagtgaggt ttaaatttat 8520 ttctgtgcgg catctaaata aaattatgtg aatgtgtaca cagacaatgg actgaatgta 8580 tattttctta tgaactgctt aacagaagtg tactgatagc agacttactg gggttgatgt 8640 gtattgtaga gctcaatgtg acaggacatg acttctaaac aatcgaagag catgtctgac 8700 cagtttgatg gaatattgaa atgaatgtca attacatagt aaataacatt aatttaagca 8760 gctttgtcgt tttgttagga gtgaggaact ttttttgaga ataaataaaa ttgaaagttg 8820 tgaaaaaaaa atactgtttt taaaaaagta atgaagtaaa agtaaacatt aattttaact 8880 gtactcaagt agggtacaaa tcataaaagt agtacttaag tacagcaatc agctaacatt 8940 gttcaaacat tgttcgagta ctttacatct gtgatattgt aactatttca aagaccactc 9000 taaaaatgtt gcaatatact gtaaacatgt tcatcgaaag attgtcgtac aaaaagtcag 9060 ttatatgtaa gttgtatgct tacatcctgc ccgatacatt aaaatattaa agatttacag 9120 caaaacaaaa tgtttgtctc aggaagtgca gttctttact gatatgactg tctgttttta 9180 caaaacagca tgacatgtac ttttgtatat ttctaacact tttacaatgt tgcatcgaag 9240 ttgctgaata gtgaataaga aaaacacttt aatatttcaa tttgtggaaa cattccacaa 9300 atctcaacca aacaagttag aggtgtcagc tttacatacc ctgacacata tttacctttc 9360 aaaatgctaa atattcaatt agtacaatca gaagctttaa ctatttacca aaatgtcaag 9420 cacatgatgt ttaactaaat gtatcagctt tttatttcaa ctttataata agtatatcac 9480 taaaattata ggccagaaaa tagatttagt atcacataac ataaaaaaaa aaaaaaaaaa 9540 aaaatcttat tgctactctg caacaaacat aagcaaacgg ttttggtacc ctttgtataa 9600 aattggtgaa tattgttatt cgtagattat ctgatggtaa agtgtatgtt tatatatctc 9660 cagaacgtat aatgcataac ttgattttta aaattttctt tatgttttaa cagaacaaat 9720 ttaagaaagc ctaggttttt aaccaaactt gactttttaa aataactttt atctaataac 9780 caattatgag gtaatggctc ctaaacttga tctctttctt gacctcaaag gcattgtaat 9840 aagttatgga atatgggtta tttcataaat ttaattgtat tttcttacaa gtatagttca 9900 cttaaattct aaaaagaata tgcaagtgta gatgtaatcc tatttcttct caatcattct 9960 ttgaaaaaga acactatcat gaaagatgca aatttcagtt aaatatgtga ctgctttgag 10020 cccaaatagg aaaccaaata aacaattgta ttgaacaatt caaactgcag aataatgttg 10080 aaaagactgt acatatttgc ttaaaaacat tcaaattaat tgtattgtcc attttggcaa 10140 tgagttttga gatgacttca gctgtgcatt atattttgtc aacctaaaac ttccatgaaa 10200 tgtaaaaaat ctataatatt agttctctaa aatctttcat tgtgaatata tttacggtat 10260 gactctgtat ctttattttt atttttaaaa taggtatccc tacatcatca cgcctatgtc 10320 ttaaataaaa gacaagatgt caactatcat gagttttatg acaccatagc agatgttcat 10380 agtacctccc cgtgagtgag ggcgggcctg tttgaatgtg tagataaagc acccattttt 10440 acttctagct accaaggagt aatgcttctg gctacagtca ctgatcctaa agctttctac 10500 aagaaccaat tgatcaggag taaaaagaag attaaagaca taaaaaggta ctatctttga 10560 ggaccacagc ccctactttg gtcttgtttg cactgaacta ttagggctga actatcgctg 10620 tttaaggaca caggccgcat ctttaaggta ctactttata tgatctacac agattatggg 10680 tcaaactttg tattgtgatt aaagaaaaaa cgttttctta aaaatatttt cagatttttt 10740 tctgacagcc accatgcaaa gatggttggt tgctattgtt ctcaaaatac tgacgataat 10800 gcgcatgaaa ggcccgagta cgagtgcaac cctgtttttg attgtcaaat aaggtctatt 10860 gtggacaaca tgtttaaaca gcattatagg gaccatatgg tttttcttct ggagaagatc 10920 attaatgatg cgtgtgattc taaacatgat tgtacaggag atggcaaacg tgtgaaaaat 10980 ctcaagagag aaatggacgt tgtgagaggg acctaaactc atgggccgat acgataaatc 11040 atttgacaca cctgtgtttt cataacaatg gtgctgtctg tgcaaccaat acaaaaattc 11100 tggaggtgct gactgaattg agtgatgaga tggggggtaa cgcaggttct atgtatgtcc 11160 acatatataa cagatatttg tgtgtcactg gtttcacaaa acaaaataac aaatattgct 11220 gagattgtgg aagttatgac tagccacata ctaggacatg acagaatgct tttagcagca 11280 tttctttgtt gtttggtaat attcagtaat gtctttattg ttctggtgta ccctgttttt 11340 tcttcaatgt atgactgtat aaatgtattt tttttttctt agttaatttg tctttattat 11400 ggtaatgcaa tccatgtttg tttgttatgt actactctat ttcagtgtat atttacttga 11460 aataaaagtt tatcaactca agtgtctatg cctttcctgt aaaaaagtta aaaattgagc 11520 taggtgcaaa acatgatggt gtgaaacaaa tcatagaaac attgtcaaaa ctgattaaag 11580 ccctgtattt atgccaaatg tatatttaaa tatttagaaa attagaatgt tgcatagtgt 11640 gtcaaggact ggtcacttgt ttaaagcaaa tgtataaact gtagaacatt tattaaaatt 11700 ggttaacgtt taagtcaacc caataacaca cagttctcat gtataacggt tttgtagtca 11760 gtttagaaag acaacctaaa ttttttttgc ccaatttagt tttaaagatg cactgacata 11820 caatttcata atttatcttt catttgaaaa agtatttgaa caaaaggtat acataatatt 11880 gttggttgtc tctggtttaa aaccaaaaca ttcacaacgc aaactttcat gacagttcaa 11940 aagaatacag ttttaataca cagtacatta gacactgttg aacaagatat taatgcagat 12000 ttagtggcaa ctattctttt agttttgtag acagatcgca ctattgacag atcacactaa 12060 tatgtttttc tcttcttttc attcttttat aacttgtttt aacacatttg atctgttttt 12120 atcatttttt tatatttttc attatttgtt tttatttttc ttatagttgt ctcttttatt 12180 cctgtttatg taaagcactt tgaattacca ctgtgtatga aatgtgctat agaaataaac 12240 ttgccttgtc ttgccttgcc ttgcctttgt agattcatag ttctcgtgaa aaaaatattg 12300 tgaagaatta gacaccaaaa gataattggc agtagaaatg gttattgtga catttactgt 12360 atgcattgta acggacgcta taagagttgt gtaaaattag acttttaaca catacgcaaa 12420 aaagtttgta aagacatttt gtgattaaaa actgcagcac tggttacttg aattaggtca 12480 aatattttgg tgagatgccc gtggtcataa aaacataatt aaatgtttca aaaacattct 12540 gcatcagaag gtggataaaa taaacttact aatgcattaa agtttactgc aaagtattct 12600 taaaataaga aagtgtttac tctttgttat tattgtgttt tgctggcagt attaagtgta 12660 gccttcaact gcgatacaaa taattgtggc taagtgttgt tattaatgtt acttgggaca 12720 ctttagaggt taaaaagcat taaaaaatca ggcgttgagc cctggccgtc tggatgaaac 12780 cccgaacttc taacctttac accatattgg tgtaaccaat gtatttcaaa acatcacttt 12840 tttttttctt tcccccacaa atgaaacatt gttgttccta aaaactccag aagaaaggag 12900 acaacgttaa gtaacttttg aaaactgaca gttttaaaaa catgtatgct ttgttacata 12960 caaactctga cagtggtgca aaattacaga tttttagcac atgcataaaa cgtttacatt 13020 ttgtgttgaa aaactgctac agagtggaaa atcaaaagca tgaattaggt caatgctatt 13080 ggtttgtaaa aaataaaact ccagtcatgc agtaagttta atgaatagta ttttaaacat 13140 cacttttcca agtgaaatgc acagataaag cttttatatt cagagtgaaa aacattccgt 13200 tagattttag tttgatcatt tgaaaatgtg tttgccaata aaaaacattg tttaaagaaa 13260 catatacact cccaacaaat tgtgtgtagt gacaataatc ttgattaaat ctattggtag 13320 aagaaaagca tcatttaccc tgtaaaaaag taatcagtct actttcaaga actactttta 13380 aacttgacta attttgacaa atttgtaaac ttactagatt tgacactatt gtatgaacat 13440 gtacatagtg gattcagtag ttaaactaga aaattttact ataaaacatt aagtagattt 13500 aactaagggc tggagctgta ctgtttttag ataaactgag agccaatttt cttcttaaaa 13560 taaatatcag tgttattcca ttattcagta taattaacat atttctcttt atatgatcaa 13620 gtatttgcaa tgaattaaaa acagtttgaa tttaaaattc agtaactaag ttgtctagta 13680 atgcaattac tacaatttat tgtaataaaa ttacatttta tttatctttt gaaacagaaa 13740 aaacaataat gtctaaagag tgataagtcc tttttagtaa atattttatg ttgatttttt 13800 aaaccaacac cagtgatgac cagtgctcaa aacattaggc tttggattaa ctgtattaca 13860 atgtgtgcca taaattacag gacatgcatg taaataaatt gaaaacatgt tagtcaaagg 13920 aaaaactttc attttaaatt gtttttcata ataaaagttt tcattaatta taaagacaac 13980 ttgaatttac atctgtgttt ttcaacattt tagaatatta cattttagat aacgttgttt 14040 taaaaggtgt gacaatttat tgtcaataaa gattaagatg tgttatttac tggattaata 14100 ttgtaaacat tttcccagag atgggttgcg ggtggaaggg catacgctgc gcaaaacctt 14160 gctggttaag ttggcggttc attccgctgt gtccacccag gaattaaaaa ggggctaagg 14220 cgacaaaaaa tattgtaaac atattttaat ttaatactcc gacaaagata catcaaaatg 14280 tttgtgtgtc ttaaaatatg tattattgta gtaataatgt gaaaaacaac ataacctatt 14340 aatgagtatt tttagtaaat actgtgtcgt tggattcaag catacaaagc cagacccaag 14400 ttcgctcaag cgcgcacaca tgcacgcacg cacgcacgca cgcacgtacg cacacacaaa 14460 cacatacata tgcatacagt tgactgcatg tggatatata gatatcagca cactttctaa 14520 tcacctaaca acatatagta gttagtggga gacacccaac acatacaatc agatatacat 14580 attctacatc tgatataaat ctatctttca caagcagttg acgggatgtt atccaaagtt 14640 ttccttatca tcactcactc acacgctcag gcccagactg agacacagag gaaacatcgg 14700 tatgaacata aagaaatgtt tatgttaata gtttattaat tataatataa tataatataa 14760 tataatataa tataatataa tataatataa tataatataa tataatataa tagaatagaa 14820 tagaataata atgatatata atataatata atttaattta atatctaata taaatattat 14880 aataataata taatataata atattatata atatggtata attacaatat aagttaataa 14940 aataatgatt taatataact taatataata tattataata caatacataa ttttatttaa 15000 taaaatatta attttatata acttaatata atataatata ttgtaatata ataatataat 15060 ttagtcaaat aactaaaaat ataaaataat ggttctaata acagtaattt aattaccgtt 15120 attattataa taataatttt atttctgttt gattagaatt taattaccaa tattttcatt 15180 acctggaaat agcctattaa tttaattaga ctgataaaaa tctaattacc ggaattatta 15240 ttttatatta aaccgataaa aatctaatta ccggaattat tattttatat taaaccgata 15300 agaatctaat taccggaact attattttat attaaaccga taagaatcta attaccggaa 15360 ctataaagtt tatataaaag gggcggggct tagaccggaa gtcccgcctt agggggcggg 15420 gctacaatcc atcaaacttt ttgacaggtg ctccatcata attctctct 15469 // ID DNA13TA1a_DR repbase; DNA; ZEB; 318 BP. XX AC . XX DT 11-JUL-2002 (Rel. 7.06, Created) DT 11-JUL-2002 (Rel. 7.06, Last updated, Version 1) XX DE DNA13TA1a_DR is a nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA13TA1_DR; KW DNA13TA1a_DR; Dr000073; nonautonomous DNA transposon. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-318 RA Bao Z.; RT "Dr000001, an unclassified and uncharacterized repeat."; RL http://www.genetics.wustl.edu/fish_lab/repeats/. XX RN [2] RP 1-318 RA Kapitonov V.V. and Jurka J.; RT "DNA13TA1_DR, an ancient family of nonautonomous DNA transposons RT from zebrafish."; RL Repbase Reports 2(6), 3-3 (2002). XX DR [2] (Consensus) XX CC This element is characterized by 13-bp terminal inverted repeats CC and the TA targets site duplications. CC Its classification is not certain yet, although it CC is expected to be a member of the Mariner/Tc1 superfamily. CC There are several hundred copies of DNA13TA1_DR present in the CC zebrafish genome; they are ~95% identical with the consensus CC sequence. CC DNA13TA1a_DR is a young subfamily of the old DNA13TA1_DR family. CC Their consensus sequences are 92% identical with each other. CC Interestingly, the 13-bp TIR in the DNA13TA1a_DR consensus CC sequence has one mismatch (C at position 3). However, the CC DNA13TA1_DR consensus sequence has the perfect TIR. CC Presumably, one of the ancient DNA13TA1_DR genomic copies served CC as a master copy that started the DNA13TA1a_DR family. CC Unclassified version of DNA13TA1a_DR was identified as Dr000073 CC [1]. XX SQ Sequence 318 BP; 76 A; 71 C; 58 G; 113 T; 0 other; ttcaccttat tctccgccga cgagcgggcg ctgccatttg aatctttttg gctcgagact 60 tccggtctca ttcacttcca ttcattttta gacgttaaaa actgctcgtt acgctgcttg 120 atgttgcaaa ctgatatttt cttattatat tattctactt ggtctgtata gtcatgcaaa 180 catttgtttg tagagcaagt agtttgaccg ttttctgccg tttattattc ctagtcattt 240 ctcccatagg cgactgaatc ggaagttcta aaacaatcgc gaaaacaggc gcacttccgc 300 attttagaat aaggtcaa 318 // ID hAT-N55_DR repbase; DNA; ZEB; 1921 BP. XX AC . XX DT 21-OCT-2008 (Rel. 13.1, Created) DT 21-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Putative nonautonomous hAT DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW 8 bp TSD; hAT-N55_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1921 RA Bao W. and Jurka J.; RT "hAT DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1324-1324 (2008). XX DR [1] (Consensus) XX CC TIR is 38 bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1921 BP; 586 A; 344 C; 331 G; 659 T; 1 other; caggcccggc gccaggatgt cagaactgag ggggcatttt aaatccatag gggggcacta 60 gacctggtga ctgattttgg tctctcttgt tyctttcatt taggctattt attcacattt 120 attattactt tactttactg cctaagttac tgacctaagg cagtaatata ggctacgttt 180 taatagtaat aataatatgc acagtttcgc tattactgat tttattaatg atggtaatgc 240 aacgataccg cttattaatg cagacggatc atggcttcag tgccaaacga gtgctttaaa 300 tcatatgtta ttctccttat ttatttccgt tttatttcaa ataacaggca tatgaatgtt 360 taaagtttta ggtaaagacg tcaaattatg cccacacatg caagctgacc cacgatcaga 420 ctattcgttt aaccggtaaa taaatatagt gaaatcagct taaccgagtc atcacgcttc 480 acgtagccta ttacccaacc aacatcagtg ttagggaaaa gctaaatgtt acaatctaat 540 gtcatttaaa acaattactt agttacattt taaggcatta gtgtagccag aaaaatacct 600 aaaaaagata ccaaaattca catgttatta ccgtgaaaat tggttaaatt ttggcacgga 660 taacacgcaa ctcagtatgc tatataactt ttttccaatc attttaaatg taaaaataaa 720 gatttatctt aggctatcat aagtgtgttt ttttcgtgat attgtctggt tgtcatttgt 780 aaaccatcat aaaccattag gttactaggc ctacccgtca ttttatttat taattatttt 840 atttattaag cctattttaa taacttgttt tcaggttatt ttcattttag ggccatgagc 900 tacaagtcaa aacaaactcg atatgctgac tgtattttag ctcttaaaat agatctacaa 960 tatattttat gttgtttaat tttgcatgag caaatagcag tatagcctaa attacatatt 1020 tttaaaagat tatcattcag cctacctgca gcacgtgaga ggtttctcgg ttgatccgcg 1080 gtaaatccac ttaagcacta tacctattcc atttataaca ctgcaaactt gacttcgcag 1140 gccgatcatc tttaaacgca aaaggcattt attaagtgtt ataccttgac tgctgctgca 1200 taatgggatg tttctctaca caactgtaaa agtgcgcttt ttagcaatgt caaaatgaaa 1260 gcaaacattc aaagggttct gcgttttgtc aattttagct taattgtaat aatattaata 1320 ataatattaa taaaataaat taaaatatta tttatagtgt agcctactgt gggactgtgc 1380 agtgaactat ttatttttat taaaacattt ttattaggct acatcttccc gaataactaa 1440 tgtgaaaatg actggcagtt gcgtctctgc taaaattaat ttaacagcaa atttcttctg 1500 cgtttcttcc ttttagagct caagagagtt gtgtttaaag aattagttaa ccaaaacatg 1560 taatgaatta agctgctcgt atcatgtcac ccacttttct cccttgtgag ttgagtgcgc 1620 ctccatcgcc gtgtgcgcgt tttaaaggct atttacgggc gcacctgacc tgacgcgggg 1680 accggatctc tggactatag gactagcttt aatctcacaa ttttcataat ttaattgcta 1740 tgtcatatat tttaaaatat ttaaatgtgt gttctaaagg ttggaacgac gggagtgtta 1800 gtaattaatg acagaaagaa ttttcagttt tgagtgaaca atcctttata aaaactgggg 1860 gggcacaaat tctttctgag ggggcaatgc ccccctttgc ccccccgtag cgccgggcct 1920 g 1921 // ID EnSpm1_DR repbase; DNA; ZEB; 4402 BP. XX AC . XX DT 09-SEP-2003 (Rel. 8.08, Created) DT 09-SEP-2003 (Rel. 8.08, Last updated, Version 1) XX DE EnSpm1_DR is an autonomous DNA transposon - a fossil record. XX KW EnSpm; DNA transposon; Transposable Element; KW Autonomous DNA transposon; En/Spm superfamily; EnSpm1_DR; KW EnSpmN1_DR; Zn-finger; transposase. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4402 RA Kapitonov V.V. and Jurka J.; RT "EnSpm1_DR, an autonomous En/Spm DNA transposon from zebrafish."; RL Repbase Reports 3(8), 152-152 (2003). XX DR [1] (Consensus) XX CC This is a fossilized copy of En/Spm DNA transposon related CC to the EnSpmN1_DR nonautonomous element. CC EnSpm1_DR encodes remnants of the EnSpm1_DR transposase. CC Transposase: CC INQKFLKRSWLQGRERRDTAVTGMVRIIEIIHFFADDCRISLNLYIDDFEVTNPLGTSRKKHKMCAIYWV CC IANLPAKHTSSLHTIQLALLCKTDVVKQHGYSKILRPLIEDLTCLEEQGVYVEQLGSCVRGSVLHVLADN CC LGAHSLAGFQESFRVNFPCRFCLATRDDIQIKEVRSGSFEMRTKEKHQQHIQDVLQDRTLVAQYGVKSSC CC PLSDNLKYFHAIGGYPPDLLHDLLEGIVPVELALCLQDLIGKHYFSLDSLNNAIKQFPYKFSDRTDQPQP CC IPKSYKSKQTIGGNGDENWALLRLLPFLIGHKVKENDHTWEIVMLLKDVVELAFSS CC The zebrafish genome harbors several diverged families of CC autonomous CC En/Spm transposons. XX SQ Sequence 4402 BP; 1361 A; 784 C; 898 G; 1359 T; 0 other; acaaaagagg attgtagcac accattagag tattgtaatg cgtattgtgg gtattatatt 60 tggtttgtga tatcgttaca ctcttaatat atataatttt ttcttttcag actaccagtt 120 tgggatatgc agtggaaatg taagtcttgc tcatacacat cagaaaagag ggcacaactt 180 tttaaacact accgcataag ccatggagca tattcacgaa ttgctccagt accatgtctc 240 catactgagt gtccatgtac tttaaaatct tttaacgctt taaaagtaca tttatcaaaa 300 aatcattctt aacaaagaag tgcacaatca agtaactcta gccatgcttt agtttcttac 360 aaatgccatg tttgtgattt taaagaaccg tgcagcgagt cagaatattt tgcccatttg 420 cgtactcatt tgaggaatag tgagaaagtt cagtgtcctt ttaaagattg ttcatttgag 480 acaaatgtgt attccacttt taatgcacat aaaagtaagg cacaccagca acattgccat 540 gaggatttta aattgggggt tatgttaagt actgcatttg gtgacacaca agaaaatata 600 gttcattcac ttaatgacag ctcaggcagt gaagatgaat gtaatatggc agaggaacaa 660 gtcaatttgg aagatacaaa acaaaagttg gaacacagtt tagcatcact ttttctcaaa 720 atgcgaacaa tcttgcatat ttctgaaaat gcgacacaag aaattataca acagttgaat 780 cagattcatt tattgtctga gccactgttg tataatataa tacagcaaac tgttgagaag 840 cactatgata aagtagacag cacagttgtg aagaagctag tcagggcagt aactgaaagc 900 aatgtgctgc tacattgtac aaatgaagga ggtcctcttt caacgtccaa taggagggca 960 tcatacttca tgagagaata tccagttgta agccctgttg agtacaaagt agatagagat 1020 gagcagtatt atgtgtatat cccaattatc caaatgttgc agaaattgct aaataaacca 1080 gaagttcttg aagaggtcct ggcttcaggg gagagagcga agggatacag cagttacagg 1140 gatggtacgc attatagaga taattcattt ttttgcagat gattgtcgaa tatcccttaa 1200 tctctatata gatgattttg aagtgaccaa tccacttgga acgtcacgaa aaaaacacaa 1260 aatgtgtgct atctactggg ttattgcaaa cttaccagca aaacacacat catctttgca 1320 caccattcag ctagctcttc tgtgtaaaac agatgtagtt aagcagcatg gctactcaaa 1380 aatactacgc ccattaattg aggacctgac atgtcttgaa gagcagggag tctatgttga 1440 gcagcttgga tcatgcgtaa ggggatctgt tctgcatgtc ttggcagata atcttggtgc 1500 ccattcgctg gcaggctttc aagagagttt tagggtgaat ttcccttgta gattttgctt 1560 ggccacaaga gatgacatcc aaatcaaaga agttaggtct ggctcttttg aaatgaggac 1620 aaaagaaaag caccaacagc atattcagga tgttctacag gatagaactt tggtggcaca 1680 gtatggtgta aagagtagtt gtccattgag tgacaatttg aagtactttc atgcaattgg 1740 tggctacccc ccagatctgt tacatgatct tttagagggt attgttcctg tggagttggc 1800 attgtgtttg caggatttaa ttggaaagca ttacttttca ctggattctt taaataatgc 1860 catcaaacaa tttccataca aattttcaga caggactgac cagccacagc caattccaaa 1920 atcatacaaa agcaaacaaa caataggggg aaatggcgat gaaaattggg ccctcctaag 1980 gctcctccca ttcttgattg gccataaagt gaaagagaat gatcatacct gggaaattgt 2040 aatgttgtta aaagatgttg ttgaattggc tttttctagc tagttttctg atgacacagt 2100 ggagttcttg aattttaaga tctcagaaca cagagctttg cttcaagatg tgtttccaaa 2160 ctttgtacta cgcccaaagc atcactactt ggagcattat cctcaattga ttagaacata 2220 tggaccactg aggaatttat ggaccatcag atttgagggg aaacataagt ccttcaagaa 2280 agtgatccgt gatgcacaca atttcaaaaa tgtcccctaa tgctggccaa gaagcatcaa 2340 atggcaattg cctatcatat ggatggcagc tcttttttca agccaaaaat acaaatggac 2400 agggtggtta gttctctttt aacagctttt cctgaagctg tacaagagta cctacatctg 2460 cagagtccta gttgtagctc tgtgctagtt tcatctagtg tttacattga tggaattagg 2520 tactgcccaa acatggtatt atcagttggg tcttgttcag gtttgccaga gttcatgcag 2580 atagagaaaa tcttaacagt caacaccgag ataatgtttc tctgcgaacc acaatctgca 2640 tggtacattg agcatttgcg ctcttatgaa ttgtgcagtt tgcattcact tcaagttgtt 2700 ctgccttctg aactgaatga ccccttccct ttggcagctt ataatattcg agggtgttta 2760 tatgtcacat tgaagtatta catctgttgc taaactgcat tgtatcattg catgtttctg 2820 atttctactt tctttgtttt agaatttcac ttagccatgt ccatgacact tcgcgttatt 2880 gtgcacgaca gtgacatccg gaaggttgtc ttagaggaaa caccagctga tgctgatgct 2940 gatgctttaa aacaagtgct gcagaagaaa ctgcaacttg catacccatt caacctccag 3000 tatgaagacc ctttttttaa caatgcatta tgcaatatca gtgacattaa tgacttacca 3060 gagagagcga cggtgaaaat aattcctctg gacacatcac cttcctcaag tcaagctgac 3120 accataattc tctcaggttc tgatacatca gagcagttgt ctgttgaacg acagattgta 3180 tggccagatg agatacctaa attctcggtt aatgttgagt atcttctacg acagggaaac 3240 ctagctttta tgaaagatgg tacaactgct gacgttggaa gagacataaa acatgacata 3300 ctggaggggc ttgctggggc catttacaca tttagggcct atccaagtga tgaggatttt 3360 acccaagttg ctaaatcaaa aatgcgtaaa tctggctgta gtgatgttgc aattggggaa 3420 aagagaggtg ccatgaaggc aacaaagcat caaggcatca aaaaacccag acgatctgag 3480 ttgaatttct tgccaaactt tcctaatggc gaaaatgaag aaagccaaga atgtacaaga 3540 atagaacttg tagaagaaat gaaaaggagg cctctaaaca tgagcttgat tggccaaaag 3600 atggacagta catttgcctt gagacgaaaa gaaattgtcc actctgaacc tcttgtcagt 3660 gacatcttgg aaaagtggcc ggcacttttt ttggaaagtc aggtttgttt tcttttcaaa 3720 acttagtatg attttttaaa tttgtgtttt tgaatcttaa tattgactca atgtttttaa 3780 ttagatcttt gcagaattca gtcgaatttc gggaagaaat ttgagatcgg agttctacac 3840 tgcattagac acacactcag ctcgcttgat agaaattttc aaaaaaagag gaggaaatca 3900 gggaaaaaaa ctagatgaga ttttgcaaca agtttattct aaggtatttt tcctgacatc 3960 atctttgaat catcgtttcc atttgatttt tttcgttgag ctttaatgca tgtctacctg 4020 tctttacttt ttcatagcca tctgatgtga cagttgtccg ctccgcagtt cttcagggtc 4080 ttccagtcct tttgggggat gagtctgaag attttttcag gatgtgtttt gtaagtagta 4140 ttattgtttc tgctttttag agtggtaatg aggcaatcat tcaaatttac tttatttact 4200 gaaatcaatt tcccctttta taaaacaact aaaataaagc aaatgtaatt attttttaag 4260 taaaaggaaa gttttattat ggcttttgga gcaagagtgc tgccttcttt caaaattcac 4320 agtatgtgac atcacagggt tagtttgctt cataggccag tgaggtaatg catgcattgc 4380 tttaattttt gaacatactt gt 4402 // ID Gypsy37-I_DR repbase; DNA; ZEB; 5226 BP. XX AC . XX DT 21-SEP-2007 (Rel. 12.09, Created) DT 21-SEP-2007 (Rel. 12.09, Last updated, Version 1) XX DE An internal portion of the Gypsy37_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy superfamily; LTR; Gypsy37-I_DR; KW Gypsy37-LTR_DR; Gypsy37_DR; endogenous retrovirus; gag; integrase; KW reverse transcriptase. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5226 RA Dib M.R. and Naveira H.F.; RT "Gypsy37_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 7(9), 804-804 (2007). XX DR [1] (Consensus) XX CC Gypsy37-I_DR is an internal portion of the Gypsy37_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its long CC terminal repeat is deposited in Repbase as Gypsy37-LTR_DR. CC Gypsy37_DR is characterized by 4-bp target site duplications. The CC internal portion encodes one polyprotein the 1536-aa polyprotein CC Gypsy37_DR1p (pos. 512-5119) composed of the gag, protease, CC reverse transcriptase, and integrase domains. Some insertions CC fairly recent, according to the high identity between their CC flanking LTRs. Consensus obtained after the alignment of at least CC five independent insertions bearing at least 85% homology over at CC least 1000bp. XX FH Key Location/Qualifiers FT CDS 512..5119 FT /product="Gypsy37-I_DR_1p" FT /note="ORF." FT /translation="MTSVEEFCSAPSDELLNLCSKKQLWEIVDRYELSGVD FT KRMRKDEVREMIRVKLSELQVLPVTSRRATAGVNPAAEKMSFVGLSFEQQK FT ELIRMQQLHEKEMRVSEMEIEKLRLDDLMRKREMEFEKIKHEQKLAVERQN FT QEFQLKFESLKLQNASGSPVHRTDGSFNLINNLKLLPVFNERDPDVFFSLF FT ESIADERDWPETDRTVMLQSVLVGKAQEAYTALSVDERKKYDSVKSAVLKA FT FELVPEAYRQRFRTWRKGERQTHVEVTRELGNHFDRWCSTSNVKSFSELRE FT LMIIEQLKNIVPDQIAVYINENKPRTAAEAASLADDFVLTHKNRFSSGNRG FT NGNVHFKQSQQIDPYLHSRRAISDSTCRYCLEVGHWKNECPLLKNKQIRGK FT NKNTAPVLLTNCFPQTPAVCSTDDSVIKGNCSEIKGNTGYAPFISQGCVSL FT LEEHESVPVRVLRDTGSSESFILESILPFSSSSYVGKNVLIRGIALQTLSV FT PLHKFSLQSQLVNGEVIMGVRPSLPVEGVDIILGNNLAGESVWPVMTPAPI FT VTTSSVLDQECCELSEDVSCVVTRSKSKMLNESNLNDTRVPVESVAPGLSL FT LSKISRAEFIVAQRDDPGLEKMFAAVLPPEEVESAATGYYINDGVLLRKWL FT ALHEDGGGESVVQVVVPEKFRDVVLKLSHGDLAGHLGVKKTYSRVLQHFYW FT PLLKKDISRFIRTCHTCQVTGKPNQPLKPAPLYPIPPVEQPFEYLLIDCVG FT PLPPSKSGSKFLLTVMCQSTRYPAAYALRNITTRSVVKALSQFISIFGIPK FT VIQSDQGTNFTSKMFAEILKQLGVKHHRSSAQHPQSQGVLERFHQTLKCLL FT RAYCVELNRDWEEGLPWLLLASREVTQSSLGFSPNDLVFGHKIRGLLSVLG FT DQLKGKQPPQSLLEYVNGFKRRLILAGQAAKENLEKAQIKMKTLFDRRSEQ FT RVFKPGDQVLALLSLVSSPFCAKFVGPYTVVRALSDLNYEIATPDRKKKTQ FT ICHVNLLKPFFSRDCQADIKSSGVISPAIEALGQSSAGDREGGKFLDDAVV FT PPRLKNSETLKELEDVLSHLTVVQRAELSHLIREFPSLFGDVPSQTHLIEH FT DVDVGDALPIRQRFYRVPFEKRIKLESEIQYMLKNNIAKPSCSSWASPCLL FT VRKPDGTDRFCTDYRKINKITKPDAFPLPRMEDCVDQVGSARFVSKLDLLK FT GYWQIPMSPRAQEITAFVTPSGLYSYSVMSFGLRNAPATFQRLMNRVVSGL FT EGCAVYLDDVIVYSENWDQHMRRVRELFKRLAEANLTVNLAKCEFARATVS FT YLGKEVGQGMVRPLRAKISAIDNFPIPCTKKELMRFIGMASYYRGFCPNFS FT SVICPLTNLLKNSAKFDWNPHCQKAFENVKLLLTATPVLAAPQSGLPFEIQ FT VDASHVGAGAVLLQTHSDGLSHPISYFSRKFNSYQLNYSVIEKETLALIWA FT LQHFDVYVGGGSPVVIYSDHNPLTFLHSLQSPSQRLMRWILFLQPFNLTIK FT HIKGIKNIFADTLSRAPVEE" XX SQ Sequence 5226 BP; 1400 A; 939 C; 1287 G; 1600 T; 0 other; tatgggggct cgtctaattt aaatttagtt gatttgatct ttggctttcg taatcggtat 60 ttgttgtgta ataggtggga gagacaagga ggcggagatg tttgttggag cggctttggg 120 tttctagtcc gatcaggaag aggtaagttc ttgcggttta aaactgtgtt aatgtaaatt 180 tcccctgtta ggttgtagca gcgcattgcg ttgttttttt tattttagtt tgttattttt 240 gtgcggcttc tgggttgagt gcactgccgt ggagtagctc tgaactattt gttcagttcc 300 acaccagcgg tatctgagtg cttcaaaggg ctcgactctt gggagcgcat gaagattagg 360 aatagttatt tagagttttt tttcgatagg attaggtggg gaattaatga atttgtgcac 420 agtttttgac cgtaagaatt acctgcgtaa attgtaattg ggagtaggtt atttgattat 480 ttgttgtaac tttggttggt tgattttgat aatgacgtca gtggaagaat tttgtagtgc 540 tccttcggat gagctattga atttgtgctc gaaaaagcag ctttgggaaa ttgtagatcg 600 atacgagttg agtggcgttg ataagcgaat gcgtaaggat gaggttagag aaatgattag 660 agttaaattg agtgaacttc aggttttgcc agttacgtcg agacgagcaa cagctggggt 720 taatcccgca gctgagaaaa tgtcatttgt tggtttgtca tttgagcagc aaaaagagct 780 gattagaatg caacagttgc acgaaaaaga gatgcgcgtt agtgaaatgg aaatcgaaaa 840 acttagattg gatgatttga tgagaaagcg tgagatggaa tttgagaaaa taaaacatga 900 gcaaaagtta gctgtagaac gacaaaacca agagtttcaa cttaagttcg agagtttaaa 960 actccaaaat gcgagcgggt cacctgttca tcgcacagac ggctcattca atctcattaa 1020 taatttgaag ttgttgccgg tgtttaatga gcgagatcca gacgtatttt tttcgttgtt 1080 tgaaagcatt gctgatgaac gagattggcc tgagactgac cgtaccgtta tgcttcagtc 1140 cgtattagtg ggaaaagccc aggaggcgta tactgctctt tctgttgatg aaaggaaaaa 1200 atatgatagc gtaaaatctg ctgtattaaa ggcatttgag ctggttccag aggcataccg 1260 ccaacgtttt agaacgtgga gaaaaggtga gagacaaaca cacgtggaag taactagaga 1320 gctaggtaat cattttgatc gctggtgttc tacatctaat gtaaaatcat ttagtgagtt 1380 gcgggaattg atgattattg agcaactgaa aaatatcgtt cctgatcaaa ttgctgtcta 1440 cattaatgag aataaacctc gcactgctgc tgaagctgca tcattagcag atgactttgt 1500 tctgactcat aaaaaccgat ttagttctgg gaatcgggga aatggaaatg ttcactttaa 1560 acagtcgcag caaattgatc catatttaca ttctcgtaga gcgatatctg attcaacgtg 1620 ccggtattgt ctggaggtcg ggcactggaa aaacgagtgc cctttgctaa aaaataagca 1680 gatacgtggc aaaaataaaa atactgcacc tgtgcttttg accaattgtt ttccgcagac 1740 tccagctgtt tgcagtacgg atgactccgt gataaaagga aattgttccg aaattaaagg 1800 taacactggg tatgcaccat ttatatctca aggttgtgtt tctcttttag aggaacatga 1860 aagcgtacct gtaagagttt tgagggatac gggatcgtcc gagtccttta ttttggagtc 1920 catacttcca ttttcatcca gctcgtatgt cggtaagaat gttttgattc gaggaattgc 1980 acttcaaaca ttgtctgttc ctttgcacaa atttagttta cagtcccagc tagtgaacgg 2040 cgaagttatt atgggagtgc gcccttcttt gccggtggag ggagtggaca ttattttggg 2100 aaataatttg gctggggaga gtgtgtggcc tgtaatgaca cctgcaccga ttgtcaccac 2160 gtcctctgtt cttgatcaag aatgttgtga gttatccgaa gatgtgtcat gtgtagtgac 2220 tcgatctaag tctaaaatgt tgaatgagtc taacctgaat gatacccgtg ttcctgttga 2280 gtctgtcgct cctggattat ctctcttgtc aaaaatttct cgtgcagaat tcattgttgc 2340 ccagagggat gatcctggat tagaaaaaat gttcgctgct gttctcccac cggaggaggt 2400 agagagtgcc gcgactggat attatattaa tgatggggtg cttctgcgca agtggcttgc 2460 gttacacgag gatggtggtg gtgagtcagt ggttcaagtt gttgttccag agaaattcag 2520 agatgttgta cttaaattgt ctcatggcga tctagctggt cacttgggtg taaaaaagac 2580 atattctaga gttttgcagc acttttattg gccgctgttg aaaaaagata tctcgcggtt 2640 tattagaaca tgtcatacat gccaagtgac cggaaaacct aatcagccac tgaaaccagc 2700 cccactgtat cccattcctc cagtagagca gccctttgaa tatttgttaa tagactgtgt 2760 tggtccgtta cctccctcaa aatcgggaag taaattcctt ctaacagtaa tgtgtcaaag 2820 tactcggtat ccagcggctt acgcgttgcg taatatcact acccggtccg tagtaaaagc 2880 cctgtctcaa ttcatttcca tcttcggtat tccaaaggtc attcaaagtg accaaggaac 2940 gaactttact tcaaaaatgt tcgcagagat tttaaagcaa ttaggagtta aacatcatcg 3000 ttccagtgcg cagcatcctc agagtcaagg ggtgctggag agatttcacc aaacactgaa 3060 gtgtcttttg cgggcgtatt gtgtcgaatt aaatcgcgac tgggaagaag ggcttccgtg 3120 gctgttgtta gcatcaaggg aggtcacgca atctagtttg ggctttagcc caaatgattt 3180 agtcttcggc cacaaaattc gaggtttact ttctgttctc ggcgatcagt taaaaggcaa 3240 acaaccacca caaagcttac ttgaatatgt gaatgggttt aaaaggcgct tgattttagc 3300 tggtcaggca gctaaagaga atttggagaa ggcacagata aaaatgaaaa ctctttttga 3360 tcgtcggtct gagcagcgcg tatttaaacc gggggatcag gttttggcgt tgctgtcact 3420 agtgagctct ccgttctgcg caaagtttgt gggaccgtat actgtagtgc gtgcattgtc 3480 tgatcttaat tatgaaattg caactccaga cagaaagaaa aaaacccaaa tttgccatgt 3540 aaatctgctg aaaccctttt tctcccgaga ttgtcaagct gatatcaaat cctccggggt 3600 gatcagtccg gcgattgagg cgctgggaca gtcctccgca ggagaccggg aggggggtaa 3660 gtttttagat gatgctgtag tgccacctcg attaaaaaat tctgagactt tgaaagagtt 3720 ggaggatgtg ttgtctcatc taactgttgt gcagcgagcg gagctgtcac atttgatccg 3780 cgagtttcca tcattatttg gggatgttcc atcacaaacg catcttatag agcatgacgt 3840 ggatgtgggg gatgctttac ccatacgcca gcgtttttat cgtgtccctt ttgaaaaacg 3900 cattaaatta gagtctgaga ttcagtacat gctaaagaat aacatcgcaa aaccttcgtg 3960 ttcgagttgg gcatcgccgt gtttgctggt aagaaaaccg gatggaacgg ataggttttg 4020 tacggactat cgaaaaatta ataagatcac taaacctgat gcgtttcctc tcccaaggat 4080 ggaagattgc gttgatcaag tgggctcggc aagatttgtt agtaaacttg atttactcaa 4140 aggttactgg caaataccaa tgagtcctag ggcacaggaa attactgctt ttgtaactcc 4200 ttcagggcta tattcttact ccgttatgag cttcgggtta cgaaacgctc ctgcgacatt 4260 tcagaggtta atgaaccgtg tagtatctgg tttagagggt tgcgcggtct atttggatga 4320 cgttatagtt tacagcgaaa actgggatca acatatgaga cgggtgcgag aattatttaa 4380 acgtttagct gaagctaatt tgaccgtgaa cttggctaaa tgtgaatttg cacgggcaac 4440 tgtttcttat ttgggcaagg aagtgggtca gggtatggtg cgccctttgc gtgctaaaat 4500 cagcgcgatt gataatttcc ctattccgtg taccaaaaaa gagctgatgc ggtttattgg 4560 aatggccagt tattatcgtg gcttttgccc aaacttttca tcggtcattt gtccgctgac 4620 taatctatta aagaactctg caaagtttga ttggaatcca cattgccaaa aagcttttga 4680 gaatgtaaag ttgttattaa ctgctactcc tgtgttggca gcgccccaaa gtggtcttcc 4740 ctttgagatc caagttgacg caagtcacgt gggagccgga gcggtccttc tccagactca 4800 ttctgatgga ttaagtcatc caataagtta tttctcacgc aaatttaatt cgtatcaact 4860 gaattattct gttattgaga aggaaacact ggcgcttatc tgggcacttc aacactttga 4920 tgtgtatgtg ggaggaggtt ccccggtggt tatttattcg gatcataatc ctttgacatt 4980 ccttcattca ttgcagagtc cgagccaacg cttaatgcga tggatccttt tcttacaacc 5040 attcaacttg acgattaaac acattaaggg aattaagaat atctttgcgg acactttgtc 5100 gcgtgcccct gtggaggagt gaggcggtat ttttggtctc ccactgatgt ccctgaccta 5160 ctcctgtatg tatgttatag ttattatttg atttaatttg atgtgggatt attttttagg 5220 ggggtg 5226 // ID BEL8-I_DR repbase; DNA; ZEB; 6706 BP. XX AC chr18; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL8-I_DR; KW BEL8-LTR_DR; BEL8_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6706 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1365-1365 (2008). XX DR Genome; chr18; Positions 48739167 48745872. XX CC Positions [5652-6209] - Integrase core CC 'GCGCC' target site duplication CC LTRs are 100% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 492..6596 FT /product="BEL8-I_DR_1p" FT /translation="MSDLGERPMDDRVNQQLATQPSVRSKVATETSDAVET FT ELRRGKRTRKLTEKGQKLQEEKLERYEQRFKVSYDKWKSLVKEVKWLLSES FT SSIESLQDVMARMTKASAVVGTGYEDLRHLSIPDHDTRRRVDHCEAVTKKM FT EVLVKHYFEGNKEMESGWSDIELSEEVSVGTRISRTSTSSQSKSRSSKFSS FT LSAAKKQEAAAEVAANEATLEVLKEQDRQFKELQKLEAEDKKRIAEQEADD FT LKRREEEAKVKAQHEVENAARRKLLEDKRRELERLETVKKLNAAKARMQIY FT EQSEVSEDDELLSDLSHDCEFKEIHVDKRHHRPDNLSTKNVTLQPTVTTPQ FT IDNTSLVKALTESMSVNRLPVPEPMVFNGDPLKFNDWKTSFQMLIEKKSIP FT AVEKIFYLRRYVGDSVKKALEGHFLLGTDSAYCSAWKILEERYGNPFLIAK FT SFREKLYAWPKIGPKESLELQEFVDFLRSCEAAIPQIKELEVLNDCNENQK FT ILAKLPDWLTSRWNREVMETEENTKTFPSFSQFVKFLTKEAKIVCNPITSL FT YALKSSDKEGTRTLKTRSSVGKVLATNSGGDNLVLVSCVFCQKNDHSLHKC FT HKFMEKTVVERHKFVYKKMLCFGCLKPGHQSRICQNRSVCDICAKNHPTCL FT HEDRSRSVGNHGVTESFKKSTDLYQHRHTRVLNEATSNQVMQEINNTCTSS FT IIPVWVSTTSEPEKEVLVYALLDTQSDTTFILEETTHALNVKKEAVQLKLA FT TMASKITVVPCFKMKGLQVRGFYSKTRIPLPTTYSREFVPASKDHIPTPQT FT ARVWPHLEHIADVLAPRQGCDVGLLIGYNCPQALLPREVVAGEGNQPFAQR FT TDLGWSVIGHGNPCVDSGDIYGTSHQIIVRQVMPSIPASVNLKTKVHYVCR FT TQIKEVVSPIDVIRVFESDFCEQESEDCCMSQEDLRFLGKMKAGTSLNKAG FT HYEMPLPFKGDRPNLPNNKVCAVYRLKCLARRFKKDMQYYSDYTAFMNEII FT ACGDAEKVPPEEAEKCSAWYIPHHGVYHPKKPGKIRVVFDCSARFQGTSLN FT DHLLTGPELTNTLVGVLCRFRKNPVAIMCDIERMFHQFHVKAEDRDYLRFL FT WWDNGNLEAQPSVYKMKVHLFGAASSPGCANYGLKHLAAEGKGMFTEDVIR FT FIQRNFYVDDGLVSVPTEAEAIALVTEAKKLCSTGKLRLHKFISNSKNVIA FT SIPQEDLAKGVKDLDLDLGEMYVERALGVQWSIASDEFQFRVIVKEHPFTR FT RGVLSTIASVFDPLGFMAPFILVGKKILQQLCYNKASWDDSLPEDLQSRWE FT RWLLDLPNLADVKIPRCYVPQISNEAIRFELHHFSDASISGYGECSYLRTV FT SASGEVHCSFVMGKSRVAPTKITTIPRLELSAAVVAVRVSDMLKKELDIKD FT LQECFWIDSKVVLGYINNDARRFQVFVANRVQRIKLSTESNQWKYVGTEDN FT PADSASRGLMVEQLVASSWLKGPKFLWQKDLPSDEVKVGDVITSNDPELRK FT VQVFNIQANKEKSLLERLLKFSDWARMVKAIARLKRYVREAKGFESKAFEP FT TTLEQRKDAELFIIRLVQEATFSHEMTSLKKQLAFTSDAQTKSQLYKLSPF FT LDNQGILRVGGRLANATLHLDVKYPAILPRKNHFSALLIKHYHERVQHQGR FT GMTVNELRSNGIWILGCSSAVSSHIYKCLKCRKFRRCVEEQKMADLPEDRV FT QPTPPFLLCGMDCFGPFYVTDGRRQLKRYGLLLTCMCSRAIHIEMLDALST FT DAFINALRSFIAIRGTVRQIRCDQGTNFVGASREFMSTLKTMDQGQIKKLG FT CELIMNIPSASHMGGVWERQIRTVRNVLTSVLDQCPGALDSTSLRTLLYEV FT MAIVNSRPLTVEHLNDPLGPEPLTPNHILTMKSMIVEPPPGEFVKEDLYLR FT KRWRKVQYLVNNFWSRWRKEYLLNLQQRQKWNKNRRNVKVNDIVILKDDVS FT PRNKWKLAKVVDVYQGKDGKVRQVKLLMSDSSLDGKGKRLSKPSYLERPIQ FT KVVILLEAD" XX SQ Sequence 6706 BP; 2114 A; 1141 C; 1542 G; 1909 T; 0 other; gtaagaactc aaggctcatt gcgagtatga aagaagaagc aggacaattt aaccaatgaa 60 ctcagtcaag tgttctgttg tgcaaagcaa cgaagtgaaa ttatacatgg ttcagcaata 120 tcagaaggaa aggacattgc cggtgtaata tggagacgca agcataactt gaagagcagc 180 acaagtcaag accagcagta agagtggtgc agttaagtgt gattaaaggt acctaactgt 240 ggttaatgac tgcaaagact ttaagtattg tttaatattg tcatgttcct ttcaaaatgt 300 tggtggttta ttattgactt gtatgtaaat taagtactgt ttaactgcct gtttctattg 360 ctgtgaaata agggtaaatt tcactttcag tgtttaatca aattaaggca tgatttatgg 420 tcttaacttt aaattaagga ataaataata aagaaaagga ttttagaagt gaaaaaggag 480 aagtttgaac aatgtcagat ttaggtgagc ggcccatgga tgatagagtg aatcagcaac 540 ttgcaactca gccctcagtg agaagtaagg tggcaacgga aacttctgat gctgtggaga 600 ctgagctacg aagaggaaag aggactcgta agctgactga aaagggtcag aagttgcaag 660 aagagaagtt ggaaaggtat gaacagagat tcaaggtcag ttatgataaa tggaaatctc 720 ttgtcaaaga agtaaagtgg ttattatctg agtcttcttc aattgaatca ttacaagatg 780 ttatggcaag aatgacaaag gcttctgcag ttgtaggcac tgggtatgaa gatctacgcc 840 atttgagtat tcctgaccat gatacacgcc gccgagtaga ccattgtgaa gcagtcacaa 900 agaaaatgga ggttttggtg aagcattatt tcgaagggaa taaggaaatg gagtctggat 960 ggagtgacat tgagttatct gaagaggtca gtgtcggcac aaggatttct agaacatcta 1020 catcgtctca aagtaagtca agatcttcta agttttcaag tttatctgct gccaaaaaac 1080 aagaagctgc tgctgaggtt gctgctaatg aagccacttt agaagtctta aaggaacagg 1140 atcgtcagtt taaggaactt cagaagcttg aagctgaaga taagaagcgt atagctgaac 1200 aagaagctga tgatttgaag cgccgtgaag aagaggctaa ggtgaaagct cagcacgagg 1260 tggaaaatgc agctagacgt aagttattag aagataaacg ccgcgaactg gaacgtttgg 1320 agacagttaa gaagttaaat gcagctaaag cacgtatgca gatatatgaa cagagtgaag 1380 tttcagaaga tgatgaactt ttgagtgact tgtctcatga ctgtgaattt aaagaaatcc 1440 atgtggataa aagacaccac agacctgata atttgtcaac aaaaaatgtc actttacagc 1500 caacagtgac aactccacaa atagataata cttctcttgt caaggctttg actgaatcta 1560 tgagtgtcaa tcgacttcct gtgccagaac caatggtttt taatggggat ccacttaagt 1620 tcaatgattg gaaaacgtcc tttcaaatgt taatagagaa gaaatccatt cctgcagtgg 1680 agaagatttt ctatctacgg aggtatgtag gagattctgt caaaaaggct cttgaaggcc 1740 attttcttct tggaacagat tcggcttatt gttcagcttg gaagattttg gaagaaagat 1800 atggaaatcc atttcttata gccaaatcat ttcgagagaa gctttatgct tggccaaaga 1860 ttggtcccaa ggaaagtttg gaactacaag agtttgtaga tttccttcgt agttgtgagg 1920 ctgccattcc tcaaattaaa gaacttgaag tgttaaatga ttgtaatgag aaccaaaaaa 1980 ttcttgccaa gctgcctgat tggttaactt caagatggaa tagagaggtg atggaaactg 2040 aagaaaatac taaaaccttt ccaagtttca gccaatttgt taagttcctc acaaaggaag 2100 ccaagattgt gtgcaatcct ataacatcac tttatgctct gaaatcaagc gataaagaag 2160 gaacaaggac gttgaagact cgaagctctg tgggaaaggt gttggcgact aattctggtg 2220 gagacaacct cgtcttggtg agttgtgttt tttgtcaaaa aaatgatcat agtcttcata 2280 aatgtcataa gttcatggag aagacagttg ttgaacgaca caagtttgtc tataagaaga 2340 tgttatgctt tggatgtttg aagcctggac atcaatcaag gatttgtcaa aacagaagtg 2400 tctgtgacat ttgtgcaaaa aaccatccta cctgtcttca tgaagatcgt tcaaggagtg 2460 ttggaaatca tggggtgact gaaagcttta agaagagtac agatctttat cagcatagac 2520 acacaagagt tctaaatgag gcaacttcga accaggtaat gcaagaaatt aataatacct 2580 gcacttcttc tattatacct gtatgggtgt ctacaacatc tgaaccagaa aaagaggttc 2640 tagtatatgc tctccttgat acacaaagtg atacaacctt tattcttgaa gagactacac 2700 atgcattgaa tgttaagaag gaagcagttc aattgaaact tgcaactatg gcttcaaaga 2760 ttacagtcgt tccttgtttt aaaatgaagg gtcttcaagt tagagggttt tattcgaaaa 2820 ctaggattcc tctgccaaca acctattcaa gagagtttgt tcctgccagc aaagatcaca 2880 ttcctacacc acaaacagca agagtatggc ctcatcttga acatattgca gatgtccttg 2940 ctcctcgaca aggttgtgat gttggcttat taattggata caactgtcca caggctcttc 3000 ttccacgaga ggtggtggct ggtgaaggaa atcaaccttt tgcacagagg accgatttgg 3060 gatggagtgt aattggtcat ggtaatccat gtgttgactc tggcgatatt tatggaacaa 3120 gtcatcagat tattgtgagg caagtaatgc caagtatacc tgcttctgtc aatctcaaaa 3180 cgaaggtgca ctatgtctgt cgaacacaaa tcaaagaagt ggtttctcca atagatgtca 3240 tcagggtgtt tgaatctgat ttttgtgagc aagaatctga ggattgctgt atgtctcaag 3300 aggatcttcg attcctaggg aaaatgaaag caggtacatc attaaataag gctggacatt 3360 atgagatgcc actaccattt aaaggtgaca ggcctaactt gccaaacaat aaagtctgtg 3420 cagtatatcg tctcaaatgt ttggcaagaa gatttaagaa ggatatgcag tattacagtg 3480 attatactgc tttcatgaat gaaatcattg cctgtggaga tgctgagaag gttccacctg 3540 aagaagctga gaaatgttca gcatggtaca ttcctcatca tggagtgtat catccaaaaa 3600 agcctggaaa aattagagtt gtttttgatt gttctgcaag gtttcaagga acttctttaa 3660 atgaccatct tttgacaggc ccagagctga caaatactct agtgggtgtc ttgtgtcgtt 3720 tcaggaaaaa tccagttgcc attatgtgtg acatagaacg catgtttcat cagtttcatg 3780 tcaaagcaga ggaccgagat tatcttcgat tcttatggtg ggacaacggc aatttggaag 3840 ctcaaccctc agtgtataaa atgaaagttc atttatttgg tgcagcttct tccccaggtt 3900 gtgcaaatta tggtcttaaa catcttgctg ctgaaggaaa aggaatgttc actgaagatg 3960 ttattaggtt catccaaagg aacttttacg ttgatgatgg attagtaagt gttccaactg 4020 aagctgaagc aattgccctt gtgactgaag ctaagaaact ctgtagcact gggaaacttc 4080 gacttcacaa gttcatttcc aacagtaaga atgttatcgc ttcaattcct caagaggatc 4140 ttgctaaagg agtaaaggat ttggatctgg atctgggtga gatgtatgtg gagagagcac 4200 ttggtgttca atggtcaatt gcatctgatg aattccagtt tagagtcatt gtaaaggaac 4260 atccatttac tcgcagagga gtgttgtcaa cgatagcatc tgtgtttgat ccacttggat 4320 ttatggcacc attcattttg gtgggcaaga agatcctaca acagctgtgc tacaacaagg 4380 ctagttggga cgattcattg cctgaagact tgcaatctcg atgggaacgt tggcttttag 4440 atctgccaaa tttagctgat gtgaagatcc cacgatgtta tgttccacag ataagtaacg 4500 aagccattcg ttttgaactc catcatttct ctgatgctag catctcagga tatggggaat 4560 gctcctatct tcgaacagtc agtgcatcag gtgaggttca ttgctctttc gtcatgggaa 4620 agtcacgagt tgcaccaaca aaaataacta ctattccaag actagaactt tcagctgcag 4680 tggtcgctgt acgggtcagt gatatgctga agaaagaatt ggacattaag gatttacagg 4740 aatgtttctg gattgattct aaggtagttc tgggatacat caacaatgat gctaggagat 4800 tccaagtttt tgtagctaat cgtgttcaac gtatcaagtt gagtactgaa tcgaatcaat 4860 ggaaatatgt aggcactgaa gacaatcctg cagattctgc ttctagaggt ttgatggttg 4920 aacaactagt ggcgtcaagt tggctcaagg gaccaaaatt cttgtggcag aaagatcttc 4980 ccagtgatga agttaaggtg ggagatgtga tcacaagtaa tgatccagag cttcgaaagg 5040 ttcaagtttt caatatccaa gcaaataaag aaaaatcatt gttagaacgt cttctcaaat 5100 tctcggactg ggcaaggatg gtgaaggcca ttgccagact taaacggtat gtaagagaag 5160 ctaaaggttt tgaatccaaa gcttttgaac ctactacttt ggagcaaagg aaagatgctg 5220 agctctttat cattcggtta gttcaagaag caaccttttc ccatgaaatg acgagtctca 5280 agaaacaact agcatttaca tctgatgctc aaaccaaaag ccagttgtac aagttaagtc 5340 ctttcctgga taatcaaggc atactgcgag ttggaggtcg cttggcaaat gcaactcttc 5400 atttagacgt aaaatatcct gccattcttc caaggaaaaa tcatttctca gcattactaa 5460 tcaaacatta tcacgaacga gtacaacatc aagggcgagg aatgaccgtg aatgaacttc 5520 ggtccaatgg tatatggatt cttggatgca gtagtgctgt ttcttctcac atatacaaat 5580 gcctgaaatg taggaaattt agaaggtgtg tagaagaaca gaaaatggct gatcttcctg 5640 aagatcgtgt gcagccaact ccaccttttc tattgtgtgg aatggattgt tttggacctt 5700 tttatgttac cgatggtaga aggcaactca aacgctatgg gttattgttg acttgtatgt 5760 gttccagagc catacacatt gaaatgcttg atgctctgtc tactgatgca tttattaatg 5820 ctctccgctc ctttatagcc attcgtggaa cagttcgcca aattcgatgt gatcagggta 5880 cgaactttgt gggggcaagc agagaattta tgagcacatt gaaaacaatg gatcaagggc 5940 agataaagaa acttggctgc gagttgatca tgaacattcc atcagcaagt catatgggag 6000 gtgtctggga aagacagatc cgtaccgtaa gaaatgtttt gacttctgtt cttgaccaat 6060 gtcctggggc acttgacagt acttccctgc gaactctgtt gtatgaggtt atggctattg 6120 tgaatagcag gcctttgact gttgagcatt tgaatgaccc tctaggtcca gaacctctta 6180 ccccaaatca catcttgaca atgaagtcta tgattgtgga accaccacct ggagagtttg 6240 tcaaagaaga tctttatctc cgtaaacgat ggcgcaaagt gcagtattta gtcaacaact 6300 tctggtcgcg ctggagaaaa gagtaccttt taaatcttca gcaaagacag aaatggaata 6360 agaatcgcag gaatgttaaa gtgaatgaca tagtaatcct taaagatgat gtttcaccac 6420 gtaacaagtg gaaattggca aaggtggtag acgtgtacca aggaaaggat ggcaaagtga 6480 gacaagtcaa gttgttaatg agtgactctt cgcttgatgg aaagggtaag cgattgtcta 6540 aaccaagtta ccttgagcgt ccaatacaga aggtagtcat tcttcttgaa gctgattgaa 6600 gactgcttga tttcacacta tcactccatg ggttgcatta tttttcataa ttgtgtttaa 6660 gtgtgattta taaaacagga aaaaatcaca agtgatttgg tgggag 6706 // ID Gypsy155-LTR_DR repbase; DNA; ZEB; 780 BP. XX AC chr4; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy155-I_DR; KW Gypsy155-LTR_DR; LTR68_DR; LTR_Retrotransposon. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-780 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1572-1572 (2008). XX DR Genome; chr4; Positions 7943130 7943909. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 780 BP; 166 A; 138 C; 140 G; 336 T; 0 other; tgttatgatt tatgtttttt ttttcctgtt tttttcaccc tcctctccgt ttgctcaacc 60 ctaggtttct tattggttta ttattctgtc catcatcatt atgacgctct ccggttccac 120 caatcccagg acgagtcaca agtatttaaa gccctgtcag tccagtgtgt ggtgtgcgcc 180 ctgcctcggc gtaccctttg tgctccacgc aatttgtttg tgcaaatgct tcgtctattt 240 tgtgatcgct cagttactgt gataattcga gttgcttgtt gaactttgtt ggctttaagt 300 aacatatgat gatggttgtg tgattctaaa ctaagattta gcggacttat ccgttttgtt 360 ttgttttagt ttagattgat gttttgttag attagtttga gtgttttgcc acccgtgtat 420 ttttgagttt tgtagtttga ttagtttagt tcgggtgtcg tacacactga agatctcggt 480 ttttatttgt gtttcttttt gacgcctagt ttagacatta gatttaagga ttgttttgtt 540 ttgttaattt ctttgatttg gcttacactc gttgttcttg tttaaagata ataattataa 600 atgtttgttt gttttttttc tcccccattt attattgtta catgtttcaa ttcaattata 660 gagtaaaagc tttaaatgag agttactgaa tcttgcctca ttattgatct gtcgcacaca 720 ttacacctgt ctcacttaaa tgttacagca tcccacttta acatcataaa acacgtaaca 780 // ID LOOPERN5_DR repbase; DNA; ZEB; 1237 BP. XX AC . XX DT 11-JUL-2002 (Rel. 7.06, Created) DT 11-JUL-2002 (Rel. 7.06, Last updated, Version 1) XX DE LOOPERN5_DR is a nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; Dr000155; KW Dr000190; LOOPERN5_DR; nonautonomous DNA transposon. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RA Bao Z.; RT "Dr000155, an unclassified and uncharacterized repeat."; RL http://www.genetics.wustl.edu/fish_lab/repeats/. XX RN [2] RP 957-1237 RA Bao Z.; RT "Dr000190, an unclassified and uncharacterized repeat."; RL http://www.genetics.wustl.edu/fish_lab/repeats/. XX RN [3] RP 1-1237 RA Kapitonov V.V. and Jurka J.; RT "LOOPERN5_DR, a nonautonomous DNA transposon from zebrafish."; RL Repbase Reports 2(6), 20-20 (2002). XX DR [3] (Consensus) XX CC Several thousand copies of LOOPERN5_DR are expected to populate CC the CC zebrafish genome. LOOPERN5_DR copies are ~12% divergent from the CC consensus sequence. CC This element is characterized by 11-bp terminal inverted repeats CC and putative TTAA targets site duplications. CC Its classification is not very certain yet, although it is CC expected to be a member of the piggyBac/Looper superfamily [3]. XX SQ Sequence 1237 BP; 383 A; 255 C; 258 G; 340 T; 1 other; aggggtggtc cactacgata tcatatttta aactttagtt gatgtgtaat gtagctgtgt 60 gaacataaac aacatctctg aatgtaatac gctcaaagtt caatgcaaag ggagacattg 120 gcttttacag agttagctta gcaaagccta cagcgaacaa agtttgggga ctacaaaaaa 180 aatacatccg ggttagtgag atcacaaacg cttcaggtta cgcgcattca ccgcgcacac 240 acacgcgcag caaaggggcg tggccagagg cgctgtaatg ttatagcaga gaaagctaaa 300 atgccgtcca aacgctgcta tttccacaga gcttgttctg tttctgtatt tgggcttcca 360 aaggacacga cacaaagaga gaagtgctta cagtttaatt ttaattatgt tccagagaat 420 tataaaaaat atatagctag catttgacaa aggacagctt ccagaatctc tctcagttca 480 gtgctggatt cggctaaaaa ctcctccaac cataatagac gaagctgtgg attgtgagcc 540 acaacctgta agtgttttta tttgttaaaa ttgatctatt acatgcacag tktctagcgt 600 taacggtatg ttgtagcgag gacgtaaaca aggatgtaaa caacgggaaa tgctgtttgg 660 caccgttaac aatttagcta caaattcata tttatcagtc aaaccgctgt aaacacccac 720 aatcttcagc agcgctgcag tgtctctcta tgcggctgct ttccctgcgt tctacatctc 780 aaataacaaa ctcgcaaaag atatgtgaac gtttcatatt acttacacat gcttattctg 840 aatatatgtg aaagacactt gtcagatttt attttagaga gcaggcgtga ggttcagctg 900 tgtccttttc attttctgtc tgattcaggc ttaaactgac acggctaaca gctactctga 960 ctgactgttt acacagagca gtaggggcgt gacgtggagc cgatccgcga atcacagcac 1020 attatgttag ctgaccaatc agagcctctt gagggcgggc ctttcggagg aactaggaaa 1080 tatgacagtc gttttcatgt tagccgagta gctgtatata atcaaagtaa gatatatgaa 1140 aaaataacgt gattttctac aagtgaagca tgagcacaca ttgctttgca tcttataaac 1200 acaaccaagc cttaaaaata cactctggac cacccct 1237 // ID BEL2-I_DR repbase; DNA; ZEB; 5897 BP. XX AC chr16; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL2-I_DR; KW BEL2-LTR_DR; BEL2_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5897 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1355-1355 (2008). XX DR Genome; chr16; Positions 35789393 35783497. XX CC Positions [4905-5465] - Integrase core CC 'CATTG' target site duplication CC LTRs are 100% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 42..5834 FT /product="BEL2-I_DR_1p" FT /translation="MQQSEHASWALPRPRRQVRQPTYLQDYEVSAVRPKSH FT TSRNFNLSLPVPSESVHSQQPHIQWRRPPETQAEYALDPENQYEDISRGWP FT LNPKPCTQVDIEQSELIRGSNANSYTSLYGQRREPMLSALNQAPVAEQCDP FT LFESWPVSSMPVAEAPLMPEEENEEVDTPPPPRPSSNYLPEPTVEEHRVFT FT AIDRMMSELQLMRDRMTPNRIAQTHPHSTHQSHNSPHHSRGLEHNTWCQTQ FT QSGPNISSFGHPPSMHSTSRGPLPQPRVRPYMSQPQNARSREQSWVSTCQS FT GGEYRGPAPKIPQFIHNDPMEFSRFKLALTNLLPADAPELFKYQILVDHVK FT LEEACLIADSFINSPRPYTDTMNALTEKFGQPHHIALRRIAALMESPEIKR FT GDLMAFERFSLHVQSLVGMLQTLGPDGDVELSCGSHVARLLTKLPIEMRAN FT FRRSRFHHPRHTPTLLDLAEWLRYESWCQGYDTTPEPRTSRVTGLRADKRF FT NKPVATVLHGSESSTSNRAQSESQSPSSSKQKSHIKPYCPYCNNTEHFLNK FT CQAIQKLTREQIVEWIKSHKRCWRCGRAHQASQCDLKRLCDICQKRHLRVL FT HDANTRPEESKAESCLVNTSLDVLYLDKPSSHARVLLKVVRVTLQNGDKTL FT DTYAILDDGSERTMLLPEAVNQLNLDKRPETLALRTIRQEVQTLNGASVSF FT KISSTSKPKKTFQIKNAFTSKRLGLADHSYPVASLKKRYKHLADLPLESFA FT FVKPLVLIGADYPQLISPVEPVRLGPPGGPAAIRTRLGWTLQGPAHLIQWA FT PTTQQCLHTSISSPTLELMQNVERLWKMDALPYQSTRLAVRSKEDKEALEL FT LEVKTKRVQDAGIFRYATPLLRKQGVPPLRAQKESVLPSLRALERRLARDP FT QQAEEYCAAISKLEEAGAIRRLTTSEVLDSAESYYIPHHVVRHNGKSRIVF FT NCSFPYQGRSLNDTLLPGPTLGASLLGVLLRFREHAVAISSDIRGMFHQVQ FT LLPEDQPLLRFIWRDMKRADPPAVFQWQVLPFGTACSPCCATFALHRHARD FT HSGVNNAVLHSVEHCFYVDNCLQSLPTAEEAKELVDQLRDVLASGGFELRQ FT WASNVPDVLRHLPAEVRSDSMERWLSHDEPGLPEPALGLSWHWGSDTLGYR FT SRPVEYKTLTMRNIYKVLARQYDPLGFIAPYTTRAKLIVQSMWDKPRDWDD FT PCIPPDLQKAWTTWESELHWLPHIAVPRPYAPGLTEPMVVSRQIHVFSDAS FT TKAYGSVAYIRTEDVQGRIQVAFLAARSRVAPRRQHSIPRLELCGALTAAQ FT LAKKIETELTIKTDKPILWSDSTTVLTWLHSESCRFKVFVGTRIAEIQELT FT AGSTWLYVDTAQNPADDVTRGKALAELAVPNRWSQGPLFLHKPSTDWPSQP FT GELLEQDISEYKKVTLCGTISKANRQEAQGALTYQSWNDLMESVAQEAYGA FT ANQDTCLTAEDYRRAEKSIFSRIQAECFPEELHCLKAGKAVPRSSRLASLS FT PELDPSEGFIRVGGRLRRAEGLDTAFKHPILLDPSHHMTKLLIRDYDERLC FT HPGPERVFAEIRRTFWILRGREAIRREQHQCQGCQRWKAKPSVPKMADLPL FT ARLRLYQPAFYSTGVDCFGPFQVRIGRRSEKRWGIIYKCLTTRAVHLDLLH FT AMDSDSFLMSLRRFIARRGSPAELYSDQGTNFRGGEKELSEWFSRMSPDLQ FT QLLAKQKIAFCFNPPAAPHFGGTWEREIKSVKMALNTVIGAQPISEEVLRT FT VLLEVEAILNSKPLGYTSSNVADLDAVTPSILLMGRLDGTLPPVVYPKSEG FT LSRRRWRHCQVLADHFWARFIRCYLPTLQCRQKWRDTQANLTNGAVVMLMD FT PQYPRALWPIGRVIKVHPSVDGCVRSVDVQVKDRVYTRPVARLILLPAIPA FT ENVEEMA" XX SQ Sequence 5897 BP; 1587 A; 1485 C; 1436 G; 1389 T; 0 other; tttattggtc cttcgagccg gattgtagtt gtgctaggat tatgcagcag agtgagcacg 60 catcgtgggc cctcccacgg ccaagacgtc aggtgagaca gccaacatat ttacaagatt 120 atgaagtgag tgctgtgcga cccaagtcac atacttcaag aaattttaat ttgtccctgc 180 ctgtccctag tgagtcagtg cacagccagc agcctcacat tcagtggcgc agaccccctg 240 agacacaagc agagtatgcg ttagatccag aaaatcagta tgaggatatc agccgtggat 300 ggccacttaa ccctaaacca tgcactcagg tggacattga gcaaagtgag ttaatcagag 360 gaagcaacgc gaacagttat acttcactat atggccagag aagggaaccc atgttatcgg 420 ctttgaacca agcaccagta gctgaacagt gcgatccact atttgaatca tggccagttt 480 cttctatgcc agtagcagag gcccctttga tgcctgagga agagaatgaa gaagtagata 540 cgccacctcc cccccggcct tcctctaact acctaccaga gccaacagtt gaagaacacc 600 gtgtgtttac tgctattgat cggatgatga gtgagttaca gttaatgaga gacagaatga 660 cgccaaaccg catagcacaa actcatccac attcgacaca tcagagtcat aactcccctc 720 accatagtag aggcttggaa cataacactt ggtgccagac gcaacagtca ggacccaaca 780 tatctagctt tggtcacccg cccagtatgc atagcaccag tagaggtcct ttacctcagc 840 cacgagtaag accttatatg agtcagcctc agaatgcaag gagccgagag caatcatggg 900 tatccacatg tcaatcagga ggagagtatc gggggcctgc tccgaaaatt cctcagttta 960 tccataatga ccccatggag ttttcacgat tcaagttggc tctgactaat ctgttgccag 1020 cagatgctcc tgaactcttt aagtatcaga tacttgtaga ccatgtgaag cttgaggagg 1080 cctgtctcat cgctgactcc ttcattaatt caccccggcc atacaccgat acaatgaatg 1140 ctttaacaga gaagtttgga cagccacacc acatagcctt acggcgaata gctgctttga 1200 tggagtcacc ggagatcaag agaggagatc taatggcctt tgaacggttc tcgctgcatg 1260 tccagtcact ggtaggaatg ctccagactt tgggcccaga tggagatgtc gagctgagct 1320 gcggctcaca tgtagcacgc ttacttacca agcttcccat cgaaatgaga gctaactttc 1380 gtcgttcaag gttccatcat ccgagacata cccctaccct gctcgactta gctgaatggt 1440 tacgctatga gtcatggtgc caaggttacg acaccactcc agaacctcgc acctcacggg 1500 tgactggatt aagagcggat aaaagattta ataagcctgt tgctacagtt cttcatggtt 1560 ctgagagtag tacttcaaac agagctcagt cagagagtca gagcccatcc tcctccaagc 1620 agaagagcca tatcaaacct tactgcccct actgtaacaa tacagagcac ttcctcaata 1680 aatgtcaggc cattcagaag cttacacggg aacagatcgt agagtggatt aaatcgcaca 1740 aacggtgctg gagatgtgga agagcccatc aagcctctca atgtgacctg aagaggctct 1800 gcgatatctg ccaaaagagg cacttaagag tactacatga tgctaatact cgtcctgaag 1860 agtccaaagc tgaaagctgc ctagtgaaca ccagcctgga tgtcctgtat cttgataaac 1920 catcgtctca tgctcgagta ctgctaaagg ttgtccgagt caccctacaa aatggagaca 1980 agacactaga tacttatgca atcttggacg atggatcaga gaggacaatg ctcttaccag 2040 aggcagtcaa ccaacttaat ttggataaga gacccgaaac cctcgctctc cgtacaatcc 2100 gtcaggaagt ccagacattg aatggagctt cagtctcatt taaaatatca tccacgtcga 2160 agcccaagaa aacattccag atcaagaatg cttttacttc caagcgctta ggcttggcag 2220 accactcgta ccccgtggcc agtctcaaga agaggtacaa gcacctagct gaccttccct 2280 tggagtcctt cgccttcgtc aagccgcttg tgctgattgg tgcagactac cctcagctga 2340 tctctcctgt tgaacctgtg agactaggcc ctccaggagg tccggcagcc attcgcaccc 2400 gattaggctg gacactccaa ggaccggcac atctcatcca gtgggctcca actacccagc 2460 agtgtcttca cactagtata tcatcaccaa ccctggagtt gatgcagaat gtagagaggc 2520 tctggaagat ggacgctttg ccgtatcaga gcactaggct ggctgtgagg tcaaaggagg 2580 ataaagaggc gttagaactt ctggaagtca aaacaaagcg agtacaagac gcaggcatct 2640 tcagatatgc tacacctttg ctgaggaaac aaggtgtacc ccccctcaga gctcagaaag 2700 aaagtgtctt acctagcctc agggccctag agagacgtct ggccagggac cctcaacaag 2760 ccgaggagta ctgtgcagcc attagtaagt tggaggaggc aggagccatt aggagactca 2820 ctacaagtga agtattagac tctgctgagt catattacat cccgcatcat gtggtgcgtc 2880 ataatgggaa aagtcgtata gtcttcaatt gctctttccc ataccaaggc cgtagcctta 2940 acgacaccct cctaccagga ccaactcttg gagcctctct tctgggagtt ctgctacgtt 3000 tcagggaaca tgccgtggca atcagtagcg acatcagagg catgttccac caagtccaac 3060 ttttaccaga agatcagccg ctcctcagat ttatatggag agacatgaaa cgtgccgatc 3120 cacctgcagt ctttcagtgg caggttctcc cctttggtac tgcatgttca ccttgctgtg 3180 ccacgttcgc actccataga catgcaaggg atcacagtgg agtaaataat gcagtccttc 3240 attcagtgga acattgtttc tacgtggata attgtctcca gagtctgccg acagctgagg 3300 aggccaagga gttagttgac cagctcagag acgtgctggc atctggaggg tttgagctca 3360 gacaatgggc ctccaatgta cctgatgttc tccgacactt acctgctgag gtgaggtccg 3420 acagtatgga gaggtggctt tcgcatgacg aacctggtct tccggaacca gccctgggcc 3480 tcagttggca ctggggatca gataccctcg gttatagaag cagacctgtg gaatacaaga 3540 cactgacaat gaggaacatt tataaagtgc tggcccgcca atacgatcct ttgggattca 3600 tagcgcccta cactactcga gcaaagctca ttgttcagag catgtgggac aaaccccgtg 3660 actgggatga cccatgtatc ccccctgact tgcagaaggc atggacgaca tgggaatctg 3720 agcttcactg gctacctcac attgctgtac ctcgtcctta tgctccaggc ctcactgaac 3780 cgatggtggt ctcaagacag atacatgtat tcagcgatgc ctccactaaa gcgtatggct 3840 ctgtggcata cattaggact gaagatgtcc aaggacgaat acaggtagcc ttcctcgctg 3900 cacgatctag agtggcacct cgtcgccaac actcaatccc tcgccttgaa ctttgtggag 3960 cgttgactgc tgctcaatta gcgaagaaga tcgagacaga gctaaccatc aagacagata 4020 aaccaatact ctggagtgac tcaacgacag tattaacctg gttgcattcg gagtcatgcc 4080 ggtttaaagt ttttgtgggc acgcgtattg cagaaattca ggagctaacc gcagggagta 4140 cctggcttta tgttgacact gcccagaacc cagctgacga tgtgaccagg ggaaaggcct 4200 tggcagagct tgctgttcca aatcgttgga gtcaaggacc gctctttctt cataaaccct 4260 ctacggactg gccatcacaa cctggagaac tccttgagca ggacatttct gagtacaaga 4320 aagtgactct atgtggaaca atctccaaag ctaacagaca ggaggctcaa ggggccttaa 4380 cttaccagtc ttggaatgac ctgatggagt cagtagcaca ggaggcatat ggggcggcca 4440 atcaagatac ctgtttaact gcggaggact atcgtcgagc ggagaagtct atcttcagta 4500 ggatccaagc agagtgcttt ccagaagagt tacattgtct taaagcaggt aaagctgtgc 4560 ctcgcagcag ccgtcttgca tctctatcac cagagctcga tcccagtgag ggatttatac 4620 gagtaggagg gagactccga cgagctgaag ggttagacac agccttcaag caccctattc 4680 tcttagaccc gtcacaccat atgacaaaac tcctcattcg ggattacgat gaacgactct 4740 gtcatccagg tccagagaga gtctttgcgg agatacgtcg cacgttctgg atccttcgtg 4800 gaagggaagc cataaggaga gagcaacacc aatgccaagg gtgccagcga tggaaagcta 4860 aaccctcagt tccaaagatg gcagatcttc ccttagctcg ccttcgccta taccaaccgg 4920 cattttactc aacaggcgtt gattgctttg gtccatttca agtgaggata ggaagacgtt 4980 ctgaaaagag atggggcatt atttataaat gtcttacaac tagagcggtc catttggatc 5040 tcctgcatgc catggattcg gactcgttcc taatgagctt aaggcgattc attgcgcgcc 5100 gtggcagtcc agctgagctt tattctgacc agggcacgaa tttcagagga ggtgaaaagg 5160 agctgagcga atggtttagt cgcatgtctc ctgaccttca gcaactcctc gccaagcaga 5220 aaatcgcctt ctgttttaac ccccctgcag ctccacactt tggggggaca tgggagaggg 5280 agattaagtc tgttaagatg gccttgaata ctgtcattgg agctcagccc atatctgagg 5340 aagtcttacg cactgttcta ttggaggtcg aagcaatact aaactctaaa ccattgggtt 5400 acacttcctc aaatgttgcc gacctggatg ctgtgacgcc aagtatcctt ctaatggggc 5460 ggcttgatgg cacactgccg ccagttgtgt atccaaagag tgaaggatta agccggcgga 5520 gatggcgtca ctgccaagtg ttggctgatc acttctgggc aagatttatc aggtgctatc 5580 tgcccacact acagtgccgc cagaagtggc gtgatactca agccaacctg acaaatggag 5640 ctgtagtcat gttgatggat ccacagtacc ctcgagcact ctggcctatt ggcagggtga 5700 ttaaagttca cccaagtgtg gatggatgtg tacgatctgt agatgtacaa gttaaggaca 5760 gggtttacac ccggcctgtt gcccggctta ttcttctacc tgcaatccca gctgagaatg 5820 ttgaggaaat ggcttgaatg aatcctaatt aacagtacac ttttgcatgg tcaaattgca 5880 agcaatttgg gggcggc 5897 // ID Tc1-4_DR repbase; DNA; ZEB; 1572 BP. XX AC . XX DT 11-JUL-2002 (Rel. 7.06, Created) DT 25-SEP-2007 (Rel. 12.1, Last updated, Version 2) XX DE Tc1-4_DR is an autonomous DNA transposon - a consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; TIR; KW Autonomous DNA transposon; TA target site; Tc1 superfamily; KW Dr000076; Dr000078; Tc1-4_DR. XX NM Tc1-4_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1188-2 RA Bao Z.; RT "Dr000076, an unclassified and uncharacterized repeat."; RL http://www.genetics.wustl.edu/fish_lab/repeats/. XX RN [2] RP 1424-614 RA Bao Z.; RT "Dr000078, an unclassified and uncharacterized repeat."; RL http://www.genetics.wustl.edu/fish_lab/repeats/. XX RN [3] RP 1-1572 RA Kapitonov V.V. and Jurka J.; RT "Tc1-4_DR, an ancient Tc1-like autonomous DNA transposon from RT zebrafish."; RL Repbase Reports 2(6), 23-23 (2002). XX DR [3] (Consensus) XX CC Tc1-4_DR copies are flanked by the TA target site duplications CC generated upon their integration in the genome. Tc1-4 has perfect CC 51-bp terminal inverted repeats that belong to ~200-bp imperfect CC TIRs. There are approximately 1500 copies of Tc1-4_DR harbored by CC the zebrafish genome, they are ~13% divergent from the consensus CC sequence. The reconstructed 340-aa transposase, Tc1-4_DRp, is CC encoded by this transposon (positions 350-1369). XX FH Key Location/Qualifiers FT CDS 350..1369 FT /product="Tc1-4_DRp" FT /translation="MGRGSPVCQQICEKIIEMFKNNVPQRKIGRHLDISPS FT TVHNIIKRFKESGGISVHKGQGCKPKLNNRDLRSLRRHCIKNRHSSISDIT FT TWAQDYFGKPLSSTTIHSYIHKCQLKLYCAKRKPYVNSVQKRCRLLWARRH FT LGWTITQWKCVLWSDESVFQVFFGRNGRRVLRTKEEKDHPDCYQQQVQKPG FT SVMVWGCVSALGKGNLHFCDGTINAEKYIEILEHNMLPSRRYIFQGRPCIF FT QQDNAKPHSAHITKSWLRRKRIQVLDWPVCSPNLSPIEKVWCILCGKMLQR FT RPCTVAHLKTCLQEEWDKITPETLHHLVSSVPKRLLSVVKRNGNITKW" XX SQ Sequence 1572 BP; 516 A; 294 C; 326 G; 436 T; 0 other; caaccccaaa tcagaaaaag ttgggacagt atggaaaacg caaataaaaa agaaaatagt 60 gatttccaaa tttactttga cttgtatttc attgcagaca atatgaacac aaaatatttc 120 atgttttgtt tgtggtcaac ttcatttcat ttgtaaatat acatcctttc ctgtcattca 180 gacctgcaac acattccaaa aaatgggaca ggagcaattt agggctagta atcaggtaaa 240 ttggttaaat aatgatgtga tttgaaacag gtgatgtcaa caggtgattg taattatgat 300 ttggtacaaa agcagcatcc aagaaaggtc tagtccttta ggagcaaaga tgggcagagg 360 atcgccagtt tgccaacaaa tatgtgagaa aattattgaa atgtttaaaa acaatgttcc 420 tcaaagaaag ataggaagac atttggatat ttcaccttca acagtgcata acataattaa 480 aagattcaag gaatctggag gaatttcagt gcataaagga caagggtgca agcctaagct 540 gaacaaccgt gatctccgat ccctcaggcg gcactgcatc aagaatcgtc attcatctat 600 aagcgatatc accacatggg ctcaggacta ctttggcaaa cctttgtcaa gtaccacaat 660 acatagttac atccacaaat gccagttaaa actgtactgt gccaaaagga agccctatgt 720 taacagtgtc cagaagcgct gtcgacttct ctgggctcgg aggcatctgg gatggaccat 780 cacacagtgg aaatgtgtac tgtggtcaga tgaatcagta tttcaggtat tttttgggag 840 aaatggacgc cgtgtgctcc ggaccaaaga agaaaaggat catccagact gttaccagca 900 acaagtccaa aagccagggt ctgtcatggt atggggttgt gtcagtgccc ttggcaaagg 960 taacttgcac ttctgtgatg gcaccattaa tgctgaaaag tacatagaga ttttggagca 1020 caatatgctg ccttcaagaa gatatatttt ccagggacgc ccatgcatat ttcaacaaga 1080 caatgcaaaa ccacattctg cacacattac aaagtcctgg ctgcggagga agaggataca 1140 ggtacttgac tggcctgtct gcagtcccaa cctgtctcca atagagaaag tgtggtgcat 1200 tttgtgtggc aaaatgctac aacgaagacc ctgtactgtt gcccacctta agacttgttt 1260 gcaggaagaa tgggacaaaa ttacacctga aacacttcat cacttggtgt cttcagtccc 1320 taaacgtctt ttaagtgttg tgaaaaggaa tggcaacatt acaaagtggt aaatgcttta 1380 ctgttccaac ttttttaaaa tgtgttgcaa gaaccaaaat tgaaatacgt gtttatttta 1440 aaaaaaaata atcatgagga acacattaaa taatgtttgt tgtattgtct gcaatgaaat 1500 acaagtcaaa gtacattaac tttttttatt tgcgttttcc atactgtccc aactttttct 1560 gatttggggt tg 1572 // ID CR1-25_DR repbase; DNA; ZEB; 3105 BP. XX AC . XX DT 21-NOV-2008 (Rel. 13.11, Created) DT 21-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE CR1-like non-LTR retrotransposon - a consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; AP endonuclease; CR1 clad; CR1-25_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-3105 RA Bao W. and Jurka J.; RT "CR1-like non-LTR retrotransposons from zebrafish."; RL Repbase Reports 8(11), 1699-1699 (2008). XX DR [1] (Consensus) XX CC The 5'-part is incomplete. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 2..2791 FT /product="CR1-25_DR_1p" FT /translation="SDLRLSVFGCSETWLDGSVLDSEIEIANYSLIRRDRN FT RNGGGVCVYIRSDLSFNVRNEMNNASIEAVWVNVLLPKSKPILIGNLYRPP FT DQSNFYELLDESLNDFVKTHGSEIIICGDLNTDMLNVDAVLYKAFKHFSDF FT FGLKQIIKVPTRITPKSQSIIDLILTSDSDKISCSGVIECSMSDHFLVYLC FT RKVQKVKLMGHNTVLVRSMKNYSPNLFCDILDKVDWSGVWGSKDVEAAWDC FT FKDLFSNALNKIAPYKKIRVKQRSDPWITDEILQLLKIRDESFAKFRKDSD FT CSSYEFYKLKRNEVRNKIREAKRVFFKDKINENKFDPKKLWGVLRNLGYSN FT AKSRKSSKINLKVEDKLIVHPKDIAESFNQYFILLAKKLCGQLPGQPDIYN FT EDRVFSFYRQQDILPDSFSLRPVTESEVLKIISELKLNKAPGVDNIPARFF FT KDSVKVIAPLVSHILNLSIEQGKIPCDFKSARVIPLYKKGSTLDMINYRPV FT SILGVMSKILEKIVHEQIVNYITKKHILYELQSGFRALHSTETCILHMTDK FT IRRAVDGGNYCGMVLLDLQKAFDTVDHQILLYKLKAIGFDKVSLKWVQSYL FT HDRNQRVDIKGILSAPLSINCGVPQGSILGPLFFLLYINDLKIACSEELYL FT YADDAAILVSHHDKRALEEIMSSQLKSVYNWFLDNKLSLHMGKTEAILFAS FT RVKLKKDDNLVVKINDWKIQSKNVINYLGCLIDNKLSGDAMAKKVITKICG FT KIRFLARQSDLLDVHSLQLLAGALIQPHFDYATSFWYSSCSQNLKEKLQKA FT QNKLVRIILKMHPRAHLHKESFNKLGMLTVSNRVCFLKLGMVRKMFNNVVP FT SYLMNYFQMVRQQHSYSTRSWDFNICLFKFRSLVGKNMFLYTSAIMWNKLP FT KAIKEIKDLRLFKKAIKRWLFDEDN*" XX SQ Sequence 3105 BP; 1051 A; 355 C; 641 G; 1058 T; 0 other; ttcggatcta cggttgtcgg tatttgggtg tagtgaaact tggttggacg gttcagtgtt 60 ggactcagaa attgaaattg cgaattatag tttgataaga agagaccgaa atcgtaatgg 120 tggtggcgta tgtgtttata taagatcgga tttgtctttt aatgttagaa atgagatgaa 180 taatgcatcc attgaagctg tgtgggtgaa cgttttatta ccaaagagta aacctatttt 240 aattggaaac ttgtatcggc caccagatca aagtaatttt tatgaattgt tggacgaaag 300 tttaaatgat tttgtgaaaa cccatggatc ggaaattata atatgtgggg atttaaatac 360 tgatatgtta aatgtggacg ctgtgttata taaggctttt aaacattttt cagatttttt 420 tggattgaag caaattatta aggttccaac tagaattact cccaaatcgc agtctattat 480 agatttgatt cttacatctg attcagataa gataagttgt agtggtgtta ttgagtgtag 540 tatgagtgat cattttttag tatatttatg tagaaaagtg caaaaggtga agcttatggg 600 tcataatact gttttagtga ggtcaatgaa gaattatagt cctaatttat tttgtgatat 660 attggataaa gttgattggt ccggggtatg gggatccaag gatgtggagg cagcatggga 720 ttgttttaag gatctgttca gtaatgcttt gaataaaata gcaccatata aaaaaataag 780 agttaaacaa agatcagacc cttggattac agatgaaatt ttacaactgt taaagataag 840 agatgagagt tttgccaagt ttaggaagga tagtgattgt agcagttatg agttttataa 900 attaaagaga aatgaggtta ggaacaaaat aagagaagca aaaagggttt tttttaagga 960 taagatcaat gaaaataaat ttgatccaaa aaaactttgg ggtgtactga gaaatttagg 1020 ttattcaaat gctaaaagta ggaaatcatc taaaattaac ttaaaggtag aagacaaatt 1080 aatagttcat ccgaaggata ttgccgaatc ttttaatcag tattttatat tgttagcaaa 1140 aaaattatgt ggtcagcttc cagggcaacc tgatatttac aatgaagata gagtgttttc 1200 tttttataga caacaggaca ttttgccgga ttcttttagt ctgagacctg ttacagaaag 1260 tgaagtttta aaaattatta gtgagttaaa attaaataag gctcctggag ttgataatat 1320 tcctgctaga ttttttaaag actctgttaa agtgattgcc cccttggtgt cacatatttt 1380 aaatttatct attgaacaag ggaaaatacc atgtgatttt aaaagtgcta gagttatacc 1440 attatataaa aaaggaagta cgttagatat gattaactat agacctgtat caatcttagg 1500 ggttatgtct aaaattctgg aaaaaattgt acacgaacag atagtcaact acattactaa 1560 aaaacatatt ttgtatgagt tacagtcagg ttttagagca ttgcattcca ccgagacatg 1620 tatattacat atgacagaca aaatacggag agctgtagat ggtggcaact attgtggaat 1680 ggtgctgctg gaccttcaga aggcttttga cactgtggac catcaaattt tactatataa 1740 attaaaggca attggatttg ataaagtgtc tttaaagtgg gttcagtcat atctgcatga 1800 tagaaaccag agagtcgata taaagggtat attatctgct cccttatcaa tcaactgtgg 1860 agtacctcaa ggaagtattt taggtccatt attttttctt ttatatatta atgatttaaa 1920 aatagcgtgt tcagaggagc tctatcttta tgctgatgat gccgcaattt tagtatctca 1980 tcatgacaaa agggccctgg aagaaattat gagttcacag ttaaaaagtg tttacaactg 2040 gtttttagac aacaaacttt ctttacacat gggtaaaaca gaggctattt tatttgcttc 2100 aagagtgaaa ttaaaaaaag atgataattt ggtagttaag attaatgatt ggaaaattca 2160 gtcaaaaaat gtgataaact atttgggttg cttaatagat aataaattat caggagatgc 2220 tatggcaaaa aaagttatta cgaagatttg tggaaaaatt aggtttttag caaggcaatc 2280 tgatcttttg gatgttcatt ctttacaatt gttggcagga gctttaattc aaccccattt 2340 tgattatgct acgtcatttt ggtacagtag ctgctcacaa aatttgaagg agaaattgca 2400 aaaggcacaa aataaattag tacgaataat tttaaaaatg catcctagag cacatttgca 2460 caaagagtct tttaataaat taggtatgtt aactgttagt aatagagtgt gttttttaaa 2520 actgggaatg gtacgtaaaa tgtttaataa tgtggttcct agctatttga tgaactattt 2580 tcaaatggtg aggcaacagc attcatacag tactaggagt tgggatttta atatttgttt 2640 atttaaattt agaagtttag tggggaaaaa tatgttttta tatacaagtg caatcatgtg 2700 gaacaaacta ccaaaagcaa tcaaggaaat aaaagattta aggttgttca aaaaagctat 2760 taaaagatgg ttgtttgatg aggataatta gttttgcatg gtggatgagg attttttaaa 2820 ttatgttact tgagtgtgaa taggttggcg gctggaggga cattgtgctg gataggttgg 2880 ctgtttgtac cgctgtggcg actgtttgtt ttattcttca ttagtattat gaagccatac 2940 attgttttaa tttaacagag gaccacaatg gaaataagcc cagggcttta ttgtgttttt 3000 atcctcgaca gtttttttta acaaagtgta tggaatactt tttgtacagt ttgtctaaaa 3060 tctgtcaaat aaaattcaat tcaattcaat tcaattgtac ttatg 3105 // ID Gypsy121-I_DR repbase; DNA; ZEB; 4926 BP. XX AC chr19; XX DT 03-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Gypsy retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy121-I_DR; KW Gypsy121-LTR_DR; Gypsy121_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4926 RA Jurka J. and Kohany O.; RT "Gypsy retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1278-1278 (2008). XX DR Genome; chr19; Positions 39642293 39637368. XX CC Positions [2431-2907] - Integrase core CC 'TTAT' target site duplication CC LTRs are 100% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 1051..4836 FT /product="Gypsy121-I_DR_1p" FT /translation="MSAEVTTLDDLINLVLLEQFKNTLPDIVVTYIAERNV FT KTASQAAVLADEWELIHKTRADGFRSDRLSPCLSSPSVYLNQARRPKSPSN FT RHDLTCNYCLEEGHWKSNCPVLKSKSAKAQLTKSAGFVKPAALVGPVPLSA FT LSAETPANTSSVLDRQDIFNPFISDGFVSLKSGNRIAVKILRDTGSSASFI FT LESILPFSLESDTGKSLLIRGIGLNTLSVPLHKVTLNCDLVQGDVELGVRP FT VLPVEGVSLILGNNLAGGRVWVGETPPLIVSNPPPYCDCLDESAPQNPDVF FT VACAVTRSASRAQSEVKPVSTEHFFPLTDYPLAVSREELSKEQQSDVTLKA FT QYEQVYDDVDIRNRAQGYFIRDGVLLRKWSPHAGFCVGDPFVQVVVPVKWR FT PVVLKTAHDGLAGHAGIKKTYDRILRYFYWPRIKKDVASYVKTCHTCQLTG FT KPNQSLKPAPLQPISAVCQPFEHLIIDCVGPLPRSKAGSEYLLTVMCQVTR FT YPAAFPLRSINTKAIVKALTQFISIFGLPKVIQSDQGSNFTSNLFAEVLRH FT LGIKHNLSTAYHAQSQGALERFHSTLKSLLRAYCVEMDRDWEEGLPWLMLS FT AREVTQQSLGYSPNELVFAHKVRGVLAVLHDQWTDLDPPEKLSDYVLGFRR FT RLSLAWELAKKQLGSSQKKMKRLFDRRTERRQFSVGDQVVALLPVLGSPFQ FT AKYTGPYKVTHIGSNDNYTILTPDRKKKTQVCHVNILKPYFNRDCVTPMQV FT NPVALVDSFPEHYAHHVMDMQADELKTPDDCVLQGRLKNSETFKKLLFDFK FT HLDATQQQELIQLLSEFPQLFSDVPTQTSVLTHDIDVGEATPIRQRFYRVP FT LNKRDVLRAEVQYLLDNKLAEPSSSCWSSPCLLVRKSDDSFRFCTDYRKLN FT SVTKPDSFPLPRMEDCIDQVGAAKFVTKLDLLKGYWQVYLSERARELSSFI FT TPDGLFSYRVMSFGLRNAPATFQRLMNKIVAGLEGVTVYLDDTIVVSDSWP FT EHLCRLRRLLDRLSQANLTINLAKCEFAGATVTYLGKVVGQGQVRPVRAKV FT QAIDEYPVPVTKKELMRFLGLVGFYRCFCRNFSTVIAPLTDLLKSKALFVW FT SESCQTAFDAVKALLTSAPVLMAPRLGEPFKIMVDASKEGAGAVLVQEDDV FT GIERPVCFFSRKFNSYQRNYSTIEKEALALIWALQHFEVYVGGDSSPLTIY FT SDHNPLTFLHSLKNPNQRLMRWCLFLQPFHLDIRHLKGSENIVADALSRAP FT L" XX SQ Sequence 4926 BP; 1220 A; 957 C; 1229 G; 1520 T; 0 other; attgggggct cgtccttgtg aagactattt tttgcgtcga tcggtttgtt ttggttagat 60 tttcgtttgg taagtgtttt attttattat tatgttttgc ttggattatt agcttatttt 120 gttaggaggg ggaaagtagt ggttggcagg caatttgtgg tctccgtcct gcccaagtgt 180 aatatttgat taaggtctaa tccgtcaaaa gaaacctagc caaattgtgc agaaataaac 240 tgtccccggg taggtgaaga acgtgtcttc ggatacgtta tttttttatt ttagtttgtt 300 gtttttggct ggctcggggc ttggtagcat ttgtctcggg agcacggccg aggtttgggg 360 aattacttgt acattggtgg ttctcccaaa cttgcggtat atagtgcata aattcccggt 420 cttgggccag ttgaagatta gggtctacat tagtgaagag tcggggaggt tttatttctg 480 tgtgattggt tgcaataacg gtgatattat tattattatt attggattgg agttaattag 540 attttttttt tttttttttt cctatggcga gtgttgtgga aacgtttgtt caaactccgt 600 cggaggactt attagactta atgacgaagg atcaattatt agaattagca actcattacg 660 aaattgagct ctccgcactt cagaagagaa cgaaggaggg gataaagcag attttaaaag 720 cgtcgctgtt tgatctgaat gtgctggtgg aagcaccggc tccgatacaa cctgggtttg 780 gtgcactttc attcgagcag cagaaggagt tgttgatgtt gcagcgcgac actgaatgtg 840 aaaaacggcg tttagattgt gagaaagcta aaatagaaca taagagttga ggagagtcaa 900 atttacgaaa atgtgaagtc atctgtttta aagatttatg agcgtgtgcc ggaaagttat 960 cgccagcagt ttcgttcttt ggggcgtgaa tccaatcaaa cgcatgtgga atttgtgaga 1020 gatctaactt tgcgatttaa tcgttggcgt atgtcggcag aagtgaccac cttggatgat 1080 ctgataaatt tggttttatt ggagcaattt aaaaatacgc ttccagatat tgttgtcact 1140 tacattgccg agcgtaatgt taagaccgcg tcccaggcag cagtgctggc cgatgagtgg 1200 gaactcattc ataaaactcg tgcagacggg ttcagaagtg atcgcctttc tccctgctta 1260 agttctccat ccgtttattt gaaccaggca aggcgtccta aaagtccatc aaatagacac 1320 gatttgactt gtaactactg cttagaagaa gggcattgga aatcaaattg tcccgttctt 1380 aaatcaaaaa gtgccaaagc acagctaact aaatccgccg gttttgtaaa gcccgctgcg 1440 cttgtgggtc ctgtaccgct gtccgctctg tcagctgaaa caccggcgaa tacatcaagc 1500 gtgttagacc gccaggatat ttttaatccg tttatttcgg atggttttgt gtcacttaaa 1560 agtggtaata ggatagcagt aaaaattctc cgagatacag gatcatcggc ttcctttatt 1620 ttggaatcta tattgccgtt ttcactggag tctgacactg gtaagtcgtt gttaattcgt 1680 ggcattggat taaatacatt atctgtaccc ttacataaag tgactctgaa ttgtgactta 1740 gtgcagggcg atgtagagct tggcgtgcga cctgttctgc ccgttgaagg cgtgtcactt 1800 attcttggca ataatctggc gggcggacgc gtttgggttg gggaaacgcc acctttgata 1860 gtttctaacc cgcctcccta ttgtgactgt ctggatgaga gtgctccgca aaatcctgat 1920 gtgtttgttg cttgtgcagt gactcgatct gctagccggg cgcaatctga agtaaaacct 1980 gttagtactg aacatttttt tccgttaact gattacccac tggccgtttc tagggaagag 2040 ctctcgaagg aacagcagag cgatgtaact ttaaaagctc agtatgagca agtttatgat 2100 gatgttgata ttcggaaccg ggcacagggt tatttcatca gagatggggt attattgaga 2160 aaatggtctc cacatgctgg tttttgtgtg ggggatcctt ttgtccaggt ggtggtgccc 2220 gttaaatggc gaccggtagt tttaaaaact gcccatgatg gtcttgccgg tcatgctggc 2280 attaaaaaga cgtatgacag aattcttcgt tatttttact ggccacgaat caaaaaagat 2340 gtcgccagtt atgttaaaac ctgtcataca tgccagttga ctggtaagcc taatcagtcg 2400 ttaaaaccgg cgcctctgca gcccatttct gccgtatgcc aacctttcga gcatctcatc 2460 attgactgtg ttgggccact ccctcgctca aaagcgggga gtgaatatct gctaacagtg 2520 atgtgccagg tcactcgtta tccggccgcg tttcctttgc gttctattaa cactaaggct 2580 attgttaaag ctctcactca atttatttct atttttggct tacctaaagt tattcagtcc 2640 gatcagggaa gtaattttac ttcaaacttg tttgccgaag tacttcgtca tttggggatt 2700 aagcacaatt tgtccactgc gtatcatgcg caaagtcaag gtgcgctaga gcgcttccat 2760 tccaccttaa aatccttgct ccgggcttat tgcgttgaga tggacagaga ttgggaggag 2820 ggcctcccgt ggctgatgct gtcagcccgg gaagtaactc aacaaagtct tggttatagc 2880 ccgaatgaac tcgtttttgc tcataaagtt cggggggtac tggctgtttt gcacgatcag 2940 tggaccgatt tagatccgcc agaaaaattg tcggattatg ttttgggttt tcgtcgacgg 3000 ctttcgttgg catgggaact agctaaaaag cagttgggta gttcccagaa aaagatgaag 3060 cgcctgtttg accgtcgcac tgaacgccga cagtttagcg tgggagacca ggttgttgcg 3120 ctcctcccgg tgttgggatc gccctttcaa gccaagtaca ctgggccata taaggtgacg 3180 cacattggca gtaatgacaa ttacactatt ttaacaccag atcgtaaaaa gaaaacacaa 3240 gtttgtcacg ttaatatttt aaagccatat tttaatcgag actgtgtcac accaatgcag 3300 gtaaatcctg tagcgctggt tgatagtttt cctgagcact atgcgcatca tgtgatggat 3360 atgcaggcgg atgaattgaa aacacccgat gattgcgtgt tgcagggtag gttaaagaat 3420 tctgagactt ttaaaaagtt gttattcgat tttaaacatc tagatgctac tcaacaacaa 3480 gaattaattc agttgttatc ggagtttcca cagttatttt ccgatgtacc gacacagacg 3540 agtgttttga cgcatgatat agatgtgggg gaagccacgc ctatccgcca gcgcttttat 3600 cgtgttcccc tgaataaacg tgatgtattg agagcagaag ttcagtattt gctcgacaat 3660 aaactggctg aaccctcttc ctcttgctgg tcatctccgt gtcttttagt tcggaagtcc 3720 gacgatagtt tccgtttttg tacggactat agaaaattaa attcagtaac aaaaccagac 3780 tcctttcctc taccgaggat ggaggattgt attgatcagg tcggtgctgc aaaatttgtg 3840 actaaattgg atttacttaa gggctattgg caggtctact taagtgagcg tgcgagggaa 3900 ttatcgtcgt ttataacacc tgatgggtta ttctcctacc gggtgatgag cttcgggctg 3960 cgaaacgcgc ccgcaacgtt tcagagatta atgaacaaaa tagtcgcggg attagaaggg 4020 gttaccgttt atttggacga tacgattgtt gtgagtgaca gctggccgga gcatctttgt 4080 cgtttgcgcc gccttctcga tcgcctctcg caggccaacc ttaccatcaa tctcgctaaa 4140 tgcgaattcg caggggcgac ggtgacgtac ctgggcaagg tagtgggaca aggacaggtg 4200 cgtccagtga gggcgaaggt gcaggcgata gatgaatatc cggtgcctgt cactaagaaa 4260 gaattaatgc gctttttggg acttgtaggg ttttaccgct gtttttgtag aaacttttcc 4320 acagtgattg cccctttaac tgatctgctt aagtccaaag ctttatttgt gtggtccgag 4380 tcttgtcaaa ctgcgtttga tgcagtgaag gcgttgctta cgtcagcacc agtcctgatg 4440 gctccccggc taggagaacc atttaagatt atggtggatg ccagtaaaga gggagcgggt 4500 gcagtgttgg ttcaagagga tgatgtgggc atcgagcgtc cggtttgttt tttctcacga 4560 aaattcaata gttatcaacg gaattattct accatcgaaa aagaagcgct cgcattaatt 4620 tgggctctcc aacactttga ggtgtatgtg ggaggagata gttcaccttt aacgatttat 4680 agcgatcata atcctcttac ctttttgcac tcgcttaaaa atcctaacca acggcttatg 4740 cgctggtgct tgtttctcca accttttcat ttggacattc ggcacttaaa aggctctgag 4800 aatatagtgg ccgacgccct gtctagggcc cccttgtaaa tatgggtttg tatgcacttt 4860 ctttttatta tgtacatgtt tatggtttat tgtatggaaa cgttgtttcc tgtttatggg 4920 ggaggg 4926 // ID hAT-N51_DR repbase; DNA; ZEB; 368 BP. XX AC . XX DT 21-OCT-2008 (Rel. 13.1, Created) DT 21-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Putative nonautonomous hAT DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW 8 bp TSD; hAT-N51_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-368 RA Bao W. and Jurka J.; RT "hAT DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1320-1320 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 368 BP; 119 A; 64 C; 66 G; 119 T; 0 other; tagggctggg ccgataaaca atattatatc gaatcgcgat aaaatttatg tcaataacaa 60 tgataagctc tggacttttt tactctatat tgatctaaga gccaatcaca cagcagaaat 120 gtgcaacaat gggaatctaa aagtgtgttg atattagaga tgtaccgaat tttcggccac 180 cgaaaattta tcggccgaaa atatgttatg ccatttaatg gaccctctaa tttttgtggc 240 ttcagtcatt gttggggtac tcttttaaat ctggaagcat taacaactta tatcgaaata 300 tatatcgtta tcgttcaata tggaaaataa ttatcgagat tgcatttttg ccatatcgcc 360 cagcccta 368 // ID Gypsy39-LTR_DR repbase; DNA; ZEB; 961 BP. XX AC . XX DT 21-SEP-2007 (Rel. 12.09, Created) DT 02-OCT-2007 (Rel. 12.09, Last updated, Version 1) XX DE A long terminal repeat of the Gypsy39_DR LTR retrotransposon; a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy superfamily; LTR; KW endogenous retrovirus; Gypsy39_DR; Gypsy39-I_DR; Gypsy39-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-961 RA Dib M.R. and Naveira H.F.; RT "Gypsy39_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 7(9), 809-809 (2007). XX DR [1] (Consensus) XX CC Gypsy39-LTR_DR is a long terminal repeat of the Gypsy39_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its CC internal portion is deposited in Repbase as Gypsy39-I_DR. XX SQ Sequence 961 BP; 203 A; 179 C; 188 G; 391 T; 0 other; tgttacgtgc cagcacgtat tggttattta tggttgtatt actatgtgtc tcattgtgtt 60 tttcttttta tcactccagt tcacttcctt gtttttgcct actgctgttc ctgattggct 120 ggttcacatc cacttgatgc aaccaatcct gacactgctt tatgctgatt ggttcctctg 180 tgacaactcg gttccagttt cagccaatca gattactcca gctcactata tatttttttg 240 tctgccagca gtttgacagt ctttgttggc tcaccttgtc actggttttt gtgcttgttt 300 gctcgcttga ttgaacctgt gtttatatcc ccattttgtt aagtattgtt gtgatatata 360 gtccgtctta tcctttattt acactgtgtt gttattttgg tttagtcact gtctgttctt 420 acctgtgtaa tcattcattc cctttcctca tgtttcccta cagaggtttg gtagaaagac 480 tggtaggtaa gtaggtcacg tgatatacat ttcaacacat aggagttcaa ctgtctagag 540 atacactagg gaagggggtg agcgccaacc cgtgtatttt tggttaagtg gatagttagt 600 gtagtttagg taagacgtct tggacgttga agatttattt ttcacattat tgtttattag 660 tctagattag cggggatttg atattagtaa gactgtttgg ttttgattgt ttctttgctt 720 tagcgccacc catagccagt ctttctaatt taggtttttg ttttgattgt gatttgtaaa 780 tattgtaaat agctcactta ctgcttcact tatcaccact tgtaataaat cactttattt 840 gcacagcaat tcttgtatgt ggtcattact ttgagttcaa ttttctttga gtgtttgttt 900 gaacaaaatc cttgatcaag tgtgcaaccc cacttcccta gactgcctag gggttgtaac 960 a 961 // ID DRSATII repbase; DNA; ZEB; 173 BP. XX AC . XX DT 11-AUG-1999 (Rel. 4.07, Created) DT 11-AUG-1999 (Rel. 4.07, Last updated, Version 1) XX DE Zebrafish satellite type II DNA - consensus. XX KW SAT; Satellite; Simple Repeat; DRSATII; Repeat region; KW satellite type II. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RA Ekker M., Fritz A. and Westerfield M.; RT "Identification of two families of satellite-like repetitive DNA RT sequences from the zebrafish (Brachydanio rerio)."; RL Genomics 13, 1169-1173 (1992). XX RN [2] RP 1-173 RA Jurka J.; RT "DRSATII."; RL Direct Submission to Repbase Update (AUG-1999). XX DR [2] (Consensus) XX SQ Sequence 173 BP; 61 A; 33 C; 17 G; 58 T; 4 other; aagcttcaaa tccaaattac aaatttacac tggattttgt atcccttcaa tgcttgtaaa 60 gcatctattt gacatctatt tgwmwtctat ttgacatcca cacattgatg cctctttgac 120 cachaaagaa accacacaaa aagtatcata atatataaag tttaattatt tga 173 // ID hAT-N22_DR repbase; DNA; ZEB; 1738 BP. XX AC . XX DT 03-OCT-2008 (Rel. 13.1, Created) DT 03-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE hAT-N22_DR is a nonautonomous hAT DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW Interspersed repeat; hAT superfamily; hAT-N22_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1738 RA Bao W. and Jurka J.; RT "hAT DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1292-1292 (2008). XX DR [1] (Consensus) XX CC consensus built from dozens of copies, ~ 85% identical to the CC consensus. TSD=8 bp. TIR= 13 bp. CC The termini sequence of hAT-N22_DR is similar to hAT-N21_DR, thus CC classified as hAT superfamily. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1738 BP; 339 A; 398 C; 228 G; 769 T; 4 other; cagtggtgga aagagtactg aaaaatcata ctcaagtaaa agtaccatta cttgcccaaa 60 aatgtagtgc aagtagagta aaagtatctg ttgtaaatat tactcaaagt atgagtaaaa 120 agtagccctt ttaaaagtac tcaagagtag tgagtagtga gtattacact gtgaaaagtt 180 gatgcgttta catgcagttt gtgcagggat gtgtaaacgt aacattctgt agtgcattta 240 gtgatcttcc tatatttgtt cgttccttcc aaccttcctt ccattgtttg tttgttctgt 300 ccttccttcw ttcattcctt ccttcattcc ttccttcctt ccttcattca tttgttcatt 360 cgtttgttca ttccttcctt cattcatttg ttccttcctt catttgttcc ttccttcctt 420 ccttccttct tccttccttc attcattcgt tcattcattc gttcattcat tccttcattc 480 atttgttcct tcatttgttt gttcattcat atttttgttc cttccttcat taggtccttt 540 gttcattcat tcattcgttc gttcgttcct tccttccttc cttagtttgt tcagtccttc 600 cttgaatcct tccttcattc atttggtcct tccttcgttc attctttcat tcattcgttt 660 gttcgttcat atttttgttc cttccttcat taggtccttt gttcattcat tcattcattt 720 gttccttcat tyatttgttt ctttcttcat tccttccttc cttccttcat ttgttcattc 780 tttcctttgt tccttccttc cttagtttgt tcagtccttc cttgaatcct tccttcattc 840 atttggtcct tccttcgttc ctcccttcat ttgttcgttc attcattcgt tcgttctttc 900 atttattcat tcgtttsttc attcattcat tcgtttgttc gttcatattt ttgttccttc 960 cttcattagg tcctttgttc attcattcct tcgttcgttt cttccttcct ttgttcgttc 1020 cttccttcct tcattcatgt tttttttcct tcattccttc cttccttcct tcattcattc 1080 atttattcat tcattcgttc gttcgttcat atttttgttc cttccttcat taggtccttt 1140 gttcattcat tccgtccttc cttccttcct tagtttgttc agtccttcct ttaaatcctt 1200 ccttcattag tttggtcctt ccttcrttcg gtcctttcct tccattcttc attcgttcca 1260 tttagtgatt gtttaaggct gtttggtgat ttcagtcatc atacagtcga catccttcat 1320 tttctcatca gtgacatgca gtctataaaa agtctctggg tcattgcgtg taaagatttt 1380 ggacatcttc ttggacattt ttaatgcttc caaaaggttt gctgcattta taaagcgccc 1440 atgtcttgag gttgttcagt atgatgcgat ttactttcta tgtgcgattt gattggacag 1500 gaatcacagg actgattttt ctactttagc caatcaccat agacaagaaa gaaataaagt 1560 agtgactgca ggttgaagga aagtagtgga gtaaaagtac cgatacagca ctaaaaatgt 1620 actcaaggga aagtaaaagt acacattttt aaaactactt agtaaattac aattcctgag 1680 aaaaaactac tcaattacag taatttgagt atttgtaatt tgttacttta caccactg 1738 // ID Gypsy-32-I_DR repbase; DNA; ZEB; 4400 BP. XX AC . XX DT 01-JUN-2007 (Rel. 12.06, Created) DT 26-JUN-2007 (Rel. 12.06, Last updated, Version 1) XX DE An internal portion of the Gypsy-32_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy superfamily; Gypsy-32-LTR_DR; KW Gypsy-32_DR; endogenous retrovirus; gag; integrase; KW reverse transcriptase; Gypsy-32-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4400 RA Dib M.R. and Naveira H.F.; RT "Gypsy-32_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 7(6), 181-181 (2007). XX DR [1] (Consensus) XX CC Gypsy-32-I_DR is an internal portion of the Gypsy-32_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its long CC terminal repeat is deposited in Repbase as Gypsy-32-LTR_DR. CC Gypsy-32_DR is characterized by 4-bp target site duplications. CC The internal portion encodes one polyprotein the 1460-aa CC polyprotein Gypsy-32_DR1p (pos. 19-4398) composed of the gag, CC protease, reverse transcriptase, and integrase domains. Some CC insertions fairly recent, according to the high identity between CC their flanking LTRs. Consensus obtained after the alignment of at CC least five independent insertions bearing at least 85% Homology CC over at least 1000bp. XX FH Key Location/Qualifiers FT CDS 19..4398 FT /product="Gypsy-32-I_DR_1p" FT /translation="MATRKRTTRGRKAGLRSAVKDAAWRRLSSEDDEEATE FT PYSSLDPSTESSGGMESLMRDFLKAQKTREEMFLRELQGLRVTVQQAVQAA FT PERPTGNPSPEPERENTGESSLPPPTPPPPALPSHHRAQFAHRPETPLPPF FT QMGDDMENYLRRFERLAQTWQWPKEQWSCRLVPLLTGRALEAYLAMDEVSA FT DNYLQLKDSLLQKFNVSAESYRQRFRAASTPEGESPTETYYRLKHLYQRWI FT RPDIHSVEEIGEQIILEQLLRVMKPEARIWVKQHEPSTGLAAAQLAQQYAN FT AHRSGLRTQPERGSTRHIRHNTDNRHTQELETELTPKLICYACRQYGHKAS FT VCPARKPKLTGMCYVPREGDANSDYLMSQSVDMVNVVVNGRELKALLDTGS FT SISLIKSCQVNNIDYANTTDVCCIHGDVKSYPKAEVLVHIQDEMYLLNVAV FT VTNLSVDMILGRDLPILNELCSKTIKMNCAGLNETENVLTVVTRAQAKTGL FT LPLPDLDDSLLQGATKGERKTRKQKRLLKKLGAPVKKAQLDGLEAKNWQIP FT ENISELQKNDESLKKLFKRAENPNNATMNDEHFVIIDDVLYVQVKEVKRLV FT VPMSVRSIVLHLAHTLPWAGHLAFQKTYARISTRFFWPTMYTDVQTYCNTC FT STCQTTSAVRHRDKAPLQPLPIVSTPFRRIAMDIVGPLERSSAGHRYILVV FT CDYATRYPEAFPLRTVTTSKVVQALTELFSRVGIPDEIITDQGTNFMSRVM FT TQFHQQLGIKALKTTPYHPQTDGLVERFNGTLKSMLRKFVSDTGKDWNKWL FT PFLLFAYREVPQASTGFAPFELLYGWPVQGPLDILKKAWTKSPTVAEDSSI FT VKFILEMRDKLDTYREQAMENLKEAQTKQKRWYDMKSRLRQFQPGQKVLLL FT LPTSTNKLLAKWQGPYEITRKMGPVTYEILQPERKKTRQIYHVNLLKEWKE FT RKEPEKTLMVREVKEEDEEPAEEMVTPREAAELKLNHLEENKKKELQELLS FT KYPLLFQERPGRTEVTQHTIHLTDPTPSRQRPYRIPEKLLKPLKEEIETMK FT QLGVIEPSSSEWSSPIVIVPKKDGSLRICVDFRKLNSQSKFDAYPMPRIDD FT LLERIGQARFITTLDLCKGYWQVPLHHSSKALTAFRTPVGLYQFTVLPFGL FT HGAPATFQRLMDQVLQGCEEYAAAYLDDVVIYSHTWEDHLKHLRQTLEKIQ FT AAGLSLNVVKCEWGRGETNYLGYILGNGGLRPQVDKVEAIKKSQRPRTKKE FT VRSFLGLLGWYRRFIPDFAAIAAPLTDLLQKGTKNPVIWTEACEQAWTVLR FT ERLCAQPVLKSPDFSQRFLVQVDASERGIGAVLAQGPAGRERPVVFLSRKL FT LPRETRYSTVEKECLAIKWSLESLKYYLLGREFDLQTDHRALTWINSMKDH FT NARVTRWYLSLQPFQFKISHCPGRSNLVADYLSRFPDSARHGEGED" XX SQ Sequence 4400 BP; 1332 A; 979 C; 1119 G; 970 T; 0 other; gttggtgtca gaagtgagat ggccacccgg aagaggacaa cgagaggtcg aaaggcgggg 60 ctacgatccg ctgtgaagga cgcagcttgg cgtaggctgt catcggagga tgatgaggaa 120 gctacagagc cttacagcag ccttgatcct tcgacggagt ccagtggagg catggagagc 180 ctgatgaggg atttcctcaa agcacaaaaa acaagagaag agatgtttct tcgggaactg 240 cagggtttgc gagtgacggt ccagcaggcg gtgcaggcag caccggaacg accaacaggc 300 aacccttcac cagagcctga aagagaaaac acaggggagt cttcgctgcc accacctacc 360 ccaccaccac ccgcacttcc atcacatcac cgggcccagt ttgcacaccg tccagaaacg 420 ccgttgccac cgtttcagat gggtgacgac atggagaact atctacggcg ctttgaacgt 480 ctggcgcaga catggcagtg gcccaaggag caatggagct gtcggctggt acccctatta 540 accggtcgag cgctggaggc ttatctggca atggatgagg tcagtgctga taattactta 600 caattaaaag actctctgtt gcagaaattc aatgtgtcgg cggaaagcta tcggcagcgc 660 tttagagcag catcaacgcc ggaaggagag tctccgacag aaacgtatta tcgcctgaaa 720 catctatatc agcgctggat tcgaccagac attcattcgg tggaggaaat cggtgagcag 780 attattctgg agcaactact acgggtcatg aaaccggaag ccaggatctg ggtaaaacaa 840 catgagccaa gcacaggact ggcagcagca cagttagccc agcagtacgc taatgctcat 900 cgctccggtc tgcgcacgca acctgaaaga ggtagtacac gacacattcg tcacaatact 960 gataaccgac atacacagga acttgagact gaactgacac ctaaattgat ctgttatgca 1020 tgtcgtcaat atggacacaa agcctcagtt tgcccagcaa gaaagcctaa gctaacgggt 1080 atgtgctacg taccacgtga gggtgatgct aacagtgact atttaatgtc tcaaagtgtt 1140 gatatggtga atgtggtggt gaacggtcgt gaacttaaag cattgctaga cacaggaagt 1200 tcaatttccc tgattaaatc atgccaggta aacaatattg actatgcaaa taccacagat 1260 gtatgctgca ttcatgggga tgttaaaagc tatccaaagg ctgaagtgct tgttcatatt 1320 caagatgaaa tgtatctttt gaatgtggcc gttgtcacta atttgtcagt tgacatgatt 1380 ttgggtcgag atctaccaat tttgaatgaa ctgtgtagca agactatcaa aatgaactgt 1440 gcagggctaa atgaaactga aaatgtgttg actgttgtga cacgtgctca agctaagact 1500 ggtttgctac cgttgccaga cttggatgat agtctactgc aaggtgcaac caagggtgag 1560 agaaaaacta gaaaacagaa aagactgtta aaaaaactgg gtgcgccagt taagaaagca 1620 caattagacg gcttagaagc taagaattgg caaataccag aaaacatttc tgagctgcaa 1680 aaaaatgatg agtctttgaa aaaactgttt aaaagagctg agaacccaaa caatgcgact 1740 atgaatgatg aacattttgt tatcattgat gatgttttgt atgtgcaggt caaagaggtt 1800 aaacgtttgg ttgtacccat gtctgtccgt tctattgtgt tacacctagc acacacacta 1860 ccatgggcag gacatctagc atttcagaaa acatatgcac gcatcagtac acgtttcttt 1920 tggccaacca tgtacactga cgtccagaca tattgcaaca catgcagcac atgtcagact 1980 accagtgcag tgcgccacag agacaaagca ccattacagc cacttcccat tgtttcaacc 2040 cccttccgcc gcatagctat ggatatagta gggccacttg aaagaagtag tgccggacat 2100 cgctacatcc tagtcgtttg cgactacgcc actcgttacc ccgaagcttt cccactccgc 2160 acagtgacca catcaaaagt tgttcaagct cttactgaac tgttttctag ggttggaatc 2220 ccggatgaga tcatcacaga ccagggaaca aactttatgt cacgggtaat gacacagttt 2280 caccagcagc taggcatcaa agctctgaaa acaacacctt accaccctca gactgatgga 2340 ttggttgaac gtttcaatgg gaccttaaag agtatgctga ggaagttcgt ttcagacaca 2400 ggtaaagact ggaacaagtg gttaccattc ttactgtttg cttacagaga agtgccacag 2460 gcatccaccg gattcgcacc atttgagctg ctgtatggct ggccagtcca gggacctttg 2520 gacatcctta aaaaggcgtg gacgaaatct ccaacagtgg cagaggacag cagcattgtg 2580 aagttcatct tggagatgag agacaagctg gatacctaca gggagcaggc gatggagaac 2640 ctaaaagagg cacagacaaa acagaagaga tggtatgata tgaagagcag actacggcag 2700 ttccaacctg gacagaaggt actgttatta ctacctacat caacgaacaa acttttagca 2760 aaatggcagg ggccatacga aatcaccagg aagatgggtc cagtgacata tgaaatactc 2820 caacctgaga gaaagaagac tcgacaaatt taccacgtga acctgctaaa ggagtggaaa 2880 gagagaaagg aaccagagaa gacattgatg gtgagagaag tcaaggagga ggatgaagaa 2940 cctgctgaag aaatggttac tccgagagaa gctgcagaac tgaagctgaa tcatctggag 3000 gaaaacaaga aaaaggagtt acaagagcta ctaagcaagt accctttgct gttccaggag 3060 agaccaggca ggacagaggt aactcaacat accatccact taacagaccc tacaccatca 3120 aggcaaagac cataccgtat accagagaaa ctgttaaagc cactgaagga agaaatcgag 3180 acgatgaaac aacttggggt gatagaacca tcatcatcgg agtggagcag tcctattgtg 3240 attgtgccaa agaaagatgg atcactgagg atctgtgtgg acttcaggaa attaaattca 3300 cagtcgaagt ttgatgctta cccgatgcct agaatagatg acttactgga gagaataggg 3360 caggcccgct tcataaccac actggatttg tgtaaaggtt attggcaggt gcccctccat 3420 cactcatcta aagccctgac tgccttcaga acacctgttg ggctatacca gttcacggta 3480 cttccctttg gattacatgg ggcacctgct accttccaac gcctgatgga ccaagttctc 3540 cagggttgtg aggaatacgc agctgcttac ctagacgacg tagtgatcta cagtcacacg 3600 tgggaggatc atttaaagca cttaagacag accctggaga agatacaagc agcgggcctg 3660 tccttaaatg tggtgaaatg tgaatgggga cgaggtgaga ctaattatct gggctatata 3720 ctgggcaatg gtggactcag gccacaggta gataaagtcg aagcgatcaa gaagagccaa 3780 cgacctagga ccaagaaaga ggttcgatcc tttctaggac tactgggctg gtatcgtcgc 3840 ttcatccctg atttcgcagc aattgcagca cctttgactg atttactcca gaaaggtaca 3900 aagaacccag tcatttggac tgaagcctgt gagcaggcct ggacagtatt gagagagaga 3960 ctgtgtgcac aacctgtgct gaagagtccg gatttctctc aaagattcct tgtacaagta 4020 gacgcatcgg agagagggat aggtgcagta ctggcccaag gaccagctgg acgagagcga 4080 cctgtggtat tcctgagtag gaagctgttg cccagggaaa cgcgttactc tacagtggag 4140 aaggagtgtc tggcgataaa gtggtcccta gaatccctga aatactacct cctggggcgg 4200 gagtttgact tacagacaga ccatcgggca ctgacgtgga tcaactcaat gaaagaccac 4260 aatgctcgag taactcggtg gtacctatct ctgcagccct ttcaatttaa gatcagtcat 4320 tgccctggga ggagtaatct tgtagccgac tacctgtcaa ggttcccaga cagtgcgcgg 4380 catggagagg gggaggatac 4400 // ID DNA-6-N7_DR repbase; DNA; ZEB; 4244 BP. XX AC . XX DT 02-DEC-2008 (Rel. 13.12, Created) DT 02-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE nonautonomous DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; TSD 6-bp; KW DNA-6-N7_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4244 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(12), 2137-2137 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 4244 BP; 1431 A; 770 C; 674 G; 1366 T; 3 other; tatggcgctc ctccgagctc tttaacctga cgctgcaacg tgcactcgtc tcaccgaacg 60 tctcactgaa cgtctcactg acgtcttaaa aaatattttt ttttgctatt aaatccatgc 120 aatttaagac gcagacgtga cgtwgacgtc acgttgacgt cacgtagacg tcacgtaaat 180 cacgtgtggc agtgaagtgt caagccagac gttacgcgtg attcacgtgt agcagtgaag 240 ttatttattt atttttttgc taacacgtat cataacacgc tgctcacacg gctcagtgca 300 gtgtgtgagg cagtgcacaa ttttcttgcc cttcatacat tcctctcaat ggacgtgaca 360 tcaacgtctg acttgttaca tgacgtctgc gtctaccttt ttttttaata tatatatttt 420 ttcagcagtt tttagtgcgt ctgtctgctt tttgccggct aacttataca atatacaata 480 attatgtgca aattaaatat aaatatgtta atatataatg atatttgttt tcaagatagc 540 attaaaacgt ataaaaaatg tataacaaaa ataacaaatg tctccggagg taaagtttag 600 gttgtttatt aatatattca tgtcgtttag ttttgtaaaa tcctttttta atttgtttta 660 catttgagtc ttattgtgtt ttaatttttt acatgtttaa tgtagcctac agcatatcag 720 tgatgaccaa ctcatcgatc actatcgcct gttagatctg atacttttgc tgtaaaatat 780 tagaaaaata gaaaattaca ttaaccatta ttgtgttgtt tttaaagcac ctttcttctt 840 tattttacgt ttaaaacata atattaggat tctttgacaa ataaaaaaaa cataacatat 900 tttatttttc atttgtttcc tttctttttt gaaagaaaat gtatattttt attcagcaat 960 aatgttacat tgatagtatt tatattttga ataataattt gccaggattc aatcaatcaa 1020 tcctgtaaaa atatcataga aaaacattta acagcacaac tttgacaata cactgaagga 1080 tcatgcgaca taatagtgtc aaaaaattta actttgcatc acataaataa ataagtatat 1140 taaaatagaa aagcattatt ttaaattgta atattatttc ctaatattac ccaaaccctg 1200 aaccattcat aataataacc aaaacagtga tgtttaatcc taaaacgttg aatagcctaa 1260 tgttatttga aataattaaa aataattaaa atttgcttta aatgttacat ctaaactaaa 1320 aactaaaatc gccagtaggt ggcagaatgt cactattaca gtttttccca attgtttaca 1380 cacaattact tgtacttcag acacaatgac tacaacatgt aactgatgca ccaaccccct 1440 gaaccaatgc agctaaacta caagcacaat tcctgcttta cactcaaatt gcaggtttaa 1500 aacacacttt ttcaaaacac tacacacaat tctctgcatt ttgcacaatt ttcatgcaga 1560 aaatctcttg ttttaacaag gaacacactg tcattcacaa ttgtaaagtc agttgtccta 1620 ctttgcatac taactcatca catgagtaaa cacctgtcac acagaattgc aatttagaaa 1680 tcagagctca tctggtttat aactggtaca ctgtccaccc atgtttttta gttgtttatc 1740 tccttccata ttctccattc cttaatcata ttgaagattt tttttcctgc ttggggatgg 1800 aaagtgtatg ccgaaatcca cactttattt ttcgtttttt actaacagta ttctgtaaat 1860 ccttctgtta atggcaaatg aagaccacat tatgtgcacc tttgtgttgg ttgggatgaa 1920 cactctgtaa actgaaaata cagtaaaata tatttggtat ttgtgtgttt tgtataaaaa 1980 caatattctg aagtatttta caatgcactt ctgtaggcct atgtactgtc tgtagtaact 2040 gtaacactaa acgaaaaagg ctgacagtaa gtcattatga tgaatggaga agaagtggtc 2100 atagtgtttt acattcagca catcagtgtt caactgatct tataaatgtc tattgatgtg 2160 atggtttgtg tgcgtcattt gaaaacaaaa taccatgttg agaagaaatt acattgtgtt 2220 gattgtgttt cattttgcag gagaattgaa gggtttcacc ctagagttct gacaatatga 2280 ggcatgcttt cagaaaatgt gtgtaaacaa ttgggaaaac tgtaaattgt gtggatttca 2340 gtcatacaca ctttccatcc ccaagcagaa aaaaaaatct tcaataggat taaggaatgg 2400 agaatatgga aggagataaa caactaaaaa acatgggtgg gcagtgaacc agttatgaac 2460 cagatgagct ctgatttcta aattgcaatt ctgtgtgaca ggtgtttact catgtgatga 2520 gttagtatac aaagtaggac aactgacttt acaattgtga atgacagtgt gttccttgta 2580 aaaacaagag attttctgca tgaaatttgt gccaaatgca gagaattgtg tgttgtgttt 2640 tgaaaaaaaa gtgtgtttta aacctgcaat ttgagtgtaa agcaggaatt gtgcttgtag 2700 tttagcagaa ttggttcagg gggttggtac atcagtcaca tgttgtagtc attgtgtctg 2760 aagtacaagt aattgtgtgt aaaaaaaact gtaagtgagt cawtgttcat ttacagtttt 2820 tttttttttt ttgattgctc aagcacaatt ttgaaatgaa ggctcatttt gtcaaaacac 2880 tacacacaat ttccacatcc acacacacac aaacagcaga acacatcaga acttttgcaa 2940 aatgaaacac ttcatccaaa actttacaac actataacaa aacacaattt taacaccgta 3000 cagaacacac acgcctcatt caagataaat ctgtttgctt catttacaca ctgctgatct 3060 caatcttaaa cactcgtata atttctacct gactaatact gatataatct gggtttgatt 3120 tgtttcagag gtattgctat aaagtacatg aacatactgt attcaccaaa cagtgataga 3180 ctgaaagcta aatattgccc tatcctaata aaaaccatac aacaaatgta aagtgtttta 3240 ttcagctttc tggtgatgtg tacatttcaa tatataaaaa aaatcccatg ccctgtctcc 3300 tgtttaccaa tcttcctccc ttccgtgctt cctcttccat actgcaacat tgccttccat 3360 atttattgaa gattatttat ttataactcc tgttaaacat atttattgaa gactaatatt 3420 tacagctata ctactgttaa caaatgtatt gtagataaat ataatagcct agtttatacc 3480 taatacccag ttacagtttt tcatgattgc taacacacta aaattaaact tttcctacaa 3540 ttagcaaaac cgtacaatca tggagcaaaa cacaaggcta gatctgcaca actgtaggca 3600 cattgtcagc ttcatactkt ttgcaaaaca ttacacacag tcatttgcaa atcactaaac 3660 acacttggat gcttttgaca cagacatttt ttcataatgg aatttccttg caatttcaaa 3720 gcaaaggctt tcatatgagc acacacatga actataggtc aaacacagcc atcaggtgtg 3780 cacacacact gctgcaaact gtagacacac acaccaatca gctgtaacca caataaaaga 3840 agcaatatag gaataaccat atatggcata ccaaaatgtt taaatcattt gtataaatat 3900 ttaccatttg aaataaaaaa ctataaataa gaaaggtaaa aagcacttaa ttgcaaatga 3960 aacataactg tggataagtg aaaacactga tagagaatga ataggtataa taaagcagaa 4020 ataagagcaa tgtgctgcaa acccttgaag cagacgcaga cgtgacgcag acgtcacgtt 4080 aaagcagcgg aacagtaatg tggcagacgc agacgacacg ttcggcagac gcagacgtca 4140 cgttcggcag acgcagacgt cacgttcggc agacgcagac gtcacgttca gtgagacgag 4200 tgcacgttgc agcgtcaagt tattgaccca ttccgcacgc cata 4244 // ID EnSpm-N4_DR repbase; DNA; ZEB; 212 BP. XX AC . XX DT 15-OCT-2008 (Rel. 13.1, Created) DT 15-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE EnSpm-N4_DR is an non-autonomous DNA transposon - a consensus. XX KW EnSpm; DNA transposon; Transposable Element; EnSpm-N4_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-212 RA Bao W. and Jurka J.; RT "EnSpm DNA transposons in zebrafish."; RL Repbase Reports 8(10), 1264-1264 (2008). XX DR [1] (Consensus) XX CC TSD is 2 bp long; TIR is 12 bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 212 BP; 74 A; 27 C; 30 G; 81 T; 0 other; cactgtaaaa aattatatga tcaattagtc ctgacagcat atgtttttag gtcattgtaa 60 cttattaaac taagttaatc atgttctaac ttaattttat aagttacgca agctgtttta 120 agtcagttta acataatata agttcaatgg actcataagg ttaatttgat tcagcttaaa 180 aatttaaggc aaccaggatt ttttttacag tg 212 // ID LOOPERN7_DR repbase; DNA; ZEB; 1290 BP. XX AC . XX DT 09-JAN-2009 (Rel. 14.01, Created) DT 02-MAR-2009 (Rel. 14.01, Last updated, Version 2) XX DE Nonautonomous DNA transposon - a consensus. XX KW Kolobok; DNA transposon; Transposable Element; Nonautonomous; KW LOOPERN7_DR. XX NM LOOPERN7_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1290 RA Jurka J.; RT "A nonautonomous piggyBac-like DNA transposon from zebrafish."; RL Repbase Reports 9(1), 1-1 (2009). XX DR [1] (Consensus) XX CC 87% identity to consensus. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1290 BP; 396 A; 262 C; 235 G; 396 T; 1 other; aggtgcagta ggtgatctgc cagaatgcta gcattagcat aatagctttt agaaatttta 60 cgaccctccc ctgccgtcca aagccacgcc tccttaagtc atgaacacgt gaacgcgcac 120 tgattagata taacactaga tcatgtcatt caccagttag aaaactctat ggagacacgc 180 actttatcaa gcgtagttgt tgacaggcca gmgggtgcag gaattatatt tccccaaaac 240 tgtctcacag gctgtcactg ttcaaaaatg aacaaatgtg tgaatataaa tcaaacttca 300 cctcagtaaa gctggttagg cggaagagca catttactta tgttttgtta ttaaactaaa 360 taaaagttag atcagaataa aggttagatc tgctaaaaac agaaatatcc tgtcatcttt 420 ttacattaca cttatgttta cgttaccaag acaacgagtg acttttcatt ttcagttgtg 480 ctcgctgatc ctcgtacatc gcttgttttc agctcctttt acacttgaac aactaaacga 540 atgctcaggt ctatttaacc gactatatta tcgatgctgt aaaaggtact ccatgaactt 600 gaaaatagcc aatcttttgc cggacgtttt cagtggctga acaacacctc tgtgcatata 660 aacccattca taaacaagaa cacagtctac agcacataag ctttgcgtgg ctaaaatagt 720 tttaaaacat acctgtctaa aagaaatact tcagccgtgg tgtcaccctt cagatcgttg 780 ttttgaactg cataacgtca tttccctcca gcgtgcaaaa gtaattccga tattgattca 840 gggtttagaa aagttttgat tctgcatttt tggcagaagc cctttcagaa acaatctttc 900 ttttcatgca tacaaggcca cgttggtctt tcagaaagaa gttcaggttg atgcagagtt 960 tgcaggatga cgtctctctg tcagctgtag atgcgtgctt cacgtgtgcg cgcacgagtg 1020 acgtatctgt ctgcttaaag aggctgcgca gaaattcaaa tttaaatttg ttgacagaca 1080 gtttgagata cctgtcgtaa ttgagttatt tgtcggtccg acaatattta attggatgaa 1140 cattttttat gttttatgcc ttatccagaa tataaaaata catataaata catttagatc 1200 atttacttta atcattacta ttggaatgtg aagagacttt caaccagcac aacaaaaaag 1260 tgtttctgaa gacaatcacc tactgcacct 1290 // ID hAT-N48_DR repbase; DNA; ZEB; 1663 BP. XX AC . XX DT 21-OCT-2008 (Rel. 13.1, Created) DT 21-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Putative nonautonomous hAT DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW 8 bp TSD; hAT-N48_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1663 RA Bao W. and Jurka J.; RT "hAT DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1317-1317 (2008). XX DR [1] (Consensus) XX CC Copies are ~92% identical to the consensus. The TSD is 8-bp long, CC and TIR is 12-bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1663 BP; 543 A; 319 C; 322 G; 478 T; 1 other; cagagatgta taaagtacta gagacccaga cttaagtaaa agtacaagtg ctctatcaaa 60 aaagtgactt gagtagaagt tgaagtgctc tttaagcacc atacttaagt ggaagtacta 120 aagtattcaa cattttttgt acttaagtat tgcaagtagt ttatttcaaa atttactact 180 caagtactga aagtaaaagt acaagtattg tgtaatgtag ttattaaaga aagcagtcaa 240 aagacatcat attgttttgt ttattttaaa tatctttttt gggggcacgt gattaagcag 300 aacgaaaaga aacatggctt acgatactgt ttttctctcg cgcttgaacc acagatctga 360 cagattataa cagctttaac acacaccttt caaragttgt gtgtcacaaa aacactccgt 420 aatccactca aaacgctatt gaaacccatt ttcggatcgc aaattaccgg ttgtaaacgc 480 tgtaaacgga agttctaaac attctaccgg aagacgctgg tcgctatggc gccctccatg 540 cagcagccaa gaaaaaggtc tataattgta acgagtaggg gtgtgaatct acactggtct 600 cacggtttgg ttcggttacg attatcatgc catcgattcg gttcaattcg atatctcggt 660 gcattgacga tgctttccat acataattag atttttttct tcacaacaca agattttttt 720 tattaaaatc tataaatata tttatattta tatattattt gtaatacaat tttgttcttt 780 aatacagcag taagatatat gaactgtacc cttaatttga cctgctcaaa gaaaacactc 840 tctttctgaa tgtatcaaac aaaactccaa cacatgcaca gaagacagca tgagcattta 900 tagcaaataa actaatgtaa atattatccc gcttgctttt tgagcgctga caggcgaggt 960 cttacacccc tatgattggt tattgtcttc acctgctcaa cagaaagggc tgcgatcggc 1020 tttagaaatg aaagcgctgt tcaccgatgg cgggaagaac agccgcggtt acagtctgta 1080 tgcgcataat acgcggttat cacaggtaat ccataataga cacagtggag aaaacttgca 1140 cctcaggcac gctcatggtg tctggatcca caagtatcgc tttaacagcc aagaggagag 1200 gaaaagttac ctcacaaaca cgccagcggg tcaaatgtcc aaagtgtgag agagagagag 1260 gtaaagatgg gagttttaca gcatttctcc gtagtagtga aatagcggct cgtgtgctgt 1320 gccgccttca ttgtagtaag agcgttattt actcgccctc gcctccttcg ccatagtatt 1380 ctactgcgac atcgcgcata ggagataaat gacgtcagta cgtaataacc ggttatgatc 1440 tattactgaa ccgatatcaa taattttgac acccctagta acgagtaacg atgcagcaca 1500 taaaaaatct atcggagtaa aagtattaaa ctcatcgaaa atatgtactg aagtaaaagt 1560 ggaagtagga gaaaaaaaca atactccagt agagtacaga tacagccttt tagtacttaa 1620 gtacagtagt gaagtagttc tacttcgtta ctatacatct ctg 1663 // ID LTR-11_DR repbase; DNA; ZEB; 1133 BP. XX AC . XX DT 08-JAN-2009 (Rel. 14.01, Created) DT 08-JAN-2009 (Rel. 14.01, Last updated, Version 1) XX DE Putative long terminal repeat - a consensus. XX KW LTR Retrotransposon; Transposable Element; Nonautonomous; KW LTR-11_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1133 RA Jurka J.; RT "Long terminal repeats from zebrafish."; RL Repbase Reports 9(1), 21-21 (2009). XX DR [1] (Consensus) XX CC 4 bp TSD. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1133 BP; 250 A; 389 C; 271 G; 222 T; 1 other; tgtaaccccg ccggtcctac accacccaac ccgctccgag ctggtatcga accggcgacc 60 ttccgcatgg gagtcggttg ctctaacaag gaggctaaag accatggcct ctagcgtctg 120 tcgctagagc acctttagag gtcagaggag tgaggtttac ctgcacagca cttactagct 180 ggcctccgtt acactcaccc ccctaaacct cactcccatc cgggtcacgg caccaatgta 240 accccgccgg tcctacacca cccaacccgc tccgagctgg tatcgaaccg gcgaccttcc 300 gcatgggagt cggttgctct aacaaggagg ctaaagacca tggcctctag cgtctgtcgc 360 tagagcacct ttagaggtca gaggagtgag gtttacctgc acagcacttc actagctggc 420 ctccgttaca ctcaccccct aaacctcact cccatccggg tcacggcacc aatgtaaccc 480 cgccggtcct acaccaccca acccgctccg agctggtatc gaaccggcga ccttccgcat 540 gggagtcggt tgctctamca aggaggctaa agaccatggc ctctagcgtc tgtcgctaga 600 gcacctttag aggtcagagg agtgaggttt acctgcacag cacttactag ctggcctccg 660 ttacactcac ccctaaacct cactcccatc cgggtcacgg caccaatgta accccgccgg 720 tcctacacca cccaacccgc tccgagctgg gatcgaaccg gcgaccttcc gcatgggagt 780 cggttgctct aacaaggagg ctaaagacca tggcctctag cgtctgtcgc tagagcacct 840 ttagaggtca gaggagtgag gtttacctgc acagcactta ctagctggcc tccgttacac 900 tcacccctaa acctcactcc catccgggtc acggcaccaa tgtaaccccg ccggtcctac 960 accacccaac ccactccgag ctgggatcga accggcgacc ttccgcatgg gagtcggttg 1020 ctctaacaag gaggctaaag accatggcct ctagcgtctg tcgctagagc acctttagag 1080 gtcagaggag tgaggtttac ctgcacagca cttactagct ggcctccgtt aca 1133 // ID Looper-N7_DR repbase; DNA; ZEB; 1015 BP. XX AC . XX DT 17-NOV-2004 (Rel. 9.1, Created) DT 17-NOV-2004 (Rel. 9.1, Last updated, Version 1) XX DE This is a nonautonomous DNA transposon that belongs to the DE piggyBac superfamily. XX KW piggyBac; DNA transposon; Transposable Element; Nonautonomous; KW Looper-N7_DR; nonautonomous DNA transposon; piggyBac superfamily. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1015 RA Kapitonov V.V. and Jurka J.; RT "Looper-N7_DR, a nonautonomous DNA transposon from zebrafish."; RL Repbase Reports 4(10), 283-283 (2004). XX DR [1] (Consensus) XX CC Looper-N7_DR is a nonautonomous DNA transposon that belongs to CC the piggyBac superfamily. Its is characterized by 15-bp CC terminal inverted repeats (3 mismatches) and the TTAA target CC site duplications. XX SQ Sequence 1015 BP; 308 A; 242 C; 217 G; 248 T; 0 other; agggccatga caccccccac tttcggttaa agtctacttc agaatttttt caaaagatgc 60 atgattaatg ggcgtggagc tccgcgagca tcgggcagga gtgggcgtgg ccagcagggg 120 agaacgtgag cgaacgaagc tagctcacaa aatgagacaa accgtgagga gacgcatgag 180 tttatagttt acaaagttaa aatgcaaaga aatgaacagt gatttaatgc cctgctacat 240 ttgttattcg taatttcata tacacataac cacaatttat atcattataa agataagtgt 300 gttcatgtaa accctataaa tgaggacttc tccctcaatc cccgtatcca gcagactcag 360 tgcagcaggt ctcctgacct gtctatttta accattagcc ctgctggtaa tctggaggat 420 ttaggcaaac acagcagcac agtgatgtgt ctgaatgtga acgaactcct gataaaagac 480 agcgtccgcc attctctaat tctcgtgctg ctctcccgac aaaaatgcta gcagcacaca 540 cacagctttg ctgtatgatc ggccctgaca agatcgcggg ggaaaacatg caacaaaccc 600 cgtggatcat ggaaacaaac gcatatgagc cttcatgaac ggctaaatac tgtgtgcccg 660 tgccgtgggt ccgtgacgtg tctcacagct cgggcttgac tctggcaggt ctggcaggtc 720 tcatgaataa ttaagcagcc ggctcttctc ataggataag aaaactccgc tatgaataat 780 aatgagaaac cgacgcgtca tcattgcact tgcagtactg cgctacgtcg ccgattttga 840 tcccgcccca aaaatcattt taaacccgga agctgaaatt agctgacaaa agctcaaaat 900 tatccagttt tccccacaat taaagctgac aggtgctaac attgtcttaa ctgatgctca 960 acacacacaa atctgttaat ataaaaaaaa aagttctcca gggtgtcctg aacct 1015 // ID hAT-N41_DR repbase; DNA; ZEB; 1260 BP. XX AC . XX DT 16-OCT-2008 (Rel. 13.1, Created) DT 16-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Putative nonautonomous hAT DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW 8 bp TSD; hAT-N41_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1260 RA Bao W. and Jurka J.; RT "hAT DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1311-1311 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1260 BP; 434 A; 218 C; 218 G; 389 T; 1 other; taggggtgta acgatacact cagctcacga tacgatgtgt atcacgatat aatgttcacg 60 attcgatatg tatcacgata tttggaataa aaattgaaaa ttaaactgaa atgcaaattt 120 taccaagtaa actatttttt atgtatttct tttgtttcaa cttgttaaac tgaaccttct 180 ttaagaaaat aaattaaaca ggcctgtctc tggaaaataa aacaatttaa aaacttctta 240 aaaaatatta ttgaaatgac agagacaaaa ttaaggaaca atttaagtaa aggtgcaaaa 300 aaaaaaaagc tttctattca attgaattta acagtaagag cttaaggctt tgcttggtca 360 cttgcgtttt ttgtgtgtgc aaaaaaaaaa tccacatttc caggtattta attcatattt 420 aattaaattc atttagagat atctctaatt acaattgtga ctagtcaaaa ctcatttaga 480 gatatctgca aatatttttc aatggaagtc aatggaggaa tatgactagt cataatatat 540 ttgcagatat ctccaatctg atttcgtact agtacaattg taattgcaga tatctctaat 600 tgtcattctg actagtcgaa ttataattat gactagtcaa aatataatta gagatatctc 660 taaatatatt tatggatagt cagaacaaag gtttaaatgc taaaacggct tgccatacgc 720 gcgcgtctat cawagtccgc cctctgtagt aacagctcac ggtcagctca cgtcatgtgt 780 gattttgcgg cgggaaatac atcattatat gaagacagaa tgatatttta gggtgaattg 840 taccttataa atacagtatg tgactctttc aacaccatta ggaggtatcc ctgtcgctgt 900 gcaaagtatg taaatcactg tgaagacttt aaaacttagg atgtttgacc cagtatgcgt 960 gtcaaaaagc ggacgagtct aaagcaatga ctgtacaacc gaaatgttgc cagacgtctg 1020 attttgagag gatgggccat cctctatttc agctccactc atcttggtct gctaacatta 1080 ctgctgctgc aatctgaact cacgtgcgac agcaggtgga acgtagctca cgtgcaaaca 1140 gctcaactaa taagagaaaa cggacgtcat atcgtaatca aatcgtcata acgccacgat 1200 gcatattgtg acatttttgc atcgcaataa atcgtacaac gataaatcgt tacaccccta 1260 // ID Gypsy49-I_DR repbase; DNA; ZEB; 5683 BP. XX AC . XX DT 25-SEP-2007 (Rel. 12.09, Created) DT 02-OCT-2007 (Rel. 12.09, Last updated, Version 1) XX DE An internal portion of the Gypsy49_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy superfamily; Gypsy49-I_DR; KW Gypsy49-LTR_DR; Gypsy49_DR; endogenous retrovirus; gag; integrase; KW reverse transcriptase. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5683 RA Dib M.R. and Naveira H.F.; RT "Gypsy49_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 7(9), 829-829 (2007). XX DR [1] (Consensus) XX CC Gypsy49-I_DR is an internal portion of the Gypsy49_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its long CC terminal repeat is deposited in Repbase as Gypsy49-LTR_DR. CC Gypsy49_DR is characterized by 4-bp target site duplications. The CC internal portion encodes one polyprotein the 1591-aa polyprotein CC Gypsy49_DR1p (pos. 737-5509) composed of the gag, protease, CC reverse transcriptase, and integrase domains. Some insertions CC fairly recent, according to the high identity between their CC flanking LTRs. Consensus obtained after the alignment of at least CC five independent insertions bearing at least 85% homology over at CC least 1000bp. XX FH Key Location/Qualifiers FT CDS 737..5509 FT /product="Gypsy49-I_DR_1p" FT /note="ORF." FT /translation="MIRKKKDTLIATLKRSLVERGVLEVRTAVLCESSVFS FT SCDIDVESELKFPEMSIQEKQLYLDAEKLRADREGRAMKERETEREFEARK FT LEQQLSVRKLELELEREREERAFQLKKLELELAAKRAETPRADVHPVPEQS FT SALVFDVHRNMRLVPPFSEKEVEKYFDHFERVALSLKWPKQFWTLLLQCVF FT TGKAQDIYSALTLEQSGEYETVKTAVLHAYELVPEAYRQKFRNLIKTENCT FT FVEFAREKESLFERWCTSMKVETKEQLRELLLLEEFKHCVPSAVATYLNEH FT KVSKLADAAIMADEFVLTHQGTFSSASFRSDMNKFRSCQVKQKPAVVGQNI FT RAPVVGNIGKTTVSRDLICFYCKKSGHKISECPVLKKKEKFSKPVSLVAPS FT TVDDTFIESAGHLSHSCGALFDQKVNDFVDYTPFITEGAVSLLGSEEMVPV FT RILRDTGSAQSFLLEGVLPLSSETATGTCVLVRGFEMGFVDVPLHRIHLSS FT TIISGDVVVGVRSALPVPGITFILGNDLAGGNVWGESKVTALPIVISAPSK FT SDVEKSNIFSNLFPACAVTRSMAKQLPENQATDVSLGNTFFSTVDSGEFTC FT DLPESPEKDKNICSSLSPDLEEITNSPQSLDSDATEVLHTASSDVDKYPLT FT SLLQVPREELIHAQQLDDTLKTVVSQAGSNMGDLSSYFFEDGLLCRRVAMF FT DDNLKSRTQIVVPLAFRESVMQLAHQGLAGHTGVRKTYDRIMRQFYWPGVK FT RDVARFIRSCHTCQLTGKPNQKVPTAPLQPIPVTSNPFDHLIIDCVGPLPR FT SRAGHHYLLTIMCQTTRYPAAYPLRSITTKSILKALTNFMSIFGIPKVIQS FT DQGSNFMSKQFAKALRQLRVKHNISSAYHPQSQGALERFHQTLKSLLRSYC FT VELDGDWEEGLPWLLLAIREVTQESIGFSPNELVFGHEIRGPTTVLADVWH FT SSKPPDNLIDYVSGFRRRLYEMQAVAKKKLGKSQKKMQQHFDKKAKFRSFQ FT TGDQVLALLSLPTNPFQAKFTGPYSVAKRLSDNNYLLNTPERRKKVQVCHI FT NLLKPFVAPVCSASVSVAPLCDHSGLEHLSVFPSLSCSMGEDLSEKNEDGS FT LLSQGKIDGRLNNSQILSDLSGHLSYLTEDERNDIIELVMSFPSLFLDVPG FT RTTVIEHDIDVGNSCPIKQNAYRVNPIKRELLKREVNYLLAHNLAEPSFSS FT WSSPCVLVNKPDGSYRFCTDYRRLNSFTKPDCFPLPRIDDCVDHVGSAKYV FT SKFDLLKGYWQVPLTARARELSAFVTPDSFLQYTVMPFGVRNAPATFQRLV FT NRVLHGMSGCEAYLDDVVLYSSTWSEHLDQIKDLFSRLAEANLTINLAKSE FT FGKATVTYLGKIVGNGYVKPIGAKVEAICNFPTPSNRRELRRFLGMIGYYR FT SFCKNFASIVSPLTDLLSPKIEYQWSETCQQAFENCKALLASSPVLTAPNF FT EKSFRLAVDASGCGAGAVLLQEDLNQVEHPVSYFSKKFNRHQQVYSTIEKE FT ALALVLAVQHFEVYLGSIAAPIVVYTDHNPLVFLNRMKNRNQRIMRWSLIL FT QPFHLEIKHVRGKDNVIADTLSRV" XX SQ Sequence 5683 BP; 1528 A; 1057 C; 1255 G; 1843 T; 0 other; taaattgggt ggctcgtccg gcattccttt atgcgggtga gtagtttttt tttggattgg 60 ctattcacct ttatatacta gaaactccgg ttagaaagag tgtatccagc tgggttgtga 120 atacaatcct tccatcggtg cactgtttgt tgttttgttt attattttgc ctttcttatt 180 tttggaggtt atcctaggtg gagattttaa tctctgtcgg gggtacgctt tcggactttt 240 ggtctaaaga ccgattgtca ctgagctcag cgtttaatgc cgccccgagc ctggtacacc 300 tctgaagact agataggaat tagttagttt tcttcttaga taggagctgg agtttctctg 360 agtataaagg tgaatatatt ttgttactga cggcgtttgt gattcgtgcg cttctctctt 420 cctcgcgtac ctacgtgtcc gacttgtggt aagtctttga agtttttttt gtgaatgggt 480 agatgcgtaa cgttcagttg tagttcattc cgaagcagtg gcttatggga attgtagttt 540 ttgtacagag ggcagtttcc gggtgagtac gggagtgttt gtttgtttat ttgtgttgta 600 ttcgtgaatg ctaatacaaa aatgtcttca gtactcgatg atttttttgc ttgtccctcg 660 gaagctttaa ttgagcgttg tacaaaggaa gaattgatac aaattgccga gcgttttagc 720 atcgatctca ctacacatga taagaaaaaa aaaggacacc cttatagcaa ctttgaaacg 780 atctctggtt gaaaggggag ttttagaggt aagaactgct gttctttgtg aatcttcagt 840 gttttcttcc tgtgatattg atgtggaaag tgagttaaaa ttcccagaga tgtcaataca 900 agagaaacag ctgtatttag atgcagaaaa gctccgtgct gatcgcgagg gtcgcgctat 960 gaaagagaga gaaacagaac gggaatttga agctaggaaa ctagaacaac agttgtctgt 1020 acgaaaattg gaacttgagt tagagagaga acgggaagag agagcgtttc agttaaagaa 1080 gttggagttg gaattagcag caaaacgagc ggaaacaccg cgagctgacg tgcatccggt 1140 gccagaacag tcgtctgcat tagtgtttga tgttcatcgt aacatgagat tagttccgcc 1200 attttctgaa aaagaggtag aaaaatattt tgatcatttt gagcgtgttg ctttatcgct 1260 gaaatggcca aaacagtttt ggactttgtt gttgcagtgt gttttcacag gaaaagctca 1320 agatatttat tctgctctca ccctggaaca aagtggtgag tatgaaactg tgaaaactgc 1380 tgtattgcat gcatatgaac tggtgcctga ggcatatcgt caaaaattta gaaatttaat 1440 caaaactgaa aattgtacat ttgttgaatt tgcaagggaa aaagagagtc tatttgagcg 1500 atggtgcact tcaatgaaag ttgaaactaa agagcagtta agagaattgt tacttctaga 1560 ggagtttaaa cactgtgttc cttctgctgt agctacttat ttaaatgaac acaaagtgag 1620 taaacttgca gatgctgcca ttatggctga tgaatttgtt ttaactcatc agggcacatt 1680 tagttcagcc agttttcgaa gtgacatgaa taagtttagg tcttgtcaag ttaaacaaaa 1740 gcctgctgtg gttggacaaa atataagggc acctgttgtt ggtaatattg gaaaaaccac 1800 tgtttcaaga gacctcatct gtttctattg caaaaagtct ggccataaaa tttctgaatg 1860 ccctgttctg aaaaagaaag agaaattttc aaaacctgtt tctcttgtcg ccccatcgac 1920 tgtagatgac actttcatag aaagtgctgg tcatttatct cattcctgtg gtgcattgtt 1980 tgaccaaaag gtgaatgatt ttgttgatta tacacccttt atcactgaag gagctgtgtc 2040 tttattaggt tctgaggaaa tggtccctgt gcgtatcctc cgagacacgg gatcagcaca 2100 gtcatttctt ttggaaggag tgttgccttt aagttctgaa actgctactg gaacttgtgt 2160 attggtaaga ggttttgaaa tgggatttgt tgacgtgcca ttacaccgaa ttcatctttc 2220 ctcaaccata atctctggtg atgttgtagt tggtgttcga tctgcacttc cagttcctgg 2280 aattacattt attttaggaa atgatttagc tgggggaaat gtttggggag agagtaaggt 2340 gactgctttg cccatagtaa tatctgcacc tagtaaatct gatgttgaaa agagtaacat 2400 cttctctaat ttgtttccag cctgtgcagt cacccgatcg atggcaaaac aactgcctga 2460 aaaccaagct acagatgttt cacttggtaa cactttcttt tccactgttg attcgggtga 2520 atttacatgt gatctgccag agagtccgga aaaagacaaa aatatctgct cttcactatc 2580 ccctgatctg gaggaaatta caaattcacc ccaaagccta gactctgatg ccactgaggt 2640 cttacatact gcttcttctg atgttgataa gtatccttta acttctctgt tgcaggttcc 2700 cagagaggaa ctaattcatg cacagcagct tgatgacaca ttgaaaaccg tagtctccca 2760 agcagggtct aatatgggtg atctttcttc ttatttcttt gaagatggtc tactttgtcg 2820 gagagtagcc atgtttgatg ataatttaaa gtctagaacc cagattgtag tccctcttgc 2880 ttttagggaa tctgtgatgc agttagccca tcaaggactc gctggtcata caggcgttcg 2940 gaaaacctat gatcgtatta tgcgacagtt ttactggcct ggagtaaaac gagatgtggc 3000 cagatttata cgttcctgtc acacatgtca acttactggt aagccaaacc aaaaagtgcc 3060 aaccgctcca ttacagccta ttcctgtgac atctaaccct tttgaccatt tgattattga 3120 ttgtgttggt cctttacctc gttccagggc tggtcatcac tatctcctta ccattatgtg 3180 ccagactacc cgttatcctg cggcttaccc actgagatct ataactacga agtccattct 3240 gaaggccctt actaatttta tgtctatatt tggtatcccc aaagtgattc aatcagacca 3300 aggttctaac ttcatgtcca aacagttcgc aaaagccctc cgtcaacttc gagttaaaca 3360 caatatttct agtgcatatc atccacagag ccagggagcg cttgaaagat ttcatcaaac 3420 cctgaagtct cttttaaggt catattgtgt ggagctcgat ggtgattggg aagaaggcct 3480 gccatggttg ctattagcta tacgtgaggt aactcaagaa agtattggct ttagcccaaa 3540 tgaattagtt tttggacatg aaatacgagg gcccactact gttttggctg atgtgtggca 3600 ttcatctaaa cctccagaca accttataga ttatgtaagt ggttttcgtc gtagacttta 3660 tgagatgcaa gctgttgcta aaaagaaatt gggtaagtca caaaagaaaa tgcagcaaca 3720 ctttgacaag aaggccaaat ttcggagttt tcagacaggt gatcaggttt tagcattgtt 3780 gtctcttcct actaatccat tccaagcgaa attcactgga ccttacagtg tagctaaacg 3840 tctttcagac aataattatt tgttgaatac tcctgaacgt cggaagaaag ttcaggtttg 3900 tcacattaat ttgttgaaac cttttgttgc tcctgtttgt tctgcctctg tcagtgtggc 3960 gcccctatgt gatcattctg gtttagaaca tctctctgtg ttcccttctc tgtcttgcag 4020 tatgggtgag gacttatctg agaaaaatga ggatgggagt cttctgtccc aagggaaaat 4080 agatggacgc cttaacaact ctcagattct ttctgatttg tctggccatc tgtcttattt 4140 gacggaagat gagagaaatg acatcattga attagtgatg tcatttcctt cacttttttt 4200 ggatgtccct ggtcgcacta ctgtcattga acatgacatt gatgtaggaa attcttgtcc 4260 tatcaaacaa aatgcttata gggtcaatcc aataaaacga gagcttttaa aaagagaggt 4320 gaactaccta cttgctcata atttggctga accaagtttt agttcctgga gttccccttg 4380 tgtattggtg aacaaacctg atggttcata tcggttttgc accgattata gaagattaaa 4440 ttcttttacc aaaccagact gttttccctt gcctcgaata gatgactgtg tagatcatgt 4500 tggctctgcc aaatatgtaa gtaagtttga tctacttaag ggctattggc aagtaccttt 4560 aactgctcgg gcaagagaat tatctgcttt cgttacccca gactctttcc ttcaatacac 4620 tgtgatgcct tttggggtcc gtaatgcccc cgcaacattc caacgattgg ttaatcgtgt 4680 attgcatgga atgtctggat gcgaggctta tcttgacgat gtcgttttgt atagttctac 4740 ctggtctgaa catcttgatc aaatcaaaga cctattttca cgtttggcag aggctaattt 4800 aacaatcaat cttgcgaagt ccgaatttgg aaaagccaca gtcacctatt tgggtaaaat 4860 agtgggaaac gggtacgtta aaccaattgg agcaaaagtt gaagccattt gtaattttcc 4920 aactcccagt aatcggcgtg aattacgacg atttcttggg atgattggtt actaccggag 4980 tttttgtaaa aactttgcta gtatcgtttc acctttgact gatttattga gtccaaaaat 5040 tgagtaccag tggtcggaga catgtcaaca agcctttgaa aattgtaagg ccttgttagc 5100 tagttcccca gttttgacag ctcccaactt tgagaagtcc ttccgtctcg cagttgatgc 5160 tagtggttgt ggtgcaggcg cggttctttt gcaagaggat ttaaatcaag tagaacaccc 5220 tgtaagctac ttttcaaaaa agtttaatcg tcatcaacag gtttattcca ccattgaaaa 5280 ggaagccctt gctcttgttc tggctgtaca acattttgag gtttatttag ggtcgatcgc 5340 cgcaccgatt gttgtttata ctgatcataa ccccttggtt tttttaaatc ggatgaaaaa 5400 ccgaaaccaa cgaatcatgc gatggagtct aattttacag cctttccatc ttgaaattaa 5460 gcatgtgcgc gggaaagata atgtgatcgc ggatacactc tctagagtat agctaattca 5520 tattgcatgt ttcactcttg ttgttctttt tgcacttcta gggcccagag gtgttggaag 5580 aggaggaaga gtgagctgtt ttatgaaagg atgtatttct gttattgttt gctaattttt 5640 tttttctttt tgattctttg cagaatcttt ttgaggaggg agg 5683 // ID DNA-2-14_DR repbase; DNA; ZEB; 2750 BP. XX AC . XX DT 11-NOV-2008 (Rel. 13.11, Created) DT 11-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE nonautonomous DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; 2-bp TSD; KW DNA-2-14_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-2750 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(11), 1705-1705 (2008). XX DR [1] (Consensus) XX CC Copies are ~99% identical to the consensus. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 2750 BP; 978 A; 456 C; 403 G; 912 T; 1 other; cactgtaaaa aatgttactt agatctaact tataaaaagt gaggcaagtg gttgcatcgg 60 acgttttagg ttgattcaac ttccagcctt ttttaagtat tttaaacact aaaacattgc 120 ttaaaacaaa tgcaataaaa ttaagttgag ccaactaata tttctaaaat tactcaaatc 180 agataaactt actttttttg ttaaacttaa cttacttata cattgctata tgttgactgt 240 actcatttta attaagtatt atgaacttaa tgatctaaat aaaattaacc atgcatatgt 300 catttaaaac gaatttaatt gacttctggc tgtaacctca gaaaataaat cttcgtttct 360 atattgtaag cacaatatac actagtaaaa acagataaac agactttcct gtcaataaca 420 attaaataag aatgggaaga tggattaaag accataacat gtattcaata tcaatcgatg 480 ttccttttaa tagtgactac aaaataggcc attaatgtgg ctacacgtac acaatacata 540 aataaaaaca ttaataatac cagatcctat tgagaaatca tccaaggcaa ctcagtgagg 600 agatttgcat attttgatga agatttagga tgcataatgt ccagtcccac aaagagtctc 660 taatatatct caaatgtata cagttttgat ggatacttga agtcgaggac atcggtaagt 720 ccaaacagga gccatcatgc atttgacagg tttggcaaac ctctgacagt cctatgataa 780 tccaaactgt gttgggctga tggcaaattt ttcccgtggg cagccgcatg cacctctgat 840 tcctcacaac tctattggta gagacaaaaa gaaacaggag ttcaggtggt tgctaccaaa 900 caacattaaa tagctaaata ctgaatagga aataactaca agtgaagtta caaaagtcca 960 gtgagaatga tgcatggatg aaacacctta gtttatcagt taaaaattac cttagttaaa 1020 attacctaac ttctatttaa aaaataaaaa aaggttatag aggttgtgta attacattat 1080 tccaggtgtt tttagttatc aaattttaaa gaaaactgtg attttactca gtgtaacagt 1140 gcatgtcatt gtggtttcct gtcaatcaaa gaattgttaa aatttaggta ttacattttt 1200 tgtttatttt ggcaacttat aaagcataag aatgtgtatg taatttagat attaaaactg 1260 tttgaaaatg gctatatgaa aattatagaa tacaaattct ataatttaat tgtcatcttc 1320 attttgcacc aaactttgca cgtcataata caaattttat taaataaata aataaataca 1380 ttgccatttt acacattaca ttaaggttta cattaaattt ttgctaaagc tacatttaat 1440 tgactttttc ttcaaattta atgtcactgg caaatgacag ttcagtgatt cagtgttatc 1500 agttagcttt attgaaagac tgacagataa taagaaactg cattaggata tttttatttt 1560 tactgttttt actgttgcac acataaaaaa aatcttcaga tatcaaatgt aagttaattc 1620 tatggagccc actattgttt gcaattttac caatgtttta cttacctgca aataattata 1680 gctgtaattg tatctaatgt ttttcttacc tggatgtgaa gtttgaagat ccacaggtaa 1740 caaaatcact ttggtatcta aagaccaaga acatgcattt tagaatgagg tgcatacart 1800 ttaaatggtt ctgtaaacat cagtagtaaa tgtaatttaa gttaaaataa attgtaaatc 1860 tgtaatttat tcactatgca gcattaaaaa aaacttatgg actaaatctc taaaacacaa 1920 actttaagtt atactttctc ttacgttaaa cacaaactac ttaactgttg catacgtttc 1980 tgcatttaat atccatcctt aatgtgctgc tataacttag gtgagctaac tttagctaag 2040 ttagcgctaa caaactgaac ccccacaccc cgaccgccca ccagtgacct tgttaatcgg 2100 ttctgcaacg ttagttatac ctactaagtg ttttaaaact actttcacta ttttttctca 2160 ataatatcac ttttaatcgt tagcataaac tagcgttggt tagcgttagc caacttagct 2220 ttccatgtga gagcagtcta actgttaaca tacctgaggc tacagtacaa tttacaaagc 2280 taaaacctca cctgcctagc gaagacagca gttttgcatg actttacaat tatcattcgg 2340 ctaaatttgc aacttaccgt gttaatcaga tccgctgctg ggagcagtta atgtcgacca 2400 ttgcagaagc tgcgtctgcc tgctaaatcc cgggcaatga aaaaggagtc acgccaaact 2460 attaaaatca cgttttatga acacctgccc acatcgacag ctgctctaat aaattgtctg 2520 tgtcaactca cttttataac atttattcta cacaataacc acgtaaaaaa atctactcaa 2580 ataaattgca ttagttttat tcaaaatata cagttatata aaaactgata tattttaagt 2640 aaagaggaag caaattttat ttttttggta taaaacaaat aaaattaact agattgcaat 2700 tatttaaagt agatagaacc aactttgtga gtgtagcact ttttacagtg 2750 // ID Gypsy153-LTR_DR repbase; DNA; ZEB; 166 BP. XX AC chr14; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy153-I_DR; KW Gypsy153-LTR_DR; Gypsy153_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-166 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1568-1568 (2008). XX DR Genome; chr14; Positions 10035164 10034999. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 166 BP; 46 A; 31 C; 38 G; 51 T; 0 other; tgtatattcg tggcattgtt atttcactac agcacccaga atgcattgtg acgtgaaccg 60 gaaatgccgg ttgttgtagt tgtcatcgga tatagcgaag taaaacggag agcagtaaaa 120 acggtctgtg tgtcatctta ttccttatgc aacgttatta tccaca 166 // ID DNA-2-11_DR repbase; DNA; ZEB; 989 BP. XX AC . XX DT 03-NOV-2008 (Rel. 13.11, Created) DT 03-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE nonautonomous DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; 2-bp TSD; KW DNA-2-11_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-989 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(11), 1702-1702 (2008). XX DR [1] (Consensus) XX CC TIR =22 bp. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 989 BP; 313 A; 160 C; 170 G; 342 T; 4 other; ggggagagcg gggacgaaag taacgtggga cgaaagtaac aaagcgattt tctcgaagcc 60 ctgacaacat ttgctttcca ggctataaca gcacattcag catgcaaccc ttgatggagc 120 tgccaaatat catctgattt cgggaccgtg tcccagcggt tagctccaaa tttcagtttt 180 taagtgcaaa aagtaaatta ttgtagcggt tcttttttcc ttattacaag ttgttattwy 240 gttttgtttc tcttttcatg tgtcacacta atgatcaatc tacaggttat ttagtgcaat 300 aacacatcct taagtttgaa atgtctgagt ttttcagttt aaaagtaaat caggtttaaa 360 tggcaagagt tcctgtgggg acgaaagtaa cactgttatt tatgtcccac tgctgctaat 420 agmcatacct tatttttcac ttccacatta gagtttgcag tgtttkgatt catatgtttt 480 ataaaattaa tgcatattgc caataataat acaaaaatat tcaaaaattg tagaaaattt 540 taacattttt gcattgttgg attgcttttg aaacatttga tggaactgaa atttaaaatg 600 aaaatgcagt ttaatatttt tttgcaaaaa taaaggacaa aatatgtttg cagtgaacat 660 taacaaggtc atagtcagat atatttaaaa taaacaagat taagccagta aatctagcaa 720 atattgttgt gttacttttg acccacccgt gtgttacttt cgtccctgct gctatggggt 780 caaaagtaac aaatgtgcaa cttttgtgca acttttttta aagtgaagtt atattggagt 840 tgttctacat tgatacaaaa taacacctga ttttgataga ccactcttgt gagttagtaa 900 ccacagcaaa aatatatttc tgtctaaaac attatctttt aaaattaggt tttttaatga 960 aaaatggtac tttcgtcccc gctctcccc 989 // ID DNA-8-18_DR repbase; DNA; ZEB; 1032 BP. XX AC . XX DT 01-DEC-2008 (Rel. 13.12, Created) DT 01-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE A nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; TSD 8-bp; DNA-8-18_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1032 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(12), 2141-2141 (2008). XX DR [1] (Consensus) XX CC TIR is 12-bp long. Copies are ~91% identical to the consensus. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1032 BP; 299 A; 229 C; 186 G; 317 T; 1 other; tagtgatgtg cgataccact ggtttcctat ccgatccgat accaagtaaa ataaaggcga 60 gtatcggcga taccgatccg ataccgatac tttgcccaaa aattactgtt tgggaaatta 120 aagtaacaag tagtgtactt aaaagtgtca ctttatactg aatacgaaga catattacgg 180 gttattcttg tttattgatg aagaattatc atacaaaaaa agagccaact tacatattta 240 ttgacagcat ctgagcatgt atatcttaaa acgcctgcct gaacatctta acataaaaca 300 ataaacattc acacctttta gtttagtgac atgttaatcc ggtccccttg aagattttta 360 acctttttct tttgttttat cattgtcaca aaaacccaag cccccaaaat acagccacat 420 cgccacattt tctccctctg tacctggata gtgcctgcct gctgcaccta aggactctgm 480 aagcgatgtc tgtctcgccc tagcagcacc tgtgcctctc ctttcctcct cttctgaccg 540 cctagtcatg acgtagtcat attctgtctt gtgatttata tgtagatgtt tgattaggtt 600 tgtggtatta ccacagtacc tcactgtttt gttacacgtg tcacaaacag catcgtttga 660 gtttatggaa gtgaagtagc tccacacaaa acttcgcttg cggtcaccca tatccaatga 720 cagcgtgact gaacaacatg tgccacgcca gcggactgac tgagtggcta gctagcgtgc 780 tctgcagttg tacagtgcca tattacgcac atgacttggc aggcggaaga acaagtagtt 840 ccgactaagt tggcattatt cagataaact agggtaattg tggttacttt ccactgtatt 900 tctatttcca ataaactaca tatttgctcc aaattactga aatatcgata ttttgataag 960 ataatcgatt ctagaacata caaagctggt atcggagata tcgatatttt agtatcgatc 1020 cgcacatcac ta 1032 // ID Gypsy93-LTR_DR repbase; DNA; ZEB; 249 BP. XX AC chr18; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy93-I_DR; KW Gypsy93-LTR_DR; Gypsy93_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-249 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1498-1498 (2008). XX DR Genome; chr18; Positions 27133052 27133300. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 249 BP; 85 A; 38 C; 58 G; 68 T; 0 other; tgtaatgttt gtgtttaact ctaagccaat gatgttatat aagcaggaca ttccatgttc 60 ctattggcta gaaggagtgt cagtcaagtt cctcaaggaa atgatagagg ggataggaaa 120 caagggaaag catgcgatgc tgtaagaaag cgctaagaaa gcgctatgtt ctgagttaag 180 ttgatattaa aaaaaacata cgttgttaca gccagcctgt gagtggtcta atattcaact 240 tacacaaca 249 // ID ZFERV-2-LTR_DR repbase; DNA; ZEB; 462 BP. XX AC chr14; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 01-OCT-2008 (Rel. 13.1, Last updated, Version -1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW ERV1; Endogenous Retrovirus; Transposable Element; ZFERV-2_DR; KW ZFERV-2-I_DR; ZFERV-2-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-462 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1609-1609 (2008). XX DR Genome; chr14; Positions 54408097 54408558. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 462 BP; 135 A; 77 C; 107 G; 143 T; 0 other; tgtagtgttt ttcagtttaa ataaaacttt gacatgtttt ccatgcataa gaagtcttta 60 atgattgtcc catagggtgt tggttactat gaaatcatct gaagggttaa cagccatatt 120 gtgagcacgg tatagaccag tgttgcagac tgctgagaca agcacatcct gttattgaat 180 ataggggcga gttctaagaa tgacaggatg tatgagccat gtgtgtgtta agaggagaca 240 tttgccttaa aaggttaact cttaagagaa gatgcacgga agacaactgt ggtataaaag 300 gatgtgcctc tgaaggagag tttagagaac tgtacctctc ctgcgcagat tgtatgattg 360 cttttggttt ctctcctgct tgagcaaata aaacttataa tttttgacag attgactccg 420 aacccatttt tggacatgca ctggaccctt tgaagtccaa ca 462 // ID LTR-6_DR repbase; DNA; ZEB; 654 BP. XX AC . XX DT 03-NOV-2008 (Rel. 13.11, Created) DT 03-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE Putative long terminal repeat (consensus). XX KW LTR Retrotransposon; Transposable Element; Nonautonomous; KW solo-LTR; LTR-6_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-654 RA Bao W. and Jurka J.; RT "Long terminal repeats from zebrafish."; RL Repbase Reports 8(11), 1770-1770 (2008). XX DR [1] (Consensus) XX CC TSD =4 bp. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 654 BP; 222 A; 129 C; 99 G; 204 T; 0 other; tgttggaata tcaagccaat tccatgcgtg ttcaataaga aaaggtctgg agagggtttt 60 cttcaatgcc aaaaatatta gtaatttatt agatacaaat gaataattcg atgacagagc 120 tctgggttac gttaccccaa tgcaatacaa gaattacatt caagaggttc gcaactaaca 180 tacatttccc tgactccagc atatatacac aactgatatt aacaggtgga gttttggttt 240 aacgtcactt atcagtcatt ctgcagtcct ttggctcttt gtaaccccta gacatacttc 300 ctctttctgc aacccttctc tgcataccag aactgaacaa ttacgtgcat ctttaatata 360 tttcacaact tctacaagca tcatgacaac ttgtgacatg tctagacttt gtattaaaca 420 gcagtctgga acaaaggccc ccaccctaaa tttattctta ttctgctttt tccacacttg 480 tcagcagttt cttctaaaaa gtatgcagca cagtaaaaaa gaggaagtgt gtctggtcaa 540 gatgtggtat aatacaaaag aaaaagattg attaatctgt tgttagtcta aacatttttc 600 taataaaaaa taaagacaca aaagtaataa aaaaataaat tccttttttc caca 654 // ID DIRS-1C-LTR_DR repbase; DNA; ZEB; 618 BP. XX AC . XX DT 21-JAN-2009 (Rel. 14.01, Created) DT 21-JAN-2009 (Rel. 14.01, Last updated, Version 3) XX DE A solo-LTR derived from DIRS retrotransposon - consensus. XX KW DIRS; LTR Retrotransposon; Transposable Element; Nonautonomous; KW LTR; MER6; DANA; SINE_DR2; DIRS-1-LTR_DR; DIRS-1B-LTR_DR; KW DIRS-1C-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-618 RA Jurka J.; RT "DIRS LTR subfamilies from zebrafish."; RL Repbase Reports 9(1), 3-3 (2009). XX DR [1] (Consensus) XX CC See comments in DIRS-1-LTR_DR. 97% identical to consensus. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 618 BP; 161 A; 174 C; 114 G; 169 T; 0 other; ttaagtgaag tttatttata aacaaatttc gagaggatca cgtgcttatg attgttcaca 60 gctgttccaa ctttagctaa tgactgatta tcctatcaga cgatccttaa ctcactataa 120 ataaccagac ttttctcatc tcaatatctt cgtcttgaag aaaccccccc acccaaccct 180 actttccctc ctttcaaacg ggcgtcacgg tggccagcga ttagcgctgt tgcctcacag 240 caagaatgcc cctggtttaa ttcctttcca aaccagacgg catttctgtg cggagtttgc 300 tcgttctccc cgtggtcgcg tgggttttcc ccgggtcctc cggtttcctc ccacagttca 360 aaaacaagca ctctaaacaa ttaatccaaa tactttagct aaactctgag tacacctttc 420 agcatccata tctgacactt agctacaaca agcaagaggg ggagtcatcg agatctacct 480 gagctcaaac tcccctctcg ccttgcaacg ggagggagcc cagggctcga ggatcttata 540 agctcagggc tctctcccgg gacagcacgc caaactagct ttattatcaa tcatcagcta 600 agtgtgaact cttgaaat 618 // ID Gypsy134-LTR_DR repbase; DNA; ZEB; 1201 BP. XX AC chr3; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy134-I_DR; KW Gypsy134-LTR_DR; Gypsy134_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1201 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1530-1530 (2008). XX DR Genome; chr3; Positions 52732042 52730842. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1201 BP; 221 A; 224 C; 300 G; 456 T; 0 other; tgtggccaga caagcgccgc ccctctgtgg ttgcgggtgg cataccacct accctattag 60 taatcagtgg gggaagatat aaaatgacaa ccaggatggt atgtgtcacg agtcccgccc 120 cttccgactg gtcacttccc gctttcagtc actcgccggc agtggtgcgt cttatcggga 180 tcacggtgtt ggtgggcatc tacggctgct cttctgcagt cagcagttcg tttcactgcg 240 ctacaaaact acgtttagcc caacggatat gtttaggact tattaatgat tgattgagcc 300 cgttttcctc tgtgtgggcg ttctgatgct cttgattagg cgaaattgag tgttgcgacc 360 gaaaatgagt gctgggtgcc gtatgacaac ctatagctac gtgggaggtt tctatagacg 420 ctgctgctta cccgcggctt aactgctact taaccgcggc ttaactgctg cttttactgc 480 tgtttaactg ctgctttgat gctgtttaag aatttttggg gatgatgctg ttttaatgta 540 gtaaacatca gtttccctac acattcagtg ctaggtccgt atatttgaag gttgtggttg 600 accgatttga atggtccttg tgttcttttt ttttctttta tttgggtaat ttcgcctagt 660 tttgttagat atatatttat tttttacttt tgtgttggtt tccttgaaat tagtattttc 720 taatgtgatt tctttctgtt acattgtttt ttttttgtat gctttattga aattgtattt 780 gttttttttt tgtaacagtt ttcgtgtatg tatattcaag actgtttaac ccttttgtgt 840 cttcctctta tattttgtat ttttgggttg cttcttcagc agagggttca ttaagagggg 900 ttgtaataca ttagcattgc agtgtgttgt gcgcccaagt gattgtgtgt tagagtgtca 960 gtgaccagtt tattaaaact ttgtagtaac tggaatttgt gttttcagga gcagagcacc 1020 taaggcaggg tggattcctg gtggaggtct ttcttgcacg tgtgctgtgt gtatgcgctg 1080 tgtgtgtgtg tgtgtgcgcg tgtgtgtgtg tgtgtcacag gtcttggtgg ttttgctgtg 1140 gaggccttgt tgtgccctta ttctctgctg tcacgtctca gctcctgcac tcctaaatcc 1200 a 1201 // ID DIRS-7_DR repbase; DNA; ZEB; 5393 BP. XX AC . XX DT 02-DEC-2008 (Rel. 13.12, Created) DT 09-MAR-2009 (Rel. 14.04, Last updated, Version 2) XX DE DIRS-like LTR retrotransposon family - a consensus. XX KW DIRS; LTR Retrotransposon; Transposable Element; Nonautonomous; KW reverse transcriptase RNase H; DIRS-7_DR. XX NM DIRS-7_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5393 RA Bao W. and Jurka J.; RT "Families of DIRS-like retrotransposons in zebrafish."; RL Repbase Reports 8(12), 2160-2160 (2008). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 5393 BP; 1483 A; 1567 C; 891 G; 1452 T; 0 other; gtgaagtttt taaactaatt tcgagaggag cacgtgatat aattgacagc agctggccac 60 ctatctacac tcattagtta gccaatcaga tctatcctaa cccactataa gtagcctagc 120 tagacattac tcccttatct tcgttttccg aagaaacaaa cgacaggtcc gctctagctc 180 ctccaaaagt tactcacgga caaaacggat cctccacatc tcctactatt cttcaactac 240 aataccacac cacctttaat accatcgtca gcaactacaa aaaatgacaa aaacaacaac 300 aacagcgacg gcggctgaga tcacaacagc atcagcgcag ctacatacaa gaaagaaagc 360 tccgggctcc ctgaaacacc agccccgtct taacaacaac cagcccagca tgaaagtcag 420 ctcgagaccg aggctccaac gagaggccgt aggcctatcc actccagcca gatcacaatg 480 caatccccag ctccgctacg taccttctcc agcatcatca tgtgtcactc atatccaaca 540 acgactgtca ctgaactcct ccaaactcct agacgccggg accataattc aaagccgctg 600 cattaaaccc aaaacacttg cgaatccacg tcttcaccgc cacccccagg acataacacg 660 ctcctgccac accccctatc ctaaacagac attaaacccg caatctctcc gcctccccgg 720 agaatcaagg aaaggattaa aaagcaatat tggatttgga catattgcag caatcataat 780 tctaaacacg gataaaacct atctcccgga caataaagca actcttggga atcccacctc 840 acctcttcca tcctgcagct cccgattcca gtgcacactc cagtcctcct ctttcagcga 900 tccttaccac ctacgcaccc cgcccctcta ccttgcgtat cattgctttt tccccctctc 960 atttttacaa gctcgcagct actgctgatt attttctatt ataaaataga aattacttgt 1020 cgacttttgt caatatgttt atgcttttac tgaattacgt ttgtaactga cctaattttc 1080 tgaatttctc attccaggtt tcgtctactg aagcacagcc gatctcggcg ctcccttccc 1140 acaacctcac tagccgtcca ccaacactac accagaaatc acagagcaat tccctgttaa 1200 gcacatccat accagcccta ctgaaatgga cattagaacg tttctaaaaa cgtctaataa 1260 cggctctttc atctcctcat aattctgcct ttttcagctt taacagcatg attctacctg 1320 atgaatatgc ttaaattatc acgttataaa ccaagcaatc tctctaatct agacggctgc 1380 agcgctcggt tgcaatcaac attacacccg ccacccgatt tctggcaatc ggaggtcaca 1440 attacatttc gtagtccgtt taacattcgg atgcaaaagt agcccacaca tttatgctta 1500 aaaacgatcc gctggatttc gccaataact acggtgtttg cacgtaattc acctacaagt 1560 tcccgtttct ctctccctaa tatcccccag ctaaactagt gaccggtttt ctaatatttg 1620 cattctcatt gcggaagaac ctccggctta agcacttcta tagaattctt aagcatcaat 1680 ttggagtgat agaatttcaa gcattaataa gctacttgaa taaatcgaca atcagtcccc 1740 ttttatattc tggaagaaaa atatcacgct aagcataaac tgctctccat ctagaacatc 1800 aaacacacat gcgcattatc ccgcagggat gccttttcgt cactcaccta tttcatctcg 1860 cagcagcaat ctccggaata gaagagaaat gatctaaatc cgatcttggc cgtaacaaac 1920 tctgcctgaa gggcgtcgcg caccagaggc gccgacggcg cggtgacgcg acgcacttac 1980 gacagagaat agtagattaa tgcacaccag actcaagcat tcatacttta tcaaataaac 2040 taatcaatta tacaaatagc gctccgcggc gcggcagaat acgaagtgtc atgaattgtg 2100 gctgggcact acggacagcc gcattgactt ccttattatt tatttccttc ctgtgaaagt 2160 caatggttac aggttttcag ctttcttcaa aatgttgtct ttagcgttta acacgggagt 2220 cactaacctc tgaaactaag agctactttc gggcaccgat taacgtggag ggctaccagt 2280 tggataaaca cttctcaaat aacaaattgc tcaatttact tataattata atttaagttt 2340 ttggttattt tttaataatg aataatattc atccaggtgt agtcactgat catgttatga 2400 ttctctcaaa gttaccaaca atgatttaac agggtaggaa ataccccgta attaatatat 2460 caacgtgcaa cactatttta tttatctttg caaatattta catttttaaa tgattacttc 2520 tatattagag aaagcttact tgtaagtggt gttatggtga gctattttag aacaggcctg 2580 tgagcaacac acgagagcct cgtgagctat ctggtgccca tcggtaccat atttggtgac 2640 tcctggttta acagaaatga aattcataaa ggttcgtaac catcaggtag cgcttaaagc 2700 atgcgacctg cttatgcttc cttccttaag cgctggaata gaagccattc atttattagc 2760 tatccaattt catcctcagt atatcctttt tacacagatg ctgccccctt cgtggggtat 2820 attatcgggg ccgccaatgc gcttctaacg ggcaatatag tcaatacatt ctgaaaaaaa 2880 aaaaaaaaaa aaaaaaaaat aaataaataa aaaaaaaata ctgctgtaca tgttcctagt 2940 tgcataggaa atcgctgact ctcttttgct ttattttcag aaactggtat cagaagcaga 3000 gccgcagctg actccatctc tccttttaag atgataatcc cataaccaca gcatacacaa 3060 tttgcgtcag cattcattta tacattttaa gttaccccag aacacctcat cagtttctca 3120 ctgctcaaca catccacaca ttccgttcaa taacattttt ggtttcataa caccttaaag 3180 tcataggacc attcaggctt cccaccagat tcataatcca gtcactcatt cagaattgga 3240 gccaccacca cagcagcaca caagggctat cgtaacaaca catacaaaca ctaggaaagt 3300 agtctcccga cgccttcaaa tcttatatct gactcagcca cagccatctc agggaagctc 3360 agaggaccct caccagcaga cgcagtaatc ccagcggcca agggcatggg cctagtccag 3420 ggagagaata cgacccaacc ataccagctt cccgagggca gaggaactac ccagtctccc 3480 aagggcgcgg gcatgaccca gccatctagc ttcctttctt ccttccagct gatttgcact 3540 cagccttctc ccccttttca ctcacctaac tccagtagga gttctctccc tgcccaggcc 3600 cccttgtcac ccccgccccc ccccggccgc tgccacagaa gttttcactg ccccccaact 3660 tttctaactt ctgtaggacc ccgccccccc catggctctg gcccccgcag gggcgttacc 3720 ccgagcttcg attctcgcag gaatcatctc actgccctgg ccttagcttt ttatattatg 3780 tcattccata atgacatata gtatagcact atttattttt ctctctgttt gattttattt 3840 atataaattt atatctatat gcacccacgc atataaatat gaatttatat atagtgctgt 3900 caccctcacg ctctgttccc gcaggaacac ccccagagca cctataccgc ccgtcaccct 3960 ccagtaagaa gtcttcacct ccccctcatc tttcccgact cctactggag cagctaaata 4020 gctcacccaa ccccgagctg tgacacgagt catgtcaccg accctccccg gccctagctt 4080 ttatttatgt ttatttttgt ttatttctca cacttatctt ttatttttta aatttattta 4140 tatatatgta tatatatatg cacccacgca tataaatata tatttatata tagtgctgtc 4200 accctcaagc tctaactccc gcggagttaa tcccgagcac cggacccccg caggggtcat 4260 cgcccaactg ccatttcccc cttcagctgg ggcttcaccg accattcctt ttcccgactc 4320 cagctggaga tggcacatac agctctctct cccgcaggag agccaagagc ttcgactctc 4380 gcaagagtca gtaaaaacgc ccccccaagg ccctagatta ccccattata tatttatata 4440 tctatatgtt tcatataaat atataattat atatagtgct gccacctccc agctcaatct 4500 ccgcaaggag tgttcctcga gcaaattact cctttggagt ccccgccccc cctgcccccc 4560 ttcacccccc tctccagccg gagtccttca ctccccttcc cttgtaatga ctccagcagg 4620 attcccgccc accccatggc tctgaccccc gcaggggtct ccccgagtct ctactccagc 4680 aggagtattc acagcccaag ccaactctgc tcgggttccc gcaggaacct gtttcaccct 4740 ttgctccaaa ggagccctct ttaccttttc tttttcaaat aactatatcc agcagccgga 4800 tatagcattt caagcctttt ggggagtttc ttcgaataca cggctgctgt cccgagcttc 4860 atgcatttgg ggagctctcg agaaccacct gatctcgtac tcccctcaca tgctctatgg 4920 acctggcggg agccctgggc tcaactatct ccgagctcag ggttctctcc cgggacagca 4980 tgccaaacct gcttacagct gtcaagcaat atctaagtgt gaactcttga agtgaagttt 5040 ttaaactaat ttcgagagga gcacgtgata taattgacag cagctggcca cctatctaca 5100 ctcattagtt agccaatcag atctatccta acccactata agtagcctag ctagacatta 5160 ctcccttatc ttcgttttcc gaagaaaccc cccatccacc cctttctcct ccttttctcc 5220 tttacaaaaa ggggagctct cgagaaccac ctgatctcgt actcccctca catgctctat 5280 ggacctggcg ggagccctgg gctcaactat ctccgagctc agggttctct cccgggacag 5340 catgccaaac ctgcttacag ctgtcaagca atatctaagt gtgaactctt gaa 5393 // ID DNA-X-6_DR repbase; DNA; ZEB; 1333 BP. XX AC . XX DT 31-OCT-2008 (Rel. 13.1, Created) DT 31-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE A nonautonomous DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; DNA-X-6_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1333 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1255-1255 (2008). XX DR [1] (Consensus) XX CC Copies are ~88% identical to the consensus. The TSD is unknown. CC TIR is 63-bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1333 BP; 423 A; 230 C; 231 G; 448 T; 1 other; taggttgttt tcaatcacgt ggttctggtg acgcgcgaaa ttcagatgga gggcaggaag 60 taaaaaactg aatgggagac atagaggaaa gtcygtattt ttgcgattta taccatattt 120 caaaggagat ataaatagaa aataaccaca tatgttgggc atgatcctta tatcatgaag 180 agggccgagt tttctactga actaaaatat atttgcctgt catcgaggcg gtagatactg 240 tataagctat tacgttacat gaaaaaatac tatagtaaat actatagtgt ttggaagcat 300 actatagata attacttgaa ttaaactgtc gtagtaatta tattgttgct gtggtacaac 360 acaactgtag taataaacaa attattcaat aggcttcagt tttctacgct ataattatac 420 accatcacaa tgcaccacag tttacactaa cgttacagta tttattacag tttatcagtt 480 tactatgcta cagtattctg tagcattcat taacaaagag ttgtacacat tatacacttt 540 actatgtggt tcaaaaacat gtttattata aattactata gtttttcccc atgcggatat 600 cttccagaca tcacaaaata acagatgaaa gcatacaaaa gcgtggacgc atagtctaca 660 ttattattta tttatttatt tttgtaaggg ggaaaaagtg ctccacaata ataaccgtct 720 agttttgtcg caagggttgc atttcttttt tgttctgtcg atgaacttac catcaacaaa 780 cattcaccgc tgctgcgcat ctcgatgttc acgttacggt tgtttattct gccaaaacgc 840 cagtaaagac ttggattatt gcaaaacaag acggagattt cattgcagcg attacatggc 900 agggtacgtt atttatattc tcctggattt tgtataagtg tggtggtgtt tgtatctgat 960 ttttatataa cacattaaca tatttataca catagctagg cctgtatata atctttattg 1020 gtgctgtcaa acgaattatc gcgttcaaaa aaagtttgta cacatggatg tattgaaaaa 1080 atgtttatta catgccgcct tttcttccgt ttttcagcca gtttcttgaa ctttcccccc 1140 tttatgagca aaaacttttg gaagtctata aaagctgttc ggcatttctt attttggccg 1200 atttaaacaa ttataaacaa cataaaacag aaacattttg atgttctgat agcgattcct 1260 atgtagaaga atcacttcct gccctccatc tgaatttcgc gcgtcatcaa tgacgtcatg 1320 tgaaaacaac cta 1333 // ID L1-6_DR repbase; DNA; ZEB; 5693 BP. XX AC AL627249; XX DT 04-AUG-2002 (Rel. 7.07, Created) DT 04-AUG-2002 (Rel. 7.07, Last updated, Version 1) XX DE L1-6_DR is a non-LTR retrotransposon from the L1 clade. XX KW L1; Non-LTR Retrotransposon; Transposable Element; L1 clade; KW L1-6_DR; endonuclease; reverse transcriptase. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5693 RA Kapitonov V.V. and Jurka J.; RT "L1-6_DR, a family of non-LTR L1-like retrotransposons from RT zebrafish."; RL Repbase Reports 2(7), 26-26 (2002). XX DR Genbank; AL627249; Positions 156757 151065. XX CC L1-6_DR is a family of L1-like non-LTR retrotransposon. This CC family was active recently (no stop codons in ORF2). CC It encodes two proteins: CC the 299-aa L1-6_DR1p (positions 154-1050) and the 1255-aa CC L1-6_DR2p CC (positions 1710-5474, a conceptual translation). CC These proteins are most close to corresponding CC proteins encoded by other L1-like elements. L1-6_DR1p is a CC putative CC RNA/DNA binding protein, and L1-6_DR2p is composed of the CC AP endonuclease and reverse transcriptase domains. XX FH Key Location/Qualifiers FT CDS 154..1050 FT /product="L1-6_DR1p" FT /translation="MSPQKQATGNKRLKTLDDDVASIVSHAIEAQQEALRE FT MVFGMLTEAIKSALAPFNDYINENGTVLRAIKEEIGSLGKNMASITSKVDD FT LQGNLRKTKKDTNHCLSQLDQIQRKCTDLEDRSRRNNIRLVNLPAGMEGDD FT PIGFLQKTLPKWIPDLSGRSNPIEIDRAHRVYSNSNSNRSRAMIFRLLRYP FT DRQAILQGARKAKPTLPGGARLEFYADYSAATAQKRKAFSAVRSKLQQKGA FT ETFLIYPATLKVSFKGQKYSFETPDEAEKYIEGLERTTSAPARRSLYPQLQ FT EAMEETD" FT CDS 1710..5474 FT /product="L1-6_DR2p" FT /translation="MVNLNIMSLNVRGLNSPSKRTKFLDYLRRKNIDVALI FT QETHLRIADVQRLQNKHYIPIVSSGYNSKKRGVAILFKRNHTFSIENLGSD FT NNGRMAYCCTNIGGKKLMFVNIYAPNSYDALFFPDITKTLLAYNEYSLIIG FT GDMNTVMDTVLDRSNPSIPMIQPQSSKALNWFTKDLNVSDTWRLLNPNVKE FT YTYFSPRHKSYSRIDYILLSPMIFSCVKNIEFLPQSISDHNAITTSLQFST FT LTKPSRWRFNTLLLKNEEYISQLRSELTEFISINQNSVSNQDMFWAFIKGF FT LRNNAISFSARLKKNYLQQISSLEKRCGELERELEKKFATETQQKLKVERA FT TLNDLLRQRTEYLMHITKHKYYIDGSKPSRLLALTLKKHEQRFNINAIKNS FT EQVLTSNPSDINKTFKKFFEKLYSSEIQTNTDHLEKIFESLNLTKLSKEDA FT DNLDSEITLDELHNAVRDLHTGKSPGIDGIPGELYLVLWDLLGHIWLQTIN FT KAITKGHFHKDLNTALITVLPKPEKDLFECSNYRPISLINEDIKIYAKVIA FT KRLQSVISKLIHPDQAGFITGRLASDNVRRVLHVIAESGKLKKPSGLLLID FT AEKAFDRLEWGYLWYTLKNFNFGDQFIKMIQTLYSNPLARVTTVGIISDTF FT CIKRGTKQGCPLSPLLFNLSIEPLAQFIRQCKNITPIQLGSSVHSISLYAD FT DILLFLSDLQNSLPNALTALNEFGRISGFKINYSKSVLMPLNPIGHNVTNN FT TNIKISTKVKYLGIELSPTLQLIPKINYMSIYSKVEADIQRWMRLPSSPPA FT RISTIKMNILPRINFISSMLPLPPPPNYWKKLDSILNKFVWNGKRPRVKWS FT TLQKNKTEGGWACPNFKLYHWSFILLSVVKWFEPDMQCSWKEIEKDLIFPT FT RLQDFLFSGFNHRKCILRYGPILSYTLQIFKKVETYFPSRTTWHNHSPLWH FT NVNLLSGRQPFNQPSWANKGILTLKDINGEKSILDFQELVNKFNISPNTLF FT IYFKIRAALKAHKTPWGKDLHTHPIVKWIYNAPRKGTVSYLYCCLSNINAH FT SFSKERLWLSDMSRPNTTIDWETVWHNTFHASTNPNHQFIHFKTIHRAYLT FT TRIRHIIGLSPDPYCKFCEPQRPDTFLHMMWECPEVQKLWDSVLDILFKVT FT KICFPKDPILLLLNDNSQFPLNAKVRKFWLAASTATKKMLVQRWKPPHDLS FT VKHWLHSLLEILYLELSSARINHAKVDILTAWKIYICNVKEILDT" XX SQ Sequence 5693 BP; 1936 A; 1112 C; 947 G; 1698 T; 0 other; cttccggtgg cgcgcgctgc aagatggctg ctcggcactg agctccttac aaatacctcc 60 gattataaca tttttttctt caaattgagg ataaattcct aattaaaata gaacatcaac 120 tctttctaaa gaatggcaac tagtaaaaag acgatgtctc ctcaaaaaca agctacagga 180 aacaaacgac taaaaacgct tgatgacgac gtggcctcca tcgtcagcca tgcaatcgaa 240 gcgcagcagg aagccttgcg tgaaatggtc ttcggcatgt taacagaggc aataaagtcc 300 gctcttgctc ctttcaatga ctacattaat gaaaatggaa ctgttcttcg agctataaaa 360 gaagaaatag gctctctggg taagaacatg gcgtcgataa catcaaaagt agatgatcta 420 caaggaaacc ttcgcaaaac taaaaaagat actaatcatt gcttgtctca gcttgaccaa 480 atacaacgaa aatgcacgga tttggaagat cgctcacgac gcaacaatat ccgtctggtc 540 aacttacctg cgggaatgga aggagacgac cctatcggct tccttcagaa aacgctccct 600 aaatggatcc cggacctgtc agggcgttca aatccaatag agattgacag agctcatcga 660 gtctactcga attctaactc aaacagatca cgagcgatga tctttcgact ccttcgttat 720 ccggacaggc aggccatact tcaaggtgca cgtaaagcaa agcccacact acccggagga 780 gcgcgtctgg aattctatgc agattacagc gccgcgaccg cacagaagag aaaagcgttc 840 tcggccgtca gatccaaact ccaacaaaaa ggtgcggaaa cttttctaat ttacccagca 900 acactgaaag tctcattcaa aggccaaaaa tactcgtttg agacgccgga tgaagccgag 960 aaatacatcg agggactgga gcgaacaacg tcagcgccgg ctcgcaggtc tttatatcct 1020 cagctccagg aggcaatgga ggaaaccgat taaggagaca atattggact tatcaacact 1080 taaatgacat gattagaaac atttcttagc cccctaaatg gccaaattat acggtaaaca 1140 actattacta ttaatattat tattttgttt tattggtatt taaagttaag gttagatatt 1200 catgatacta attttattta tttaatacta tttattttat tattatatat atttttacta 1260 ttatttgtta tgttatttat ttattttagt tatttatttt attatttttt atttatttat 1320 ttatttttat tactttattt atttatttat tttttttttt gtcgacaagg tgggggctaa 1380 gtttcttggt ttaagattgg aacagcccga attcaaaact tgttacatgt ttaaatagag 1440 ttgatgctga tacactgatg gttgtaacgt tcctcagctg tttaatatca aggaggtact 1500 ggaaggcgaa tagtgttaag acattgcctc gcgatcgcca cagacgctcg ttattgaggg 1560 ggttagggaa agggtggggc actcaggaat aaaatacgca gaaattttta tattcttttc 1620 atatatatcc acgaaatttc tatgtttaaa ttgcacctac tactacaatc tttacatgtt 1680 tatttgtgaa agctttatgt tcagagatta tggttaattt gaatattatg tctcttaacg 1740 tcagagggtt aaactctcct agtaaacgga ctaaattttt agattatctt cgcaggaaaa 1800 acatagacgt tgcacttatc caagaaaccc atctaaggat agcagatgtc cagcgattac 1860 aaaacaaaca ctatattcct atagtatctt caggttataa ctctaaaaag agaggggtgg 1920 caatattatt taaaagaaat catactttca gtattgaaaa tttaggatca gacaataatg 1980 gtagaatggc atattgttgt actaatatag gaggaaagaa gctaatgttt gtcaatattt 2040 atgccccaaa ctcttatgat gctctttttt tcccagacat tacaaaaact cttttagcct 2100 ataatgaata ttcacttatt attggaggag acatgaacac ggtaatggac accgttttag 2160 atcgatcaaa tccttcaatc ccaatgatac aaccccagtc ttcaaaagca ttaaattggt 2220 ttactaaaga cctaaacgtg agtgacacct ggaggttgct caaccctaat gtgaaagaat 2280 acacttactt ctcaccaaga cataaatcct attcaagaat agactacatc ttactatctc 2340 ctatgatatt ttcatgtgta aagaacattg aatttctacc acaaagtatt tctgaccaca 2400 atgcaataac aacatcgctg caatttagta cactgaccaa accctctaga tggagattca 2460 atactctttt actcaaaaac gaggaatata tttcacaact gagatcagaa ttaactgaat 2520 ttattagcat taatcaaaat tctgtttcca accaagacat gttttgggcc tttattaaag 2580 gttttttaag aaataatgca attagttttt ctgcaaggct taaaaagaat tacctccaac 2640 aaatatcatc attagaaaaa aggtgtggag aattggagcg ggaattagaa aagaaatttg 2700 caacagaaac acaacaaaaa ttaaaggtag aaagagctac acttaatgac ctgctaagac 2760 aaagaaccga atacttgatg cacatcacaa aacataaata ctacatagat ggaagcaaac 2820 cgagccgact ccttgcactc actttaaaaa aacatgagca acgctttaat ataaatgcaa 2880 taaaaaactc tgaacaggtt ctcacctcaa acccctcaga tattaacaaa acatttaaga 2940 agttttttga aaaattatac tcttcagaaa tacaaactaa cacggatcat ttagaaaaaa 3000 tttttgagtc tttaaactta acaaaacttt ctaaggaaga tgctgacaac ctagattcag 3060 aaataactct agatgaatta cataatgcag ttagggattt acatacaggg aaatctccag 3120 gtattgacgg catccctggc gaattatatc tagtactatg ggacttactg ggacacattt 3180 ggctacaaac aataaacaag gccataacca aaggccattt tcataaagat cttaatactg 3240 ctttaattac tgttttacct aaacccgaaa aagatctttt tgaatgttcc aactatagac 3300 ctatttctct gattaatgaa gatataaaaa tttatgctaa agttatcgcc aagcgtctac 3360 aatcagtaat aagtaaatta attcatccag accaagctgg ttttataaca ggacgtttag 3420 cttcagataa tgtccgcaga gtactacacg ttatagctga atcaggaaag ttgaaaaaac 3480 ctagtggcct cctccttata gatgcggaaa aagcctttga tcgcttggaa tggggctatt 3540 tatggtacac actgaaaaac tttaattttg gcgatcaatt cattaagatg attcaaactc 3600 tatactctaa ccccttggct agagtaacta cagtgggaat tatttcagac actttttgta 3660 ttaagagggg aacaaaacag ggttgccctc tctctcctct tttgtttaat ctatcaatag 3720 aaccactggc tcaatttatt agacaatgta aaaacattac cccaattcag ttaggctcat 3780 ctgtccattc tatctcttta tatgctgatg acatacttct atttctatca gatttacaaa 3840 attcactacc caatgccctc acagctttaa atgaatttgg aagaatttct ggtttcaaaa 3900 taaactactc taaatcagtt ctgatgcccc ttaatccaat tggccataat gttactaata 3960 atactaatat aaaaatatca acaaaagtca aatatttagg aattgagctg agcccaacct 4020 tacaactcat acctaaaata aactatatgt caatttacag taaggttgag gctgatatcc 4080 agagatggat gcgcctaccc tcatcaccac ccgccagaat ttcaacaata aaaatgaata 4140 tcttgcccag aattaatttc ataagctcca tgcttcccct ccctccacct ccaaactatt 4200 ggaaaaagtt agactcaatt ttaaacaaat ttgtttggaa tggaaaacgt ccccgtgtaa 4260 aatggtccac tctccaaaaa aacaaaacag agggtggatg ggcctgccct aacttcaaat 4320 tataccactg gtcatttatt ctgctctcag ttgtgaaatg gttcgaacct gatatgcaat 4380 gttcttggaa agaaatagaa aaagacctta tttttccaac acgtctgcaa gactttctgt 4440 tctcaggctt taaccatagg aaatgtatat tgcgctatgg acctatttta tcctataccc 4500 tccaaatttt taaaaaggta gaaacttatt ttccttccag aaccacctgg cataatcatt 4560 ctcctctgtg gcacaatgtg aacctacttt ctggtagaca accttttaat caaccatcat 4620 gggccaataa aggtattcta acactaaaag acataaatgg agaaaaatct attttagatt 4680 ttcaagagct ggttaacaaa tttaatatat ctcccaacac tctttttata tattttaaaa 4740 taagggcagc actaaaggct cataagacac catggggtaa ggatcttcat acccacccaa 4800 ttgtgaaatg gatatataat gctccaagaa aaggaactgt ttcctatctc tactgctgtc 4860 tttctaatat taatgcacat agtttctcaa aagaaagact atggctatct gatatgtctc 4920 gacccaatac aactatcgat tgggaaactg tatggcataa tacattccac gcttccacaa 4980 accctaatca ccagtttatt cattttaaaa ctatccacag ggcatattta accacacgta 5040 tacgacatat cataggttta tctccagatc catattgtaa gttctgtgaa ccacagcgtc 5100 cagatacctt tttacatatg atgtgggaat gtccggaggt tcaaaaactt tgggattcgg 5160 tactggacat tttatttaaa gttacaaaaa tttgtttccc caaagaccct atactgttac 5220 tattgaatga taactctcaa tttcccttaa atgcaaaggt ccgtaaattt tggttagctg 5280 cttcaactgc tacaaagaaa atgttggtac aaaggtggaa acctcctcat gatctctcag 5340 ttaagcactg gctacattca ttattggaaa tactatatct ggaactgtca tcagcacgga 5400 tcaaccatgc caaagtggac attcttacgg catggaaaat ctacatatgc aacgtcaaag 5460 aaatcttgga cacttaaccc tcatctgatg ctacggatga caacttaatg gaaattatct 5520 aacagattgg tctaaactat agttattctg tgtttcctga tgtgcgtata atggggtggg 5580 ggggtgggaa atatatatgg ggttatatct gttaaaagta ttttgttatc tgattgtaaa 5640 aattgtatat tctcactatc ggaaataaaa aaaactgaat tacaaaaaaa aaa 5693 // ID Gypsy92-LTR_DR repbase; DNA; ZEB; 160 BP. XX AC chr18; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy92-I_DR; KW Gypsy92-LTR_DR; Gypsy92_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-160 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1496-1496 (2008). XX DR Genome; chr18; Positions 35957464 35957305. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 160 BP; 43 A; 19 C; 41 G; 57 T; 0 other; tgttgtatat tgagttagta acaaattgat tgttcagtta cgacgcactc tagtggtgtg 60 tgtgtgtatg tggttgttgc aatagtgaag taaagacgtg ttcagagcag ctccggtgtg 120 attgtcttaa ttactgttat gtaagtgtac aaacataaca 160 // ID Gypsy132-I_DR repbase; DNA; ZEB; 6523 BP. XX AC chr3; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy132-I_DR; KW Gypsy132-LTR_DR; Gypsy132_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6523 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1525-1525 (2008). XX DR Genome; chr3; Positions 44236434 44242956. XX CC Positions [2895-3434] - Reverse transcriptase CC Positions [4878-5354] - Integrase core CC 'TAAG' target site duplication CC LTRs are 99% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 298..1668 FT /product="Gypsy132-I_DR_1p" FT /translation="MAYGFKADSFKGEENMSDLTFGRGRLFANSSTPVAGR FT GSVGRTSEFCSTRVTEQTPSQLRSNPFLASPDLNDTAWQNLITHIAQEVGQ FT TLISVQGGGRYGEGETNNAQTQSPVAGQSFTETPSLNLTGVKLVMQSEVRE FT PPVFRGDGTDKFTVHEWEDLLDTYLRKRSIPASEHYHEILSRLLGKAKDIV FT KITLRSNTSLKPSENPRVIFDILKQHFSEVRFSSMPLADFYSTVSVIGENP FT VEYWVRLNKAVDAEEALKRLGRQMDNPCQEAAMMFVKYCPDPSLSAVFRLK FT AADKWTASEVQEHLDRYQVEQKEQLAIKSKRNVTMRNVAVHRQSLVENEMM FT ASGSPVVASEHSNLSISSSTQNDDNCMKTLITLLDRALSQNSQAEMSSPRQ FT SQPPTRACKVCRSYEHSTLSHCRRQRLCLACFQPGHIKRNCVNSQHGSGSQ FT ITQQSQGSPTLK" FT CDS join(1800..2615,2619..6257) FT /product="Gypsy132-I_DR_2p" FT /translation="MQTLEVFDELFYAPVYVNDRFQIKGMLDSGSMACTFS FT EAVESRMLCEDVLPKPSPLSQEIMLIGCGGKITKPKCMNEVELKVYGERCL FT VPVLVVPGQRDDLIIGTNVIKFLIHQMKISDDYWRILSDSVSGPPTSCEQF FT LDVMANTCRWRGGIPPHKVGTVKLRQSVTLMAKQEHLVWGKLPRDTAISPG FT STVIVEATSSRTMPRDIIIGRVVTPLWGDHWVPLKVTNLSDKPITLKRNCK FT LADVYPCLAVEDFELFQGFNGVQLPTPDKDPPVTAGDLKKRLETVSLSDVD FT IDSCSASDTAKEKLVQLLETYHDVFSKHALDCGEVRGFVHRIHLTDDRPFR FT LPYRRVPPAHYQKLRQVLSEMEEQGIIRKSVSEYASPLVLVWKRDGSLRIC FT TDFRWLNARTIKDAHPLPHQSDCLVALGGNTLFSTMDLTSGFYNIPMAEED FT KKYTAFATPVGLHEYNRMPQGLCNSPASFMRMMLSIFGDLNFSSLLCYLDD FT LLVFAPDETTALERLEMVFQRLRSYNLKLSPKKCHLMQASVKFLGHIVDGN FT GVSVDPSKVEVIAKLSKTDLMEEDGCTPSVRRIKSVLGMILYYQHFIPNCS FT SVARPLFSLTAGQKRRTKLKTEAKAGTYRKLKPVDWTPECDDALSKLKESL FT LQSVVLAHPNFSLPLILSIDASLEGLGAVLSQIPAGEEKARPIAFASKSLT FT NSQRRYPVHKLEFLALKWSVCEKFSHWLKGHTFTVWTDNNPLTYILTKAKL FT DACEQRWVSKLAAYTFDLKHIAGSKNTVADALSRDPFARTIARRLITEQYG FT SLLSEAETVDKDGVQDTFRLQANNLYVSTLTSPVSCDQVTVKTLLNLSKQW FT KAVTELRAMQAIQYIQDATTTGFDPVSAIPIEEIRRGQEADPSISKIMPYL FT EKQRRPSRREVVAMDAVTQILLRQLDRLKVQDGVVYRVCKDPSSTHKRFQL FT VMPSSLRARALTGVHDLAGHQGRSRTLQLARQRFFWPRMERDIREYVKCCQ FT RCILAKTPEPSARAPLENIRTSAPMELVCLDFWSAEDSKQRSVDVLVVTDH FT FTKLAHAFPCVNQKAKQIAKKLWDHVFCIYGFPSRIHTDQGTNFESTLIAE FT LLKLAGVEKSHTTAYHPMGNGGTERFNRTLGNMLRSLPLKEKQKWPEQIQT FT LTFAYNSTVHETTGYAPFQLMFGRIPKLPVDVVFAQALHDPVIVDYGSYAK FT TLMSYLHEAACIAQKHTEKEQKKQAKNYNRKVKGMCLNVGDRVLLANKSER FT GKKKLADKWDPTVYTVRDRNLQTNIYKLVDNNGKCKVVHRNLILDISFLPI FT EQSQDEEDTQPTAENEETESHAPELSSLEEEENFDGTEGWILDGQDESERQ FT KPEEDDDIGSSCSNLSEGSFCEQDLNATEVNEGSPDPVSDLETASFAKNSE FT PRPHTLNDPDDNTISNTPTHCPTDTQGVKTRAGRMVKKVSRLIESMAQRPF FT YVQRVGSLLTRRSGSVLSLF" XX SQ Sequence 6523 BP; 1928 A; 1344 C; 1520 G; 1731 T; 0 other; gtggtgccgt gacccggatg aagatcagct cttctggaat cttcatccaa agatcaatgt 60 cgtttggtcg ccggtggatg ctgcacttca atcaaggttt tgatgaactg tgatagagag 120 cattgcactt tcatcaaact ggacacttca ctatatcctt tctttatttc tatttgaatt 180 taatatacct gttttcctgt ttaacatata tattttttgg tagttgttct ccgatgctgg 240 tttatctatt ctaaaatttt attcaatgtt tgatcattta catgccaata agtagtaatg 300 gcatatggtt tcaaagcaga cagttttaag ggtgaggaga acatgtcaga tttaactttc 360 ggaaggggaa gactttttgc taattcatct acgcctgttg cagggagggg tagtgtggga 420 cgaacttctg aattttgttc tacccgggta actgaacaaa cacctagtca gttaaggagt 480 aacccatttt tggctagccc agatttaaat gatactgcat ggcaaaatct cataacacac 540 attgctcaag aagtaggaca gactttgatt tctgttcaag gtgggggaag atatggggaa 600 ggagaaacta ataatgcaca gacacagagt ccagttgctg gtcagtcatt cacagaaaca 660 ccctcactta atttgacagg agtcaaatta gtcatgcaat ctgaagtaag agagcctcct 720 gtgttccgag gtgatggaac agataagttc actgtgcatg agtgggagga tttactggac 780 acatacctgc gtaaaagaag cataccagct tccgagcatt atcatgaaat cctttccagg 840 ctgctgggta aggccaaaga cattgtaaaa ataaccctac gcagcaacac atctctaaaa 900 ccttctgaga accccagagt gatattcgat atcctaaagc aacatttcag tgaagtcaga 960 ttctcctcta tgcctttagc agatttctac agcacagttt ccgttattgg agagaaccca 1020 gttgagtatt gggtccggtt gaataaggct gtggatgctg aagaggcttt gaaaagactt 1080 gggcgacaga tggataatcc ttgtcaggag gcagccatga tgtttgtcaa atactgtccg 1140 gaccccagct tgtcagccgt tttccgtttg aaagcagcag acaagtggac agcgagtgag 1200 gtacaagagc atctagacag atatcaggtt gagcaaaaag aacagttggc aattaaatct 1260 aagcgcaatg ttacaatgag gaatgttgca gttcaccgac agtcactcgt ggagaatgag 1320 atgatggcta gtggctcccc agtcgtggca tctgaacatt ctaacttgag tatttcatct 1380 tctacacaga atgatgataa ctgtatgaaa acactcatca ccttacttga ccgtgcgctc 1440 tcccagaatt ctcaagctga aatgagtagt cctcgacaga gccaaccgcc tacgagagct 1500 tgtaaggtgt gtcgttcata tgaacattca actctctcac actgccgacg tcaacgactc 1560 tgtcttgctt gcttccaacc tgggcacata aaaagaaatt gcgtaaatag tcagcatggc 1620 tctggcagcc agataacaca gcaatcgcaa ggttcaccga ctttaaaata gtcgacccac 1680 acctggagag gggacgtgtg ggtagggtag gacaaaccct cgaagatgta gatcttaaac 1740 aagtttatga gaaggcatgt gcgcaggctc cggctgaagt caaagtgatc gcacagaata 1800 tgcaaacttt agaagtgttt gatgaattgt tttatgcacc tgtctatgtg aatgatagat 1860 tccagataaa aggaatgttg gattcaggat ctatggcatg cacttttagt gaagcggtag 1920 aaagtcggat gctctgtgaa gatgtgttgc caaaaccctc acctttgtcc caggaaataa 1980 tgctaattgg atgcggaggt aaaataacca aaccaaaatg catgaatgaa gtcgagctaa 2040 aggtgtatgg tgaaagatgc ttggtgcctg ttctagttgt accagggcaa cgagacgatt 2100 tgataattgg taccaacgtt attaagtttc tcatacacca aatgaagata agtgatgatt 2160 actggaggat tttgtctgac agtgtctcag ggccacctac gtcatgcgaa caattccttg 2220 atgttatggc aaatacttgc cgctggagag gcggtattcc gccacacaag gttgggactg 2280 tcaagcttcg ccagtcagtt actctaatgg ccaaacaaga acaccttgtc tggggaaagt 2340 taccaagaga cactgctata tcgcctggaa gcactgtaat agtggaagcc acttcttcta 2400 gaaccatgcc acgggacatt atcataggtc gtgttgtcac acccttgtgg ggagatcatt 2460 gggttcccct gaaggtcact aacctgtcag ataagccaat caccctcaaa aggaattgca 2520 agttggctga tgtgtatccg tgccttgccg tggaggattt tgagctgttt caaggtttca 2580 atggagtcca gttgcccaca cctgataaag atccataacc tgttactgca ggggacctga 2640 aaaagagact ggagactgtt agtctcagtg atgttgacat agactcttgt tctgccagtg 2700 atacggcaaa ggaaaagcta gtccaattgc tagagactta ccatgatgta ttctccaaac 2760 atgccttaga ttgtggggag gtcagaggct ttgttcaccg catccactta actgacgatc 2820 gcccatttcg acttccatac agacgagtac cacccgctca ctaccaaaaa ttacgtcaag 2880 ttctctcaga gatggaagag caaggaatca tacgtaaatc agtaagtgaa tatgcctcac 2940 ctttggtctt agtatggaag agagatggca gcttaaggat ctgtacggat ttcagatggc 3000 taaatgctag gactattaag gatgcgcacc cactcccgca ccaatcagac tgtcttgttg 3060 ccctcggtgg taataccctc tttagcacga tggatttaac atcgggcttt tataacatcc 3120 caatggctga agaggataaa aagtacaccg catttgcaac acctgtaggt cttcatgagt 3180 acaacaggat gcctcaaggg ttgtgtaaca gcccagcatc tttcatgcgt atgatgttaa 3240 gtatctttgg ggacctgaat tttagtagtt tgctgtgcta cttagatgac ttgctggtgt 3300 ttgcacccga cgagaccaca gccttggagc ggttagagat ggtattccaa cgattgagaa 3360 gctacaacct taaattaagt ccgaaaaagt gccatctgat gcaagcgtct gttaaattct 3420 taggccacat cgttgatgga aatggtgttt ccgttgatcc ttcgaaggtt gaagtcatag 3480 ccaagctctc aaaaactgac ctcatggagg aggacgggtg tactccgtct gtacgtcgaa 3540 taaagtctgt cttgggaatg atcctctatt accaacattt tattccaaat tgctcttcag 3600 ttgccagacc attgttctcc ctaacagctg ggcagaaaag aagaacaaaa ttaaaaactg 3660 aggctaaggc aggcacatat cgaaaactga agccagttga ttggacccca gaatgtgatg 3720 atgctctctc taagttgaaa gagagtttgt tgcagagcgt ggttttagcg catcccaatt 3780 tctcacttcc cctaatcctg tcaattgatg cttctttgga aggacttgga gctgtgctgt 3840 cacagatacc agctggtgaa gaaaaagcac gtcccattgc attcgcaagc aagtccttaa 3900 ccaactctca aaggagatat ccagtgcaca agctcgagtt tttagcccta aagtggagtg 3960 tgtgtgaaaa attcagtcac tggttaaagg gccacacatt cacagtgtgg accgacaata 4020 accccctcac ttatatatta acaaaggcta aactcgatgc ttgcgaacag cgctgggtat 4080 ctaagcttgc tgcttatact tttgatctca agcatattgc tggaagtaaa aatactgtag 4140 ctgacgcttt aagccgggat ccgtttgcgc gaaccattgc tcgtaggctc ataaccgagc 4200 aatatgggag tctgctttct gaggctgaaa ctgtcgacaa agatggagtt caggacacct 4260 tccgtttgca agcaaacaat ctatatgtgt caactctgac aagcccagtt tcctgtgatc 4320 aagtgacagt taaaactctt ttgaacctat cgaagcagtg gaaagctgta actgagctaa 4380 gggctatgca agcaatccaa tatatccaag atgcaacgac tactggattt gatccagttt 4440 cagcgatccc tatcgaagag attcgtcggg gtcaagaagc agatccctcg atttctaaga 4500 taatgcccta tctggagaag caaagaaggc catcaagacg agaggtggtt gcaatggatg 4560 ctgtgactca aattcttctt aggcaactag atcgtttgaa ggttcaagat ggagtagttt 4620 accgtgtttg caaggaccca tcaagcacac acaagaggtt ccaacttgta atgcctagta 4680 gccttagggc tagagcctta actggtgtgc atgatcttgc tggacaccag ggaaggtcaa 4740 gaactcttca actagctcga caacgctttt tctggcctcg gatggagcgt gacataagag 4800 agtatgttaa atgttgtcag aggtgcatat tggccaaaac accagagcca tctgctcgag 4860 ccccacttga gaacatcagg acttcagccc ctatggagtt agtgtgtttg gatttttgga 4920 gtgcggagga cagcaagcaa cgctctgtgg atgtattagt cgtgacggat cactttacaa 4980 agttggccca tgctttccca tgtgtaaacc agaaagcaaa acaaattgca aaaaaactgt 5040 gggatcatgt cttctgtatt tatggttttc catctcgcat tcatacagat caagggacca 5100 actttgaaag taccctcatc gcagagttgc ttaagctggc aggtgttgag aagtcgcaca 5160 ctacggcata ccaccccatg gggaacggag gaacagagcg attcaatcgg accttgggca 5220 atatgcttcg ctccttgcct cttaaggaaa aacagaagtg gcccgaacaa atacagactt 5280 tgacatttgc gtataactcc accgtgcatg agaccacggg ttatgcgcca ttccagttaa 5340 tgtttggtcg cattcccaag ctgccagttg atgttgtatt tgcgcaagca ttacatgatc 5400 ctgtgattgt tgattatggc agttacgcaa aaacacttat gtcttacctt catgaagctg 5460 catgcattgc gcagaaacac acagagaagg agcagaaaaa gcaggctaag aattacaatc 5520 ggaaggtcaa gggcatgtgt ctgaacgttg gagatagggt gcttcttgca aacaagagtg 5580 aacgaggaaa gaaaaaactt gcagataagt gggatccaac tgtttatact gtaagagaca 5640 gaaatcttca aacaaacata tacaagctgg ttgacaacaa tggaaaatgc aaagtggtgc 5700 atcgcaacct gatactggat attagcttct taccaattga acaatctcaa gatgaagaag 5760 atactcagcc tactgctgag aatgaggaaa cagagtctca tgcaccagag ttatccagtt 5820 tggaagagga agagaacttt gatggaacgg aaggatggat acttgatgga caggatgaga 5880 gtgaaagaca aaagcctgaa gaagatgatg acataggatc cagttgttct aatctaagtg 5940 aaggcagttt ctgtgaacaa gaccttaatg caactgaagt caatgagggt agtcctgatc 6000 cggtgtccga tttagaaact gcgtccttcg ccaaaaactc tgaaccacga ccacatacat 6060 taaatgatcc agatgataac accatatcta acacaccaac acattgtccc actgacacac 6120 agggtgttaa aacccgtgca ggtagaatgg taaaaaaagt tagccgcttg atagaatcta 6180 tggcgcaaag gccattttat gttcagaggg ttggctcact tttaacaagg agatcaggat 6240 cagttctttc cttattttag atcagttttg tgatgatgca aaaataacaa ttaagtatgt 6300 gcgccgtaag aatttcagtt aaaatcttac cagattattg tatgttgtac aaggcagcat 6360 attgttctaa gaatgtctag gacttgatca accttgggct tttcttaacc cgttaagtgg 6420 gtaacttgca atgttgaact gcgaataaga tttgctaaaa atctggttat gatagatcga 6480 tgtcatctcg gtcaagatat tagtgaaatt caggaggggt gaa 6523 // ID CR1-42_DR repbase; DNA; ZEB; 4281 BP. XX AC . XX DT 16-MAR-2009 (Rel. 14.03, Created) DT 16-MAR-2009 (Rel. 14.03, Last updated, Version 1) XX DE CR1-like non-LTR retrotransposon - a consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; AP endonuclease; CR1 clad; CR1-42_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4281 RA Bao W. and Jurka J.; RT "CR1-like non-LTR retrotransposons from zebrafish."; RL Repbase Reports 9(3), 684-684 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS join(157..1266,1230..2351,2330..4117) FT /product="CR1-42_DR_1p" FT /translation="ENAGRLCGTGHCRSVCEDGGRNLKVLCLYKQAVVVTA FT LLVYSLFFNLLCPVDFNLFAEHSANYNAQSKKLVNIADRLIKQIHNISGNF FT GLGKHIVAVTFIFPSLHNRVKSQTKSSTRLYQNEALGLIIMICLLLSGDVH FT PCPGPHYAKPENFSDFISNPISSQWHAEGPRALQFGYVPGFGDGVLPVCNL FT EMGVEVVEARRLGTAGMDRRYGYSSHQESVSDPAILQVRTLGGGSLQSGYA FT HHTNLRRSVSMDACGGSVAEWETGSPVTKAEVPLRRRVRVESLDAGLGSGS FT TSGSVGDFWKSLRISMPLWNKGNSVPNELPNSFTSVNAGLVIPQVFTKPMQ FT HVKTRTINPAIQKNKKKVFFRQLIIQEEKSFFQTVNHSRIIWDPKVKPRGL FT LGGHLNVRSIISKSDEIHHLLLDSNLDFLCLSETWLHKNSPSAALEVPGFK FT IFRKDRVGSKGGGVMIYVKSGIQCNEIVWPNDTDLECIGLNLILSPQMSFT FT LMVIYRPPSSNVSFYENLKQVLKQCNFSKEVIIMGDFNINWDDKIVRKHLK FT QITDHFDLQQMINGPTRLTNSTRSQIDLIFSNRADRILKTFNMLTGISDHN FT LILVARKLSSKRLNSFVRKHDSYGVPKHKLDDFRRAVLQIEWDELLTGTDQ FT DEDSQIFTKKLERTMTDFSCTLKPKNKKHTVPWMNSDILKIIKKRDLALKI FT AHRSKLTHDRQHFTMLRNKVVKCMRKAKADFFLRIIEEQEVTPNRARGNSK FT LIWNQLNYLKGDKRKDRSSIELMVNGEMINKPADIAEVFNNYFVDSVVSIT FT QCFSPNYSEVHQVNTVEQAFSLGLVSESDVNRAITSLKSSRAKDIFGMDVI FT LLKNISESLVKPITKIINLSISQGMFPSAWKSSVIVPIFKSGDPHSSSNYR FT PISILPIVSKIAEKLIATQITNYLNNSSFALHPLQFGFRANYSSEMANCYF FT TEKVKSLLDKGGIVGAVFLDFRKAFDTVNHRILMSKLVGFNFSPQTLKWIE FT SYLSGRTQCVNIQNYKSAPLGISTGVPQGSILGPLLFTLYINDLPSMCPNN FT NIQMYADDTVIFIHGSSVLQVADRLTESLVHLSAWLDQCCLQLNISKTVCM FT FFSKNGRLSVVPDVFVSGERLQVVSEYKYLGVLIDSKLSFQSQVKKVCNRV FT RFSLANFRFIRDQMSCEAALIYYHSMIISHITYCLTTWSQASITTLKPLKS FT LYKQSIKTLDKKSVQYHHCLILKKYNLLSWENLITYVHICLMFKTLHNPTS FT PLNQFINIRTTAHRTTRGGDRGDCIIPLKKSKFSQSVFSVRAAREWNSIPP FT TIRRSCTFRSFQSHVRNWLLTNQHCQH*" XX SQ Sequence 4281 BP; 1342 A; 741 C; 818 G; 1380 T; 0 other; ttgttaacat ttatacatat atcaagctta aataattaac atcaagtcaa tttaattggc 60 tggtcttaaa accactttgt gtgaattcag ctggggcagc aaatatgcaa accaaattac 120 ctttttttgg acagaaaata tgaaattact tactgagaaa atgccggacg gctctgtgga 180 actggtcact gccgtagtgt atgtgaggac gggggcagaa acttgaaggt cctgtgtctg 240 tacaaacaag cagtggtagt cacagctctc ttagtttaca gtttgttttt taacctttta 300 tgtcctgttg attttaattt atttgcagag cattctgcta actacaatgc acaatcaaaa 360 aaactcgtta atattgctga taggttaatt aaacaaatac acaacatctc tggaaatttt 420 ggactcggga agcatattgt agcagtaact tttatttttc catctttgca caatcgagta 480 aagtcacaaa caaagtcatc aacccgactt tatcaaaatg aagctttagg gttgattatc 540 atgatatgtc tgttgttatc gggagacgtt catccctgcc ctggtcctca ctacgctaaa 600 ccagagaatt tttcggattt tatatcaaat cccatatcct cccaatggca tgctgagggt 660 ccaagagcgc tgcagtttgg ttacgtgccg gggtttggtg acggtgtttt gcccgtgtgt 720 aatctggaga tgggtgtgga ggtggtggag gcccggcggc tgggcactgc cggcatggat 780 cgacgttatg gatattccag ccatcaggag tccgtgtcgg atccagctat cctacaggta 840 cgcactttgg gtggcggttc gttacaatct ggctatgctc atcacactaa cctgcgtcgt 900 tctgtctcga tggatgcttg tggggggagt gtcgcggagt gggagacggg gtcgccggtg 960 actaaagctg aggtcccact gcgacggcgt gtccgggtgg aatcgttgga cgcgggcctg 1020 ggttccggat cgacaagtgg aagtgttggg gacttttgga aaagtttgcg catctcaatg 1080 ccactgtgga ataaagggaa ctctgtgcca aatgagttac caaatagttt tacttctgta 1140 aacgctggtt tagttattcc acaagtgttt acaaagccta tgcaacatgt caaaacgaga 1200 actattaatc cagccattca aaaaaataag aaaaaagttt ttttcagaca gttaatcatt 1260 caagaataat ttgggatcct aaagtaaagc caagaggtct cttaggtggt catttgaacg 1320 tacgaagtat catttcaaaa agtgatgaaa ttcatcatct tttactagat tctaacttag 1380 attttttatg cttatctgaa acatggcttc ataaaaactc accgtctgca gcattagaag 1440 tgcctggttt taaaatattt agaaaagaca gagttggttc aaagggaggt ggagttatga 1500 tttatgtgaa atctggtatc cagtgtaatg aaattgtttg gccaaatgac acagatctgg 1560 aatgtattgg tctgaatctt atactatcgc cacaaatgtc tttcacacta atggtaattt 1620 atcgtccacc atcttcaaat gttagttttt atgaaaacct aaaacaagta ttaaaacagt 1680 gtaattttag taaagaagtc attattatgg gagattttaa tattaattgg gatgataaaa 1740 tagttagaaa gcatctcaaa caaataactg atcattttga tcttcagcaa atgattaatg 1800 ggcctacaag attaaccaat tctacaagat cacaaattga tttaattttt agtaacagag 1860 cagacagaat tttaaaaaca tttaatatgt taacaggcat atctgatcac aatttaattt 1920 tagtcgccag aaaattatct agtaagagat taaattcttt tgttaggaaa catgattctt 1980 atggtgttcc aaaacataaa ctagatgatt ttagaagggc cgtccttcag attgaatggg 2040 atgaattact aactggaact gatcaggatg aggatagtca aattttcaca aaaaaacttg 2100 aaagaacaat gacagatttt agctgtactt taaagcctaa aaacaaaaaa catacagtcc 2160 cctggatgaa ctctgatatt ttaaaaataa taaaaaagcg ggatcttgct ttaaaaattg 2220 cacatcgatc aaagctgact catgatagac aacatttcac tatgttacga aacaaggtgg 2280 taaagtgtat gagaaaagca aaggcagatt ttttcctgag aataattgaa gagcaagagg 2340 taactccaaa ttaatatgga accaattgaa ctacttaaaa ggagataaaa gaaaagatag 2400 gtcatcaatt gaattaatgg taaatgggga aatgattaat aagccagctg atatagctga 2460 agtatttaat aattattttg tggactctgt agtttcaatc acacagtgtt tttctcccaa 2520 ttattcagaa gttcatcaag taaatacagt agaacaagct ttttctctag ggttagtttc 2580 cgaatcagac gtcaacagag caattacatc tcttaaatca tccagagcaa aagatatatt 2640 tgggatggat gtgatacttt taaaaaacat tagtgagtct ttagtaaagc cgatcaccaa 2700 aattattaat ctgtccattt ctcaggggat gttccccagt gcttggaaat catctgtcat 2760 tgtgcctata tttaaaagtg gcgatccaca ctcttcaagt aattacagac ctatcagtat 2820 tttacctata gtttcaaaga ttgcagaaaa actgatagcc acacaaatta caaattactt 2880 aaataatagt tcttttgctt tacatccatt gcagtttggc ttcagagcga actactcaag 2940 tgagatggca aattgctact ttactgaaaa agtaaagtct ttattagata aggggggtat 3000 cgttggggct gtgtttcttg attttaggaa ggcttttgat actgtaaatc atagaattct 3060 catgtccaaa ttagttggtt ttaacttctc accacaaact ttaaaatgga tagaatctta 3120 cttgtctggt cgcacacagt gtgttaatat acaaaattac aaatctgccc ctctaggtat 3180 atccacaggt gttccacaag gatctatttt gggcccttta ctttttactt tatatatcaa 3240 tgatctccca tctatgtgtc ctaacaataa catacaaatg tatgcggatg acactgttat 3300 atttattcat ggtagtagtg tgttacaagt tgcagacaga ctcacagaat cactggtaca 3360 tctttcagcc tggttagatc aatgctgttt acagttgaac atctctaaaa ccgtgtgtat 3420 gtttttttct aagaatggta ggctaagtgt agtaccagat gtatttgtat caggggaaag 3480 attacaagtt gtatctgagt acaagtacct tggagtttta atagattcaa aactatcatt 3540 tcagtctcag gttaaaaagg tttgtaaccg ggtcagattc agtttagcaa attttagatt 3600 tattcgtgat cagatgtcat gtgaggctgc tctaatctat tatcactcca tgatcatttc 3660 tcacatcact tattgtttaa ccacttggtc ccaagccagc atcactactc tgaagccact 3720 taaatcttta tacaaacaat ccataaaaac ccttgataag aaatcagttc aatatcacca 3780 ttgtttaata ttaaagaaat acaacctgct tagttgggaa aatttaatta catatgtaca 3840 tatctgttta atgtttaaga ctttacataa cccgacttca ccattaaatc agtttataaa 3900 tattagaaca actgctcacc gaactactag aggaggtgac agaggagact gcattattcc 3960 tctcaaaaaa agcaaattca gtcaatctgt attttcagtg agagctgcta gggaatggaa 4020 ctccatccca cccacaatta gaagatcatg cacttttcgt tcttttcagt ctcatgttag 4080 aaattggtta ttgacaaatc agcattgtca gcattagtac tgtattcttg acacatttgt 4140 actgttgtct ttggcttttt gtcctctgct cccatcagtc tgttcttttt tttctttttt 4200 tcttttcttt acttttttaa ctgtatttct tatgaaagtg tatttaatgt ttaatgttgt 4260 gcttctattt taggtgtgac t 4281 // ID DNA-3-3_DR repbase; DNA; ZEB; 1350 BP. XX AC . XX DT 28-OCT-2008 (Rel. 13.1, Created) DT 28-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE nonautonomous DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; TSD 3-bp; KW DNA-3-3_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1350 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1226-1226 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1350 BP; 481 A; 205 C; 263 G; 401 T; 0 other; aggtgcgttc gacatgaagc tcagctgcag ccggtgatca acgagattag ccgagcgcag 60 gcgggggcgg agttgaaaac ggagccgtca agacggacag ctggacactt cgggactcaa 120 agttgctgca gtcatgatga ccgatcacat ggcgtcatca tatttttttt gttattttta 180 tttataacaa caaaacattt acaaataaaa cagaatacag tctctacaaa ctgtagttcc 240 tttttaaaca ataagagaga attatgaata aaatatataa caaatgcatg agagaaataa 300 gaggaaatga gaggttaata taaacataaa aacgaacagg tctattaaat acaaagcaat 360 aagcataaat tctaggagtt aaacatcaat ctttaggcta atacttcttg tttgaatatg 420 ctaaaaaagg gtttacatat atgtacattt atagatataa aaatttccaa tgtaaaaagc 480 aaaacaaggt gttttgatta ggtcttaata aatgacatta ttttggataa tgaatgaatt 540 ttcaaaaaag cattttaatc caaaaagatt gtgttgaaaa gaacaaaagg taaaataagt 600 gagctatgta gaagtttccc agaaagagca ggtagccttt aaactgctag acaggaaata 660 aaatatatat ttatattata ataataaaac cattttataa ttatttaata aaatagttct 720 ttactttaat ttgttaagag agctttaggg tggcaggatc aatgatgatg attttcttat 780 ttcaagtctg taagacttat ttgagttcat atttaggtta ttcactaaaa tatacactaa 840 aatatatcat ttaaatcgtt catggagatg aatgcagtaa atagtctgta taaaaaagtt 900 aaaaataaac ctgtgcaaac aagctaacct ttataaccag attatttaac aaaagatgat 960 tactgcagta aaatatttgt agtcttttaa taagcaagat gtgtacagga tagagagggt 1020 gtgaaaagtg aattgtttgt tttgaaactt ttgaatcgga atttgtccaa ttataaaccc 1080 gaaatacacg tgtttgttgt catgtggtga actcggccaa tcgtgtaata tcggtgtcgt 1140 catcaaagct cctgcagtcg ctcttcagaa gctgcacggt ctgagtttgc cgtctgatct 1200 cggagcgctg tcgcggtact ttgatgccac atgtgacagt cacgtggcgt cggcgcagca 1260 tcaatccgga caagatttct aaccagaatg caccgcttta agacagattt cgatcgatct 1320 gcgcagcgct gcatgaagtc gaacgcacct 1350 // ID BEL1-I_DR repbase; DNA; ZEB; 5940 BP. XX AC chr17; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL1-I_DR; KW BEL1-LTR_DR; BEL54_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5940 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1353-1353 (2008). XX DR Genome; chr17; Positions 14950837 14944898. XX CC Positions [4943-5527] - Integrase core CC 'GACAA' target site duplication CC LTRs are 99% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 80..5890 FT /product="BEL1-I_DR_1p" FT /translation="MESTHSYKTRSYASCSRSSSSSTSAAAAMARAKAEAA FT KARLKFADEEEKIKVEKAKLEAKIDKMALERETAAAVAEAEVLEAAANTSV FT HSKHEPQLETTPADVMQRTSEYIEQMNLHASTELASQPEPLPSAETVNPHT FT ELLVTNSVPDTRLKTESCSDDAEFYHMQSRRDSVQASQFTPRRYHLHDPTD FT SEVHRRRAHFTQSHIPPVSTTSQMPQSVKDKTDISDFVRYFARRELVNTGL FT IQFNDQPESFRAWQRSFQNAVGGLDLTASEEMDLLVKWLGKESAEHAKRMR FT SVHINQPLKGLDMIWTRLEECYGAPEVIERALFRRIESFPRISSRDYSKLH FT ELSDLLMEIEAAKADGYLAGLTYLDTAKGINPIIQKLPFNLQEKWLSYGAG FT YKERYGVSFPPFEVLVDFISQQARIRNDPGFNFFTQTDSVYKTEKSSWKSN FT KSREVSVNKMNVSTPVHLDTDKDERKSVDAEKLCPIHNKPHLLRNCRSFRE FT KPLEERYAFLKEKCICFKCCIATTHIAKNCKFVTKCSECGSEKHISALHPG FT PVQKAQETVSLPLHGGEQVSTSLEEVEAHCTEVCRDDHSSKSCSKISLAKV FT YPKGQPEKAMKVYVILDEQSNRSLARSEFFDAFKENTPATPYTLRTCSGVT FT EIISRKASGYQMQSLDGKVVFSLPSLLECNNIPDNRAEIPTPSAALHHPHL FT KPIASFIPELEPDVPIMMLLGRDIIRVHKVRSQVNGPNDAPYAQKLDLGWV FT IVGNVCLNGIHKSTAVSTFFTYTKEQRPSIFDSCPNVFCVKGRKSNEQKND FT NGICSMEQLICNREHDQLGKDVFKVTCDDNKVAPSINDLTFLQIMEQGLER FT DENNSWIAPLPFKVPRCRLPNNRCQALKRLLILRRNLERNPQMKQHFFTFM FT DKIFDKKYAEEAPLLLEREESWYLPLFGVYHPKKPGNIRVVFDSSARYEDV FT SLNDVLLSGPDMNNTLVGVLMRFRKEAVAITADIEQMFYGFLVREDDRNFL FT RFLWFRNNDPSEDIVEYRMRVHVFGNSPSPAVAIYCLRQSVKDGDPDVRSF FT VNRDFYVDDGLKSLPTEKAAIDLLKNTQRILAGSNLRLHKIASNSTEVLKA FT FPSQDHASEIKDLDLGSDMLPTQRSLGLNWNLRTDTFTFKIDTQAKPFTRR FT GVLSTVNSIFDPLGFVSPVTIQGKLIMRELNNQNDDWDAPLPKEMEEKWET FT WRQSLIDLKNLNIPRCYTNISPSTATQRELFIFSDASTKAIAAVAYLRLTD FT SFGVSQLGFVMGKAKLSPEQTIPRLELCAAVLAVELAEFITSEIDMHLDDI FT AYFTDSKVVLGYIHNESRRFYVYVSNRVLRIRKSTHPSQWKYVCTTENPAD FT IATRYVTAAHLQNTDWLCGPNFLLNNPCTQEKNEFELVNPSSDAELRPEVS FT TLSTALRNKSLGAQRFSRFSTWKSSIHGTARLLHISRQFCKGSTTQTSGCK FT GWHYCLNAPTVEELLCAEKIIIKTVQQETYAQEYESLKTGNKLSKQSSLKA FT LDPFIDVDGLLRVGGRLTEAQIEGDVKNPLIIPGKHHIATLLIKHHHEKVR FT HQGRHYTEGAVRAAGYWIVGGKRKVCSVIHECVTCRKLRATPQTQKMSDLP FT ADRVSMDPPFTNVGLDVFGPWDVVARRTRGGHAQSKRWAVIFTCQSVRAVH FT IELIESLDTSSFINALRRFLAIRGPVKLIRSDRGTNFVGACRELNILSNID FT KTSVEKFLLDQSCVWQFNPAHASHMGGSWERMIGLARRILDAMLQQLGPSA FT LTHEMLSTLMAEVTAIINARPLVPVSSDPNDPFILTPSALLTQKVTIAPAP FT TFDVDKNLHIRQWRKVQHLAQTFWDKWKKQYLTLLQPRRKWPSSEPDLELG FT SIVLLKDEQQKRNEWPMGLVTQVFPSKDNKVRTVEVKVFKKDGPKMFIRPI FT TKTILLLTPEKLD" XX SQ Sequence 5940 BP; 1899 A; 1250 C; 1278 G; 1513 T; 0 other; gtgaaaaagc ggctgcaaga agagagtaca gcagcagtta tgaagattca agtaaattga 60 taagagtatc aagtgcacaa tggagtctac acactcatat aaaaccagat cgtatgcatc 120 ttgttcaaga tcgagcagct cttctacaag tgctgcagct gctatggcta gagcaaaggc 180 ggaagcagct aaagcacgtt tgaagtttgc agatgaagaa gaaaaaataa aggtggagaa 240 agccaagttg gaggcaaaaa ttgataaaat ggctttagag agagaaactg ctgcagctgt 300 agctgaagca gaagttttag aagcagcagc taacaccagt gttcatagta aacatgagcc 360 tcaactggaa accactccag ccgatgttat gcagcgaaca agtgaatata ttgaacagat 420 gaatctacat gcatctacag aattggcatc acaaccagaa ccactccctt ccgcagagac 480 agtaaatcct cacacagaat tactagttac aaattctgtt ccggatacac gtctcaagac 540 agagagttgt tcagacgatg ctgaattcta tcacatgcaa tcaaggagag attctgtaca 600 agcatcacag tttacaccaa gaaggtacca tcttcatgac ccaacagata gcgaagtaca 660 cagacggagg gcacatttca cacaaagcca cataccccct gtctcaacga caagtcaaat 720 gcctcagagt gtaaaagaca aaacagatat ttcagacttc gtcagatatt ttgctcgccg 780 tgaacttgtt aatactggat taatacagtt taatgaccag ccagaaagct tcagagcttg 840 gcaacggtct ttccaaaatg cagtaggtgg cctggatttg acagctagtg aggaaatgga 900 cttgttagtc aaatggttag gaaaggaatc agctgaacac gctaaacgca tgagatcagt 960 gcacattaat caaccactga aaggccttga catgatatgg actagactgg aggaatgtta 1020 tggtgcacca gaagtaatcg aaagagctct tttcaggcgc attgaaagtt ttccaagaat 1080 ttcaagcaga gattactcaa agcttcatga attaagtgat cttctcatgg aaatagaggc 1140 agctaaagct gatggatacc ttgcaggctt aacatatctg gacacagcaa aaggcatcaa 1200 tcccataata caaaagttgc cattcaatct tcaagaaaag tggctgtcat atggagctgg 1260 ttacaaagag agatatggtg tctctttccc cccctttgag gtcttggttg acttcataag 1320 tcagcaagca agaatcagga atgacccagg tttcaacttc tttactcaaa cagattcagt 1380 ttacaaaaca gagaaatcta gctggaagtc aaacaaatca agagaagtct cagttaataa 1440 gatgaatgtg tctacccctg tacatttaga cacagataaa gatgaaagaa aatctgtgga 1500 tgcagagaaa ctctgtccca ttcacaacaa acctcactta cttaggaact gtcgttcttt 1560 ccgtgagaaa cctttagagg aacgttatgc atttctaaaa gaaaaatgca tctgtttcaa 1620 atgttgcata gctactacac acattgctaa gaactgcaaa tttgttacga aatgttctga 1680 gtgtggaagt gagaagcata tttctgcact acaccctggt cctgttcaaa aagctcaaga 1740 aactgtatcc ttaccgctgc atggcgggga gcaagtttct acgtcactag aagaagtgga 1800 ggctcattgc actgaggttt gcagagatga ccatagcagt aagtcatgct caaaaatcag 1860 tttggcaaaa gtgtatccca aaggtcaacc tgaaaaagca atgaaagtct atgtgatcct 1920 cgatgagcaa agcaacaggt cattagctcg ctctgaattc tttgatgctt tcaaagaaaa 1980 taccccagct actccctaca ccctgagaac ttgttctgga gtgacggaaa tcattagtcg 2040 aaaagcatca ggatatcaaa tgcagtcttt ggatgggaaa gtagtctttt ctctaccaag 2100 tttgcttgaa tgtaacaaca ttcctgacaa tagggcagaa attcccacac caagtgctgc 2160 tctccatcac cctcatctga agcctatagc ttcattcata ccagaattgg agccagacgt 2220 tcccatcatg atgcttctgg gccgtgacat catcagagtt cataaagtgc gcagtcaagt 2280 taatggtccc aatgatgctc cctatgcaca gaaacttgac ttgggatggg tgatagttgg 2340 caatgtttgt ctaaatggca ttcataagtc tacagctgtg agtaccttct tcacttacac 2400 aaaggaacaa cgtccttcta tctttgattc ctgtcctaat gttttctgtg taaaagggag 2460 gaaaagtaat gagcagaaaa atgataatgg aatttgttct atggaacaac ttatctgcaa 2520 tcgcgaacat gatcagttgg gcaaggacgt attcaaggtt acttgtgacg acaataaagt 2580 ggccccatct atcaatgact tgacatttct ccaaattatg gaacaaggat tggagagaga 2640 tgaaaacaat agctggatag ctccgctgcc tttcaaagtt ccaaggtgca gactccccaa 2700 caacagatgt caagccttga aaaggttgtt aatcttgaga cgtaatctag agagaaaccc 2760 acaaatgaaa caacactttt tcacattcat ggacaaaatc ttcgataaaa aatatgcaga 2820 agaagccccc ctgctgttag aacgagaaga atcctggtat ttaccactct tcggggtgta 2880 ccaccctaaa aaaccaggaa acataagagt ggtttttgat tccagtgctc ggtatgagga 2940 tgtttcattg aatgatgttc tattgtctgg gccagatatg aacaacacgc ttgtcggtgt 3000 gctcatgcga tttcgcaagg aagccgtagc tataacagct gacattgaac aaatgttcta 3060 cggtttctta gtgagagaag acgaccgtaa cttcctaagg tttctgtggt ttcgtaataa 3120 tgacccatca gaggacattg tagagtacag aatgcgcgtc catgtatttg ggaacagccc 3180 ttcacctgct gttgccatct actgtctcag acagtctgtt aaggatggtg accctgatgt 3240 aagaagcttt gtgaatcgtg acttttatgt agatgacgga ttgaagtcac ttccgacaga 3300 aaaagcagca atagatctgc tcaaaaacac gcagagaatt cttgctggtt ccaatttacg 3360 gcttcataag atagcctcga acagcacaga agttctaaag gccttcccat ctcaagatca 3420 tgctagtgaa atcaaagact tggatcttgg ttctgacatg cttccgacac agcgcagtct 3480 gggattgaac tggaacctgc gaacggacac gtttacattc aaaatcgaca cacaagcaaa 3540 accctttact cgcagaggtg tattatcaac tgtcaatagc atctttgatc ctcttgggtt 3600 tgtttcccca gtaacaattc aaggtaaact gatcatgaga gaacttaata atcagaatga 3660 tgactgggat gcgccccttc ccaaagaaat ggaggaaaag tgggaaacgt ggcgacaatc 3720 actgattgat ttgaaaaatc taaacattcc aagatgctac acaaatatat caccttctac 3780 agccactcaa agggagttgt tcatcttctc tgatgcttca acaaaagcaa tagctgcagt 3840 tgcttacctt agattgacag attcgtttgg agtcagtcag ctcggcttcg tcatgggtaa 3900 ggcaaaactc agcccagaac aaacaattcc tcgtcttgag ttgtgtgctg cagttttggc 3960 agtggaactg gcagaattca ttacttcaga aattgacatg catttggatg acattgctta 4020 ctttacagat agtaaggtag tgctaggata tatacacaac gaatcacgac gtttctatgt 4080 atatgtcagc aatagagtcc ttcggattcg caagtctact cacccaagtc aatggaagta 4140 tgtgtgcaca acagaaaatc cagcagacat tgcaacaagg tatgttactg ctgctcatct 4200 tcaaaacact gattggcttt gtggaccaaa ctttctgtta aacaaccctt gcacacaaga 4260 aaagaacgaa tttgagcttg taaatccttc ttctgatgca gagctacgtc ctgaagtctc 4320 cacactgagt actgcattga ggaataagag tcttggagca caacgatttt ccaggttctc 4380 tacctggaaa tctagcattc atggcacagc acgccttctt catatatcac ggcaattctg 4440 taaaggttct acaactcaga catcaggctg caagggctgg cattactgtc ttaatgcgcc 4500 tacagtagag gaactcttat gtgctgaaaa aattattatt aaaacagtgc aacaagagac 4560 atatgctcaa gagtatgaga gtttgaaaac gggaaataag ctctctaagc aaagctccct 4620 caaagcttta gaccctttca tagatgtaga tggtctgcta agagtgggag gtcgattaac 4680 agaagcacag attgagggtg atgtaaaaaa tccccttatc atccctggca aacatcacat 4740 tgcaacactt ttgattaaac atcatcacga gaaagtgcga caccaggggc gacactatac 4800 ggagggtgct gtgcgagctg ctggctactg gatagtgggt ggaaagagga aggtttgcag 4860 tgtcattcac gaatgcgtta cttgccgaaa actcagagca acaccacaga ctcaaaaaat 4920 gtctgatcta ccagcagatc gtgtatccat ggatcctcca tttacaaatg tgggactaga 4980 cgtcttcgga ccatgggatg tggtggcgcg gcgaacaaga ggtggccatg cacaaagcaa 5040 aaggtgggcc gtgatattta cctgtcaaag cgtgagagct gtacacattg aacttataga 5100 atctttagat acatctagtt tcattaatgc cttgcggcgt tttcttgcta tccgagggcc 5160 tgtgaaactc attcgctcag acagaggaac aaattttgtt ggagcttgca gagagctcaa 5220 catcttgtct aacattgaca aaacctccgt agaaaagttc cttttggatc agagctgtgt 5280 gtggcaattt aacccggcac atgcttctca tatgggtggg tcctgggaga ggatgatagg 5340 ccttgctaga aggatactag atgcaatgct tcaacagctt ggaccttcag cattaacaca 5400 tgagatgctc tccactctta tggcagaagt cactgcaata ataaatgcaa ggccactcgt 5460 tccagtgtcg tctgacccaa atgatccatt catactaaca cccagtgcac tcttaaccca 5520 aaaggtcaca attgctccag ccccaacttt tgatgtggat aaaaatcttc atatcagaca 5580 gtggcgaaag gtacagcact tggcacaaac cttttgggat aaatggaaaa agcagtacct 5640 aactcttcta caaccaagaa gaaagtggcc atccagtgaa cctgatcttg aattgggaag 5700 cattgtgctt ttgaaggatg aacaacaaaa aagaaatgaa tggccaatgg gacttgtcac 5760 tcaagttttt ccaagcaaag acaacaaggt tcgcacggta gaagtcaagg tctttaaaaa 5820 ggacggacca aaaatgttca tacgccccat cacaaaaact atccttcttc ttacaccaga 5880 aaaacttgac taaatttaag gatctgtaaa tagtggcatg ttaccacgcc agacggggag 5940 // ID hAT-N50_DR repbase; DNA; ZEB; 701 BP. XX AC . XX DT 21-OCT-2008 (Rel. 13.1, Created) DT 21-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Putative nonautonomous hAT DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; 8 bp TSD; hAT-N50_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-701 RA Bao W. and Jurka J.; RT "hAT DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1319-1319 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 701 BP; 273 A; 94 C; 116 G; 217 T; 1 other; cagtgttggg ggtaacgcat tacaagtaac gcgagttacg taatataata atattacttt 60 tctaagtaac gagtaaagta acgcattact tttaaaaaat aagtaataat atttgagtta 120 cttttttgaa aatgtaacgc gagttacttt tttagtttaa ttaattagct tttaaaaaaa 180 taaattgctg aattaaaatt aaagtagtca caatgaatca cgcagtcaat gagagaatgt 240 gttcattatt ttgaacgcat cgaagaaaag gaaaagggga ttattgtctc aatggacagt 300 gattttactt aagacaaata gtacaaaagt agcaaacaca ttgctttaaa ctttcaataa 360 tctgtatata cggcatgtat agctctgggg gtcaggaagt tctcagataa cattatccta 420 aaatattatt ttaatntttt tataggtaaa tgtaggtgta ggttatacag tgcgttgtta 480 attgcagagc tcctctattt aaaaaaaaaa aaaaaaaaaa acaagttctg aaagagatca 540 agcctcagcc aggtaataaa aagtaacgca aaagtaactc aaaagtaacg taacgcatta 600 cttaccataa aaagtaacta agtaacgcaa ctagttactt ttttggggag taactcaata 660 ttgtaatgca ttactttcaa aagtaacttt ccccaacact g 701 // ID MSAT-4_DR repbase; DNA; ZEB; 231 BP. XX AC . XX DT 21-AUG-2010 (Rel. 15.11, Created) DT 21-AUG-2010 (Rel. 15.11, Last updated, Version 2) XX DE Minisatellite-like DNA - consensus. XX KW MSAT; Satellite; Simple Repeat; MSAT-4_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-231 RA Jurka J.; RT "Minisatellite-like DNA from zebrafish."; RL Repbase Reports 10(11), 2011-2011 (2010). XX DR [1] (Consensus) XX SQ Sequence 231 BP; 55 A; 33 C; 77 G; 66 T; 0 other; gtgagagact agtgatgtcc tgtgagagac tagtgatgtc ctgtgagaga ctagtgatgt 60 cctgtgagag actagtgatg tcctgtgaga gactagtgat gtcctgtgag agactagtga 120 tgtcctgtga gagactagtg atgtcctgtg agagactagt gatgtcctgt gagagactag 180 tgatgtcctg tgagagacta gtgatgtcct gtgagagact agtgatgtcc t 231 // ID Mariner-N7_DR repbase; DNA; ZEB; 261 BP. XX AC . XX DT 14-OCT-2008 (Rel. 13.1, Created) DT 14-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE DNA transposons from zebrafish. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Nonautonomous; KW TSD TA; mariner; hairpin; Mariner-N7_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-261 RA Bao W. and Jurka J.; RT "Mariner transposons from zebrafish."; RL Repbase Reports 8(10), 1610-1610 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 261 BP; 85 A; 50 C; 47 G; 79 T; 0 other; cactctcaga aataaaggta cgcgagctgt cactggggtg gtaccttttc aaaaggtaca 60 aatttgtacc taaaaggtcc atattaatac ctcaagggta catattagta cctaaaaagt 120 acaaaagtgt tcctcttaaa atttttaggt actataatat atacttttga ggtaccaata 180 tggacccttt aagtacaaat gtgtaccttt tgaaaaggta ccaccccagt gacagctcgc 240 gtacctttat ttctgagagt g 261 // ID Gypsy98-LTR_Dr repbase; DNA; ZEB; 1784 BP. XX AC . XX DT 19-APR-2009 (Rel. 14.05, Created) DT 19-APR-2009 (Rel. 14.05, Last updated, Version 1) XX DE A long terminal repeat of the Gypsy-98_DR LTR retrotransposon; a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy superfamily; endogenous retrovirus; KW LTR; Gypsy-98-LTR_DR; Gypsy-98_DR; Gypsy-98-I_DR; Gypsy98-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1784 RA Dib M.R. and Naveira H.F.; RT "Gypsy98_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 9(5), 961-961 (2009). XX DR [1] (Consensus) XX CC Gypsy98-LTR_DR is a long terminal repeat of the Gypsy98_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its CC internal CC portion is deposited in Repbase as Gypsy98-I_DR. XX SQ Sequence 1784 BP; 346 A; 381 C; 380 G; 677 T; 0 other; tgtgaccgtg tataggataa gtggtagtga ttgatgttga taacaaccaa tcaggggcca 60 gtgctgccta ttttagagcc ggttggctgt gttaggtgat gtgtcacctc cagccccgcc 120 tttcttgttc cgcgtttgtt gacatctgtt tcaggtatgt tagcggttgt gtgtgtgtgt 180 gtgtgtctct ctctctctct ctctctctct ctctctctct ctctttcgat taacgtttgg 240 attaatcatc gtggtttcat ttttagcctg gtttttcgtt ggttttcctg ttggttgcca 300 tggagttaat gactgccacc cattcagctc gcttgtagca taaatggaga tgactaattc 360 gtaatacatg agtgtactac tcccggcttg gtaagtttct tatgtagaga ttaaagggga 420 atgctagcgt gtgatttttg caaacagtaa ccgtttttgc atgtgcatat ctcttgtagg 480 attctccctc tctgtcctgg gcgattataa ataaagatct agcatctgta acattactcc 540 ggttgtgttt gaaagattct gctatcctga gatctctagt ggaaagaact actactgttg 600 tttactactg ctgcctgctt ctactatctg ctgctgctac atttactgct gcagttacta 660 ctagctgcta ctgctacccg ctactactat ctgctgctgc tacattcact gctgcagtta 720 ctactagctg ctactgctac ccgctactac tatctgctgc tgctacattc actgctgcag 780 ttactactag ctgctactgc tacccgttac tactatctgc tgctgctaca tttactgctg 840 cagttactac tagctgctac tgctatccgc tactactatc tgcctgctgc tactctactg 900 ttgcagttac tgctaactgc tacgactaac tgttacctgc tctatttact gctgttgtta 960 ttacgacctg ctactactat ctcctattgc tactgttata taatttgtga tttttgttct 1020 gtttggttgt gtgattcggt gtgcaaaaga gtgagttttg tgttctgtgt gaatcttgct 1080 tttctatagc gaaactacag aagctgtatt tggtttttct tttcttgtgt gtggtttgtt 1140 gttttgtttt actcttgtat aatttatccc gtatattgat tttgattttc cttgcaggaa 1200 ctgaggtccc atgggtgcag gatcttgagg tggtggtggt gtcactgcac ttgacatttt 1260 tgctgtggac ggtttgggaa gtgtggagtg tgtggatcta gtgtggtgtg tgtgtgtgga 1320 agtgtgcttt ttgaacacaa cggagcactg gtgatttccc cccccccccc cccccccccc 1380 ccccccatgt gtgacctcag ccctgtaagt tagtttattc aatttatttc cccggttaat 1440 ttcattttca tatttacttt gtaataacgt tgttgttgtt gtcttctata aatttgtttt 1500 cttttttttt gttttatttt actgcagttg tgttgggagg ggggtgctgg caaaatgtta 1560 agagaatgta atgtgaccat tttaatctgc cacaaattct gaaaataaat tttgtacttt 1620 tatactatac ccacgtctct tgcctatcca tatgagaact gaacctgtgt ggcctaagag 1680 taaggtttaa gccaatttct atttcctggg agaaaacctc tcaggtggcg tagtcggatt 1740 tgctttgtgg ataaattgtt aatccctact cttccctcct gcca 1784 // ID TDR8 repbase; DNA; ZEB; 1155 BP. XX AC . XX DT 04-MAR-2002 (Rel. 7.02, Created) DT 04-MAR-2002 (Rel. 7.02, Last updated, Version 1) XX DE Zebrafish non-autonomous DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; TDR8. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1155 RA Jurka J. and Drazkiewicz A.; RT "TDR8: a non-autonomous DNA transposon from Danio rerio."; RL Repbase Reports 2(2), 36-36 (2002). XX DR [1] (Consensus) XX CC Putative transposon. XX SQ Sequence 1155 BP; 343 A; 232 C; 248 G; 323 T; 9 other; ccgtttacac taatgcgttt tagtttgaaa acgcataagt tttgctacgg ttacgccatc 60 cgtccacact acgccggagt tttcgagcgc cgaaaacgga gcgttttgaa aacgctggag 120 aggccgtttt cattctaaaa cgctgctgct ccgtctcagt gtggatgggg gaaaacggag 180 acatctgaaa acggaggcgg ggctgcagac attcgcctct ctgattgggg cttttcctca 240 atattaagta gcctacacac agttcagtcc tgcatcctct ccttgtaagt tcagacttcg 300 caagtttgat caaggctgca ktctcttctt tctcagtttg atatggaaaa cataccgagg 360 acacgggtaa atcttcaaag ggaacagtgt actttataac ttcattcaca tcaccctggc 420 tacgttgttt cactttctca acaataaaat gaaaacatga tttaaggaac tgcctatttt 480 cattttaata ttagcaactt aacagacagc agaaatgttg aggcgtcgtg ctgcatatat 540 gagcgtcatc ttcactgtgt ggatatttat aacaaaacgg agccnataac aactgcctcc 600 tttcaatttc agtgaaaata cgaaacatac cctctctttt gctgaatatc agttttaata 660 atcgataatg gccattataa aagtataaca tacaataagt ttatacatta taggaaataa 720 aggcaagcga tcagtcaata tacagaatnn nnnangtacg tggttacatt aattattaac 780 ttatctttgc gctcagccaa aacacgttac ctgagaacaa gtaacttaat agattccaat 840 gaccaaagtc agggaattgg cctatgtcgt tagataaaga caacaagatg aatgaaatat 900 cacgtttaay aaatatagtg agattagatc cagcgggaga tgcttgatga gcagtccgac 960 tagcagagct ctcatctggg tagataggct gcagcgcttg cccgagagtg tgtgtgtggt 1020 cacgtgatgt gcgttttcag cgttttggtg tggacggaga gctgttcaga aacgctgggt 1080 aaaacgcgag tgtggacgcg gatcgttttc attctacaac gccgttttaa aactaaaacg 1140 cactagtgta aacgg 1155 // ID Gypsy146-I_DR repbase; DNA; ZEB; 5735 BP. XX AC chr25; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy146-I_DR; KW Gypsy146-LTR_DR; Gypsy146_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5735 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1553-1553 (2008). XX DR Genome; chr25; Positions 21227159 21221425. XX CC Positions [2524-3027] - Reverse transcriptase CC Positions [4381-4752] - Integrase core CC 'TATA' target site duplication CC LTRs are 96% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 4688..5665 FT /product="Gypsy146-I_DR_3p" FT /translation="MHNASGSIEHFTTKKRWPEHLSELTYAYNVTPHATTG FT YSPYFLLFGVEPRLPVDALLGHEQSVEGRQDWLAIHQNRLKEAHERAKVYA FT EQKASERLAPLNERVYCPTVNVGQTVYLRHRPAGRNKIQDAWAPTVYRVVE FT VQDTTYTVEPLESGPCRRVHRVDLRPCVGPVVETVVSESGLTPPLTHSPLV FT KVKDDVDPECVVLEEVTFPGRQEPRNIDVGVSRSVEESFPVESDLVRDDEE FT QQSSHPEQVTDECVDPLVEEHVSKKPVPAPRRTSRVNAGVHSNPFNAPRSA FT CNSVSLSPKVFSQVLTTLGSVFFREAVKEVKNMY" FT CDS join(1402..3114,3118..4752) FT /product="Gypsy146-I_DR_1p" FT /translation="MSQAMRGQSIGSTHMNTDRDRLLRHAVGKCPVVDLKI FT GGVSVSCLLDTGSQVSTVTEHFFKKHLAGETEDMLSTPGWLKITAANGLGI FT PYLGYLELPVEAMGVSIPECGFLVVRDTQSSSAVPALIGMNILGRCRQLVH FT VEFDTTLGGELQSDWREVFQQMQSASCFEKRVAARVAGKDGVHIPAWSVST FT LMVKRKKCKLTEGSANLLLEPGNSPLPEGLIVVPSLVSMERPLFPVQVVNF FT STVDKWLQPRTRLGTLCSVESLSPTSGGEVKFQRISAGVEEITVNREENQA FT LTNTLPDFLKKMQLGGTSEQQTELKDLLLRYVDVFALDDEELGFSDKVQHE FT INLVDDVPIFLPYRSVPPTQYGEVREHITKLLKKGVIKPSTSAYASPIVLV FT RKTDGNLRLCVDYRQLNSKTKRDAFPLPRIDECFDALQGAKFFSTLDLASG FT YHQVAVREKDQPKTAFTTPFGIFEYSQMPFGVCNGPSTFQRLMQSTMGDLI FT FQIMLVYLDDILVYSSSFHEHLQRLEVVLKRLKEIGLKVKIEKCCFLQNEV FT KFLGHHISGHGIGTDPEKVEAVRSKTPCTVKELRSFLGFCSYYRKFIEGFS FT KIAGPLHDLVNFCLRKLGMVRRPEFCSLWSTECHAAFELLKDRLISAPIIG FT YADFTLPFIVETDASSEGLGAILYQQQGDSRRVISYASRRLRNAEKNDRNY FT SSMKLELLALKWAVSEKFRGYLLGSKFVVVTDNNPLCHLRTAKLGAVEQRW FT VAQLAVFDFEVKYRPGRHNAAADALSRQPLAGEPANSEDTEYDDCITICNV FT INRGTPLDLELVVAGEQCCKVRQIRALECGTRGEGADTQGGTFTLPGYTRA FT QLKSFQDRDLTLSSFREFWNYKRKPTRQERSDLPKPVLSLLKQWKHLRESD FT GLLYRVVDNPRFGECQQLLLPACLKEQVLESVHNKMGHQGIERTLHLLRQR FT CFWVGMYEDVEGWIKHCQRCVLTKMPQPRIQPPMKSFLASRPLEVVAVDYT FT MLEPASDGCENVLIVTNVFTKFTQAFPTRDQRADTTAKVLLKEWFMNYGVP FT QRLHSDQGRNFESEVVAELCRLYGVKKTRTTPYHPQGNAQCERFNRTLHDE FT KALARTPL" XX SQ Sequence 5735 BP; 1607 A; 1104 C; 1449 G; 1575 T; 0 other; tttttggtac caggagtggg gtatctgact gttgggagct tattttttta ctgaaatacg 60 gcagtggcag gttcgagtgc gtcagtaacc aggaattgga agcagtcaac gttgctgagg 120 gtgtccaggc tggtgtcgtc ggctgaggtg tttgtggcca agggacggga tcacttgacg 180 agaagtggac atcgccatct gcagctaaaa ttcaagagac tgctgtatca ttatacagac 240 tttgagatgc atatcctttg gaaaaaaaaa cgtcttgcat gtactttgtg cttttaataa 300 ggccgtcatt ttaatggagt atgtcacatg ttgatgaaga gggggtagaa ctacagggtg 360 ctgtcggtgg agagggtgta catcagggta tggatgagct ccagggacaa atccaagagc 420 tcagtagaaa gcatgctgaa gccatggcag ctatttccaa tctaaataga gagccaacta 480 ggtcttacct gtatgtgccc agagagcgtc aaattcagcc ttttagtggt gatttcgaca 540 aagatggtag aagtgttaat gagtttatag aagaagtaga aagagtattg cttgtacgaa 600 atcagagccc agaagaccta actgactttg ttctttcctt actaaagggt gctgcgttgg 660 aagaggttag gttaaggaaa gagggccagt ctccacaaac agcaggtatt tttcaatacc 720 tgcgagaggc ctttagagaa aaacgtagtt ctgcacagtt attgcagacc ttctataccc 780 gcaaacagtt agaaggtgaa gacctttgtg catactcgta tgccttgtcg caaatattga 840 attctgtgtt gaaacagtca cctgatgcta ttgcaaatat gaatgttact ctcagagacc 900 aattcattga gggattgcga gatgcctcac tcaggagaga acttcgtaag ttggtgaggg 960 acaaaccaca gtcaactttg aagttaggaa tgaagctcta atgtggtcgc ttgaagatcc 1020 caaattgcga gcgagtagag tagccattaa ccgtaatgta aactctgaag ctacagaagt 1080 ccaatgtgca gccttggcaa tgcctgcgaa gagctctatt acccttgatg acattctaaa 1140 agtagttgca gaacaaggta aagccatcgg cgagctaact caagctgttc agaaatttac 1200 cctgcaatgt ggtaaacccg aatcagagaa tcgtaataaa tcaaaaatgc agcctaggtt 1260 tacagatgac ggccagccta tttgttttaa atgtaatggg gtgggacaca tagccaaaag 1320 ctgtgaaaga aaggccaaac cagcagtcta ggaggatttt tcatcctctt cgccacaggg 1380 aaatgcccac cctcggtcgc tatgagccag gcgatgcgag ggcagtctat tggctccact 1440 cacatgaata cagatcgtga ccgattactc cggcatgcag tggggaaatg tcctgtagtt 1500 gacctaaaga ttggaggtgt gagtgtttca tgtttacttg acaccggaag ccaagtcagt 1560 acggttacag aacatttctt taaaaagcat ttagcaggtg agactgaaga tatgctttcc 1620 acacctggct ggctaaaaat tacagcggcc aatgggttag gcatccctta ccttggttat 1680 cttgagctac cagttgaagc tatgggtgtc agcatacccg aatgtggctt cttggtggtt 1740 cgggataccc aaagctcttc tgctgtgcca gctctcattg gcatgaatat tttaggtcga 1800 tgtaggcaat tagttcatgt tgagtttgat actactctcg gtggagagtt gcaatctgat 1860 tggcgggaag tttttcagca gatgcagagt gccagttgct ttgagaaacg cgttgctgca 1920 cgggtagctg ggaaagacgg ggtgcatatt cctgcttggt ccgtttcaac tttgatggtc 1980 aaaagaaaaa agtgcaagtt aactgagggt tctgcaaacc tgttgctaga gcctgggaat 2040 tcacctttgc ctgagggctt aatagtggta ccttctcttg tttccatgga aaggcctttg 2100 tttccagtac aggttgtaaa tttttccaca gtggataaat ggttgcagcc tcgtacccgg 2160 ctgggtacgc tctgttcagt tgaatcctta agtccaacga gtggtggtga agttaagttc 2220 cagaggatat cagccggtgt agaagagata actgtaaaca gagaagagaa tcaagcattg 2280 acaaatactt taccagactt tcttaaaaag atgcagcttg gtggtacgtc tgagcagcag 2340 acagaactta aagatttgtt gttaaggtat gttgatgttt ttgcactcga tgatgaagag 2400 ctagggtttt ctgataaagt gcagcacgag atcaatctgg tagatgatgt accaattttc 2460 ctgccctatc gcagtgtccc accaacccag tatggtgaag tccgggagca tatcactaaa 2520 cttttaaaaa aaggagtcat taaacccagc actagtgctt atgcttcacc catagtgctc 2580 gtgagaaaga cagatggaaa tcttcgattg tgtgtcgatt acagacaatt gaattcaaag 2640 acaaagcgag atgcatttcc attgcctcgc attgatgaat gttttgatgc gttacaaggt 2700 gccaagttct tttcaactct tgatttggct agtgggtacc atcaggtggc cgtgcgtgaa 2760 aaagaccaac caaagacagc atttactacc ccatttggta tatttgagta ttctcaaatg 2820 cctttcgggg tttgtaatgg accgtctaca ttccagcgtc ttatgcaatc tactatggga 2880 gatttgattt ttcagattat gctggtctat ctagatgata tccttgttta ttcatcttct 2940 ttccacgaac atctgcagag gctggaagtt gtgttgaaaa ggctgaaaga aatagggctc 3000 aaggtgaaga ttgaaaaatg ctgtttcctc caaaatgagg taaagttcct gggtcatcac 3060 atttcaggtc atggcattgg aactgaccct gagaaggttg aggctgtaag aagttgaaaa 3120 actccttgta cagtaaaaga attgagatca tttctaggct tctgtagcta ctataggaag 3180 tttattgaag gattttccaa aattgctggc ccacttcatg atctagtgaa cttttgtctg 3240 agaaagctgg gcatggtgag aaggcctgag ttttgttcac tgtggtctac agagtgtcat 3300 gcagcctttg agcttttaaa ggataggctt atatcagctc caattatagg ctatgcagac 3360 tttactctgc cgtttatagt agaaacagat gccagtagtg agggacttgg tgccatactc 3420 tatcagcaac aaggtgacag tagaagagtg atttcctatg ctagtagacg actgcgcaat 3480 gcggaaaaga atgaccggaa ctatagtagc atgaagcttg agcttctagc tcttaaatgg 3540 gctgtttcag aaaagttccg gggatatttg cttggatcaa aatttgtagt tgttacagat 3600 aataacccac tttgtcatct gagaacagca aaattgggtg cagtcgaaca aaggtgggtg 3660 gcccaattag ctgtttttga ttttgaggtc aaatatcggc ctggccggca taatgctgct 3720 gctgatgccc tctctaggca gccattagca ggggagcctg ctaattctga agacactgaa 3780 tatgatgatt gcattaccat ttgcaatgtg atcaacagag gtacacctct agatctggaa 3840 ttagtggtag ctggtgaaca gtgctgtaag gttagacaga ttcgtgcctt ggagtgtggg 3900 accagagggg agggcgctga tactcaaggt ggtacattta ctcttccagg ctacaccaga 3960 gctcagttaa agagtttcca ggatagagat ttgactctta gctcttttag ggagttttgg 4020 aattataagc gaaagcccac aagacaggag agaagcgacc tgccaaagcc tgtactgtca 4080 ttgttaaagc aatggaaaca ccttagagag tccgatgggt tgttgtatcg tgtggtggat 4140 aatcctcgtt ttggtgagtg tcagcagttg cttttgccag cctgtctaaa agagcaagtc 4200 cttgaaagtg tccataataa aatgggacac caaggcattg agcgtaccct ccacctctta 4260 aggcaaaggt gtttctgggt tggcatgtat gaagatgtgg aagggtggat taaacattgt 4320 caacgttgtg ttctaactaa aatgccccag ccacgaattc aacctccgat gaagtcgttt 4380 ttagcttcaa ggcctttgga agtggtagcg gtagattata ccatgcttga acccgcctca 4440 gatggttgcg agaatgtatt gattgtaact aatgtgttta ctaaatttac tcaggcattc 4500 cccaccagag atcagcgagc tgacactaca gccaaggttt tactcaaaga gtggttcatg 4560 aattatgggg tgccacaaag gttacattcg gatcagggaa ggaattttga aagtgaagtt 4620 gttgctgagc tgtgtcgact gtatggtgtg aagaaaacgc gcaccactcc ctaccatccg 4680 caaggcaatg cacaatgcga gcggttcaat agaacacttc acgacgaaaa agcgttggcc 4740 agaacacctc tctgagttga cttatgccta caatgtaact ccacatgcca cgactgggta 4800 ctcaccctat ttcttgctct ttggtgtaga accgcgtctt cctgttgatg ctcttctagg 4860 ccatgaacaa tctgtagagg gaaggcagga ttggttagcc attcatcaaa accgactcaa 4920 agaggctcat gaaagagcaa aagtgtatgc cgaacaaaag gcttctgaaa gacttgctcc 4980 cttaaatgag agggtttact gtcccacagt taatgtaggt caaacagtgt acttgcgtca 5040 ccgaccagct ggccgaaaca agatccaaga tgcgtgggct cctactgtgt atagagttgt 5100 agaagtacag gacaccacat atacagttga acctcttgag agtggtcctt gtaggagagt 5160 gcacagagtt gatttacgcc cttgtgttgg tccggttgtt gagacggtgg tatcggagag 5220 tggtttgaca cctccattaa cacactcccc acttgtaaag gtgaaagatg atgtagaccc 5280 tgagtgtgta gtattggaag aagtaacatt tcctggtcgt caggaaccta gaaatataga 5340 tgtgggagtc tccaggtcag tagaggaatc atttccggtg gaatcagatc ttgttcggga 5400 tgatgaagaa cagcaatctt ctcatccaga gcaagtaaca gatgagtgcg ttgacccatt 5460 ggtggaggaa catgtgtcta aaaaacctgt cccagcacca agaagaacta gtcgagtcaa 5520 tgctggtgtg cattccaacc catttaatgc accaagatca gcttgtaact ctgtttcact 5580 cagtcctaaa gtgttttccc aggttttgac aacccttggt agtgtatttt tcagagaagc 5640 ggtgaaagaa gttaagaata tgtattgaaa tcattgagga caatgagttc ttaacagggg 5700 agagagtggc cagagggata aatccagttc ttata 5735 // ID Gypsy77-LTR_DR repbase; DNA; ZEB; 1147 BP. XX AC chr12; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy77-I_DR; KW Gypsy77-LTR_DR; Gypsy77_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1147 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1470-1470 (2008). XX DR Genome; chr12; Positions 11510647 11509501. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1147 BP; 290 A; 180 C; 277 G; 400 T; 0 other; tgtggcgaat aaaatgctct cccattttct caaagtgcca caaggaggcg ctgctatgtt 60 ctgccctctg tcgagcttcc aaaaccttac ttccggggtc gggaggaagt ggaaggtcag 120 ctacacaata gaagaccata aaagggggtg ttgagggttg ttccgggtgg ctgttgttgt 180 tgttggagtt ggagttgatg ggttttttat tcgtgtgttg gtggtgaaat cgtggggaga 240 gaaaaagacg agaaacaaat gtgagaaaaa ggtgaagagc tgttggcttt gaagatggag 300 gtgtggcggt aaatattgtg gtgtgaaagt gatggtggac aggctgacca gaaaaactac 360 acgagcgatt aagtgcgagg ggtctgcagg atcttatggg gatcgaaagc caaggtggac 420 ctgtacagaa tcagcgttgt ttgccgggtc agttatttag gtcattcaca ttcattcata 480 cacccataca atccacaccc attgctactc aatttcacaa accttcttta catttctctt 540 tctctctggt ttccgttacc tcagccacac gttattattg tttgtttgat tgattgtttg 600 tttgatgttt tctttctttt attttattga cttatctaag cagtgatttc tttttttgtt 660 gttgttaaaa atcgttcaac caagattgaa caggggtaaa aagaattggt tattatttag 720 aaaatatatt tgttgtgctg tgtgataata atacaacagt tgatttcatt taaagtgtgt 780 tgcgttgtta ttatttatcc ttttcttttg cattgtgtat ggtggttaaa tgccggctgt 840 agctaaccag ggtattggta aatgtttttc tttttggcaa gtaaatcttg tctggcgccc 900 aaagtttaat tacatatatt gataactaat tttttttttg tttgaggtct gctgggagtg 960 agattaaggg ccgggaaaat ctaaaacttc ctagtggtca cgtggactag cggttgagcg 1020 ttggactgtg gagcagacgg gcccgggttc taatcccact aaaactggtt tttgaaattt 1080 gaaatttttt tttttccttc attcaaatta tatttaaact gaattattat taccgaccac 1140 cgttaca 1147 // ID Gypsy9-I_DR repbase; DNA; ZEB; 6027 BP. XX AC . XX DT 07-JAN-2005 (Rel. 10, Created) DT 07-JAN-2005 (Rel. 10, Last updated, Version 1) XX DE An internal portions of the Gypsy9_DR LTR retrotransposon - a DE fossilized copy. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy superfamily; Gypsy9-I_DR; Gypsy9-LTR_DR; Gypsy9_DR; KW endogenous retrovirus; gag; integrase; reverse transcriptase. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6027 RA Kapitonov V.V. and Jurka J.; RT "Gypsy9_DR, an LTR retrotransposon from zebrafish."; RL Repbase Reports 4(12), 323-323 (2004). XX DR [1] (Consensus) XX SQ Sequence 6027 BP; 1629 A; 1096 C; 1615 G; 1687 T; 0 other; attggaggca ccgctgggat actcttttcc cctgtgttta tattttttgt tagttagatt 60 gaacatggac cttttgcagg ttgtggagtt gtggagtcaa cagataaatc tcgatcttgg 120 taaatatttt attttgacca aagtgccaat tgatgtgagc gaggatataa ttttagaagt 180 gttaaacgca gtaaaagtgt ttggatgtgc taaaatacgg gaccgtcgtg gggacaacac 240 tggtttgcag atggttgagt tagtggagac cagtttggag attgacactg acactgtttc 300 acctgaagtg ggtattcctg gtgaggtggg accatgggga gtacatgtcc catccgtgga 360 agttgttccc gagggggttg taccagatgg ggattttcaa gagaaattaa ttagttggct 420 gagtcaggag ggtaagtctt tagaagacgt gaagccgatt ctcagtactg acaagatgtc 480 taatgtaggt ggagacttgg tacaggctct tgatcggctg gtagaaaggt gtaaccaggc 540 acctgttgag tctctgagtt atcgcaaatt gcgggtgttt tctggccaac aacctattcc 600 ccctggggag gaggattatg actcctggat ggaacaggcc acccaaatgg ttagtgagtg 660 gcagtgtaca gagtctagca aaaaacaacg aattgtggaa agcttgaggg ggtcagcagc 720 tgacatcatt cgatttgtga aaatgggtag ttcagatgtc actgccatgg actatttaaa 780 ggctttagac acagcttatg gcagtacaga aagtgagact gacctaactg tgaagttcag 840 ttgtacgtac caagagccag gcgagaagct ctcaagttat ttgtatcggc tggataagat 900 cctccatcga ctctttttga aagggggtat tagactagaa gatcttaatc gaaaacggat 960 ggaacaggtt gttaaaggtg cactcacctc tgatccagta gctctgagac tcagaatgat 1020 gtctagttta cgtgatccac ttgggttttc tcagctacta cgtgaggtta gagaggagga 1080 aaattggatc agtgcccgga gcagtgctaa tcctgtagta agttcaaaac ctgctgtagt 1140 tgtgcctagg ggggctgcct cggaggaaat agagtgcttg aaaagagacg taaaggaatt 1200 aaccacacaa gtagggaagt tgttgaaaac tgtaacgtct tctcctacag tggatgaaac 1260 ttcacaggtc cattgtatgg gccccgcaaa ggcttctggt ttttctggct tggccgcaaa 1320 caaggggtcc aatactgttt ctagagttgg gattttctgt tataagtgtg gcgaagatgg 1380 tcatactaga agggaatgtc aaggggtaga ggatctgcgc aaggtaaacc agaaactgat 1440 tagacagagt aggcagtcgg gaaactacgg tggaactctg tgaaggaacg gcccagagtt 1500 ccaatttcaa atcgttcacg attgtttttc ttttatgagt acagatgatt ttggacttcc 1560 agtcgggctg ataggaccct ctttagatgt tcctgtacaa attgaaggga tttatgctcg 1620 ggccctgttg gacagtggtt cacaggtgac catcttgtat cgtagttttt atcaaactta 1680 tttgaagcac ttaccagttc aaccagttaa agatttggaa atctggggtt taagcttgca 1740 gaagtatcca tatgatggat atttgcctct tcgtctggag ttcacagctg ctgtagctgg 1800 agtgcctcag gtggttgata ctcttgcact tgtctgtcca gatcctcaac ctgcacagaa 1860 ggtagccctt ttggtgggca caaatactgt tcgacagctt tttgaaacat gtaaggcaga 1920 agcaggtgat aattttatga gtgttctctc cgtgcatccg gtggtgcgag atgcatatga 1980 gaaaatccaa gaagtggaga ttggagagga aatttgtggt ccgcagggta ccatttggtt 2040 tcatcagtgc cggtcagtat cactggaacc tggggaaact ggtcaactag agggaaggcc 2100 tacatttcct tatgatttag aagacgaaca tgtgttaatt gatcagcctg ttgatactat 2160 ggatctggga gctttgatgg ttagacctga ggtacgaccc atttctatga tgcgaaatca 2220 cttagtgaca gttacagtca aaaatatgtc tgatcatagc attaatgtga cccggggcac 2280 cccaattgct caggtattcc ctgtagatgt ggtatccccg attattgctg agagagaagt 2340 gacagattca aaccaaatca ctatggactc tttcaacttt ggagattctc cactttcatt 2400 ggaggcccgg cgagagctat gtgagaaact tctagagcgg aaagatgtgt tttctactca 2460 tgaacgggaa gtaggttgtt cgtgcagtac acttcatgaa atcaggttga ccgattccac 2520 cccttttagg gagagatctc ggcgtttggc ccccaaggac ttggaagatg tgcggaaaca 2580 cttatttgaa ttaaaacagt gtggaataat ttcagagtct aggagtcctt atgcatcccc 2640 tattgtggtc gtaagaaaaa aatcaggaag tgttcgaatg tgtgtagatt accgcacctt 2700 aaatcgcaga actattcctg atcagtatac tgtgcctcgg attgaggatg ctttgcacag 2760 cctctcaggt agccagtggt ttactgtgtt agacttaagg agtgggtatt accagatacc 2820 gatgaaccct gttgacaaag agaaaacagc attcatttgt cctcttgggt tctatgagtt 2880 tgaccgaatg cctcagggca tctgtggtgc accggctacc tttcagcgac tcatggaacg 2940 cgctgtgggt gacatgaatt tcctggaggt gttagtgtat ctggatgatc tgatagtgtt 3000 cggtcgaacc atagaagagc atcatgagag gttgttaaaa gtgttggata ggctgcgtga 3060 ggagggactg aaactgtctg tggacaaatg tcaattctgt cgaacctctg taacttatgt 3120 agggcatgtg gtgtctaaag atggtatttc cactgaccct tcaaaggttg aggcagttgt 3180 taattggcct agacctcaga ctttgacgca actgcgatca tttctgggat tttgtggcta 3240 ttacagaagg tttgtgaggg gatattcaag tctttgccgt ccattgaata aattgttggg 3300 tgaggaagtt tcacaggaac aagaaaaaaa actatcccgg agcttagcga aggaggtttt 3360 tggagccagg tgggatgagt catgtgagag tgcattccag gaattgaagc agaggttgac 3420 tcaagctcca gtgctggcat ttgcagatcc tcagcagtca tatgttttac atgtggatgc 3480 tagtttagat ggtttggggg gtgttctcta tcaaccatac ccggagggtc tgcgtcctgt 3540 agcgttcatc agccgaagtt tgtccccagc tgagagaaat tatccagctc acaagctgga 3600 atttttagct ttaaaatggg ctgtggtaga caggctgcac gaatacttgt atggggttac 3660 ctttgaggtg aggacgcaat aatcccctta cataccttct aacatcagcc aaattggatg 3720 ctgctggcca tcgatggtta tcagccttgt ccacttacag tttcagcatc aagtaccgaa 3780 gtgggaaaaa aatgtagatg ctgattctct atcgcggcgg ccttattcct ctgagactga 3840 aggaggaaag tgggaagaga ttcctgagac tggggtgcgg tctttgtgtc aggcagttgc 3900 ttcaggggag aattcaagtg ttaaggagtg cccttttcag agtttttcag aaagtggtga 3960 gattagagct cctgtaaaga atgtgtttgt ttctgcaacc cagctgaaag attccaatga 4020 ggagttgtgg gaggctcagg gccaggatcc ttatttgaat ataattctct ctgctataag 4080 gaaaaaccag tctcatgaca acgtgaaatg tgagttgccc gaatttgctt gttggcggag 4140 agagtggagt aatttggcag tgtcaaatgg attgttgttt agagtgacca catccgaggg 4200 ggaacttaat cgacagatag tgctcccgtc tgcccttcag agtgatacgc tggcctctct 4260 gcatgattct catggccacc tgggttttga taaaacatat gcactcgtaa aggacaggtt 4320 ttattggcca ggtatgaaag atgctgtgaa acaatactgt aagagttgcg aacgatgtct 4380 ggtgagaaag actttgcctc gcaagacagc gtctatgcag cacatgcaaa gctcaggacc 4440 gatggatctc gtgtgcattg actttctgtc tattgagcca gacaggcata acgtgggtaa 4500 tgtgttagtt gttacttacc attttactcg ttatgcacag gcttatccta cgcgggacca 4560 gaaggccttg acagtagcta aaatcctgtg tgaaaagtac tttgtgcatt atggactacc 4620 aaatcggatc cattcggatc aaggacctga ctttgagagt cacctggtga aagatgtgct 4680 tggtgtgtta ggggttaaaa agtcccgtac cactccgtac caccctcagg gtgatcctca 4740 accagaaagg tttaatcgga cacttctcaa tatgttgggc acgttagaat ccacaaagaa 4800 agctcagtgg agtcagtaca tttcgcagtt agtacatgca tataattgtt cacaaaacga 4860 ggccacgggt tactctccat attttctcat gtttggacgt gagcccactc tgcctgtgga 4920 tgtctgttat ggaacaaatg aaatgggtgg gtccatgtca cctagtgtgt atgtacaaaa 4980 tttgaggaag aatctgagaa ttgcttttga gttggctact gaggttgcta aaaaaaagaa 5040 cctttctaat aaaagaaggt atgatgagaa agtgcaggaa tgtgtcttaa accctggtga 5100 cagggtgctc atccgaaact tgggcttgag agggaagcac aaattggctg atcgctggag 5160 ttcagtggtt tacttggtag ataagcgatt ggaaaatctg ccagtatatc agctgactcc 5220 agaaaatggt gttggacctt ctaaggtttt gcataaagat catattcttc ctgtgagctc 5280 gaaagtgagg tttggggagg agagtaaggt agtcatggtt gatgatgtgc ctaagaagaa 5340 tagattccct aggagaaact tgagaacttc acataatatt gaggttaaag agaaagggtc 5400 atttcaagtt gagggagaaa cagatacgga ctcagaatct gaatatgatt gtgccccctt 5460 ccggtattta gatttttcaa tgtttgacgg ggaggaaatt gggaaagata aaggacccga 5520 aatgaacaat gatgtcgggt tactaacaga taaatgtctg gtgacacccc agttggatca 5580 gggtgatggg aattcggtgg gtaacctgga agaggaagca gttacctcag accctttgga 5640 tgtggatagt aatactgact gtcaagggca ggaaaaggaa gttagtgaaa gtgttgatac 5700 tctgcctgaa atttcattga ggaggtctga gagggagcgt aaaccaccgg caaggtttac 5760 ctacccaaat ttaggtattc ctactgagga gaggatggtc acaacaaaag tgtgtggggt 5820 gctcctaaac cacatgcaaa ctcctaacca atgtcacaaa catgacctac atcataattg 5880 gtggtgtaat tctcatgccc tttgccagtc atgttgggta cacatcagta cattgccatg 5940 tgtagtccct agagtgtaaa ggggaatttc atgcttaatg acatatatgt agcatgggga 6000 catgctggtt ttcatgtggg gagagag 6027 // ID Gypsy-18-I_DR repbase; DNA; ZEB; 6801 BP. XX AC chr10; XX DT 11-FEB-2005 (Rel. 10.01, Created) DT 11-FEB-2005 (Rel. 10.01, Last updated, Version 1) XX DE An internal portion of the Gypsy-18_DR LTR retrotransposon - a DE fossilized sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy superfamily; Gypsy-18-I_DR; Gypsy-18-LTR_DR; Gypsy-18_DR; KW endogenous retrovirus; gag; integrase; protease; KW reverse transcriptase. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6801 RA Kapitonov V.V. and Jurka J.; RT "Gypsy-18_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 5(1), 9-9 (2005). XX DR Zebrafish.; chr10; Positions 4020218 4027018. XX CC Gypsy-18-I_DR is an internal portion of the Gypsy-18_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its CC long terminal repeat is deposited in Repbase as CC Gypsy-18-LTR_DR. The genome contains one copy of Gypsy-18-I_DR CC that is flanked by identical LTRs. The internal portion CC contains two ORFs encoding Gypsy-like gag and pol proteins. CC The first ORF encodes the 612-aa Gypsy-18_DR1p gag (pos. 225- CC 2060). The second ORF encodes the 1578-aa Gypsy-18_DR2p pol CC protein (pos. 2021-6754) composed of the protease, reverse CC transcriptase, and integrase domains. This retrotransposon CC can be still active. XX FH Key Location/Qualifiers FT CDS 2021..6754 FT /product="Gypsy-18_DR2p" FT /translation="MGPPELSQLAAFKLKSVSITGRTEARKEKCPKKKEHL FT LHQRTYKSPCYQLPTGLVGTRCTTKIQIEGREVNCLLDTGSQVTTIPKSYY FT DQYLQQHPIKSLQHLLEVEGANGQAIPYLGYVELNLKFPQDFLGVEAEVPT FT LALIVPDLTTIPQILIGTNSLDVLYTYCTQGNKRILKSHCHGYQSVINILE FT KRRQQASGAAVGPVKLKGNQQEVVPAGSTVVLEGLVQVKGPLFEKWVSVEP FT ATTPFLPGGILVASCLHSLPSRQRIVQLPVVLRNDSHTDIVIPPKAVIAEA FT HAVQRVEKEFSKVDADSEEADSILPKVPVDFGDSPLPPEWKERITSMLNSM FT PDVFALHDLDYGHTDQVKHRIKLSDETPFKHRARPIHPQDVDAVRKHLQEL FT LEAGVIRESESPFSSPIVVVRKKNNSVRLCIDFRKLNSQTIKDAYALPNLE FT EAFSVLTGSKWFSVLDLKSGYYQIEMEESDKQKTAFVCPLGFWEFNRMPQG FT ITNAPSTFQRLMERCMGDLNRKEVLVFIDDLIVFSSTLEEHETRLKQVLKR FT LREFGLKLSPEKCKFCQTSVKYLGHIVSQHGVETDPAKVEALKTWPTPKNL FT KELRSFLGFSGYYRRFVQDFSKIVKPLNDLTAGYPPLQKENRKKMVESKKQ FT YFNPREPFGGRWTQECQHAFNTVIAKLTSAPVLGFANPKLPYVLHTDASTT FT GLGAALYQDQDGQMRVIAFASRGLTRSESKYPAHKLEFLALKWAVTTKFND FT YLYGAEFTVITDSNPLTYLLTSAKLDATSYRWLSSLSTFIFKIRYRAGSRN FT QDADGLSRRPQTEIPDDLETQKERERIRQFTHQHLTNEQSMLVSEEEVRAI FT CERHQIVQSGDDPDHLHPSVTLVESITVNVDALPQAFQQDDAYQITEIPQL FT SEENLRKRQREDPEIGIVVKQIKSKEKPCFKTILPELIPWFREWSRLELRN FT GVLFRRKQEHGDSSYQLALPADLRDMVLTELHDKMGHMGVERTLDLVRSRF FT FWPKMSAAVEKKIKTCERCVRRKSLPEKAAPLVNIRTNRPLELVCMDFLSL FT EPDQSNTKDILVMTDHFTKYAIAVPTRNQKAQTVAKCLWDNFLVHYGFPEK FT LHSDQGADFESRTIKELCKVAGISKTRTTPYHPRGNPVERFNRTLLQMLGT FT LNNKEKSRWKDYVKPLVHAYNCTKSDVTGFTPYELMFGRQPRLPIDLAFGL FT PVDGQTESHSKYVQDLKSRLEESYKVATENANKVAGRNKKRYDKHVVNSTI FT EVGDRVLVKNVRLKGKHKLADKWETEVYVVTKKAGDLPVYTVKPERKDGPL FT RTLHRDLLLPCGFLSASAPDTPVERKVMQKPRTRAQCAEESENLSEHSESE FT YDPVDYYDPERFTIDTRILPTSGLEPSSSSKHTVNLPVAKPGMKKGEERLV FT SITLVDEENLPDVETSRPAETDRHVLELPLESNLPDLSEIQVLDENIVDES FT EKEMAQIADQEKPDNCVGSSESSDRKDERGTNLTQREIGTSREFQETQREG FT TETESLRRSRRQGEPPHRLNYPQLGNPFCSVIQSLFSSLSTAITNSLEDSN FT PSRRYSPVENI" FT CDS 225..2060 FT /product="Gypsy-18_DR1p" FT /translation="MEIVKRENVNVQNAVIVSGLTLSDIDQDLEAYLIKYG FT SIKQNLVIDDPNSEHHHSAIVEFSHDSAMQNLKPSLPMSIKSTLDDNVVFH FT VRSLASVYTPAASSDVTEDYLMNLQAIAHDSGKTLQEVLKSELQKISMLTF FT PESRASEQVLEKEQPMYDSSMHMTEPENPAKVSTPSQSPHQSRIPDASVDE FT NMSLNIPTSALSPPGVQRVIMEHVIRTSDTVSAPHVSLRLKAFSGRIPRPS FT HEPDFDTWSATVDFLLNDKSLSDSQKTGKILDSLLPPASDVVKHIGPYALP FT SECLKLLESVYGSVEDGDELFAKFIGALQNQGEKPSSYLHRLHVMLSTAIR FT RGGVTEAERDRCLLKQFCRGCWDNSLIIDLQLERKRATPPPFAELVILIRT FT AEDKQSLKEERMKKHLGLNKQTSVPFKLRTATNQQFAYCSEMPDESHKVES FT CQLSAKQKSAKPSSKAENSEIENLKKEITKLQTQITMMKTEPGPKEKSSPS FT VDGISELRRDIAELQAHLVPRFQREYQEQSPTLRAFPAKHRPRLTGTEVTT FT TQRPSEQLYSRPRPGYCFHCGGDGHIAVNCENEPNPRKVEEKRRELRERQA FT KWDLQNSPNLQHLN" XX SQ Sequence 6801 BP; 2206 A; 1488 C; 1481 G; 1626 T; 0 other; aaaacttggc gagccagcca ggagcaaaaa ataaataaat aaaaaattaa ttacaaaaat 60 ttatcaatta taataataat aataaataat aataaataaa taagaataaa ttaaaaactg 120 tacaatattt atagcatagt gtattagttt taatttcttt tcatcaccac cacagttaat 180 attaattcaa aaactgaacc cccgctaact ttagaatatt agccatggaa atagtgaaac 240 gtgagaatgt taatgtacag aatgcagtca ttgtgagtgg cttaaccctc tctgatatcg 300 atcaagactt agaagcctac ctaatcaaat atggatctat caagcaaaac ttagtcattg 360 atgatcccaa ctcagagcat caccacagtg ccattgtaga attctctcat gactctgcca 420 tgcaaaatct gaagccctca ctgccaatgt ccatcaagag cactttggat gataatgttg 480 ttttccatgt gcgcagctta gcaagtgtgt acacccctgc agcaagtagt gatgtcactg 540 aagattattt gatgaacctg caggccattg cccacgacag cggaaaaacg ctgcaagaag 600 tacttaaaag tgagctgcaa aagatttcta tgttgacgtt tccagagagc agagcttcag 660 aacaggtact agagaaggaa cagcctatgt atgacagcag tatgcatatg acagaacctg 720 aaaaccctgc caaggtgagc acaccatcac agtcacctca tcaatcaaga atccctgatg 780 ctagtgtgga tgagaacatg tctttgaaca tacctaccag tgctttgagt ccaccaggag 840 ttcaacgagt aataatggag cacgtaataa gaaccagtga tactgtgtct gccccacatg 900 tttcccttcg cctcaaagct ttttcaggaa gaattcctcg tcccagtcat gaaccagact 960 ttgacacttg gagtgcgact gtggacttcc tattgaatga caagtctctc tctgattcgc 1020 aaaagacagg taaaatcctc gatagtctac tccctccagc ctctgatgtc gtcaagcaca 1080 tcggcccata tgccttgccc tcagagtgtt taaagctatt ggagtctgtt tatggctctg 1140 tggaagacgg cgacgaattg tttgcaaagt tcattggtgc cctgcaaaat caaggagaaa 1200 aaccatcctc ctaccttcac cgtcttcatg tgatgctgag cactgcaatt agacgaggtg 1260 gtgttactga agctgaaagg gaccgttgtc tactcaagca attttgcaga ggatgctggg 1320 acaacagctt aatcatcgat ctacagttag aaaggaagag agccacaccc cctcctttcg 1380 ctgaacttgt gatactcatc cgtactgcgg aagataaaca atctctcaaa gaagaaagga 1440 tgaagaagca ccttgggcta aacaaacaga cttctgtacc attcaagtta cggacagcga 1500 ctaaccaaca gtttgcctac tgcagtgaaa tgccagatga gtcccataaa gtagagtcat 1560 gtcagttgag tgcaaagcag aaatcggcca agccaagtag caaagctgaa aattctgaaa 1620 ttgaaaatct gaagaaagaa ataacaaaac ttcaaacaca gattaccatg atgaaaactg 1680 aacctggtcc caaagaaaag agcagcccca gtgtggatgg tatttctgag ttgagacggg 1740 acatagctga acttcaagct catcttgttc caagatttca aagggaatat caagagcagt 1800 ctcccacttt acgtgctttt cctgcaaaac acagacccag actgactgga actgaagtga 1860 ctacaactca aagacccagt gagcagttgt acagcagacc ccgtcctggc tattgttttc 1920 attgtggtgg cgatggacat atagctgtta attgtgagaa tgagccaaac cctcgcaaag 1980 ttgaagagaa acggcgtgag ttaagagagc gtcaggccaa atgggacctc cagaactctc 2040 ccaacttgca gcatttaaac tgaaatcagt ctctatcaca gggcggacag aggctagaaa 2100 agaaaaatgc cctaagaaga aagaacatct gcttcaccag cgtacttaca agtctccatg 2160 ttatcaatta cctactggct tagtaggaac aagatgcaca accaaaatcc aaattgaagg 2220 gagagaagtg aattgcctcc tggacacagg gtcgcaggtg acaactatcc ctaagtctta 2280 ctatgaccag tacttacagc agcaccccat aaaatcgttg cagcatcttc tagaagttga 2340 aggagctaat ggacaggcca taccctatct gggttatgtt gagcttaact tgaaatttcc 2400 acaagatttc cttggagtag aagcagaagt cccaacttta gctcttattg ttcctgacct 2460 gacaaccata ccacagattc taattggaac taattcactt gatgtgctat acacctactg 2520 cacacaagga aacaaacgta ttctcaagtc acattgtcat ggctatcaaa gtgtgataaa 2580 tatactggag aagagaagac agcaagcttc tggtgcagcg gtgggacctg taaaactgaa 2640 agggaaccag caagaggttg taccggcagg aagcaccgta gttctagaag gactagtcca 2700 agtgaaaggg ccccttttcg aaaagtgggt gtcagtggag ccagccacaa ctcccttttt 2760 acctggtggc attttagtgg caagctgcct acacagtctt ccctcgagac aacgcattgt 2820 gcagttacca gtagtactga ggaatgactc tcacactgac atagtgattc ccccaaaggc 2880 tgtgatagca gaggcccatg ctgttcaacg agtggagaaa gagttttcca aagtcgatgc 2940 tgacagtgag gaggctgact caattctgcc gaaggttcca gttgattttg gagattctcc 3000 tctaccacct gagtggaaag aaagaataac atcaatgtta aactccatgc cagatgtgtt 3060 cgcattacat gacctggact atggtcatac tgatcaagta aagcacagaa taaagcttag 3120 cgatgagacg ccatttaagc acagagctag gccaatacac cctcaagatg tggatgctgt 3180 cagaaagcat ttacaagagc ttcttgaagc tggagttatc agagagtcgg agtccccatt 3240 ctcatctcca atagtcgtag tgcgcaagaa aaacaactcg gtgcgattat gtattgactt 3300 caggaagtta aactcccaga cgattaagga tgcatacgct cttcctaact tagaagaagc 3360 cttttcagtc ttgaccggct caaaatggtt ttcagtcctt gacttgaagt caggctatta 3420 tcaaattgaa atggaggagt ctgataagca gaagaccgct tttgtgtgcc cattaggctt 3480 ctgggaattt aacagaatgc cgcaaggaat tacaaatgca ccgagcacgt ttcaacgatt 3540 aatggagcga tgtatgggag acctgaaccg caaagaagtc cttgtattta tcgacgatct 3600 gatagtcttc tcaagtactc tggaggagca tgagacacgg ctgaagcaag tactcaaaag 3660 gctaagggaa ttcggattga agctttcgcc agagaaatgt aaattctgcc aaacgtctgt 3720 caaatatttg ggccacatag tatcccaaca cggagttgaa actgacccag ccaaagtcga 3780 agcccttaaa acctggccga caccaaagaa tcttaaggag ctcagatctt tcctcggttt 3840 ctctggttat tatcggaggt ttgtgcaaga tttttcgaaa attgtgaaac ccctcaatga 3900 tctaactgct gggtaccctc cactccaaaa agagaatcga aagaaaatgg tggaatcgaa 3960 aaagcagtac ttcaatccga gggaaccatt tggaggacga tggactcaag aatgtcagca 4020 tgcatttaat actgtgatag caaaactaac atctgctcca gtattaggat ttgccaaccc 4080 caaactccct tatgtgttgc atacagacgc tagcaccact ggcctaggcg cagcactata 4140 ccaagaccaa gacgggcaga tgagagtcat cgcctttgca agtagaggac ttacccgcag 4200 tgagtcgaag tatcctgccc ataagttaga atttctggca ttgaaatggg ctgtgaccac 4260 aaagttcaat gactacctct atggagctga gttcacggtg ataactgaca gcaacccact 4320 gacttacctt ttaacctctg caaaattaga cgccactagc tacagatggt tgtcaagctt 4380 gtcaaccttc attttcaaga tacgttaccg agctgggagt agaaaccagg atgcggacgg 4440 actctctaga cggccacaga cagaaattcc agatgacttg gaaacacaga aagagagaga 4500 aagaatcaga cagttcacac accagcacct aacaaacgaa caatcaatgc ttgtttcaga 4560 ggaggaagtg agagctattt gtgagcgtca tcaaattgtt caatctggtg atgatcctga 4620 tcatttacac ccctcagtga ccttagtgga atccataact gtcaatgtgg atgccttacc 4680 tcaagcattt caacaggatg atgcttacca gatcacagag attcctcaac tgtctgaaga 4740 gaacctgaga aagagacaga gagaagatcc agaaattgga attgttgtca agcaaatcaa 4800 gtctaaagaa aagccctgct tcaagaccat tctacctgaa ttaattccat ggttcagaga 4860 atggagtcgt ctggagttga gaaatggtgt actctttagg agaaaacaag aacacggaga 4920 ctcgtcttac caactagcac tgccggcgga tcttcgtgat atggtcttga cagaactcca 4980 tgataaaatg ggtcacatgg gtgttgagcg aactcttgac ctcgttcgat caagattctt 5040 ttggcccaag atgtctgcag ctgtagaaaa gaagatcaaa acttgtgaac gctgtgtcag 5100 acggaaaagt ctacctgaga aagctgctcc gttggtgaac atcagaacca acagacccct 5160 tgagctcgtt tgtatggact tcctgtcgtt agagccggac caaagcaaca ccaaggatat 5220 attggtgatg actgaccact tcaccaaata tgccattgct gtaccaacac gaaaccagaa 5280 agcccaaact gtagccaagt gtttatggga taattttcta gtgcactatg gattccccga 5340 aaaactccat agcgatcagg gtgcagattt tgagtctcgc acaattaaag agctgtgcaa 5400 agtggctggg atatccaaaa ctagaaccac cccatatcat ccaagaggca atccagttga 5460 gcgcttcaac cgaacacttc tacagatgct gggaaccttg aacaataagg aaaagtcaag 5520 atggaaggat tatgtcaaac ctctagtgca tgcatacaac tgcacaaaga gtgatgtgac 5580 aggatttact ccatatgaac taatgtttgg gagacaacca cgactaccaa ttgaccttgc 5640 atttgggtta cctgtggatg gacagacaga atcccactcc aagtatgtcc aagaccttaa 5700 gagcagacta gaagagagct acaaagttgc aacggaaaat gccaacaaag tagcgggacg 5760 aaacaaaaag agatatgaca aacatgttgt taactctact attgaggtcg gtgatcgtgt 5820 gttggtaaag aatgtccgtc tgaaaggcaa acacaaactt gcagataagt gggaaacaga 5880 ggtttatgtg gtaacaaaga aagcaggtga cctaccagtt tacacagtta aacccgaaag 5940 aaaggatgga cctctgagaa ctttgcacag ggacctgttg ttgccttgtg gattcctatc 6000 agcaagtgca ccagacacac ctgttgaaag aaaagtgatg cagaaaccaa gaaccagagc 6060 tcaatgtgct gaggagtctg agaacttgag tgaacattcc gagtcagaat atgatccagt 6120 agactattat gatcctgaga gattcacaat tgacaccaga attttgccta catctggact 6180 tgaaccatct tccagtagta aacatacagt taacttacct gttgctaaac ctggaatgaa 6240 aaagggtgaa gaacgtcttg tatcgataac actcgttgat gaagaaaact tacctgatgt 6300 ggaaacttcc cgcccagcgg aaactgatcg tcatgtatta gaactcccac tagagagcaa 6360 cctgccagat ctgtcagaaa tccaagtcct tgacgagaac attgttgatg aatctgaaaa 6420 agaaatggca cagattgcag atcaggagaa accagacaat tgtgttggat cctctgaaag 6480 ctcagacagg aaagatgaaa gaggtactaa tctcactcaa agagaaattg gcacttccag 6540 agaatttcag gagacccaaa gggaaggaac tgaaactgaa agtctcagac gatcaagaag 6600 acaaggcgaa cccccacaca gactaaatta cccacagcta ggtaatcctt tctgttcagt 6660 tattcagtcc ctgttctcca gtctcagtac tgcaattact aactctttgg aggattcaaa 6720 tccctcaaga agatactctc ccgttgaaaa catctgaatt caagcatgca gagggacctg 6780 catatattca ggaggggaga a 6801 // ID Gypsy59-I_DR repbase; DNA; ZEB; 5559 BP. XX AC chr15; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy59-I_DR; KW Gypsy59-LTR_DR; Gypsy59_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5559 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1433-1433 (2008). XX DR Genome; chr15; Positions 13186185 13180627. XX CC Positions [2393-2896] - Reverse transcriptase CC Positions [4175-4651] - Integrase core CC 'CTTG' target site duplication CC LTRs are 99% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 2120..5482 FT /product="Gypsy59-I_DR_1p" FT /translation="MIQSHSVVGSPSFFQQIESLDLSVLSPPEQGQVRALL FT QKYQAVFSDHESDLGCTSLISHDIPLLDDAPVRQRYRRIPPSEYEVVKTHI FT NQLLEAGIVRESCSPYASPIVLVKKKDGGLRMCVDYRHLNAKTRRDAFPLP FT RIEESLDSLSGARWFSTLDLASGYLQVPVSEQDKPKTTFCTPFGLFEWNRM FT PFGLCNAPSTFQRLMERLFGDQRCQSLLLYLDDVVVFSSSIDQHLERLEMV FT LGRLEKEGLKAKLEKCAFFRREVKYLGHVISAQGVATDPGKIEAVTHWRCP FT GTVSELRTFLGFASYYRRFLEGFAKLAAPLHRLVAEVGGSKSGKHRKQSLT FT HVWTEQCQMAFEALKRKLTSAPILAYADFSKPFILEVDASYGGLGAVLSQD FT FEGKVRPIAYASRGLRPSERNMTNYSSMKLEFLALKWAMVEKFREYLLGHK FT CIVFTDNNPLSHLATAKLGATEQRWAAELAAFDFEVKYRAGKSNGNADALS FT RQNLVDKSVVGELAVSSKIPEALQQAISKGCPVQADMRTISTLPGYSVPDL FT RCMQEADPVIGPVIRFWEQGQRPGLEERKLLTKPVLTLLQQWNRFEEREGV FT LYRKIFRPDGGEVCHQLVLPDVLKEEVWIQLHQQHGHQGVERTSELIRQRC FT YWPNMLADVTRWCHECERCQLAKDTGPVARAFMGHLLASKPNEILAIDFTM FT LEPSYCGKENVLVMTDVFSKFSMAVSTRDQRAETVAKVLVEEWFYKYGVPG FT RIHSDQGRNFESTLIRQLCELYQVKKSRTTPYHPAGNGQCERFNRTLHNLL FT RTLPDSRKRDWVSCLSQVIFCYNTTPNQVTGESPHYLMFGQEPRLPVDFLL FT GQVPEPTVGSVHHWVLEHRRRLQVAFEGAQGRLKAAAARRKNLHDQHVGDQ FT PLVEGQLVYLRDHAVRGRHKIKDLWSSVVYRVVHSAEEGGAVYAIAPVTSP FT GKMRRVHRSQLKACVPRGKVCSDQPKGVVEIQEDPQEDAQELDLEAVDLAV FT MVPVVPQATQGVGPRDGSHPMQSRVQEDLPGDLSPVRAIVVEPQGPGVEPV FT AREREVLTEPPNDSVVPPRRSARVGAGQHTNLHHLPRGVGEVRSGPMNHEV FT VSNNISVLFRPWS" XX SQ Sequence 5559 BP; 1303 A; 1203 C; 1564 G; 1489 T; 0 other; tcttggcgta gtcggcaggg tccaccataa tttagcgtca gagacattgg tgtttttttt 60 tcttttcttt cagttttttt ttcagcttct ttttgggaaa caggacggca gtttgtggtc 120 ttttgtagcc tcgtttagtg gttccctcct tcctgggtaa gtgtattttc ctctccttat 180 aataatggat gaagaactac aagaattgcg agatttggtt gttcaattaa gagctgataa 240 cgcgcggctg caacaggctc aggcaccggt tgggttgcct gtccctgatg ctgccctccc 300 tagcacctct gcggttccgc ctgttacgcc gccgccatct ggcgttacac cagagcggtt 360 tgtttttgtg ccacgagatc ggcggtgccc gaagtttaat ggtaaatcgg gttagactat 420 tgacgaatgg gttgaggagg cacaggcatg catgaggttg cgctatatgt cagtagctga 480 taaagctttc tttttaattg atcatttaga aggagaggcc agagaggaga tcagatatcg 540 ttctgattta gaacgtgggg accccaaccg aattattcag gcattgcgtg acgtctatgg 600 ttgctctcag tcatatgtag ccttgcaaga agccttcttc tcaaggaagc aactcgaggg 660 tgagactctt ttagagtttt ctcttgcact catgggctta ttagaaaggg ttaaaaatag 720 agcgccacac gttttcccca atgcagaaat tttgttgcgt gaccaatttg tggagcatgt 780 cacagatagt gccctgagac gcgagttgaa acagatggtt cgtcgccagc ctactgccac 840 cttattagag gtccgtagtg aggcaattag atgggaacaa gaggggatgc ccgggggagc 900 aaggggacgt agtcagtcgg tcccgttaac ttatggtatc caatatgggg tacgcagtga 960 aacacctttg ggggtagtta gatcatcaca acagtcagag ttgggtgaac taagggaaat 1020 gctgcggcaa caacaacagc agttaaacca gttaactcaa acatttgcac gatttcaaac 1080 gtcacatgct cagagtaatt cctctcgatc aggtccaatt gtttgtaggc ggtgtcaaca 1140 gccaggtcac tttgcccgtg attgtagggg gcaaaggagt ttcccacgcc ctcctcaggc 1200 tgcatctggt tttgttatag ctgagaatag acgacagcgc tcgccccaac cgtcgggaaa 1260 ctagccccca ccaggctgcg gagtcacagc ctgggtgggg aaggtattga ctcaagtaag 1320 attcttgcac accactcagt ttctaaattg atttctcctt gcccacatgt gaccgctctc 1380 atagggggtg ttgcagtgcc ctgtttggtt gacaccggct cgatggtgtc aacaattaca 1440 gagggttgtt ttcggcaaca ttttgagcca tggggcctgg accgccttca aagttgtcag 1500 tggttgcagt tacgagcagc taatgggctc actattccat atgtgggcta ctttgagtta 1560 gatgtaaaac tttgtggcaa gttggttact gggtgcggcc ttctggtggt tcgggaccct 1620 ccagatggta tgagtctgga agttcctggg gtactgggga tgaacatcct ctcccggtgc 1680 taccaggaac tgtttggaca gcatggtaat gctctgttta acctacctgt actaacacaa 1740 ttgcctagtg tgtcgagcgc cttacagtac tgtcatcaag taagtgctag gaaaccttct 1800 gatcatgtgg ggtgcgtgcg ggttcgtggg cgtagagtgt gtcgcatccc gggtggcaca 1860 atgaagttag tggctgcaac ctgttcggcc cactattttg gccatgtcgt tttgtttgag 1920 cccccagagt ctggtcttcc tgctggactt ttggcatcac cagctttggt aagcgtaaat 1980 tggggaacag tctatgtacc aaccgtaatt gttggtgcag tggaggtagt gcttcgtcct 2040 gcaacaatga tgggtaattt gaaaggtcta tatagttagt ctgccagctg gggttagtga 2100 agaaaaacca gtggctgcta tgattcaatc tcatagtgtg gttgggagtc cttctttttt 2160 tcaacagatt gagtcacttg atctatctgt tctgtctcca cctgagcagg gtcaggttcg 2220 ggctcttctg cagaagtacc aagctgtgtt ttcggatcat gagagtgatc tgggttgtac 2280 cagtcttatc tcccatgata taccgctgtt ggatgatgct ccagtgcgcc aacgctatcg 2340 gcgaatacca ccatctgagt atgaggtagt aaaaacccat attaatcagt tgttagaggc 2400 tggaatagtt agggagagtt gcagtcctta tgcgtccccg atagtcttgg taaagaagaa 2460 ggacggtggc ctgcgcatgt gtgtagacta ccgtcatttg aatgccaaga ctaggaggga 2520 tgcattccct ctgcctcgca ttgaggagtc cttggactcc ttatctggtg ctcgctggtt 2580 ctctacgttg gatttggcca gcgggtattt gcaagttcca gtttctgagc aggataagcc 2640 gaaaactacg ttctgtacac cgtttggatt gtttgaatgg aaccgtatgc cctttggcct 2700 ttgtaatgct ccaagtacat tccaaaggtt gatggagcgg ttattcgggg accagcggtg 2760 tcagtccctg cttttgtatc ttgatgatgt tgtggtgttt tcttcctcca tagaccaaca 2820 tttggagagg ctggagatgg tactaggtcg tctagagaag gaagggctca aggccaagct 2880 ggagaaatgt gctttcttcc ggcgtgaagt aaagtacttg ggtcacgtta tatctgctca 2940 gggggtagca actgatccag gtaaaataga ggcggtcacg cattggcgct gcccggggac 3000 cgtgtccgag ctacgaacat ttcttggctt tgcgagctat tataggcgtt ttttggaagg 3060 gtttgctaag ctggccgccc ccctccaccg attagtggca gaagttggag ggagtaagtc 3120 aggtaagcat agaaaacaga gtttgactca tgtatggacc gagcagtgtc agatggcttt 3180 tgaggcccta aagaggaagt taacatccgc cccaatatta gcttatgcag acttctcaaa 3240 gcctttcatc ctggaggtag atgccagtta tggaggtttg ggggcagttt tgtcccaaga 3300 cttcgaaggg aaggttcgac ccatcgccta tgccagtcgt ggtctcaggc cctcagagcg 3360 aaatatgact aattatagct caatgaaact agagtttctc gcccttaaat gggctatggt 3420 tgaaaagttt cgtgagtatc ttttggggca caagtgtatt gtgtttacag acaataatcc 3480 tctgagtcac ttggctacag caaaacttgg agccacagag caacgttggg ccgctgagtt 3540 agccgcattc gattttgagg tcaagtacag ggcaggaaag agtaatggaa atgctgacgc 3600 gctctcccgc cagaatttgg ttgacaagag tgtagttggg gagttagcag ttagctctaa 3660 gattcccgaa gccttgcagc aggccattag taagggctgt ccggtccaag cagatatgcg 3720 gactataagc acattgccgg gctactctgt ccctgacctt cgctgtatgc aagaggctga 3780 tcctgttatt ggaccagtaa taaggttctg ggagcagggc caacgtccag gattagagga 3840 gcggaaactt ctaaccaaac cggtcttgac cctactccag cagtggaata ggtttgagga 3900 gagggaagga gttttgtacc gcaagatatt ccgtcctgat ggaggggaag tatgtcatca 3960 gcttgtgtta cctgatgttt tgaaggaaga ggtctggata cagctgcacc aacagcatgg 4020 ccatcaagga gtagaacgca ccagtgagtt aatccgacag cgttgttatt ggcctaatat 4080 gttggctgat gttacccggt ggtgccacga gtgtgaacgc tgtcaacttg ccaaagacac 4140 tgggccggtg gcacgggcat tcatgggcca cctgctggcc tcaaagccta atgaaattct 4200 tgcaatcgac tttaccatgc tcgagccttc atattgtggg aaggagaacg tgttagtgat 4260 gaccgatgtg tttagtaagt tctctatggc agtgtccact cgtgaccaaa gagcagagac 4320 tgtggccaag gttctagttg aggaatggtt ttacaagtat ggggttcctg gacgtatcca 4380 ttcagatcag ggccgcaact ttgaatccac tcttatacgg cagctttgtg agctgtacca 4440 ggtaaagaaa tcccgtacca ccccatacca tccagctggc aatgggcaat gtgagagatt 4500 caatcgcaca ttacataact tgttgcgaac cttgccagac tctagaaaaa gggactgggt 4560 gtcttgtctg tcacaggtga tattttgcta taacaccacc cctaaccaag tgacaggaga 4620 gtctccacat tacctcatgt tcggtcaaga gccccggctg ccagtagatt tcttgctggg 4680 ccaggtacca gaacccactg taggtagtgt acatcactgg gtcttggaac atcgtaggcg 4740 gttgcaggtt gcctttgagg gagcacaggg acgattgaag gctgcagcag ccaggcggaa 4800 aaacctgcat gaccagcatg ttggagatca acccctggtg gaaggtcagt tagtttatct 4860 aagggaccac gcagtaaggg gacgccacaa gattaaagac ttgtggagtt ctgttgtcta 4920 tcgggtcgtc cactccgctg aggaaggagg ggcggtatat gccattgcac cagttacaag 4980 cccaggtaag atgaggcgag tacatcggtc acagttgaag gcctgtgttc cgagaggaaa 5040 ggtctgtagt gaccagccga aaggtgttgt ggagattcaa gaggaccctc aggaggatgc 5100 tcaggagttg gatttagagg ctgtcgattt ggcggtgatg gtgccagtgg tccctcaggc 5160 cactcaaggg gtgggacctc gagatggctc gcacccgatg cagtctaggg ttcaggaaga 5220 tttgcccggg gacctctctc ctgtaagggc cattgtggtt gaaccacaag gaccaggggt 5280 agaacctgta gcaagggaaa gggaagtact taccgagcct cctaacgaca gtgtggtccc 5340 tcctcgtcgt tcagcccgag ttggggcagg acagcatacg aacttgcatc atcttccccg 5400 aggtgtagga gaggtgagaa gcggacctat gaatcatgaa gttgtgtcta ataacatttc 5460 ggtgttgttt aggccttgga gttagtatgg tgggctagtg tttaatttgc tgatttctgg 5520 tccatcgtcg gggcgccgat gcaaaaggtg ggggtagat 5559 // ID HATN4_DR repbase; DNA; ZEB; 192 BP. XX AC . XX DT 05-JUN-2002 (Rel. 7.05, Created) DT 05-JUN-2002 (Rel. 7.05, Last updated, Version 1) XX DE HATN4_DR is a nonautonomous DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW HATN4_DR; nonautonomous DNA transposon; hAT superfamily. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-192 RA Kapitonov V.V.; RT "HATN4_DR, ancient nonautonomous hAT-like DNA transposon from RT zebrafish."; RL Repbase Reports 2(5), 24-24 (2002). XX DR [1] (Consensus) XX CC ~1000 copies of HATN4_DR are expected to populate the CC zebrafish genome. The element was active long time ago, its CC copies are ~9% divergent from the consensus sequence. CC This element is characterized by 75-bp terminal inverted CC repeats and the 8-bp target site duplications. CC Putative classification - nonautonomous DNA transposon from the CC hAT superfamily. The zebrafish genome harbors several families CC of hAT-like elements closely related to HATN4_DR, including CC previously unclassified TDR3 and TDR19. XX SQ Sequence 192 BP; 52 A; 48 C; 47 G; 45 T; 0 other; caggggtgct caaccctgtt cctggagatc taccttcctg cagagttcag ctccaaccct 60 gatcaaacac acctgaacca attaattagg acctgaacag cacttgataa ttacaggcag 120 gtgtgtttga tatgggttgc aactgaaatc tgcaggaagg tagatctcca ggaacagggt 180 tgggcacccc tg 192 // ID Gypsy49-LTR_DR repbase; DNA; ZEB; 360 BP. XX AC . XX DT 25-SEP-2007 (Rel. 12.09, Created) DT 25-SEP-2007 (Rel. 12.09, Last updated, Version 1) XX DE A long terminal repeat of the Gypsy48_DR LTR retrotransposon; a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy superfamily; LTR; Gypsy49-I_DR; KW Gypsy49-LTR_DR; Gypsy49_DR; endogenous retrovirus. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-360 RA Dib M.R. and Naveira H.F.; RT "Gypsy49_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 7(9), 828-828 (2007). XX DR [1] (Consensus) XX CC Gypsy49-LTR_DR is a long terminal repeat of the Gypsy49_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its CC internal portion is deposited in Repbase as Gypsy49-I_DR. XX SQ Sequence 360 BP; 72 A; 67 C; 69 G; 152 T; 0 other; tgttacgacc ctgtcttctc tttcctacct ttcggtggcc tcgccatgtg acgtcaccgt 60 taattagtta atttatgggt ggagaaagta tatatttgaa ttttttgtga agaacggcac 120 ttcctctggt ggaccgttat tccacagaga ggtcattttg ctccaactct tttgtgttgc 180 cgtcttatgt tttgttttgt attagttgag agaatttcac ctgctttctt atttatttca 240 aatccctaga cagtttgtta tactttatta ttccctttaa taaatgtttt ttttcattat 300 atatcttttt gtgtgtctga tcctttttat ggtacggtcg ggagcgagct ggccgtaaca 360 // ID DNA-TTAA-1_DR repbase; DNA; ZEB; 1351 BP. XX AC . XX DT 09-JAN-2009 (Rel. 14.01, Created) DT 09-JAN-2009 (Rel. 14.01, Last updated, Version 1) XX DE non-autonomous DNA transposon - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; target TTAA; KW TSD 4-bp; DNA-TTAA-1_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1351 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 9(1), 11-11 (2009). XX DR [1] (Consensus) XX CC The TSD is TTAA specific, and TIR is 440-bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1351 BP; 400 A; 278 C; 276 G; 397 T; 0 other; aggggaaatg aagcactcag tcaagtttac attattttgt acattggatt ccactttaat 60 gaaaatatat taaacaaact cgattaatgt aaccatttag cagaaaacgg catgttttac 120 tatagatttt agacctaggg cgccgccatc ttggaatgtc gctgtgtctg acgtcacggg 180 gttggttctg ttccctcagc tgaacagata cagtggaagt aaaggactga acacggagac 240 tgtaaagcct aatttaaccc aatgcgtgaa gttgtggacg tggagctgct gcaaatacaa 300 gcatcagatg tgtgtctaat ataaaagtat atatatttat tctatttttc tttttttccc 360 cttttaatta ccgatcattt gatgcgcttt ggtccactct ctgtattacg ctgcctgact 420 gcctgtctgg gtttcaacca tggtagtaac ggactgaaca cagagactgg acagcctaat 480 ttaacccaat gcatgaagtt gtggacgtgc atcacagagc tggtgcaaat acaacaaata 540 caagcattta agtgtcttaa gttgtgtatt tttcttttgt taataccttt cgtttgatgc 600 gctgtggtcc actctctcac gctgctccag cgctgcctga cgaggggatt tacattcaga 660 ctaatgcaac gattaaaatg atacatgact tcacgcttta ctaagtcacg tctgtgtgga 720 ttatgtcaag acatattccc ttttcaagtc gatcaactcg atctctcaac atgtatgaag 780 gatgtaaaaa acctgccatg tgaaaaaccg cgccgatctt agtttggttt gaacacgaaa 840 agaggtgagc ggctgtagta gtgaaaagtg aaagtacccg cccccattac gtcagcaccg 900 gaccttgttg aaaccatggt tgaaacccag acaggcagtc aggcagcata atacagagag 960 tggaccaaag cgcatcaaat gatcggtaat taaaagggaa aaagaaaaat agaataaata 1020 tatattttta tattagacac acatctgatg cttgtatttg caccagctcc acgtccacaa 1080 cttcacgcat tgggttaaat taggctttac agtctccgtg ttcagtcctt tacttccact 1140 gtatctgttc agctgaggga acggaaccaa ccccgtgacg tcagacacag cgatattcca 1200 agatggcggc gccctaggtc taaaatctat agtaaaacat gccgttttct gctaaatggt 1260 tacattaatc gagtttgttt aatatatttt cattaaagtg gaatccaatg tacaaaataa 1320 tgtaaacttg actgagtgct tcatttcccc t 1351 // ID CATCH3I_DR repbase; DNA; ZEB; 6373 BP. XX AC AL590152; XX DT 05-JUN-2002 (Rel. 7.05, Created) DT 05-JUN-2002 (Rel. 7.05, Last updated, Version 1) XX DE Internal sequence of the CATCH3_DR endogenous retrovirus. XX KW BEL; LTR Retrotransposon; Transposable Element; KW BELL/CATCH superfamily; CATCH3I_DR; CATCH3LTR_DR; CATCH3_DR; KW CATCH3_DRp polyprotein; endogenous retrovirus; endonuclease; gag; KW protease; reverse transcriptase. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6373 RA Kapitonov V.V. and Jurka J.; RT "CATCH3, an endogenous zebrafish retrovirus from the BEL/CATCH RT superfamily."; RL Repbase Reports 2(5), 4-4 (2002). XX DR Genbank; AL590152; Positions 46181 52553. XX CC CATCH3I_DR is an internal portion of the CATCH3_DR endogenous CC retrovirus, which belongs to the BEL/CATCH superfamily. CC It is flanked by 99% identical long terminal repeats, deposited CC in CC Repbase as CATCH3LTR_DR. The 3' end of CATCH3LTR_DR and 5' end of CC CATCH3I_DR are not precisely defined. CC CATCH3I_DR encodes CATCH3_DRp, a 2007-aa polyprotein (positions CC 209-6229). CC CATCH3_DRp is composed of gag-like, protease, reverse CC transcriptase, CC endonuclease domains similar to other retroviruses from the CC BEL/CATCH superfamily. XX FH Key Location/Qualifiers FT CDS 209..6229 FT /product="CATCH3_DRp" FT /translation="MSLMNCDDDGATTCEQTETIVPFEKFETEHVREKRLI FT KLTPKALLEKLETLQKTRKCKLSKAKNLMAIIKDFMSNREYEKEVQCSFEK FT FIKLRDETKEMHNSVMVLLPSEEKEKQQTWFNGKMLICDEFVDDVEKWLSA FT ESQVSCSGVDARVCQEDVNPDDSISNISSCKSSKSSSKKSSSSVRSGRSSA FT ASARIMAEAEKAALMARAAALKEKHTLEVQQEKLRHRQEQLDIDAEIAAAT FT AKIAVLNNQSLKTCTDDMNAYCDKGTVTLNPHAEAYVPIEVPLQESSRNVI FT QQIHSVPQIDPHFTSQMQSSSVMQNSDGIITSNLCSILQKQNEITTLLVQQ FT QQSSTLPQRDIPVFDGNPLQYKTFMRAFEHGIEDKTKNSRDCLYFLEQYTQ FT GQPREIVRSCQHMDAQRGYVQAKALLKEHYGNEFKIASAYVEKVLAWPSIK FT SEDVNSLQSYALFLRGCCNVMEEIDYMEELEMPSNLKTIIMKLPYKLRENW FT RTVACELMERRKQRARFKDIVIFVEHQVKMSSDPLFGDIQSTQPVQQKFQP FT KTKVRNSFATTVVTNNKSIQCCETDLRSKAPVVCLCCKRDHLLEQCTQMQN FT KLHKDKLNFLKEKGVCFGCFSFGHISKRCDKRLTCTVCGQKHPSILHIQQK FT ERVCTQEQSHSSLSSAHVALTTCGQTGAGDDGDCALSIVPVQVKSSKGQTV FT IQTYAFLDPGSSTTFCSERLMHSLKLKGKETNIVLKTMGNQRAVNSTVLTG FT LEVSNLLDDTFYSLPEVFTQERMSVSRDNIVTQKDLENWPYLKGVEIPQIP FT ADVDLLIGANASRVMEPWEVINSNGDGPYAVRTLLGWVVNGPLQGSRNKVK FT CSGVTVNRISVRKLEDMLGKQYNHDFNENSVEKKEMSREEHAFMEKVSKSI FT EFQDGHYKLNLPFRMEIPMLPNNLCVAKQRLIGLKRKFERNKIFHQEYKDF FT MDEVIKHGYAEIVPLHQLKQDEGKVWYIPHHGVYHPKKQTLRVVFDCGAAF FT KGTSLNDQLLQGPNLTNSLLGVLVRFRQEPIAFMADVKAMFHQVKVAEEDT FT DFLRFLWWPDGDINQDSVEYRMKVHLFGAVSSPSCACYALRRTAEDNKDDF FT PEKVIDTIHTNFYMDDCSKSVPSEEEAVLMIKDLTDVCLAGGFQLIKWISN FT SRIVLQSVPEEHRAQGIKMMDLDKDQLPMERALGLWWCVESDTFNFKITSK FT DHPCTRRGVLSKVCSVYDPLGFLAPFTLTAKILLQELCRIRCGWDDLLPSN FT LQRQWSLWLKELEKMADFKMDRCLKPAGMDELTNVQLHHFADASTLGYGTV FT TYLRMETKQDVHVTFLLSKARVAPLKSVTIPRLELTAAVLAARVDKMLRAE FT MQFPLVDSVFWTDSTSVLKYIKNEDKRFLTFVANRISAIREITTPLQWRYI FT PTTQNPADCCSRGLKADQLLTNREWINGPSYLWKPVEEWPAQIFDSALKAD FT DPEVKRNICVNSTLVSEQSNATHRLITHFSDWTKLKVAVAWLLKLKTVLKQ FT RSQKRKEIQASLDSSGVTSKKTEKELRKLTGSVVTPNLSLQDVNEAEVAIV FT AFSQRQQFKKEIDALSVSPVSKISRDSKLYKLDPVYQDGILRVGGRLRNSA FT MPEERKHPIILAKDQHIATLILRHIHQQLGHSGRNYMLSELRKRYWIIKGN FT AAARRIISSCGHCRRFGVKVGEQKMADLPKERLLPDLPPFSNVGVDYFGPV FT EVKRGRSIVKRYGVIFTCMASRAVHLEVAYSLDTDSCINTLRRFICRRGQV FT TQIRSDNGTNFVAAEKELKESLRSWNQNKIQRAMLQKGVQWSFNPPGGSHY FT GGIWERVIRMVKIILNSILHQQILDDEGFHTVLCEIEAILNDRPITKLSDD FT PNDLESLTPNHLLQMKGKPVLPPGLFEKSELYSKRRWKQVQYMCDLFWKRW FT TNEYLPLLQDRQKWNKERRNFVPGDIVVIVDSSAPRGSWLMGRILEVFPDK FT NGLVRSVRVQTKTSVLERPVTKLCLLYDNMEI" XX SQ Sequence 6373 BP; 2062 A; 1093 C; 1459 G; 1759 T; 0 other; tttgtcaaaa aatggcttct ttgtggatca tcatgcatca tttttggact ttcttgagga 60 gcagccgtga tgaactgaat gaatgaaaca aaggtatgtg tagcgctact taatgaatga 120 ctttgtgtgc acacaatagc gtaaatccaa tgcattggtt gcttaagttt taaagttgct 180 taggaggcaa acgatttcag ctgtgaatat gagtttaatg aattgtgatg atgatggtgc 240 gactacctgt gaacagacag agactattgt gccatttgag aagtttgaaa cagagcatgt 300 gagggaaaag cggctcatta agcttactcc taaggcttta ttggaaaaat tggaaacttt 360 gcaaaagaca cgaaagtgta aactaagcaa ggcaaagaac ttaatggcta ttataaagga 420 ttttatgtcc aaccgagaat atgaaaagga ggtgcagtgt tcctttgaga aattcatcaa 480 attgagagac gaaacaaagg aaatgcacaa ttcggtgatg gttcttttgc ccagtgaaga 540 aaaggaaaaa caacaaacat ggttcaatgg aaagatgttg atttgcgatg agtttgtaga 600 tgatgttgag aaatggttga gcgctgaaag tcaagtttca tgttctggtg ttgatgcaag 660 ggtttgtcaa gaggatgtta atccagatga tagcatttcc aacatatcct catgtaaatc 720 cagtaaatcg tccagtaaga aaagttcaag cagtgtacgt agtgggagat cttctgctgc 780 ttcagcacga atcatggcag aggcagagaa agctgcatta atggcgcgtg cagctgcttt 840 aaaggaaaag catactttgg aggtacaaca agaaaagctg agacatagac aagaacagct 900 ggatattgat gcagagattg cagccgctac tgcaaaaatt gctgttttaa ataatcagtc 960 tctcaaaacc tgcactgatg atatgaatgc atattgtgat aaaggaactg ttactctcaa 1020 cccgcatgca gaggcatatg tgccaataga agtccccctg caagaaagct caaggaacgt 1080 gattcagcaa attcacagtg tgccacaaat tgatcctcat ttcacgtcac aaatgcaatc 1140 ctcaagtgta atgcagaaca gtgatggaat aataacttcg aacctttgta gtatcttgca 1200 aaaacaaaat gagattacca ccctgcttgt acaacaacaa caatccagca cactgcctca 1260 aagagatatt cctgtgtttg atggcaatcc tctgcaatat aaaactttta tgagagcatt 1320 tgagcatgga atagaagaca aaaccaaaaa tagtagagat tgtttatact ttctggaaca 1380 atatacacaa ggtcagccaa gggagattgt gcgcagttgt cagcatatgg atgcacaaag 1440 agggtatgtg caagctaaag cattgttaaa ggaacattac ggcaatgaat tcaaaatagc 1500 ttctgcatat gtagaaaagg ttttggcatg gccttcaata aaatctgagg atgtaaattc 1560 attgcagagt tatgctctct ttttgcgtgg atgctgtaat gtaatggaag aaattgacta 1620 tatggaagag ctagagatgc caagcaattt gaaaaccatc atcatgaagt tgccttacaa 1680 attaagggaa aattggagaa ctgttgcatg tgagttaatg gaaagacgca agcagagagc 1740 acggtttaaa gatattgtga tatttgttga acatcaagtc aaaatgtcat cagatccctt 1800 gtttggtgat atccagagta cccagcctgt gcaacagaaa ttccaaccta agactaaagt 1860 tagaaacagc tttgcaacaa cggttgtcac caacaataaa tcaattcaat gctgtgagac 1920 tgatttaaga tcaaaggccc cagttgtatg tttgtgctgc aaacgtgatc acttattgga 1980 gcagtgtaca caaatgcaaa acaagctgca taaagacaaa ttgaactttc taaaggaaaa 2040 aggtgtttgt tttggctgtt tttcttttgg acacataagt aaaagatgtg ataagcgctt 2100 gacttgtaca gtgtgtggtc aaaagcatcc cagcattttg catatccaac aaaaggaaag 2160 agtgtgcact caagaacaaa gtcattcatc tttaagtagt gcacatgttg cccttacaac 2220 atgtgggcaa acaggggccg gtgacgatgg agactgtgca ttatcaattg taccagttca 2280 agtcaaatcc agcaagggac aaacggtaat tcaaacctat gcatttttgg atcctggcag 2340 ttctaccaca ttttgttcag aacgcttaat gcatagtttg aaattaaagg gaaaggaaac 2400 taatattgtt ttgaaaacaa tgggcaatca aagggctgtt aatagcactg ttttgactgg 2460 attggaagtg tctaaccttc ttgatgacac cttttatagt cttccagaag tcttcactca 2520 agaaagaatg tctgtatcca gagacaacat tgtcactcaa aaggatttgg aaaattggcc 2580 atatctgaaa ggtgttgaga ttcctcaaat tcctgctgat gtggatttgc tgattggagc 2640 aaatgcttcc agggtaatgg agccctggga ggttataaac agcaatggag atggacctta 2700 tgcagttaga accctattgg gatgggttgt aaatggacca ctccaaggaa gcaggaataa 2760 ggtcaaatgt tcaggtgtga ctgttaacag gatttctgta agaaaattgg aggatatgct 2820 tggtaaacaa tacaatcatg actttaatga gaattctgtt gaaaagaagg aaatgtcaag 2880 agaagagcat gcctttatgg aaaaggtaag caagtcaatt gagtttcaag atggacatta 2940 taaattgaat cttcctttta ggatggaaat tcccatgttg ccaaacaacc tttgtgtggc 3000 taaacagcgc ttaattggat tgaaaaggaa atttgaaaga aacaagattt tccatcaaga 3060 atataaagat ttcatggatg aggtaattaa acacggttat gccgaaatag taccgttaca 3120 tcaattaaaa caggatgaag ggaaagtttg gtatatccct caccatgggg tttatcaccc 3180 caagaagcaa acattaagag tggtgtttga ttgtggagct gcctttaaag gtacatcatt 3240 gaatgatcag cttctgcagg gtcctaatct gacaaattca ctgttgggag ttcttgtgag 3300 gtttcgacaa gagcctattg catttatggc tgatgtaaaa gcaatgtttc accaggttaa 3360 agttgcagaa gaagatactg actttcttcg tttcctatgg tggccagatg gtgatatcaa 3420 tcaggactct gtggaataca ggatgaaagt gcatttgttt ggtgctgtgt cgtccccaag 3480 ttgtgcttgc tatgctttac gtagaactgc tgaggacaac aaagatgact ttcctgaaaa 3540 ggtaattgac acaattcaca caaacttcta catggatgat tgttcgaaaa gtgtaccttc 3600 tgaggaagag gccgtcttga tgatcaagga tctcacagat gtctgcttag caggaggttt 3660 ccagttaata aaatggatca gcaatagccg tattgtgcta caaagtgtac ctgaagaaca 3720 ccgagcccag ggcataaaaa tgatggatct ggacaaggac cagttgccaa tggaaagagc 3780 tcttggtttg tggtggtgtg ttgaatccga cactttcaat tttaaaataa ctagcaaaga 3840 tcatccttgc accaggcggg gtgtcctgtc aaaggtatgc tctgtctatg atcctttagg 3900 atttttggct ccattcactc ttacagcaaa aattctcctg caagagttat gtaggataag 3960 atgtggatgg gatgatctgc taccctcaaa tctccaaagg cagtggtctt tatggttaaa 4020 agagcttgag aagatggcag atttcaagat ggacagatgc ctaaaaccag caggaatgga 4080 tgaacttacc aacgttcagt tgcatcattt tgctgacgca agtacacttg gatatggcac 4140 tgtgacgtac cttaggatgg agacaaaaca agatgttcac gtaacattcc tactcagtaa 4200 agctagagta gcacctttga aatcagtaac tattccacgt cttgaattga cagctgctgt 4260 gcttgctgct cgagtggaca aaatgttaag ggcagagatg cagtttccat tggttgattc 4320 agtattctgg acagatagta cgtccgtgtt gaaatacata aaaaatgagg acaaacgttt 4380 tcttaccttt gtcgccaaca ggatctctgc aataagagaa attacaacac cattacagtg 4440 gcgatacata cctacgactc aaaatccagc agactgttgc tccagaggtc tgaaagcaga 4500 tcagctgtta acaaacagag aatggatcaa tggaccaagt tatctgtgga aaccagtgga 4560 agaatggcca gcacaaatat ttgattctgc tcttaaagca gatgatcctg aagtcaagag 4620 gaacatatgt gtaaatagca ctcttgtaag tgaacagtca aatgctactc ataggctgat 4680 tactcatttc tcagactgga caaagttgaa ggttgctgtt gcttggctct tgaaattaaa 4740 gacagttctg aaacaacgaa gtcaaaagag aaaggaaatt caagcctctt tagactcttc 4800 tggagtaact tctaaaaaga cggagaaaga acttcgaaaa ctaactggtt ctgtggttac 4860 accaaattta agtttgcagg atgtgaatga agctgaagta gctattgttg cattcagtca 4920 aaggcaacag tttaaaaagg aaatcgatgc tctgtcagtt tcacctgttt caaaaataag 4980 ccgagacagc aagctgtata aattggatcc cgtctatcaa gacggtattt tgagagtagg 5040 aggacgattg agaaattctg ctatgccgga agaaagaaag catccgatca ttctggctaa 5100 ggatcagcat attgccacgc tcatcttgcg tcatatacac caacagttag gtcacagtgg 5160 aagaaattat atgctttcag agctgaggaa aagatattgg atcattaagg gtaatgcagc 5220 tgccagaagg attatatcga gttgtggcca ttgtagacgt tttggagtta aggtgggtga 5280 gcagaagatg gcagacttgc caaaggaacg actccttccc gatttgcctc cattttcgaa 5340 tgttggggtg gattattttg gtcctgtgga agttaaaaga ggacgctcca ttgtaaaacg 5400 ttacggagtc atctttactt gcatggcaag tcgagcagtt catctggaag tcgcctactc 5460 attggacact gattcatgta ttaacactct tcgaagattt atctgcagac gagggcaagt 5520 aactcaaata agatcagata atggtaccaa ctttgttgcc gcagaaaagg aattaaagga 5580 atcactgaga tcatggaatc aaaacaagat tcaaagggca atgctccaga aaggtgtcca 5640 gtggagtttc aatcctccag gaggttcgca ttatggtgga atctgggaac gtgtaatcag 5700 aatggtcaaa ataattttga attccatcct acatcagcag attttggatg atgaaggatt 5760 ccatactgta ctatgtgaaa ttgaggccat tctaaatgat cgcccaataa ctaagttgtc 5820 agatgacccc aatgatttgg agtcacttac tccaaaccat cttcttcaga tgaaaggtaa 5880 accagtatta ccacctggat tatttgaaaa atctgaatta tacagcaaga ggagatggaa 5940 acaggtccag tatatgtgtg acttattctg gaaaagatgg acaaatgaat atttgccttt 6000 gcttcaagat cggcagaaat ggaataagga aaggagaaac tttgttcctg gtgacattgt 6060 tgtcattgtg gactcgtctg cccctcgagg gtcttggtta atggggcgaa ttctggaggt 6120 ttttccagat aaaaatggac ttgttcgttc tgtgcgagta caaactaaaa ccagtgtact 6180 tgagagaccc gtgacaaaac tttgtctgtt gtatgacaac atggaaatat aatggactga 6240 tggtaaagac tgttaaattt gacacctaca cagaattgat taatgcaatg tttatagtgt 6300 tgctttaatg gctctttgta tgtacattgt tgaattgtta tgtctacctg ccatgtacaa 6360 ttaggggccg ggg 6373 // ID BEL24-I_DR repbase; DNA; ZEB; 6788 BP. XX AC chr20; XX DT 03-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL24-I_DR; KW BEL24-LTR_DR; BEL24_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6788 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1395-1395 (2008). XX DR Genome; chr20; Positions 53248383 53255170. XX CC 'GTACA' target site duplication CC LTRs are 100% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 638..6625 FT /product="BEL24-I_DR_1p" FT /translation="MEAVHAEEAMEDTHSTHEEVHAVKQADEMSLPTVEIS FT NPVVQQVRSSTRERSFTERGLEMREQEAKQQEKAFHKAYNSWKETAKDCRS FT TLKMFCSRENLEQIQQDIQNQCNLVHQHYEPILRNHATSPDIVNRMDACAA FT LTAEIIDLVSKRLETIGEDYNEELVKERVRQVLNKDEYGSIFGCTKTNTVM FT SESSLGSGNQSKTSSKISSKRAEAEAELAAKQEQAKAMQEIHDQQAKLNKM FT ESEWKLSEAKMLAEIKQREVEMQLKLEEERKRLHQLQVDKEVKVAAARVKV FT YNDLEGISQFNDKEIDSSIHRFQLNPEAELFLPQQTFKAEHEAQPSQNNVS FT LAQAIADSLSTHRLPVPEPMIFAGDPLKFIDWKMSFIALIERKPLPPGEKM FT FYLKKYLVGEARKAVEGYFYRNSEDAYHGAWKVLQDRYGNSFILQKAFRDK FT LMRWPKIGINDPLALRDFTDFLQGCAEAMPHIKGLSILNDCEENYKLLKKL FT PEWVVHKWNRIVTEELDASGDYPSFKCFTEFLQKETRIACNPITSPFMNMK FT NSDEKFPKRAKALSTKTDVKEFSSKGLNTSSVKSKVSCIVCKDENHSIVQC FT SAFGEKTIEDKKNFIQENQLCYGCLRKGHIVKKCRRRHKCGTCGRNHPTCL FT HEERHMVSPKPLNKTSTEVQASQEVQKVMAHALTQSSSATSSIVPVLISTV FT EEPQREVLTYALLDTQSDSTFILEDLLDDLNAVKQPVQLRLSTMTAVDTIT FT ASNRVCGLQVRGLQAANSIQLRQAYTRDFIPVDKSYIPTKSTALEWPHLKH FT LANQLPPLQNCEVGLLIGYDCPSALAPLEVIIGHENEPFAQRTELGWSIIG FT LSNPHLDRQGNQSFVHRVAVKEIFVPSPNDVLKILESDFNEKGYEDKSVSQ FT EDVRFIQHLSTNIKQKDNGHYELPLPFKSISQPSLPNNRRLAVARLQQLKK FT KLKSNKQYSDDYKVFMKEVVDKGDAELAPEISEGETVWYIPHHGVYHPQKP FT GKLRVVFDCSAKFSGISLNDTLLTGPDLINSLVGVLCRFRKELVAVTCDIE FT KMFHQFLVPPDERNYLRFLWWEDGDWEKEPQDYRMTVHLFGATSSPGCANF FT GLKYLAQQYEVKHPTASEFVKRNFYVDDGLASVRSIDEAKELITDAQALCK FT QGGLRLHKFNSNKEDVLCCIDPSERDIVSKPLNLNPEATPTGRVLGVQWLT FT RDDNFQFNINYKDQPSTRRGILSVISSLFDPIGFVAPFILQGKCILQELCR FT KNIGWDDQLPEDMYSRWEDWKGGLQRLKEVVIPRCYHPDTFNEIIETELHH FT FSDASNIGYGACSYLRFKNDKGKVHCSLVMAKARVAPTKVTSIPRLELAAA FT VLSAKISVMLKTELEMKIDREFFWTDSQVVLAYINNEARRFHVFVANRVQL FT IRDITDSSLWYYVNTLENPADHASRGFHASDIATSTWLRGPKFLWEQEVNP FT TPHTSANLLVGDPEVKPVQTFVTTVSDSSDILSRFRRFSCWSMLLKVVARI FT KRLGLKQKCSTDHITVEERQRAAEVVIKLMQQEAFSKEMRMIENGIALPNS FT SALYQLDPVLDKGLLRVGGRLKKSSLSQDLKHPVILPRDSYITKLILSHYH FT AKICHQGRTQTQMQLRMNGFWVIGGSKSVAKLIHKCVQCRRLRRPTEEQRM FT AELPKERVEVSAPFTFCGMDCFGPFVVKRARKEYKRYGLIFTCLSSRAVHI FT EMIEDLSTDAFINALRCFISLRGAVCKLYCDQGTNFVGARNEFKDCLKQVD FT IKTLEVFLAEKQCEFAFNAPSASHTGGVWERQIRTVRSVLNATIALCPGRL FT DDASLRTLFYEAMAIVNSRPLTVDGINDPNSLEPLTPNHLILMKSDVALPP FT PGKFVKEDMYATKRWRRVQYLVEQFWSRWKKEYLLNISTRQKWHTPRRNLR FT VNDVVIIKEDMLPRSQWQLGRVVETVKESDGFVRRVKVRVAERKLTNKRNQ FT TPKLSIIERPIQKLVVLLEED" XX SQ Sequence 6788 BP; 2204 A; 1319 C; 1484 G; 1781 T; 0 other; gtgaaactcg tcgcttaaag gaaccacgca tctgtttcat cgagacgcag gctggcagct 60 ggcagagtcg tgacgtcatc gcacaggctg atcggaggct gcagttaccc atactagtcg 120 gaggtggcgc gcgtgctgtc attcactgtt gcagatacag tgggagactt tgagtggcaa 180 gagagtttaa gaacatattc ctacgtaagc agcatcaagt catcttgaac ttagtaattt 240 taatgcttta aaggaaaagc aaatatttta tggacatttg agttaaaagg gttacattta 300 aaaaggactt taaataaaag gaaataaata atgttataaa gaacatttac ttcatagttt 360 aagatttgaa ttacattatt tgagtagaat ttaagaaaaa aaaaaagaac ttaatttttt 420 ttatttagca ttagaaaaat tacaatcagt gttaatgtta gctgtactta tttgaaaact 480 gacatttcct tgttattcct gttacaagct aatattttgg atgtcatata ctgctgtttg 540 catttatatt gattttgttt gctaatttat ttgggtgctt acgtgctaga ggtctggtga 600 gagtgttatt attcactgct taagttcctc tttaattatg gaagctgtac acgctgaaga 660 ggctatggaa gatacacata gtacccatga agaagtgcat gcagtcaaac aagcagatga 720 aatgtcactc ccaacagtag aaataagcaa cccagttgtg cagcaggtca gatcaagcac 780 acgtgaaaga agcttcacag aaaggggtct agaaatgcgt gaacaggaag ccaagcaaca 840 agagaaagct ttccataaag cttataacag ttggaaagag actgcaaaag attgcagatc 900 aacactaaaa atgttctgtt cacgtgagaa tctagaacaa attcagcaag acattcaaaa 960 tcaatgtaat ctagtccatc agcattatga gccaatattg cgtaaccatg ctacatctcc 1020 agacattgtc aatcgtatgg atgcttgtgc tgcactgact gctgaaatta ttgatcttgt 1080 gagtaagcgg ctggagacaa taggtgaaga ttacaatgaa gagcttgtga aggaaagagt 1140 gagacaagtc ttaaataaag atgaatacgg atcaatcttt ggatgcacaa aaacaaacac 1200 agtgatgtca gagtcatcgc taggatcagg taatcagtca aagacctctt ccaagatatc 1260 aagtaagcga gctgaagcag aagctgaact tgcagcaaaa caagagcaag ctaaagctat 1320 gcaggaaatc catgatcaac aagcaaagct taataaaatg gagagtgaat ggaagctttc 1380 tgaagctaaa atgctagcag aaataaagca aagggaggta gaaatgcagc ttaagcttga 1440 agaagaaaga aaacggctgc atcagttgca agtagacaag gaagtaaaag tagcagcagc 1500 tcgtgtgaaa gtgtacaatg atcttgaagg gatcagtcaa tttaatgata aagaaataga 1560 ttctagcatt cacagatttc agttaaaccc agaagctgaa ttgttcttgc ctcagcaaac 1620 attcaaggca gagcatgaag cccagccatc tcagaataat gtttctttag ctcaagcaat 1680 agcagactca ctaagcacac atcgactgcc tgtgcctgaa cccatgatct tcgctggtga 1740 tccattgaaa ttcatagatt ggaagatgtc cttcatagct cttatagaaa gaaagcccct 1800 cccacctggt gagaagatgt tctatcttaa aaagtacctt gttggagaag ctcgtaaggc 1860 tgtggaggga tatttctata gaaattcaga ggatgcatac catggagcat ggaaggtcct 1920 gcaagataga tatgggaatt cattcatact acaaaaagcc tttagagaca agctgatgag 1980 gtggcccaaa attgggataa atgacccact tgcactgcga gacttcacag atttcctgca 2040 aggatgtgct gaggcaatgc cacacattaa gggattatct attctaaatg actgtgaaga 2100 aaactacaag ctactaaaga aacttcctga atgggttgta cacaagtgga atcgaattgt 2160 gacggaagaa ttagatgcat caggggacta tccaagtttc aagtgtttta cagagttctt 2220 gcaaaaggaa actagaatag cttgtaaccc cattacttcc ccattcatga atatgaaaaa 2280 ttcagatgaa aagtttccca agcgagctaa ggcgctcagt acaaagactg acgtgaaaga 2340 gttcagttct aaagggctta atacatctag cgtaaagtca aaagtatcat gcattgtctg 2400 taaagatgaa aaccacagca ttgtccaatg ttctgcattt ggagaaaaaa ccattgaaga 2460 caaaaagaac ttcattcaag aaaatcagct ctgttatggg tgtctcagga aagggcacat 2520 tgtaaaaaaa tgcagaagac gacataagtg tggaacatgt ggccgcaatc accccacatg 2580 cttacatgaa gaaagacata tggtgtctcc aaaaccatta aataaaactt caacagaagt 2640 acaagcaagt caggaagttc agaaagttat ggcccatgca ctcacacaaa gctcttctgc 2700 tacttccagt attgtgccag tccttatttc aacagtggaa gagcctcaaa gggaagtact 2760 cacttatgcg ctactggaca cgcagagtga ctcaacattc attttggaag atcttcttga 2820 tgatctaaat gcagtcaagc aaccagttca acttagatta agcactatga cagctgtgga 2880 tactataacg gcaagtaaca gggtttgcgg tctgcaagtt agaggactcc aagcagcaaa 2940 ctccattcag ttacggcaag catatacaag agacttcatt cctgttgata agtcctatat 3000 cccaacgaag agcacagcac tggagtggcc tcacctaaaa catctggcca atcaattgcc 3060 accactgcag aactgcgaag tcggattatt aatcggatat gactgccctt cagctctagc 3120 tccccttgaa gtcataattg gtcatgaaaa tgaaccattc gcacaaagaa ctgaacttgg 3180 ttggagcatt ataggtctct ctaatccaca cctagacagg cagggaaatc agagctttgt 3240 ccacagagtt gcagtcaagg agatatttgt tccatcaccc aatgatgtct tgaaaattct 3300 tgagtcagat ttcaatgaga aaggttatga ggataaaagt gtgtctcaag aggatgttcg 3360 tttcattcaa catctcagta ccaacatcaa acagaaggat aatggacatt atgaacttcc 3420 cctccctttc aagagtatca gccaaccatc actgccaaat aatagaaggc tggcagttgc 3480 ccggctgcag caattaaaga aaaagctcaa atctaacaag caatattcgg atgactacaa 3540 agtcttcatg aaggaagtcg ttgacaaagg tgatgctgaa ctagcccctg agatatcaga 3600 aggagagact gtatggtata ttccacacca cggagtatac caccctcaga aaccaggtaa 3660 gctgagggtt gtgttcgact gttcagctaa gtttagtggt atatctctaa acgacactct 3720 gttgactggg cctgatctaa taaattcatt agttggagta ctctgccgct tcagaaagga 3780 gctagtcgca gtcacctgtg atatagaaaa gatgtttcac cagttccttg ttcctccaga 3840 cgaacgtaat tatctgaggt tcttatggtg ggaggatgga gactgggaga aagagcctca 3900 agactatcgc atgacagtcc atctctttgg cgctacatcg tctcctgggt gtgccaattt 3960 cggtttgaag tacctggcac agcagtacga agttaaacac ccaacagctt cagagtttgt 4020 taaaagaaac ttctatgtag acgatgggct agccagtgtt cgttcaattg atgaagccaa 4080 ggaactaatt actgatgctc aagcactctg taaacaagga ggtttacggc ttcataaatt 4140 caactccaat aaggaagatg ttctctgctg catagaccca tctgaaaggg atattgtctc 4200 taaacctctc aatttaaacc ctgaagcaac accaacagga cgtgtacttg gtgttcagtg 4260 gttaacaagg gatgacaatt ttcaattcaa catcaactat aaagatcagc catcgacccg 4320 ccgtggaatc ttgtctgtca ttagttccct gttcgaccca attgggtttg tggcaccctt 4380 tattcttcag ggaaagtgca tcctgcagga actatgtcgg aaaaacattg gatgggatga 4440 tcagctacca gaagacatgt attcacggtg ggaggattgg aaaggtggac ttcaaaggct 4500 taaagaagtt gtcataccaa gatgttatca ccctgacacc ttcaatgaga tcattgaaac 4560 ggagctgcac cacttctccg atgcgagcaa cattggatat ggagcttgct cttatcttcg 4620 tttcaaaaat gacaaaggta aagtgcactg cagtttggta atggcaaagg ctagagtcgc 4680 acctacaaaa gtcacaagta taccaagact ggaacttgcg gcggctgttc tatctgcaaa 4740 gataagtgtc atgctaaaga cagagcttga aatgaagatt gacagagaat tcttttggac 4800 agattcacag gttgttttgg cttatattaa taacgaagca agacggtttc atgtattcgt 4860 tgctaaccgt gtgcagctca taagggacat tacagattca agcctgtggt actatgttaa 4920 taccttagag aacccggctg atcatgcttc cagagggttt catgcctctg acattgctac 4980 ttcaacctgg ttaagaggtc ccaaatttct gtgggaacaa gaagtgaatc caacaccaca 5040 tacttctgcc aatttgcttg ttggtgaccc cgaagttaaa ccagtccaaa catttgtcac 5100 cacagttagt gacagttccg acattttaag tcgtttcaga cgattttctt gctggtcaat 5160 gcttctgaag gttgtggcaa ggataaaaag acttggattg aaacagaaat gctccactga 5220 ccatataact gttgaggagc gtcagagagc tgctgaagtg gtgatcaagc tcatgcagca 5280 ggaagcattt tccaaagaaa tgagaatgat tgaaaatggg attgctcttc caaattctag 5340 tgcgttgtat caactggatc ctgttttgga caaaggtctt cttcgggtag ggggaagatt 5400 gaagaagtca tcactcagtc aagacctcaa acatcctgtg attctaccaa gagatagtta 5460 catcaccaag ctaattctgt cacattatca tgccaaaatt tgccatcagg gtcgaactca 5520 aactcaaatg cagcttagaa tgaatggatt ctgggtcatt ggaggtagca aatcagttgc 5580 taagctgata cacaagtgtg tgcagtgtcg aagacttaga cgaccgacag aagaacaacg 5640 aatggccgaa cttcccaagg agcgtgtcga agtatctgct ccctttacat tttgtggcat 5700 ggactgtttt gggccatttg tcgtgaagag agctcgcaaa gaatacaagc gatacggtct 5760 aatattcact tgtttgtcat ctcgcgcagt tcatattgaa atgattgaag atttgtccac 5820 agatgccttc atcaatgcat tgagatgctt tataagcctc agaggagcgg tttgcaaact 5880 ttattgtgat caaggtacaa actttgtcgg ggccaggaac gaattcaaag attgcctgaa 5940 acaagtcgac atcaaaactc tggaagtctt tctcgcagag aaacaatgtg agtttgcctt 6000 taatgctcct tctgcaagtc acactggagg tgtctgggaa cgacaaatac gaactgtgcg 6060 cagtgtactt aatgccacaa ttgcactgtg tcctggtaga cttgacgatg cttcacttag 6120 aacattgttc tatgaggcaa tggccatagt taacagtcgc ccgcttacag tagatggaat 6180 taatgatcca aattcactag agcctctaac tccaaaccac ctaatactta tgaagtctga 6240 tgttgcactt ccacctccag gaaaatttgt taaagaagac atgtacgcta caaaaagatg 6300 gcgaagagtc caatacttgg ttgagcaatt ttggagtcgc tggaagaagg aatacttgct 6360 taacatttca acacgccaaa agtggcacac tcctcgcaga aacctcagag taaatgatgt 6420 tgtaataatc aaagaggata tgctcccaag aagtcaatgg cagttgggca gagtggttga 6480 aactgtcaaa gagagtgatg gatttgtgag acgagtaaag gtacgagtag cagaacgaaa 6540 gctcactaac aaacgtaatc agacccccaa gctctcaatc attgaaagac cgattcagaa 6600 attagttgta cttcttgaag aagactgact gttagtaact gggatgatac actctcatca 6660 gaccaaataa ttatttctga aatacttgat agtcatgaaa agtaaacgtg gtcatgttat 6720 agttcaaaat catgttctga ttaatttcag tttgttcttt caattcatca taaaatgatt 6780 ggtgggag 6788 // ID BEL17-I_DR repbase; DNA; ZEB; 5772 BP. XX AC chr13; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL17-I_DR; KW BEL17-LTR_DR; BEL17_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5772 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1381-1381 (2008). XX DR Genome; chr13; Positions 52486511 52480740. XX CC Positions [4786-5346] - Integrase core CC 'GTTGG' target site duplication CC LTRs are 100% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 421..5694 FT /product="BEL17-I_DR_1p" FT /translation="MRQLQFHMGQQRHESQPPAPVAPPRQYTSTHAVPAPR FT VWPQGGTNDMSFTPTPAPRLKYQSEYKARDQIRSLPHSSALRGNLSYHYVP FT QGDQSLNAHYSNPPYWHLQDPVRHPQIPFRTQTPSPPPEQGPYRGPTPMIP FT DFSHPSPREFSRLKIALENILPANATESFKFQILTDHLKLEEALLIADSYC FT HSQHPYTRTMAALDQQYGQPHQLALQRIAELMDGPNIASGDQKAFRLFALK FT VRSLVGMLEQLGRNGSFELQCGSHVSRLLGKLPHDLRSGFRRYAHPHQVPI FT PTLLDFAEWLDFEIQVQEDTTRFASSQRRMPQTRTRENLRDYKPAIKPTTI FT YLGTEKATVEPIPPIPLSRSSLKPYCPYCDNSKHSLNNCSNFKQLTKDQKQ FT SWIKENNRCWRCGRTHKSADCNLKMRCRQCSSRHLMALHEISVGRPENPKL FT TQDESADPKTCLLNTMQEILLVHKPPTSRKVLLKICRVILSNGSKRMNAYA FT IQDDGSERTIILHSAAQQLGLTGQPEELPLRTIRQELQVLKGAAVSFTVSP FT IAQPTKRFHITSAFTAQQLSLAEHSHPVKSLKERYRHLKGLPLQEFKAVCP FT VLLIGSDYPHLITPVEPVRLGPHGGPAAIKTRLGWTLQGPVQHMPKDVTEQ FT HCLFTSVTSSESDLYKQVEKLWQMDVLPWRSDKACIRSRQDQEAVELLEKR FT TIRVEMDGVKRYATPLLRVKNMPELKAPKEAVLSQLRSTERRLAKNPQQAA FT AYITEIAKLEKSGYVKRVPPNAVTNTPCSWYIPHHMVEHNGKNRIVFNCSF FT QYHGQNLNELLLPGPVLGPSLLAVLLRFREHSVAVSSDIKGMFHQVRLLPE FT DKPLLRFLWRELNVQEQPTVYEWQVLPFGTTCSPCCAIFALQKHILDHSHP FT GDDVQNSVLKSFYVDNCLQSFTSAEAAKEFVDSIKNQLADGGFELRQWSSN FT IPSTINHLPPESISSSAELWISQGKSDIQESTLGLLWNHQSDTLSYKYRAK FT DSRETTMRNIYRILASQYDPLGYLIPYTTRAKILVQQLWDKKRDWDDPHLP FT TDLLQTWTEWEAELPALQNIVLPRCYCSANKDTETSLRDMHVFCDASERAY FT GSVAYLRTEDQYGQVEVAFLTARSRVAPKKQQSIPRLELCAALTGAQLSKV FT LLTELNLPIRHTILWTDSTTVLAWLQSESCRFKVFVGTRVAEIQELTEQHS FT WRYVPSTSNPADDITRGKSLCELNLESCWYQGPHFLKDPANQWPECPSPKE FT IDRDELRKTGTHVQVCFASSDYSQFKTLEELIDFTALQGAANGNPTASDYR FT RAEIDVLQKSQQESFPSDLTQLKAGKSLSSNSKLRALTPELDIETNLIRVG FT GRLRHSPYLGPDNMHPIILDPRHPTTKLIIQSYDSKLRHPGPERLFAEIRR FT KYWILRGREAIKHLQRECLHCQKWRKKPEVPRMADLPPARLRLFKPAFHST FT GMDCFGPYTVKTGRRNEKKWGIIFKCLTTRAVYIDILHSLETDSFLMALRR FT FTARRGKPQELLSDQGTNFRGGERELKEAFTALAPDLQSQLAKQQIEFHFN FT PPNSPHFGGCWEREIRSLKNALMVTLGNQSVTFEVLQTVLAEIEGILNSKP FT LGYTSSDASDSNPITPNCLLMGRLDASLPLTVYPQSELVSRRRWRHSQILA FT DQFWRHYIKFYLPGLQSRQKWQNDTPDTQVGSTVLIVDPQLPRSLWPVGKV FT VDVHPGADTRVRTAKVQVGKKTYTRPVARLIQLPAIPD" XX SQ Sequence 5772 BP; 1690 A; 1487 C; 1254 G; 1341 T; 0 other; tggtccttcg agccggagcg aggactgacc agagaaatag aagatgtcac aacattcagg 60 ggatgagcct gctgtgcgcg tcaggaggaa gtcagtcctc cccgcacgtt atgatgagta 120 cgacctcact ggattcactc tccctaagcc tgttccaccg cccccgtcac cactcactca 180 ttcacctaga ctggctgaat gcgatgatac acaggatgga gcatttggtt ttactccagt 240 cctacccggt catgaagctg acagcccatc acagtggtct gatgatgaag gagcttctga 300 tatcgtagca agagaaaatg cgaaactaca caacagtctg tggacgatac agcaagaaag 360 ggacatattc cagcgtgcaa atgaacaata tgcaaaggag ctcatcagct ccagcaccaa 420 atgagacagc tacagtttca tatggggcag caaagacatg aaagtcagcc ccctgcacct 480 gttgcccctc caaggcagta tacctccacc catgctgtac ctgcaccccg tgtctggcca 540 caggggggca ctaacgacat gagcttcacc cctacacccg ctccacgact caaatatcag 600 tctgagtata aagcaaggga ccagattaga agcctacctc acagttctgc cctcagaggg 660 aacttgtcat accactatgt gcctcaagga gatcaaagcc tgaatgctca ttattcaaat 720 cctccttatt ggcacttgca agatcctgtg agacatccac agatcccgtt taggacacaa 780 actccatcac ctccacctga acaaggacca taccgcggtc caacccccat gattcctgac 840 tttagccatc ccagccctag agagttctca cggctaaaaa ttgctctgga aaatatactc 900 ccagccaatg ctacagaaag tttcaaattt cagatactga ctgaccatct gaagttagag 960 gaggctctgc taatagcaga ctcatactgc cattcacaac atccctatac aaggacgatg 1020 gctgccttag atcagcagta tggacagccg catcaacttg cactgcagcg gattgctgag 1080 ttaatggatg gcccaaatat agctagtgga gatcagaagg cctttagact tttcgccctt 1140 aaagttcgtt ctctggttgg catgctggag cagttgggca gaaatggctc ctttgagttg 1200 cagtgtgggt ctcatgtgtc tcggctcctt gggaagcttc cacatgactt aagatcaggg 1260 tttcggagat atgcacatcc acatcaagtg cccatcccta ccctcttaga tttcgcagaa 1320 tggctagatt tcgaaatcca agttcaggag gacaccacaa ggtttgcgag cagtcaacgc 1380 agaatgcccc aaacacgcac aagagagaac ttaagagact ataaaccagc cattaaaccc 1440 accacaatct atctgggcac ggagaaagca actgttgagc ctataccgcc tattccatta 1500 tctaggagct ctttgaagcc ttactgcccc tattgcgaca acagtaagca ctctctcaat 1560 aactgcagca actttaagca acttactaaa gaccagaaac aaagctggat caaagaaaat 1620 aatagatgct ggcgttgtgg cagaacacat aaatctgcag attgcaacct gaagatgcgc 1680 tgcagacagt gcagcagccg acaccttatg gcactgcatg aaatcagtgt tggaaggcct 1740 gaaaacccaa agctcactca ggatgaatct gcagacccaa agacttgtct gttgaacaca 1800 atgcaggaga ttctccttgt tcataaacca ccaactagta ggaaagtcct acttaaaatc 1860 tgccgggtga ttctcagtaa tggaagcaaa aggatgaatg cctatgcaat tcaggatgat 1920 ggatctgaga gaaccataat cctccatagt gcagctcaac aacttggcct aactggtcag 1980 cctgaagaac tcccacttcg aactattcga caagagttgc aagtcctcaa aggggcagct 2040 gtttcattta ccgtctcacc cattgctcag cctactaaac ggtttcacat cactagtgcc 2100 ttcactgctc agcagctaag tttggcagaa cactcacatc cagtgaaaag tctgaaggag 2160 agatatcggc accttaaagg acttccactc caagaattca aagccgtctg ccctgtattg 2220 ctcattgggt cagactatcc ccacctgatc accccagtgg aaccagtcag gctagggcca 2280 catgggggac cagctgctat aaaaactcgt cttggttgga ccttgcaggg tccagtccaa 2340 cacatgccga aagacgtgac agaacagcat tgtcttttca cttcagtcac atcctcagaa 2400 tctgatctct acaagcaagt agagaaatta tggcaaatgg atgtattgcc atggcgcagc 2460 gacaaggctt gcatcagatc cagacaggac caggaggctg tggaacttct ggaaaagaga 2520 acaataaggg ttgagatgga tggagtgaaa cgatatgcaa cacccctctt gcgtgtcaag 2580 aatatgcctg aactcaaggc accaaaggag gctgttttgt cacaactgag atccactgaa 2640 cggagactgg ctaaaaatcc acagcaagca gccgcctaca tcacagagat tgcaaaactg 2700 gaaaaatcag gttatgttaa gagagtgcca ccaaacgctg tgacaaacac cccctgctcc 2760 tggtacatcc cacaccatat ggtggaacac aatggcaaaa atcgcattgt tttcaactgt 2820 tcattccagt atcacggtca aaacctcaat gaacttctgc ttccggggcc tgtgttgggg 2880 ccatctcttc ttgctgttct cctcagattt agagaacatt ctgttgcggt cagcagtgat 2940 atcaagggaa tgtttcacca agttagactg cttccagagg acaaacccct tttgcgcttt 3000 ctctggagag agctaaatgt ccaggagcaa cccactgttt acgaatggca ggttctcccc 3060 ttcggcacta cttgcagccc ctgctgcgcc atatttgcac tgcaaaaaca cattcttgat 3120 catagtcatc ccggggatga tgttcagaat tcagtactga aatcctttta tgtagacaac 3180 tgcctacaaa gtttcacttc agctgaagca gcaaaggaat tcgtggacag tattaagaat 3240 caactggcag atggaggttt cgagctaaga cagtggtcca gcaatattcc ttccaccata 3300 aaccacctgc cccctgaatc tatatcaagc agtgctgaac tctggatatc gcaaggcaaa 3360 tcggacattc aggaatcaac attagggctc ctctggaatc atcagtctga cacactctcc 3420 tacaaatatc gtgccaaaga tagtagagag actacaatgc gtaatatata tagaatactg 3480 gccagtcagt atgacccatt gggctacctc attccttaca ccactcgtgc caagattctt 3540 gtacaacaac tgtgggataa aaaacgggat tgggacgatc cacatttgcc tacagatctg 3600 ctacaaacat ggactgagtg ggaggcagag ttaccagctc tacaaaacat tgtccttcct 3660 cgatgttact gtagtgccaa caaagacaca gaaacgagcc taagagatat gcatgttttc 3720 tgtgatgcct ctgaacgtgc ttatggctcg gtggcatacc ttcgaacaga ggaccagtat 3780 ggacaagtgg aggtggcatt cctaaccgcc agatctcgtg tagcaccaaa aaaacaacaa 3840 tctatcccca gactggagtt gtgcgctgca ctgactggag ctcaactctc caaagtactc 3900 ttaactgaac tgaacctgcc cattcgccac acaatcttgt ggaccgactc cacaacagta 3960 ctggcatggt tacagtctga atcatgtcga ttcaaagtat ttgttggtac tcgggttgca 4020 gagatccagg agctaactga gcaacactcc tggcgatatg taccctccac cagcaatcca 4080 gctgacgaca tcactcgggg aaagagtctc tgcgagctca acttggaaag ttgctggtac 4140 caaggaccac atttcctaaa agatccggct aatcagtggc ccgaatgtcc tagtcctaag 4200 gaaatagaca gagacgaact acgcaaaact ggtacacatg tacaggtctg cttcgcatca 4260 tcagattaca gccagtttaa gaccctggag gaactaattg atttcacagc tcttcaaggg 4320 gcggccaatg gtaaccccac agccagtgac tataggcgtg ctgagattga tgtcctgcaa 4380 aaatcccaac aagaatcctt cccatcagac ctgacccagt taaaggctgg taaatcactt 4440 tccagcaata gcaagctaag agccctgacg cctgaacttg atatcgaaac taatttaatc 4500 agagttggcg gacgtctacg acatagtcca tatctgggac ctgataatat gcaccccatc 4560 atcttggatc caagacaccc gacaaccaag cttataattc aaagttacga ttcaaaactt 4620 cgtcatccag gtccagagag attgtttgcc gaaatccgcc ggaaatactg gatcctgcgt 4680 ggtagagagg ccatcaagca cttacaacga gaatgtttac attgccagaa atggaggaag 4740 aaaccagaag tacctagaat ggccgacctt cctccagcaa gattgcggct ctttaagccg 4800 gccttccact ctactggcat ggactgtttt gggccttata ctgtcaaaac ggggcgacgc 4860 aatgaaaaga aatggggaat aatattcaaa tgtcttacca ctagagcagt gtacattgat 4920 atacttcaca gtctggaaac cgattccttc cttatggctc tcaggcgatt cacagctcga 4980 cgaggaaagc cacaggagct cttatctgat cagggaacca actttagagg tggtgagaga 5040 gagctgaagg aagctttcac agctcttgcc cctgatctgc agagtcaact cgccaaacag 5100 caaattgaat ttcatttcaa ccctcctaat tcaccccact ttggaggttg ttgggagcga 5160 gaaattcgct ctttaaagaa tgccttgatg gtcacgcttg gaaatcagtc agtgaccttt 5220 gaagttcttc aaacagtgct ggcagaaatt gaaggaattc tgaactccaa accattggga 5280 tatacctcat ctgatgcttc cgactctaat cccattactc caaattgtct cttaatgggg 5340 cggctggatg cctcactccc cctgacagtg tacccccagt cagaactggt cagtagacgt 5400 cgatggcgcc atagtcagat ccttgcggac caattctgga gacactacat caaattctac 5460 ctccctggtc ttcaaagccg ccagaaatgg caaaatgata ctccagacac tcaagttggc 5520 tctacagtgc tgatagttga cccacaacta ccaagatccc tctggcctgt aggaaaagtg 5580 gtagacgtac accctggagc cgatactaga gtgaggacag ccaaagttca ggtgggaaaa 5640 aagacttaca cacgaccagt agcccgcctc atacaactac ctgctattcc agattaaatt 5700 aagcttaaat cccacaacag tataaagcct ttttgtgtgt gtacatattt cttacataaa 5760 tatgggggcg gc 5772 // ID BEL18-LTR_DR repbase; DNA; ZEB; 414 BP. XX AC chr13; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL18-I_DR; KW BEL18-LTR_DR; BEL18_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-414 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1384-1384 (2008). XX DR Genome; chr13; Positions 18129285 18129698. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 414 BP; 134 A; 56 C; 85 G; 139 T; 0 other; tgtaacagac ttaagtaaag gtgttatctt gcatttcttc atgtttaaag atcatttaat 60 ttctattatt tgtgggggac ttttattttg aaaggaaatc cgttttttca gagcagaaaa 120 taaatgagag agcaaaaggt aaagcggacg tttcataaca gtttaactag agctcgtact 180 gaaaaggact gtttacaagt ttaatcagca accccccgaa aaggcataag gtttgtttca 240 tcatatattg ttagtgagaa atacatattt tgtgaagata tctaagaatg ctgatgtatg 300 tttttgtatt ggtttgctca gctcttacgg tgatttcaag agtaaaggaa cttaaataaa 360 gcaacttgaa acatcagtaa agttttggcc atcattcgga tggggtctgc taca 414 // ID DNA-5-8_DR repbase; DNA; ZEB; 1480 BP. XX AC . XX DT 17-APR-2009 (Rel. 14.04, Created) DT 17-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE non-autonomous DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; TSD 5-bp; KW DNA-5-8_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1480 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 9(4), 857-857 (2009). XX DR [1] (Consensus) XX SQ Sequence 1480 BP; 436 A; 252 C; 282 G; 510 T; 0 other; cagggtcccc gcggggtctt aaaaagtatt aaaaagtctt aaattaaaaa aatcgaaatt 60 tatggcttta aaaagtctta aatttgctgt tctaggtctt aaatattttt gcacaggtct 120 tatttttgcg atgtccatgt aacgctacat ctaatgctca tttaaattct ttttgttgtt 180 gttgctaggt ttttggggtg ttgtagttct ttatttcact agtccaattg taatttgcgg 240 tattacaact acaaaaaagt caacttgcaa tagccaatca gctttctgat attaaactca 300 gtgcgcgcgg cgaatgtgac atcatcgctg tgtttgcgga agttcgcttt gagtgcgcgc 360 gacatgattt cggctggcag agtatgccgt tatttcactg acaaatgtcg caagcgcagt 420 tttataagtc gggagcgctg taatacgaga gagcgaatct ctgctcacgc gccggtttcc 480 tttgttcatg gacaaaactg cttgcgagct ctcaaaacac tggctgctca cgtgcgaatg 540 atctgctctc gctccgtcgg actcctttcg tgcttttgct ctccagagat gcttcagcat 600 actgtagtat taagaatagt gaactgataa aatgaaatac agtgtagttt tactacagta 660 aagtgtgttg aattgtagaa aaatataccc tatattgtaa aaaaaaaaac taaagcactg 720 ggtagtttgt ttttaataca cttgttgtgt taccatagca actatagtat taccacaaca 780 gagtaattga agtactttac tgtactttac aaaacactgt acagcagaca ctagttcact 840 ttattttagt gataaaatgg gattgatagt gggatttatt tgaagagtga atagtacagt 900 aggctataat acctaatcaa gtttttacct attttctatc acagttacat ttctgcttgc 960 cagaatgacc aataattatt ggctagaaat tgcaacagcc atgggaaagg aggaaagttt 1020 ttgtaaaact ttggaaaaac tctgagggat aagtttgtta aaacaacaaa aaagaggccc 1080 ccaaaaacat cttcaatgat aatgattatc tttttattcg tctgatttta ctttctgttt 1140 ttttggaccg ccccctgtca tttaaaaaaa aaatatctca atggtgaacg tgtcaagtat 1200 aaaagcctta atccgtttcg tgagttgtgc gatgcggtgc caggtgaaag tataaatctg 1260 cattaagatg tttgtttttt ctgtagttca atggtgcatt taacctgact ttactactgt 1320 tcaatttaaa taactttatt ttacccctaa ttttgtgctt ttacattttc tctcgcgtgt 1380 ttttgatatt gcgatatagg tcttaaattt aatacttaat ggtcttaaaa aggtcttaaa 1440 aggtcttaaa tttgacatta tgatatctgc agagaccctg 1480 // ID TDR1 repbase; DNA; ZEB; 1236 BP. XX AC . XX DT 25-JUL-2000 (Rel. 5.06, Created) DT 25-JUL-2000 (Rel. 5.06, Last updated, Version 1) XX DE Danio rerio (zebrafish) Tc1-like element Tdr1 (a consensus). XX KW Mariner/Tc1; DNA transposon; Transposable Element; Nonautonomous; KW TDR1; Tc1-like transposon; zebrafish. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RA Radice D.A., Bugaj B., Fitch H.D. and Emmons W.S.; RT "Widespread occurrence of the Tc1 transposon family: Tc1-like RT transposons from teleost fish."; RL Mol Gen Genet 244, 606-612 (1994). XX RN [2] RA Izsvak Z., Ivics Z. and Hackett B.P.; RT "Characterization of a Tc1-like transposable element in zebrafish RT (Danio rerio)."; RL Mol. Gen. Genet 247(3), 312-322 (1995). XX RN [3] RA Izsvak Z., Ivics Z. and Hackett B.P.; RT "Repetitive elements and their genetic applications in RT zebrafish."; RL Biochem Cell Biol 75(5), 507-523 (1997). XX RN [4] RA Ivics Z., Izsvak Z. and Hackett B.P.; RT "Genetic applications of transposons and other repetitive RT elements in zebrafish."; RL Methods Cell Biol 60, 99-131 (1999). XX RN [5] RA Jurka J.; RT "Consensus."; RL Direct Submission to Repbase Update (2000)June. XX DR [5] (Consensus) XX SQ Sequence 1236 BP; 432 A; 204 C; 243 G; 338 T; 19 other; cagttgaaat cggaagttta catacrctgt ataaaaaggc acataaccat ttaaaaaaag 60 tcagatgtta atgtgactaa actttttctc ttttaggtaa gtyaggatta tcacatttgt 120 ttctgttctg cttaatagca gaataatgag agagagattt tttgagaaat tgttataact 180 tttcttgaaa gtcaagttta cayacaataa gattattctg cctctggaaa agctcagatg 240 atgatgtcaa ggttttggaa gtttctgatt gacaacattt gagttaatkk gaggcacaac 300 tgtagaatag tattgaagga aaagctcaaa cacactgctt ccttgtgtga caacatggga 360 aaatcaacaa gccagaatca acmaaaaagc cagattacaa ttagctraat tacactggga 420 aaagactaat gtttggagac atgtcctgtg stctgatgga rctaagattg aactgtttgg 480 ccataatgac cagtgttaca tttggaggac aaaggggaaa gctyacaagc ctargaacac 540 catcccaact gtgmagtatg ggggcggcag catcatgttg tggggctgtt ttgctgcagg 600 agggactggt ccacttcaca gcatagatgg catcatgaag aaagaacatt atgkagaaat 660 actgaagcaa catctcaaga catcagccag gaaattaaaa cttggccaca aatgggtctt 720 ccaaacagac catgacccta agcatactgc caaattagtt aaaatgtgct ttaaggacaa 780 cagagtgaat gttttggagt ggccatcaca aagccctgat ctcaatccta tagaaaatdt 840 gtgggcagag ttgaaaaagc ttgtgcgagc aagacagcca acaaatctga ctcagttaca 900 ccaattctgt caggaggaat gggccaaaat tcctgcaaac tattgtgaga agcttgtgga 960 aggataccca aaacatttga ccaaagttat acagtttaaa agcaaagcta aaaaaatacc 1020 aaggaaatgt atgtaaactt ttgactgtct agaaattaat aaaaaaatct ytaaaahaah 1080 ttctctcatt attctggcat ttagcaaatg taaatcattt aggtaatcct aacggaccta 1140 aaatagtaaa cgtttagtat gatttaccat cagacatttw wtaaaaatgg ttatgttcct 1200 ttttttagag tgtatgtaaa cttccgactt caactg 1236 // ID Gypsy51-I_DR repbase; DNA; ZEB; 6568 BP. XX AC chr1; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy51-I_DR; KW Gypsy51-LTR_DR; Gypsy57_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6568 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1417-1417 (2008). XX DR Genome; chr1; Positions 13304933 13298366. XX CC Positions [4929-5405] - Integrase core CC 'CAAT' target site duplication CC LTRs are 99% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 379..1740 FT /product="Gypsy51-I_DR_1p" FT /translation="MAEKGQNTSDLFTPLGGRGRGLYRVEGNTGATPKLLF FT PSSVSLPEVSPCPDLAQPACSTPSDSNNNVTQQLIDLIGELGSQIGESIAG FT RLLASQNVANTTPTTTTQLGQPSLETSKVSFIVRSDIKEPPMFKGDESDKY FT TVNEWIDVMELYLQKSSCPEAEKASTIQSHLLGRAKNIIKVGLKSTGSVSV FT EKIYDMLRHYFSDHPVSLLPLADFYATQPKFKESPVDYWVRLNSAAERANS FT HLQRSGSNMENMNAEVAMMFIRNCPDPELSSVFKCKPMSKWSIEEIQEAID FT EHQRENQCRRIPNTVKPHTLHVSTAKAVPVLEEVKESELTAANATACTPSS FT HSEAKVSQSNSEALERVLNMLERVLERTSQPAHNSPGSFQTHRARFSSCRV FT CGDPAHSTRSHCMRERRCLNCLRVGHQRRDCPNVVEATPQPTGTSHSQGNE FT YARIGEGTM" FT CDS 1746..6299 FT /product="Gypsy51-I_DR_2p" FT /translation="MENETLIDKDELQIFEESCKATPDKNVLFQNVMKLEK FT ADSLFYTDVSIVGANVTLRALLDSGSMACTINKEAECKLKSAGVVLTSSEK FT RPDIILVGCGGVNVQPESICHLEIEVYGYVVNVPMLVVPGQRDEMILGTNV FT IKYIVSQMKNVQGYWHVLSHPESSGNDDIAQFLNMLSGLHRWKGTEMPTKV FT GTAKLTQAVVLCPQREHLVWAKLPANVPISEGSAVLVEPAKSHSSKNVVIG FT RVVASMSSDGWVPVKVLNASDKPVTLRRNAKMADVSPCIALEDLDLSPQLP FT ASHELISSSQVTICMDDNATSSDSRSPNRLDSFGLGDLNIEACEVSQYWKE FT KLVQLICKYEDVFSKNKLDCGKANGFYHRIHLVDNRPFRLPYRRVPPAHYH FT KLRHVLSEMEELEIIRKSSSEWASPLVLVWKKSGDLRICVDYRWLNARTTK FT DAHPLPHQADCLAALGGNAIFSAMDLTSGFYNIEMAEEDKRFTAFTTPMGL FT YEFNRLPQGLCNSPASFMRLMMHVFGDQNFLTLLCYLDDLLVFAPNESEAL FT KRLELVFIRLRAHGLKLAPKKCHLLRRSVQFLGHVIDQKGVSADPDKVQAI FT AAMDERDLMLEDGVTPSQKKIRAFLGMVMFYQRFIQNCSSIAKPLFALTAA FT PKGQRNNGAKLSHWRKLSPSDWKSEQRDSFNMLKSALLETVVLAHPDFNRP FT FILSTDASLDGLGAVLSQIPEGETRARPVAFASKALTRAQSNYPAHRLEFL FT ALKWSICEKFSHWLKGHSFAVWTDNNPLTYILTKPKLDACEQRWVAKLAPY FT NFSIQYIPGSKNVVADAMSRQPFVQGRISQRLVSEPYETLMEQAEQFKKDV FT VQHAFRISAQCQSAEPQCLLSSDEVSAILDSRAEWSAGPKDKVHHWLIQDV FT HNLVPAGQDTLPMFTLGELQNKQLADPVLARVLSYVTRGRRPSRRERTKEA FT PRVLKTLKQWEKLKLLDGILYRVCKDQLTGKKRWQYVVPTSLTDQVLTGIH FT DKAGHQGQSRTLYLARERFFWVQMERDVREHVKCCKRCVVSKTPEPEAKAP FT LESVRTTRPLELVCIDFWSAEDRKGGTVDVLVVTDHFTKMAHAFACSNQSA FT RQVARQLWDRYFCIYGFPERIHADQGANFESELIQELLQFAGIKKSRTTAY FT HPMGNGLTERFNRTLGSMIRALPPRDKQKWPQMLQTLTFAYNCTAHESTGY FT APFYLMYGRIPRLPVDVMFHSIGRDDDIMDYDSYIAKMRDDLKEALNLAQA FT NARASQERQAELYNKGTKGCDIEEGDQVLLANKGERGRRKLADKWESIPYV FT VISKDSRCHTFRIKNTNTGREKVVHRNLMLKANFLPLEVEVGESCDGSLSS FT QEQSDGTLMESLTQNDPVETERIVEWVASLPAKDDSVRGGSDIGYQEDPVD FT SGEQRKLTSIQLQESENDSTGVNSHTVPATLTEELSVPRVDAVPCPRSTSS FT LSISSAPPNVSKIVTQVRTRAGRLVRPVNRLIQNMTQRVNHNAVVGVTKSL FT LF" XX SQ Sequence 6568 BP; 1902 A; 1368 C; 1612 G; 1686 T; 0 other; tctaactggt gccgtgaccc ggatccattg gtcagccgca gccgatcgcc ggggtcaagc 60 agtgtgctga gacgtcctgg tgctgtaccg tgtgacaagg atctacgcag cagatattca 120 agtgattccc attggacatt cgcacacaca tacgtcgtta ttcgtctaca acgaactata 180 cacaattaca cacattttac acatacacat tttggacatc agcatacatt ggacattaac 240 atttatttgg tgacttgatt tttcatttaa tgctatattc tgatttgaga ttttgtacat 300 ttggttattt tgggtttaca tattctcaag tttaagttgt aatagtaact cataattcag 360 tttgtggtat cattaataat ggctgaaaag ggacaaaata catcggattt attcacacca 420 ttaggtggga ggggtagagg tttgtacaga gttgaaggga atacaggagc cacaccaaaa 480 ttgttatttc ccagttcagt ttcgttacca gaggtgagcc cgtgtcctga tttagcgcag 540 ccagcatgtt caaccccttc tgacagtaac aacaatgtca ctcagcaatt aatagatctg 600 ataggagagc tgggtagtca gattggtgaa tccatagccg gtcgcttgtt agctagccaa 660 aatgtcgcta acacaacacc cacaactaca acacagttag ggcaaccttc actcgaaact 720 tctaaggtta gttttatagt gagatcagac attaaagaac caccaatgtt taagggtgat 780 gagagtgaca agtacactgt caatgagtgg atagatgtaa tggagttgta tctgcagaag 840 agtagctgcc ctgaagcaga gaaagcaagc actatccaaa gtcatctcct tggaagagcc 900 aaaaatatta ttaaagtagg gcttaaaagc actggttcag ttagtgttga gaaaatctat 960 gatatgctaa gacactactt tagcgaccac cctgtgtccc ttttgccact ggcagatttc 1020 tacgccactc agcctaaatt taaggagagc cctgtagatt actgggtgag attaaattct 1080 gcagcagagc gggctaacag tcatttacag cgcagtggca gtaacatgga aaacatgaat 1140 gcagaagtgg ccatgatgtt catacgaaac tgtcctgacc ctgagctatc tagtgtgttc 1200 aaatgcaaac ctatgagcaa atggtctatt gaggaaatcc aggaagctat tgatgaacat 1260 cagagagaaa accaatgtcg tagaatcccc aatacagtca agccccacac cctccatgta 1320 tctacagcta aggcagttcc tgttttagag gaagtaaaag aaagtgaatt aactgctgct 1380 aatgctacag cgtgcacacc cagtagtcat tctgaagcta aagtgtccca gtctaattct 1440 gaagcattgg aacgtgtcct caacatgctg gagagagtgt tggaacgaac cagccagccc 1500 gcacataatt ctccagggtc ctttcagaca catagggctc gcttttcctc ttgtcgtgtt 1560 tgtggagatc cagcacattc cacccgttca cattgcatgc gagagaggcg gtgtctaaac 1620 tgtttgaggg tcggacatca gcgtagagac tgtcctaatg tggtggaagc aaccccgcag 1680 ccaactggta cctcacactc tcagggaaac gagtatgctc gcattggaga ggggacaatg 1740 tgagcatgga gaatgaaacc ctcattgaca aggatgaact tcagatattt gaggaaagct 1800 gtaaggcaac ccctgataaa aatgtacttt ttcaaaatgt gatgaagttg gaaaaagctg 1860 acagtctgtt ttacaccgat gtctctattg taggagcaaa cgttacatta cgtgcacttc 1920 ttgatagtgg gtctatggcc tgtaccataa acaaggaagc tgagtgcaag ttaaagagtg 1980 caggggttgt tcttacatca agtgagaaaa gaccagatat catccttgtc ggttgtggtg 2040 gagtcaacgt tcagcctgaa agtatttgtc acctcgagat agaagtgtat ggatatgtgg 2100 tcaatgttcc aatgttagtt gtccctggtc agcgagatga aatgatcctg ggtactaatg 2160 tcatcaaata cattgttagc cagatgaaaa atgtgcaggg ctattggcat gttctgagtc 2220 atccagaaag ttcaggaaat gatgatatcg cacagttctt gaacatgttg tctggcctcc 2280 acagatggaa aggaactgaa atgcctacta aagtagggac agctaaactt acccaagctg 2340 tagtgttgtg tccccagcga gagcaccttg tatgggccaa gttacctgcc aatgtgccta 2400 tctcagaagg aagtgccgtt ctcgttgagc ctgctaaatc acactcaagc aagaatgttg 2460 ttattggccg tgttgtggca tcaatgagta gtgatggatg ggtgccagtc aaagttctaa 2520 atgctagtga caagcctgtg acattacgga ggaatgcaaa aatggcagac gtatcccctt 2580 gtatagccct cgaggacctg gacttgagcc cccaattgcc agccagtcat gagctgatat 2640 cctccagtca agttaccatt tgcatggatg acaatgccac gtcttctgac tcaagatctc 2700 ccaacagact tgactcgttt ggcctaggtg acctcaacat agaggcatgc gaagtgtccc 2760 agtattggaa ggaaaagctt gtgcagctga tctgcaagta tgaagacgtg ttctcaaaaa 2820 acaagttgga ttgtggcaag gcgaatggtt tctaccatcg tatacatcta gttgacaatc 2880 gaccctttag gcttccttat agacgagtgc caccagcgca ctatcataag ctcagacatg 2940 ttctttcaga aatggaggag cttgagatca ttcgtaaatc ctctagcgaa tgggcctccc 3000 cattggtgct tgtctggaag aaaagtggag atctgcgcat atgtgtcgac tatcgatggt 3060 tgaatgcccg cactactaag gatgctcatc ccttgccaca ccaggcagac tgtctagctg 3120 cgctgggtgg aaacgccatt ttcagtgcca tggatctcac ttcgggattc tataacatag 3180 agatggcaga agaagacaaa agatttactg cttttacaac gcctatgggg ctttatgagt 3240 ttaacagact tcctcagggt ttgtgcaaca gtcctgcaag ttttatgcga ctcatgatgc 3300 atgtatttgg agaccagaac ttcctgactt tactatgtta tctggatgac cttttagtat 3360 ttgcgccgaa cgagagtgaa gcactgaaga ggcttgagtt ggtgtttatt cgattgaggg 3420 ctcatggctt gaagcttgct ccaaaaaaat gtcacttgtt acgtcggagt gtgcagtttc 3480 taggtcacgt gatagaccaa aaaggagtat cagctgaccc tgataaagtt caggctattg 3540 ctgcgatgga tgagagagat ctcatgctgg aagatggggt cacaccatct cagaaaaaga 3600 taagagcttt tcttggaatg gttatgttct atcagcgatt catacagaac tgctctagta 3660 tagccaagcc tttatttgca ttgactgccg cgccaaaggg acaaaggaac aacggagcaa 3720 aattgtctca ttggaggaag ctaagcccaa gcgactggaa aagtgaacag cgtgactctt 3780 ttaatatgct gaagtccgct ctcctagaaa cagtggtgct tgcacaccca gatttcaatc 3840 gtccattcat tctatcaacg gatgcttcgc tggatggatt aggcgcagta ctttctcaga 3900 tccctgaagg tgagactaga gcccgtcctg tggcatttgc cagcaaggcc ctcacacgtg 3960 cccaaagcaa ttatcccgct caccgtttag agtttttggc tctaaagtgg tccatctgtg 4020 aaaagttcag ccattggttg aaaggtcatt cttttgcagt atggacagac aacaacccgt 4080 taacctacat cttaacgaag ccaaagcttg acgcgtgtga gcagagatgg gttgctaagt 4140 tagcaccata taatttcagc attcagtaca tcccagggag caagaacgta gtggcagacg 4200 caatgagtcg gcaacctttt gttcagggac gcataagtca gagacttgtg tctgaaccgt 4260 atgagaccct gatggagcag gcggagcagt ttaagaaaga tgttgtccaa catgcatttc 4320 gcattagtgc tcagtgtcag tctgctgaac cccaatgctt gctttcttcg gatgaagtgt 4380 cagccatact ggatagtcgg gcagagtgga gtgcaggacc aaaggataaa gtgcatcact 4440 ggctcattca ggacgtacac aacttggtcc cagcaggtca ggacacatta ccaatgttta 4500 cactgggaga gcttcagaat aaacagttag cagaccctgt cttggccaga gttctttctt 4560 atgtcactcg tggcagaagg ccatcaagac gtgagaggac aaaagaggcg ccaagggttc 4620 tgaagactct taaacagtgg gaaaaactaa agctcctcga cgggattctg taccgtgtgt 4680 gtaaagatca gctcacagga aagaaacgat ggcaatatgt tgttcctact tcactgactg 4740 atcaagtctt gacagggata catgacaaag ctggccatca aggccaatcc aggacactct 4800 acctggctag agaaagattc ttctgggtcc aaatggagcg agacgtcaga gaacatgtta 4860 agtgctgtaa gcgttgtgtg gtgagcaaga cgcctgagcc agaagcgaag gcaccgctcg 4920 agagtgtgag aacaactagg cctctggagc tagtgtgtat cgacttctgg agtgctgaag 4980 atcgcaaggg tggcactgta gatgtactag tagtcacaga tcatttcact aagatggctc 5040 atgcctttgc atgtagcaac caatctgcca gacaggtagc acgtcaactg tgggacagat 5100 atttttgcat ctacgggttc ccagagagga tccatgccga tcagggtgca aactttgaaa 5160 gtgagttgat ccaggagctg ttgcagtttg caggcatcaa gaagtcgagg acgactgcct 5220 atcaccctat gggaaatggt cttactgagc gctttaatcg cacactaggc agtatgataa 5280 gagcattgcc cccgagagat aagcaaaaat ggcctcagat gttacagacg ctaaccttcg 5340 cttataattg caccgctcat gaatccacgg gatatgcacc attctacctg atgtatggaa 5400 ggatacctcg tctgccagta gatgttatgt ttcacagcat tggaagagat gatgacatta 5460 tggactatga cagctatatt gcaaagatga gggatgatct taaggaggct cttaacttag 5520 cccaagcaaa tgcgagagcg agtcaggaac gacaggcaga actctacaac aaggggacaa 5580 aaggctgtga cattgaggaa ggagatcaag tgttgttggc taataaggga gaaagaggcc 5640 gcaggaagtt ggctgataag tgggaatcta taccctatgt ggtgatctca aaagactcta 5700 gatgtcacac ttttcgcatc aaaaacacca acacgggtag agaaaaggta gtgcaccgta 5760 atctcatgct gaaggccaat ttcttgcctc ttgaagtgga agtaggagag tcttgtgacg 5820 gttcactatc cagtcaagaa caatcagatg gaaccttaat ggaaagctta acccagaatg 5880 atcctgtaga aacagagcgg atcgttgagt gggtggcttc cttacctgca aaagacgatt 5940 cagtgagagg aggatcagac atagggtatc aggaggatcc agtagattct ggtgagcaaa 6000 ggaaattaac aagcatccag ctccaagaga gtgaaaatga tagtactgga gtcaattctc 6060 ataccgttcc agcaactctg acagaagaac tcagtgttcc cagagttgat gcagtgccct 6120 gtccaagatc caccagttct ctgtccattt ccagtgcgcc ccctaatgtg tctaaaatcg 6180 taacccaggt acgaacacgg gctggtaggt tagttagacc tgtgaacaga ctgattcaaa 6240 acatgacaca gagagtaaat cacaacgctg tcgtaggggt taccaagtca cttttatttt 6300 aagaagttag gatattcggc caagtcatag gaatataggc aagcaaaata acacagtagt 6360 caagttgagt agtgtaagag gcacttcttt agcaagggaa tatctggaag cctgatcaac 6420 tttcatttgt tctttctccc gggaggttcg ggtaacaagt atgttgcatg actaagtggt 6480 gtaaattgtc agaaattatt tgacttaatt gcttccttat tgaattaaag tgtggtagta 6540 caagttttgt gaagttaagg ggggagta 6568 // ID CR1-38_DR repbase; DNA; ZEB; 3463 BP. XX AC . XX DT 28-FEB-2009 (Rel. 14.02, Created) DT 11-MAR-2009 (Rel. 14.02, Last updated, Version 1) XX DE CR1-like non-LTR retrotransposon - a consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; AP endonuclease; CR1 clad; CR1-38_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-3463 RA Bao W. and Jurka J.; RT "CR1-like non-LTR retrotransposons from zebrafish."; RL Repbase Reports 9(2), 524-524 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 429..3323 FT /product="CR1-38_DR_1p" FT /translation="MTTFVELFIGLLIIGHLFSAGYTHITTNSDLIMVYTR FT EDLLQLRSADFDAPPNLPDCLRNRVTASTCRRKRGHRGGVRRRFRRRGNRP FT PLPAIILSNVRSLQQKMDELRIKARACFEYREAGLLALTETWLRPEIPSSL FT VEVQGFSLIRADRSAASGKSRGGGICVYVNERWCSHITIKSAVCNPDIELL FT CLSLRPFYLPREFGNILICVAYVPPSAKAAIAASAIRDCIQDQLERTPGAP FT VFILGDFNHCKLETVLSGYDQYVKDNTRDKNVLDKCYGNIKEAYTSRVRPP FT LSNSDHNTVFLIPTYKTLLKKNKPVVKTVNVWSEDSVEALKGCFLCTDWEV FT FHSEDINETATVITDYINFCTENVIPKKEIVIYANNKPYITKDIKDCIKRK FT KLAFHNHDHQGLRVTQKELNGLLRDARIKHKQCLENRLHMSNSKQLWDSMK FT SMSNMQSFTPSISIFDDSQKVNELNDFFLRFENCEYTPDNSSEVMSPRGVT FT YDRPLIDPKQINFYFKHIGSNKSAGPDGILPYTVKCCADELTPVWVPIFQK FT SLDSGIVPSLWKKSLIVPVPKLARPSEYNDFRPVALTSVIMKCFEKCVVSI FT VKSEVASNLDPLQFAYRYGRGTEDAVNSISHLILKHLDNPKAYARLLFIDF FT SSAFNTIQPHLLIEKMEQLNINPFICKWYFSFLTNRTQVVKVKNICSEEKN FT SSTGVPQGCVSSPILFTLFTNECISKHSQNFIFKYSDDTAILSLLYEKDDI FT GVIYQSEISQFIDWCDRNKLKINTKKTEEMVFDPKCIGDHRQVVVHDDPIT FT QSDTYKYLGIHIDSSFTWKTHIDWVCSRLHQRLYFLRRLRLFGVNKKLMML FT FYKAVLESIVRYGITVWFGNLSVQCKSKLMHLLKTAWKIVGHNENFNPQVL FT YENCVLNQVDRIVKDPTHFLFPQYELLPSGRRYRVPKCRLNRYKKSFVPVS FT VGLLNKSIS*" XX SQ Sequence 3463 BP; 1051 A; 632 C; 713 G; 1067 T; 0 other; aaacggctta cagagattat gaaaaaaaat aatacaattg atcatctacc ctcaacatca 60 ccttcagaag aactccataa gaaaatactt attctgcaag cataatatga ctccttgaca 120 agtgaaaatg cagaggatct ccacataaga tcaagccaga ttcattatga acatggtgaa 180 cgagctggca ggtgatcgtg ttaccaactg taacaatcct cagcagctgg ttttatagca 240 gctgttagta aggatgatgc agtattatta cttcactatc ttccggtggc gctgcactgc 300 tggcgacacg ggcacttgtt gctgctgacg ttttttgtga aattactttt gaacttgctg 360 gtatacttaa ttatttaaac tttgtggcgg tacactggta ctttagtagt ttcgggtgca 420 atatcagtat gacgactttc gtagaactct ttattggcct gctaataatc gggcacctgt 480 tttctgccgg ctacacgcac ataacaacga atagcgacct gatcatggta tacacaagag 540 aggacttgct tcagcttcgg tcagccgact ttgatgcacc acctaacctt cccgactgtt 600 tgagaaacag ggtgacggcg agtacatgtc gtcggaaaag aggtcacaga ggcggcgtcc 660 ggcggcgttt tcgccgacgt ggaaacagac ccccgttacc tgcaattatt ctaagtaacg 720 tcagatcatt gcaacagaag atggatgaac tgcgcatcaa agctagagcc tgctttgagt 780 atcgagaggc gggtctcctg gcactgactg aaacatggct tcgtccggag attccgagtt 840 cactggtaga ggtacaaggc ttttctctta ttcgagccga tagatctgcc gcctcaggaa 900 aaagcagagg aggtggaatc tgcgtttatg tgaatgagcg ttggtgcagt cacataacaa 960 taaaatctgc agtatgtaat ccggatattg agctcttatg cttgagtttg agacctttct 1020 acctcccaag agaattcggg aatatattaa tctgtgtggc ttatgttcca ccttccgcta 1080 aggcagctat agccgccagt gctataagag actgtattca agaccaactt gaacgtacac 1140 caggagcgcc ggtgtttatt cttggtgatt ttaatcattg taaattggag actgttcttt 1200 ctggatatga tcaatatgtt aaagacaata caagagacaa aaatgtttta gacaaatgct 1260 atgggaacat caaagaggca tacacgtcca gggtaagacc tcccctttca aactctgatc 1320 acaatactgt gtttttaatt cctacgtata agactctttt aaagaaaaat aaacccgtgg 1380 tgaagactgt gaatgtttgg tcagaggaca gcgttgaagc cttgaaaggc tgttttctct 1440 gtactgactg ggaagttttt catagcgagg atattaatga aactgccacg gtaataacag 1500 attatattaa tttctgcact gagaatgtga taccaaaaaa ggaaatagtc atctatgcca 1560 acaataaacc atatatcaca aaggacatta aagattgtat taagaggaag aagttggcct 1620 ttcataatca tgatcatcaa ggattaagag ttactcagaa ggagttaaat ggattattaa 1680 gagacgctag gataaaacat aaacaatgcc tggagaatag actacacatg tctaattcta 1740 aacaactgtg ggactctatg aaatctatgt ccaatatgca gtcttttact ccctctattt 1800 ctatttttga tgactcacag aaagtgaatg agttgaatga ctttttctta agatttgaga 1860 attgtgaata caccccagac aacagttcgg aggtaatgtc tcctagaggt gttacttatg 1920 ataggccact gattgatcct aagcagataa atttttattt taaacatatt ggctcaaata 1980 aatctgcagg tccggatgga attttgccat acactgttaa atgctgtgct gatgaactga 2040 ctccagtttg ggttcccata tttcaaaaat ccttagactc tggtattgtt cctagtttgt 2100 ggaaaaaatc actaattgtt cctgttccta agttggcgag gccctctgag tacaatgatt 2160 ttagaccggt ggcattaaca tctgttatca tgaaatgttt tgagaaatgt gtggtgtcca 2220 ttgtaaaatc agaagttgct agtaatttag atcctctaca gtttgcctat aggtatggca 2280 gaggtactga agatgcagtt aatagtattt cgcacttgat tcttaagcac ttagacaatc 2340 ctaaagctta tgctcgactg ctttttattg attttagttc cgcttttaat acgatccaac 2400 cacatttatt aattgaaaag atggaacagc tgaatataaa cccattcata tgcaaatggt 2460 atttctcatt tttaacaaac agaactcagg ttgtcaaggt taaaaacatc tgttcagagg 2520 aaaagaacag tagtactggg gttccacaag gctgtgtcag ttcacctatt ctttttacac 2580 tttttacaaa tgaatgtata agtaagcatt cacagaattt tatttttaaa tactctgatg 2640 acacagccat tttaagttta ctttatgaga aggatgatat tggggttata tatcagtcag 2700 aaattagtca gttcattgat tggtgtgata gaaataagct aaaaataaat actaagaaaa 2760 cagaagagat ggtatttgac ccaaagtgca taggtgacca tcgtcaagtg gttgtacatg 2820 atgatcctat tactcagtca gatacctata agtacttggg cattcatatc gacagttctt 2880 tcacatggaa aacacacatt gattgggttt gttctcgttt acaccagaga ttgtattttt 2940 tacggaggtt gcgtcttttt ggagtcaaca agaagctcat gatgctgttt tataaagcag 3000 ttcttgaaag cattgtcagg tatggcatta cagtgtggtt tggcaacctt tctgttcagt 3060 gtaaatctaa actgatgcat ctcttaaaga cagcatggaa aattgtcggg cacaatgaaa 3120 attttaatcc tcaagtttta tatgagaact gtgtcctcaa tcaggtggat aggattgtaa 3180 aggatccaac tcattttttg tttccccagt acgaactgct tccctctggt agaagatata 3240 gagtcccaaa gtgtaggtta aaccgttaca agaaatcctt tgtacctgtg tcagtgggtt 3300 tacttaataa gagtatatct taattggcac cataaatttt gtaattgttg tattttatac 3360 tgttttttat tgcttgtttt agtgagtgtc tgcagcaata tggcgatgcc caaaacaaat 3420 ctcctgtcaa ggacaataaa gttttactac tactactact act 3463 // ID DNA-X-9_DR repbase; DNA; ZEB; 1151 BP. XX AC . XX DT 17-NOV-2008 (Rel. 13.11, Created) DT 17-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE nonautonomous DNA transposon - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; DNA-X-9_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1151 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(11), 1735-1735 (2008). XX DR [1] (Consensus) XX CC TSD is unknown. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1151 BP; 346 A; 230 C; 213 G; 361 T; 1 other; ttaaaggtgc cctataatga aaatctgggt ataccaaggc atagtagaat aataaaagat 60 cagtatatgg aaatgggcat actgtgagcc tcagacacca ctgtttcctc gttctcatgt 120 aaattccaca agtgtaaaac cccggtggac aacgggccaa tcagaagaca aaacaaactg 180 tgatgagact cacgggccat gccctctgca tttgcatgta tgcctacttt aggtcattca 240 tccagacctg acgagcagcc acgtcatcaa gagaacacga ctcatatgca gctctgagat 300 cataaaaaaa gctctgagat cattaatatg cacgcctcag gctgygtttg ataagccaca 360 acgttttcta gtgatacaac agtgataatt ttcctccctt attgttattt ttaagcatat 420 atttcactta ctatgtatgt gggactttta ttttgaaaac gcgagcccca aattgtttac 480 tgtgcattac tgggcaacga cagcacgaga catgttttaa actcgcctca ggatgcaaaa 540 tgcaagattt acagattaca catttattca tctccactct gaagaggcat aaggtatgtt 600 tagttacatc ttttgttaat ttatcgtgtt tattgtgaaa tttgatgcgt atgtttcaac 660 aaacacatgt agactgctga atcgtgtcaa atcactcagc tcaactgtca tctgttcatg 720 cagaggttga tattaattcc tggctgtgtt ttctttgact atagcttcaa aatacaacgc 780 gatacatctc tatcactctc ttttgtcaag tttaattata atttagcctt tatccacaac 840 agatctggtg agcaaaatag gttaacgtta ctctgattgg gtttatattt gtccgtgttc 900 agcgtacatc atgctctgtg tgtgtgtctg taagagcggc acgtcagagc ggagctcatt 960 aatattcacg acacgaacca tatatggtca atcatggacc ttctgattct atgggtattt 1020 tatggagtaa taaatgagct tataaaaccg tttctggaga tttttttaac ttaccgaagc 1080 cataaaccta ctatgtagat atcagagaac aatttaactt attaaaccaa tgcagtatat 1140 ggcaccttta a 1151 // ID Helitron-1N1_DR repbase; DNA; ZEB; 989 BP. XX AC . XX DT 30-NOV-2007 (Rel. 12.11, Created) DT 30-NOV-2007 (Rel. 12.11, Last updated, Version 1) XX DE A family of non-autonomous Helitrons - a consensus sequence. XX KW Helitron; DNA transposon; Transposable Element; Nonautonomous; KW Helitron-1_DR; Helitron-1N1_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-989 RA Kapitonov V.V. and Jurka J.; RT "Helitrons on a roll: eukaryotic rolling-circle transposons."; RL Trends Genet 23(10), 522-529 (2007). XX RN [2] RP 1-989 RA Kapitonov V.V. and Jurka J.; RT "RPA70-encoding Helitrons in zebrafish."; RL Repbase Reports 7(11), 1179-1179 (2007). XX DR [2] (Consensus) XX CC Helitron-1N1_DR is a nonautonomous deletion derivate of CC Helitron-1_DR; that was transposed by the Helitron-1_DR-encoded CC proteins. The genome contains several hundred copies of CC Helitrons. Many of them form clusters composed of 2-20 tandemly CC repeated elements (du to the rolling-circle replication). These CC elements, like most of Helitrons, are inserted into 5'-A|T-3' CC target sites without their duplications. They are frequently CC inserted into (TAA)n microsatellites. The 3' terminal region CC (pos. 975-987) is complementary to the 5' terminal region (pos. CC 23-35). Probably, this complementarity induces formation of a CC circular DNA structure important during the rolling-circle CC replication. The standard 3' short palindrome is at pos. 955-972. CC There is also a short 12-bp palindrome at pos. 6-17. Numerous CC copies of Helitrons-1N1_ZF are <3% divergent from the consensus. CC Therefore, transposition of this family has occurred quite CC recently. XX SQ Sequence 989 BP; 311 A; 178 C; 235 G; 265 T; 0 other; atatagctgc aagcagcaat taaggggcca agcacttttg gccaaaaggt ggcgctgctc 60 cagaagtttt tgagtacttt cagggcatgg ggttgaagat gcataccatg ttttgtaatg 120 atatgtgaat gtgttggtaa aatatagcat ttttggccaa aaatcgaaat gggcgacgcc 180 caaaatggct gacctgtgaa aatcagacat cattcgactc gacatgctct gccggatgta 240 atgagaccag ttttatgagt ttcggacgaa aggttgagac gttataagcc aaaaagcaag 300 ttttttgtat ctccggacca ctagggggca gtgcgccgaa actccgcata taacctcaga 360 ccctagttgt cataacacac accaagtttg gtgtgaatcg gtgaatgcgt tacggagata 420 tcgcctcaag tccattttcg caagttctac gttaaatttg atcgcaagtt aaacgaaaac 480 ggttggtcta atcaacttga attccataac ttttggttgg catggtctgt agatcatgtg 540 attcaatttt ggtgaaaatc ggagaaacgg cctaggacga gttcgatcaa ataggttttt 600 cgaaaaattt aaaatagcgc gaaaaaattc atgacggaaa atgacgtcat agggtgcgtt 660 tgaatcggac tgagccaagg aatcagagga aaaaagaatt ttgattgtag cccattcggt 720 tcaaaagtta tgatgataaa catacgtgaa agtttggaca agtggtggcg ctagagagtt 780 tgattttgag acttcatatt tggtctgatt aatgttaata ctgtcctcta tcattgtgcc 840 aaattataca actttcccgc atacgcctat atgggctgcc attcaaatcc ggcggaagaa 900 acggaagaag aagaagaaga agaataataa taataggaac actaacgaaa gcaataggtg 960 cctacgcacc tacggtgctt ggcccctaa 989 // ID DNA-3-2_DR repbase; DNA; ZEB; 973 BP. XX AC . XX DT 22-OCT-2008 (Rel. 13.1, Created) DT 22-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE nonautonomous DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; TSD 3-bp; KW DNA-3-2_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-973 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1225-1225 (2008). XX DR [1] (Consensus) XX CC Members of this family are ~92% identical to the consensus. TSD CC is 3-bp, and TIR is 58-bp. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 973 BP; 236 A; 206 C; 231 G; 300 T; 0 other; ggccccgttt acactgtcag gtcttgatgc ccaattccga tttgttgcct atatctgatt 60 tgttttgcct gtctgtttac acgttgtttt taattgtgac ccatatccaa ttcatgtgtt 120 tacacttgcc atacaattta cgatgtcgcg catgcataaa ggcgggttct agcatctgtg 180 ccacactagc cagcgatgga agaaattgtc ttgtttttag ctctgcgaaa tatgcgaagt 240 gtagtataag cagtgaatgg tttagtccag atttaccccc acgcagttta tgtaatggta 300 tggccctgat gtgtgtgtgt gtgtggttgg aagatgtagc ctttgcctgt gtgcgcactg 360 gtgttggaga gaataaagtt gttctatgtg tagcccgcag tgcgtgtatt aatagcgaca 420 aaaagcaaaa gttacaacac gaagttcgcc caactttaaa atgattttga ggcggaggag 480 gtgggaaagg gccgtcatcg cagcttgcgc ttatggttta tatgctttat gatgtcctcc 540 accattcaga ggaatttgtg ggtacgagag agatctcagg tctggtggga gcaaattgtg 600 gcgactgacc actttcctcc tccgtgtttc ctctgttgtt gttgtttacc gcgtcggcat 660 ctccacacac gctcttcctt ctacgtcatt aaaccgcgga gaagcaccac attgtcgcaa 720 gtttaatgac gtacagaacg gaatgcctca ataaatccga tctgcctgtt tacatgacag 780 tcgcattgtc acatatccga tctatatctg atttatttcc acatatgaag gaggcctgaa 840 accgatctct gaatatccga atgcatgcgt tttttttcgt gtttacacgg tcatagaaca 900 gatccgatct gtgtcacata tgagcaaaaa aatcggaatt gggtcacatt taactggcag 960 tgtaaacggg gcc 973 // ID DNA-X-12_DR repbase; DNA; ZEB; 16684 BP. XX AC . XX DT 04-FEB-2009 (Rel. 14.02, Created) DT 04-FEB-2009 (Rel. 14.02, Last updated, Version 1) XX DE nonautonomous DNA transposon - consensus. XX KW DNA transposon; Transposable Element; KW nonautonomous DNA transposon; DNA-X-12_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-16684 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 9(2), 537-537 (2009). XX DR [1] (Consensus) XX CC The full-length of this element is ~27-kb before masking out CC other TEs. It seem insert into GT/CA satellite. TIR is ~17-bp. CC The TSD length is unknown. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 16684 BP; 5239 A; 2660 C; 2757 G; 6008 T; 20 other; gtgtgtgtgt gtgtgtgtgt gtgtgtgttg gcatatgtgg tttatggaac caattttcag 60 gtttacccac cagcattatg agaccattta gtttatgaga ccatttttgg tggtctcata 120 aactaaaaat ttaagttgat attaaaaatg gtttaaatga tgattattca ttacaatgac 180 ctaaatgata attataagac ttgtttcaaa agttcacatt gttatttggt tattattaca 240 cttggtgtgc tttttatgac ctttacaaat atggtcttgt aaaccatgat gtgtatgtgt 300 gatctttgaa cagcgcctct gtaggcggaa cttggtactg cacttctata cacactcgcg 360 ccaaatctta cacaatttcg cgccatacat tcaatcgcgg gaaaacgtcc ctcatgaaga 420 ctgtggacgt gacagtcgcg cttcaagcag tgaaactcac atcaaacaag taagtcattt 480 gtttttaata ctttgtacat tatttattat tatatgtact gtattattcc acttgttatt 540 caaaatagca agtttattta tatatttata tgcggcgcgt actgttttag acaataataa 600 ttagtgactg ggaacagaag atgagaacaa ataaacgtta gaaatatgtg tttgtaatgg 660 cagtatcaga cgtagttctt caaaatgtgt atttgctaat aatagcacgc gctcggcctg 720 tatctggctc ggaagcgccg gtgtcgattt aacccagaac cgggcagagc tcactcctcc 780 gcggtaaaca attcgccgtc tgattccatg agatgtaacg ttataataga tgcactatgt 840 aatttgtgat tttttttcta cacatacttt tatgtgagaa gcaatgacca ttcgaatatg 900 gtataaataa catggtaaca aggtatatta aacataaata tgacgtagaa acaatttact 960 aaacttgaat tattaagcat gattgaacaa aatgtttgaa tttagtaatg taaatgtgtg 1020 gttttatttt ttctaatgtt tttttgtgat aaacttgtgt tcagttaaac tgtacatttg 1080 aataaatgaa cttttattat gaaaagaggt taactaaccc tagttaaata actttatttg 1140 atcaaattgt caaaatagtg cactactaac ttatattttg ttaaatggtt tgaaatgtga 1200 taactgtatt atgaaatttg ataactatat tactattttg ttctgttttg gtcttcttta 1260 gagaatttta cactacgtca catatattaa tactgctttg catatgtggt attgtattga 1320 tattataaac tgtgaagttt aaccttatat aatattttaa cagtgaatga atggtggtaa 1380 gagtcaaagc agaggattaa atcctttgaa agatgctgct catcatgtga ggttggggaa 1440 ggataaagac atgcttacgt ggaggtacat cagtgacatc aaaggtacca aagttttgta 1500 catgtttttt cttactgcag gtagtcccca gttaatgttt cattataagc atagtgtata 1560 aaatgttaag atgcaaaaag gagtttaatt taacacaaaa aaattaagtc atgaatgatg 1620 atttgatgac atgacattat aaatgtcaac aaataacatt taaataataa taataaaaag 1680 aaatgtccta attgaaaacc acactttaaa atgttaatgt ttaatttaag agtctaaaaa 1740 atgctaatat taatgatttc catattagtg aaaagttaaa actattaaag attgtacact 1800 atccagggcc tgatctctaa atagttattt taaatatata ataaagttat tttcaaatgt 1860 tagcaacatc ttaatagaat tcatttgtat ttatttattt gtgatgttag tcactagggg 1920 tgtaacggat cactgttgat ccgtgatcga tacggatcac aacccacggt tcggaatgcg 1980 cgtgacccgt agattaataa ctttttttga acttgtaggt taatccaaca tttataacaa 2040 tcgcagagag atcgcctccc gtgtcattca aattgcatga atgaaagcat tttggcttcc 2100 ctgtaaaata taatgatgac gttgtgggac ctacctaaat gcttttggct attaaggtgt 2160 ttttttgtca ctgtttgttt agcctgcatt aatgcattca ttatggtaag aataaaatca 2220 ctgtttaatg aaacaagatg ctggtgaaag caactacatt gtgtaaccaa cagatggcga 2280 caaacaaaca tcaaaattat gtcattgaat aggttttcaa atataataca cctacagtat 2340 aatgaaacat gaagttttat gtgtgaattg ttgaatcatt aattgaagat aaatatatgt 2400 tgtacaagat ttctatatat tttgcattat cagcaacagt agcaaagatc atatttcata 2460 ttgaatactt ttttttagct ggttatttcc tgatttttat tttttattaa aattacaggt 2520 tctatattct gtttgttaaa agcaatgttt tatgcagtaa catttttgta taaatggaaa 2580 gcaaattgtc tcctcccttt tttgctgatc caaaaaatgg tccgatccat aactcaaaaa 2640 acgtagagtg atccgaactg cagattttgg atccgttaca ccactattag tcgccctttc 2700 ccaaaccctg gttagcctgt cgattatcat ttgccacaac tagcgctact tcacccatat 2760 tctcctacct catggctctg ttgaagagca tttaataggt tgcacatgcc ttaataaagg 2820 tttctgaggg cgtactcgca ctgtacagtt gccatgaacc aggctgaagc atgcttttcc 2880 ccccttccgt ctccccctac ggcctgcact cacattgcat tcgtgcctgg gcactcttat 2940 gtcatcgaac atgtgctgtt cagtaagagc tctcgcaaag cacagtggag aatagtaata 3000 aatacctttt aaaatatata ttttagtact gcttcagtta acttgaacaa gtgtacttat 3060 actatcaatt ttcacattaa tacagacctc ttgagttctt aatccgggcc tgacactatc 3120 aaatagtaaa gtattagata attgttttgg gccagtttcc tgtacagtgt actttttgtg 3180 cactattaaa ccaacataat gaattttttc aatgatttag tttcttctgt atcagttttc 3240 actgtttgtc ctgaaaagtt tgtgatgcat actctgtgga gtcagtgttc atgtgttgac 3300 agaactgttt ttatttggca gagttacaat tttaatcttg tataaaaaaa gtaccttgtg 3360 tttaatttta aaacattgtt ctgcacgata tcacaacata caattgtaat tttagagtgt 3420 gtcatgtgga ggagttgacc ttctctcctg taacaaacat gaaaaagcaa aggtagggta 3480 tattggattt tattttaaaa attggcatat tttgaactgt atgtagtatt aaatataatg 3540 tttaataatg tctgtttagc acacacttga gttgttttag ctttattttc tgtgttttga 3600 ttatgttttt cagtctcttc atgagccatt atctttgaat tctgaacaga gtgagcttca 3660 aagaggtttg tctatataat tgtcttctaa atgttgatct gaagagatat tttcttattg 3720 ttcctgtatc tgtgcagtta tttttatata ataatacatt taatatgggt gtagcgatac 3780 attattatat tgagaatcgc aatataacaa ttcaaagttt tttctttaga ttagcctcta 3840 ttcacacggt gagtattatg cgtaagatta aacgtttaca cagtggcagt ataaactact 3900 gttgttaaat gatgaaaata atgcaagttg aaatctgatg atgacaaatg cctgattata 3960 ccagtgaaaa tgatctattt gcatgtggac cataagagag tgcaagcata catctaacac 4020 tcatcttcag aattttgaat ttgtattctg atgtcatctc tgtcctgtgc attaaagacc 4080 aatttaaagt ttaacatgcc agatctatgc aaaattgagc attttaacca ttactaggcc 4140 tacatattat aataatattg ttttttacca catttgagaa ataacaaata ttactaatat 4200 ttaaagcttt ttttatagct tacaatattg agatatcaag agaaacaatg tcaagtttgc 4260 tgacctagca ataggctatt gttttaatat ctgtgtgttt gctcatggtt tatggggacc 4320 aagtatgtct actgaatgtg acttagcagt aggctattgt tttgttaaat ctgtttattc 4380 atggtttatg aggacatggt ttgttatgcg agacttaaag ctttcagtaa gtgaatgtca 4440 tgttttcttc atctttcttt gcttctctct ttcagcgcct ccatgagcct gtggctttaa 4500 ccaacacaaa cagagctgtc ttctacagag gtttgttaat tattttgtca tgctaatttg 4560 acactccgtg ctaaaatgac agtttcatat ttctactaca ttcaacgtag cagtcctgtg 4620 tgtttagtca tggtttatga ggaccaagta tgtctatttt atgtgagtta gcagcaagct 4680 attgttttat taaatctgtg tgtttactca tagtttatga ggacatgatt ggttatgcaa 4740 gacttgaagc tttcagtaag tgaatcatgt tttcttcatc tttctttgct tctctctttc 4800 ggtacctcca tgagcccgtc gcttggacaa acacaaacag agttgttttc tacagaggtt 4860 tgttaattct tttgtcaagc tattttgaca ctccgtgcta aaatgacagt tacctatttc 4920 tgctaaatgc agtgtagcag tcctgtgtgt ttagtcatgg tttatgagga ccaagtatgt 4980 ctattagatg tgacttagca gtaggatatt gttttaataa atctgtgtgt ttactcatgg 5040 tttatgagga cacagtatgt gtaggtttac atggtgctas cagtagacta ttataacatg 5100 tgtttcttca tggtttatga gggcattttt tgttatgcga gacttcaaac agttagtaaa 5160 caaatgttac attttcttta tctttgtttc tctctttcag tacctctatg agctcgtcac 5220 tttgacaaac acagagttgt cttctacaga ggtttgttaa ttattttgtc aaactatttt 5280 gacactttgt gctaaaataa cagtttcata tttctgctaa atgcagtgta gcagtcctgt 5340 gtgttttgtc atggtttatg aggacactgt atgtagaatg caatgttagc agtagactat 5400 tgttttataa cgtgtgtgtt tcttcatggt ttatgaggac atgattggtt aaacgagact 5460 taaagctttc agtaagcgaa tctcatgttt tcttcatctt tctttacttt tctctttcag 5520 cacctccata agccagtcgc tttggcaaac acagagttgt cttctacaga ggtttgttaa 5580 ttcttttgtc aagctatttt gatacttcgt gctaaaatga cagtttccta tttctgctaa 5640 atgcagtgta gcagtcctgt gtgtttagtc atggtttatg aggaccaagt atgtctatta 5700 gatgtgactt agcagtaggt tgttttatta aatctgtgta tttactcatg gtttatgagg 5760 acacagtatg tgtaggttga catagtgcta gcagtagaca attataacgt gtgtttcttc 5820 gtgatttata aaggcatgat ttgttatgcg tgacttaaag taatttaaca atgttaaaaa 5880 aatcttatct ttgcttctct ttttcagtac ctccctgagc ccgtcacttt gaccaacaca 5940 gagttgtctt ctacagaggt ttgtttattc ttttgtcatg ttattttgac actccttcgt 6000 gctaaaatga cagtttctta attagactaa atgcagtgta gcaatcctgt gtgtttagtc 6060 atggtttatg aggacacagt atgtgtaggt ttacaatgtt agcagtagac tattgtttta 6120 taacgtgtgt gtttcttcat ggcttatgag gacatgattg gttttgtgag acttaaagct 6180 ttcagtaagc gaatgtcatg ttttcttcat ctttctttgc ttctctcttt cagtacctcc 6240 atgagccagt cactttgtca aacacaaaca gagttgtctt ctacagaggt ttgttaattc 6300 ttttgtaatg ctattttgac acttcatgct aaaattacag tttccttttt cagctaaatg 6360 caatgtagcg gtcctgtgtg tttagtcatg gtttatgagg acacagtatg tctatttaat 6420 gtgacttagc agtaggatgt tgttttatga aaactgtgtg tttcttcatg gtttatgagg 6480 acacagtatg tgtagtttta catgtgctag cagtagacta ttataacgtg tgtttcttta 6540 tggtttataa gggcatggtt tgttatgcaa aacttaaaac agtaagtaaa cgaatggtaa 6600 attttcttta tctttgcttc tctctttcag taccttcagt cactttgaca aacacaaaca 6660 gagttgtctt ctacagaggt ttgttaattc ttttgtaatg ctattttgac actcccgtgc 6720 taaaataaca gtttcctatc tctgctaaat gcagtgtagc agtcctgcgt gtttagtcat 6780 ggtttataag gaccaagtat gtctattgga tgtgacttag cagtagacta ttgttttatt 6840 aaatctgtgt rtttcttcak ggtttatgag gacatgattg gttatgcgag acttaaagtt 6900 ttcagtagtt ttcttcatct ttctttgctt ctcgctttta gtacctatat gagcccgtca 6960 ctttgacaaa cacaaagact tgtcttctac agaggttttt gtaattgttt tgtcaagcta 7020 ttttggtact tgaaaccaat catgtcctca taaaccatga agaagcacac attataaaac 7080 aacagtcttc tgttaacatt gtaaaactac acatactgtg tcctcataaa ccatgaagaa 7140 acacagattt aataaaaaaa aacattctac tgctaagtca catctaatag tcatacttgg 7200 tcctcataaa ccatgactaa acacacagga ctgctacact gcatttagca gaaataggat 7260 actgtaattt tagcacagtg ctaaaataac agtttcctat ttctgctaaa tgcagtgtag 7320 cagtcctgtg tgtttagtca tggtttatga ggaccaagta tgtctattgg atgtgactta 7380 gcagtagact attgttttat taaatctgtg tgtttcttca tggtttatga ggacacagta 7440 tgtgtaggtt tacaatgtta ccagtagact gttattttat aacgtgtgtg tttcttcatg 7500 gtttatgagg acatgattgg ttttgcaaga cttaaagctt tcagtaagcg aatgtcatgt 7560 tttcttcatc tttctttact tctctctttc agcacctcca taagccagty gctttgacaa 7620 acacaaacag agctgtcttc tacagaggtt tgttaattct tttgtaawgc tattttgaca 7680 ctcctatgtg tagacttmca tggtgctagg tgytaaaatt ttcctatttt tacttttagc 7740 tgttttagtc attytatgag ccagtcgatt tgactaacas aragttgttt tttacaragg 7800 tktgttaatt ctttttacaa gaattaacat actgctgtag aaaacaacty tgtgmmtggy 7860 tyattagaaa agtccaatag taggaaaaat aataacatgg aagtcaatga gtgcacgttt 7920 taatttatta agaaaataaa ctaataataa ttaaaataat aaaataattt aaataagtga 7980 ataataatgt ctctctcact ttctcccttc aggaaataac aggtgaatct tgacagaaat 8040 ccaggaagcc agacagtgat ggtggtgatg gtaagaattt ctgtcaataa caatgcagga 8100 agaaactaag taactaaaat tactttraaa ataatgtgca ttttaggcct tttttatttt 8160 aagcaaattt taaaactatt gcgtcgtaga gagataaaca gtgtttgtta ttttaaagtg 8220 ataggactaa gatggtcaat aaataagctt tttatttacc tattaattca cagttttaat 8280 ttacattttg ttgattatat tttactctta tttttaccta ttaaaaatct tatttttttc 8340 ttattattac tcttatttta cttttattat atttatattt tactattatg ttattttgtc 8400 tcaattactg tacattaata gcaaattaat atttaagtag gcctaattat ttatttaaaa 8460 acattaccag cgaaatactg agaaacggtc tggcgctatt tttatataca ggcctatcat 8520 aattttaaag aaagctgcaa ggtgtgctac attattttaa attatttttt ctgtctgtta 8580 cttgaagttc agtgcgtgtg cgctgagaga caaggcaata ttgacaaaat attgaatttt 8640 gttttttata ttgcattgca ttaatttgca gagatgtttt atagtttata caggcctaaa 8700 ataaacgctt tggtacagag ttattatttt aaaacaaacc tgtgtttgta atgatagatg 8760 taggtgtcac aaaatacttt ttaaatcagt gtttttatta atatgattat tatttattgt 8820 gaacaacact taataaatac tgaaatgtcc ttaatacaca ccttattttt ttatcaatta 8880 cagtgcttcc cgcatgtgtg tgccgcatac acacaccatg cggtctcatt catgctttta 8940 aaacatgaat gattcgaatg tacatcaatg aatgatgcac atgccgtgct gcaaaattac 9000 aatgcatgct tttagtaata ttcacgtgaa actgacttcc cgyatgttta taatatactt 9060 gcgrtggtag gaggattttt aaataaaaat atagtcaagg taaaatgttc aaaaactagg 9120 tctgactcac tccttttctt ccgccgtctc attaacacac ctttattaaa ttacaataat 9180 taacaaattg cacaaacaaa cagtaatatt taagagtaaa gtaatgagct atttaaaagt 9240 acactttaaa gaaaaccaca tttacaagag agactgtgcc ttaaagcata tactgctctg 9300 tcacactgct tttatttatt attttaggtt tgtttatgaa tgaattattt aagttaaata 9360 tttcatgtat gtggtttaat atttaattgg tcattaaatc aaatgaagaa gctagctact 9420 ttattttttg acacttcaac cgatcaataa acaaagaaat gaacaaccac agattatttt 9480 attctctctg taaagtattt agatacttag actatactaa ataattagac tattctatat 9540 agtttttaac attgtttggg ctcagttggg attatttgtt aatgaactgc cactcatctt 9600 tccattaaac attttgatat taaaattatg ttttaacttt atgtctccaa gattgcattg 9660 taattttcta agttaaacta aactagttaa actagacaat taaaatatgc tgtaaaaaag 9720 tttaggttga taactatttt attttggctg gaaatgattt actgctggtt ttatctgacc 9780 agagaacaac acaatgttgg aattgacgta gtagcattgt gtaaagtgcg atatgagaca 9840 gcatatttaa tccttgcact attttaagtg aatgcatatg tattccaccg actccaaaat 9900 taggagttaa gagtctcatc caaaaaaccc agaaccgaac atccctaatc gacgccccat 9960 tgatgtttct ttgaacatca ggtgccatta ttaactgttt atcaagttaa atttcaaact 10020 gaacctataa ctgtttttga ttaattcaaa gtatagccca gactattttt tccttttcaa 10080 aagcagtagc agtatagcct aaacaggaca gttttacaaa tgtacagata agcattttta 10140 taactgaaca tcacaaaagg cctacaagaa ttttcagtcc atttttatat acaggcctat 10200 tataatttta aagaaaactg ctaggtgggc tatattattt taattctagc ctatattatt 10260 tttattaatt tcctctgtat ctgtcacttg cgattcaatg cacacgtgct gagcgagacg 10320 agaaataagg caatattgac cattttttat aggctatagg ccactatagt atataatagt 10380 ttatttttac atatagtatt gcattaattt gcattgacgt tttgtagttt gtagaggcct 10440 aaaatgaatg cctcgatttg gagttattta aaacataatc ctgcgttttt aacattcttt 10500 aaaccaatgt tttttattat tattatttaa tgtgaataac ttttaattaa aaattgtaat 10560 gtatttatta caccacctta ttgggaaaaa aaaacgatca attacagtgc ttcccgcagg 10620 tttgaaatat aggacttgca gcattgaaac accatttacc cacatcacat agttaactat 10680 gcgacaaaca aaacttaatt gcatttttaa taataattgt atttattatg acactttttc 10740 ttttaatggg ttaaagtaaa aaaaaaaaaa aagaatgtta caaaagactg aaatcaaaaa 10800 gaaatgcttt atgctattta ggagcaatgt gcagccactg acaggtaaaa ttcgcttttc 10860 tctctcaagt tcaaagcaaa tatatgctgc gaaccacgtt tttttagcct gttaaataaa 10920 attataaata atacattaag aaattgaagt caggtaggac aacaaaagac aaccaaacat 10980 gttccccctg ttagcaaatg taaccaaaaa gtatctttac gtgcaacaag ctgctcatct 11040 gagggtgtgt tcagcacgat ggggaacata gtgagcccca caagatccca tcttaagcca 11100 gagatggtca acatattggc caggattcct cgcaaagaac aagtaaacct cactgttaat 11160 aaatgccaat ttacaatgtt ttatttattt tttttttttt aagactgtgt caacctgtca 11220 tttttgtaat gtaattttga aagtttttct tttatatttg tttcttgttc attcgtttaa 11280 tcttatttaa actttgtgca atttaattac ttatcttgct attttatttt cttattcgtt 11340 ttaaaaacat tctaacattc atcacaattt gaatgctaaa ggattttaga tgtgtatata 11400 aaatagccta tgtaaaatat taacatcatc aaattaataa aataatgaca aatttgaaat 11460 aaatgacaga aaagttaata aaaacagaca ttctctaaaa aatataaaaa tcgcaatatt 11520 aaaatgagta gattttttta aataaaatgc gttgattaac tattactatt ggtgtatata 11580 ttttacagcc agtctagatc tcattacgtc ctctcccgag tatatagccg aagtttttct 11640 tagtcgttta cattaaagaa tctattattt actttttctc tctgtatcgc ggctgagtgt 11700 gcgctgtcag ctgaatatag gcttaaatta aatctaaaaa gcaactagcc tgcgagctaa 11760 tatttgttct ttttttttcg cataacagcc aatttcaacc gcgaggagat accaacagct 11820 tgatatgcaa cattttctaa aatgttattt agctcggaat aaaaacaatt cacaaaaaat 11880 gatttagtgt gtgtgttttt ttaagattgg attagcctat ttcacagtgt gtaaacatta 11940 gaaataataa ttaggctaaa atacagtaga ttaccccttc attcagatat atgtcgttgt 12000 ttgggaaact ctcccctggt ctagaagctt ctctagcttc ttatctgctc cacaagcgca 12060 gcgcgcttct tcctctggct ctgcaccatt tcatttctaa tttcatatat aaatcattaa 12120 tggttgtagg cctaatttac agtaggctat ttatttagaa atagaaaaaa ctttttaata 12180 taatattaat tattattatt attattaaag taggatattg caaacaatat gcaaaataaa 12240 tgcaaacaat attccatgac tccaatatca attgctacga gcctaatata gaaattttaa 12300 ttaacacgta gttcagtcca taagttgctg tgaatagaaa cagcctatat aagcctacat 12360 taccagaagt tggcctatag gtaaaaggta tataaccggg gattttgtgt caaccttgct 12420 tgtgaaatat actcaagata tacccctgcc ctcatctcaa acaaaatgtt tggctattaa 12480 tgaagcagcc tgtctgaggt gttaatgggg cattgcgctt taaccgaaaa caacaacaac 12540 ccaaaagcac taaattggtt acaggggttg cttcagaaaa ttgcctaata ataatagtag 12600 tagtagtaat aataataata ataattatgt aatcaaaaaa aggctgcata ttaataagtg 12660 ggtcgcgggt tataatctaa aagctattgc cagtcaaaca cagcacagcc attacagcaa 12720 aatggagaga tttaaatgta catattgcac atatggaatt atccactcac tgcatgtcgt 12780 ctaccctgat aaacatgcta attgtcttat tattattaat attatattgt atgcacttat 12840 gaaattaaag tttatagata aacaatgctt aaacattaaa atgtgctttt agaaacttaa 12900 aatatttatg tttctaatat ttaattatac atttgctgtg tttaatatag gggatagaat 12960 agcatacagt atataaaaaa aattttgtcc tttataccga aataggccgg ctataagagg 13020 aggcgctcta ttaaacaagc agaaacagca cagactgttc actctgcatg catgtgaaag 13080 aaatgaacag agaaaaataa acagacgata aataataggc gactcacctt acgtgccgat 13140 atgcccattt atttaattca taactgccac taatagctat gcaaaaggga acctattttg 13200 agtgttttgc atgtagtgac agattttcaa atatttgtaa cctggcctat aataaataat 13260 aataattatt gataaagcaa gttatactgc aatcaggtga tgtgtttcat tgactcgata 13320 accttaatat tttgtagtgt gtgctgtgac gcgattttca tttcttgcaa tgtgcgcacg 13380 tttaagattt gaggtaataa catcgccatg atagtcaaag aaatgtaaac attcttttta 13440 ccccaacaca atttaatagg gcgctataaa ggctaaacaa cttcattctt atatttgttt 13500 aatttttgca gcctggggct atacttctca ccatctattg tatcttttct aaagcataca 13560 gtatagtgta aaagcatgtt tattactaag aaaaaagaaa aagtattttt taattgatca 13620 catcacataa tctactgtag gctacatata aatgaaggct aagtgcacac acacagcagc 13680 atgtacaaat cagattatga aaaatgatga agagcatgct tttcctttta tcctttattt 13740 ttagtttttt ttttttattt acacattttt ttaagtggtg aggacaatat tcactggttc 13800 ccccctcctg tataataatt aaatgttaca tttttaagaa cttcacaaat agcctaaata 13860 acacttttca gtatcagtga taatataaat agtaactaat aataataata ataataataa 13920 gaagaagaag aacaaaaaat attagcctat tgataacaat ataagtgaga tgttgaggaa 13980 aaaacagcgt aggctactaa ctctccactg tttttggttc atattttaaa ttatttgaca 14040 gaaaagctaa catgctgaca cattgtctgg attaagcaga cctagttgga gttgcaatgt 14100 ttccagcagc actcaataca cgttcagact ttctatcgca gtgcttgtag cgcaaatgca 14160 caaatgcttt ttcctatgct acatttctcg tcaatgtctt ccatttatca ctttctcttt 14220 tgtatctggc catttttaaa tagggctctg cgatatctgg ctgacaaaaa aatcaaaaaa 14280 taaattttaa tttttataat tttatatatt tttattttat tttttgtgct cataatattg 14340 acctttgcag tttttacctt tttattattc agctaaacta aaaaacggaa gactttctcc 14400 agtggaaaaa tataataggg aatattatga aaacattgct tgcctttttt taaaattaag 14460 aaaagtcttt acatggaaag gtcatatcac tttcagtagt ccacaatgat tcaaagcagc 14520 tactgaaagt tatactcgca tatgttgctg ttaaatcatg tagttgtgtg atgtataacc 14580 atggtgtaag tgcacaccta aggattaacc ttttttccaa gaaagagact tttaaacgat 14640 attgatgaac agttgatatg atctattaaa agtcagtgat gctaacagtt aatcatcttg 14700 tttgctttca ggctctagca ctggcagaac aacctacatg agaagaccat ggacaaaagc 14760 agacgtgaag gctgtaatga gttcaaaaga ctgattatta ttatacacat tttctttttt 14820 tttccttgtc acagagtgac aatatttaac cattttcaaa cccacttctg ttccatgcca 14880 gtgcttcaca gtccatcatc actctagggc agtctctaca ttgctctaca gtttgcctct 14940 gcttctcaac ccatgtctgc cactactcca cagtctggta ctgctccaca gtccaggtcc 15000 acgtctgctt catgagcctg ggcaataccc atcacacaaa aatacataac ccacaaatgc 15060 agtccctcag actaaacatt tacaacagtg ttgagtagca aaaagacatg ttttttgttt 15120 tttatttaca tgtttgaaaa tggtcaaata ttgtcactgt gacaaaaaag tgaaaatgtg 15180 tatgatatta attagtcaca ttataacaca ttatattact gtcacattac gaaatttagt 15240 ttttaatctg ttttgtgttc atataattct gtatgtatca ttcaatttgt ggaatcaaac 15300 ttttatgtct agttttatct ttttttcttt ttatctatct ttttatctat ttttatccat 15360 ctattttttg ttcttgtcag agcgataata gttgatgcat ttcataataa aatgtatata 15420 catgtttaat tcccctgctg ttttcctgga cttcaaaaat aaaaaaaaaa agtttatttt 15480 tttatttctg tcattgagtg gcaatgtttt gtatttgaca gtattctagt gcactaccat 15540 gttgtgttct agtataaata tttaaatttg aaaaagaata atctgttgaa cgacttaatt 15600 aaattttaaa ataccattag tttgtttata agctgtaatc attcattcat tcattttctt 15660 ttcggcttaa cccctttatt tatcaggggt ttccacagtg aaatgaacca ccaacttatc 15720 cagaatatgt tttacgttgc ggttgttctt ccagctgcaa cccatcactg gaaaacaccc 15780 atacacactc atacacacac actacggaca atttagtgta ttcaatcccc tgtaccgctt 15840 gtctttggac tgtgggggaa accggagcac ccggaggaaa cccactccaa cacagagaac 15900 atgcaaactc cacacagagt tcacagaggg gaacatttgt gttctggcca agtaaaagaa 15960 taaaataatg aatatgttat cgtacagttg tgcttctggt tgttcatggt tgaaggttct 16020 ttgccaatta tggtcttttt ttatctgaag ttgcctttca aaatatctac tgtcctataa 16080 aataagagca aaggccctta aacaattaac tcttcatact ttaatgttaa aatgctgtct 16140 ttttcaggtt atttgttcaa cagtacatta aacagattgt ttaactgata caacacaata 16200 agcctacaga attgttaaat caaaagtgta ttataagcaa aatagtattt tgtcatttaa 16260 gaagtgacag gcagaacctt agcaactgag atgggtttag caggtgtcta gctgaggaaa 16320 gccagagtct gttctgttgt cacccttact gcaatagcag aatagtgtga tatgtaaaat 16380 gtgttaatca gtcctcttaa accatgaagg ggctaggttt gtgtgtaaca tgtaaacaga 16440 ccttgtaaac catgattttg tctggcccca taagcctgtg aaatgtcaca tagttaattt 16500 gagtgtgttg tgaaactaaa attgctgttt ttaagcgttg tttgcctttt gtagctcaaa 16560 taaaaatttt tggcacaagt tttatgttcg taggatgttg gcaaaggtat gtccccgtaa 16620 accaccaaaa agtcacattg acctcataaa ccatgtatgc ctgtatgtgt gtgtgtgtgt 16680 gtgt 16684 // ID GYPSYDR2_LTR repbase; DNA; ZEB; 135 BP. XX AC AL603743; XX DT 04-MAR-2002 (Rel. 7.02, Created) DT 04-MAR-2002 (Rel. 7.02, Last updated, Version 1) XX DE LTR of putative novel retrotransposon GYPSYDR2. XX KW Gypsy; LTR Retrotransposon; Transposable Element; GYPSYDR2_LTR; KW LTR; retrotransposon. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-135 RA Jekosch K.; RT "LTR of a gypsy-like retrotransposon."; RL Repbase Reports 2(2), 13-13 (2002). XX DR [1] (Consensus) XX SQ Sequence 135 BP; 21 A; 51 C; 22 G; 39 T; 2 other; tctctctgcc tgcccacgaa ccacgcctgt tatacggact ctgatccttg ctgcctkccc 60 ttgacccaag cctgcataac ggattctggt tctcgctcac tcccttccac tcatgcctgk 120 taactcactc tgtgt 135 // ID DNA-1-8_DR repbase; DNA; ZEB; 1561 BP. XX AC . XX DT 29-OCT-2008 (Rel. 13.1, Created) DT 29-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE nonautonomous DNA transposon from zebrafish - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; TSD 1 bp; DNA-1-8_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1561 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1216-1216 (2008). XX DR [1] (Consensus) XX CC TIR is 13-bp long. TSD is 1 bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1561 BP; 528 A; 268 C; 296 G; 469 T; 0 other; taggctcgtt tgaggtgcgc ctcgccttga ctgcattgta aacgagagca ggcgtgcgca 60 gtaagaggag ggctcaaaaa ttacatcaga agtcggacac ctgccgaatc tcgctgttta 120 gccacctgga aacgtcacca gtggcggaga tctcgttcgc gttttttatc gcagacgaat 180 gacctcatga aaaaatacta tggtgattta tagtaaatat tatagcattt ttgaacatgt 240 ttattcagta ttgggttgtg tttatagttg ttgtgatacc agagcaacta taaaatttaa 300 ataggctatt ttataacatg gttcaaaagc actacagtat acattactat agtatttttt 360 aatgtgggag tggatgcaat agaaatatca ctgtttttac aaaaagaaaa tggtcagaat 420 tacagaagat gaacagctca ttaactaaaa gggtgtatta aaaatataat tctgtatatt 480 caacaaacgt taaaagaata acaaaataaa cagcttatac agcaagaaaa ctttcaagaa 540 acctaatttg gctacattat aaatgaagta gccatagcaa gatgcttttg agtgacgggt 600 catgacaaat gtttgtttga caaataaatt cttagggggt taattatatt gacatgctta 660 ttaggtctat ataaactgct tttactccag ccagactaaa ataaacagcc caagactttc 720 tccagaataa aaactattag actgtaggaa aaactgtgga aatgttcttg gtctattaaa 780 catgactttt gacttcacat acatatacat atacatatgt gtgtgtgtgt gtgtgtgtgt 840 gtgtcttgtt tactacaccc ttttcacttc tcatatgaaa ccacacaaag atgaggcata 900 atgacaaaac agtctgatta cctttgtatt tctgaatata tgtatatatt ggattgactt 960 taatggaata acgctgcata aaaattacaa tacaaaattc acacaaacat acaaaaagtg 1020 aaaatgacaa caaacaggac ttaaaactga gagagaaata aaatacagca cacacacatc 1080 ctaatgattt attagtaaca tttagttata ttggataggt cacgttatgt tttacaaaga 1140 tgtgttttgt ttttgtttgc gtgtaaatct aaataaataa ccatatgtaa tgctctttat 1200 gacttttttt ttaattatca acgaaaatac aaaaaaaagg tattcaacaa atccaggtat 1260 aaaccttagt aaaacaagat aataaagtgt aatgtgtcag tctcatccaa tcagcattaa 1320 gctgtgtcgt cagccagtcg tttcccatca tgcttttggt aacgcagtct tttgagagca 1380 actgcggccg actgctctat ccgaggtacc cgcctcgcga tcgcgagagt tttttggcgc 1440 acgtcagcgt gacatcagag cgaggcgaag gcaaggcgga cacatttcta accggcatgc 1500 atcttgcggt gatcaccggt gatcgattct gcgcagattt tggccaagtc aaacgagcct 1560 a 1561 // ID Gypsy-24-LTR_DR repbase; DNA; ZEB; 787 BP. XX AC . XX DT 11-FEB-2005 (Rel. 10.01, Created) DT 11-FEB-2005 (Rel. 10.01, Last updated, Version 1) XX DE A long terminal repeat of the Gypsy-24_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy superfamily; Gypsy-24-I_DR; Gypsy-24-LTR_DR; Gypsy-24_DR; KW endogenous retrovirus. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-787 RA Kapitonov V.V. and Jurka J.; RT "Gypsy-24_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 5(1), 22-22 (2005). XX DR [1] (Consensus) XX CC Gypsy-24-LTR_DR is a long terminal repeat of the Gypsy-24_DR CC LTR retrotransposon that belongs to the Gypsy superfamily. Its CC internal portion is deposited in Repbase as Gypsy-24-I_DR. XX SQ Sequence 787 BP; 216 A; 127 C; 173 G; 271 T; 0 other; tgtaacgaac aaacgttaca tgctttaatt gttgttttag tttcttgtga tgcgagaagg 60 actggttgca atattactac aggccactag gttgcactag aggggagtga gaactggata 120 tattccagct caccatagag caagtgagtc gtgtttggta gagggaacca catggtgaac 180 tgaggctacc gacccactcc ggttctgatt gttttgagaa ctttattgtt gctgctcaaa 240 tgtgagtatt gaatacgttt tgagtttgaa agtattttca atggttggtt gtaatttgcc 300 ctatgtttat ttcttatgta gacaggacct gttgtcctag agatgttcag atgaggtgtt 360 tgttggtgct ttcaatggac tgcaatgatt tggtttcaac gttctttatg cttctcattg 420 gcaaatgaac tatatgtgta cacttggaat gacggtcagt gactcttaaa ttcctcatca 480 tactaccgtt ttcccagtgt gaggaagtgg atcttattgc atcgcactca aatccatcta 540 cgcaacattc acccaactat ctatattatt tacaaagact gtttacattc aatgactgct 600 tggttttggt tattgattac ttttgtttgt gtggactaaa tactgtatac acatgtagag 660 aaattaaaca agaaggaaaa agggaaaaac aggcttgtga agtttatttt aatttcagta 720 tgtgtgaaat aaaaattctg tgaactccct cagtgtaaca tcttgcatgc atagaggtgg 780 tgttaca 787 // ID HATN2_DR repbase; DNA; ZEB; 1219 BP. XX AC . XX DT 02-MAY-2002 (Rel. 7.04, Created) DT 02-MAY-2002 (Rel. 7.04, Last updated, Version 1) XX DE HATN2_DR is a nonautonomous DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW HATN2_DR; nonautonomous DNA transposon; hAT superfamily. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1219 RA Kapitonov V.V. and Jurka J.; RT "HATN2_DR, ancient nonautonomous hAT-like DNA transposon from RT zebrafish."; RL Repbase Reports 2(4), 15-15 (2002). XX DR [1] (Consensus) XX CC The consensus sequence was reconstructed based on 12 copies CC present CC in sequenced DNA that represents ~1% of the genome. Several CC hundred CC copies of HATN2_DR are expected to populate the complete CC zebrafish genome. The element was active long time ago; its CC copies are ~14% divergent from the consensus sequence. The genome CC is populated by multiple subfamilies related to HATN2_DR. CC This element is characterized by 17-bp terminal inverted repeats CC and CC the 8-bp target site duplications. XX SQ Sequence 1219 BP; 386 A; 240 C; 220 G; 373 T; 0 other; cagggcttga cattaacttt tttgatcacc agccactgtg gctagtagtt ttccaacatt 60 actagccact cgccattttc actagccaca attttgttgt tgggaaaata tattttatat 120 gcataaattt gactttgaca tgctaaaatt acttgattta gattttgtgt tatgtccaca 180 tgcctcctca ttcatttcac tttttgtgtg tcgtgtatga gcttgctcaa tgagcatgag 240 caaatgggtc atggtttcat agtgtcgcat tgcaattagt ttttatttca catgactttt 300 cagggctcaa aattaaggat ttaccaaatt tatctctgac cacaccaaaa ataaataaat 360 aattatttct tagccacaaa ttttaaatgt gtcaaaacaa gcaaaatata gctgtatact 420 atacttttta tttaacaaaa catttcttta aaaaaaaaca aaaaaaatga cattttaaaa 480 aaattattgt catgtataaa atatgcacag taatctgtcc tggctaaagg tcccagatca 540 ccagttaact acacataaat ggggagttaa tctcatatta aagcaatgtt attgtaaaaa 600 aaagataatt aaaccatatt aacaaaaata actgtaagca aaagaaagca ggtagaaaaa 660 catactgtgt gataatgaaa ggatgatcac acgcacacgg ttaatgttag gcccattaat 720 aatctgttca aagaatggtt aaagcgaaag cggcaactgc tgctgcttac aaaaagacaa 780 atctggagct cttgaaagca gcaggactgt gggtgcacga gcgtcacttt ttgtccagtc 840 tatttcaaag agaaccgatc gcaccagttg cgtttccagg cacggttgct tcgcgcaagg 900 aaacgggagc gtgcactttt taaacagtcg cgtgcatcac atcagctgtt cgcgcgagtg 960 atcatcctct cattcggtat tcacctgctt tgctcgcgcg aacacaatta tttttgtcag 1020 tcgtgctgct tctcgattct aagatcacat ttgcacgcat attgggccat ccttattgtc 1080 gaaccctgct tttaagaaaa ctacaattta aaaattattt tcaaccagcc aaagtggcta 1140 gtgggagtgt ctgtctaacc cgccacagct gaaatctacc cgcatttggc aggttggcgg 1200 gtgttaatgt caagccctg 1219 // ID EnSpmN1_DR repbase; DNA; ZEB; 6581 BP. XX AC . XX DT 09-SEP-2003 (Rel. 8.08, Created) DT 09-SEP-2003 (Rel. 8.08, Last updated, Version 1) XX DE EnSpmN1_DR is a nonautonomous DNA transposon - a consensus. XX KW EnSpm; DNA transposon; Transposable Element; Nonautonomous; KW Dr000060; En/Spm superfamily; EnSpmN1_DR; KW nonautonomous DNA transposon. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1773-2625 RA Bao Z.; RT "Dr000060, an unclassified and uncharacterized repeat."; RL http://www.genetics.wustl.edu/fish_lab/repeats/. XX RN [2] RP 1-6581 RA Kapitonov V.V. and Jurka J.; RT "EnSpmN1_DR, a nonautonomous En/Spm DNA transposon from RT zebrafish."; RL Repbase Reports 3(8), 153-153 (2003). XX DR [2] (Consensus) XX CC EnSpmN1_DR copies are ~1% divergent from the consensus sequence. CC This element is characterized by 10-bp terminal inverted repeats CC and 2-bp targets site duplications. This nonautonomous family CC belongs to the En/Spm superfamily. This classification was CC made based on detection of En/Spm-like transposases in the CC zebrafish genome that are internal portions of sequences similar CC to EnSpmN1_DR (see EnSpm1_DR). CC RnSpmN1_DR is a composite DNA transposon, it carries copies of CC the DNA-8-2_DR and TDR15 DNA transposons (pos. 1850-2625 and CC 5233-5747, respectively). XX SQ Sequence 6581 BP; 2087 A; 1038 C; 1164 G; 2292 T; 0 other; cactgtaaat gcgaatagta ttccttacta tcaaaattct agtgtgttta cttaatttta 60 agctaattgt tgagtttact agaaatgcat tagtgttgct gacatagtaa ggtataaagt 120 tcccataact aatgtaattt gagtaaaatt gcttataagg gctaagccat gctgttacgc 180 atgcgtactg ggtggtgatt tcccatcccc catcccgagt cacaaccaac cgccacccag 240 caccgccatt gaaggacagg atttcatatt gagaagatgt actgctgtaa atcaaggtaa 300 acttttcgtt ttaattacta ctttgatttt atttgtatag tatcagtaat ctgtttattt 360 cctgcacttt gtgaaattgt ttgctgtttt gtgaactaat taatagccca tgatgttgtg 420 actaaacgtt ttgaccactg cacgttattt gcaatactgt ctactttact agataagcta 480 tttttagctg cttagtcctt tttaataact attaataatt tgtaaagtta gtagtaattt 540 gtaaagtggg cgttatgctc taaaagagcg ggccgcgcca ttttaaatga tgcactgaga 600 gagcaagagc gcgcgcaatg atccgttcaa atagatgtaa agttaacaca tagctgtagg 660 tctacatttt taaaaaagaa aacgaaacct atttatccgc attacaaata cagtaacgtt 720 agttatcgct gaactttagt ggcgcttgtt gtgacgaaaa tatgcgtgtt tttttttttt 780 aagatttact tttatgagat cacctgcagg aagcaggtga ggagatcagc tgtgattcgg 840 tccggcgtca gcgagcggct catgaccaga ccagtcgttt ttttctccat gatcgccggg 900 gcagcagcag ttttaacccg ccgaggcttt tagaacatta taaagcagtt tcagctcaaa 960 atggacccgt gagtgtcgta ggtaatttag gatttcgatg cgattacgat tcgtgtcact 1020 gtatgaggca gatacatatt tacacatgat gctaaagtgt gcttatgcta acaatcccta 1080 tgaaacttaa agttaaactt atgaaaatgc gagagatatt taattaactc atttatttgt 1140 tttatatgct atttgtcttt tctttatgcc gttttgaagg tttattccag tgttgccgtt 1200 acgttattga gtgttttgta ttgctgtaat gacaaataaa tctgaccaga cgcgactgaa 1260 tatgtgtgat ggataacgtt aatgatgaag tggtataact cgtacaacca agcacagaaa 1320 catccctttg tttgctcaat atcacacaaa ggcatcaaca agcctttgtt ataagcaaac 1380 tttctgtaaa gaaacaggct ttgccaaacc tgttagtttg catttttaaa gaaggtttca 1440 ctaattttta acaatttagg gcaaataaat ttgaacaatt acaaggttgt ttaaaatgta 1500 gcacaccatt agagtattgt gagtggcata tttggtttgt aatatcatta taccctttat 1560 ttatttttta tttcagacta ccagtttgtg agctgatatg cagtggaaat gtaagttttg 1620 cacttacaca tcaaagaaaa ctttttgaac aataccgcac aagccataga acatattcat 1680 gaattgttct attaacgtta ccttgtctcc atactgagtg tcagggtaat tttaaatcgt 1740 ttaatgcatt aaaagtacat ttaacaaaat tctcagcaaa gaactgcatg atcaagaata 1800 gtccgaatat tttgtccatt tgaataatca tctgaggact agttagaaat agggttgttg 1860 tgttgtgata ccattaattt atgttacaat actataccag ctgaagtatg gtgataccaa 1920 gtagtattgc aatactgtaa ttcataactt aaatccatga aataaagaaa aatgtcagaa 1980 ataatatatt gtatatgtta ttataggcct acttgaattt aaataattcc cttattgttg 2040 aaaaaagtat ttgcacatct cttcacctaa aatgtattca ttccagcaaa aaaaaataca 2100 ttaaaataaa gatacataaa gaaaagagca tttttccaac aaacctaggc tatatgaagt 2160 ggtcaaaaat tgtttcctgt tgctattttg ggtttttcat ctatcatact tttttcacac 2220 ttattatcgg ataagaatcc aaagtaattc aaaatttcac tttatgagtc attcttttta 2280 atagattgtc acggttgttg tagctattat atcatattga caggaacaga aataaacgaa 2340 ttccgattca aaatgaacca tcacaaaata tgcaataggc tactataaaa ggcgcaaatt 2400 ctacagtttt acaaccttag aaggctccat agactattaa aatacaatga tctattgttg 2460 ctgaatcgtg taagtatttt agtgactttt tcacatgcag cattatgtat tgtaggcaga 2520 ctgttaatga caatactacc gtttacaaac tacagtggca ccgccagtat cttggagcca 2580 tagtatcacg atactaccat agtaccggta aaccgtgcaa ccctagtcag aaagttcagt 2640 gtccttttaa agacatcaat tgagacaaat atttattcca cttttaatgc gcataaaagt 2700 aaggcacacc agcaacaatg ccatgattat atttaatcgg gggttgtgtt tattactgtt 2760 tggtgacaag tatagctcat gcttttgatg acaggcgccg ggagtatttg gcagctttta 2820 ttattcaaga acatttaact atgtcatgtt aaagtgtaac atctgttgct aaatggcatt 2880 gcattgtatc attgcatgtg tttctgattt ctgttttttg ttttagaatt tcgcttagcc 2940 atgtcaataa cactttacgt ttttgtgcac aacagtgaca tccagaagga agtcatagag 3000 gaaacaccag ctgatgcaag tgctgcagaa ggtttgtttt cttttcaaaa ctttgtgtga 3060 ttttgtgaat ttgtgttttt gaatgttatt aatgactaat aatgttttga ttggatcttt 3120 gcagaattca atcgaatatc aggaagaaat gtgagatcca agttctacgc tgcattggac 3180 acacactgct cgtttgattg aaataaaaaa aaaaaaacag gagaaaattg cagaagaaaa 3240 ctagaagaaa ttttcaacat gttaattcaa aggtattttt gtgacatcat ctttgaatca 3300 taatttgcat ttgattgcaa taaattgcat ttgatgtttt cattgggcta taatgcatgt 3360 ctacctgtct ttactttttc aaggtcatct ggcagcagtt ctccggggtc tttcagtcct 3420 ttagagggat gagtctaaag aaattttcag gatgtgtttt gtaagtagta ttattgtttc 3480 tgctttctta gactggaaat gaagcagtca ttccagttta cttcacttat taataaaaat 3540 tttcaatttt ataaaaataa aacaaatgca tttaatgtaa gtatttgaaa gtaagaggga 3600 tgttatatta tggcctttgg agcaagagca ttgctatgtt gcaaatacag cagcttgtgc 3660 cgtcacaggg tcagttagct tcataggcca gtgaagtaat gaatgcattt ctttaatttt 3720 taaacacact tgttttgaaa aggaggaaga ttagcttgaa aaaaaaatct taataagtgt 3780 aatacctatg taatatgttg agagaaatat aatttcagtt gaaagaacct gctggattaa 3840 gatgtaaaaa gatagtgtta aaagccacaa agtcaaagat gctccgtatt ataaccaacc 3900 ctgtgaagtc acctgctgtg ggattgtaga tggctgcacc cttgttctaa aagccatact 3960 aaaacatcaa ttttcttatt aaaaaatagt tattaataca tttttaaaaa gtgtgctaaa 4020 taatgcaaac ttgactcact gctttaatct agcaaaacca aatgcattca tgaatcggtt 4080 cccaatttat taagttttcc ctacaaaata tttatattga ttatgtagca ccaattggtc 4140 tctcacacat ttgcatttaa tttcttgaaa atgttttaaa agtactttta ctgcagtttg 4200 ttttattaca ttacagtgtt tctttaatta aaaaaattat tgttccttta gaatagtatc 4260 atgttcaaga tactggaatt tagtaccaat ctctgctgca attttaagca gcaaatttcc 4320 tgctaacatt ttaagcactt ttgagcacat ttttaaacag cactgttttg ccattctgtt 4380 cacatgctta acagaaatga cactcaacaa tgctcaaatg ttagcaggaa atggatgttt 4440 ttaaaattac tatatggatt ggtaataaat tccactatca tgacaagcct agtagaagca 4500 gtacaggcaa atccatcgtg ttggtctaca ttgaagaagg agcaccatag caaggccttc 4560 ttataagaag catttgaatg taaaatgtac tataaacaca aatagtcatg ctaataagca 4620 actagttaat agggtaattg gttcctacta aagtgttact aaattaagca caaggaattg 4680 tatttttttc acctctgctt ttggtctcct ggccactaca ggctttatta gtgccattta 4740 gaatatcaaa ctttacttat ttttctttct ctattcatag gatgtaattg tagagtcgga 4800 cttcagtgca gaagatgttg gcctgctgac tgtcgtgtgt gaagatgtct aaagaacctc 4860 attatggatg caatgtgggc ctttatactg gaaggaaaag tggttatgga ctttgccaca 4920 tgcagtatgt ctcttttttt gctttgattt acacactgaa ttcggactac ccaaagactt 4980 tgagcaatgc atttgaattt gttcaacgtg tgttgctttt gctgggaggg aagcgtaatg 5040 tgttttattc ggcgagtgcc tatatactgc tgtgctttat gtttaaaggt tcaggtataa 5100 tggcatgttt aattacaaag tgttgctttt taacatgttg ctgtttgcag tgttacattt 5160 aggactgcag cattttggtg tttaaaacac tttttttttt ttttttttac caaaaaagtg 5220 tattgtgatt tatagggttg tcaaaagtat tgagtttggt accaatcggt actgaaataa 5280 ataaaaaaaa agtccatcca tttcccacta acattgcaag aactgattgg ccattgtgtt 5340 cacatgctca gtagatatga ctgattggct ttgaaggtca ttatttttac cacacttcac 5400 cactgttcac caggtgcaaa gatacaaaga cactggataa cttcaaagca ctgtcgatca 5460 gtcgattcat caccagatag cttatatgtt agcttctaaa caaatcagct atcgacatgg 5520 ctttgaaaca cttcagtgtc tgtgtaactt tgtttggact cagtgaaaaa tggtaaacct 5580 atgacagtca cagccaatca cagtcatttc tgttgagcat gtgaacacag tggccaatca 5640 atggtgttta agaacgagct cagtagcact caaatgctgg tggaaaatgg atatttttaa 5700 tatttcagta ttgattgata ccaaactcaa tacttttgac cacactaatg tgatattttc 5760 tgtaggaaaa tctaaactac acagtctaac aaattagctt taaagcttaa tatctgttcc 5820 tctgtgagtg tctatatgct gccatgttta aggtgctggt gcaatggcat gtttaattac 5880 actaattcaa gtatcttttt tccaagtgtc aagtatttaa agtttttttg atagattttc 5940 aaaccttttc gaaaacttgt gttttagggt tgaacatgtt tattatattt tttgtctgta 6000 ttgctgttta cagtatatta cagcacttta caacaggttt ttacaaaact gtattgtgta 6060 caagtgtatt gtgatactaa aatatcacaa gtgtattgtg aaattctgct aattttgtca 6120 aataaaatgt ttacatttaa gacctgtata atgtcattac ttgtaggttg aactacatat 6180 tttatgttca gtttaattga agaaaatgtg gaaaccgatt gcacgtagtt tctttagttg 6240 acattactta aaatggtgga ttaaaacaat gttgagacta ctagcattca agatgttaac 6300 tgaacatggt agaacagtat tagttaaata gagttattaa agtaatctta ctataaaacc 6360 ctaagtaaga ttactaaata atgacaagta tgttaactag aattttctat taaacttact 6420 taatttttaa cagttagttt actttataag ttttagtttt cttacttcaa ggcaataatt 6480 aaacttaact taagatttcc ttgtaagttt atattgcaaa aataaggcaa tcggtttcca 6540 aacttttttt aagtaaactg aacatgttag aatttacagt g 6581 // ID ERV1-N7-I_DR repbase; DNA; ZEB; 8568 BP. XX AC . XX DT 03-DEC-2008 (Rel. 13.12, Created) DT 03-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE nonautonomous endogenous retrovirus from zebrafish, internal DE portion - consensus. XX KW ERV1; Endogenous Retrovirus; Transposable Element; Nonautonomous; KW internal portion; ERV1-N7-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-8568 RA Bao W. and Jurka J.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(12), 2175-2175 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 8568 BP; 3390 A; 1061 C; 1673 G; 2403 T; 41 other; taatttggcg ccttcatcgt ggggctggac tgaggatggt cgaaggaagg tgtcaatggg 60 tcgatccttg gcagacactc acgcagagac caaaaagcgc gcagattaaa aaggtgagca 120 awtttttgct ttattttggt tctctgatga gtgtactgtg gggtggacca gacgcttgaa 180 cttgactttt aaaatacctc accattaaaa agaacctata aattgactaa atttattaaa 240 aagaatgata aagtaaatga taaagtrata aaagaatagt gaagtgaact atttataaga 300 gaaagaggtc ttccaggttg tcaaggtatg tgcttgtact aacaaatggc tgtatattat 360 gcactaacta atggctttgt attatatact aacgaattgc tgtaaattaa atgcgtatga 420 aagagagtta catgcatgaa ttgctacgca agaggcggga gcagatcgcg cggaggtggc 480 actagtgatc gctttttata gktgtgatta gagcaaagga gcgctccgcg tctctgtact 540 cgcctgtgga ttgagtaaga ctctcccgat agatacaggg aaattgggag gaggccttta 600 atctgatctc aaaagcacct ataaggaggg ctggatctca gtgaactgac ctgtgtaagt 660 cttgttagtc tatattttat gtttttatgt ggtccggacc aaggggaatg actgattttt 720 cttacaattt ctagaaagag aattgaataa gaaagcatat actgtgagat ttgtgaatga 780 gaatgcatgt gatgagagaa ttgaatttta aagcatgagt aggttgtgtg cttaaggatg 840 ttgagtgcta gagagaaatt actgaggtgt agtgtgagag aaagtgagta ttataattaa 900 gagaataaaa tatagatata atatgaattg attagagaat taataaaaat tgcacatgct 960 gtgctggagg gacaaagcaa acaatgtggc cacaacaatt aagaaaagca gttttgcatg 1020 taaaaaagtt tgttttgtct cccaggacct gcttgtgatg gtaaagacac tttaactgta 1080 cctccaactg aaaataaata aaagtaatwa agggaggttt ggattccaga ttaaaaaaat 1140 aataataata gtttgcatgc aaatgatctg tgtttgctcg ttaagaggcc ttgatagcga 1200 cagacattaa ctagtaactg ttactcagat ttggaaagtt atttaatgat gatatgcata 1260 gagcagattt agattttaag agtgagttct gaatgtgtgg tgacagttaa gagattaaaa 1320 gcagattgga ttataatgac atgcgcaaga aaattcctgt ggttattgtt ttagtgttgt 1380 gtgtgtctgt gtctgtgtga atggttgtgt aaaattccat attagtggac aagcagagct 1440 gcccctcctg taccgtccct taaatggatc atggcagaga tggacactct ctgtgttaca 1500 ctgagaggaa tgaatgactg aatgagccct gtagatataa agggacaaat gctgagtgaa 1560 tgagtccttt aaatgtgaag aagtagaatg tgagagtgtc gttaagtgtt tgtgtttagt 1620 agtgtgtttg tgataaatag taaagtaaaa taattattat tgtagattat aatataaaga 1680 tggactgcag atattgtgtg ataaagaaca aaamatagaa gagactgatg cagtttaagg 1740 aagagattat aatatttagg aaataaatta attcagttca gatagaaaga tataaagata 1800 aatattaaat taaggagaag tatataacat aarttgagay ytgtgcatac aaagagatca 1860 taatttaatt tcttagaata aaaattaatt tagtgaataa tagtatctga tttaataata 1920 atgaggtgta aaagtgtata ttacaagaga gtataagata aataggataa agtaaataca 1980 atttggaaga aaaataatgt gttatataaa agaataaagt gatatgttaa acatttggga 2040 acagagagga ataacacgtc atagaaaaat attagaatta gaataaaaaa aaaaatatat 2100 atatataata tgaataattt atagttaaat actcaactaa gagagaacag aacagaaaat 2160 aatgacaaat ggcaggaagg aacatttgat gtaagctgta agtatatata aactatgatt 2220 aaaaaataat aattatcaat caaaagaata agatgaaagt taaaaactaa aaagatgatt 2280 gtgtataaaa agaatgagaa aatgtactgt aaaatttaaa cagatatgga aaatgttaga 2340 gaaattcaaa caaaataaag aaaagaccaa ctagtgagaa aacactagaa tagattagga 2400 gaaaaaacaa ataagaaaaa gcacaagctr aagctaaata aatgraaaga gaagaaaatt 2460 taaagaaaca gaaaattaat aaaagatatg tggacttagt gataaatgtt aggaaaaatc 2520 agctcttgga caacaaagag aagaaaatag atgagaggag tctttatttg atgaaggaaa 2580 atgggaaagt ggaaaaatgc ttagattatt aactaagcaa attgctatat taaattaagg 2640 ggcatggaga tggacttttg aagggtcctg gtctttttgc ctgccggaca tccctgaagg 2700 cactgggaaa tagaattagg tgtttggaga aacacaagga gcaacttgtt gactgtgaac 2760 tctttttaca tgaactaaaa gcagtgagcc aggcactgca tgcaaaaagc tacaragact 2820 ttgacccatc aacatgttca agagacactt aaagaaggag aaacaagara ctgatgaact 2880 gatcaaaaaa gctaaacctt ccagcaacat cagctgggtg aaagagaagc cagcaccaag 2940 aggaaggaga aatgctctac actttaactc cttaaataag caggaaatgt ggggattcct 3000 gaatacaaca atgcaaagac tgcaccatct atggaacagt tgcctcgggg tgcgaggaag 3060 agaagttggc tatggcctag caaactcaat ctcaaagccc agtagaacta tgatccaaat 3120 caagtacatt gcaaatttaa acaaaagtgt accctttcct gtatcaaatc agacttctgt 3180 caattagtta gaaaaaaaaa agttattata taaattaaga attaattaaa ttttaattaa 3240 ataaggatta tgaaataaaa atatattata gaaatataaa tataatatat attagttata 3300 acagatagaa gatgtaatta agacaattga taatggggaa aatagattga gttaaacatt 3360 tctagagcag ggctagtaaa aagaagaatt tgtctataca aatgtatgat actgaaagag 3420 atttagattt agaaaattaa tacaaggaaa aaataattga agagaaaaag aacagcacat 3480 gagaataagt caacagtgag aatctctaag tgtaagagat gattttcagg aaaagggggg 3540 gttaatataa tagattttgt ttcatacagt atatagccat acaagtaaaa aaaaaaaaaa 3600 agggaactaa atttgaggac taatgagtaa agcaaaacag accagataga ttcagtgatt 3660 tttccaatta agttaactta caaaaattac atgaaagntg tgtttacaat caagtagata 3720 ggttgcctta ggaagtaaat atcaatcaag agaaagatag aaaawaataa atgcaacatt 3780 caaatgaaca aatagttcaa aatgatttaa agaagtacag ttaatactta aataataatt 3840 attatcattg caggacttgg atttagagtt agaaaaataa ccagtacaaa tgagggttaa 3900 aattaaacca gatgcacatc tgcagagaaa agaagaaaaa agctgaaata ggtataaaaa 3960 caatattgaa gttttattaa gagtgggagt attaacacct agtgattata atatgcaata 4020 catatgtcgt tggatatttc tatttagatt atatttcaat tyaggtatga cagtgtatgg 4080 aattaataag atttgaattt aaattgaata aatactttaa ttatgctgca tcaaagcagg 4140 taaaaagcag aacatataag aacaatcaat gagagaaaat tatccatctg aagttcctac 4200 attttatgaa caaatgttcc tcctgatgtc aaatgttgtt aagtaattag tttctgtcca 4260 gccaatttta ggtaattagc aaaagagggt acctattagt atttacatta cctggaaaaa 4320 ctatacattg atgcgtctaa aaatcttatg atgaatagta gtttgttaca acatagttga 4380 tttaattatc tgcagctcct tatagctcta gctcagttaa aaggcaaaaa acacatgatg 4440 acttttaggg atgactgaaa tttagcattg atggaacaaa gaatctgaca caaacaaaaa 4500 agcactggag aaaatttatg aaacaggaaa gactgcaaaa attaggtact tagttaacag 4560 gaaaraaaaa aaagcattta agatgatgaa agataaatta ggttgaaaaa caatgcagga 4620 aataggtgac aggatgtgag gtagcacacc ttcagtgctt atagagctac aatctcaatt 4680 aaagaacatt gtgaaaatta aacaacaaac agatctctgt gaacaatcta gtatgaaagg 4740 aaggaaaaag acgtttattt aacatactgc actcataaaa gactaattgt tgcaccaacc 4800 acacctctaa gtatacttat taaagatgca catgattttg atcaaagtat tattaggaaa 4860 attaaacagc aaggagattg gtcttcttat taaaaggcag tgcttagtga gttgtccatg 4920 tggcatttgt gctaaatata atgtgaggaa aggaacagca accagtacct aaagtgacat 4980 tcaagcaatt aattggggtt ttgaggttat gataaaataa gtacaaagca agaaataaat 5040 gtttatacta cttaacaggt taggattaaa agctggttgt ggtgatatat tatatttcta 5100 ataagaaaaa ttatgatttc tacatttgga atactctcag aaataattta aaataatgtg 5160 tgtgcattaa tacattattt agagtaatta aagtaaattt aaaatagcta aataagtaaa 5220 taattgatta aataaataaa caaataaata aayaaaactg ttaaagaaat actgataatt 5280 aagaataaat caaaagctaa agtgaattta tcatgcacaa tcacaaggac tggtcaaaag 5340 caccaagaag ataataataa taataataat aataataata ataataataa taataaattt 5400 tatttgtaat gtgtttttct tatactcaaa gcgctacagg caacacacta gaacacaagg 5460 caagcaaaac agcacaacaa tagaagatgc aataaaacaa taaaaaatat gaaccataag 5520 atagaaataa aaagttaaac aaaattatca tccagttaaa attatcatcc taagttaaaa 5580 gcaatgctaa aaaggtgagt cttaagaagt ttcttaaaac agtccagaga ggcagcattg 5640 ctaatgctga tggggagttc attccaaagc ttaggcgcac tgtgtgaaaa agattattag 5700 ggccatagct ttaaaaaaaa aaataataaa aataaaaatg tgaaaatgct aaacatgatt 5760 gagtttggat tagtaaactg gattgatatg taaacagatg ctttacatgc aaactaatag 5820 aaacaagaac ttaacactgc atggaatgct aatgtcctgt ctaatacatt ggcaaataag 5880 aatttttgtc gtattagaaa aactaactgc catgtatgaa gctatttatt cacaggaaca 5940 aagcaaggaa cgacagaagg aagctgcttt ggaagatgtt cctgctgctg ttgctgatga 6000 tgatgcaaat aaagatgagt gaaagcctag aggtaacttc tctaaaggta aaggaatatc 6060 ttattctgat ggagttggac aagagtatcc aacaattgac ttcttawwtt tcttggaaaa 6120 gaggtaaatt atcscacaac aaaaygaagk taaagaacaa tcatctaaag aaaaatgctg 6180 acccaaagga tatgcatgaa taccaaaata tgcaatttar gaagaaattg actataacca 6240 gatctcagat gatcagctat taagaagcta ataacagtga ttggtattgt tgggcaaaat 6300 acacagttgg aaaattacaa atatccaatt gcctaatatg tttataatag cctttataaa 6360 gcttaaagtg gttgtaaatc ctaatgatta tatacattgt gcaaaattca ggaaaataac 6420 tgcaatatac tgtacagtga cttcccattt tgtcaggccc aatcccttga taaactctaa 6480 atttacagag agaataagag aacaatgatc ggttgaaaaa agaaatgtaa agaaatgaac 6540 atcagagttc aaacaataat aagagagaga ttcccacaaa ttatgtgata aataagaaat 6600 taatatatat gaatgttata taatcagtgg attgayaaat gtaagaaaat ttaaaggaaa 6660 ttatgtaata atttggcatt tgaatgacga aatgtagcga aaaagaaaat gcttatatgc 6720 aagccttcca aatggctaca gctacaataa agaatataat gttgccagtt tcagaagttg 6780 caattatgaa agaaaargtk atagggcagt taaccgcaca cttggtaatt kcactggaca 6840 caatgtacaa ttcctaagat gatttccctt aatacaacaa gattaacaaa cagtagacta 6900 ttactggatc tgtggaagag gaatattatg ccccaatatt ctattgaatt ggaaaggagt 6960 ttatactaca gttaaattaa tacaggaagt gataataact gggttaaaga gcaagatgac 7020 atacaaagac cgcagaattg aaagcagcag aattggaaga aatgagttaa gagacaatat 7080 acacctgact cagatatttg catatttgca attggccaac ctacaagaat accacatgaa 7140 gatgaattta aagcaacaga taaattaaaa gcattactcg agtaatagag tgaataattg 7200 cattataaaa tttagaatgg gttaactata tttattacaa ccaacaaaga tttataaatt 7260 acacaaatga tgcccttaaa gcattaggag aacaactggc tctaactagt aaaatggctt 7320 ggtaaaacag tgcatcattt gctgcaccta tattccaaat aatactgccc caggtggcat 7380 ctttcactga agccatgcga aaattgtraa atctgagraa agaartaaaa garaacrctg 7440 ragarkatam aaaagtrwtt tgatttgctt ccatrtaytg tatggatcat taagacaata 7500 gttcctcagg attggaatat tattaataag aattgctata tttgttttta ttcttctttt 7560 ggttgtgtta tgcttctcag atcgatagct aaatctactg ccaagcggat ggcaaatgtg 7620 cgagttaata gaactgtgca tgactgcaaa gattatgtct atgaagtctt tggaacagat 7680 ttggaccaat tgggacacat ttggaacagt tgaatgagtt ttgaaaacaa ttactaccct 7740 gaatgtcttt tatctttaat gtcttaaaaa taataataat aaattcttta gctttttact 7800 aattttataa aggctttaaa tcacaaagat aagagtttta agcaaggcat catccataat 7860 gtaggaaaat gcaaatagaa tgaactgaac agaacttaaa gattgtgtaa tgattttaag 7920 ccctatttac tataataaat ttctgtgtat ttaggagttc atcgaagctg ccggagatca 7980 tatcagatgt acacggggaa agagggaaga tgattacggg tcaagggaca cctcagtatc 8040 caacarcatc gaaaggagaa gaagagcaga ctctactgga ggtttaaaaa cccactgtct 8100 gctccaacac cagacagtac tcagaagtac ctctggcgac gtgggatgaa acatcagata 8160 tagttctttt ataggcgctg agaatgttct ggaaatattg agattttcta ttgtatgttg 8220 attaatcaag tatttatgct atataactac tgcagtgaaa aagtctgaca aagcatatgc 8280 ttttaatatg actaatgatt gttgtgtgtt aaactcccca agaaagtatt tagttaggga 8340 gagcgggatc ttttactcct tctaaagcca aaagtggagg gagatgtttg gttccgtttc 8400 tcctctgaag tgcggcgcaa ggatgctgac aattgataag gatgtagatg ggcaactttt 8460 ttggttgcat aagtatgtta agacttggcg agatgttgga ttcttaagtt gcacttcagt 8520 gagggattta ggtgagggac tttttgatta agtccctaag gggggatt 8568 // ID Copia-8-LTR_DR repbase; DNA; ZEB; 484 BP. XX AC . XX DT 20-FEB-2009 (Rel. 14.02, Created) DT 20-FEB-2009 (Rel. 14.02, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Copia; LTR Retrotransposon; Transposable Element; Nonautonomous; KW Copia-8-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-484 RA Bao W. and Jurka J.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 9(2), 555-555 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 484 BP; 101 A; 153 C; 81 G; 148 T; 1 other; tgtcacacaa ccagcgatcg ctccccagag gatcgctggt tctcacacga actcacagga 60 ctacatttcc gcattcccca ggactacaca ttcatacatg cactgctccc aaacacgcac 120 gcctgctcac tcatgtatcc tgattgcaag caccagctga accttgttca cagactgatt 180 acatcccatt cataagccac tctctcacgc taccagtttg ccgagtcttg ttttcccagt 240 rtgacattac aacgcgtttc cctgtcttgt ttctccgtgt ttcgatccta gccttgtatc 300 cctagttatc ctgtttagcc gcctgccttt tgaccttttg cctgtttata tcgactacga 360 ttctggactg cctgtatata cctgtttgca cctgatgacc attgcttgcc tgactattga 420 ataaactgca tattggatct tacctcctgt ggtctcctgt cactccaccc cagtcgtggt 480 aaca 484 // ID DNA-2-27_DR repbase; DNA; ZEB; 1912 BP. XX AC . XX DT 09-JAN-2009 (Rel. 14.01, Created) DT 09-JAN-2009 (Rel. 14.01, Last updated, Version 1) XX DE Nonautonomous DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA-2-27_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1912 RA Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 9(1), 5-5 (2009). XX DR [1] (Consensus) XX CC 2bp TSD. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1912 BP; 621 A; 354 C; 348 G; 587 T; 2 other; tatggaagca tacgagtagc ataattcaat tcaaaatgta atatgttaaa tggcaatgta 60 tttctgtatt tgaatttgca ttttccaaga catatgagca atgtttggtg cagaatgaaa 120 atgtaaatta aattacattg tttgcatttc tcatttcaaa cactgtttta atatgtaaat 180 tgtatccaca ttttaaagct ttataagtag caaaattaaa ttgaaaatgc tatacacaaa 240 cttaatttga tgtgataaag tacaagcaga aactgtagca aaatgataat ttaaatgtta 300 tttgtcctaa atgcatttac gctcgcgtct aggaaacgca ggattacatt ttcagcataa 360 cagcatggga tgtgtgtagc aaaattaatt tcgaaatgta atacgcaaaa cgataatgca 420 atatgtaaaa ggcagtgtat ttctgtattt cagttagcat tttccaagac atatgtgcaa 480 tgtttgctgc agaatgaaaa tgaaaaaaaa aaaaacataa tgcattttca tttaacacat 540 cgcgacgaat gaaagccgat ttgaaattga aaatgcattc tggcctggcc acgcccacag 600 acggccaccg ttgctgcaag gcgggtttta ggttatgtcg tcacagctgc cgcgaggact 660 attwgggaca tttggcagca taaacatggc gaagtctgtt cctcagcggc ttcgtgaagc 720 tgcacgagcc ctcgagcgtg aactgggaaa tactttagaa aascaatcac ctcctagctc 780 agtaaacttg gcttaaggtt tgatattcgc gcacgcagtt ttagggatca tctgcaaatt 840 acatcgacag aaccaaaacg cacagtaacg caaagcagaa aaaaattgag gagattccaa 900 taaatggtta gaggccagac tgatttaata caggtagcaa agctactata cctctaaccg 960 tatggtacat acaatgcaga aataggcaat agctcttgtc ataagcatac agtacacatc 1020 agctaaagcc cctttaggtt ctgtcgatgt aatttgcaga tgatccctaa aactgcgtgc 1080 gcgaatatca aaccttaagc caagtttact gagctaggag gtgattggtt ttctaaagta 1140 tttcccagtt cacgctcgag ggctcgtgca gcttcacgaa gccgctgagg aacagacttc 1200 gccatgttta tgctgccaaa tgtcccaaat agtcctcgcg gcagctgtga cgacataacc 1260 taaaacccgc cttgcagcaa cggtggccgt ctgtgggcgt ggccaggcca gaatgcattt 1320 tcaatttcaa atcggctttc attcgtcgcg atgtgttaaa tgaaaatgca ttatgttttt 1380 tttcattttc attttcattc tgcagcaaac attgcacata tgtcttggaa aatgctaact 1440 gaaatacaga aatacactgc catttacata ttgcattatc gttttgcgta ttacatttca 1500 aaattaattt tgctacacac atcccatgct gttatgctga aaatgtaatc ctgcgtttcc 1560 tagacgcgag tgtaaatgca tttaggacaa ataacattta aatgatcatt ttgctacagt 1620 ttctgcttgt actttattca catcaaattc agtttgtgta tagcattttc aatttaattt 1680 tgctacttat aaagctttaa aatgtggata caatttacat attaaaacag tgtttgaaat 1740 gagaaatgca aacaatgtaa tttaatttac attttcattc tgcaccaaac attgctcata 1800 tgtcttggaa aatgcaaatt caaatacaga aatacattgc catttaacat attgcattgc 1860 cattttgcat attacatttt gaattgaatt atgctactca tatgcttcca ta 1912 // ID GYPSYDR1 repbase; DNA; ZEB; 4222 BP. XX AC AL591405; XX DT 04-MAR-2002 (Rel. 7.02, Created) DT 13-OCT-2008 (Rel. 13.11, Last updated, Version 2) XX DE Gypsy Danio rerio 1. XX KW Gypsy; LTR Retrotransposon; Transposable Element; LTR; KW retrotransposon; GYPSY/TY-3; GYPSYDR1. XX NM GYPSYDR1. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4222 RA Jekosch K.; RT "GYPSYDR1: gypsy-like element from D. rerio."; RL Repbase Reports 2(2), 10-10 (2002). XX RN [2] RP 1-4222 RA Jurka J.; RT "Corrected internal portion: eliminated LTR fragments."; RL Direct Submission to Repbase Update (13-OCT-2008). XX DR [1] (Consensus) XX CC Putative novel retrotransposon similar to GYPSY/TY-3 CC retrotransposons. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 164..4222 FT /product="GYPSYDR1_1p" FT /translation="MSVEEVLLHLAEISSKQNSISEQLTARQDRLEQQLRQ FT AARHHPTPEVSAHHHLTKLSDLDDIDAYLHTFEVIAEREGWPKENWARMLA FT PFLTGEAQRAYFSLETPKNEDYKALKKEILARMGLSTISAAQQFSQWSYDE FT KQPVRTQAAQLSRLGRLWLLGGDPSAVQVAEKVIIEKMMRALPRRLRTLTS FT MRNPESLATLVEAIELAEAHIARDNGERAALPPRRVSAPWRPVEGTARPGG FT RPAVPSPMDEPMPTEPTTHSTPAWTAGCAVHRNIPPEAPTHKVQLEGKTQT FT ATLDTGSAITLVHPKTLKYHHESKGRIPITCVHGDTRHVPARRVTIAAKPG FT SWRIEVGVVPDLPVPLLLGRDWPGFDELLTHHHAPSARSKKKNKPRAHRDR FT KPALMTTESDRGGESSISANLYFDLFQQITAGGDFGRAQREDETLKHCWPQ FT VRIIDGNERLPSPHPLPHFIVENGLLYCVAERRGEKKTLLVVPRTKRETVL FT ELAHTHPMAGHLGAANTVKRIRDRFHWPGLDGEVKRYCQACDICQRTSPQR FT PPPSPLIPLPIIDVPFTRIGMDLVGPLPKSARGHEHILVILDYATRYPEAI FT PLRKATSSAIAKELFLLCSRVGIPTEILTDQGTPFMSRLMADLCHLLKVKQ FT LKTSVYHPQTDGLVERFNKTLKQMLRRVVAEDGRDWDLMIPYVLFGIREVP FT QASTGFTPFELLFGRQPRGLLDVARQAWEQEPAPQRSVIEHVRDMRERIDK FT IMPIVKQHLTEAQRAQQRLYNRPAQPREFHPGDKVMILIPTTTSKFLASWK FT GPYTVVERVGPVNYRVRQPGRRREEQLYHINLMKKWVAAPGHLVAFAEETL FT PVVHIGEQLSPNQKAELQALVGQFKDVFSEKPGRTSIIQHNIITPPGTIVR FT QRPYRVPEARRLAIDEEIQKMRRLGVIEPSRSPWSSPIVMVPKPDGTLRFC FT NDFRKLNEISKFDGYPMPRVDELLDRLGGARFISTIDLTKGYWQLPLSEDA FT KEKTAFSTPGGHWQYRVLPFGLHGAPATFQRMMDILLRPHQPYAAAYLDDL FT IVHSESWEEHLSRLRRVLLDLRRAGLTANPKKCHLGLAEARYLGFHIGRGL FT IQPQQNKVKALQETPQPTTKTQVRAFLGLAGYYRCFIPNFSSIASPLTDLT FT RKGQPERIRWTKEADDAFRALKKSLTSSPVLHAPDFGCPFILQTDASDSGL FT GAVLSQVHGDEEHPIMYVSRKLTPAETRYATVEKEALAIKWAILELRYYLL FT GRKFTLVTDHAPLQWMATAKNNNARVTRWFLSLQDFNFDVQHRAGASHGNA FT DGLSRLWSGWAGLSKHSTPPLNTLLFLRRTPRTRTTLRGGE" XX SQ Sequence 4222 BP; 1086 A; 1244 C; 1098 G; 788 T; 6 other; actggtggag aatgcgggca ttgcagaggg aaaaattcag aagaaacact caccattaag 60 tagaaaataa ccgctgaatt tttctctgtg tttgttacag acaggcatcc gctctctctc 120 tctcccacgc ttcctcttac tcggctgtca acatcccgtc gccatgtctg tggaagaggt 180 actgctccac ctagcggaga tctccagcaa acagaattcc atctctgagc aacttacagc 240 cagacaggat cgactggaac agcagctccg ccaggcggcc agacaccatc cgactcccga 300 agtgagtgcg catcatcatc tcactaaact cagcgacctg gatgatattg acgcttattt 360 acataccttt gaggttattg ccgagagaga aggctggcca aaagaaaact gggcgagaat 420 gttggctccg tttctcacag gagaagcgca acgagcatat ttttcactag agacacctaa 480 aaatgaagat tacaaagcgt taaaaaagga aatattagcc agaatggggc tatccacaat 540 cagcgcagcc caacagtttt cccagtggtc ttacgacgag aaacaaccag tgagaactca 600 agcagctcaa ctttctcgcc tgggaaggct atggttattg ggaggagatc cctcggcggt 660 ccaggtcgct gagaaggtga tcatcgagaa gatgatgcgt gcgttacccc gacgtttgcg 720 aacactcacc agcatgcgaa atcctgagtc actggccacc ctggtggagg cgatcgagct 780 ggctgaagct cacatcgcca gagataatgg ggagagagcg gctctgccac cccggagggt 840 aagtgcacct tggcgaccgg tggagggcac agcgcgacca ggcggcagac cagcggtccc 900 cagcccgatg gacgagccga tgcccaccga gccgacgacg cactcgaccc cggcctggac 960 agcagggtgc gcggtwcacc gcaatatccc tcccgaagct cccacccata aagtccagct 1020 agagggaaaa acacaaacgg ccaccttaga cacaggaagc gccatcactc tggttcaccc 1080 gaagacttta aaataccacc atgaaagcaa agggcgaatt ccaatcacgt gtgtgcatgg 1140 tgatacccgc cacgtacccg cccgaagagt gaccatcgcg gcgaaaccag gcagctggcg 1200 matcgaagtc ggggttgttc cagatcttcc tgtgcccctc ctactgggca gagactggcc 1260 ggggttcgac gaactcctaa ctcaccacca cgctccatcg gctcgttcaa agaagaagaa 1320 caagccacgg gctcatcggg accgcaaacc agcgctgatg accaccgaga gcgacagagg 1380 gggtgagtca tcwatatctg ctaatctata ctttgatctg tttcaacaga taaccgcagg 1440 aggcgatttt ggcagagcac agagggaaga tgaaacgctc aaacactgct ggccacaagt 1500 acggatcatc gacggtaatg agcgacttcc cagccctcac cccctcccac atttcattgt 1560 ggaaaatggt ctgctgtact gtgtcgcaga gaggcggggg gaaaagaaga cactactggt 1620 cgttccgagg accaagaggg agacggtctt agaactggca catacccacc cgatggctgg 1680 acatctggga gcggccaaca cggtgaaaag gatccgcgat cgtttccatt ggccggggct 1740 agacggggaa gtaaagaggt attgccaggc atgtgacatc tgccagagaa cgtctcccca 1800 acgaccaccc cccagccctc taataccact acccatcatc gatgtgccct tcacccgaat 1860 tggtatggac ttggtagggc ctttgccgaa gtcggcccgg ggacatgagc acatccttgt 1920 tatcctcgat tatgccacca gataccctga agcgattccc ctgaggaaag ccacgtcatc 1980 ggccatcgcg aaggagctgt ttctgctatg cagtcgagtg ggaatcccaa crgagatact 2040 gaccgaccag ggcaccccct tcatgtcccg gttgatggca gacctctgtc acctactcaa 2100 ggtaaaacag ctaaaaacct ctgtatatca tccacagacg gacggccttg tcgagcgctt 2160 taacaagact ctgaagcaga tgctccgacg ggtggtggca gaggacgggc gcgactggga 2220 cctcatgatc ccgtacgtgt tattcggtat cagggaagtc ccccaggcct ccacaggatt 2280 tacccccttc gaactgctgt ttggccgcca accccgaggc ctattggatg tggctcgtca 2340 agcctgggaa caggaaccag ccccccagcg gtcggtgatt gaacacgtac gggacatgag 2400 agaacgcatc gacaaaatca tgcccatcgt caaacaacac ctgaccgaag cccagcgcgc 2460 ccagcagaga ttgtataacc ggcccgccca acccagagag ttccacccag gggacaaggt 2520 gatgatactc atacctacca caacgtcgaa gttcctcgca tcctggaagg gaccatacac 2580 cgtggtagaa agggtagggc cggtaaatta tcgagtccgt cagccgggac gaagacggga 2640 agaacaactt taccacatta atcttatgaa gaagtgggtt gcagctccag gtcatctcgt 2700 tgccttcgct gaagaaactc ttcccgttgt ccacattggt gagcaactct caccaaacca 2760 gaaggcggag ctgcaagcct tggttggtca gttcaaggat gtgttctcgg agaaaccggg 2820 ccgaacctcc atcatccaac acaacattat cactcctcct ggcaccatcg tccggcaaag 2880 gccttatcga gttccagagg ctcgcaggct ggctatcgac gaggagatcc agaagatgag 2940 aaggttaggc gtcatcgaac catcccgtag cccgtggtcc agcccaatag tgatggtccc 3000 caaacccgat ggcaccctcc gtttctgcaa cgacttcagg aaactcaatg agatctccaa 3060 gttcgacgga taccccatgc ctcgggtgga cgagctgctg gataggctgg gtggagcccg 3120 atttatctcc accatcgacc tcaccaaagg ctactggcag ttaccactca gtgaagacgc 3180 caaggagaaa accgccttct ccacacccgg tgggcactgg caataccggg ttcttccctt 3240 cgggctccac ggggccccag ccacattcca aagaatgatg gacatcctgc tgaggcccca 3300 ccagccatat gcagcagcct acctggacga cctcatcgtc cactcagagt catgggaaga 3360 acacctatcc cggttacgga gggtgctctt agatcttcga cgggctgggc tcacagctaa 3420 tcccaagaaa tgccacctgg gtctagccga agccagatac ctcgggttcc acattggacg 3480 aggtctcata cagccacagc aaaacaaggt caaggcacta cargaaactc cacaacccac 3540 cacaaagacc caggtacgtg catttctggg gttagcgggc tactatagat gtttcatacc 3600 taacttctca tccatagcca gccctytgac agacctgacc agaaaggggc agccggagag 3660 gataagatgg accaaggaag ccgacgatgc gttccgagcc ctaaagaagt ccctcacgtc 3720 ctcaccggta ctgcacgcac ctgacttcgg ctgccccttc attctacaga cggatgcttc 3780 cgactcgggc ctgggcgcgg tcctctccca ggtccacggc gatgaagaac atcccataat 3840 gtacgtgagt cggaagctga cccccgcaga gacccgctac gcaacggtgg agaaggaggc 3900 cctggcgatc aagtgggcaa tcctggagct aaggtactat ctccttggca ggaaattcac 3960 tctggtgacc gaccacgccc cgctacaatg gatggctaca gcgaagaaca acaacgctcg 4020 ggtcaccagg tggttcctgt cgctccagga tttcaacttc gacgtacaac atcgagccgg 4080 ggcctcccac ggaaacgcag acggactctc acggctctgg tcaggatggg caggtctgtc 4140 gaaacattct accccccctc tcaatacact gctcttcctt cgcaggacac ccaggaccag 4200 gacgacgcta aggggggggg aa 4222 // ID TE-X-1_DR repbase; DNA; ZEB; 1478 BP. XX AC . XX DT 21-OCT-2008 (Rel. 13.1, Created) DT 21-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE nonautonomous transposable element from zebrafish - a consensus. XX KW Transposable Element; Nonautonomous; TE-X-1_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1478 RA Bao W. and Jurka J.; RT "Transposable elements from zebrafish."; RL Repbase Reports 8(10), 1621-1621 (2008). XX DR [1] (Consensus) XX CC This is the consensus built from copies that are ~10% divergent CC from it. The size of TSD is not know, and TIR is not obvious, but CC the elements are inserted into TA satellites. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1478 BP; 390 A; 298 C; 330 G; 459 T; 1 other; caacagttct gtctggttct tgaatctgat tggctgatag ccgtgcgata ttttgccagt 60 aacatcacac aaaggcctct tcacccttca ctgtgtatta ctccgcccac atacagccag 120 caacaagcag agacgctaca gtttgacaaa tattactgct gttagacaac aaaatgtact 180 tttgaggctt ttttagtcga gaatgtagtt gtttagattg caactatgca gtttatttrt 240 aagaatagtg cctattttaa aatatttaca atttctgaga gacagctcgt cggcgtccat 300 tagccattag tcaaagacgg ttgacgttgt ctatccacaa gatggcgcca ggaccgcata 360 ataagccctt gcttagatta aaactgttgt gtgttttctt gagcgcgaat aagaagctga 420 aaaacggaag cgtcacggtt tttaaggtgg accggataga gtcaataaga agctgcgaat 480 aagaagctgg attcagcctc gtccctcctt atcgcaaaca caacagcagt ctggtaagtt 540 agtgattgcg ctgctttttt ctgaggttaa ttattgtgat attgtaacgg tcgcttaatg 600 tttgatatgc attgtgtata aatgtcatgt cccctttaag aggacttctt ctgtggttgt 660 tttacctttg agatgtaagc acgcatgtat gggagctcag ttcagttctg gcgagctctg 720 aggagaagtg tgtgatggtc gctctaccgc tgtatcattc ttttatttca ataatgtaac 780 attttatttt gcgacggatg tttgagtgtc tttttctttt ccttttgcct actcattgtc 840 gttgactggc atgtatgtag tcgacagcac ttgggagaaa cgcataaagg ccaataggaa 900 gagcgaaata gtttctgtct cctttttttt attccttaaa gtctccgggt aaacgctggg 960 acacctggca cccgtcacaa taccgacggc aacagagaaa tactgtagac tgtaatagac 1020 tgtagggaga tatctctgta gacggatggc atttcatgtc acgttcagcc ttataatctt 1080 aaaatgtgag caaaatcagc tgttttgtca tcactttagg ggggtcacac accagaagcg 1140 ccgcttcaca ttacgctata gagaatcatt caaacactag ctctaaagtg acgttggtga 1200 attagtaacg gcttctgctg ttctgacgtc agctgcagat gtgaatgaat ggcggaagaa 1260 agtagttcct aatacaaaag ggtttttaga ctctccgtgt ttgattttct tttttatata 1320 caagattgtg ccgtcgaact gttgtataaa cgcaatatca cactcgtagc agtgcgatat 1380 ggctgtatat cagcactggt gggacgctaa ggcactgcac gcctcccacc agtgctgata 1440 tacagccata tcgcactgct actcgtgtga tattgctc 1478 // ID Gypsy52-I_DR repbase; DNA; ZEB; 5474 BP. XX AC chr17; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy52-I_DR; KW Gypsy52-LTR_DR; Gypsy52_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5474 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1419-1419 (2008). XX DR Genome; chr17; Positions 23231809 23237282. XX CC Positions [2296-2835] - Reverse transcriptase CC Positions [4084-4560] - Integrase core CC 'CACC' target site duplication CC LTRs are 100% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 73..5283 FT /product="Gypsy52-I_DR_1p" FT /translation="MAELEELKKQFEEMQRRFGEQSGMLEQARAEQREALS FT LAKIVIEQQAAAQKAPSTVYIPRDRKLPEFSGCRSKPGELSIEEWISSMKS FT AFKVMKIPEEDRIEFVKQCLKDEAKMTVKFMLNGKEKSVDEIFDALDQTYG FT DKLPIGSRLKDFYDRKQMPGETIRSYAYDLQEKLSIIQSRDPARIPDADGV FT LKEQLVLGLKDDSLRREMKRRVKAEKDLTFIQLMQEAITWSEEEEVQIPSN FT LRPSTRSRGVVRATSATESSSVPLTLESLHEAVQQIAARQEELFQIVNSKE FT KVKPLVKENKMKSVPLKDSEGRYICYTCGKPGHTSRRCLQNKENSKTQTVE FT REKTADRDTLVLESPGPSVIRSHTADCDTKNVSKSLCESAFGDCLTIEVKI FT AGISTICLLDTGSEVTTITESHFKDHFGEAVLSSANWVRLTAANDLDIPII FT GCLEADVECMGKKLHGKCVFVIKETETHGTELNRLPGIVGMNVLSELKDLF FT MTTGDVKKMNRYSCGFKEARVQRVLANIKMQTETLSCNDKIGFVKVAGKQA FT VTIPPFSERVLEGRCRIPPKVSCQVLVEASSNVSLPKSVLIANVLAKSADG FT KVPVRVLNSSEKPVKLPPRCRIAALSKPQEVVQKELVEFEEKEGTLYVKAV FT QKHQVDVDISTTEQLPVPVQMNLENLTESQRCTLQELLVKHSDVFSKNDCD FT FGYTTAVTHNVPTGDAPPIKQRHRRIPPQVFQEVKKHVQDLVSQGILRESC FT SPWASPAVIVIKKDGSVRFCCDYRRLNKVTCKDAYPLPRVEESLDALGNAQ FT LFSTLDLTSGYFQVAMNEEDRAKTAVTTPFGLFEWTRMPFGLCNAPATFQR FT LMGVVLGDLTFDILLIYLDDIIVFSRDFESHCQRLEIVFNRLREHGLKLKP FT SKCFLLKPEVKFLGHLISSQGIKVDGEKTQVLETWPAPTNVKELRQVLGFM FT SYYRRFVPCFAQLARPLHALVGKGGKGKVIEPLNWTTECQLAFDKLKHCLM FT SPPVLAYPDFSQSFVLTTDGSLHGLGAVLSQRQGDAERVIAYASRSLRGSE FT KNDRNYSAFKLELLALKWAVTEKFKEYLIYSKFSVVTDHNPLRYLETANLG FT AVEQRWVAQLSEFDFEVFYKPGRQNTNADVLSRIPSGEEPEQEDSDKDFIK FT MNSDEVRTCLWPATKNQQRVQASVQVSVARKVPGYSWSDLEEQQKTDPKIA FT PIYRAVLKNKNLSPVEQRNMNAELKKLAKQFIRLELKKGVMFRTILDPRDG FT EEICQLVVPEPLRYKVYESQHDHCGHFGERSTLERMRRNYYWPTMSKDVQN FT WIKECKRCALAKDVFPKIRAPMTCTNVSAPLEVLAMDYTVLEESVGGYENV FT LVLTDMFSRFTVAVPTRNQTAHTTAKALVQHWFVHYGCPARLHSDQGRCFE FT ANVIKELCRVYGIGKSRTTPYHPQGNSQCERFNRTMHDMLRTLPPEKKKNW FT KEYLPELVMAYNSRTHTSTGYSPFYLLFGRDARMPMDILNGRDIEESGVDN FT LDDWVKKHHDRLKTAVEVANSAAQEASRQRKRAYDRKSRGALMKAGDRVLL FT RNHSHRGRNKIQDHWEPLPYIVVKQNHTDTPVYTIRPEKGGPCKVVHRDQL FT RHCTFQSSLPPRTSRHNSRAHTSQEHTDSEDPDVLVVPVTFTPTTLPDTQG FT RRESGDESDTGIREHDNRISDEDSVDQSVNSGGSEAESENESIPEPRRSQR FT LNKGTLPVRYRTDFVLK" XX SQ Sequence 5474 BP; 1708 A; 1044 C; 1303 G; 1419 T; 0 other; tattggcgtc acgaacagga taaattcctg acttttaatt cagtataagt ttgaagtttt 60 tgagccgacg ctatggctga gctagaagaa ttaaagaaac agtttgaaga aatgcaaagg 120 cggttcggtg agcagtctgg catgttagag caagccagag ccgaacagag agaggcactt 180 tctctagcta aaattgttat tgaacaacaa gctgctgctc aaaaagcacc atctactgtg 240 tacatcccaa gagatcgtaa acttccagag ttcagtggat gtcgttctaa gcctggcgag 300 ttgagcatag aagaatggat aagttcaatg aagtccgcct tcaaggtgat gaaaattcct 360 gaagaagaca gaattgagtt tgttaagcag tgtttaaaag atgaagcaaa gatgacagtc 420 aagtttatgc tgaacggaaa ggagaagtca gtggatgaga tttttgatgc tttggatcag 480 acttatgggg acaaacttcc cattggcagc aggctaaaag atttttatga ccgcaaacaa 540 atgcctggag aaaccattcg ttcgtatgca tacgatttgc aagagaaatt gagtataatt 600 cagagcagag acccagctag aattcctgat gctgatggtg tattgaagga acagctcgtg 660 ctaggcctca aagatgattc actacgacgt gaaatgaaaa ggagagttaa ggctgagaaa 720 gatttgacat tcattcagct tatgcaagaa gctataacgt ggtcagaaga agaggaagtt 780 caaattccaa gtaatctgag acccagtact cgttcacgtg gtgttgttcg tgctacctct 840 gcgacagaga gctcttcagt ccctctcacc ctagagtcac tccatgaagc tgttcaacaa 900 attgctgccc gacaggaaga attgtttcag atagtcaata gtaaagaaaa agtgaaacct 960 cttgtaaaag agaacaagat gaaaagtgta cctttaaaag atagtgaagg gagatacatc 1020 tgttatacct gtggtaagcc aggacacaca agccgacgtt gtcttcagaa caaggaaaac 1080 tctaaaactc agacagtaga gagggaaaag acagcagacc gagacacttt agtgctagag 1140 agcccaggtc catctgtcat tcgtagtcat actgcagact gtgatacaaa gaatgtttcc 1200 aagtctcttt gtgagagtgc ttttggagac tgtctgacaa ttgaagtgaa aattgcgggt 1260 ataagtacca tttgtctctt ggatacagga tcagaagtca ctactatcac tgagtcgcat 1320 tttaaggatc acttcggaga agctgtgttg tcttctgcca actgggttcg tctcacagca 1380 gcaaacgacc tggacattcc catcataggt tgtctagaag cagacgtaga gtgtatgggg 1440 aagaagttac atggaaagtg tgtgtttgta atcaaagaga ctgaaaccca cggaacagag 1500 ttgaatagat tacctggcat tgttgggatg aatgtgttga gtgaacttaa agatctgttc 1560 atgactactg gagacgtgaa gaagatgaac agatatagtt gtggctttaa ggaggcaagg 1620 gtccaacgag tgttagctaa catcaagatg cagactgaga cattaagctg caacgataag 1680 attggcttcg taaaagttgc tggcaagcaa gcagttacta tccctccatt cagtgaacgt 1740 gttttagaag gccgctgtag aataccacca aaagtaagct gtcaagtgtt ggtggaggcc 1800 tcttcaaatg tcagcttgcc taaaagtgta ctgatcgcta atgtgctcgc caaatcagct 1860 gatggtaaag ttccagtcag agtcctgaat tctagcgaaa aacctgtaaa attaccacca 1920 cgatgtagaa tcgctgcact gtctaagcca caagaagttg tgcaaaaaga gcttgttgag 1980 tttgaggaga aggaaggtac cctgtatgtt aaagctgtac agaaacatca ggtggatgtg 2040 gatataagca ccacagagca actgccagtt ccagtgcaaa tgaatttaga gaatctcacc 2100 gaaagtcaac gttgcacact acaggaacta ctggtcaagc acagtgatgt gttttcaaag 2160 aacgattgtg actttggtta tacaactgcc gttacacaca atgtaccgac aggagatgct 2220 cctcctatta agcaaagaca tcgcagaata ccacctcaag tgtttcagga agtgaaaaaa 2280 catgttcagg atttggtttc tcaaggtatt ctcagagaaa gctgtagtcc gtgggcatcg 2340 ccagctgtga ttgtaattaa gaaagatggt agcgtacgct tttgctgcga ctacagaaga 2400 ttaaataaag tgacttgcaa agatgcttat ccactccctc gtgtggagga atcactggat 2460 gctttaggta atgcacagct gttctccacc cttgatctta cctctgggta tttccaagtt 2520 gcgatgaatg aggaggatag agcaaagacg gcggtaacaa ctcctttcgg attgtttgaa 2580 tggaccagga tgccattcgg actctgtaat gctcctgcga cgttccaacg tctgatgggg 2640 gtggtgctcg gagatctaac ttttgacata ctgctcatct acctggatga cattattgtt 2700 ttttcaagag actttgaaag tcattgtcag agactggaaa tagtgttcaa ccgattgaga 2760 gagcatggtt tgaaactaaa acccagcaag tgcttccttt tgaaacccga agtgaaattc 2820 ttgggtcatc taatatcctc ccaaggaata aaggtggatg gagagaaaac tcaagtgttg 2880 gagacatggc ctgcacccac aaatgtcaaa gagttaagac aggtccttgg atttatgagt 2940 tattacagga gatttgttcc ttgttttgct caattagccc ggcctcttca tgctctggtg 3000 ggtaaaggag gaaaagggaa agttattgaa cctttgaact ggacaactga atgtcagtta 3060 gcatttgaca aactcaagca ttgtttaatg tctcctccag tccttgcata tccagatttc 3120 agtcagtctt ttgtactcac aactgatggg agcctgcatg gtcttggagc cgtgttgagc 3180 cagagacaag gagatgctga gcgtgtgatt gcgtatgcaa gtcgaagtct tcgtggatca 3240 gagaaaaacg acagaaatta cagtgctttt aagctagagt tgctagctct aaaatgggct 3300 gtaaccgaga agtttaaaga atacttgatc tactccaagt tctctgtggt tacagatcac 3360 aaccctttgc gctacctaga aacagcaaat ctaggagcag ttgaacaacg gtgggtagct 3420 cagttgtcgg agtttgattt tgaagtcttc tacaaaccag gacgacaaaa tacaaatgcg 3480 gatgttcttt cgaggattcc atctggagag gagccagagc aggaggattc tgataaagat 3540 ttcattaaga tgaattcaga tgaagtgcgc acatgcttat ggccagccac taaaaatcag 3600 cagagggttc aagcttcagt ccaagtatct gtggcaagga aagtccctgg atacagttgg 3660 agtgatctag aagagcaaca aaagacagac cctaaaatag cccctatcta ccgtgctgtt 3720 cttaaaaaca agaatctgag cccagttgag cagcgtaaca tgaatgctga gttaaaaaag 3780 cttgctaaac aattcataag actcgagctt aaaaaaggag tgatgttccg taccatcctt 3840 gatcctcgag atggcgaaga aatttgtcag cttgtggttc cagaacctct gcgttacaaa 3900 gtatatgaga gtcagcatga tcattgtggc catttcggag aaagaagtac gctagagcgc 3960 atgagaagaa attattactg gccaacaatg agtaaggatg ttcaaaactg gattaaagag 4020 tgcaaaagat gtgcacttgc aaaggatgtt ttccctaaga ttcgagctcc aatgacgtgc 4080 actaatgtgt ctgcaccact ggaagtcctt gcaatggatt atactgtctt ggaggaatct 4140 gtgggaggat acgaaaatgt tcttgtctta actgacatgt tctctcgttt cacagtggcc 4200 gtgcccacca gaaatcaaac tgctcacact actgcgaaag cactggtgca gcattggttt 4260 gtacattatg gctgtcctgc acgattacat tcggaccaag gtcgttgctt tgaagccaat 4320 gtcatcaaag aactgtgtag agtctatgga attggaaaga gtcgcacaac cccgtatcac 4380 ccccagggta actcccagtg tgaacgattt aatagaacta tgcacgacat gctaagaaca 4440 ttacccccag aaaagaaaaa gaactggaag gagtatctac ctgagttagt tatggcctat 4500 aacagtcgaa cccacacttc aactgggtat tcgccattct acttactgtt cggtagagat 4560 gctcgcatgc caatggatat ccttaatgga agggacattg aggaaagtgg agttgacaat 4620 cttgatgatt gggtgaaaaa acaccatgac agattgaaaa cagctgttga agtagccaac 4680 tctgcagcac aagaggcttc aagacagaga aagagagcct atgatcgcaa atcacgtggt 4740 gcactcatga aagctggaga tcgtgttctg ttaaggaatc attcacacag gggaaggaat 4800 aagatccaag atcattggga acccttacct tacattgtag tgaaacaaaa tcacacagac 4860 acgccagtct atactattcg cccagagaaa gggggtccat gtaaagttgt acacagagat 4920 caactaagac actgcacttt tcagtcatct ttaccgcctc ggacatctag acataattct 4980 agagcacaca caagtcagga acacactgat tcagaggatc ctgatgttct tgtagtccct 5040 gttacattta caccaactac acttccagac acacaaggta gaagggaaag tggggatgag 5100 tcggacacag gtattagaga acatgataat agaatttcag atgaagattc agtagatcag 5160 tctgtaaatt caggtggttc tgaagctgaa agtgaaaatg aaagcattcc tgaacctaga 5220 cgttcacaac gtttgaacaa aggtacttta cctgttaggt acagaactga ttttgtcttg 5280 aaataaactt agacaacaat tgaaacttta aggttttgaa agtatctttt aatgtgtatt 5340 tcactgattg aaaactttat tgttgtactg atgttcatac ctaaagtgtt tacctgctat 5400 tctctgtttt tgttttccgg aaccggatcg ctgaaaggag gtccagtggt atggcagggt 5460 ttagtagggg ggaa 5474 // ID ZFERV_I repbase; DNA; ZEB; 9865 BP. XX AC AF503912; XX DT 23-MAR-2006 (Rel. 11.03, Created) DT 23-MAR-2006 (Rel. 11.03, Last updated, Version 1) XX DE Danio rerio endogenous retrovirus ZFERV (internal portion). XX KW Endogenous Retrovirus; Transposable Element; Interspersed repeat; KW ZFERV_LTR; ZFERV_I. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-9865 RA Shen C.H. and Steiner L.A.; RT "Genome structure and thymic expression of an endogenous RT retrovirus in zebrafish."; RL J Virol 78(2), 899-911 (2004). XX DR EMBL/GenBank/DDBJ; AF503912; Positions 693 10557. XX FH Key Location/Qualifiers FT CDS 1198..3267 FT /product="ZFERV_I_1p" FT /translation="MTTYKETWDGICKQIIHHTEPKDRKKISKTLESQWTY FT LSGLNGWTETTKNQGKLLNILLTEEKRADTDLVLARQNRDKTGVWKKGAAN FT REIEKAEDRLRQWSRVAAAMVTKVKSAVWADQRDSPERPPPYSSAEEKPKT FT PSAPMQMPVMIVKGGEIETTTKQANKDFKFMIKQGAIEVEETEEDKKQRQL FT KMALKEVQTKVEIEEAVKKEIAKQFTANIQLARDSVNQTMSRVAGMEEELK FT RNLNEEEGSVRSEGTTSVGGTCEDRDKIVVSGYQLDVDWGRGAALQREHRA FT QGPELDSLSDRTKRWVHAEENREQQMPLFMKNLHSTPKLVMPLIRAATGRK FT EYKPWGHTDMNAVLSKLPEITKGGQRWFTKLLTLTHGTDLALGDVRALWGS FT ILTRTQVELIEREANTTTEENEEPLNRFSTEVGSAMRRIYPTPKLTYQSIK FT FKIMTGESASAYLHRCEAEWEDRTGENPDSSDICKEFFRQAVIKGVPASAV FT VAIENSPDMQGGEGVVWTRHFIHHVDKAVERQGKEETEVEKLKTQLLKMQI FT EAEKEKGKKKKENLQLPVLTGPGVEGARSPVHNIHPISGHINSPHGIPDSY FT PYGPSPNWPPSGNQNQNPVMQGCFGCGAQDHWKRECPYGGQRGQPPARGRA FT PGRGGPQGGRGRGAPRNVSWYSNQQVPRQSPVWGPLDHADY" FT CDS 7908..9836 FT /product="ZFERV_I_3p" FT /translation="MNKINKLVVAWGVMLVIILWVICHMEFRGTTKEKRSV FT KEGGGTGVRITLEREEGKDGMWQFDLCQVIDCGKDQLAWRGYDVYGCLWPT FT TTRPPDGPHCHTWHNVNWKTAPGFIKKMLKNSPLAKADSIQNRLRLSRGYQ FT HSWGGWKNTLIISLKNGNDETDTYITLGVDVEGKDPLGLIKISIKKPKPTG FT APIITDLTENKKKVIQSTDYSNLTPLDLMTLETGYHETNLWLEWVTNAAEE FT LGFEGCLACAAGRPQLNTEPAPLHDYDSWGYKCMLKLTKEKSPKNCTALSN FT LYPPAGNKSSMGVALKRRKGNYTCFNLTTSSASKILVGAFKREWCGQMIQE FT GHDKLGGWGRVGLYYACGEKVVLDRIEPTMEGVCAMIRVAVPMVMIGNRAI FT NGHKNRRKRSVNSDFDLTRNSPTYIDAIGIPRGVPDEYKLADQVAAGFENI FT PVIAALFPVTPNKNVDRINYIHYNVQRLSNLTRDAVSGLKEQLAATSLMAI FT QNRLALDMLLSERGGVCSMFLDTCCTIIPNNTAPDGSVSRALEGLKELSNE FT MKASSGIENAVSKWLSDMFGEWKGVIIAMLTSMGIFLGILVTCGCCCIPCI FT RSLINRLIITAIEKKENPPPYQMPLLAAAEGDDVMEEVEELLDMV" FT CDS 3394..7908 FT /product="ZFERV_I_2p" FT /translation="MMEKDSLSTQSEEVRGFSGETEKWPLTKPLRVQVAGQ FT TLLHSFLCSANVPSALLGRDLLVKLGVKILCHPEGLIIIFPNGLTTNCSTP FT VTTTVRGQWVLQADKTVKARCYWLKLTDKTKLQEVITKWEPWLNTLHKFQT FT PTDPWHCTLLYDISEDEEYEQNFSEVLGRSTSIKCTKLFVAREGVAAEIIL FT TEEQEGYFEMTETSTPHVTIFMNEGHEAKALGPMVKRMQKVGDWGKTDNEW FT EYSQALDGYRLTIDIAEEVIYQMVEVRRIVTTQEGDGEGAERLLQEMPESL FT WSKAAGDVGKWSIEPVLFQIDRTEIVNVRQYKLRPEAVEGIGETIKELEAA FT EVLRRTVSGWNTPILPVLKKTTGKYRMVHDLRLINEKVLTATLPTPNPYTI FT MSKLTPKHSHFTCIDLANAFFCMPLAEQCQGIFAFSYQGAQYTYNRLPQGF FT ILSPGLFNQALRELLDSCTLHEGTIVIQYVDDLLLAAHSNEVCLQDTRKVL FT TLLSTAGLKVSKEKIQISRATVHFLGRIIGQTGTALSDDTKQTVLSHPKPL FT VVKDMMSFLGLIGYSRQYVPNYSERTATLRALAKEVGMKNSRARLNWTQEA FT EAVFCGLKADLATAAALQTPNYELPFFLDVSTTASTTNGVLYQKQHQQRRV FT LHYLSAPLDKIEQKQPTCARYAAGLAKLIEKSEHIVMGHPLHVLTSHSVIS FT FITSSAFTFSAQRQNKVIRILTAPHITYEHQGVNMAHAGEGEPHECIPRAE FT AEEQIRPGLSSIPLTKPQLTLFCDGCCFKTDSGKLVASYAIVEQTDDGYTI FT REQQVLQDRPSAQRAELLALVRALHMAKDKTVNIHSDSAYAVGAATSELTG FT WARVGFVTSSGKPIKHAQEASDLLESIMLPQEVAIIKCAAHTKGKDPVSLG FT NEAADAAAKTVAGYKPLQMTVTAVDELHQIDEHLTTSFLSKEQSLAAAEEI FT SVWLEKGGRKDSQTGLWVGPTGRPIMPANLAGKVLTEAHSLAHSSEKDMTK FT RVSQWWHPFMPHMISGVIASCQTCAEFNVKPTSKPTAGHFPTDRGPGCTVV FT MDFTDMITRVNGKRYLLVLVDQFTGWPEAFPCAREDAVSVVKCLINQYIPR FT HGFPRIIRSDNGTHFKNEHLADVEKLLGIKLRYGAVYHPQSQGKVERLNLT FT LKNKLAKICHKSKLNWVDALPIALMSVRCSINRTTGFTPFELATGRQFPGP FT VAPLHAGDTDSPQMYHDKVCAVINMFSPQKSWPTESEASRPAENTTLWVRL FT KQHKRKWSSPRWSEPLRVTARTSHCVQLAGKGTTWYHLSACMFCPSPDRSL FT ADVRVDLRRGQREGEGEGEGDTEEPEREGEGGERSGQTEVAPALRTQLSAT FT LVFKILQRTGDLLQTPEHIPIAHCISADYALGAGVAKQIRDKYGVEELNTS FT VAQPGDCIKTTHGPRQIYHLVTKWWCRDLPTYEHLEASLIKLCYQCKKDQN FT KILAIPKLGCGLDKLDYTKVKEIIEKVFKEGHIQVILLTK" XX SQ Sequence 9865 BP; 3330 A; 1872 C; 2450 G; 2213 T; 0 other; atttggaggg ggcacccgga ttcgctcaga cgcttcgtcg agttggagga cgatctcaaa 60 tcaacctctc tgcagagctt ctgataaggt aagcagtcac ctttttaata gaagtactgc 120 aagattggaa ttaccaaatt taaggttgat tggggattga ataccaacat aatccgtaaa 180 taatttgaat caaatcaaca atggcaatta aatgagttga cccggcgtgg ttacaaggtt 240 tataaaaact catactggta attaaattgg ttatattatt gagttgaccc ggcgtggtta 300 aacaaattta taaaaactca gaattggtta tattattgag ttgacccggc gtggttaaac 360 aaatttataa aaactcagaa ttggttatat tattgagttg acccggcgtg gttaaacaaa 420 tttataaaaa ctcagaattg gttattaaag gagttgaccc ggcgtggtta agcaaattta 480 taaaaactca gaattggtta ttaaaggagt tgacccggcg tggttaaaca aatttataaa 540 aactcagaat tggttattaa aggagttgac ccggcgtggt taagcaaatt tataaaaact 600 cagaattggt tattaaagga gttgacccgg cgtggttaag caaatttata aaaactcaga 660 attggttatt aaaggagttg acccggcgtg gttaagaaat attataaaaa ctcgtatcgt 720 gttttcctgg catggttgag aattgaagtt tacccggcga ggttaactca ctcgtaaaaa 780 ccataaaaga aagtaaaagg tgaaacgtta aagggtgtaa atttgtgtta aagtgtcaat 840 tgtgtgcgga ttcctgaatg agtgtgtgtg tgtgagtgaa taactccgta tttgggaggt 900 cagagtgctg cgcaccttct ctggaccgtc acttaagtgt gacccggtag gaacggacgc 960 agcattctgg gagctcagag aggagagtgt gattgatgag ccctatattc atagggactg 1020 agagcatgtg aataaaacat cgagtgaatg aaggaatcat aattagatgc tgaaggtgtt 1080 atattacaaa ccaagaaagg tgaaaaatat tcctaggtgg tgggtgtgta gattattagg 1140 acataaggaa gattgtgagg tgtgtgaccc cagtggggag acaaaaccat tattaaaatg 1200 acaacttata aagagacgtg ggatggtatc tgtaaacaga taatacatca caccgaacca 1260 aaagacagaa agaaaataag taaaacatta gaaagtcagt ggacatatct ctcaggtcta 1320 aatggctgga cagagacaac taaaaatcag ggaaagttac tgaacatact tttgacagag 1380 gagaaaaggg cagatacaga tttggtactg gccagacaaa atagggacaa aactggggtt 1440 tggaagaagg gagcagcaaa cagagaaata gaaaaagcag aagatagact gagacaatgg 1500 agcagggtag ctgcagccat ggttaccaaa gtaaaaagtg ctgtgtgggc agatcaaagg 1560 gactcaccag agagaccccc accctatagt agtgcagaag aaaaacccaa aaccccctca 1620 gcaccaatgc agatgccagt aatgatagta aaagggggag aaatagaaac tactactaaa 1680 caagctaaca aagatttcaa gttcatgata aaacagggcg ctatagaggt tgaagaaaca 1740 gaggaagata aaaaacaaag acagttgaaa atggccctaa aggaagttca gactaaagta 1800 gaaatagagg aagcagtaaa aaaggaaata gcaaagcagt ttaccgcaaa cattcaactc 1860 gcaagagaca gtgtaaacca aacaatgtcg agggtggcag gaatggagga agagctgaag 1920 agaaacctca acgaagaaga gggaagtgta agatctgaag ggaccacatc agtagggggt 1980 acctgtgagg acagagacaa gatagtagta tcagggtatc aattggatgt agactgggga 2040 agaggggctg cattacaaag agagcacaga gcacaaggcc ccgagctaga tagcttgtca 2100 gacagaacta aaagatgggt acatgcagag gagaataggg aacaacaaat gcccttgttt 2160 atgaaaaact tacacagcac cccaaaatta gtgatgccac taatcagggc agctacagga 2220 aggaaagaat acaaaccctg gggacataca gatatgaatg ctgtgctaag taaattacca 2280 gaaatcacta agggaggtca gaggtggttc actaaactgt tgaccctgac ccatgggaca 2340 gacctagcac tgggagatgt tagagcatta tggggaagca tactaactag aacacaggta 2400 gaactgatag aaagggaagc taacacgacc acagaagaaa atgaagaacc tttaaacagg 2460 ttcagtactg aagtgggaag cgccatgagg cgaatttacc ccaccccaaa attgacttat 2520 caaagcataa agttcaaaat aatgactgga gaatctgcat cagcatatct gcacaggtgt 2580 gaggctgaat gggaagatag gacaggggaa aatccagact cctctgacat atgtaaagaa 2640 tttttcagac aggctgtgat taagggtgtt cctgcaagcg ctgtagtcgc tatagaaaac 2700 agtccagaca tgcagggggg agaaggggtg gtgtggactc gccatttcat ccaccacgta 2760 gacaaggcag tagagagaca agggaaagaa gaaactgagg tagaaaaact caaaacacag 2820 ctactaaaaa tgcaaataga ggcagaaaag gaaaaaggga agaaaaagaa agaaaatctg 2880 cagctaccag tactgacagg gccgggagta gagggggcca ggtcacctgt acacaacata 2940 cacccaattt cagggcacat taacagccca catggtattc ctgattctta tccctatggc 3000 ccctcaccca actggccacc ttctggtaat cagaaccaaa acccagtgat gcaggggtgc 3060 ttcggctgcg gggcgcagga tcattggaag cgtgaatgtc catatggagg ccagaggggg 3120 caacctcctg cgcggggacg ggctcccggt agaggagggc ctcaaggggg aagaggaaga 3180 ggtgccccca gaaatgtttc gtggtacagc aaccaacagg tgcccaggca aagcccggtc 3240 tggggcccgc tggaccatgc agactattga cgaggcccac agagaaccag tgagggtgcc 3300 ctggctgagc ccctcctaac aatactggtg gacaaccagc cagttcaggc cctggtggac 3360 actggggcca ccttctccac tatacaaaga caaatgatgg agaaggactc actgtctaca 3420 caaagtgaag aagtgagagg gttctctggg gaaactgaaa aatggccatt aaccaaacca 3480 ctccgggtcc aggtggcagg gcagacgttg ctccattcgt ttctgtgctc tgcaaacgtc 3540 ccctctgcgt tactgggtag agatttattg gtaaaattag gagtaaaaat tctatgtcat 3600 ccagaaggac tgataatcat ttttcccaat ggtttgacaa ctaactgctc aacaccagtg 3660 accacaacgg tgagaggtca atgggtcctc caagcagata aaacagtcaa agcaagatgt 3720 tattggttaa aattaactga caaaaccaaa ttgcaggagg taataacaaa atgggagcca 3780 tggctaaaca cactgcataa atttcagacc cccactgacc cctggcactg tacactactg 3840 tatgacatta gtgaggatga agaatatgaa cagaattttt cagaagtact aggcaggtca 3900 actagcatca agtgtactaa attgtttgta gcaagggaag gtgtggctgc agagataata 3960 ttaacagaag agcaagaagg ttattttgag atgacagaaa cttccacccc acatgtgaca 4020 atatttatga atgaagggca tgaggctaag gctttagggc ccatggtgaa acgaatgcag 4080 aaagttgggg attggggaaa gacagacaat gagtgggagt actcacaagc gctagacggg 4140 tataggttga caattgatat cgctgaggaa gtcatatacc agatggttga ggtacgtagg 4200 atagttacca cccaagaggg agatggggaa ggggcagaga gactactgca agaaatgcca 4260 gagtccctgt ggtctaaggc agctggggat gtgggtaaat ggtcaataga gccagtatta 4320 tttcagatag acagaacaga aatagtaaac gtccgacagt ataaacttcg accagaagct 4380 gtggagggaa taggggaaac cataaaagag ttagaggctg ctgaggtctt acgcagaaca 4440 gtgtctggct ggaacacccc aatccttcct gttttgaaaa agacaactgg aaaatacaga 4500 atggtgcatg atttaaggct aatcaatgag aaagtattga cagctacctt acccaccccc 4560 aacccctata ccatcatgtc taaattgaca ccaaaacatt ctcattttac gtgcatagac 4620 ttggctaatg catttttctg catgccactg gcagaacaat gtcaaggcat ttttgctttt 4680 agctatcagg gagcgcaata tacttacaac agactaccac aagggtttat tttaagccca 4740 ggtctgttca accaagcatt aagggagctg ttggacagct gtactttgca tgaaggtacc 4800 attgttatcc agtatgttga tgacttgtta ttggcagcac actccaatga ggtctgcctg 4860 caggacacac gaaaagtact aacactacta agtactgcag ggttaaaggt gagcaaagaa 4920 aaaatacaaa tcagcagggc aacagtgcac ttccttggaa gaataattgg acaaacaggc 4980 acggccctat ctgatgacac caaacaaact gtgttgtcac acccaaagcc actagtagta 5040 aaagacatga tgtcatttct ggggttgata gggtatagca gacaatatgt accaaattac 5100 tcagaaagaa ctgcaacatt gagggcattg gcaaaagaag ttgggatgaa aaacagtaga 5160 gcacgactaa actggacaca agaggctgaa gctgttttct gtgggcttaa ggctgattta 5220 gccactgcag cagccttgca gactccaaat tatgaattgc ctttctttct ggacgttagc 5280 accacggcct ccaccacaaa tggagtgttg tatcaaaaac aacaccagca aaggagggtt 5340 ttgcattatt tgagtgcacc cctagataag atagagcaaa aacagcccac ttgtgctagg 5400 tatgctgctg gcctggctaa attgatagaa aaatctgagc acatagtcat gggacatcca 5460 ttacatgtac tgacgtcaca ctctgttata tcattcatca cttcatctgc attcactttt 5520 tctgcacaga gacagaacaa ggtaattaga atactgaccg ccccacacat cacatatgaa 5580 caccaagggg taaacatggc tcatgcagga gagggtgaac cacatgaatg tatccccagg 5640 gcagaggcag aggaacagat cagaccaggt ttgagtagca ttccattaac taaaccacag 5700 ctaactctgt tctgtgatgg ttgctgtttt aaaactgatt caggcaaact tgtagccagt 5760 tacgccatcg tggaacagac tgacgacggg tatacaataa gggaacagca ggtgttgcaa 5820 gacagaccat cagcacagcg agctgagtta ttggcccttg taagagcctt gcacatggcc 5880 aaagataaga ctgtaaatat tcattcagac tcagcatatg cagtaggggc agctacatct 5940 gaactgactg gttgggcaag ggtggggttt gtaacatcgt ctgggaagcc cataaagcac 6000 gcacaagaag cctctgattt gttagaatca attatgttgc cacaagaggt agcaataatt 6060 aaatgtgcag cacacaccaa aggaaaggat cctgtttcat taggaaacga ggcagcagat 6120 gccgctgcta aaacagtggc agggtacaaa ccattgcaaa tgactgtgac tgcagttgac 6180 gagctgcatc aaattgacga acatctaact acaagttttc tatctaaaga acaaagtttg 6240 gctgcagctg aggaaatttc agtatggtta gaaaagggag gaagaaaaga ttcccagaca 6300 ggactatggg tcggtcctac aggtagacca attatgcctg caaatttagc agggaaagtc 6360 ctgacagagg cccactctct ggctcacagc agcgaaaagg atatgactaa acgagtgtct 6420 caatggtggc accctttcat gccacacatg ataagtggag taatagcctc ttgtcaaaca 6480 tgtgcagagt tcaatgtcaa gccaacctcc aaacccactg cagggcattt ccccacagat 6540 aggggtccag ggtgtacagt agtcatggac tttactgaca tgatcacaag agtaaacgga 6600 aaaaggtatc tgctggtcct agtagaccaa ttcactggtt ggccagaagc tttcccatgc 6660 gctagggaag atgcagtttc tgtggtaaaa tgtttgatca accaatacat tccaaggcat 6720 ggcttccctc gtataatcag gtcagacaat ggcactcatt tcaaaaatga acatctggcg 6780 gatgtagaaa aattgttagg tataaaactc cggtatggag ctgtatacca cccacaaagt 6840 cagggaaaag tggagcgcct caatctaacg ctaaaaaaca aactggcaaa aatttgtcac 6900 aagtccaaat tgaattgggt agatgcactt cccattgcac taatgtcagt acgctgttct 6960 attaatcgaa ctacaggttt tactccattt gaattggcaa caggaagaca attcccgggc 7020 ccagtggccc ccttgcatgc tggagacact gactcaccgc aaatgtatca tgataaagta 7080 tgtgctgtga ttaacatgtt ttctcctcag aagagttggc caacagagag tgaagcatct 7140 agacctgcag aaaacacaac actgtgggtg agattgaaac agcacaaacg aaaatggtct 7200 agcccaaggt ggtctgaacc gttacgagtc acggctagga catcccattg tgtgcaatta 7260 gcaggtaaag gcacaacatg gtatcatttg tctgcttgta tgttctgtcc ttctccagac 7320 aggtctttgg cggatgtcag agttgacctc aggagaggtc aaagagaggg agaaggagaa 7380 ggagaaggag acactgagga gccagaaaga gagggagaag gcggggaaag aagcggacaa 7440 acagaagtag ccccagccct cagaacacag ctaagcgcaa cccttgtatt taaaatcttg 7500 caacgaactg gagacttgct tcagacacca gaacacattc ccattgcaca ttgtataagt 7560 gctgattatg cactgggagc tggggtagct aaacaaatta gagacaaata cggtgtagaa 7620 gaattgaaca cttcagtcgc ccaaccaggg gattgtatca aaactacaca cggtccacga 7680 caaatttacc atttggtgac caaatggtgg tgtagggacc tacccaccta cgaacatctc 7740 gaggctagtc tgataaaatt gtgttaccag tgtaagaagg atcaaaataa aatattagct 7800 ataccaaaat tagggtgtgg attagacaaa ttagactaca ccaaagtaaa agaaataatt 7860 gagaaagtat tcaaagaggg ccacattcag gtaattctcc taacaaaatg aataaaataa 7920 acaaattggt ggtggcctgg ggagtaatgt tggtgataat cctatgggta atctgccata 7980 tggaattcag agggacaaca aaggaaaaac ggagtgtaaa agaaggggga ggaacagggg 8040 tacgtattac cttagaaagg gaagagggaa aggatgggat gtggcagttt gatctttgtc 8100 aagtgataga ttgtgggaag gaccaattgg catggagagg atatgatgtt tatgggtgct 8160 tgtggcccac taccaccaga cctccagatg gtccccactg tcatacttgg cataatgtaa 8220 attggaaaac agccccagga ttcattaaaa agatgttgaa aaatagtcct ttggccaaag 8280 cagacagcat ccaaaatcga cttaggttgt ctaggggata ccaacatagt tggggaggct 8340 ggaaaaacac gttgattata tcgctaaaaa atggcaatga tgaaactgac acgtacatca 8400 ctctaggggt agatgtagaa ggaaaagacc cactggggtt gataaaaatc tccataaaaa 8460 agcctaaacc cacaggggcg ccaataatca cagacctaac cgaaaacaag aagaaagtaa 8520 tacagagcac tgactatagc aatctgaccc cactagatct gatgacgctg gagacaggct 8580 atcacgaaac aaatttatgg ttagaatggg tgacaaacgc agccgaagaa ttaggatttg 8640 aggggtgcct agcttgtgca gcaggaaggc cacaattaaa tactgaaccc gctccattac 8700 atgattatga tagttggggg tacaaatgta tgttaaaact aacaaaggaa aaaagtccca 8760 aaaactgcac tgcactcagt aatctatacc caccagctgg aaataagtca tcaatgggag 8820 ttgcattaaa aagaagaaag ggtaactata cctgtttcaa tttaactacc tccagtgcat 8880 ccaaaatatt agtgggagcg tttaaaaggg aatggtgtgg acagatgatt caggaaggtc 8940 atgacaaact aggaggatgg ggcagagtgg gtttgtatta cgcgtgtgga gaaaaggtgg 9000 tattggatag aatagaaccc accatggaag gagtatgtgc tatgataaga gtcgctgttc 9060 ctatggtcat gataggaaat agggctataa atgggcataa aaacaggcga aaaaggtcag 9120 taaatagtga tttcgattta acacgcaata gccccaccta catagacgca attggcatcc 9180 ccaggggagt tccagatgaa tataaattgg ctgaccaagt agcagcaggg ttcgaaaata 9240 tacctgtaat agcagcatta ttccccgtga ccccaaataa aaatgtagac agaataaatt 9300 acatacatta caacgtacaa aggctgtcaa atctcactag agatgctgtg tctggattaa 9360 aagaacaatt ggctgctacc tctctgatgg ctattcaaaa ccggctagct ttagacatgt 9420 tactatcaga aaggggcggg gtttgttcca tgtttctgga cacgtgttgc actatcatac 9480 caaacaacac tgcccctgac ggttccgtat ccagggccct ggagggacta aaagaattgt 9540 caaatgaaat gaaggctagc tcggggatag aaaacgccgt atcaaaatgg ttgtctgaca 9600 tgtttgggga atggaagggg gtgattattg ctatgctaac ttcaatgggt atcttcttgg 9660 gaattttggt gacatgtgga tgttgttgta ttccctgtat aagaagtctg ataaataggc 9720 tgataataac tgcaattgaa aagaaggaaa acccacctcc ttatcaaatg cccctccttg 9780 ctgcagccga aggggatgac gtgatggaag aagtggagga actattggat atggtgtgat 9840 ctctttcgag atcaagagag ggaat 9865 // ID MSAT-1_DR repbase; DNA; ZEB; 150 BP. XX AC . XX DT 21-AUG-2010 (Rel. 15.11, Created) DT 21-AUG-2010 (Rel. 15.11, Last updated, Version 2) XX DE Minisatellite-like DNA - consensus. XX KW MSAT; Satellite; Simple Repeat; MSAT-1_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-150 RA Jurka J.; RT "Minisatellite-like DNA from zebrafish."; RL Repbase Reports 10(11), 2008-2008 (2010). XX DR [1] (Consensus) XX SQ Sequence 150 BP; 12 A; 48 C; 30 G; 60 T; 0 other; ttctgcctct ggactgcttt gctcattctg cctctggact gctttgctca ttctgcctct 60 ggactgcttt gctcattctg cctctggact gctttgctca ttctgcctct ggactgcttt 120 gctcattctg cctctggact gctttgctca 150 // ID Gypsy124-I_DR repbase; DNA; ZEB; 5200 BP. XX AC chr16; XX DT 03-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Gypsy retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy124-I_DR; KW Gypsy124-LTR_DR; Gypsy124_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5200 RA Jurka J. and Kohany O.; RT "Gypsy retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1284-1284 (2008). XX DR Genome; chr16; Positions 14966709 14971908. XX CC 'GGAT' target site duplication CC LTRs are 98% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 806..4669 FT /product="Gypsy124-I_DR_1p" FT /translation="MYRIDADMKVRMRELELQAAQTLSKTPVRSINVNGGV FT DVATAPADASIVVNSVPSDEKPDALIPSRYFNVAKNIALVPCFREKEVEAY FT FQAFERIATALKWPNEVWALMLQCKLTGKAQEVCASLSVEDSIQNEAMKAA FT ILRAYELVPEHYRQRFRTTKKSASQTYVEFSREKGILFDRWVKACKVTDLN FT SLRKLMLIEEFKNCVPERTALYLNEQKVSTVQQAAVLADEYALMHKTVFKR FT TSENVLSGYENEKIPENKVWASSNLKSRKECRYCHNLGHIMSECHTLQRKQ FT ERQESMQPRGSVLEKTLPFSSIPEPDPCFKPFVFDGFVSLNEGDGGRKSVR FT ILRDTGGSQSFILADALDFCDSTACETSTIIQGIEMGYVTVPLHRVWVTSE FT LASGCFEVAVRASLPVKGIDFIMGNNIAGGKVMPVVQVTNLPSYEKQLDVC FT AETFPEVFPSVVTTRAQLKCDRSENQCLNDSFFSQILGDDVLPESVVKDQT FT ATSSVRSDLGTDLPVSREALIDAQKGDPTLQKCRTNAEVSSPCRRNCHYVW FT KDSVLMRKWVSTLSEPENEDDWVYQIVVPSKFRQSVLSLAHDHLWSGHLGI FT NKTYHRVLRHFFWPGLKSDVANYCKSCHVCQVSGKPNQSVPLAPLCPIPAV FT GEPFERVLVDCVGPLPRAKSGCQYLVTVMCAATRFPEAIPLRNITAKSVTK FT ALTKFFTTFGLPKIVQTDQGSNFLSRLFKNSLKALGVSHVVSNAYHPESQG FT ALERWHQTLKSALRKFCVETGNEWDEAVPFVLFAMREARQDSLGFSPSELV FT FGFEIRGPMKMLKEEFLGISPCKTNVIDFVSHTRERLRKACDLAREALSLS FT QQKMKRFYDQKTVVRNFSPGEKVLILIPTPGSALTAWFSGPYLVKSKVGET FT GYIIHTPDRRRKTRFCHVNMIKSYVDRADTETVSVSAGEWKPATDIKEGVS FT LLTCTLPEDVDDGFSMPIEVLNGGLMKNSELLLTLPSQLTYLSHDQVQDIQ FT KLLESFPDLFSDVPRGTAVLTHDIDVGDALPIKQHAYRCPINKREVMKAEV FT DYLLRNGFAVPSSSSWSSPCILVPKADGSFRFCTDFRKINAVTVPDAFPLP FT RIDDCIDNLGAAKYITKLDLLKGYWQISLSERASNISAFVTPDAFLQYTRM FT AFGLRNTPATFQRLMSIVLGDVSNCNTYLDDVVIHSLTWSEHLSSLSDVFH FT RLSAASLTLNLKKCEFAKASVTYLGKQVGNGFVKPVDVKISAVLEYPVPAT FT RRELRRFLGMVGFYRCFCKNFSSVVAP" XX SQ Sequence 5200 BP; 1358 A; 972 C; 1231 G; 1639 T; 0 other; taaatgggag ctcgtcctga attaagtaaa attcttaaaa ttagtgtttt tcagtgacta 60 gcttatgtgg tgagtaaatc atttaccgac tcctgcctgg gtgagttaga gtaagccaac 120 aggctgtgct ttgcgccagt ctggctacgg ctgactgttt attgttttgc tttattattt 180 tagtttatta tttttggagg tattccaggg gagagaaaaa aactcttaat cggtaccctc 240 tgaagactag ctgagtttag ttagttttct ggttaggaag gtgtaggact gggctgggtg 300 tttggtattt tttatttttt ttgctcacaa catatgtttt gaaggattgg caacttttga 360 tttaaatcat ttcatagatc aacctagtgt tgaggaattg gaagtgtgcc gtaaaaacga 420 tttatttttg atcgctcaac attttgaaat tcctgttgtt aaaactcagc gtaaagcaga 480 gataaaggcg tgtataagtg attttctgat tgacaaatgt atctttcctg taacagagac 540 agtggttacc caagctgctg aatcattagt ggatgcagaa ggttcttcac aagtgcgggt 600 tggtgatttg actacaccag gggatgtggt caatgaggag gtatgtccac ctttttccct 660 gcctaagttt gaaccagctt ctctttctga ggcatcaact gaatttcgtt cagattggct 720 tttaaaactt cgtttagcgc gattacaact tgaaacacaa gatcgagctc aagcgagaca 780 agatgattta aaacatcaga ttgaaatgta tcgaattgat gcggatatga aagtgcgtat 840 gcgagagtta gaattacagg ctgctcagac tttatctaaa acacctgtga gatctattaa 900 cgtaaatggt ggcgtagacg tggctactgc cccagctgat gccagtatcg ttgtgaattc 960 tgttccatct gatgagaaac ctgatgcgtt aattccgtca aggtatttca atgtagcaaa 1020 gaatattgca ctagttccat gtttccgtga aaaagaagtt gaagcgtatt ttcaagcttt 1080 tgaacgaatt gcgactgctt tgaagtggcc aaacgaggta tgggctttga tgcttcaatg 1140 caaattaaca ggtaaggcac aggaggtctg tgcatcactt tctgtggagg atagcattca 1200 gaatgaggct atgaaagctg cgattttgcg ggcatatgag ttagttcctg agcattatag 1260 acagcgtttt cgtactacga aaaaatctgc atctcaaact tatgttgagt tttcgcgtga 1320 gaaaggaata ctgtttgatc gctgggtaaa agcatgtaaa gttacagact taaattcttt 1380 gcgaaagctt atgttaatag aggagtttaa aaattgcgtt cctgaacgta ctgctttata 1440 cttaaacgaa cagaaggtca gcactgtaca acaagcagct gtattagcgg atgaatatgc 1500 gttaatgcac aaaactgtgt ttaagcgtac aagtgagaat gtgttgtccg gatacgagaa 1560 tgagaaaatt cctgaaaata aagtttgggc ttcttctaac ttaaaatctc gaaaagaatg 1620 caggtattgc cataatttgg gacacataat gtctgaatgc catactttac aacgaaaaca 1680 ggagcgacag gaatcaatgc agcctcgagg ttcggtttta gagaaaactc tgcctttttc 1740 ttccattccc gaaccagatc catgtttcaa accttttgtc ttcgacggtt ttgtatcact 1800 gaatgagggg gacggtggta ggaagtccgt acgaattctt cgtgatactg gaggatctca 1860 atctttcatt ttagcagacg ctttagattt ttgcgactcc actgcgtgtg agaccagcac 1920 gattattcaa ggcattgaaa tggggtatgt aaccgttcca ttgcatcgtg tgtgggtaac 1980 atcggagttg gcatctggat gttttgaagt ggcagtgcgt gcatcgttac cagtaaaggg 2040 aattgacttt attatgggca ataatattgc tgggggtaaa gttatgccgg ttgtgcaggt 2100 gactaatctt ccttcttatg aaaagcaact tgatgtatgc gctgaaactt ttcccgaagt 2160 ttttcctagt gtcgttacta ctcgagcaca attaaaatgt gatcgttcag aaaatcagtg 2220 ccttaacgac tctttttttt cccagattct aggagatgat gtgcttccag aatctgtggt 2280 taaagatcaa acggcgacgt cgtctgttcg ttctgatctt ggtacggatt tgcctgtttc 2340 tcgagaagct ttgattgatg cgcagaaggg tgatccaaca ttacaaaaat gtcgcacaaa 2400 tgctgaggtt agttcgccgt gccggcgaaa ctgtcattat gtttggaaag actccgtgct 2460 gatgcgaaag tgggttagta ctttgagtga gccagagaat gaggatgatt gggtatatca 2520 aattgtggtt ccatctaaat ttagacagtc tgtgttgagt ttggctcatg accatctttg 2580 gtccggtcat ctcggaatta ataagaccta ccatcgggtg cttaggcatt tcttttggcc 2640 gggattaaaa tcggacgttg ccaactactg taagtcttgt cacgtctgtc aagtgagtgg 2700 aaagccgaat cagtctgtgc cacttgctcc tctctgtcct atacctgctg taggtgagcc 2760 atttgaacga gttttggtgg actgtgtcgg tcctcttcct cgtgcaaagt ctggttgcca 2820 atatctagta actgttatgt gcgccgctac tcgattccca gaagcgattc ctttaaggaa 2880 tataactgct aaatctgtca ctaaagcgtt aacaaaattc ttcacaactt ttggattgcc 2940 aaaaattgtt caaacagacc agggttcaaa ctttctttct cgcttgttta aaaactcact 3000 aaaagcatta ggagtttctc atgttgtgtc aaatgcgtat cacccagagt cgcaaggagc 3060 tttagagcgt tggcatcaaa cgttaaaatc tgctctccgt aaattctgcg ttgaaactgg 3120 aaacgaatgg gacgaagccg ttccttttgt tttatttgca atgcgtgaag cgcgacaaga 3180 ttccctagga tttagcccat ctgagttggt gttcggtttt gaaattcgag ggccaatgaa 3240 gatgctaaag gaggaattcc ttggtatcag tccttgtaag actaacgtga tcgactttgt 3300 ttctcatacg cgggagcgct tgcgaaaagc ttgtgatctt gctagggaag ctctttcact 3360 ctctcaacag aaaatgaagc gtttctatga ccaaaaaaca gtggtgcgta atttttcgcc 3420 aggtgagaaa gtgttgattt taattcctac ccctgggtct gctctgactg cgtggttttc 3480 tggtccttac cttgtgaaaa gtaaggttgg cgaaacgggc tatataatcc atactcctga 3540 tcggaggaga aagactcgtt tttgccatgt caatatgatt aagtcctatg tcgaccgtgc 3600 tgatactgag accgttagtg tttctgcagg tgagtggaaa ccagcgactg acataaagga 3660 gggtgtgtct ttgctgactt gcacgttacc cgaggatgtg gatgatggct ttagtatgcc 3720 gattgaagtg ttaaacggtg gattaatgaa aaactcagaa ttgttactaa ctttgccttc 3780 tcagcttact tatctgtccc atgatcaggt gcaggatata caaaagcttt tggagagttt 3840 tccagatttg ttcagtgatg tgccccgtgg taccgctgtc ctgacgcatg acatcgatgt 3900 cggagatgca ttaccgatta agcaacatgc ttatcgctgt ccaataaata agagagaggt 3960 aatgaaagcc gaagttgatt atcttttgcg aaatggtttt gctgtaccaa gtagtagttc 4020 gtggagctct ccatgcatat tagtgccgaa agctgatgga tctttccgtt tttgtacgga 4080 ttttcgaaaa attaatgctg ttactgtgcc tgatgcgttc ccacttcccc gtattgatga 4140 ctgcatcgat aaccttggtg ctgcgaaata catcacgaaa ttagatctct taaaaggtta 4200 ttggcagata tcactatcgg agcgcgcatc taatatctct gcgtttgtga ctccggacgc 4260 atttctacag tatacgcgga tggcattcgg tttgcgtaac acaccggcaa cttttcagcg 4320 tttaatgtct atcgtgttgg gtgatgtttc aaactgtaac acctacttgg acgatgtagt 4380 aattcactct ctgacttggt ctgaacatct ttcatctctt tctgacgttt ttcaccgttt 4440 atctgctgcc tccctcacct tgaatctcaa aaagtgtgag ttcgctaaag cttctgttac 4500 ctatctaggt aagcaagtgg gaaatgggtt tgtgaagccc gtagatgtaa aaatctctgc 4560 ggtcttagag tatcctgtac ctgctacgag acgagagttg cgtcgatttc tgggtatggt 4620 gggattttac cgctgttttt gtaaaaactt ttcaagcgtg gtagctccgt aaacgcgatt 4680 gtgcagtcca aaggttgagt tttgttggac tgattgccaa caggcatttt tgtctgcaaa 4740 atcccttctt tgcagtgctc ctgttttctc cgctcctgac gtgtctcgtg cgttccagct 4800 ggaggtggat gctagtgccg tgggagcgtg agatgttctt ctccaggagg gtgctgacgg 4860 agtggtgcat cctgtatcct atttttctgc taagtttaat cgacatcagt taaactattc 4920 caccatagaa aaagaaaccc tggcattact tttagcgttg agacattttg atgtttatgt 4980 ttcaggtagt ttatatccag ttattgtcca tactgaccat aatcctcttg tttttctgag 5040 cagaatgtac aaccataatc agcgcttgat gcgttgggca ttaatggtac agccatacaa 5100 tcttgaaatt cgtcataaaa gaggaactga aaatgtggtt ttccaatggg aaaacaagga 5160 taggagaagt gaaaactgtt tctttcttca gggggggagg 5200 // ID Gypsy158-I_DR repbase; DNA; ZEB; 4391 BP. XX AC chr5; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy158-I_DR; KW Gypsy158-LTR_DR; Gypsy158_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4391 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1577-1577 (2008). XX DR Genome; chr5; Positions 28397284 28401674. XX CC Positions [3261-3713] - Integrase core CC LTRs are 97% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 1671..4193 FT /product="Gypsy158-I_DR_1p" FT /translation="MRKPDGSIRICGDYKLTVNRASSVEQYPIPKVEDLFT FT QLAGGQKFSKLDMSHAYQQIMLDESAKKYVTVNTHKGLYTYCRLPFGVASS FT RAIFQRTMEGILQNLTHVVVYLDDILVTGANDEEHLRNLEEVLRRLKSSGL FT RLKRSKCEFLGEEVIFLGHRISAAGVQPVAEKVQAIQEAPIPQTVSELKAY FT LGLLNYYHKFLPSLSTVLAPLHRLLKKETKWTWGSEQEESFVKSKELLHSS FT ALLVHYDPTKPLILACDASPYGVGAVLSHRMEDGTDRPIGFVSRTLNAAEK FT NYSQLDKEGLAVLFGVKKFHTYVYGRPFTIVTDRKPLIALFNELREVPQMA FT SPRIQRWAVTLGGYEYTIVYRAGREHQNADGLSRLPLSEKGVETPAEEERV FT LLLDENDVSLVKAEQVEKWTVKDPVLARVREYVLRGWPKTVEDPAFGVYLK FT KQDELSVQGGCVLWGARVVIPPQGRAAILRQLHSGHPGITRMKGLARSYVR FT WPQMDAAVEGLVKACTRCQESRNTPPCAPLHPWEIPQLPWRRIHIDYAGPW FT QEKMFLILVDAYSKGIEAFPLNISTSGVTIQCLRQSFSQHGLPEIVVSDNG FT SCFMSKEFQEFMSRNGIKHITTAPYHAASNGLAERAVQTFKSLMKKSTGDS FT IEAGLARVLFSYRITPQSTTGKSPAELLCGHKLRSTLDLVHPDFKSQVQNK FT QLKQKWYHDLHAKERHVSVGENVYTKNFSLGPTWIPGTVQKRTGPLSYTIA FT LGNGQVVHRHIDQVRTRQVASPADTAVGLDTSLQELEEWPLPESGNSVGQQ FT ASGMTPKVTVSGTVKAEEQPELRRSARTKKLPSYLRDYA" XX SQ Sequence 4391 BP; 1343 A; 817 C; 1209 G; 1022 T; 0 other; tctgcctcca tattttactg cctcatagaa tattacataa ttggcgacga ggatttaagg 60 agtggcgtgt cagaggacat aattgattga ttgctgagaa atgctggagt aaacagatcg 120 ctcgacatca ctggcagagt ataccggtga ttatcaagac aaaatgactt cactggtggg 180 gacggtgagt ccttttgata atgcatcaca gttatgggat gaatatatcg aaatgctcga 240 atattttttc caagccaatg gaattgacag tgctgagaaa aagaaagcgg tgctgttaag 300 tggtgttggg gcatcgacat atagtttatt aagaagtttg atctgccctg aaaagccagg 360 agacaaaacc tatgaggagc tggtgactgt gctaaagact cattacaatc ctaagccaag 420 tgagatagtg ccaagaatta agtttaactc acgtacacgg aaaaatggtg agactgttgc 480 tgattttgtg gcagaactga aaaagttagc acagcactgt gagtacgggt ctacacttcc 540 acaaatgctg cgagacatac tcgtctgtgg agtgaacgac gaccgaatgc agagaaggct 600 gctatcggaa gtggagttaa cttttgaaaa ggcactgaca atctgccagg caatggagtc 660 tgcaaacaag aatgtgagag atttgcaagg attgttaatg gaggacacag cacaaaccat 720 gaagtcattt aaaggaccgg cagcagtgca tagagtgtac acatttgaga agaagggtcg 780 acaaaaagac acggtgtgtt acagatgcaa aggacaacat gcaccagagg aatgcaagtt 840 tttgaatgag ctctgccata aatgtggaaa gcggggacac ataaaacgcg catgcagggc 900 aaaacaagct gctggggggg ggggtagggg gtcaaaacac tccatttaaa ggacaaaaag 960 gagagaaaaa agagaaaggt aggaggactc atctcgttag agaggaggaa agtaacgaga 1020 gggaggacac cagtgatatt aacacaatat acagtgtatc ccaagtccag ccaaaagtgg 1080 gccctataac acagaaagtg agtgtaaatg gcatggaagt ggactttgaa gtcgacacgg 1140 gttgtggagt gacaattatc agcagagggc aatattctaa gttgtgttct aagttgtgga 1200 aaaaagcaga catgccagaa tggagaccct gcagtatgaa actaaagacc tatacagggg 1260 aaaggatgga agtgctgggt caagctacag taacggtgga aatgcaaaat gtaaaaaaag 1320 aaatgacaat ggtcgttgta gatggagagg ggcctaactt gttgggacga agctgactga 1380 aagactttgg gctgttgcca cagctggtaa atcaagtaac agcagtgtcg acatggaata 1440 tagcagatgt gttagataga cacgcagagg tgtttaaaga agagctgggg cagctcaaag 1500 gcaccacagc taaaattcat gtcaaccccg aggcccaacc acgattcttt aagcccagac 1560 gtattccttt tgcagtcaag cccttggtgg aggctgagct ccagagactg gttgaggaga 1620 aaattattga accagtgcag tttgcggagt gggcagctcc tatcgtgccg atgaggaaac 1680 cagatggctc tattcgaatt tgtggagatt ataaactcac agtaaataga gcatccagtg 1740 tggaacaata tcccattcca aaagtggaag atttgttcac tcagttagct ggaggtcaga 1800 aattttccaa actggacatg agccatgcgt atcagcaaat tatgctggat gaaagtgcaa 1860 agaaatatgt cactgttaac acacacaagg gactgtacac atactgtaga ctaccttttg 1920 gggtggcatc aagtcgagcc attttccaac gcaccatgga gggaattctc caaaacttga 1980 cgcacgttgt ggtgtatttg gatgacatat tagtcacagg agctaatgat gaagagcatc 2040 tgaggaatct ggaggaggtt ctgaggagac taaaatcgag tggattgaga ctaaaaagga 2100 gcaaatgtga gttcctggga gaggaagtga tattcctggg ccaccgcatc agtgctgcag 2160 gcgtgcagcc agtcgcagaa aaggtgcaag cgatacagga agcccccatt ccacagacag 2220 tgagtgaact gaaggcttat ctaggcttgt taaactatta tcataagttt ctgcccagtc 2280 tgtccacagt gttggcaccc ttgcacagat tgttgaagaa ggaaacaaaa tggacatggg 2340 gctctgagca ggaagaatct tttgtgaagt caaaagagct tctccactca tctgctttac 2400 tagtgcacta cgatcctacc aaacctctta tcttggcgtg tgatgcttct ccatatgggg 2460 taggggctgt tttgtctcac aggatggaag acgggactga cagaccaata ggatttgtat 2520 cacggacgtt aaatgcagca gaaaagaatt actcgcaatt ggataaggag ggtttggccg 2580 ttctgttcgg ggtaaagaaa ttccacacat atgtgtatgg cagacccttt accattgtga 2640 ctgatcgcaa gcccctgata gcactgttta atgagctcag agaagtgcca cagatggctt 2700 ctccacgtat tcaaagatgg gctgtgactt taggggggta tgagtacact attgtgtatc 2760 gagcagggcg tgagcatcaa aatgctgatg ggctgagccg gttgccactg tcggaaaaag 2820 gagtggaaac cccagcagag gaggagagag tgctactgtt ggacgagaat gatgtttcat 2880 tggttaaagc agagcaggtg gaaaaatgga ctgttaaaga tcctgtgttg gctcgggtaa 2940 gggagtatgt gcttagagga tggcccaaga cagtggagga tccagctttt ggagtatacc 3000 ttaaaaaaca ggatgaactg agtgtccagg gaggatgtgt attgtggggt gcccgcgtag 3060 tgattccacc ccaaggccgg gcagccatac tcagacagtt acacagtggc catccaggca 3120 taaccagaat gaaaggactg gcacggagct atgtaaggtg gccccagatg gatgctgctg 3180 tggaagggct ggtaaaagca tgtaccagat gtcaggagag caggaacacc cctccttgtg 3240 cacccttaca cccttgggaa attccacagc tgccatggag aaggatccat attgattatg 3300 ctggtccgtg gcaagaaaaa atgttcctga ttctggtgga tgcgtactct aaagggatag 3360 aggcgtttcc tttgaatatt tctacatctg gtgttaccat ccagtgcctg agacaaagtt 3420 ttagccagca tgggttaccg gaaatagtgg tgtcggacaa tggcagttgt ttcatgagta 3480 aggaatttca ggaattcatg agtcgtaatg gcatcaagca cataaccacg gcaccatacc 3540 atgcagcttc taatggtttg gctgaacgag ctgtgcaaac gttcaagagt ctgatgaaaa 3600 agagcacagg agattccatt gaggccgggc tagccagagt gttgttcagt tatcgaatta 3660 caccacagtc gactacggga aaatcacctg cagagctgtt gtgtggacac aaactgaggt 3720 ctaccctgga tcttgttcat cctgacttta aaagtcaagt gcaaaacaag caattgaagc 3780 agaagtggta tcatgatcta catgcaaaag agagacatgt gagtgttggt gaaaatgtgt 3840 ataccaagaa cttcagttta ggacctacct ggattccagg aacagtgcag aaaaggacgg 3900 gacccttgtc gtataccatc gcacttggta atggccaggt agtgcatcgt cacattgatc 3960 aggtgaggac cagacaggtg gcatctccgg ctgacacagc tgtggggttg gatacttcct 4020 tgcaagagtt ggaggagtgg ccattgcctg aatcagggaa cagtgttgga cagcaggcaa 4080 gtggaatgac acccaaggtc actgtgtcag ggactgtgaa ggcagaggag cagccggaac 4140 tcagaaggtc tgcaaggaca aagaagttgc ccagttactt aagagactat gcttagagta 4200 gtggtgagtt ccccagccac agggcaagaa tgcgtcctgg aactcactgg gatggaggca 4260 gtctacccga agtccctagc ttagactgtg taaaaataaa ataaaaaaat tttttaaaaa 4320 aatcataatg ttgttagtat atttgattaa gttgttaagg gactattgtt atcaggggaa 4380 aggggatgta g 4391 // ID Kolobok-N2A_DR repbase; DNA; ZEB; 1355 BP. XX AC . XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-JAN-2009 (Rel. 13.12, Last updated, Version 2) XX DE nonautonomous Kolobok transposons from zebrafish- a consensus. XX KW Kolobok; DNA transposon; Transposable Element; Nonautonomous; KW TTAA TSD; Kolobok-N2A_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1355 RA Jurka J.; RT "Kolobok-type transposons from zebrafish."; RL Repbase Reports 8(12), 2166-2166 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1355 BP; 399 A; 311 C; 269 G; 371 T; 5 other; agagccccta ttatacatta aaaagggtca tattttggtt ttaggggtct ccaacaacag 60 gctgatatgc atgcaaggtc aaaaaacact ttcattgtct tataatatgc atttattttt 120 acctaattat cccagcgact cccatatgaa tcgttcagcg attcatttgt tcccaaaccc 180 ctccttagcg cgaagctaat ctgcgctgat tggaccgatg acagyctgtt gtgattggtc 240 gacagcgttc agcgtgagac agagtgaaat gcccagcacg gcttatcaac aatattgaag 300 tagtcasagt gcatagtgga cgtgtgagca taccttccat ctcccttatt tcagacccat 360 cttcagccac ccacccattt gttatttatc ccggaaaact ccagtaattt caaccaacag 420 caagctaaac tattttgaam cttgaccgcc gcacgtgtgt aggaacagct gacggtggcc 480 atagcaacga caaacggcag tggaacgcga gctcacaaac gcatttaaat ccgtaaacaa 540 agcagcacgc gtcgcgtttt caacgtggct ttagacgcga tatgagaata taaagagtta 600 accagataca gtacaagcgg ttacaagtaa caaaacacaa ttaaatacat aatttgcaag 660 ctagagaaaa caaggaggca accattttaa tcgcacttac ttacacttgt gaaatggagg 720 aagaaactga tccatgaact gtgtactgta aagttcctgt caagctctga caaagtccca 780 tacatcaata gtcttttctg atccttcctt taacaaacgg ccgatgaatc ccccgttgta 840 agcgtgtaga ttgctgaagc actcaagtag cacaaggctg ggctataatg ctgaggtatt 900 tttgcaaaaa taaacytcaa ccactgactc ttcacagcct catctttggg tagagaaaat 960 aacactaact ttcccctcac acttccaata atatccagct gagcacagcg tcttcacgac 1020 wtgattcaac cgggatcagc catcttgtgt tgtagtcttt ttctttgcgt actgtgtttg 1080 tgggcggggc tgcaggttta aattttcccg ggtttgcgcg cgcaacaaat gggcggggct 1140 taagtttcgt atcgacgtca cgccgaaacg gctaaagact cgttatcaag atgattcatt 1200 tgaaccacta tgagtcgact cttttataga tgaatcaata gttttaaaca ctgtgcactt 1260 tcagatttaa gccttagctg gatatttcac ttcacttaga gctgtgttac acactacatg 1320 gaaggtcatt ttcaaaaacc cataataggg gctct 1355 // ID SINE3-1 repbase; DNA; ZEB; 590 BP. XX AC . XX DT 11-JUL-2002 (Rel. 7.06, Created) DT 11-JUL-2002 (Rel. 7.06, Last updated, Version 1) XX DE SINE3-1 is a SINE retrotransposon - a consensus. XX KW Non-LTR Retrotransposon; Transposable Element; 5S rRNA; Dr000031; KW Dr000339; Dr000397; SINE; SINE3-1; polIII; conserved; CNE. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 582-2 RA Bao Z.; RT "Dr000031, Dr000339, Dr0003979, unclassified and uncharacterized RT repeats."; RL http://www.genetics.wustl.edu/fish_lab/repeats/. XX RN [2] RP 1-590 RA Kapitonov V.V. and Jurka J.; RT "SINE3-1, a novel class of SINE elements that utilize 5S rRNA."; RL Repbase Reports 2(6), 22-22 (2002). XX DR [2] (Consensus) XX CC This family represents a novel superfamily of SINE elements. CC Its 5' ~120-bp region is derived from 5S rRNA that encodes the CC polIII internal promoter. SINE3-1 elements are CC not flanked by target site duplications. 3'-termini of CC SINE3-1 elements are composed of minisatellites. Therefore, CC retrotransposition of SINE3-1 may depend on CR1-like non-LTR CC retrotransposons. CC About 10000 copies of SINE3-1 are expected to populate the CC zebrafish genome. They are ~90% identical with the consensus CC sequence. This family is composed of many minor subfamilies. CC Members of some young subfamilies are ~95% identical with each CC other. CC The SINE3-1 consensus sequence [2] replaces the unclassified CC Dr000031, Dr000339 and Dr000397 repeats [1]. XX SQ Sequence 590 BP; 133 A; 159 C; 156 G; 142 T; 0 other; tgccacagcc atatcaccct gcagcccaag accggttact cactgaagct aagcagggct 60 gagcctggtc agtacctgga tgggagacca catgggaaaa ctaggttgct gttggaagtg 120 gtgttagtga ggccagcagg gggcgctcaa cctgtggtct gtgtgagtcc taatgcccca 180 gtaaagtgaa ggggacacta tactgtcagt gggcgccgtc tttcggatga gacgttaaac 240 cgaggtcctg actctctgtg gtcattaaaa atcccatggc acttctcgta aagagtaggg 300 gtgtaacccc ggtgtcctgg ccaaattccc tccatcggcc cttacgcatc atggcctccc 360 aatcatcccc atccaccgaa ttggctctat cactgtctct ccactccacc aatagctggt 420 gtgtggtgag cgcactggcg ccgttgtcct gtggctgccg tcgcatcatc caagtggatg 480 ctgcacactg gtggtggtgt ggagagaccc cccctcatga ttgtgaagcg ctttgggtgt 540 atggccatac acaataaatg cgctatataa atacacatta cattacatta 590 // ID Gypsy149-LTR_DR repbase; DNA; ZEB; 306 BP. XX AC chr4; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy149-I_DR; KW Gypsy149-LTR_DR; Gypsy149_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-306 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1560-1560 (2008). XX DR Genome; chr4; Positions 22535231 22535536. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 306 BP; 83 A; 55 C; 60 G; 108 T; 0 other; tgtgtgatac accattgtgt atccacagta gtaatgtttg ttttgtcatt aatgtatcta 60 tttatttatg tatatactgt aaaagtcatt tattatcact actaatgact gggtttaaaa 120 caaaataaca tcaacagttg agaaagcatg cgctgtgggt taccatggcg atacttgttg 180 tgtttacttc agatgctact aagctgctcc tgaggaaaga cactgaagtt ttcttgtgat 240 tttgtttctc ctgataattc aggcacttta atctgcctac gagttccgcg gccgggaccc 300 gtaaca 306 // ID Gypsy-30-LTR_DR repbase; DNA; ZEB; 256 BP. XX AC . XX DT 28-FEB-2007 (Rel. 12.02, Created) DT 02-MAR-2007 (Rel. 12.02, Last updated, Version 1) XX DE A long terminal repeat of the Gypsy-30_DR LTR retrotransposon; a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy superfamily; Gypsy-30-I_DR; KW Gypsy-30-LTR_DR; Gypsy-30_DR; LTR; endogenous retrovirus. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-256 RA Dib M.R. and Naveira H.F.; RT "Gypsy-30_DR, a family of LTR retrotransposons from zebrafish."; RL Diect Submission to repbase Update (28-FEB-2007). XX DR [1] (Consensus) XX CC Gypsy-30-LTR_DR is a long terminal repeat of the Gypsy-30_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its CC internal portion is deposited in Repbase as Gypsy-30-I_DR. XX SQ Sequence 256 BP; 74 A; 38 C; 56 G; 88 T; 0 other; tgtaagatta aaaatatgtt cagatagatg gtagttaaag tgtggaatct gcgttactgt 60 tcgaaatgtt agatttactt ggatcttcat ctttgtacat tctgattggc tgaatgccag 120 gttggttgtt acattttgta agcgctgact ggtgaaaaag aagtaaagac aggtcacgtt 180 tcacaagccg tctccggtga tttattttat ttactttgag agtgagtcac gcaaaatact 240 cgaaccctat attaca 256 // ID HARBINGER2N_DR repbase; DNA; ZEB; 1018 BP. XX AC . XX DT 08-OCT-2003 (Rel. 8.09, Created) DT 08-OCT-2003 (Rel. 8.09, Last updated, Version 1) XX DE HARBINGER2N_DR is a nonautonomous DNA transposon - a consensus. XX KW Harbinger; DNA transposon; Transposable Element; Nonautonomous; KW 13-bp TIRs; 3-bp TSD; Harbinger superfamily; HARBINGER2N_DR; KW HARBINGER2_DR; nonautonomous DNA transposon. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1018 RA Kapitonov V.V. and Jurka J.; RT "HARBINGER2N_DR, a young family of nonautonomous Harbinger DNA RT transposons from Danio rerio."; RL Repbase Reports 3(9), 179-179 (2003). XX DR [1] (Consensus) XX CC HARBINGER2N_DR copies are flanked by 3-bp target site CC duplications CC (usually CWG) generated upon their integration in the genome. CC It has perfect 13-bp terminal inverted repeats similar to other CC HARBINGERs. There are approximately 100 copies of HARBINGER2N_DR CC harbored by the zebrafish genome. The consensus sequence was CC reconstructed based on 25 copies which are ~0.6% divergent from CC the consensus sequence. HARBUNGER2N_DR elements were derived CC from the autonomous HARBINGER2_DR transposon. XX SQ Sequence 1018 BP; 332 A; 179 C; 155 G; 352 T; 0 other; ggctgcgttt cccgataacg attgatctta gcacttaaga gcgttttcta cgagtcattt 60 tgcgaacgtt cgttattgtt tcacgtgcgt ttcccaaaaa tgcacttaac acaattgcac 120 gtagcccagc tttaagtgca acttaggagt cgctatccgt ttgttaagtg ctgaaatgtc 180 acgctataga atggctcgtt attgttgcac atgttatagc aatccatata attcttcttc 240 tacttgtgtg aatgtatatt caactcgaat aacataaaaa aaaaatattt ttgagccagt 300 ttaaaaacat aaattaactg gaaatgtcgt actgtaaaaa cactgtcttg ctccaatctc 360 gcataaaact aattctaaca gtccttgccg gaaatgacat cagcatcatt ttcgatattt 420 atatgaaacc ttaattaagt agaattattt atcaatttct ttatcaaaaa ttgcttatct 480 cacatcaaaa taaataaata agtaaataaa taaataaata aataagctct tacatttatt 540 tttgaaaagt ttagcctaat tatccccacc tgatgtgcat ttgctaatag gacaaaaata 600 tagcctacta ccctcgaaat gtccatatac aacagtgtaa tttttttttt cattctaaca 660 tttattatgt aatttttatt tttttcttta ttcattatta tttgtatatt gaattgtgta 720 atgtgtagaa aatataaatg aatagagata tacgtacaaa taaaatgaat aatgactttg 780 aagagaagta aattgtaggt gcaatttgcc ggttgtccag caggtgccct cataactctg 840 tctccttacg atgcacttaa ggctttacga ttactccaga gcactcgtag atccactaag 900 attttcaagt gctacttaag ttacgatgct tttgggaaac agaccgtaat attaagatca 960 gtcgtacgat catttctacg aacttcttag gcttacgatg cttttgggaa acgcagcc 1018 // ID IS4EU-2_DR repbase; DNA; ZEB; 5366 BP. XX AC . XX DT 29-APR-2007 (Rel. 12.04, Created) DT 01-MAY-2007 (Rel. 12.04, Last updated, Version 1) XX DE A family of autonomous IS4EU transposons - a consensus. XX KW ISL2EU; DNA transposon; Transposable Element; IS4EU; KW Interspersed repeat; IS4EU-2_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5366 RA Kapitonov V.V. and Jurka J.; RT "IS4EU, a novel superfamily of eukaryotic DNA transposons."; RL Repbase Reports 7(4), 146-146 (2007). XX DR [1] (Consensus) XX CC DNA transposons from the IS4EU superfamily are characterized by CC the TA target site duplications. These transposons are wide CC spread in metazoans, including fish, frogs, lancelet, sea CC urchins, sea squirts, insects and cnidarians. Autonomous IS4EU CC transposons encode two proteins: the IS4EU-TR transposase, which CC is similar to the IS4-like bacterial transposases, and the CC ISEU-EX DNA exonuclease. Based on the conservation of both CC proteins in highly divergent transposons, is is clear that they CC are necessary for transpositions. IS4EU-2_DR is a consensus CC sequence of a young family of autonomous IS4EU transposons that CC were active in the zebrafish genome in a last few million years. CC The IS4EU-2_DR transposon is characterized by 13-bp terminal CC inverted repeats, TA target site duplications, and it encodes two CC proteins: (i) the 475-aa transposase, IS4EU-1_DR1p, composed of CC the THAP DNA-binding domain and catalytic "DDE" domain, which is CC conserved in all IS4EU transposases, and (ii) the 515-aa CC IS4EU-1_DR2p exonuclease. Questions and comments send to Vladimir CC Kapitonov. XX FH Key Location/Qualifiers FT CDS join(76..502,1172..2169) FT /product="IS4EU-2_DR1p" FT /note="IS4EU-TR transposase." FT /translation="MAAQSRSKAFCCVPKCSNSKQKQPYMSFHEFPTNDMQ FT RQKWVHAIRRDEGPKFDIRRASTLVCSQHFTAADFIQGSCRLKPGVIPSRF FT QWNNFHVQPQKQSAFERSSARLGVDVRAPKTVDLQSNDSEVIVKDHDYAAY FT PPPGALDEALEYIAELEARLKKISLETPTVFSRFCVSDHTIRYYTRFPSQE FT VFQVFWESVCPSATNLMYWTKAQRMGQEASPTPSPARKIQLIDELFMYCCR FT VAAGLRERVIADIFGVSMATVSRTVITWANYLYFVLGSVPIWMSREQISSS FT MPEKYSSFSPNVRVILDCTEIFCESPTSLTLHSEIFSNYKSTTTFKGLLGV FT APCGAVTFISRLYTGSISDKEITRRSGILSLLEPGDEVMADKGFTIGDMIT FT NVGATLIIPPFKRDRQFSKTDCQKTQTIARLRILVERVIRRVKENHIFDSA FT VPLSLSGTIDQIWHNCCFMVNYQGPMFLEH" FT CDS join(5276..5123,4984..4881,4483..4120,3826..2904) FT /product="IS4EU-2_DR2p" FT /note="IS4EU-EX lambda-like exonuclease." FT /translation="MERFFTTSLSTLPQIKFEDINRIVAENSLTTGSRLKK FT GYKFFVEEFIHGYQVSNVVDSKVVVRARCYRSMRKNEEPHTLEVICNLPMF FT LESHGEPSVTAYACSCAAGKGLCNHLTALLYQTAHYVQLHLKSVPPTVACT FT SEQQRWHRPRTQVSVMFPVFQHLQIFLVDTFFVYIKGVSPEAVSQLVVQKP FT CSLGKTGVKSTLYKAHTGPLPDPHILASGAKLNQIDPKPLLAHILGGMSEM FT ELVSSQFGPLPRGSPLSYHYPLTAADQPTVDFPNLPVCGSQFSTNLNFVPT FT HHQSFHLQSIQVSHILSGQIEHNTRLQSNCAAWAHARQPRVTASRFREVCY FT VVGESASQSLAARILKGTKQTSAMKRGLELEPEILKQYSETKRVTVLPCGF FT VVHPDAPHLGASPDGRVYDPSEIFPFGLVEVKATSAESIGQASFIKMQKGQ FT AKLKETHKYYWQVQGQLAVTGLQWCDFVTDTQSDITIQRIWRDDVFITSMK FT EKLDMYYYYVYMDKYLSMV" XX SQ Sequence 5366 BP; 1647 A; 1035 C; 1040 G; 1644 T; 0 other; ggccaatttc acacttccgc attcgttact tccggttttg accgcccctg ggtaaactta 60 tttccggtaa aaacaatggc agcgcagtcg aggagcaagg cgttctgctg tgtaccaaag 120 tgcagcaatt caaagcaaaa gcaaccatac atgagttttc atgagtttcc cactaacgac 180 atgcaaagac aaaagtgggt acatgcaatt cgacgagacg aagggccgaa gtttgatatt 240 agaagggcaa gtacgttagt ctgcagtcaa cacttcactg ctgctgactt catccagggt 300 agttgtcgct taaaacctgg agttattcca agtcgctttc agtggaacaa tttccatgta 360 cagccacaaa agcagtctgc ttttgagagg tcaagtgctc gtctaggagt agatgtgcgt 420 gccccgaaaa ctgtggatct gcagagtaac gactcagaag ttatcgtcaa ggaccatgac 480 tacgctgcat acccccctcc tggtaagtat ttgtattaaa gtgtaactta gatgcaaaat 540 tatacacact agggctgcat gattatctcg attagggttg aagtcaattc ataaagtaca 600 attcacctaa aatgtcattt ctgtcctcat gtaatgatgt aatcacagcc gcgaggtcac 660 acgtgagctg tcgcactgtt tgtttacatc agaaaatgat gcgtttgcgt gcagtgaaca 720 gaacctgtaa attcagtgaa taacaggcaa tagttttaaa ataatgctca gtttgttgca 780 caaggcgaca actttgctcc ttaatcagga tatgattgat ttgcatgtgt tttgtttctc 840 acagtcgagg cacattacat ggtttaacag tgccataaag ataaattaaa ttcactaacc 900 ctataattca tcaactatac agttcttaac tataaatcag cactgatttg atttcagctt 960 aatgaatgcg cagttttaga gctgctttat aaatgtatta aggtgtttgc aactctaaat 1020 tcactaaatt tgtaaaggtt ctttggaaaa aaacattgta taaagtgcta caaaattaaa 1080 agtaacttga gtagtttaat gtttttactt agatgttttt attcttatca ttgatattat 1140 tattaaacct ttgtaatttg tcattttgta ggtgctttgg atgaagcctt ggaatacata 1200 gcagagctgg aggctaggct aaaaaaaatt tcactggaga caccaactgt gttcagtagg 1260 ttttgtgtgt ctgatcacac aattcgctac tacaccaggt ttccctcaca ggaagtattc 1320 caggttttct gggagtctgt ttgtccatct gccacaaacc ttatgtactg gacaaaagca 1380 caaagaatgg gtcaggaagc ttcccctaca ccaagtcctg cacgcaaaat tcagctcata 1440 gatgaactgt tcatgtactg ttgccgcgta gctgctggcc ttagggagag agtaattgct 1500 gacatctttg gtgtaagcat ggccacagta agcaggacag tcatcacatg ggcaaattat 1560 ctttattttg ttttaggctc tgttcccatt tggatgtcaa gggaacaaat aagctcttcc 1620 atgcctgaga agtatagttc attcagtcca aatgtcagag taatcctgga ctgtacagaa 1680 attttctgtg agagtcccac atctctgaca ctccactcag agatattctc taactacaag 1740 agcacaacaa catttaaagg actcttggga gtggctccat gtggtgcagt aacattcatc 1800 tctcgtctgt acactggctc aatctctgac aaagagatta caaggaggtc tggcatcttg 1860 agtctacttg aacctggaga tgaggtgatg gctgacaaag gctttacaat tggggacatg 1920 atcaccaatg ttggtgctac actaataatt ccaccgttca agcgtgaccg gcagttcagc 1980 aagacagact gtcaaaaaac acaaaccatt gcacggctca gaattctggt ggaaagagta 2040 atacgacgag ttaaagaaaa ccacatcttc gattctgctg taccactaag cttgtcagga 2100 acaattgacc agatctggca taactgctgc ttcatggtaa attaccaggg accaatgttt 2160 ttggagcatt aaatgagcca gatttctgca tcttctgtgg cctttacctt gaagccagat 2220 tgtctggcta gtcagttaat tttcagttta gtttcagttt aactatcctg aaatgtgtaa 2280 aatagtagaa atcaaaccca tcagagtaat ccatatacaa agaacggaaa acatgacaac 2340 cacataaaat taactcagtg tttgtttaga tacatttgaa ggattctgtg ctgatttgtt 2400 acattgttct gtttgttatt attaaaatag ttacaagtta atgtaatttt atgatttcct 2460 cttaactttt acaaaaaata ttacaatgta tttccccctc tgtgctcaga ggcagatttg 2520 ttacttttat caagcaaaaa tgagcagtct cctgaaaata acaaggacag tttgaataaa 2580 cttttttttt aataattgct attggtcaaa ataaacatat catacagttg ttctgttgtc 2640 cagttgttat ttgacacatt ttattgttag tttcagatat aatacagaat tgtaaacctg 2700 aactgataca acatgaaatt tttacaatta taatagtagt ttaaaaatag acaaagttgt 2760 aaaacaatat aaaattaatt actatacata tatacatagc gtggtagcaa ttttaaaggg 2820 ttagctcaca agtgcaagta tcaaggtgag taattaaaat tataaatttt gtgtgaatta 2880 accctttgct acagcaatgc ttaaaccata gatagatact tgtccatata tacataatag 2940 taatacatgt ccaacttctc cttcattgat gttataaaga catcatctcg ccaaatcctc 3000 tggatggtta tgtctgactg tgtgtcagtc acaaagtcac accactgcag accagttacg 3060 gcaagctggc cttgaacctg ccaatagtat ttgtgtgttt ctttgagttt ggcttggccc 3120 ttttgcattt taataaatga agcctggcct atactttcag cagatgtagc ttttacttcc 3180 actagaccaa atgggaagat ttcagatgga tcatatactc tcccatctgg actggcacct 3240 agatgtggtg catctgggtg aacaacaaag ccacaaggta aaaccgtaac cctctttgtt 3300 tcagaatatt gtttcaaaat ttctggttca agttcaagtc cacgtttcat agcacttgtc 3360 tgctttgtgc cttttagtat acgtgctgct aatgactggc tagcagattc accaaccaca 3420 tagcagactt ctctaaacct gcttgcggtg actcttggtt gacgcgcatg tgcccaggca 3480 gcacaatttg actgtagtcg agtattgtgc tcaatctgac cggataagat gtgtgacact 3540 tgtattgact gcaaatgaaa agattggtgg tgagtgggaa caaaatttaa gtttgtacta 3600 aactgactgc cacacactgg caggttgggg aaatcaactg taggttgatc agcagctgtg 3660 agaggatagt ggtacgaaag aggactgccc cggggtaatg gaccaaactg cgagctgacc 3720 aactccattt ctgacatgcc tccaagtata tgagccaaca aaggttttgg gtcaatctgg 3780 tttagttttg ctccagaagc caggatgtgt ggatctggaa gtggacctaa agaaataaat 3840 gcttgttagt gttaacttat aacatatgaa acaacaaatt actaaatgga gtctgtcagt 3900 gtgtatgtaa ctttatgatg aaatacatct gtcagcctct ctctgttctt tttgtaaata 3960 atgtatgtga atctatattt agatgtgtgt ctgtacattt ctatgtgtgg gtatgtcttc 4020 tgtatgcata tataggatgt ttatatattg atatacatgt atgcgtttct gtttgtcaag 4080 agtatggata cattttgcat acacacagtt tattattacc tgtgtgagct ttgtacagcg 4140 tagactttac tcctgtcttc cccaaagaac atggtttctg aactacaagt tgactcactg 4200 cctctggact gacaccctta atataaacaa aaaatgtgtc aaccaggaat atttgcaaat 4260 gctgaaaaac gggaaacatt acacttacct gtgtccttgg acgatgccat ctttgttgtt 4320 cacttgtgca agccactgtt ggtgggacag attttagatg aagctgcaca tagtgggctg 4380 tttgatacaa cagagctgtt agatggttac ataacccttt gccagctgca caactacatg 4440 cataagcagt gacactaggc tctccatgag actccaaaaa catctgtaaa aaagaacaca 4500 aaagttattt aaaactacaa gtcattaaca atattcaaca ctgaaaaaaa tgctgttgta 4560 ttaacaccgg aactaccaga attgtctaac tacttgaaat gccataccag tcattcagac 4620 ccgttctcat atttaatgaa agcctgagat ttatagatat gtgtgttttt aatttaaact 4680 aaattattca attgcaaata tagcagtcaa aatgaccgtc tcggtagttc tagtgtcaac 4740 actgtaaaat tcattagcct gcaaccgcaa aagctatgta tgttacacca gaacacttaa 4800 aaactgcaac agcagttata ttaaaaataa tagaacttgc tatcagaaca atgatagcta 4860 atctgcaatc ataagccaac cggcagatta catataacct ccagtgtgtg gggttcctcg 4920 tttttcctca ttgaacggta gcatctagct ctgactacca ccttgctgtc gactacattt 4980 gacactaagc atgcataaac aatgaaaaaa gtcaacatat ctgtaatcaa catatatcat 5040 ataacttatt tagaaacgac tgtaacttta ttgccgctaa cgttagcaat atgctagcga 5100 ttgtagctaa ttcaaaactt accttgatac ccatgaataa attcttcaac aaagaattta 5160 tatcccttct tcagtcttga ccccgtggtt aaagaatttt ctgcaacaat tcggttgatg 5220 tcctcgaact taatttgcgg cagagtggag agagatgttg taaaaaatcg ttccatgttt 5280 tctcagcata gaaatcgttt gttgtaactg tagcggaagt aaaattaccc agcggcgacg 5340 agcttccggt tcttgtgaaa ttggcc 5366 // ID Gypsy143-I_DR repbase; DNA; ZEB; 4438 BP. XX AC chr3; XX DT 10-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy143-I_DR; KW Gypsy143-LTR_DR; Gypsy143_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4438 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1547-1547 (2008). XX DR Genome; chr3; Positions 1443548 1447985. XX CC 'TTTAAA' target site duplication CC LTRs are 97% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 123..4193 FT /product="Gypsy143-I_DR_1p" FT /translation="MAEGFRRPDPLVFDGNIAENWRVFEQEFDIFIAAAHS FT EKDPRTKAFILLNLAGSEAIERERTFVYAPAVYTGQGDDRRIHIPAESRED FT PECLKRKFRELCSPQTNITMERHKFNSRNQKPGETIEAYVSDLQNKAQSCR FT FGDLQEELIRDRLVCGILNDGMRKLLLRDNELTLAKAIETCQIHELTDQHT FT KTLAGPKHNLSNVDSVQQMIKRKSFRTPKPRLDKFASQQIHNCSNCGRNHE FT AKRERCPAYGQQCHRCNKWNHFQKCCKSTKVHSARNLRKTVNQIEVSKTSN FT SSEDSFFIDGVALNHQNVTNTDLKQLKKPAISCTVQINGKTTELKIDTGAS FT CNIMSSQIFAHVKQHENLQVSSHVKLVAYGGEEIQTAGSTVLPCHLNGQIY FT TLQFYVVQRDVQPLLGLPDCLRMGLISLNKAVHQVSLKETSFAQQITTEYA FT DLFQDEIGKLPVTYCMKLDKEAQPVVRPARRIPAAMQNKVKAELEKMVAMG FT VLSPVSEPTDWVSSMVATHKKNSDDIRLCIDPRDLNKFLKRPHHPMRTVEE FT VAAQMPNSTVFSVLDAKSSFWQISLDQKSSLLTTFSTPFGRFRFLRMPFGI FT NSASEVFQHAMEQIFAGFPCAIIVDDIIVGGHGEEEHDENLRRVLNRAREV FT NLRLNPQKCKFRLSEVSYVGHIFTNKGLRADPEKTKAITEMQPPDNITALQ FT RFLGMINYLGKFIPNLSELSAPLRELTCKNTEWCWYKQHQDAFDNLKQMIS FT SPPTLKYYDVQKPVTLTCDASQFGLGAACLQEGAPVAYASRTLTQTEVHYA FT QIEKELLAVVFACTKFNDYIYGKQIHIETDHQPLVTILNKPIYTAPARLQR FT MMLRLQKYNFTITYKKGKQMYLADTLSRSPRTVPNEPYDDSTDFEVMSVQH FT ISSFRLNELRTHTAQDPVLQRLCSILKSGWPYSQSKLPVEICEYFPFRDEL FT TIDEDIIMKGQKLVVPESLRSEYITIIHRGHPGLEATKRRARGIVFWPSLN FT KDIENQVLSCSICNSQRSHQQKEPLHLHYIPDLPWSTVATDIFEWNGQHYL FT VLVDSYSGWFEIDLLRDMSSSTVITKLKRHFSVHGSPHRVLSDNGTQFTSQ FT RFKEFAAAWDFTHITSSPEYPQANGLAERAVQSAKQLMEKSKRDGSDVFQN FT LLNIRNIPRDQTLGSPAERLMSRQTRTTLPVSKSALVPASKNNVSVKEQLS FT RKRQYQKQYYDKSSKALRPLLQGEVVRLATSKGHDRIGLVKKLLDEPRSYL FT VESEGREYRRNRKHILPVTEPSPQPFEQSVMSFPSFQTPSVESKHEHPIPE FT EIKEHCTSQETKKKTFIEHSTKVVPKTPYVTRVGRLCKPNPKYMD" XX SQ Sequence 4438 BP; 1470 A; 964 C; 928 G; 1076 T; 0 other; tggtgtcaga agtggtctgc acggaattct gatgacggcg gaaaaaaggc tcgtccatcg 60 tgcttcgaca cagtaagctg tctaagagcg tcgcagagaa gaaaaaaaaa aaaataataa 120 ccatggccga aggtttccgc agacctgatc cgctggtctt tgacggcaat atcgccgaaa 180 attggcgtgt ttttgaacag gaatttgata tattcatagc cgcggcacac tcagagaaag 240 accctcgcac aaaagcattc atactgctta acctggccgg ctcggaagcc atcgagcgcg 300 agcgaacatt tgtttatgca cccgcggtat atacgggtca gggtgatgac cgacgaatac 360 acataccggc agaatcacgc gaagacccgg agtgtctaaa acgcaaattt cgagaacttt 420 gcagtccaca aacaaatata acaatggaga gacacaaatt caattcaaga aatcaaaagc 480 ctggtgaaac tattgaggcg tacgtgagtg atttgcaaaa caaagcacag agttgcagat 540 ttggagattt gcaggaagag ttaataaggg atcgactcgt gtgcggaata cttaatgatg 600 gaatgagaaa actgctgttg cgtgacaacg agttaacatt agcaaaggct atagaaacat 660 gtcagataca tgaactgaca gatcagcaca ccaaaacgtt agctggacca aaacataacc 720 tgtcaaatgt ggatagtgtt caacaaatga taaaaagaaa gtcatttcgt actcccaagc 780 ccagattaga caagtttgct tcacagcaga ttcataattg cagcaattgt ggaagaaacc 840 acgaagctaa acgggagaga tgtcctgcgt atggccagca atgccacagg tgcaataaat 900 ggaatcactt ccagaaatgt tgcaaatcaa ctaaagttca cagtgcaaga aatctaagaa 960 aaactgtgaa tcaaattgaa gtttctaaaa cctctaacag tagtgaagac tccttcttta 1020 tagatggagt tgctctgaat catcaaaatg tgactaatac agatttaaag cagctgaaga 1080 aacctgcaat atcttgcaca gtccaaatca atggaaaaac aactgaattg aaaattgaca 1140 caggagcctc atgcaacatc atgtcctcac aaatatttgc acacgtaaag caacatgaaa 1200 atctgcaggt gtcgagccat gtaaagcttg ttgcttatgg tggagaggag attcaaacgg 1260 ctggatccac tgtgttacct tgtcatctaa atggtcagat ctacacttta cagttctacg 1320 tggtccaaag ggatgtacag ccattacttg ggctccctga ttgtcttcgg atgggcctca 1380 tctctctcaa taaagcagta caccaagtca gtctaaaaga aacaagtttc gctcaacaga 1440 tcaccacaga atatgcagat cttttccaag acgaaatagg gaaacttccg gtgacatact 1500 gcatgaaact tgacaaagaa gctcaacctg ttgttcgtcc agcacgcaga attcctgcag 1560 cgatgcaaaa caaggtgaaa gcagagttag aaaagatggt tgccatgggt gttctcagcc 1620 ctgtatcgga acctactgac tgggtatcat ctatggtggc cactcacaag aaaaactcag 1680 atgatatcag attgtgcatt gatcccagag acctaaataa gtttctgaaa cgtccacatc 1740 accccatgcg cacagtggaa gaagtggcgg cgcaaatgcc gaattccact gtattttcgg 1800 tgctagatgc taagagctcc ttttggcaga tatcactaga ccaaaagtcg tctctactta 1860 caacgttcag tacgccgttt ggtagattta gatttcttcg aatgcccttt ggtattaatt 1920 ctgccagcga agtttttcag catgccatgg aacaaatctt tgctggcttt ccatgtgcaa 1980 tcatagtgga tgacattata gtgggtggac atggcgaaga ggaacatgac gaaaatttaa 2040 ggagagtgct gaatcgtgcc cgtgaagtga acttgagact aaatcctcag aagtgtaaat 2100 ttcgtctaag tgaagtgagc tatgtgggcc acattttcac aaacaaggga ctacgagcag 2160 acccagagaa gacaaaagca atcacagaaa tgcagccacc agacaatatt actgcattac 2220 aacgtttcct gggcatgata aattacttag gaaagttcat accgaatctc agtgagctgt 2280 ctgcacccct gcgcgagctg acctgtaaaa acacagaatg gtgctggtat aaacaacatc 2340 aggatgcttt tgacaatctg aagcagatga tttccagtcc acctacctta aaatactatg 2400 acgttcaaaa acctgtcaca ctcacctgtg atgcatccca gttcggccta ggagctgcat 2460 gtcttcagga aggtgctcca gttgcatatg catcacggac attgactcaa actgaagtgc 2520 attacgcaca gattgaaaaa gaacttctgg cagtggtctt tgcgtgtaca aaattcaatg 2580 actacattta tggtaaacaa attcacatcg aaacagacca ccagccacta gtaaccatcc 2640 tcaacaagcc catctacaca gcgccggcca gactgcaacg tatgatgcta agattgcaga 2700 aatacaactt cacaataaca tacaaaaaag ggaagcagat gtaccttgcc gacacgctat 2760 cacgttcccc tagaacagtc ccaaatgaac cctatgatga ttcaactgac tttgaagtca 2820 tgtcagtcca gcatatttca tctttcaggc tgaatgagtt gcgcactcac actgcacagg 2880 acccagtact tcaacgcctt tgcagtattc tcaaatcagg atggccgtat agccagtcaa 2940 aactgcctgt agaaatttgt gagtattttc cattcaggga cgaactcaca attgatgaag 3000 acataatcat gaagggacag aagctagtgg tacctgaatc actacgtagt gaatatatca 3060 caattattca cagaggtcac cctggccttg aggcaactaa acgcagggca cgaggtattg 3120 ttttctggcc ttctctgaac aaagatattg aaaatcaggt gctgtcgtgc tccatctgta 3180 acagtcagag gtctcatcaa caaaaagagc ctcttcactt acactacatc ccagatctcc 3240 catggtcaac agtagctaca gatatctttg agtggaatgg tcaacactac cttgtattag 3300 tagactcata ttcgggatgg tttgaaattg atttgcttcg ggacatgtcc tccagcacag 3360 tgatcacaaa actgaaaagg cacttttctg ttcacgggag tccacacagg gtcctttcgg 3420 acaatggcac ccaattcaca agtcaacgat ttaaagaatt cgctgcagct tgggatttta 3480 cacacatcac cagtagccca gaatatccac aagctaatgg gctagctgag agggcagtgc 3540 aaagtgcaaa acaactaatg gagaaaagca agagagatgg atcggatgtg tttcaaaatt 3600 tgctcaacat cagaaacatt ccacgtgatc agactctggg ttctcccgca gaaagattaa 3660 tgtctaggca gactcgaaca actctaccag ttagcaaatc tgcccttgtg cctgcctcga 3720 aaaacaatgt ttctgtcaag gaacaactct cgaggaagag gcaatatcaa aagcaatact 3780 atgacaaatc tagtaaagca ctgcgaccac ttttgcaagg tgaagtagtg agactggcaa 3840 catccaaagg tcatgatcga attggacttg ttaaaaaact tttggatgaa ccaaggtcat 3900 atcttgtgga atcagaagga agagaatata gacgaaaccg aaaacacatt cttccagtca 3960 cagaaccatc accacaaccg tttgagcaaa gtgtgatgtc atttccctcg ttccagaccc 4020 cctctgtcga gtctaaacac gaacatccca ttccagagga aatcaaagaa cattgcacct 4080 cacaagaaac caagaaaaaa acatttattg aacactcaac aaaagttgtt cccaaaactc 4140 cttacgtgac cagagttggt cgtctttgca aaccaaaccc taaatacatg gactgagatg 4200 aaaagagaaa acactcaatc gctaagtaac aatactgaat gtataaaagt tgtttaatgc 4260 ttaaaatctg agctgttaaa agttgtatag tctatcttta tgctcagctt accttttaga 4320 gacagttctc aaagaaaaac agaaattttg atttaatttg ttatgcaata atcttttgtt 4380 aagaagaaac tgtcaaagtt aaagaaggag gatgtagaat gtgctctgtc gctttaaa 4438 // ID DNA8-5_DR repbase; DNA; ZEB; 2033 BP. XX AC . XX DT 17-NOV-2004 (Rel. 9.1, Created) DT 17-NOV-2004 (Rel. 9.1, Last updated, Version 1) XX DE DNA8-5_DR is a non-autonomous DNA transposon. XX KW DNA transposon; Transposable Element; Nonautonomous; DNA8-5_DR; KW nonautonomous DNA transposon. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-2033 RA Kapitonov V.V. and Jurka J.; RT "DNA8-5_DR, a nonautonomous DNA transposon from zebrafish."; RL Repbase Reports 4(10), 255-255 (2004). XX DR [1] (Consensus) XX CC DNA8-5_DR is a nonautonomous DNA transposon. Its is CC characterized by 16-bp terminal inverted repeats and 8-bp CC target site duplications. It is expected to be a member of the CC hAT or P superfamilies. Numerous copies of DNA8-5_DR are inserted CC into poly(A) tracts. XX SQ Sequence 2033 BP; 740 A; 378 C; 390 G; 525 T; 0 other; taggggtgta acgatattaa aaccgaaccg aaatatcgcg atacaccaac cacgatacgt 60 atcgcgaaac tctcgtggcg taccgcggta ctctcacgcc ccctgctgcc cattgttgag 120 gttgacgtca cttgtctcta actaattgaa ccaatttaaa cacatcttta ttatcatatt 180 tgattaaatt gttttagtaa tgatgaggat ttgttctaaa tattatattc taaagttagt 240 attttagtgc atgttatgtc atgtgacttg agtaagcatc acactcgtta ctactggacg 300 caggatggct gcttcacata gtactgagat tgcagatccc ccagacacat ttaagtcatc 360 tgtgtggaaa cattttggtt ttccagttga gtacaggaat agaattcgtg tcgtagacaa 420 agcgcctgcc tgtctgtctc aactgtttca cgaaacttac ttgtaaggaa aaacatttaa 480 aaagtgtggt aatttcacaa tgtgagggtg taagcaacag aactgccaac aaaagtgatt 540 taatgtacaa aagtaaaacc agaataatgc ccaaatatac aaaaaaaaga gaaatattta 600 cactaataaa aataatataa taaagtaaca aattcaaaca aaagtatcaa acaaaaatta 660 gttaaactcg agcaaggggg tgatagtgaa gacaaacaaa agaaataaat ataataaaac 720 aattctaact catgaataac ccctcttacc tcgctaaaac tgatgaaaag aaaaaaaagg 780 tcttcagcgc cgtgcgccgc attcttccct atactgaata aaataaacaa agatgacctt 840 cacctcttac ctgatgtctt taacaataca ttttctacgt gtttgcaaaa tggaaatcgt 900 atgacatttt ccctatccac cttacactaa gactgaactt gaggagatac agctatattg 960 tcgggaagga gcggagctaa aaaataatat acaaatcaat tgaaataaaa atgtacaaac 1020 ctcacaaaat ttcttaaagt gggcaaaagt aatgcaaaat aattttaccc aaacgaaagt 1080 taagcaaaaa caacgaaaat cataatcatg tcagaaaaat acacgaaaaa cgagatcaaa 1140 ggaaaaagcg ctctctctcc cgctcttcct gacatgcttt ttacattccc gcgctgcgtc 1200 gccaaatgat gatagatcgg tctcctgacc aatcagctgt caaaaaggcg gacctacaga 1260 aatgacaggc atcagctgtc atgaaggcgg acctacataa atgaccggca tagagggagc 1320 gggagagaga gagagagaga gagagagaga gagagagaga gagagagaga gagagagaga 1380 gagagagaga gagagagaga gagagagaga gagagagaga gagagagaga gatcgggagg 1440 caaacataca gccggctaca accctaacat tagtttcaga cacaaccgca atgtatttta 1500 ctggtgaata tgtcatgaaa aaacacttta caaacacatt atatgtatat ttttcattaa 1560 tttaaacgga aacctatcta tctaaatgaa accatataat caaaaatgag cgtctgtgtc 1620 tgcacagaga tcaagcctct catcggagaa tgtggatatt atggtgtttt tgaagaagat 1680 ctttcactag tctacactgc agttatttat ttaattttag tttacttagt taagatggct 1740 gcactaaaga gaatagtttt ttttctgact agtctatatt ggaggtattt atttaattct 1800 agttgtttac tcgggtatcc tgactgtact aaaaatgcga tgacaaagtt aataaagaaa 1860 aagttaagat gttcttgtta ttaagtgaat tgtcccttag cattgctgtt aacatgacat 1920 caaccctaac cccacagcaa acagaaaccg aaccgtaccg aaccgaaccg tggctccaaa 1980 accgtgaacc gaaccgaacc gaaccgtgcc ttttgtgtat cgttacaccc cta 2033 // ID HATN8_DR repbase; DNA; ZEB; 255 BP. XX AC . XX DT 11-JUL-2002 (Rel. 7.06, Created) DT 11-JUL-2002 (Rel. 7.06, Last updated, Version 1) XX DE HATN8_DR is a nonautonomous DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW Dr000390 Dr000084; HATN8_DR; nonautonomous DNA transposon; KW hAT superfamily. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RA Bao Z.; RT "Dr000390 and Dr000084, unclassified and uncharacterized RT repeats."; RL http://www.genetics.wustl.edu/fish_lab/repeats/. XX RN [2] RP 1-255 RA Kapitonov V.V. and Jurka J.; RT "HATN8_DR, an ancient nonautonomous hAT-like DNA transposon from RT zebrafish."; RL Repbase Reports 2(6), 16-16 (2002). XX DR [2] (Consensus) XX CC ~5000 copies of HATN8_DR are expected to populate the CC zebrafish genome. The element was active long time ago, its CC copies are ~7% divergent from the consensus sequence. CC This element is characterized by 140-bp terminal inverted CC repeats and the 7-8-bp target site duplications. CC This element has ~60-bp subterminal inverted repeats CC (positions 35-92 and 153-210) and imperfect 32-bp TIRs CC (11 mismatches). XX SQ Sequence 255 BP; 68 A; 54 C; 53 G; 79 T; 1 other; caacacaatc tcacggcaat tcgtaacttt ttgatttagt ggctaattcg tatgaattcg 60 tacgatctaa ttcgtacaat ttagtacgat ttgctcatcm cccaatgacg gttggggtta 120 ggggtggggt tgggtgccac gcctcctttt taaaatcgta cattttcgta cgactgaact 180 cgtacgaatt cgtacgaatt agccactaaa ctgacaaaac gtaaaatact tacgtttcct 240 cgtgagatca ggctg 255 // ID TDR16 repbase; DNA; ZEB; 313 BP. XX AC . XX DT 04-MAR-2002 (Rel. 7.02, Created) DT 04-MAR-2002 (Rel. 7.02, Last updated, Version 1) XX DE Non-autonomous DNA transposon from Zebrafish - a consensus. XX KW DNA transposon; Transposable Element; TDR16; Tol2. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-313 RA Jurka J. and Kapitonov V.V.; RT "Non-autonomous DNA transposon from Zebrafish - a consensus."; RL Repbase Reports 2(2), 26-26 (2002). XX DR [1] (Consensus) XX CC 8bp TSD. Present in several hundred copies in the genome. CC Similar to Medaka (Oryzias latipes) DNA transposon Tol2. XX SQ Sequence 313 BP; 113 A; 36 C; 45 G; 118 T; 1 other; ggtcacactt tacaataagg tttattagtt aatgttaatt aatgcattta ctaacatgaa 60 caaacaatga acaatacatt tactacagta tttattcatg ttagttaaca ttagttaatg 120 aaaatacagt agttcattgt tagttcatgt taactcatgg tgcattaact aatgttaaca 180 agcatgaant tggatgttaa taatgcatta gtaaatgttg aactatgatt aataaatgct 240 gtacatgtat tgttcattat tagttcatgt tagtaaatgc attaactaat gaaccttatt 300 gtaaagtgtt acc 313 // ID Kolobok-N3_DR repbase; DNA; ZEB; 370 BP. XX AC . XX DT 13-OCT-2008 (Rel. 13.1, Created) DT 13-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE nonautonomous Kolobok transposons from zebrafish- a consensus. XX KW Kolobok; DNA transposon; Transposable Element; Nonautonomous; KW TTAA TSD; Kolobok-N3_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-370 RA Bao W. and Jurka J.; RT "Kolobok-type transposons from zebrafish."; RL Repbase Reports 8(10), 1347-1347 (2008). XX DR [1] (Consensus) XX CC Consensus was built from copies which are ~13% divergent from it. CC The TSD is TTAA, and the TIR is 12-bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 370 BP; 116 A; 66 C; 78 G; 110 T; 0 other; agtctgcgtg aaccggaagt tgctgagact tttatttcag tatgttgatg tacttccaac 60 tgaaacagaa tattgagtag ggggcggggc tttctttttg cactgtgcat cattccctca 120 tagcaaacta atggtaagag gggcgtggtt aagaatattg tggctgaagc cgtcaaactg 180 acatcaacag agaaggaccg ccactccaaa cgcagaagca aatggtcaga ctttgattga 240 agattaccaa aacaaacatt tgtttttttt ttttttcagt ggattaactt gcacagatta 300 attgttcacc taaagaataa caatgtgcgc tagcaaaata aacattgtaa attttgattt 360 cacacagact 370 // ID Gypsy64-LTR_DR repbase; DNA; ZEB; 835 BP. XX AC chr14; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy64-I_DR; KW Gypsy64-LTR_DR; Gypsy64_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-835 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1444-1444 (2008). XX DR Genome; chr14; Positions 47821875 47822709. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 835 BP; 143 A; 217 C; 141 G; 334 T; 0 other; tgtcaggtcc tctgtctgat cacctgggtt cgtttgtgtt ttgcacttgt ttacatgtat 60 ttcagcacgt gtggtgtttt gatcgctctc tcattacata cttgctttga cacatctcct 120 cccatcttcc cactcatccc tattattcaa ttaaaccaat ttcactcacc tgtcttcccg 180 taatttcccc tctatttaaa gcgctctcat gttgtcattt gtcagatcgt tgtcgtttgg 240 ttgtggtcag cgtgcactgt ttcactagtc cagtccagtc agttcctgtt gtctcgagcc 300 cgtcttctgg ttggtttttc cttttatttt atttcttcat tttttgaccc ctctctgtca 360 tctgcaggac ctgtttctgt tttgtttatc cgccgtggtt ggccgattgt ctccgatccc 420 cggcttcaat attccctccc gtcacctgct ccactcttct tctccctctc ggctctgccg 480 cccgccggat ccagtatacc gctgtatcgc cccctgccga gtgaacttga aattacaccc 540 tgaaagacat tttgtccctt ctacggagcg gtttctgttg acagtttttt tagctctttc 600 ttctgaacaa ccttgaagtt tgttttagtt tcattttgtt aattttattt gcctgagtcc 660 ctttaaagat cccagtaata attatatcct cttcggagca gtttttgttg aactttttat 720 ttttataaat ttttgattag aatttttctc cctgtgctta agcccaatag ggtcattttt 780 gtcactaata aattctgaaa gagacctgca cttgtgtccg ttctgtttcc tgaca 835 // ID L1-11_DR repbase; DNA; ZEB; 7324 BP. XX AC . XX DT 30-OCT-2008 (Rel. 13.1, Created) DT 30-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE L1-11_DR is a non-LTR retrotransposon from the L1 clad. XX KW L1; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; endonuclease; L1 clad; L1-11_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-7324 RA Bao W. and Jurka J.; RT "L1 non-LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1351-1351 (2008). XX DR [1] (Consensus) XX CC This consensus represents a complete L1 element. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 1362..5096 FT /product="L1-11_DR_2p" FT /translation="MSXIXFLSWNVRGLGSQAKRIKVFNHLSKLQADICLL FT QETHLSEXDQNKIKSSQYHSYSAHYNTKQRGVCILINKKISFVHNTTITDP FT EGRFVIINITINNXSLTIGNLYGPNTDDPSFFQXFFSXISNLSNCPXIIAG FT DFNTVLDPTTDRXNNIXXKXIWKSTDXIKQFMSDMGLGDSWRLQHPDIKAF FT SFYSPVHHSYSRIDFFLTSNSILSNISESKIHPIIISDHAPVTLKLNITTR FT HKPISRWRFNTSLLQDQDFXSYFKREWACFLEINDSPETSPSLLWETGKAV FT LRGKIISYSVYKKKKEKEHQAELEQKIKDLEDININNPTEETQDLLRKYKS FT QLNKLINTHTQFLIHRLRQEHFHHSNKSGKYLANQIKHNKEKTTISVIKDS FT AGKPTNSPEEINQIFQKFYSNLYSPEKNPSQDEIHSFLSNINLPQLNEQQI FT NXLESPLSQQEISNALKLMPNNKSPGPDGFPAEFYKHFWSILSPLFIRLIN FT ESKQNSKLPDTMNTATIALLLKPNKDPTLPSSYRPISLINVDTKIIAKALS FT HRIEKIMPSIIHPDQTGFIKGRQSSNNTRRLINLIHHSSINKNKSIIVTLD FT AEKAFDKVNWKFLFSTLERFGFGESFINWIKILYTSPSATVITNGLTSRSF FT TLHRGTRQGCPLSPSLFTIFIEPLAAAIRQNXYIKGIQTLNIHHKISLYAD FT DILLYLQNPQTSLQETIKLIDTFSNISEYSINWNKSAILPLNSTSVDVTSQ FT TSQIPLCTNYITYLGINVSARLSELFALNYTPLLKKIDDDLQRWMNLPLSI FT MGRISVTKMSILPKINYLFSMIPTQPTHTWFKSLDSIITQFYWKNKTPRIK FT LTTLQKPKTQGGLEAPHFYNYFLANQLQNIHKWIKPNSSEYTWQDIEQSIC FT KDINISELPFHSQSIKRHHCFKMPTIEATLTAWWKFHQVTNSPLTPSKYTP FT IWNNLDFLANKKPLNFRTWAAKGITQLQHIFINNNLAPFPHIAQTYSIGSN FT CFLEYLQIKSSIKSKLPNQSINLDLPPQISEFINISPSKKLLSQIYKMISK FT SDNALVLPTTKWQTDLSINPNAAFWTQICKNTFSMTKNANLQLIQYKVLHR FT SHLTGQKLFKMGFTSEICSHCTQNTPDTYLHALWHCTSIKSFWXKVTDSLS FT NLMGCHIPLCPSLCILGDISTININNTNGQFLLVALTIAKKTILMNWKSRN FT TIHISYWKNLLTEYISIEDPLCLKPFPNNTFIST*" FT CDS join(330..1043,1047..1361) FT /product="L1-11_DR_1p" FT /translation="MNSTPSSPSCCTEVSDILRSIENKLSGLDTRIALIEV FT IHKEFQELRHSLEFSQEQIDTLTKENNFLKDSVHALSTQLTSVVKENKSMK FT ENILDLQTRSMRDNLVFTGIPDQTPDDPEKLIKDFMTKQLKLPAETVQTIT FT FHRVHRIRSQQNSNRPRPIIAKFEHYKHKELVQKQGRQLKGTTYGLNEQYP FT KDILDRRKQLFPIRKQKINEGKRAIITVDKLFIDGQLFRDKDITPWLFTHL FT SHNXTQSLYTSSLSYXFVIAPRSHPSPTLKLYVGIXSHFCLVIXXLCTFCQ FT LHVIQTQTQVQXXXFSNTYSYXCICSXXXLVXXXTLLLFTYLFCNXLXFY* FT " XX SQ Sequence 7324 BP; 2503 A; 1476 C; 965 G; 2324 T; 56 other; gatgaccgag taacgcgaga ttctgcgaga cctggtgctt gacccaaaat ggcggcgtcc 60 gtctgctaga caaacaaaca aaaaactttg gtgttatttc catttttctc ctcacttttc 120 atcatcgtat ctcgcttatc aaaaattgga ctatatcact gcatctggac tattttcttt 180 tttcattttt ttttcatcct ggatcaactg gaagactaca tcgacgcact ctacaacaca 240 cactgctgca atatgaacac gccagatcac acagaaaagg taacctcact ccaatctcca 300 aaagaccccg ttctaattca ccgttaacta tgaactcaac gccatcctca ccctcctgtt 360 gcaccgaggt aagcgatatt ctacgctcaa ttgaaaataa actttccgga cttgatacaa 420 gaatcgcatt aatagaagtc atccacaaag aattccaaga actccgtcac agcttagaat 480 tcagtcaaga acagattgac acactcacta aggaaaacaa ctttttaaag gactcagttc 540 acgcactttc cactcagctc acctccgttg ttaaagaaaa caaatcgatg aaagaaaaca 600 ttttggactt acaaacacgc agtatgagag acaatctagt attcacaggt atcccggatc 660 aaactccaga cgaccccgaa aaattaatca aggatttcat gacaaaacaa ctcaaactac 720 cagctgaaac tgtacaaacc atcacctttc accgtgtcca ccgcatccgt tcgcaacaaa 780 acagcaaccg tccccgtcca atcatcgcaa aattcgaaca ctacaaacac aaagaactgg 840 ttcaaaaaca aggcagacaa cttaaaggta caacttacgg actcaacgaa caatatccaa 900 aggacattct cgaccgtcga aaacaactct ttccaatcag aaaacaaaaa ataaatgaag 960 gaaaacgggc aatcattaca gtggacaaat tattcataga tggacaacta tttcgcgata 1020 aggacattac cccatggctt ttctaaactc atctctcaca taatawtact caatctctct 1080 atacatcttc cttaagttat rgttttgtta ttgcacctag gtctcatccc tctcccactt 1140 taaaacttta tgttggtatt tamagtcatt tytgtttggt cattrttktr ttgtgtacat 1200 tttgtcaact tcatgtcata caaacacaga cacaggtaca atwtcmcayt ttttcaaata 1260 catacagtta ctwatgtatt tgcagtamwa mcasacttgt twatcawkat acattrctrt 1320 tattyacyta yttattttgt aatyytttaa wattttatta aatgtcayca atcamatttc 1380 tatcctggaa cgtccgtgga ctcggctccc aggcwaagag gatcaaagtt tttaatcatt 1440 taagtaaayt acaagcagay atatgcctty tacaagaaac tcacttrtca gaayytgatc 1500 aaaacaaaat yaaatcatca caatatcatt catattctgc ccactacaac acaaaacaga 1560 gaggagtstg tatcytaata aataaaaaaa tatcatttgt tcataacacc acmattacag 1620 acccagaggg acgttttgtc ataattaaca ttacaattaa caatwactca ctyacaatag 1680 gaaacttata tggcccaaat acagatgacc catctttttt tcagawttty ttttcckcca 1740 tttcaaatct ttctaattgc cccrtcataa tagcaggtga tttcaacaca gttttagacc 1800 caactacaga cagatytaat aatatcrgca awaaacrcat ctggaaatct acagacwcta 1860 taaaacagtt catgagtgac atgggtcttg gcgatagttg gcgcctccaa catccggaca 1920 ttaaagcatt ctcattttat tcacctgttc atcattcgta ctcccgcatt gacttttttc 1980 tcaccagtaa ttctatttta tccaatattt ctgaatcaaa gatccaccca ataatcatta 2040 gtgatcacgc tcctgtaaca ttaaaactga atataaccac cagacataag cccatttcta 2100 gatggcgatt caacacctct cttcttcaag aycaggattt cracagttat tttaaaagag 2160 agtgggcatg ctttctagag ataaatgact cccctgaaac atccccatct cttctgtggg 2220 aaacaggaaa ggcagtatta agaggaaaaa taatttcata ctctgtttac aaaaaaaaga 2280 aagagaaaga acatcaagct gaactggaac agaaaattaa agatctagaa gacattaaca 2340 taaataatcc aacagaagaa acacaagatt tgctaagaaa atataaatca caattaaata 2400 aattaattaa tacacacact caattcctta ttcacagatt acgacaagaa cacttccatc 2460 atagtaacaa atctggtaaa tatctggcta atcaaatcaa acacaataaa gaaaaaacaa 2520 caatatcagt cattaaggac tcagcaggga aacccaccaa ctcaccagaa gaaataaatc 2580 agatctttca aaaattttac agtaatttat attcccctga aaaaaaccca tcccaggacg 2640 aaattcactc atttcttagc aatatcaact tacctcaact aaatgaacag caaataaatt 2700 stcttgaatc ccctttatca caacaagaaa tttctaatgc acttaaactc atgcctaata 2760 ataaatcacc aggacccgat ggtttccctg ctgagttcta caaacacttt tggtcaattt 2820 tatcaccact tttcatccgg ttgattaacg aatcaaaaca aaattcaaag cttccagata 2880 ccatgaacac agccacaatt gcactcctcc tcaaacccaa taaagaccca acattaccgt 2940 caagctatcg tccaatctca ctaattaatg tagacactaa aataattgct aaagcacttt 3000 cacatagaat agaaaaaatc atgccatcca taatccatcc agatcaaacc ggttttatta 3060 aaggtagaca atcatcaaac aatacacgca gactcataaa cctaatacac cactcatcaa 3120 taaacaaaaa caaatctatc atagtcaccc tcgacgctga aaaggctttc gataaagtaa 3180 actggaaatt cctgttttcc acattagaga ggtttggttt tggggagtca tttattaatt 3240 ggatcaaaat tctctacaca tcaccctcag ctactgtcat tactaacgga ctaacatcac 3300 gcagcttcac tctacaccgg ggaactagac aaggatgtcc actctctccc tctttattca 3360 ccatattcat tgaaccactt gcagcagcta tccgtcaaaa cawttatatt aaaggaattc 3420 aaacattaaa catacaccat aaaataagtc tttatgckga tgacatatta ttatacttac 3480 aaaatcccca aacatcatta caggaaacaa taaaacttat tgataccttt tcaaatattt 3540 ctgaatattc aatcaattgg aacaaatctg ctatactccc attaaattcc accagtgtgg 3600 atgtgacatc ccaaacatca caaatccctc tgtgcactaa ctatatcaca tatttgggta 3660 tcaatgtttc cgccaggctg tcagagttgt ttgcacttaa ctatactccg ttacttaaaa 3720 aaatagatga cgatctgcaa cgttggatga acttaccatt atccatcatg ggcagaatat 3780 cagtaaccaa aatgtccata ctcccaaaaa taaactattt attttcaatg attccaacac 3840 aacccacaca tacctggttc aaatctctag actcaattat aacacaattc tactggaaaa 3900 acaagacccc aagaattaaa ttgacaaccc tacagaaacc aaaaacacaa ggaggattag 3960 aagcaccaca tttttacaac tattttttgg caaatcagct tcaaaacata cacaaatgga 4020 ttaaacctaa ctcatcagaa tacacatggc aagacattga acaatcaata tgcaaagaca 4080 ttaacatttc agaattaccc tttcacagtc aatcaatcaa gagacatcac tgttttaaaa 4140 tgccgacaat agaagcaact ctgacagcct ggtggaaatt ccatcaggtc acaaattccc 4200 cactcacacc atctaaatac accccaattt ggaataacct tgatttccta gccaacaaga 4260 agccacttaa ctttcgcaca tgggcagcca aaggaatcac acaacttcaa catatcttta 4320 ttaataataa tctggcacca tttccccata tagcccagac atacagcatt gggagtaatt 4380 gctttttgga atatttacaa atcaaatcat caattaagtc aaaattacct aaccaatcaa 4440 tcaatctaga ccttccacct caaatctcag agtttattaa tatatctccc tccaaaaaat 4500 tgctctccca aatatacaaa atgatatcaa aatcagacaa tgcattagtt ctaccaacca 4560 ctaaatggca aacagactta tctattaatc ccaatgctgc attctggact caaatttgca 4620 aaaacacatt ttccatgact aaaaatgcta acttacaact tatacagtac aaagtacttc 4680 acagatcaca tttaactggg cagaaattat ttaaaatggg tttcacgtca gaaatatgtt 4740 cacattgcac acaaaacact ccagacacat atcttcatgc cttatggcat tgcacttcaa 4800 ttaaatcatt ttgggwaaag gttactgact cactctccaa cttaatgggt tgtcacatcc 4860 cactgtgtcc ctccctctgc atattaggtg acatatccac aatcaatata aataatacaa 4920 atggtcagtt tctgctggtg gctctaacta tcgcaaagaa aactatcctc atgaactgga 4980 aatcaagaaa taccattcac atctcatact ggaaaaactt attaacagaa tacatctcca 5040 ttgaagatcc tctttgcctg aagccattcc ctaacaacac cttcatcagt acataaacaa 5100 gaggggaggg gagagagggg gtcaggtaga gatgaggaag aaaaaaaaaa aaatctaata 5160 ttaaatctaa tttaaaataa ataaatcaat tggaaaaaaa agtataatgt aaacaattaa 5220 aataaaagaa tcataattcg gtctttatat acttctttac tttccctttt gttctatttg 5280 ttcccctttc ctgtctctat caatgttatt tattcatttt attattatta ttatttataa 5340 tatttgtttt ctatttatta ttattattat tattattata cataaattat tattattatt 5400 attattatta ttattattat ttatatacca gtacttctcc tgaccagtcc aacaacagat 5460 aaaaaaaaaa aaaaaaaaag aaaaagtatg aaagggaaaa aaaagggaaa gaaagaaaga 5520 aagagaagag gaggaaaaaa aagagagagg gaaaaaaagg aaagaaaaaa aatctactta 5580 catgtactaa attcacaggg taaagtgtaa cgtaggcatg gcagtctggg cactgcactg 5640 gactgatcca tgccacacgt tacacttttt ttaatgcatc acaaagcaac tgacaaacac 5700 tatatacctc aatgcccatc acacaactac aacctcctca atttattgtt acagctgttg 5760 ttatttttat tattatcatt attattagta ttattattat tattaccagc atgagtatat 5820 tatagttatc aagttacagt aataatctat catacatatc tttaatatta ttcttccatg 5880 tgggtttgta taattcataa ttagcattga tatagtgttt tcttttttat ttattcagtt 5940 tttattatta ttattctttt tttttttttt ttttgcttat tctcattgct ttgtttgctg 6000 aactttattt ttcttctctc ttattgtttc ttttcttttt cttctactcc tccccttcct 6060 tttctgtttt ttttttattt tttattttat tttatttatt tatttttatt attagtattt 6120 tattaatttt tgtttattat cattagttta ttttctcttt tcctctatct ccccttctac 6180 ttactactta ctacttacta ctattattat ttattattat tattattatt attatttatt 6240 attattatta cttgtcctct ccctgatttt cccatcctcc cctctcctcg acacctttgt 6300 cataaatggt attactttta ttttttgaaa tcattattat tattattatt actgttctga 6360 cttgtcctgt tatctttcca ttatcattat tacaagtatt tataccatta tcattattaa 6420 ggttgcagtt gttgtagttg taacggtagt agttgtagta gtagtagtag tagtagtagt 6480 agtagttgta gttgaagtag tagacagtgt aatagtatcg attgttattg agtaattgtt 6540 acggagattt ttgaagtagt agttcatgta gtagtagatg tcattggtat agtagtagca 6600 gaagtagtaa tggtgtggta gttacagtag tagtagtagt cgttgttggt gtagtaatat 6660 cagtagtaga tgttgttgat gtagatgtag tagaagtgat tgtagtagta gtagtagtag 6720 tggtaatagt agtagtagta gtagtagtag tagcagtaga cgttgttgta ttagttgtag 6780 caaacgctat agtagtatca gtagtggtag tcgttgtagt agtatcagta ttagtagaca 6840 ttgtagtagt agtagtagtc gtggttgtgg ctgtagtaga agcagtagta gttgtaatat 6900 aagtaacttt ctgtagtagt agttggtgta gcacttgtga tttattatta ttgttgtcag 6960 ttattactgt tgtcctttct ttctttttac tctcattttt actatcatac attaataagc 7020 attgttgtta ctccctacct ctgactggtt tctttctata tgttttatct atatctgtga 7080 cacttgcttc atttattgac tgttattgtt ccttatgtat gtactctgac ctgtccacca 7140 agttaccttt acatgcatac acacactcac acacacacac tcacgcacat accagtacat 7200 gactatattg ttgttgtttt tgttttgttt ttgtttcgtt gctttgtatt tgttatttgt 7260 tgacgttttc ttgtatttgt atgatttttg catattctaa taaaaaaaaa aaaaaaaaaa 7320 aaaa 7324 // ID ERV5_DR_LTR repbase; DNA; ZEB; 435 BP. XX AC . XX DT 29-AUG-2008 (Rel. 13.08, Created) DT 29-AUG-2008 (Rel. 13.08, Last updated, Version -1) XX DE Long terminal repeat of ERV1 Endogenous Retrovirus from Danio DE rerio. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR; KW ERV5_DR_LTR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-435 RA Smit A.F.; RT "ERV5_DR_LTR - ERV1 Endogenous Retrovirus from Danio rerio."; RL Direct Submission to Repbase Update (05-AUG-2008). XX DR [1] (Consensus) XX CC Near perfect LTRs of ERV5_DR at danRer4 chr9:19054473<-19065147. XX SQ Sequence 435 BP; 143 A; 57 C; 118 G; 117 T; 0 other; tgaggtggat tttgtgctct ttggagtgtg aagttaatct atatattcag gttatatgtg 60 tataatctgc ttgtagactt gcataccatt aacacaagtg gacatgggaa atggcttgag 120 gaaacaacta ggggtcataa aagaagggat gaaaagagag tggtaaaaga gcctgaggtg 180 tctcaggaaa cagatgtcag aggtggaatt tggggttagg ggagggatct tgtgtaatgt 240 atggagaaca aaggaaaata attgttgtcc aagtgtattt aaacaggcca ggagattagg 300 gaggcagaat tttcaggcag agatgctact gggggtctcc tgaaaaatct cctttgttgt 360 cgacaataaa gaactcttct gaaatcaaaa ccatttgact ttgctgatca ttgaagaaaa 420 gagaaaacca cctca 435 // ID BEL22-I_DR repbase; DNA; ZEB; 5948 BP. XX AC chr19; XX DT 02-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL22-I_DR; KW BEL22-LTR_DR; BEL22_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5948 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1391-1391 (2008). XX DR Genome; chr19; Positions 33137093 33131146. XX CC Positions [4950-5531] - Integrase core CC 'ATAGT' target site duplication CC LTRs are 100% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 95..4450 FT /product="BEL22-I_DR_1p" FT /translation="MTTETSSLKTRSCASCSSARSNTSSTGSAAAKARARA FT EAAKARLTFAEEEVNLKLQKAKVEASMEMLQQKKEVAAAVAEAEALEAAID FT EHSEKHSCKLSLNSVPLETTQRTEQYVIDQTKNLDKDLQLCDVPAKIEPST FT SYSISGSQLKPEAKPFLLRHNSVSFQLPDTTLKHSATSPANKYPTPSQQFG FT SQNSINTSNPLNSEYPMFQNSNDCSSNMNDFVRYLARRELVSTGLLQFNDK FT PQNYRAWKRSFQNATFDLNLTPSEEMDLLLKWLGKESAEQVEQIRAIHINH FT PQAGLAMAWDRLDQTYGSAEIIEHALFKRIDAFPKIANRDYSKLSKLSDLL FT MELQSAKAEGDLPGLVLLDTARGIYPIIQKLPFRLQEKWASVGAAYKRHNQ FT VPYPPFAYFVDFVSQEADIGNDPSFSFVSYIDTAPKVEKAAWKSSKQHEIF FT VHKTAVSYGDCADTGGLYKQSDDCDKLCLIHKRPHPLNKCRVFLEKSTDAR FT KAFLREHNVCFKCCSSTTHIAKNCKTKVQCSECKSEGHNTVLHPGHASWKE FT SNPATKHGGEGEQPLSQSQVDSKCTEVCGENQTDRSCSKVCLVKVFPAGQK FT DKAVKVYAILDEQSNRSLVRSQFFEVFSDQSPCAPYILRTCAGVKESAGRR FT ASGYEVESLDGTVCISLPSLIECNDIPNNRDEIPTPDVARSHTHLKSMAHF FT IPELDPKAPIMLLLGRDVIRVHKVRKQVNGPHNLPYAQKLDLGWVIVGNVC FT LGNVHKPLTISTFHTNTIEPERPTLFKPCPNVFHVKETCKDFQIANHCPTY FT PEAKFTCDVDCLGDNVFQRTKEDNQVAPSIQDATFMKIMDEGLRKDSNNSW FT VAPLPFKSPRPRLPNNRIQALKRLISLKHNFERKPIMKEHFLKFMEKIFKN FT GHAELAPPLDEKEEQWYLPTFGVYHPKKVNQIRVVFDSSAQHNAISLNDVL FT LTGPDLNNTLVGVMMRFRKEAIAFTADIEQMFYCFLVREEDRDFLRFLWFQ FT DNDTSKDIVDYRMRVHVFGNKPSPAVAIHCLHKSVQIDNFHVDSEVKQFVM FT RDFYVDDGLKSLPTVEATISLLKKTQEVLSKSNLRLHKIAANSKKVMEAFP FT STDHACDLKDLDFEEDTLPMQRSLGLNWDLHSDCFRFSVSDEVKPYTRRGV FT LSTINRLYDPLGFVAPVTIQGKAILRELTAVSGDWDAPLPQKMEDAWTLWR FT VSLSELANFSIPRVYTEISTSTAMRRELCVFADASIKAIAAVSYLKVTDDE FT GKNHIGFVMGKAKLAPRPEHTVPRLELCAAVLAVELADLISVNLDLQFDAV FT TYYTDSKVVLGYICNETRRFYVYVSNRVQRIRRSSSPDQWRYVPTEQNPAD FT HATRFVSAAHLQKTNWLSGPNFLLTQGPDIQNTFNLVNPSSDPDVRPLVST FT LTTTALTKQLGSERFAKFSSWEFTNWCHYSPYPHSSSF" FT CDS 4512..5903 FT /product="BEL22-I_DR_2p" FT /translation="MEESDRAINIILRAVQEEAYSHEIECIKRQERIPKSS FT PLHTLDPFIDQQGLLRVGGHLHLSSLNQREKTPLLIPGKSHIATLIIRHHH FT ERVQHQGRHFTEGAVRAAGFWIIGGKRRVSSIIHQCVTCRRLRAPLSIQKM FT ANLPAERLSADPPFTNVGLDVFGPWNVSSRRTRGGLLYSKRWAVIFTCMSI FT RAVHIEVIESLDTSSFINALRRFLAVRGPVKHIRSDRGTNFVGACKELQIP FT SNIDSKVVKTYLSDQGCTWTFNPPHASHCGGSWERMIGLARRILDSMFLQL FT KDKLTHEVLVTFMAEVTAIINARPLVPVTTDPHESFILTPAALLTQKVNPV FT AAPTGEFGVTDLYKCQWRQVQHLSNTFWDRWRKQFLPTLQPRRKWQSVHPN FT VNTGSVVLLKNSQVPRNEWPLGLVTQSFPSKDGKVRQVEVKVIKPGGSTLF FT LRPVTEVVLLLSSESKVDE" XX SQ Sequence 5948 BP; 1823 A; 1324 C; 1274 G; 1527 T; 0 other; gtaaaaccac ggccttacaa ctgcaagtgg caaacaaagg caaataatcg tcacacccag 60 ctagtaatca attaacagat gtgttgctgg agccatgaca acagaaacat cgtcattaaa 120 gacgcgctca tgcgcttcat gttcttcagc acgatccaat acatcgtcaa ctggatcagc 180 agctgctaag gcaagagcaa gagcagaggc agcaaaagct cgacttactt tcgctgaaga 240 agaggttaat ttaaagctac aaaaggccaa agttgaggca tctatggaaa tgctgcagca 300 gaaaaaagag gtagctgcag ctgttgcaga agcagaggct cttgaagctg ccattgatga 360 gcattctgaa aaacacagtt gcaagctaag tttgaactct gttcccttgg aaaccacaca 420 gcgaacagaa caatatgtta tcgaccaaac caaaaatcta gacaaggatc tacaactgtg 480 tgatgttcct gcgaaaatag aaccaagtac aagctacagt atttcaggtt cacagctaaa 540 acctgaagcc aagcctttcc ttttacgaca caacagtgtc tcttttcaac tacctgatac 600 caccttaaaa cactctgcaa catcacctgc aaacaaatat ccgactcctt cgcagcaatt 660 cggatcacaa aatagcatca atacgtctaa tcctctgaac tctgaatacc ccatgtttca 720 aaattcaaat gattgcagtt caaacatgaa cgactttgta agatatctgg ctcgtcgaga 780 gcttgtttca acaggcttac tacagtttaa cgacaaaccc cagaattaca gagcctggaa 840 acgatctttt caaaatgcaa cttttgattt aaatctgact ccaagtgaag aaatggatct 900 cctgttgaag tggttaggca aagagtcagc cgagcaagtg gaacagataa gagcaataca 960 tattaatcac ccacaagctg ggctggcgat ggcatgggac agacttgatc aaacatatgg 1020 ctcagcagaa ataatagaac atgctctatt caaacgaatc gatgcctttc ctaaaatagc 1080 taatcgagat tattcaaaat tgtcaaaatt gagtgatctg ctaatggagt tgcagtctgc 1140 taaggctgaa ggagacttac ctggtcttgt tctcttagac actgcaagag gcatctatcc 1200 aataatacag aagttaccat ttcgtctgca agaaaagtgg gcgtcagttg gtgcagccta 1260 taagcgacac aatcaagttc catatcctcc atttgcttac tttgtagact ttgttagcca 1320 ggaggcagat attggaaatg atccgagctt cagctttgtc tcttacatag acacagctcc 1380 taaagtagag aaggcagctt ggaagagcag taaacaacac gaaatctttg tacacaagac 1440 agcagtatct tacggagact gtgctgatac tggcgggctt tataagcaat ctgatgactg 1500 tgacaaactg tgtctcatcc acaaaagacc acaccctctt aacaaatgcc gtgtcttttt 1560 agagaagtct actgatgctc ggaaggcatt tttgagggag cataatgtat gctttaaatg 1620 ctgctcttca acaacgcata tcgcaaagaa ctgtaaaact aaagttcaat gctctgaatg 1680 caaaagtgaa ggacacaaca cggtactcca ccctggacat gcatcctgga aagagtcaaa 1740 ccctgccaca aagcatggcg gggagggaga gcaaccctta tctcaatctc aagtcgacag 1800 caaatgcaca gaagtctgtg gggaaaatca aacagaccga tcgtgctcaa aagtatgcct 1860 tgtaaaggta tttccagctg gccaaaaaga caaagcagta aaagtatatg caatcttgga 1920 tgaacagagt aacaggtctc tggttcgttc acagtttttt gaagttttca gtgaccaaag 1980 tccttgtgct ccttacatat tgagaacatg cgcaggagtg aaggagtcag caggaagacg 2040 ggccagtggc tatgaagtag agtctctaga tggaactgtt tgcatttcac tgccaagctt 2100 gatagaatgc aatgatattc ctaataatag agatgagatc ccaacccctg atgttgctcg 2160 cagtcacaca cacctaaagt ctatggcgca cttcattcca gaactagacc caaaagcccc 2220 aatcatgctt ctcttaggtc gagatgttat acgggtccat aaagtccgta aacaagtgaa 2280 tggcccacac aatttgcctt atgcacagaa gttggatctg ggatgggtta tagtaggtaa 2340 tgtatgttta gggaatgtcc acaaacctct gacaatcagc acattccata caaacactat 2400 agaaccagaa cgtccaactc tctttaaacc atgtccaaat gttttccatg ttaaggaaac 2460 atgcaaagac tttcaaattg ctaaccactg cccaacatac ccggaagcta aattcacctg 2520 tgatgttgac tgtctaggag acaatgtgtt tcagagaacc aaagaggaca atcaagtggc 2580 tccttcaatt caggatgcaa ctttcatgaa aatcatggat gaaggtttac ggaaggattc 2640 aaataacagt tgggtagctc cattaccttt taagagtcct cgtcctcgtc ttcccaacaa 2700 cagaatacaa gccctaaaac gacttatatc actcaagcac aactttgaaa ggaagccgat 2760 aatgaaagaa cactttctca agttcatgga aaagatattc aaaaatggac atgccgagtt 2820 agcgcctcct cttgatgaga aagaagaaca atggtacctg ccaacatttg gagtgtacca 2880 tccaaagaaa gtgaatcaaa tcagagtagt ttttgattcg agtgcacaac acaatgcaat 2940 atctctgaat gatgtgctgt tgaccgggcc tgacttaaac aacaccctgg taggggtaat 3000 gatgcgcttc agaaaagaag caattgcctt tactgcagac atagaacaga tgttctattg 3060 ttttttggta agagaggaag atagagattt tcttcgtttt ctttggtttc aagacaatga 3120 cacttccaaa gacattgtag attaccgcat gagagtgcat gtctttggaa acaaaccctc 3180 acctgcagtg gctatacatt gtctgcataa gtctgttcag attgacaatt tccatgtcga 3240 ctctgaggtc aagcaatttg tgatgcgtga cttctacgtg gatgatggtc tcaagtcttt 3300 gcctacagta gaagctacga tcagtttgtt aaaaaagaca caagaggttt tgtctaaatc 3360 caatctgaga ctacataaga tagcagcaaa cagcaaaaaa gtcatggaag cctttccatc 3420 gacagaccat gcatgtgatc tgaaagatct agactttgag gaagacacgc tgccaatgca 3480 gcgtagtctt ggtctaaatt gggacctcca ttctgattgc ttccgattta gtgtcagtga 3540 tgaggtaaaa ccctacaccc gtcgaggtgt cttatccact attaacagac tctatgatcc 3600 ccttgggttt gtagcgccgg tcacaattca aggcaaagct attctgagag aacttactgc 3660 tgtgagcggt gactgggatg ctccattgcc acaaaagatg gaggacgctt ggaccttgtg 3720 gagagtttcc ttgtcagagc tggctaattt ttctattccc agggtttata ctgagatttc 3780 gacatcgaca gctatgagaa gagaactatg tgtttttgcg gatgcatcaa taaaagccat 3840 tgctgcagtg tcatatctaa aagtcacaga tgatgaagga aagaatcaca ttgggtttgt 3900 aatggggaaa gccaaacttg ccccccgccc tgagcacaca gtgccaagac ttgaactttg 3960 tgcggcagtg cttgctgttg agttggcaga cttaatttca gtaaacctgg acttgcaatt 4020 tgacgctgta acctactaca cggacagtaa ggtagttctt gggtacattt gtaatgagac 4080 caggcgcttt tatgtttatg tgagtaaccg cgttcagcgt atccgaaggt cttctagtcc 4140 agatcagtgg cgatatgttc ctactgaaca aaacccagca gaccacgcaa cacgttttgt 4200 ttctgcagcc cacttacaaa aaaccaactg gttaagtgga cccaactttc tgctcacgca 4260 aggaccagat attcagaaca ctttcaatct ggttaatcca agttcagatc ctgatgtgcg 4320 ccctctagtg tccacactaa ccaccacagc attaaccaag cagcttggtt ctgaacggtt 4380 tgccaaattc tcctcttggg aattcactaa ctggtgccat tactcgcctt atccacatag 4440 ctcatcattt taaaacaaca gagaaggaaa acagttcttg caaaggctgg cactactgca 4500 aagaattcac aatggaagaa tctgatagag ctataaacat catcctccga gcagtacaag 4560 aagaggctta cagtcatgag attgaatgca ttaaacggca agagaggata ccaaagagca 4620 gtccactcca tactttggat ccattcattg atcagcaagg tcttctgagg gtcggaggac 4680 atctccacct ttcaagtctt aatcagagag agaagactcc tttgttaatt cccggcaaga 4740 gtcacattgc cactttgatc attagacatc accatgagcg tgttcagcat caaggtcgtc 4800 actttacgga aggagcggtc cgtgctgctg gtttttggat aattggtgga aaacgaaggg 4860 tgagcagcat catacaccaa tgcgtaactt gcagacggct cagagctcca ctcagcatcc 4920 aaaaaatggc taaccttcca gcagaacggc tctcagcaga ccctcctttt accaacgttg 4980 gactggatgt gtttggccct tggaatgtct cctcccgtcg gacaagaggt ggtctcctgt 5040 acagcaagag gtgggccgta atttttacat gcatgagcat aagagctgtt catatagaag 5100 ttattgaatc tctcgacaca tccagtttca tcaatgctct tagacgtttt cttgctgtgc 5160 gtggacctgt caaacatatt cgctccgatc gtgggacaaa ctttgtaggt gcatgtaaag 5220 agctacagat accttcaaac attgacagca aagttgtaaa gacgtatctg tcagatcaag 5280 gttgtacatg gacctttaat cctccgcatg cttcccattg tggtggatca tgggaaagaa 5340 tgattggtct ggcaaggaga attcttgatt ccatgtttct ccagttgaaa gacaaactta 5400 cccatgaggt gctggtgacc ttcatggcag aggtaacagc tattatcaat gccaggcctc 5460 ttgttcctgt gacaacagac ccccatgagt cattcatact cacaccagca gctctcctaa 5520 cacaaaaggt gaaccctgtt gctgcaccta caggcgagtt tggagttacc gacttgtaca 5580 agtgccagtg gcggcaagtt caacaccttt ccaacacatt ctgggacaga tggcgtaagc 5640 aattccttcc aaccctacaa ccacgcagga aatggcagtc cgttcatcca aatgtcaata 5700 caggaagtgt tgtcctcctt aagaacagcc aagtaccacg gaatgaatgg cctcttggac 5760 tggtaacaca atcattccct agcaaagatg ggaaagtgcg tcaggttgag gttaaggtca 5820 ttaaaccagg aggttccact ctttttctta ggcctgttac agaggtagtg cttcttcttt 5880 cctcagagtc taaagtagat gaataactta gaattatcta gtgatgttta ttaacatcag 5940 gcggggag 5948 // ID HAT1_DR repbase; DNA; ZEB; 3292 BP. XX AC AL603743; XX DT 05-JUN-2002 (Rel. 7.05, Created) DT 05-JUN-2002 (Rel. 7.05, Last updated, Version 1) XX DE HAT1_DR is an autonomous hAT-like DNA transposon. XX KW hAT; DNA transposon; Transposable Element; HAT1N_DR; HAT1_DR; KW hAT superfamily; transposase. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-3292 RA Kapitonov V.V. and Jurka J.; RT "HAT1_DR, an autonomous hAT-like DNA transposon from zebrafish."; RL Repbase Reports 2(5), 22-22 (2002). XX DR Genbank; AL603743; Positions 79447 82738. XX CC HAT1_DR is an autonomous hAT-like DNA transposon. CC It is flanked by a 8-bp TSD. CC This element encodes a 597-aa transposase, HAT1_DRp. CC HAT1_DRp is encoded by 4 exons (ORF positions 654-881, CC 1152-2111, 2205-2390, 2487-2894). CC The zebrafish genome harbors subfamilies of nonautonomous CC DNA transposons that are derivates of autonomous elements CC similar to HAT1_DR. HAT1N_DR is one of these subfamilies; CC its consensus sequence shares ~85% identical 238-bp and CC 138-bp termini with HAT1_DR. CC HAT1_DR and HATN1_DR elements had imperfect 9-bp terminal CC inverted repeats (1 mismatch) at the time of their CC transposition in the past. CC HARB1DRp1: CC MDAPRKRCRPSVVWEHFHLETPNKVRCMYCDRQLAYCNNTSSMMRHLRSTHPAILGGAED CC GPVPRPASHTAGSSKQGRQKELDEALVDMVVKDLQPFTIVEDMGFRAFVNKLDPSYVLPS CC RKALKTMVMEKYNSTKEKIMGDIQKAQFVSLTADMWSSINMDGYLDVTCHFITPEAKMAT CC VVLGVVRFYQTHTAQHLMEAKTLLMSEWGITSKVQCLVTDNASNMMLSAQLLHLRHIPCF CC AHNLNLIVKKALDQTPLVSEIRQKARKIVGLFRSSCKAKDKLEEMQTLMGRPTLKLIQEV CC ETRWNSTYDMLQRLFEQREPVAAAISSLKADTALLSSMEYDVVQEAMSLLQPFKLATTEL CC SEEQRVSASKLIPLYRMLQHKLTEKKGLAKQESVVQLGSHLQEGLQSRCGGYESFRTLAL CC ATLLDPRFKTVAFGNTGKGQEAERHLTMECASLMRNQTSHDPQLSASMPSSSSSSSSSSS CC STTPAESQDSLWELFDNRVRESNLTSSVTADATVEVKKYITDAFLPRNEDPLKYWKEREN CC IFPNLYVLAQKYLCMPATSVPCERIFSKAGEIICKKRSRPSPSTAEHLIFLNKNLYL. XX SQ Sequence 3292 BP; 1040 A; 644 C; 673 G; 935 T; 0 other; tagggatggc tgatgcgaaa ctgacgtttc gacacagttt cgagatcccg aagcgcaagt 60 gtttcgaaac actgcaccga agcatgatcc gaaacactgg ggtcacgtga ctgaagtgat 120 tcgaaacacc caggtcacgt ggctaaagag atccaaagca tcagggcggc ccaagtgttt 180 cgggacgtgg catacctgcc gaatcagcgt ttgattgaca gggtcgtaaa caaatcacaa 240 tcgcacatct gtctgaatct aatttcaaaa aaaggataaa aatggtgtta tctcatgttc 300 aattcgtggg tttttgtgag aggagagata ttttgcgatt tagagacatt tatagtgcat 360 atttaagcta cattaaaact ttattgatag acaggctata gacactgaca aatagttata 420 ggatacattt gctgatacat tgatacattt ataggcatac atatagctac atccatacac 480 atacatccac ataggtatac atatacatcc ataggcatac acatacaacc atataggcat 540 acatatccat aatttagata gtgtatagta ttgcatagat agatagatag atagatagat 600 agatagatag atagatagat agatagatag atagatagat agatagaata gtaatggatg 660 ctccacgcaa gagatgccgt ccatctgtgg tgtgggagca tttccatttg gaaacaccga 720 ataaagtgag atgcatgtat tgtgataggc agctagccta ttgcaataat acatcatcca 780 tgatgcgcca tttgaggagc actcatcctg ccattttggg tggtgcagag gatggccctg 840 tacctagacc tgctagtcac actgctgggt catctaagca aggtatgcaa tgaatgaaat 900 ctaattgtga atacaaagtg ttgagacagt gtttaaaaaa ttcttatttt acatatagac 960 gtaagcttac gtcaacctaa gccctgcaat ataacatccc attaacaaac tttacagtga 1020 atatctaaat gtgacacctg acaaatacgg cactgtattt aaaattaatt atttgtgtgt 1080 gtttgagcta tttcaggcta aatctcttaa gtatatttga ctgaaagtag gttgtattcc 1140 ttttttaagg cagacaaaag gaactggatg aggctcttgt agacatggtg gtgaaggatc 1200 tgcagccctt caccattgtg gaagatatgg gtttcagggc ctttgtgaac aagcttgatc 1260 caagctatgt tctcccatct cgcaaggcac ttaaaacaat ggtcatggaa aaatacaaca 1320 gcaccaagga gaagattatg ggggacatac agaaggcaca atttgttagc ctcacagcgg 1380 atatgtggtc ctccattaat atggatggat accttgatgt aacgtgccac tttataacac 1440 ctgaagcaaa gatggcaact gttgtattgg gtgttgtaag attttaccaa acacacactg 1500 cacagcatct catggaggct aaaactttgc tcatgtctga gtggggaata acctccaaag 1560 tgcagtgttt ggtgaccgac aatgcatcca acatgatgtt aagtgcccaa ctacttcatc 1620 tacggcacat tccatgtttt gcacacaatc taaatttaat tgtgaaaaag gccctagatc 1680 aaaccccact tgtcagtgaa attcgccaga aagccagaaa gattgtgggg ttgtttagat 1740 caagctgcaa agcaaaggac aaacttgaag aaatgcagac cttgatgggt agacctactt 1800 taaaactcat acaagaagtg gaaaccagat ggaacagcac ctatgacatg ttgcaacgct 1860 tgtttgaaca acgagagcca gtggctgcag caatttctag tttaaaggct gatactgctc 1920 tgttgtccag tatggagtat gatgttgttc aagaagccat gtcattattg caaccattca 1980 aacttgcaac aactgagtta tcagaagaac agagagtctc agcctcaaag ctaataccac 2040 tctaccggat gttgcagcat aagctaactg agaaaaaagg ccttgccaag caggaatcag 2100 ttgtccagtt aggtaagccc caatagccat actgtcactg gccataagtt taatattgtt 2160 gtcagtaaca taattaatct gctttgctct tgttttgcac aggctcacac ctgcaagaag 2220 gactccagtc gagatgtggt gggtatgagt ccttcagaac attggcattg gctacactgc 2280 tagacccaag gtttaaaacg gtggcttttg gaaatactgg caaaggccag gaggctgaaa 2340 ggcatctcac aatggagtgt gcttcactta tgcgcaacca aacaagtcat ggtgagccct 2400 tttacttttc atgtattatg caacttcata taggtttgac atgagtttag actttgtaat 2460 taattttttt ttttcccaca tcagatccac agttgtcagc atcgatgcca tcatcatcgt 2520 cgtcatcatc atcatcatca tcatcaacaa caccagcaga aagccaggac agtttatggg 2580 aactttttga taaccgtgtc cgggaatcca acttgacaag cagtgttaca gctgatgcca 2640 ctgttgaagt aaaaaaatac ataactgatg cgtttttgcc aagaaatgaa gatccactaa 2700 aatactggaa ggaaagagaa aacattttcc caaatttgta tgtgcttgca caaaagtatt 2760 tgtgtatgcc tgcaacaagt gtgccctgtg agaggatttt ttcaaaggct ggagaaataa 2820 tttgtaaaaa aagaagcagg cctagtccat ccacagcaga acatttaata tttttaaata 2880 aaaatcttta tttgtaaaaa cattgacatt gtgtctttca ttaatgttca tattttatga 2940 ccaaagtatt tattatcact gctcattcag atgtggcacc atcattactt attcaacacg 3000 ataatgatat actttgtgac ttttacttat gtcagattcg aacccatggt ctaaacgttt 3060 taattgtgaa cactaaccac tcccccaaac tgctagtatt gcaccgttcc acaacatatc 3120 atataattcg cctgttagac gaagcgcttc tcattgaagc tctttttagt gcaatatttt 3180 aaaatgaccg ctaggtgcca ctgtagagac gtgtttcgaa acgtttcgaa gcttcgactc 3240 atttgcttca actgtttcag tgtttcatga agcctcgctt tgcccatcac ta 3292 // ID DNA-1-3B_DR repbase; DNA; ZEB; 763 BP. XX AC . XX DT 17-OCT-2008 (Rel. 13.1, Created) DT 17-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE nonautonomous DNA transposon from zebrafish - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; TSD 1 bp; DNA-1-3_DR; DNA-1-3B_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-763 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1211-1211 (2008). XX DR [1] (Consensus) XX CC The consensus was built from copies that are ~95% identical to CC the consensus. The element is characterized by 1-bp TSD, and CC 43-bp TIR. Elements insert preferentially into TWA target site. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 763 BP; 181 A; 201 C; 192 G; 188 T; 1 other; atgctgtgtt cacaccagac gcggaacgcg cggataaatc gcgctattcg cgcgtaaata 60 gccgcgtgaa catttgagtt tactcgcttc attcgcgcgt caaaccccgc ttcattcgcg 120 tgtcaaattc acwtcagaac agacgcggat tcgcgtgatg ggcagggctt ctgtctgccc 180 gaagactcta gcttcatagc taaatggcta acatggattt tatgaagaaa ataacagtgt 240 ttatgtgctt tatgaagact gaaaaacagc gtcgatacgt ttagggtcgt gtctgagtcc 300 actacattct ttcagaggtg catccagctc tgtgagctca taaactcctc cagaaactta 360 acctggatga cggaggcttt cagcggtgct tctgactgag ccaagccgag tttgatgaac 420 tgttgtcggt gaaggctgga ggatttccct cggaacaccg acaacaggtt ctacgtcaca 480 atcacgcccc cacaagagca agcttctgat tggttaacgc ggcgcgaatg tacgctgaag 540 ttcagatttt cagaattcga gagattcgcg cgaaacgctc gttaagcgcg tcaaacgcgc 600 aaaacgctca attcgccccg cgccattcgc gcaatttgcg tcattcgcgc cgcgccattc 660 gcgcgtatcg cgccgcagga tgtctattcg cgcgtttgca ttgacttaac atgtaaatca 720 ctcgcgcttg acgcgcgtgc cgcgtctggt gtgaacgcag cat 763 // ID HATN9_DR repbase; DNA; ZEB; 1054 BP. XX AC . XX DT 11-JUL-2002 (Rel. 7.06, Created) DT 11-JUL-2002 (Rel. 7.06, Last updated, Version 1) XX DE HATN9_DR is a nonautonomous DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW Dr000199; HATN9_DR; nonautonomous DNA transposon; KW hAT superfamily. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RA Bao Z.; RT "Dr000199, an unclassified and uncharacterized repeat."; RL http://www.genetics.wustl.edu/fish_lab/repeats/. XX RN [2] RP 1-1054 RA Kapitonov V.V. and Jurka J.; RT "HATN9_DR, an ancient nonautonomous hAT-like DNA transposon from RT zebrafish."; RL Repbase Reports 2(6), 17-17 (2002). XX DR [2] (Consensus) XX CC About 500 copies of HATN9_DR are expected to populate the CC zebrafish genome. The element was active recently, its CC copies are ~1% divergent from the consensus sequence. CC This element is characterized by 8-bp target site duplications. CC HATN9_DR has very short 3-bp terminal inverted repeats. CC Putative classification: nonautonomous DNA transposon from the CC hAT superfamily. XX SQ Sequence 1054 BP; 249 A; 247 C; 260 G; 298 T; 0 other; caagtgtgcc gcaagcctac gtttgagtga aggtttcggc cgttagatcg ccccctgggg 60 gctggctgca gtacaagtca taaagcccgc ctcctccgtg ttaatgaagg agacttgagc 120 ccaaataaaa aaaaatatta cacttgcaat aaaatgtccc gaaagatagt tctggtcgat 180 taaggcactg gttattgtgc tgaaataggt gcagatcttc atttttgtaa acagtttgtt 240 tttagcagta atttaatgct gggcgtgtca tcgtgattga cagctgtgat tgacagtttc 300 tcaaagcgcg gcgtctgagc ttcggcagga gactgaagta gactgaaatg ttattattcg 360 atttctgtgt tattttacca tgacaaaatg agttcagcag taaactatag tttctgacat 420 acatgatcct ggtggaacac tgtttattcg ctaagttcag ggcttttttc gggctttatt 480 agtttgctga tacacgccta gccacccaga tagcaaaaca cagttccggc tagattctcg 540 cctgccggag acttatctct tagagctcag actgactaat gttacctctg ctggacctac 600 tccggatgcc tggactcact acccaattcc agcccgagtc aatcgagcca gatgcggcgg 660 ccgagcgagc cggcgctgcc gcatgcgagc cggaatcagc ccgcgacgcc gcgagtaagt 720 tatgcgctgc ggcctggagc tggcgcgatt gaatatataa aaccctcata tttgttaacg 780 ttattacatc catttgtgtt gatatgacag caaacgcatg ctgagaagtt cggggggcgt 840 ggttgatttt acataaagcg tttggttgga agctcgactc cgctcatttt cgcggctcct 900 cctctggctc catcagacaa tccttctgcg catgtctggc tccaatttca gcagtctttt 960 gcgacagttt gtgcccgtca agcaggcgtt ttgccctcaa ggcgttcaat gggaaaaagg 1020 gctgtcgcgt cgtccatatt ttttacagtc attg 1054 // ID Gypsy-15-LTR_DR repbase; DNA; ZEB; 1965 BP. XX AC . XX DT 11-FEB-2005 (Rel. 10.01, Created) DT 11-FEB-2005 (Rel. 10.01, Last updated, Version 1) XX DE A long terminal repeat of the Gypsy-15_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy superfamily; Gypsy-15-I_DR; Gypsy-15-LTR_DR; Gypsy-15_DR; KW endogenous retrovirus. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1965 RA Kapitonov V.V. and Jurka J.; RT "Gypsy-15_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 5(1), 4-4 (2005). XX DR [1] (Consensus) XX CC Gypsy-15-LTR_DR is a long terminal repeat of the Gypsy-15_DR CC LTR retrotransposon that belongs to the Gypsy superfamily. Its CC internal portion is deposited in Repbase as Gypsy-15-I_DR. XX SQ Sequence 1965 BP; 580 A; 294 C; 371 G; 718 T; 2 other; tgtaacccag gttactgtag tccttatcaa gccctttaac caaataaata aataagaaca 60 ataactggag tacatgtaaa ataattttat ttgtattttg taatagtcac aatgttatca 120 aaaagaaaaa aagaaacatt atataatcgt agaatacaga taaaattaaa tgcatctatt 180 taactgcgag tgaaaaaaat actagctccg cctcccggaa tgacgtcatc tgcgcgcgtt 240 gagttgagag agaggcagaa tcacagcgtg agagaggagt gaaacgagag aaacgtgaac 300 agctgaagcg agctcgcggc ccgcgtcgtg tatagctgta aaatttgaga aattagaagt 360 aaaacaagca gttgaaactt gatttagttg ctgtaatata ctactgtccg tactcaaacc 420 cccaatcgcg tttggaccaa ggacaggaat cgtctgaagc acagatctga ttaggctgta 480 tttttttttt tggttcataa gtgatttttc cttcaactat aactactata tttcttggtg 540 agtgaatttt atattattgt atttcactaa aagtgattgt tataagtcat ttatggtgta 600 aatggttcat aaatatagtc tgaatgtatg ttaaatggtt aaaatgcaca ttaatgtgaa 660 tctaagtgtg gattaagagc aaatctgctg tttaaatgtt tttgggaagt gtattagtgt 720 tacaaacgga catctgcagt tgtccatgtg ctcttttagc atgtgcaaat ggctcacatg 780 tgttttgcta accgtgtgct taaactagca tgctaacaga atgtgagtgc atctattaag 840 atgttttttc agtttaatag ctatatacaa atacgaaatt gcaattgtag tgaagataat 900 gtggtttaat atgagagttg atactttatt gtacatttat cctattttat ttaaattggt 960 taagtttatg tgagaactga ttgtatgcac tgatttatat tcattgttga aaggtgaatg 1020 gaaattaatg gtgcctttcc accttgtctt ttcaaggttg ggtttttttt tttgtgtgtt 1080 cttttcctct gttttgttkt tccatcttct ctgttcgtga tcctgacacc tgacatttgg 1140 agtgtggagt tctagaagtc caaagtattc ttgtttctgc catcttggag gacattcgct 1200 gttggtggaa actggcgagc cgcccggttt tatggaacca atatttcgtg ttgcccaatt 1260 catctgattg gtaggaaatt tgagtttata ttttccatcg tctatcggac acaggaacga 1320 ctggaagcca gagtcatatt tagttctatt tttctggaac tgagctacta atcagtgtta 1380 aaacaaaaca ctgcaacggg tgcattataa acatttgagt ttattgaact gaacttgaac 1440 atttcacaac caaacaggtg atcaatcata tttgtttgat ggactagtga atyattttgc 1500 cgaatcactt tggtatctga caagaaatta ttttctgggt tcaattttct agccgttttt 1560 tttcctaatt gaagtgtttc aattattact atggtcttac ttacctgagg agtgctgaag 1620 ttgtttatat ttacctttta cattttgttt ttgtgaagaa aacacttgac atagtgtgaa 1680 aagcctttat tttattttat tactttatat ttggggaaat atacatttca aagtccaaat 1740 ttgagaataa ttagtttgtt attttattta atttatttca tttctcacta atatttacaa 1800 tataaatcat atatattttg atacaaaaat tcctgtcttg tgtcatcatt tcccttgtct 1860 tgctgcaaag ttggctccag agcgaactta aacttctgat aacttggtcc aaagttgata 1920 gtataatatt gtaatctcct tggtttctat atcgtgcggg ttaca 1965 // ID Ngaro1_DR repbase; DNA; ZEB; 6578 BP. XX AC . XX DT 22-OCT-2004 (Rel. 9.09, Created) DT 22-OCT-2004 (Rel. 9.09, Last updated, Version 1) XX DE Danio rerio retrotransposon Ngaro1 - a consensus. XX KW Non-LTR Retrotransposon; Transposable Element; Ngaro1_DR; gag; KW retrotransposon Ngaro1; reverse transcriptase; KW tyrosine recombinase. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RA Goodwin J.T. and Poulter T.R.; RT "Direct Submission to Genbank."; RL Direct Submission to Genbank (20-SEP-2002)Department of RL Biochemistry, University of Otago, Cumberland Street, Dunedin, RL New Zealand. XX RN [2] RA Gentles A., Kohany O. and Jurka J.; RT "Danio rerio retrotransposon Ngaro1 - a consensus."; RL Direct Submission to Repbase Update (JUN-2004). XX DR [2] (Consensus) XX CC Ngaro1 is a retrotransposon from Danio rerio that has putative CC gag, reverse transcriptase, and tyrosine recombinase domains. CC It also has a putative recombination target site (see Genbank CC accession AY152729). XX SQ Sequence 6578 BP; 1731 A; 2035 C; 1025 G; 1753 T; 34 other; ttgtaagaag tccataaggc aactcactgt gtaaagccac tttaaacaaa gtgaagtttc 60 ataaactaat ttcgagagga gcacgtgata tgattgagca cagctggcca ctcatttgta 120 atcagtaata atccaatccg agtgatccta gcctactata aatggatcac tccctaactt 180 tttcccctat cttcattttg gaagaccaaa agccaagaac gtgcaactgc aatcgaagct 240 tgacaaacca acaaaccaac aaaccaacca accaaccagc aaacaacaaa caacaaacaa 300 gcaagcaagc cagcctgcca gcagtctaac aactttgcat ccaacaaata aatcgacata 360 atggaaccaa caagcaacaa caacaacgcc ttaaccacca gcgagagctc cgctctcccc 420 gacacgcccg ccgcggcaca acgctacaca gccccgcaaa caaaggaacc cgaagccgat 480 gcccctgtga gaggccgcag gccaatccgc gccgcaacta cacaacgccg ccagatttct 540 ccgtctccaa taacaccaag ccgtcaccta ccttcaccag cttcctcgta tgcctccgcc 600 agatcatcat ctcacgtctc caacaagatt accatccctg agctccggaa aaacctctca 660 gatgcaggaa tccacatacc tccacgatgc aataaagctg atctcctgaa actctactcc 720 gctatcgctc ctcaatcccc gcccaccatg gacgccgcaa gatcttcccg gtcacgtcac 780 gctccacaaa ccacgcccac cacggacaat gcaagatcct cccgctctcg ccacgccccc 840 tatcctcaac attctacttc acagactgaa ctttctcacc caggacaatc aaagaaagcc 900 acccagaaaa gttcaactac cagcaagaaa gcatcaagac cacacattca agaaagccgg 960 accaccccag ccatatcatt tcaacacaca gaaacacctc tcgccaatat ttccactata 1020 tcacatgcta accagccttt caccacctct ctcacttggc ctccagctcc acactctagc 1080 tccaccccta gccccccatt acacaccact gccatcagtc attctcattc tcagcccccc 1140 atacctaatc tccctcgtac atctacacag ctcattcata ccacctctag ctccattcac 1200 aacgctcaac ctctctctaa cccctttact ctctctagca ttccccccta taaccctccc 1260 ccttcccttc accaagcact cactcactct tccagcactg atgcagctca acatcccact 1320 gtcccttcac aaatttcttc atcacacccc ccgtacaacc ttttcacagc caacctcctc 1380 ccagtgccac acaacgctac agtcctggaa ccaccacccg tctcgaatgc tgcaaaaaac 1440 ctcattttat caggtgctga catcgatctc tcatcactcc tttcacccat tacaccccct 1500 acggcagaac gacaggtgga ttgcggcgaa ttcacactca cgctcaaatc ttcaaacacc 1560 actcaatcac gtactctctc gttagcagaa tttcaggtag cattttctag attcactgac 1620 attatttgct ctgtattccc ccataggagg cgcgagttga acgactacat ggccatcatc 1680 tcagagctcg cgctctccta tggaggtact catttctata cctaccacaa attattttcc 1740 gccaaatgcg ctatccgagt aacccagtgg aatcagtgtc cttactgggg ggcgttggac 1800 actgacctcc acaacagggt ttttctaggt tgccgcaatc tttcctgcgc ggtctgtcgt 1860 tcctgtcttc accctaccac ttcctgtccc tttattattc cctcttcaga taaagaatcc 1920 cctaacccaa gatccactag ctacgttccc cgtccttcta cttcagctat ccctgctcta 1980 cttcctcctt ctcttacctc ctctcagctt tcttcytcct ctaacccatg ctttagtttc 2040 aatataggca gatgcttccg tcacccttgc aagtttcctc atacatgcaa ttattgcggg 2100 ggagcgcacg cccgactagt ttgcccaatc ctgaaggcta ataaaaaaat caagaaatta 2160 tttatcgact cctgtcaatg tgtctcaamt tgcatctgaa ttatgttttc atcctgaccc 2220 taacctttct gaatttctca tttcaggttt atcctatgga ttccayccag gtgttttgac 2280 cctgccctct tacaacctcg cttgcccaaa tytacaatcy gccatygctg aaccmgaagt 2340 tgtagatcaa ttratcaaaa aagagatcga caacaaattt atgatcggtc ctttttccac 2400 ccccccattc agctcttttc gcatcagccc tatcggtgtt gccaccagaa aattttctga 2460 caaaaaaacg mctcataaty gatctatcat ccccccataa ctccaccttt tcaagtatca 2520 atagcctaat tgaatcagac caattctygc tcaattacca tgatatcgat caggcaattt 2580 gtttaatcaa atcagttggt cgtaacgcat ggcttgctaa agtcgacatt tcctccgcat 2640 ttaaaataat gcctttacat cccgatttct ggcatttgtt tggcattaat tggagatcca 2700 aattttactt tgcagtccgt ttaaccttca gatgcaaaag tagccccaaa atatttgaca 2760 tgctttcgga ggctatatgc tggattctca ctaataatta tggcgtgtcg catctcatcc 2820 acttactaga tgaytttctc atcatttccc ytcctagtga gccacccgca agacagttag 2880 cgattactca gaaagtcttc gctaatctcg gcatccctct agcwgaggaa aaaacggccg 2940 gccctagcac ytctcttgag ttcctaggaa tyaagctgga ttctaaaaac ttccaagctt 3000 cacttcctaa ggaaaaaatt gatcgaatca ttttcctttc atccatcttc cttgagaagc 3060 aaatttgcac aaaacgcgaa cttttgtcca tcctgggaca cttgaatttt gccatgcgca 3120 taattcctca aggtcgcccc tttgtctctc acctcctcca aactgcagcy tccattaacr 3180 gtctagaaga aactatcccc ctgtctgaag cttgccggar ggaactcagc ctttggattt 3240 ccttccttaa atgctggaat ggctgytcat ttttctatag cgatctggtg tyagctccta 3300 tagacatcca tctgttcacc gatgctgccc cttcwgtagg gtttggaggt tattaccagg 3360 gtcgttggtt ygcttctcca tggccttcac aaatgctaga aatcccattg ccctctcaat 3420 catcagctct ctttgaatta taccccctag tggccgcwac tatcatttgg ggtgacgaat 3480 ggtctgcttc cagcattcta attcacagtg ataatgaagc agctgttcaa tgtgttaaca 3540 aagggcgttc tcatttccct attttaatgc ctttcattca tcgtctagtt tggacatcag 3600 ctaaaaaaca gtttatcatt actgcaaaac atgtacctgg tttcaaaaat caaattgctg 3660 actctctttc tcgtttttcc ttccagagat ttcggcagtt ggccccagaa gcggatccac 3720 atcctactcc catccctcca tattcagaga tgatattgcc ctgaatcatc ccatgcataa 3780 cttacaccaa gcatccatct ccctcattct agaagcagta gcccctagaa ctatacagtc 3840 ttatctcaca gcatggagcg ctttcaaaaa gttccacacc caatacaaca caagttttcc 3900 caatttttcc ctcctcacca ttacatcttt tatcacatac ctacacactt ccaaaaacat 3960 tcaagctagc actattaaga gttacttaag cgggattaat ttctttcaca aattgataya 4020 tggctcccca gctgaatcaa tttctaactc ccaaacaggc cttcttatta aaggcatcca 4080 gaaagtacat ccctcccccc cagaccctag gctwcccatt acacaaagag tcctctccaa 4140 atgcatttct accctccgca aaggttayat ttctacaaat acagatcgca ctctggatgc 4200 catgtttaat ctggcttttt ttggatttct tagatgctct gaattcacca taacttctaa 4260 atttgatcca aacattcacc ctaccatctc agaccttact ttgttagatg agggaactct 4320 ttctttcctc ataaaacaaa gcaaaacaga ccagacacgc aagggtcatt tcatctatat 4380 attcgacctc ccctcaccca cacttccatt tcaaacactc aaggcctttc tactctttag 4440 gaaaacacag aacccgagcc ctttttcccc cttatttgta gatgactcta atcgcccagc 4500 atcacgtttc tggtttcaaa gacacctcaa ggcagttctc agtctctcag gctttccctc 4560 agattcatac tccagtcact cattcagaat atgagcagcc atcacagaag tacaaaaagg 4620 gctctcacag caacaaatcc agatcctaga aaggtggtca cctgatgcct tcaagrcata 4680 catgcgtcct agccactggc cccccaaaga agcccagatg gctcttgtca gccataatcc 4740 cacatccact caagggcgag ggygtgaccc agccaccttc ycttcctctt tttcttcctt 4800 ctagcctgag taacactcag tttcctcccc cagccacata ggtaattgga gtttcatcca 4860 agcccacccg tcaccccgcc accctggccg tttcctgctg gagtcttcac agccccctcc 4920 catttcccga ctcctgccgg accccgcccc ccctaaagct ctgacttccg cagaagtgtt 4980 accccgagct acgacccccg caggggtcgt ctcaccgtcc cttccaggcc ttagcaatct 5040 ttatatttat atatatatat atatatatat atatatatat atatatctat atacacctgt 5100 acatagatat atatatttat aatagcgctg tcactccccc gctccatctc ctaacggagt 5160 gttcctcgag cacctaactc atcccgtcac cctctaacag gagtcttcac tgtccaaatc 5220 ccctttccca gactcctgct agagtaggcc agcttgccct gttccccggg cgccgacccc 5280 cgcaggggtc agtcactgcc ccccccaggc cactgaaact cattatatat gtatatctat 5340 ggactttcgt ttatagatat atatttatat agagcgctgt cactccccgc tctatctcca 5400 gtcggagtgt tccacgagca tcgactccag caagagtctg gccaaacttg ccactcaccc 5460 tctagcagaa atctccaccg cccaaatcac acttcacgat ttctgctaga gatggcaaaa 5520 taaataaaaa attgctgcac ccaactcccg cagcgcccat tctgactctc agaagtctcc 5580 tgatcacccc ctccaggcct tagattaycc cattatatat atatatatat ttatatactc 5640 tcttatatat acatatatat ttatatatag cgctgccact tccctgctct atctcygttt 5700 ggagtgttcc tcgagcattt ttcttgactc ttaatgagcc aaccccgccc accccttatg 5760 gccccccttc actagtctcc acccaacccc ctcccccgct ctggcttcca caggagtcag 5820 tttcaaactt tgctccaact ggagccccct actctttctt ctattcctta aattactata 5880 tccagcagcc ggatatagta aaaaactttc tagctttttg ggggaatatt ctttgaaata 5940 ctcggctgct gtcccgagct agaggcattt tttggggagc gatcgagacc tacctgatct 6000 cggttctcct gatatgcttc tagaccgggc gggagccctg ggctcaaata tctccgagct 6060 cagggttctc tcccgggaca gcatgccaaa cctgctataa gtgccaagca tatctaagtg 6120 ggaactcttg aagtgaagtt tcataaacta atttcgagag gagcacgtga tatgattgtg 6180 caccgctggc cactcatccg taatcagtaa taatccaatc agaatgatcc tagcttagta 6240 taaatggatc actttctcac ccattgcact atcttcattt tggaagaatc cccccttcca 6300 ccccatctcc tcctttttct cccycttcta aaagggggag cgatcgagac ctacctgatc 6360 tcggttctcc tgatatgctt ctagaccggg cgggagccct gggctcaaat atctccgagc 6420 tcagggttct ctcccgggac agcatgccaa acctgctata agtgccaagc atatctaagt 6480 gggaactctt gaaayaaaaa tacgatgaat atgccgagtc caggggctaa tctgctggag 6540 tcagctgagg tgaagagacg gcgaccagtg agacctag 6578 // ID hAT-N58_DR repbase; DNA; ZEB; 808 BP. XX AC . XX DT 22-OCT-2008 (Rel. 13.1, Created) DT 22-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Putative nonautonomous hAT DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW 8 bp TSD; hAT-N58_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-808 RA Bao W. and Jurka J.; RT "hAT DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1327-1327 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 808 BP; 293 A; 153 C; 147 G; 211 T; 4 other; tagggctgtg cgattaatcg aaatcgaatc gcaatcgcga tttgaaacgt tgcgattagc 60 taatcgcaag aggctgcgat atgaaatata tattatttaa tttcccccgc ccagagcaaa 120 tgcgtgactg ccgtctgtgt gtgacagtct tactagccaa ttgagtgagc acactttcgt 180 tctgcccaat cagaattgcg caaccgaact atgcggaacg cccacaataa aaaaacaaac 240 aaacaaaaca aacaaacagg aaagcgcgga caagtgggat gatggcgtct gccgcttcag 300 aagcattaat agacgaatta atatcaaaga aaaacagcac atcggtaata tgggaatatt 360 ttggtttcaa agtcacagac accaaacaaa aacaggtcat ttgtaagagc tgtcacagaa 420 ttgttgccac agcatgagga aatacaacaa ccagctatat gaggagtgtc ttgctaaaam 480 gtcaactaaa aaagtatcgc aagtgacact caatctagta gcaawctaaa gtctaacttg 540 tttatwattg tttactgttt gctctagaga aaaatgagtg tttaatttct gaatatttaa 600 aaacaaattt gaataaaagt tggagtwagt taatatcttt tcagttcctt tttttaacta 660 acactcactg cattttagca gtaagaagct acgatacaga atattgagct gaagcttgac 720 tacaaaatta aatcgcaaat caaatcgaaa tcgcaatatc tgttaaaaaa atcgcaatta 780 gatattttcc ccaaatcgca cagcccta 808 // ID Gypsy107-LTR_Dr repbase; DNA; ZEB; 770 BP. XX AC . XX DT 29-APR-2009 (Rel. 14.05, Created) DT 29-APR-2009 (Rel. 14.05, Last updated, Version 1) XX DE A long terminal repeat of the Gypsy-107_DR LTR retrotransposon; a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy superfamily; endogenous retrovirus; KW LTR; Gypsy-107-I_DR; Gypsy-107-LTR_DR; Gypsy-107_DR; KW Gypsy107-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-770 RA Dib M.R. and Naveira H.F.; RT "Gypsy107_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 9(5), 949-949 (2009). XX DR [1] (Consensus) XX CC Gypsy107-LTR_DR is a long terminal repeat of the Gypsy107_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its CC internal portion is Gypsy-deposited in Repbase as Gypsy107-I_DR. XX SQ Sequence 770 BP; 123 A; 172 C; 134 G; 341 T; 0 other; tgtcaggact cctggctgat cggctggttc tatctgttct gtttgtatta gccacatgtt 60 cctttgtttg tgcttctgat tgtttctctc cacctcctcg ttctcataat tttagttgat 120 ttgcttcacc tgtctttgct tattttctcc tcattagctc ccctatttat tctctgggtt 180 tctgctgttc agtgccagat tgttgtttgc ttctgaccct gcaacatcaa ctataggtaa 240 acttaatgtt tttagttaac ctttttctgt ttgactccag gactctctcc ctaccttccc 300 agcctttctg tgtagtccct ggcttgctcg aagtttttgc tcttgtgctg cctgagtttt 360 tgatgtcctg gacactggcg tggttctgac aattacacct gcctctgttt ctactggcca 420 gtctattttt tgttcctggt tatttccagt cattctcttc ccctccatca aagtctgcaa 480 gtattttttt tgtcattttt tgttgttcat ttttttttgg actataatat tttttgttct 540 ttcttgatca ggctgagagg actgcttgct gaagagcatt tttttttccc catttgtcga 600 actgaggttt ttgtgcattt ttcccttgag actgagacat tgcctttaga agattttttt 660 tttcaatttt gcattctggc taatgaactg atctgttaga atttttgcct tccacaataa 720 atcattgtct tcacctgcat ttgggttttt tctgtttact ggccttgaca 770 // ID Mariner-N3_DR repbase; DNA; ZEB; 5507 BP. XX AC . XX DT 02-OCT-2008 (Rel. 13.1, Created) DT 04-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Putative Mariner-type non-autonomous DNA transposon - consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Mariner-N3_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5507 RA Jurka J.; RT "Mariner-type DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1613-1613 (2008). XX DR [1] (Consensus) XX CC 2-bp TSD, mostly TA, but other dinucleotides can also be found. CC 98% identical to consensus. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 5507 BP; 1793 A; 862 C; 881 G; 1971 T; 0 other; cactgtaaac gatttctgtt gttttcacgg tattatactg catttctcaa cagttgttta 60 ctgtataagc tgtgcacagt gtgttactgt atatattgct gttttcacag catgataatg 120 tattctcaac atttccatac tgtgaacatg attaacagta tgctactgca gattttacaa 180 tgaattttgg gaaaaaattc agttactgtg catctaaata cagtactgca aatgaccatg 240 aaaaactcta cacacacttt atattattat tattattatg tttattttat tttttgacaa 300 agtattttgt gaagaaaaat aaaataacca gtttcttcag actgttcagt gcataaaact 360 gtcccctggt tttccatatt tgggtgtact gcaccttaat tttataaggt taacatgaaa 420 atattaaatc atttttaatc aacataactt tgtgcttaac caaaacgtaa actggaacaa 480 aagtaattga ttcatgaaac caaagtcagc ttaaatgtca taatgaccag cctgctttag 540 acggattcat gaggttaggg cacccgtcat caccatggag gtcacatatc tccgcatttt 600 caaaacaata ttttaaaata cccaccatct ctataaataa accacagact tgagtttaaa 660 acaactacat tcccgcatga aaaacggtta aaactaaatt tcataacaca ttcattatag 720 agccatctaa gaccggggca agtctcacat gtctagccga ttataacctc gtttcgccgg 780 atacatgagt gatgatggac accctggagg tcagaatctc cgctatcggc aaagcacgct 840 taaaatttac acatgatatc tataaataaa ccacagattt gagtttgtaa gaactacatt 900 ctcgcatgaa acactgttaa aatttcattt cgtgacatat taaacgcaat atttataata 960 ttagctgggc tttctcatcc gccattacca tttgcttgtt gatgtgaaat gaattctggg 1020 agaaaaaaaa atatatatat atatatatat ttatatataa ttctcaaaac acttcaaacg 1080 aactttattc ttattgtcaa ggtcagatat atgtaataat ttcgctggta aagtgaataa 1140 aaaacaatgt tactatattg taatcaatat cacgtttgca ctggttcggt tcagcactcg 1200 agcgactctg aatcgtattg aaccgaattg aaccgtttgc tcgggaatta tttgatcaca 1260 cgtgttcaca agcacattca gcaatatgca attagtgtat aattagtgta taaatgcgtg 1320 tggccaagca gtataacgaa atagtaatga atcatataca aaaaggaatc ggttttaggc 1380 ggattcgatt ctttactgtt cactaaggag ccgattcata gaaccgaacg gacacgactc 1440 gacatgcagg caacggctct cctaatttga aattcacctg tcgtctttca tccagcgaat 1500 tcatcctcgg taagtgtttt aaagtcaaat tctgtccgtt aaaggattgt tttataagta 1560 tttctgtgta ttgtgtgtgt aatttaaaag aattaaccaa gtgacattag ttgtaggttc 1620 ttatgttatg tcacgctatc cgttaacaaa cggtatttta aacaagaaat aacagacgcc 1680 aggcggtcat tgtagtgttt gcgaggtatt agatacagtg agtttatctt gagtgagttt 1740 ttgaccatac gttaagttac ctgaaaaaaa cataacttta acttattgta actgcattaa 1800 tgaattctct atttcagcac aacactgaga tgaaggcgaa gtcaccatga aaagttaaac 1860 gtttaccgtt aaggtaagct ctggtcttca cataacgttg tcattatgac taagctatca 1920 ttatggtaaa atttttcaac tgtaagttag ttatttaaac taaactcata atgaaatgag 1980 tcttaactgt tttattaaat ggagctgaaa attaaatgca gttgaattat tttaagctaa 2040 cgttaatatt aacaaaatac atattaattg tctatttctc aataatgtta atgagcttga 2100 tgtatagtgt gtcattaatt gtacatttaa tttttggtgt tcaggtgtac ctgaggtgag 2160 tgtacgttct gagtacacag atataagcca aagtcattat cactatatat atatatatat 2220 atatatatat atatatatat atatatatat atatatatat atatatatat atatatatat 2280 atatccatcc tattctaaga gaaattgatg gggtcaggga atgatttgtg tacaaatgta 2340 ttttaacccc atacagattg gtatggttgt accatagatt tagattaatt aatgttaagt 2400 cttattttac tcttattata tttatctgtt tttcagctac agtaattgat gttgaaaaac 2460 tgatgcaacc aatcactcct cacttgatta ttcaaggtaa tgttgatgtc ttcctcattt 2520 tgattagatt tttaaatgtc aagttgttca cagatttgtt gtgaggagtt tcatgtacaa 2580 cctctttttc ttcttccagc agttcctaag ataaaaaaaa aatctggtgt taaaacagcc 2640 acggctatgc tatttgccta ccaagtgctg agtaaatatt agacaaacat ttttttctat 2700 aactgttgtc tttgtgttta ttagcaattt tactgggcgt atgtgtttgt tttttttttc 2760 aaacagcctg gaaactgtaa gtttagttca atatgttatt ttatgctatt catttttgtc 2820 ttaacatgga ttgcatgttg aagctatcgt tttatagtgg gaaaatgtaa aagagaatca 2880 ctgaatacaa tggcataatg attctgcttg ttcaaagaag gtacttgatt tagttttcct 2940 caagcgacac atcagttttt atgactattg cttgacactt gaattgaagg gacagttcac 3000 cccaaaatga aaataatagc atctttggca tcctccactt gtttcaaaat agttatatat 3060 ttaatttgtt ccgttgaaca caaactcgta ttttttaaag aaatttggtt caaatatgga 3120 tgtcaatagt tactggttac taacattctt caaaatatct gctttagtgt ttaacagaaa 3180 ttcaaactag tttaaaacac gtggagtaaa tagtggcaaa atagtaaact attcctttaa 3240 atattttatt ctgagactta tttatatgtg tcttgtcttt tgtcatattg tccttctagc 3300 ccagattgtg atggacctgg aatgcgacac cagttactct taagacaaga accgaataaa 3360 gcacagatgt ggagggattt gcagccaaac taactggagg gaaggaacct ttaggtaaaa 3420 gcacctgctt tatttagatc tgaagtattt tatttgacat agagatttat tgttttaatt 3480 ctaatttaaa gcaaacaagg ttgagcaccc ctggtctaaa agctcaaaaa tgctgtttcc 3540 gtgataaggt taaatgtagt tgagaaatag attcattagg atcaatgctt aaagtaaaca 3600 ttttcttctc tctgtagacc aaacatgttt actatccctc acattttaat aaattgtttc 3660 tgtttttgtt tcttacagtg tgggtgaaca tactacctgg atgtccaaca aaggctgcac 3720 aagagctcat ttgatagcac atattctgtc ctgtctacat ctctgatgac ctggaagtgc 3780 ttcgacatca gtgagtaata catctttata ttgtcatgca tacaggattt ctgtaagctt 3840 agctttaatg ttagtttcta atcagcagtg actgttttaa aaaattaagt atttaaatta 3900 ttctcaacgt tttactgatg ctgtgcacag taacgtcgtg atgatttagt ttgaggaaat 3960 gtcactagag ggtttgtata aaaaataaat aaataaataa attgtatgaa atgactcgat 4020 aaaatgttta tttgtctcgc tgacatgcca tcactgctgc atattaaaat acagtaatac 4080 gaattgacag cagaatatta ccacaaacat ttcatgaagt aggctagact gctctttagt 4140 tctctactca actttaacac atttcctcgt gtcgctgagc ctctctgcag attttgccaa 4200 aaatgaatgc ttagtgatta ctttgtttta tgcaaaccaa atatcagtaa ctcatcatta 4260 gcattatttt aaaaaatcac ccacggcttt catgttctct aacgcaatct tatctgtaag 4320 ttagaagtgg atttatacgt gttaaaagcc taaaagtatt taattaacat tagtcacgtt 4380 aaattagatt atttttattt atttaatttt aaatatgtag cctcttaaag tgagaatgaa 4440 ctgtgttcgc ccctcttaaa tccgtggaag cactagtagt gtgatcatgg aatcagatat 4500 tcgagctata cagagtcgga cattaagtta agatgagact ttatttttta cacacctgtg 4560 tggacacctg actagcgatc ttcaagaaca atgcccgctg atcttttccc cattaacgtt 4620 agtgtaattg tgaattgaca acaaactgta tttcaccaac aaaaacaatt aaaaaaactg 4680 ttatatatat tataactgtt atatatatat atatatatat atatatatat atatatatat 4740 atatatatat atatatataa acaacagtgc agtgtttcgt ttctgaatga attaaatggt 4800 ttgagggatg attagttttt acttcagtga ttgattttgc ttgtcatttt tgcaggatca 4860 tcagaagtgc tgctgccctt tttcactgtc cctttccaca tcatttggcc ttcatcattt 4920 cctgtcttgt agcagtattt tcacctgttt ttaaaattgt aataattttt gatgcattaa 4980 ttgttgactg gttttaatga tcaattgttt attagtgttc tgttgctttg ttatattgaa 5040 ttattgctac tgttttagca tggtttacac aataaacctt tattgtgaag ttaaagtttc 5100 agtttcttta tttttcttgt atttcttgaa cagttaaaat aacacttgtg caatttttga 5160 acatttttac ccattataaa gtagcactat taaataaaat aaaataacat ttaataactg 5220 taaaatacta tacagtaata atgaaagtaa cgttaaaata ccgtagttct tcatggtaaa 5280 aatgtacaat aaaatactgt tttcagtttt gcagtatgtg tatcactact gtaattggtg 5340 ttacagtaaa aatcaacaac agtaaaatga tgtattacat tttacagtaa atgtaatact 5400 aatgtaatta gtattacagt aatgttactg ttttatgaaa tacagcagct actggataat 5460 tgttgccagt aacttactgt taatttaaca agaaatcgtt tacagtg 5507 // ID HE2_DR repbase; DNA; ZEB; 451 BP. XX AC . XX DT 06-APR-2011 (Rel. 16.04, Created) DT 06-APR-2011 (Rel. 16.04, Last updated, Version 2) XX DE SINE element - a consensus. XX KW SINE; Non-LTR Retrotransposon; Transposable Element; HE1_MM; DANA; KW HE1_SINE; HE1_DR1; HE2_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RA Ogiwara I., Miya M., Ohshima K. and Okada N.; RT "Retropositional parasitism of SINEs on LINEs: identification of RT SINEs and LINEs in elasmobranchs."; RL Mol. Biol. Evol 16(9), 1238-1250 (1999). XX RN [2] RP 1-451 RA Jurka J. and Drazkiewicz A.; RT "HE1_DR1: HE1 SINE element from zebrafish."; RL Repbase Reports 2(3), 4-4 (2002). XX RN [3] RP 1-451 RA Jurka J.; RT "SINE elements from zebrafish."; RL Direct Submission to Repbase Update (06-APR-2011). XX DR [2] (Consensus) XX CC ~94% identical to consensus. Subfamily of HE1_DR1. XX SQ Sequence 451 BP; 72 A; 77 C; 165 G; 137 T; 0 other; ggggcgagca gtggcgcagt aggtagtgct gtcgcctcac agcaagaagg tcgctgggtg 60 ctggttcgag cctcggctca gttggcgttt ctgtgtggag tttgcatgtt ctccctgcgt 120 tcgcgtgggt ttcctccggg tgctccggtt tcccccacag tccaaagaca tgcggtacag 180 gtgaattggg taggctaaat tgtccgtagt gtgttgtgtg tgtgtgtgtg tgtgtgtgtg 240 tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg 300 tgaatgtgtg tgtggatgtt tcccagagat gggttgcggc tggaagggca tccgctgcgt 360 aaaaacttgc tggataagtt ggcggttcat tccgctgtgg cgaccccgga ttaataaagg 420 gactaagccg acaagaaaat gaatgaatga a 451 // ID Gypsy96-I_DR repbase; DNA; ZEB; 4296 BP. XX AC chr19; XX DT 02-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy96-I_DR; KW Gypsy96-LTR_DR; Gypsy96_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4296 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1503-1503 (2008). XX DR Genome; chr19; Positions 20993832 20989537. XX CC Positions [1718-2173] - Reverse transcriptase CC Positions [3188-3667] - Integrase core CC 'TATTG' target site duplication CC LTRs are 99% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 18..953 FT /product="Gypsy96-I_DR_2p" FT /translation="MDAAGSDPVITAVTQQGVLLGQHEARLTNTTREMEYL FT ANQVADLTARIRELQHDTAQGGQADPGPQHTREPRCNNPPPYDGDPNSCRA FT FLSQCSVVFALQPHTYAAERTKVAYVLTLLAGKARDWGTSVWETQAPCCAS FT FEDFRQEMVRLFDRSAQGQEAADQLARLRQAGRSVTEYAIQFKTLAASCDW FT NQGACMSMFRAGLEEEIQDELATIDLPQDFDDLISMALRVEGRLRRRRQRP FT VFRPPWRMEDTSSAVPEAANATSLGSEPMQVGRLRLTPQQKQQRLIQGLCF FT YCGKPGHFAVACPLKAAARQ" FT CDS 1094..4297 FT /product="Gypsy96-I_DR_1p" FT /translation="MDSSLATLWGIPALPLSNPIPARSLNGTLITTVSHTT FT PPLHLIVSGNHHEVTTLLLLDSPSAPIVLGHPWLVQHGPHVDWSSNSVLSW FT SQHCLASCLGPTAFPVSVCPVLQVEAADLSGVPAEYLDLRRVFSKSRATSL FT PPHRPYDCAIELLPGTSPPKGRLFSLSGPEREAMDRYINESLKAGLIRPSS FT SPAGAGFFFVKKKDGSLRPCIDYRGLNDITIKNRYPLPLMSSAFELLQGAK FT VFTKLELRNAYHLIRIREVDEWKTAFNTPTGHFEYRVLPFGLTNAPAVLQA FT LVNDVLRDMVNRFVFVYLDDILIFSPSLKVHTQHVRQVLQRLLENQLYVKA FT EKCVFHVQSVSFLGFIISAGELQADPCKVKAVAEWPTPDSRKALQRFLGFA FT NFYRRFIRNFGQIAAPLTALTSPKVPFKWEVDAQEAFDKLKSRFVSAPVLS FT IPDPERQFIVEVDASDVGVGAVLSQRSRLDGKVHPCAFFSHRLNPSERNYD FT IGNRELLAVRLALGEWRHWLEGAAQPFLVWTDHKNLEYIRSARRLSSRQAR FT WALFFGRFNFTLSYRPGSKNIKPDALSRLFEAPGEREPPDTILPKEMVVAS FT LSWEIEKRVEEARRTSPGPGRVPAGCLFVPTKMRSEVIQWGHSSKLTCHPG FT VRRSLAAIRQRFWWPSMAKDVRQFVAACSVCARNKNSNAPPIGLLHPLPIP FT SRPWSHLALDFVTGLPESKGNNVILTVVDRFSKAAHFIPLPKLPSAKETAQ FT VVIDHVFRIHGLPVNVVSDRGPQFVSRFWKEFCRQIGASTNLSSGFHPQTN FT GQSERANQDLERTLRCLTSHNPGSWCQQLPWVEYAHNSLPSSATGMSPFQC FT SMGFQPPLFPSQEPEAAVPSALAFVQRCRRTWKRAREALAQANKRTKAAAD FT HHRTPAPRYRCGQRVWLSTKDLPLRVASRKLAPRFIGPYRITKVLSPVAIR FT LKLPPMLGRVHPTFHVSRVKPVLYSRLVPSAPIPPPPRLVDGSPVYTVRRL FT LDIRRRGRGFQYLVDWEGYGPEERSWVPAQDVLDPALVEDFRQRQGRPPHA FT APGGVRRGGGT" XX SQ Sequence 4296 BP; 865 A; 1217 C; 1110 G; 1104 T; 0 other; gaacaatcta gccagaaatg gacgcagcag gttcggatcc agtcattact gcagtcaccc 60 agcagggtgt gctgctaggg cagcatgagg caaggttgac caacaccacc agggagatgg 120 agtacttggc caaccaggta gctgatttaa ccgcacggat ccgtgagttg cagcacgaca 180 cagcccaggg ggggcaagcg gatcctggcc ctcagcatac acgtgaaccc cgatgcaaca 240 acccaccacc ttacgatggg gaccccaatt catgtagagc cttcctctct caatgctctg 300 tggtgtttgc cttacagcca cacacctacg ctgctgaaag aaccaaggtg gcatacgttc 360 tgacgctgct cgcaggcaag gcccgtgatt ggggaacatc agtctgggaa acccaagctc 420 cctgttgtgc atcgttcgaa gacttcagac aagagatggt gaggttgttc gatagatctg 480 ctcaggggca ggaggcagcg gaccagttgg ctcggttacg ccaggcaggc cggtcagtca 540 cagagtatgc catccagttt aagaccttgg cggcttcctg cgactggaat caaggggcct 600 gcatgtccat gtttcgagcg gggcttgaag aggaaatcca ggatgagttg gccaccatcg 660 accttcccca agattttgat gatctcatta gtatggccct acgggtagaa ggacgtctgc 720 gccgccgtcg tcaacgtcca gtcttccgcc caccctggag gatggaggac acttcctctg 780 ctgttcctga ggctgctaat gccacctcct tgggttcaga gcccatgcag gtggggcgtc 840 ttcggctgac tccccaacag aaacaacagc gccttatcca gggcctgtgc ttctactgcg 900 gcaagccagg gcactttgca gtggcctgtc cgttaaaagc cgcggctcgc cagtaaagag 960 ggggatcctg gtgagcgcta cccctttatt gaattccccc tcctctcgta ctgttctccc 1020 tgtatctgtc cagtttggtg actcttttaa ttctgccttg gccttggtcg attccggggc 1080 cgaagataat tttatggata gttctttagc taccttgtgg ggaatccctg cactccctct 1140 ttccaatcct atccctgctc gctctcttaa tggcaccctt atcaccactg tatcccatac 1200 cacccctccc ctccatctca ttgtctcggg caatcaccat gaggttacca cgctgcttct 1260 cctagattcg cctagtgccc ccatagtttt ggggcatcca tggctggtgc agcatggccc 1320 tcatgtagat tggtcgagta attcagtttt gtcttggagt cagcattgtc ttgcttcgtg 1380 tttgggtcct actgcttttc ctgtgtctgt gtgtcctgtc ttacaggtgg aggcagctga 1440 tctctctggg gtaccggcgg agtaccttga cctgcgccgg gtgttcagca agtctcgggc 1500 cacttccctg cctccgcatc gtccttatga ttgtgccata gaactcctcc caggcacttc 1560 tccgcccaag ggtcgtttat tttccctttc tggtccagag agagaggcca tggataggta 1620 tatcaatgaa tcactcaaag ccggtctcat ccgaccctct tcctcccccg cgggtgccgg 1680 gttcttcttt gtcaaaaaga aagatggctc cctgcgtcct tgtattgatt atcgagggct 1740 caatgacatc actattaaaa acaggtaccc cttacctctc atgtcttctg cctttgaact 1800 tttgcaggga gccaaggtct ttaccaaact tgaactccgg aatgcttacc atcttatccg 1860 catacgagag gtggatgagt ggaagaccgc ctttaacacg cctacgggac acttcgagta 1920 ccgggtctta ccgttcggtc tgaccaatgc cccggcggtc ttacaggcct tggtcaatga 1980 tgtgttgagg gacatggtta atcgttttgt ctttgtgtac cttgatgata tcttgatatt 2040 ctctccatct ttaaaggtac acacccaaca tgtccgccaa gtgctacaac ggttgctgga 2100 gaatcagctt tatgttaagg cggagaagtg cgttttccac gtccagtcgg tctcgttctt 2160 gggattcatt atttcggctg gagagctcca ggcagacccc tgtaaggtaa aggcagtcgc 2220 cgagtggcca actccagact ctcgtaaggc tctgcagcgg ttcctggggt tcgccaactt 2280 ttatcggcgc tttatcagga attttggtca gatcgctgca cccttaacag ccctcacctc 2340 tcccaaggta ccgttcaaat gggaagttga tgctcaggag gcctttgata aattaaagtc 2400 ccgttttgtc tctgctcctg ttctatctat tccagatcct gaacggcaat ttattgttga 2460 ggtggatgct tcggacgtcg gagtaggcgc agtcttatct cagcggtcac gcctggatgg 2520 aaaagtgcac ccttgtgcat tcttttccca tcgcctgaat ccttcagagc gaaattacga 2580 cataggtaat agggaattgt tggcagtcag gttggccctg ggggagtggc gtcactggtt 2640 agaaggcgca gctcagcctt tcctggtctg gacggatcac aaaaaccttg aatatatccg 2700 ttcagccagg aggctgagct cccgacaggc tcgctgggcc ctcttctttg ggcgatttaa 2760 ttttaccttg tcttacagac ctgggtccaa aaatattaag ccagatgctc tctcccgtct 2820 gtttgaggcc ccaggggaga gggaaccacc cgacaccatt ttacccaagg agatggtggt 2880 ggcatctctc tcttgggaaa ttgaaaaacg ggttgaggaa gccaggagga caagtccagg 2940 accggggagg gtcccagcgg gatgtctctt tgtccctacc aagatgcgct ctgaagtaat 3000 ccagtggggt cactcctcta agttgacctg tcacccagga gtcaggagat cactggctgc 3060 catccgtcag cgattctggt ggccatccat ggccaaggac gtcaggcagt tcgtggccgc 3120 ctgctctgtt tgtgctcgga acaagaattc caatgcgcct cccattggtc tgctccaccc 3180 ccttcccatc ccatcccgcc cctggtcaca cttggccctt gattttgtca ccggcctacc 3240 agaatctaaa ggcaacaacg tcattctcac ggtggtggac cgcttctcca aagcggccca 3300 tttcattccc cttcccaaac tcccctcagc caaggagact gctcaagtgg tgattgatca 3360 tgttttccgg attcatggtc ttccggtcaa tgtggtttct gataggggcc ctcagtttgt 3420 ctcccggttt tggaaggaat tctgtcgaca gatcggggcc tctacgaatc tgtcttcagg 3480 atttcatccc cagaccaatg ggcagtccga gcgggcaaac caggatttgg aacgtactct 3540 ccgctgcctg acgtcccaca atcctggctc ttggtgtcaa caattacctt gggtcgagta 3600 tgcccataat tctcttcctt catctgccac aggtatgtcc ccgtttcaat gctctatggg 3660 gttccaacct cctttgtttc cctctcagga acccgaagct gcagttccgt ctgcccttgc 3720 tttcgtccag cggtgtcgac gcacctggaa gagagccagg gaggccttgg cccaggctaa 3780 taaacggacc aaagcagcag ctgatcacca ccggactccc gctcctcgtt acagatgtgg 3840 tcaaagagtg tggctttcta ccaaggacct gcctctcagg gtagcctcac gcaagttggc 3900 tcccaggttc attggcccat accggatcac caaggtcctg agtccggtgg cgatacggct 3960 caagctccct cctatgcttg gtcgggtgca ccctaccttt catgtgtcaa gggttaaacc 4020 tgtattgtat tcacgtcttg ttccctctgc ccccattccc cctccccctc gtctagtgga 4080 tggctcccca gtttacactg tgcggagatt gctagacatc agacgcagag gtcggggctt 4140 tcaatattta gtggactggg agggttatgg tccggaggag agaagctggg tcccggctca 4200 ggacgttctg gatccggcgc tggtcgagga tttccgtcaa cggcaaggta ggccccccca 4260 tgcagcgcct ggtggcgttc gtagaggggg gggtac 4296 // ID BEL11-I_DR repbase; DNA; ZEB; 5999 BP. XX AC chr14; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL11-I_DR; KW BEL11-LTR_DR; BEL11_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5999 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1371-1371 (2008). XX DR Genome; chr14; Positions 41955505 41949507. XX CC Positions [5000-5584] - Integrase core CC LTRs are 100% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 71..5947 FT /product="BEL11-I_DR_1p" FT /translation="MSSSEKHSLRTRSACTRSSRHSSKSAASAAALEARAK FT AEAALTRATYAQKEIEVKVKQAQVQAQLKVEEARLEATLNALQQEREAEAA FT AAEASVFEAAADIRGTDQLSDNQDPHSSHNSLQRTKEYVEDQQKYKESYMP FT EFAETTEQKLVASNLVDVQQPQPIEQDNSFQSLPARQSAIKIVTNAATPLI FT PTTQAGAIKKEGANTSLFKSAVPDALPPRHNASLTSPPHSHCNEHYNARSE FT VFELAKFLARRDLLTGGLSRFNDKPENYWAWKSSFCNAIEGLDLKPSEELD FT LLVQWLGPESTEHARRIRSVHISYPDVGLSMVWKRLEDCYGSAEAMEAALF FT NKLECFPKLSNKDHQRLRELGDLLLELKIAKAEGYLPGLTFLDTSRGISSI FT LEKLPFNLQEKWMLQGTRYKQEYGVSFPPFSFFCDFICGEARMRNDHSFKL FT TTQNVSLSRFEKYPKAIRAPVTVHKTEIDNATQRNEFNRTNDNLNRQCPIH FT KKPHPLRKCRGFREKTLLDRKAFLRENSICFKCCSSTTHQAKDCKAVIQCS FT ECNSDKHIAALHNGPAPWLEKKDTEPSADYGGEQEEFLSPVATSSCTKVCG FT DPNSRKSCSKICLVNVYPEGEPHNKRKVYAVIDDQSNRSLARSAFFEMFKV FT EDNTAPYMLRTCAGIAKTEGRRADGFIVESFDGKTSLALPSLIECNLIPNN FT RSEIPTPEAAQYHSHLRSVASKIPSLDPKAKILLLLGRDLIQVHKVEEQCN FT GPLHAPFAQRLALGWVIVGDICLNGAHKPATVDVFKTHILANGRPSYMSPC FT PNSMFVKDYFSLHNEPKPSLNPKQKTLSVSEECSIGESVFRCTKNDEKTAP FT SMDDLAFLKIMEDQFHQDESNSWVAPLPFRSPRQRLPNNKVQANNRLMLLT FT RKLRKHPEVKEHFVDFMGKIFENGHAELAPQLQENEECWYLPMFGVYHPKK FT PGQIRVVFDSSAQHQGVSLNDVLLTGPNANNSLLGVLLRFRKERVAVMADI FT QQMFHSFVVAKEHRNFLRFLWYANSDLNSEVREYRMRVHVFGNSPSPAVAI FT YGLRRAALTGEQDYGFEARHFVERNFYVDDGLTSFPTEEEAIKLLKDTQEM FT LAQSNLHLHKIASNKVEVMKAFPSEDLAKELKNLDLNTGLSSVQRSLGVSW FT DMSEDVFIFQVADQMRPYTRRGVLSTINSLFDPFGFAAPVVIQGRLLLRTL FT TTESCDWDAPLSSEKFKEWKIWRDSLKKLELVKIPRTYATISLHQAQRREM FT CVFCDASTNAIAAVAYSKITSISGNVEVGFIFGKVKLAPRPEISVPRLELC FT AAVLAVEIAEIIADEIDIKLDVVTFYSDSKVVLGYICNRSRKFHVYVHNRV FT QRIQRSTHPEQWKYVPTDQNPADQATRSVPAADLSHSMWLTGPRFLFNTTQ FT ETSHTGPFAMIHPDLDIEVRSQVISCATHTSKRDLDPKRFERFSSWHTLVR FT AIARLVHITHSFQKDNDKSTCRGWHICDKPCSVEDLERAKSILIHSVQAEC FT YSEELKCLKGGRGIPKQSPLCNLTPYVDELGLLRVGGRLSQASLDRDEVCP FT IILPGQCHITSLLIQHYHKQVEHQGRAFTEGALREAGFWIIGGKRRISSVI FT FQCVKCRRLRGKFLTQRMSDLPPERLSIDPPFSFVGVDIFGPWMVTSRRTR FT GGQANSKRWAVLFTCMSTRAIHIEVMESMDSSSFINALRRFFAVRGPAKQL FT RSDCGTNFVGACKELKMEAITDDRKVQEYLSNKSCIWIFNPPHSSHMGGSW FT ERMIGVSRRILESMLQGIALSKLSHEVLITFMAEVTAIVNNRPLVPVSTDP FT DAPLILTPATLLTQKSSAVLPPPGEFGEKDLFVRQWRQVQSLANTFWNRWR FT KEYLATLQNRRKWQNESSNLQVGDVVLLKDSHARRNDWPMGIVMETFPGRD FT GRVRKVEVKVVKDGVSKTFLRPISNVVLLLSPKTE" XX SQ Sequence 5999 BP; 1839 A; 1310 C; 1358 G; 1492 T; 0 other; gaatagtaaa aagacaaata ctgatcgaca cattaatgaa tagaagatca agttcagcat 60 tcatttcaga atgtcttcat ctgagaaaca tagtctaaga acaagatcgg catgtactcg 120 atcatcaaga cactcaagca agtctgcagc aagtgcagca gcattggagg cccgtgcgaa 180 ggcagaagct gcactaacta gagccacata tgcacaaaaa gagatagaag tgaaagtaaa 240 gcaagctcaa gtacaagctc agcttaaagt ggaggaggct cgtctagagg ctacactaaa 300 tgcactacaa caagaaagag aggcagaagc agctgcagct gaggcatcag tgtttgaggc 360 tgcagcagat ataaggggta cagatcaact ctctgataat caagacccac attcttcaca 420 caattctctg caaagaacta aagaatatgt tgaagaccag caaaaatata aagaaagtta 480 catgccagag tttgcagaaa ccactgaaca gaagctagta gcttctaatc tggttgatgt 540 tcaacagcct caacctatcg agcaagataa ttcatttcaa agtctgccag caagacagtc 600 agcaattaaa attgtcacta atgctgctac ccctcttatc cctaccactc aggctggagc 660 gatcaaaaaa gaaggtgcaa acacatcatt gtttaaatca gcagtgccgg atgcactacc 720 tccaagacac aacgcttcac tcacaagccc cccacactca cattgcaatg agcactataa 780 tgctcgctca gaagtgtttg agcttgcaaa gtttctagct cgacgtgatc ttctgacagg 840 tgggctttca agatttaatg acaagccaga gaactattgg gcatggaagt ccagcttttg 900 caatgctatt gaaggacttg atcttaaacc tagtgaggaa ctagacctac ttgttcagtg 960 gctgggtcca gaatcgactg aacatgcaag acgcattagg tcagttcata ttagttatcc 1020 tgatgttggg ctcagcatgg tctggaaacg gctagaggat tgctatggtt cagccgaagc 1080 tatggaggca gcgctcttta acaagcttga atgtttccct aagctgtcaa acaaggatca 1140 tcaacgtctc agagagctgg gagacctact ccttgaatta aagatagcca aagctgaagg 1200 ttatctgcca ggcctgacct tcttggatac ttcaagaggc atcagctcca tcctagagaa 1260 actgccattt aatttgcaag aaaaatggat gttgcaagga actcgttaca aacaggagta 1320 tggagttagt tttcctcctt tttcattctt ttgtgatttc atttgtggtg aggcccgcat 1380 gagaaatgat cacagcttta aactgactac acaaaatgtg tcattatcca gatttgaaaa 1440 atatccaaag gcaatcagag ctcctgttac tgtccacaaa acagaaattg acaatgccac 1500 tcaaagaaat gaatttaata ggacaaatga taacctcaac agacaatgtc ctattcataa 1560 aaagccacat cccctcagaa agtgtagagg tttcagggag aagaccttgc tggacaggaa 1620 agctttcctc agagagaact ccatttgctt taaatgctgt tcctcgacta cccaccaggc 1680 aaaggattgt aaggcagtaa tccagtgctc tgaatgcaac agtgacaagc acatagccgc 1740 cctacacaat ggtccagccc cctggttaga aaaaaaggat acagaacctt cagcagatta 1800 tggcggggag caggaagaat ttctttcccc agttgcaaca tcttcctgta caaaagtatg 1860 tggggatcca aatagcagaa agtcatgttc aaagatctgc ttggtaaatg tatatccaga 1920 aggtgagcct cataacaaaa ggaaggttta cgcagtcata gatgatcaga gtaataggtc 1980 tctggcgaga tcagcttttt ttgagatgtt taaagttgaa gacaacacag caccttacat 2040 gctcagaaca tgtgcgggca ttgcaaagac tgaaggacga agagctgatg gatttatagt 2100 agaatccttc gatggaaaga catctcttgc tcttccatca ctgatagaat gtaatctcat 2160 cccaaacaac agatcagaga ttcctactcc agaggcagca caataccata gtcatttaag 2220 atcggtagcc tcaaaaattc caagtctaga tccaaaggcc aaaattctgc ttctgctcgg 2280 gagagatctc attcaggtac ataaagtgga ggaacaatgt aatggaccac tacatgctcc 2340 ctttgcccag aggctggccc ttgggtgggt gatagttgga gatatctgcc taaacggagc 2400 ccacaaacct gctacagtgg atgtttttaa gactcacatc ttagcaaatg gacgccccag 2460 ctacatgagt ccctgtccaa acagcatgtt tgtgaaggac tatttcagtc tgcacaatga 2520 gccaaaacct tcattgaacc ccaagcagaa aaccttatca gtatctgaag aatgctccat 2580 tggagagtca gtttttcgct gcacaaagaa tgatgaaaaa acagctccct ccatggatga 2640 cttggcattc ttgaagatta tggaagatca attccatcag gatgaatcca atagctgggt 2700 tgcccctcta ccattccgca gccctcggca gcgcctcccg aacaataagg ttcaagccaa 2760 caatcgtctt atgttgctga cacgcaaact aagaaagcac ccagaggtaa aggagcactt 2820 tgtggatttt atgggcaaaa tctttgaaaa tggccatgca gagttggcac cacagttaca 2880 ggaaaatgaa gagtgctggt atctgcccat gtttggggtg taccatccta agaaacctgg 2940 acagataaga gtagtgttcg actccagtgc acaacatcaa ggagtgtcac taaatgatgt 3000 tctcctcact ggccctaatg caaacaatag ccttttaggg gtattactgc gattcaggaa 3060 agaacgagta gcagtgatgg ctgatatcca gcaaatgttt cactcatttg tggtagcaaa 3120 agaacatcgt aatttcctac gctttctttg gtatgccaac agtgacttaa acagtgaggt 3180 gcgagagtat cgcatgcgag tgcatgtttt tggtaacagc ccatctccag cagttgccat 3240 atatggactc agaagagctg ctctcacagg cgagcaagac tatggatttg aggcaaggca 3300 ctttgtggaa agaaactttt atgtggatga tggccttacc tccttcccca cagaagagga 3360 agccatcaaa ctgctcaaag acactcagga aatgctggcc cagtctaatc tgcatcttca 3420 caagattgct tctaataaag tggaagttat gaaagcattt ccttcagagg atcttgcaaa 3480 ggagttaaag aacctagatc tgaatactgg actctcatca gtccaaagaa gtcttggagt 3540 gagttgggac atgagtgaag atgtgtttat ctttcaagtt gcagatcaga tgagaccata 3600 cactagacgt ggggttctgt caacgataaa cagtttgttc gacccatttg ggtttgctgc 3660 acctgttgtc attcaaggca gacttctact cagaacacta accacagaat cttgtgactg 3720 ggatgcacct ctatcaagtg agaaattcaa ggaatggaag atttggagag actcgttaaa 3780 gaagcttgag cttgtcaaaa tccctcgcac atatgccacc atctctcttc atcaagcgca 3840 aagaagagaa atgtgtgtct tctgtgatgc gtccacaaat gccattgcag ctgtcgcata 3900 cagcaaaatt accagtataa gtggtaatgt agaagttgga ttcatatttg gcaaggttaa 3960 actagctccg cgaccagaga tctctgtccc tagacttgaa ctgtgtgcag ctgttctggc 4020 tgtggaaata gctgaaatta ttgcagacga aattgacatc aagctggatg ttgtgacatt 4080 ctattcagac agcaaggttg tcttaggtta catctgtaat agatccagga agtttcacgt 4140 gtatgtgcac aacagagtgc aaaggattca gaggtccacg caccctgagc agtggaaata 4200 cgtcccaacc gatcaaaacc ctgctgacca agccacaaga tctgttccag cagctgatct 4260 ttcccatagc atgtggttga ctggtcccag attcttgttt aatacaacac aggaaacatc 4320 tcacacaggt ccctttgcca tgatacatcc agatttggac attgaagtac gctcccaggt 4380 tatttcctgt gccactcaca cttcaaaaag agacttagat cccaaacgct ttgagagatt 4440 ctcaagctgg catacactgg ttcgggcaat agccagactt gtgcacataa ctcattcttt 4500 tcagaaggat aacgataaaa gcacttgtag aggctggcac atatgtgaca aaccctgctc 4560 agtagaagac cttgaaagag cgaagtccat cctcatccat agtgtacaag cagaatgcta 4620 ttcagaagag ctgaaatgcc ttaagggagg caggggcatt ccaaaacaaa gtcctctttg 4680 caatctaaca ccttatgtag acgaactcgg cctgctcagg gttggtggcc gcctctcgca 4740 agcgagcctt gatagagatg aagtgtgccc aataattctc cccggacagt gtcacatcac 4800 atctcttctc attcaacatt atcacaaaca agttgaacat cagggacggg cctttactga 4860 gggtgcctta agagaggctg gcttttggat cataggtggc aaaaggcgca tcagtagtgt 4920 gatttttcag tgtgtaaagt gccgcagact gcgaggaaag tttttaactc aaagaatgtc 4980 tgacttacct cctgaaaggc tcagcataga ccctccattc tcctttgtag gggtggatat 5040 ttttgggcct tggatggtca cctcacgtcg aactagagga ggacaagcaa actcaaaacg 5100 ctgggcagtc ttgttcacat gtatgagcac aagggctata cacatagaag tcatggaatc 5160 aatggatagc tccagcttta taaacgctct ccgccgtttc tttgctgttc gaggtccggc 5220 aaagcagttg aggtcagact gcggtaccaa ctttgttgga gcgtgcaagg agctaaagat 5280 ggaggcaatt acagatgaca ggaaggtaca agaatatctg agtaacaaga gctgcatttg 5340 gatatttaat cctccccact catcccacat ggggggaagc tgggagcgta tgatcggagt 5400 ctcaagacgc atccttgaat ctatgctaca aggaatcgca ctatcaaagt tatcccatga 5460 ggttttgatc acctttatgg cggaagtcac agctatagtt aacaaccgtc cactagtgcc 5520 tgtgtctacg gatccagatg ctccattaat tcttacccca gcaactctac tgactcagaa 5580 gtcttcagca gttcttcctc cgccaggaga atttggtgag aaggatctct ttgttcgtca 5640 gtggagacag gttcagagct tggccaatac attctggaat cgttggagga aagaatatct 5700 tgccacctta caaaaccgca gaaaatggca aaatgagagc tccaatctac aagtgggaga 5760 tgttgtgctg ttaaaagaca gccatgctag gaggaatgac tggcccatgg gaattgtgat 5820 ggaaacattt cctgggagag acgggagagt aagaaaggtt gaggtcaagg tggtaaaaga 5880 cggagtgtcc aagacgtttt tacgacctat atcgaatgtt gttttactgt tgtctccaaa 5940 gactgagtaa acataatatg aatatcattg atggcgtctt tcagacacca gacggggag 5999 // ID Gypsy87-LTR_DR repbase; DNA; ZEB; 294 BP. XX AC chr10; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy87-I_DR; KW Gypsy87-LTR_DR; Gypsy87_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-294 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1486-1486 (2008). XX DR Genome; chr10; Positions 24999642 24999935. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 294 BP; 99 A; 50 C; 76 G; 69 T; 0 other; tgtagggagt gaaatttgaa gtgaaactaa tgttctgtac ctttaaataa gtttggaaat 60 agtatcgcgg gatcagagag acagagagag aaggagacgg agagagagag aaaccgcacg 120 aagtaatctg taatggctga ctaatgatgt tcaccctgta tgtttaataa atcaagggtt 180 aatacctgcc gatcgagctc cgtgtgtact ccttgcaact aagccataga acccacaaga 240 cggcgtgtgg tgagaactgg aaagtttata tctaaagaac caacgcgcgt taca 294 // ID hAT-N40_DR repbase; DNA; ZEB; 1313 BP. XX AC . XX DT 16-OCT-2008 (Rel. 13.1, Created) DT 16-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Putative nonautonomous hAT DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW 8 bp TSD; hAT-N40_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1313 RA Bao W. and Jurka J.; RT "hAT DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1310-1310 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1313 BP; 380 A; 267 C; 290 G; 376 T; 0 other; tagggttggg cgatgttgac caatttggca tcgtacgatg tctaatgtga aacatcgcga 60 tggacgatgc catcgtcgtc gtaggcgggg ggtgggggtg gttgttaaat atcaattaat 120 tcataacgaa ttaattaatt gtagcctacc gttttcacta cctgacccac atggtctttg 180 ctttacccat aaccaaacca taaataaata aagttacaaa caaatgacca cctgtcaatc 240 actttttttc cgcgggactc tggcatgaat aggcacagtg atctgtgtcg ttctaatggc 300 ttttcccaca gctggtggag gagacgagag ttcggttgga gttcactgta atttgtattg 360 tgattgatga taacttcctt attttattat tattatgagg aaataataac aagtaaagat 420 gctagggctc gaccgttttg gtcgcatatg cgcccaaaat tttatctatg cgaccttaaa 480 atatatttgg aagcatttgg gcgagtgcat acattttcat tgttttcact gcaaaatcct 540 caccgcgtgc gcggatttag gagacattta tgcagaatgc atctttgcac ctaaaaacgc 600 caaacgcagc actcaggaac agattaaaac agttgttatg tgaacttgct gaagtaaaaa 660 agccacgcaa tgcttgcccc gcctttgtgc tcttcttatt ggcccactgc tctagaactg 720 aacgcgaatg attggttaat atcagctgtc aatcactcag tcaatgcctt ctgccttcag 780 atgacaggag ctacaaccgc gaaaatgtga agcgctgaag atgagagaat gaaaataaca 840 ctgacagatt tgttttagaa aacacctaca gttactaata atggcacatc ttattagtga 900 tcatgtgctg aatgttaatc caccatttac acttttccaa gagtggataa aagacttcca 960 gtggcttcaa gtccgctatg aaaatgacta aagccattca ctgtaaagtc tgtgggacgg 1020 ggtttgcagg taacagcgtt tgccactgaa tctacacatt ttaagcgtga gaggtgctgt 1080 ataatccttc atttaatcct cacgatgctg tcagccgtgc aagcggcagc tatatgtaac 1140 atttaacaca gctcatgaaa agaccgttat tgctattaag atgacatgca aataataagt 1200 tatatatcaa tataaatgtg gggcggggcg atggatcgcg atgccggctc agcatcgtga 1260 agtttgtcgg ccatcggcga tggacgatgg catcgtctat cggcacaacc cta 1313 // ID CR1-43_DR repbase; DNA; ZEB; 898 BP. XX AC . XX DT 07-APR-2009 (Rel. 14.04, Created) DT 07-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE CR1-like non-LTR retrotransposon - a consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; AP endonuclease; CR1 clad; CR1-43_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-898 RA Bao W. and Jurka J.; RT "CR1-like non-LTR retrotransposons from zebrafish."; RL Repbase Reports 9(4), 856-856 (2009). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 3..752 FT /product="CR1-43_DR_1p" FT /translation="SPLLFILYTNMCQSTFENHFILKYADDTVIVSLLQGD FT ENSHGPIVDHFVQWCEESNLKLNATKTKDMIIDFRRNHTAAEVTNIRGQKV FT ECVENYKYLGTIIDRSLNFEANCDAVCKKGHQRLYCLRKLSRFHIGPNMLT FT LFFYRAFIESILSFALVAWFGGLSLKNRNRLSQVVRWSSRLIGESQLSIES FT LYSRQLLRIAQNISSCSSHPLSEEFILLPSGRRFQMLFSRTKRFRNSFVPS FT SIVALNKL*" XX SQ Sequence 898 BP; 258 A; 158 C; 186 G; 296 T; 0 other; tatccccact tttatttatt ctgtacacaa acatgtgtca gagcacattt gaaaatcatt 60 tcattttaaa atatgcagac gacactgtga ttgtaagcct cctccaagga gatgaaaata 120 gtcatggtcc catagtagat cattttgttc agtggtgtga ggagtcaaat ctaaaactta 180 atgcaacaaa aactaaggat atgattattg attttagaag gaatcataca gcagctgagg 240 tgactaatat taggggtcaa aaagtagaat gtgtggagaa ctacaaatat ttgggtacca 300 ttattgaccg atctttaaac tttgaggcaa actgtgatgc tgtgtgcaag aaggggcatc 360 agcgcctata ttgtctcagg aaactatcac gttttcacat tggccccaat atgcttactc 420 ttttttttta tcgtgctttt attgagtcta ttctatcctt cgctctggtg gcatggtttg 480 gaggtctgtc tctcaaaaac aggaatagac tgagccaggt tgttaggtgg tctagcaggt 540 tgattggtga atcacagctg agtattgaat ccctgtatag taggcagctt ttacggattg 600 cccagaacat tagtagctgt agctctcatc ctctgtctga ggagtttatc ctgctccctt 660 cgggtcgaag gtttcagatg ctttttagta ggacaaaacg gttcaggaat agctttgtcc 720 catcatctat tgtcgctctg aataaattgt aatcttttat aaggtacttt tgtatgtgta 780 tatgtgggta tgtatatgtg tgatttaatt gtttgtatca aatgccaaac tttgctgcaa 840 aaagaatcta cctacgggta ctaataaagt caactaacta actaactaac taactcac 898 // ID LRS_DR repbase; DNA; ZEB; 9916 BP. XX AC . XX DT 01-FEB-2010 (Rel. 15.02, Created) DT 01-FEB-2010 (Rel. 15.02, Last updated, Version 2) XX DE Long repeat sequence - consensus. XX KW LRS_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-9916 RA Tracey A.; RT "Unclassified repeat from zebrafish."; RL Repbase Reports 10(2), 241-241 (2010). XX DR [1] (Consensus) XX CC This is an unclassified repetitive element included for sequence CC assembly purposes. CC The consensus sequence was derived from sequence data generated CC by the Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 9916 BP; 2179 A; 3147 C; 2856 G; 1734 T; 0 other; gagaaagtaa aaaaaaaatc attttgaaaa cgaaatgcat tgattttatt tttttgctgt 60 ttgaaacaac taacactcat ttgggggatg gtgggtctct agcggggggg gtctttggaa 120 cgaccgctgg caaacgctat ggatgagctc accgggggcg ttggttggac cagggggaga 180 gcccaccaag ccatgcccgg atagagacca ctctctccgg catgacccgc gcctggaggg 240 gggggtagcg cgtccctggg tttcgctgac ctgccgtcga gttttcgata acagactcaa 300 atgaaggact tagtgcaatt ttcgatttcg cggttgaacg aaccaggggt ctaacgaggg 360 ccccgttgcc ccaaaacaag cgttcaaacc cgggcaaacc aaatgcctgc tgccctggtc 420 ctccgatcca ggcctctggc cagcaaagct acccaaaaca agcgttcaaa tccgggcaaa 480 ccaatgcgca ctgctgccct ggtccttcca aggacccagg cctcttggcc agcaaagcta 540 ccccaaaaca agcgttcaaa tccgggcaaa ccaagtgcgc actgctgccc tggtccttcc 600 aaggatccag gcctcttggc agcaaagcta ccccaaaaca agcgttcaaa tccgggcaaa 660 ccaagtggcg tgctgccctg gtcctccaag gatccaggcc tcttggccag caaagcaccc 720 caaaacaagc gttcaaatcc gggcaaacca agtgcgctgc tgccctggtc ctccaaggac 780 ccaggcctct tggccagcaa agctgcccca aaacaagcgt tcaaatccgg gcaaaccaag 840 tggcctgctg ccctggtcca ccaaggaccc aggcctcaat ggccagcaaa gctaccccaa 900 aacaagcgtt caaatccggg caaaccaagt gcgcgtgctg ccctggtccc caaggatcca 960 ggcctcttgg ccagcaaagc ccccaaaaca agcgttcaaa tccgggcaaa ccaagtgcgc 1020 ctgctgccct ggtccttcca aggatccagg cctcttggcc agcaaagctc cccaaaacaa 1080 gcgttcaaat ccgggcaaac caagtgcgcc tgctgccctg gtcctccaag gaccaggcct 1140 cttggccagc aaagctgccc caaaacaagc gttcaaatcc gggcaaacca agtgcgcact 1200 gctgccctgg tccttccaag gatccaggcc tcttggccag caaagctaac ccaaaaacaa 1260 gcgttcaaat ccgggcaaac caagtgcgcg ctgctgccct ggtccttcca aggacccagg 1320 cctcaatggc cagcaaagct accccaaaac aagcgttcaa atccgggcaa accaagtgcg 1380 cgctgctgcc ctggtcctac caaggaccca ggcctcaatg gccagcaaag ctgccccaaa 1440 acaagcgttc aaatccgggc aaaccaagtg cgcactgctg ccctggtcct tccaaggatc 1500 caggcctctt ggccagcaaa gctaccccaa aagaagcgtt caaatccggg caaaccaggt 1560 gcgcgtctgc cctggtcctt ccaaggatcc aggcctcttg gccagcaaag ctaacccaaa 1620 aacaagcgtt caaatccggg caaaccaagt gggcgctgct gccctggtcc ttccaaggac 1680 ccaggcctca atggccagca aagctacccc aaaacaagcg ttcaaatccg ggcaaaccaa 1740 gtgcgcactg ctgccctggt ccttccaagg atccaggcct cttggccagc aaagctaacc 1800 caaaaacaag cgttcaaatc cgggcaaacc aagtgcgcgc tgctgccctg gtcctaccaa 1860 ggacccaggc ctcttggcca gcaaagctac cccaaaacaa gcgttcaaac ccgggcaaac 1920 caagtgcgca ctgctgccct ggtccttcca aggatccagg cctcatggcc agcaaagcta 1980 acccaaaaac aagcgttcaa atccgggcaa accaagtgcg cactgctgcc ctggtccttc 2040 caaggatcca ggcctcttgg ccagcaaagc taacccaaaa acaagcgttc aaatccgggc 2100 aaaccaaatg cgcactgctg ccctggttct accaaggatc caggcctctt ggccagcaaa 2160 gctaacccaa aaacaagcgt tcaaatccgg gcaaggccag tggtctgaac accatgggag 2220 tcagggttcc atggcaggcc accgcgcgca aaggcctatg tgggtcacca acggtaagcg 2280 atgggtcaga aagagccagg gggggggggt ctttggtgat cagtgcttgc ccgggataag 2340 tgagggtgta ttgggcagag agcgcgtgac atgaagcctc ggaggccaat ggttcaccag 2400 cggcaccatt tacgcaccag ctgagaccta agtgcgctca ccgggggcgt tggttggacc 2460 agggggagag ctcaccaagc catgcccgga aagagaccac tctctccggc atgacccgcg 2520 cctggagggg gggtagcgcg tccctgggtt tcgctgacct gccgtcgagg tttcgataac 2580 agactcaaat gaaggactta gtgcaatttt cgatttcgcg gttgaacgaa ccaggggtct 2640 aacgagggcc ccgttgcccc aaaacaagcg ttcaaacccg ggcaaaccaa gtgcgcactg 2700 ctgccctggt ccttccaagg atccaggcct cttggccagc aaagctaccc caaaacaagc 2760 gttcaaaccc gggcaaacca agtgcgcact gctgccctgg tccttccaag gacccaggcc 2820 tcatggccag caaagctacc ccaaaacaag cgttcaaatc cgggcaaacc aagtgcgcac 2880 tgctgccctg gtccttccaa ggacccaggc ctcatggcca gcaaagctac cccaaaacaa 2940 gcgttcaaat ccgggcaaac caagtgcgcg ctgctgccct ggtccttcca aggacccagg 3000 cctcatggcc agcaaagcta ccccaaaaca agcgttcaaa cccgggcaaa ccaagtgcgc 3060 gctgctgccc tggtcctacc aaggacccag gcctcaatgg ccagcaaagc taccccaaaa 3120 caagcgttca aacccgggca aaccaagtgc gcgctgctgc cctggtcctt ccaaggaccc 3180 aggcctcaat ggccagcaaa gctaccccaa aacaagcgtt caaatccggg caaaccaagt 3240 gcgcactgct gccctggtcc ttccaaggat ccaggcctct tggccagcaa agctacccca 3300 aaacaagcgt tcaaacccgg gcaaaccaag tgcgcgctgc tgccctggtc cttccaagga 3360 tccaggcctc ttggccagca aagctacccc aaaacaagcg ttcaaatccg ggcaaaccaa 3420 gtgcgcctgc tgccctggtc cttccaagga tccaggcctc atggccagca aagctacccc 3480 aaaacaagcg ttcaaacccg ggcaaaccaa gtgcgcactg ctgccctggt ccttccaagg 3540 atccaggcct catggccagc aaagctaacc caaaacaagc gttcaaaccc gggcaaacca 3600 agtgcgcact gctgccctgg tccttccaag gatccaggcc tcttggccag caaagctaac 3660 ccaaaacaag cgttcaaatc cgggcaaacc aagtgcgcac tgctgccctg gtccttccaa 3720 ggatccaggc ctcatggcca gcaaagctga cccaaaacaa gcgttcaaat ccgggcaagg 3780 ccagtggtct gaacaccatg ggagttaggg ggtccacggg agtcaagggt ccatttctga 3840 gctgctaacc cgtctcccag cgtccatcac cccgcagcaa acataagacc ccatcgcaga 3900 cggaggctag aacgacgtgc ggctggtgcc ccatacacga gcccagggcc ggtggtgacc 3960 cattcacaaa cccaaagacc gattgggtca ccagcggtac gtcttggcca gttgggtcac 4020 cagcagcacg tctgtctgat ccatgcggtt tggaggagtt agtgtccccg ggcttaatag 4080 tggggtgccc gggcacggct ggtgattagg tgcaaatcta gggggtcctt tggaaccagc 4140 gcttgccgcc ggagagtgcg tgtacctcgg gcagagcgcg ctggatctca gaggccccag 4200 gggctgggct cgctccccaa gaagggtgtg tgtgcctcgg gcagagcgcg caggcacagg 4260 ccccccaccg ctggatctca gaggccccag gggctgggct cgctccccaa gaagggtgcg 4320 tgtgccccgg gcagagcgcg cggggcacag gcctccccaa cgctggatct cagaggcccc 4380 aggggctggg ctcgctcccc aagaagggtg tgtgtgcccc gggcagagcg cgcggggcac 4440 aggcctcccc aacgctggat ctcagaggcc ccaggggctg ggctcgctcc ccaagaaggg 4500 tgcgtgtgcc ccgggcagag cgcgcggggc acaggcctcc ccaacgctgg atctcagagg 4560 ccccaggggc tgggctcgct ccccaagaag ggtgtgtgtg cctcgggcag agcgcgcagg 4620 gcacaggcct ccccaacgct ggatctcaga ggccccaggc cagggagagg ggctgggctc 4680 gctccccaag aagggtgtgt gtgccccggg cagagcgcgc ggggcacagg cctccccaac 4740 gctggatctc agaggcccca ggccaggggc tgggctcgct ccccaagaag ggtgtgtgtg 4800 ccccgggcag agcgcgcggg gcacaggcct ccccaacgct ggatctcaga ggccccaggg 4860 gctgggctcg ctccccaaga agggtgtgtg taccccgggc agagcgcgcg gggcacaggc 4920 ctccccaacg ctggatctca gaggccccag gggctgggct cgctccccaa gaagggtgtg 4980 tgtaccccgg gcagagcgcg cagggcacag gcctccccaa cgctggatct cagaggcccc 5040 aggggctggg ctcgctcccc aagaagggtg tgtgtacccc gggcagagcg cgcggggcac 5100 aggcctcccc aacgctggat ctcagaggcc caggccagga ggggctgggc tcgctcccca 5160 agaagggtgt gtgtacctcg ggcagagcgc gcggggcaca ggcctcccca acgctggatc 5220 tcagaggccc catcaatggt tcaccagcgg ctccatcaat ggttcaccag cggcacatct 5280 ttacgcatct agcgccaaag gtccaacagg aatacgttat gcctggcgct caccggggcg 5340 ttggttagac ccaggcccga gcccaccaaa ccatgcccga cgggagacca cccttcccgg 5400 cctacttgtc gggacgtccc gcgcctggag gtttttgccc gacacccaca atctctgacc 5460 tgccatcagg gtttcgggga aggcgaagaa atatggactt agtctctgac agtccagtcg 5520 ccagacccca cggtgactga tcgataggag ctagtgtggc gccccagggg ctaggccccc 5580 acccaggccc cccaaagcac caatggggtc ggcctctatg ggtgtggcag atcctccccg 5640 tcctgggcgc tctatcaggg aggcgtgggg gtgtccctcc gcacacagcg cccccccttc 5700 ccgccccgat ctgtgggtaa aggacccaga tggcccgtct ggctaaatga cccagagctt 5760 gtctgctgct cagcccgccc gccagcccgc ccccctggac tccgtttccc ccgcgccgat 5820 actcggcccg tgcccccgcc ccacgcccct cgccctgcga gggctggggg atgttcggtg 5880 ggttcggtcg agtcacccgg cgtggtgaat cgaggcccca ggggcagcgt agggaacgag 5940 ctggtgtggg cttcgattct acccccgccc gagtgtgtgc gtctgcatcc cgtccgtccg 6000 cccgcaacgc cgcccacccc ctgggtttat ccttgctggc tcccggctga cccccacccc 6060 gcctaccctg ctcccggggg agtgagggtg gttggttggg aggggaagct agtggactcc 6120 ggcgaggtga aacgtgccag ggtcagtggt gccggggtta cgggctgtct ggacccccct 6180 ttcgatggtt ggtgcgtatc ccatgttggt tgtagcaggt cacaaaagct cccgcggctg 6240 gcgcgtgtgg agctgtcccc ccccccgatg gggctcaaac aatccccttt ttggggtgga 6300 aaaagtgaac gtgaatggct gtggaaaagc tcccgctggc cagaacagca aaatgaaggg 6360 gggataggga gtgaggtggg acaccccttc cccaactcga gtcgtgacaa gagagagagg 6420 tgtcgaaccg ctgaccgggg caacgatgaa ggggcggtgg ggtgcattgt ttaacgggac 6480 accccctctg gttcctttga aaagggggga agggagtgag gtggacaccc ctctcccccc 6540 tacttgagaa gtcgtaacaa aagaaggctt gaaccgaatg gatctccgta gggcatcggc 6600 ttacacctcc tggtgcgggg gggtgtcctt cccctttaag aaatacgggt ccaccaacgt 6660 catgacgcgg aagtggacgc cgcctccggc cgcttattaa ccgccgcacg tggcgtgcgg 6720 gcccctttcc ccctctgcat tggctctagc acggtcatgg ctgaatgtgc cctatgcttc 6780 aacgtctaca gccggcttgc gcctcacctg acggccgttc acaaggtggc caactcggac 6840 gagaagcggc tgctgctcgc gctggccgcc ggccgcgtgg acacgcggaa gacaccgtgc 6900 ccggtcccgg gctgtcgccg gacgccagcc cgcctagaca ggcacttgag gcagcacgcg 6960 gagctctcga cctcgggaag gaaagaggct atggggaggg cgaaacgccg gaaagtagct 7020 cgggagttgc agtggctgcg gtccacccag cctgcaatcc ctgtcgtgtc ccagctagca 7080 tcgtcctccg agggggagcg ggactcggag gacccagagg ccggccgccc gtgcgccgac 7140 cgcggctgca ggcgcgccac cgagcgcata caggctcagc tcacggacct ggggaaacag 7200 gttaccaaga tgcggtccac cctgctccag atcacacgcc tctaccggga gctgaggaag 7260 ggagaagggg ggagaaggag gaggaagagg gcgaggagaa ccaggtcgcc tcctgcacca 7320 ccggctcccg ggcgaggggc ggccggaggt ccgacgcccc ccgagcctcc ggaggctttg 7380 gacccactgc ctacccgttc ccggaccacg tccccgcgct gagttgagta tatttcaggc 7440 acgtcactgt cgctctgctg ggtgtgtggc ttcggggaag ttgactcttg gttgtgctcg 7500 ttccacagac cttctcttgg gggagttcga agggtaccaa ctgggcagcg agcccacccc 7560 ccgcctgcgg aacaacgtca cttcgaagct ggggagaatc aaagccttcc ttggttacat 7620 ggccaggggc accgcggagc caggggactt ccttttcctc aaccaaccag cccgaatccg 7680 agcgtgggcc gcccggctag gtcagacgcg catggccgag cccaccaggc agcactacct 7740 gaagaacgtg gctcagttcc tagactacct ctcggagacg ccgccggccg cctgtcagct 7800 ctccagcacg gctctggttc tgattcgaag ggaggtcaga gccctcatcc gcggcatacg 7860 ccggcgtgtc gtcgtgcacg aggtaaggac caagcaggcg aaggaaagcc gactgatccc 7920 caaggccagc ctggtgcgct gtcaccggac cgctgggagg aaaattcccg ccctgctagg 7980 taagcaagcc cttccgccgt tccagcgatt ccccctgttc gttcttccgg ggcaccaaca 8040 gcccttggtt catccagcca cgtctgtctg cccctctccc ccgcgaatgg gactcattcg 8100 gggggaaggg gggcagacgg ctggaacccc aacagctgct ggtggcccgg agaggaattg 8160 tgcgtttctt cttcaggggc accagcagtc catgaactgc tggtgcccct gaagaagaaa 8220 cgcacaattc ctctccgggc caccagcagc tgttgtttga gctgctggcg ccccggagaa 8280 caagtcacca gcagccaaga gctgttggtg acccagaaag gaaatagaac cgtcggtggt 8340 ctaatctgtc taatgtcctt caccagatag cctcgaatcc aacccaagca ctaggcaaca 8400 gtggcgcttc tatggctttc tgactggcta cctaacctcc atctctgggc accgctgtgg 8460 agtcttccag aatctcacaa tccaggaggt tgaagaggcc tccagaagcc ccgacgagtc 8520 tgcttatgtc attaacgtga gtatggcgca agtaaggtgg gggtcaccag tagcatgctc 8580 tctcactcac tcactcactc actgcttgtt ctctgatggc agattaccac tcacaaaaca 8640 aacagagcct ttggggcggc tcagctgtcc ctaaacaggg aggaatacag ctggttccgc 8700 aggtttttgg cgctgcgggc tggtctcccc ggagggagcc aggctaccta tttctttttc 8760 acttccagag ccagtccttg tcggaccctg aacaagtact ttcagtctgc ttggctcagt 8820 atgggccttc caggcaaacc cacctttact gacgtacgca ctgcgatcgc gactcatgtg 8880 agtaggcatt gtgacgcccc tgtaactacc agcggcggct tctatcctaa tcaagcttct 8940 gtctctctcc gacgcaggca aagaatgcac actcttcaga ggatcgccgc aaggtggcgc 9000 aattcatgtg ccatgacact tcaacctcag ataagttcta cgcacttcac ctcggacctc 9060 tccaagcacg cgagcgccgc agactctttg aaagggccct ggtggaggag gaggaggagg 9120 aggaggatgg ggaggcagcg ggcactgaaa gccccccgcg gaaagggcgc aagaggacag 9180 agacttccgt ctctcccctg gtaaaaatca cattctcttt ggcgtggaca gcagaacgtg 9240 tggcattggc taacttccct ttatgtgtct cttttccagg agggaaccag caggaggacg 9300 ctgccatggc gccctgccaa agggaaaagc cagggcgctc gcccgctcga acaaacagaa 9360 gactctgaat cgtcctgaat aaatattgta caacttaacg gtcaaatggt gttcagtgtc 9420 ttcattattc cttttattta ttatctgtgt ggtaaaagta aattcagtgg ggacaaaaat 9480 gagccctgtc ttcttagttc aacgctgtat gcctgctgcg tccaggtcct gccttggcta 9540 cgttccgatt gggacagatt agccctaact tgtgtgttaa aagtgtcacg tgggatgccc 9600 tcgtccaggt tctgccttgg ctacgttcca attgggacaa actatccctg ccttgagtgt 9660 taaaagtgac tcgcgggacg cccccgtcca ggttcagcct tggttgatag ctcataggga 9720 gaatgaatcc cttttccagc acaagggggt gatgggctgg acaaatctgt ccaggtctgc 9780 aaatgctttg aacgcccaga aggccttgag tccaattctg tatattagca agcttctccc 9840 gggacagctc ggtccttaga gatgcatgaa taggtcagga cagggcagct gtggagtgga 9900 ggttaacttt atggag 9916 // ID KibiDr2 repbase; DNA; ZEB; 5400 BP. XX AC . XX DT 19-FEB-2010 (Rel. 15.02, Created) DT 19-FEB-2010 (Rel. 15.02, Last updated, Version 2) XX DE Danio rerio retrotransposon KibiDr2 DNA, complete sequence. XX KW Tx1; Non-LTR Retrotransposon; Transposable Element; KibiDr2. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5400 RA Kojima K.K. and Fujiwara H.; RT "Cross-genome screening of novel sequence-specific non-LTR RT retrotransposons: various multicopy RNA genes and microsatellites RT are selected as targets."; RL Mol. Biol. Evol 21(2), 207-217 (2004). XX DR [1] (Consensus) XX FH Key Location/Qualifiers FT CDS 173..1339 FT /product="KibiDr2_1p" FT /translation="MASVQVQGASALSFRNGCRCVADPEVTVEDILQIIGE FT KVGFENIVSASRMNKAVVVFLKSESLINELTVSGIWVKEAFVPITPLSAPA FT TKVTISNVPPFVSNEAILKELQRFGKIASPVKGIPLGCKNAALKHVLSFRR FT QVYMFLNSPERTLEASFRIIHEESSFIIYASTESMRCYGCRDFGHKRFLCP FT QKNEQRASSSQAETSNADTLRVQGTDEMTGQQEVSELEVNITGSVEKPGCS FT TDSNADVSNAKCDESKAQSDTEVVCEGGTEQESGSKESFDDEMEGLSQCTD FT DGLREDDEQWSEVVKKANKDLYTLDQINAFLDKTKGKIGVEISDYFPDLDK FT FVSSVMSVRKKHNYDELSQQKRFRLRKHLTNIRSTGKIGKTRGKSK" FT CDS 1348..5148 FT /product="KibiDr2_2p" FT /note="apurinic endonuclease and reverse FT transcriptase." FT /translation="MFKLLVFLFVSLSLSLFYMEILRVGSLNTNGLRDEKK FT RNLLLESIRIKQLNVIFLQETHSNSNNEAEWGLCWKGEKVLSHGTNLSAGV FT AILFSSALKVQILMINEIELGRLVAVRAKINDLSFLFINVYAQNNGADRLK FT LFSNLECFLKQQQDGDFIIIGGDWNCTLDFKYDRNSEEPHMQSASRLAGIL FT KNQYFSDVWREQNASIKQYTWVKINDGRFSGARLDRFYVSNNVKNRVVGAN FT IIPSFLSDHKLITVECTLLNRTHKSYYWHFNNKLLQDKYFCETFKSFWEAW FT KNEKGLYENIIQWWEIGKVQIKVLCQQYAAQSSGVFKKTIELLEKEIEEIE FT KSMTVNDAGNLQELWTEKKNQLGSILNEKVKGALVRSRFSTIKDMDGPTSY FT FFNLEKKKGQEKLMFVLKDDDGQDTSDPVLMRRLAVRYYSNLFAAETTDEQ FT SRNELLQDIPSLTSTQKELLEVDFKFEEVTAAVMDLSSGTAPGIDGLSSEF FT FKTFWTLIGADYYEVLKECMKKKILPVSCQRAVLTLLPKKGDLTLLKNWRP FT VAVLCSEYKIIAKCLSNRLNNVLHEIIHKDQSYCIKGRSILDNLHLVRDVY FT DFAFKNNINLGFLSLDQEKAFDKVDHNYLFETLKAFGFGDSFISMVKVLYN FT KATCMVKMAGGLSVPIEIHKGIRQGCPLSGQLYSIVIEPLLCKLRRWLTGL FT SVPELDFQEPVKISAYADDLTVIVKDHSDVHFLREIIEQYGKASSARMNWS FT KSNALWCGQATNGPMLPGNLQWGRMGFKYLGVFLGTDDFIKQNWEGLVEKV FT CARLSRWRWLLPQLSYRGRVLICNNLVASSLWHKMMILEPPRDLVARIQKQ FT LIDFFWSGQHWLRASVLYLPIQEGGQGLVDIGSRVEAFRLQTAQRLLYGKD FT VCWAGIACALLKKAGNMGLDRHLFLMDTNKLDLTGLTAFYRSLLRTWSRFK FT VSRELGSIRGLWSKEEPLLFNSAMDLNIFKSDSFRKALWSAKITKIGHLVS FT GGQWINAEALALRLGMRSVRVAERLLIEIQENLPVNIRLTSEEDVSVFPEL FT ILSADTGEWQETEGGLLSFRIPQPSLFEDAGKKTLYTLCVKVVHFHMLESL FT KESKWQTLFEPDSSPKGCWRTLYKPPIEKRSGDLQWRIVHGIIATNRHRAH FT IDPQVMEGCPFCNMGETVFHLFLNCSRLEMFLYQLNGYCQMLGEVFTPSLF FT IYGPKYNRNKMHVHVLLNFIFGKAKLAIWLSRKRKLTGLGLTDVFWFLRNL FT LNPVLK" XX SQ Sequence 5400 BP; 1689 A; 798 C; 1229 G; 1684 T; 0 other; ggtgcatgct gggagtgtga ggtgtagctg agtgcgtgtt tgagagcgtt gagtgattgt 60 tttttctatc tcttttccga tcattctttt cttctctttt tcttttttct tttgtagtat 120 agacagtgta gagaaagttt tttcccgtac ggaagactta acaggcgccg gtatggcgtc 180 tgttcaagta caaggagcgt cagctctgtc atttcggaat gggtgcaggt gtgtggcaga 240 ccctgaagtg acagtggagg atatcctcca gataatagga gaaaaagtcg gatttgaaaa 300 catcgtctcg gcttcgcgga tgaacaaggc cgtggttgtg tttttaaaat ccgagtcact 360 aataaacgag ttgactgtga gcggaatatg ggtaaaagag gcgtttgttc cgataacccc 420 gctatccgct ccagcaacaa aagttactat ctcaaatgtt ccaccttttg taagtaacga 480 ggctatccta aaagaactac aacgttttgg taagattgct agccctgtta aggggattcc 540 actcggatgc aaaaatgccg cgctgaaaca tgttctgtct tttaggcggc aggtttacat 600 gtttctaaac tccccagagc gcacattgga agcgtccttt cgcatcattc atgaagagag 660 ctccttcatt atatatgcca gcacagaaag tatgaggtgc tatggatgca gagattttgg 720 gcacaaacgt tttttgtgcc ctcaaaaaaa tgaacaacgc gcttcttcat cccaagctga 780 gacgagtaac gcggacacat tgagagttca ggggacagac gaaatgacag gacagcaaga 840 ggtgagtgag ctggaagtta atattactgg cagtgtggag aaacctggct gtagtactga 900 tagtaatgca gatgtgagta atgctaagtg tgatgaatct aaagctcaga gtgatactga 960 agttgtgtgt gaaggtggta cagaacagga aagtggatca aaggaaagtt ttgatgatga 1020 aatggagggg ttgtcacagt gcactgatga tggtctgaga gaagatgatg aacaatggtc 1080 tgaggttgtg aagaaggcta ataaggatct atatacccta gatcaaatca atgcctttct 1140 tgataaaaca aaaggaaaaa ttggtgttga gataagtgat tattttcctg atttagacaa 1200 atttgtctct tcagttatgt ctgtcaggaa gaaacataat tatgatgagt tgtctcaaca 1260 gaagcgcttt cgtctgagga aacatctaac taatatacgg tcaactggaa agataggaaa 1320 aacaagggga aaatctaaat aattaggatg tttaaactac ttgttttcct ctttgtttcc 1380 ctctctctct ctcttttcta tatggaaatt ttgagggtgg gttctttaaa cacaaatggt 1440 ttgagggatg aaaagaagag gaacctattg ttagaatcca taaggattaa gcaactaaat 1500 gttatttttt tgcaagaaac ccatagcaat tctaataatg aagctgaatg gggtttgtgt 1560 tggaaagggg agaaagtgct tagtcatggg acgaatctta gcgctggtgt ggctatatta 1620 ttctcctctg ctctaaaagt tcaaatttta atgataaatg agatagagtt aggtagattg 1680 gtagctgtaa gagccaaaat taatgatctt tctttcctgt ttattaatgt ctatgctcaa 1740 aataatgggg cagatagatt aaagcttttt tctaatttag agtgtttttt aaaacaacag 1800 caagatgggg actttattat aataggaggt gattggaatt gcactttaga ttttaaatat 1860 gataggaata gtgaagaacc acatatgcaa tcagcttcac gtctggctgg aattttaaag 1920 aatcaatatt tttcagatgt gtggagggaa caaaatgctt caataaaaca gtacacttgg 1980 gtaaaaataa atgatggtag gttttctgga gcacgtttag atcggtttta tgtatcgaac 2040 aatgtaaaaa atagagtggt tggtgcaaat attattcctt catttctatc agatcacaaa 2100 ttaatcacag ttgagtgtac attgttaaac aggactcata agagctacta ttggcatttt 2160 aacaacaaac tattgcaaga taaatatttt tgtgaaacat ttaaatcttt ttgggaggct 2220 tggaagaatg agaaggggtt gtatgaaaat attattcaat ggtgggaaat tggtaaagtg 2280 caaataaagg ttctatgtca gcaatatgct gcccaatcct ctggggtttt taaaaaaaca 2340 atagagttat tggaaaagga aatagaggaa attgaaaaaa gtatgactgt taatgatgct 2400 ggtaacctac aagaattgtg gactgaaaaa aagaaccagt tgggttctat tttaaatgaa 2460 aaagttaaag gggcacttgt aagaagccgt ttctcaacca taaaagacat ggatggtcca 2520 acatcttact ttttcaactt ggagaaaaaa aaaggacaag aaaaattgat gtttgtttta 2580 aaagatgatg atggtcagga cacctcagat ccagttttaa tgcgtagact tgctgtacgt 2640 tattattcta atttatttgc tgctgaaacc actgacgaac aaagcagaaa tgaattgtta 2700 caggacattc cttctttaac ttcaacgcaa aaagaacttt tagaggttga ttttaagttt 2760 gaagaagtta ctgctgctgt tatggacctt tcttcaggaa cagcacctgg gatagatggg 2820 ttatcttctg aattttttaa aacattttgg actttaatag gagctgatta ctatgaagtg 2880 cttaaagaat gtatgaaaaa gaagattctt cctgtcagtt gtcaacgggc tgttttaact 2940 ttgttaccga aaaaaggaga tttaactctt ttaaaaaatt ggaggccagt agcagttttg 3000 tgttctgaat acaaaattat agcaaaatgt ttatcaaaca ggctaaataa tgtattgcat 3060 gaaatcatac acaaagatca atcatactgt ataaaaggaa gatcgatttt agataattta 3120 cacttagtac gggatgttta tgattttgct tttaaaaata atattaatct tggatttttg 3180 tcattagacc aggaaaaagc ttttgacaaa gttgatcaca actatctttt tgaaactcta 3240 aaagcttttg gttttggtga cagttttatt tccatggtta aagttcttta taataaggct 3300 acatgtatgg ttaaaatggc tggtggtctt agtgttccta ttgaaattca caagggtatc 3360 agacaaggtt gtcctctttc aggacaactc tatagtatag ttattgagcc actgttgtgt 3420 aaattaagaa gatggttaac tgggttaagt gttcctgaac tagatttcca agagcctgta 3480 aagatttctg cttatgctga tgatttaaca gttatagtaa aagatcatag tgatgttcat 3540 tttttaagag aaataattga acaatatggc aaagcatcat ctgcaagaat gaactggagt 3600 aaaagtaatg ctctatggtg tggtcaagct actaatggtc ctatgcttcc gggtaatttg 3660 cagtggggca gaatgggttt taaatatcta ggagtctttc taggcactga tgattttata 3720 aaacaaaatt gggaggggct tgtggagaaa gtgtgtgcga ggttgtctcg ctggagatgg 3780 ttgctacccc agctttccta tagggggaga gtattgatat gcaacaactt ggtggcctca 3840 tctctttggc ataaaatgat gattttagag ccccccaggg acctggtggc aaggatccaa 3900 aagcaactca tagacttttt ttggtcagga cagcactggc taagggcatc cgttttatac 3960 ttgcctatcc aggaaggcgg acaaggactt gtggatattg gatccagagt tgaagccttt 4020 agacttcaga cggctcagag gttgctctat gggaaggacg tgtgttgggc aggaattgcc 4080 tgtgcacttt taaagaaagc tggtaacatg ggcctagatc gtcacttatt tttaatggac 4140 actaacaagc tggatttaac tggtctaaca gctttttaca gatcactgct gagaacatgg 4200 tcacgattca aagtttcaag agaactcgga agtatacggg gcctctggtc gaaagaagaa 4260 ccactccttt ttaattctgc tatggacctg aacattttta aatctgattc tttcagaaaa 4320 gctttgtggt ctgcaaagat cactaaaatt ggacatttgg tttctggagg acaatggata 4380 aatgctgagg ccttagcctt aagattgggc atgagatcag tacgggtggc tgaaagactg 4440 ttgattgaaa tacaggagaa tttaccagtg aacattagat tgacttcaga ggaggacgtt 4500 tcagtatttc cagagttgat tctctcagcg gatacaggag aatggcagga gacagaagga 4560 ggacttctct cttttagaat acctcagcca agtctgttcg aagatgcagg gaaaaaaact 4620 ctgtatacac tttgtgtaaa agtagtgcat ttccacatgc ttgaaagcct gaaggaatca 4680 aagtggcaga ctctttttga accagattct tctcccaaag gttgctggag gaccctgtac 4740 aaacctccta ttgaaaaaag gtctggagac cttcagtgga ggatagtgca tggtataata 4800 gccacaaaca gacacagagc acacattgat ccacaggtga tggaggggtg tcctttttgc 4860 aatatgggag aaacagtgtt tcacttattt ttaaattgct ctagattaga aatgttcctt 4920 tatcaactga atggatactg tcaaatgtta ggagaagttt ttacacccag tctttttatt 4980 tatgggccta aatacaacag gaacaagatg catgtccatg ttttattaaa ttttattttt 5040 ggaaaagcaa aattggctat ttggttgtca cggaaaagga aactgacagg tctgggtttg 5100 actgatgttt tttggttttt aaggaactta ttaaatcccg tattaaaata gaatatgcgt 5160 attataagtt gattgatgat ttggacactt ttagatctaa atggtgtatt aatcaatgcc 5220 tctgtgaaat tggtcaaaat ggttgtttgg aaatacttgt ttgattgttg aatgttagcc 5280 agctcttata tatatatata tgtttatttt ctaatgtatg tatatgtata aggtttgtga 5340 tttttcatta atgtggtaaa aaaatggttg taaataaaga gattgttaaa gtcaaaaaag 5400 // ID hAT-N31_DR repbase; DNA; ZEB; 1462 BP. XX AC . XX DT 14-OCT-2008 (Rel. 13.1, Created) DT 14-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Putative nonautonomous hAT DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW 8 bp TSD; hAT-N31_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1462 RA Bao W. and Jurka J.; RT "hAT DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1301-1301 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1462 BP; 409 A; 294 C; 290 G; 468 T; 1 other; tagggttggg tatcgtttgg gtttttttcg ataccggtgc taaatcgata cttttaaaac 60 ggtaccggtg ccaaaacggt gcctgaaccg atacttttta gccacaaaat taattgacaa 120 aaaaaaatta tttatatata tatatatata tatatatata tatatatata tatatataaa 180 tatatattta taataagttt aaagtaaaca tcaattcata ttttatatat ttgaattgca 240 ttaatatatt tcttttgatc atttgtttgt tagcataaat gcaagatttg cattatgcta 300 ttaacattac attagcttac tactaacctg tggaaaggct gtcatcaaaa tcagggaaca 360 ccttttttct gggtttgggg cttgtgacta cttttacatg gacatcagtg atctaatgat 420 ttgccttaat ctgaataaga caataatatg attcaggtgt ttacatgagy tgctttttga 480 atgttacatt catgattccc agttacatgt tatagcacat agatccatta acgtcattgc 540 gtcaccagcc tataaatgtt tcctccgcag tttgtgtctg tggtccttta agataatcaa 600 aaatcactgt ttacgtggta gactcttgat cagagtattg tcttaatcat attaaaatca 660 gagtattggt gtccatgtaa acgtactcag ataatgtgcc agatgatgtc acaagctgtc 720 aaagacggtc cattcctcac ctttaagttg attccatgag ccctcacatt tcataagttt 780 gacgtgtttc tcccttacac gcaacttttg taaagcatct gctgcatgtt ggtgtgtcag 840 aatccttgct tgtaaaatat agccatactt tagaatgctt agtttaagca aattagatga 900 acgcgatcat gcgtgttgat gaagtgagtc gctcgccgca tgcgccgtac tgcgcgcttt 960 gccttctata agctacaata acaaacgcct gacatcgctg acatccgtat ccctcaaagc 1020 tgtttagaat taaacattta gagatggtgt gacacaacgc ttacttatgt gtgcctttaa 1080 tgcacccctt tatgcttctt aagggcgtca cacagcggat gcgccgctca gagccgcggc 1140 acggcgcgca catgacagtt gatagtatca cacaccagac gtgcacattc gcatgttatt 1200 taaaatgaaa ctattcagat ggcgctctgt ggcatggcag aaatatgaac agtcatgagt 1260 cgtggctggg cgccgcggac agccgctgac ttgcgaattc tatggcgctg tgcgtcgccg 1320 agcggcgctt ctggtgcgcg acctgcttta cgttcttgtt gcagcaccga aattaggcac 1380 cgaaattcat atgctgattt gatccggtgc ataccaaagg taacggttcc ataatggcac 1440 cgggtttcgg tacccaaccc ta 1462 // ID DIRS-2_DR repbase; DNA; ZEB; 5291 BP. XX AC . XX DT 26-SEP-2008 (Rel. 13.09, Created) DT 04-JUN-2010 (Rel. 15.07, Last updated, Version 6) XX DE DIRS-like LTR retrotransposon - a consensus. XX KW DIRS; LTR Retrotransposon; Transposable Element; DIRS1; DIRSDR1; KW reverse transcriptase RNase H; phage integrase; DIRS1_DR; KW DIRS-2_DR. XX NM DIRS-2_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5291 RA Jurka J.; RT "A family of DIRS-like retrotransposons in zebrafish."; RL Repbase Reports 8(9), 928-928 (2008). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 277..4899 FT /product="DIRS-2_DR_1p" FT /note="tyrosine recombinase at C-terminus." FT /translation="MAEKNFKRCVPPCPRFITAGDSHDLCLECLGEEHALA FT AFENADCGHCDVLSRKELRSRREFFNKAPVAHAPRGSGPARAEAERRLRSW FT GSQLDLADEMETDSVLSLSGSVRSNPPSGASEARSAVSSAPRGRAASSVSE FT ETEAPQQQIQRGSGNLPPQSVEYEELVEVISRAADRFEVIDWQPAREQQQL FT RLTGMLDERELPSRVDPPLRDLPFCPELHDAVSKSWKNPYSAWLMTPKTAI FT YSAVRGLGEKACTVMPMIEEDLARHRRSDLKSARVPPLLSRPLRVTSGLVS FT KAYMTAGQSVGCLHTMSVLQAYQADLIKECLDGGGATPEQLREALRASDLA FT LRATKEAASSLGRSMATLVATERHLWLTQAEVSETDRAILMDAPISSSGLF FT GDAVDRVAESLDRVKKRSSSLGDFLPPRSKSQGAVKRQPQPSTSSSYHEVQ FT RIKQSPDVLGVRLSRAPPGHREHHSALSLNQGPWRKGEATSPLMLVGGSVS FT PGVGVPQCLRALGAPPPLQGGQRTTRGQSREAGTPSTFFGSVETPADGVPV FT GPVHSRTWLQNTVLCTPTSLQRYCTHHSEARTGSGYGTGSTGLINKRRNRA FT CSPTRQRVRVLQPVFYSSQKGWGIASHIRSEKSKSVRRGPQVQDVNHQKRG FT VTNSVRGLVCDDRPERRILPYFHPSPTQEIPEVRLRGRSVPVSGSSIRPSS FT VTSNLYQNSRSGTSSTSYAGDTYPKLHRRLANSSSDSRYGSSASRCRSHPY FT QKVGVSVKHRKKCACSSQDDHFFRCAMGLHDDASTSVPAMNRFDSVNRTQS FT QTRPVHHCETLSEVVGSHGSSSQRDSVRSAVHETPAVVAQIQGVFPQGESF FT PHDQGLAALPSSLKYVENALVPVPGPSVGGCLSSRHAYDKCFSDGLGSNGL FT DPHDQGLAALPSSLKYVENALVPVPGPSVGGCLSSRHAYDRCFSNGLGSNP FT EGASRSGTMGRTSSLLAHKLPGDDGRVSGLKTLSPRSKGPSCFSLHEQHIG FT GRLHQPAGGSEVSNAMQTSTSDPPVGPEQNPVHQGNVCPGPSEHGSRSPVE FT AGGEIQGMETSSPRGGGVHLGKIRESAGRPVCFPRDHALRTMVFSLASSPS FT GTGCHGSDIAEATSVCFSPDRSAPRSPGEGPSRLSTVTAGSPGLAYQDLVF FT GPHSPAGGSLVGDPHQQGPSVPGGRNDTSSPTRPVETVGVASEGAHLIEYG FT LSTEVAQTILSSRAPSTRKLYALKWALFSAWCREHQLNPVSCQVASVLEFL FT QDRLSAGLAASTLRVYVSAIAAYRSPLDDESLGQDPLIRRFLRGAIRLRPV FT STHRVPTWDLTLVLEGISVPPFEPLQEASDKFLTLKTAFLLAISSLKRVGD FT LQALSVAPSFLEFAPGMSKAFLYPRPGYVPKVPTHVARPAVLQAFHPPPFQ FT SSDQEKLNLLCPVRALNTYVNRVINWRKSEQLLVCFGPSKRGSPANKQTIS FT NWIVETISFTYQAAGRPAPKFVKAHSTRAVGASKASISGSALSDICLAAGW FT STPHTFVRHYQLDVDPSPGSSILTA" FT CDS 1904..2647 FT /product="DIRS-2_DR_2p" FT /note="reverse transcriptase." FT /translation="MVSQWVLFTVEHGYKIQFCARPPRFNGIAPTIVKPEQ FT ALVMEQEVLALLIKGAIERVLPLDRESGFYSRYFIVPKKDGGLRPILDLRN FT LNRSVGALRFRMLTIKNVVSQIQSEDWFVTIDLKDAYFHISILPQHRKYLR FT FACGGEAFQYRVLPFGLALSPRTFTKIVEAALAPLRMQGIRILNYIDDWLI FT LAQTHDMAVRHRDVVLTHIRRLGFRLNTARSVLVPARTTISLGVLWDSMTM FT RARLSPP" FT CDS 2996..3625 FT /product="DIRS-2_DR_3p" FT /note="ribonuclease H." FT /translation="MIKVSRRCLRALSMWKMPWFLSQGLVLGAVCHRVMLM FT TDASLTGWGATLRGLPAAGRWGEHHRYWHINCLEMMAVFLALKHFLPDLRG FT HHVLVCTNNTLVVAYINQQGGLKSRMLCKLAHRILLWAQNKILSIRAMYVP FT GHLNMGADLLSRQGVRSREWKLLHPEVVESIWERLGKAQVDLFASQETTHC FT VLWFSLSHPAPLGLVAMVQT" XX SQ Sequence 5291 BP; 1230 A; 1357 C; 1380 G; 1324 T; 0 other; ttccccttct agggaacttc aacactgcgt ctaaccagaa cgctagggga acacctcttt 60 tatacgcgtc ttgaagcaca tgtgaaatca atctaatgta attaagcagg tgtcgtcaga 120 ccagagagta taaaagcctg tactgagcat tcagtatcaa cttctttgct ttcaagaagc 180 acgcacgtga aaatacaccc tctttctgtg aactttcatt actgatttgc atacacaaaa 240 tacaaaaaaa ctgacaactt acttttttat tttggtatgg cagagaaaaa ctttaaacgt 300 tgtgtgcctc catgccctcg ctttattacg gctggtgact cacatgattt gtgtttagag 360 tgtttgggag aagagcatgc cctggcagca tttgagaatg ctgactgtgg acactgtgac 420 gttctctccc gtaaagagct gcgtagtcgg agagagttct ttaataaagc tcccgtggcg 480 cacgctcctc gcggttcggg tcccgctcgt gctgaggctg agcgtcgact tcggtcgtgg 540 ggttcgcagc tagatctggc ggatgagatg gagacggact ctgtcctttc tctctctgga 600 tccgtgagat ctaatcctcc ttcgggagcg tcagaagcac gctctgcggt ttcttctgcg 660 cctcgtggga gggcggcgtc ctccgtttcc gaggaaaccg aggcgccgca gcaacaaata 720 caaagagggt cggggaatct gccgccccag tcagtggaat atgaggagtt agtggaggtg 780 atttcacgtg ctgctgacag gttcgaagta atagattggc agccagcacg tgagcagcag 840 cagctgcgtc tgacaggaat gctggatgag agagaattac ccagcagagt agatcctcca 900 ctaagggacc tccccttttg tcccgagcta catgatgcgg tttctaaatc atggaaaaat 960 ccgtattcag catggttaat gacaccaaaa acagctattt attcagcagt tcgtgggcta 1020 ggggaaaagg catgtacagt aatgccaatg atagaagagg acttagcacg tcatcgtcgt 1080 tcagatctaa aatctgcaag ggtccctcct ttgttgtcga gaccattaag agtaacatcg 1140 ggtctagtca gtaaagcata tatgacggct ggtcagtctg ttggatgcct gcacaccatg 1200 tcagtgctgc aggcatatca ggctgaccta attaaagagt gcctagatgg tgggggagca 1260 acacccgaac agcttcgaga agctcttcgg gcgtcagatc tagctttaag agctactaaa 1320 gaggcagcct ctagtttggg gcgatctatg gctaccctgg tggctactga gcggcacctc 1380 tggctgacac aagcagaagt gtcagaaacc gatagagcta ttcttatgga cgctccaata 1440 tcgagctcag ggctcttcgg cgacgccgtc gatcgcgtcg ccgaatccct cgacagagtt 1500 aaaaaacgct ctagctccct cggggacttt ctccccccaa gatcaaaaag tcagggggct 1560 gttaaaagac agccccagcc gtcaaccagc tcctcatatc atgaagtaca aaggataaaa 1620 caaagtcctg acgtgttggg agtcaggctt tccagggccc cccctggaca ccgagagcac 1680 cattcagcgc tctcactgaa ccagggtccg tggagaaagg gagaggccac ctcaccactt 1740 atgttggtgg ggggctctgt gtctcccggg gtgggcgttc ctcagtgtct gagggcattg 1800 ggggccccac cccctctgca ggggggtcaa agaacaacca gaggccagtc tcgagaggct 1860 ggtaccccta gcacattttt tggcagcgtg gaaacacctg ccgatggtgt cccagtgggt 1920 cctgttcaca gtagaacatg gctacaaaat acagttttgt gcacgcccac ctcgcttcaa 1980 cggtattgca cccaccatag tgaagccaga acaggctctg gttatggaac aggaagtact 2040 ggccttatta ataaaaggcg caatagagcg tgttctccca ctcgacagag agtcagggtt 2100 ttacagccgg tattttatag ttcccaaaaa ggatggggga ttgcgtccca tattagatct 2160 gagaaatcta aatcggtccg tcggggccct caggttcagg atgttaacca tcaaaaacgt 2220 ggtgtcacaa attcagtccg aggactggtt tgtgacgata gacctgaaag acgcatactt 2280 ccatatttcc atccttcccc aacacaggaa atacctgagg ttcgcttgcg ggggcgaagc 2340 gttccagtat cgggttcttc cattcggcct agctctgtca cctcgaacct ttaccaaaat 2400 agtcgaagcg gcactagctc cacttcgtat gcaggggata cgtatcctaa actacataga 2460 cgattggcta attctagctc agactcacga tatggcagtt cggcatcgag atgtcgttct 2520 cacccatatc agaaggttgg ggtttcggtt aaacaccgca agaagtgtgc ttgttccagc 2580 caggacgacc atttctttag gtgtgctatg ggactccatg acgatgcgag cacgtctgtc 2640 cccgccatga atcgcttcga ttcagtcaac cgtacacaga gtcaaactag gccagttcat 2700 cactgtgaaa cactttcaga ggttgttggg tctcatggca gcagcagcca gcgtgattcc 2760 gttcggtctg ctgtacatga gacccctgca gtggtggctc aaatccaggg ggttttccct 2820 caaggggaat cctttccgca tgatcaaggt ctcgcggcgc tgccttcgag ccttaagtat 2880 gtggaaaatg ccctggttcc tgtcccaggg cccagtgttg ggggctgtct gtcatcgcgt 2940 catgcctatg acaaatgctt ctctgacggg ctagggagca acgggcttga tccgcatgat 3000 caaggtctcg cggcgctgcc ttcgagcctt aagtatgtgg aaaatgccct ggttcctgtc 3060 ccagggccta gtgttggggg ctgtctgtca tcgcgtcatg cttatgacag atgcttctct 3120 aacgggctgg ggagcaaccc tgagggggct tcccgcagcg ggacgatggg gagaacatca 3180 tcgttactgg cacataaact gcctggagat gatggccgtg tttctggcct taaaacactt 3240 tctcccagat ctaaggggcc atcatgtttt agtctgcacg aacaacacat tggtggtcgc 3300 ttacatcaac cagcaggggg gtctgaagtc tcgaatgcta tgcaaactag cacatcggat 3360 cctcctgtgg gcccagaaca aaatcctgtc catcagggca atgtatgtcc cgggccatct 3420 gaacatggga gcagatctcc tgtcgaggca gggggtgaga tccagggaat ggaaacttct 3480 tcaccccgag gtggtggagt ccatttggga aagattaggg aaagcgcagg tagacctgtt 3540 tgcttcccaa gagaccacgc attgcgtact atggttttct ctctcgcatc cagcccctct 3600 gggactggtt gccatggttc agacatagcc gaggctacgt ctgtatgctt ttcccccgat 3660 cgctctgctc ccaggagtcc tggagagggt ccgtcaagac tgagtacagt tactgctggt 3720 agccccggtt tggcctacca ggatttggtt ttcggacctc atagccctgc tggcgggtct 3780 ctcgtgggag atccccatca gcagggacct tctgtcccag gcgggaggaa tgatacttca 3840 tcccctaccc gacctgtgga aactgtgggt gtggcctctg agggggccca cctcatagag 3900 tatggactgt caaccgaggt tgctcagacc attctaagct ccagggctcc ctccacaagg 3960 aagctttatg ccctaaaatg ggctctcttt tcagcttggt gcagagaaca ccagctgaac 4020 ccagtcagct gccaggtagc ctcagtgctg gaatttctcc aagatcgcct gtctgctggg 4080 ttagctgcat ccactctgag agtgtacgtg tcagctatag cggcctaccg ttctccccta 4140 gatgatgagt cactaggaca ggatccgcta attcgtcgct tccttcgtgg agccataagg 4200 ctaaggcctg tcagcacaca cagggtaccg acatgggatt taacattggt gctcgagggc 4260 atctctgttc ccccatttga gccactgcag gaggcgtcag ataagtttct gacactaaaa 4320 acagctttct tattagctat ttcttcctta aaaagggttg gtgacctcca ggctttgtcg 4380 gttgcacctt catttctgga gtttgctcca ggcatgtcca aagcctttct ttatcccaga 4440 ccggggtacg tgcctaaggt gcccactcat gtggcgagac ctgctgtgct acaggccttt 4500 cacccgcccc catttcagtc gtcggaccaa gagaagttaa acttactctg cccagttaga 4560 gctctgaata catatgttaa ccgggttatc aactggagaa agagtgaaca gttactggtc 4620 tgcttcggac cctcaaaaag ggggagtccg gcaaataagc agacaataag taattggata 4680 gttgagacta tctcatttac ctatcaggct gctggacgcc ctgcacctaa atttgttaag 4740 gcccactcca caagggctgt cggggcctcc aaagcttcta tttcgggctc agccctttct 4800 gacatttgtt tggcggcagg atggtcgact ccacatacat ttgtgcgtca ctatcaactc 4860 gatgtagacc cctcaccagg gtcctctatt ctcactgcgt agtgtgcgtt cacagtcagc 4920 agtgagtctg gcctagtggg tattgcgttc ccctagcgtt ctggttagac gcagtgttga 4980 agttccctag aaggggaacg tctcgggtta cgtatgtaac catagttccc cgagagggaa 5040 cgagacactg cgtattccgc catactctct tctgcctgtt acttctttca agcaaattcg 5100 aagttgatac tgaatgctca gtacaggctt ttatactctc tggtctgacg acacctgctt 5160 aattacatta gattgatttc acatgtgctt caagacgcgt ataaaagagg tgttccccta 5220 gcgttctggt tagacgcagt gtctcgttcc ctctcgggga actatggtta catacgtaac 5280 ccgagacgtt t 5291 // ID Gypsy107-I_Dr repbase; DNA; ZEB; 4297 BP. XX AC . XX DT 28-APR-2009 (Rel. 14.05, Created) DT 28-APR-2009 (Rel. 14.05, Last updated, Version 1) XX DE An internal portion of the Gypsy-107_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy superfamily; endogenous retrovirus; KW gag; reverse transcriptase; integrase; Gypsy-107_DR; KW Gypsy-107-LTR_DR; Gypsy-107-I_DR; Gypsy107-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4297 RA Dib M.R. and Naveira H.F.; RT "Gypsy107_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 9(5), 948-948 (2009). XX DR [1] (Consensus) XX CC Gypsy107-I_DR is an internal portion of the Gypsy107_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its long CC terminal repeat is deposited in Repbase as Gypsy107-LTR_DR. CC Gypsy107_DR is characterized by 4-bp target site duplications. CC The internal portion encodes two proteins: the 312-aa gag CC Gypsy107_DR1p (pos. 19-952) and 1077-aa polyprotein (pos. CC 886-4116, conceptual translation) composed of the protease, CC reverse transcriptase, and integrase domains. Some insertions CC fairly recent, according to the hight identity between their CC flanking LTRs. Consensus obtained after the alignment of at CC least five independent insertions bearing at least 85% Homology CC over at least 1000bp. XX FH Key Location/Qualifiers FT CDS 17..952 FT /product="Gypsy107-I_Dr_1p" FT /note="Gag-protein." FT /translation="MDPADEDTVRAALAQQGALLGRHESQLTSSNRVLEMM FT GSQIEDLTAQLLQLRTEQMSAHSTLLPVHRSPEPRANPPPHYDGDPSVCRA FT FLTQCSVVFSLQPATFSSEEAKVAYVITLLTGRAREWGTTVWQAKFPYCFN FT FNDFEKEMIKVFDRSVHGREAARLLVSLRQGDRSVSDYSIQFRTLAASCGW FT NAAAQWDQFLHGLRDEIQDEIAAQELPTEFDGLVDLAIRVDNRLALRQRNR FT MARFAWKQGGSQDPLSPTSLDTFQESEPMQIGRTRLSAEERKRRITNRLCL FT YCGGSGHFASTCPVKGNARQ" FT CDS 886..4116 FT /product="Gypsy107-I_Dr_2p" FT /note="Polyprotein." FT /translation="MPLLWRIWPFCFNMSGKRQCSPVARGVLVSVTPVNDP FT PLARTLLPATVQWGNSYHPCAVLVDSGAEGNFMDSALAAKWGLPTTPLQTP FT LTARALNGAHLALISLTTVPVSLIISGNHREEIVFYLLESPCNSIVLGHPW FT LIKHNPHIEWAKNSVLSWSPFCLKHCLGAALSPGSVLSVSQVEAADLSGVP FT VIYHDLRLVFSKSRAISLPPHRPYDCAINLLPGTSPPRGRLYSLSGPERVA FT MDKYISDSLAAGLIRSSSSPAGAGFFFVEKKDGSLRPCIDYRGLNDITIKN FT RYPLPLMSSAFELLQGAGFFTKLDLRNAYHLVRIREGDEWKTAFNTPTGHF FT EYRVMPFGLTNAPAVFQTLVNDVLRDMVNKFVFVYLDDILIFSHSLQEHIQ FT HVRQVLQRLLENQLYIKAEKCLFHTRSVPFLGFIVSAEGIRVDPAKVRAVS FT NWPTPDSRKALQRFLGFSNFYRRFVRNYSQIAASLTALTSTKTPFQWSSQA FT QAAFERLKTCFSSAPVLSFPDPERQFIVEVDASEVGVGAVLSQRSLADGKV FT HPCAYFSHRLSPAERNYDIGNRELLAVKLALDEWRHWLEGTSEPFLVWTDH FT KNLEYVRSAKRLNSRQARWALFFGRFNFILSYRPGSKNVKPDALSRQFDGP FT GDSLTPKPIIHPGRVVAAVVWDIESQVREALQGAKVPEGCPANLLFVPKLL FT RSSVLQWSHSSKLTCHPGVRRTLASIRQRFWWPSMAKEVSLFVAACSVCAQ FT NKSSNKPPVGLLQPLPIPTRPWSHISLDFITGLPLSNGNSVVLTVVDRFSK FT AVHFIPLAKLPSAKETALVVVDHVFRIHGLPVDVVSDRGPQFVSNFWREFC FT RQIGASTSLSSGFHPQTNGQSERANQALEQMLRCLVSHNPSSWSQQLTWAE FT YAHNSLPNSSTGLSPFQCCMGYQPPLFSSQEPEAAVPSVQALIQRCRRTWQ FT RAREALLQAERRVRTSANRRRTPAPRYICGQRVWLSTKDLPLKVPARKLAP FT RFIGPYPIVKVLSPAAVRLRLPTSLRRVYPVFHVSRIKPVFRSSLNPSVPI FT SPPLPPVSLRALQRSQLGDC" XX SQ Sequence 4297 BP; 879 A; 1177 C; 1055 G; 1186 T; 0 other; gaacaatctg gccaatatgg acccagcaga tgaagatact gtgagagcag ccctggccca 60 gcagggagct ctattaggtc gccatgaaag tcagctcacc tcatctaacc gggtcttgga 120 aatgatgggc tcccagatcg aagacctaac cgctcagctt ctgcagctcc gtactgagca 180 gatgtcagcc cactcaactc tccttcctgt ccatcgttcc ccagagccca gagcgaaccc 240 cccacctcac tatgatggag acccttctgt ctgtcgagcc ttcctgaccc agtgctcagt 300 ggtattctcc ctccaaccag ccacattttc ttctgaagaa gccaaggtag cctacgtgat 360 cacccttctg actggccggg ctcgtgagtg gggaaccaca gtatggcaag ccaaatttcc 420 ttactgtttt aactttaatg attttgaaaa agagatgatc aaagtttttg accgctctgt 480 tcatgggcgt gaggcagcaa gacttttggt ttctcttcga cagggggatc gctctgtttc 540 tgattattcc attcagtttc gaactctggc agcctcatgt ggatggaatg cagcagcaca 600 atgggaccaa tttttgcatg gactcagaga tgaaatccaa gacgagatag cagctcaaga 660 gctccctact gaatttgatg gtctggtgga cctggccatc agggtggata atcgtttggc 720 actacgtcag cgcaaccgta tggctcgttt tgcttggaaa caaggagggt cccaagatcc 780 tctttccccc accagtcttg atactttcca agagtctgag cccatgcaga ttggccgtac 840 ccgactctca gcggaggaga ggaagcgccg cataaccaat aggctatgcc tttactgtgg 900 cggatctggc cattttgctt caacatgtcc ggtaaaaggc aatgctcgcc agtagccagg 960 ggggtattgg tgagcgtaac tcctgtcaat gatcccccct tagcacgcac tctcctacct 1020 gccactgttc agtggggtaa ctcctatcac ccatgtgctg tcttggtcga ctctggagca 1080 gaggggaatt tcatggactc tgctctggct gccaaatggg ggcttcctac aactcctcta 1140 cagactccct tgactgctcg agccctaaat ggagcacatc ttgccctcat ttctctcact 1200 actgtcccag taagtctcat tatttctggc aaccaccgtg aggagattgt gttttatctg 1260 ctagagtccc cctgtaattc cattgtgtta ggacacccct ggctgataaa acacaatcct 1320 catattgaat gggccaaaaa ttctgtcctg tcctggagtc ctttttgtct taaacattgt 1380 ttgggtgctg ccttgtctcc tggctctgtg ctttctgttt ctcaggtgga ggcagctgac 1440 ctgtcgggtg ttccagtgat ctaccatgat cttcgtctgg tctttagcaa atcccgggcc 1500 atatctctcc caccgcaccg accttatgat tgtgccatca atctgctccc aggaacttct 1560 cctcctcggg gcaggttgta ctccctctct ggtcctgagc gtgtagctat ggacaagtac 1620 atcagcgact ctctggcagc cggcctcatc cgatcctcct cctccccagc gggtgcgggg 1680 tttttcttcg tggagaagaa ggatgggtcc ctgcgaccct gcatcgatta tcgagggctg 1740 aatgatatta caattaaaaa caggtacccc ctacctttaa tgtcttcagc ctttgagttg 1800 ttgcagggtg caggattctt cacgaagtta gacctccgca acgcttacca cctggtccgc 1860 atacgtgagg gggacgagtg gaagacagcc ttcaacactc ccactgggca ctttgagtac 1920 cgggtaatgc cctttgggtt aaccaatgcc ccggctgtct tccagaccct ggtcaatgat 1980 gtgctgagag atatggtgaa caagtttgta tttgtatatc tagacgatat tttgatcttt 2040 tcccattctc tccaggagca cattcagcac gtcagacagg tgttgcaacg gctgctggag 2100 aatcagctgt acatcaaggc tgagaagtgc ttgttccata ctaggtcggt tccgttcctg 2160 gggttcattg tctctgcaga gggtatccgt gtagatccag ccaaggtaag ggccgtctcc 2220 aactggccaa ctcctgattc tcgcaaggct ctgcagcgtt tcttaggctt ttcaaatttt 2280 tatcgtcgtt ttgtccgcaa ctatagtcaa attgctgcat ccttgactgc actcacctcc 2340 acaaaaactc cctttcagtg gtcttctcag gctcaggctg cttttgaaag gcttaagaca 2400 tgcttctctt ctgctcctgt tctttctttc ccagatcctg agaggcaatt tattgtggag 2460 gtggatgcct cggaggttgg ggttggggcg gtcctttcac agaggtccct agcagatggc 2520 aaggtacacc cttgtgccta cttctcccat cgtctatccc cagctgaacg gaactatgat 2580 ataggtaaca gggagttatt ggcggttaag ttggctttgg atgagtggcg ccattggctg 2640 gaggggactt cagagccatt cttggtctgg acagatcata agaatcttga atatgttcgt 2700 tcagccaaga gattaaactc acggcaggct cgctgggcat tattttttgg gcgctttaat 2760 tttattttgt cttaccgtcc tggttccaaa aatgtaaagc cagatgctct ttcccggcaa 2820 ttcgatggcc caggggacag cctcacaccc aagcctatta tccaccctgg gcgggtggtg 2880 gcggctgttg tctgggacat tgaatcacag gtcagagagg ctcttcaggg agctaaggtg 2940 cctgagggat gcccagcgaa tctgttgttt gttcccaaat tgcttcgttc ttctgtccta 3000 cagtggagcc actcatctaa actcacttgc cacccaggag ttaggaggac attggcatct 3060 atccgtcaaa ggttttggtg gccttccatg gccaaagagg tcagtctgtt tgttgctgcc 3120 tgctcggtct gtgctcagaa taagtcatcc aataaacctc ctgttggtct gcttcaaccc 3180 cttcccatcc ccacccgccc ttggtcccac atatctctgg attttataac cggccttccc 3240 ctttctaatg gtaattcggt tgttcttaca gtggtggatc gcttttctaa ggcggtccac 3300 tttattcccc tagctaagct tccctctgct aaggagacgg cactggtggt ggtagaccac 3360 gtcttccgga tacatggcct tccggtggat gtggtctctg acaggggacc ccagttcgtc 3420 tccaatttct ggagggagtt ctgccggcag attggtgctt ccaccagtct gtcgtcaggc 3480 ttccatccac agaccaacgg gcagtccgag cgggccaacc aggccctaga gcagatgctt 3540 cgttgtcttg tctctcacaa cccctcctcc tggagccagc agcttacctg ggccgaatat 3600 gcccataatt ctttacccaa ttcttctact ggtctctccc catttcagtg ttgcatgggt 3660 tatcagccac ctcttttttc ttcacaggag cctgaggctg cggttccctc tgtccaggct 3720 cttatccaaa ggtgccgtcg cacctggcaa agggctagag aagccctcct tcaggctgaa 3780 agacgtgtca gaacctccgc taaccgtcgt cggactccgg ctcctaggta catttgtggg 3840 cagagggtat ggctttccac aaaggacctg ccacttaagg tgccagcccg taaactggct 3900 ccgcggttca ttgggccata ccctattgtc aaggtcctca gtccggcggc ggtacgtctc 3960 aggctcccaa cttcccttcg tcgtgtctat ccagtatttc atgtatcccg aattaaacct 4020 gtttttcgta gctctcttaa tccttctgtt cctatttccc cccccctccc ccccgtctca 4080 ttgagggctc tccagcgttc acagttagga gattgttaga atgcaggcgt agaggacgtg 4140 gattccagta ccttgtagat tgggaggggt atggtgcgga ggagagatgt tgggtcccgg 4200 ctcgcaatat tttggaccag tcgttgatta gggaatttca cagccgacag tctcctccct 4260 tgggatcgcc cagtggcgat cctggagggg ggggtac 4297 // ID ERV1-1-I_DR repbase; DNA; ZEB; 9909 BP. XX AC chr20; XX DT 03-OCT-2008 (Rel. 13.1, Created) DT 03-OCT-2008 (Rel. 13.1, Last updated, Version -1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW ERV1; Endogenous Retrovirus; Transposable Element; ERV1-1_DR; KW ERV1-1-LTR_DR; ERV1-1-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-9909 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1594-1594 (2008). XX DR Genome; chr20; Positions 8886575 8896483. XX CC Positions [4563-4997] - Reverse transcriptase CC Positions [6675-7151] - Integrase core CC 'CTAT' target site duplication CC LTRs are 99% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 3447..5852 FT /product="ERV1-1-I_DR_3p" FT /translation="MLTTNANEEPMLPIKIEGKTIPMLLDTGAVYTSVNSN FT YASHFPLSGKYAKTIGFSGKVQLIPMTTPVCIQTKNQSVTLPILVSSQTPV FT NLLGRDALCKLGLHIWCTSDGVYIDAKGIENQMVITEPKANVYWIGQIEND FT VKQTINKWGKYIEAQIDEAQIPKSGFHCTMVYDSERNPDIEQQWQKETEGQ FT KIEMISQHIILGKQGAALTIVKNKFAEKWFNVTDSVPHIAIYVGNNWKTKD FT LGQMVKKAEQSKWVSTENPLIFQSDDENYIQIVCTTNLMGIPQEVIIGQDK FT ITQMSTKSELINDLETEMFKEMECQVPPELWSLYDNDVGLIRSANPVRVQI FT KPNTQLPRKAQYPLRPEAELGIKNTIEGLLKAGVLEETTSYCNTPIMPVIK FT ADKNRWRLVHDLRAINDIVEDFPAEVPNPHTLLTNVPPNAKYFTVIDLCSA FT YFSVPLAKESRYLFAFTYGGKQYTYTRMPQGFKHSPHVFNQILKEDLEDLV FT MDSTLLQYVDDLIICSPSLEQCHKDSIKVLTKLAQGGHKVSKNKMQYCQPQ FT VEYLGRLIAFGTRAIAPAQLEGISKTPLPQTVGQMMTFLGMTGFSSDWVED FT YAIKTAPLRKLMKQAGLQNLKNSLTWNTDARIAFETIKKELQKTPALGNPD FT YSKEFSLYVANRVDGYASAVLMQETCSGRKKQPIAYYSTKLDNTAQGFPPC FT YQQGLAAIHYAYEKASSITMGYPVTIYTHHKTVELIEQGKFVLTQARILAY FT STLLTYPDVTIKRCKTFNPAELIPLAFEGEPHDCVANSLTFTRLRPDLESS FT " FT CDS 6399..7574 FT /product="ERV1-1-I_DR_4p" FT /translation="MWHQRGAKKDSQDIWRTHEGQIVAPTALLNILISDAH FT GFDHCAKGEVIRKIKQQGYWSPYLHAMVGEFLSSCEICAKYNVRKGTATPI FT GHIPVPEGPFKHLVMDYVDMIKRVQGKRYMLVIIDRFSRRVKAVPSADLGA FT GTVIKFLTREVIPRFGIPSEICSDNGSAFVQKTFKQVLQHLRIEQRLGCIY FT HPQSQGIVERVNGTLKAKLNKICASTKLNWVDALPLALMSYRMQTNRNTHL FT TPHEMLTGRPMPAPFCRGPYKGPPLEQLQMELKSYMKKLTAIHKAIYDQET FT RRQPLEEGETPGPVVPGDQVYLKVFRRKWNEPRREGPYRVVRATPTAIQVE FT GSTTWYHLNHCTRVPKEKVTAGEHQQETVSPEKPTAESDGEQNVAENDE" FT CDS 7543..8868 FT /product="ERV1-1-I_DR_1p" FT /translation="MVSKMWQRMMNKTIRQVTMIICLSIIYMPQNQATRNN FT ITQNPTTLNDDLQLKSNISDTGSITQRPNYTSMLTTLQSLRNESSSANTTE FT LSLEEYEKQRRDVDNLYVIHEEHTELDILLETAQSNQWFQWASFTASEISK FT ENCLFCTKTPMSQVVVVPNPFTYEHCAEFNRDFCHMKQDLRLFCPAECLGY FT LSHSTLKTLFRHVGRLKSKCRSFGINENLKVKNIQVEIPLWFSIDYTKEYE FT CFVKPSGVVDVGKFKGKCSVIWNLDPQKTLRNYDDPPFASSFRATGTKHGK FT IIEPAKNCENQTIALPDIDRYKDQTLTLADFFWMCGNRKLLPSLPLGWTGT FT CTRVRIIQEIQVINWETESTIGESDTINYRVKRSYKPDPTVYIDSIGQPRG FT IPDQFKARNEVKSGIESILIWISQNKNTEWINYIYYTNKDLLITQMKH" XX SQ Sequence 9909 BP; 3871 A; 1579 C; 1988 G; 2471 T; 0 other; aattggtgac cccgacgtga tttggctagg agcaacggag tggggacctc gctggcggag 60 tctgatgcgg gcgacacaat tgcagtggcc actgtgaaaa aaggtaagca attttttgct 120 tatttaatga aaaattgtgt ggctcgacgg actctgcttg tgagggtaaa aacactcaaa 180 gtgctcttcc aaacttaaat aattaatacg tttaaaaaag ggttttgatt tcagaagaaa 240 atgttgcatg caacaatctg tgtcttgctt taagagggcc ttgaggataa agataagaca 300 ttaacaacta attgttactt atattaaaga ttttttgaag gaagtaagga taaaatctta 360 atagattgtg acgattaata agttaaaaca gattaataaa gtagagagaa gcgcgcaaag 420 acctcggagt acccgttctg ttttaataat aataaatatt atgattcagg aggctggggg 480 aaaaaaggca aaaatcctgg aaaatgctat ttttactgta gagcagtaga gattggaagc 540 agaagtaaca ggtcacttag gaaagtgttg caagcctgtg tgtgttgggc tctgtgtccg 600 tgagattgcg gtattgaata aggttaatac atgaacccca taggataagg ggggtagaaa 660 agtaaatgct ttaaaaagtg tgataaaata tttcatgaac cccataggat aaggggggta 720 gaggagtgaa tgtttttaga agtgtgtatt caataaagca ttgaagtgaa tgaatgtttg 780 aagaaagaat tgtaaataaa aattgaggaa agacttggga aaacaaagta gaacaaatgg 840 aaaagttaac tagttaaaac aaaaaagtag taataataat aattaggctt gtgctgcagc 900 aattaaaatt taaaataaaa gtgagggtaa aacaataata ataaaataaa aataaagata 960 aaaattaaat aaataaatta acaatcaaaa taaataaata aatataactt gactaaaatg 1020 aataaaggat aaaatagaag gataaacaat taatattgat tgatatttag acaagaaaaa 1080 agaagtcaca ataataaaaa actcaataaa taaataaaaa taataataaa aaaaaataat 1140 aaacaaaaaa gaaaaaataa ttagaagtac aaattatata aaatttaaac tacttattta 1200 aagaataaat aaaaaaataa gtaaaaagaa aaaaaataat actactaata aataataata 1260 ataaataata aaaataaata aatataaata aataaataaa atttagatac caaacaagtg 1320 gaaaaagaaa attgattaaa gtcaaataca aataataaaa acaaagtttt aattataata 1380 ataataaaaa taattataat aagaagaaga acaacaatca agtgataaaa caaataaata 1440 aaaaataaca aaataaagac acataatagt ttacaaataa agaagatagc aatggcaagc 1500 tctccagtag aaatactagg atcaaagtat ccattgtgta aagagcttat tgataaatat 1560 tcaaaaaagt agcaacttag aactaaaaat atgatgacaa tctggccaca aggtggaact 1620 tttgatgtag cattatgtga agaaatggat actctaataa aaaattataa aagaaaaggt 1680 agtagtagga gacaaaaaaa aaagatgaaa aaagagcaaa agagagagaa gtgttaaaca 1740 tgttcaaaaa aggagaagat aatttaagaa atattaaaaa ggcaagaaaa atgctgaaag 1800 aggctgacaa agaaaccact gctaaagagc aaatgtttaa tagccctcca ccttatttac 1860 aagctgaagg gcaattccct atgcttaaag ggctcatgga ggtgtcagga gaacttaaat 1920 tggaaggaga gatggaaatg gaggaaaaac aattaggggc aagcgtaaaa tcaaaacgat 1980 cagcaaataa gcaaaataaa ggggcatatg cacccttgga ttgttataaa caggcccgtg 2040 aagcattaga acaaatgaat aaacagtgtg aggacaaatt agagtatagg gagccaaaag 2100 aacagacaaa atcatataat aaggaaaggg gagcatgggg tggaattaaa gatattgaca 2160 tgggaagtac aattggagaa actgaaaaac tgttaaaaga cacatatgac aaagtaaaac 2220 aaaaaaggag attgttattc agttcagatg aggaagaaga ttgggaggag tcaatacttg 2280 atacaggaaa ctgggaaagg gggaacaggc ttaggccatt taaacaatgc ccaattttaa 2340 taaaaggtgc acaagcacag tatgtccctt ggcctacatt ggatttggag ggtttgattg 2400 ctaggctccc aaacatccat gagggggcag caaagtggat aagggtattt gaagaggaat 2460 cagttggaaa attgatttct ttgggagaca ttaaatctct tcttgcaaag actgtaggag 2520 gagcaaagat ggatgaaatc cttcacgctt ctaaccttga cagggctgtg aactctcaac 2580 agatggatgg gactattttt gatgccttcc gtccagcagt gtggcaggca ctgcgtgcag 2640 agtatccact aagactggac cctaatatgt taagaggtga ttagttgggg gaaacagaga 2700 atccaataac gtatgttcag agacattcac gacaatggaa acatgacact gaaattgatc 2760 ctgaaagaga ccctataatg gctacactgt ttcgacaagc cataattgac tctatgcctt 2820 cctcagtgaa gagcaggtta gaagatgtgg tggggttaaa ctctaaatct cacaaagagt 2880 tttgtgatta tgtggcacat gcagttgagc agtacaggaa aaatgaacaa aaactcaaaa 2940 agcaagaaag tgagctgcaa agaaaattag ctcaattgca acttgatgaa ctgactaaaa 3000 agaacaagaa aaagatccaa gctcttgtag agacagagga gggggaacca ctaacaatga 3060 tggctccagt tacagctcca gcaccagcta tgcagtctgc ctcacaagct aaacatccaa 3120 ctaataccca aatgactcca cccataataa acaattatac ccaacagcca gcagccgcga 3180 cctggagaaa aagacccaga gagcaaagag ggaatggcaa agcctatgtg acacctacag 3240 gcatgtgttg gggttgcgga ttatcaggac acaccaaaaa agactgtccc accaatccat 3300 gggagcagtt tcctcggggt ggaagggagg taacttggca tcaaacatat ccaaatccta 3360 accaaggtcc agttaatcct tggagaggac ccaataaggg atattgaaga tgcccagaga 3420 atcctatagg ggataatcag cttccaatgt taacaactaa tgctaacgaa gaacctatgc 3480 taccaattaa aatagaggga aaaacaatac caatgctcct tgatactgga gctgtttaca 3540 cttctgttaa ttcaaattat gcgtcacatt ttcccctgtc tggtaaatat gcaaagacaa 3600 taggattctc gggaaaagtt caattaattc caatgactac tccagtttgt atacaaacta 3660 aaaatcaaag tgtgacgcta cccattctag tatcaagtca aactcctgtc aatttgctag 3720 gaagagatgc attatgtaaa ttaggactac atatttggtg tacttcagat ggagtgtata 3780 tagatgcaaa aggaattgaa aatcaaatgg taattaccga accaaaagct aatgtgtatt 3840 ggataggaca aatagaaaat gatgtaaaac aaacaataaa taaatgggga aaatacattg 3900 aagcacaaat tgatgaagca cagataccaa aatcaggttt tcattgcaca atggtttatg 3960 attctgaaag aaatccagac atagaacagc aatggcagaa agaaacagaa gggcaaaaaa 4020 ttgaaatgat ctcacaacac atcatattag gtaagcaggg agcagctcta accatagtaa 4080 aaaataaatt tgctgaaaaa tggtttaatg ttacagattc agttccacat atagctatat 4140 atgttggaaa taactggaaa acaaaagatt tgggacaaat ggtaaaaaag gcagaacaga 4200 gcaaatgggt atcaacagaa aatccactaa tctttcagtc agatgatgaa aattatatcc 4260 aaattgtgtg tacaaccaat ttgatgggaa ttccccaaga agttattatc ggacaagaca 4320 aaataacaca aatgagtaca aaatcagaat taataaatga cctagaaact gaaatgttta 4380 aagaaatgga atgccaagtg ccaccagaat tatggtcttt atatgataat gatgttggct 4440 tgataagatc agctaatcca gtaagagttc aaattaaacc aaatacccaa ctgcccagaa 4500 aagcacaata tccattgcgg cctgaggcag aattaggaat caaaaacact attgagggtt 4560 tattaaaagc aggagtatta gaagaaacaa ccagttattg taacactccg ataatgcctg 4620 taattaaagc agataaaaac agatggcgtc tggttcatga tttaagggca attaatgaca 4680 ttgtagagga ttttccagct gaagttccta acccacatac tctcttgaca aatgtccctc 4740 ctaatgctaa atatttcaca gtgattgacc tttgttcagc ctatttcagt gtgccattag 4800 caaaagaaag taggtacttg tttgctttca catatggggg taaacaatat acttacacca 4860 gaatgcccca ggggttcaag cattcgccac atgtattcaa ccaaattctt aaagaagatt 4920 tggaggatct tgtaatggac agtactctgt tacaatatgt agatgactta atcatctgct 4980 ctccttcatt agaacaatgt cataaagatt caattaaagt cctcactaag ttggcacaag 5040 gaggtcataa agtatctaaa aacaaaatgc agtactgtca acctcaggtt gaatatttgg 5100 gacgattgat tgcctttggc acacgagcta tagctccagc tcagttggaa ggaataagta 5160 aaacaccatt accccaaaca gtaggacaga tgatgacttt tttaggaatg acaggtttta 5220 gttcagactg ggtagaggac tatgcaataa aaacagcacc actaagaaag cttatgaaac 5280 aagcaggatt acaaaactta aaaaactcat taacatggaa tacagatgca aggatagcat 5340 ttgaaacaat aaaaaaagaa ttgcaaaaaa caccagcatt aggaaatcct gattactcta 5400 aagaattttc tttgtatgta gcaaatagag tggatggtta cgcatcagct gtactgatgc 5460 aagaaacttg tagcggaaga aaaaagcagc ctatagcata ctatagcact aaacttgata 5520 atacagctca agggtttcca ccatgctatc aacaaggtct cgcagcaatt cattatgctt 5580 atgaaaaagc atcatcaata accatgggtt atccagtcac aatttatact caccataaaa 5640 ctgtagagtt aatagaacaa ggaaaatttg ttttaactca agctcgcatt ttagcttatt 5700 caactctatt aacatatcca gatgtaacta ttaagcgatg taaaacattt aatccagctg 5760 aattaattcc attagcattt gaaggagaac cacacgactg tgtggcaaat tcattaacat 5820 ttactagatt aagaccagat cttgaatctt cctgaggcag aagtaactta ttttgtggat 5880 ggttcaagtt ttagggacca tttaggaatc catacaggat atgcagtagt caaaaaagag 5940 aaggaagaat ttgtatcagt aatatctcat cattgtgttc aaccatgctc tgctcaatta 6000 gcagaattaa aagcgcttac tactgcttgc caattagcaa agggactgac tgcaaatatt 6060 tatacagatt cagcatatgc acatggtgta tgtcacttat tcggagcagt atggaaacaa 6120 cgaggcttta agaaaactga tgggtctcct atccaacatg cagaacaaat aagtgaatta 6180 atttcagcaa tgatgcaacc aaaacgattg gcaataataa agtgtcaagc acataaaaaa 6240 ggaaatgact ttgtcataaa aggaaataac atttagaagc taaaaaagca tcagggtgtg 6300 aagtagcagt tctaacccca gtagtgctaa tagagccaca acctcaatta gatgatattg 6360 tacgaataca acaacaagca ggtccctatg aacaatcaat gtggcatcaa aggggggcta 6420 aaaaagactc acaagatatt tggcgtacac atgaagggca aattgttgca ccaacagcac 6480 ttctgaatat tttaatatca gatgcacatg gttttgatca ctgtgctaaa ggagaagtaa 6540 taagaaaaat caaacaacaa ggatattggt ctccttattt acatgcaatg gttggtgagt 6600 ttttgtcctc atgtgaaatt tgtgctaaat ataatgtgag aaaaggaaca gcaacaccta 6660 taggccatat accagtacct gaaggaccat ttaaacattt ggtaatggat tacgtagata 6720 tgataaaacg agtccaaggc aaaagataca tgcttgtgat aattgatagg ttcagtagga 6780 gggtgaaagc agtaccatca gcagatctgg gagcaggaac agttattaaa tttttaacta 6840 gggaagtaat acctagattt ggaataccat cagaaatttg ttcagataat ggatcagcat 6900 ttgtccaaaa aactttcaaa caagtattac aacatttaag aatagagcaa agactaggat 6960 gtatttatca tccacaatca caaggaattg ttgaaagagt caatggaacc ttaaaagcta 7020 aacttaacaa aatctgtgcc agtactaaac ttaactgggt ggatgctttg cctcttgctt 7080 taatgagtta ccgcatgcaa actaatagaa acacacatct aacaccgcat gaaatgctaa 7140 caggccggcc tatgcctgca cctttttgta gagggcccta taaaggacca ccactagaac 7200 aattacaaat ggaattaaaa tcatatatga agaaactaac tgcaatacac aaagctattt 7260 atgaccagga aaccaggcga cagccacttg aggagggaga aacgccagga ccggtggtgc 7320 caggtgatca ggtgtatcta aaggtgttcc ggagaaagtg gaatgagcca agacgggaag 7380 gaccatatcg agttgtgaga gccaccccaa ctgcaattca agtggagggg agtacaacgt 7440 ggtaccatct aaaccactgc acgagggttc caaaggaaaa agtgacagca ggagaacacc 7500 agcaggaaac tgtcagccca gaaaagccca cagcagagag tgatggtgag caaaatgtgg 7560 cagagaatga tgaataaaac tattagacaa gtgactatga taatatgtct aagcatcata 7620 tacatgccac aaaatcaagc cactagaaac aatataactc aaaatccaac cactctcaat 7680 gacgacctcc aacttaagag taacatctca gatacaggaa gcataactca aaggcccaat 7740 tacacttcca tgctgaccac actgcagtca ctaaggaatg aaagtagtag tgccaataca 7800 actgaactgt cactagaaga atatgaaaaa cagcgaaggg atgttgacaa cttgtacgtc 7860 attcacgagg aacacacaga actagacata cttttggaaa cagcacagag taatcaatgg 7920 tttcaatggg cttctttcac agcatcagaa attagcaaag agaactgcct tttctgtaca 7980 aaaactccaa tgagtcaagt ggttgtggtc ccaaatcctt tcacttatga acattgtgct 8040 gaatttaaca gagatttctg tcatatgaaa caagacttga gacttttctg tccagctgaa 8100 tgtttgggtt acttaagcca ctcaacacta aaaactttat ttagacatgt tggacgtttg 8160 aaatctaaat gtagaagctt tggcattaat gaaaatttaa aagtgaaaaa tattcaagta 8220 gaaattcctc tatggttttc tattgattac acaaaagaat atgaatgttt tgttaaacct 8280 tcaggagtag tagatgttgg caaattcaaa ggaaaatgta gtgtaatttg gaatcttgat 8340 ccacaaaaaa ctcttagaaa ttatgacgat ccaccatttg cgtcatcatt tcgagcaaca 8400 ggaacaaaac atggaaaaat aatagagcca gcaaaaaatt gtgaaaatca aactatagct 8460 ttacctgata tagatagata taaagatcaa acgctaacac tagcagactt cttttggatg 8520 tgtgggaata gaaagctgtt accttcttta cccttaggat ggactggtac ctgtactaga 8580 gtacgtataa tacaagaaat acaagttata aattgggaaa ctgaatctac aataggtgaa 8640 agtgacacaa ttaattatag agttaaaaga agttataaac cagaccctac agtatacata 8700 gattccatag gtcagccacg aggtatacca gatcaattca aagcaagaaa tgaggttaaa 8760 tcaggaattg aatcaatact tatttggata tcacaaaaca aaaacacaga atggataaat 8820 tacatttact acaccaacaa agatttatta atcacacaaa tgaagcatta atgtcattgg 8880 gggaacaatt agatgcaact agtagaatga catggcaaaa tagattagcg cttaattgga 8940 tgttagcgga taaatcgggc gtatgtgtgc tctttggcga ccagtgttgc acctacatac 9000 caaataatac agcgccagaa ggtacgttta caaaagctat gacaaaactc aaagaattac 9060 aggtagagat ggctacaaat gcggggagag atgaaaaaat gtgggactgg tttgatctga 9120 aactaggagc atggggagcc tggctagcta aaatgggaat gtttttaggt gtggcaatat 9180 taataggagc attattattt tgttgtgtac tgccattatt aaaatcatta attgtcaaag 9240 cttcaacaaa gcaaatggaa atgctacaat atcacggaag tgggaaaatg atgcttgtaa 9300 atgaccatta tggacagtct gggcactggg cagatcaact taaatggcaa actcaaatat 9360 caacaagttc tgaagagtct gaaagctcag acgaagaaaa ggaatacaaa tgattcatac 9420 cctgggtgta agtgctagaa cctctacccc aagggaggcc ctcaacgatg cgcaaagtat 9480 ctgggctgaa gaggaacacg ttgtgatggg caactacttt tgatgtttgt gattattgtg 9540 tgagatatat tatgtgtgta aaagattaat gataacaata tctatatata ttaatgtatt 9600 ctacataact gtgacaccca caggcaagcg ctgaaccaat tgcacgctca tgctttaaac 9660 actattttct atgaaacaga aaaaacagag gaagaagtat tttactatcc ctatgtcaac 9720 agtaggggta gatgtttggc ttctacttct gccacagtgc ggtagcataa tctggtcaat 9780 tggtaactat tatgtggcta atatcagtca ctagatagtg gtaactaagg gaaccggatt 9840 gtgaaattgc actatgggta aaaaacaata atattaagac ttcagaaaaa gtcttaagag 9900 gagggaata 9909 // ID TDR12 repbase; DNA; ZEB; 242 BP. XX AC . XX DT 04-MAR-2002 (Rel. 7.02, Created) DT 31-DEC-2007 (Rel. 13.01, Last updated, Version 2) XX DE Zebrafish non-autonomous DNA transposon - a consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; TDR12. XX NM TDR12. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-242 RA Jurka J. and Drazkiewicz A.; RT "TDR12: a non-autonomous DNA transposon from Danio rerio."; RL Repbase Reports 2(2), 22-22 (2002). XX DR [1] (Consensus) XX CC TA-target site duplication. Putative non-autonomous Mariner CC element. XX SQ Sequence 242 BP; 84 A; 36 C; 41 G; 81 T; 0 other; tatagaccat tttgagggat gtaaacaaaa acaatggtcc caacgtattt cctgttttac 60 atttttaatt tctatagctt ccgagaatcc aaaaagagcc acatattgat aaataatgtt 120 atgatagctg ttttaacatt aagttatgat tgaattgcct cttgttacag ttatgaaata 180 gtttgataac aagcaggaaa tgttcatggg ccaatgacat gaccacattg aaatggtcta 240 ta 242 // ID HARBINGERN6_DR repbase; DNA; ZEB; 393 BP. XX AC . XX DT 11-JUL-2002 (Rel. 7.06, Created) DT 11-JUL-2002 (Rel. 7.06, Last updated, Version 1) XX DE HARBINGERN6_DR is a nonautonomous DNA transposon - a consensus. XX KW Harbinger; DNA transposon; Transposable Element; Nonautonomous; KW 18-bp TIRs; 3-bp TSD; Dr000179; Harbinger superfamily; KW HARBINGERN6_DR; nonautonomous DNA transposon. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 2-392 RA Bao Z.; RT "Dr000179, an unclassified and uncharacterized repeat."; RL http://www.genetics.wustl.edu/fish_lab/repeats/. XX RN [2] RP 1-393 RA Kapitonov V.V. and Jurka J.; RT "HARBINGERN6_DR, an ancient family of Harbinger-like RT nonautonomous DNA transposon from zebrafish."; RL Repbase Reports 2(6), 7-7 (2002). XX DR [2] (Consensus) XX CC HARBINGERN6_DR copies are flanked by 3-bp target site CC duplications. CC It has perfect 18-bp terminal inverted repeats similar to other CC HARBINGERs. There are several thousand copies of HARBINGERN6_DR CC harbored by the zebrafish genome; they are ~10% divergent from CC the consensus sequence (some subfamilies are much younger). XX SQ Sequence 393 BP; 112 A; 94 C; 93 G; 94 T; 0 other; ggctgcgtcc gaaaccgcat acttccatac tatatagtac gctaaaatca gtatgcgagc 60 cgagtagtat gtccgaattc atagaattcg aaaatcagta tgcgagaagt acccggatga 120 cttactactt ccggcgagat tctgaagtgc gcatcccatg cacgctgcgc tatcccatga 180 tgccccgcga gagaattcat gaatgggagt gaagcgacgc aactgacgca ggtaggtcac 240 gtgaccatga caaaatggcg gatgtagtac gtccgaattc cattcatact actcacattc 300 atactgtata gaacgtactt ttctaacggc cgagtagtac gtttaaattc aaatgcagta 360 cctactgagt agtaggcggt ttcggacgca gcc 393 // ID DNA-8-27_DR repbase; DNA; ZEB; 1883 BP. XX AC . XX DT 11-DEC-2008 (Rel. 13.12, Created) DT 11-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE A nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; TSD 8-bp; DNA-8-27_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1883 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(12), 2150-2150 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1883 BP; 615 A; 350 C; 328 G; 589 T; 1 other; tagggctgca actaacgatt attttaataa tcgattaatc tgtcgattat tttttcgatt 60 aatcggataa aaaaaaaaaa acaagaaaag cattaatttc caacccttta ttcaaaaaag 120 ctcatccctg agctgttata ataataataa taataataat aataataata ataataataa 180 taataataat aataaataat aaactaacta gttttgtcct gttttcaatc caaatatcta 240 aatgttttta aatgaagata catactagac acatgaaata gcataagaaa tcacttaacc 300 tcaaaattaa gtgattgttt gcttaaaaca agctaaatta tttgccaatg gggtaagaaa 360 aattatctta ataagatata atctcaaaca gaagttttca gcatcacttc attttctcat 420 gccatttttc ttgtctggtc ttgatgtaag attttttaga tatttgaact ggaaacgaga 480 caaaatgact aagaaagaaa agttttctgc agtgcagcta tacatgacaa cagatggaaa 540 aatgaaggat ccaaaaaagg cgagtgaata aaaatgtgtc tttagtcttg caatgcaaag 600 taactctacc accgctgctt tactgctgtt attaactctt tcactgctga gtttaaaaga 660 ttctatctgg cccgggagtc agtttttttt agaacattca cccttaatgt ttccgtggcg 720 acgcatcatg tttgtcacgt gacacaccgc agccacaaca gtgatgtatg acagatgtat 780 cccttttatt tagttttttc tgttttaatt caacatattc acacgcttgc tcatcatgtc 840 atcaactacc tgatattcca attcacataa acgtgtaagt gatcgtgatc tataacgtga 900 gatggacgtc tccatgttta cttgcggccg ctgcctcatt cataaaaagc agaacacgca 960 ggattcagca ggtacattca taagttactc ccaaaataca tgcaagtaag tggctggtga 1020 acttagagaa gcgtagattc aactttatag ttactttcac tatgtgtaat gtgtatctga 1080 tatgaataaa acacatcggc aaattatatt tgaatggatt gttagacttc cttgtgtgtt 1140 ttacaaactc taaatgtatt gttttactag tacttgtgtg tatttacata tctaaaacat 1200 aaatgaagta ttaggcggtt aaaacgctgc ataattgtga taatatgaca cgtctttcag 1260 cggagttaag ttggttttgt tttcgaaaag aaaaacaaaa acaaaaaaaa gagagcactt 1320 ttccgacggt cccttactga gagctctgct cgagctctcy cggctcaacg catattgcga 1380 ctgcttaaac ggagcagtgc tcgtaatatc gagcgaaaaa aaaaaagaca gctgtgaaac 1440 ataacctgct ttgtcttttt gtaggaaaca cagtttagat ctttttaggg taagagtttt 1500 ttgagttaat gccgtctatc actgaatgtg tcaggaatac cgaaaacaaa accaacttaa 1560 ccccgctgtc tttctctggc tcagcgccag acacaaacac cagcaccagg gactttacgg 1620 ttctcacttc aaaacggaac aaaagtagga ttctgtagtg attcttccgc atcatccctg 1680 tcttctttga ggcgctgaac tctgtatcag tgcgcgagag acggagaggc cgtgtttact 1740 ccgctccgcg ctgagctaaa gtgttctttt tttaatcaaa cgtctccgtg tcgcgcgaca 1800 caacgaatcg ataatgaaat tcgttgccaa cgcttttagt aatcgatttt tatcgattta 1860 atcgattcgt tgttgcagcc cta 1883 // ID CR1-6_DR repbase; DNA; ZEB; 2341 BP. XX AC . XX DT 20-OCT-2008 (Rel. 13.1, Created) DT 20-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE CR1-6_DR is a CR1-like non-LTR retrotransposon - a consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; AP endonuclease; CR1 clad; CR1-6_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-2341 RA Bao W. and Jurka J.; RT "CR1-like non-LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1201-1201 (2008). XX DR [1] (Consensus) XX CC The 5'-terminal is uncertain. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 1..1806 FT /product="CR1-6_DR_1p" FT /translation="LTQPVSFSRVFHENSALEFSNSYTNFLSNANASLPSC FT SDTEQLAETFNSACDFALNEVAPLKSKRSRVSSIPQPWLNSSTRTLRQECR FT RAERRWKKDGLQVSYQILKDCLNSYQDSVSEAKSKYFSNLIAKSASKPKVL FT FKTINSVLNPTMCSVPVVNLETCIKFSDFFAQKIQDIKSHLFPTDIDHSDS FT VPPSNSFTNFQPVSSSQLLNTISQMKLTFCPSDVLPCRLFKEVLDTICPSL FT LALINSSLINGVFPDVFKHAVVQPLLKRPNLDPAVPDHYRPISKLSFISKV FT LEKIVLSQLTSYLDIFNILDPFQSGFRAKHSTESALLKVTNDILLSMDSGS FT SAILILLDLSAAFDTVDHNILLRRLECVVGVQGLALQWFSSYLKGRSFSVN FT IGSFTSPPKPIHCGVPQGSILGPLLFSLYMLPLSSILLKHNISYHCYADDL FT QLYFPIKLGKDLSLENLFLCLTDIKTWMTNNSLQLNENKTEVLILGPAATN FT NSIKTQLGALNDNLHNHAKNLGVIFDPLLQFDKHVNAVVKSSFFQLRLVAK FT VKRFLSKKRFRNRNPCFNLLSVRLLQFFVHWPTSVNTLSATVGPKCSRQIN FT NRN*" XX SQ Sequence 2341 BP; 637 A; 521 C; 371 G; 812 T; 0 other; cttactcaac ctgtctcttt ctctcgtgtt ttccacgaaa attctgctct tgaattttct 60 aattcttaca ccaactttct ttcaaatgct aatgcttctt tgccatcttg ttcagacact 120 gaacaacttg ctgaaacctt taattctgct tgtgattttg ctttaaatga agttgctcct 180 cttaaatcaa agaggtctag ggttagttca attcctcagc cctggttaaa ttcatctacc 240 cgtactctca ggcaagagtg cagaagggct gagcgcaggt ggaaaaagga tgggcttcag 300 gtttcctacc agattttaaa agactgccta aattcatatc aagactcggt cagtgaggct 360 aaatctaaat atttttcaaa tttgattgct aaaagtgcaa gtaagccaaa agtccttttt 420 aaaactatta attcagttct taatccaact atgtgctctg ttcctgttgt caatctggag 480 acgtgcataa agttctctga cttttttgct cagaaaatac aagatattaa atcacacctc 540 tttcccaccg acatagatca ttctgattcg gtccctccat caaacagttt cacaaatttt 600 cagcctgtat cttcttctca gctacttaat acaatttctc agatgaaact caccttttgt 660 ccttcagatg tgctgccttg ccgtcttttt aaagaggttt tagatacaat ctgcccctcc 720 ctcttagcct taataaacag ctccttaata aacggggttt ttccagatgt cttcaaacat 780 gctgttgtac agcctctatt aaagaggcct aatcttgatc ccgctgtgcc tgaccactac 840 aggcccatat ctaagctctc atttatctca aaggttttag agaaaattgt gctttcacaa 900 ttgacctcat atctggacat ttttaatatt ctagacccct tccaatctgg ttttagagca 960 aaacacagca ctgagtctgc actactaaag gtcacaaatg acattttact ctccatggat 1020 tctggatctt ctgcgatttt aatactttta gacctaagcg ctgcttttga taccgtagat 1080 cacaacatat tattgagacg acttgagtgt gttgttgggg ttcagggcct tgccttgcag 1140 tggttctcct catatctaaa ggggagatct ttttctgtaa atattggatc ctttacttca 1200 ccacctaaac ctattcattg tggtgttcca caaggatcta ttcttggccc tcttttattt 1260 tctctttata tgcttccttt aagctcaata ctcctcaaac ataacataag ctatcactgc 1320 tatgctgatg atctccaact atactttcca atcaaactag gaaaagattt atcattagaa 1380 aatcttttcc tatgccttac cgacattaag acctggatga caaataactc tctgcaacta 1440 aatgagaata aaacagaagt gttaatttta ggtcctgctg ctaccaacaa ttcaattaaa 1500 actcagctag gagccctaaa cgacaatctg cacaatcacg ccaagaatct cggagttatc 1560 tttgatccac tcttgcagtt tgacaaacat gtaaatgcag ttgtgaagag cagtttcttc 1620 cagcttagat tggtggcaaa agttaaacgc tttctttcta aaaaaagatt tagaaatcgt 1680 aacccatgct ttaatctcct ctcggttaga ctactgcaat tctttgtaca ttggcctacc 1740 tcagtcaaca ctctctcggc tacagttggt ccaaaatgca gccgccagat taataacagg 1800 aactaagaaa agagatcata tctcccctgt ccttgcctca ttgcactggc tacctataaa 1860 atacagggtt gattttaaaa ttcttctttt cacttataaa gctttgcaca attctgcccc 1920 ggcctatgtc agcgagctta ttaggccata tacagcttgt agattcttaa gatcacatga 1980 tcagtttctt ttgtctgtcc ctcggtctcg ctgcaagtca aagggtgatc gagctttcag 2040 tgttgcggcc ccaaggctct ggaatagtct ccctctgaat atcagaatct ctccttcttt 2100 ggatgctttt aaatctaact taaagacaca tttttactct cttgcctttg actgatgcat 2160 gcatgttatt tttgtattat ttatatttta gtattacttt taaatatatt ttattgtatt 2220 tttaatacca gttattttta gttttattgt ctagtttatt ttattgtaaa gcactttgga 2280 tcaactacgg ttgtgttaaa ttgtgctcta taaataaagt ttgccttgcc ttgccttgcc 2340 t 2341 // ID ERV4_DR-LTR repbase; DNA; ZEB; 331 BP. XX AC . XX DT 29-AUG-2008 (Rel. 13.08, Created) DT 29-AUG-2008 (Rel. 13.08, Last updated, Version -1) XX DE Long terminal repeat of ERV1 Endogenous Retrovirus from Danio DE rerio. XX KW ERV1; Endogenous Retrovirus; Transposable Element; LTR; KW ERV4_DR-LTR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-331 RA Smit A.F.; RT "ERV4_DR-LTR - ERV1 Endogenous Retrovirus from Danio rerio."; RL Direct Submission to Repbase Update (05-AUG-2008). XX DR [1] (Consensus) XX CC LTR of complete ERV4_DR with identical LTRs, identical copies of CC which are at danRer4 chr14:71365456-71373010 and CC chr18:19912497-19920051. XX SQ Sequence 331 BP; 87 A; 73 C; 64 G; 107 T; 0 other; tgaaagaaat atccatgtag tcactctcca aatattctgt tttgtaactg tgtcttatct 60 tattgtaagt tgtccctgta ctaggcgcct tgacgttgcg ccttgacatt cttgagcact 120 ggataaaatt gttctgcctt actgagctga atatttgcac gtattcatag cccatgtaac 180 cctgagtatg atgtataaaa gctgggaata ccccagtctt ctttagaaga gagcaataaa 240 ctgttggatt cacatctctc agttgttgtt gccttcctgg agctccagct tactgaggaa 300 acacacagac ggtctgcaat ttatcccaac a 331 // ID DNA-2-33_DR repbase; DNA; ZEB; 501 BP. XX AC . XX DT 28-FEB-2009 (Rel. 14.02, Created) DT 05-MAR-2009 (Rel. 14.02, Last updated, Version 1) XX DE Nonautonomous DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; 2-bp TSD; KW DNA-2-33_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-501 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 9(2), 529-529 (2009). XX DR [1] (Consensus) XX SQ Sequence 501 BP; 184 A; 65 C; 68 G; 183 T; 1 other; cactgtaaaa cctaacagtg aaaatcacta aatgaaataa gttagtttaa cttataattt 60 ttaaaaaagt tactttgact taatgaaaaa gagttatggt aactcataaa ctgattatat 120 taagcagaac tcaaaccgaa tgagttatga agttatgaat taattaatta tatatatttt 180 gtgttrtatt tgtgttaact ctaatgctag aagcaatacc aaaccatttg agtagctata 240 tagttgtttg gacttaattt gttctattaa ctgaactcat actgatttga taactgaact 300 taaaatgttt aagttactaa tactctaaat gtttgatgac acaaattatc aacacaaata 360 atatatgtgt atatgactca atatatacta agtttatagg tttacagtac tcaaaccatt 420 ttgtttcaaa acttaaatgg tttgccgcaa tcggtttcct caaacagttt gagttaactt 480 aacttatcgg gttttacagt g 501 // ID hAT-N42_DR repbase; DNA; ZEB; 713 BP. XX AC . XX DT 16-OCT-2008 (Rel. 13.1, Created) DT 16-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Putative nonautonomous hAT DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW 8 bp TSD; hAT-N42_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-713 RA Bao W. and Jurka J.; RT "hAT DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1312-1312 (2008). XX DR [1] (Consensus) XX CC Members are ~90% identical to the consensus. TIR is 16 bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 713 BP; 224 A; 158 C; 126 G; 205 T; 0 other; tagggatgtc ccgatcaggt tttttttgcc ctcgagtcca agtcagagtc atttgatttt 60 gagtatctac cgataccgaa acccgatccg atacttctat aatacataaa aaaaataaag 120 aagagcgaag aaacagatcc aggatgttcc ttatttttta tttaattcac cttattttaa 180 cattcaacaa ctctgttaac aaacagagca cttctgtgag gtagcttgaa caatcaagta 240 ataaataaca tcaattcttc acttttggac tttagtgcaa cagtaaatat ataaaaaaaa 300 tctaatataa aaacaaatag cacctcaact taaaataccc ggcaggcaat cccaagttta 360 catatctcac agtttgctat ggcaatgttg tcgtcatcaa ctttataata cctccagacc 420 gcagacatac tcgcgacttt ccgcttcaag ctgcttccgt gttctacttt gccggcattt 480 aaccaatagc gtctctgcaa caaagtgatg tcatgccgca cgcgttgctg tttctgtgtg 540 aagtcaagaa agaggtctaa ctctgtgcca ccgcataact atagatgtgt taaaaaataa 600 tgagaatata tattcgagtc ctgatcggga ggtaacgtcc gattccgatc agagtctgaa 660 accgcgtgat cgggcccgat ttccgatcac gtgatcggat ctggacatcc cta 713 // ID DNA-TA-3_DR repbase; DNA; ZEB; 1041 BP. XX AC . XX DT 26-NOV-2008 (Rel. 13.11, Created) DT 26-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE nonautonomous DNA transposon from zebrafish - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; TSD TA; KW DNA-TA-3_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1041 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(11), 1732-1732 (2008). XX DR [1] (Consensus) XX CC TSD is 2-bp (TA) long. TIR is 26-bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1041 BP; 326 A; 189 C; 188 G; 336 T; 2 other; acgtagacta accatgttcc gtacactttt cgctattcag cgcactcaaa cgaccactcc 60 tccgtcattg aaagccagat aagcatgctt gtgtgttcat tctctggggg aatacctatt 120 gcacaatcca aatcttgatt gatgtctttt ggaccagtag tttttgagaa aagatgggat 180 aagtaccgcc ttcgcatgtg tacggaagat gataggtatt aacggtgttc cgtacacgac 240 taaaatagtg ttccgtacag ggcgcataac tttaatttac tggcaaaaat gggcattgtt 300 tgaagtctta ttaaaatctt atgtcagaat atactattat tgtgtttaac ttgaagttaa 360 tgatatatta aataactctt ttatgaaagt tttaaatgaa atagcttcaa aatcatttac 420 tttttatatc atgtttattt tattacagca tgacaattta aaatattaaa acattaaatt 480 catgcacatt raagctaact tgactgacct tttatgaata taaacatcat yactgcaaaa 540 ggaggatcca aacagcgttt gcagcctgtc tctcaggctc tctctggcgg agagttgcga 600 catgcggtaa ttacccacaa ttgatgttta aacatcccca tttcaaagca tttattcaac 660 aaaagataat ttatttaaca atttgtgata acataatatt cacttcataa agactgtaat 720 accgtgtgat gcataaatta aatgcaaaag catattgtcg gggctttaaa tgtctcccct 780 cccaggcgac actcgcactt taagcttgcc cactttctat tttcaagcgc attaggtgga 840 cgtagcaaac ttactgattt tactcggaaa catatagtag gaaggtctct gtctctgttg 900 tggttggggc tgaaagagtg agaattaaag attacacaga tatatagctt gcctacgacc 960 cataagtatt tattcagaga ttaacgttaa aaggtcatct gtacggaaca tcgttgtgta 1020 cggaatatgg ttagtctacg t 1041 // ID DNA-1-1_DR repbase; DNA; ZEB; 1659 BP. XX AC . XX DT 14-OCT-2008 (Rel. 13.1, Created) DT 14-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE nonautonomous DNA transposon from zebrafish. XX KW DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; TSD 1 bp; DNA-1-1_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1659 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1209-1209 (2008). XX DR [1] (Consensus) XX CC The consensus, built from elements which are ~5% divergent from CC it, is characterized by ~84 bp TIR. The TSD is 1-bp long. This CC element inserts preferentially into the CWG target site, and CC duplicates the central W; the classification of this element is CC unknown. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1659 BP; 502 A; 311 C; 292 G; 553 T; 1 other; ggggcctgtt tcagaaagga ggttaagtga aaactcagag tattttaacc ctgaaatgag 60 agaaactctg ggttttccgt ttcaaaatgg caggtttgtt aaactcgaga aatcagggta 120 agtcaagcct gtttctgaaa gaaaggtaac tttaactcag agtcagttac tgtggtaact 180 tactctgtga atctaacctg gtccggagca ggttttattc tctaaactct gagtttctgt 240 cggtctcctc ccctttttta aagatgaagc ggtatttctc gccttagcct tacgtttcca 300 cacacctatt ttagagctca ttttggagat gcccataaaa acgattgatg gaaacgtcat 360 gattcacata acttttgaaa atacgcataa taaaacatgt gcataactga gtaggttaaa 420 cttttatttg ataagaaaag atgcgcataa actacgaagt gaacacttaa caaattacag 480 tatgtacatt aaaaaaaagg tttgttagag atgatgatga aaatgtgtgt gaatggacaa 540 accagcagac tgagcacact gtagaacatc ttaaatgttg tttcggtcat tctaaaatgc 600 cttaactgtt tcagtattag tgtatattat taattacctc cgagcgtctg gagcgtgtca 660 agatctccgc gtctcatggc ttcaaacgcc cccacgtgtt cattgtgtgt cagcattgtc 720 ttctgaggcg caagtacatt tattaaataa aaaatcattg acgcagcttc ttctaccgca 780 gtaaattctg tttttactgt tgatatttgg cgcaagttaa tcaggaagtg acgattgtgt 840 tctctttgac tcgttggata aaaacgctgc tttattcgca tcttttaggc gttattccag 900 ttttgcacaa atttatgtaa tagcctatat tttgatggaa acacagctat tgcctacatt 960 tttgtcacac acagaacaaa gtcacgtacg agaatggctt gtcctttttt cataaataat 1020 tagcttaacc ttcgacatgt actgttacta gattaatggg attattattc tgtctaaaaa 1080 ttatttttag cactgcttac cttctwaatg ttatatcaac ctctatcact tgatcaataa 1140 aaaggcagac acacaagtgc actgttaaat ctattaaaac tgataaacgt gtataaaagt 1200 ttagaaaaaa tttataaacg tcacacatta cattacttat tttattatac ttaaacgttt 1260 aaatacttta aattaaaaat tacgcattaa cttgagcagc tgttttccca cgctaactgt 1320 ctctgtttca ctgatggttg cttcttttta aatatataca ctcatatttg ctataacttt 1380 gcatgagcag atcaagttca gccggagaaa gaaatgctga tctctttttt ttaagatgtt 1440 gccatggtca ctcgtaatat ctgcgctcca ttgataatgc ctttttatag tcacggtgtg 1500 cgcgcttaac tctgagttgg tctactcaaa gttgattgac ccaactcaga tcagctattc 1560 tgaaaccgaa aactctgagt ttttaatctc tcggtaaatc aactcagagt tcaagtttaa 1620 actccgagtt gtttgaacct ccttactgaa acaggcccc 1659 // ID hAT-3N1A_DR repbase; DNA; ZEB; 1245 BP. XX AC . XX DT 30-SEP-2005 (Rel. 10.09, Created) DT 30-SEP-2005 (Rel. 10.09, Last updated, Version 1) XX DE hAT-3N1A_DR is a nonautonomous DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW Interspersed repeat; hAT-3_DR; hAT-3N1B_DR; hAT-3N1A_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1245 RA Kapitonov V.V. and Jurka J.; RT "hAT-3N1_DR, a family of nonautonomous hAT DNA transposons from RT zebrafish."; RL Repbase Reports 5(9), 255-255 (2005). XX DR [1] (Consensus) XX CC hAT-3N1A_DR is a nonautonomous DNA transposon that belongs to the CC hAT superfamily. It is a subfamily of the hAT-3N1_DR family. This CC element is a deletion derivative of the hAT-3_DR autonomous CC transposon. XX SQ Sequence 1245 BP; 377 A; 227 C; 248 G; 392 T; 1 other; caggggtcgg caacccgcgg ctctagagcc gcatgcggct ctttagcgct gccctagtgg 60 ctccctggaa ctttttcaaa aaatgtttga aaatggaaaa cgatggggga ggtaaatata 120 tttttgtttt aatatggttt ctataggagg acaaacaatc ttaacgtttt ccaatgctgt 180 aaaagtgtgt agaatattta atttcaacat ttctgtcaac gaagatttgc gtcatagcct 240 gcgacacacg tttctatcag cagggcggga tgccaggcag gtagctgttg taaacaaacc 300 ggcggctgtg tgatgcgcca tggagcgcaa gctgtctttg ccaaagatgt acagcggggc 360 acgctttctc attttccctc cctgagagaa ttcaaagaag cccatcccga tcactcactc 420 aacggtgatt atttacaagg tgcgatcgtt gatatgcaaa ctgcatttgg gagcagattt 480 tgcgagttgc gaaaggaaaa aacgagactg tctttccttg tcacacaccc ctggagattg 540 acccttcctt gttgagcaca ttcccaggaa tgactcgagc tgatcttgaa atggaaatgg 600 caggctgcgt tttattgcac tgtttgttgc ccagataagg ggcacgttat gcacgttcat 660 tttgatgttg ttttttggaa tcctcaaaat aaaaaaaaac atcaaaaaat ctgatttctt 720 tactgcattt ctttaatttc atcaaagcaa aacatgatgc attaatattg taatgaagtt 780 aaacttgaac attaaacata attcattrat attgtaatga agttaaactt gaacattaaa 840 cataattcat taatattgta atgaagttaa actttaacat taaacataat tcattaatat 900 tgtaatgaag ttaaactcga acattaaaca taatgcatta atattgtaat taagttaaac 960 ttgagacggc atcgtacaac agagtagtca cgtggtgcgt cattctctac aggatgcact 1020 gcagggaaaa aaaaacattt aatcatgaag gctcagtatg tatttgtagc caactaaatc 1080 atttttatag taggctaata tagctaatat aaatacatac agcatgtgtt gccttcatta 1140 taaggcttat ataaggcttt taattttttg cggctccaga catatttgtt tttgtttttt 1200 ttggtccagt atggctcttt caacattttg ggttgccgac ccctg 1245 // ID hAT-N59_DR repbase; DNA; ZEB; 1212 BP. XX AC . XX DT 23-OCT-2008 (Rel. 13.1, Created) DT 23-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Putative nonautonomous hAT DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW 8 bp TSD; hAT-N59_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1212 RA Bao W. and Jurka J.; RT "hAT DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1328-1328 (2008). XX DR [1] (Consensus) XX CC This element generates 8-bp TSD; most elements are inserted into CC TA satellites. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1212 BP; 336 A; 275 C; 301 G; 300 T; 0 other; cagaggtcgc gttaaccgaa tattttccgg acaatttttt tcagcagtga cggaaaaatc 60 tgaaggtgac gtcattttga cggattatgc tgagggtgat ctattgtaat ttgtagctaa 120 ctgtaattcc cactcctgtc cagctgatgg cgattgccct tgttttgtct cctctttgtc 180 accgctgcgt acagttgcaa aaatctcaca gcagctgagt gtgagaagtt tctttaaacg 240 gccaaatagt tgtgatatta ataataaagg tgaaaaaaag aggaagtgat gcagtggacg 300 atgaagaaca aacagacaaa aatcctcagt cgccgacagc tgttgttacg ggccaaccaa 360 gcggcaacaa ggaggtcagg tggtttggag tgcgttgtga gcaccatgca ctaccgaacg 420 cagaggggtg aagccgcgcc cctggtgctg aaggagcgcg tgctgcagta ttttttcccg 480 gtgaaacgag tgcttgcaag accgggaaat cccacattca gaaaatccag caaacttctg 540 atcacttcct taggagaaat gtttcctaac tacaaaacgc tggctgaagt ggcgctaccc 600 agatagcagg tagtaatcgg cccgagctcg gctgcccttc ggcgcctccg gctccgactc 660 ggcatcggcg agtgttatcc gggccgagta tggcccgcag ctcgctcccg cacatgcgtt 720 tgtgatgcgg ctgtaaagca ctggcccgat tcctgttaac catatctggc ccgagtctgt 780 ctgtagagtc cgggccgctt ctggcaagga ctctccagtg tggcagcaga acgcggcttc 840 agcctccaga acaaatttaa aactgtcacg agaagtcgac tttccgaggc aaagacacag 900 aatttaatga caactgcctc tgcatcagtt tctcttgatg cgtttgatta cgcacaggca 960 agcacccagt tcaggtcgac gcaggaggaa gatatgaggt aaggcagata tgagttattg 1020 ttcatttgtc aaataaataa atgaataatt tagcctatat gctcgtcgtg caagttcatt 1080 attaaaatat gaatgaaaaa agcgattaga aaatatgcaa attaattgac ggtcaataat 1140 aggttatgac ggaattttta caaccctctc cgtcaaaatg acggacaatg acaaagtcta 1200 acgcgacctc tg 1212 // ID L1-13_DR repbase; DNA; ZEB; 5781 BP. XX AC . XX DT 13-FEB-2009 (Rel. 14.02, Created) DT 13-FEB-2009 (Rel. 14.02, Last updated, Version 1) XX DE L1-13_DR is a non-LTR retrotransposon from the L1 clad. XX KW L1; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; endonuclease; L1 clad; L1-13_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5781 RA Bao W. and Jurka J.; RT "L1 non-LTR retrotransposons from zebrafish."; RL Repbase Reports 9(2), 551-551 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 163..1050 FT /product="L1-13_DR_1p" FT /translation="MMSKNKSAANAPPTTRTSVASKLATMATAAKSEETQE FT DDNGVTMSQLVAELTKQRTSIKEDISALIQESIVSLQTSVNALTETMASFQ FT SRLCATETLASENFSALAAAEKTIKALKEQNTTLLDRVDDLENRSRRANLR FT ILNIPENSEKGQPTVKFVSEMLMEIMGEEVFEKPPELERAHRSLGPKPQDG FT RPPRPLVVCFHRFQEKEKALRWARQNEVKYKGSTLRIYPDISADLARKRAM FT FKDVKQLLYQKKVRFQLLHPARLRVHYEEQTFLFDTPEKAQQFFDQRIATH FT DNS*" FT CDS 1617..5429 FT /product="L1-13_DR_2p" FT /translation="MANTGDKENTKSSAVRFVSWNVRGLNGPVKRTRIFTH FT IKKLKTEIAFLQETHLRIEDHNRLRKNWVGQIYHSSFNHRTRGAAIIIHKK FT IQFIASETISDPQGRYIIVSGHLFEVPVVLVSVYAPNWDDVNFVKKLFSLL FT PDLNSFHLILGGDLNCVMDTTMDRSNPKIVPLLKMSQVFAGYMNQIGCIDP FT WRFLFPDKKVFSCFSHVHQTYSRIDYFFADKRLIPAIKNIEYSTIIESDHA FT PVIFDICFSYNYTTRPVWKLNTTLLSDSNFCKFISRAIDNFLINNKSDSIS FT PSLLWETFKAVIRGEIISYSTSRNKERKQTEEKLIESIQKIDRQYADTPTP FT ELYREKLSLKTKYELLSTEKTERNLFFARARFYEHGDKAGRLLAQQLKSKS FT ASRLIPKIKNTKQDITVDPQEINNIFYKYYSDLYTSEFPQEDSPMSTFLSN FT INLPTVKMDQRKNMDEPLQLQEVENSIKAMQSSKTPGPDGFPIEFYKKFSS FT QICPLLLNMFNHSFEQTKLPPTLTQAHITVLLKPDKDALYCSSYRPISLLN FT VDVKILSKLLASRIENIIPEIISQNQTGFIKGRHSFINIRNLLNVVHSPAS FT EDSPEVIISLDAEKAFDRVEWKYLFAVMHKFEFGSKFISWMRLLYHQPKAA FT VLTNKIVSQYFSLSRGTRQGCPLSPLLFILAIEPLSTILKSSQSIRGIKRM FT GVEYKVSLYADDLLLYITDPLSCISEIDKILKEFGNFSGYKLNFSKSICFP FT INNMAXXIADTDLPFCFSKSGFKYLGINITRSYTDLFKANYSPILKKLELD FT FQRWNVIYLSLAGKVNCVKMNVLPRLLYLFQSLPIFLPKSFFQSTNRLLSS FT FIWGGKKPRIRRDFLERPKGDGGLALPNLLNYYWAANLQKIIYWLKFPHTE FT WCEAEAKSCKSTSLPALITMKTPFSPSQYSSSPVVISTLKIYNQFRQALQF FT QDFSLETPICNNHLFPAANLDATFKKWQDLGILTCRDFFIDNIFANFNDMS FT KKFNLQKSDFFRYLQVRHFVQTQCSVFPQMPAESGLDLILKTPVHPKGLIS FT KMYNLIMTFXNTTLEKIKMEWIEELGINISEETWDKAVKIINKTSSCARLN FT LIQMKILYRIHYSKTKLAKIYPNIDETCDRCNSCKADLTHMFWSCDKLRRF FT WSSIFEILNSAFNLRIQPNPEMAIFGVTNEDIHLAKEIENAFAFVTLIARR FT RILMEWKSPMPPKLSLWLSDVMLFLKTEKIKYFLRGSTKKFFKTWDPLIIH FT IETNVLLQP*" XX SQ Sequence 5781 BP; 1956 A; 1126 C; 930 G; 1760 T; 9 other; ttgaaatttg ggacttccgg tttggcaatg tgagaagacg acgcctatca acagagctcc 60 cgatcatttc tgataatata actttaaata gtggcattta aataatctaa aacgtttttt 120 taagtgttaa aacagtgctt aacacgaacg tgtcgagact acatgatgag caagaataaa 180 tcagccgcaa atgccccgcc aacgacgaga accagcgtgg ctagcaaatt agccacgatg 240 gctaccgcgg ccaaatccga agaaacccaa gaagacgata acggcgtaac aatgagccaa 300 ctcgtagcag aactaacaaa gcaacgaacc tcgataaaag aagacatctc agctctgatt 360 caagaatcaa tagtgtctct tcagacatcg gtcaatgcct tgacggaaac gatggcgagt 420 ttccagtctc gcttatgcgc cacagagaca ctcgcgagcg aaaacttttc agcgctggct 480 gcggccgaaa aaactattaa agccctgaaa gagcaaaaca cgacactatt ggatcgagtg 540 gacgacttgg aaaaccggtc gcggagagca aatctcagga ttttaaatat cccggagaac 600 agcgagaaag gtcagccgac tgttaaattt gtgtccgaga tgctaatgga aatcatgggt 660 gaagaagtct tcgaaaaacc tccagaattg gaacgagcac accgttcgct cggtccaaaa 720 ccacaggatg gccgacctcc tcgaccactt gtcgtgtgtt ttcacagatt tcaagagaaa 780 gaaaaagcat taagatgggc caggcagaat gaggtaaaat acaagggctc gactttgagg 840 atttatcctg acatctccgc cgatctcgcc agaaagcgtg ccatgttcaa agatgtcaaa 900 caactgctgt atcaaaaaaa ggtacgattc cagttactgc atcctgcacg gctacgagtt 960 cattatgaag aacaaacatt ccttttcgat acgccagaga aagcgcagca atttttcgac 1020 cagcggattg ctacacatga taactcttga cttcacaaac actgtgaact gatgaaggat 1080 cacgttgact ttcagacact gaacttgtaa gatacataat atatttaact gggctgctat 1140 gcagattgtt taatataaat agatatccgc gttgaacaat tccgcagtaa gttgagatat 1200 gttttatcta actttattat taagggaaac tgtcttatta ttatttaaga tctgcagata 1260 ctttattttt atttattttt attatgcagc actgtattta ttaacggtac ttaaacacat 1320 ttatttgtaa gtttgattgg tcactttatg cctaacttat attactttgg gagctttgag 1380 gagaatagaa gcacatagac ttggaatatc gggagcagag agaagtttgt gaactgcggt 1440 ttgtcattgt tcaaagggca aatctagatt gtattatagt ttttgttgtt taccaaggtt 1500 caaccacgtt atctcgttat ttttattata attttttttt tttttttttt ctctcccccc 1560 aactccccaa cccacaccca atatacaaag tttgggtaca tagctgttga ataaaaatgg 1620 caaacacagg cgataaagaa aacacaaaaa gttcagcagt acgctttgtt agctggaatg 1680 ttagagggtt aaacggacca gtgaaaagaa ctcgcatatt tacacatatt aaaaaattaa 1740 aaacagaaat agcctttcta caggaaactc acttgcgtat tgaagaccat aaycgactta 1800 ggaaaaattg ggttggtcag atttatcact ccagctttaa tcacagaaca aggggagctg 1860 caatcataat ccacaaaaaa atacaattca ttgcatctga aactatttca gaccctcaag 1920 gacgatatat tattgtatcc gggcaccttt ttgaggtacc agttgtcttg gtcagtgtct 1980 acgcacccaa ctgggatgac gttaactttg tgaaaaaact tttttccttg ttaccagatt 2040 taaactcttt ccatctcata cttggtggag atctgaattg tgtaatggat acaactatgg 2100 accgttccaa ccctaaaatt gtacctcttt taaaaatgtc tcaagtattt gccggatata 2160 tgaatcagat tggctgcata gatccatgga ggtttttatt tccagataaa aaggtattct 2220 catgtttttc acatgttcac cagacgtatt cgcggataga ttattttttt gctgataaaa 2280 gacttatccc ggcaataaaa aacatagaat actcaactat tattgaatcc gatcatgccc 2340 ctgtaatttt tgatatttgt ttttcatata actatacaac aagaccagta tggaagctaa 2400 acacaacctt actctcagac tcaaactttt gcaaattcat atctcgtgca attgacaatt 2460 ttttaataaa taataaatca gactctatat caccatcact attatgggaa acgtttaagg 2520 cggtaataag gggagaaata atatcatact caacctcgcg taataaggag aggaaacaaa 2580 ctgaggagaa acttattgaa tcaatacaga aaatcgaccg ccagtatgct gataccccaa 2640 cacccgaact gtatagagaa aaacttagtt taaagacaaa atatgaatta ttatcaactg 2700 aaaaaacaga acgtaactta ttttttgcac gtgctagatt ttatgaacac ggcgataaag 2760 caggccgtct tctcgcccaa caattaaaaa gtaartcggc atctcggctc atcccaaaga 2820 ttaagaatac aaaacaagac attacagtag acccmcaaga aataaataat attttttata 2880 aatattactc tgacctgtac acctcagaat tcccacaaga agacagtccc atgtcaacat 2940 ttttatctaa tattaatcta cccactgtta aaatggatca aagaaagaac atggatgaac 3000 ctttacaact tcaagaagtg gaaaattcaa tcaaagctat gcagagtagt aaaacccctg 3060 gaccagatgg atttcccatt gagttttaca aaaaattttc ctcacaaata tgccctttac 3120 tgttaaatat gtttaaccac tcgtttgaac aaaccaaatt acctccaacc cttacacagg 3180 cccatattac agttctcctg aaaccagata aagatgccct ttactgtagc tcttatagac 3240 ccatttcctt actaaatgtt gacgtcaaaa ttctatctaa attgttagcc tcaagaatag 3300 agaacataat accagaaata atttcacaaa accagacagg ctttataaag ggacgccact 3360 cattcattaa catcagaaac ctcctcaatg ttgtgcactc tcctgcatca gaggatagtc 3420 ctgaagtgat tatctctcta gatgcagaga aggcattcga cagagtcgag tggaaatact 3480 tatttgctgt aatgcacaaa ttcgaattcg gctcaaaatt tatttcttgg atgcgcttgc 3540 tttatcatca accaaaagca gccgtactca ctaataaaat agtgtctcaa tatttctcct 3600 tgtctagagg aacacgacaa ggatgccccc ttagcccttt gctatttatt ttggctatag 3660 aaccattgtc aacaatactc aaatcatcac aatccatcag gggtattaaa agaatgggtg 3720 tagagtataa agtctctttg tatgctgatg atctgttatt atatataact gacccactat 3780 catgcatctc tgaaattgat aaaatattaa aagaattcgg taatttttca ggttataaat 3840 taaacttctc taaaagtata tgcttcccta taaataatat ggcagwtmaa attgcagata 3900 ctgacttacc cttctgtttc tccaaatcgg gatttaaata cttaggaatt aatattaccc 3960 gttcatacac agatttattc aaagcaaatt atagtcctat acttaaaaaa ctagaattag 4020 atttccaaag atggaatgtt atttatttgt ctttagcagg taaagtcaat tgtgttaaaa 4080 tgaatgtact acccagatta ttgtacttat ttcaaagcct cccaattttc ttgccaaaat 4140 ctttttttca gtcgaccaat agactactgt cttcctttat atggggtggc aaaaagccaa 4200 gaatacgcag agactttctc gaaagaccta aaggagatgg aggactggct cttccaaatc 4260 tgttaaacta ctattgggca gcaaatttac agaaaataat ctattggtta aagttcccac 4320 atacagaatg gtgtgaagca gaagctaagt cttgcaaatc aacttcccta ccagcactta 4380 ttacaatgaa gacacctttc tcaccatcac aatattcttc gagtccagtg gtaatttcga 4440 ccctcaaaat atataatcaa tttagacaag cacttcaatt ccaagacttt tctttagaaa 4500 cccccatatg caataatcat ctttttccag cagccaacct agatgctact tttaagaaat 4560 ggcaggacct gggcatactt acatgtagag attttttcat tgataatatt tttgccaatt 4620 tcaatgacat gagtaaaaaa tttaatcttc aaaaatcaga cttttttcga tacctccagg 4680 ttcgccactt tgttcagacc cagtgctctg tattccctca aatgcctgct gaatctggac 4740 tggatctaat tttaaagacc cctgtacacc ccaaaggtct catttccaaa atgtataacc 4800 ttattatgac attccrtaac acaacattag agaagattaa aatggaatgg atagaagaac 4860 taggaattaa tatttctgaa gaaacttggg ataaagcagt taaaattata aacaagactt 4920 catcatgtgc acgtcttaat ctgattcaga tgaaaattct ttaccgtatt cactacagca 4980 aaaccaaact ggcyaaaatc taccccaata tagatgaaac atgcgatcgt tgcaactcat 5040 gcaaggcaga cctgacccat atgttctggt catgtgacaa actgagacga ttttggtctt 5100 caatatttga aattttaaac tcagccttca atttaagaat tcaaccaaat ccggaaatgg 5160 ctatttttgg agtaacaaat gaggacatcc atttagctaa agaaatagaa aacgcctttg 5220 cctttgtaac attaatagct agaagaagaa ttcttatgga gtggaaatca ccaatgccac 5280 ctaaactgtc actttggctt agtgatgtaa tgctattttt gaaaacagaa aaaattaaat 5340 attttctcag aggatcaacc aaaaaatttt tcaaaacatg ggaccctcta ataattcata 5400 ttgagactaa tgtactcctc caaccataaa ttcccaaccc cacccccacc caccyttttt 5460 tttttttttt ttttttttwa atttaattta cttatttatt tattttttca tttatttatt 5520 ttttctgtaa atttatttta ttttcatttt tattattatt aatatatttt ttatttaatg 5580 catttccttt tcatcaaatt tatttattta tttatttttt ttttaattta tttatttttt 5640 ttattatgtc aatttgtctt tttttttgtt caatttatgt atggttaaaa aaaaaaaaga 5700 aaaaacgaaa atacctttgt aattgaatac ctttaatgta ctgtaccttt tgtcaaaaac 5760 acaataaaaa aaaaaaaaaa a 5781 // ID DNA-6-N5_DR repbase; DNA; ZEB; 3301 BP. XX AC . XX DT 19-NOV-2008 (Rel. 13.11, Created) DT 19-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE nonautonomous DNA transposon from zebrafish - consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; TSD 6-bp; KW DNA-6-N5_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-3301 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(11), 1717-1717 (2008). XX DR [1] (Consensus) XX CC TSD is 6-bp long. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 3301 BP; 1028 A; 626 C; 637 G; 1008 T; 2 other; tatgaaggta aatcmgttgc gaaactcgag agcttgcaaa tgtaaatgtg agcacttgtg 60 ataataatat ttgtatgtgt aggttgaaat gtacacttac agctctgatg tgaacagctg 120 aaatcctcga tttgcacaca cacacaacaa yccacacact tgtgttttaa attggaagtt 180 gcagattaat agttgtgtat ttgtaaaatg tcattcacaa atacaaagtg acagacacac 240 gtgtgcaagg caaatttgat tgcatctttg ctctacaacc acaggtcggc actcacaacc 300 tctcagattc gtcagtacaa ctacaaatct cccgcgctct gacttttgca acacatctcc 360 cgcgatctct gtagcagaac tcatctgtgc actcgcagat gcagaggcgt gagcagcgcg 420 gggcggggga ggggcggggc aggtgtaaag ctgtttgatt ggctgaagcc tgaccaatga 480 acaatgccag cagccaacag gactgaggtg caaaataata atttcccacg attatttcat 540 tagttagggt ttatttaaac tcttttctga agagattctt gttaaaaatc ataaattcta 600 ctgtaatgta cataccagta aaagagcagc aaagccagtt tattggctac agccagccaa 660 tgagatgggg cgtttctgtt tgtcagcaca cagggcagct cacgttgttg gaggcctcga 720 gcaatcagag ggtaagttat gattttttaa taattatcta tatgttcaat actgttagct 780 aagctaagga ctgtgctggg tgcttctctt gtttgtttct tatataagaa aacagttatg 840 agttatctag gagctgcttt caatcatgtt atatggttct aaagatacga ttcaacccac 900 atgaaataag aagttgtaat agcatttata gtaaacaata ttacgttttt gaaccatact 960 aactataata aactgtatta tactgtatat tttgcagtat ctacaacttt gttaatgaat 1020 gcaacagcac actgtagtat taactagcat gaactttaat gaattgaagt atactttata 1080 tgtaaaacct aatacatttt aaagcagttt catttctttc taatttatac atctacatgc 1140 attacatctg atttaggtcc acaacttgca tccttcaaca tgatatgtgt gttcaaagga 1200 agagcaacta ggtgagggag acagtgtcca tcgctcggca gagggtccac tttgagcatc 1260 tcatctgcag ggttaagcaa cataagctga acagtcatca tctctctcta ttacagcgag 1320 catcaaccag ctttacacaa tggcttgctt tctgaccatt agtaaaggct tgagcaaagg 1380 attaaagagg agataacatt tgatgggtaa tgatgaagtt ggatttacca cctccatctc 1440 attatgaaca aggaaaaacc ctgaaatatt tcaatacaag taactagaca tttgcgctgc 1500 acttatttgg actgtaaatt ttatttgaca aattacattc ccttacagaa caggtgtcaa 1560 actcagttcc tggagggctg cagctttgca aagtttagtt ctaaccctaa ttaaacacac 1620 ctgatcaaac taattgagtc cttcaggttt gtttgaaact ttcacttaag tgttttgaag 1680 cagggttgca actaaaaacg tgcagggtta cggcccttca ggaactgagt ttgacacccc 1740 tgccttacag taaagtgtaa aaatactaca ttaaacaatt atcaaaataa aagttgtgtg 1800 tgtgggtgag tgtgtgtttt gttcaaatct gtagtcatct tggcaggtaa aaagagaaaa 1860 taaatggtcg gcttcctctt gaatgatctg aatcattttg aagtgtttgt agatgagctg 1920 gatcatcatg tcatctttag cataaataac atccttttca agctatgcag cttctctgag 1980 ttcctcatcg acttgaaaca tgctgatctc tcctgtcact ccatccttac aggaaactga 2040 gaaataaagg tagtaacttt cataattaaa aaagtgatat aaaggacaca ctgtacacaa 2100 acatgtaaat agatagatgg atagatagat atgatgcagg ttataataac aaaaaccggt 2160 atgatctgta agaaactgaa acgcctagtg ttatttttaa ttgctattaa tgtgtgcgtg 2220 tgtgtaaata tacttcaatt cattaaagtt catgctagtt aatactacag tgtgctgttg 2280 cattcattaa caaagttgta gatactgcaa aatatacagt ataatacagt ttattatagt 2340 tagtatggtt caaaaacgta atattgttta ctataaatgc tattacaact tcttatttca 2400 tgtgggttga atcgtatctt tagaaccata taacatgatt gaaagcagct cctagataac 2460 tcataactgt tttcttatat aagaaacaaa caagagaagc acccagcaca gtccttagct 2520 tagctaacag tattgaacat atagataatt attaaaaaat cataacttac cctctgattg 2580 ctcgaggcct ccaacaacgt gagctgccct gtgtgctgac aaacagaaac gccccatctc 2640 attggctggc tgtagccaat aaactggctt tgctgctctt ttactggtat gtacattaca 2700 gtagaattta tgatttttaa caagaatctc ttcagaaaag agtttaaata aaccctaact 2760 aatgaaataa tcgtgggaaa ttattatttt gcacctcagt cctgttggct gctggcattg 2820 ttcattggtc aggcatcagc caatcaaaca gctttacacc tgccccgccc ctcccccgcc 2880 ccgcgctgct cacgcctctg catctgcgag tgcacagatg agttctgcta cagagatcgc 2940 gggagatgtg ttgcaaaagt cagagcgcgg gagatttgta gttgtactga cgaatctgag 3000 aggttgtgag tgccgacctg tggttgtaga gcaaagatgc aatcaaattt gccttgcaca 3060 cgtgtgtctg tcactttgta tttgtgaatg acattttaca aatacacaac tattaatctg 3120 caacttccaa tttaaaacac aagtgtgtgg attgttgtgt gtgtgtgcaa atcgaggatt 3180 tcagctgttc acatcagagc tgtaagtgta catttcaacc tacacataca aatattatta 3240 tcacaagtgc tcacatttac atttgcaagc tctcgagttt cgcaacggat ttaccttcat 3300 a 3301 // ID BEL4-I_DR repbase; DNA; ZEB; 6339 BP. XX AC chr16; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW BEL; LTR Retrotransposon; Transposable Element; BEL4-I_DR; KW BEL4-LTR_DR; BEL4_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6339 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1359-1359 (2008). XX DR Genome; chr16; Positions 11384236 11390574. XX CC Positions [5264-5851] - Integrase core CC 'CCTAA' target site duplication CC LTRs are 100% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 104..6277 FT /product="BEL4-I_DR_1p" FT /translation="MAEEALGKSVKQLKKERTTAKSSFTKQANFLCREAHR FT LIESELKEEFKKLSSGARKVFETNDDYKAGLIAEIEATEDGAEVQLSAQQE FT ADFKKAVEECNAKFDEVTEIVQDNLWQRYGQSDVLTTISAAEKACDRTAAM FT PVDGTGYESYEVQLELSMRLVKEATVTLLKWERWIPAPERKDLESHVRKLK FT ECSNELESRKAEFAWARRAAEDTQALDTSMTNNPNIMVPAGQIIPVVKIKP FT TSLPKFTGNKRDYYRWKRDWENLQKQGEPTGSVEVKKIQLVDSVDDRIAKE FT LRLSSYNTADDIFRVLDNRYGNKITIAMEIVEELEKMPALRGNQPRRVIEL FT IQMVEKALADLTDLGNVGAMKNPLVIKSIESKLPELLKREWLVYMTDPANN FT ITPDKHFDELLQFLQKQEEVLERLEQLRISERVEKPERKFEKNYAFTRTTK FT KDSTEAGCIVCGAEKHRDRIFFCRKFKDLKLTEKKSIIRKLGACKKCLGVH FT EDDSRCSDNYLCRNKDCKKGDFSDHHYFLCPKWDLKNLNEGKTISKDSRKG FT SGLTAEQEHFLAELSPELAEKCKKAFTNKVKMINCLGNDQQGLVSKSGLKE FT LPVIMMLMEVTTNAGQKIGALIDLASDTNYITHEAAERLKLRGEKITLVVH FT GVGKMAIRVSTNRYLLRIRVKTPEGTVKAHQLICYGLEEIAKVHRIVSQEK FT LREFFPEVELEELKRPGKIDLLISHREGRLAPQRVKIVGDLVLWDSPLGKT FT VAGAHPDLFEVVDMAAYETKTHFARSMRTAAARYEEIIEKAERPAITDIVK FT EGKSPRSSLASNREFIDWWKWDSIGTACEPKCGGCRCGNCQPGGKEMTLAE FT ERELEIIREGLTYVEMDTHSTAPHWDTKYPWIQDPATLPYNRSGVEAAFLR FT TEKQLKKVPEWQDIYAAQVHDMVKRGAATKLTKQVIEEWKGPVWYVSHLVA FT PNPHSTTTPVRLVWNSSQKFKGLSMNDLLLKGPDVLNPIRAVLLRFRKGVF FT AALGDIKKMYNSVWLEEREVHLHRFLWRDTQDEEISEYAITRVNIGDRPAG FT CIAQLAMRETARLAKFAHLKEECRVLEEDSYVDDILTSHNSQERLKEITNG FT LEEILAAGGFALKPWVRSEQSGRNEVVDPSQVPKESISQSKTIILPNQLRD FT EDNKALGTGYLVEEDKLYIMASINFSVRKKKMRTGQNLLKSEVRLRTPNPL FT TRRHLLSQVAGLYDPVGLVTPAKQKGAILVRKAFQETGKGSLTRETWDMPL FT SENLREEAIRLFEEYVQLNQVKFHRSLTPPGWKGKPWGITFSDGSDKTYGA FT VLYLRWNTDQGIDVRLVESKAKLTPLDQKGDAVKAEICGAVFAARLRKYFE FT KHGRMEVERWFHLVDSQTVLGAIQRDSYGYQTFFANRVGEIQKSGSVSDWW FT WIPGDINIADIITRGGTPEDLAEESEWQEGPQFLRYSVEEWPKKSAAEVAA FT DAKENVNKLQRKTFSAVLTRSQTKRDLAEAGKGIPEVSILGDSINVVSDSG FT ENGKRPNLPKGTPRDLIIKKLVDEEKYSNLTKLVRVIGWVWRAVKKWLGFR FT DQDSNERKRKEVLQNKIKQTMLTVKEEEYVLKELFLAAQQGVTFPDTTLSR FT LAVYRENSGLLVCGGRIQIFNEEKIAVPILPFSAWVSMLLAREAHSANHEE FT VAGTLLRMRKKAWVIKGRRLAKKVVDSCVICRKARAKRCEQIMADLPPERT FT EPAAPFEYTTLDLFGPYIVKDEVRKRVHLKVWGIVYCCMASRAIHTDIVSD FT QSAEGFMLAYQRFTALRGHPRKLWSDPGSNFIGVKPALTELYKFLDKLETS FT ELEEKAAKHGTEWVWKIHPASSPHRNGAAEAAVRIVKRALHNLGGDGVFTW FT SEFQTFLYMAANLANERPIDARTQSREDCVEYVTPNSLLLGRSGPKGDPGD FT FDFVGYPYKRLKVIQGEVNKFWRKWSQLAGPNLFVRNKWHTKERNVAVGDV FT VWLADQNALRSQYKLARVIQVNADDKGIVRDVNVKVFTSYPVSNVKPAKAK FT VTTKKLTDRIPATILHRDVRRLVVLIPIEEQH" XX SQ Sequence 6339 BP; 2175 A; 1163 C; 1575 G; 1426 T; 0 other; gttgataacc aagagtgtag cagctggaca taaagatcaa cggctttgtg acgtccaaag 60 acaaagttct gctgacacca agtgaaagaa aaattcaagg aaaatggctg aggaagcatt 120 aggaaaatca gttaaacagc taaaaaagga aagaaccact gcaaaaagta gctttaccaa 180 acaagctaac tttctctgta gagaagcaca ccgacttata gaatcagagc taaaagaaga 240 atttaaaaag ctttcttctg gagcaagaaa agtgtttgaa actaatgatg actataaagc 300 tggattaata gcagagattg aagctacaga agatggtgca gaggtacaac tttcagcaca 360 gcaagaagct gactttaaga aagctgtgga agaatgtaat gcaaagtttg acgaagtaac 420 tgagatagta caagataatc tctggcaaag atatggacaa agtgatgtgt taaccacaat 480 ttcagcagct gaaaaagcct gcgatcgcac agccgctatg cctgttgatg gcactggata 540 tgaaagttat gaagtgcaac tggagctctc gatgagactg gtgaaagagg caactgtaac 600 tttgctaaaa tgggaaagat ggattccagc acctgagaga aaggatcttg aaagtcatgt 660 tcgcaaactg aaagagtgta gcaatgagct tgaatcaaga aaagctgaat tcgcttgggc 720 ccgaagagct gctgaagata cacaagcatt agacacgagt atgacaaaca atccaaacat 780 aatggtacct gcaggacaga tcataccagt ggtaaaaatt aaaccaacaa gtctacctaa 840 attcactggt aataagagag attactatcg ttggaagaga gactgggaaa atctccagaa 900 gcaaggtgag cccactgggt ctgttgaagt gaaaaagatt cagttagtgg atagcgtgga 960 tgacagaatc gcaaaagagc tcagattgtc ttcctacaac actgctgatg acatattcag 1020 ggtgttagac aaccgatacg gtaacaaaat tacaattgcc atggaaattg tagaggaact 1080 agagaaaatg ccagctttga gaggaaacca accaagaagg gtaatagaat taatccaaat 1140 ggtggaaaaa gccttggctg atctcacaga tcttggaaac gttggagcaa tgaagaatcc 1200 tcttgtgata aaatctatcg aaagcaagtt acctgaactc ttgaaaagag aatggctcgt 1260 atatatgaca gatcctgcaa ataacatcac accagacaaa cactttgatg agctgttaca 1320 attccttcag aaacaggaag aagtgcttga aaggctggaa caacttagga tctcagaaag 1380 agtggaaaag ccagaaagaa aatttgagaa gaattacgct ttcactagaa cgacaaagaa 1440 agactccaca gaagctggtt gcatcgtgtg tggggcagag aaacatagag acaggatttt 1500 cttctgcaga aaatttaaag acttgaagct gacagagaag aaatctatca tcaggaaact 1560 aggagcatgc aaaaagtgct tgggagttca tgaggatgac agcaggtgca gtgataacta 1620 cctttgcaga aacaaagatt gcaaaaaagg agacttctca gaccaccatt acttcctctg 1680 tccaaaatgg gacctcaaga acctgaatga agggaaaacc atcagtaaag atagtagaaa 1740 aggaagtgga ttgactgcag agcaggaaca tttcctggct gaactttctc cagaattggc 1800 tgagaaatgc aaaaaagcat tcaccaacaa agttaagatg ataaactgct taggaaatga 1860 ccagcaggga ttggtcagta aaagtggatt aaaggagctt cctgttatca tgatgctgat 1920 ggaagtaacc acgaatgcag gacagaagat tggcgctttg atagatctgg cttctgatac 1980 aaactatatc acccatgaag ctgctgaaag acttaagctg agaggtgaaa aaatcactct 2040 agtggttcat ggagttggca agatggcgat cagagtaagc acaaatagat accttctcag 2100 gattagagtc aaaactcctg aggggacagt gaaggctcat cagttgatct gctatggttt 2160 ggaagaaatt gccaaagttc atagaatcgt aagtcaagag aaacttagag aattctttcc 2220 agaggtggaa ttagaagaac tgaaaagacc aggaaaaata gatctcctca tcagtcaccg 2280 agagggcaga ctcgctccac aaagagtgaa gattgttggg gatcttgtcc tgtgggacag 2340 cccgctgggg aaaacagtgg ctggagcaca ccctgatttg tttgaagtgg tggatatggc 2400 tgcatacgag accaaaacac attttgctcg gtccatgaga acagctgcag ccaggtatga 2460 agaaatcata gaaaaagcag aaagaccagc aattactgat attgtcaaag agggaaagag 2520 cccaagaagt tcactagcaa gcaacagaga gtttattgac tggtggaaat gggatagtat 2580 tggcacagct tgtgagccaa aatgtggagg atgtcgctgt ggaaactgtc agcctggagg 2640 gaaggagatg accctagcag aagaaagaga actggagatt ataagagagg gtctcacata 2700 tgttgaaatg gatacccaca gcacagctcc acactgggat acaaagtatc catggattca 2760 ggacccagct acacttcctt acaacagaag tggagtagaa gctgctttcc tcagaacaga 2820 aaagcaactg aaaaaggtac cagaatggca ggacatttat gcagctcaag tccatgacat 2880 ggtgaaaagg ggtgcagcaa caaagcttac taaacaagta attgaggaat ggaagggacc 2940 tgtttggtat gttagtcacc tggtagcacc aaatccgcat tcaactacca ctccagtgcg 3000 acttgtctgg aacagtagtc agaaattcaa agggctaagc atgaatgatc ttctgttaaa 3060 aggaccagat gtactcaacc caatcagagc tgtcttgcta agattcagga agggagtatt 3120 tgctgctctt ggagacataa aaaagatgta caattcagtg tggctggaag agcgtgaggt 3180 tcatcttcac agatttctct ggagagacac tcaagacgaa gaaataagtg agtacgctat 3240 cactagagtt aacattggag atcgtcctgc tggatgcatt gcacagttag ctatgcgtga 3300 gacagctagg ctggcaaagt ttgctcattt aaaagaagag tgcagagtcc ttgaggagga 3360 cagttatgtg gatgacatcc ttacatccca taatagccag gagagactga aagagataac 3420 aaatgggcta gaggagattc tagcagctgg cggatttgca ctgaaaccat gggttcggtc 3480 ggagcaaagt gggaggaatg aagtggttga tccaagtcaa gttccaaaag aatcaatctc 3540 tcaatctaaa acaatcatac taccaaacca gctacgagat gaagacaaca aggctttggg 3600 cacagggtac ttggtagagg aggataagct gtacatcatg gcttccatta atttctcagt 3660 aagaaagaag aaaatgagaa cgggtcagaa cctcttaaaa agtgaggtga gactgagaac 3720 accaaaccct ttgaccagaa ggcacctgtt aagccaagtt gctggcctct atgacccagt 3780 tggtcttgtc actcctgcca agcagaaggg tgcaattctt gtcaggaaag ctttccaaga 3840 gacaggaaag gggagtctaa cccgtgagac atgggacatg ccactttctg aaaacctcag 3900 ggaagaagct atcagattat ttgaggaata tgtgcagctt aatcaggtca agttccacag 3960 aagtcttaca cctcctggct ggaaaggaaa accctgggga atcactttct cagatggtag 4020 tgataaaaca tatggagctg tattgtacct aagatggaat acagaccagg ggatagacgt 4080 cagactagtt gaatctaaag caaaactaac accactagac cagaaaggag atgctgtgaa 4140 ggctgaaatc tgcggtgcag tttttgctgc acgactcagg aagtactttg aaaagcatgg 4200 gcggatggaa gtggagcgct ggtttcacct ggtagacagt caaacggtgc taggggccat 4260 tcaacgtgat agctatggct accaaacttt ctttgcaaat agagttggag aaatccaaaa 4320 gtctggatca gtaagtgact ggtggtggat accaggggat atcaatatcg ctgacatcat 4380 tactagagga ggcactcctg aagatcttgc tgaagagtcc gaatggcagg agggaccaca 4440 attcctcaga tattcagtag aggagtggcc aaagaaatca gctgctgagg ttgctgctga 4500 tgccaaggaa aatgttaaca aacttcaacg gaaaacattc tcagcagttc tgacaagatc 4560 tcaaacaaag agggacctgg cagaagcagg taaaggaatt cctgaagttt caatccttgg 4620 tgattcaatc aatgttgttt ctgactctgg tgaaaatgga aaaagaccaa acttaccaaa 4680 aggaacacct cgtgacttaa ttattaagaa actagtggat gaggaaaaat acagcaattt 4740 aaccaagcta gtcagagtca tcggctgggt ttggcgtgct gtaaagaagt ggctaggatt 4800 cagggatcag gactcaaatg agagaaaaag aaaagaagtg ttgcaaaaca agatcaaaca 4860 aaccatgctg actgtgaaag aggaagagta tgttctgaaa gaactctttc tagctgctca 4920 acagggtgtg acttttcccg acactacact tagcagactg gctgtgtata gagaaaattc 4980 aggactcttg gtttgtggtg gaagaataca gatctttaat gaagaaaaga ttgcagtccc 5040 aatattacct tttagtgcat gggtgtccat gttattagcc cgagaagccc acagtgcaaa 5100 tcatgaggag gtagcaggga cacttctaag aatgaggaaa aaggcgtggg tgattaaagg 5160 tcggagacta gccaagaaag ttgtagacag ctgtgtgatc tgcaggaagg caagagcaaa 5220 gagatgtgag caaatcatgg cagaccttcc tcctgaaagg acagaaccag cagcaccctt 5280 tgaatacaca actttagacc tgtttggacc atacatagta aaggatgaag tgaggaagcg 5340 agtacatctc aaagtttggg gaatcgttta ctgctgtatg gcatctagag cgatacacac 5400 tgacattgtg agtgaccaat ctgctgaagg gtttatgcta gcctaccaaa gattcacggc 5460 tttgaggggt catccaagaa aattatggtc cgaccctggc tcaaatttta tcggagttaa 5520 acctgctctc acagaactct acaagttcct ggacaagtta gaaacatctg agctagagga 5580 aaaagctgct aaacacggaa ctgaatgggt ttggaagatc catcctgcaa gttctcctca 5640 caggaacgga gcagcagagg ccgctgttcg catagtaaag cgagcccttc acaacttggg 5700 aggtgatggt gttttcactt ggagtgaatt tcaaacattc ctttatatgg cagctaacct 5760 ggctaatgag aggcctattg atgcaaggac gcagagcaga gaggattgtg tagaatatgt 5820 cactcccaac tctctcctgt taggaagatc tggaccaaag ggagacccag gtgactttga 5880 ttttgttggc tatccttata aaagattaaa agtcatccaa ggagaagtta acaaattctg 5940 gaggaaatgg agtcagttag cagggccaaa cttgtttgta agaaacaaat ggcacaccaa 6000 ggaaaggaat gttgctgttg gagatgtcgt ctggttagca gaccaaaatg cactgaggag 6060 tcagtacaag ttggccagag tcatccaagt caatgctgat gacaaaggta ttgtcagaga 6120 tgtgaatgtt aaagtcttca caagctaccc agtctcaaac gtaaagcctg ctaaagccaa 6180 agttacaacc aaaaagttga ctgacagaat ccctgcaact atcctccaca gagacgtaag 6240 acgtttagtt gtcttaatac ctatagagga acagcactaa acacgcacta aattctgtga 6300 acaacttggg ttcaaagaac cagtagttca agtgggagg 6339 // ID Gypsy-31-LTR_DR repbase; DNA; ZEB; 954 BP. XX AC . XX DT 26-APR-2007 (Rel. 12.05, Created) DT 17-MAY-2007 (Rel. 12.05, Last updated, Version 1) XX DE A long terminal repeat of the Gypsy-31_DR LTR retrotransposon; a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Interspersed repeat; Gypsy superfamily; Gypsy-31-I_DR; KW Gypsy-31_DR; LTR; endogenous retrovirus; Gypsy-31-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-954 RA Dib M.R. and Naveira H.F.; RT "Gypsy-31_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 7(5), 155-155 (2007). XX DR [1] (Consensus) XX CC Gypsy-31-LTR_DR is a long terminal repeat of the Gypsy-31_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its CC internal portion is deposited in Repbase as Gypsy-31-I_DR. XX SQ Sequence 954 BP; 241 A; 173 C; 154 G; 386 T; 0 other; tgtgatggcc agtgtggtgg ccacacattg agagtgaggc atgcacgcac acatacatac 60 acaaacatat atatatatac acacacacac acacacacac acacacacac acacacacac 120 acacacacac acacacacac acacacacac acacacgtta tacatattca tttaagatta 180 tttattggtc acaatttcag ttattgtgtt tatacatttt tcgtttcatt catatctgat 240 ttaaataaga ctcatacact ttttcagttt ctcattacct ttctttatta ttcacagtac 300 atttattact attttgggtg cacacaaaat cagttatctg tactgtatgt tttgttaatt 360 tagttttctt ttgagtttaa gttaccgtgc tgcagtgccg acgaggaaca gagtttccgg 420 gtgtggtccg cccaagtaaa ggtccgacgt gccgcaagat gccgtctttt ggttccgcct 480 gccgatacca ttggcttaat ggcttggagg gagagctcat gtttagttta ggttcaataa 540 tgatttgcaa tgtttgttta ttcagattta tatttagttt tgccttattt ataatttatt 600 taacttttat atttagattt gtatttagat gtttggctta gtttattgta tttgtaaatt 660 gtatgttttt gtctccccct tcttgtttaa tgtggactag tccctgtgct ataaatgtgg 720 tctgtcttgt catttcatgg agttctgttt ttggtttaga catgatttgt ttggtttgaa 780 ctcagtttgt tttccttgtt tagtttattg ttctttcaat tgatttaatt atttaattaa 840 tttatttttg aactttttgt gaaccttttt gactttatta aaaaataatt attttttgaa 900 aatcttcttt tttttgtatg ccccttactg tacgtgctgg ctcggtccct caca 954 // ID Gypsy10-I_DR repbase; DNA; ZEB; 6147 BP. XX AC . XX DT 07-JAN-2005 (Rel. 10, Created) DT 19-MAY-2005 (Rel. 10.06, Last updated, Version 2) XX DE An internal portions of the Gypsy10_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW Gypsy superfamily; Gypsy10-I_DR; Gypsy10-LTR_DR; Gypsy10_DR; KW endogenous retrovirus; gag; integrase; reverse transcriptase. XX NM Gypsy10-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6147 RA Kapitonov V.V. and Jurka J.; RT "Gypsy10_DR, an LTR retrotransposon from zebrafish."; RL Repbase Reports 4(12), 313-313 (2004). XX DR [1] (Consensus) XX CC Gypsy10-I_DR is an internal portion of the Gypsy10_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its CC long terminal repeat is deposited in Repbase as CC Gypsy10-LTR_DR. Gypsy10_DR is characterized by 4-bp target CC site duplications. The internal portion encodes one 1988-aa CC polyprotein composed of gag, protease, reverse transcriptase, CC and integrase domains (pos. 142-6105). XX FH Key Location/Qualifiers FT CDS 142..6105 FT /product="Gypsy10-I_DR1p" FT /note="Polyprotein" FT /translation="MSFRSDPQLQSELLEWCREAVIDPAHALLLTGVPQNT FT ETACIEKVAESVKLFGRVRVRDSKGGATPTTLLVLCECREVVDPTNCPEEL FT HPTDGEEAWMIILATERESAHVAPAEFADKLSKFLKDEGKSMTDVHALFSP FT QSANPSSPESIIRAVGEILEKTVRPSSDGSAYRRLRTFSGIVPTPVGEETM FT DHWIEQAKLMISECECSEKEKRRRIVESLKGPALEIIKAVRMSCPDAEAVK FT YVEALESTFGSSESGEDLYFAFRLLRQCPGESLSAFLRRMEKSLTKVIQKG FT GLTSANADKARVEQLIRGAVESDMMLLQLRLRERKENPPTFLSLLNEVREA FT EEIEATRRKITATAKPIHLQEESICPTVVHELKAEIQELRAQIKSDRSNIA FT SSMMIENRVKSRQLSPTDSKEVVKETEVQVLKKQVQQLQQQLAIMSVGQSN FT STSHAQLPQTPVSSSSRHQLSKPKADYFCYRCGEEGHIATKCKAPENATLV FT INKLVRSLKRAKGTKSDPNSHAEDRACFSKKSQIHSCESKGLPKGLVGPAS FT TIAVKVGGHPCCALWDSGSQVTIVFDSWYSEHLSNVPILPLSGLSIWGLSS FT SNYPYKGYIVVDVTFPVALTGAEETVTILALVCPDPKGPQQFPLIIGTNAS FT FFQRMTNGEGTNITRSAHSLRIQTHLDTTSFAVDQPEGQVRWKGPGMLNVP FT SQGERYASCKIESDKPLRKDIFLIETSPVDSLPAGLLVSPVVFSSSAVDVN FT NFRILIQNETSKELSVPPGTVVAQIFPTDTVTVAHGVTESNDQINPDVFNF FT GDSNIPTAWEKRLRLKLAERKNVFSTQEWDVGLAKGVTHQIRLHDPHPFRE FT RSRRIAPADIDDVRRHLKDLLAAGIIEESRSPYASPIVIVRKKSGAVRMCI FT DYRTLNSRTTPDQYVTPRIDDALDCLAGSKWFSVLDLRSGYYQIAMSEEDK FT EKTAFICPLGFYQFQRMPQGITGAPATFQRLMEKVVGDMHLLQVIVYLDDL FT IVFGSTLEEHEERLMKVLDRLEEWGLKVSIDKCQFCQPKVKYVGHIVSAAG FT IAPDPEKVAAVTQWKEPTDLKSLRSFLGFCGFYRRFVKNYSSIVRPLTELT FT KGYPPAKGKSKVEGKKYFKETDQFGERWDNACKQAFQEIIKCLTQAPVLAF FT ADPSLPYVLHIDASLSGIGAVLNQEHSDGLRPVAFASRKLSASEQRYHIHQ FT LEFLALKWAVVDKFHDYLYGVPFVVRTDNNPLTYVLTSAKLNATGHRWLAA FT LATYNFSLQYKPGKHNIDADVLSRYPAEPATSFSWTEIPQSGVKAICQLSN FT LSWSDEKSRLIDQLGVSPGSIPAVYSCPALLDVCHLEQLTHADLKLSQEQD FT PVIGKVKQDIARNKPLTPIKGSDPTLTLLQRQGPKLVIRNNLLYRVTKNQS FT GKEKVQLVLPEKYHLTVLRSLHDDSGHLGVEKTTELLRDRFYWPRMTSDIE FT QYIKNCGRCITRKTLPQKSAPLSHITSSGPLDLVCIDFLSLEPDSKGIANV FT LVITDHFTRYAQAFPTKDQRAVTVAKVLVEKFFVHYGLPSRIHSDQGRDFE FT SRLIQELLGMLGIRKSRTTPYHPQGDPQPERFNRTLLSMLGTLEPAQKSKW FT SQHITQLVHAYNCTKNEATGYSPYQLLFGREARLPIDICFGISPAGEKGVT FT HLQYVEKMKAELQQAYQLAAETSLKAHQRNKKLYDTRVKPQLLTVGDRVLI FT RNLAVKGKNKLQDRWNSLPYVVVEKFKDLPVYKLRPERGMGAIRTMHRDHL FT LPVGENVRFSKPNDSNPSTQSPVTRAQSGKRVQKEKKVENVEQVRDENHET FT SESEDDNLCYYYPKLIPALRTLPTPQIAVEPEIPAEYGPELNSECEARKDT FT VEEEAREVLNQPAEAVDDQGVGDNDHRNVPKPGSEPAVCRKSTREVKPVIK FT LSYDDLGRPTDKPLTMVHRGMVVHIEDLSKTRKSCNTVWCHPMAQCSQCVP FT TAPGPIVRTVIQF" XX SQ Sequence 6147 BP; 1843 A; 1324 C; 1415 G; 1565 T; 0 other; caaattgggg gctcgtccgg gatacactct caccttttcc tgatacacta agaagagtaa 60 attaaacacc ttaatacacg ttaagaacac tgaagaggaa aaactgacac catcctgtaa 120 ctaactgctt attattgaga aatgtctttc agaagtgacc cacaactgca gagtgaactt 180 cttgagtggt gcagagaagc agtaatcgac ccagctcatg cgttgttgct gacaggagta 240 ccacaaaaca ctgagactgc ttgcattgaa aaagtggccg agagtgtgaa gctttttgga 300 cgagtacggg ttcgagactc aaaaggtgga gccactccga caactctgtt agtgctgtgt 360 gaatgcagag aagttgtaga tcccactaat tgccctgagg aattgcatcc tacagatggt 420 gaggaagcct ggatgattat cttagctacg gaaagagaat cagctcatgt tgctccagca 480 gagtttgctg acaagctctc caagtttctg aaggatgaag gcaagtctat gactgatgta 540 catgctctat tttctccaca gagtgcaaat cccagttcac ctgagtcaat aattcgtgca 600 gtgggtgaaa ttcttgaaaa aacagtaaga ccatcgagtg atggaagtgc ctaccgtcgt 660 ttacgtacct tctctggtat tgttccaacc cctgtaggag aagaaactat ggatcactgg 720 attgaacaag ctaagttgat gatatctgaa tgtgagtgct ctgaaaagga gaaacggaga 780 agaattgtgg aaagtttaaa gggaccagcc ctggaaatca ttaaagctgt ccgtatgtca 840 tgtcctgatg ctgaagcggt gaaatatgtg gaagctctgg aaagtacttt tggatcttct 900 gagtctggag aagacttata ctttgctttt cggcttctta gacagtgtcc tggtgagtca 960 ctttctgctt ttctgagaag aatggagaaa tcactgacta aagtcatcca aaaaggggga 1020 ctgacttctg ctaatgctga taaggctaga gtagagcaat tgattcgagg agctgttgaa 1080 tctgacatga tgctattgca gttgcgatta agggagcgga aagaaaatcc accaactttc 1140 ctgagtctgt taaatgaggt tcgtgaggct gaggagatag aagccactcg acgcaagata 1200 actgctactg caaagcccat acacttgcag gaagaaagca tttgtcccac tgttgtacat 1260 gaacttaaag cagaaattca agaattgagg gctcagataa aaagtgatcg ttcaaatatt 1320 gcttcatcca tgatgatcga gaacagagtg aaatcacgcc aattaagccc cacagactca 1380 aaggaagtag tcaaagagac tgaggttcag gtgttgaaaa aacaagtaca gcagttacaa 1440 caacagctag ctataatgag tgttggtcaa agtaattcaa caagccatgc ccagttacca 1500 caaacccctg tttcaagttc atcgcgacac cagctttcaa aacccaaagc tgactacttt 1560 tgttaccgat gtggtgaaga gggacacatc gcaactaaat gtaaagcccc tgaaaatgcc 1620 actcttgtaa tcaacaagct agtacgttcc ttaaaaaggg ctaaaggaac aaagagtgat 1680 ccgaacagcc atgctgaaga tagagcttgt ttttcaaaaa agagccagat acacagctgt 1740 gagtcaaagg gtcttcccaa gggtttagtt ggaccagcgt ctaccattgc agtgaaagta 1800 ggaggacatc catgctgtgc tctttgggac agcgggtctc aagtcactat tgttttcgac 1860 tcctggtatt ctgaacacct gtcaaatgtg cccatacttc ctctttctgg cctatccatc 1920 tggggcctaa gttcatccaa ttatccctat aaaggataca ttgtagttga tgtcacattc 1980 cctgttgctc ttactggtgc agaagaaaca gtcactatcc ttgctttagt ctgcccagac 2040 cctaaaggac cacagcagtt tccattaatc atcggaacca atgctagctt cttccagcga 2100 atgacaaatg gtgaaggtac taatatcaca cgcagtgccc actcactcag aattcaaaca 2160 cacctggaca ccacatcttt tgctgtcgac caacctgagg gccaagtgag atggaaaggc 2220 ccaggtatgc ttaatgtccc atcacaaggt gagcgatatg cttcatgcaa aattgagtct 2280 gacaaacctt tgagaaaaga catctttctc attgaaacct ctcctgttga ttcccttcct 2340 gctggactgc ttgtttcccc tgtcgttttt tcttcatcag cagtggatgt aaacaatttc 2400 aggatcttga ttcaaaatga gaccagcaaa gaactctcag ttcctccagg gactgtagtt 2460 gctcaaatat tccctacaga tacagtcact gttgctcatg gagttacaga gtctaacgat 2520 cagattaatc ctgacgtgtt taactttggt gattcgaaca tacctacagc ttgggagaaa 2580 agattacgct tgaagctggc tgagcgaaag aatgtgtttt cgacacagga atgggatgta 2640 ggcttggcta aaggagtcac ccatcaaatt agactgcatg atcctcaccc attcagagag 2700 cgttcaaggc gcattgcccc agccgatatc gatgatgtca gaaggcatct gaaagatctt 2760 ctagctgctg gtatcattga ggagtccaga agcccatatg catcgcctat agtaatagtg 2820 cgcaaaaaga gtggtgctgt gagaatgtgc attgattacc ggactctaaa cagtcgcact 2880 acacccgatc aatatgtcac ccctcgcatt gatgatgcat tagactgcct agcgggaagc 2940 aaatggtttt cagttttgga tttgcgaagc ggctattacc aaattgcgat gtctgaggaa 3000 gacaaagaaa aaactgcatt catttgccca ctggggttct accagtttca acgtatgcca 3060 caaggaatca ctggggcccc agcgacgttt caaagattaa tggagaaagt ggttggagat 3120 atgcatctat tacaagtgat tgtctatctc gatgacctca ttgtctttgg aagcacactg 3180 gaagagcatg aggagcgatt gatgaaagtc ctcgaccgac tggaggaatg ggggctgaaa 3240 gtgtctatcg acaagtgcca gttctgtcag ccaaaagtca agtatgtggg acacattgtt 3300 tctgctgcag gaatagctcc agaccccgag aaagtagctg cggtgactca gtggaaagag 3360 cctactgacc tgaaatcttt aagatctttc cttggatttt gtggatttta ccgccgtttc 3420 gttaagaatt actcctccat tgtaagacct ctgacagagt taacgaaagg ttacccacct 3480 gcaaaaggaa agagtaaggt ggaaggaaag aagtacttca aggagactga tcaatttggt 3540 gagcgttggg acaacgcatg taaacaagct tttcaagaga taatcaagtg tttaactcag 3600 gcacctgtac ttgcctttgc tgatccatcc ctaccatacg tactccacat tgatgcaagt 3660 ctgagcggga ttggtgctgt gctgaatcaa gaacactctg atggacttcg accagttgct 3720 tttgctagca gaaagttgag tgcttcagaa cagagatatc acatacacca gctagagttc 3780 cttgcattaa agtgggctgt tgtagacaag tttcacgatt acctgtacgg agttccattt 3840 gtcgtgagaa ctgataacaa tcctctaact tacgtactga caagtgcaaa gttgaatgca 3900 actggtcata gatggctagc agccttggca acatataact tcagcctgca gtataagcca 3960 ggcaaacaca acattgatgc tgacgtgctt tcccgttatc ctgcggaacc tgccacttcc 4020 ttctcttgga ctgaaattcc acagtctgga gtaaaagcta tttgtcagtt gtccaacttg 4080 tcttggagtg atgaaaagtc cagactgata gaccagttag gtgtttcacc tggtagcatc 4140 cccgctgttt actcttgtcc tgcgttgctt gatgtttgtc acctagaaca gttgactcat 4200 gctgacctaa aattgtcaca agaacaggat cctgttattg gcaaagtaaa acaagacatt 4260 gcacgaaaca agccactcac ccctataaaa ggttctgatc ctaccctcac tctcctgcaa 4320 cgccaaggtc ccaaacttgt cattcgaaat aatctgttgt acagagtcac caaaaaccaa 4380 agtggaaaag agaaagttca actagtgttg ccagagaaat accatttaac agtgctgcgg 4440 tctctgcatg atgattctgg tcacctagga gtagagaaga ccacagagtt actgagagat 4500 cgtttttact ggccacgtat gaccagtgac attgagcaat acatcaaaaa ttgtggtcgt 4560 tgtattacac gcaagacctt acctcaaaag tctgccccat taagccacat taccagcagt 4620 ggtccactag acttggtttg tatcgacttc ttgtcccttg agcctgatag taagggtatt 4680 gctaatgtgc tagtaataac tgaccacttc acccgctatg cacaagcatt tcctactaaa 4740 gaccagcgag ctgtgacagt tgccaaagtg ttggtagaaa agttttttgt tcattacgga 4800 ttgccctcac gcattcattc cgatcaagga agagactttg aaagccggtt gatacaggaa 4860 cttctgggaa tgttggggat ccgcaaatca cggaccacac cttaccaccc acaaggtgac 4920 ccacagccag aacgctttaa ccgtaccctt ctgtcaatgc taggtactct ggagccagcc 4980 caaaaaagta agtggagtca acacatcact cagcttgtac acgcttataa ttgtacaaaa 5040 aacgaggcta caggctactc cccttatcag ttgctttttg gaagagaggc tcgtctgccc 5100 atagacatat gttttggcat ttcacctgct ggtgagaaag gagtgactca tctgcaatac 5160 gttgagaaaa tgaaagctga actgcaacaa gcatatcagc tggcggctga gacttcgttg 5220 aaggctcatc aaagaaacaa gaagctctat gacacaagag tgaaacctca actgttgact 5280 gttggagaca gagtgctcat tcgaaatctt gctgtaaaag gaaagaacaa actccaggac 5340 agatggaatt ccttaccata tgtggttgtg gagaagttta aggacttacc ggtctataaa 5400 ctgaggcctg agcgtggaat gggagcaata aggacaatgc atcgagacca cttgttacct 5460 gttggagaga atgtgagatt cagtaagccg aatgactcca atccctcaac acagtcacct 5520 gtaacaagag cacaatcagg aaaaagagta caaaaggaaa agaaagttga aaatgttgag 5580 caagtaagag atgaaaacca tgagacatct gagagtgagg atgacaacct ttgttattac 5640 tacccaaagt taattcctgc tctgagaact ctgccaacac ctcaaattgc tgtggaacct 5700 gaaatacctg ctgaatatgg tccagaactg aattcagagt gtgaagcgag aaaagataca 5760 gtggaggagg aggcgagaga ggttctcaat cagcctgctg aagctgtgga tgatcaaggt 5820 gtaggagaca atgaccacag aaatgttccc aaaccggggt ctgaacctgc tgtgtgtcgt 5880 aagtctacga gagaagtgaa accagtgata aaactgagtt atgatgattt gggccgaccc 5940 actgataaac cattaactat ggttcaccga gggatggtag tacacattga agatttgtca 6000 aagacccgaa agagctgcaa cacagtttgg tgtcacccca tggctcagtg ttcccagtgt 6060 gtccctacag cccctggccc tattgtcaga acagtaattc aattttaaat gtctcatgag 6120 ggcatgagaa gtttagaagg gggagga 6147 // ID DNA-8-8_DR repbase; DNA; ZEB; 1125 BP. XX AC . XX DT 12-NOV-2008 (Rel. 13.11, Created) DT 12-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE DNA-8-8_DR is a nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; TSD 8-bp; DNA-8-8_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1125 RA Bao W. and Jurka J.; RT "DNA transposons from zebrafish."; RL Repbase Reports 8(11), 1723-1723 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1125 BP; 324 A; 222 C; 195 G; 383 T; 1 other; caggggtctc aaactgccgg cccgcrggcc atttacggcc cgccctcctc ctccctccgg 60 cccgcaactg acgtcaaaaa tataacgaga ttcggcccgc caaaacatat atttttgaac 120 cactctatca ttgttgtgca gtcgcgtcta gccacttgtg gttttgaccc gccacctagt 180 cgacatttaa agtactgcat tcagattagt tttactttca ttttctctca gtgtacggtc 240 gagagtaaca cacatgcgtt ttaattctgt ggctgattta aacgttaaaa tatatgtcca 300 taagtgctct attttttact aaagctctat ttttgtaaat attcaagggt tgttttgatt 360 attatcagtt ttataccact tgtattttgt tgtacaaaca cttcttcatg gcttacacgt 420 tatgctggtg gtgtaacaaa tatgataatt ttgctaatat tcgatttaca ttcaaatggt 480 ctcatttaat agattttcct catatgctta ttaagatttg cataaaaact atgtacatta 540 gtaaaattgt actgctagtt gaattgaact tgttaaagta aatgtgaata tgtacatact 600 tttcccctct ttgttgtagt tttacaaaaa aaaagaaaga tattgagaag aacaattgcc 660 agtaaagtca cttaagttac ccaaactgct ttctgctgtt cttacgctat tgggacaata 720 caattggttg ggacagaata ataattatta tgcctattat ttgtagtatt ttcatagcaa 780 tttaaactac cccttcaata tgtacaacaa gaaacaaaaa gctagagttt tgactgcata 840 tactctgtgg aagaatgact gagtgtgtgt ctaatgctcg gccatacaca caacaaccac 900 agatatattt cagcagctga tgtgtgacac gataaacgtg ctgatttgca ttatcactgg 960 tgacaggtaa tgcttgaaat ataaacgtca tgtctgttgt cctaagatgc atcaattatg 1020 tcaggtttcc actttttttt cttgtgcttg aatgttaaaa cggccctcct atgaattgtc 1080 agtcactgaa aatggccccc cgccaatttg agtttgagac ccctg 1125 // ID I-1_DR repbase; DNA; ZEB; 5283 BP. XX AC AL672145; XX DT 11-JUL-2002 (Rel. 7.06, Created) DT 01-FEB-2007 (Rel. 12.03, Last updated, Version 2) XX DE I-1_DR is a non-LTR retrotransposon from the I clade. XX KW I; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; endonuclease; RNaseH; I clade; I-1_DR. XX NM I-1_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5283 RA Kapitonov V.V. and Jurka J.; RT "I-1_DR, a first example of vertebrate non-LTR retrotransposons RT that belong belong to the I clade."; RL Repbase Reports 2(6), 18-18 (2002). XX DR Genbank; AL672145; Positions 25824 31106. XX CC I-1_DR is a non-LTR retrotransposon that belongs to the I clade. CC This element is characterized by 13-bp target site duplications. CC It can be an active element. It encodes two proteins: CC 417-aa I-1_DR1p (positions 3-1253) and 1250-aa I-1_DR2p CC (positions CC 1238-4987). These proteins are most close to corresponding CC proteins CC encoded by other I-like elements. I-1_DR1p is a putative RNA/DNA CC binding protein (it includes zinc-finger), and I-1_DR2p is CC composed CC of the AP endonuclease (aa positions 9-222), reverse CC transcriptase CC (positions 500-730) and RNaseH (positions 968-1095) domains. XX FH Key Location/Qualifiers FT CDS 3..1253 FT /product="I-1_DR1p" FT /translation="MAASSGGGAVQAVGNDWIENGQEWGDGNDGSGNEVEE FT MEGSESCEPWTNCRGKKRKKRKNKLDSDEEMRSKVKEGNEEYNVFVRLVQE FT GATFEDWSPIQLTKALYKEIGEVRCAKKLRNGCLLVSCKDEAQQKKAIKVN FT KINGKKVKCSEVYDRKLIRGVITGIPVSESLNNVIEGITNAKIKEAKRLKT FT RWNGAICDSLSIMLTFDETKLPDKVFIGYMSYEVKMYIPPPVRCYKCQKYG FT HIAAICKGKMRCSKCSGEHEYGQCDKEAKLKCCNCGGEHSSAYRGCEVNKR FT MQEIQRIKVTQGISYAEATKKVRPTREQMGQMPTMREERKDMIKCKKCDKI FT KEETLIVDKNEFVLFMAEVINCSAQTTSRTERIKIIVKAAEKYMGIKGIPW FT ESVRNTLNEEVQQQQSQAWGGVA" FT CDS 1238..4987 FT /product="I-1_DR2p" FT /translation="MGWSCIMVIFILQWNARSLISNGQEFKKYIDDLNEKP FT DIVCVQESWLISRLEFNIKGYNAVRMDRKIGKGGGIITFIKKGIQYREVKR FT GNELEYVIVEVWSSEGNIKIINFYNPCRLLEREQLEEIWEGINGKIIWCGD FT FNAHSTLWGNRNDNNGRVIEEFIEEEELVCINDGTGTRLNTARGTESAIDI FT TIVTKDIADRCEWEVLRGNTVGSDHYPIKTQVGIECAKEIEVREEKWILER FT ADWDKFREISEDLLQKIEDNLDVENMCKRISGGIIEAAKMAIPKSKPKIIN FT KIVPWWTKECRKAIKERNKAFKKMKTTHNFQNLMKYKQAQAIVRKTVKKSK FT KEYWRQFCESIGRTTPVERVWGMIKKMKGNGKEYGYPMLMDGQRVIINNKE FT KAEIIARTLIKVHSTDNLSQEEKRGRVETYERYQFDLGKDDGDQVLNINFS FT GTELSRALKKLGKTAPGRDGICYTMLENLTDKGKEVLLKLYNKIWEVGVIP FT KEWKKAVIIPIKKPGKDPKQPTSYRPIALTSHIGKTMERMINDRLVYWVET FT KRKIGNYQSGFRKGRGTMDPILRLEDDIKKAQVNRESVIAVFLDIEKAYDM FT LWRDGVLIKLNQIGVKGRILRWVKEFLSERSITVKINGTFSECYSVENGTP FT QGSIISPFLFSIMFDGIFKEIENNTGVALFADDGAIWKRGRNITFIMKKMQ FT QILNTVQEWTVKWGFRISQEKTKAMLFTKKKIREDLKLKLGGKDLENVESF FT KYLGVWFDRRLTWNTHISKMVDKCKRVLNVMRCLCGVDWGASRVALKSIYT FT GLIRSVIDYGCMVYGSAANTTLKQLDVIQNQALRVCCGAMKTTPVAALQVE FT MGEMPLHLRRDQLEVVYWANLKGHNENHIAQTVLMQCQEREKRGIKGYGWT FT IQQKINDMEIDTIKISPTIVFPVVPTWLLDDLEVDFEIMKEKQENEIDSKQ FT VENYIREKYSETTEIYTDASRIGQRVGVSFSIPKLKIEVTKRINNNLAVYT FT AELVAIWLALKWVEDNKPIKAVIASDSSSALISIKNVVSESRQDIIYEIVQ FT LGNNIIKSGVIISLLWVPAHIGVSGNEMADKLAKQAAQQTMIDMDIKYSKS FT EIKSIVKTKILGKWQHIWNNGSTGRQYYTIQNIVGKGRETRKNKKEEDKFS FT RMRFNHTSLNSTLHMINKHADGMCECNNQETVEHVLMHCPIYQTERNILFT FT QLQEKQVEPNIKNILKLSTGDVCFRYVYNYLKDTGLINRI" XX SQ Sequence 5283 BP; 2143 A; 603 C; 1253 G; 1284 T; 0 other; gaatggcggc atcaagtgga ggaggagcag tgcaggcggt tggcaacgac tggattgaga 60 atggtcagga gtggggagac ggaaatgacg gatcaggaaa tgaagtagag gagatggagg 120 gtagcgagag ttgcgaaccg tggacaaact gtagaggtaa aaagcggaaa aaaagaaaaa 180 acaaactaga tagtgacgaa gaaatgagat ccaaagtaaa ggaaggaaat gaagagtata 240 atgtgtttgt tagactagta caggaagggg cgacatttga ggattggagt cctatacaac 300 taacgaaagc tctgtataag gagattgggg aggtaagatg tgctaaaaaa ttaaggaatg 360 gatgcttatt ggtgtcatgt aaagatgagg ctcaacaaaa gaaagcaatc aaagtaaata 420 aaataaatgg taaaaaagtg aaatgctctg aggtctatga cagaaaactt ataagaggag 480 taatcacagg cataccggta agtgagtcat taaacaacgt gattgaagga ataacgaatg 540 ctaaaataaa agaagctaaa cgcttaaaaa caagatggaa cggagccata tgtgacagtc 600 tttcaataat gctgacattt gatgaaacaa aactacctga caaagtcttc ataggataca 660 tgagctatga agtgaaaatg tacataccac cgcctgttag gtgttacaaa tgccaaaaat 720 atggtcatat tgcagcgatc tgtaaaggga aaatgagatg tagcaaatgt agtggagaac 780 atgagtatgg acaatgtgat aaggaagcaa aactcaaatg ttgcaactgt ggtggagaac 840 atagttcagc atacagaggg tgtgaggtaa ataaaagaat gcaagagata cagagaataa 900 aagtcactca aggtatatct tatgcagaag caacaaagaa agttaggcct acaagggaac 960 agatgggtca gatgccgaca atgagagaag aaaggaaaga catgattaag tgtaaaaaat 1020 gtgataaaat aaaggaagaa acactgattg tggataaaaa tgaatttgtt ctctttatgg 1080 cagaggtgat aaattgctca gcacagacaa caagtcggac ggaaaggatt aaaataatag 1140 ttaaagccgc ggaaaaatac atgggtatta aaggaatccc ttgggaatca gtgagaaaca 1200 cgttaaatga agaagttcag caacaacagt cacaagcatg gggtggagtt gcataatggt 1260 aatatttata ctgcagtgga atgcgaggag tttaatatct aatggtcagg aatttaagaa 1320 atatatagat gatttaaatg aaaaacctga tatagtgtgt gtacaagaat catggttaat 1380 atcaagacta gaatttaata ttaaaggcta taatgcagta agaatggata ggaaaatagg 1440 taaaggtggg ggaattatta catttataaa aaaaggtatc caatatagag aagtaaagag 1500 agggaatgaa ctagaatatg ttattgtgga ggtatggtca agtgaaggta atattaaaat 1560 aataaatttt tataacccat gtagattgtt agagagggaa cagttagaag agatatggga 1620 gggtattaat ggaaaaatta tctggtgtgg agattttaat gcacacagta cattatgggg 1680 taacagaaat gataataatg ggagagtaat tgaagaattt attgaagaag aagagctagt 1740 gtgcattaat gacgggacag gaactagatt gaatacagca agaggcacag aatcagcaat 1800 agatattaca atagttacaa aagatattgc agataggtgt gaatgggagg tattaagagg 1860 taacacagtt ggaagtgacc actacccaat caaaactcaa gtaggaatag aatgcgcaaa 1920 agaaattgaa gtgagagagg agaaatggat tttagaaaga gcagattggg ataaatttag 1980 ggaaatcagt gaagatttgt tgcaaaagat tgaggataac ttagatgttg aaaatatgtg 2040 taaaagaatt agtgggggaa taattgaggc agcaaaaatg gcaataccta aatcaaaacc 2100 taaaataatt aataaaattg ttccatggtg gacaaaggag tgtagaaaag ctataaagga 2160 gagaaacaaa gcttttaaaa aaatgaaaac gacacacaac tttcaaaatc tcatgaaata 2220 caaacaagca caggcaatag ttaggaaaac ggttaaaaaa tcaaaaaaag aatattggag 2280 acaattttgt gagtcaattg gtagaacaac gccagtagag agagtatggg gaatgatcaa 2340 aaaaatgaaa ggaaatggaa aagaatatgg atatccaatg ctgatggatg ggcagagggt 2400 tattattaat aataaagaga aagcagaaat tatagcaaga acattaatta aagtacacag 2460 cacagacaat ttaagccaag aggaaaaaag agggagggtg gagacttatg aaagatatca 2520 gtttgattta gggaaagatg acggggatca ggtactaaat ataaatttct cagggactga 2580 gctgagtagg gcattaaaga aattagggaa aacggctcca gggagagatg gaatttgtta 2640 tactatgtta gaaaacctaa ctgataaagg aaaggaggtg ctgttgaagt tgtataacaa 2700 gatatgggag gtaggagtta ttccaaaaga atggaaaaaa gcagttatca ttcccattaa 2760 aaagcctggg aaagatccca aacagccaac cagttataga ccaatagccc taacatcgca 2820 tattggaaaa acgatggaaa gaatgattaa tgacagatta gtgtactggg ttgaaactaa 2880 aagaaagata ggaaattatc aaagtggatt taggaaaggt agagggacaa tggatccaat 2940 attgaggctt gaagatgata ttaaaaaagc acaggttaac agggaatcag taatagcggt 3000 gtttttagac atagagaaag catatgacat gttgtggaga gatggagtgt taattaaact 3060 taaccaaata ggagttaaag gacgcatatt gagatgggtt aaagaatttt tatcagaaag 3120 atccataaca gttaaaataa atggtacatt tagtgaatgt tacagtgttg aaaatggcac 3180 accacaaggg agtattatta gccctttttt gttttctata atgtttgatg ggatctttaa 3240 ggagatagaa aataatacag gagttgcatt atttgctgat gatggtgcga tttggaaaag 3300 agggagaaac ataacattta taatgaagaa aatgcaacag atattaaata cagtgcagga 3360 atggacagtt aagtggggat ttagaatatc tcaagaaaaa actaaagcca tgttatttac 3420 caaaaagaaa ataagagagg atttaaaatt gaaattggga ggtaaagatt tggagaatgt 3480 tgaatctttt aaatatctgg gagtgtggtt tgataggaga cttacatgga acacacatat 3540 tagtaaaatg gtggataaat gtaaaagagt gttaaatgta atgaggtgcc tatgtggtgt 3600 agactggggt gctagtagag tggcattaaa atcaatttat acagggttga taagatcagt 3660 gatcgattat ggatgtatgg tgtatggatc agctgctaat acaacattaa aacagctaga 3720 tgtaatccaa aaccaagcat taagagtatg ctgtggagcc atgaaaacca caccggtagc 3780 agcattacag gttgaaatgg gagagatgcc tctacacctt aggagagatc agctggaggt 3840 agtatactgg gcgaacttga aaggccataa tgaaaatcac atagcacaga cagttctgat 3900 gcaatgccaa gaacgagaga aacggggcat taaagggtat ggatggacaa tacaacaaaa 3960 aataaatgat atggaaatag atacaataaa aatttcgccc actatagtat ttccagtagt 4020 tcccacatgg ctgcttgatg acttggaagt tgattttgaa ataatgaaag aaaaacaaga 4080 aaatgagatt gacagcaaac aagtggaaaa ttatataaga gaaaaataca gtgaaacaac 4140 tgaaatatac acagatgcat caagaattgg acaaagggta ggagtgtcat tcagtattcc 4200 aaagttaaaa attgaagtta caaaaagaat taataataat ttagcagttt atacagcaga 4260 attggtagcg atatggttgg ctctaaagtg ggtagaggac aataaaccaa ttaaagcagt 4320 cattgcgtct gattcaagtt cagcactcat aagtattaaa aatgtagtat cagaatcacg 4380 gcaggacata atatatgaaa tagtacagtt gggaaataat atcattaaat caggagttat 4440 catttcatta ttgtgggtac cagcgcacat aggggttagt ggaaatgaga tggcagacaa 4500 gttggctaaa caagcagcac agcaaactat gatagacatg gacatcaaat acagcaagtc 4560 agaaatcaaa agtatagtta aaaccaaaat attaggaaaa tggcaacata tatggaataa 4620 tgggagtaca ggacgtcaat attacaccat acagaacata gttgggaaag gtagggaaac 4680 aaggaaaaac aaaaaggaag aagacaagtt ctcaagaatg agatttaatc acacatcact 4740 caatagcaca ctacatatga tcaacaaaca tgcagatggg atgtgtgaat gtaacaacca 4800 ggaaactgta gaacatgtat taatgcactg cccaatatat cagacagaaa gaaatatatt 4860 attcacacag ttacaggaaa aacaagtgga acctaacata aagaacatac taaagttgag 4920 cacgggtgat gtatgtttta gatatgttta taactatttg aaagacacag ggttgattaa 4980 tagaatttag tatttagatg aaaagttatc tatatcatat agaggtatta tagagaaaga 5040 aatatcaaag tttgggacat ttagtaattt atttatatgt ataggaaaaa gaaataaaaa 5100 tttgatttta aatttttttt ttttgggttt tttttttttt tttgtgattt ttatggttgg 5160 gctttgttta actataaaac atggtttagt taactccgga tccacactcc aatccagatg 5220 gtggcggtaa tgcaccaaaa gctggttgcc aaccgccata aaacacacaa gaagaagaag 5280 aag 5283 // ID hAT-N70_DR repbase; DNA; ZEB; 1325 BP. XX AC . XX DT 31-OCT-2008 (Rel. 13.1, Created) DT 31-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Putative nonautonomous hAT DNA transposon - a consensus. XX KW hAT; DNA transposon; Transposable Element; Nonautonomous; KW 8 bp TSD; hAT-N70_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1325 RA Bao W. and Jurka J.; RT "hAT DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1339-1339 (2008). XX DR [1] (Consensus) XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1325 BP; 477 A; 187 C; 172 G; 488 T; 1 other; gagatatatc cagctgcagg cccgcagaac cacacacgtt tgaggcacac acaatctagg 60 acacacaatc taggcaaacc atatatggta actaactctt tagcattatc gccacctatt 120 gaatttcaca aatatggttt cccatttcag tcattattat tattattata tcgtaacgat 180 cattttaaca tctctgcagt ttctgaactt aaattatgta aataaaaata attggttaaa 240 gacacttatg actttattca tgtgtccata tacacaaaga aaacataata gaccgtctaa 300 ttgttggatt aagtgtaata aactaattca agctcttaat attatcatac ttggttaaaa 360 tcactttgtt gtaaaatata aaaagcaaac aataatatgt cactttaaaa taaattagcc 420 tactatagtt aaccatttaa acaatatata tagttataaa ctgattatct taactgatta 480 taattataag tttatatata aactactatc ttttactaaa ctgtcactgt taaagctgtt 540 atatatatat atatatatat atatatatat atatatatat atatatatat atatatatat 600 atatatgtat atgtgtgtgt gtgtgtgtgt gtgtgtgtat atatatatac acttatgatt 660 tttttattat taatagttaa tttgttccta tttctaaatt ttggtttaaa aacaatttat 720 tgctgtacta ttaaattact acagtattaa atgtgggaaa tcattaatta aaaggttatt 780 agaaaattaa gctaataaac actattaaat cgcttgagta aaatattaat atagtgttat 840 acatttgtta catgtaaaaa gatttttatt acatataatg tattattgcc tatattaact 900 tgctattatt tccttcatca taaattaatc cattatataa attaaacctg tccaaatgag 960 atcattcagc aaggccagca aacgagtatc tgccatatta ttgtcagata ataatgactg 1020 atcagctata atgactgatt aaaaactgtt castctatta tattgaatgg cgatcggaaa 1080 caatgctccc cattcggccg tctcaatcgc aatatttgtg aaataaagta aagtaaacac 1140 tattgtttat tgtgtacaat atttattaat catatatatc aaaatatatc gatgacgatg 1200 tccaaaaaga cgttaataac tttaacatcc gtcgtaacca tatatggttt gcctagattg 1260 tgtgtcctag attgtgtgtg cctaaaacgt gtgtggttct gcgggcctgc agctggatat 1320 tattg 1325 // ID Gypsy86-LTR_DR repbase; DNA; ZEB; 139 BP. XX AC chr1; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy86-I_DR; KW Gypsy86-LTR_DR; Gypsy86_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-139 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1484-1484 (2008). XX DR Genome; chr1; Positions 53844206 53844068. XX CC CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 139 BP; 43 A; 22 C; 36 G; 38 T; 0 other; tgtattgata ggtgttttga agacctatca taagtgacca ctagatggca gtacgaggcc 60 atggtaagcc tagttgagtt atgtgagaag aataaagcat tctgacaaaa gtactcggtg 120 tgtgagcctc gtcaataca 139 // ID Gypsy89-I_DR repbase; DNA; ZEB; 4368 BP. XX AC chr10; XX DT 01-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: internal portion. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy89-I_DR; KW Gypsy89-LTR_DR; Gypsy89_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4368 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1489-1489 (2008). XX DR Genome; chr10; Positions 10914557 10918924. XX CC Positions [3265-3744] - Integrase core CC 'TATAT' target site duplication CC LTRs are 96% similar to each other. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 832..4353 FT /product="Gypsy89-I_DR_1p" FT /translation="MNEGEVLHLLWEAGSFPGNMSRVVGKSQISFRCRGPG FT ARVGVIPCPIFSGLTIQAVISLQQPHEVEIFIDSGAAGNFMDLKCAQDLSC FT DLLTLQNPLKVSAVDGNPLGLGHIHQCTTALHVRIGDHEENLQFHIIHSPK FT IPIILGHPWLVTHNPLIDWSKGVIVSWRCKGSSLNTMFGQSTKDSTLCHCP FT LPLETQPEPPRHTKAIPLDIMTIPKPQIVPKSLNTPPEISRVPPEYSDLAE FT VFSKTRAASLPPHRPYDCPIDLLPGTCPPRGKLYSLSGPERAAMEKYVHES FT LDSGFIRPSTSPAGAGFFFVGKKDGSLRPCIDYRGLNSITVKNRYPLPLMT FT TAFKILQGATIFTKLDLRSAYHLVRIRQGDEWKTAFNTPTGHYEYQVMPFG FT LANAPAVFQSFIYDVLREMLNIFVFVYLDDILIFSHNPEEHVIHVRKVLIE FT LLKHGLFVKLEKSEFHVSSVSFLGFIVSKGSLQIDPSKTRAVLDWPQPTSI FT KEVQRFLGFANFYRRFIRNFSSIAEPLTSLTKKANTPFTWNDKASTAFNTL FT KHRFTSAPILTLPDPELPFILEVYASDIGVGAVLSQRSKADNKLHPCAFYS FT HRLTPTQANYDIGNRELLAIKLALEEWRHWLEGASHHFLIWTDHQNLTYIQ FT NAKRLNARQARWSLFFNRFKFTLSFRPGSKNIKPDALSRQFEPSEKPCCPE FT PIVPGSRIVAPILWDLESSIRKAHQHHPDPGNGPSGSLFVPNSMRSDVLQW FT VHFSSPSGHPGINRTYKLILRKFWWPMMRKDVQEFVSSCPTCAQSKEPRTR FT PQGLLQPLSIPSRPWSHISLDFITGLPPSQGNTVILVVVDRFSKMCHLLPL FT PKLPTANQTAELLMRHVFKMHGFPQDIVSDRGPQFSSRFWKAFGRLIGSSI FT SLSSGYHPQSNGQTERVNQEVEKTLRCLAGDNHSSWTSHLVWVEFAHNTLH FT HSSINMSPFECVYGFPPLLFPGQEPQVEVPAATRFIHRCLRSWQKARTSLR FT KAVHNQQKQANRHRRAGPSLRTGQRVWLSTRNLPLRVESRKLAPRYVGPFK FT IIKKINPVAYRLLLPSALKIHPTFHVSQLKPVLCSTLHPNRAPTPALRIVD FT GHPAFTVRRLLKCRQVRGRTQYLVDWDGFGLEERTWASAKDILDPSLIKEF FT HQAQREHKGNVRRRS" XX SQ Sequence 4368 BP; 1101 A; 1186 C; 960 G; 1121 T; 0 other; gcaggactga accatggaaa accgcccagc agaggaggct gcaccagtct caaatgtgga 60 tcgcattctt gcggcacttt cagaacaagc caccgctatt caccagcatg accaaacttg 120 cgaaattctt agcctcttga atccccactc tcagactaca atggcccagg taattacgcc 180 acagtctgtt cccttgtcaa cacctgagcc caagttgccc cctccccagc gctttgatgg 240 cagtttcgag aactgcaggg gattcattac tcaatgtact ttgatctttc aactacaacc 300 tagttgtttt cctactgact gcagtaaggt ggcatatata attacactcc ttacgggcaa 360 ggctctggat tgggcctcag ccttgtggga taagaagtcc cctgtaacca ctaatctcca 420 gctcttccta gcagagatga agagagtgtt cctccattca gcagacagtg gagaggcagg 480 acgccgtctc ctaagactat ctcaggggtc ccgcagtgcc gctgagtttg ccattgagtt 540 ccgtactctg gccaccgaga gtgggtgggg ccaatctgca cttagagcga tctttcacaa 600 tgcattgtct cccaagctca aggatgaact ggcattcaga gactacccat cagaccttga 660 gtccctcatt ggtttagcca ttcgggtgga taaccgcctt aaggagtggc agacagagag 720 agagcaggac acaattatga cccgtcatga caatgttgtt tgtcccacac tggatcctga 780 ggaacctatg caattagggg ctacaaggat ctcgctctct gaaagaaccc gatgaatgaa 840 ggagaagtgc tgcatttatt gtgggaagca gggtcatttc cgggcaacat gtccagagtt 900 gtcgggaaaa gtcagatctc gttccggtgt aggggccccg gggcgagagt aggtgtgatt 960 ccctgtccca ttttctctgg tttgaccatc caggcagtca tctctctcca acaaccccac 1020 gaggtggaga tcttcattga ctcaggtgca gcaggaaact ttatggacct caagtgtgct 1080 caagatcttt cctgtgatct cctaaccctg caaaaccccc ttaaagtctc agcagtagac 1140 ggcaatccgc taggcttagg ccacatccac caatgcacca cagctctcca tgtgagaatt 1200 ggagatcatg aagaaaacct tcagtttcac atcattcatt cccctaagat ccccataatc 1260 ctgggacacc cctggttagt aacccataac ccacttattg attggtctaa aggagttata 1320 gtaagttgga gatgtaaggg gtccagtttg aacaccatgt ttggtcagtc aaccaaggat 1380 tcaacccttt gccattgccc tcttcccctt gagactcagc cagaaccccc caggcacacc 1440 aaggccatcc ccctggacat catgactatt cccaagccac agattgttcc caagtcgctc 1500 aatactcccc ccgagatatc ccgagttccc cccgaatact ctgacttagc tgaggttttc 1560 agtaagactc gagcagcctc actgccccct cacaggccat atgattgccc cattgatcta 1620 ttacctggaa cctgcccacc ccgcggaaag ttgtattcgc tctctgggcc tgaaagagct 1680 gctatggaga agtatgtaca tgagtcccta gatagtggtt ttatccgccc ctccacgtca 1740 ccagccgggg caggattctt ttttgttggt aagaaagatg gttcccttag accctgcatt 1800 gactacagag gtctcaacag cattacagtc aagaatcgct accctcttcc cctaatgaca 1860 acagccttca agatcctaca aggagctacc atctttacca agctggatct ccgtagtgca 1920 taccatctag tcaggatcag gcaaggggat gagtggaaga ccgcgtttaa cactcccact 1980 ggccactatg agtaccaagt tatgcccttt ggattagcca acgcgccagc tgtcttccag 2040 tcatttattt atgatgtact acgagagatg cttaacatct ttgttttcgt ctatttggac 2100 gacattctca tcttctctca taaccctgag gaacatgtga tacatgttcg gaaggtactg 2160 attgaacttc tcaaacatgg gcttttcgta aagctagaga agagtgaatt tcatgtttcc 2220 tctgtctctt tccttggatt cattgtttca aagggcagct tacagataga tcccagcaag 2280 actagggctg tgctggactg gccacagccc acctctatca aagaggtgca gcgcttccta 2340 gggtttgcaa atttttatag gaggtttata aggaacttca gttctattgc cgaaccacta 2400 acttccctga ccaagaaggc gaatacccct ttcacctgga acgataaggc cagtacagcc 2460 tttaacaccc tcaaacatag attcacttca gcacctatcc tcactttacc agatcctgag 2520 ttgcctttta tcctggaggt gtatgcatca gatatagggg ttggagcagt tctatcccaa 2580 agaagtaaag cagacaacaa acttcatcca tgtgccttct attcccatag acttacgccg 2640 actcaggcca actatgacat tggaaaccgc gaactactgg ctattaagtt agctttggag 2700 gagtggaggc actggcttga gggagcatct caccacttcc tcatttggac tgatcaccag 2760 aacctcactt acatacaaaa tgccaagaga ttgaacgcta gacaagctag atggtcactg 2820 ttctttaacc gtttcaaatt cacactttcc ttccgcccag ggtccaaaaa catcaagcct 2880 gacgctctct caagacagtt tgaaccttct gagaagccat gctgccccga acctatcgtg 2940 cctggctcca gaattgtggc tccaatactg tgggatttgg agtcctcaat cagaaaggca 3000 catcaacatc accctgatcc aggtaatgga ccatctggga gcttatttgt ccctaactct 3060 atgcgttctg atgtcttaca gtgggtacat ttttcctccc cctcaggaca ccctggtatt 3120 aacagaacct acaaattgat tttgaggaag ttctggtggc caatgatgcg aaaagatgtg 3180 caagagtttg tttctagttg ccccacctgt gctcagagca aagaaccccg aactcgaccc 3240 cagggacttc ttcagccact ttccatcccg agtcgcccct ggtcccatat ttctcttgat 3300 tttatcacag gattgcctcc atctcaggga aatacggtaa tccttgtggt ggtggacagg 3360 ttctctaaga tgtgtcactt gttaccatta ccaaagcttc ccactgcgaa ccagacagct 3420 gaactactaa tgagacatgt ctttaagatg catggcttcc cacaggatat agtctctgat 3480 cggggccccc agttctcatc taggttttgg aaagcttttg ggcggctcat tggctcatcg 3540 atcagtctgt cctctggcta ccacccccag tctaatgggc agacagaaag ggtaaaccag 3600 gaagtagaaa aaacgctaag gtgcctggca ggagacaatc attccagttg gacctcccat 3660 ttagtttggg tagaatttgc tcacaatact ttacatcact cttccatcaa catgtctcct 3720 tttgaatgtg tctatggctt tccccctctt ttgtttcctg gtcaggaacc acaggtggaa 3780 gttccagctg ccacccggtt tattcaccgt tgcctgcgtt cctggcagaa agctcgtact 3840 tccctgagaa aggctgtcca caatcagcag aaacaggcca acagacatcg cagggcagga 3900 ccctccctcc gaactggcca aagagtatgg ctctcgaccc gaaaccttcc tttgcgggta 3960 gagtcacgta agttggctcc acgctatgtt ggtcctttta agattattaa gaagatcaat 4020 cctgtagcct accggcttct actacccagt gccctgaaga tacatccgac atttcatgtt 4080 tctcagctaa aaccggttct ttgctccacc ctgcatccca acagagcacc cacacctgct 4140 ttacgcattg ttgatggcca ccctgctttc acggtccgca gactacttaa gtgtcgccag 4200 gtacgaggga ggacacagta cttggtcgac tgggatgggt ttggtctgga ggagcgtacc 4260 tgggcctcag ctaaagacat tttggaccct tccctcatca aggaattcca ccaggcacaa 4320 agagagcaca aagggaacgt caggagacgt tcttagagga ggggctac 4368 // ID Mariner-N1_DR repbase; DNA; ZEB; 270 BP. XX AC . XX DT 31-DEC-2007 (Rel. 12.12, Created) DT 31-DEC-2007 (Rel. 12.12, Last updated, Version 1) XX DE Putative Mariner-type non-autonomous DNA transposon - consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Nonautonomous; KW Mariner-N1_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-270 RA Jurka J.; RT "Mariner-N1_DR: Putative Mariner-type DNA transposon from RT zebrafish."; RL Repbase Reports 7(12), 1214-1214 (2007). XX DR [1] (Consensus) XX CC ~70 bp TIRs, TA TSD. Present in ~10,000 copies in the genome. XX SQ Sequence 270 BP; 93 A; 53 C; 48 G; 76 T; 0 other; cactctcaga aataaaggta caaaatctgt cactggggcg gtaccttttc aaaaggtaca 60 tttttgttcc taacaggtcc ttaaaggtac atattaatac ctaaaaagta caaatgtgta 120 cctcacagta ccttaaaggt acaaaagtga accttaaagg tacaaaagtg taccctaaag 180 gtactaatgt ccacctgtat ggtacaaaag tgtacctttt gaaaaggtac caccccagtg 240 acagattttg tacctttatt tctgagagtg 270 //