ID DIRS-2_DR repbase; DNA; ZEB; 5291 BP. XX AC . XX DT 26-SEP-2008 (Rel. 13.09, Created) DT 09-MAR-2009 (Rel. 13.09, Last updated, Version 5) XX DE DIRS-like LTR retrotransposon - a consensus. XX KW DIRS; LTR Retrotransposon; Transposable Element; DIRS1; DIRSDR1; KW reverse transcriptase RNase H; phage integrase; DIRS1_DR; KW DIRS-2_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5291 RA Jurka J.; RT "A family of DIRS-like retrotransposons in zebrafish."; RL Repbase Reports 8(9), 928-928 (2008). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 277..4899 FT /product="DIRS-2_DR_1p" FT /translation="MAEKNFKRCVPPCPRFITAGDSHDLCLECLGEEHALA FT AFENADCGHCDVLSRKELRSRREFFNKAPVAHAPRGSGPARAEAERRLRSW FT GSQLDLADEMETDSVLSLSGSVRSNPPSGASEARSAVSSAPRGRAASSVSE FT ETEAPQQQIQRGSGNLPPQSVEYEELVEVISRAADRFEVIDWQPAREQQQL FT RLTGMLDERELPSRVDPPLRDLPFCPELHDAVSKSWKNPYSAWLMTPKTAI FT YSAVRGLGEKACTVMPMIEEDLARHRRSDLKSARVPPLLSRPLRVTSGLVS FT KAYMTAGQSVGCLHTMSVLQAYQADLIKECLDGGGATPEQLREALRASDLA FT LRATKEAASSLGRSMATLVATERHLWLTQAEVSETDRAILMDAPISSSGLF FT GDAVDRVAESLDRVKKRSSSLGDFLPPRSKSQGAVKRQPQPSTSSSYHEVQ FT RIKQSPDVLGVRLSRAPPGHREHHSALSLNQGPWRKGEATSPLMLVGGSVS FT PGVGVPQCLRALGAPPPLQGGQRTTRGQSREAGTPSTFFGSVETPADGVPV FT GPVHSRTWLQNTVLCTPTSLQRYCTHHSEARTGSGYGTGSTGLINKRRNRA FT CSPTRQRVRVLQPVFYSSQKGWGIASHIRSEKSKSVRRGPQVQDVNHQKRG FT VTNSVRGLVCDDRPERRILPYFHPSPTQEIPEVRLRGRSVPVSGSSIRPSS FT VTSNLYQNSRSGTSSTSYAGDTYPKLHRRLANSSSDSRYGSSASRCRSHPY FT QKVGVSVKHRKKCACSSQDDHFFRCAMGLHDDASTSVPAMNRFDSVNRTQS FT QTRPVHHCETLSEVVGSHGSSSQRDSVRSAVHETPAVVAQIQGVFPQGESF FT PHDQGLAALPSSLKYVENALVPVPGPSVGGCLSSRHAYDKCFSDGLGSNGL FT DPHDQGLAALPSSLKYVENALVPVPGPSVGGCLSSRHAYDRCFSNGLGSNP FT EGASRSGTMGRTSSLLAHKLPGDDGRVSGLKTLSPRSKGPSCFSLHEQHIG FT GRLHQPAGGSEVSNAMQTSTSDPPVGPEQNPVHQGNVCPGPSEHGSRSPVE FT AGGEIQGMETSSPRGGGVHLGKIRESAGRPVCFPRDHALRTMVFSLASSPS FT GTGCHGSDIAEATSVCFSPDRSAPRSPGEGPSRLSTVTAGSPGLAYQDLVF FT GPHSPAGGSLVGDPHQQGPSVPGGRNDTSSPTRPVETVGVASEGAHLIEYG FT LSTEVAQTILSSRAPSTRKLYALKWALFSAWCREHQLNPVSCQVASVLEFL FT QDRLSAGLAASTLRVYVSAIAAYRSPLDDESLGQDPLIRRFLRGAIRLRPV FT STHRVPTWDLTLVLEGISVPPFEPLQEASDKFLTLKTAFLLAISSLKRVGD FT LQALSVAPSFLEFAPGMSKAFLYPRPGYVPKVPTHVARPAVLQAFHPPPFQ FT SSDQEKLNLLCPVRALNTYVNRVINWRKSEQLLVCFGPSKRGSPANKQTIS FT NWIVETISFTYQAAGRPAPKFVKAHSTRAVGASKASISGSALSDICLAAGW FT STPHTFVRHYQLDVDPSPGSSILTA" XX SQ Sequence 5291 BP; 1230 A; 1357 C; 1380 G; 1324 T; 0 other; ttccccttct agggaacttc aacactgcgt ctaaccagaa cgctagggga acacctcttt 60 tatacgcgtc ttgaagcaca tgtgaaatca atctaatgta attaagcagg tgtcgtcaga 120 ccagagagta taaaagcctg tactgagcat tcagtatcaa cttctttgct ttcaagaagc 180 acgcacgtga aaatacaccc tctttctgtg aactttcatt actgatttgc atacacaaaa 240 tacaaaaaaa ctgacaactt acttttttat tttggtatgg cagagaaaaa ctttaaacgt 300 tgtgtgcctc catgccctcg ctttattacg gctggtgact cacatgattt gtgtttagag 360 tgtttgggag aagagcatgc cctggcagca tttgagaatg ctgactgtgg acactgtgac 420 gttctctccc gtaaagagct gcgtagtcgg agagagttct ttaataaagc tcccgtggcg 480 cacgctcctc gcggttcggg tcccgctcgt gctgaggctg agcgtcgact tcggtcgtgg 540 ggttcgcagc tagatctggc ggatgagatg gagacggact ctgtcctttc tctctctgga 600 tccgtgagat ctaatcctcc ttcgggagcg tcagaagcac gctctgcggt ttcttctgcg 660 cctcgtggga gggcggcgtc ctccgtttcc gaggaaaccg aggcgccgca gcaacaaata 720 caaagagggt cggggaatct gccgccccag tcagtggaat atgaggagtt agtggaggtg 780 atttcacgtg ctgctgacag gttcgaagta atagattggc agccagcacg tgagcagcag 840 cagctgcgtc tgacaggaat gctggatgag agagaattac ccagcagagt agatcctcca 900 ctaagggacc tccccttttg tcccgagcta catgatgcgg tttctaaatc atggaaaaat 960 ccgtattcag catggttaat gacaccaaaa acagctattt attcagcagt tcgtgggcta 1020 ggggaaaagg catgtacagt aatgccaatg atagaagagg acttagcacg tcatcgtcgt 1080 tcagatctaa aatctgcaag ggtccctcct ttgttgtcga gaccattaag agtaacatcg 1140 ggtctagtca gtaaagcata tatgacggct ggtcagtctg ttggatgcct gcacaccatg 1200 tcagtgctgc aggcatatca ggctgaccta attaaagagt gcctagatgg tgggggagca 1260 acacccgaac agcttcgaga agctcttcgg gcgtcagatc tagctttaag agctactaaa 1320 gaggcagcct ctagtttggg gcgatctatg gctaccctgg tggctactga gcggcacctc 1380 tggctgacac aagcagaagt gtcagaaacc gatagagcta ttcttatgga cgctccaata 1440 tcgagctcag ggctcttcgg cgacgccgtc gatcgcgtcg ccgaatccct cgacagagtt 1500 aaaaaacgct ctagctccct cggggacttt ctccccccaa gatcaaaaag tcagggggct 1560 gttaaaagac agccccagcc gtcaaccagc tcctcatatc atgaagtaca aaggataaaa 1620 caaagtcctg acgtgttggg agtcaggctt tccagggccc cccctggaca ccgagagcac 1680 cattcagcgc tctcactgaa ccagggtccg tggagaaagg gagaggccac ctcaccactt 1740 atgttggtgg ggggctctgt gtctcccggg gtgggcgttc ctcagtgtct gagggcattg 1800 ggggccccac cccctctgca ggggggtcaa agaacaacca gaggccagtc tcgagaggct 1860 ggtaccccta gcacattttt tggcagcgtg gaaacacctg ccgatggtgt cccagtgggt 1920 cctgttcaca gtagaacatg gctacaaaat acagttttgt gcacgcccac ctcgcttcaa 1980 cggtattgca cccaccatag tgaagccaga acaggctctg gttatggaac aggaagtact 2040 ggccttatta ataaaaggcg caatagagcg tgttctccca ctcgacagag agtcagggtt 2100 ttacagccgg tattttatag ttcccaaaaa ggatggggga ttgcgtccca tattagatct 2160 gagaaatcta aatcggtccg tcggggccct caggttcagg atgttaacca tcaaaaacgt 2220 ggtgtcacaa attcagtccg aggactggtt tgtgacgata gacctgaaag acgcatactt 2280 ccatatttcc atccttcccc aacacaggaa atacctgagg ttcgcttgcg ggggcgaagc 2340 gttccagtat cgggttcttc cattcggcct agctctgtca cctcgaacct ttaccaaaat 2400 agtcgaagcg gcactagctc cacttcgtat gcaggggata cgtatcctaa actacataga 2460 cgattggcta attctagctc agactcacga tatggcagtt cggcatcgag atgtcgttct 2520 cacccatatc agaaggttgg ggtttcggtt aaacaccgca agaagtgtgc ttgttccagc 2580 caggacgacc atttctttag gtgtgctatg ggactccatg acgatgcgag cacgtctgtc 2640 cccgccatga atcgcttcga ttcagtcaac cgtacacaga gtcaaactag gccagttcat 2700 cactgtgaaa cactttcaga ggttgttggg tctcatggca gcagcagcca gcgtgattcc 2760 gttcggtctg ctgtacatga gacccctgca gtggtggctc aaatccaggg ggttttccct 2820 caaggggaat cctttccgca tgatcaaggt ctcgcggcgc tgccttcgag ccttaagtat 2880 gtggaaaatg ccctggttcc tgtcccaggg cccagtgttg ggggctgtct gtcatcgcgt 2940 catgcctatg acaaatgctt ctctgacggg ctagggagca acgggcttga tccgcatgat 3000 caaggtctcg cggcgctgcc ttcgagcctt aagtatgtgg aaaatgccct ggttcctgtc 3060 ccagggccta gtgttggggg ctgtctgtca tcgcgtcatg cttatgacag atgcttctct 3120 aacgggctgg ggagcaaccc tgagggggct tcccgcagcg ggacgatggg gagaacatca 3180 tcgttactgg cacataaact gcctggagat gatggccgtg tttctggcct taaaacactt 3240 tctcccagat ctaaggggcc atcatgtttt agtctgcacg aacaacacat tggtggtcgc 3300 ttacatcaac cagcaggggg gtctgaagtc tcgaatgcta tgcaaactag cacatcggat 3360 cctcctgtgg gcccagaaca aaatcctgtc catcagggca atgtatgtcc cgggccatct 3420 gaacatggga gcagatctcc tgtcgaggca gggggtgaga tccagggaat ggaaacttct 3480 tcaccccgag gtggtggagt ccatttggga aagattaggg aaagcgcagg tagacctgtt 3540 tgcttcccaa gagaccacgc attgcgtact atggttttct ctctcgcatc cagcccctct 3600 gggactggtt gccatggttc agacatagcc gaggctacgt ctgtatgctt ttcccccgat 3660 cgctctgctc ccaggagtcc tggagagggt ccgtcaagac tgagtacagt tactgctggt 3720 agccccggtt tggcctacca ggatttggtt ttcggacctc atagccctgc tggcgggtct 3780 ctcgtgggag atccccatca gcagggacct tctgtcccag gcgggaggaa tgatacttca 3840 tcccctaccc gacctgtgga aactgtgggt gtggcctctg agggggccca cctcatagag 3900 tatggactgt caaccgaggt tgctcagacc attctaagct ccagggctcc ctccacaagg 3960 aagctttatg ccctaaaatg ggctctcttt tcagcttggt gcagagaaca ccagctgaac 4020 ccagtcagct gccaggtagc ctcagtgctg gaatttctcc aagatcgcct gtctgctggg 4080 ttagctgcat ccactctgag agtgtacgtg tcagctatag cggcctaccg ttctccccta 4140 gatgatgagt cactaggaca ggatccgcta attcgtcgct tccttcgtgg agccataagg 4200 ctaaggcctg tcagcacaca cagggtaccg acatgggatt taacattggt gctcgagggc 4260 atctctgttc ccccatttga gccactgcag gaggcgtcag ataagtttct gacactaaaa 4320 acagctttct tattagctat ttcttcctta aaaagggttg gtgacctcca ggctttgtcg 4380 gttgcacctt catttctgga gtttgctcca ggcatgtcca aagcctttct ttatcccaga 4440 ccggggtacg tgcctaaggt gcccactcat gtggcgagac ctgctgtgct acaggccttt 4500 cacccgcccc catttcagtc gtcggaccaa gagaagttaa acttactctg cccagttaga 4560 gctctgaata catatgttaa ccgggttatc aactggagaa agagtgaaca gttactggtc 4620 tgcttcggac cctcaaaaag ggggagtccg gcaaataagc agacaataag taattggata 4680 gttgagacta tctcatttac ctatcaggct gctggacgcc ctgcacctaa atttgttaag 4740 gcccactcca caagggctgt cggggcctcc aaagcttcta tttcgggctc agccctttct 4800 gacatttgtt tggcggcagg atggtcgact ccacatacat ttgtgcgtca ctatcaactc 4860 gatgtagacc cctcaccagg gtcctctatt ctcactgcgt agtgtgcgtt cacagtcagc 4920 agtgagtctg gcctagtggg tattgcgttc ccctagcgtt ctggttagac gcagtgttga 4980 agttccctag aaggggaacg tctcgggtta cgtatgtaac catagttccc cgagagggaa 5040 cgagacactg cgtattccgc catactctct tctgcctgtt acttctttca agcaaattcg 5100 aagttgatac tgaatgctca gtacaggctt ttatactctc tggtctgacg acacctgctt 5160 aattacatta gattgatttc acatgtgctt caagacgcgt ataaaagagg tgttccccta 5220 gcgttctggt tagacgcagt gtctcgttcc ctctcgggga actatggtta catacgtaac 5280 ccgagacgtt t 5291 // ID CR1-4_DR repbase; DNA; ZEB; 1902 BP. XX AC . XX DT 05-JUN-2002 (Rel. 7.05, Created) DT 05-JUN-2002 (Rel. 7.05, Last updated, Version 1) XX DE CR1-4_DR is a CR1-like non-LTR retrotransposon - a consensus. XX KW CR1 clad; CR1-4_DR; ORF2; Non-LTR retrotransposon; KW reverse transcriptase. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1902 RA Kapitonov V.V. and Jurka J.; RT "CR1-4_DR, a family of CR1-like non-LTR retrotransposons in RT zebrafish."; RL Repbase Reports 2(5), 6-6 (2002). XX DR [1] (Consensus) XX CC CR1-4_DR is a family of CR1-like non-LTR retrotransposons and it CC was active in zebrafish recently. CR1-4_DR copies are ~5% CC divergent CC from the consensus sequence. There are ~100 copies CC of CR1-4_DR present in the zebrafish genome. CC The 5' portion is incomplete; the consensus was built from CC five copies. CC The consensus encodes CR1-4_DRp, a 566-aa portion of the reverse CC transcriptase. CC CR1-4_DRp (positions 2-1696): CC XLCNSFLTFFETKIKNIHHQLHSNSSAPDFSQHVSEIITHFSFDTFTLPSTAEIVGHIRKSKTTNCLLDP CC LPTCLVKTCLPSLSTLITNIIHKSLDSGSVPSLFKTAVITPVLKNLELIHPIWPTTDQFPNLPFVSKILE CC KCVASQIHNYLSINNLFELFQSGFRPNHSTETALVRITNDLLLAADSGLLSILILLDLSAAFDMVLHEVL CC LNRLASLGISGTLLLWFKSYLEDRTQYVQIKDFKSRSQIVTTGVPQGSVLGPLLFIIYLLPLGHIFRKYN CC IQFHCYADNTQLYLSTKPSCSFPPSALSRCLAEIKIWLSANFLKLNSDKTEALLIGTKSVLDKADNFTID CC IDNSTIFPSVQVKSLGVILDSTLSFEGHINNITRTAYFHVRNITRLRPSLTTNNTAIFIHALVTSRLDYC CC NALLSGLPSKLLRQKLQLVQNHAARVISRTPSHEHVTPLLYQLHWLPVKYRIDFKILLLTFKALHNLAPQ CC YLTELLHIYTPSRTLRSANNFTLVPPRTRLSTMGDRSFSSMAPRLWNSLPLDLRSSDSLHTFKSRLKTHL CC FKQAFL CC There is only a ~60% identity between CR1-4_DR and other CR1-like CC elements from zebrafish. XX SQ Sequence 1902 BP; 509 A; 461 C; 277 G; 655 T; 0 other; gttgtgtaac agcttcctga ctttttttga aacaaaaatt aagaacattc atcatcaatt 60 acattcaaat agttcagccc ctgacttcag ccaacatgtt tctgaaatca ttacacattt 120 ctcttttgat acttttacct taccatctac cgctgaaata gttggtcata tacggaaatc 180 caaaaccacc aactgcctgc ttgatcctct tcctacatgc ttagttaaga cctgtcttcc 240 atcattgtcc acactgatta ctaacattat tcacaaatca ctggattctg gatctgtccc 300 atctttattt aaaactgctg taatcacccc agtactaaaa aacctggagc tgattcatcc 360 aatttggcca actacagacc aatttccaaa tttgccattt gtctcaaaaa tacttgaaaa 420 atgtgttgcg tctcaaatcc ataactatct ctctattaac aatttgtttg agctcttcca 480 gtctggtttt cgtcccaacc acagcactga gactgctctt gtcaggatta ctaatgatct 540 actactggca gcagactctg gtttactgtc aattcttatt ctcctggact tgagtgcagc 600 ttttgacatg gttttgcacg aggttctttt gaataggctt gcctcactag ggatctctgg 660 cacccttctt ttatggttta agtcatatct cgaagatcgt actcagtatg ttcaaattaa 720 agattttaag tcaagatcgc agattgtcac tactggtgtc ccacagggtt ctgtactggg 780 tccactcctg tttatcatat atctactgcc tcttggtcac attttcagaa aatacaacat 840 acaatttcac tgttatgctg acaacactca actctacctg tccaccaagc cctcctgttc 900 ttttcctcct tctgctttaa gcagatgttt agctgaaata aaaatctggc tttcagctaa 960 ctttttaaaa ttaaacagcg acaaaactga agcccttctc atcggcacta aaagtgtttt 1020 ggataaagct gataatttca caatagacat tgataacagc acaatttttc cctctgtgca 1080 ggtaaagagt ttgggtgtca tcttggatag cacactctca tttgaaggtc acattaataa 1140 tattacacgt actgcatatt tccacgtgcg taatatcact cgtctccgcc cttctctcac 1200 aactaataac acagccattt ttatccacgc attagttact tcacgtcttg actactgtaa 1260 tgcacttctt tctggacttc cttccaaact tctccgtcaa aaactccaac tggttcagaa 1320 ccatgcagct cgtgtcatct ctaggacccc atctcacgag cacgtcacac cactcctcta 1380 tcagcttcac tggcttccag taaagtatcg tattgatttt aaaatattac ttctaacttt 1440 caaggcactt cataatctcg ctcctcaata tctcaccgaa cttctccata tttacacccc 1500 ttctcgtacc cttagatcag ccaacaactt caccctggta ccacctcgca ctcgattgag 1560 cacaatggga gacagatcct ttagctctat ggctcctcgg ctatggaact cgcttcccct 1620 agatctaagg agcagtgata gtcttcacac ttttaaatcc cgtctaaaga cccatctttt 1680 taagcaggct tttctttaac aatttttttt gtcatgtttt ttattatgtt cttttatatt 1740 cgctattgcg ttttaacctg tttggtcaat gattgttttt agtaatgttt aatttgtttt 1800 agcatgtctg ttgatgctta ttgtaaggcg accttgggtg tcttgaaagg cgccatttac 1860 aaataaaatg aattattatt attattatta ttattattat ta 1902 // ID Gypsy-21-I_DR repbase; DNA; ZEB; 6426 BP. XX AC . XX DT 11-FEB-2005 (Rel. 10.01, Created) DT 11-FEB-2005 (Rel. 10.01, Last updated, Version 1) XX DE An internal portion of the Gypsy-21_DR LTR retrotransposon - a DE consensus sequence. XX KW GYPSY superfamily; Gypsy-21-I_DR; Gypsy-21-LTR_DR; Gypsy-21_DR; KW LTR retrotransposon; endogenous retrovirus; gag; integrase; KW protease; reverse transcriptase. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6426 RA Kapitonov V.V. and Jurka J.; RT "Gypsy-21_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 5(1), 15-15 (2005). XX DR [1] (Consensus) XX CC Gypsy-21-I_DR is an internal portion of the Gypsy-21_DR CC LTR retrotransposon that belongs to the Gypsy superfamily. CC Its long terminal repeat is deposited in Repbase CC as Gypsy-21-LTR_DR. The consensus sequence was reconstructed CC based on multiple alignment of five proviral copies (they are CC less than 1% divergent from the consensus sequence). CC Gypsy-21_DR retrotransposons are characterized by 4-bp CC target site duplications. The internal portion contains two CC ORFs encoding the 562-aa Gypsy-21_DR1p gag (pos. 86-1771) CC and 1585-aa Gypsy-21_DR2p pol proteins (pos. 1672-6426) CC composed of the protease, reverse transcriptase, and integrase CC domains. The second protein, including the protease domain, CC does not start from Met. Presumably, the gag-pol fusion protein CC is formed originally due to a ribosomal frame shift. XX FH Key Location/Qualifiers FT CDS 0..0 FT /product="Gypsy-21_DR1p" FT /translation="MEIIEQENIKVPNSLIVSGTTDTESDIDLTEHLGKYG FT CINRIVRIDSPGSPHHKNVIVEYESGSAVKILEPQLPFIFENPHQADIQYE FT IKALSSIYVETVTTSATKGYMEKLKSIAKLSGRRFEDLLQEELSKCREPVA FT QADDDTTDVASDLRSGIPQVNPAQGSQEMPMGAAFVPLVGTNVPNVNPPEI FT QRVVVEHIVKSEEATSHMHAPLKLRFFSGRSPRPANEVDYEIWHNSVELML FT QDPAVSDLVRSRKIIDSLLPPASDIVRTLGLHATPRAYLDLLDSAFGTVED FT GDELFAKFLSTMQNAGEKPSQFLQRLQVALAQAIRRGGVPSSEADRHLVRQ FT FCRGCWDDALISELQLEQKKANPPSFSELLLLLRTAEDKISTKELRMRKHL FT GASKLRTSSYSVCASSDEVVSTQTIVSDLKKQIAELQNQVEGLTKAKKQAQ FT CAPEGRVCDLQKQVVGLKGQIVEAKPSKSAKTQSATRMNKTQSVQTNLSEV FT VSSVPLKQLSKPKPWYCFHCGEDSHIASTCDRDANPALVAAKRKLLKERQS FT SWEAQMGSSHRTPLK" FT CDS 0..0 FT /product="Gypsy-21_DR2p" FT /translation="CKSCLGSCQKKAFKRAAVVLGGTNGVIPSHSFKIESA FT PVVGQTGAEKCSRRPKNSKFCNHHASSRSKPFILPKGLVGAKCTAEVSIAG FT QKCSVLFDTGSQVTTVSQTYYEQNLSHLEIKPLEHLEVEAANGQFVPYLGY FT VEIDVVFPKEFLGAEITLTTLALVTADTSSNVQSPVLIGTNTLDLAYEAQF FT DSEVAQPSHVLPYGFKVVMNVLRHRFKLKTNSMIGLVRAQGTKPEVVPAGQ FT TLVLEGSINSRELSSDKWVLMEAPLQSSLPGGLLCASSLVTLSQRSFQKIP FT VILKNETEHDILVPAKSVIAELYSLQSVNVKEPATESNSQGKELPPYTLDF FT GDSPIPNEWKERISQKLCAMSDVFALHDLDFGKTDKVTHRIRLHDQTPFKQ FT RARPIHPQDFDAVRKHLQELLDAGVIRESESPFSSPIVVVRKKNGDVRLCV FT DYRKLNLQTIKDAYALPNLEETFSALTGSRWFSVLDLKSGYYQIEVEESDK FT HKTAFVCPLGFWEFNRMPQGVTNAPSTFQRLMERCMSDVHLKEVVVFLDDL FT IVFSDTLEEHERRLLRVLHRLREFGLKLSLEKCRFFQTSVKYLGHIVSSSG FT VETDPDKVAALKTWPVPKNLKELRSFLGFSGYYRRFIRDYAAIVKPLNDLT FT SGYPPHRKSSKVRERRDYHNPKEPFGSRWSANCQKAFEAIIEALTTAPVLG FT FADPKLPYVLHTDASTVGLGAALYQEQEGQLKVIAYASRGLSRSEARYPAH FT KLEFLALKWAVTEKFYDYLYGNQFTVVTDSNPLTYVLTSAKLDSTSYRWLS FT ALSAFSFKLQYRAGKQNIDADSLSRLPHGALKNDVASQKEQERIRQFALNH FT LFDDVQTMSPEVIQAICDKHIVYCKSVDCGVTPMTLVESLAIHVDAIPGSF FT DHDENNFGCPVVPTFGENDLKEKQRSDPIIREVIVQLETGETIPPLVRKEL FT PMLSLLMRELSKLELQNGILYRRRHDGDTITYQLVLPESLRSMVLTCLHDD FT MGHLGVDRTLDLVRSRFYWPKMLVDIERKIRTCPRCVCRKSLPDHAAPLVN FT IQVTRPLELVCMDFLSIEPDSRNTKDVLVITDFFTKYAVAVPTCNQKSRTV FT AKALWENFIVHYGIPEKLHSDQGADFESKTIKELCELMGIHKIRTTPYHPR FT GNPVERFNRTLLNMLGTLKKCDKVHWSSFVKPLVHAYNCTKHDSTGFSPYE FT LMFGRQPRLPIDLAFDVPLNREEYKTHSQFVHDLKCRLKKTYDLAMKSTAK FT VGERNKARFDKHVVESVLDIGDRVLVKNVHLRGKQKLADKWESLVYVVVKR FT AGDLPVYTVKPEGKEGPLRTLHRDLLLPCDLLQLPEEVLALPSTRKRPPTR FT KNPCNKVQDQVDFDSDDDDSSLELMHSDLLIPTNITFTEVYEAGTGPQTEV FT PSIPIVQDTVGHISTENLPVSADSPIECLPEIENFQIDNAPTLNVLPVDDP FT DVVESDENLPAEVPVNLGDQQIPVEETVKVSEHETETKKHENDGNVIRKSD FT RVRQKPRIFTYPELGNPLVSIVQSLFQSLSTAVTDSIIENNSFTKAADAVV FT TQPVSFMHRDVHRVNGGGA" XX SQ Sequence 6426 BP; 1825 A; 1292 C; 1508 G; 1801 T; 0 other; gtttggcgag ccagccagga gcagtaagta gagagtgact atagtaagaa atcagacata 60 tttaaataga ctttacaacg taaagatgga aatcattgaa caggagaaca ttaaggttcc 120 aaattctctt attgtaagcg gaactacaga cacagagagt gatattgatc tgaccgagca 180 tctaggtaaa tatgggtgca taaatagaat tgtccgcatt gacagtcctg ggtcccccca 240 tcacaaaaat gtgatagttg aatatgagag tggaagtgca gtgaaaatcc tagagcccca 300 gttaccattc atttttgaaa atccacatca ggctgatatt cagtatgaga tcaaagcttt 360 gtccagtatt tatgttgaaa ccgtcactac tagtgccacc aagggctata tggaaaagtt 420 gaaaagcatt gcaaagctta gtggtaggcg ttttgaggac ctcttacaag aagagctttc 480 taaatgcagg gaacccgtag cacaggctga tgacgatact acagatgttg cctctgattt 540 gcgctcaggt attcctcaag ttaaccctgc tcaggggagc caggaaatgc ctatgggtgc 600 tgcatttgtt ccattagttg ggacgaatgt tcctaatgta aatcccccag agatacagcg 660 tgttgtagtt gagcatattg tgaagagtga agaagctact tctcacatgc atgctcctct 720 caagctaaga ttcttttctg ggcggtcccc tcgtccagcg aatgaggtag attatgagat 780 ttggcacaac agtgtagaac tcatgctaca agacccagcc gtgtctgatt tagtcagatc 840 tagaaaaatt attgacagcc ttctacctcc agcatcagac attgtgagga cgctaggtct 900 gcatgcaacg cctagagctt atcttgatct tttagattca gctttcggaa cggtggagga 960 tggtgacgaa ctctttgcta agttcttgag cacaatgcag aatgcaggag agaagccatc 1020 acagttttta cagcgattgc aagtagctct tgcacaggct attagaagag gtggtgtgcc 1080 ctctagtgag gctgatcgac acttagtaag acaattctgt aggggttgtt gggacgatgc 1140 cctcatttca gaattacaac tagaacaaaa gaaggccaat cccccttcct tctcagaatt 1200 gttattactg ttgcgaacag cagaagacaa aattagcaca aaagagctcc gtatgagaaa 1260 acatcttggt gcatccaagc tgcgaacatc ctcttactct gtgtgcgctt catcggatga 1320 ggttgtatcg actcagacta ttgtttcaga tttgaaaaag caaattgcag agcttcagaa 1380 ccaagttgag ggtttaacga aagctaaaaa gcaagctcag tgtgctccag agggcagggt 1440 ttgtgacttg caaaagcaag tagtagggtt gaagggtcaa attgttgaag ctaagccaag 1500 caagtctgca aaaactcaat ctgctactag gatgaacaaa acccagtccg tgcaaacaaa 1560 cttgagtgaa gttgtaagca gtgtccctct taagcagctg agtaaaccta agccctggta 1620 ctgctttcac tgcggtgaag acagtcatat tgcttctaca tgtgatcgtg atgcaaatcc 1680 tgccttggta gctgccaaaa gaaagctttt aaaagagcgg cagtcgtcct gggaggcaca 1740 aatggggtca tcccatcgca ctcctttaaa atagaatcag ctccagttgt gggacaaacg 1800 ggggctgaaa aatgtagcag gcgtcccaag aatagtaaat tttgcaatca ccatgctagc 1860 agtagaagca aaccttttat tttgcctaag ggcttggtag gggcaaaatg cactgctgaa 1920 gtctctatag cgggtcaaaa gtgtagtgtt ctttttgaca caggttcgca ggtaaccact 1980 gtttctcaga cctattatga acagaatttg tctcatctag agataaaacc gcttgaacat 2040 cttgaggtgg aagctgcaaa tggacagttt gttccgtatc tgggctatgt tgagattgat 2100 gtagtgttcc caaaagaatt ccttggagca gagatcacac ttaccactct tgctttggtc 2160 actgcagata ctagcagtaa tgtccagtct cctgttctta ttggtacaaa cacccttgac 2220 ttagcctatg aagcccagtt tgattctgaa gtagcccagc cttcacatgt attaccgtat 2280 ggattcaaag tcgtcatgaa tgttctcaga caccgattca agctaaaaac caatagcatg 2340 attggacttg ttcgagccca gggtacaaag cctgaagttg tccctgcagg acaaactctt 2400 gtgcttgagg gttcaataaa ttccagagag ctttcctctg ataagtgggt tctgatggag 2460 gctcctcttc agtcttcctt gcctggaggt cttttgtgtg catcttctct tgtcactctt 2520 tcccagagat catttcagaa aatcccagtg atcctgaaaa atgaaactga gcatgacatt 2580 ttagttcctg caaagagtgt cattgcagaa ttgtattcac tgcagagtgt gaatgttaaa 2640 gaacctgcca cagagtccaa cagtcagggt aaagagttac caccgtacac tttggatttt 2700 ggtgactcac caatacctaa tgagtggaag gagagaattt cacaaaagtt gtgtgcgatg 2760 tcagatgttt ttgctcttca tgatttagat ttcggcaaga ctgataaagt gacgcatcgc 2820 ataaggctac atgaccaaac cccatttaaa cagagagctc gccccattca cccgcaagat 2880 tttgatgctg tgcggaaaca tctgcaggag ttattagatg ccggtgtcat ccgggagtcg 2940 gagtcccctt tttcttcacc gatcgtagtt gttcggaaga agaatgggga tgtccgtctc 3000 tgtgttgact accgtaaact taacctgcaa acgatcaaag atgcttacgc tttgccaaat 3060 ttggaggaaa ctttttccgc tcttactggt tctcgttggt tctctgtttt ggacctcaaa 3120 tctggttatt accagatcga ggttgaggag tccgataagc acaaaaccgc ttttgtctgt 3180 ccgttaggtt tctgggagtt caatcgaatg ccgcaagggg ttacgaatgc tcccagtacc 3240 tttcaaagac tgatggaacg gtgtatgagc gatgttcatt taaaagaggt cgttgttttt 3300 ctggatgact tgatagtgtt ttctgacacc ttagaggagc acgagcgtag gttgttgaga 3360 gtgttgcatc gcttgcggga gtttgggttg aagctttctc tggaaaaatg cagatttttt 3420 cagacttctg tgaaatatct tgggcacata gtgtctagca gtggtgtaga gaccgatcca 3480 gacaaggttg cggcactgaa aacttggcct gttccgaaaa acctgaagga actcagatca 3540 tttttaggtt tttcagggta ttaccgcaga tttattcgcg attatgctgc cattgtgaag 3600 cctttaaatg atttgacgtc agggtatccg ccacatagaa agagctctaa agttagggaa 3660 cggagagatt accacaatcc caaagagccc tttgggagcc gttggtctgc aaattgtcag 3720 aaagcgtttg aagccatcat cgaagcactt actactgctc cagtgcttgg ttttgccgat 3780 cctaaacttc cgtatgtttt acatacggac gccagcactg tggggttggg agcggctttg 3840 taccaggagc aagaagggca gttaaaagta attgcttatg ccagccgtgg actttctcga 3900 agtgaagccc gttatcccgc tcataaatta gagtttttgg cccttaaatg ggcggtaact 3960 gaaaaattct acgattatct gtacggtaac caatttactg tagttaccga cagtaatcca 4020 cttacttatg tattaacctc ggctaagtta gattctacga gctacaggtg gctttccgct 4080 ctttcagcat tttcctttaa attgcagtac agagctggaa aacagaacat agatgctgac 4140 agtttgtcca gacttcctca tggtgcttta aaaaatgatg ttgcatctca gaaggaacag 4200 gaacgaattc gtcaatttgc actgaaccac ctgtttgatg atgtccagac catgtcacct 4260 gaagtcattc aggccatatg tgataagcac attgtctatt gcaagtccgt ggattgtggt 4320 gttactccta tgactttagt tgagtccctt gcaatccatg ttgatgcaat cccaggcagt 4380 tttgatcatg atgagaataa ttttggttgt ccggtagtgc caacatttgg agagaatgat 4440 ttaaaagaaa aacaaagatc tgaccccatt atccgtgaag tcattgtcca gttagaaaca 4500 ggtgagacaa tccctccttt agtacggaaa gaacttccta tgctctcttt gcttatgagg 4560 gaactcagta agttggagtt gcaaaatgga atcctctatc ggaggaggca tgatggagat 4620 accatcactt atcagctagt tcttcctgag tctttgcgta gtatggtttt gacatgtcta 4680 catgatgaca tggggcattt gggtgtagat cgtacactgg accttgtgag atctaggttt 4740 tattggccta aaatgcttgt ggatattgaa agaaagatcc ggacctgccc taggtgtgtg 4800 tgtcgtaaat ctttaccaga tcatgctgca cccttggtta atattcaagt gactcgaccc 4860 cttgagctag tgtgtatgga ttttttgtcc attgaaccag attcccgaaa caccaaagat 4920 gtgctggtta tcacagattt ctttaccaag tatgcggttg ctgttccaac ttgtaatcag 4980 aagtcgcgta cagtggctaa agctctttgg gagaatttta tagttcatta tggaattcct 5040 gaaaaattac acagtgatca aggggccgac tttgagtcaa agactataaa agaactatgc 5100 gaattgatgg gaattcataa gattaggaca accccttacc accccagggg gaaccccgtc 5160 gaacgattca accgtactct tttaaacatg ctcggtacgt tgaaaaaatg tgacaaagta 5220 cattggagta gttttgtgaa acctcttgtt cacgcgtata actgcaccaa gcatgattcc 5280 actgggttta gcccctatga attaatgttt ggacggcagc cgcggttacc tatagatttg 5340 gcatttgatg ttccgctgaa cagagaggag tacaaaactc attcgcagtt tgtgcatgat 5400 ttgaagtgta ggttgaaaaa aacgtatgat ttggcaatga aaagtactgc taaagttgga 5460 gaaagaaaca aagctcgttt cgataagcat gttgttgaat ctgttttaga cattggggat 5520 agggtcctgg taaaaaacgt gcacttgagg ggaaaacaga agttggcaga caagtgggaa 5580 tcacttgtat atgttgttgt gaaaagagcg ggtgatctcc cagtgtacac tgtcaagcca 5640 gagggtaaag aaggccctct aagaacgctt cacagagacc ttttactgcc ttgtgatttg 5700 ttacagttac ctgaggaagt acttgcgttg cccagtactc ggaaacgtcc tccaactcgt 5760 aaaaacccct gtaacaaagt acaagaccag gtagattttg actctgatga tgatgacagc 5820 agcttagagc tgatgcacag tgatctttta ataccgacga acataacgtt tactgaggtt 5880 tatgaagctg gaaccgggcc acaaactgag gttccctcaa ttccaatagt tcaagacact 5940 gtcggtcaca tttcaactga aaatctacct gtttcagcag acagtccgat cgaatgttta 6000 cctgaaatag agaattttca aatagacaat gcacctacct tgaatgttct acctgttgat 6060 gatccagatg ttgtggagtc tgatgaaaac ttacctgcag aggttcctgt caatttaggt 6120 gaccagcaaa tacctgtaga ggaaactgta aaagttagtg agcatgaaac tgaaacgaag 6180 aaacatgaaa atgatggtaa tgttatcagg aagtctgata gagttaggca gaaaccacgg 6240 atttttactt atcctgaatt gggtaacccg ttggtctcta tagtgcagtc tctttttcag 6300 agtcttagca cggctgtgac tgactccatc attgaaaaca acagctttac aaaggcagct 6360 gatgcagtag tcacacagcc tgttagtttc atgcacagag acgtgcatag agttaacggg 6420 ggaggg 6426 // ID Gypsy-15-I_DR repbase; DNA; ZEB; 6739 BP. XX AC . XX DT 11-FEB-2005 (Rel. 10.01, Created) DT 11-FEB-2005 (Rel. 10.01, Last updated, Version 1) XX DE An internal portion of the Gypsy-15_DR LTR retrotransposon - a DE consensus sequence. XX KW GYPSY superfamily; Gypsy-15-I_DR; Gypsy-15-LTR_DR; Gypsy-15_DR; KW LTR retrotransposon; endogenous retrovirus; gag; integrase; KW reverse transcriptase. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6739 RA Kapitonov V.V. and Jurka J.; RT "Gypsy-15_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 5(1), 3-3 (2005). XX DR [1] (Consensus) XX CC Gypsy-15-I_DR is an internal portion of the Gypsy-15_DR CC LTR retrotransposon that belongs to the Gypsy superfamily. CC Its long terminal repeat is deposited in Repbase CC as Gypsy-15-LTR_DR. Gypsy-15_DR is characterized by CC 4-bp target site duplications. The internal portion encodes CC two proteins: the 628-aa gag Gypsy-15_DR1p (pos. 112-1995) and CC 1554-aa Gypsy-15_DR2p polyprotein (pos. 2022-6683) composed CC of the protease, reverse transcriptase, and integrase CC domains. PBS is complementary to Arg-tRNA. The internal CC portion is flanked by 99% identical LTRs. XX FH Key Location/Qualifiers FT CDS 112..1995 FT /product="Gypsy-15_DR1p" FT /translation="MDVVRRENITVANSLIVSGLTLTELDNELEAYLLRYG FT SIRRNVIIDDPASDFHKLLIVEFNENSAFQTLHPHLPLTLGSLSDPSITFR FT VRALAAVCDPPVVSTATEGYLEQLKAIAKESGRPFQTVLQEELEKLKETHS FT VDQTLAESQKIEVADTQSRDNTLISESPNVAEIDPQDSESKNPTKNRIIYV FT SPPLPETTADDNLTTVFPSSSTNIMGDPQVQRMVVEHVVKTSDAMMSQQTS FT IRLRVFSGKSPRPPNEPDYDTWRASVDYLLNDPSISDLHRTRKILDSLLPP FT AADVVKHVRPPALPAVYLELLESVYGSVEDGDELLAKLMGTFQNQNEKSSD FT YLHRLQVLLSAVIRRGGIKESERDRYLLKQFCRGCWDSRLIVDLQLEKKEG FT QLLTFAELTILIRTQEDKNASKEERMRKHFGMTKPANVYPKTRAISNQVSA FT CACDVSSTYSSEAGSLKKQVAEIQAQVATLKQSPDKKSIKGQSERAELVAL FT KRTVEDLCVQVAAVKASVAEGLKGNNPEQSEIARLQRQVAELQAQGIVQKA FT YQAPHMQRSPGTEIGRALKKEPLRTNRPRPGYCFRCGDDGHLAVNCENPAN FT PPKVEEKRLKLREQQHQWDILHGRPAQFLN" FT CDS 0..0 FT /product="Gypsy-15_DR2p" FT /translation="MEAKLYNSRPGCYKQIPELSQTTVSFKHLPKGLVGTK FT CTAQVTIGGMEVNCLLDTGSQVTTIPHSFYKAHLSDFPLEPLKNLLEVEGA FT NGQAVPYLGYIELTLKFPKEFIGAEVEVPTLALVVPDLTSFSQILVGTNSL FT DVLYGKCAQDCAADVKSSFPGYQAVLKVLEARWRQASSETLGYVKFKGNSP FT EIVPAGGMVVLEGQAHFNGPHTEKLVTLEPPSVPLPNGLLIASCLHTSPNK FT RLSKLSVLLRNTTQTDIAVPPKVMLAEIHAIQSVLNQHHQSSDAKAEESIP FT TCANLTFDFGDSLPTTWKERITKLLNSMPEVFSLHDLDFGHTKKVKHQIKL FT NDETPFKQRARPIHPQDIDAVRRHLQELLVTGVIRESDSPFASPIVVVRKK FT DGSVRLCVDFRKLNAQTIKDAYALPNLEEAFSTLTGSKWFSVLDLKSGFYQ FT IEMEEVDKAKTAFVCPLGFWEFNRMPQGITNAPSTFQRIMERCMGDLNRKQ FT VLVFIDDLIVFSDTLEEHESRLLQVLNRLKEYGLKLSPEKCRFFQTSVKYL FT GHIVSHNGVETDPAKVEALKTWPRPRNLKELRSFLGFAGYYRRFVRDFSKI FT VKPLTDLTAGYPPLRKSCNTKQKDCEYFNPKAEFDTRWTTDCQDAFDSIID FT NLTSAPILGFANPKHPYVLHTDASTTGLGAALYQEQEGQLRVIAYASRGLT FT KGESRYPAHKLEFLALKWAVTSKFNDYLYGAEFTVVTDSNPLTYILTSAKL FT DATSYRWLSSLSTYNFKLQYRAGSQNQDADGLSRRPHGELVDDLTSLKERE FT RIRQFTLHHLMESEDESPVVMAEVVKAICEKHQVVGSPQGLHCIPSVTLVE FT SLTHCVDVLPYEFQHEDEHGLPSLPHLSQAALAELQRKDPELKIVIERVES FT GVKPCKLRELSSAVSLWLKEWKRLELRSNVLYRKRQEHGASSYQLALPTSL FT RNTVLQSLHDDMGHLGIERTLDLVRTRFFWPKMSHAVVQKVKTCERCVRRK FT TPPEKAAPLVNIQTSRPLELVCIDFLSLEPDQSNTKNILVITDHFTKYAMA FT IPTRNQTAQTVAKSLWDHFLIHYGFPEKLHSDQGADFESRTVKELCKVAGI FT HKVRTTPYHPRGNPVERFNRTLLQMLGTLENERKSRWKEYVKPLVHAYNCT FT RHDTTGYTPYELMFGRQPRLPVDLAFGLPVDTPNKSHSQYVENLKNRLRES FT YEMATKNAGKIAERNKQRFDKHVVALTLEEGDRVLVRNVRLRGKHKLADKW FT EQNVHVVVKKAHNLPVYTVKPEGKDGPLRTLHRDLLLPCGFLQSNKLVEPP FT KQKPARKPLTRFSLNNEMQESDLISENSESEEEHIVSNVPEGTLSFETQII FT VGPEYIPVGESGVSLTVLDPAVEDVSVPESVVSNPEEPAKKHLPGVEPVEK FT ETNELIEVEKNSNALESSNTVPFVLTEKNSELEQSSELWSESPDQTAKNVL FT DSFEWETEQNLIPSNVGHTEILQNEQPTCNEPDDILLRRSQRERRPPKKFE FT YPQLGNPLTLVIQSLLQGLDTALCSSLEKSVVAPVRHL" XX SQ Sequence 6739 BP; 2013 A; 1449 C; 1549 G; 1728 T; 0 other; gtaaagttgg cgagccagcc aggagtctaa ttattgcagc aagggtgtca aacgacaaga 60 aaaagggaat tgtatcagta gcaaaccgtt ttaaaaattt gagctgtcat aatggatgtc 120 gtaagacgag aaaatataac tgtagcgaac tctctcatag taagcggtct aacgttaact 180 gagttagata atgagctgga agcatatttg ctgagatacg gctctatccg tcgcaacgtg 240 ataattgatg acccagcatc agactttcac aagctgctga ttgtggagtt taatgaaaac 300 tctgcgtttc aaactttgca tccccatttg cccctgactt tgggaagtct ttctgatcca 360 agcattacct ttcgggtacg cgctttagcc gctgtgtgtg acccacctgt cgttagcact 420 gccactgaag ggtaccttga gcaattgaag gccatagcca aagagagtgg aaggcctttc 480 caaaccgtgt tgcaggagga gttagagaaa cttaaagaaa ctcattcagt agaccaaact 540 ctagcagaat ctcaaaagat agaggttgct gacacacagt ctagagataa cactctgatt 600 tctgagtcac ctaatgtagc agagatagac ccccaagatt ccgaatccaa aaacccaacc 660 aagaatagaa tcatttatgt gtccccgcct ttacccgaaa ctaccgctga tgataacctt 720 acaacagttt ttccttcctc ttcgacaaac ataatgggcg atccccaggt tcaaagaatg 780 gtagttgagc atgtagtaaa aacgagtgat gccatgatgt ctcagcagac atccattcgt 840 cttagggtct tttcagggaa gagtccccgc ccccctaacg aacctgatta tgacacctgg 900 cgtgccagtg ttgactattt gcttaatgac ccatctattt cagacctgca tagaacacgt 960 aaaatcctgg acagtctctt acctccagcc gcagatgtag ttaaacatgt gcgtccccca 1020 gcccttcctg ctgtctatct tgagttgctg gagtccgttt atggttctgt tgaagatgga 1080 gacgaactgt tagcaaaact aatgggtact tttcagaatc aaaatgaaaa atcatccgac 1140 tatctccatc gccttcaagt cctgttaagc gcagtaatca ggcgaggtgg tataaaagag 1200 agtgaacgtg accgttatct tttaaaacag ttctgtagag ggtgctggga tagccgcctc 1260 attgttgatc ttcagcttga gaaaaaagag ggccagttac ttaccttcgc tgaattaacc 1320 atactaattc gaactcagga agacaaaaat gcttctaagg aggagcgtat gaggaaacac 1380 tttgggatga caaagccagc aaatgtttac ccaaagacac gagctatctc aaaccaagtg 1440 tcagcttgtg catgcgacgt gtctagcact tatagttccg aagcaggatc tttaaagaaa 1500 caagtcgcag aaattcaagc tcaagtcgcc actttaaaac agtctcctga taaaaagagt 1560 attaaaggtc aatcagaaag ggctgagtta gttgctttaa agaggactgt tgaagacctt 1620 tgcgttcagg tggctgctgt aaaagcatct gttgctgagg gactaaaagg gaacaatcca 1680 gagcaatcag aaattgccag attgcagcga caggtagcag agctacaagc acaaggtatt 1740 gtacagaaag catatcaagc tcctcatatg cagagatccc ctggaactga aattggcaga 1800 gctctcaaga aagagccttt aagaactaac agacctaggc cagggtactg ttttcgatgc 1860 ggagatgatg ggcatttggc agtcaactgt gaaaaccctg caaatcctcc aaaagttgag 1920 gaaaagcggc tcaagctgag agagcagcag catcagtggg atatcctaca tggaagaccc 1980 gcccagtttt taaactaggt gaggtctcta tagcggggca tatggaggcc aaactttata 2040 atagccgccc tggatgttat aaacagatac ccgagctcag tcaaaccact gtgtctttca 2100 aacatttacc caaaggtctg gtaggaacca aatgtacagc ccaagtcacc attgggggga 2160 tggaggtaaa ctgcctttta gacacggggt cgcaggtcac cacaataccc cattcgtttt 2220 acaaagcaca tttatctgat ttccctttgg agcccttgaa aaatctactg gaggtagaag 2280 gagctaatgg acaggctgtg ccatatttag ggtacatcga acttacctta aaattcccca 2340 aagaattcat cggggcagag gttgaggttc ctacattagc tttagttgtt ccagatttaa 2400 ccagtttttc ccaaatttta gttggaacaa actcgttaga tgtgctttat ggtaaatgtg 2460 ctcaagattg tgcagctgat gtcaagtcta gttttcctgg ctatcaagct gtgcttaaag 2520 tgttggaagc tagatggagg caggccagca gtgaaaccct tggttatgta aaattcaagg 2580 gaaactcccc tgagatagta cctgcaggag gaatggtggt gttagagggt caagcccatt 2640 ttaatggtcc ccacacagaa aaactggtaa cactcgaacc accctccgtt cctttgccca 2700 atggtcttct tattgctagt tgcttgcaca catcaccgaa taaacgtctt tccaagctgt 2760 cagttctgtt aagaaatacc acgcaaactg acatagcagt tcctcctaaa gtcatgttag 2820 ccgagattca tgctattcaa agtgtcctga accagcatca tcagagttca gatgctaaag 2880 ctgaagagtc aatacccacc tgtgccaact taacatttga ctttggcgac tctctgccca 2940 cgacctggaa agaaaggata acaaaactgt taaactctat gccggaagtt ttctccctgc 3000 atgatttgga ttttggtcac acaaagaagg tcaagcacca aataaagtta aacgacgaga 3060 caccattcaa acaaagggcc aggcccatac atccccaaga catagacgct gtgaggaggc 3120 acctccaaga gttgctagtt actggtgtta tccgggagtc tgattctcca tttgcttcac 3180 ccatagttgt tgtccggaaa aaggatggct cagtgcggct atgtgttgac ttccggaagc 3240 taaatgcaca gacaataaaa gacgcctatg cgttaccaaa tttagaggag gctttctcca 3300 cactgacggg ctcaaaatgg ttttctgtgc tcgatctgaa gtctgggttt tatcaaatag 3360 agatggagga agttgataag gccaagactg catttgtctg tccacttggc ttctgggagt 3420 tcaaccgtat gccgcaggga attacaaatg cccctagcac ctttcaaagg attatggaac 3480 ggtgcatggg agatctaaat cggaagcaag tccttgtctt tattgatgac ctcattgttt 3540 tttctgatac tttagaggaa catgagtccc ggttgttgca agtcctaaac cgacttaagg 3600 agtatggatt gaaattgtca cctgagaagt gccggttctt ccaaacctca gtgaagtacc 3660 ttggccacat tgtttctcac aatggggtgg aaacagaccc tgcaaaggta gaagctttaa 3720 agacctggcc aaggccaaga aacctaaaag agctaaggtc ctttttaggc ttcgctggat 3780 attacaggag gtttgtgcgt gacttttcaa agatagttaa accgttaact gaccttactg 3840 caggatatcc tcctcttaga aagagttgta acacgaagca gaaagactgt gaatatttca 3900 atcccaaagc ggaatttgac actcgatgga ctacagactg tcaggatgca tttgactcca 3960 taatcgacaa tctcacatct gcacctatat tgggctttgc aaaccccaaa catccctatg 4020 tgctacacac cgatgcaagt accaccgggc tcggtgcagc tttgtaccaa gaacaagagg 4080 ggcagctgcg agtcatagct tatgctagta gagggttgac taaaggtgag agcaggtacc 4140 ctgcacataa acttgaattt ttagcgctaa aatgggctgt aacttccaag tttaatgact 4200 acctttacgg tgcagaattt actgttgtga cagatagcaa ccctctaaca tatatattaa 4260 cttcggcaaa acttgacgct accagttacc gctggttgtc cagtctatcg acttataatt 4320 ttaagctgca gtacagggca gggagtcaaa accaagatgc agatggtctc tctcgaaggc 4380 cacatggtga gcttgtggat gacctaacct cactaaaaga gagggaaagg attaggcaat 4440 tcactttgca ccatcttatg gagtcagaag atgagtcacc tgttgtgatg gcagaagtag 4500 tgaaagcgat ctgcgaaaag catcaagtag ttgggtcacc ccaaggactc cattgtatcc 4560 cttcggttac tttggttgag tctcttaccc actgtgtgga tgtccttcca tacgagttcc 4620 agcatgagga tgaacatggt ctcccaagtc tccctcatct ctcacaagct gctttggcag 4680 agttgcagag aaaggatcca gagttgaaaa ttgtcattga aagagtggaa agtggggtta 4740 agccttgtaa gttaagggaa ctatcttctg ctgtgagctt atggttaaag gaatggaagc 4800 gtcttgagtt gaggagtaat gttctgtaca gaaagaggca ggaacacgga gcttcatcat 4860 accagttggc tttacctacc tcacttagaa acaccgtatt acagagtctc catgatgaca 4920 tgggtcatct tggtattgaa cgaacactgg atcttgtgag gacaagattc ttttggccga 4980 aaatgtctca tgcagtggta cagaaggtaa aaacctgtga acgctgtgtt cggcggaaaa 5040 cacctcctga aaaagcagct cctttggtta atattcaaac aagtaggccc cttgagttgg 5100 tgtgcattga tttcctatcc ttagagccgg accaaagcaa cactaagaac attctggtca 5160 tcactgacca ttttacaaag tatgctatgg ccatacctac tcgaaaccaa actgcccaaa 5220 cagttgcaaa aagtctttgg gaccacttct taatacacta tgggtttccc gagaagctgc 5280 atagtgacca aggagccgac tttgagtcac gtactgtcaa ggagctgtgt aaggttgcag 5340 gaatacacaa ggtcagaaca accccatacc atcctagggg gaatccagtg gaacgattta 5400 atcgtacact gctccaaatg cttggaacac tggaaaatga gaggaaatct aggtggaagg 5460 agtatgtaaa acccctagtg catgcctata attgcaccag gcatgacaca actggatata 5520 ctccctacga gctcatgttt gggcgacaac ctcgtcttcc tgttgacttg gcattcgggt 5580 tgccagtgga cactcccaac aagtctcact cacagtatgt ggaaaacttg aagaatcgtt 5640 tacgtgaaag ttacgagatg gctaccaaaa atgctggaaa gattgcagaa cgtaacaagc 5700 aaaggtttga caagcatgta gttgccttaa ctctggaaga aggtgaccga gttctagtga 5760 ggaatgtgcg tttgcgaggc aaacataaat tagctgacaa atgggagcaa aatgttcatg 5820 ttgttgtcaa gaaagcacat aacctaccgg tgtatactgt caaaccagaa ggaaaggatg 5880 gtccgttacg aactttacac cgtgacctct tgttaccctg tggatttttg caatcaaata 5940 agcttgtaga accaccaaaa cagaaaccag ccaggaagcc tctaaccaga ttttccctta 6000 acaatgagat gcaggaatca gacttaattt ccgaaaactc agaatctgag gaggaacaca 6060 tcgtcagtaa tgtgcctgaa ggaacattaa gttttgaaac tcaaattatt gttggtcctg 6120 agtacatacc agttggggag tctggtgtta gcttaacagt cctcgatcct gctgtggaag 6180 acgtgtctgt tccggaatct gtagtaagta atccagaaga acctgcaaag aaacacttac 6240 ctggtgtgga acctgtggaa aaagaaacaa atgagttaat tgaagtagaa aaaaattcca 6300 atgcacttga gtcaagtaac actgtgccat ttgtcctgac tgagaaaaac tcagagttgg 6360 agcaatcctc agagctttgg agtgagtccc ctgatcaaac agccaagaat gtgctagata 6420 gttttgagtg ggaaactgaa caaaatctga tacccagcaa tgtgggacat actgaaattt 6480 tgcagaatga gcaacccact tgtaatgaac cagatgatat tttactcaga cgatcacaga 6540 gggagcgacg gcctcccaag aagtttgaat atccccagtt agggaatcca cttaccttag 6600 ttatacagtc tttgttacaa ggccttgata cagctctctg ttcttcttta gagaagtcag 6660 ttgttgctcc agtgcgtcat ctgtgaatac tgtttgctgt gcaatgcaaa gggacttgca 6720 tgtattcgag aggggaggg 6739 // ID LOOPERN3_DR repbase; DNA; ZEB; 1011 BP. XX AC . XX DT 05-JUN-2002 (Rel. 7.05, Created) DT 05-JUN-2002 (Rel. 7.05, Last updated, Version 1) XX DE LOOPERN3_DR is a nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; LOOPERN3_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1011 RA Kapitonov V.V. and Jurka J.; RT "LOOPERN3_DR, a nonautonomous DNA transposon from zebrafish."; RL Repbase Reports 2(5), 27-27 (2002). XX DR [1] (Consensus) XX CC About 1000 copies of LOOPERN3_DR are expected to populate the CC zebrafish genome. LOOPERN3_DR copies are ~10% divergent from the CC consensus sequence. CC This element is characterized by 10-bp terminal inverted repeats CC and CC putative TTAA targets site duplications (less likely, TTTAA). CC Its classification is not very certain yet, although it is CC expected to be a member of the piggyBac/Looper superfamily. XX SQ Sequence 1011 BP; 290 A; 190 C; 218 G; 286 T; 27 other; agggcaccta tggtraaaaa tctacttttc aagctgtttg gacagacmtg tgtgtaggta 60 tagtgtatag accgtcatat tggggtgata taaacacacc cagtcctttt tttttcaatt 120 taactacata aaaacggtsg accaattgga gcggttttca gatcgaccgc aactttacgt 180 aggagtgcgg tccccccgcc caccgaattg attgacagct gcgcgtaaca tgttccggta 240 gtcatgtgta tatgtcaaca agaccagacg tgcgcaaagc aaccgggaat aaaaggtctg 300 ttcagttcgc taggatcatc aatcatcatc aaatgtgaty aagagtaagt ttcacatgtt 360 taaaatgttt taaaacagtg catgtgtgta atkaattaca gcgatttact tcagctttac 420 ttcatcagca cagccgcgtg tcagaacaat tataaaagaa gacgcttcaa tcccggtttg 480 tggacgttaa atcaggttta ttttgtacat taacataaca gatatccaca cagyastkga 540 grttagccta tcctgacaca tttgcgtgca aaaacagtgc taagctaagc gcgctctgtc 600 tgtctgcctc tgtgtgtgtg tgtgtctctg tgtgtgcgtg tgtgtgtgtg tgtgtgngtk 660 aactttgtaa cgatattgtg tgtgactcat caatgcaact kcacaatact sattrgtaaa 720 gttcttactg tagtatctca caaacgctac gtgagacctt cttcctttaa gtctgtctgt 780 tgtctgacgc agcygaggga ggaggcatgt agaaatagta ggcgggrarg actcgyctta 840 aaggcgcagt acgacaaaac maccccctgs tgraaaamwg tataaaacag satctwgtaa 900 aaggtataat gaaaaatctg atgggtrttt tgakctgaaa ctttatatac acattctaga 960 gacgcaaaag acttatatta aatctgaaaa aaggggtaac ctaggtgccc t 1011 // ID TZF28 repbase; DNA; ZEB; 1613 BP. XX AC U51227; XX DT 10-APR-1997 (Rel. 3, Created) DT 10-APR-1997 (Rel. 3, Last updated, Version 1) XX DE Danio rerio transposon Tzf.28. XX KW DNA transposon; TZF28. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1613 RA Lam L.W., Lee S.T. and Gilbert W.; RT "Active Transposition in Zebrafish."; RL Unpublished. XX RN [2] RP 1-1613 RA Lam L.W., Lee S.T. and Gilbert W.; RT "Direct Submission."; RL Direct Submission to Repbase Update (13-MAR-1996)Wan L. Lam, RL Molecular and Cellular Biology, Harvard University, 16 Divinity RL Avenue, Cambridge, MA 02138, USA. XX DR GenBank; U51227; Positions 1 1613. XX SQ Sequence 1613 BP; 531 A; 305 C; 337 G; 440 T; 0 other; tacagtgtat ccgcaaagta ttcatagcgc ttcacttttt ccacattttt tatgttacag 60 ccttattcca attccaaaat ggattaaatt aatttatttc atcaacattc tacccacaat 120 accccataat gacaatgtga aaaaaatttt tttttttaat tattgcataa aaaaaaaaaa 180 gctgaaaaat cacatgtaca taagtattca cagcttttgc agtgaagcta aattgagctc 240 aggtacgttc tgtttcaagt gttcattctt gaaatgtttc agacaggtta attggaattt 300 cacctgtggt aaattaagtt gatttggaca tgatttgaaa aggcatacac ctgtctatat 360 aaggtcccag ggttgacagt gcatgtcaaa gcacaaacca aacatgaaga caaaggaatt 420 gtctgcagac ctccgataca ggattgtcgt caaggcacaa ggctggggaa ggttacagaa 480 aaaatttctg ctgctctgaa agttccaatg agcacagcga cctccatcat ccatgtggaa 540 gatgtttgga accaccagga ctcttactag agctggccag ccatctaagc taagtgatca 600 ggagagaagg gccttagtta gggaggtgat caataactca atggtcactc tgtctagctc 660 cagccatctt ctatggagag aggagaacct tacagaagga caaccatctg tgcagcaatc 720 caccaatcag gcctgtatgg tagattagcc agtgttaacc actcctcgtc tggaatttgc 780 aaaaaggcat ctgaaggatt ctcacaccat aagaaacaaa attctctggt ctgatgagac 840 taaaattgaa ctctttagag tgaatgccag gcgttacttt tggagaattc aggcaccgct 900 catcaccagg ctaacaccat cactacagtg aagcatggtg gtggcatgca tcaagctgtg 960 gggatgtttt ttcagcagca tgaaatggaa gactagtcag aatagaggga agatgaatga 1020 agcaatgtac agagacatcc tgaagtgaaa accttcttca gagtaatctt gatttcagac 1080 tggggtgacg gtttatcttc cagcaggaca atgaccgaaa gcacactgcc aaaatatcag 1140 tggagtggct tcacaacaac tcgatgaata ccttaagtga accagccaga gcccagacct 1200 aaatcctttt gaatatctct gaagagatgt aaaatggctg tacaccgtcg cttcccatcc 1260 aacctgatag agtttgagag gtactgcaaa gaggaatggg cacaaattct caatgacagg 1320 tgagccaagc tgtggcatga tattcaaaaa gagttgaggc tgtagttgct gccaaagatg 1380 catcaacaaa gtattgagca aagactgtac atttttatat gaacgtgatt ttttgaccct 1440 tttaatttta ataaatttga aacaatttca aaaaatgttt tttcacattg gcattatggg 1500 caactgtgtg tagaatgttg aggaaataaa tgaaattaat ccgtttttga ataaggtaaa 1560 aaggtggaaa aaaatgaggt agaattatga tatcactttc ccgatgcact gca 1613 // ID DIRS-10_DR repbase; DNA; ZEB; 5763 BP. XX AC . XX DT 17-JAN-2009 (Rel. 14.01, Created) DT 17-JAN-2009 (Rel. 14.01, Last updated, Version 1) XX DE DIRS-like LTR retrotransposon family - a consensus. XX KW DIRS; LTR Retrotransposon; Transposable Element; KW reverse transcriptase RNase H; DIRS-10_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5763 RA Bao W. and Jurka J.; RT "Families of DIRS-like endogenous retroviruses in zebrafish."; RL Repbase Reports 9(1), 16-16 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 3929..4693 FT /product="DIRS-10_DR_1p" FT /translation="CVDGGTVGRHNLIIRFLRGARRMVPPRPPLMPSWDLA FT VVLTSLREEPFEPLDSVSLRFLSLKTALLVALASVKRVGDLEAFSVSDSCL FT EFXPAYSHVVLRPRPGYVPKVPTTPFKDQVVSLQALPAEEADPALSLLCPV FT RALRTYVDRTQSFRSSDQLFVCYGGRQKGSAVSKRRLAHWRVDAISLAYLS FT QGEPCPPGVRAHSTRSIASSWALARGASLTDICRAAGWATPNTFARFYNLR FT MEPVSSRVLGNPW*" XX SQ Sequence 5763 BP; 1074 A; 1711 C; 1597 G; 1365 T; 16 other; ttccccttcg gatggggaac tccaatgcta tgtggaaaac cttccacaat atggggtttt 60 cgtcagaaac caatcatctg aaagagtata aaatcgggcc aatgaaatgc caaatgaatt 120 ggcagcgtca gcatgcacag ctggcgtcaa tgacaatcag tcaagtatat aagacgcggc 180 tagtgcaatg ctcgacatcc ttttcgcttt cagagccttt cactagcttc tgagagagtc 240 tttgagggtt ctccwacctg tgtctacaga gagagatcga gaagcagctt ctcccggtcc 300 agagcgcgta tacgcagtgg cagacggtcg agctgggttt ttctcccttg cctggcgttc 360 tttgggtccg gtcctccaag agcggtttgt atatagggaa aaagttttcc taaaagagca 420 acacggtttt gcagcgcgtc tttttcaaga cgtcgctccg accgtgcgtt tctggatgcg 480 gtggtttcct atccccggat gatgggcacg agcactgcgt ttcatgtctg ggggtccagc 540 atgttaatgc ggtgctcgcg ggcagtgcat gccgtctctt tgatgtcatg tctgctgcgc 600 agttaagatc gcggctcgct cttgcaaaag agcgacccac cccagttgtc ccccgcactg 660 cggttggcac tcgggcagat ctgaggatta cagtgggagt aaatccgccg ccttcgggct 720 gcagacctct cgctcctcca cgcgctccat ccaagcttca ggtgagaata tgcgctccct 780 cttcggatgg tcgctctctc acttgatgac accgaggatc agatgtccat cgctgcatcg 840 gaggatgggc tgtcattgtc tgatgaggat gcgaacccgc tcgctccctc cctccggggt 900 ggagagcacg gcgttggcat ctaaaaagca ggcatgatgg ccgtgctttc ccgggctgct 960 tcggccgtgg gcctggtgat ggtttatccc ccagccccgc gaccggaccg actagatggg 1020 tgttatgtgg aggattcaaa gccttccgtc cccttcttcc cggaagtgca cagtaagctc 1080 acgcagtttt gagggcactt tttttctgcc cgatctgcat gcgcttccat cctcaccatc 1140 cttggagggt gggctgtcaa tgtctgatga ggattcgaac ccgctcctcc ctccgggttg 1200 atgagcgcgg tgtcgaatct agaagcagac tttgtggccg tgctttccca ggctgcttcg 1260 gccatggctc tggagagtgg ctctggagat ggtttatccc ccagccccgc ggccggactg 1320 actagagggg tgtaaaaacc tttcttcctg gatgcgcaca gtaggcttac gcagtgtgct 1380 gcgtgcgcct ccaccctcac cttgcatgct atggccacct accagcgcta ccaagcgcag 1440 gcgctggccc agctgcggcg aggatggttc cgacccagga ctgggcatga gctccgcacc 1500 ctgggaagga cgatggccac tttagtggcc aggaacgcca cctcaggcta aatctggtga 1560 tgtgtgtgat gttgacaaag ttcgctttct taactcaccc atatcccggg ctggcctgtt 1620 cggcgacacc gtcagtgaac tcgcccagga gttcacgccg gtgatggagc agtcggaggc 1680 gatgggttat aatctatcgg cgggatcgta agaccgctcc ctcccaccga gccatccaca 1740 tccactgctt ttcgccgagg gcgctcgcct gtagcttttt gctccgccgc cccgcctgtg 1800 cctccggcca agcggctgcg ccgagcatct cgcaggcaac cagcgccccc ctgccccagg 1860 gtgccgctaa gttcggtaaa cagaccgcga agcatccctg agacgggcca tccggagagg 1920 agggaatttr ctctttcccc gctggagggt ggggctctay atttaaangc ngwaaawaaa 1980 aaaaaaaaac gccatcaaat cttcaaagag cttttttctc tttcctcgga tgtgacagcc 2040 tgaacactgc cagtttggga cgctatgctt tccagctcgc aggatcggtg catttcgcca 2100 atggctcaca gagcgcgaga gaacggtctc ctttctctcc ctctcgcagc ccctcctccg 2160 gagtttgggt gcgagaccag agcgagtctc tcgcctctcg ctctcccgcg ggaccccagc 2220 gctccccggg tgagcccacc cactccacgc tgcctcaccg ctggcatgtc agcgatcgtg 2280 cgggcgggtt cacttgcgag ggctctgcct gcctggttag cgcgggccag cccatcgcaa 2340 tggctcatcc gtacgatcag actcggctat gcgatacagt tcgygaaacg gctttccaag 2400 ttcacgggcg tgtatttcac caggctcagt cctgcgtccg cccctgtctt gagggacacg 2460 tatatgcatt tctccatact tcctcgcggg cgtctgcatg ctcagttctc tcgacatttg 2520 gctgatttta gcccactctc ggggacaatt gactatgcac agagacaagg tgctccggca 2580 cctccacctg ttggggtttt agatcaactc gagaaaagag ctggctcgcc cccgtgcaga 2640 gcccctcctt tctcgggttg gagctggact cgatcaccat aaaggcgtgc ctctcacgag 2700 agcgcaccga gccagtgctg gactgtctga gagagtttga cagaaaaatg tggtccccct 2760 gaaatctttt cagaggctcc tggggcatat ggcatccgtg gccgcgacct ccccgctcgg 2820 attgctccat atgagaccac cacggcattg gcttcacgat cgggtcccca gacgcgcatg 2880 gcaggcgggc acataccgag tgactccact ccgctgtgtc gcctcgcccc caccccctgg 2940 agggacccct ctttcctacg ggccagagtg cccctgggtt aggcgtccag gcatgttgtc 3000 atttygacag atgcttccag tacgggttgg ggggccgtgt gttgcgggca tgctgctgcg 3060 gacctgcgga agggaaccca gctgcattgg catatcaact gcctagagct gttgacagtg 3120 tttctcactc tgcgccgctt tttaccggcg ctgagggggc aacacgtgct ggtcaggacg 3180 gacagcacgg cgacgggagc gtatatcaac cgtatggggg atgtgcgctc ccgccgcatg 3240 tctcagctcg ctcgccgtct gctcctccgg agtcacacgt ggctgaagtc gatgcgtgct 3300 gttcacatgc cgggcgagcc caaccgtgcg gccaactggc tctcacggca gctccttgcc 3360 ccgggagaat ggcgactcca ccccgagtct gttcagctgt catgggcact gccccacagc 3420 tggcctcggg gcacgcgcaa acttgcgttt tccccagtga gcctgctcgc gcagttactg 3480 tgcaaaccca gggaggacga ggagcaggtc ttgttagttg cgcctctctg gcccaaccgg 3540 acttggattt ttgaactctc cctcctcgcg acggcccccc cctggrgggt ccctttgaga 3600 gagcacctac tctctcaggg acagggcacc atcgggcacc ctcgcccaga tctgtggaac 3660 ctccacgtgt ggtccataga cgcgaggaag acttaggtaa cctaccgatg gcggtggtta 3720 ataccgtcac tcaggctaga gcaccctcta cgaggcatgc ctatgccctg aagtggagtc 3780 tattcactga gtggtgcgct tctcgctgag aagacccccg atcttgccag atcagtgttg 3840 tgctttcttt ccttcaggac aggctggagc gaaggctgtc gccctccaca ctgaaggtct 3900 acgtggccgc tatttccgct catcatgatg cgtagatggc ggcaccgtgg ggaggcataa 3960 cctcatcatc cggttcctca gaggtgcgag gcgtatggtt ccaccccgcc cccctctcat 4020 gccctcttgg gacctcgcgg tagtgctaac gagcctacgt gaggagccct tcgagccact 4080 cgattcagta tccttgagat ttctgtcctt gaagacagct ctgctggtcg cgttggcatc 4140 ggttaagagg gtcggggacc tggaggcatt ttcggtcagt gactcgtgcc tagaattcrg 4200 gccggcctac tctcacgttg tcctgagacc ccggcccggc tatgtgccca aggttcctac 4260 cacgccgttt aaggatcagg tagtgagcct gcaagcgctg cccgcggagg aggcagaccc 4320 agccctttca ttattgtgtc ctgttcgcgc tttgcgaacc tatgtggacc gcactcagag 4380 ctttagatcc tctgaccagc tctttgtctg ttacggtggt cggcagaagg gaagtgccgt 4440 atctaaacgg aggttggccc actggagagt ggatgccatc tccctcgctt atctgagcca 4500 gggtgagccg tgtcccccag gggtgagagc gcactccaca cggagcatcg catcctcttg 4560 ggcgttagca cgcggcgcct ctctgacaga catttgcaga gctgcgggct gggcaacacc 4620 taatacattt gctaggtttt acaatctgcg aatggagccg gtttcctcaa gggtattagg 4680 taacccttgg tgattgagga aacaattygg ttggggtgtt gaaacacgct tgctgcgcca 4740 ttctccctaa cacggaggta cgtgcacttt ttcagctttg tcagttcagt tccccgttcg 4800 gtgaacccta cagagttcct ccgaggcccc cagcatctga ctcagcggag gagtcagacg 4860 ttggcccgtt acgttgttgg catgcccgct ggtcagccmg cccgcattgt tctgggtata 4920 ggtgcctgct atgygtgatc ccctgcgggc gatcccataa gctcactcaa ccacggttta 4980 gtcccccctt gtgttagggc gggctcgtgt cttccctccc cgctaaccat cctctttatg 5040 tacccctccc ccatttccgg ggctgggcca caggctgtca ccaggtctcc cctccttggg 5100 tagcaggtga actccgcagc gtcctcccta tcgggactga acgctttccc aacgtactgt 5160 cgtattaaaa cccttttact gggttatttt cgactccccg aaaaatatag ctaaacctga 5220 acaggtaagt aaggtragta agggccaggg gacacgttgg aagactgcat ctcgcggcgt 5280 tgtaggtgcg ctcgctctac tgcrtggcgc acccttcgcc agggacgcgg taaggtgctt 5340 tcgttgtggc gttttccata gatttcccca tattgtggaa ggttttccac atagcattgg 5400 agttccccat ccgaagggga acgctacggt tactaaagta accctcgttc cccgaggggg 5460 ggaacggaaa tgctatgtac cttcgccaca acgattgtcc cttagctgtt gagcgtgaaa 5520 gtctcctcag ctcaaaagga tgtcgagcat tgcactagcc gcgtcttata tacttgactg 5580 attgtcattg acgccagctg tgcatgctga cgctgccaat tcatttggca tttcattggc 5640 ccgattttat actctttcag atgattggtt tctgacgaaa accccatatt gtggaaggtt 5700 ttccacatag catttccgtt ccccccctcg gggaacgagg gttactttag taaccgtagc 5760 gtt 5763 // ID DIRS-4N2-LTR_DR repbase; DNA; ZEB; 387 BP. XX AC . XX DT 24-OCT-2008 (Rel. 13.1, Created) DT 24-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE nonautonomous DIRS-like LTR retrotransposon family , LTR- a DE consensus. XX KW DIRS; LTR Retrotransposon; Transposable Element; Nonautonomous; KW endogenous retrovirus; reverse transcriptase RNase H; KW phage integrase; DIRS-4_DR; DIRS-4N2-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-387 RA Bao W. and Jurka J.; RT "Families of DIRS-like endogenous retroviruses in zebrafish."; RL Repbase Reports 8(10), 1270-1270 (2008). XX DR [1] (Consensus) XX CC The element is the solo 3'- LTR portion of an assumed DIRS LTR CC retrotransposon, derived from recombination. It contains two CC split LTRs which show similarity to those of DIRS-4_DR. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 387 BP; 108 A; 107 C; 73 G; 99 T; 0 other; gtgatgtttt ataaacaaat ttcgggagga gcacgatcag tctttgacga ggctaattca 60 tcacacacac aatcattcta ttatccaatc agctcaagag caaaccccta taaatagtca 120 aacacgtcat acctccgttt tctcttgact tcagcgtccc tccaccaccc caactccaca 180 ctattaaacc agatacctat ttaaatctga ggggggagcg ttctggagcc gggctagaca 240 ctgcgctcgg accctatctc tctttatcct gataagggga ataacacgag ttagggtgtc 300 ttcccgagct cagagccctc tccccggaca gcacgccaaa tacgcatatt ctattcagtc 360 aaatatctgt gagtgtgaac tcgtgaa 387 // ID CR1-26_DR repbase; DNA; ZEB; 3390 BP. XX AC . XX DT 25-NOV-2008 (Rel. 13.11, Created) DT 25-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE CR1-like non-LTR retrotransposon - a consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; Nonautonomous; KW reverse transcriptase; AP endonuclease; CR1 clad; CR1-26_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-3390 RA Bao W. and Jurka J.; RT "CR1-like non-LTR retrotransposons from zebrafish."; RL Repbase Reports 8(11), 1700-1700 (2008). XX DR [1] (Consensus) XX CC The 5'-part is incomplete. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 1..2877 FT /product="CR1-26_DR_1p" FT /translation="SWRPSWQTCLLAPHINVVFLLFFVCFLSFPRCIHAMH FT YQHIADKLAGSCNLRQTFRIYQFYSTGLTTHCPLPQPLQRRRRKRHKRGKR FT GGLHARLKSRANRPPLPSILLANVRSLDNKLDELRARITSQREVRECCALI FT FTETWLSEKVPGTAVQLQTHSLHRGDRTTASGKAKGGGVCVFINNSWCGDV FT QTVHKHCSPDVEFLLLKCRPYYLPREFTAVFIAAVYIPPRANATAALGKLF FT NVFNAQEMAHPDAVIIAAGDFNQCNLRTVLPKYHQHVSIPTRENNTLDNVY FT SNIRGAYRAAPRPHFGQSDHISLFLYPAYRQRLKQTNPITKLVKIWNPQTE FT STLQDCFALTDWDVFKTAATKEDLSVNVQDYAEYVTGYISTCVENIIPTIQ FT VKKFPNQKPWINSKVRHMLNARSLAFTSGNENEYKSAKYGLRSAIKEAKRQ FT YKEKLDNTYSTASAGQMWRGLQHITDYKSTTNVTISASESLPDDLNTFYAR FT FESSSINTEQRHTQTQTSQPALPPPXVTPAAVHKALRKINPRKAAGPDNIP FT GQALKACASELVDVFTSIFNLSLSQSSVPTCFKTTTIVPLPKKSPLTCLND FT YRPIALTPIIAKCFERVVLPLIQSSIPDTLDPLQYAYRTNRSTSDAIAAAL FT HTSLSHLEDKDTYIRMLFIDYSSAFNTVIPHKLTHKLSELGLHPTLCDWLL FT DFLTGRPQSVRIGNKSSSTILTNIGTPQGCVLSPILYTLFTHDCVASHKNN FT TIIKFADDTAVIGCITGGDEAAYRKEVASLVTWCENNNLTLNTDKTKEMIV FT DMRKERRTHQPLFIRKLEVERVSSFKYLGVHISEDLTWTLNTTQLVKKAQQ FT RLYFLRKLRKLGLSSKILSNFYSCVVESILTNCITVWYGNATEKDRKRLQR FT VVRTAEKIIRSPLPSLQTIYHHRVHKRTASILKDPTHPQHGYSHSYPQGGG FT IGV*" XX SQ Sequence 3390 BP; 989 A; 878 C; 678 G; 844 T; 1 other; tcatggcgcc cctcgtggca gacgtgtttg cttgctccgc acataaatgt cgtgttttta 60 ttgttttttg tttgtttttt gtcttttccg cggtgcatac atgcaatgca ctatcaacat 120 atagcagaca aacttgctgg atcttgcaac ctccgccaaa cttttcgaat ttaccagttt 180 tattccaccg gacttacaac acattgtccg cttccacaac cccttcaaag gcggcgacgc 240 aaacggcaca agcgtggtaa gaggggagga cttcatgcta ggctaaagag ccgtgctaac 300 cgaccaccgc tacctagcat cttgctggct aacgtgcggt ctctggacaa caaactggat 360 gagctaagag caaggattac atcgcaacgg gaagtaagag aatgctgcgc tctgattttc 420 acagaaacgt ggctctccga gaaagtccca ggaaccgctg ttcagctaca gacccattca 480 ttacacagag gagaccggac cacagcctcc ggtaaggcta aaggaggagg tgtgtgcgtg 540 tttattaata actcgtggtg tggagacgta cagactgttc ataagcactg ctcgccagac 600 gtggagtttc tactgctgaa atgccgcccc tattatctac caagggaatt tactgccgtg 660 ttcatcgccg ctgtttacat ccctccgcgg gcgaacgcta cagcagcact cggcaaactt 720 ttcaatgttt tcaacgcaca agaaatggca catcctgatg cggttattat cgctgcgggc 780 gactttaacc agtgtaactt acggactgta cttcccaaat atcaccaaca tgtgagtatt 840 cccactcgtg aaaataacac actggacaat gtttacagta acatacgcgg tgcatacaga 900 gctgcccccc gcccccactt tggtcagtca gaccacatct ccttgttttt gtatccagct 960 tacagacaaa gactgaagca aacaaaccca atcactaaac tggttaaaat ctggaatcca 1020 cagacagaga gcacccttca ggactgtttt gctcttacag actgggatgt gtttaaaact 1080 gcagccacca aggaggactt gtctgttaat gtacaggact atgctgagta tgtgactggg 1140 tatatcagca cttgtgttga aaacatcata cccaccatac aagtcaagaa gttccccaac 1200 cagaagccct ggataaacag caaggtgcgt cacatgctga atgctcgttc tctggcattt 1260 acatcaggca atgagaatga gtacaaatct gcaaaatatg gactgagaag tgccatcaaa 1320 gaggctaaga ggcagtataa agagaaactg gataacacct actccactgc ctcagctgga 1380 caaatgtggc gaggcctgca gcacatcaca gactacaaga gcaccacaaa tgtcacaatc 1440 agtgcctcag aaagcttgcc tgacgacctc aatacatttt atgcccgctt tgagtcctcc 1500 agcatcaaca cagagcagag acacacacaa actcaaactt cccaacctgc cctcccccct 1560 cctgyagtga caccagctgc agtacacaaa gcactgagaa aaatcaaccc ccgcaaagca 1620 gccggacctg acaacatccc gggacaggcc ctcaaggctt gtgcttcaga gctggttgat 1680 gttttcacct ccatctttaa cctttccctt agtcaaagct ctgtcccaac ttgcttcaaa 1740 accacaacca tcgtccccct tcctaaaaag agccctctga cctgtctgaa tgattacagg 1800 ccaatagcac tcactccaat cattgccaaa tgttttgaga gagtggtact acccctcatt 1860 cagagcagta taccagacac tttggacccc ctgcagtatg cataccggac caataggtcc 1920 acctcagatg ccattgctgc tgcactacat acttccctct ctcacctgga agataaagac 1980 acctatatca ggatgctttt tatcgattac agttccgcat tcaacacggt tatcccccat 2040 aaactcaccc acaagctgtc tgaactcgga ttacacccca cactctgtga ctggctctta 2100 gatttcctca ctggcagacc gcaatctgtc aggattggaa ataaaagctc aagcaccatc 2160 ctcaccaaca tcggcacccc acagggatgt gttttaagcc ccatcctcta cactttattc 2220 acacatgact gtgtcgcatc tcacaagaac aacaccatca ttaagtttgc ggatgacact 2280 gcagtgatag gctgtatcac tgggggagat gaggcagctt ataggaagga ggtggccagt 2340 ctagtgacat ggtgtgaaaa caacaacctc accctcaaca cagacaagac caaggagatg 2400 atagtggaca tgaggaagga aaggagaact catcagccac tgtttattcg caaacttgaa 2460 gtggaaagag tgagcagttt taaatacctg ggggtccaca tcagtgagga cctcacctgg 2520 acactgaaca ccacccagct ggtcaagaaa gcacaacagc ggctgtactt cttaaggaag 2580 ctaaggaaac tcggtctgtc atctaagatc ctcagcaact tttacagctg tgtggttgag 2640 agcatcctga ccaactgcat tactgtatgg tatggaaacg ctactgaaaa ggaccgcaaa 2700 cgtctgcaga gagtggtgag gactgcagag aagatcatta ggtccccact gccttctctg 2760 cagactatct accatcacag agtccacaag agaactgcct ccatcctgaa agaccccact 2820 catccacaac acggttattc acactcctac cctcagggcg gaggtatagg agtgtgagat 2880 gcaggactgc cagactcaag aactctttct tcccatcagc catcagactt ctaaacagat 2940 aaccaacgca cataacagtc tatttttctg ctcagtacaa cactacatac tccatcccat 3000 tttgcactat tttattcttt attctttttt tttgcacata atccaattgc actaggcact 3060 tttttgcata ataagcacaa taaaaaaaaa aaaacaaaac aaaacaaaaa acactgtaca 3120 ctgtttacat ctgtttcatt actcaaatca ggtttacata ttgtttactt tcatagatat 3180 ttatatactt acattttaca atcaatcttc agtcacttct gtgtatatat gtgtatttat 3240 gtgtatgttg tgtatgatgt gtatgttgtg tgtatgactt cactgtggac ggcaaagtaa 3300 gaatctcatt gtacagggag acgtgtttcc ttactgtgca catgacaata aacagttgaa 3360 ttgaattgaa ttgaattgaa ttgaattgaa 3390 // ID Gypsy-35-LTR_DR repbase; DNA; ZEB; 929 BP. XX AC . XX DT 08-APR-2009 (Rel. 14.04, Created) DT 08-APR-2009 (Rel. 14.04, Last updated, Version 1) XX DE Solo-LTR of the Gypsy LTR retrotransposon; a consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Nonautonomous; KW LTR; Interspersed repeat; GYPSY superfamily; Gypsy-35-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-929 RA Bao W. and Jurka J.; RT "a family of Solo-LTR from zebrafish."; RL Repbase Reports 9(4), 855-855 (2009). XX DR [1] (Consensus) XX CC The internal portion of this Solo-LTR is not identified. XX SQ Sequence 929 BP; 178 A; 186 C; 180 G; 385 T; 0 other; tgttacggtc cgtgggagtt tgtatttttg tgccttctcc ctctctgttt tgcttgtctc 60 ccttcgctcc cttcgcatct ccttgtctgt cattcgctct ttgtctcgcc ctatcacctt 120 ttgttccgct actttttctc tactaggtac ctgctgattg gatcccgagc ctgtcactcc 180 tcattagggc gtttcctcgt ggaccaattg gacttcgttc cgcggacttt aaattcgacc 240 ggcacgcctt gtttttcaga gtgctatgtg tgtgcgtgtt ggtttgtgtt cgctcgctgc 300 tgttcataat gtatgctttg ttctgtgttt ctgaataatt acatcgatgt aaattaagtc 360 atggttaaag agtttatcat tcctttttgg ggaaaaggga atagtttact aagtcgcgcg 420 acatacattt tgcccgtagg taacttcaat gttcagaaac actaggtaag tgggtgagcg 480 ccgcattttg tatttttgct tattatttca gggagtttag gtgtggcgtc gcaaacgctg 540 aagatttcgg tttctttttc acattttgat ttttattagt aagtttagag ggggatcttg 600 tgttctagtt taggtggctt ttggttttgt ttagttcttt gctttggcgc cactctagtt 660 cctttttccc ctaaaacttt tgattgtatt tcttcttttt ttcttcttct tcttcttctt 720 attattatta ttatatatac attgtacaaa gcgcacacat gtaagcctct cttttgttgg 780 cttcattttt cttctcctcg taagaggaat aaacgaattg aatgtaaatt ttggtgtcct 840 ggtgtatttt ccatccatct actcagtaat aatttaatat tataagtgtt acgtttattt 900 tccctagaca acttaattaa accgtaaca 929 // ID Gypsy114-LTR_DR repbase; DNA; ZEB; 723 BP. XX AC chr20; XX DT 03-OCT-2008 (Rel. 13.1, Created) DT 10-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE LTR retrotransposon from zebrafish: long terminal repeat. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Gypsy114-I_DR; KW Gypsy114-LTR_DR; Gypsy114_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-723 RA Jurka J. and Kohany O.; RT "LTR retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1518-1518 (2008). XX DR Genome; chr20; Positions 11146997 11146275. XX SQ Sequence 723 BP; 205 A; 102 C; 161 G; 255 T; 0 other; tgtaacgaag ttataatttt cattgtgaat cctcctgaat tatgcagcag ttatgtgttt 60 tatagagtaa gggcgccctc atgtggtttg tttgggtacc gcagacatac aagttaggca 120 ggaagtgacc tagcgcagca gatgctgtga gacagccatg ctgtgggtag gtgtttttca 180 tgctccgtca tagtttatta atattagctt aagaaagtag cgatttcaaa catttagttt 240 ataatttggt tatattatca ttaagcatta agtttgactt gtgacatgta gtttttatgg 300 ctttttataa atgtctactg gattatgctc tttagcaagg ccatgcggtt gatttgattt 360 tctatttcat ttagatgttg tatgtctgca gcaagcaatc cactaataag ggctgtactg 420 tttattattt caggtatgtc aatgtttatt tgttaattgt gaatgttacg ttttatgtta 480 aatgttcata tgattatcag ttaatatatg gatattaatg atgtaatagg cagcagatgc 540 tgtgagacag ccatgctgtg gatgttgtat gtctgcagca agcaatccac taataagggc 600 tgtactgttt attatttcag tatatcagta aaaaggagta atcaaatact gtgtccgggc 660 cttcattaaa gagcattgca cacaacgcag aggaagaaga gttgaaggac cgaagacgtt 720 aca 723 // ID DIRS1_DR repbase; DNA; ZEB; 6132 BP. XX AC . XX DT 11-FEB-2002 (Rel. 7.01, Created) DT 11-FEB-2002 (Rel. 7.01, Last updated, Version 1) XX DE DIRS1_DR is a DIRS-like LTR retrotransposon - a consensus. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW DIRS superfamily; DIRS1; DIRS1_DR; endogenous retrovirus; KW phage integrase; reverse transcriptase RNase H. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 414-5132 RA Jekosch K.; RT "DIRSDR1: putative non-LTR retrotransposon."; RL Repbase Reports 2(2), 9-9 (2002). XX RN [2] RP 1-6132 RA Kapitonov V.V. and Jurka J.; RT "DIRS1_DR, a family of DIRS-like endogenous retroviruses in RT zebrafish."; RL Repbase Reports 3(1), 1-1 (2003). XX DR [2] (Consensus) XX CC DIRS1_DR is a family of DIRS1-like retrotransposons. These CC elements CC are related to gypsy-like LTR retrotransposons and endogenous CC retroviruses. CC There are ~100 copies of DIRS1a_DR in the genome, they are ~0.3% CC divergent from the consensus sequence. Therefore, this family CC retrotransposed in the zebrafish genome very recently. CC The unusual structure of DIRS1_DR is depicted in the next figure. CC GTTCCCCTTCGGTTGGGGAACTTCAGTGCCATGAATGGGAGGATTCGGATCAGAAGCCGCTTATCTGGAG CC <====== ======> <--------------------------------------------- CC AGTATTGAACGGGCCAATGAATGAAATTAATTGGCAGCGTAAGCTTGCGCAGGTGTGCGACATCTGCAAT CC ---------------------------------------------------------------------- CC TATCTCAGCATATAAGCACACCTGAAGCCAGCAGACGCCATCCTTTTCGCTTCAGATCCTTTCTGAGTGA CC ----------------------------------------------- CC ...................................................................... CC ...................................................................... CC GGTGCAGTCATTATGGCGCTTTCCATATTCTCCCATTCATGGCACTGAAGTTCCCCAACCGAAGGGGAAC CC <====== ======> CC <~~ CC GTTCGAGGTTACAGAAGTAACCCTTCGTTCCCCGAGGAGGGGAACGGAAGTGCCATATTCCGTCGCCATA CC ~~~~~~~~~~~~~~~~~~~~~~~~~~<====== ======> CC ATGACTGTCCCTTAGCTGTTTGAAAGTCTCTTCAGCTT CC AAAAGGATGGCGTCTGCTGGCTTCAGGTGTGCTTATATGCTGAGATAATTGCAGATGTCGCACACCTGCG CC ---------------------------------------------------------------------- CC CAAGCTTACGCTGCCAATTAATTTCATTCATTGGCCCGTTCAATACTCTCCAGATAAGCGGCTTCTGATC CC ---------------------------------------------------------------------- CC CGAATCCTCCCATTCATGGCACTTCCGTTCCCCTCCTCGGGGAACgaagggttacttctgtaacctcgaacgtt CC ----------------------> <====== CC ======>~~~~~~~~~~~~~~~~~~~~~~~~~~~~> CC Fig.1 Termini of DIRS1_DR. The 163-bp sub-terminal inverted CC repeats are CC underlined by a single line. CC DIRS1_DR encodes three ORFs. ORF1 (positions 414-1632) codes for CC the CC gag-like protein. ORF2 (positions 1633-2597) codes for reverse CC transcriptase and RNase H. ORF3 (positions 2598-5129) codes for CC the CC phage integrase. XX FH Key Location/Qualifiers FT CDS 0..0 FT /product="ORF3p" FT /translation="MRSSSGLVCSHRPEGRVFPCLHSSTPPPISAVCVRGS FT SVAVQGPPLRALSVSAGLHQTRGGCPSAPSARGHSHTQLSRRLADFSPLAG FT AIDYAQGRGASASPPTGASGQPRKEQTRPRAEDFFSRDGAGLDHHGSAPLR FT GTRSPVAELSEGARQQTSGPTEVLSEAPGAYGIRSRRHAARVAPYETTSAL FT ASRSGPQTRMARGHTPGLGYCAVSPRPQPLERPLVPTGRCASRTGVQPCCC FT FNRRFQHGLGGRVSRACGCGPLEGCPAALAYQSPRAVGSVPRSPPLFTGAG FT AATRAGQDGQYGGGGVYQPHGGYALSPHVSARPPSAPLESPAAEIAARHSR FT PRHAQSCSRCALTTAVTPWRMETPPRVCSADMGAIRGGPDRSVCFPRERSL FT PVVFFPDRGLSRHGCTGPQLASGHAQVCVSPSEPARAVSVQGQGGRGTGSA FT SCAPLAQPDLDIRALTPRDGPPLADPFERGPTLSGTGHHLAPSPRSLEPPR FT VVPRREEDLGNLPTAVVNTITQARAPSTRRAYALKWSLFTEWCVSRREDPR FT NCQISVVLSFLQEKLDSRLSPSTLKVYVAAISAYHSAVAGGTVGKHNLVIQ FT FLRGARRINPSRPPLMPSWDLALVLTSLRSDPFEPLESVSLRFLSLKTALL FT VALASIKRVGDLEAFSVSDSCLEFGPDYSHVILRPRPGYVPKVPTTPFRDQ FT VVNLQALPPEEADPALSLLCPVRALRIYVDRTQNFRSSEQLFVCYGGRQQG FT SAVSKQRLSHWIVDAISLAYSSRGQPCPPGVRAHSTRSVASSWARARGASL FT TDICRAAGWATPNTFARFYNLRVEPVSSRVLGNPLVIEETTR" XX SQ Sequence 6132 BP; 1117 A; 1898 C; 1706 G; 1411 T; 0 other; gttccccttc ggttggggaa cttcagtgcc atgaatggga ggattcggat cagaagccgc 60 ttatctggag agtattgaac gggccaatga atgaaattaa ttggcagcgt aagcttgcgc 120 aggtgtgcga catctgcaat tatctcagca tataagcaca cctgaagcca gcagacgcca 180 tccttttcgc ttcagatcct ttctgagtga gtcgatgagg gttcctcttg ctgatcagca 240 cttcagagcg aacgagtgtg tctcccggtc cagagtgggt cttcgcggtg gcagacggtc 300 gagctgggtt actcccttgc ctgcggttct ttgggtccgg tcctccagag cggtgcgtat 360 agttgcaact ttcctaaaag agcaacacag tcgtgcagca cgtccttttc aggatggcgc 420 tccgactgtg cgtttctgga tgcgggggtt tcctgtctcc ggatgatgga cacgatcact 480 gcattgcatg tttgggggtc cagcatgtta atgcggtgct cgcgggcggt tcatgtcgtc 540 attgcgatgc catgaccgtt gcacagctaa gatcgcggct aactttcgca agagagcgag 600 ccaccccagt tgcctcctgt tctaaaaaag cagcgggcgc tcgggcagat ctgagggttt 660 cagcgggagc taatccgccg cccacgggct cgcggacctc tcgctcctca cggcgctcca 720 tccaagcttc gggtggtgag agtgatccgt ctaaccagat ggtagctctc acactcgctg 780 acaccggaga tcagatgtcc tccgcggcat cggagggtgg gctttcactg tccgacgaag 840 atccggaccc gctcgccccc tccgggcagg tgagcgctgt caaatcggat cctgaagcgg 900 acatgttagc cgtgctttcc cgggctgctt cggccgtggg gttggagatg gtttatcccc 960 cagctccgcg gccggaccga ctagatgggt gctacgtaga ggaccagaag gcgaagcctt 1020 cgaagcctct cgtccccttc ttcccggaag tgcacagtag gctcacgcag tcctggaggg 1080 cacctttctc tgcccgtgct gcgagtgcct ccgccctcac cgcccttgac ggcggagctg 1140 ccagggggta tgaggcgatc ccgtcagtgg agcgcgctat cgcggtcaat ctttgtccgc 1200 gcggcgcctc tacgtggcgg ggtttgcccc gcctcccgtc caaagcctgt aggttgtctg 1260 cctccctcgg agccagagct tataaggctg cgggccaggc tgcttctgct ttgcacgcga 1320 tggccaccta ccagcgctac caagcgcagg cgctggccga gctgcacgag ggcgggtcca 1380 acccaagctt attacatgag ctgcgcaccg cgaccgacta tgctcttcgg actactaagt 1440 ccgccgcgtg tgcgctgggg aggacgatgt ccacacttgt ggttcaggaa cgccacctct 1500 ggctaaacct ggccgatatg cgcgacgttg acaaagttcg ctttcttgac tcgcccatat 1560 cccaggctgg cctgttcggc gacaccgtcg gtgaattcac ccaggaattc aaggcggtga 1620 aagagcagtc ggatgcgatg ggcaatgtca tctatcggcg tggccgtaag cccgctccgc 1680 ccgccgagcc atccacctcc gctgttcctc gccgagggcg cccgccaacg agtgctgccc 1740 cgcccccgcc tgcgcctccg gccaagcggg cgcggcgttc acctcgaaag caggcagccc 1800 ctcctgccca gggcgccgtt aagtccggta aacggaccgc gaagcgtccc tgagacaggc 1860 catccggaga agaggaaact tgctctttcc ccgctggagg gcggggcccc gataacaacg 1920 gtacttttca gtgccaccaa aacatcagta aaagagcact ttttcccttc cccggatgtg 1980 actgcacgag ttctgccagt ccgggacgcg ctgccttccg gctcgcagac tctacgtgct 2040 tcgccagtgg ctcacgagcg ctggggggac ggtctccctt ccctcagccc tccagccccc 2100 tctccggagt cagggtgcgg agccagagcg aatcgctctc ctccagcttt tccgcgggac 2160 cctcgtgctt cccggatcag cacacccact ccgcgctgcc ccaccgctgg tacgtcagcg 2220 attgtagcga tgactccatt agcgagggct ctgcctgcct ggttagcgcg ggccagcccc 2280 tcgcggtggc tcatacgcac aatcagactc ggttacgcga ttcagttcgc gaaacggccc 2340 cccaagttta cgggcgtgta tttctccagg gtcaaccccc tgtccgcccc tgtcttgcga 2400 gaggagattg ctgccctcct ggcgaagggt gcaatcgagc cggttcctcc agccgagatg 2460 gagagtgggt tttacagccc atacttcatc gtacccaaaa agagcggtgg gtcacggcca 2520 atcctagatc tgcgcgtttt gaaccgctgt ctgcacaagc tgccgttcag aatgctcacg 2580 cagaggcgca ttctccaatg cgttcgtcct cgggattggt ttgcagccat agacctgaag 2640 gacgcgtatt tccatgtctc cattcttcca cgccaccgcc aatttctgcg gtttgcgttc 2700 gagggtcgag cgtggcagta caaggtcctc cccttcgggc tctctctgtc tccgcgggtc 2760 ttcaccaaac tcgcggaggg tgccctagcg ccccttcggc tcgcgggcat tcgcatactc 2820 agttatctcg acgactggct gattttagcc cactcgcggg agcaattgat tatgcacagg 2880 gacgaggtgc ttcggcatct ccgcctactg gggcttcagg tcaaccgaga aaagagcaaa 2940 ctcgcccccg tgcagaggat ttcttttctc gggatggagc tggactcgat caccatggta 3000 gcgcacctct ccgaggaacg cgctcgcctg ttgctgaact gtctgaggga gctcgacagc 3060 aaactagtgg tcccactgaa gttctttcag aggctcctgg ggcatatggc atccgcagcc 3120 gccgtcacgc cgctcgggtt gctccatatg agaccacttc agcactggct tcacgatcgg 3180 gtccccagac gcgcatggca cgcgggcaca caccgggtct cggttactgc gctgtgtcgc 3240 cgcgccctca gcccttggaa cgacccctcg ttcctacagg ccggtgtgcc tctaggacag 3300 gcgtccagcc atgttgttgt ttcaacagac gcttccaaca cgggttgggg ggccgtgtgt 3360 cgcgggcatg cggctgcggg cctctggaag ggtgcccagc tgcattggca tatcaatcgc 3420 ctagagctgt tggcagtgtt cctcgctctc caccgctttt taccggtgct ggagcggcaa 3480 cacgtgctgg tcaggacgga cagtacggcg gcggcggcgt atatcaaccg catggggggt 3540 atgcgctctc gccgcatgtc tcagctcgcc cgccgtctgc tcctctggag tcacccgcgg 3600 ctgaaatcgc tgcgcgccat tcacgtccca ggcacgctca atcgtgcagc cgatgcgctc 3660 tcacgacagc tgttacgccc tggagaatgg agactccacc ccgagtctgt tcagctgata 3720 tgggcgcgat tcggggaggc ccagatcgat ctgtttgctt cccccgagaa cgctcactgc 3780 cagttgtttt tttccctgac cgagggctct ctcggcacgg atgcactggc ccacagctgg 3840 cctcggggca tgcgcaagta tgcgtttccc ccagtgagcc tgctcgcgca gtttctgtgc 3900 aaggtcaggg aggacgagga acaggttctg ctagttgcgc ccctttggcc caaccggacc 3960 tggatatcag agctctcact cctcgcgacg gccctcccct ggcggatccc tttgagagag 4020 gacctactct ctcagggaca gggcaccatc tggcaccctc gccccgatct ttggaacctc 4080 cacgtgtggt ccctagacgc gaggaagact taggtaacct accgactgcg gtggttaata 4140 ccatcactca ggctagagcc ccctccacga ggcgcgccta cgccctgaag tggagtctat 4200 tcactgaatg gtgcgtctct cgcagagaag acccccgaaa ttgccagatt agtgttgtgc 4260 tctctttcct tcaagagaag ttggacagca ggctgtcgcc ctccactctc aaggtttacg 4320 tggccgccat ctccgcttat catagcgcgg tagctggcgg caccgtggga aagcataacc 4380 tggtcatcca gttccttagg ggtgctaggc gaattaatcc atctcgcccc cctctcatgc 4440 cctcttggga tctcgccctc gttctcacga gtctgcgatc cgatcccttt gagccactcg 4500 aatcagtatc tctaagattt ctgtccctga agacagctct gctggttgcg ttggcctcca 4560 tcaagagggt cggggacctg gaggcatttt cggtcagtga ctcgtgcctg gaattcgggc 4620 cggattactc tcacgttatc ctgagacccc gccccggtta tgtgcccaag gttcctacca 4680 ccccctttag agatcaggta gtgaacctgc aagcgctgcc cccggaggag gcagacccag 4740 ccctttcttt actttgtcca gttcgcgctc tgcgcattta tgtggaccgt actcagaatt 4800 ttagatcatc tgagcagctc tttgtctgtt atggcggtcg gcagcaggga agtgccgtat 4860 cgaaacaaag attatcccac tggattgtgg atgccatttc actcgcttat tcgagtcgag 4920 gtcagccgtg tcccccggga gtacgtgcac actccactcg gagcgttgca tcctcttggg 4980 cgcgtgcacg cggcgcctct ctaacagaca tctgtagagc tgcgggctgg gcgacaccca 5040 acacatttgc aaggttttac aatctgcgag tggagccggt ttcctcaagg gtattaggta 5100 accctttggt gattgaggag acaactcggt agggtgttga aacacgcttg ctgcgccatt 5160 ctccctaaca cggaggtacg tgcgcctttt ttatctgtca gtaaagttcc ccgtcaggtg 5220 agccctgcag attcctccgt ggcccccagc actgactcag cggaggagtc acttgctggc 5280 ccactacgtt gtaggtctgc ccgctggtca gcccgcgttt tgggtatagg tgcctgctat 5340 gcgtgatccc cactaggcga tcccatatgc ttattccgcc acggttaagt cccccccctg 5400 ggcggacccg tgtcttccct ctccgctaac cactcttttg ctatgcgtac tccccctttt 5460 tagggctagt ccataggtaa attctgccat ctatcccccc cttgggtaac ggatggcctc 5520 cgcagcgtcc tccctatcgg gattgcacgc ttcccaacgt actgtcgtat ttcctagaat 5580 tatctagatg ctcacgactt cccaaaaaat atatctaaat ccgtaaaact tctgttgaag 5640 taggataaat tagggccagg gacacgttgg aggaccgcgc cccccatgat gtgggtgcgt 5700 cacgcttgct tgactatctc ctcatcgggg gtgttggtaa ggtgcagtca ttatggcgct 5760 ttccatattc tcccattcat ggcactgaag ttccccaacc gaaggggaac gttcgaggtt 5820 acagaagtaa cccttcgttc cccgaggagg ggaacggaag tgccatattc cgtcgccata 5880 atgactgtcc cttagctgtt tgaaagtctc ttcagcttaa aaggatggcg tctgctggct 5940 tcaggtgtgc ttatatgctg agataattgc agatgtcgca cacctgcgca agcttacgct 6000 gccaattaat ttcattcatt ggcccgttca atactctcca gataagcggc ttctgatccg 6060 aatcctccca ttcatggcac ttccgttccc ctcctcgggg aacgaagggt tacttctgta 6120 acctcgaacg tt 6132 // ID TDR12 repbase; DNA; ZEB; 242 BP. XX AC . XX DT 04-MAR-2002 (Rel. 7.02, Created) DT 04-MAR-2002 (Rel. 7.02, Last updated, Version 1) XX DE Zebrafish non-autonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; TDR12. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-242 RA Jurka J. and Drazkiewicz A.; RT "TDR12: a non-autonomous DNA transposon from Danio rerio."; RL Repbase Reports 2(2), 22-22 (2002). XX DR [1] (Consensus) XX CC TA-target site duplication. XX SQ Sequence 242 BP; 84 A; 36 C; 41 G; 81 T; 0 other; tatagaccat tttgagggat gtaaacaaaa acaatggtcc caacgtattt cctgttttac 60 atttttaatt tctatagctt ccgagaatcc aaaaagagcc acatattgat aaataatgtt 120 atgatagctg ttttaacatt aagttatgat tgaattgcct cttgttacag ttatgaaata 180 gtttgataac aagcaggaaa tgttcatggg ccaatgacat gaccacattg aaatggtcta 240 ta 242 // ID CR1DR1 repbase; DNA; ZEB; 3984 BP. XX AC AL591176; XX DT 04-MAR-2002 (Rel. 4, Created) DT 04-MAR-2002 (Rel. 4, Last updated, Version 1) XX DE CR1 Danio rerio 1. XX KW retrotransposon; non-LTR; CR1; CR1DR1. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-3984 RA Jekosch K.; RT "CR1DR1: CR1-like repeat from Danio rerio."; RL Repbase Reports 2(2), 7-7 (2002). XX DR [1] (Consensus) XX CC Putative novel non-LTR retrotransposon similar to CR1-like CC retrotransposons. CC Contains one reading frame (pos. 1-3984) with two stops. XX SQ Sequence 3984 BP; 1215 A; 949 C; 711 G; 1109 T; 0 other; atgccgcttc cgtctctgtc cttgtgtgca ggagaagcat cgatggaggc gttggagctg 60 gagctggaag aagtggagtc ccagattcgc gcgctggtgg tgagaaggtc gctgctacgg 120 gaacaacttc gtgttgtacc taatgctaag gccgtctcat cacctaaggt acgtggaaat 180 tacaaccaca tcattccctc tacctcaacc ccgcgtcctt ctctgtccag gcccagcgca 240 cccggggcgc ggctcagcca ggcgtcgttc acgccgacac ccggctacca cggcgcctgg 300 gtgcagccgc gcaaggtgct tcccagatcc cggggcagaa cgtctcctcc tgtgttcgag 360 atctccacgg agaaccgctt ctcccctctc cgcgagtcgg gtcccgatgt ggccatcatc 420 ggtgactcga tcgttcgtca cgtccgtgcc gcctcctcaa aaggtaataa agtacgtact 480 ttctgctttc ctggtacccg tgtgagaaat atttctacac agattccaac catcctgggc 540 gctgccgaga gccctggtgc cgttgtcctc cacgtgggga caaacgacac cgggctccgg 600 cagtcggaga tcctaaagaa ggacttcagg agcctgatcg agacggtacg acgcacctcg 660 cccgccacgc agatcatcgt ttctgggccg cttcctacct accgccgagg aaatgaaagg 720 ttcagtagac ttttagctct gaatgaatgg ctaataacat agtgtaaaga acaaaaattg 780 ctctttgcta ataactggaa tcttttctgg gagcgtccta ggctcttccg tcctgacggc 840 ctgcacccca gtcgagccgg agctgaactc ctgtcggaca acatctccag attacttcgc 900 accatctgac tagcaggtaa aaattcacac tatagccacc tagactcttg ttcaccccac 960 ttaaacatca gtaacgcata tctggcgaat cctatagaga ctgtgtctgt tcctcgtatt 1020 attagattaa gaaataaacg tactgtgtgc tccagaaaaa atctattaag aatcaaacca 1080 gaaaaaccag tagaaagtga aaatacaaat ttcgtaaaac ttggtctcct aaacatcagg 1140 tcacttgcac ctaaagcact tatcattaat gaaataataa cagaaaacaa tgttaatgca 1200 ctctgtctca ctgaaacctg gctgaaacaa aatgactata ttagcttaaa tgaagcaact 1260 cctccaggat tcttatataa acatgaggct cgtcaaactg gtcgtggtgg tggagttcca 1320 tcaatcttta gtgatttcct taatattaaa cagagaaacg gacttatgtt tagctccttt 1380 gaagtattat cgcttaatgt tcagcttcca gatactatac aaaaacctat gttatctctc 1440 gctttaatca ccatatatag acccccagga ccctatgtca aatttctaaa agaattttct 1500 gattttattt ctgacttact agtcaaaact gataaaatgc taattgtagg tgactttaac 1560 atccacatag atgacgctaa tgatacatta ggactcgcgt ttatggattt aatacactca 1620 cttgggataa agcaaaacgt tgtgggccca acccatcgct taaagcatac attagatcta 1680 attctgtctt atggaatcga ggttattgac gaagacatta taccacaaag tgatgatatt 1740 acagatcact acctcttact atataagctg tgtttacctg aaatcagcaa acccgctcca 1800 atactccgcc ctagtagaac tattgttccg tcaactaaag atgaatttat caataactta 1860 cctgatcttt ctctatttcg taatgcacct gcaaactcaa atgatcttga tgtagtaacc 1920 agcagtatgg atgccatctt tactagcaca ctaaatactg tggcacccat caaattaaaa 1980 aaggctagag agattaaaac tataccatgg tataatagtc atactcgtgc gctcaaaaca 2040 gcaacccgtg ccctggaacg taaatggaaa aaaacaaatt tagaggtctt tagaattgcg 2100 tacaaagaca gtatgtccag ctataggagg gctctaaaat ctgccaggac cgagcacctg 2160 cgcaaactga tagaaaataa tcataacaat cctagatttt tatttaacac catctctaaa 2220 ttagctaata atcggtcatc cttggaacaa actactccac cgcaaattag tagtgatgac 2280 ttcatgaatt ttttcagtaa taaaatagaa ggctttagac agaaaatagg agatgccaaa 2340 ctttctgcac cggcttatac tccaaatcct gtaaatattt cattaaatca taataataac 2400 ctacactgct tcaaaatcat agaacatgaa gagttagtaa aaattataaa tagctctaaa 2460 ccagctacgt gtatgctgga ctcaattcca acaaaattac tgaaagagct gctacctgct 2520 ataggagaac ctcttcttaa cattatcaac tcttctttat ctataggcca tgttccaaac 2580 tcttacaagc tagctgttat taagcctatt attaagaaac cgcaactaga caccaacaac 2640 ttagctaact ataggcctat ttcaaatctt ccatttatgt ctaaaatact agaaaaagtt 2700 gtttccactc aattatgctc ttttctgcag acgaacaata tttttgaagt gtttcagtca 2760 ggtttcaggg ctcaccacag tacagaaacc gccttagtga aaataaccaa cgatttactc 2820 ttagctgctg accgagggtg cgtctcgcta ttagttttac tcgatcttag tgcggcattt 2880 gataccattg accacaatat cctcataaat cgcttaaagt ctacaggtgt ccagggacag 2940 gctctacaat ggtttaagtc atacttaact gaccgctacc aatttgtgaa tcttaatgga 3000 cagccttcac aaatctgccc agtaaagtat ggggtgcctc aaggatcagt tttaggccct 3060 ttactgttta caatttatat gctacctctg ggagacatta ttagaagaca tgggatcagc 3120 tttcactgct atgcagatga tactcaatta tatatttcaa ctaaacctga cgagacgtct 3180 gaactttcta aactaactga gtgtttcaaa gacatcaaag actggatgac cgacaatttt 3240 cttctcttaa actcagacaa aacagaaatg ttacttattg ggcctaaatc ttgcacacag 3300 cagatctcgc aactcaattt acaattagaa ggatacaaag ttagctttag ctctactata 3360 aaagatttgg gtgtcatatt tgacagcaat ctaactttta aaaaccatat atcccatgtc 3420 actaaaactg ccttctttca tctaagaaat atcgctaaat tacgaaatat gctatccatc 3480 tcagatgcag aaaagctagt ccatgctttt atgacttcga gactggatta ctgtaatgct 3540 ctatttgctg gctgcccagc atcctctatt aacaaacttc aattagtaca aaatgcagca 3600 gccagagttc tgaccaggtc tagaaaatat gatcatataa ccccaatttt atcctcctta 3660 cactggctgc ctgttaagtt tcttattgaa cttaaaatat tacttctcac ccataaagct 3720 ctaaataatc tagctcctgt ttatctaacc aaccttctgt ctcgctacaa accaactcgc 3780 tctttaagat ctcaaaattc agggcttctg gtagtaccta gaatagcaaa atcaagtaaa 3840 ggaggtcgag ccttctcttt catggctcct acactctgga atagccttcc tgataacgtc 3900 cgaggctcag acacactctc ccagttcaaa actagattaa agacctatct gtttagtaaa 3960 gcatactctc aatgcatcac ctag 3984 // ID TDR10 repbase; DNA; ZEB; 655 BP. XX AC . XX DT 04-MAR-2002 (Rel. 4, Created) DT 04-MAR-2002 (Rel. 4, Last updated, Version 1) XX DE Zebrafish non-autonomous DNA transposon - a consensus. XX KW DNA transposon; TDR10. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-655 RA Jurka J. and Drazkiewicz A.; RT "TDR10: a non-autonomous DNA transposon from Danio rerio."; RL Repbase Reports 2(2), 20-20 (2002). XX DR [1] (Consensus) XX CC TA-target site duplication. XX SQ Sequence 655 BP; 164 A; 175 C; 149 G; 155 T; 12 other; taagggcgta ctcacactak gyacagttgc cttgaaccgk gccgaagcac gcttgtcccc 60 cctcccctct cccccgacgg cctgcactca cattgcattc gagcccgagc acgcttacgt 120 catcgatgat gcgctgttca gtttaanaga agagaagcgc tctcgctcag cacagtggag 180 attgctttag ttatattgtt ttagtcgttt gatatgcagt gacacgcagt caaatatttt 240 gctgaacaga tcaaccactt ttgacgctca taaataatca taaaagtcct cgtgctgcag 300 gwattaggag gtttgctgaa ggtgcagctg tcatgcagtg aggggtttgc gtctttaata 360 awctacgaca gtttgcgttc attgaamagt aagaatgatt aataaatcca tatgaaacag 420 tcccttaaaa gtsacgtckc gtcttcagtt tcgggctcag gcgcgctttg cactcacact 480 acaagcgtac cgcgccaaag cccaagtgaa ccrcgctctg gcacacctct tccaaccggg 540 ccagggccgg ccaastgaac catgcctgag cccaattcag agcactcaca cttctcaaac 600 gaaccgggaa acgggcctgg gcacngttcg gatagcatag tgtgagtacg cccta 655 // ID HARBINGER2_DR repbase; DNA; ZEB; 3727 BP. XX AC Contig:ctg25784.2; XX DT 08-OCT-2003 (Rel. 8.09, Created) DT 08-OCT-2003 (Rel. 8.09, Last updated, Version 1) XX DE Autonomous Harbinger-like DNA transposon. XX KW Harbinger; DNA transposon; Transposable Element; KW DNA-binding protein; HARBINGER2N_DR; HARBINGER2_DR; KW Harbinger superfamily; TDR; transposase. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-3727 RA Kapitonov V.V. and Jurka J.; RT "HARBINGER2_DR, an autonomous Harbinger-like DNA transposon from RT zebrafish."; RL Repbase Reports 3(9), 180-180 (2003). XX DR Genbank; Contig:ctg25784.2; Positions 85260 81301. XX CC HARBINGER2_DR is an autonomous Harbinger-like DNA transposon. CC It is characterized by 13-bp TIRs and the CAG target site CC duplication. The zebrafish genome harbors ~100 HARBINGER2N_DR CC nonautonomous elements derived from HARBINGER2_DR. The CC HARBINGER2N_DR consensus sequence is nearly identical to CC HARBINGER2_DR (only 4 indels, it matches positions 1-603 and CC 3311-3727). CC HARBINGER2_DR encodes two proteins, the 276-aa HARBINGER2_DR-1p CC (2 exons, positions 642-1437 and 1558-1592) and the 368-aa CC HARBINGER2_DR-2p transposase (positions 2140-3246). CC HARBINGER2_DR-1p is similar to Myb-like proteins present CC in different eukaryotes. XX FH Key Location/Qualifiers FT CDS 0..0 FT /product="HARBNGER2_DR-1p" FT /translation="MAETKKQRKVIFQKEEINIILEEVELQKHIIFSRFKG FT SHTNKEKQKMWDDIATKLTATRGIKRSGNEVRKKWQDFSSLAKRKRALQRT FT TINKTGGGPNDAPILTAEEEKALSILGTTASDGICGGIDLHGGEGLRQPEP FT ESGPPCSQEQSGSPPIDRQLPSPSPPNSPTDQPPSSIVQATATTPRFENIC FT GCSQDLVQLEREKLDVLKDIRQSLKEANEMNYNFQREIIELKKAKMALEER FT RLSLEEMSFARPSISVPIILPDESDPGEQTTNLNQ" FT CDS 0..0 FT /product="HARBNGER2_DR-2p" FT /translation="MAALQRVFQLRVRQRERQRQRQRRPRSTLCTINAFIR FT QHQNPLDMLDDMAVIHRYRLPRGEIVQLLNVIGPQLMRATRRNFALSPDVQ FT LLAAVRYYATGSFLQVLGDGLGLSKPSVSRAVQAVTYALLPLAAEHIKFPA FT SRQAMSDIQEYFLTHYHIPQVIGVIDGTLIPISTPSVDGHTYICRKGYPAI FT NCQVICDHNCLITDIVARWPGSTHDSYIFTNSSVGQEAQNSNGHWRLLGDS FT GYPLRPYLFTPVANPVSNSEAHFNEAHRVARSTVERTLGRWKLRFRAIHKS FT SGGLLFVPQKCCAVITVTAMLHNIAVRARVPLDIREEDEEVEEENEVEMRI FT HDDQPRHVQYMAGFGARQQVIDTFF" XX SQ Sequence 3727 BP; 1202 A; 697 C; 734 G; 1094 T; 0 other; ggctgcgttt cccgataacg ttgatcttag cacttaagag cgttttctac gagtcatttt 60 gcgaacgttc gttattgttt cacgtgcgtt tcccaaaaat gcacttaaca caattgcacg 120 tagcccagct ttaagtgcaa cttaggagtc gctatccgtt tgttaagtgc tgaaatgtca 180 cgctatagaa tggctcgtta ttgttgcaca tgttatagca atccatataa ttcttcttct 240 acttgtgtga atgtatattc aactcgaata acataaaaaa aaaatatttt tgagccagtt 300 taaaaacata aattaactgg aaatgtcgta ctgtaaaaac actgtcttgc tccaatctcg 360 cataaaacta attctaacag tccttgccgg aaatgacatc agcatcattt tcgatattta 420 tatgaaacct taattaagta gaattattta tcaatttctt tatcaaaaat tgcttatctc 480 acatcaaaat aaataaataa gtaaataaat aaataaataa ataaataagc tcttacattt 540 atttttgaaa agtttagcct aattatcccc acctgatgtg catttgctaa taggacaaaa 600 atataaataa gcacatacct aggggtggag acactagcaa aatggctgaa accaaaaaac 660 aaagaaaagt catttttcaa aaagaggaaa ttaatattat tttagaggag gtggagctac 720 aaaaacatat aattttcagc aggtttaagg gcagtcacac aaataaagaa aaacaaaaaa 780 tgtgggacga tattgctaca aaacttactg caacaagggg gattaaaaga tcagggaatg 840 aggtcaggaa gaagtggcag gacttttcaa gcctagctaa aagaaaaagg gcactgcaga 900 ggacaacaat taataaaaca ggtggtgggc ctaatgacgc ccccattctt acagcagaag 960 aagaaaaggc actgtcaatt cttggaacaa ctgcctcaga tgggatttgt ggtgggattg 1020 acctccatgg aggagaaggg ttgcgtcaac ctgaacctga gtcaggtcca ccatgttcac 1080 aagagcaatc aggttcacct cctattgaca gacagctacc atctcccagt cctcccaaca 1140 gtccaacaga ccagcctcca tcttcaattg ttcaggctac agccacaaca ccgcgatttg 1200 agaatatctg tggttgtagt caggacttgg tacagctgga gcgagagaaa ttagatgttc 1260 taaaagatat cagacaatct cttaaagagg ccaatgagat gaattacaac tttcagaggg 1320 aaatcattga acttaagaag gccaagatgg ctttggaaga gaggagactt tcactggagg 1380 agatgagttt tgcaaggccc tccatttcag tgccaattat cttgccagat gagtcaggta 1440 agattgttca atgttgttta atcatgaata attatatgtt ttaataaaat atgttgtgca 1500 acatgaaata ttcttttcca tatgtacact catcacattt tgccatgtct tttgcagatc 1560 ctggggaaca aactacaaac ttgaatcaat aaaaattgtt taaatgcaat gttttgatgt 1620 ttgttaatcc actcaaagtc taaggctcaa tgaaatacta taaagcagaa gaattcttat 1680 tatattatca tatcataagt gtttcaaatt taaatatata cttgtagtct ctttaaaaaa 1740 agtaataagc atgcaagtcc tatatatttt actacagctc gagtcttgat gtgtgtggca 1800 atatagtatt agggcctatc tattagccta tccatatagg tgcagaaaaa ttcttaatgt 1860 gcacattctt tattagcact gttttatata aatgtaaatc tttgaccatt atgaacctaa 1920 caagagaata aattcaggga agaggtaatt aatgcaattt ctaaaatcaa tgattcagtt 1980 tcttcaagtg gggcctactg caattagtga aatagatgac aatcagtgtg tgacatgtga 2040 atattatcac agcctgaaaa actttaatca tatggttcct gtgagcaatt ttaggatagt 2100 atataaagtt cacacttcag tagtcttgca gagtaaatca tggctgcact acagcgagtt 2160 ttccagctga gagtgaggca aagagaaagg caaagacaaa gacagagacg accacggagt 2220 acactatgta ctataaatgc cttcatcagg cagcatcaaa atcctctgga tatgcttgat 2280 gatatggctg tcattcacag gtaccgtctg ccacgaggag aaatagtcca gctgctcaat 2340 gtcattgggc ctcagctgat gcgtgctaca agaaggaatt ttgccttgtc ccctgatgtg 2400 caactcttag ctgctgtgag atattatgca acaggcagtt ttcttcaggt acttggagat 2460 ggacttggac taagtaaacc atctgtgtcc agagctgtac aagcagtcac ctatgcactg 2520 cttccacttg cagctgaaca catcaaattt ccagcatcaa gacaggccat gtcagacatt 2580 caggagtatt ttctaacaca ttaccatata ccacaagtca ttggagtcat tgatggtact 2640 ttaattccca tcagtacgcc ttctgtggat ggccatacat atatatgccg caaaggttat 2700 ccagcaatca actgccaagt gatctgtgac cataactgtt taatcacaga cattgttgca 2760 aggtggcctg ggagcacaca tgactcctat atctttacca actcatctgt gggtcaagag 2820 gcccaaaact caaacggaca ttggaggctg cttggtgaca gcggatatcc attgaggcca 2880 tatctgttca cccctgttgc taatcctgtc agtaacagtg aggcacattt caacgaggct 2940 caccgtgttg cccgaagtac tgtggagcgc actttaggaa ggtggaagct acgctttcgg 3000 gctattcaca agtccagtgg tggtcttctc tttgtgccac aaaagtgctg tgctgtaata 3060 acagtaacag ctatgttgca caacattgca gtgagggcaa gagtgccctt ggacatcagg 3120 gaggaagatg aagaggtgga ggaagagaat gaagttgaga tgagaattca tgatgatcag 3180 ccaagacatg ttcaatacat ggctgggttt ggggcacgcc agcaagtcat tgacacattt 3240 ttttgagttc ctcctttatt tgtgtcatct ttaaatcaca ttaaatattt tctgttaaat 3300 acaacatatg tagcctacta ccctcgaaat gtccatatac aacagtgtaa tttttttttt 3360 cattctaaca tttattatgt aatttttatt tttttcttta ttcattatta tttgtatatt 3420 gaattgtgta atgtgtagaa aatataaatg aatagagata tacgtacaaa taaaatgaat 3480 aatgactttg aagagaagta aattgtaggt gcaatttgcc ggttgtccag caggtgccct 3540 cataactctg tctccttacg atgcacttaa ggctttacga ttactccaga gcactcgtag 3600 tccactaaga ttttcaagtg ctacttaagt tacgatgctt ttgggaaaca gaccgtaata 3660 ttaagatcag tcgtacgatc atttctacga acttcttagg cttacgatgc ttttgggaaa 3720 cgcagcc 3727 // ID LOOPERN3_DR repbase; DNA; ZEB; 1011 BP. XX AC . XX DT 05-JUN-2002 (Rel. 7.05, Created) DT 27-SEP-2008 (Rel. 7.05, Last updated, Version 2) XX DE LOOPERN3_DR is a nonautonomous DNA transposon - a consensus. XX KW piggyBac; DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; LOOPERN3_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1011 RA Kapitonov V.V. and Jurka J.; RT "LOOPERN3_DR, a nonautonomous DNA transposon from zebrafish."; RL Repbase Reports 2(5), 27-27 (2002). XX DR [1] (Consensus) XX CC About 1000 copies of LOOPERN3_DR are expected to populate the CC zebrafish genome. LOOPERN3_DR copies are ~10% divergent from the CC consensus sequence. This element is characterized by 10-bp CC terminal inverted repeats and putative TTAA targets site CC duplications (less likely, TTTAA). Its classification is not very CC certain yet, although it is expected to be a member of the CC piggyBac/Looper superfamily. XX SQ Sequence 1011 BP; 290 A; 190 C; 218 G; 286 T; 27 other; agggcaccta tggtraaaaa tctacttttc aagctgtttg gacagacmtg tgtgtaggta 60 tagtgtatag accgtcatat tggggtgata taaacacacc cagtcctttt tttttcaatt 120 taactacata aaaacggtsg accaattgga gcggttttca gatcgaccgc aactttacgt 180 aggagtgcgg tccccccgcc caccgaattg attgacagct gcgcgtaaca tgttccggta 240 gtcatgtgta tatgtcaaca agaccagacg tgcgcaaagc aaccgggaat aaaaggtctg 300 ttcagttcgc taggatcatc aatcatcatc aaatgtgaty aagagtaagt ttcacatgtt 360 taaaatgttt taaaacagtg catgtgtgta atkaattaca gcgatttact tcagctttac 420 ttcatcagca cagccgcgtg tcagaacaat tataaaagaa gacgcttcaa tcccggtttg 480 tggacgttaa atcaggttta ttttgtacat taacataaca gatatccaca cagyastkga 540 grttagccta tcctgacaca tttgcgtgca aaaacagtgc taagctaagc gcgctctgtc 600 tgtctgcctc tgtgtgtgtg tgtgtctctg tgtgtgcgtg tgtgtgtgtg tgtgtgngtk 660 aactttgtaa cgatattgtg tgtgactcat caatgcaact kcacaatact sattrgtaaa 720 gttcttactg tagtatctca caaacgctac gtgagacctt cttcctttaa gtctgtctgt 780 tgtctgacgc agcygaggga ggaggcatgt agaaatagta ggcgggrarg actcgyctta 840 aaggcgcagt acgacaaaac maccccctgs tgraaaamwg tataaaacag satctwgtaa 900 aaggtataat gaaaaatctg atgggtrttt tgakctgaaa ctttatatac acattctaga 960 gacgcaaaag acttatatta aatctgaaaa aaggggtaac ctaggtgccc t 1011 // ID GYPSYDR1 repbase; DNA; ZEB; 4463 BP. XX AC AL591405; XX DT 04-MAR-2002 (Rel. 7.02, Created) DT 04-MAR-2002 (Rel. 7.02, Last updated, Version 1) XX DE Gypsy Danio rerio 1. XX KW Gypsy; LTR Retrotransposon; Transposable Element; LTR; KW retrotransposon; GYPSY/TY-3; GYPSYDR1. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4463 RA Jekosch K.; RT "GYPSYDR1: gypsy-like element from D. rerio."; RL Repbase Reports 2(2), 10-10 (2002). XX DR [1] (Consensus) XX CC Putative novel retrotransposon similar to GYPSY/TY-3 CC retrotransposons CC with perfect 294 bp LTR (GYPSYDR1_LTR) and an open reading frame CC (pos 267-4463, the last two bp of the stop codon belong to the CC LTR). XX SQ Sequence 4463 BP; 1150 A; 1316 C; 1151 G; 846 T; 0 other; gagcaccctt tcaccccaat atcacttgta aataaaggca ccctccgggg cattgtttgt 60 aacattatct acggtgtttg tgttttccct ctgcctcgct acaactggtg gagaatgcga 120 gcattgcaga gggaaaaatt cagaagaaac actcaccatt aagtagaaaa taaccgctga 180 atttttctct gtgtttgtta cagacaggca tccgctctct ctctctccca cgcttcctct 240 tactcggctg tcaacatccc gtcgccatgt cggtagaaga ggtactgctc cacctagtgg 300 agatctccag gaaacagaat tccatctctg agcagcttac agccagacag gatcgactgg 360 aacagcagct ccgccaggcg gccagacacc atccgactcc cgaagtgagt gcgcatcatc 420 atctcactaa actcagcgac ctggatgata ttgacgctta tttacatacc tttgaggtta 480 ttgccgagag agaaggctgg ccaaaagaaa actgggcgag aatgttggct ccgtttctca 540 caggagaagc gcaacgagca tatttttcac tagagacacc taaaaatgaa gattacaaag 600 cgttaaaaaa ggaaatatta gccagaatgg ggctatccac aatcagcgca gcccaacagt 660 tttcccagtg gtcttacgac gagaaacaac cagtgagaac ccaagcagct caactttctc 720 gcctgggaag gctatggtta ttgggaggag atccctcggc ggtccaggtc gctgagaagg 780 tgatcatcga gaagatgatg cgtgcgttac cccgacgttt gcgaacactc accagcatgc 840 gaaatcctga gtcactggcc accctggtgg aggcgatcga gctggctgaa gctcacatcg 900 cccgagataa tggggagaga gcggctctgc caccccggag ggtaagtgca ccttggcgac 960 cggtggaggg cacagcgcga ccaggcggca gaccagcggt ccccagcccg atggacgagc 1020 cgatgcccac cgagccgacg acgcactcga ccccggcctg gacagcaggg tgcgcggttc 1080 accgcaatat ccctcccgaa gctcccaccc ataaagtcca gctagaggga aaaacacaaa 1140 cggccacctt agacacagga agcgccatca ctctggttca cccgaaaact ttaaaatacc 1200 atcatgaaag caaagggcga attccgatca cgtgtgtgca tggtgatacc cgccacgtac 1260 ccgcccgaag agtaaccatc gcggcgaaac caggtagctg gcgaatcgaa gtcggggttg 1320 ttccagacct tcctgtgtcc ctcttactgg gcagagactg gccggggttc gacgaactcc 1380 taactcacca ccacgctcca tcggctcgtt caaagaagaa gaacaagcca cgggctcatc 1440 gggaccgcaa accagcgctg atgaccaccg agagcgacag agggggtgag tcatctatat 1500 ctgctaacct atactttgat ctgtttcaac agataaccgc aggaggtgac tttggcagag 1560 cacagaggga agatgaaacg ctcaaacact gttggccaca agtacggatc atcgacggaa 1620 atgagcgact tcccagccct caccccctcc cacatttcat tgtggaaaat ggtctgctgt 1680 actgtgtcgc agagaggcgg ggggaaaaga agacactact ggtcgttccg aggaccaaga 1740 gggagacggt cttggaactg gcacataccc acccgatggc tggacatctg ggagcggcca 1800 acacggtgaa aaggatccgc gatcgtttcc attggccggg gctagacggg gaagtaaaga 1860 ggtattgcca ggcatgtgac atctgccaga gaacgtctcc ccaacgacca ccccccagcc 1920 ctctaatacc gctacccatc atcgatgtgc ccttcacccg aattggtatg gacttggtag 1980 ggcctttgcc gaagtcggcc cggggacatg agcacatcct tgttatcctc gattatgcca 2040 ccagataccc tgaagcgatt cccctgagga aagccacgtc atcggccatc gcgaaggagc 2100 tgtttctgct atgcagtcga gtgggaatcc caacggagat actgaccgac cagggcaccc 2160 ccttcatgtc ccggttgatg gcagacctct gtcacctact caaggtaaaa cagctaaaaa 2220 cctctgtata tcatccacag acggacggcc ttgtcgagcg ctttaacaag actctgaagc 2280 agatgctccg acgggtggtg gcagaggacg ggcgcgactg ggacctcatg atcccgtacg 2340 tgttattcgg tatcagggaa gtcccccagg cctccacagg atttaccccc ttcgaactgc 2400 tgtttggccg ccaaccccga ggcctattgg atgtggctcg tcaagcctgg gaacaggaac 2460 cagcccccca gcggtcggtg attgaacacg tacgggacat gagagaacgc atcgacaaaa 2520 tcatgcccat cgtcaaacaa cacctgaccg aagcccagcg cgcccagcag agattgtata 2580 accggcccgc ccaacccaga gagttccacc caggggacaa ggtgatgata ctcataccta 2640 ccacaacgtc gaagttcctc gcatcctgga agggaccata caccgtggta gaaagggtag 2700 ggccggtaaa ttatcgagtc cgtcagccgg gacgaagacg ggaagaacaa ctttaccacg 2760 ttaatctcat gaagaagtgg gttgcagctc caggtcatct cgttaccttc gctgaagaaa 2820 ctcttcccgt tgtccacatt ggtgagcaac tctcaccaaa ccagaaggcg gagctgcaag 2880 ccttggttgg tcagttcaag gatgtgttct cggagaaacc gggccgaacc tccatcatcc 2940 aacacaacat tatcactcct cctggcacta tcgtccggca aaggccttat cgagttccag 3000 aggctcgcag gctggctatc gacgaggaga tccagaagat gagaaagtta ggcatcatcg 3060 aaccatcccg tagcccgtgg tccagcccaa tagtgatggt ccccaaaccc gatggcaccc 3120 tccgtttctg caacgacttc aggaaactca atgagatctc caagttcgac ggatacccca 3180 tgcctcgggt ggacgagctg ctggataggc tgggtggagc ccgatttatc tccaccatcg 3240 acctcaccaa aggctactgg cagttaccac tcagtgaaga cgccaaggag aaaaccgcct 3300 tctccacacc cggtgggcac tggcaatacc gggttcttcc cttcgggctc cacggggccc 3360 cagccacatt ccaaagaatg atggacatcc tgctgaggcc ccaccagcca tatgcagcag 3420 cctacctgga cgacctcatc gtccactcag agtcatggga agaacaccta tcccggttac 3480 ggagggtgct cttagatctt cgacgggctg ggctcacagc taatcccaag aaatgccacc 3540 tgggtctagc cgaagccaga tacctcgggt tccacattgg acgaggtctc atacagccac 3600 agcaaaacaa ggtcaaggca ctacaggaaa ctccacaacc caccacaaag acccaggtac 3660 gtgcatttct ggggttagcg ggctactata gatgtttcat acctaacttc tcatccatag 3720 ccagcccttt gacagacctg accagaaagg ggcagccgga gaggataaga tggaccaagg 3780 aagccgacga tgcgttccga gccctaaaga agtccctcac gtcctcaccg gtactgcacg 3840 cacctgactt cggctgcccc ttcattctac agacggatgc ttccgactcg ggcctgggcg 3900 cggtcctctc ccaggtccac ggcgatgaag aacatcccat aatgtacgtg agtcggaagc 3960 tgacccccgc agagacccgc tacgcaacgg tggagaagga ggccctggcg atcaagtggg 4020 caatcctgga gctaaggtac tatctccttg gcaggaaatt cactctggtg accgaccacg 4080 ccccgctaca atggatggcg acagcgaaga acaacaacgc tcgggtcacc aggtggttcc 4140 tgtcgctcca ggatttcaac ttcgacgtac aacatcgagc cggggcctcc cacggaaacg 4200 cagacggact ctcacggctc tggtcaggat gggcaggtct gtcgaaacat tctacccccc 4260 ctctcaatac actgctcttc cttcgcagga cacccaggac caggacgacg ctaagggggg 4320 gggaatgtag cgagccatgt aataccacgt cgccctgggc acaaatccac accagctgga 4380 tctctcatca acctcggatc gctcacagct ggacctcatc agccagggag agatataagc 4440 cagccccaca cagacggaag tga 4463 // ID BHIKHARI_I repbase; DNA; ZEB; 1851 BP. XX AC AJ011117; XX DT 19-JUN-2000 (Rel. 5.05, Created) DT 19-JUN-2000 (Rel. 5.05, Last updated, Version 1) XX DE Internal portion of retrotransposon bhikhari from Danio rerio. XX KW BHIKHARI_I. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RA Vogel M.A. and Gerster T.; RT "Promoter activity of the zebrafish bhikhari retroelement RT requires an intact activin signaling pathway."; RL Mech. Dev 85(1-2), 133-146 (1999). XX RN [2] RA Vogel M.A.; RT "BHIKHARI_I."; RL Direct Submission to Genbank (29-SEP-1998)Vogel A.M., Abteilung RL Zellbiologie, Biozentrum der Universitaet, Klingelbergstr. 70, RL Basel, 4056, SWITZERLAND. XX DR GenBank; AJ011117; Positions 4453 6303. XX SQ Sequence 1851 BP; 617 A; 491 C; 413 G; 330 T; 0 other; atggctcact ccaaagatcc tgttggccac tggaaggacc tggaaacgtg gctgagcgtt 60 gtaacaggca gtctcttccc taaagctgct gaaacactgc agcccctgac gcaaaaccaa 120 ttggatgaga acatagacag catcatgaag caagacccaa gtcaaagctt caaccacaag 180 gagctggcca aaatcactgg tactttgagt cacacactca tagccaccct caaattgagt 240 gacagacacg cctcccaact ccaacacaag ctgacacgcc tgcaagcccg catcgagcag 300 ctagagctag aggctcagga acgtctggaa caaccaagtg aggtgaatga aggtaccaca 360 gaggagatcg acaaactaca agaagcttta acagcaatca cagaagaaag agaacaagcc 420 agagcagacc acgctgacgt cgctaacaag ctagattatg ctgaacagct actgaaggaa 480 gcgaaggtgg acttaagaga caagaaggcc agaatcaaag cccttgaaac tcacctgagc 540 gaagcaagac atgagatcga cagactaatg caggaagtgg atgacatcaa agaggagtcc 600 gccagtgaac tcaggcatgc ctatgcactg cgctatgaac ctccaaagac aagatgtgca 660 ccagcctcgc ccctgccaag caggacagga tcccctgtcc ctgagctctc acctattgaa 720 agaggtgaga aaccatgcca aagatcttcg ccaacacctt ctgaagagcc ttaccttacc 780 ccacagcgac gagagcctgt gatagccagt cacagatcgt catacagtct ggaccttaaa 840 gaccttgaca agctggccag aaacattggc aagtttactc caagtgtgtc aggtggtttg 900 gaggtccacg cttatttgca agacattgat ttccatctgg aaatgagacc caatgtctct 960 gataaagaaa gactgtattt gcttcgagcc acatccagca ctgaggtgcg cagcttcctg 1020 gaccgacaac cagctcgggt aaagaacgat taccgcttgc tccaagaagc cctcatcaga 1080 gagtttgcaa accctgaatc agatcaagga cttttaggtg ctctggagac aaaacaagct 1140 cgcaatgagt ccccacatgc ttactacaac cgactcaggc aagccttttt cggaactcgc 1200 aacgaaccga atatggaaga agacctgaac tttaagattc tcttcctgag aaacctccat 1260 cctgcagtaa gccaacatct cggagtactc gcatgcccac gaacaatgag catccagcaa 1320 ttgcgagatt tgacacagaa agcccacgac aaacaaaaga tggtgttaga gaaaaacact 1380 aaaactgcca cagtttttga ctttaacacc catccagaac tggcactaga gggtgcccaa 1440 cgctcaaaca gcgtgagacc accattccca acatggaatg cgtcttcgtc caatagacaa 1500 cggaactcct acgatgacac aagacctaaa caaaggaaca gtcactggca tggaccgcat 1560 ggacaacaac gctcgcctga acaccaccgg gaaagaaacc aatgggggtc aaacaaaagc 1620 tggtcacctt ctaagggaag acatcaaaac cctggatcat caagtccaag gagtcaacga 1680 aggtactcca aaaacttcca ccctgacaac gctcagactc aatcccagca agaggaaaat 1740 gccccactgg gatttaaccc tcaagaactg gtgaaattga tgatgaaaga gtttctcaaa 1800 tgcatagaag aggacaggaa acgggaaaag gaaaaagcag attcagcctg a 1851 // ID DIRS-7_DR repbase; DNA; ZEB; 5393 BP. XX AC . XX DT 02-DEC-2008 (Rel. 13.12, Created) DT 02-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE DIRS-like LTR retrotransposon family - a consensus. XX KW DIRS; LTR Retrotransposon; Transposable Element; Nonautonomous; KW reverse transcriptase RNase H; DIRS-7_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5393 RA Bao W. and Jurka J.; RT "Families of DIRS-like endogenous retroviruses in zebrafish."; RL Repbase Reports 8(12), 2160-2160 (2008). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 5393 BP; 1483 A; 1567 C; 891 G; 1452 T; 0 other; gtgaagtttt taaactaatt tcgagaggag cacgtgatat aattgacagc agctggccac 60 ctatctacac tcattagtta gccaatcaga tctatcctaa cccactataa gtagcctagc 120 tagacattac tcccttatct tcgttttccg aagaaacaaa cgacaggtcc gctctagctc 180 ctccaaaagt tactcacgga caaaacggat cctccacatc tcctactatt cttcaactac 240 aataccacac cacctttaat accatcgtca gcaactacaa aaaatgacaa aaacaacaac 300 aacagcgacg gcggctgaga tcacaacagc atcagcgcag ctacatacaa gaaagaaagc 360 tccgggctcc ctgaaacacc agccccgtct taacaacaac cagcccagca tgaaagtcag 420 ctcgagaccg aggctccaac gagaggccgt aggcctatcc actccagcca gatcacaatg 480 caatccccag ctccgctacg taccttctcc agcatcatca tgtgtcactc atatccaaca 540 acgactgtca ctgaactcct ccaaactcct agacgccggg accataattc aaagccgctg 600 cattaaaccc aaaacacttg cgaatccacg tcttcaccgc cacccccagg acataacacg 660 ctcctgccac accccctatc ctaaacagac attaaacccg caatctctcc gcctccccgg 720 agaatcaagg aaaggattaa aaagcaatat tggatttgga catattgcag caatcataat 780 tctaaacacg gataaaacct atctcccgga caataaagca actcttggga atcccacctc 840 acctcttcca tcctgcagct cccgattcca gtgcacactc cagtcctcct ctttcagcga 900 tccttaccac ctacgcaccc cgcccctcta ccttgcgtat cattgctttt tccccctctc 960 atttttacaa gctcgcagct actgctgatt attttctatt ataaaataga aattacttgt 1020 cgacttttgt caatatgttt atgcttttac tgaattacgt ttgtaactga cctaattttc 1080 tgaatttctc attccaggtt tcgtctactg aagcacagcc gatctcggcg ctcccttccc 1140 acaacctcac tagccgtcca ccaacactac accagaaatc acagagcaat tccctgttaa 1200 gcacatccat accagcccta ctgaaatgga cattagaacg tttctaaaaa cgtctaataa 1260 cggctctttc atctcctcat aattctgcct ttttcagctt taacagcatg attctacctg 1320 atgaatatgc ttaaattatc acgttataaa ccaagcaatc tctctaatct agacggctgc 1380 agcgctcggt tgcaatcaac attacacccg ccacccgatt tctggcaatc ggaggtcaca 1440 attacatttc gtagtccgtt taacattcgg atgcaaaagt agcccacaca tttatgctta 1500 aaaacgatcc gctggatttc gccaataact acggtgtttg cacgtaattc acctacaagt 1560 tcccgtttct ctctccctaa tatcccccag ctaaactagt gaccggtttt ctaatatttg 1620 cattctcatt gcggaagaac ctccggctta agcacttcta tagaattctt aagcatcaat 1680 ttggagtgat agaatttcaa gcattaataa gctacttgaa taaatcgaca atcagtcccc 1740 ttttatattc tggaagaaaa atatcacgct aagcataaac tgctctccat ctagaacatc 1800 aaacacacat gcgcattatc ccgcagggat gccttttcgt cactcaccta tttcatctcg 1860 cagcagcaat ctccggaata gaagagaaat gatctaaatc cgatcttggc cgtaacaaac 1920 tctgcctgaa gggcgtcgcg caccagaggc gccgacggcg cggtgacgcg acgcacttac 1980 gacagagaat agtagattaa tgcacaccag actcaagcat tcatacttta tcaaataaac 2040 taatcaatta tacaaatagc gctccgcggc gcggcagaat acgaagtgtc atgaattgtg 2100 gctgggcact acggacagcc gcattgactt ccttattatt tatttccttc ctgtgaaagt 2160 caatggttac aggttttcag ctttcttcaa aatgttgtct ttagcgttta acacgggagt 2220 cactaacctc tgaaactaag agctactttc gggcaccgat taacgtggag ggctaccagt 2280 tggataaaca cttctcaaat aacaaattgc tcaatttact tataattata atttaagttt 2340 ttggttattt tttaataatg aataatattc atccaggtgt agtcactgat catgttatga 2400 ttctctcaaa gttaccaaca atgatttaac agggtaggaa ataccccgta attaatatat 2460 caacgtgcaa cactatttta tttatctttg caaatattta catttttaaa tgattacttc 2520 tatattagag aaagcttact tgtaagtggt gttatggtga gctattttag aacaggcctg 2580 tgagcaacac acgagagcct cgtgagctat ctggtgccca tcggtaccat atttggtgac 2640 tcctggttta acagaaatga aattcataaa ggttcgtaac catcaggtag cgcttaaagc 2700 atgcgacctg cttatgcttc cttccttaag cgctggaata gaagccattc atttattagc 2760 tatccaattt catcctcagt atatcctttt tacacagatg ctgccccctt cgtggggtat 2820 attatcgggg ccgccaatgc gcttctaacg ggcaatatag tcaatacatt ctgaaaaaaa 2880 aaaaaaaaaa aaaaaaaaat aaataaataa aaaaaaaata ctgctgtaca tgttcctagt 2940 tgcataggaa atcgctgact ctcttttgct ttattttcag aaactggtat cagaagcaga 3000 gccgcagctg actccatctc tccttttaag atgataatcc cataaccaca gcatacacaa 3060 tttgcgtcag cattcattta tacattttaa gttaccccag aacacctcat cagtttctca 3120 ctgctcaaca catccacaca ttccgttcaa taacattttt ggtttcataa caccttaaag 3180 tcataggacc attcaggctt cccaccagat tcataatcca gtcactcatt cagaattgga 3240 gccaccacca cagcagcaca caagggctat cgtaacaaca catacaaaca ctaggaaagt 3300 agtctcccga cgccttcaaa tcttatatct gactcagcca cagccatctc agggaagctc 3360 agaggaccct caccagcaga cgcagtaatc ccagcggcca agggcatggg cctagtccag 3420 ggagagaata cgacccaacc ataccagctt cccgagggca gaggaactac ccagtctccc 3480 aagggcgcgg gcatgaccca gccatctagc ttcctttctt ccttccagct gatttgcact 3540 cagccttctc ccccttttca ctcacctaac tccagtagga gttctctccc tgcccaggcc 3600 cccttgtcac ccccgccccc ccccggccgc tgccacagaa gttttcactg ccccccaact 3660 tttctaactt ctgtaggacc ccgccccccc catggctctg gcccccgcag gggcgttacc 3720 ccgagcttcg attctcgcag gaatcatctc actgccctgg ccttagcttt ttatattatg 3780 tcattccata atgacatata gtatagcact atttattttt ctctctgttt gattttattt 3840 atataaattt atatctatat gcacccacgc atataaatat gaatttatat atagtgctgt 3900 caccctcacg ctctgttccc gcaggaacac ccccagagca cctataccgc ccgtcaccct 3960 ccagtaagaa gtcttcacct ccccctcatc tttcccgact cctactggag cagctaaata 4020 gctcacccaa ccccgagctg tgacacgagt catgtcaccg accctccccg gccctagctt 4080 ttatttatgt ttatttttgt ttatttctca cacttatctt ttatttttta aatttattta 4140 tatatatgta tatatatatg cacccacgca tataaatata tatttatata tagtgctgtc 4200 accctcaagc tctaactccc gcggagttaa tcccgagcac cggacccccg caggggtcat 4260 cgcccaactg ccatttcccc cttcagctgg ggcttcaccg accattcctt ttcccgactc 4320 cagctggaga tggcacatac agctctctct cccgcaggag agccaagagc ttcgactctc 4380 gcaagagtca gtaaaaacgc ccccccaagg ccctagatta ccccattata tatttatata 4440 tctatatgtt tcatataaat atataattat atatagtgct gccacctccc agctcaatct 4500 ccgcaaggag tgttcctcga gcaaattact cctttggagt ccccgccccc cctgcccccc 4560 ttcacccccc tctccagccg gagtccttca ctccccttcc cttgtaatga ctccagcagg 4620 attcccgccc accccatggc tctgaccccc gcaggggtct ccccgagtct ctactccagc 4680 aggagtattc acagcccaag ccaactctgc tcgggttccc gcaggaacct gtttcaccct 4740 ttgctccaaa ggagccctct ttaccttttc tttttcaaat aactatatcc agcagccgga 4800 tatagcattt caagcctttt ggggagtttc ttcgaataca cggctgctgt cccgagcttc 4860 atgcatttgg ggagctctcg agaaccacct gatctcgtac tcccctcaca tgctctatgg 4920 acctggcggg agccctgggc tcaactatct ccgagctcag ggttctctcc cgggacagca 4980 tgccaaacct gcttacagct gtcaagcaat atctaagtgt gaactcttga agtgaagttt 5040 ttaaactaat ttcgagagga gcacgtgata taattgacag cagctggcca cctatctaca 5100 ctcattagtt agccaatcag atctatccta acccactata agtagcctag ctagacatta 5160 ctcccttatc ttcgttttcc gaagaaaccc cccatccacc cctttctcct ccttttctcc 5220 tttacaaaaa ggggagctct cgagaaccac ctgatctcgt actcccctca catgctctat 5280 ggacctggcg ggagccctgg gctcaactat ctccgagctc agggttctct cccgggacag 5340 catgccaaac ctgcttacag ctgtcaagca atatctaagt gtgaactctt gaa 5393 // ID SINE_DR2 repbase; DNA; ZEB; 622 BP. XX AC . XX DT 01-APR-2002 (Rel. 7.03, Created) DT 01-APR-2002 (Rel. 7.03, Last updated, Version 1) XX DE Putative Zebrafish SINE element - a consensus. XX KW SINE; Non-LTR Retrotransposon; Transposable Element; MER6; DANA; KW SINE_DR2. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-622 RA Jurka J. and Drazkiewicz A.; RT "SINE_DR2: SINE-like retroelement from Zebrafish."; RL Repbase Reports 2(3), 11-11 (2002). XX DR [1] (Consensus) XX CC Contains ~200 bp segment similar to HE1_SINE, MER6 and DANA CC elements CC starting around position 201. The 200 bp segment contains CC a hairpin-like GC-rich structure. XX SQ Sequence 622 BP; 152 A; 167 C; 125 G; 172 T; 6 other; ttaagtgaag tttatttata aactaatttc gagaggatca cgtgcttatg attgatcacg 60 gctggtcccg cattagctaa ttcatgattc accaatcaga tgattcctaa gccactataa 120 ataaccngag tttcttatca cagttatctt cgttttgaag aatcccccct tccaccccta 180 ctcctcctcc tttcctgatg ggnggcacgg tggcccagtg gttagcactg ttgcctcaca 240 gcaagaatgt cactggttca agtccttacc aggccagtng acgtttctgt gcggagtttn 300 catgttctcc ccgtgctcgc gtgggtttcc cccgggttct ccggtttcct cccacngtcc 360 aaaaacatgc aacataagtt aattgactaa tccaaattag caccatagac aagctctaaa 420 gnagttatct cttgcaatca ctatctgttc attagctact aagcagggga gttctcgaga 480 tctacctgag ctcaaactcc cctctcgccc tgcaaacggg agggagcccc gggctcgagg 540 atctttgagc tcagggctct ctcccgggac agcatgccaa acaagcttat aaaatcatca 600 gctaagtgtg aactcttgaa at 622 // ID REX1-1_DR repbase; DNA; ZEB; 3394 BP. XX AC . XX DT 05-JUN-2002 (Rel. 7.05, Created) DT 08-JAN-2009 (Rel. 7.05, Last updated, Version 2) XX DE REX1-1_DR is a CR1-like non-LTR retrotransposon - a consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; CR1 clade; AP endonuclease; KW REX1 subclade ORF2; REX1-1_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-3394 RA Kapitonov V.V. and Jurka J.; RT "REX1-1_DR, a family of CR1-like non-LTR retrotransposons in RT zebrafish."; RL Repbase Reports 2(5), 29-29 (2002). XX DR [1] (Consensus) XX CC REX1-1_DR is a family of CR1-like non-LTR retrotransposons and it CC was active in zebrafish a few million years ago. The consensus CC sequence encodes one protein, REX1-1_DR1p (position 723-3311). CC The 863-aa REX1-1_DR1p protein is composed of the AP endonuclease CC (positions 1-200) and reverse transcriptase. REX1-1_DR copies are CC ~9% divergent from the consensus sequence. Approximately 1000 CC copies of REX1-1_DR are present in the zebrafish genome. XX FH Key Location/Qualifiers FT CDS 288..2876 FT /product="REX1-1_DR1p" FT /translation="HGRLKAKLKLIPHRLSLPSIFLANVQSLVNKMDEIRL FT RINHSKRLWNCNVMIFTETWLNSGIPDNAVFLTEHNTFRADRTADDFDQRH FT CSANLEFLMVKCRPFYLPREFTFTIVTAAYIPPDADAKLVMNELPAISKQQ FT TAHPEATFIVAEDFNHSNLKTVLPKIHQKDFCHKSGNKTLDHVHTNMAEAY FT VMNPPPPLGSIRSPFLFLTPKYSLLINRVKPSVRTIKVWPAGVDSTLQDRF FT QHTDWSIFASQANYGPYIDIISYTSSVLEYITTAIDSVTTQKQISTYPNQK FT PWMNKEVCLLLKARNTAFRSGDAQAYSTSRANLKRGIKKAKHCYKLKLEEH FT FSNSDPRCMWQGIQAISNYKPSQTTSTATNVSFLNELNDFYARFESDNKEA FT YTRITSSTDHSPITLTSSEVYTALSQINVCKAAGPDGIPGHVLKACAEQLA FT GVFTDIFNLSLNLAAVPTCFKTTSIVPVPKHCSPTCLNDYRPVALTPIIMK FT CFERLVLAHLKDSLPSTLDPHQFAYRGNRSTEDADSIALHSVLTHLDNKNI FT YARMLFVDFRSAFNTVIPSKLMIKLRDLDIDTSLCNWVMDFLTNRPQNVRS FT GHICSTTVTLNTGVPQGCVLSPFLYSLFTINCRPVNRSNTIIKFADDTTVI FT GLISNNDETAYREEIQHLATWCTDNNLLLNTNKTKELIVDFRKGRTGSHDP FT IHINGMAVEPVSSFKFLGTHISKDLSWTTNTSSLVKKAHQRLFFLRQLKKN FT QLSSAVLVNFYRCTIESILTNCVTVWYGSCSVAERKALQRVVKTAQRITGT FT TLPAIEDIQKKHCLRRARSILKDTSHPAHRLFSLLLSGRRFRLPRTKTSRL FT RNSFFPRAPF" XX SQ Sequence 3394 BP; 958 A; 937 C; 653 G; 846 T; 0 other; gcaagatggt ggcgcataca cattacgagg ctgagcgtct ctccagtttc tgcagttttg 60 cagtattatt cctgcttatt tcgggtctgt tcgtgcagaa cagtggtgcc tttacatcgt 120 acacccgaca ggagatcttg gatatttgtt tgtgcattcc ggacagtttt attagcaatc 180 ttcgactcat ccctgagatt gccagaacac ccgaggctga gcggcccggc tggccgggcg 240 gatgtgctta aaggcggcgt cgagacggta aacaaaggcg ggggtagcac ggcaggctaa 300 aagctaagct aaagctaata ccacaccggc tctctttacc cagcatcttt ctcgccaatg 360 tacagtcact ggtgaacaaa atggatgaga ttcgactgcg cataaaccac agcaaaagac 420 tatggaactg taatgtcatg attttcacag aaacatggct aaacagcggg ataccagaca 480 atgctgtatt tttaactgag cataacacat ttcgagcaga cagaacggcg gatgacttcg 540 atcagagaca ttgctctgct aacctggaat ttctaatggt taaatgtaga ccgttttatc 600 taccaaggga gttcacattc accattgtaa ctgctgctta tattcctcct gatgctgatg 660 ccaagcttgt tatgaatgaa cttccagcca tcagcaaaca acagactgct cacccggagg 720 caacttttat tgttgcggaa gattttaatc actcaaactt aaagacagtg cttcccaaaa 780 ttcatcaaaa agatttctgc cacaaaagtg gaaacaaaac cttggaccat gtacacacaa 840 acatggctga agcctatgtt atgaaccccc ctcccccact tgggtcaatc agatcacctt 900 ttttgttcct cacgcccaag tactcactcc tcatcaaccg tgtgaagcca tcagtgagaa 960 ccatcaaagt gtggccagcg ggggtagact ccacactcca ggacaggttc caacacacag 1020 actggagtat attcgcttcc caggccaact atggccctta catagacata attagttaca 1080 cttcctcagt tctggaatac atcaccaccg ccatagacag tgttacaacc cagaaacaga 1140 tcagtacata cccgaatcag aagccatgga tgaacaagga ggtgtgcctc ctgctgaagg 1200 cacgcaacac tgccttcaga tcaggggatg cacaggccta cagcacttcc agggctaatc 1260 tgaaaagggg catcaaaaag gccaagcact gttacaagct aaagctagag gagcactttt 1320 ccaactctga tcctcggtgc atgtggcagg gcatccaggc catcagcaac tacaaaccca 1380 gccagactac atccacagcc acaaatgtct ccttcctgaa cgagctaaat gacttttatg 1440 ctcgctttga aagtgacaat aaagaagcct acaccaggat cacttcctca accgaccact 1500 cacctatcac actcacctcc tcagaagtct acaccgcact gagtcagatc aatgtgtgta 1560 aggctgctgg accagacggt atccctgggc acgtcctcaa agcatgtgca gaacagctcg 1620 ctggggtatt cacagacatt ttcaacctgt cacttaacct agcagctgtg ccaacatgct 1680 ttaaaaccac ctctattgtg ccagtgccca aacactgcag cccaacatgc ctgaatgact 1740 accgccctgt agcactcaca cccatcatca tgaagtgctt cgagcggttg gtcctggcac 1800 atctgaaaga ctctctgcca tccacactgg acccacatca gtttgcctac cgtggcaaca 1860 ggagcacaga agatgcagac tccatagcac tgcactctgt actcacacac ctggacaata 1920 aaaacattta tgcacgaatg ctgtttgttg acttccgctc agcattcaac actgtcatac 1980 cctccaagtt aatgatcaaa cttagagacc tggatatcga cacgtctctc tgcaactggg 2040 ttatggactt tctgactaac agacctcaga atgttagatc aggccacatc tgctccacca 2100 ccgtcacact caacactggt gtaccacagg gctgtgtgct gagccccttc ctctactccc 2160 tttttaccat caactgtagg cctgtgaaca gatccaacac catcatcaaa tttgcagatg 2220 acaccacagt gattggtcta atcagcaaca atgatgagac ggcctacagg gaggagatac 2280 agcatctggc cacttggtgc acagacaata atctgctcct taacaccaac aagaccaagg 2340 agctcattgt ggacttcagg aagggacgaa caggctcaca tgatccaatc cacatcaatg 2400 ggatggccgt tgagcctgtc tcatccttca agttcctggg gacccacatc tcaaaggacc 2460 tttcctggac caccaacacc tccagtctgg tcaagaaggc tcaccagcgc ctatttttct 2520 taaggcaact taagaagaac cagctttcat cagccgtctt ggtgaacttc taccgctgca 2580 caatagaaag catcctgacc aactgcgtca cagtctggta tggaagctgc tctgttgctg 2640 agcgtaaggc actgcagcgg gtggtgaaaa ctgcccaacg catcacaggg accacactgc 2700 cagccataga ggacatccag aagaaacact gtctgcgccg agcacgcagt attcttaagg 2760 acacctctca ccctgctcac agactgtttt cactcctgct ttccggcagg cgcttcaggc 2820 tcccccggac aaaaacgagc agactgagga acagcttttt ccccagagct cccttttgaa 2880 ctctgcccct cactgactct tttgccccac cccaatacac ccccactctc ctctaactta 2940 tactcctcac aatcactgca ctatttaaca tttgcacatt taaaatttgc acatattcat 3000 tgcactacat tgcactgatt cacttatttg aactgtacac acccactgca catggacatt 3060 tgtaattatg tacacaccca ctgtacatat acatttgtaa ttatgtttat ttatctgcac 3120 acttctgatt attaatagca acctgtacat atattcattt attgtaaatc tctgttcata 3180 gctaatacaa cctgtatata atgttcatag tacatccatc tgtaaatatc accatagttt 3240 ttctataact gcactttata acttataccc gtatcctgca cttgctgcta ttgcactgct 3300 ggttagacct aaactgcatt tcgttgcctt gtacttgtac atgtgtaatg acaataaagt 3360 tgaatctaat ctaatctaat ctaatctaat ctaa 3394 // ID I-2_DR repbase; DNA; ZEB; 5528 BP. XX AC . XX DT 03-OCT-2008 (Rel. 13.1, Created) DT 11-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Non-LTR retrotransposon from zebrafish - consensus. XX KW I; Non-LTR Retrotransposon; Transposable Element; Nonautonomous; KW I-2_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5528 RA Jurka J.; RT "I-type retrotransposons from zebrafish."; RL Repbase Reports 8(10), 1342-1342 (2008). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 462..1673 FT /product="I-2_DR_1p" FT /translation="MANKQNPKRKLSDQDLDCNFGNGISLLPKSENWQRFI FT LLESLQSDMPLSKLSPFAIQKGISGIAGTVKDCKKLRSGQILVECSKKVQA FT ENLLRANMLAGVAMKTFHHPTLNHSKGVIRTRELEDMEETEITTELMTQDV FT IHVKRITIRREDRLIKTGTYILTFSRPLPPEKIQIGYLSVNVDIFTPNPLR FT CFSCQKFGHGSISCKNRPTCVNCGEEKHGDQCKKTPKCSNCAGEHPSSSKD FT CPTWIKEKEIQKIKCTKKVSYLEARKLVENSSFFKLEKTFAAVMKPKLQSV FT GVQTDLTWINEKFEEIKSQTPVNTKCTDHQSQTVLHQTQVNHKEHSSTGQH FT SSKNKNDKKPNPQKEKTSQEYKEMKTSSSKDIEMSDDKKPRSRSTSPKTKT FT KGHSYVLPT*" FT CDS 2066..4765 FT /product="I-2_DR_2p" FT /translation="MGDFNAHNPLWGGKQLDAKGKKMEKLINENDLCLLND FT GSYTYLHPGHGSFSAIDLTICDATLATDFSWYVCDDLCGSDHFPLIITKTA FT ADTQQRPQKWKLEKANWYSFQALCYERLDGKQQDKENPIKWFTEKIIEIAD FT ESVPKTSTKQSKRRNPWFDDQCKELIKARKKAVRCFQKHPTSENLIRIKIC FT RAQARRYIREAKRQSWNKFVSSLTINTPSKKIWDAIRKMKGREGPQLKHID FT NHGTLLTSKHDIANILAETFAKNSSTENYQPNFQKIRTNQETIKLNFISQN FT TEIYNQPFSMEELLNSLKCCHDTAVGPDKVHYQFLKHLPQLSLCLLLDNFN FT EIWKSGNIPPSWQEATIIPXPKPGKEHKDPNNYRPIALTSCVCKTMERMVN FT NRLVWKLESDHQISDFQCGFRRGRCTLDHLINLESYIRNGFIKKEHVVAVF FT FDLEKAYDTTWKYGILKDLYKMGFRGNLPIFISNFLSNRTFRVQVGTTMSD FT PQIQQQGVPQGSILSVTLFMVKINSVTDVIGRNMMCSLYVDDICICYRGKN FT MNIIERQLQLCINKVSNWSTENGFKFSKTKTVCMHFCLLRSLHHDPELFLD FT EEPIKIVKETKFLGLHFDCKMTFIPHIKALKNKCLKNLDLIKVLAKTKWGA FT DSTVLMRLYRLLVRSRLDYGSIIYSSARKSYLQMLDPVHHQGLRLALGAFR FT TSPAQSLYTEANEPPLHIRRLELSLQYALKIKTNQHNPAFKPIFQPQYLDL FT YEERPSYIQPFGVRIRTHIKNLNVNLDSIHQTLICPVPPWRLNKLQVNLDL FT SKHKKADTFPHTYQQAVAGIRHFYPTHNPIYTDGSKKDGHVSAAIVMGQSH FT HGIRIPDQSSIFTAEAKALFLALEHIENAEGHNFIIFSDSKSCLQH*" XX SQ Sequence 5528 BP; 1952 A; 1082 C; 983 G; 1503 T; 8 other; ccccccgtgg ggacccgggg tagaataggt ccccagcacc cccttgctga tcgtaagagg 60 cgacaaatgg ggcaacttgt ttcagccgtg agttgtgacc cgtgtcagtg gaaaaagaga 120 tcctggtatt tgaggaatgt aactgttgcg actattgcag atccaacaca gtactttggc 180 tcctatttca cttagacggg aggttggaag ggcccgatcc aatcaatcgg ctagtcaaga 240 aatgccctgg attatttaac ttttttaaac atctagcaaa tttccagaag gctttggctg 300 agggtcaatt gagtaatgca tatttttaaa tatgcttgct tgagtatatc attcttatat 360 ccttgatgta tacccggtgg agatccgcgg ctctgtgcat ccccttgttg ggctccgagg 420 tgggtgagga gtaagagtga tgaatctgaa atcatacaaa aatggcaaac aaacaaaacc 480 caaaaagaaa attatcggat caggacttgg attgtaactt tggaaatggc atcagtctac 540 tacctaaaag tgaaaactgg cagagattta tacttctaga atctctacaa tctgacatgc 600 cactctctaa actgtctcct tttgctattc aaaaaggcat ctctgggatt gcaggaacag 660 tgaaagattg taaaaaacta agatctggac aaattcttgt tgaatgctcc aaaaaggttc 720 aagcagaaaa cctactacgt gcaaatatgc ttgctggtgt tgctatgaaa acttttcacc 780 atccaacttt aaatcatagc aaaggagtaa tccgtaccag agagctagaa gacatggaag 840 agactgaaat tactacagag ttaatgactc aagatgtaat acatgttaaa agaatcacaa 900 tcaggagaga agatcgtctc ataaaaacag gaacctatat tttaacattc agtagacctc 960 tacctccaga aaagattcaa attggatact tgagtgttaa tgttgacatt tttacaccta 1020 atccactcag gtgtttcagc tgccaaaagt ttggacatgg atcgatttct tgcaaaaaca 1080 ggcctacttg tgtaaactgt ggagaggaaa aacatggtga tcaatgtaaa aaaacaccaa 1140 agtgcagcaa ctgtgcagga gaacatccaa gctcatccaa agactgtcct acctggatta 1200 aagaaaaaga aatccagaaa atcaagtgca ctaagaaggt gagctatttg gaagctcgaa 1260 agcttgtaga gaactcctca ttctttaaat tggaaaaaac ttttgctgct gtgatgaaac 1320 caaaactgca gtctgtcggc gttcagacag atttgacgtg gataaatgaa aaatttgaag 1380 aaataaaatc tcaaactcca gtaaatacta agtgtactga tcatcaaagt cagactgtac 1440 ttcatcaaac tcaagtaaat cacaaagaac attcatctac aggacaacac tcatccaaaa 1500 acaaaaatga caaaaaacca aatcctcaaa aagaaaaaac aagccaagag tataaggaga 1560 tgaagaccag cagctcaaaa gacatcgaaa tgtcagacga caaaaaacct cgaagtcgga 1620 gcacttctcc taaaactaaa accaaagggc actcgtatgt cttaccaact tgacagtgga 1680 aagtcatatt attcagtgga actgccgtgg cataagagcc aatttttctg aattacaacg 1740 tcttgcttgc atttataatc ccctggcttt ctgtctccaa gagacccatc ttactccaga 1800 gagcaatata tctctgaaac attttacatg cctaaatgca tatggcccaa accttcaacg 1860 tccttgtggt ggaacatcaa tattacttag acatgatgtt attcatagta atgttgacat 1920 aaatacaaac ctacaagtag tggcagtccg tataactttg caaaatacta ttactctatg 1980 ttctgtttat attcccccag aagctactgt ctcacatcaa gatttggaaa atctggtgga 2040 acaaatcccc cctccattta ttctcatggg agattttaat gctcacaatc ctctgtgggg 2100 aggaaaacaa cttgatgcaa agggaaagaa gatggaaaaa ctaattaatg aaaatgacct 2160 gtgtctttta aatgatggtt catacacata cttacaccca ggacatggat ccttctctgc 2220 cattgactta acaatatgtg atgcaacttt agccactgac ttctcatggt atgtctgcga 2280 tgatctctgt ggaagtgatc actttccttt gattataacc aaaacagccg cagatacaca 2340 acagagaccc caaaaatgga aacttgaaaa ggcaaattgg tactcatttc aagccctttg 2400 ttatgagaga cttgatggca agcaacagga taaagaaaat ccaatcaaat ggttcacaga 2460 aaaaattatt gaaatagcag atgaatctgt accaaaaaca tcgacaaaac agagcaagag 2520 aaggaatcct tggtttgatg accaatgtaa agaactcatt aaagcacgga aaaaagcagt 2580 aagatgtttt caaaarcatc caacttctga aaaccttatc aggataaaaa tttgcagagc 2640 acaggcacgg agatatataa gagaagcaaa gagacagagc tggaataaat ttgtgtcaag 2700 cctgacaata aatacaccat caaagaaaat atgggatgca attcggaaaa tgaaaggaag 2760 agagggacca caactaaaac acatcgataa tcacggcact ctgctgacta gtaaacatga 2820 cattgctaat atacttgcag aaacctttgc aaagaattcc tctacagaaa actatcagcc 2880 aaatttccaa aaaatcagaa ctaaccaaga aacaattaaa ttaaatttca tttcccaaaa 2940 cacagagatc tacaatcaac ctttttctat ggaggaactt ttaaactctc taaagtgttg 3000 ccatgacaca gcagtcggac cagacaaagt acattatcag tttcttaaac atcttcccca 3060 attgagtctt tgcctcctcc tggataattt caatgaaatt tggaaatcag gaaatatccc 3120 accatcatgg caagaagcca ccataattcc twtaccaaaa cctgggaagg aacataaaga 3180 ccccaacaat tacagaccta tagcattgac gagctgcgta tgtaaaacta tggagaggat 3240 ggtaaataac cgacttgtct ggaagcttga gtccgatcat cagataagtg atttccaatg 3300 tggtttcaga agaggaaggt gtacattaga tcatctcatc aatctggagt cttacatacg 3360 aaatggattc atcaaaaagg agcatgttgt agctgtattt ttcgaccttg aaaaagcata 3420 tgacactaca tggaaatatg gcattcttaa agatctgtat aaaatgggat tcaggggaaa 3480 tttacccatc ttcatttcaa actttttatc aaatagaact ttccgagttc aagtaggaac 3540 cactatgtca gaccctcaaa tacaacagca aggagttcct caaggcagca tcctttcagt 3600 gactcttttt atggttaaaa ttaatagtgt cacagatgtc attggaagaa acatgatgtg 3660 cagtctttat gtggatgaca tttgtatatg ctacagaggg aaaaacatga acataattga 3720 aagacagcta cagttatgca taaacaaagt gagtaattgg tctacagaaa atggtttcaa 3780 attttccaaa accaaaactg tatgtatgca tttctgtctt ttacgatctc tacatcatga 3840 tccagaactc tttttggatg aggaacccat caagattgtt aaagaaacaa aattccttgg 3900 actccatttc gattgtaaaa tgaccttcat cccacatata aaagctctaa agaacaaatg 3960 cttgaagaac ttggatctaa ttaaagttct tgctaaaaca aaatgggggg cagattctac 4020 tgttttaatg aggttgtata gacttttggt gcgctcacga ctggactatg ggagtataat 4080 ttacagctcc gctagaaaat cttatttaca aatgctggat cctgtacacc accaaggact 4140 tagactagct cttggagctt tcagaacttc acctgctcaa agcctataca ctgaagcaaa 4200 tgaaccacct cttcatatta gacgcttgga attatctctc cagtatgcat taaaaataaa 4260 gacaaatcaa cacaacccag ctttcaaacc aatctttcaa cctcaatatt tggatttgta 4320 tgaggaaaga cccagctata ttcaaccatt cggtgtgcgg atcagaacac acataaaaaa 4380 cctwaatgta aatcttgatt ccatccacca gactctcatt tgcccggtcc caccatggag 4440 attaaataag ttgcaagtaa atctggatct aagcaagcat aagaaggctg atacattccc 4500 acatacatat caacaagcag ttgctggcat aagacatttt tatcctaccc ataatccaat 4560 atacacagat ggatcaaaaa aggacggtca tgtttctgca gcaatcgtaa tgggtcaatc 4620 acaccacggc attcgcattc ctgatcaaag ctcaatcttt acagctgaag caaaggcact 4680 ctttttagcg ctggaacaca ttgaaaatgc tgaaggacac aattttatta ttttttctga 4740 ttctaaatct tgtctacagc attaaattct tttaaactgg agcatcccat tattattgac 4800 atttttatca aagtgaatga attacagaga aagttttata atatagtctt ctgctggatc 4860 ccaggacata ctggactact tgggaatgaa caagctgaca aagccgccaa aacagcccta 4920 acaggaaagc tgatggagtg caaaatccca ccttcggatc ttaagccact aataaaaggm 4980 tatattctca acaaatggca atcagaatgg gatcagtgcc cagaaaataa actttttgaa 5040 attcaacctg aaataggaaa aaaatctaat ttatttttta agtcaagaca cgatgagatt 5100 gtgtatagaa gatgccgcat tggacacacg agactcacac atgaacattt actcaaagga 5160 gaagaaccac caaaatgcct ttattgcaac agaaaccaaa cagttaaaca yattcttgtg 5220 gaatgcccca tttttaatgt tatcagaaaa caatatttat ctggagagac tttaaaggaa 5280 atatttttaa acgtgaatcc atctaaagtt gtagaatttt tatcaaaagg aaatcttaaa 5340 aaactktttt agatttttat cttatatatt atcattatta ttwttattta tatattttac 5400 cttgtacatt atttattgct gttgatgact tttaattgtt agaatatttt tacttgatct 5460 attttctttt tgccatgaca tagccataga agctgaaatg gcaataaaat waaatctctc 5520 tctctctc 5528 // ID RTEX-2_DR repbase; DNA; ZEB; 3260 BP. XX AC . XX DT 23-FEB-2009 (Rel. 14.02, Created) DT 23-FEB-2009 (Rel. 14.02, Last updated, Version 1) XX DE RTE-like non-LTR retrotransposon - a consensus. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; RTEX-2_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-3260 RA Bao W. and Jurka J.; RT "RTE-like non-LTR retrotransposons from zebrafish."; RL Repbase Reports 9(2), 564-564 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS join(2..1780,1716..1955,1861..2238,2186..2836) FT /product="RTEX-2_DR_1p" FT /translation="NGRVRGDSLGRYTYSSSLGSSTVDYMITDLDLFSFSA FT FTVKPLTPLSDHSQITLYIKRNKNTNIFSQSSKLXNIKNNYRWVQDSSEKY FT RNAIEDPQIGLLLDKFMENSYPDNGDGVNLAVENINYIFDYLATXSNIXTT FT KKTFKQKQENEKWXDFDCKTIRKNLRQLANQKHRQPENTDLRLQYXEELKK FT YKNTIRKKKEEYTQKQLKTIEESVDSNNFWDNWNLLNKKKREQLTIQNGDT FT WKNHFEKLFSKINMSTEQSLIYNKLTHTEQIFGKYQNPLDYPITEKELLDK FT IQVLPTKKASGPDGILNEXIKNINHKFQLAIMKLFNLVLSVGHFPDIWNKG FT LITPIFKSGDKSDPNNYRGICVSSNLGKLFCSIINTRLIGFLTEHNVLSKS FT QIGFLPNYRTSDHIFTLQTLIDKYVHQNKNKIFACFVDFQKAFDSIWHEGL FT LSKLLESGIGGKTYNIIKTMYSNNQCAIKIGNKRTEFFXQGRGVRQGCPLS FT PTLFNIYINQLAMLLEQADTPGLTLYDSSVKFLLFADDLVLLSPTEEGLQQ FT QLDILHSFCQTWALTVNPKKTKTLIFQKKTQMSGKETQLHPWYDKNYSRKK FT PRCQGKKHSFTLGMTKIEPAISYTYLGLKISANGKLNLAVNELKEKARRAF FT YAIKKINTNRNSNSNLAQNFPVSDKRKQEGPFTPLKKSTQIEIPIRIWLKI FT FQSVIEPIVLYGSEVWGPLLSHEFDKWDKNPVESLHAEFCRSILRVQRNTP FT SNACRAELGQYPLLMRIEKQSVKFWKHIKMSDPNSYHFKALKTPRNGHVTP FT TLIILKPSKHQEMDIEKSLLIQMVLKLQTQTNTTNMTNSRQHQDTDMLIHK FT IKPNQIINARKEEYLIYWNEAAGKQSKLQCYLDLNRDYTTATYLSAVKDSK FT LRTTMTKYRLSAHSLTVETGRYRQNWQPRESRICPHCAQAEVETEEHFLTH FT CTNYQHIRETFYTKLQSIYPQFTELDNKTQLQYLLGEKNECVLLATQYINA FT CHKKREQNQ*" XX SQ Sequence 3260 BP; 1230 A; 618 C; 572 G; 829 T; 11 other; caatggcagg gtgagaggag actctctggg cagatacacc tacagctcaa gtcttggtag 60 ctcaacagtt gactacatga tcacagattt agatctgttc tctttcagtg cattcactgt 120 taaaccccta acacctctat cagatcacag ccaaattaca ctttatataa aaaggaacaa 180 aaatactaat atattttcac aatcyagtaa attgtrtaac attaaaaata attatagatg 240 ggtacaagac agctctgaga aatacaggaa cgcaattgaa gacccccaaa taggtttact 300 tttggataaa tttatggaaa atagttatcc tgataatgga gatggtgtta atctagctgt 360 agaaaacata aattacatat ttgattattt ggcaacgatr tctaatatta amactactaa 420 gaaaacyttt aaacaaaaac aagagaatga aaaatggyat gactttgatt gtaaaacaat 480 aaggaaaaac ttaagacaac tagcaaatca aaaacacaga caaccagaga atacagattt 540 acgtcttcag tactrtgagg agcttaaaaa atataaaaac acaatcagaa aaaagaaaga 600 agagtacacc caaaaacagc tcaaaacaat tgaagaatct gttgactcaa acaatttctg 660 ggacaactgg aacctcctta ataaaaagaa acgtgaacag ctaacaatac aaaatggaga 720 cacctggaaa aaccactttg aaaaactgtt tagtaaaata aacatgagta cagaacaatc 780 gcttatatat aataaattaa cccatacaga gcaaatattt ggcaaatatc aaaatccgtt 840 agactaccca attacagaaa aagagctact agataaaata caagtcctac caacaaaaaa 900 ggcaagtggt ccagacggta tccttaatga aatkattaaa aacataaatc acaaattcca 960 attggctata atgaaactgt ttaatttggt tctgagtgtt ggtcatttcc ctgacatatg 1020 gaataaagga ttaataacgc ccatattcaa gagtggagac aaatcagacc ctaataatta 1080 cagaggcatc tgtgtgagca gtaatctggg gaagctgttc tgcagcatca tcaacaccag 1140 actcataggc ttccttacag agcacaatgt cctcagcaaa agtcagattg ggtttctgcc 1200 aaattacaga acatcagacc atatcttcac ccttcagact ctgattgaca aatacgtcca 1260 tcaaaacaaa aacaaaatat ttgcttgctt tgtagatttt cagaaagcat ttgattcaat 1320 ttggcatgaa ggtctgctgt ctaaacttct agaatcaggt attggcggta aaacatacaa 1380 cattataaaa acaatgtatt cgaacaatca atgcgcaatt aaaataggaa ataagcgaac 1440 agaattcttc astcaggggc ggggtgtgag acagggctgt ccactgtcac caaccctctt 1500 caatatttac atcaatcaat tggcaatgct cctagagcaa gcagatacac cgggtctcac 1560 actatacgac tccagtgtga agttcctgct gtttgcagat gatctggtgc tgctgtcgcc 1620 aacagaagag ggtctacagc agcagctgga tatcctgcac agcttctgtc agacctgggc 1680 cctgaccgtt aacccaaaga aaactaaaac cctaatattc cagaaaaaaa cccagatgtc 1740 agggaaagaa acacagcttc acccttggta tgacaaaaat tgaacctgcc ataagctaca 1800 cataccttgg gttgaaaata agtgctaatg gaaaactaaa tttggctgtg aatgaactga 1860 aagagaaagc aagaagggcc ttttacgcca ttaaaaaaat caacacaaat agaaattcca 1920 attcgaatct ggctcaaaat tttccagtca gtgattgaac ctatagttct atatgggagt 1980 gaagtgtggg gtcctctcct cagtcatgag tttgataagt gggataaaaa tccagttgaa 2040 agcctacatg cagagttctg taggagcatc ctcagggtac agaggaacac acccagcaat 2100 gcatgcaggg cagaattagg ccagtatcct ctactcatgc gcattgagaa acaatctgtt 2160 aaattttgga aacacattaa aatgagtgac cccaactctt atcattttaa agccctcaaa 2220 acaccaagaa atggacattg aaaaaagcct cctgattcag atggtcctga agctgcaaac 2280 acaaaccaac acaacgaata tgactaacag caggcagcat caggacacgg acatgctcat 2340 ccacaaaatt aagcccaatc aaataataaa tgcacgaaaa gaagagtatc tgatttactg 2400 gaatgaagca gcmggaaaac aaagtaaact tcaatgttat ttagacctaa atagagatta 2460 taccacagca acatatctga gtgcagtaaa ggacagtaaa ctaagaacaa caatgaccaa 2520 atacaggctg agtgctcaca gtctgactgt agagacgggc cgatacagac agaactggca 2580 acccagagag agccgcatct gcccacactg tgcccaggca gaggtggaga cggaggaaca 2640 cttcctcacc cactgcacaa actatcagca catcagagaa acattctaca ccaaactaca 2700 gagcatttac ccacaattca ctgagttaga caataaaaca cagcttcaat atttactagg 2760 ggaaaaaaat gagtgtgtcc ttctagcgac acaatacatt aatgcctgtc acaaaaagag 2820 agagcagaac caataaaaac atcagtgatg cgcctgcaaa cacacacaca cacacacaca 2880 cacatagatg attataaatg aatcattgta aatatactga ttattattgt ttttattact 2940 gttttatcct aaaatgttga ttgttaaatg tacagtttat tctgatctat ttttntattg 3000 atttgtatga ataatatgat atgttgaata ataaaaataa tatgatgtat tgttgatatt 3060 ttatttaatt attatatata aatatcatct ttttttacta ctactattta ctgtgtgtac 3120 tgttttttat tatatatgta tgttactttt tatgtaaact ttataactgc tttggcaata 3180 catttgtaac atttgtcatg ccaataaagc aattattgaa ttgaattgag agagagggag 3240 agagagagag agagagagag 3260 // ID TDR18 repbase; DNA; ZEB; 572 BP. XX AC . XX DT 04-MAR-2002 (Rel. 7.02, Created) DT 04-MAR-2002 (Rel. 7.02, Last updated, Version 1) XX DE Zebrafish non-autonomous DNA transposon. XX KW DNA transposon; Transposable Element; TDR18. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-572 RA Jurka J.; RT "TDR18: a non-autonomous DNA transposon from Danio rerio."; RL Repbase Reports 2(2), 28-28 (2002). XX DR [1] (Consensus) XX CC TA target site duplication. XX SQ Sequence 572 BP; 207 A; 80 C; 81 G; 202 T; 2 other; tacagttgaa gtcagaatta ttagcccccc tgaattatta gcccccctgt ttattttttt 60 ccccaatttc tgtttaacgg agagaagatt ttttttttca acacatttct aaacataata 120 gttttaataa ctcatttcta ataactgatt tattttatct ttgccatgat gacagtaaat 180 aatatttkac tagatatttt tcaagacact tctatacagc ttaaagtgac atttaaaggc 240 ttaactaggt taattaggtt aactaggcag gttagggtaa ttaggcaagt tattgtataa 300 cgatggtttg ttctgtagac tatcgaaaaa aatatattag cttgcttaaa ggggctaata 360 attttgacct taaaaatggt ttttaaaaaa nttataaatt aaaaactgct tttattctag 420 ccgaaataaa acaaataaga ctttctccag aagaaaaaat attatcagac atactgtgaa 480 aatttccttg ctctgttaaa catcatttgg gaaatattta aaaaagaaaa aaaaaatcaa 540 aggggggcta ataattctga cttcaactgt at 572 // ID Gypsy38-I_DR repbase; DNA; ZEB; 5385 BP. XX AC . XX DT 21-SEP-2007 (Rel. 12.09, Created) DT 02-OCT-2007 (Rel. 12.09, Last updated, Version 1) XX DE An internal portion of the Gypsy38_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; LTR; KW endogenous retrovirus; Interspersed repeat; reverse transcriptase; KW gag; GYPSY superfamily; integrase; Gypsy38_DR; Gypsy38-LTR_DR; KW Gypsy38-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5385 RA Dib M.R. and Naveira H.F.; RT "Gypsy38_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 7(9), 806-806 (2007). XX DR [1] (Consensus) XX CC Gypsy38-I_DR is an internal portion of the Gypsy38_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its long CC terminal repeat is deposited in Repbase as Gypsy38-LTR_DR. CC Gypsy38_DR is characterized by 4-bp target site duplications. The CC internal portion encodes one polyprotein the 1525-aa polyprotein CC Gypsy37_DR1p (pos. 594-5168) composed of the gag, protease, CC reverse transcriptase, and integrase domains. Some insertions CC fairly recent, according to the high identity between their CC flanking LTRs. Consensus obtained after the alignment of at least CC five independent insertions bearing at least 85% homology over at CC least 1000bp. XX FH Key Location/Qualifiers FT CDS 594..5168 FT /product="Gypsy38-I_DR_1p" FT /note="ORF." FT /translation="MSSELDIFFESPNEESFDKLTKEQLIELANKYDVHLT FT TKDKKLKESIAVVVKAGLVRLGLFEAEKSVVGEEREIFTPSSKYETSLSFE FT QQKELMLLELEKAKINGELEMRRLEVEALRFRLIGEGKLGESANSAIVSAK FT KGLDVTNNIKLLPKFSETDVDTFFRLFERLSTSMAWSEQEQTMMLQCVLVG FT KAQKAYAALSAVDSGSYSKIKEAVLKAYELVPEAYRQRFRNIRKSNAQTYS FT EFVSELKLQLDRWCSASEVKTYENLYELVLLEQFKATLPEHVVLFLTERKV FT KSAEDAAVISEYVLTHKVERMGSGVKNKLRSDVFGESAALRPSAFEMSKTE FT KADKALKPYGDNNDRCAYCHMKGHWKKECLVLKGKLSRVKSEQKAVLTVSS FT VQTDENANDDPVVLQKPVMMVRAANESYSPFITDGYVSLKENAEKTPVKIL FT RDTGASESFILESALPFSQETSTGSNVLIQGIGLNTMSVPLHRLFLQSELV FT SGPVVLGIRPSLPVEGVSVILGNNLAGDRVWPDVPPPPVVTTTPVLGETDI FT STDFPETFVSCAFTRAMRKQGIDEQELLEREIPVTSNKSVPGFIAVPSITR FT HDWEVAQCEDVSLKPLFDVALSQDAAESSSSCYFVHNNVLLRKWTPNKEED FT LGGAVIQVVVPVSLRDAVLAAAHGGMSGHVGVNKTYQHLLQYFYWPRVKSD FT IRRYIKICPTCQKTGKPNQTLKPAPLYPIPVLEPPFQHLIVDCVGPLPPSK FT SGSKYLLTVMCQSTRYPAAYPLSSITTRSVVKALSQFISIFGIPKIVQSDQ FT GTNFTSKMFSEVLQQLGIRHNKSSAYHPESQGALERFHQTLKSLLKAYCTE FT LKGDWELGLPWLLLAAREFVQESLGFSPNQLVFAHSVRGTLSVMTDSVVPN FT EPPQSLLKYVLGFRRRLLLAGELAKEKLEKAQKKMKGWFDRKSAVRKFSSG FT DQVLALLPLPESPFCAQFSGPYTVLRSVSDQNYVLSTPERRKSSQLCHVNL FT LKPYYSRDGIETSKSPVMLANTTLVSESENDVKIPDDAILHPRLNNSESLK FT RLDELLKHLPEKHCAELTSLLLAFSNLFSDTPTRTDVIEHDIDIEGSKPVR FT QRFYRVSLDKQKKLEAEVNYMLQNNIAKPSFSDWASPCLLVGKPDGSQRFC FT TDYRKVNAITKPDSFPLPRIEDCVDQVGSASYVSKFDLLKGYWQVPLTPRA FT QEISSFITPFGLFSYSVMSFGLRNAPATFQRLMNRVTSGLEGCAVYLDDIV FT IYSDTWDQHLTRIRDLFTRLTAANLTVNLAKCEFARATVTYLGKVVGRGEV FT RPVRAKVLAIDNFPPPETKRELMRFLGMVGFYRSFCSNFSSVVAPLTDLLK FT SKVKFDWTKKCEDAFENVKRMLTSSPVLAAPRLADPFKLQVDASHIGAGAV FT LLQADENGIDRPISYFSRKFNSYQLNYSIIEKEALALIWALQHFEVYLTSG FT ITPIVIYTDHNPLTFLHSLQNPNQRLIRWSLFLQPFALDIRHIKGVDNVLA FT DTLSRAPYG" XX SQ Sequence 5385 BP; 1521 A; 920 C; 1251 G; 1693 T; 0 other; aatgggggct cgtcctcaag taacttggga accttgatta aaaacacctg ttagatttcc 60 atgtggaagc tccaggagtt ttgttagtgt gcggtggcgt ttgttttgtt ttgttttgca 120 gtcccttgct ttctcttgca ttgctttgag aaatttgaaa atagcatcgg aagtgggtaa 180 gtttattttg atatctatgc tatcacgaat gtgtttctat ttagcccgat gatatttggg 240 agtgtagttt gaggtgatag tgtatttgat cacaaccgtt tgatgtttct aacacagggc 300 tgatatcccc atgttaggca tagcaacgct ttgcgttggt ttttgttgtt ttagtttgtt 360 atttttgtgc taacgccgag ttaagagcac tgccgctttt gtcggtctag tggtttgtcc 420 gtagatttag acatctgccg gtatttgagt gcttaaagat acttgactct tggtgaaagc 480 atgaagatta gggtgagtta gaatagggaa aaataacaaa agcatagggg gataatcagg 540 ttgtgtttga aatttataga aattcgtttg aaaaggaaaa aataaaggga aaaatgtcct 600 ccgaattaga catatttttt gagtcaccaa atgaagaatc atttgacaaa ctaacaaaag 660 agcagttgat agagttggcg aataaatatg acgtacactt gacaacaaaa gataagaaat 720 tgaaggaatc tattgctgta gttgtaaagg caggtttagt tcgtttaggg ttatttgaag 780 ctgaaaagtc tgttgttgga gaagaacgtg aaatatttac accgagttca aaatatgaaa 840 catcattgtc atttgaacaa caaaaagaat taatgctttt ggagttagag aaagcaaaaa 900 ttaacggtga actcgaaatg cgcagattag aggttgaagc actgcgcttt cgactaattg 960 gtgagggaaa gttaggtgaa agtgcgaaca gtgcaattgt gtctgctaaa aaaggtttgg 1020 atgtgacaaa taatattaag ttgttaccga aatttagtga aacagacgtt gacacatttt 1080 tcagattgtt tgaaagatta agcacatcga tggcctggtc agagcaggaa cagactatga 1140 tgcttcagtg cgtacttgtg ggaaaagcac agaaagcata tgctgcttta tcagcggtag 1200 atagtgggag ttatagtaag atcaaagaag ctgtgctaaa agcttacgag ttggttcctg 1260 aggcttacag gcagagattt agaaatattc gaaaatctaa tgcacaaaca tattctgaat 1320 ttgtgtctga actaaaactt cagttagatc gttggtgctc tgcgtcggaa gttaaaacgt 1380 atgagaattt gtatgaacta gtcctgttgg aacaatttaa agctacgctt cctgaacatg 1440 ttgttctttt tttgacagaa cgcaaagtga agtctgcaga agatgctgct gttatctctg 1500 agtatgtact cacgcataaa gttgagagaa tggggagtgg cgttaaaaat aaattaagat 1560 ctgatgtttt tggcgaaagt gcagctttac gaccttctgc ttttgaaatg agtaaaactg 1620 aaaaggcaga caaggcttta aagccttatg gtgataacaa tgaccgctgt gcgtattgtc 1680 atatgaaggg tcattggaaa aaagagtgtc ttgtactaaa agggaaattg tcacgtgtaa 1740 aatctgagca aaaagctgtc ttgactgtat cttcagttca gactgatgaa aatgctaatg 1800 atgatcctgt tgtgttgcag aagccagtga tgatggttag agcagctaat gaaagttatt 1860 ctcctttcat tacagatgga tatgtgtcac ttaaggaaaa tgctgagaaa acacctgtaa 1920 aaattcttcg tgatacgggt gcatctgaat catttatttt agaatctgct ctaccttttt 1980 cacaggaaac ttcaaccggg agtaatgtgt taatacaagg aattggttta aatacaatgt 2040 ctgtcccttt gcataggttg tttcttcagt ctgaattagt gagtgggcca gtggtgttgg 2100 gaattcgtcc ttctttacca gtagagggag tgtcagttat tttggggaat aatctggcag 2160 gtgatcgggt atggcctgat gttccaccac ctccagtggt gacaaccact cctgttttgg 2220 gtgagactga tatttctaca gattttccag agacatttgt gtcatgtgca tttacacgtg 2280 ctatgcgaaa acaagggatt gatgaacaag aattgttgga gagggaaatt cctgtgactt 2340 ctaataaatc tgttccaggc tttattgcag taccctcaat tactcgtcat gactgggaag 2400 tggctcagtg tgaagatgtt tctctgaaac ccttgtttga tgtggctttg tcccaggatg 2460 cagcagaaag ttctagttca tgctattttg ttcacaacaa cgtgctgcta cgaaaatgga 2520 ctccgaacaa agaggaggat ttgggtggag ctgtgataca ggttgtagtt ccggtttctt 2580 tgcgtgatgc cgtattagca gctgctcatg gaggtatgtc tggtcatgtg ggggtgaaca 2640 aaacttacca acacttgctg cagtatttct attggcctcg tgtaaaatct gacataagac 2700 gatacattaa aatatgccca acatgtcaga agactgggaa acccaatcag actttgaaac 2760 ccgctccttt gtatcctata cctgttttgg aaccaccttt tcaacatctg atagtggact 2820 gtgtagggcc attaccacct tctaagtcag gaagtaaata tttactaaca gtgatgtgtc 2880 agagcactcg atatcctgca gcatatcctt taagttctat tacaactaga tctgtagtga 2940 aagcgttgtc acagtttatt tcaatctttg gaattccgaa gattgttcag agtgaccaag 3000 ggacaaattt cacatcaaaa atgttttctg aggtgttaca gcaattagga atacgtcata 3060 acaagtcaag cgcttatcat cctgagagcc aaggtgctct ggaacgcttt caccagacat 3120 tgaaatcttt attgaaagcc tattgcacag agttgaaagg agattgggag ctgggtttgc 3180 catggttgct attagcagcc agggagtttg ttcaagaaag tctgggtttt agtccaaatc 3240 agctcgtttt tgcacactct gtacgaggaa ccttgtctgt aatgaccgat agtgttgtgc 3300 caaatgaacc acctcaaagt ttgctcaaat atgttttagg ttttcgaaga cgtttgctgc 3360 tggctgggga actagccaag gaaaagctgg aaaaagccca gaagaaaatg aagggttggt 3420 ttgataggaa gtcggctgtt cgtaagttta gttcaggaga tcaggttttg gctttgcttc 3480 ccctgcctga gtcgcctttt tgtgcacaat tttcaggacc ctacacagtg ttgcgatcag 3540 tgtctgatca gaattatgtg ttgtctactc ctgagcgtag gaagtcatct cagttgtgtc 3600 atgtgaatct attaaagcct tattatagta gagatggaat tgagactagt aaatcaccag 3660 tgatgttggc taacactact ttagtgagcg aatctgaaaa tgatgttaaa attccggacg 3720 atgctatttt acatccccgg cttaacaatt cagagtcttt gaaacgctta gatgagttgc 3780 tgaaacatct gcctgagaaa cattgtgctg aactgacttc actgttactt gcattttcta 3840 atctattttc agatacgcct acccgcactg atgtaattga acatgacatc gacatagagg 3900 gatcaaaacc agttagacag cgattttacc gtgtgtcttt ggacaagcag aagaagctgg 3960 aagctgaagt aaattatatg ctgcagaata acatagcgaa accttccttt tcagattggg 4020 cgtcaccctg tttgcttgtt ggtaagcctg atggctcgca acgattttgc acagattata 4080 ggaaggtaaa tgcgataaca aaaccagatt catttccttt acctaggatt gaagattgtg 4140 tggatcaggt gggttctgcc tcctatgtga gcaaatttga tttgctcaag ggatattggc 4200 aggtaccact gactccacgt gcccaagaaa tatcttcatt tattacacct tttggccttt 4260 tttcttattc tgttatgagt tttggtttgc gaaacgctcc agctacgttt cagcgattaa 4320 tgaacagggt aacatcagga ttggaagggt gcgctgttta tctagatgac attgtcatct 4380 atagtgacac atgggatcag catctaactc gcattcgtga cttattcacc cgtctgaccg 4440 ctgcaaatct cacagtgaat ctagcaaagt gtgagtttgc tagagccacc gtgacctatt 4500 tgggaaaggt agtgggcagg ggggaagtcc gacctgttcg agcaaaggtt ttggccattg 4560 ataattttcc acctcccgaa acaaagagag aattgatgcg atttttaggt atggtagggt 4620 tttaccgaag cttttgttcc aatttttctt ctgtggtggc tcccctcaca gatttgctta 4680 agtcaaaagt aaaatttgat tggacaaaaa agtgtgaaga tgcgtttgaa aacgtaaaga 4740 gaatgctaac ttcatctcct gttttggcag ctccacgact ggctgatcca tttaaacttc 4800 aggtggatgc aagccacatc ggggctggtg cagttttgtt acaggctgat gaaaatggta 4860 tcgatcgacc tattagctat ttttcacgaa agtttaactc gtaccaacta aattattcaa 4920 ttatcgagaa ggaagcttta gcattgattt gggcacttca acattttgag gtttacctga 4980 cttctggtat caccccaatt gtgatatata ccgatcataa tccccttacc ttcttgcatt 5040 ctctacaaaa tccaaatcag cgtctgattc gatggtctct ctttttacaa ccatttgctc 5100 tggatattcg tcacataaag ggcgtagaca atgtgttggc tgacaccttg tctagagccc 5160 cttatggcta gtggtttgaa tgtgtttctt ttttttagtt ctaggatgtg tggctttccc 5220 ttaaattgct gcctccagat attgggagat tgctggtgat tgctcaacag ttttctaaga 5280 aaaaggaaag tttcattttc attttgtttt tgttttaatt tatttgggag agtttataaa 5340 gttaaggaca gagattatta taatctcttt cttttgaggg ggagg 5385 // ID Gypsy12-I_DR repbase; DNA; ZEB; 6169 BP. XX AC chr8; XX DT 07-JAN-2005 (Rel. 10, Created) DT 07-JAN-2005 (Rel. 10, Last updated, Version 1) XX DE An internal portions of the Gypsy12_DR LTR retrotransposon - a DE fossilized copy. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW GYPSY superfamily; Gypsy12-I_DR; Gypsy12-LTR_DR; Gypsy12_DR; KW endogenous retrovirus; gag; integrase; reverse transcriptase. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6169 RA Kapitonov V.V. and Jurka J.; RT "Gypsy12_DR, an LTR retrotransposon from zebrafish."; RL Repbase Reports 4(12), 317-317 (2004). XX DR Zebrafish.; chr8; Positions 22688544 22682376. XX CC Gypsy12-I_DR is an internal portion of the Gypsy12_DR CC LTR retrotransposon that belongs to the Gypsy superfamily. CC Its long terminal repeat is deposited in Repbase CC as Gypsy12-LTR_DR. Gypsy12_DR is characterized by 4-bp target CC site duplications. The internal portion encodes two proteins: CC the 469-aa gag Gypsy12_DR1p (pos. 153-1559) and 1543-aa CC polyprotein (pos. 1475-6105, conceptual translation) composed CC of the protease, reverse transcriptase, and integrase domains. CC PBS is identical to that in Gypsy9_DR. The internal portion is CC flanked by 99% LTRs. XX FH Key Location/Qualifiers FT CDS 0..0 FT /product="Gypsy12-I_DR1p" FT /translation="MDISQTAQWSTEENINSSRAIVLSNVPLNTSDETIEK FT VLNTVKVFGRTQIHGRRGDVTGKHLFVLVETRADLDPSTIPPEIGIESEAG FT PWPVHFVGRLQVQNPAPENDTFQSKLLTLMQQEGKSMDEVKAILMGEHSPK FT SDINVDLVDAIGKLVDRCNQASNDGPSYRKLRLFSGLKPVPPGEEEYEIWM FT EQAAQMISEWQCTEASKKQRIVESLRGPAADIVRFLKVSQPSATATEYLAA FT LETAYGTTECGPDLMAKFRHTYQDNGEKLSAFLYRLDKLLHRALLKGGIDV FT AGINKARMEQLIKGALTNDMVALRIRMTHTLQNPPSFTQLMKEIREEEHWV FT AARENVKASVATVISPQSDGPSELQSLKKEVKELSSQMSHLLNVATATCAS FT ECAPQKTSSKNSESVKRDKSQPTKLTQQPVPGIFCYKCGEDGHKKWECKGQ FT EDLRKVNQKLIKMHRLQGNWAGVQ" FT CDS 0..0 FT /product="Gypsy12-I_DR2p" FT /translation="MQGTRGPQESKSKADQNASFAGKLGRSSVKERHGAPG FT TTRSNCDSFLLDASKPRLPEGLIGPVSEVPVQIEGVYAKALLDSGSQVTLL FT YRSFYDTYLKHLELQPVENLEIWGLSSHKYPYDGYLPLRLEFTESVAGVHQ FT IIDTLAIVCPDPVKREGIAILLGTNTSLVKKLLESCRKQAGEEFLNVLTIH FT PVIREAFETIQLTDFSQDDSDTHGTVWFIKHNPVVLKPNQVRQLPGLLKFP FT GQSTESLVLVDRVADSDTSSEHLDVRPELHAASVVSSRQVTVTVRNMSNRE FT IWVKRGTPLAHVLPVSLVPQLTAKQPPVQNPLSPASFDFGDSPMPEEAKQS FT LREKMMQRKDVFSLHEWDVGCSKSTTHEIRLNDSRPFRERSRRLAPADLED FT VRLHLQELQSSGIISESRSPYASPIVVVRKKSGKVRMCVDYRTLNQRTTPD FT QYTVPRIEDALHSLSGSKWFSVLDLRSGYYQIPMSDADKEKTAFICPLGFY FT QFERMPQGICGAPATFQRVMERTVGDMNFLEVLVYLDDLIVFGRTIDEHEE FT RLLKVLDRLSDEGLKISLDKCQFGRTSVNYVGHIVSQDGISTDPSKIEAVV FT SWPKPQTVTELRSFLGFCGYYRRFVKDFSKFCRPLNELLKGYPSTRKNRDS FT LVCNTKPCYKSSEPFGSRWSAQCDTAFETLKKCLTQAPVLAFADVQKPYVL FT HVDASMDGLGGVLYQEHESGLRPVAFISRSLSPSERNYPAHKLEFLALKWA FT VVDRLHDYLYGVPFEVRTDNNPLTYITKSAKLDAAGHRWLSALTTYNFSLK FT YRPGRRNVDADALSRRPHTYRSEDDEWQEIPAVGVRTFCQAVSLEKRAENG FT FYTCVVKQAGAHMSAVPKAYSHVMEVAADHLPLFSSSDLQTAQRNDSLLGE FT VWKAVCDKKPASSIRSSHPSMKILKREWEKLVVNNGMLYRIVRQSNHKVKQ FT QLVLPKQFHSSVLKSLHDDIGHLGFEKSYGLVRDRFYWPHMKPDVESYCKT FT CERCIKRKTLPQRAAPLSHMQSSGPLDLVCIDFLSIEADSRNVCNVLVVTD FT HYTRYAQAFPTRDQKASTVAKTLWEKYFIHYGLPTRIHSDQGRDFESQLVS FT EMLTMLGIKKSRTSPYHPQGDPHLRDLTELXNMLGTLQPSQKSKWSQHIAR FT LVHAYNCTVNEATGFSPYFLMFGREARLPVDVCFGVSADSSSSGSYSKYVS FT KMKQELQAAYQLAQVSSEKMNQSNKARYDQKVRYHSLSVGDRVLIRNLGLK FT GKQKLADRWSENPYVVESQLSGIPVYRLKPVDGNGPIKVMHRNHLLPLGQE FT VRLKPKVDLGPTSLPKNLRHRSVKDKRKTAQSENPPIAVDVFSREHDSSDS FT DSEYGCYVEDMAPISSESAQEIQGETPEQAVECLDYSNAHRVSEIPVIPCQ FT SETSEFQLTDADRNTDVVDDVMTEVSESTSQTTVQTTDNETQPEIVPFEVR FT RSNRERKPSTRFTYDKLGVPYLHSVSSKCCGINALTTDVLNVYGSLNKSHA FT WWCNPNALCKTCKNRPVLVPCKQMVAM" XX SQ Sequence 6169 BP; 1783 A; 1200 C; 1458 G; 1728 T; 0 other; tttttggagg caccgctggg atcttgtttt tttccttttt accagatttt tttttctctt 60 tagcagaaaa aaatatatat atctgaaata gtgttaatta atattgatat atttttctgg 120 cttattttgc atagtaactc attagttgaa gtatggatat ctctcagact gctcaatgga 180 gtacagagga gaacataaat tcctcacgtg ccattgtgtt aagcaatgtt cctttgaaca 240 ctagtgatga gactattgag aaagtgttaa acacagtgaa ggtttttggt cgtactcaaa 300 ttcatggtcg acgtggtgat gtgactggaa aacatttgtt tgtgttagtg gagactagag 360 ctgatcttga tccaagcacc ataccacctg aaataggtat tgaaagtgaa gctggaccct 420 ggcctgtaca ctttgtaggt agactacaag ttcagaaccc tgctcctgaa aatgacacat 480 ttcagtccaa gttgttaaca ttaatgcagc aggagggcaa gtctatggat gaagtgaaag 540 ccattttgat gggggagcat tctcctaaat ctgatattaa tgtggattta gttgatgcca 600 taggtaaatt agtggacagg tgtaatcaag cgtctaatga tggacccagt tacagaaaac 660 taaggttgtt ttcaggtctg aaacctgttc ctccaggtga ggaagaatat gaaatctgga 720 tggagcaagc cgcacaaatg atcagcgaat ggcaatgcac tgaagcttct aagaaacaac 780 gcattgttga gagtttgcga ggtcctgctg ctgatatcgt taggtttcta aaagtgagcc 840 agccatctgc cactgcaact gagtacttgg ctgctcttga aactgcgtat ggaactactg 900 agtgtgggcc tgacttgatg gctaaatttc gtcacactta ccaggataat ggagaaaaac 960 tttcagcttt cttgtatcgc ttagataaac ttcttcacag agcgttgtta aagggtggga 1020 ttgatgtagc tggaataaac aaagctagaa tggagcagct aattaaggga gcacttacca 1080 atgatatggt tgctctgcga atcagaatga ctcacacttt gcagaatccc ccatctttta 1140 cacagttaat gaaggaaata cgtgaggagg aacactgggt ggctgcaagg gaaaatgtca 1200 aagcttcggt tgccactgtt atctctcctc agtcagatgg gccctctgag ttacaaagcc 1260 tgaagaagga ggtgaaggag ctatcttcac agatgagtca cctattgaat gtggctactg 1320 caacgtgtgc ttctgagtgt gctcctcaga aaacatctag taaaaactct gagagtgtga 1380 aacgagacaa atcacaacca actaaactca cccagcaacc agtgcctggg atcttttgct 1440 acaaatgtgg tgaggatgga cataaaaagt gggaatgcaa gggacaagag gacctcagga 1500 aagtaaatca aaagctgatc aaaatgcatc gtttgcaggg aaactgggca ggagttcagt 1560 gaaggaacgg cacggggctc ctgggacaac acgttccaat tgtgattctt ttttacttga 1620 tgccagtaaa ccaagattgc ctgaagggtt gataggacct gtttctgaag tgcctgtcca 1680 gatagaaggt gtttatgcaa aagcccttct tgacagtggc tcacaggtga ctctattata 1740 ccgcagtttt tatgacactt atctaaaaca cttggaactt cagcctgtgg aaaaccttga 1800 gatatggggt ttaagttcgc ataaataccc ctatgatggg tacttgcccc ttagacttga 1860 gtttacagag agtgtagctg gagtgcatca aataattgac acacttgcga ttgtatgccc 1920 tgaccctgta aagcgagaag gaatagccat tttgctcggg actaacacta gtctggtgaa 1980 gaagctactt gagtcttgtc gtaaacaagc tggcgaagaa ttccttaatg tgctgaccat 2040 acatcctgta atcagagaag catttgagac tattcaacta acagattttt ctcaagatga 2100 ctccgacacg catgggacag tttggttcat aaagcataac ccagttgtcc taaaaccaaa 2160 ccaagttagg cagcttcctg gtctattgaa atttcctggt caatcgactg agtcattagt 2220 attagttgac agagtagcag atagtgacac aagttctgag cacctagatg tgagacctga 2280 actgcatgca gcatctgttg tatccagtcg gcaagttaca gtgactgtga ggaacatgtc 2340 taatagagaa atatgggtga agagaggaac tccgcttgca cacgttcttc cggtgtcctt 2400 agtgccacaa ttgactgcta aacaaccacc agtacaaaat cctttgtcac ctgcttcttt 2460 tgattttgga gattccccaa tgcctgagga agcaaaacag agcttacggg agaaaatgat 2520 gcagagaaag gatgtgtttt ctctacacga gtgggacgtg ggctgttcaa aaagcaccac 2580 tcatgagata aggttgaatg attcgcgccc tttcagagag cgatctcgtc gtcttgcccc 2640 tgctgactta gaagatgtgc gactgcattt acaagaactg cagagtagtg gtattatttc 2700 tgagtctcgc agcccctacg cttcacctat tgttgttgtg cgtaaaaagt cagggaaggt 2760 tagaatgtgt gtcgactatc gaacacttaa tcaacgaact acaccagacc aatatactgt 2820 gccccgcatt gaagatgctc tccatagtct atcgggaagt aagtggttca gtgttctcga 2880 tttgaggagt gggtactacc agatacccat gagtgatgct gataaggaaa agactgcgtt 2940 catatgcccg ttagggttct atcaatttga acgtatgcct cagggtattt gtggagcacc 3000 cgccactttt caaagagtca tggaacgtac tgtaggggat atgaactttt tggaagtgct 3060 tgtatacctt gatgatttga ttgtctttgg gcgaaccatt gatgagcacg aagagcgtct 3120 tttgaaagtg ctcgataggc taagtgatga gggactaaag atctcccttg acaagtgtca 3180 gtttggtagg acttcagtga actatgtagg acatatagtg tcacaagatg gaatttcgac 3240 agatccgtcc aagatagagg ctgttgtatc ctggcctaag ccccagacag tgacagagct 3300 caggtctttt ctaggattct gtggatatta caggcgcttt gttaaggatt tctcgaagtt 3360 ttgccgccct cttaatgaat tgctgaaggg atatccatct accaggaaga acagagattc 3420 acttgtttgc aacactaagc cctgctataa gtcctctgaa ccgtttggtt ctcgatggtc 3480 ggctcagtgt gatacagctt ttgaaacgtt gaaaaagtgt ttgacacaag caccagtgtt 3540 agcctttgct gatgtacaga agccctatgt cttgcacgtg gacgcaagca tggatggact 3600 aggaggagtt ttgtatcaag aacatgagag cggattacgt ccagtagctt ttatcagtcg 3660 cagcttatcc ccttcagaga gaaactatcc agcccataaa ctggaattct tagcactgaa 3720 gtgggctgtt gtggatcgac tgcacgacta tctatatggt gttccgtttg aggttagaac 3780 cgacaataat cctctaacct atataacaaa atcagcaaaa ctggatgcgg caggtcaccg 3840 ttggctgtct gccttgacca cctacaattt cagcctaaaa tacagacctg gccgcagaaa 3900 tgttgatgct gatgcgttgt cgaggcgtcc gcatacttac cgcagtgagg acgatgagtg 3960 gcaggaaatt ccagctgtag gggtgaggac tttttgtcaa gctgtgtccc ttgaaaagag 4020 agcagagaac ggattttaca cttgtgtggt gaaacaagct ggagcacaca tgtctgctgt 4080 ccccaaagct tactctcatg ttatggaagt agctgctgat cacctaccct tgtttagttc 4140 aagtgacctt caaacagctc agaggaatga ctctttactt ggcgaagtgt ggaaagctgt 4200 gtgtgataag aaacctgcta gtagcattcg aagtagccat cctagtatga agattctgaa 4260 acgggagtgg gaaaaattgg tagtaaataa tgggatgctt tacagaattg tcagacagag 4320 taatcacaaa gtgaaacaac agcttgtact accaaaacaa tttcacagct cagtgctgaa 4380 gtctttgcat gacgacattg gacatttagg gtttgagaag tcctatggtc tagtccgaga 4440 tcgattttat tggccacaca tgaagcctga tgtggaatcc tactgtaaga cctgtgagcg 4500 ctgcatcaaa agaaagacgt tacctcaaag agcggctcct ttgtcacata tgcagagttc 4560 aggacctctg gaccttgtat gtattgattt tctttccatt gaagctgact ctcgaaatgt 4620 gtgcaatgtc ttagtggtta ctgaccatta cacacgctat gcccaagctt ttcccactag 4680 agatcaaaaa gcttcaacag tggcaaagac tttgtgggag aaatatttca tacattatgg 4740 tctccctact cgaattcact ctgaccaagg cagagatttt gagagccagt tagtgtctga 4800 aatgctgact atgctaggga ttaagaaatc tagaacatca ccatatcatc cccaaggtga 4860 tccccacctg agagatttaa cagaactctg ttgaatatgc tgggtacctt acagcctagt 4920 cagaaaagta aatggagcca gcacatagca cgtttagtac atgcttataa ttgtactgtc 4980 aatgaagcta caggtttctc tccttacttt ttgatgttcg gccgagaagc tagactgcct 5040 gttgatgttt gctttggtgt gtctgctgat agctcatcat ctggttccta ttcaaagtat 5100 gtgtccaaaa tgaagcagga attacaggca gcttatcagt tggctcaggt ttcctctgaa 5160 aagatgaatc aaagtaacaa agcaaggtat gatcagaaag ttcgctatca tagtttaagt 5220 gtgggggaca gagttctgat ccgaaacctt ggtctcaagg gaaaacaaaa acttgccgat 5280 aggtggagtg aaaatcctta tgtagtggaa agtcaattgt ctggtattcc agtttatcgt 5340 ttgaagcctg ttgatggtaa tggaccaatt aaagtcatgc accggaatca cctcttgcct 5400 ttaggacaag aagtaaggct aaagcccaag gtagatttag gtcctacttc tttacctaag 5460 aatttaaggc atagaagtgt gaaagacaag cgtaaaactg ctcagtcaga aaacccgcct 5520 attgcagttg atgttttctc aagagaacat gattcttcag attcagactc tgaatatggg 5580 tgttatgttg aggatatggc accgatttca tctgaaagtg ctcaggagat acagggtgaa 5640 actcctgaac aagctgtaga atgcttagac tatagcaatg ctcaccgtgt gtctgaaata 5700 ccagttatcc catgtcaaag tgaaacctct gaatttcagt taaccgatgc agataggaac 5760 actgatgtag tggatgatgt aatgactgag gtttctgaga gtacatcaca gactactgtt 5820 cagactactg ataatgaaac tcagccagag attgtacctt ttgaagtacg taggtctaat 5880 agggagagaa aaccttctac cagatttact tatgacaagc tgggtgtacc ataccttcat 5940 tctgtatcat ctaaatgctg tggtattaat gcacttacta cagatgtgct taatgtttat 6000 gggagtttaa ataaatcaca tgcttggtgg tgtaatccca atgctttgtg taaaacctgt 6060 aaaaaccgac ctgtacttgt gccctgtaaa cagatggttg ctatgtaatt gatctgcagc 6120 aatttagtaa ccagcatggg gacatactgg atatttggtg gggggagta 6169 // ID Gypsy7-I_DR repbase; DNA; ZEB; 5822 BP. XX AC . XX DT 07-DEC-2004 (Rel. 9.11, Created) DT 07-DEC-2004 (Rel. 9.11, Last updated, Version 1) XX DE Gypsy7-I_DR is an internal portion of the Gypsy7_DR LTR DE retrotransposon - a consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW GYPSY superfamily; Gypsy7-I_DR; Gypsy7-LTR_DR; Gypsy7_DR; KW endogenous retrovirus; gag; integrase; protease; KW reverse transcriptase. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5822 RA Kapitonov V.V. and Jurka J.; RT "Gypsy7_DR LTR retrotransposon from zebrafish."; RL Repbase Reports 4(11), 291-291 (2004). XX DR [1] (Consensus) XX CC Gypsy7-I_DR is a consensus sequence of the internal portion of CC Gypsy7_DR LTR retrotransposons. Its long terminal repeat is CC deposited in Repbase as Gypsy7-LTR_DR. The internal portion CC encodes the 1658-aa Gypsy7_DRp polyprotein (pos. 476-5449) CC composed of gag, protease, reverse transcriptase, and CC integrase domains. Given that some Gypsy7_DR genomic copies CC are flanked by 100% identical LTRs, it is possible that CC Gypsy7_DR elements are still transpositionally active. The gag CC domain is similar to the Arc protein important for long-term CC spatial memory in vertebrates (mammals, birds). Presumably, CC Arc was derived some 300-400 million years ago from a CC gypsy-encoded gag protein. XX FH Key Location/Qualifiers FT CDS 0..0 FT /product="Gypsy7-I_DRp" FT /translation="MANVNPSPSTSVDIDPPDIATPVWPPVQQRQFSSPSN FT IPTYHSTPTQLDPYGRTQVHFHTTTPGVTSTVQPDPMQLCTSASTVESPPS FT TATQHALPGYLPTPGREIHQLTAHVQGNWDRVFDCLKRQDKAVKELTEKSS FT KSFSLHEAKLAKMESTHQQLLNTLTAQRKDDTETADQLTKAVKVMVTQEIQ FT RSESTLISEIRFMVEQAQLELQKDIQATKEHSDKNFERLSSDLNHCSTEIN FT AIKNQLDNLQTEISDVIPPIKQVSDPPSSAPVSVSTQSSSSVTAPMPFQTP FT VIKSDHLKLTFPTFGRPSDDADPLLYVTRCKDFLALHPLDDPDILATFRTV FT LYGTARDWWEVARSAISTWSEFETAFLSAFLSEDYEDELAERVRTRTQAEK FT ESIRDFAFTYRAMCKRWKPTLTESELVKMILKNIKPHLASQLRSRVHTVDE FT LVKLGLQLEKDYVQQLHYVEHVTQPSPQRIAPNRVEKPPVLCWRCKGLHPP FT GSCPHYSSSVQTTQSSSHPPPTGNKRYFQTQKHGGNPSNNAMSVTLPSKSL FT PKSTVTKSVVIPQQLIVPIYIGAWRGKAILDTGASYTLLHESLWKEIDPQA FT SLHPWTLGPLYLANGEAEVPLGWTNFEIILHDKVFPTQAAILTPKALAYSV FT VLGLDFIYSSGLQINVVDQTYSFKSNPNEEYPFQPGHASVPVGRSQHLNKN FT AQTQHSSKTLSLLSSIPPPLPFPVVSQLAPSSDDQALIEMAVAEAHLPLES FT KPQLLHLLQSNPKVCTLQLGRTTVLQHCIYTTHPVPVKQRPYRLTPGKQAI FT VEEQIEEMLKAGVIEQSCSPWASPVVLVPKKDNSLRFCVDYRKLNAMTESD FT AYPIPNITEILESLSGASTFSSLDLNCGFWQVPMDDKSKLMTAFITSRGLY FT HFNVMPFGLKNAPATFQRLMEIVLRDLLGKICYVYIDDIVIYSPTLTQHLH FT DIQTILERLEKAGLTLNLKKCSFCLPEITFLGHVVSHQGVAADPKKVEVIH FT AYPVPQNLKDVQRFLGLAGWYHRFVPNFSRIAEPLNNLKKKGRQFKWDSLC FT QQAFDNLKFCLTTPPILGHPDLNIPFTVYTDASDSGLGAVLTQRKEQGGEE FT VIAYASRTLTKAEVNYSTTEKECLAVVWALDKWQHYLEPRMFTVVTDHSAL FT QWVMNSTKPASRLMRWALRLQRYDFVIEYRKGRLNVAPDALSRMYSMPGCN FT LYTTEKDLPDFPVTPQTIWEEQHQDTDIMKIFQALAKNEQQEQAQYTVLED FT KLYHITHLADETVHYKVVIPSTLRPTVLEWYHDTPLSGHLGIYKTYKRIQD FT VAYWPGMWTDIKKYVKNCAKCQVTKWDNRKPAGKLQQVTTSRPNEMWGVDI FT MGPMPKSGKQNEYLLVFVDYFSKWVELFPMRHATAQTIATILRQEMLTRWG FT VPDFILSDRGAQFVSSLFTELCGKWNITPKLTTAYHPQTNMTERVNRTLKS FT MIAGFVEDNHKTWDTYLPELRFALNSAIQESIGMTPAELHLGRKIHSPMDK FT LLHRRDLSPTKPAYDMVHKITQLQRQAKENYTKAQKRQLRSYDKNRRDVFF FT RERERVWVRNFPISSAQHHFSAKLAPKWKGPYRIIQQLGPVNYQVSLEDTG FT EDVRNVHVCNLKPCFPTAEELEAREKNCTKILPQQDQKRF" XX SQ Sequence 5822 BP; 1796 A; 1330 C; 1183 G; 1513 T; 0 other; taagtggcgc ccgaacaggg accctgaaca cttaaaaaaa aaaaaaaaaa aaaaaaaaaa 60 aaaactgaac actttaaaag acattgaaca ccattgaaac cttactttat tttgggaaat 120 tttgaacttt gaaactcatt tgaattgttt gactgatttc agttgacaac aacctttttg 180 actttttgaa ctgttttgct ttactgacaa agactttgga aaaaacaatt ttgttaaaaa 240 aaaaaaaaaa aaaaaaagtt gttatagaac atttgtgtca ttgtacagga attgtactga 300 ttttgatttt gtataccttg tgaatttggt acttgtggta cttttgacat ctctctctct 360 atatatatat tttttgtgac cacattgcct aaaaagtcaa cattttcttt tttgattttt 420 attttttcaa aacccttata ttgttccttg acttacactc atacacacta taaacatggc 480 caacgtcaat ccttcccctt caacttccgt ggatattgat ccaccagata tagccactcc 540 agtttggcca ccagtgcagc aaagacagtt ctcatcacct tccaacattc ccacatacca 600 ctccacaccc actcagttag acccttatgg aaggactcaa gtgcatttcc acaccaccac 660 tccaggtgtt acttctacag ttcaacctga cccaatgcaa ctgtgcacaa gtgcatccac 720 ggtagaatca ccgccctcca cagcaaccca gcatgctctt cctggatacc tccctacacc 780 tggaagagaa attcatcaac ttactgctca tgtacaagga aactgggatc gtgtatttga 840 ctgtctgaaa cggcaagata aagctgtgaa ggaactcacc gaaaaatcgt ctaaatcttt 900 ttccctgcat gaagcaaagc ttgcaaaaat ggaatccact catcagcaac tcctgaacac 960 cttaactgca caacgaaaag atgacacaga gacagcggat caactcacta aagctgtgaa 1020 ggtgatggtg acacaagaaa tccaaaggag tgaaagtacc ttaatttcag agattcgctt 1080 catggtggaa caagctcagt tggaattgca gaaggatatt caagctacca aggaacactc 1140 tgacaagaat tttgaacgcc tttccagtga tctaaatcac tgcagcactg aaattaatgc 1200 cataaaaaac caacttgaca atcttcaaac agaaataagt gatgtcatcc cacctataaa 1260 gcaagtgtct gatcctccaa gcagtgcacc tgtatccgtt tcaacacagt cttcatcttc 1320 agtgactgct ccaatgcctt ttcaaacacc tgttataaaa agtgatcatt taaagttaac 1380 ttttccaacg tttggaagac cttcggatga tgctgatcca ctgctatatg taacacgctg 1440 caaagatttc ctggccttac accctctaga tgatccagac atcctagcta ccttccgcac 1500 tgtcctgtac ggtacagccc gggattggtg ggaagtggct cgctctgcta tttccacatg 1560 gagtgagttt gaaactgctt ttctctcagc tttcctttca gaagactatg aggatgagct 1620 ggcagagagg gttagaacta gaacacaagc agagaaagag tcaattagag actttgcttt 1680 tacatacaga gcaatgtgta aacgatggaa gcccacatta actgagagtg aattagtaaa 1740 aatgattcta aaaaacataa aacctcacct agccagccaa cttcgaagcc gtgtccatac 1800 agtggatgag ttggttaaac tgggccttca gcttgagaag gattatgttc agcagttaca 1860 ttatgtagaa catgtgactc aaccctcacc acaaagaatt gcccccaacc gagttgagaa 1920 acctccagtt ttgtgttgga gatgcaaagg tctgcatcca ccaggtagtt gtcctcacta 1980 ttcctcctct gtgcaaacca ctcaatcatc tagtcaccct cctcctactg gaaataaacg 2040 ttattttcag acccaaaagc acggaggtaa tccatctaac aatgccatgt ctgttacact 2100 tccttcaaag tcattaccca agtcgactgt tactaaatct gtggtcatac cacaacagct 2160 gatagttcca atttacattg gggcttggag aggaaaagcc atattggata cgggtgccag 2220 ttacacttta ctccatgaga gtttgtggaa ggagatcgat ccccaagcca gcctccatcc 2280 ctggacactt ggcccactct atctggccaa tggagaagcc gaagttcctt taggatggac 2340 gaattttgaa atcatattgc atgacaaagt ttttcctact caagctgcca ttctcactcc 2400 aaaagccttg gcttactctg tagtcttggg tttagatttc atttattcaa gtggtctaca 2460 gattaatgta gttgaccaga catactcttt taagtccaac cctaatgaag agtacccttt 2520 tcaacctgga catgctagtg ttcctgtggg aagatcccaa catttgaaca aaaatgcaca 2580 aacccaacat tcaagtaaga cactatctct gctcagctct attcctccac cattaccgtt 2640 tccagtagta tcccaacttg cacccagtag tgatgatcaa gctctgattg agatggctgt 2700 tgccgaagca cacttaccac tagaaagtaa gccacagtta cttcatcttc tccagtcaaa 2760 cccaaaagtc tgtactcttc agcttggaag aaccactgtt cttcaacatt gcatttacac 2820 cactcaccca gtacccgtta agcaacgtcc ttatcggttg acacctggaa aacaagccat 2880 agtagaggaa cagattgaag agatgctaaa ggctggtgtc atcgaacagt cttgttctcc 2940 atgggcatct ccagtagttc ttgttcctaa gaaagacaac agtcttaggt tctgtgtgga 3000 ctacagaaaa ttgaatgcga tgacagaaag tgatgcttat ccaataccta acatcacaga 3060 gattttagag tctctttctg gagcatccac attctcatcc ttggacctca actgtggatt 3120 ttggcaggta ccaatggatg acaaaagcaa gttgatgact gcattcatca cctctagagg 3180 gttatatcat ttcaatgtta tgccctttgg actgaaaaat gctcctgcta ccttccaacg 3240 tttgatggaa atcgtcctga gagatttact tgggaaaatt tgctacgtct atattgacga 3300 cattgtcatt tactcaccca ccttgaccca acatcttcac gacatccaga ccatcttgga 3360 gagactggaa aaagcaggtc taaccctaaa cctaaaaaaa tgttcctttt gcctacctga 3420 aattaccttt ctaggacacg tagtgagtca ccaaggagtt gcagctgacc ccaagaaggt 3480 agaggtcatt cacgcttacc cagtcccaca aaaccttaag gatgttcagc gattcttagg 3540 actggcagga tggtatcacc gttttgtacc aaatttttca cgcattgctg aaccactgaa 3600 taatctgaaa aagaaaggac gacaattcaa gtgggattca ctatgccagc aagcatttga 3660 caatctaaag ttctgtctta ccacacctcc catcctgggc catccagatc ttaacatacc 3720 ttttactgtg tatactgatg ccagtgactc aggactaggg gctgttttga cccagcgtaa 3780 agagcagggt ggcgaagaag taattgctta tgccagtaga accttgacta aggcagaagt 3840 gaattactcc accacggaga aagagtgtct ggctgtggtg tgggctttag acaagtggca 3900 acactacctg gaacctagaa tgtttacagt ggttacagac cattccgctc tgcaatgggt 3960 catgaattcc accaaaccag ccagtcgact catgagatgg gccttgcgct tgcaacgcta 4020 tgattttgtg atcgagtaca gaaaaggacg gctgaatgtt gctcctgatg cgttgtcccg 4080 tatgtattcc atgccaggct gtaacttgta caccacggaa aaggatctgc ctgatttccc 4140 tgtcacccca caaaccatct gggaggaaca acatcaagac acagacatta tgaagatctt 4200 tcaagctctg gccaaaaatg agcaacagga acaagcccag tacactgtgt tggaagacaa 4260 gctgtatcac atcacccacc tagcagatga aactgttcac tacaaagtag tcattccatc 4320 tactcttaga ccaacagtac tagaatggta ccatgatact cccttaagcg gacacttggg 4380 aatttacaag acgtacaagc gaatacaaga tgttgcttat tggccaggaa tgtggacaga 4440 cataaaaaaa tatgtcaaaa attgtgccaa atgtcaagtc accaaatggg acaaccggaa 4500 acctgctggc aagttacaac aagttacaac atcacgacca aatgagatgt ggggagtgga 4560 tataatgggt ccaatgccga agtctggaaa acaaaatgag tacttactcg tatttgtcga 4620 ctatttctcc aaatgggttg aactgtttcc catgcggcat gccacagcac agaccattgc 4680 caccatacta agacaagaaa tgttgactcg gtggggagtc cctgacttca tattgtcaga 4740 cagaggagcg cagtttgttt cttctttatt cacagagctg tgtggaaaat ggaacatcac 4800 tccaaaactt accactgctt atcacccaca gaccaacatg acagaaagag tgaatcgcac 4860 tttgaagtct atgattgcag ggtttgtgga ggacaaccac aagacctggg atacatactt 4920 accagagtta cgttttgctt taaattctgc aatacaggaa tccattggga tgacgcctgc 4980 cgaacttcac ctaggtcgga aaatccacag tcccatggat aaactgctgc acagacgtga 5040 tctctcacca accaagcctg catacgacat ggtacacaaa ataacacagt tacaaaggca 5100 agccaaagaa aattacacaa aggctcaaaa acggcagtta aggagctatg acaagaacag 5160 aagagatgtg ttcttcagag aaagagagcg tgtatgggtc cgtaattttc ccatctctag 5220 tgcacaacat cacttcagtg ctaaactagc tccaaagtgg aaaggaccat accgcattat 5280 ccagcaacta ggtcctgtga actaccaggt atctcttgaa gacactggtg aggatgtgag 5340 aaatgttcat gtgtgtaatc ttaaaccatg tttccccacg gcagaggagc tggaagcaag 5400 ggagaaaaat tgcacaaaga tcctcccaca gcaggatcaa aaaagatttt aaaaatgtaa 5460 atcctcgtga gcattgaaca acatgggttg ttctcacgaa gggggggaga gtgtgacgag 5520 atggatgctt taatgtttat ttttctcaag tgccactagg gggcgctgct ccgaccggtc 5580 cttcctatcg cttccagaca atacttccgg gggtcggaag gaagcggaag ggcaggtaaa 5640 caatcggaga ttataaaaga gggagaaaag gccagagaag gggcttcttg ttgttttggt 5700 gtcgggtatt tggtggcaag aattggagaa ggaggagaag aacggtgtaa ggtggagagc 5760 ttgttggttt caatgactgt gtgaggaaaa aactgagggg tggaagcaat tcgtggacgt 5820 gg 5822 // ID ANGEL repbase; DNA; ZEB; 308 BP. XX AC . XX DT 20-JUN-2000 (Rel. 4, Created) DT 20-JUN-2000 (Rel. 4, Last updated, Version 1) XX DE Non-autonomous DNA transposon - ANGEL. XX KW Zebrafish MITE; ANGEL; Miniature Inverted Repeat Element (MITE); KW DNA element; consensus. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-308 RA Izsvak Z., Ivics Z., Shimoda N., Mohn D., Okamoto H. RA and Hackett B.P.; RT "Short inverted-repeat transposable elements in teleost fish and RT implications for a mechanism of their amplification."; RL J Mol Evol 48, 13-21 (1999). XX RN [2] RP 1-308 RA Izsvak Z., Ivics Z. and Hackett B.P.; RT "Repetitive elements and their genetic applications in RT zebrafish."; RL Biochem Cell Biol 75, 507-523 (1997). XX RN [3] RP 1-308 RA Ivics Z., Izsvak Z. and Hackett B.P.; RT "Genetic applications of transposons and other repetitive RT elements in zebrafish."; RL Methods Cell Biol 60, 99-131 (1999). XX RN [4] RP 1-308 RA Jurka J.; RT "Consensus."; RL Direct Submission to Repbase Update (JUN-2000). XX DR [4] (Consensus) XX SQ Sequence 308 BP; 109 A; 47 C; 50 G; 99 T; 3 other; ttaagggata gttcacccaa aaatgaaaat ttactcaatt tactcaccct yaagttgttc 60 caaaccttat gagtttcttt cttctgttga acacaaaaga agatattttg aagaatgttg 120 gactgtaacc attgacttcc atagtaggaa aaacaaatac tatggaagtc aatggttaca 180 ggtttccaac atttttcaaa atatcttctt ttgtgttcaa cagaagaaag aaactcaaah 240 aggtttgaac aatnaagggt gagtaaatga tgacagaatt ttcatttttg ggtgaactat 300 ccctttaa 308 // ID TDR2 repbase; DNA; ZEB; 941 BP. XX AC . XX DT 04-MAR-2002 (Rel. 7.02, Created) DT 04-MAR-2002 (Rel. 7.02, Last updated, Version 1) XX DE Zebrafish non-autonomous DNA transposon - a consensus. XX KW Harbinger; DNA transposon; Transposable Element; Tc1 superfamily; KW Tdr2. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-941 RA Gottgens B., Barton M.L., Grafham D., Vaudin M. and Green R.A.; RT "Tdr2, a new zebrafish transposon of the TC1 family."; RL Unpublished. XX RN [2] RP 1-941 RA Jurka J. and Drazkiewicz A.; RT "TDR2: a non-autonomous DNA transposon from Danio rerio."; RL Repbase Reports 2(2), 30-30 (2002). XX DR [2] (Consensus) XX CC TA-target site duplication. XX SQ Sequence 941 BP; 294 A; 183 C; 190 G; 272 T; 2 other; tacactacct gacaaaagtc ttgtcgtcga tcccagttgt aagagcaaca aataataact 60 tgacttctag ttgatcattt ggaaaagtgt cagaaggtag atttttctca gatgaatcat 120 ctgttgaact gcatcccaat catcacaaat actgcagaag acctattgga acctgcatgg 180 acccaagatt ctcacagaaa tcagtcaagt ttggtgaagg aaaaamtcat ggtttggggt 240 tacattcagt atgggggcgt gcaagagatc tgcagagtgg atggcaacat caacagcctg 300 aggtatcaag acatttgtgc tgcccattac attacaaacc acaggagagg gcaaattctt 360 cagcaggata gcgctccttc tcatacttca gcctccacwa catcaaagtt cctgaaagca 420 aagaaggtca aggtgctcca ggattggcca gcccagtcac cagacatgaa cattattgag 480 catgtctggg gtaagatgga ggaggcattg aagatgaatc caaagaatct tgatgaactc 540 tgggagtcct gcaagaacgc tttctttgcc attccagatg actttattaa taagttattt 600 gagtcattgc agagatgtat ggatgcagtc ctccaagctc atgggagtca tacacaatat 660 taattctttt tccactgcac catgacttta tattctatac tgtacattat ttctgttaag 720 tgacaagact tttgtctaag caaagtcaga ccttactgtc ctaattaaat aattaaaaat 780 caaggcatga tcatatttta ttttggtaaa ataagtgtaa tctagaggcc tttgcctttc 840 atataagcca cttctgatac caaatgatca actagaagtc aagttattat ttgttgttcc 900 taaaacttgg ataggcgaca agacttttgt caggtagtgt a 941 // ID DIRS-4N1_DR repbase; DNA; ZEB; 4462 BP. XX AC . XX DT 24-OCT-2008 (Rel. 13.1, Created) DT 24-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE nonautonomous DIRS-like LTR retrotransposon family - a consensus. XX KW DIRS; LTR Retrotransposon; Transposable Element; Nonautonomous; KW endogenous retrovirus; reverse transcriptase RNase H; KW phage integrase; DIRS-4_DR; DIRS-4N1_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4462 RA Bao W. and Jurka J.; RT "Families of DIRS-like endogenous retroviruses in zebrafish."; RL Repbase Reports 8(10), 1269-1269 (2008). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 4462 BP; 1232 A; 1229 C; 763 G; 1237 T; 1 other; aagtgaagtt tcataaacta atttcgagag gagcacgtga tatgattgtg caccgctggc 60 cactcatccg taatcagtaa taatccaatc agaatgatcc tagcttagta taaatggatc 120 actttctccc cattgcacta tcttcatttt ggaagaatgc ttcccgctac aaacgctcca 180 gcatttaaac tacgcttcag catcattcaa ccttctggca tggaagaaca aacaatcaac 240 aacaacaaca acaacttctc caaccagaac cccgctctcc cctaaactcc agcagctgcc 300 tcgccagcaa tcgagccaga atccccgcaa gaagccggca tatcaactct tccatcttct 360 acttcatctg cccttcacta cacaagccac aacttcgtct aaacttatga cgccggcgat 420 tcaagccacc ggagagaagc cgaactcaca gatcacctaa cgttaacggt ccactttcgg 480 taacgtgcat gctttaaagg gaagtctggc gagatcagcg tgaatagaac gctttaaatg 540 aagaactcac ctctcagcct cccgctggtc agcagatcca taacattctg acagctgaaa 600 tagttcgaag gcaaatgacc aggcaatcta atccataaac attattccat aacaccttca 660 ttcgtcgata tacatccatg aaaaggtgca ctgatatcca aaccagctgc tggtgagagt 720 ggagccatac tttcacgcat aaacattgta gcgatctgat ctacaaaatg gccgccggcm 780 tttgcactct gaactcttga ccgtgactcc aaggagccaa tagcttaaag gggaagtatc 840 accatccaat gagctttcca aaactggaga cggtcccgcc ttctctcccg acaaattcat 900 gaatggactg cgtgcagaga ctttgattcc ggccaaataa tcatgttgag attacagaca 960 tgttacttat catggatgga gtaaagccat gataaacatt acattcacac agttcctatt 1020 aagtggcaac atcgtttttt aagtttgatg taggcagttg atatatttac attttattaa 1080 taattatata tacatatatt aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aagtaaataa 1140 aattctgaaa tgagtattac ccagcaagtg atagacaacc tacacatcaa gcagtaaccc 1200 ttttgtatca cgccaattgt gactctcata tcaatacacc tgaatgtcag tcagtttata 1260 atgacagtaa gaaaacaatt tcctgctcga ctgcaacaaa caagaggagg acatttctgg 1320 ccatggagta gcttttcata cttatacttt gtgttaaata cattcgagtt ttaaaaactt 1380 ctatatcagt aaggggttta aactccactc catcatcatc tcaccagctc tccaagaggt 1440 ttcctggact tccagcatag aaacgcccca tacgctcagt tctccttcta attaccaccc 1500 tgcaggagga caaatgccca cttctacaac cctggcacat tgtgacatta agtattggaa 1560 catgcattaa caaaggcgct tccattcaca atcatccata cacttatgaa gactcctccc 1620 acgaatatca gccaatacaa ttaatcatgt ctgctgtaca tgtgccatgt tgctaagatt 1680 tcatttctaa ttctctcttg ttttccattt tagagttcaa gactgccagc tcctgaaact 1740 gccctcattc aacccatcac actcttcatc cattcctaat gcatttacac tcttcgcaaa 1800 gatactttct gcctatacag cccacaccct tgataccata ttcacatata cccccccccc 1860 cccccattta ctcttcatat cactattgac aatatttagc actcataacg ctccaatgct 1920 gactatcatt tgtacttccg ctatagcaga gtcgctccgc cgagcctcat tctccttctg 1980 caaccccccc cccctcttct tcccatttat agatgacaca accgccctgt cacacgattc 2040 tgatttaaga gatgcttata gcagttcttt gctccacagg tatcccccag atcaactccc 2100 agtcactctt acacagggtg tctccggagt atcagaaggt gttgggggct gataagtcaa 2160 tatagaggag tttgagggct taaagtatta gaaagtctta gattccttta caaggtaata 2220 catttattgt tgaacaaggt atcatcgtat gctaaagttt gactgtattc aagccgtgaa 2280 tatcaggata ctaggtagtt acgtaatcag taaaattcta ctaaggtttg acagcttgcc 2340 gcttgtttac tgcaacacta tcagtctgca gctctatagg tgacaacctg tcgaatcggc 2400 tagattttaa tatacatgtg tctgtattcg aatgtttagt tggattcatt tgttacattt 2460 taatatttag taaatattcc gtaagataaa tctcgccagc ttttttgatt gaaaatggtg 2520 ataaggtctt taaatgtgtg ggaaaagtat taaaggtgtt gaattacttc tcttattcct 2580 gtatactatg tttcagaata tgagcagcca tcacagaagt acaaagggca gcaacaaatc 2640 cagatcctag aaaggtggtc acctgatgcc ttcaagacat gcgtcctagc cactggcccc 2700 ccaaagaagc ccagatggct cttgtcagcc ataatcccac atccactcaa gggcgagggt 2760 gtgacccagc caccttacct tcttcttttt cttccttcta gcctgagtaa cactcagttt 2820 cctcccccag ccacataggt aattggagtt tcatccaagc cccccgtcac cccgccaccc 2880 tggccgtttc cgctggagtc ttcacagccc cctcccattt cccgactcct gccggacccc 2940 gcccccccta aagctctgac ttccgcagaa gtgttacccc gagctacgac ccccgcaggg 3000 gtcgtctcac cgtcccttcc aggccttagc aatctttatt tatatatata tatatatata 3060 tatatatcta tatacacctg tacatagata tatatttata atagcgctgt cactcccccg 3120 ctccatctcc taacggagtg ttcctcgagc acctaactta tcccgtcacc ctctaacagg 3180 agtcttcact gtccaaatcc cctttcccag actcctgcta gagtaggcca gcttgccctg 3240 ttccccgggc gccgaccccc gcaggggtca gtcactgccc cccccaggcc actgaaactc 3300 attatatatg tatatctatg gactttcgtt tatagatata tatttatata gagcgctgtc 3360 actccccgct ccatctccag tcggagtgtt ccacgagcat cgactccagc aagagtctgg 3420 ccaaacttgc cactcaccct ctagcagaaa tctccaccgc ccaaatcaca cttcacgatt 3480 tctgctagag atggcaaaat aaaaaattgc tgcacccaac tcccgcagcg cccattctga 3540 ctcacagaag tctcctgatc accccctcca ggccttagat tatcccattt tatatatata 3600 tatatatatt tatatactct ctcatatata catatatatt tatatatagc gctgccactt 3660 ccctgctcta tctctgtttg gagtgttcct cgagcatttt tgactcttaa tgagccaacc 3720 ccgcccaccc cttatggccc cccttcacta gtctccaccc aaccccctcc cccgctctgg 3780 cttccacagg agtcagtttc aaactttgct ccaactggag ccccctactc tttcttcatt 3840 ccttaattac tatatccagc agccggatat agtaaaaact ttctagcttt ttgggggaaa 3900 ttctttgaaa tactcggctg ctgtcccgag ctagaggcat tttttgggga gcgatcgaga 3960 cctacctgat ctcggttctc ctgatatgct tctagaccgg gcgggagccc tgggctcaaa 4020 tatctccgag ctcagggttc tctcccggga cagcatgcca aacctgctat aagtgccaag 4080 catatctaag tgggaactct tgaagtgaag tttcataaac taatttcgag aggagcacgt 4140 gatatgattg tgcaccgctg gccactcatc cgtaatcagt aataatccaa tcagaatgat 4200 cctagcttag tataaatgga tcactttctc cccattgcac tatcttcatt ttggaagaat 4260 ccccccttcc accccatctc ctcctttttc tccccttcta aagggggagc gatcgagacc 4320 tacctgatct cggttctcct gatatgcttc tagaccgggc gggagccctg ggctcaaata 4380 tctccgagct cagggttctc tcccgggaca gcatgccaaa cctgctataa gtgccaagca 4440 tatctaagtg ggaactcttg aa 4462 // ID DIRS-1-LTR_DR repbase; DNA; ZEB; 622 BP. XX AC . XX DT 01-APR-2002 (Rel. 7.03, Created) DT 24-OCT-2008 (Rel. 7.03, Last updated, Version 2) XX DE A solo-LTR derived from DIRS retrotransposon - consensus. XX KW DIRS; LTR Retrotransposon; Transposable Element; Nonautonomous; KW LTR; MER6; DANA; SINE_DR2; DIRS-1-LTR_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-622 RA Jurka J. and Drazkiewicz A.; RT "SINE_DR2: SINE-like retroelement from Zebrafish."; RL Repbase Reports 2(3), 11-11 (2002). XX RN [2] RP 1-622 RA Bao W. and Jurka J.; RT "Re-classified to DIRS, and renamed."; RL Direct Submission to Repbase Update (24-OCT-2008). XX DR [1] (Consensus) XX CC Contains ~200 bp segment similar to HE1_SINE, MER6 and DANA CC elements starting around position 201. The 200 bp segment CC contains a hairpin-like GC-rich structure. new comment: This is a CC solo-LTR from DIRS LTR retrotransposon, because it contains a CC similar split LTR similar with those of DIRS-4_DR. Other example CC of solo-LTR derived from DIRS is DIRS-4N2-LTR_DR. XX SQ Sequence 622 BP; 152 A; 167 C; 125 G; 172 T; 6 other; ttaagtgaag tttatttata aactaatttc gagaggatca cgtgcttatg attgatcacg 60 gctggtcccg cattagctaa ttcatgattc accaatcaga tgattcctaa gccactataa 120 ataaccngag tttcttatca cagttatctt cgttttgaag aatcccccct tccaccccta 180 ctcctcctcc tttcctgatg ggnggcacgg tggcccagtg gttagcactg ttgcctcaca 240 gcaagaatgt cactggttca agtccttacc aggccagtng acgtttctgt gcggagtttn 300 catgttctcc ccgtgctcgc gtgggtttcc cccgggttct ccggtttcct cccacngtcc 360 aaaaacatgc aacataagtt aattgactaa tccaaattag caccatagac aagctctaaa 420 gnagttatct cttgcaatca ctatctgttc attagctact aagcagggga gttctcgaga 480 tctacctgag ctcaaactcc cctctcgccc tgcaaacggg agggagcccc gggctcgagg 540 atctttgagc tcagggctct ctcccgggac agcatgccaa acaagcttat aaaatcatca 600 gctaagtgtg aactcttgaa at 622 // ID L1-2_DR repbase; DNA; ZEB; 5496 BP. XX AC AL645691; XX DT 04-AUG-2002 (Rel. 7.07, Created) DT 04-AUG-2002 (Rel. 7.07, Last updated, Version 1) XX DE L1-2_DR is a non-LTR retrotransposon from the L1 clade. XX KW L1 clade; L1-2_DR; Non-LTR retrotransposon; endonuclease; KW reverse transcriptase. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5496 RA Kapitonov V.V. and Jurka J.; RT "L1-2_DR, a family of non-LTR L1-like retrotransposons from RT zebrafish."; RL Repbase Reports 2(7), 22-22 (2002). XX DR Genbank; AL645691; Positions 96682 91187. XX CC L1-2_DR is a family of L1-like non-LTR retrotransposon. This CC family was active recently (ORF1 and ORF2 have two stop codons CC only). CC The element is incomplete (its ~100-bp 5' terminus is deleted). CC It encodes two proteins: CC 469-aa L1-2_DR1p (positions 1-1407) and 1280-aa L1-2_DR2p CC (positions CC 1513-5353). These proteins are most close to corresponding CC proteins CC encoded by other L1-like elements. CC KIKRRFYPGTNIEDGTRFVKTRFPKELASLPYSTRIETAEGPQYFRVMHSH*VKTCRLCLSPDHVVKDCP CC DFRCYKCEERGHFAKFCTAVKCPDCNKVLNKCECWIGEEEEEVEQQVGRQMYEGDNIQSEDKTTTQEKST CC ESESEKLQENETNGIDRVTEQEGTTWTQMDMTDSLKSVLEAAELSDSNNKDLNQGQQEDTFWTQMDITDS CC FQKALDTEETKGQSNDEQADLEGHLQKDGNKETQGKSAKRRRSLKINLI. XX FH Key Location/Qualifiers FT CDS 1..630 FT /product="L1-2_DR1p" FT /translation="YIPGSVWLQLVREAHQIFFVYFCFCLKYIFVKLFFVC FT LVCPPSSVSCSGGVFFIYLFFFAMDGSTVTDLAMADGVEKANDNGLDRRQQ FT ETNQSEKEARKRIYLKEATVTVDIGQATEVRAIDVIKAETERIGDGKILAV FT RPKHNKEYEVTLEREEDADELMDELTIKGINCAVKRLQNRDYVVSFMHLPV FT YVADKDILDKLDHWGVCPIS" FT CDS 0..0 FT /product="L1-2_DR2p" FT /translation="MNVFMSFMNVFIFFMVLGIVSFNARGLLDIRKFEKVK FT EMCKREDVILLQETNWREECMKEIRKRWSGEMLYNNGDGRLGRGVAILLKE FT NSGVLCKTIYNDKEGKCMICEMEYVKKKVIMVNVHAPTEENKKKEYYNVLR FT DYLKKHERVIIMGDFNTVFSKLEMAEGMVFKTDKGRKELKILMEEMNLIDV FT WRERNEQTKEYSRRQIVGNFCCQTRIDFILCTRNVEGFINKIKYEETSLSD FT HKPLFMKLDWSNVKRGPGVWVLNTAVLKNEDYVLSVKEIIQKEKGNEIYNE FT DKRMWWENVKYLVKKFTIKYCRQLQNCKKYKEKELKEKLENELKNENGKNI FT QKIKELQGRLNEMEEEKFEGARLRSKAKFTVEGEKCTKFFFDLEKRRGKSE FT MIREIRSKNGNVVEQHEEILEEIRSYYEKLFCTEGIKEKEKGELLNLIKSR FT VEEGEKRECDEEIREEEIKRAISGLNKKKSPGIDGLGSEFYIVFKDILSSI FT LKEVYDEIFENGEINKRMGMGLMKVIYKGKGDKVDLKNYRPITMLNTDLKI FT LAKVLANRLKEVMPSIIKTNQAYSIKGRDIADTTMSIKDTIRYINDKQKDG FT FLISLDFEKAFDRVEHDFLFGVLKSFGFGENFIKWVQILYRGAVTRIKCNG FT FLTDCFKIRRSIRQGCPLSALLYALVAEPLGLAVKHEDRIKGIEVEGGVNK FT IFQYADDTTLILQDLASVKQAMETVQHFCKGSGAKINENKTGYLRFGRTEA FT LSGHFTFKEMDEIKILGIVIGRDEKKAEVTMWEEILGGIERRLRFWKLMSL FT TLKGRVLILSVLMVSKLWNILYVSSMPLWMEKRLKQCFLDFLWEGKPPRIA FT YATLIGEVGKGGLGLIDVEQRKNSLRVKMVRKYLDEDNKAAWKRTMEYFLS FT KSGNFNMGDNILYMRMKKFMTEGLPDFYKELIGAWGKFLTCVHFNIQGREN FT ILNQPLFLNSGILNQEKVVFFRKWWEVGITRVRDVLYEFKEGFLPVQYVID FT VMDEAKEDFNRQDLIKEYDIIKNAIPAEWLTRIENMEENKQSKDVIVRFGE FT KWWNLKDSTVKMIYGFFRDGVFKKPRANENWIRMFKDVNEDNIWANIKGKL FT VQSKVENLEYLIRNKAVFTDIILNKIGMEESVTCKVCQDADEGFLHLFLYC FT NELKDFNEKCKSIILTLKGERDDELEWEKVLMLGVNKECNNEKLINLLVML FT RKSAIWERRVAAKKEKAVLDVWNVFKRKVEKYVECLFYYFKLEDMQEAFYD FT VFTQEVSKILNDTGMKMPF" XX SQ Sequence 5496 BP; 2128 A; 527 C; 1367 G; 1474 T; 0 other; tacattccag gaagtgtttg gctgcagttg gtgagagagg ctcaccagat tttctttgtt 60 tatttttgtt tttgtcttaa atacattttt gttaaacttt tttttgtttg tttagtttgt 120 cctcccagca gcgtgagctg ttcgggagga gtgtttttta tttatttatt tttttttgca 180 atggacggat ctacggtgac agacctggca atggcggacg gagttgagaa ggcaaatgac 240 aatggactgg acagacgaca acaagaaaca aaccaatcgg agaaggaagc aaggaaaagg 300 atttatctaa aagaagcaac tgtaacagtg gacataggac aagcaacaga ggtgagagca 360 atagatgtaa ttaaagcaga gacggagagg attggggatg gaaagatttt ggccgtaaga 420 ccaaaacaca acaaggaata tgaagtaaca cttgaaagag aggaagatgc tgatgagtta 480 atggacgaat tgactattaa agggataaac tgtgcagtta agaggctaca aaaccgtgat 540 tatgttgtct ccttcatgca tctgcctgtc tatgttgctg ataaagatat tttagacaaa 600 ttggatcatt ggggagtttg tcccatttca aaaattaaaa gaaggtttta tccgggcaca 660 aatattgaag atgggacgag gtttgtgaaa accagattcc ccaaagaact ggcgtccctc 720 ccgtacagca caagaataga gacagcagag ggtccacaat actttagggt gatgcacagt 780 cattaggtga aaacatgcag gctgtgcttg agcccagatc atgtggtaaa agactgtcct 840 gattttaggt gctataagtg cgaggaaagg gggcactttg caaagttttg cactgctgta 900 aagtgcccgg attgtaataa ggttttgaat aaatgtgaat gttggattgg ggaagaggag 960 gaggaggtag agcagcaggt gggcaggcag atgtatgaag gagacaatat ccagtcggag 1020 gacaaaacaa caacacaaga aaaaagtaca gaatctgaaa gtgaaaaact acaagagaat 1080 gagactaatg gaatagacag agtcacggaa caggaaggga caacatggac acaaatggat 1140 atgactgaca gtttaaagag tgttttggaa gcagcagaat tgagcgattc gaataataaa 1200 gacttgaatc aaggacaaca ggaagacaca ttttggacac aaatggacat cacagacagt 1260 tttcaaaagg cattggacac agaggagaca aaaggccaaa gtaatgacga gcaagccgat 1320 ttagagggac atttacaaaa ggatggaaac aaagagacac aggggaaatc agcaaaaaga 1380 agaagatcgt taaagataaa cctaatttag agactgtaag aaaaaaactg ctaaaagatg 1440 aagaaattga atgcgcaaat aagtatgagt tgctaaaggg cttggaagac atggactgag 1500 atgatgtttt ttatgaatgt ttttatgtct tttatgaatg tttttatatt ttttatggtt 1560 ttaggaattg tgtcttttaa tgcaagaggg cttttagaca tcaggaaatt tgaaaaagtg 1620 aaagaaatgt gtaaacgaga agatgtgatt ttacttcaag agacaaactg gagggaagaa 1680 tgcatgaagg aaataagaaa aaggtggagt ggggaaatgt tatacaataa tggggatggg 1740 aggctaggga gaggagttgc aattttatta aaagaaaaca gtggggtttt atgtaaaaca 1800 atctataatg acaaagaggg aaagtgtatg atatgtgaaa tggagtatgt aaagaaaaaa 1860 agtaattatg gtgaatgttc acgccccaac agaggagaac aaaaagaaag agtattataa 1920 tgtacttaga gattatttaa agaaacacga aagagttatt atcatgggtg attttaacac 1980 tgtttttagt aaattagaaa tggctgaggg aatggttttt aaaacggata aggggagaaa 2040 agaactaaaa atattgatgg aggaaatgaa tttaattgat gtgtggagag aaaggaatga 2100 acagacaaaa gagtactcaa gaagacagat agtggggaat ttttgttgtc aaacaagaat 2160 tgattttatt ttatgcacaa gaaatgttga agggtttata aacaagatta aatatgaaga 2220 aacaagtctg agtgaccata agccactttt tatgaagcta gactggagta atgtgaaaag 2280 agggccaggg gtatgggttt taaacacagc ggttttaaag aatgaagact atgttttaag 2340 tgtaaaggaa attattcaaa aggaaaaagg gaatgaaatc tataatgagg acaaaagaat 2400 gtggtgggag aatgtgaagt atttagttaa aaagtttacg ataaaatatt gtagacaatt 2460 acaaaattgt aaaaaatata aggaaaagga gctgaaagaa aaactagaaa acgaattgaa 2520 aaatgagaat ggaaaaaata tacaaaagat taaagaactg caaggaagat taaatgaaat 2580 ggaggaggag aaatttgaag gtgcaagatt aagaagtaaa gcaaaattta cagtagaggg 2640 ggaaaagtgc actaaatttt tctttgatct agagaagaga agagggaagt cagaaatgat 2700 tagagaaata aggagcaaaa atgggaacgt agtagaacaa catgaggaga ttttggaaga 2760 aataagatca tattatgaga aattgttttg cacagaggga ataaaagaaa aagaaaaagg 2820 ggaattacta aatctaataa aatcaagagt agaagaaggg gaaaaaagag aatgtgacga 2880 ggagataaga gaagaagaaa taaaaagagc aattagtgga ttaaacaaaa agaaaagtcc 2940 aggaatagat gggttgggaa gtgaatttta tattgttttt aaagatattt tatctagtat 3000 tttaaaggaa gtatatgatg agatttttga gaatggtgag ataaataaaa gaatggggat 3060 gggcttaatg aaggtgatat acaaaggaaa gggggataaa gtagatttaa aaaactatag 3120 acctataaca atgcttaata ctgatttgaa gattttagcc aaagttttgg ctaatagact 3180 aaaagaagtg atgccaagca taataaaaac aaaccaagca tatagtataa aaggacgaga 3240 cattgcggat acaactatga gtattaaaga cacaattaga tatataaatg ataagcagaa 3300 agatggtttt ttaattagtc tggacttcga gaaagctttt gatagggtgg agcatgactt 3360 tttatttgga gtgttaaaga gttttggttt tggggaaaat tttataaagt gggttcagat 3420 tttatataga ggagcggtaa caaggataaa atgcaatggg tttttaacag actgttttaa 3480 gataagaagg tcaatcagac agggttgtcc gttatctgca cttttatatg ctttagttgc 3540 agaaccactg ggattagctg tgaagcacga ggacagaata aaaggaatag aggtagaggg 3600 gggagtgaat aaaatatttc aatatgctga cgataccaca ttaatactac aagatctggc 3660 aagtgtaaag caagcaatgg aaacagtaca gcatttttgc aaggggtcag gggctaaaat 3720 aaatgaaaat aaaacagggt atttgagatt tgggagaact gaggctttat ctggacattt 3780 tacttttaag gaaatggatg aaataaaaat tttaggcatt gtaattggga gggatgaaaa 3840 gaaagcagaa gtaaccatgt gggaggaaat tttaggaggg attgaacgga ggctgaggtt 3900 ttggaaatta atgtctttaa ctttgaaggg gagggtttta attttgagtg ttttaatggt 3960 ttctaaatta tggaatattt tatatgtgtc atcaatgcca ctgtggatgg aaaaaaggct 4020 gaaacaatgt tttttagatt ttttatggga agggaaacct ccaagaatag catatgcaac 4080 gttaattgga gaagtaggca aagggggtct aggtttaata gacgtggagc aaagaaaaaa 4140 cagtttaaga gtaaaaatgg taaggaaata tttggatgaa gacaataagg cagcatggaa 4200 aagaacaatg gaatattttt taagtaaaag tggcaatttt aatatgggag ataatatttt 4260 atacatgagg atgaaaaaat tcatgacaga gggtctacca gatttttata aagaattgat 4320 tggagcatgg ggaaaatttt taacttgtgt acattttaac atacaaggac gcgagaacat 4380 tttaaatcag cctttattct taaacagtgg cattcttaat caagagaaag tggtgttttt 4440 taggaaatgg tgggaggtgg gaataacaag agtaagggat gttttatatg aatttaagga 4500 aggattttta ccagtacagt atgttattga cgtaatggat gaagcgaagg aggattttaa 4560 cagacaagac ttaataaagg aatacgacat aatcaaaaat gccatacctg cagaatggtt 4620 aacaagaata gaaaatatgg aagaaaataa acaaagtaaa gatgtgattg taagatttgg 4680 agagaaatgg tggaacttga aagatagtac tgtgaaaatg atttatgggt tttttagaga 4740 tggggttttt aagaaaccgc gtgcaaatga gaactggata cggatgttta aagatgtaaa 4800 tgaagacaac atatgggcta atataaaggg caaattagta cagtcaaaag tggagaattt 4860 ggaatatttg atcagaaata aagcagtttt tacagatata attttaaaca aaatagggat 4920 ggaggaaagt gtcacatgta aagtatgtca agatgcagat gaaggattct tacacctgtt 4980 tttatattgt aatgagttga aagattttaa tgagaaatgc aaaagcatta ttttaacttt 5040 gaaaggagaa agagatgacg aacttgagtg ggaaaaggtg ttgatgttgg gagtgaacaa 5100 agaatgtaat aatgaaaagc tcataaattt actggtgatg ttaaggaaaa gtgcaatatg 5160 ggagagaaga gttgctgcaa aaaaggaaaa agctgtatta gatgtgtgga atgtatttaa 5220 gaggaaggtg gagaaatatg ttgaatgtct gttttattat tttaagttgg aggacatgca 5280 ggaggctttt tatgatgttt ttactcaaga agtttcaaag attttaaatg acacaggaat 5340 gaaaatgcct ttttaaaaat gtgattatac cctttaagga gttctacttg caacatttta 5400 tatttaaaat gttttattgt gaagatatga tgtaataagg acctttttaa tgttttgtct 5460 gaagtgaaat atgtataaat aagtgaattg taaaaa 5496 // ID DIRSDR1 repbase; DNA; ZEB; 4714 BP. XX AC AL590134; XX DT 04-MAR-2002 (Rel. 7.02, Created) DT 04-MAR-2002 (Rel. 7.02, Last updated, Version 1) XX DE DIRS-1 Danio rerio 1. XX KW DIRS; LTR Retrotransposon; Transposable Element; DIRS-1; DIRSDR1; KW non-LTR; retrotransposon. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4714 RA Jekosch K.; RT "DIRSDR1: putative non-LTR retrotransposon."; RL Repbase Reports 2(2), 9-9 (2002). XX DR [1] (Consensus) XX CC Putative novel non-LTR retrotransposon similar to Distyolstelium CC DIRS-1 CC with one ORF (pos. 1-1252, 1446-3125 and 3356-4714).Exon 1 CC is similar to GAG proteins, exon 2 to reverse transcriptases and CC exon 3 CC to Lambda recombinases. XX SQ Sequence 4714 BP; 835 A; 1501 C; 1346 G; 1032 T; 0 other; atggcgctcc gactgtgcgt ttctggatgc gggggtttcc tgtctccgga tgatggacac 60 gatcactgca ttgcatgttt gggggtccag catgttaatg cggtgctcgc gggcggttca 120 tgtcgtcatt gcgatgccat gaccgttgca cagctaagat cgcggctaac tttcgcaaga 180 gagcgagcca ccccagttgc ctcctgttct aaaaaagcag cgggcgctcg ggcagatctg 240 agggtttcag cgggagctaa tccgccgccc acgggctcgc ggacctctcg ctcctcacgg 300 cgctccatcc aagcttcggg tggtgagagt gatccgtcta accagatggt agctctcaca 360 ctcgctgaca ccggagatca gatgtcctcc gcggcatcgg agggtgggct ttcactgtcc 420 gacgaagatc cggacccgct cgccccctcc gggcaggtga gcgctgtcaa atcggatcct 480 gaagcggaca tgttagccgt gctttcccgg gctgcttcgg ccgtggggtt ggagatggtt 540 tatcccccag ctccgcggcc ggaccgacta gatgggtgct acgtagagga caagaaggcg 600 aagccttcga agcctctcgt ccccttcttc ccggaagtgc acagtaggct cacgcagtcc 660 tggagggcac ctttctctgc ccgtgctgcg agtgcctccg ccctcaccgc ccttgacggc 720 ggagctgcca gggggtatga ggcgatcccg tcagtggagc gcgctatcgc ggtcaatctt 780 tgtccgcgcg gcgcctctac gtggcggggt ttgccccgcc tcccgtccaa agcctgtagg 840 ttgtctgcct ccctcggagc cagagcttat aaggctgcgg gccaggctgc ttctgctttg 900 cacgcgatgg ccacctacca gcgctaccaa gcgcaggcgc tggccgagct gcacgagggc 960 gggtccaacc caagcttatt acatgagctg cgcaccgcga ccgactatgc tcttcggact 1020 actaagtccg ccgcgtgtgc gctggggagg acgatgtcca cacttgtggt tcaggaacgc 1080 cacctctggc taaacctggc cgatatgcgc gacgttgaca aagttcgctt tcttgactcg 1140 cccatatccc aggctggcct gttcggcgac accgtcggtg aattcaccca ggaattcaag 1200 gcggtgaaag agcagtcgga tgcgatgggc aatgtcatct atcggcgtgg ccgtaagccc 1260 gctccgcccg ccgagccatc cacctccgct gttcctcgcc gagggcgccc gccaacgagt 1320 gctgccccgc ccccgcctgc gcctccggcc aagcgggcgc ggcgttcacc tcgaaagcag 1380 gcagcccctc ctgcccaggg cgccgttaag tccggtaaac ggaccgcgaa gcgtccctga 1440 gacaggccat ccggagaaga ggaaacttgc tctttccccg ctggagggcg gggccccgat 1500 aacaacggta cttttcagtg ccaccaaaac atcagtaaaa gagcactttt tcccttcccc 1560 ggatgtgact gcacgagttc tgccagtccg ggacgcgctg ccttccggct cgcagactct 1620 acgtgcttcg ccagtggctc acgagcgctg gggggacggt ctcccttccc tcagccctcc 1680 agccccctct ccggagtcag ggtgcggagc cagagcgaat cactctcctc cagcttttcc 1740 gcgggaccct cgtgcttccc ggatcagcac acccactccg cgctgcccca ccgctggtac 1800 gtcagcgatt gtagcgatga ctccattagc gagggctctg cctgcctggt tagcgcgggc 1860 cagcccctcg cggtggctca tacgcacaat cagactcggt tacgcgattc agttcgcgaa 1920 acggcccccc aagtttacgg gcgtgtattt ctccagggtc aaccccctgt ccgcccctgt 1980 cttgcgagag gagattgctg ccctcctggc gaagggtgca atcgagccgg ttcctccagc 2040 cgagatggag agtgggtttt acagcccata cttcatcgta cccaaaaaga gcggtgggtc 2100 acggccaatc ctagatctgc gcgttttgaa ccgctgtctg cacaagctgc cgttcagaat 2160 gctcacgcag aggcgcattc tccaatgcgt tcgtcctcgg gattggtttg cagccataga 2220 cctgaaggac gcgtatttcc atgtctccat tcttccacgc caccgccaat ttctgcggtt 2280 tgcgttcgag ggtcgagcgt ggcagtacaa ggtcctcccc ttcgggctct ctctgtctcc 2340 gcgggtcttc accaaactcg cggagggtgc cctagcgccc cttcggctcg cgggcattcg 2400 catactcagt tatctcgacg actggctgat tttagcccac tcgcgggagc aattgattat 2460 gcacagggac gaggtgcttc ggcatctccg cctactgggg cttcaggtca accgagaaaa 2520 gagcaaactc gcccccgtgc agaggatttc ttttctcggg atgaagctgg actcgatcac 2580 catggtagcg cacctctccg aggaacgcgc tcgcctgttg ctgaactgtc tgagggagct 2640 cgacagcaaa ctagtggtcc cactgaagtt ctttcagagg ctcctggggc atatggcatc 2700 cgcagccgcc gtcacgccgc tcgggttgct ccatatgaga ccacttcagc actggcttca 2760 cgatcgggtc cccagacgcg catggcacgc gggcacacac cgggtctcgg ttactgcgct 2820 gtgtcgccgc gccctcagcc cttggaacga cccctcgttc ctacaggccg gtgtgcctct 2880 aggacaggcg tccagccatg ttgttgtttc aacagacgct tccaacacag gttggggggc 2940 cgtgtgtcgc gggcatgcgg ctgcgggcct ctggaagggt gcccagctgc attggcatat 3000 caatcgccta gagctgttgg cagtgttcct cgctctccac cgctttttac cggtgctgga 3060 gcggcaacac gtgctggtca ggacggacag tacggcagcg gcggcgtata tcaaccgcat 3120 ggggggtatg cgctctcgcc gcatgtctca gctcgcccgc cgtctgctcc tctgtagtca 3180 cccgcggctg aaatcgctgc gcgccattca cgtcccaggc acgctcaatc gtgcagccga 3240 tgcgctctca cgacagctgt tatgccctgg agaatggaga ctccaccccg agtctgttca 3300 gctgatatgg gcgcgattcg gggaggccca gatcgatctg tttgcttccc ccgagaacgc 3360 tcactgccag ttgttttttt ccctgaccga gggctctctc ggcacggatg cactggccca 3420 cagctggcct cggggcatgc gcaagtatgc gtttccccca gtgagcctgc tcgcgcagtt 3480 tctgtgcaag ggaggacgag gaacaggttc tgctagttgc gcccctttgg cccaaccgga 3540 cctggatatc agagctctca ctcctcgcga cggccctccc ctggcggatc cctttgagag 3600 aggacctact ctctcaggga cagggcacca tctggcaccc tcgccccgat ctttggaacc 3660 tccacgtgtg gtccctagac gcgaggaaga cttaggtaac ctaccgactg cggtggttaa 3720 taccatcact caggctagag ccccctccac gaggcgcgcc tacgccctga agtggagtct 3780 attcactgaa tggtgcgtct ctcgcagaga agacccccga aattgccaga ttagtgttgt 3840 gctctctttc cttcaagaga agttggacag caggctgtcg ccctccactc tcaaggttta 3900 cgtggccgcc atctccgctt atcatagcgc ggtagctggc ggcaccgtgg gaaagcataa 3960 cctggtcatc cagttcctta ggggtgctag gcgaattaat ccatctcgcc cccctctcat 4020 gccctcttgg gatctcgccc tcgttctcac gagtctgcga tccgatccct ttgagccact 4080 cgaatcagta tctctaagat ttctgtccct gaagacagct ctgctggttg cgttggcctc 4140 catcaagagg gtcggggacc tggaggcatt ttcagtcagt gactcgtgcc tggaattcgg 4200 gccggattac tctcacgtta tcctgagacc ccgccccggt tatgtgccca aggttcctac 4260 cacccccttt agagatcagg tagtgaacct gcaagcgctg cccccggagg aggcagaccc 4320 agccctttct ttactttgtc cagttcgcgc tctgcgcatt tatgtggacc gtactcagaa 4380 ttttagatca tctgagcagc tctttgtctg ttatggcggt cggcagcagg gaagtgccgt 4440 atcgaaacaa agattatccc actggattgt ggatgccatt tcactcgctt attcgagtcg 4500 aggtcagccg tgtcccccgg gagtacgtgc acactccact cggagcgttg catcctcttg 4560 ggcgcgtgca cgcggcgcct ctctaacaga catctgtaga gctgcgggct gggcgacacc 4620 caacacattt gcaaggtttt acaatctgcg agtggagccg gtttcctcaa gggtattagg 4680 taaccctttg gtgattgagg agacaactcg gtag 4714 // ID Gypsy-22-I_DR repbase; DNA; ZEB; 5713 BP. XX AC . XX DT 11-FEB-2005 (Rel. 10.01, Created) DT 11-FEB-2005 (Rel. 10.01, Last updated, Version 1) XX DE An internal portion of the Gypsy-22_DR LTR retrotransposon - a DE consensus sequence. XX KW GYPSY superfamily; Gypsy-22-I_DR; Gypsy-22-LTR_DR; Gypsy-22_DR; KW LTR retrotransposon; endogenous retrovirus; gag; integrase; KW reverse transcriptase. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5713 RA Kapitonov V.V. and Jurka J.; RT "Gypsy-22_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 5(1), 17-17 (2005). XX DR [1] (Consensus) XX CC Gypsy-22-I_DR is an internal portion of the Gypsy-22_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its CC long terminal repeat is deposited in Repbase as CC Gypsy-22-LTR_DR. Gypsy-22_DR is characterized by 4-bp target CC site duplications. The internal portion encodes one CC polyprotein: the 1686-aa Gypsy-22_DR1p (pos. 605-5662) CC composed of the gag, protease, reverse transcriptase, and CC integrase domains. The consensus sequence was built from five CC copies less than 2% diverged from the consensus sequence. XX FH Key Location/Qualifiers FT CDS 0..0 FT /product="Gypsy-22_DR1p" FT /translation="MAQFFSHSTSTYVDIDAPDAPTISTPVWSPPVQTQTL FT SGLPPHDQIMHNISPVHSFPSHATPVQTLPTALGGHILMQPLSLSQEHVSM FT QLCTSADVSSPSTMEQHDLPGNLPTPRREIQQVSSYVQGNLDNLMVTMKKQ FT EKCLHELTQKLKTSSSQHVNQITTLTAKIESNKQEIVTILTGAKQQEAADA FT DQLVKAVQLMLATEFQKFESTLTSAVVDKVEKLRRDVHHDLKSIQQTLQGS FT LDQLTTNLQQCEEKISKCQTCVTQLKKDLQVHNVKDVEPQTETKAAPVTST FT LSTETVSTLPNTMVKSDHLKLTFPTFGRHTDDTDPLLYLTKCQDFLALHPL FT TDADLLATFRTVLYGTARDWWEVSRSNIATWKEFESAFLSAFLSEDYEDEL FT AERVRTRVQGDRESIRDFAFTYRALCKRWKSTLTETEIVKMILKNIKPYLA FT SQLRSRVNTVEDLVKLGYLLERDYEEQRRYESRMAHKQASSQKSFSNRPVE FT KQPIQCWRCSGPHPPGNCPMYLTPPSQQSSTQHHPNHGKSFHAAKSGGRPT FT NIIVAASETPQSTKEVPNVFLPSTTMSSLAIPQQLVVPISIGSWFGKAILD FT TGASYTLIHESLMQHFDTSAQLQNWSSGPLYLANGKAEIPLGWLNITIQIH FT GKSFVVPAVVLPSQALAYAIILGLDFIFFSGLKIHVSERKYSFTSDPTEEH FT PFQPGYASEPLVKMTPMTEKKTLRKNKLNLTLLSAVPPPQTSLGMLQTDHV FT DDATQIWNAVSEAQLPKEEKQQLLQILQNNPRVCTQRTGKTKLLQHRIYTT FT SQVPIKQKPYRLSPVKQQVMEEQLEQMLREGIVEPSHSSWASPVVLVPKKN FT GKLRFCVDYRKVNAITENDAYPLPNITEILESLSGSTIFSTIDLNSGYWQV FT MMDPDSKAKTAFIVSDGLYQFNVMPFGLKNAPATFQRLMETVLGELRRKIC FT LVYIDDIIVYSPSVTQHFCDLQTILHRLEAAGLTINLEKCKFFLPEITFLG FT HVVNAKGITADPSKVEAILSFPTPNNLKEVQRFLGLAGWYHRFVQNFSKIA FT EPLNALKKKGQVFKWTAQCQQSFDQLRSCLTSPPILGHPDLKIPFIVYTDA FT SDTGLGAILTQRKDPGSEEVIAYASRTLTGAEVNYTATEKECLAVVWALEK FT WQHYLEYKLFTVVTDHSALQWVMGSTKTNSRLIRWVLRLQKFNFIIEYRKG FT KLNVAPDALSRSPLTTISPVTAVYTKQQTDQHTELPVSDVVLWEEQHSDEE FT TTKLLQAVAEEPNQLEQYEVIEDKLYHKTYLKNDQVHYRVYVPNRLRPTLL FT HHYHSHPLSGHHGIYKTYKRIQAVAFWPGLWTDVKRHVKECVKCQTIKYDN FT QKPAGKLQSTITSRPNQMLGVDIMGPLPRSTQQNEYLLVFVDYYSKWVEFF FT PMRQANAQSVAVIFRREILTRWGVPDFILSDRGTQFISSVFKNVCEKWGVT FT QKLTTAYHPQTNMTERVNRTVKSMIASYVDDNHSKWDQFLPEMRFAMNTAI FT QETTGVTPAELQIGRKLHGPMDKILHGQNLIPDNTSYDVVCHIQQLKSQVQ FT ENCRRAQQRQLRNYNKKRREAGFKNKDRVWLRNFPQSSAQHKFSAKLAPKW FT KGPYRVLKQLGPLNYRIALEETGEDVRTVHVCNLKECFPTAEELEVQEKKR FT LRELFEETSEEEEFFGF" XX SQ Sequence 5713 BP; 1818 A; 1219 C; 1139 G; 1537 T; 0 other; gaaatggcac ccgaacaggg acatgaacac tttaagggac atgaacactt taagggacat 60 tgaacacttt aagggacttg aataattgaa actttcattg attgacttga ataactgaaa 120 atttctttga ctgactgact gacctgaatt actggacagg gttttgtttg tgaatgttta 180 atagtttaaa attgtttttt tcttgctgaa tgtgagaact attgacaaaa atcatcaaat 240 aacagaaatt gtgtaaaaaa aaaaaaaaaa aaaaacttgt catgtatgtg gtttttggta 300 taatgatttg agtttattgt gaaaatttgc ttgagcaacc agtgtaagct gaatttattt 360 tttgtgtcaa agtgtgtggg aacttgaact agaaaatgtg atcgtcatct gtcttgtatt 420 tttgtacatg catttgaaag gtttctctta cattgataat atggacatga tgtaattgat 480 tgcagagaac agtttctttg tgagtatttt ttgttttatt ggggtttttt tttatttatt 540 tatttttttt cctgctgtcc caaacacaca caaatcctaa tcacacccaa caccaacaca 600 agtcatggcc cagttctttt cccattcaac gtcaacatat gttgatattg atgcacctga 660 tgccccaaca atttccacgc cagtgtggtc acctccagtg caaactcaaa cactttctgg 720 acttccacct catgaccaaa ttatgcataa tatttcaccg gtccatagtt tcccaagcca 780 tgcaactcca gtacagacac ttcctactgc tctgggaggc catattctaa tgcaacctct 840 ttctttgtcc caggaacatg ttagcatgca actgtgcacg tctgcagatg tttcatcacc 900 ttctacaatg gaacagcatg atctgcctgg gaaccttcca acccctagga gagaaattca 960 gcaggtcagt tcttatgtgc aaggtaattt ggacaatctg atggtaacaa tgaaaaaaca 1020 agagaaatgt ttgcatgaac ttactcaaaa gctgaaaact tcatcatccc agcatgtgaa 1080 tcaaatcacc acccttacag ccaaaataga gtccaataag caagagattg tcaccatact 1140 cactggagct aagcaacaag aggctgcaga tgctgaccag ttggtcaaag ctgtacaatt 1200 gatgcttgca actgagtttc agaaatttga atcaaccctt acctcagcag ttgttgacaa 1260 agtggaaaaa ctccggagag atgtccacca tgatctcaaa tcaatccagc aaaccctcca 1320 gggaagtttg gatcagctta ctacaaatct tcagcaatgt gaagaaaaaa ttagtaaatg 1380 tcagacctgt gttacacagt tgaaaaagga tttacaagtg cacaatgtga aagatgttga 1440 acctcaaaca gaaacaaaag ctgctccagt tacaagtaca ctctccactg agactgtttc 1500 aactttacca aataccatgg tcaaaagtga tcaccttaaa ttaacttttc ccacctttgg 1560 acgacataca gatgatactg accctttatt gtacctaaca aaatgtcaag atttcctggc 1620 acttcatcct ttgacagatg cagatctttt ggctaccttt cgcactgtct tatatggcac 1680 agctcgggac tggtgggaag tgagtcgctc caatattgcg acttggaagg aatttgagtc 1740 tgcatttctt tctgcattcc tgtctgaaga ctacgaagat gagcttgctg agcgtgttcg 1800 tactagagtt caaggagaca gagagtcaat tcgagatttt gcatttactt atcgagcact 1860 ttgtaaaaga tggaaatcca ctctaacaga gactgaaatt gtaaaaatga tcctaaaaaa 1920 tatcaaacca taccttgcca gccaactgcg cagtagagtg aacaccgtgg aggatctagt 1980 taaattggga tatctattgg aacgagacta tgaagaacaa agacggtatg aaagtcgaat 2040 ggctcacaaa caagcaagtt cacaaaaatc tttctccaat cgacctgttg agaaacagcc 2100 tattcagtgt tggaggtgca gtggtccaca tccaccggga aattgcccaa tgtatttaac 2160 tccaccttcc cagcaatctt ccacgcaaca tcacccaaac catggaaaga gttttcatgc 2220 tgcaaagtca ggaggtcgac ctacgaacat cattgtagca gcctcagaaa caccccaatc 2280 aacaaaagaa gttccaaatg ttttccttcc atctacaact atgtcatctc tggccattcc 2340 acaacaatta gttgtcccaa ttagtattgg atcatggttt ggaaaagcca tactggacac 2400 tggagcaagc tacacgctaa tacatgaaag tctaatgcag cattttgata cctctgccca 2460 gctacaaaac tggtcgagtg gacctcttta cttggctaat ggaaaagcgg agataccctt 2520 aggatggtta aacatcacta ttcaaataca tggtaaatcc tttgtagtac ctgctgttgt 2580 cctcccatct caagctcttg catatgccat catcttgggt ttggacttca tattctttag 2640 tggtctgaaa attcatgtta gtgaacgcaa gtattctttt acgtctgatc ctactgaaga 2700 acacccattt caacctggat atgcaagtga acctctagtt aaaatgacac ccatgacaga 2760 aaaaaagacg ctcagaaaga acaaactcaa tctcaccttg ttaagtgctg tccctccacc 2820 tcaaacctca ttgggtatgc tacaaactga tcatgtcgat gatgcaacac agatctggaa 2880 tgctgtaagt gaagcacagc ttcccaaaga agaaaagcaa cagttactac agatcctgca 2940 gaataacccc agggtatgta ctcaaagaac tgggaaaacc aagttacttc aacaccgtat 3000 ctacaccacc agtcaggtac ctatcaaaca aaagccatat cgtttgtctc ctgtaaaaca 3060 acaggtgatg gaggagcaat tggaacaaat gttgagagaa ggtattgtag aaccatcaca 3120 ttcttcttgg gcttcaccgg tggtgttggt tcctaagaaa aatggcaagt taagattctg 3180 tgtggactac cgcaaagtaa atgcaataac ggaaaatgat gcttaccctc ttccaaacat 3240 cacagagata ctagaatctc tctctggatc aacaattttt tctaccatcg atttaaacag 3300 tgggtattgg caagtgatga tggatcctga cagcaaagca aagactgctt ttattgtgtc 3360 tgatgggcta tatcaattta atgttatgcc ttttggatta aaaaatgcac ctgctacatt 3420 ccagaggtta atggagaccg tactagggga actaagacga aagatatgcc ttgtctacat 3480 tgatgacata attgtgtatt ctccctcagt gacccaacac ttctgtgatc tgcaaaccat 3540 cctccacagg ctagaagctg ctggactgac catcaacctg gaaaaatgca agtttttcct 3600 accagagatc acgtttctag ggcatgtggt aaatgctaaa ggtatcacgg cagatccgag 3660 caaagttgag gccattctct cttttcctac acccaacaat ctgaaggaag ttcagcgatt 3720 cctgggacta gccggctggt accaccggtt tgtgcaaaac ttttcaaaaa ttgctgagcc 3780 cctcaatgcc ctgaagaaaa aaggacaggt gtttaaatgg acagcacagt gtcaacaatc 3840 atttgaccag ttaagatcat gccttacctc acctcccatt cttggccacc ctgacctcaa 3900 aatacctttc attgtataca ctgatgccag cgatacagga ctgggtgcta ttctcactca 3960 acgaaaggat ccaggtagtg aagaggtcat tgcctatgcc agtcgcacct taactggggc 4020 tgaggtaaat tacactgcaa cagaaaaaga atgtttagct gttgtttggg cattggagaa 4080 gtggcaacat tacctagagt acaagctctt cacagtagta acggaccatt ctgctcttca 4140 gtgggtgatg ggatccacaa aaactaacag ccgccttatt cgatgggtct tgagactgca 4200 gaaattcaac ttcatcattg aataccgaaa agggaagtta aatgtagctc ctgatgcttt 4260 gtccagatca cctctcacta caatttctcc tgttacagca gtttacacga agcagcaaac 4320 agaccagcac acagagcttc cagtttctga tgttgtccta tgggaggaac aacattcaga 4380 tgaagaaaca acaaaactac tacaggctgt agcagaagaa ccaaatcaat tggaacaata 4440 tgaagtgata gaggacaagc tgtaccataa aacttacctg aagaatgacc aagtacatta 4500 tcgtgtgtat gttccaaacc gtcttcggcc gacactactt catcactatc attcacaccc 4560 gctaagtggc catcatggaa tatacaaaac ttataaacga atccaagcag ttgctttttg 4620 gccaggctta tggactgatg tgaagagaca cgttaaagaa tgtgtgaagt gtcaaacgat 4680 caagtatgat aatcagaaac cagcaggaaa acttcaatca accatcacct ctcgacctaa 4740 tcaaatgctt ggagttgata ttatgggacc cctacctcgg agtacacagc aaaacgagta 4800 tttgttggtg tttgttgact attactccaa atgggttgag ttctttccca tgcgtcaggc 4860 aaatgctcag agtgttgctg tgatttttag aagagaaatc ttaacccgtt ggggggttcc 4920 agacttcatt ctctctgatc gaggaacaca gttcatctca tctgtgttta agaatgtatg 4980 tgaaaagtgg ggagttacac agaagctaac aacggcttat caccctcaga ccaatatgac 5040 agaacgggtg aatagaactg tcaaaagtat gattgcgtca tatgtggatg acaaccacag 5100 caaatgggat cagttcttac ctgagatgag attcgcaatg aatactgcca tccaggaaac 5160 cacgggagta actccagcag aactgcagat tggaaggaaa ctgcatggac caatggataa 5220 aatacttcat ggtcagaatc ttatccccga taatacctct tatgatgtag tctgtcacat 5280 acaacaactg aagtcacaag ttcaagagaa ttgccgaagg gcacaacaac gccaactccg 5340 aaactacaac aaaaagagaa gagaggccgg tttcaaaaat aaagacagag tatggttgcg 5400 caatttccct cagtccagtg cacagcacaa gttcagtgcc aagctggcac caaaatggaa 5460 aggaccttac cgagttttaa aacaactggg acctttaaat tatcgtatag ccttggagga 5520 aactggagaa gatgtgcgca cagtacatgt gtgtaatctt aaagaatgtt tcccaacagc 5580 tgaggagttg gaagtccaag aaaaaaaacg acttcgggaa ttgtttgaag aaacctctga 5640 agaggaggaa ttttttggat tttgattgta tttcttttaa caaccatggg ttgttttgtc 5700 aaggggggga gaa 5713 // ID DIRS-4_DR repbase; DNA; ZEB; 6796 BP. XX AC . XX DT 24-OCT-2008 (Rel. 13.1, Created) DT 24-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE DIRS-like LTR retrotransposon family - a consensus. XX KW DIRS; LTR Retrotransposon; Transposable Element; KW endogenous retrovirus; reverse transcriptase RNase H; KW phage integrase; DIRS-4_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6796 RA Bao W. and Jurka J.; RT "Families of DIRS-like endogenous retroviruses in zebrafish."; RL Repbase Reports 8(10), 1271-1271 (2008). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 523..3816 FT /product="DIRS-4_DR_1p" FT /translation="MEGTNTPALPETPAQTQQQTQINTEAPIRGRRPIRST FT VTRTHRRTQSPSPNRNLPSPASSYASARSSSITSNKMTVSELRQSLTNAGI FT SIPTRCNKSELLKLYEAIPSPTPPPQDSRPTRSRHTPYPQPSATQHSRNPP FT GPPKKATKKTNKKQPQATGQXAPSTNQHTVNPPDNYATPGLPTPLLWPPAP FT QSSENSSPTLQAIPPTLNPPQFSLSSNLPHSSTQLIPNLSSNLPHSSTQLI FT PTQSFPTSSNALPLPANFSSTNPPFFPSTSLQAPTSITNPPQQNAFCTNTS FT SARAPFTLATATPLPIPHNAPVLEPPQISNTVRNLILSGADIDLSTLLSPI FT APPSADRQVDCGEFTITLKSPVSSQPRTLTIAEFHVAFSRYTDTICSVFPH FT RRRELNDYMAIISELALSYGGTHFYTYHKLFSAKCAIRVAQWNQCSYWGAL FT DTDLHNRVFLGCRNLSCAVCRSNLHPTTSCPFIIPSTEKELQTPRSTSYVP FT RPSTSAIPALLPPPSSQNPPSSLACNNFNAARCFRHPCKYLHICSYCGGAH FT ARVVCQVWKANKKHRSYLSTPVNISNLYHELCMHPDPNFSEFLISGLSNGF FT HPGVSTLPSYNLACPNLQSANAEPEVVEQLIKKEIDNKFMIGPFLAPPFST FT YRVSPIGVATRKFSGKKRLIIDLSSPHNSAYSSVNSIISPDEFSLNYHDID FT QAISLIKLVGRDAWLAKVDITSAFKIMPLHPEFWHLFGINWKSQFYFAVRL FT TFGCRSSPKIFDMLSEALCWILANNYGIPHVVHLLDDFLIISPPNSPPAKH FT LEITKAVFAKLGIPLAEEKTAGPSTFIEFLGINLDSNKFQASLPKEKVDRI FT ISLSSIFLEKQECSKRELLSILGHLNFAMRIIPQGRPFVTHLLQLAASVQS FT LEENISLSDPCRNELSLWISFLKCWNGCSFFYSDLISSPVDIHLYTDAAPS FT IGFGGYYQGRWFASDWPPQMLEVPSHQYSSALFELYPIVVATLLWGDEWSA FT SSILIHCDNEAVVHCINRGRSHSPALMPLLRRLIWTSAKKQFILTAVHVPG FT FHNQIADSLSRLHFQKFRELAPEAEQHPTPIPPYSEMIFQ*" FT CDS 3825..4916 FT /product="DIRS-4_DR_2p" FT /translation="MHDLHQASISLIMQAVAPRTLQAYLTAWKTFKHFHSL FT YNTTFPNFSLLTITSFITYLHSHKHIQANSIKSYLSGIQFFHKLMYGSSSE FT SITNSQTSLLIKGIQKTRPPLPDTRLPITHNILAKCISTLRKGYFSFHTDH FT TLDAMFILAFFGFLRCSEFTVTSKFDPSIHPTIADLTLIDEETIAFLIKQS FT KTDQSRKGHYIYIFNIPSPTSPFQTLLAYTHYRKTLSASPLDPLFIDDTHH FT PVTRFWFQKHLKYVLTNSGFPSESYSSHSFRIGAATTAAHKGLSQQHIQTL FT GRWSSDAFKTYIRLSHSHLREAQRTLTSRCSYPSGQRHEPSTRKEHNPAIP FT ASRGQRNYQVSQGRGHDPAI*" XX SQ Sequence 6796 BP; 1875 A; 2179 C; 1015 G; 1725 T; 2 other; aagtgaagtt tttaaactaa tttcgagagg agcacgtgat ataattgacc gcagctggcc 60 gccwatctac actcattagt tagccaatca gatctattcc aaattactat aaatagccta 120 gctagatatt actcccttac cttcgttttc cgaagaacaa ggacaaaccc tgctcctaac 180 aaactaccga aaccctcgac aacaaacaac aacaacaaca acgacaacag ctacaacaac 240 agctacaaca acagctacaa caacagctac aacaacagct acaacaacag ctacaacaac 300 agctacagct acaacaacag ctacaacaac aacaacaaca acaacaacag caacaacaac 360 agcaacaaca gcctacatca acaataacag ctaacgcttc aacaacaaca gcttcaacaa 420 caacagctac tacaacaaca gctacaacaa aacagcgaca acaacagaaa gaactcacaa 480 cctaaacaga acaatccaac atcaaagcca tcaacaagca acatggaagg aaccaacaca 540 ccagctctcc cggaaacacc agcccaaaca caacaacaaa cccaaataaa caccgaggct 600 cctatcagag gccgaagacc catccgctcc acagtcacaa gaacccatcg ccgcacgcaa 660 tctccatctc caaaccgcaa cttaccgtct cccgcttcat cctacgcctc tgcaagatct 720 tcatccatca catccaacaa aatgactgtt tctgaactcc gccagtcact cacaaacgcc 780 ggaatttcca tccccacccg ctgcaataaa tccgaacttc tgaaactgta cgaagccatc 840 ccgtcaccaa ctccgcctcc ccaagacagt agaccaactc gctcccgcca caccccctat 900 ccacaaccct ccgctactca gcactcaaga aacccccctg gaccacccaa gaaagcaacc 960 aagaaaacta ataaaaagca acctcaagct acaggacaga magcaccttc taccaaccaa 1020 cacacagtga atccaccgga caattatgcc actccaggac ttcccacccc cctcctttgg 1080 cctccagccc cacaatccag cgaaaactcc agtccgactc ttcaagcaat tcccccgact 1140 ctcaaccctc ctcagttctc tctttcttct aatctccctc attcttcaac tcaacttatt 1200 cccaatcttt cttctaatct ccctcattct tcaacccaac ttattcccac tcaatccttt 1260 cctacaagct ccaacgctct ccctctgcct gctaattttt catctactaa tccccccttt 1320 tttccctcta catccctcca agcacccact tccattacta accctcccca acaaaatgct 1380 ttctgtacta acacatcttc cgcacgagcc cccttcaccc tagccacagc cacacccctt 1440 cccattccgc ataacgctcc agtcctggaa ccacctcaga tctccaacac agtcaggaac 1500 ctcatcctat caggtgcaga catagacctc tctacactcc tttcacctat tgcacctccc 1560 tcggcagatc gacaggtgga ttgcggcgaa ttcaccatta cacttaaatc accagtcagc 1620 tctcaacctc gcacactcac aatagccgaa ttccacgtag ctttctcacg ttatacagac 1680 accatctgct ctgtctttcc ccataggagg cgcgagctga acgactatat ggctatcatt 1740 tcggagctcg cactctccta tgggggaacg catttctata catatcacaa attattttca 1800 gcaaaatgcg ctattcgcgt tgctcaatgg aatcagtgtt cttattgggg ggctttggac 1860 actgatctcc ataacagagt ttttctagga tgccgcaatc tttcctgcgc ggtctgccgc 1920 tcaaaccttc acccaaccac ttcctgtccc ttcataatcc cctccactga gaaagaacta 1980 caaaccccaa gatccactag ttacgtaccc cgcccttcta cctctgctat ccctgctctt 2040 ctcccccctc cctcctctca aaaccctcca tcatctctag cttgcaataa ctttaacgca 2100 gccagatgtt tccgccaccc ttgcaaatac ttacacattt gcagttactg cggtggcgct 2160 catgctcgag tggtctgcca agtgtggaaa gcaaataaaa aacatagatc ctatttgtcg 2220 actcctgtca atatttctaa tctttaccat gaattatgca tgcaccctga tcctaacttt 2280 tctgaatttc tcatttcagg tctgtctaat ggattccacc ccggtgtttc gactcttcct 2340 tcctataacc tcgcatgtcc taaccttcaa tctgctaacg ctgaaccaga agtggtggag 2400 caattaataa agaaagagat cgataataaa tttatgatcg gtccctttct tgcccccccg 2460 tttagcacct atcgagtcag cccaattgga gtagcgacca gaaaattttc gggcaaaaaa 2520 cggctaatta tcgacctgtc ttctccccat aattccgcct attcaagtgt caacagcata 2580 atttcacctg acgaattctc tctgaattac cacgatatag accaagccat ttctttaatt 2640 aaactcgtcg gacgcgacgc ctggctcgcg aaagtagaca tcacgtcagc tttcaaaatt 2700 atgccattgc atcccgagtt ctggcatctc tttggcatta attggaaatc ccaattctac 2760 tttgcagtcc gtttaacctt cggctgcaga agtagcccca aaatcttcga catgctttca 2820 gaagcattat gctggatcct cgctaacaat tacggcattc cgcacgtagt ccacctacta 2880 gatgatttcc tcataatttc ccctccaaat tccccacctg ctaaacacct agagattacc 2940 aaagcagtgt ttgccaaact cggcatccct ctagctgaag aaaaaaccgc cggccccagc 3000 accttcatag aattcttagg catcaatttg gactctaaca aatttcaagc atctttaccc 3060 aaagagaaag tcgatcgcat catttctcta tcttccatat ttttggagaa acaagaatgt 3120 tctaaacgcg aactgctgtc aatattagga catttaaatt tcgccatgcg catcatacct 3180 caaggacgcc cgttcgtcac tcacctcctt caactcgcag catcagttca gagtctagaa 3240 gaaaatatat ccttatccga tccatgccga aacgaactca gcctctggat ttccttcctt 3300 aagtgctgga acggctgttc tttcttttat agtgatttaa tttcatcccc cgtagacatc 3360 catctttata cagacgctgc accctccata ggatttggcg gttactacca aggccgctgg 3420 ttcgcatccg attggccccc ccaaatgtta gaggttccat cacaccaata ttcatctgca 3480 ttattcgaac tataccccat agtcgtcgcg accctattat ggggagatga atggtctgct 3540 tccagcattc tcattcactg tgacaatgaa gccgtcgttc actgcattaa tagagggcgc 3600 tctcactccc ccgctctaat gccgcttctc cgtcgcctta tttggacttc agccaaaaaa 3660 cagtttattt taactgctgt acatgttcct ggttttcata atcaaattgc tgactctctc 3720 tctcgtcttc attttcagaa attcagagaa ttagcgccgg aggcggagca gcacccgacg 3780 cccatccctc cttattcaga gatgatattc caataaatca tcccatgcac gatctgcacc 3840 aagcatccat atctctcatt atgcaagcgg tggctccaag aaccttacaa gcttatctca 3900 ctgcatggaa aacattcaaa catttccatt cactatacaa cactacattc cccaatttct 3960 ccctacttac aatcacatca tttatcactt accttcattc tcacaaacat atccaggcaa 4020 actcgattaa gagctattta agtggcattc agttttttca caaactcatg tacggctcca 4080 gttctgaatc tatcactaac tcacaaacta gccttcttat taaaggcatt cagaagaccc 4140 gcccccccct cccagacaca aggctaccca tcacacacaa catactagct aaatgcattt 4200 ccacactcag gaaaggctat ttttcatttc atacagatca taccctagat gcaatgttta 4260 ttcttgcctt ttttggattt ctaagatgtt ctgaatttac agttacatct aaattcgatc 4320 cctctatcca ccctactata gcagatctga ccttgattga tgaggaaaca attgctttcc 4380 tcattaagca aagtaaaaca gatcaatcca gaaagggaca ttacatctac atattcaaca 4440 ttccctcccc cacaagccca ttccaaactc ttctagctta cacacactac aggaaaacac 4500 taagtgcaag tcccctagac ccccttttca tagacgacac acaccaccca gtgacacgct 4560 tttggttcca aaaacacctt aaatatgtcc taaccaactc aggcttccca tcagaatcat 4620 actccagtca ctcattcaga attggagccg ccactacagc agcacacaaa gggttatcac 4680 aacaacacat acaaacacta ggaaggtggt cttctgacgc cttcaaaacc tacatccgac 4740 tcagccacag tcatctcagg gaagcccaga ggaccctcac cagccgttgc agttatccca 4800 gcggccaaag gcacgagcct agtacaagga aagaacacaa cccagctata ccagcttccc 4860 gagggcagag gaactaccaa gtctctcaag ggcgcgggca tgacccagcc atctaatttc 4920 ttttcttcct tccagctgat ttgcactcag ccttctcccc cttttcactc acccaactac 4980 agtaagagtt ctttccctgc ccaagccccc ccgtcacccc cgccccccct ggccgctgcc 5040 acagaagttt tcactgccca ccaacttttc taacttctgt aggaccccgc cccccccatg 5100 gctctggccc ccgcaggggc gttaccccga gcttcgattc ccgcaggaat catctcacgg 5160 cccggcctta gctttttata ttatatgtca tttcaatgac atatagtata gcactattta 5220 ttttctcttt gtttgatttt atttgtataa attcacatct atatgcaccc acgcatataa 5280 atatgaattt atatatagtg ctgtcaccct cacgctctgt tcccgcagga acacccccag 5340 agcacctata gcgcccgtca ccctccagaa agagtcttca ccttcccctc atctttccag 5400 actcctactg gagccagcca aatagctcac ccagccccga gctgtgacac tagccatgtc 5460 accgaccctc cccggcccta gcttttattc atttatttct gtttatttct cacatttatc 5520 ttttattttt taaatttatt tatatatatg tatatatata tgcacccacg catataaata 5580 tatatttata tatagtgctg tcaccctcac gctctaactc ccgcggagtt aatcccgagc 5640 accggacccc cgcaggggtc atcgcccaac tgccatttca ccctccagct ggggcttcac 5700 cgaccattcc ttttcccgac tccagctgga gatggcacat acagctctct ctcccgcagg 5760 agagccaaga gcttcgactc tctcaagagt cagtaaaaaa cgcccccccc caaggcccta 5820 gattacccca ttatatattt atatatctat atgtttcata taaatatata attatatata 5880 gtgctgccac ctcccagctc aatctccgca aggagtgttc ctcgagcaaa ttactccttt 5940 ggagtccccg cccccccctg ccccccttca cccccctctc cagccggagt ccttcactcc 6000 ccttcccttg taatgactcc agcaggattc ccgcccaccc catggctctg acccccgcag 6060 gggtctcccc gagtctctac tccagcagga gtattcacag cccaagccaa ctctgctcgg 6120 gttcccgcag gaacctgtgt caccctttgc tccaaggagc cctctttaca tttcctttca 6180 aataactata tccagcagcc ggatatagca tttcaagcct tttggggagt ttcttcgaat 6240 acacggctgc tgtcccgagc ttcatgcatt tggggagctc tcgagaacca cctgatctcg 6300 tactcccctt acatgctcta tggacctggc gggagccctg ggctcaacta tctccgagct 6360 cagggttctc tcccgggaca gcatgccaaa cctgcttaca gtcgtcaagc aatatctaag 6420 tgtgaactct tgaagtgaag tttttaaact aatttcgaga ggagcacgtg atataattga 6480 ccgcagctgg ccgccaatct acactcatta gttagccaat cagatctatt ccaaattact 6540 ataaatagcc tagctagata ttactccctt accttcgttt tccgaagaaa ccccccatcc 6600 accccctatc tcctcctttc ctccctttaa aaaggggagc tctcgagaac cacctgatct 6660 cgtactcccc ttacatgctc tatggacctg gcgggagccc tgggctcaac tatctccgag 6720 ctcagggttc tctcccggga cagcatgcca aacctgctta cagtcgtcaa gcaatatcta 6780 agtgtgaact cttgaa 6796 // ID Looper-N8_DR repbase; DNA; ZEB; 1107 BP. XX AC . XX DT 17-NOV-2004 (Rel. 9.1, Created) DT 17-NOV-2004 (Rel. 9.1, Last updated, Version 1) XX DE This is a nonautonomous DNA transposon that belongs to the DE piggyBac superfamily. XX KW piggyBac; DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; piggyBac superfamily; KW HARBINGERN7_DR; Looper-N8_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1107 RA Kapitonov V.V. and Jurka J.; RT "Looper-N8_DR, a nonautonomous DNA transposon from zebrafish."; RL Repbase Reports 4(10), 284-284 (2004). XX DR [1] (Consensus) XX CC Looper-N8_DR is a nonautonomous DNA transposon that belongs to CC the piggyBac superfamily. Its is characterized by 16-bp CC terminal inverted repeats and the TTAA target site CC duplications. This is a composite transposon, it contains CC a copy of HARBINGERN7_DR (pos. 483-368). XX SQ Sequence 1107 BP; 323 A; 259 C; 231 G; 293 T; 1 other; aggtgcagta ggtgatctgc caaaatgcta accgcttagc atattatctt tggacggcgg 60 ggagggactg tgattcaaag ccacaccccc tgaaatcgcg agcgcgcgca ccgcgtcaca 120 acagccgaca gacaacccac tagttcatgt cattcgccag ttagttagtg ccagcgccgt 180 gcaggaatta catgtattac tttgccacac ttacatgcta tttcagagcg aatattcaga 240 gtagcatggt aaacagtata ggaaggctgt cattgttcaa aaatgaccca atgtgaatat 300 aaaagcaact tcagctcaat aaagcaggtt aggcggaata gcgctattta ctggtgtttt 360 gttgttaaac taaatataaa tctacattat agatgctgtc aaagaacctt atgaaactga 420 aaataatcac atcaatcttt caccgggaga tttcagtggc tgaacaacac gtctgtgcat 480 ataagtcatt cataacacaa cacaatctaa cataaaatta gcctgactaa aatagtttca 540 aaacagaaca ttacctgtct aacagtaata cttcagccat ggtgtcgtcc ttcctccagc 600 gtgctaaagt aactccaata ttgattcagg tttttaaaag tttcaattca gcatttgatt 660 tctcccggtc cgtctcgtga ccgcgcggcc gctttaaggg cgaattatac ccgcgcctgg 720 ctttacagta atcggcccga gctcggctgt ccttcggcgc ctccggctcc gactcggcat 780 ctgcgagcgg cccgcagctc gctcgcgcgc atgcgtttgt gatgcagacg ggcacgcgct 840 aatggcggat ctgcgtgaac agatgcgcag aagtcggaat ctacatttgc tgacagacag 900 tctgacctgc ytatcggaat taagggagat gacggtccga ctctatttaa ttggatgaac 960 attttttagt tttatgcttt acccagaata taaaaataca tataaacaca tttagatcat 1020 ttactgtaat cattactatt ggactgtgaa gagactttca accagcacaa caaaaaatgt 1080 ttctgaagac aatcacctac tgcacct 1107 // ID TDR5 repbase; DNA; ZEB; 531 BP. XX AC . XX DT 04-MAR-2002 (Rel. 7.02, Created) DT 04-MAR-2002 (Rel. 7.02, Last updated, Version 1) XX DE Zebrafish non-autonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; TDR5. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-531 RA Jurka J. and Drazkiewicz A.; RT "TDR5: a non-autonomous DNA transposon from Danio rerio."; RL Repbase Reports 2(2), 33-33 (2002). XX DR [1] (Consensus) XX CC Contains 13 bp TIRs. XX SQ Sequence 531 BP; 155 A; 109 C; 113 G; 150 T; 4 other; cccagcaggc acacaacatc ataagacgtt aatattaggt tagatttagg ttgtgatgtc 60 aggtgaccaa aattcaatgt ctagccagca tctaaggaca atgttatttt gacgtccaat 120 aacaacgtca aatgacgttg atatttggtt gattttaggt tgtgttggaa agtgaccaaa 180 atccaacgtc gagccaacat cttaaaccaa cgtcatattg acgtcaaata ctgacattta 240 ttcgtcaggt atggcaacca aaatccaacg tctgatagac gtcatagtgg taatgtccac 300 acaacgtcaa gctgtaacat cattagacgt tgatatttgg ttgattttag gttgtgttgg 360 aaagtgacca aaatgcaang tcngtccgac gttggacatt gacgtcagcc tgatgttggg 420 ttctgacgtc aacccgattt tcatttccaa acaaaatgca acgtcccacg acnttggggt 480 aatgtccaca acgtcaatct gacgtcatgt tgacntcctg tgcctgctgg g 531 // ID LINE_DR repbase; DNA; ZEB; 647 BP. XX AC . XX DT 31-JUL-2000 (Rel. 4, Created) DT 31-JUL-2000 (Rel. 4, Last updated, Version 1) XX DE Danio rerio LINE-like sequence - a consensus. XX KW LINE_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-647 RA Okada N., Hamada M., Ogiwara I. and Ohshima K.; RT "SINEs and LINEs share common 3' sequences: a review."; RL Gene 205(1-2), 229-243 (1997). XX RN [2] RP 1-647 RA Jurka J.; RT "LINE_DR."; RL Direct Submission to Repbase Update (JUL-2000). XX DR [2] (Consensus) XX CC An internal portion similar to a 35 bp SmaI SINE element CC is not included. XX SQ Sequence 647 BP; 167 A; 183 C; 103 G; 170 T; 24 other; aactaaactt ctctgamcac attwctagaa ctgctcgatc gtgcagattc gcactctata 60 acatcagaaa gatccraccc ttcttwtctg aacatgcagc tcaactcctt gttcaagctc 120 ttgttctctc caaactggat tactgcaact ctctactagc tgggcttcca gctaactcta 180 tcaagcctct tcaactgctc cagaacgcag cagcacgagt ggtcttcaat gaaccgaaac 240 gagcacatgt cactccgctg ctagtccgtt tgcactggct gccagttgct gctcgcatca 300 aattcaaagc tctgatgttt gcctacaaag tgacttctgg ccttgctcch tcttatctgc 360 tctcacttct gcagatctat gtgccctcca gaaacttgcg ttgtgtgaat gaacgccgcc 420 tcgtggttcc atcccaaaga gggragwaat cactttcscg aatgctcack ctsrcrytca 480 atctgcccag ttggtggaat gaactcccta actgcawmas aacrgcagag tcactygctr 540 yyttcargaa acgactaaaa actcaactat ttagtctcca cttcacttcc taatctgcaa 600 ttgcctctyt gaatatcaca ctaactgtac ncacaaaaaa aaaaaaa 647 // ID Gypsy109-LTR_Dr repbase; DNA; ZEB; 536 BP. XX AC . XX DT 29-APR-2009 (Rel. 14.05, Created) DT 29-APR-2009 (Rel. 14.05, Last updated, Version 1) XX DE A long terminal repeat of the Gypsy-109_DR LTR retrotransposon; a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; LTR; KW endogenous retrovirus; Interspersed repeat; GYPSY superfamily; KW Gypsy109-LTR_DR; Gypsy-109_DR; Gypsy-109-LTR_DR; Gypsy-109-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-536 RA Dib M.R. and Naveira H.F.; RT "Gypsy109_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 9(5), 953-953 (2009). XX DR [1] (Consensus) XX CC Gypsy109-LTR_DR is a long terminal repeat of the Gypsy109_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its CC internal portion is deposited in Repbase as Gypsy109-I_DR. XX SQ Sequence 536 BP; 107 A; 174 C; 106 G; 149 T; 0 other; tgtcaccgac tcggtcccag tcattcccct cgctggccag cagaggccgc catccccgga 60 cttctagcat tacatcatcc acatagactg attgtgcaca cacctgaact gaatcacggt 120 aatgacccac gccacctata taagccacac tcaaaccact gttcagtgtg aagtcttgtt 180 tagccccggc cagcattact gaccgttctt tttcctgcct gatctcctgt gcataacccc 240 ggactgtttc tgactctgag ttgccttctg cctccccacg acccttgctt gatacacgga 300 ctctgaacca cgctgcctgc cctcgaccca cgcctgtctt aaggattctg aaccacgccg 360 cctgccactg atctatgcct ggtaaatcac tctgtgtctg tcagccgcca gccccacgac 420 ctttattgat tactgttgat gtgtgttcgc actttagtgc gtgttggatg tttgtgtttg 480 actgtgtcta ataaatactg caaaatggat ccctccgtgt cagtctcccc gttaca 536 // ID VIRDR1 repbase; DNA; ZEB; 3816 BP. XX AC AL591144; XX DT 04-MAR-2002 (Rel. 7.02, Created) DT 04-MAR-2002 (Rel. 7.02, Last updated, Version 1) XX DE Retrovirus Danio rerio 1. XX KW ERV1; Endogenous Retrovirus; Transposable Element; retrovirus; KW leukemia virus; VIRDR1. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-3816 RA Jekosch K.; RT "VIRDR1: Danio rerio retrovirus."; RL Repbase Reports 2(2), 38-38 (2002). XX DR [1] (Consensus) XX CC Putative novel retrovirus similar to leukemia viruses with a CC reading CC frame in pos. 637-3816 with several frame shifts. XX SQ Sequence 3816 BP; 1151 A; 690 C; 936 G; 1039 T; 0 other; ggtggtaaaa ttttcagaca tgcatttgtt tgttcaccaa aatgacacgt ttccatcatc 60 ggaagggaca ttttgtgtaa actaaatttg attttaacgg cagattcctc tggtgtgaga 120 gtgattgaag gggaggaatt ttgtcgttta caaattgaat cacaagagcc gaagtgggcc 180 tatgaatggc tgattgaaga caatgaatgg gctaattaaa tttgcaaatt ggctaaagaa 240 cgtgttaaac cttttgacag tgatatgatg tctcctggta aactgcattg tacgtcacat 300 gtcgtaattt acagatgagg aatttgagaa agcttggttt gagacaaatg tgaatgaaac 360 gttgatttta gagaaaatgt actggaagca tagtctgtgt gcagtttcag tgtctctgtc 420 agaaaagcag ctttcattct atctcttgtc agcacaggct gtgcctcaca tatcagtttg 480 caagggtaaa caacaatctt gggctgattt agggccgttt gtgaagcagt gtgttgatgg 540 aatttagcaa aagcatttag agtagacagt gaaactgaaa cagctgtttc aaaaactgtg 600 actgctattg acaaactttg cgtaaagaat tattgcatgg ttgactttaa cgctgaagac 660 atacacccag ctttagcaga aataccaagt gaattgtggg ctaaaagcaa atatgatgtt 720 ggtttaatca aaggttgtga tccagtgaaa atcactgcaa aatctgataa cagaccttgt 780 caacaacaat atcctttgaa aagggaagcc attgagggca ttaccccagt atttgaggct 840 ttactgaaac agggtgttat tgtaccatgc aacaattctg aggttcgcac ccctattttt 900 cctgttaaaa agataaggga taatgggatg cctacagaat ggcgttttgt acaggattta 960 caagcagtaa atgcagctgt caaacaaagg gctccattag ttccaaatcc gtacacgatt 1020 ttgtcacaaa ttcctgaaaa atcacaattg tattcagtgg ttgatctggc aaatgcattt 1080 ttcagtgtgt cagtggacaa agacagccaa ttttaggttt gcatttaatt tcaatggaaa 1140 aggctacacc tttacatgtc tgtgtcaggg ttttacagca tctccaaact tgcaatgaag 1200 cgttgttaag aagtttggaa cctttgactc tgacagctgg aactgctttg ttacagtatg 1260 ttgatgactt gttgatatgt gctgaggatg aggagacatg tgtgaaagac actgtgactc 1320 tccttaaaca cctagctaag gagggccaca aagtcagttt gacaaaattg cagtttgtta 1380 aacaaaaggt aacatttttg gggcatgtca ttacaccaca cagtaaatct ctgtctgaaa 1440 aaaaggggga gtggtataaa aaaatttacc aaaaccgctg acgaaaaaac aaatgttgtc 1500 atttttggga atgtgctcat attgtcgcac atttattcca aattatgcaa ttttggaaca 1560 acccctgaga gccctaacat tagggaaggg gatgaaatcc actgacaaac tagagtggac 1620 gaaagaggca gagcaggcat ttgtaaacaa gaagttacaa atggctgagg cccctgcatt 1680 gggtttacct gtaccaacaa aaccatttgt tcagatggta gatgagagaa atgggttcat 1740 gacgtcattg ctcctacagg atcatggagg tagattgcga cctgtggcct actttgagca 1800 aacttgacct tgtagcagca ggcctgacac gttgcttaag agcagtggtg gctgcggaaa 1860 aagctgttat ggcttcaaga gattttgttg gttattctga tctgatattg atggtgccac 1920 attctgggtc catgatactt caagaacaaa aaacattgca tctgtcaaca gcccgctggc 1980 tgagatatca cactatcttg ttagatatgc caaatgtgac tgttaaacga tgtactggtt 2040 tgaatgcagc tactcttctt cctactgagg aggatgggga agagcatcat tgttgtttaa 2100 cagcacttga acaggtgtgt tcgccacgac ctgacctctc tgacgaacca cttgaaaatt 2160 gtgacaatgt cctctttgtg gatggttcaa agatggtcag cgttcaaaga tccacaaaca 2220 ggccagaata aggttggtta cactgtaaca actgaatttg atgtgatggc ttctggtaaa 2280 ttaccagggc actattcggc acaggccgca gagcttgtgg cgttgacaga ggcatgtaaa 2340 ttgatggcag aaaaagaagc taccatttac actgactcta ggtatgcatt tggggtagct 2400 catgattttg gggctctgtg gaaacacaga aaatttctaa agtctgatgg tcgacaaata 2460 ctcaatgctc ctttagtggc agcgctgctt gatgcgattt aactacctga caaactggcc 2520 atttgtaaat gcgcagcgca caccagcaat aaagattcgg tttctgcagg taactccata 2580 gcagacgcag cagcaaaagt tgcggcgtcc cgagacgagg acaactctga atgttctctg 2640 ctatctgtcg ataatgacaa tgacgtgtgt tcttctttgc aggatatgca gaccttcgcg 2700 acggggctgg agaaaaaaca agtggagaca gtctggctgt gtgatgaaag ataatgtgtg 2760 gaagtgtgct gagggcaagg cgtgtttacc aaaacatttt ttccaacatg atgcaaaatt 2820 acttcacggt aaagatcatg tgtcaaaaac agcaatggtt gcgcaaatga acgaactgtg 2880 gttcacaaag gggttcacta catttgcgga caatttctgt agacgatgtg taatttgcaa 2940 cacacaatgt ggccagagcg ataaaggttc cactatcatc tcatccacct ccaacagggc 3000 cttttgaata tttgatgatg gatttcattg aattgtcccc atgcaatggg aaaaggtatt 3060 gtttggtgat ggttgacatg tggtcaaagt aggttgaagt ttttccaacc tcaaaacaag 3120 attcggctgc ggtagcaaaa gcgttactga ctgaaatcgt gccgagatgg ggaataccac 3180 taaaaataag ctcagataat ggtacccatt ttgttaatga agcaatcaaa caagtgggcc 3240 aatatttgga aattgatttg agaacacatt gcagttacca ttcagcttca ggcgaagctg 3300 ttgagagaga aaatggtatt ctgaaaaaac aaattggcaa aatgttgtga agacacaggg 3360 cttacatggg ttcaagctct acccattgtc ctgatgtaca tgagaatgag aaaaagacca 3420 aaaatgaatt tgagcccgtt tgaaattgtc tttggcagac caccgcgtgt aggtgtgaat 3480 gggggtaaac agcagctgcc ctcaacagat gtgtgtgagg atgacatgtt gaattactat 3540 aaagaaatgt ctcatgtgtt gtccaatgtt tgtgtgcagg taaaggccgc attggggaag 3600 gcagctgaca gaccactaca caatctgagg cctggcgatt tcgtggtgat aagggacctg 3660 aggagaaaga gctggagagc gaaacgttgg ctgggtccat ttcaagtgct actgaccact 3720 gagacagcgg tgaaggtcgc ggagcgggcg acgtgggtgc acgctgggca ctgcaggaaa 3780 attccaccac ctgaggagga ttctgcgagg gagtag 3816 // ID EXPANDER1_DR repbase; DNA; ZEB; 3350 BP. XX AC . XX DT 05-JUN-2002 (Rel. 7.05, Created) DT 10-NOV-2010 (Rel. 7.05, Last updated, Version 2) XX DE EXPANDER1_DR is a RTE-like non-LTR retrotransposon - consensus. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; ORF2; RTE clade; REX3; EXPANDER1_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-3350 RA Kapitonov V.V. and Jurka J.; RT "EXPANDER1_DR, a family of RTE-like non-LTR retrotransposons in RT zebrafish."; RL Repbase Reports 2(5), 15-15 (2002). XX RN [2] RP 1-3350 RA Kojima K.K. and Jurka J.; RT "Consensus update."; RL Direct Submission to Repbase Update (10-NOV-2010). XX DR [2] (Consensus) XX CC EXPANDER1_DR is a family of RTE-like non-LTR retrotransposons and CC it CC was active in zebrafish a few million years ago. CC There are several thousand copies of EXPANDER1_DR fossilized in CC the CC zebrafish genome; they are ~91% identical with the consensus CC sequence. CC The 5' portion of EXPANDER1_DR is incomplete. CC The consensus encodes a 3' portion of the reverse CC transcriptase. CC EXPANDER1_DR is 87% identical to EXPANDER1 from the fugu genome. CC Such a high nucleotide identity between non-LTR retrotransposons CC from the species diverged more than 100 million years ago CC strongly indicates that horizontal transfer events were involved CC in evolution of EXPANDERs. CC [2] Consensus update and extension. The full-length sequence is CC >74% identical to RTE elements in fugu, medaka, stickleback, and CC lamprey. XX FH Key Location/Qualifiers FT CDS 501..3338 FT /product="EXPANDER1_DR_1p" FT /note="apurinic-like endonuclease and reverse FT transcriptase, N-terminal truncated." FT /translation="LEKVGLTSTHSLGSGTQLLEKGWTFLYSGVAHGERQR FT AGVGLLIAPQLSRHVLEFSPVNERVASLRLRVGDRSLTVVCAYGPNTSAEY FT PAFLESLGGVLEGAQTEDSIVLLGDFNAHVGNDSDTWRGVIGRNGLPDLNP FT SGVLLLDFCANHSLSITNTMFEHKGVHQCTWHQDTLGRRSMIDFVVVSSDL FT RPYVLDTRVKRGAELSTDHHLVVSWIRWQGGKLDRPGRPKRIVRVLWERLA FT EPQVRGIFNSHLRKSFDQIPREAGDMETEWSMFSDSIVNAAVRSCGRKVCG FT ACRGGNPRTRWWTSEVRDAVKLKKESYRAWLACRTPETADGYRRAKRAAAR FT AVAEAKTRAWEEFGETMEEDYRSAPKRFWQTVRRLRRGKHLHIDAVYSGSG FT ELLTSTGDVVGRWKEYFEDLLNPTDMSFIEEAETGDAGVDSPITQAEVTEV FT VCKLRSGKAAGVDEIHPVYLRSLDVVGLSWLTRLYNIAWRSGTVPLDWATG FT VVVPIFKKGDRRVCSNYRGITLLSLPGKVYARVLERRIRPMVEPRIQEEQC FT GFRPGRGTLDQLYTLTRVLEGSWEYAQPVHMCFVDLEKAFDRVPRGILWRV FT LGEYGVRGDLLRAVSSLYEQSRSLVRIAGNKSDLFPVHVGLRQGCPLSPIL FT FIIFMDRISRRSLGLEGVRFGDHRISSLLFADDVVLLASSDMDLQHALGRF FT AAECDAAGMRISTSKSEAMVLHRKKVVCHLQVGGKSLPQVEEFKYLGVLFT FT SEGRMEREIDRRIGAAAAVMRSMYRSVVVKKELSRKAKLSIYRSIYVPTLT FT YGHELWVMTERTRSRIQAAEMSFLRRVAGRTLMDRVRSSDTREELGVEPLL FT LHIERSQLRWLGHLFRMPPGRLPREVFQACPTGRRPRGRPRTRWRDYVSRL FT AWERLGIPPEELEEVSGDREVWGSLLRLLPPRPGPGKAAEDE" XX SQ Sequence 3350 BP; 705 A; 802 C; 1101 G; 742 T; 0 other; tgaactgcca ccttatcgtg gtggaggggt ttgagtgcct gagtaatcct aagagctatg 60 ttgtcggggg ctaatgcccc tggtagggtc tcccaagaca aacaggtctt aggtgacagg 120 tcagactaag tttggttcaa aaacccctta tgagttcaac aacatcaagg actgtgatgt 180 cgcccggtat ggcacagctg gggccccacc ctggtgccag gccttgggtt ggggctcgta 240 tgtgagcacc tggtggccgg gttttttccc acggaacctg gccgggctca gcccgaagga 300 gcgacgtgct accatcctcc cgcaggccca ccacctgcag ggggagctgt aaggggcagg 360 tgcattgtgg aatgggtggc ggtcgaaggt taggactacg acaacccgat cgtcaggcac 420 agaaactggc tattgggaca tggaatgtca cctcactttg aggaaaggag cccgaactgc 480 ggtaggttga acggtactga ctagagaaag ttgggctcac ctccacgcac agcttgggct 540 ctggaactca acttcttgaa aaggggtgga ctttcctcta ctctggagtt gctcacggcg 600 agaggcagcg ggctggtgtg ggcttgctta tagctcccca actcagccgc catgtgttgg 660 agttttcccc ggtgaacgag agggtcgcct ccctgcgcct tcgggtcggg gataggtctc 720 tcactgtggt atgtgcctac gggccaaaca ccagtgcaga gtacccggcc ttcttggagt 780 ccttgggagg ggtgctggaa ggtgcccaga ctgaggactc cattgttcta ctgggggact 840 tcaatgccca cgtgggtaat gacagtgata cctggagagg cgtgattggg aggaacggcc 900 tccctgatct gaacccgagt ggtgttttgt tattggactt ctgtgctaat cacagtttgt 960 ccataacgaa caccatgttc gagcataagg gtgtccatca gtgcacatgg caccaggaca 1020 ccctagggcg gaggtcaatg atcgactttg tggttgtatc atctgatctc cgaccgtatg 1080 tcttggacac tcgggtaaag agaggggcag agctgtcaac tgatcaccac ctggtggtga 1140 gttggatccg ctggcaaggg ggaaagctgg acagacctgg taggcccaag cgtattgtga 1200 gggtcctctg ggaacgtctt gctgaacccc aagtcagagg gattttcaat tctcacctcc 1260 ggaagagctt tgaccagatc ccgagggagg ctggagacat ggagactgag tggtccatgt 1320 tctccgactc cattgttaac gcggccgtga ggagctgtgg tcgtaaggtc tgtggtgcct 1380 gtcggggcgg caatccacga acccggtggt ggacatcgga agtaagggat gccgtcaagc 1440 tgaagaagga gtcctacagg gcttggttgg cttgcaggac tcccgagaca gctgatgggt 1500 atcgacgggc caagcgtgct gcagcccggg cggttgcgga agcaaaaact cgggcctggg 1560 aagagttcgg ggagaccatg gaggaagact atcggtcggc cccaaagaga ttctggcaaa 1620 ctgtccggcg cctcaggagg ggaaagcatc tccacatcga cgctgtttac agcggaagtg 1680 gggagctgtt gacctcaact ggggatgttg tcgggcggtg gaaggaatac tttgaggatc 1740 tccttaaccc caccgacatg tctttcatcg aggaagcaga gactggggat gcaggggtgg 1800 actcacccat cacccaagct gaggtcactg aggtagtttg caagctccgc agtggcaaag 1860 cagcgggggt ggatgagatt caccctgtgt atcttaggtc tctggatgtt gtggggctgt 1920 cgtggctgac acgtctttac aacatcgcat ggaggtcggg gacagtacct ctggactggg 1980 caactggggt ggtggttccc atttttaaga agggggaccg gagggtgtgc tccaactata 2040 gggggatcac actcctcagc ctcccgggga aagtctatgc cagggtactg gagaggagga 2100 tccggccaat ggttgaacct aggatccagg aggaacaatg cggttttcgt ccgggccgtg 2160 gtacactgga ccagctctat accctcacca gggtgctcga gggttcatgg gagtatgccc 2220 aaccagtcca catgtgtttt gtggacttgg agaaggcatt cgatcgtgtc cctcgcggca 2280 ttctgtggag ggtgctcggg gagtatgggg tcagaggcga tctgttgagg gccgtctcgt 2340 ccctgtatga acagagtagg agtctggttc gcattgccgg caataagtca gatttgtttc 2400 cagtgcatgt tggactccgg cagggctgcc ccttgtcacc aattctgttc ataattttta 2460 tggacagaat ttcaaggcgc agccttgggc tggagggggt ccggttcggg gaccacagga 2520 tttcatctct gttattcgca gacgatgttg ttctgttggc ttcatcggac atggaccttc 2580 agcatgcact ggggcggttt gctgccgagt gtgacgcggc tgggatgaga atcagcacct 2640 ccaagtccga ggccatggtg ctccaccgga aaaaggtggt ttgccatctc caggttggag 2700 gaaagtcctt accccaggtg gaggagttta agtatctcgg ggttttgttc acgagtgagg 2760 gaaggatgga acgtgagatt gacaggcgga ttggtgcagc ggcagcagta atgcggtcga 2820 tgtaccggtc cgttgtggtg aagaaggagc tgagccgaaa ggcaaagctc tcgatttacc 2880 ggtcaatcta cgttcctact ctcacctatg gtcatgagct ttgggtcatg accgaaagga 2940 caagatctcg gatacaagcg gccgaaatga gtttccttcg aagggtggca gggcgcactc 3000 ttatggatag ggtgaggagc tctgacaccc gggaggagct cggagtagag ccgctgctcc 3060 tccacatcga gagaagtcag ctgaggtggc tcgggcatct gtttcggatg cctcctggac 3120 gcctacctag ggaggtgttc caggcatgtc ccaccgggag gaggcctcgg ggaagaccca 3180 ggacacgctg gagggactat gtctctcggc tggcctggga acgcctcggg atccccccgg 3240 aggagctgga ggaagtgtct ggggataggg aagtctgggg ttctctccta agactgctgc 3300 ccccgcgacc cggccccgga aaagcggcag aagatgaatg aatgaatgaa 3350 // ID DIRS-3_DR repbase; DNA; ZEB; 5342 BP. XX AC . XX DT 26-SEP-2008 (Rel. 13.1, Created) DT 04-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE DIRS-like LTR retrotransposon family - a consensus. XX KW DIRS; LTR Retrotransposon; Transposable Element; KW endogenous retrovirus; reverse transcriptase RNase H; KW phage integrase; DIRS-3_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5342 RA Jurka J.; RT "Families of DIRS-like endogenous retroviruses in zebrafish."; RL Repbase Reports 8(10), 1268-1268 (2008). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS join(308..1777,1752..3764) FT /product="DIRS-3_DR_1p" FT /translation="MEKTFRSCVPPCPRFLTEQDTHDLCVQCLGQEHAQSA FT LEGGGCVHCDSLPGRVLRSRRALFDKEPAAGVPRGTGPARAEAERRLMSWG FT SQLDLAEGMETSSRASSPPSTVRSRSSQRELEARSSASSSRADGRLLRLPA FT SEGVESMEGLRGQNDVPPLCPEYDELAEVLANVTAKYNINWEVERQEVRQK FT AGLCDERILPSRDRPLRGGLPFNKDLQKEIIYTWKNPYTARVISQQGSIYS FT AVAGLNEHGCRTMPKMEEHLARHLLPQASSAKTPALPSKPXKLTSMLVGKA FT YAAAGQSVGCLHTMSLLQAYQADLLKEALDNGGAGPDTVREXLRASDLSLR FT ATKEVARSLGRSMAAMVVTERHLWLNQADIKEEDKRIFLDAPISPAGLFGD FT SVDRVAKSFERAKKRSGSYGSFLRPKGHPSGAMQRDQPQPSTSSSARYRAE FT QKESVASRQPPKRKWGGKRSQEAKTVTYGGPRTIVVKRSGPQKKSNGLGLR FT RSPSXGTSSGSPQRGAKATSSLPTGGLSTNPPTVGASGRGDPRRYPQTFHP FT KSGDVSGLAPSTEGPRAVKSRASCGEFASVPRVSYSGEHRDQSRETGTPAE FT IFGRVETAAECFAVGPAYHREGLQNSVCFTPLSLQRGAPYPSEARAGSSDG FT TRSRISSEEESHRTDPSSRHRVRVLQPLLHCSKEGWRAASDIRSKTAKSLR FT TNTEVQNVDYQHHRVTNTVRGLVCHDRSEGRILPHIHPSKSQEVPQIRLRG FT QGLSVQGSSVRPSTLTPHIYQSRRRGFGSAAATRDSNSQLPRRLAHSSPIE FT GVSGSTSRCCSRSHREIGVAAKPKEKCACASSDDHFFRCAMGLHDDVCVSV FT PHPNRVDSGSRKEDQTGPSHHXQTVPEAVGSTSGSLQHNPIGSATHETITV FT VAQNQGVFPEGKSVPHNQSIEALPTSVIDMEKALVSVPGPYTRSNISAPGS FT DIRCLTQGLGSDPEGPSRIGTVEGPSSPHAYQLLGDVSSVSGSKALLPSSE FT RSPCVSEDRQHIGGLLHQPSRGFEFAPSVQVGESNPSVGSGEAALSEGSLH FT PRXDECGSGPPVETGVGAWGMATTPKSGGGHLAEIRQSRRRSIRLSXNNTL FT PIVVLPHKHIQHLWGWMPWCRRGRGFVCTLFPQSLCSRESWRGSVKGGTTY FT CW" XX SQ Sequence 5342 BP; 1235 A; 1398 C; 1430 G; 1270 T; 9 other; tggttccctt tcgggaactc tacgctgcgt ctctttagga gacacaatgg gataccatcc 60 ctctcttata accttctgaa gcacaggtgt aatcaatcca atgtaattgg cgggacgcaa 120 cgtgcgtgtg gcgtcagacc gaaggtataa aagccccaca ctcacagtga acttcagctt 180 tttctctctt cactctgcga atgagttcgt tgattttgaa gcactcacag cactgataag 240 gcactacaaa aaccatttaa actgaatcaa agcacacact ttcattttct atttaaaaat 300 tagcaaaatg gaaaagacat ttagatcatg tgtacckcca tgtcctcgtt tcctcacaga 360 acaggacacg catgatttgt gtgtgcagtg cctcggtcag gagcacgcac agtcagccct 420 tgaggggggt ggctgtgtgc attgcgattc acttcctggy cgagtgctgc gaagtcgccg 480 cgctctgttt gataaagagc cggcggcagg tgtgcctcgc ggcactggtc ccgcccgtgc 540 tgaggccgag cggcgattaa tgtcgtgggg atcgcagctc gatctggcag aggggatgga 600 gacaagctcg agggcttctt ctcctccttc tactgtgaga tcgaggtcca gtcagcgcga 660 gctggaagcc cgttcatcgg cttcttcctc tcgcgctgat gggcggctgc tccgcctccc 720 ggcttctgag ggagtcgaga gtatggaggg ccttcggggc caaaatgacg ttcccccatt 780 atgccccgaa tatgatgagc tggcagaggt cctggccaat gtaacagcta aatataatat 840 taactgggaa gtagagaggc aggaagtgcg ccaaaaagct ggtttgtgtg atgaacgcat 900 cctgccatca cgagacaggc ctctgagggg gggtcttcca tttaacaaag atctccaaaa 960 agagatcatt tacacttgga aaaatccata cactgcccgt gttataagcc agcagggatc 1020 aatatattca gccgttgctg gtctgaatga gcacggatgc cgcacaatgc cgaaaatgga 1080 ggagcatttg gcgcgccatt tgctacctca ggcatcgtcg gcgaagaccc ccgccttgcc 1140 gtcaaaacca rtaaaattaa catcgatgtt agtcggcaag gcatatgcgg cagcaggcca 1200 gtctgttggg tgtctgcaca caatgtcact tttgcaggca taccaggcgg acctgctgaa 1260 ggaagccctt gataatggtg gggcggggcc cgatacagta agagaartac ttcgggcctc 1320 ggacctgtct ctccgtgcca ccaaagaagt agctagatct ttaggccgct ctatggcggc 1380 catggtggta acggagagac atttatggct gaatcaagcg gacatcaagg aggaagataa 1440 acgwattttc cttgatgctc ccatttcgcc cgcgggtttg ttcggcgact ctgtagatcg 1500 ggtcgccaaa tcgtttgagc gggcgaaaaa gaggtcgggc tcctacggga gcttccttcg 1560 accaaaaggg catccttctg gggctatgca gcgggaccag ccccagccgt caaccagctc 1620 atctgcccgg tatagagccg agcaaaaaga gagtgtggct tcccgtcagc ccccaaaaag 1680 gaaatggggt gggaaacgct ctcaagaggc gaagacggtg acatacggtg gtccgaggac 1740 catcgttgtg aaacggtctg ggcctcagaa gaagtcctag caytgggact tcttcgggca 1800 gcccccaacg aggagcaaag gccacctcct cgttgcccac agggggtcta tccaccaacc 1860 ctcccaccgt tggtgcctca gggcgtggag acccccggcg atatccacaa acgtttcatc 1920 ccaaaagtgg tgacgttagt ggactcgccc cctctactga gggtccaaga gcagttaaat 1980 caagggcctc ctgtggtgag ttcgcctcag tgcccagagt tagctactca ggggaacata 2040 gagaccagtc tcgagagact ggtacccctg cagaaatttt tggcagagtg gaaacggctg 2100 ccgaatgttt cgcagtgggt cctgcttacc atagagaagg gctacagaat tcagtttgct 2160 tcacgcccct ctcgcttcaa cggggtgctc cataccctag tgaagccaga gcaggctcta 2220 gtgatggaac aagaagtaga atctcttctg aggaagagag ccatagaaca gatccctcct 2280 ctagacatcg agtcagggtt ttacagccgt tacttcattg ttccaaagaa ggatggaggg 2340 ctgcgtccga tattagatct aagacagcta aatcgctccg tacaaacact gaagttcaaa 2400 atgttgacta tcagcaccat cgtgtcacaa atacagtccg aggactggtt tgtcacgata 2460 gatctgaagg acgcatactt ccacatatcc atccttccaa gtcacaggaa gttcctcaga 2520 ttcgccttcg ggggcaaggc ttatcagtac agggttcttc cgttcggcct agcactctca 2580 ccccgcacat ttaccaaagt cgtcgacgcg gctttggctc cgctgcggct acaagggatt 2640 cgaattctca attacctcga cgattggctc attctagccc gatcgaggga gttagcggtt 2700 caacatcgag gtgttgttct cgctcacata gagaaattgg ggttgcggct aaaccaaaag 2760 aaaagtgtgc ttgtgccagc tcagacgacc acttttttag gtgtgctatg ggactccacg 2820 acgatgtttg cgtgtctgtc ccccacccga atcgagtcga ttcgggcagc cgcaaagagg 2880 atcagactgg gccaagccat cacwgtcaaa cagttccaga agctgttggg tctactagcg 2940 gcagcctcca acataatccc attgggtctg ctacacatga gaccattaca gtggtggctc 3000 aaaaccaggg ggttttcccc gaggggaaat ccgttccgca caatcaaagc atcgaggcgt 3060 tgcctacgag cgttatcgat atggaaaaag ccctggtttc tgtcccaggg ccctacacta 3120 ggagtaatat ctcagcgcct ggctctgaca tcagatgcct cacgcaaggg ctggggagcg 3180 accctgaggg gccttcccgc atcgggacag tggagggacc atcatctcca catgcatatc 3240 aactgcttgg agatgttagc agtgtttcag gctctaaggc acttcttccc tcaagtgaga 3300 ggtcaccatg tgttagtgaa gaccgacaac acatcggtgg tctcttacat caaccatcaa 3360 gggggtttga attcgcgccc tctgtgcagg ttggcgaatc aaatccatct gtgggctcag 3420 gggaggctgc tctctctgaa ggcagcttac atcccaggyc cgatgaatgt gggagcggac 3480 ctcctgtcga gacaggggtt ggagcctggg ggatggcgac tacacccaaa agtggtggcg 3540 gccatttggc agagattcgg cagagccgac gtcgatctat tcgcctgtca raaaacaaca 3600 cattgcccat tgtggttctc cctcacaaac acatccagca cctttggggc tggatgccat 3660 ggtgcagacg tggccgaggc ttcgtctgta cgcttttccc ccaatcgctc tgctcccggg 3720 agtcctggag agggtccgtc aaggggggta caacctattg ctggtagccc cttattggcc 3780 cacacgagtg tggttctcgg acctagtgtc tctcctcgac ggtctcccat gggagattcc 3840 cgtccagaga gacctcctgt cccaggcgga gggaatgata gtacaccccc gcccggacct 3900 ctggaaactg tgggtgtggc ctctgagggg gcccaccttg tagatcttgg tttgtcaact 3960 gaggttgttc aaaccatact aagctccaga gctccctcca cgaggagatt gtatgccacc 4020 aagtggaaac tttttacttc gtggtgtaca gaccaccacc tggatccagt ccactgccct 4080 gcggggtcag tgctgcaatt tctccaggag cgttttgaat tcggtttgac tccgtcaacg 4140 cttaaaggtt atgtagcggc aatgtccgca taccgtactg atggtcttgg caaagaccct 4200 ctggtggtca gattcctccg tggaacgagg aggttgaggc ctgcctgcgc caataggttt 4260 cctacttggg atttgtcgat agtgcttgag ggcctgtcga cagccccctt tgaaccaatt 4320 gaggatgtgt cagaaaagtt tctgaccctt aaaacgtttt ttttttgttg gccattacat 4380 ccatcaaaag agtaggagat ttacaagcat tgtctgtagc tccctcttgt ctggaattct 4440 cacctggtat ggtgagagca tttctgtacc ccagagcagg gtacgttcct aaggtcccca 4500 ctgaggtggt gcgacctact gtgctgcagg cctttaatcc tccaccattt atgacgccag 4560 atcaggagcg cttgaatctg ctttgcccag tgcgggcact agatgcatac gtacatcgta 4620 cgtctgcttg gcgtacaaca caacagttgt ttgttttgta tggctcaccc aaaattgggg 4680 cgccagcatc taagcagtct ctgagccggt ggatagtcga ggctatttca ctagcatatg 4740 aagccctgca tcggcctcta cctgaggcga tcagggctca ctcaaccagg agtatggcgg 4800 cttcgaaagc ctttcgttct ggccagtccc tgactggcat ctgcaatgca gctgggtggt 4860 ctaccccaca tacctttgta aggttttatc agttggacct ggaccctact ccgggttcca 4920 gtgtcctagc aatataggag aactcatccc cggcatgtaa aactatggcg tgtttgggat 4980 agcgttccca ttgtgtctcc taaagagacg cagcgtagag ttcccgaaag ggaacgcgtc 5040 aggttacgaa tgtaaccatg gttccctaag ggaacgagac gctgcgtcgc gttgccatac 5100 tttttcatac ctggtgcgct ccgttcgaga ggtataagct gaagttcact gtgagtgtgg 5160 ggcttttata ccttcggtct gacgccacac gcacgttgcg tcccgccaat tacattggat 5220 tgattacacc tgtgcttcag aaggttataa gagagggatg gtatcccatt gtgtctccta 5280 aagagacgca gcgtctcgtt cccttaggga accatggtta cattcgtaac ctgacgcgtt 5340 tt 5342 // ID Gypsy-16-I_DR repbase; DNA; ZEB; 6775 BP. XX AC . XX DT 11-FEB-2005 (Rel. 10.01, Created) DT 11-FEB-2005 (Rel. 10.01, Last updated, Version 1) XX DE An internal portion of the Gypsy-16_DR LTR retrotransposon - a DE consensus sequence. XX KW GYPSY superfamily; Gypsy-16-I_DR; Gypsy-16-LTR_DR; Gypsy-16_DR; KW LTR retrotransposon; endogenous retrovirus; gag; integrase; KW protease; reverse transcriptase. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6775 RA Kapitonov V.V. and Jurka J.; RT "Gypsy-16_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 5(1), 5-5 (2005). XX DR [1] (Consensus) XX CC Gypsy-16-I_DR is an internal portion of the Gypsy-16_DR CC LTR retrotransposon that belongs to the Gypsy superfamily. CC Its long terminal repeat is deposited in Repbase CC as Gypsy-16-LTR_DR. The consensus sequence was reconstructed CC based on multiple alignment of nine proviral copies (they are CC less than 1% divergent from the consensus sequence). CC Gypsy-16_DR retrotransposons are characterized by 4-bp CC target site duplications. The internal portion contains two CC ORFs encoding the 645-aa Gypsy-16_DR1p gag (pos. 109-2043) CC and 1601-aa Gypsy-15_DR2p pol proteins (pos. 1971-2043) CC composed of the protease, reverse transcriptase, and integrase CC domains. The second protein, including the protease domain, CC does not start from Met. Presumably, the gag-pol fusion protein CC is formed originally due to a ribosomal frame shift. This family CC is likely still active in the genome. Each of all nine proviral CC copies is flanked by identical LTRs. XX FH Key Location/Qualifiers FT CDS 109..2043 FT /product="Gypsy-16_DR1p" FT /translation="MDIIEKENVDISKAVIVGGMTLTETDSDLESWLLRYG FT SINRHLLIDDPDCEFHRHAIIEFTHNSAMKTLMPLLPLTVVSMSNPSTTFM FT VRALSCVYPHIASDSATNGYLEELQNIASFSGKSIEEVLQTELLKIKFGPS FT HAESLPVLDKKLEFPNAARSQILDRSTVSSPNRLLSPVISQSMITEQTAFP FT SSRISPFHEVESTNSKNLSKESLNHRSTKPTVTVSSHPALTMDIIDPPSVQ FT KVVVEHIVRTNDTAPMHHTSFRLRSFSGKIPRPVNEPDFDTWRASVDLLLT FT DPSISDLNRARKIIDSLLPPAADIVKHVSPNSLPAVYLELLESVYGSVEDG FT DELLARFMNSFQNNGEKPSTYLHRLQVLLSTAIRRGGIFEEERNRYLLKQF FT CRGCWDSSLIADLQLERRKATPPSFAELVVLIRTEEDKNASKEERMRKHLG FT LNKHYPAPSKFRLSAHQISAHQSETQDDQTDTSLAKQVCELQAQVVALQKP FT SSQKEKKKNAKPDEVSELRNVVTELQAQITAMQTTATPKIKSDVEATEIAD FT LKRQIADLKVQLTAPDMYRNRTRNLLPEPRATDCYRASKLPESRPRPGYCF FT RCAEDGHLASSCSNAPDPTKVAEKKRKLRERQAQWDTQQVAIMNPLN" FT CDS 0..0 FT /product="Gypsy-16_DR2p" FT /translation="EKTQVKGATSPVGYPTSSNHESFKLRTVSVEGHTETK FT RNNNCPEKRKQLFNQNACDAPPLRNLPRGLVGVKCTAQITVGNKRVSCLLD FT TGSQVTTVPWSFYQENLSNCPLKSLDNLLEVEGANGQTVPYLGYVELTLKF FT PREFLGTETEVPTLALVVPDLMNTPQVLIGTNSLDALYSNYVQQSASFPQS FT NFHGYRAVQKVLEARYKQASADVVGCIKFKGHVPEVVPAGCTVVLDGHVLV FT NCPHVGKCVALESPTSPALPGGLLVASCLHSLPSKRHQQLPVVLRNETQTD FT ITIYPRTIIAEMRAVQEVIKSGQVNSSTVNKELSACSNLKFDFENSPLTPE FT WKKRITDQLNSMPEVFALHDLDYGHTNKVTHRIKLNDETPFKHRPRPIHPQ FT DIDAVRKHLQDLLAAGIIRESESPFASPIVVVRKKDNSVRLCIDFRKLNSQ FT TIKDAYALPNLEEVFSALTGSKWFSVLDLKSGYYQIEMEEADKSKTAFVCP FT LGFWEFNRMPQGITNAPSTFQRLMERCMGDLNRKEVLVFIDDLIIFSESLE FT EHESRLMHVLKRLKEYGLKLSPEKCKFFQTSVRYLGHIVSENGVETDPVKI FT EALKTWPRPRNLKELRSFLGFSGYYRRFIQDYSKIIKPLNDLTVGYPPLQK FT RHLQENKNKQYLDPKKEFGDRWNQPCQQAFDMIIEKLTSAPVLGFADPKLP FT YVLHTDASTTGLGAALYQKQEGQMRVIAFASRGLTRSESRYPAHKLEFLAL FT KWAVTSKFSDYLYGTEFVVVTDSNPLTYILTSAKLDATSYRWLSSLSTYNF FT KLQYRAGSQNCDADGLSRRPHGELLDDPASQKERERIKQFTLHHLDEFGVE FT DSLILPEAIKAICDRHQIGNSSHKCKFSNPSIALVESLALHADVLPNEFEQ FT ENEHGLPVIPYLSNEELKRQQRMDPDLKFIIDCLQRNEKPSSSKDQSLAVT FT LWIREWSRLELRDGLLYRKKQDQESTHYQLALPVALRGTVLKSLHNDMGHM FT GMERTLDLVRTRFFWPKMSSSVEEKIKTCERCVRRKAFPEKAAEMMNIKTT FT RPLELVCMDFLSLEPDQSNTKDILVITDHFTKYAVAVPTRNQKAQTVARCL FT WENFLVHYGFPERLHSDQGRDFESSLIKELCLVAGIHKVRTTPYHPRGNPV FT ERFNRTLLQMLGTLENKKKSCWKEFVKPLVHAYNCTRNDVTGYTPYELMFG FT RQPRLPVDLAFGLPVDRSTKSHSQYVKDLKEGLRESYEIAIKNSAKVAQRN FT KRRFDKHVVVSTLDVGDKVLVRNLRLRGKNKLADKWEPDVYVVIRKAGDLP FT VYVVQPDGKTGPVRTLHRDLLRPCGYLSENEIEEMSPPNVQRKPRTRSSSA FT LEYAPKEHQMSDQSESEDDSLYIRNAGRQLESITTTVLPSSQSPVLVRNLP FT GIEPIEPLPVVVNPEKETLPDSRLEEDLTENQRDDVNENFLPVLNPADIDP FT KEIEPERSGNSVEVQIHRRALELDPVDVPHSNDQNPHVRNVSSNQPIVDED FT LDTSGPRRSKRQCRPPNKLEYHKLGNPLTLVIQSLLQGLSSAFTTSLEEPI FT LTRDQPFVVPDPFPIAVTTQPRTCPRTCLNSGGE" XX SQ Sequence 6775 BP; 2156 A; 1428 C; 1416 G; 1775 T; 0 other; taccaaaaag tggcgagagc cagccaggag agagattgca acaacagtgt ctaattacag 60 tattcgagtt cactataaat attacaaatc ggagagaact ttaacgtcat ggatatcata 120 gaaaaagaga atgtagatat ctcaaaagca gtaattgtgg gtggaatgac actgactgag 180 acagactcag atttagagtc atggctttta agatatggta gtattaaccg acatcttcta 240 attgatgacc ctgactgtga gtttcatcga catgctatca tagagtttac acataactcc 300 gcgatgaaaa cattgatgcc tcttttgcct ttaactgtag ttagtatgtc aaacccaagt 360 accactttca tggtacgtgc tttaagctgc gtttaccccc atattgctag tgatagtgct 420 actaatggat atctggagga attgcaaaac attgctagtt ttagtgggaa atccattgag 480 gaagtactcc aaacagagtt actgaagatt aaatttggtc cttctcatgc tgagtcacta 540 cctgttttgg ataaaaagct tgaatttcca aatgcagcac gttctcaaat acttgatcgt 600 agcacagtca gttcaccaaa tagactgctg tccccagtca tatcacaaag tatgattact 660 gaacaaacag cttttccttc atctagaatt tcaccatttc atgaagttga atcaacaaat 720 tcaaaaaacc tgtccaagga aagccttaac catagaagta ctaaacccac agtaacagtg 780 tcatcccatc cagcacttac catggatata attgatcctc ctagcgtgca aaaggtagta 840 gttgagcaca ttgtccgcac aaatgacaca gctccaatgc accatacctc ttttcgcctc 900 cgatctttct ctggaaaaat tcctagacct gttaatgagc cagattttga cacttggcgt 960 gccagtgttg atctcctact gacagatcct tctatatctg acttaaatcg agccagaaaa 1020 atcatagaca gtctgcttcc ccctgctgca gatattgtta aacatgtctc ccctaacagt 1080 ttacctgcag tatatctgga attgctggag tctgtatatg gctctgtaga agacggagat 1140 gagttattag ccagatttat gaatagcttc caaaacaatg gtgagaagcc ttcaacttac 1200 ctgcacagat tacaagttct cttaagcaca gctattcgac gaggtgggat atttgaagaa 1260 gagagaaacc gatatcttct aaagcagttt tgtcgcggct gttgggacag ttccctcatt 1320 gctgaccttc aattagaaag gagaaaagcc actcctcctt catttgcaga attagtagtt 1380 ctcatccgta cagaagaaga taaaaatgcc tctaaagaag aaagaatgag aaaacattta 1440 gggctaaata aacactatcc tgccccctcg aaattcagac tgtcagctca ccagatatct 1500 gcccaccaaa gtgaaacgca ggatgatcaa actgacacat ctctcgcaaa gcaagtgtgt 1560 gaacttcaag ctcaagttgt tgcactgcaa aagccttcaa gccagaaaga aaagaaaaaa 1620 aatgcaaaac cagatgaagt gagtgagctg agaaatgttg tcactgagtt acaggcacag 1680 attacagcca tgcaaactac agccactcca aaaattaaaa gtgatgtaga agcaactgaa 1740 attgctgact taaagagaca gattgctgat ttaaaggttc aactgactgc ccctgatatg 1800 tatagaaacc gcaccagaaa cttgctgcct gaacctagag caacagattg ttacagagct 1860 agtaaactac ctgaaagtag acctcgtccg gggtattgtt ttagatgtgc ggaagatggt 1920 catcttgcca gcagctgtag taatgctcct gaccctacta aagttgctga gaaaaaacgc 1980 aagttaaggg agcgacaagc ccagtgggat acccaacaag tagcaatcat gaatccttta 2040 aactgaggac ggtctctgta gaggggcata cagagactaa aagaaataat aattgccctg 2100 agaaacgtaa acaattgttc aaccaaaatg cgtgtgacgc accccctttg agaaatttac 2160 caagaggatt agtgggagtg aagtgtactg cccaaataac tgttggtaat aaaagagttt 2220 cctgccttct ggacacaggg tcccaagtaa ctactgttcc ctggtcattt tatcaagaga 2280 atttatcaaa ttgtccactt aaatcattgg ataacttgct ggaagtggaa ggggcaaatg 2340 gtcaaacagt gccttatctt ggatatgtgg aattgactct taagtttccc agagagttcc 2400 ttggaacaga gacagaagtg cccactttag ccctggtagt cccagatttg atgaacactc 2460 cccaagttct aattggcaca aattcattag atgctcttta cagcaactat gtccaacaat 2520 ctgcttcctt tcctcaatct aacttccatg gttaccgtgc agtgcaaaaa gttttagaag 2580 caagatacaa acaagcaagt gctgatgtag tgggctgtat caaattcaag ggacatgttc 2640 cagaggtagt acctgcagga tgtacagtgg ttcttgatgg acatgttcta gttaattgtc 2700 ctcatgtagg gaaatgtgta gctctagagt caccaacttc acccgcttta cctggtggtt 2760 tgctagttgc cagctgtttg cattccttac ccagcaaaag gcatcaacag ttaccagttg 2820 tgttacggaa tgaaactcag accgacatta ccatctatcc cagaactata attgctgaaa 2880 tgcgggcagt ccaagaagta ataaagagtg ggcaagtaaa ttccagcact gtcaataaag 2940 aactttctgc ttgttccaat ctcaaatttg actttgaaaa ttccccattg acacctgaat 3000 ggaagaaacg aataacggat caattaaatt ccatgcctga agtcttcgcc ttgcatgact 3060 tagattatgg acatacaaac aaagtcactc accgaataaa gcttaatgat gagactcctt 3120 tcaaacacag acctcgaccc atacatcctc aggacattga tgcagtacga aaacatttgc 3180 aagacttgtt agcagctgga attatccgag agtcagaatc cccctttgcc tcccccatag 3240 tagttgtaag aaagaaagac aattctgtac gtctttgcat tgacttcaga aagctgaact 3300 cacaaaccat taaagatgcc tatgccctgc caaatctgga agaggtcttt tcagcactaa 3360 ctggttcaaa atggttctct gtccttgact taaaatcagg atattatcag attgagatgg 3420 aggaagctga caaaagtaaa actgcctttg tgtgtccctt ggggttctgg gagttcaata 3480 ggatgcccca aggcattacc aatgccccaa gtacgtttca aaggctgatg gaaagatgca 3540 tgggtgactt gaatagaaaa gaggtgttgg tcttcatcga tgatctgatc attttctctg 3600 aaagtttaga agagcatgaa tcaaggctga tgcacgtttt gaaaaggctc aaagaatatg 3660 gactgaagct atcgcctgaa aagtgcaagt ttttccagac ttctgttcga taccttggtc 3720 atattgtatc agaaaatgga gtggagactg atccagtgaa aatcgaggcc ctaaaaacct 3780 ggccaagacc aagaaatctc aaagaattaa gatcttttct gggattttct ggatactata 3840 ggaggttcat tcaggattat tccaagataa tcaaacccct taatgacctc acagtagggt 3900 atccacctct tcaaaaacgt cacctacaag agaacaagaa taagcaatat ctggacccca 3960 aaaaggaatt cggagacaga tggaatcagc cctgtcaaca ggcctttgac atgattattg 4020 agaaactcac ctctgcacct gttctgggat ttgcagaccc aaagcttcct tatgttctgc 4080 atactgacgc cagtaccact gggcttgggg cagccttata ccagaaacaa gagggacaaa 4140 tgcgggtcat tgcttttgca agcagagggt tgacaagaag tgaaagccgg tatccagctc 4200 acaagctaga atttctagct cttaaatggg cagtcacatc taaattcagt gactatttgt 4260 atggaacaga atttgtggtc gtaactgata gcaacccttt aacttacatt ctgacatctg 4320 caaagcttga tgctaccagt tatcgctggt tgtcaagtct gtccacttac aatttcaagc 4380 tccagtatag ggctggaagt caaaactgtg atgcagatgg cctttcaaga cgaccacatg 4440 gtgagctttt agatgaccct gcatctcaga aagagagaga gagaattaaa cagtttaccc 4500 ttcatcattt agatgaattt ggagttgaag attctcttat cctcccagag gccataaaag 4560 ccatctgtga tcgacatcag attggaaatt cctcacataa atgcaaattt tccaaccctt 4620 ccattgccct tgttgagtcc ttagccctgc atgcagatgt attaccaaat gagtttgaac 4680 aagaaaatga gcatggtctt ccagtcattc cttacctgtc caacgaggag ttaaagagac 4740 agcagagaat ggatcctgat ctcaaattca ttatcgattg tttacagcgg aatgaaaaac 4800 cttctagttc aaaagaccag tcgcttgctg ttactctgtg gataagggaa tggagcagat 4860 tagaattaag ggatggattg ctttatagga agaagcaaga tcaggaaagc actcactatc 4920 aattagcctt acctgtagct ttacgtggaa cagtgttgaa gagtctccat aatgatatgg 4980 gacacatggg catggagagg acacttgacc ttgtcagaac cagattcttt tggccaaaaa 5040 tgtcatcatc tgtggaagag aaaattaaga catgtgagag atgtgtacgt agaaaagcgt 5100 ttcctgaaaa agcagctgaa atgatgaaca tcaagactac cagaccattg gagttggtct 5160 gtatggactt cttgtcttta gagccagatc agagtaacac caaagatata ttagttatta 5220 cagatcactt caccaaatat gcggtggctg tgcctaccag aaaccagaag gcgcagactg 5280 tggctagatg tttatgggaa aactttctag tacattatgg atttccggaa agactgcaca 5340 gtgatcaagg acgagatttt gagtcaagcc tcatcaaaga gctatgtctc gtcgcaggta 5400 tacacaaagt gagaactact ccttaccacc caagaggaaa tccagtggag agattcaata 5460 ggacccttct ccaaatgttg ggtactcttg aaaacaaaaa gaagtcatgc tggaaggagt 5520 ttgtcaagcc tttggtgcat gcctacaatt gcactcgaaa tgatgtaaca ggatacactc 5580 cctatgaact tatgtttggt agacagccca ggctgccagt cgacttagcc tttgggttac 5640 ctgtggatcg ttctaccaaa tcccactctc agtatgtaaa agatctgaaa gaaggtttaa 5700 gagagagcta tgaaattgcc atcaaaaact ctgcaaaagt agcccaacgt aacaagcgca 5760 gatttgacaa acatgtggtt gtttctactc ttgacgtggg agataaagtc cttgtgcgaa 5820 atttgaggct aagaggcaag aacaaactgg cagacaaatg ggaaccagat gtctatgtag 5880 ttatccgtaa agctggagat ctcccagtat atgtagtcca gcctgacgga aagaccggtc 5940 cagttcgaac tttacacaga gacttacttc gaccttgtgg atatttgtct gaaaatgaaa 6000 ttgaagaaat gagtcctcca aatgttcaac gtaagcccag aactaggtct agctctgctc 6060 tagaatatgc tcccaaagaa catcaaatga gtgatcagtc agaatctgag gatgactctc 6120 tatatattag aaatgcagga cgccagttgg aatccattac aacgactgtg ttaccttcat 6180 cacaaagtcc agtgcttgta aggaacttac ctggcataga gcccattgaa ccactccctg 6240 tcgtggtgaa ccctgaaaaa gaaaccttac ctgattccag attagaagaa gacttaactg 6300 aaaatcaaag agatgatgtc aatgagaatt tcttacctgt gctgaaccct gctgatattg 6360 accccaaaga aattgaacct gaaagaagtg gaaatagtgt tgaagtacag atccatagaa 6420 gggcgcttga attagatcct gttgatgtac cccattctaa tgatcaaaat cctcatgtca 6480 gaaatgtttc aagtaaccag ccaatagtgg atgaagacct agataccagt ggccctagac 6540 gctcaaaaag gcaatgtaga cctcctaata agcttgagta tcataaacta ggaaatccct 6600 tgacactagt cattcagtcc ttactacaag gtctgagttc tgcctttacc acatcgttag 6660 aagaacccat actcactaga gatcagccct ttgttgtgcc agaccctttt ccaattgcag 6720 tgacaaccca accccgtaca tgcccgagga cgtgcctgaa ttcagggggg gaatg 6775 // ID DIRS-8_DR repbase; DNA; ZEB; 6621 BP. XX AC . XX DT 12-DEC-2008 (Rel. 13.12, Created) DT 12-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE DIRS-like LTR retrotransposon family - a consensus. XX KW DIRS; LTR Retrotransposon; Transposable Element; Nonautonomous; KW reverse transcriptase RNase H; DIRS-8_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6621 RA Bao W. and Jurka J.; RT "Families of DIRS-like endogenous retroviruses in zebrafish."; RL Repbase Reports 8(12), 2161-2161 (2008). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 6621 BP; 1850 A; 1868 C; 1054 G; 1845 T; 4 other; gtgaagttta ttcataaact aatttcgaga ggagcacgtg attatgattg aacacggctg 60 gtcctgcatt agcatgcttg atccaccaat caggccattc ctaaccacta taaagagcca 120 gggttttctc actacagtca tcttcgattt gaagaataca ctgctctgca tcagctgcta 180 ctgctactgc tactgctact gctactgcta ctgctactgc tactgctact gctactgcta 240 ctgctactgc tactgctact gcagctacat caacttctcc agctacatca acttctccag 300 ctacaacttc tccagcaaca acatcaactt ctccagctac atcaacttct ccaacaacaa 360 caacttctcc agctacatca acttctccaa caacaacaac ttctccagct acatcaactt 420 ctccaacaac aacaacttct ccagctacat caacttctcc aacaacaaca acttctccag 480 ctacatcaac ttctccaaca acaacaactt ctccagctac atcaacttct ccaacaacaa 540 caacttctcc agctacatca acttctccaa caacaacttc tccagctaca tcaacttctc 600 cagctacatc aacttctcca acaacttctc cagctacatc aacttctcca gctacaccaa 660 cttatccaac tacaacttct caaccttcta cttcaaattg cctcacacaa tgcaatctct 720 gtgtcttcaa cccaattctc caacaagatc acagtaaaaa ctcgatcacc atccatgaac 780 tttgcctaaa ctctcgatgc ctgcgattca agctgaactc tcttctaggg gtgcaccagc 840 ggtaacatac atgacacttt ttaaggggaa gcctcacaaa ctacaactaa gattaaacat 900 ccatctttaa aggatgattc tagcgtgaat agaatgactc acctctcaat caaggttgta 960 gacaatccaa tccttaaaac atcagtccac aacttatgca tacattgata aacatccatg 1020 ttaatcctcc aaactagtcg ctgtggtgca ttcaatagag tagcgtctcc aatcataaca 1080 ctagttctct ggcaaacaaa atggccgcca gcctttttct gcaactttga ttgacactcc 1140 ttcgagccaa tagctgaaag gagaagcgtc accatccaat gaactctcaa aaaatttgag 1200 atggtcccac cctctctcat gacaagctca tgaatgatta tcattacatt tacacaatta 1260 ctactaaatg gtgacattaa ttttaagttc agggtttgat gttggcagat gataaataaa 1320 taaatacata aattaataaa taaataataa agaaaataaa taattataaa atgtaaaaaa 1380 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaact gccacatctg atagcagttt 1440 aacttctaaa aagtttactc tgctactgaa aaacattcaa cgcattaggc atagaagtcc 1500 aatcgtaata tgccccagtg actcatgtcc atacacttat aagccataag cttataatgg 1560 caagttggtt agatgtgaaa acagcaaaga aacaaacttc ccgctttgac ttccacgaat 1620 gagaggagga catttctgac atgtctgatg tttcatcaat tgactctgcg ttagatacaa 1680 accaaggggt ttaaactcca ctataattcc acctaccctg tcttcaagag gtacatcagg 1740 atcactagac gagcagattt cccatttaat atgccatcct taggagggac aaatgcctac 1800 ctctactatc tttgctcgtt gcgacaatga ggcttcgtac catgcattaa caaaggcact 1860 tccacactta gttgcccatg ctcttattaa gacgccccgc atggttatca gctaaaataa 1920 ttcattgtgt ttgccataca tgtaccatgt tgtaatgtca tttctgattc tctcttgttt 1980 ttctcatttc agagatcatg tcagctggtt gcagaagctg accctcatcc aacctattac 2040 actccacatc catttctaaa tacatttaca ctctttgcaa atgacacttt tccactcaca 2100 tagcccacac cctcgatgcc atattcacac tagccttttt aatttaagtc ttctgaacta 2160 agtatgacat ttcaatccaa cgatacacct aaccatatct gatctatctt cgcaggatga 2220 agggatgcag gatgatatcc gttcccccac acacctgttc taaacaggcc cggattggct 2280 aatcgggagg accgggagaa ttcccggtgg gccggtctgt tttttttttt tttggccgcg 2340 agggccggtg tccctagctc cagaatctgt tgctctcagc agtgacactt ttttaattca 2400 tttacttgac cacagccttt ttattcatta ttttaccaca acttttctct tttgttgtgg 2460 tcgagtggtt ggcacgttag gttaagacgc tgcggacctg ggttcgatcc ttgctagagt 2520 aatttagtgt tttcattttt attgataaga catataatac tgttagggtt gtagaacatt 2580 tgaagttcta aagcagctgt tttctcaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2640 aaaaaaaaaa aagacatgat agtgccatta gaaactgatt tggaaatatt tatttactta 2700 ttttaatata gtcagtcgtg aactgaggtg ggctggtctg aggcttgaaa ctccagagct 2760 gaaaaggtgt cccactccgg ccctggttct aaacctctta gcatttttac accacagaag 2820 agcaatacca cctcggggaa gcccagccga tcctctccag ccaacttcca ccatcatgca 2880 ttattgggta tgaagtatga ctctgtcatt tccttcctat ttcccttcct atttaacata 2940 ggagctcctt cactaccctt ccctgttttt ctagctttaa ctttttctct ccaaatttaa 3000 actatcctat acttctattc actttcattc accacagctc cgaccaattc aggggttgtc 3060 ttgagtttca gttgctgcaa caggaacgtt ctaaatgagc ttgcatacac ttacaaattt 3120 tctaaatacc actatcgttg cactccccgc actaatacag cagtaaatgc ttctgcggga 3180 gcacaagttg cctccatact ggcccactac accactgcag ccgcagaaat gctccgcaga 3240 gttgcacacc caaagcacca ctgctaccgc agtgaccctc tagcatatgc cttattcacc 3300 ttcatcttta catatcacta aaggcagtat ttaacgtccc tgcgggagca ctcaaaactt 3360 actaatatcg actgtttagg tgctccactt aactatacac cccattaaca tcactgcaac 3420 tgcagtgacg ctctagctga gccccttttt cactttcatc tctaaatatc actgctaata 3480 gtacttagta tttccgtgtg aggctctaat actgagtacc actgcacctc tgctmytgca 3540 gagacgcttt gccgagcctt attctccctc tgcaccactg ctgcaacagt taagctttgc 3600 ctgagctgca actgcagaga cggctgttta gccttaattc tctgcaccac tgattttatt 3660 gcacttctgt aactgcagag aagctctgct gagattattt tttcccctgc accaatgctg 3720 cagcagtgac gccctgcctg aacagtcttc caacaacaga tataaactct gtcattggtc 3780 ctctaacacc tctaaatcct caatttgcct cagccaatat ttcctcagag aagcccagct 3840 gattctcggt tgcagacttc aacctagttt ccaagcagca cgactctgtc tgttcttcct 3900 actttctgtc ttagttgcat aggactcatt tttactaaac tttccttttt attttattaa 3960 acttcccttc tcttyctctt cataattttt caatttcact tgcactcatt ctccgcagct 4020 cggactccca caggggtagc cccaagctcc cactgccaca gcagtgaagc tctttaarag 4080 ctcacatgta cacttttcaa aaaatttaac cattatcaca ctccgccgca ataatacagc 4140 cgtaaaggcg tagcgggaac tcgtattgct tccatactga ctacaattac acctctacag 4200 cagctccact taattacatt gcattgtagc atcaatttat tctccatcac cacactcccc 4260 gcaacaatac agccataaag gcttagcggg aactcgtatt gcttacatac tgaccaccat 4320 cacaccttta cagcagctcc acttaattac attgcattgt agtatcaatt tattttccat 4380 caccacactc cccgctataa tacagctgta aaggcttagc gggaacatgt attgctttca 4440 cacatataat gcaccagact ctgaatacct atgcacccct gcagccacag agtcgctcta 4500 ctgtgcctga ttcctatctg ctacacagct gttaatcagc tctccaaagc gcacttccct 4560 ttagaaactg actttcaccg cacctctgca gccgcagaga cgctcagccg agcatcactc 4620 ccctctacat cactgccgca gcagtgacgc ttagcgagcg catatacact tccacttagc 4680 aatccactac tgatacactc tccagcactg ccaaagcagt taaacgtctc tgcggaggcg 4740 tatttacctt ccagatactg acttccattg cacctctgca gcagcagaga cgctcagccg 4800 agcatcattt acatcccctg caccactgct gcagcagtga cgctctgctg gagctcatgc 4860 acacttccat tacacttata ccacttctgt cactcccagc actgctaaag cagttaaaac 4920 gctgctgcag aagcgtattc ttccccttag gactgactac caccgcacct ctgcagtcgc 4980 agagacgctc agccgagcat cactcccctc cacatcactg ccgcagcagt gacgctcagc 5040 gagcgcatat acactttcat ttagcaaccc aatactgata cactctccag cactgctaaa 5100 gcagttaaac gtccctgcgg aggcgtattt cccttacaaa tactgacttc cattgcacct 5160 ctgcagcagc agagacgctc agccgagcat catttacatc ccctgcacca ctgctgcagc 5220 agtgacgctc tgctggagct catgcacact tccattacac ttataccact tctgtcactc 5280 ccagcactgc taaagcagtt aaaacgctgc tgcagaagcg tattcttccc cttagtactg 5340 actaccatcg cacctctgca gtcgcagaga cgctcagccg agcttcatcc cccctgcacc 5400 actgctgcag cagtgtcgct ccaccggagc tcacgcacac taccatttat ccataccact 5460 actgacacac tccccattgg actgccaaag cggttaatca gttaaccgct tctcatctgc 5520 agctacagct ccgcagaacc ttacagcttt attttccatc actactatac taaaccttta 5580 cgaaactcac cacatatgtt ttaaaccttt acacttaccc cactcactct cccgctggtc 5640 cttacaatta acagaacccg ggagcacaca tagtcataca taagcacttt cagttaattt 5700 ttacacccac accagtctct gttgctcctc caagctattt ctgtatcact tttcagcagc 5760 cggatatggc attaatctcc tgtgcctttt ggggggttct tcaaatacgc ggctgctgtc 5820 ccgagcggag cattttgggg agttgtcgag atctacctga gctcgaggct cccctctctt 5880 cctccaaacg ggagggagcc cagggctcaa gaaccttcga gctcagggct ctctcccggg 5940 acagcatgcc aaacttgctt ataatcaatc atcagctaag tgtgaactct tgaagtgaag 6000 tttattcata aactaatttc gagaggagca cgtgattatg attgaacacg gctggtcctg 6060 cattagcatg cttgatccac caatcaggcc attcctaacc actataaaga gccagggttt 6120 tctcactaca gtcatcttcg atttgaagaa tccccccttc cacccctacc ttttcacctt 6180 tccctccata gggcagcacg gtggctcagt gactagcact gtcgcctcac agcaagaacg 6240 tcaccggttc tagttcctta acaggccggt ggtcgtttct gtgtgtagtt tgcatgttct 6300 tcccgtgctt gcgtgggttt tccccgggtt ctccggtttc ctcccacatt ccaaaaacat 6360 gtacaacaag ttaatcgtta aatctaaatt tcaatacagg taatctaata atgcagcata 6420 tcttttaata gccttcaatc ttaatcttta gctattataa aaaggggagt tgtcgagatc 6480 tacctgagct cgaggctccc ctctcttcct ccaaacggga gggagcccag ggctcaagaa 6540 ccttcgagct cagggctctc tcccgggaca gcatgccaaa cttgcttata atcaatcatc 6600 agctaagtgt gaactcttga a 6621 // ID GYPSYDR1_LTR repbase; DNA; ZEB; 294 BP. XX AC AL591405; XX DT 04-MAR-2002 (Rel. 7.02, Created) DT 04-MAR-2002 (Rel. 7.02, Last updated, Version 1) XX DE LTR of putative novel retrotransposon GYPSYDR1. XX KW Gypsy; LTR Retrotransposon; Transposable Element; LTR; KW retrotransposon; GYPSYDR1_LTR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-294 RA Jekosch K.; RT "Gypsy-like element from D. rerio (LTR)."; RL Repbase Reports 2(2), 11-11 (2002). XX DR [1] (Consensus) XX SQ Sequence 294 BP; 60 A; 112 C; 50 G; 72 T; 0 other; gagcttcatt tgacatgact ccctgcgcta acgcttgtct ctctctgtct ctcccacagc 60 cgagtccagc tcgtgaccga tccagcactc tacatctcac cagccttcac catctccccg 120 gagcacttga gcacgcacct ttaagaagag cttcatttga catgactccc tgcgctaacg 180 cttgtctctc tctgtctctc ccacagccga gtccagctcg tgaccgatcc agcactctac 240 atctcaccag ccttcaccat ctccccggag cacttgagca cgcaccttta agaa 294 // ID CR1-40_DR repbase; DNA; ZEB; 3349 BP. XX AC . XX DT 25-FEB-2009 (Rel. 14.02, Created) DT 25-FEB-2009 (Rel. 14.02, Last updated, Version 1) XX DE CR1-like non-LTR retrotransposon - a consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; AP endonuclease; CR1 clad; CR1-40_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-3349 RA Bao W. and Jurka J.; RT "CR1-like non-LTR retrotransposons from zebrafish."; RL Repbase Reports 9(2), 525-525 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS join(103..306,310..2955) FT /product="CR1-40_DR_1p" FT /translation="MYTLSLPTTDTHFGKLAPSLHIEDRTLSSLTTLCLQT FT QQQSPLSGSHNRSDEGNAAGKEEREPASSSDDAVDFDLRSPPFCWLTFSHW FT TTNFASYGRAFPSNGKQETAALSALQKPGYLRRYQTQLLNFRGSPCTARIE FT RKNSQGKAKEEVFVSLLTTHGVMRGTYTQLRSFCSPDLEYLMLRCRPYWLP FT REFTAVIITAVYIHPQANTEQALRELYGSISEQETAHPEAAFIVTGDFNNA FT NLRKIAPKYYQHITTNTRGDRILDHCYSPFRDAYKSLPRPPFGKSDHSSVL FT LLPAYRQKLKREPPTLRTFQSWTDQSDSILQDCFDHVDWDMFRAACDDDIE FT VYSDTVTCFIRKCIEDXVPTKTVRIYPNQKPWINGEVRTALSVRASAYKSR FT NAEEQKQANYNLRKTIKAAKRQYRDKVEGQFNTNNARSMWQGLNYITDFKS FT NKPATVNIAASLPDELNSFYARFEAQDSARTLRAPAAETETASTLSVSVAD FT VXRSFRRVNIRKATGPDGIPGRVLKACAHQLAGVFTDIFNLSLSQSVVPTC FT FKTATIVPIPKSAKTTCLNDWRPIALTPIFSKCFEKLIKKHICSVLPAHTD FT PLQFAYRNNRSTDDAIAFTLHTALSHLENKNTYVRMLFVDYSSAFNTIVPA FT KLVVKLQALGLHSSLCNWILDFLSSRRQVVRMNNITSSTLILNTGAPQGCV FT LSPLLYSLYTHDCTAKHSSNVIVKFADDTTVVGLITDNDETAYREEVHTLT FT QWCEENHLSLNISKTKELVVDFRREKREHTPITINGTPVERVSTFKFLGVH FT IAEDLTWTAHTDAVLRKAQQRLFFLRRLRRFGMSPHILRSFYTCTVESILS FT GCITTWYGNSTSSNRKGLHRIVRTAGRVVGGELPSLQDIYTRRCMRKAKRI FT ISDSSHPSHRLFSLLPSGRRFRSIRSRTSRLKESFFPQTIRLMNT*" XX SQ Sequence 3349 BP; 895 A; 911 C; 736 G; 803 T; 4 other; catcaagatg gcgccgagca tggccgccgt gttgcgagct cccagcaaac tttgttgtgt 60 tttgtgtgtt ttacttgtat ttttgtcgtt tttctgtgct ggatgtacac actctcatta 120 cctacgacag acacacactt cgggaaattg gctccctcgt tgcacatcga agaccggact 180 ttgagttctt taacgacgct ctgtttacaa acacagcaac agagcccttt gtctgggtca 240 cacaaccgaa gcgacgaagg aaacgcagcc ggaaaagagg aaagagagcc ggcgtcctcg 300 tcagactgag acgccgtgga tttcgacctc cgctccccac cattctgctg gctaacgttc 360 agtcactgga caacaaactt tgcgagctac gggcgcgcat ttccttccaa cgggaaacaa 420 gaaactgctg cattatctgc cttacagaaa cctggctatc tgcggaggta ccagacacag 480 ctgttgaact ttcggggttc tccgtgcacc gcgcggatag aacgaaagaa ctcacaggga 540 aaagcaaagg aggaggtgtt tgtttcttta ttaacaactc atggtgtgat gagaggaaca 600 tacacccagt taagatcatt ttgttctcct gatctggaat accttatgct tcggtgtcgg 660 ccatactggc taccaaggga gttcacagct gttatcatta cggctgtcta catccaccct 720 caagccaaca cagagcaggc gctcagggaa ctgtacggga gcataagcga gcaggaaacc 780 gcacacccgg aggcagcgtt tattgttaca ggggacttta acaacgccaa tctcaggaaa 840 atcgctccaa aatactatca acacatcacc acaaacacgc gtggtgaccg gattctggac 900 cattgctatt ctccgttccg ggacgcatac aaatccctcc cccgcccacc gtttggcaaa 960 tcagatcact cttctgttct gctcttgcct gcttacaggc agaaactgaa acgggaacca 1020 cccaccctca ggacgtttca gagctggacg gaccaatcgg attccatact tcaagactgt 1080 tttgatcacg tggactggga tatgttccgg gcagcgtgtg atgacgacat tgaagtgtac 1140 tcagacacag tcacatgctt catcaggaaa tgcatagaag acrtggtccc aacaaaaact 1200 gtccgtatct accccaacca aaaaccatgg atcaatggcg aagttcgaac agccctatca 1260 gtgcgagctt ccgcctataa atccagaaat gctgaggaac aaaaacaagc aaattacaac 1320 ctcaggaaaa ccatcaaagc agcaaaacgt caatacagag acaaggtaga gggtcaattt 1380 aacaccaata acgcaaggag catgtggcag ggacttaatt acatcacaga ctttaaaagt 1440 aacaaacccg ccactgtaaa cattgctgcg tctctcccgg acgagctcaa ctcgttctac 1500 gcccgctttg aagcccagga cagcgcgcgc acactgcgcg ctcccgcggc cgaaactgaa 1560 accgccagca cactctctgt ttctgtagcg gacgtaasga gatctttccg tcgcgtgaac 1620 atccggaaag ctacgggccc agatggcatc cctggacgtg tacttaaagc atgcgcacac 1680 cagctagcgg gggttttcac ggacatcttt aacctctcgc tctctcagtc tgtggttccc 1740 acatgcttta agacagctac tattgtgccc ataccaaaat cagctaaaac cacatgcttg 1800 aatgactggc gtccgattgc tctgacaccc atcttcagca agtgctttga gaagctgatt 1860 aaaaagcaca tctgctctgt actgcccgct cacactgacc ctctgcaatt tgcatacagg 1920 aataaccgct ccactgatga tgcaattgct ttcaccttgc acactgctct gtctcacctg 1980 gaaaacaaaa acacatatgt gagaatgctg tttgtggact acagctcagc attcaacacc 2040 atagtgcctg ccaagctggt ggtgaagctc caggctctgg gtctacacag ctctctgtgc 2100 aactggatcc tggacttcct gtcaagcaga cgccaggtgg tcagaatgaa caacattaca 2160 tcatccacac tgatcctcaa cactggtgct ccacagggct gtgttctcag cccactcctg 2220 tactccctgt acacacatga ctgtacagcc aaacacagct ccaacgtcat cgttaaattt 2280 gctgatgaca caacggtggt gggcctaatc acagataatg atgagacggc ctacagagag 2340 gaggtgcaca ctctgacgca gtggtgtgag gaaaaccacc tctcactcaa catcagcaaa 2400 accaaggagc tggtggtgga tttcaggaga gagaagagag aacacacccc catcaccatc 2460 aacgggacac cagtggagag agtcagcact ttcaagtttc ttggagtaca catcgctgag 2520 gatttgacat ggactgctca cacagacgca gtgctgagga aggcacaaca acgcctcttc 2580 ttcctcaggc gtctcaggag gtttggaatg agcccccaca tcctccgctc gttctacacc 2640 tgcactgtgg agagcatcct gtctggctgt atcaccacct ggtatggaaa tagcaccagc 2700 agcaatcgca aaggcctaca taggattgtg cgaactgctg gacgcgtagt aggaggtgag 2760 cttccctccc tccaggacat ctacaccagg cggtgcatga ggaaagccaa gagaattatc 2820 agcgactcca gccacccaag ccatagactt ttctctctgc taccctcagg cagacggttc 2880 cgcagcatcc ggtcacgcac cagccggctg aaggaaagct tcttccctca gactatcagg 2940 ctgatgaaca cttaacacac cccacacaga ctcttccata cccctcactg cacaccatca 3000 atatgtagca tgcactgcac tttaaccaat ccatacttga aacaatactg cctacaacta 3060 tgtggacacc tattcattgt acatatcgct gtcaatttta cattgtcctg tttttttatt 3120 tttttttttg rggagtactg tgttattttg cactgtcgtt gtatttgcac tgtctgtatt 3180 ttgcactgtt gttgtatttg cactgtcatt ttatttgcac tgtctgtatt ttgtactgtc 3240 tggagccagc acctaagctt ttcactcatc atagcacacg tgctgctgat gatgtgacaa 3300 taaaagtgat ttgatttgat tttatttgat ttaytgcatc taaatggat 3349 // ID Gypsy6-I_DR repbase; DNA; ZEB; 5410 BP. XX AC . XX DT 17-NOV-2004 (Rel. 9.1, Created) DT 17-NOV-2004 (Rel. 9.1, Last updated, Version 1) XX DE Gypsy6-I_DR is an internal portion of the Gypsy6_DR LTR DE retrotransposon. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW GYPSY superfamily; Gypsy6-I_DR; Gypsy6-LTR_DR; Gypsy6_DR; KW endogenous retrovirus; gag; pol. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5410 RA Kapitonov V.V.; RT "Gypsy6_DR LTR retrotransposons from zebrafish."; RL Repbase Reports 4(10), 261-261 (2004). XX DR [1] (Consensus) XX CC Gypsy6-I_DR is an internal portion of the Gypsy6_DR LTR CC retrotransposon, whose LTR is deposited in Repbase as CC Gypsy6-LTR_DR. This retrotransposon belongs to the Gypsy CC superfamily. Some copies of Gypsy6_DR are flanked by identical CC LTRs. The internal portion encodes the 354-aa Gypsy6_DR1p gag CC (pos. 160-1221) and the Gypsy6_DR2p 1418-aa pol (pos. CC 1092-5345) polyproteins. The polyprotein is composed of the CC protease, reverse transcriptase and integrase domains. XX FH Key Location/Qualifiers FT CDS 0..0 FT /product="Gypsy6_DR1p" FT /translation="MEDELRELRELVTQLKADNERLRQEQVPAALPGPSNI FT SIPVVSDPPLIDAGPSSTDRFVFVPRDRKCPKFSGRSGIDINEWVEEAEAC FT MRLRHLSSADRAFFLFDHLEGEAREEIRYRPQSEREDPKRVIQVLRDLYGC FT TKSYVALQESFFSRRQQEGETLEFSLALMSLLEMVKGQSPHGMPNAEILLR FT DQFVENVNDCTLRRELKQFVRRQPTATLVDVRGEALRWEREGMPGGARGRS FT QSVPSVYGIQYGVQGNRSVSSGAKSEMTELRDMLLKQQQQLNQLVQSMSLL FT QSCSPSLPPPKVNPVICRRCRQPGHFARNCDGGRMSVRAPSAQITSALTRR FT EQSSSSQPSEN" FT CDS 0..0 FT /product="Gypsy6_DR2p" FT /translation="MSSARPFCEKLRWWADVGPCTISPNNFCFNKTRTVIL FT QPAVGKLVPTKLQGHNLVGKRVGSNDHMPCLMSTCPKLVVTIGGVQVPCLV FT DTGSMVSTITESCFMTNFGLWGQEQLRSCHWLQLRAANGLSIPYIGYLELD FT IELCGRVVSGCGVLVVKDPPGGMCAQTPGVLGMNVLSRCYQELFGQHGTGL FT FDLPAVSQAPSFIFQALQNCHQAGVQPVKDAEGQVRVRGRRACRIPGGTVK FT FVATTCSVQYSGNAVLFEPPISGLPAGLLASPALLKVDGGTVYVPIVNVGT FT MDVVLYARTIVGVVSKVDVVALPPEVAEVNMVAARVSAQSSPSVQEQIATL FT DLSKLPEVEQGKVRALLLKYLPVFSSYDGDLGCTNLISHDIPLLDEIPVRQ FT RFRRIPPSEYEVVKAHINQLLETQVIRESSSPYASPIVLVKKKDGGLRMCV FT DYRRLNAKTRKDAFPLPRIEETLDSLAGACWFSTMDLASGYNQVPVTEKDK FT PKTAFCTPFGLFEWNRMAFGLCNAPSTFQRLMERLFGDQQCQSLLLYLDDI FT IVFSSSIDEHLARMEVVLSRLQREGLKAKLSKCAFFQREVRYLGHVISSEG FT VSTDPGKVEAVANWPCPTSVTELRSFLGFASYYRRFVEGFSKLAAPLHRLV FT AQLANPKQRKGNAHDFAASWSTECQSSFEGLKIKLTSAPVLAYADFSLPFI FT LEIDASQGGLGAVLSQEQQGKVRPIAYGSRSLRPTERNPSNYSSMKLEFLA FT LKWSMTEKFREYLLGQKCIVFTDNNPLSYLNSAKLGVMEHRWAAHLSAFDF FT EIKYRSGRSNRNADTLSRQNFSSTGEVQGLCPGVAVPAVLQQVAPDGLVTQ FT VNQVTAFPCPSGSDMGALQEADTVIGEVLVFWRRKLLPTSEERKQLSRLAV FT ILLRQWGRLVEIEGVLYRRVSRPDGGEEVLQVLLLAVMKTEVLTQLHQQHG FT HQGVERTSQLVRQRCYWPGMFADIARWCQECERCQCAKGTPSAPSSFMGHL FT LASRPNEILALDFTLMEPSRSGLENVLVMTDIFTKYTLAIPTRDQRAETVA FT QVLVAEWFCKFGVPGRIHSDQGRNFESTLIQQLCGLYGVVKSRTTPYHPAG FT NGQCERFNRTLHDLLRSLPPSKKSDWPLCLPQVLFAYNTTPHQSTGESPYY FT LMFGQEPKLPIDFLLGRVKEPSAGSVHGWIVEQQDRLQVAFEGARERLGAA FT ADRRKARHDLQVREAPLKEGQLVYLRDHSVRGRCKIQDLWSSVVYQVLKAP FT TESGVVYTIAPVADLSKTKHVHRSLLKAQLGPNLLPSPPEPAMGDGMQPSD FT EECDGDLLVLVPETPELRGRSRSQGSVPPVPRVVDEELEGRATGETVQPEV FT VPSNPVRAAEVVVRRTVRSTAGQHSNLHHLPRAVGTGTVSSIATNPFGTFF FT RPWD" XX SQ Sequence 5410 BP; 1239 A; 1187 C; 1482 G; 1502 T; 0 other; ttctggcgta gtcggcagga ttccccctca ttgtgtgcag agacgtgtgt ttgtttttgt 60 atattttcat ttttgtagga attgaattgc ggcagtctcc ctcccctatt ctttttggtg 120 agtcaacaaa cgttgacttt ttgcttgtac gttgtagcaa tggaagatga attgcgtgag 180 ttaagggagt tggtaactca attaaaagct gataatgagc gactacggca agagcaggtg 240 ccagctgcac tgccgggtcc atctaatatt tctattcctg ttgtttcaga tcctcccctc 300 attgatgctg gtccctcgtc aaccgaccga tttgttttcg ttcctcgaga ccgtaaatgt 360 ccaaaattca gtggccggtc cggaattgac atcaatgagt gggtggaaga agcagaggct 420 tgtatgcgtc ttcgccattt gtcttcagct gatcgggcat ttttcttgtt tgatcacctg 480 gagggagagg caagagagga aattcgttat aggccacaga gtgaaaggga ggatccaaag 540 cgggtgattc aggtattgcg cgatctatat ggctgtacta agtcttatgt agctcttcag 600 gagtcattct tttccagaag acagcaggaa ggggagactt tggagttttc cttagccctg 660 atgagccttc tggaaatggt taaaggtcag tcacctcatg gcatgcctaa tgcagaaatt 720 ttactgcgag atcagtttgt ggaaaacgtt aatgattgca cccttcgtcg tgaacttaag 780 cagtttgttc ggcgtcaacc tactgccaca ttggttgacg tacgtggtga agcacttcgg 840 tgggaaagag agggcatgcc tgggggagcg cggggccgaa gtcagtctgt tccatcagtt 900 tatggtattc agtatggggt gcaggggaat cgaagtgtta gtagtggggc aaagtctgaa 960 atgactgaat tgcgggacat gttgctgaag cagcagcaac aattaaatca actagttcaa 1020 agtatgtctc tgcttcagag ttgttcaccc agtttgccgc cacctaaagt taaccctgtt 1080 atttgcagaa gatgtcgtca gccaggccat tttgcgagaa attgcgatgg tgggcggatg 1140 tcggtccgtg caccatcagc ccaaataact tctgctttaa caagacgcga acagtcatcc 1200 tccagccagc cgtcggaaaa ctagttccca ccaagttaca gggccataac ttggttggga 1260 aaagggtagg ctctaatgat catatgccgt gtttaatgtc tacatgtccg aagctcgttg 1320 taactatagg tggggttcag gtcccttgtt tggttgacac cggttccatg gtgtccacca 1380 ttactgagag ttgtttcatg actaattttg ggctgtgggg tcaagaacag cttagatcat 1440 gtcattggtt acagcttaga gctgcaaatg gtctttcaat tccttatatt ggttatttgg 1500 aattagatat agagctttgt gggcgagtag tttcaggctg tggcgtgctg gttgttaagg 1560 atcctcctgg gggcatgtgt gcacaaacac ctggtgtatt gggtatgaat gtgttgagcc 1620 gctgctacca ggagctattc ggccagcatg gtacaggcct ttttgattta ccggcagtat 1680 cacaggcccc tagttttatc tttcaggcct tacaaaattg tcatcaggct ggagttcagc 1740 cagttaaaga tgcagaagga caagtcagag tgcgtggacg tcgggcgtgt cgcatcccag 1800 gtggcactgt aaaatttgtt gctacaactt gttcggtgca gtattctggt aatgctgtac 1860 tgtttgaacc tccaatttct ggtctccctg caggtttgct ggcctctcct gcgctcctaa 1920 aggtggatgg tggtacggtc tatgtgccca tagtcaacgt gggcactatg gatgtggtat 1980 tgtacgccag aactattgtg ggcgttgtga gtaaggttga tgtagttgcg ttacccccag 2040 aggtcgcaga ggtaaacatg gtggccgcta gggtaagtgc acagtcttct ccttctgtgc 2100 aggagcaaat agccacttta gacctgtcaa aactgcctga ggtagagcag ggtaaagtta 2160 gggcattgct gttgaagtat ttgcctgtgt tttccagtta cgatggtgat ttgggttgta 2220 caaacctgat atctcacgat ataccattgt tagatgagat ccctgtcagg cagcggttca 2280 ggcgcatccc tccgtctgag tatgaggtgg taaaggcaca tatcaaccaa ctgctagaga 2340 cccaggtgat tagagaaagt tccagtcctt atgcttcgcc cattgtcctg gttaagaaaa 2400 aagacggtgg tctgcgcatg tgcgtagact accgtcgttt gaatgcgaaa accagaaagg 2460 atgcattccc tctaccacgt attgaggaaa ctttggactc gctggctggg gcctgttggt 2520 tttccaccat ggacctagcc agtgggtata accaggtgcc tgtaactgag aaggacaagc 2580 ctaagactgc cttctgtacc ccttttggcc tttttgagtg gaataggatg gcgtttggac 2640 tgtgtaatgc cccaagcacc ttccaacgat tgatggaacg gttgtttggg gatcaacagt 2700 gccaatccct cctcctgtat ttggatgata ttattgtctt ttcctcctct atagatgaac 2760 atctggcacg gatggaggtt gtcctgagcc gtctgcagag ggaagggttg aaggccaagt 2820 tatccaagtg tgctttcttt caaagggaag tgcgttattt gggtcacgtc atttcgtcag 2880 agggggtctc taccgatcca ggtaaagtgg aggcagtggc caactggcct tgcccgacca 2940 gcgttaccga gttgcgctca tttttggggt ttgctagcta ctaccgtcgt tttgtggagg 3000 ggttttccaa attggctgcc cctctccata ggctggtggc tcagcttgca aacccaaaac 3060 agcgaaaggg caatgcccat gactttgcgg cttcttggtc cacagaatgt caaagtagct 3120 ttgaggggtt gaaaattaag ctaactagtg ctccagtgtt ggcctatgct gatttttctc 3180 tgccttttat tttagagatc gatgccagtc agggaggctt gggggcagtc ctctcacagg 3240 aacaacaagg caaggtgcga ccaatagcat atggcagccg cagtcttagg cccaccgagc 3300 gtaatccatc taattatagt tcaatgaaat tggagttttt ggcactcaag tggtccatga 3360 ctgaaaagtt cagggagtat ttgttaggcc aaaaatgtat tgtctttact gataacaacc 3420 cccttagcta cctgaattct gccaagttag gtgtaatgga gcatcgttgg gctgctcact 3480 tgtccgcatt tgactttgaa ataaagtata gatcgggtag gagcaatcgt aacgcagaca 3540 ctttgtcccg gcagaacttt tccagtacag gagaagttca aggcctgtgt ccaggggtgg 3600 ctgttccagc tgtgttgcag caagtggccc cagatggatt ggtgacccag gtcaatcaag 3660 ttactgcctt tccctgcccc tctggcagtg acatgggtgc tctgcaggaa gcagacacag 3720 tcattggtga ggtgttggtg ttttggaggc gtaagttgct ccctacctcg gaggagcgta 3780 agcagctctc tcgcttggca gtcatcttgc ttcgccaatg gggccgcctt gtggaaattg 3840 aaggggtgct ctatcggcgt gtgtcacggc cggatggcgg agaggaagtt ctccaggtgt 3900 tactactagc cgtcatgaag actgaggtct tgacccagct gcatcagcag catggacatc 3960 agggggttga gcgtacttcc cagctggttc gtcagagatg ttactggccg ggtatgtttg 4020 ctgacattgc acgctggtgc caggagtgtg agcgctgcca gtgtgctaaa ggcaccccct 4080 ctgctcccag tagttttatg ggacatcttc tggcttctcg gcctaacgag attttagctc 4140 tcgatttcac tttgatggag ccttcaagat ctggcctaga aaatgtgttg gtcatgactg 4200 acatatttac aaagtacacc ttagctatac ccactagaga ccaacgggcc gagactgtag 4260 cccaggtcct tgtggccgaa tggttttgta aatttggggt gccaggtcgc atccactccg 4320 accaaggtcg taattttgag tccactttga ttcagcagct gtgtgggttg tatggagttg 4380 taaagtcccg tactactcca taccaccctg ctggaaatgg tcaatgtgag cgtttcaata 4440 gaacgttgca tgatctgctg cgttctcttc caccttctaa aaagagtgat tggccacttt 4500 gtctccctca ggttctcttt gcgtacaaca ctactcctca ccagtcaact ggagaatccc 4560 cgtattattt gatgtttgga caggaaccta aacttcccat tgatttcctc ttgggtagag 4620 ttaaagagcc gtcagccggc agtgttcatg ggtggattgt ggaacagcaa gatcggttac 4680 aagtcgcttt tgaaggtgct cgtgagcgat tgggtgccgc agccgaccgg cggaaggccc 4740 ggcatgatct tcaggtaagg gaagcaccac tgaaggaggg tcaactagtt taccttcgcg 4800 atcatagtgt gcgaggtaga tgtaaaatcc aagacctgtg gagctcagtg gtctaccaag 4860 tcctaaaagc acctactgaa agtggagtgg tctatactat tgctcctgtt gcagacctca 4920 gtaaaaccaa acatgtacac agatccttgt tgaaggccca gcttggtccg aatctgcttc 4980 ctagtccacc tgaacctgca atgggtgatg gcatgcagcc ttcagatgag gagtgtgatg 5040 gagacttgct ggttctagtt cctgagacac cagagcttag aggaaggtca aggtcacagg 5100 gttctgtccc ccctgttcct cgggtggtag acgaggaact ggagggtcga gcaactgggg 5160 agactgtgca gcctgaagtg gtgccctcta accctgtaag agcagcagaa gttgtggtaa 5220 ggagaactgt gagaagtacg gcgggtcagc attctaacct gcaccacctc ccacgagctg 5280 taggtacagg gacagtgtcc agtatagcga ctaacccttt tggtacattc tttcgccctt 5340 gggattaggc tcttcgatta atgatgttta gttcaccgtc ggggcgacgt tgcagaaatt 5400 gggggtggaa 5410 // ID I-1_DR repbase; DNA; ZEB; 5283 BP. XX AC AL672145; XX DT 11-JUL-2002 (Rel. 7.06, Created) DT 11-JUL-2002 (Rel. 7.06, Last updated, Version 1) XX DE I-1_DR is a non-LTR retrotransposon from the I clade. XX KW Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; endonuclease; RNaseH; I clade; I-1_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5283 RA Kapitonov V.V. and Jurka J.; RT "I-1_DR, a first example of vertebrate non-LTR retrotransposons RT that belong belong to the I clade."; RL Repbase Reports 2(6), 18-18 (2002). XX DR Genbank; AL672145; Positions 25824 31106. XX CC I-1_DR is a non-LTR retrotransposon that belongs to the I clade. CC This element is characterized by 13-bp target site duplications. CC It can be an active element. It encodes two proteins: CC 417-aa I-1_DR1p (positions 3-1253) and 1250-aa I-1_DR2p CC (positions CC 1238-4987). These proteins are most close to corresponding CC proteins CC encoded by other I-like elements. I-1_DR1p is a putative RNA/DNA CC binding protein (it includes zinc-finger), and I-1_DR2p is CC composed CC of the AP endonuclease (aa positions 9-222), reverse CC transcriptase CC (positions 500-730) and RNaseH (positions 968-1095) domains. XX FH Key Location/Qualifiers FT CDS 3..1253 FT /product="I-1_DR1p" FT /translation="MAASSGGGAVQAVGNDWIENGQEWGDGNDGSGNEVEE FT MEGSESCEPWTNCRGKKRKKRKNKLDSDEEMRSKVKEGNEEYNVFVRLVQE FT GATFEDWSPIQLTKALYKEIGEVRCAKKLRNGCLLVSCKDEAQQKKAIKVN FT KINGKKVKCSEVYDRKLIRGVITGIPVSESLNNVIEGITNAKIKEAKRLKT FT RWNGAICDSLSIMLTFDETKLPDKVFIGYMSYEVKMYIPPPVRCYKCQKYG FT HIAAICKGKMRCSKCSGEHEYGQCDKEAKLKCCNCGGEHSSAYRGCEVNKR FT MQEIQRIKVTQGISYAEATKKVRPTREQMGQMPTMREERKDMIKCKKCDKI FT KEETLIVDKNEFVLFMAEVINCSAQTTSRTERIKIIVKAAEKYMGIKGIPW FT ESVRNTLNEEVQQQQSQAWGGVA" FT CDS 1238..4987 FT /product="I-1_DR2p" FT /translation="MGWSCIMVIFILQWNARSLISNGQEFKKYIDDLNEKP FT DIVCVQESWLISRLEFNIKGYNAVRMDRKIGKGGGIITFIKKGIQYREVKR FT GNELEYVIVEVWSSEGNIKIINFYNPCRLLEREQLEEIWEGINGKIIWCGD FT FNAHSTLWGNRNDNNGRVIEEFIEEEELVCINDGTGTRLNTARGTESAIDI FT TIVTKDIADRCEWEVLRGNTVGSDHYPIKTQVGIECAKEIEVREEKWILER FT ADWDKFREISEDLLQKIEDNLDVENMCKRISGGIIEAAKMAIPKSKPKIIN FT KIVPWWTKECRKAIKERNKAFKKMKTTHNFQNLMKYKQAQAIVRKTVKKSK FT KEYWRQFCESIGRTTPVERVWGMIKKMKGNGKEYGYPMLMDGQRVIINNKE FT KAEIIARTLIKVHSTDNLSQEEKRGRVETYERYQFDLGKDDGDQVLNINFS FT GTELSRALKKLGKTAPGRDGICYTMLENLTDKGKEVLLKLYNKIWEVGVIP FT KEWKKAVIIPIKKPGKDPKQPTSYRPIALTSHIGKTMERMINDRLVYWVET FT KRKIGNYQSGFRKGRGTMDPILRLEDDIKKAQVNRESVIAVFLDIEKAYDM FT LWRDGVLIKLNQIGVKGRILRWVKEFLSERSITVKINGTFSECYSVENGTP FT QGSIISPFLFSIMFDGIFKEIENNTGVALFADDGAIWKRGRNITFIMKKMQ FT QILNTVQEWTVKWGFRISQEKTKAMLFTKKKIREDLKLKLGGKDLENVESF FT KYLGVWFDRRLTWNTHISKMVDKCKRVLNVMRCLCGVDWGASRVALKSIYT FT GLIRSVIDYGCMVYGSAANTTLKQLDVIQNQALRVCCGAMKTTPVAALQVE FT MGEMPLHLRRDQLEVVYWANLKGHNENHIAQTVLMQCQEREKRGIKGYGWT FT IQQKINDMEIDTIKISPTIVFPVVPTWLLDDLEVDFEIMKEKQENEIDSKQ FT VENYIREKYSETTEIYTDASRIGQRVGVSFSIPKLKIEVTKRINNNLAVYT FT AELVAIWLALKWVEDNKPIKAVIASDSSSALISIKNVVSESRQDIIYEIVQ FT LGNNIIKSGVIISLLWVPAHIGVSGNEMADKLAKQAAQQTMIDMDIKYSKS FT EIKSIVKTKILGKWQHIWNNGSTGRQYYTIQNIVGKGRETRKNKKEEDKFS FT RMRFNHTSLNSTLHMINKHADGMCECNNQETVEHVLMHCPIYQTERNILFT FT QLQEKQVEPNIKNILKLSTGDVCFRYVYNYLKDTGLINRI" XX SQ Sequence 5283 BP; 2143 A; 603 C; 1253 G; 1284 T; 0 other; gaatggcggc atcaagtgga ggaggagcag tgcaggcggt tggcaacgac tggattgaga 60 atggtcagga gtggggagac ggaaatgacg gatcaggaaa tgaagtagag gagatggagg 120 gtagcgagag ttgcgaaccg tggacaaact gtagaggtaa aaagcggaaa aaaagaaaaa 180 acaaactaga tagtgacgaa gaaatgagat ccaaagtaaa ggaaggaaat gaagagtata 240 atgtgtttgt tagactagta caggaagggg cgacatttga ggattggagt cctatacaac 300 taacgaaagc tctgtataag gagattgggg aggtaagatg tgctaaaaaa ttaaggaatg 360 gatgcttatt ggtgtcatgt aaagatgagg ctcaacaaaa gaaagcaatc aaagtaaata 420 aaataaatgg taaaaaagtg aaatgctctg aggtctatga cagaaaactt ataagaggag 480 taatcacagg cataccggta agtgagtcat taaacaacgt gattgaagga ataacgaatg 540 ctaaaataaa agaagctaaa cgcttaaaaa caagatggaa cggagccata tgtgacagtc 600 tttcaataat gctgacattt gatgaaacaa aactacctga caaagtcttc ataggataca 660 tgagctatga agtgaaaatg tacataccac cgcctgttag gtgttacaaa tgccaaaaat 720 atggtcatat tgcagcgatc tgtaaaggga aaatgagatg tagcaaatgt agtggagaac 780 atgagtatgg acaatgtgat aaggaagcaa aactcaaatg ttgcaactgt ggtggagaac 840 atagttcagc atacagaggg tgtgaggtaa ataaaagaat gcaagagata cagagaataa 900 aagtcactca aggtatatct tatgcagaag caacaaagaa agttaggcct acaagggaac 960 agatgggtca gatgccgaca atgagagaag aaaggaaaga catgattaag tgtaaaaaat 1020 gtgataaaat aaaggaagaa acactgattg tggataaaaa tgaatttgtt ctctttatgg 1080 cagaggtgat aaattgctca gcacagacaa caagtcggac ggaaaggatt aaaataatag 1140 ttaaagccgc ggaaaaatac atgggtatta aaggaatccc ttgggaatca gtgagaaaca 1200 cgttaaatga agaagttcag caacaacagt cacaagcatg gggtggagtt gcataatggt 1260 aatatttata ctgcagtgga atgcgaggag tttaatatct aatggtcagg aatttaagaa 1320 atatatagat gatttaaatg aaaaacctga tatagtgtgt gtacaagaat catggttaat 1380 atcaagacta gaatttaata ttaaaggcta taatgcagta agaatggata ggaaaatagg 1440 taaaggtggg ggaattatta catttataaa aaaaggtatc caatatagag aagtaaagag 1500 agggaatgaa ctagaatatg ttattgtgga ggtatggtca agtgaaggta atattaaaat 1560 aataaatttt tataacccat gtagattgtt agagagggaa cagttagaag agatatggga 1620 gggtattaat ggaaaaatta tctggtgtgg agattttaat gcacacagta cattatgggg 1680 taacagaaat gataataatg ggagagtaat tgaagaattt attgaagaag aagagctagt 1740 gtgcattaat gacgggacag gaactagatt gaatacagca agaggcacag aatcagcaat 1800 agatattaca atagttacaa aagatattgc agataggtgt gaatgggagg tattaagagg 1860 taacacagtt ggaagtgacc actacccaat caaaactcaa gtaggaatag aatgcgcaaa 1920 agaaattgaa gtgagagagg agaaatggat tttagaaaga gcagattggg ataaatttag 1980 ggaaatcagt gaagatttgt tgcaaaagat tgaggataac ttagatgttg aaaatatgtg 2040 taaaagaatt agtgggggaa taattgaggc agcaaaaatg gcaataccta aatcaaaacc 2100 taaaataatt aataaaattg ttccatggtg gacaaaggag tgtagaaaag ctataaagga 2160 gagaaacaaa gcttttaaaa aaatgaaaac gacacacaac tttcaaaatc tcatgaaata 2220 caaacaagca caggcaatag ttaggaaaac ggttaaaaaa tcaaaaaaag aatattggag 2280 acaattttgt gagtcaattg gtagaacaac gccagtagag agagtatggg gaatgatcaa 2340 aaaaatgaaa ggaaatggaa aagaatatgg atatccaatg ctgatggatg ggcagagggt 2400 tattattaat aataaagaga aagcagaaat tatagcaaga acattaatta aagtacacag 2460 cacagacaat ttaagccaag aggaaaaaag agggagggtg gagacttatg aaagatatca 2520 gtttgattta gggaaagatg acggggatca ggtactaaat ataaatttct cagggactga 2580 gctgagtagg gcattaaaga aattagggaa aacggctcca gggagagatg gaatttgtta 2640 tactatgtta gaaaacctaa ctgataaagg aaaggaggtg ctgttgaagt tgtataacaa 2700 gatatgggag gtaggagtta ttccaaaaga atggaaaaaa gcagttatca ttcccattaa 2760 aaagcctggg aaagatccca aacagccaac cagttataga ccaatagccc taacatcgca 2820 tattggaaaa acgatggaaa gaatgattaa tgacagatta gtgtactggg ttgaaactaa 2880 aagaaagata ggaaattatc aaagtggatt taggaaaggt agagggacaa tggatccaat 2940 attgaggctt gaagatgata ttaaaaaagc acaggttaac agggaatcag taatagcggt 3000 gtttttagac atagagaaag catatgacat gttgtggaga gatggagtgt taattaaact 3060 taaccaaata ggagttaaag gacgcatatt gagatgggtt aaagaatttt tatcagaaag 3120 atccataaca gttaaaataa atggtacatt tagtgaatgt tacagtgttg aaaatggcac 3180 accacaaggg agtattatta gccctttttt gttttctata atgtttgatg ggatctttaa 3240 ggagatagaa aataatacag gagttgcatt atttgctgat gatggtgcga tttggaaaag 3300 agggagaaac ataacattta taatgaagaa aatgcaacag atattaaata cagtgcagga 3360 atggacagtt aagtggggat ttagaatatc tcaagaaaaa actaaagcca tgttatttac 3420 caaaaagaaa ataagagagg atttaaaatt gaaattggga ggtaaagatt tggagaatgt 3480 tgaatctttt aaatatctgg gagtgtggtt tgataggaga cttacatgga acacacatat 3540 tagtaaaatg gtggataaat gtaaaagagt gttaaatgta atgaggtgcc tatgtggtgt 3600 agactggggt gctagtagag tggcattaaa atcaatttat acagggttga taagatcagt 3660 gatcgattat ggatgtatgg tgtatggatc agctgctaat acaacattaa aacagctaga 3720 tgtaatccaa aaccaagcat taagagtatg ctgtggagcc atgaaaacca caccggtagc 3780 agcattacag gttgaaatgg gagagatgcc tctacacctt aggagagatc agctggaggt 3840 agtatactgg gcgaacttga aaggccataa tgaaaatcac atagcacaga cagttctgat 3900 gcaatgccaa gaacgagaga aacggggcat taaagggtat ggatggacaa tacaacaaaa 3960 aataaatgat atggaaatag atacaataaa aatttcgccc actatagtat ttccagtagt 4020 tcccacatgg ctgcttgatg acttggaagt tgattttgaa ataatgaaag aaaaacaaga 4080 aaatgagatt gacagcaaac aagtggaaaa ttatataaga gaaaaataca gtgaaacaac 4140 tgaaatatac acagatgcat caagaattgg acaaagggta ggagtgtcat tcagtattcc 4200 aaagttaaaa attgaagtta caaaaagaat taataataat ttagcagttt atacagcaga 4260 attggtagcg atatggttgg ctctaaagtg ggtagaggac aataaaccaa ttaaagcagt 4320 cattgcgtct gattcaagtt cagcactcat aagtattaaa aatgtagtat cagaatcacg 4380 gcaggacata atatatgaaa tagtacagtt gggaaataat atcattaaat caggagttat 4440 catttcatta ttgtgggtac cagcgcacat aggggttagt ggaaatgaga tggcagacaa 4500 gttggctaaa caagcagcac agcaaactat gatagacatg gacatcaaat acagcaagtc 4560 agaaatcaaa agtatagtta aaaccaaaat attaggaaaa tggcaacata tatggaataa 4620 tgggagtaca ggacgtcaat attacaccat acagaacata gttgggaaag gtagggaaac 4680 aaggaaaaac aaaaaggaag aagacaagtt ctcaagaatg agatttaatc acacatcact 4740 caatagcaca ctacatatga tcaacaaaca tgcagatggg atgtgtgaat gtaacaacca 4800 ggaaactgta gaacatgtat taatgcactg cccaatatat cagacagaaa gaaatatatt 4860 attcacacag ttacaggaaa aacaagtgga acctaacata aagaacatac taaagttgag 4920 cacgggtgat gtatgtttta gatatgttta taactatttg aaagacacag ggttgattaa 4980 tagaatttag tatttagatg aaaagttatc tatatcatat agaggtatta tagagaaaga 5040 aatatcaaag tttgggacat ttagtaattt atttatatgt ataggaaaaa gaaataaaaa 5100 tttgatttta aatttttttt ttttgggttt tttttttttt tttgtgattt ttatggttgg 5160 gctttgttta actataaaac atggtttagt taactccgga tccacactcc aatccagatg 5220 gtggcggtaa tgcaccaaaa gctggttgcc aaccgccata aaacacacaa gaagaagaag 5280 aag 5283 // ID DNA-8-2_DR repbase; DNA; ZEB; 824 BP. XX AC . XX DT 09-SEP-2003 (Rel. 8.08, Created) DT 09-SEP-2003 (Rel. 8.08, Last updated, Version 1) XX DE DNA-8-2_DR is a nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; 2-bp TSD; DNA-8-2_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-824 RA Kapitonov V.V. and Jurka J.; RT "DNA-8-2_DR, an ancient nonautonomous DNA transposon from RT zebrafish."; RL Repbase Reports 3(8), 151-151 (2003). XX DR [1] (Consensus) XX CC This element is characterized by 8-bp terminal inverted repeats CC and 2-bp target site duplications. CC Its classification is not certain yet, although it is CC expected to be a member of the hAT or MuDR superfamily. CC The DNA-8-2_DR copies are ~89% identical to the consensus CC sequence. CC There are several hundred copies of this element present in the CC genome. XX SQ Sequence 824 BP; 277 A; 143 C; 128 G; 272 T; 4 other; tagggttgtc gcgataccat taattcatct tacgatacta taccagctga agtatcayga 60 taccaagtag tattgcgata ctgtaattcc ataactcaaa ttataaagaa aattgtcaga 120 aatactatat tttatgttat aataggccta cttgaattta attcataatt cccttattat 180 taaaaagtat tatttgcacg tcacttcacc taaaatgtat tcattctttt tgtttatttt 240 gtagataact gaccaacaaa aatacattaa aataaagata caaattatac caaatgaaat 300 ttgttacaaa agagcatttt tccaacaaaa ttaggctata tgaagtggtc aaaaattgtt 360 tcaatgtttc ctgttgctat tttgggtttt tcytaatcta ttttttttca gtcttatcag 420 gtaacaatcc aaagtaattc aaaatttcac tttgtgagtc gttcttttta aggagattgt 480 cgcggttgtt gtagctacta aatcatattg acaggaacag aaatgaacyg attcagatgt 540 gcgttcgaac tgaagcgtca gaaaacgtgc aataggctac cataaaaggn gcgaattcta 600 cacagttttg caaccttaaa gggctccaca gactattcaa ataaaatggt ctattgttgc 660 acaaacgtgt gagcatttta gtgacttttc cacatgcaac attatctatt gtagatagac 720 cattaatgac gatactaccg tttacaaact acagtggcac cgccagtatt ttggagccat 780 agtatcacga tactaccata gtaccggtaa accgtgcaac ccta 824 // ID Gypsy109-I_Dr repbase; DNA; ZEB; 4299 BP. XX AC . XX DT 29-APR-2009 (Rel. 14.05, Created) DT 29-APR-2009 (Rel. 14.05, Last updated, Version 1) XX DE An internal portion of the Gypsy-109_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW endogenous retrovirus; Interspersed repeat; reverse transcriptase; KW gag; GYPSY superfamily; integrase; Gypsy109-I_DR; Gypsy-109_DR; KW Gypsy-109-LTR_DR; Gypsy-109-I_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4299 RA Dib M.R. and Naveira H.F.; RT "Gypsy109_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 9(5), 952-952 (2009). XX DR [1] (Consensus) XX CC Gypsy109-I_DR is an internal portion of the Gypsy109_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its long CC terminal repeat is deposited in Repbase as Gypsy109-LTR_DR. CC Gypsy109_DR is characterized by 4-bp target site duplications. CC The internal portion encodes one polyprotein the 1422-aa CC polyprotein Gypsy109_DR1p (pos. 29-4294) composed of the gag, CC protease, reverse transcriptase, and integrase domains. Some CC insertions fairly recent, according to the hight identity CC between their flanking LTRs. Consensus obtained after the CC alignment of at least three independent insertions bearing at CC least 85% Homology over at least 1000bp. XX FH Key Location/Qualifiers FT CDS 32..4249 FT /product="Gypsy109-I_Dr_1p" FT /note="Polyprotein." FT /translation="MQVLSHEVTAQAQVLTTHQQQLSHLTQLTDELVKSLQ FT NLQAASTAQLTANYSPSQPFVTQTQTVSGARLAFPDKFSGNPAKCKGFLLQ FT CKLFISQQPHLFKDENSKIAFVCSLLTGKALDWATAVWPDSTPIFPSFNDF FT LKRFCTVFDHPEGGRNAGEELLCVQQRSQPAAEFALHFRTLAAQSGWADDP FT LKTLYRKALNPELQKEMACRDDGKSLDQLIELSIRLDHLLRSRKPLCSVTP FT SPVSPESPTEPMQLGRTRLTPEEREQRRRNHLCLYCGLSGHMKILCPNKPP FT PKTLPVSATTVFTIANDILSVPVYLRCGEIEISTLAMVDSGAAGNFIDHSF FT ATTHSIPLTSCDSSLAITAIDGRPLGEGHIKFRTLPISLQTGSLHKEELSF FT LAIDSPRHTIILGLPWLQLHDPQISWKTGEIIKWSNNCFNHCLQSVLPVQI FT NTISSTEDPELSQIPEVYQDLIEAFNKQKATKLPPHREHDCAIELLPGTTP FT PRGRIFPLSQPETEAMNNYISEELEKGFIRPSTSPASAGFFFVKKKDGSLR FT PCIDYRGLNDITVKFRYPLPLVPAALEQLRSAQYFTKLDLRSAYNLIRIRQ FT GDEWKTGFSTANGHYEYLVMPFGLANSPSVFQAFINEIFRDMLNQWVIVYI FT DDILIYSNSLPEHIQQVRAVLKRLIQNQLYAKASKCEFHQTCISFLGYIIS FT PEGVAMDQQKVDSVTQWSKPETIRQLQRFLGFANFYRRFIRNFSTVAAPLT FT AMVKANNARLKWNPEAIRSFNQLKSRFTTAPILRHPDPNLPFVVEIDASNT FT GIGAVLSQRSQTTNKLHPCAFYSRKLNPAERNYDVGNRELLAMKAALEEWR FT HWLEGAKHPFTVITDHKNLEYIRSCKRLNPRQARWALFFTRFDFKVTYIPG FT SKNVKADALSRLFDEEALADDVEPILMDSLVLAPIQWDIETEILQASEQNP FT TPQACPENRIFVSPLLREKLISEVHNHPSSGHPGSTATVQLIQSRYWWPSI FT NKDVIKFINNCSPCQMAKHSRHRPAGLLQPLEVPRRPWSHIAIDFITDLPQ FT SQGNTTILTVVDRFSKSCRLIAIPKLPTALETAELLCECVFRYYGLPEDIV FT SDRGPQFTSRLWSAFFKNLQVNISLTSGYHPQSNGQTECLNQEIGRFLRTY FT CHSNQAEWNKFLIWAEYAQNSLRKPSTGLTPFQCVLGFQPPLFPWSGEPSE FT LPAIDTWFKKCEEVWNAAHTHLSHAIRRFKEQADRHRRPGPTYSPGQWVWL FT STRDLRLRLPCKKLSPRYVGPFQIERQISPVSFRLTLPNHYRISPTFHVSL FT LKPAVGPAEVDREVAAGEQGPPPIMVDGEEAYRIHEILRSRRRGGQLQYLI FT DWEGYSPEERSWINRKDILDPTLLNEFHLQHPEMPAPRPRGRP" XX SQ Sequence 4299 BP; 1118 A; 1248 C; 875 G; 1058 T; 0 other; gaagactttg caaaacacgg atcccgcagc catgcaggtc ttgtcccacg aagtcacagc 60 tcaagctcag gtattaacta cacatcagca acagttgtct catttaaccc aactcacaga 120 tgaactggtg aaatcactgc aaaacctgca agctgcttcc acagcgcaac tcaccgccaa 180 ttactctcca agtcaaccct ttgttacaca gacccagact gtatccggag ctcgtttggc 240 attccccgac aaattttcag gtaacccagc taagtgcaaa ggctttttac tccagtgcaa 300 actgtttatc tctcaacagc cccatctgtt taaggatgaa aacagtaaaa ttgcttttgt 360 gtgttctctg ctcacgggaa aagcattaga ctgggctact gcagtttggc ctgacagcac 420 cccgatattt ccctcattta atgactttct caaacgtttt tgcactgtgt tcgatcatcc 480 tgagggtggt cgtaatgctg gtgaggagct cttgtgtgtt caacagagaa gtcaacctgc 540 agccgaattc gctctacatt tccgcacact ggctgcacaa tctggctggg ctgacgatcc 600 tctaaagacc ctatacagga aagctctaaa ccccgaactg cagaaagaga tggcatgtcg 660 tgatgatggg aaatcgttgg accaactcat tgaactctca atcaggttag accatttact 720 ccgctcccgt aaacccctgt gttctgtcac tcccagtcct gtatcccctg agagtcccac 780 tgaacctatg caactgggca gaacccgact aacccccgag gaacgtgaac aaagacggag 840 aaaccatctg tgcctgtatt gcggtctttc gggtcatatg aaaatcctgt gtcccaacaa 900 acctccgccc aagacccttc cggtgagtgc aaccaccgta ttcacgattg ccaatgacat 960 tctgagtgta cccgtttatt tacgatgtgg tgaaattgag atctcgactc tcgccatggt 1020 tgactcagga gccgctggca actttataga tcactcgttt gccacgaccc actccattcc 1080 tctaacctcc tgtgattctt ccctagccat cactgctata gacgggcgcc ccctggggga 1140 aggacacata aaattccgaa ctctgccaat ctctcttcaa acaggctctc tccataaaga 1200 agaactctcc ttcttagcaa ttgactctcc tcgacacaca attatcctcg ggttgccctg 1260 gctacaactt catgaccccc aaatttcctg gaaaacgggt gagatcatta aatggagcaa 1320 taattgtttt aaccattgcc tgcagtctgt cctccctgtc cagattaata ccatttccag 1380 tactgaagac cccgaattaa gtcaaatccc tgaagtttat caagatctca tcgaagcctt 1440 taacaaacag aaagccacta agcttccgcc tcatcgtgag catgactgtg ccattgagtt 1500 actgccaggt acaacgcctc ctcgtggccg gatttttccc ctctcacaac ctgagaccga 1560 agccatgaat aattacatct cggaggaact ggaaaaaggc tttatacgac cttccacgtc 1620 acccgcctca gctgggtttt tcttcgtcaa aaagaaggac ggtagcctac gcccatgcat 1680 tgactacaga ggactgaatg atatcacagt taagtttcgc tatcctttac cactagtccc 1740 agcagccctc gaacaactac gctcagcaca gtactttacg aagttggacc tccgcagtgc 1800 ttacaacctc attcgtatcc gacaggggga cgaatggaaa accgggttct ccaccgctaa 1860 tggccactat gaatatttgg ttatgccctt cggcctagca aacagtcctt cagtgttcca 1920 ggctttcata aatgagatat tcagagacat gctcaatcag tgggtcatcg tgtacatcga 1980 cgacatcctc atctactcca attccctacc tgaacacatt caacaggtca gagccgtctt 2040 aaaacgccta atccagaacc agttgtacgc caaagcctcc aagtgtgagt ttcaccaaac 2100 atgtatatca tttctgggtt atatcatcag tcccgaaggc gtggccatgg atcagcagaa 2160 ggtagattct gtcacgcagt ggtccaaacc tgaaaccatc cggcaactac aacgtttcct 2220 ggggttcgca aacttctata gaaggttcat ccggaacttc agtacagtag ccgctcctct 2280 cacagccatg gtaaaggcca ataacgctcg cctgaaatgg aatccagaag caattcgatc 2340 attcaaccag ctcaagtcac gcttcacaac cgcgcccatc ctacgtcatc ctgaccccaa 2400 tctaccattc gtggtcgaaa tagatgcctc caacacgggc attggagccg ttctatccca 2460 gaggtcccaa acgactaaca aactccatcc ttgtgccttt tactctcgca aactcaatcc 2520 agctgagaga aactatgacg ttggcaaccg ggaactctta gctatgaaag cggcattgga 2580 ggagtggaga cactggcttg agggcgctaa acacccattc accgtcataa ctgaccacaa 2640 aaatcttgag tacatccggt cctgcaagag acttaaccca aggcaggcaa ggtgggctct 2700 attctttact cgctttgact tcaaggtcac ttacattccc ggttcgaaaa atgtcaaggc 2760 tgacgctcta tctcgcctct ttgatgaaga agcattggct gatgatgtcg agccaatcct 2820 aatggactcc ctagtcctag cacccattca atgggacatt gagactgaaa ttctccaagc 2880 atctgagcaa aaccctactc cgcaggcatg tcccgaaaac agaatctttg tttccccgtt 2940 gctccgagaa aaacttattt ctgaagttca caaccacccc agttccggtc atccaggtag 3000 cacagcaacc gtccaactca tccagtcccg ttattggtgg ccatcaatca ataaagatgt 3060 gattaaattc ataaacaact gctctccctg tcaaatggcc aaacactccc gtcaccgtcc 3120 agccgggcta ctccaacccc tagaagttcc acgtcgcccc tggtcacata tagctatcga 3180 cttcatcaca gacctacctc aatcccaagg aaataccacc atccttaccg ttgttgaccg 3240 tttctctaag tcttgccgac tcattgccat acctaaactg cctacagctt tggaaacggc 3300 agagctactt tgcgaatgtg tcttccgcta ctatggtcta cctgaagaca ttgtttcaga 3360 tcggggtccc caatttacct cccgtttgtg gtccgcattc ttcaagaacc tgcaggttaa 3420 catcagtctc acttccggct atcacccaca atccaacggc cagactgaat gcctcaatca 3480 ggagattggt cgatttctcc gcacctattg tcactccaac caagctgaat ggaacaaatt 3540 cctcatatgg gctgaatacg ctcagaactc cctgagaaaa ccatctacag gtctgactcc 3600 cttccagtgt gtactcggct ttcaaccccc tctattccct tggtctggcg aaccttcaga 3660 acttccagcc attgacacct ggttcaagaa atgtgaggag gtatggaacg cagctcacac 3720 ccatctatcg catgccatcc gaagatttaa agaacaggct gatcgtcacc gtcgtcctgg 3780 tcccacgtat tccccaggac agtgggtgtg gctatccact cgagatctgc gcctgagact 3840 accctgcaag aaactcagcc ctaggtacgt gggtcctttt cagatagaga gacaaatctc 3900 tcctgtttct tttcgactga cacttcctaa tcattaccgt atttctccta cattccatgt 3960 ctctctgctc aagcctgctg ttggtccagc cgaggtggat agggaggtgg cagccggtga 4020 acagggtccc ccacctatca tggtcgacgg agaagaggct tatcggatcc acgagatcct 4080 gagatccaga cgccggggcg gacaacttca gtatctcatc gactgggagg ggtacagccc 4140 ggaggaaaga tcttggatca accgtaagga cattctcgac ccaactctgt tgaatgagtt 4200 ccacttgcaa catccggaaa tgccggcccc tcgcccccgt ggaagacccc ggcgtcgcga 4260 ttcttctcac ttcaggagcc gttcgttgga ggggggctc 4299 // ID Gypsy-24-I_DR repbase; DNA; ZEB; 4529 BP. XX AC . XX DT 11-FEB-2005 (Rel. 10.01, Created) DT 11-FEB-2005 (Rel. 10.01, Last updated, Version 1) XX DE An internal portion of the Gypsy-24_DR LTR retrotransposon - a DE consensus sequence. XX KW GYPSY superfamily; Gypsy-24-I_DR; Gypsy-24-LTR_DR; Gypsy-24_DR; KW LTR retrotransposon; endogenous retrovirus; gag; integrase; KW reverse transcriptase. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4529 RA Kapitonov V.V. and Jurka J.; RT "Gypsy-24_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 5(1), 21-21 (2005). XX DR [1] (Consensus) XX CC Gypsy-24-I_DR is an internal portion of the Gypsy-24_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its CC long terminal repeat is deposited in Repbase as CC Gypsy-24-LTR_DR. Gypsy-24_DR is characterized by 4-bp target CC site duplications. The internal portion encodes one CC polyprotein: the 1464-aa Gypsy-24_DR1p (pos. 114-4505) CC composed of the gag, protease, reverse transcriptase, and CC integrase domains. The consensus sequence was built from CC several copies less than 4% diverged from each other. XX FH Key Location/Qualifiers FT CDS 0..0 FT /product="Gypsy-24_DR1p" FT /translation="MEDRAASVDYSDEAFTHLRDPVANLIDTFSELYLDSE FT EEKESKDESPQQGQEDISKIDDFPSPPPPLDFHDDQLQFENDEHDQRIHSI FT EKHLADLEHRVSSFVTAETLNANLRACEDRINYYVQRELDRVQQKCYAKVE FT DLSRSIVDCLKRRDKQLEQQFKAIKPIMSTPMHSSIVTSHKTSQTPSRIDA FT TQDTTKQGTYLSPSSFPTSIKLELPTFGNADSEDPLDFIERFEEYDELRPL FT HHEEMLAALSVSLKGTAKSWWKAEKSSITDWLSFKEKFLFSFLNEDHKEVA FT AQKLADYKQKVNESIRDYAFNYRAMSLKINPAMSESELVQATLRNCNPRLA FT SLLRGTVKSIDDLVRLGTQIEKDWSESRKRWSQGKEEDQKKKSSAVKGQPN FT RLMLIDPCLCDNVLQAPVILNHSYFNAVIDTGSTFSLLQKKLWERLKKKDE FT QLTRSDQTFMLANGQSQKTLGKVLWACEIYGVKHEVTFHVMDDDSLAVPVI FT LGLDFLKKAKVTIDFNVSRIYLPDANSSHPVCFNKTTEHAAVKFYAAQEEV FT GVSHDERLKLIDQALENSHTTTKVKSQLKALMCDWPSVCTNKLGRTDLIKH FT VIKTTDDLPLRKRPYRVSKAKNDFIEEQIQELLQQKIIKPSTSPWASPVVV FT VEKKDGGSRLCIDYRGLNAKTFLDAYPMPQITDILDSLQGAKVFSTLDLKS FT GYWQLEMDPASMEKTAFVTASGLYEFSSLPFGLKNAAASFQRLMEQVLRDL FT KNKCCMVYIDDIIVYSPDVQTHLNHLEQVFHSLHKAGLTLNLKKCKFICAS FT LDYLGHTISADGVNVNSDKVEAIRTFPIPKTLKELQRFLGLAAWYHRFIPD FT FSSKTAPLHLLKRKDVKWNWSDECQRAFDVIKDELTRAPVLCTPNFDLSFK FT VQTDASDVGLGAVLTQEVEGQERVIAYASRLLRGAEKSYSASEKECLAVVW FT AVEKWHHYLEGRPFEVITDHASLVWLFQHPKPSSRLERWTIRLQGYHFTVR FT YRKGQCNIVPDVLSRREEVNSQAVLLHTPAKKNFSTVSCDLPLDLSQIACE FT QEKDTECQEIMVKAKSQRTTDLKRTHYICKNGVLFRSIPDSKEGQRLQVVI FT PEKLREVTLSYAHDSPLSGHLGRFKTLMRLLEFAYWPSIRTDVWEHCKICE FT KCQRYKPTNLKPAGDLQSVPIVEPGYMLGMDIMGPFPRSSRQNEYLLVIVD FT YFTKWVEVFPMRTAKSNTIVRILIEEIFTRWGTPAFIVSDRGRQFTSNLLD FT QLCKQWQITPKLTTAYHPQSNLTERVNRNLKTMIAMFVEQNHRTWDQWIYE FT FRFALNTAWHESTGYSPAEIALGRQLKGPLQRALHNPPDPNQPAYNTLERQ FT KILYDVVRDNVEKAQSKQRKYYNMKRRTQNFEEGDLVWVRTHPLSKADDAF FT MAKISPKWKGPARIVKKLGPVNYKVTMLSDVAQVDTYHTQNLKIWHGADF" XX SQ Sequence 4529 BP; 1467 A; 856 C; 984 G; 1222 T; 0 other; gatggcaccc gaacagggac attcgtgaat tgtattttac gaagtccttt gaattataaa 60 gagatatttc actgatactg aaattcttca caagcaaata ttacaaattt acaatggagg 120 acagagcagc tagtgtggat tattcagatg aagcctttac tcatctcaga gatcctgtag 180 ctaatcttat agatactttt agtgaattgt atctagattc agaagaagaa aaagaaagta 240 aagatgaatc tccacagcag ggacaggaag acattagtaa aattgatgac tttccatcac 300 cacccccacc ccttgacttt catgatgatc aattacagtt tgagaatgat gaacatgatc 360 agcgtataca tagcattgaa aagcacttgg ctgatcttga gcatagagta agcagttttg 420 tcaccgctga gacactgaat gcaaatttaa gagcatgtga agataggatt aactactatg 480 tacagaggga gttagatcgt gttcagcaaa aatgctatgc taaagttgag gatttgagta 540 ggagtattgt ggattgcttg aagcgtagag acaaacagct agagcaacaa tttaaggcta 600 tcaaaccgat catgtccact cctatgcatt ccagcatagt tacttctcac aaaacatctc 660 aaacaccctc tagaattgat gcaactcaag acaccacaaa gcaaggcact tacctttctc 720 catcgtcctt cccaaccagt attaaacttg aactgcctac ttttggaaat gcagattcag 780 aagaccccct tgatttcatt gagcggtttg aagaatatga tgaacttcga cctctacatc 840 atgaagagat gttagcagct ttatctgtaa gtcttaaagg tacagctaag agttggtgga 900 aggctgagaa gagcagtatt acagattggt tgtcattcaa agaaaaattc cttttttcat 960 tcttgaatga agatcacaag gaagtggctg ctcagaaatt ggctgattac aaacaaaaag 1020 tcaatgaaag tataagagac tatgctttta actatagagc aatgtcactg aaaataaacc 1080 ctgcaatgtc tgaatctgaa ttggtacaag caacattgag aaactgtaat cctagattgg 1140 cttcattatt aagaggaaca gtgaaaagta ttgatgatct ggttcgtctc ggtacacaaa 1200 tagagaaaga ttggtcagaa agtaggaaaa gatggagtca aggaaaggaa gaggatcaaa 1260 agaagaaatc ttcagcggtg aaaggacaac caaataggct catgcttatt gacccttgtt 1320 tgtgtgataa tgtactacag gctcctgtta tcttgaatca ctcatacttc aatgctgtaa 1380 ttgatacagg aagtacattt tctttgctgc agaagaagtt gtgggagaga ttgaagaaaa 1440 aagatgagca attgactaga agtgatcaaa cgttcatgct cgcaaatgga cagagtcaga 1500 aaactctagg taaagtatta tgggcatgtg aaatttatgg agtgaagcat gaagtcacct 1560 ttcatgtgat ggatgatgac agtttggctg ttcctgttat attaggcttg gattttctca 1620 aaaaggctaa ggtaaccatc gactttaatg tttcacgtat ctacctacct gatgctaaca 1680 gtagtcaccc tgtatgtttt aacaaaacaa ctgagcatgc tgctgtgaag ttttatgctg 1740 cacaggaaga agttggagta agccacgatg agaggttaaa actgattgac caagccttgg 1800 aaaattctca cactacgacc aaggtaaaga gtcaattaaa agctcttatg tgtgattggc 1860 catcagtatg tactaacaaa ctgggccgta cagaccttat caagcatgtg atcaagacca 1920 ctgatgactt gcctttaaga aagagaccat atagagtttc taaagccaag aatgatttta 1980 ttgaagaaca gatacaggag ttgcttcaac aaaaaatcat caaaccttct acatctcctt 2040 gggcttcacc tgtggtagtg gtagagaaaa aggatggggg atctagatta tgcattgact 2100 accgagggct taatgcaaaa acttttctag atgcttatcc tatgcctcaa atcacagata 2160 tactggactc tcttcaagga gctaaggtgt tcagcacgtt ggacttaaag agtggatact 2220 ggcagttaga aatggatcct gcaagtatgg aaaaaacagc ttttgtcact gcttcggggc 2280 tatatgaatt ctcgtctctt ccctttggcc ttaaaaatgc agctgcgtct ttccaacggc 2340 tgatggaaca ggtactgaga gatcttaaaa acaaatgttg tatggtttat atcgatgaca 2400 ttattgtata ctcacccgat gtccaaactc acctgaatca tcttgaacaa gtgtttcaca 2460 gcctacacaa agctggtctc acacttaacc taaagaaatg taagttcatt tgtgcttcac 2520 ttgactactt gggccatacc atctcagcag atggagtcaa tgtgaattca gacaaagtgg 2580 aggctatcag aacatttcca attcccaaga ccttaaagga attacaaaga tttttaggac 2640 tggcagcttg gtaccatcga tttattcctg atttctcctc caaaacagct cccttacacc 2700 tcttgaagag gaaagatgtg aagtggaatt ggtctgatga gtgtcaacgt gcctttgatg 2760 ttatcaaaga tgagctcact agagcacctg tgttgtgtac acctaacttt gacctttcct 2820 tcaaggtaca gactgacgca agtgatgtgg gtttaggggc tgtgctcact caagaagtgg 2880 aaggacaaga gagagttatt gcctatgcat ctcggttgct cagaggggct gagaagtcct 2940 attccgcctc agagaaagag tgtctggcag tagtgtgggc agtagagaag tggcatcatt 3000 accttgaagg tagaccgttt gaggtaatca ctgatcatgc ttccttagtc tggcttttcc 3060 aacatcctaa accttcatct agattggaaa gatggacaat cagactacaa ggataccatt 3120 ttactgtaag ataccgaaaa ggtcagtgta acatagtgcc cgatgtgttg tccaggagag 3180 aggaagtgaa ctcacaggct gtattactgc atacaccagc caaaaagaat tttagtactg 3240 tctcatgtga tctgcctttg gacttatcac aaattgcttg tgaacaggaa aaggataccg 3300 aatgccaaga gatcatggtt aaagccaaaa gccagagaac cacagatctg aagaggactc 3360 attatatttg caagaatgga gtcttattca ggagcattcc agattcaaag gaaggccaaa 3420 gactacaggt tgtaatccct gagaaattga gagaggtaac tttgtcttat gcgcatgaca 3480 gtcctttaag cggtcatttg ggtaggttca aaactcttat gcgactgcta gaatttgcat 3540 attggccatc catacgtact gatgtttggg aacattgcaa aatttgtgag aaatgtcagc 3600 ggtacaaacc tacaaaccta aaacctgctg gtgacctaca aagtgtgccc atagttgaac 3660 ctgggtatat gctgggtatg gacatcatgg ggccgtttcc acggagctca cgccaaaatg 3720 agtatctact agttatagtg gactatttca ctaagtgggt agaggttttt ccaatgagaa 3780 ctgccaagtc taacactatt gtacggattc tcatagaaga aatattcact agatggggaa 3840 ccccagcctt catcgtatcc gaccgaggta ggcagtttac ctccaacctg ttggatcagt 3900 tatgtaaaca gtggcaaata actccaaaac tcaccaccgc ctatcatcct caatcaaatc 3960 ttacagaaag agtcaatcga aatttgaaaa ccatgattgc catgtttgtt gaacaaaacc 4020 accgtacctg ggatcagtgg atatacgagt tcagatttgc tctgaatact gcttggcatg 4080 aaagtactgg ttattcacct gctgagatag cacttggacg acagttgaag ggacccctac 4140 agagagcctt gcacaatccc ccagatccta accaaccagc atataatacc ctagaacgcc 4200 aaaaaattct atatgatgta gtcagagaca atgtagagaa ggcacagagt aagcagagga 4260 agtattacaa tatgaagaga agaactcaga attttgagga gggagatttg gtatgggtaa 4320 gaactcaccc tctctctaaa gctgatgatg cattcatggc taaaatttct cctaaatgga 4380 aaggcccagc taggattgtg aaaaagctag gaccggtaaa ttacaaggtc actatgttat 4440 cagatgttgc gcaagtagat acttatcata cacaaaactt aaaaatttgg catggtgcag 4500 atttttaaaa aaacacgggg agggatgta 4529 // ID RTEX-1_DR repbase; DNA; ZEB; 4732 BP. XX AC . XX DT 23-FEB-2009 (Rel. 14.02, Created) DT 23-FEB-2009 (Rel. 14.02, Last updated, Version 1) XX DE RTE-like non-LTR retrotransposon - a consensus. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; RTEX-1_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4732 RA Bao W. and Jurka J.; RT "RTE-like non-LTR retrotransposons from zebrafish."; RL Repbase Reports 9(2), 563-563 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 1..792 FT /product="RTEX-1_DR_1p" FT /translation="SFGTPSHIIIHTGTNDLRAQQERVGQLVCRVAEKAAE FT TCPNAKITISTLLPRRDIHPDTINRVNADISRGCALLPNVHLTHHTSITVR FT DLYDHVHIKKDKVNVFAKALKDTAWGRQTSAHTTNRLSPPYHMENMKQPPP FT GHHWGPPPHTRHFQPTQAQQQRPRPSSGHSQKTQTMPGPPQRETPTAGHHQ FT RPTPSNRLNPRQVNNSRRSPQHPTTSTTAADSTPRPALLPQPRNYAQALKG FT PAKPLEMGEIRQLLQYISAQLT*" FT CDS 906..4364 FT /product="RTEX-1_DR_2p" FT /translation="MTLSISLWNIQGLKSSAFGLKSLNTEFQANIKNKDII FT ILQETWSKTNTTTHCPPNYREIILPSQKLNTTRQGRDSGGQIIWYNSKLHK FT YIDTVKTGKYHAWLKIHKDLLSSQKDCFLCAIYIPPSESPYYSEDMFDTLE FT KETSHFQAQGHVLICGDMNARTGQQPDFTNAQGSKYINSNLPGIQTSFSHL FT HRNNHDHIVNKSGKELLQICRSLGLYIVNGRIRGDRLGRFTFCSPLGNSTV FT DYMITDIDPSSLRAFTVRELTPLSDHSQITLYLKKTITNPCTQPNKLFNIR FT KPYRWAENSAEDYQNAVNSPKTQQILDNFLVNAYAHTKQGVNMAVKDINNI FT FESTAKQAKLKVKTRQNNPPKNDKNWFDQECLSIRKHLRNLSNQKHRDPNN FT AEIRLLYCETLKQYKQTLRTKKAQYTQKQLTTIENSINSNQFWDNWKNLIK FT NDHEELPIQNSEIWETHFQTLFNKVETDTNPKQNQITKTLATLESTIKDNQ FT NPIDFPITIMELKDKIKSVKPKKASGPDGILNEMIKQTSPKFQSAILKLFN FT LVLSVGHFPDIWNQGLITPIFKNGDKFDPNNYRGICVSSNLGKLFCSLINA FT RLLDFITTHNVLSRSQIGFLPKYRTSDHIYTLHSLIEKHTVQNKGKIYACF FT IDFKKAFDSIWHQGLLYKLIESGIGGKTYDLIKSMYTESKCGIKISTKRTK FT YLSQERGVRQGCCLSPTLFNIYINELALSLERSTAPGLDLHDSQIKCLLYA FT DDLLLLSPTEQGLQQNLQLLDQYCQTWALTVNLNKTKIITFQKRARAQGTQ FT HTFTLGTNQITHTTQYNYLGLNITSTGNFNPAVNELRDKARRAFYAIKRQC FT PIDIPVQIWLKILESVIEPIALYGSEVWGPLTNPEQDLAKWDKHPIETLHT FT ELCKNILHVHRHTTNNGCRAELGKYPLIIKIQKRAVKFWKHLKLSDPDSYH FT YKALQDRELSRRADPLSQLAQSFRVSETSPEELNTLLPLTQITHQIKNSYT FT HHWDTQTQLQSKMQCYLALKRQYTLADYLHTVTDKGLRNTLSRYRLSGHQL FT AIETGRHRQTWLPVEERLCPHCPQQPIETELHFLTECTKYSEIREKFYPKL FT THTHKNFESLSNNEKLPILLGECVCCCVLAAQFVHSCHRLRNPQ*" XX SQ Sequence 4732 BP; 1675 A; 1247 C; 831 G; 978 T; 1 other; agttttggca caccctcaca catcatcatt cacacgggca ccaacgacct gagagcccag 60 caggaaagag tcgggcagct agtctgcaga gtagcagaga aagctgcaga gacctgcccc 120 aatgcaaaaa tcaccatctc caccctcctg cctcgcagag acatccaccc cgacaccatc 180 aacagagtca acgctgacat ctccagagga tgtgctctac tgcccaacgt gcacctgact 240 catcacacca gcatcacagt aagagacctc tatgatcacg tacacataaa aaaggacaaa 300 gtcaatgtgt tcgcaaaagc actgaaagac acagcatggg gcagacagac atcagctcac 360 acaacaaaca gactctcacc accttaccac atggaaaaca tgaaacaacc tccacctgga 420 catcactggg gacccccacc tcacacgaga cactttcagc ctacacaagc acaacagcag 480 agaccgcggc ccagctctgg acacagccaa aaaacccaga ccatgccagg acccccccaa 540 agagaaacac ctacagctgg tcatcaccaa agacctactc catcaaacag actcaatcca 600 agacaagtga acaacagcag gaggagtcca cagcatccaa caaccagcac aaccgctgct 660 gacagcacac ccagaccagc actgctgcca caacccagaa actacgccca ggctctcaaa 720 ggaccggcga aacctctgga gatgggtgaa atcagacagc tgctacaata catcagcgcc 780 cagctgacgt gaacagcccc ccaatttaca atatacaacc accatgtata tatatatata 840 tataactaat atattagtaa aggtttatct caaacttaaa aaacaagaac tttacctatc 900 tctaaatgac cctgtcaata tcattgtgga atatacaagg cctaaaatca tcagcctttg 960 gactaaaaag cttaaacaca gaattccaag caaacataaa aaataaagac attattattc 1020 tccaggagac atggagcaag acaaacacta ccacacattg cccacccaac tacagggaaa 1080 ttattcttcc ctcacagaaa ctcaacacaa ctcgacaagg gagagactca ggaggacaaa 1140 tcatctggta caactcaaaa ctccacaaat acatygacac agttaaaacc ggaaaatatc 1200 acgcatggct caaaatccac aaggatctac tgtcgtccca aaaagactgt ttcttatgtg 1260 ccatatacat cccaccatca gaatccccct actacagtga agacatgttt gacactctgg 1320 agaaagagac gagccacttc caggcccaag gacacgtgct catctgtgga gacatgaacg 1380 ccagaacagg acaacagccg gacttcacca acgcacaggg aagcaaatac atcaacagca 1440 acctaccagg tatacagacc agcttctccc accttcacag aaacaaccac gatcatatag 1500 tcaacaaaag tggaaaagag ctcttgcaga tctgcaggag tctgggactg tacattgtca 1560 acggtcggat aagaggggac agactcggga gattcacatt ctgctcacct cttggcaata 1620 gcacagtaga ctatatgata acagatatag acccttcatc tctcagagca ttcactgtta 1680 gagaactcac cccactttct gaccatagcc aaattacttt atacctcaaa aagacaataa 1740 caaacccttg cacacagccc aataaactat ttaacatcag aaagccatac agatgggctg 1800 agaacagtgc agaagactac caaaatgcag taaacagccc aaaaactcaa caaatcctag 1860 ataacttcct ggttaacgca tatgcccaca ccaaacaagg agttaatatg gcggtaaaag 1920 acataaacaa tatattcgaa agtacagcta aacaggcaaa attaaaagtt aaaaccagac 1980 aaaataatcc acccaaaaat gacaaaaact ggtttgatca agaatgcctg tcaattagga 2040 aacacctcag aaacctgtca aatcagaaac acagagaccc aaataatgca gagattcggc 2100 ttctctattg tgaaacacta aaacaataca aacaaacact cagaaccaaa aaggcacaat 2160 acacccaaaa acaactgaca acaatagaga actccattaa ctcaaatcaa ttctgggaca 2220 actggaaaaa cttgatcaaa aacgatcatg aagagctacc gatccaaaat tcagaaatat 2280 gggagaccca tttccaaaca ctattcaata aagtagaaac agacacaaat cctaaacaaa 2340 atcaaataac aaaaacactg gcaacactag aatcgactat caaggataat caaaacccaa 2400 tagacttccc catcactata atggagctta aagacaaaat taaatctgtt aaacctaaaa 2460 aagcatctgg acctgacgga atattaaacg aaatgataaa acaaaccagc cctaaatttc 2520 aatcagccat cctaaaatta tttaatctag ttctgagtgt tggtcacttc cctgatatct 2580 ggaatcaagg attgataaca cccatcttta aaaatggaga taaatttgac cccaataatt 2640 acagagggat ttgtgtgagc agcaacctgg gaaagttatt ctgtagttta ataaatgccc 2700 gactactgga cttcatcacg acacataatg tcttaagcag aagtcaaatt ggatttttac 2760 caaaataccg cacatctgac cacatttaca cactgcactc gctaattgaa aaacacactg 2820 tccaaaataa aggtaaaata tacgcatgct tcattgactt taaaaaagct ttcgactcaa 2880 tttggcacca aggcttactt tataagctga ttgaaagtgg cataggagga aaaacatatg 2940 accttattaa atcaatgtac accgaaagca aatgtggcat caaaattagc acaaaaagaa 3000 caaaatatct ttcccaggag cgtggagtga gacaaggctg ctgcctgagc ccaacactat 3060 tcaacatcta cataaacgag ctggcgctca gtctggagcg atccaccgct ccgggtctcg 3120 atctccacga ctctcagatc aaatgcctgc tgtacgcaga cgacctgctg ctactatcgc 3180 caaccgaaca gggccttcag cagaacctgc agctgctgga ccagtactgc cagacctggg 3240 ccctgaccgt caacctaaac aaaaccaaaa tcatcacctt ccaaaaaaga gccagagccc 3300 agggaacaca acacacattc acactaggta ccaatcagat aacacacaca acacagtata 3360 attatttagg cctgaacatc acctccactg gaaactttaa tcctgcagtg aatgagctga 3420 gagataaagc ccgcagagcc ttctacgcca tcaagcgtca atgtcccata gacatccctg 3480 ttcagatctg gctgaagatc ctagagtccg tcatcgagcc catcgccctc tacggcagtg 3540 aggtgtgggg cccactgaca aaccctgaac aagatctggc caaatgggac aaacacccta 3600 tagaaaccct gcacacagag ctgtgtaaga acatcctaca cgtccaccgg cacacaacaa 3660 acaacggatg cagagcagaa ttaggcaaat accctctgat aataaagata cagaaaagag 3720 ctgtcaaatt ctggaagcac ttaaaactca gcgacccgga ctcataccac tataaagccc 3780 tgcaggacag agaactgagc agaagagcag acccactctc ccagctggcc cagagcttca 3840 gggtctctga gacgtctcct gaggagctga acacactcct gccactgact cagatcacac 3900 accagatcaa gaacagctac acacaccact gggacactca aacacaactg caaagcaaaa 3960 tgcagtgtta tctggccctg aagagacaat atactctggc agactatctt cacacagtga 4020 cagataaagg tctgaggaac accctgagca gatacagact cagcggacac cagctggcta 4080 tagagacggg ccggcacaga caaacatggc tgccggtgga ggagcggctg tgcccacact 4140 gccctcagca gccgattgaa acagaactgc acttccttac ggagtgcaca aaatactcag 4200 agatccggga gaagttctac ccgaaactca cacacacaca caaaaacttt gagtccctgt 4260 caaacaatga gaaactgccc atcctactgg gggagtgtgt gtgctgctgt gtgttagcag 4320 ctcagtttgt gcactcctgc caccgtctga ggaacccaca atgatccgct ccactgtctc 4380 caacacacaa cacaggaact ctttactcgt ggacttactg ttgatattat tgtcaccact 4440 tttatcctgt ttcacaaatt accattaatc tatttttatt tttattaata tttttagttt 4500 tactgatata ttgttatttt tattaatcta tttttatttt ttacatttct tttctttatt 4560 tctatattat attttatgtt tatatttgtt tgcactactg ccctttttgc actgtcttgt 4620 ctgtacgcag cgacgctgca ctgctttggc aatacgaatg tacagctatt tgtcatgcca 4680 ataaagcacc taaaatgtga aatgtgaaaa tgtgagagag agagagagag ag 4732 // ID Gypsy13-I_DR repbase; DNA; ZEB; 6488 BP. XX AC . XX DT 07-JAN-2005 (Rel. 10, Created) DT 07-JAN-2005 (Rel. 10, Last updated, Version 1) XX DE An internal portions of the Gypsy13_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW GYPSY superfamily; Gypsy13-I_DR; Gypsy13-LTR_DR; Gypsy13_DR; KW endogenous retrovirus; gag; integrase; reverse transcriptase. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6488 RA Kapitonov V.V. and Jurka J.; RT "Gypsy13_DR, an LTR retrotransposon from zebrafish."; RL Repbase Reports 4(12), 319-319 (2004). XX DR [1] (Consensus) XX CC Gypsy13-I_DR is an internal portion of the Gypsy13_DR CC LTR retrotransposon that belongs to the Gypsy superfamily. CC Its long terminal repeat is deposited in Repbase CC as Gypsy13-LTR_DR. Gypsy13_DR is characterized by 4-bp target CC site duplications. The internal portion encodes two proteins: CC the 556-aa gag Gypsy13_DR1p (pos. 95-1755) and 1577-aa CC polyprotein (pos. 1756-6486, conceptual translation) composed CC of the protease, reverse transcriptase, and integrase domains. CC PBS is identical to that in Gypsy9_DR. Some internal portions CC are flanked by 100% identical LTRs. PBS is complementary CC Arg-tRNA. XX FH Key Location/Qualifiers FT CDS 0..0 FT /product="Gypsy13-I_DR1p" FT /translation="MDIIEAEGIKIPNSVIISGLTQDKSDDELFDFLKQYG FT SFAKTVFISDKDSEFYQSVILEYTSGQALHSLEPQLPYTHQLSSDPSVTYH FT VRALSSVFTQYKGTSVTKSYLEGLKEVAKLSGTDFEIVLSQMLSQMTAELT FT PTSADTEADDEDLDEPQAQVCPEESFTPAPVKISQPDDSLSQHTSAKPNKP FT PLLTSSEVLNPPEVQKLVIEHVVRTGEVATQGLMQQRLRVFSGKCPRPGSE FT VDYDTWRSSVELMLKDSTLSDLNVSRKIVDSLLPPAADVIKHLSSEAPSSA FT YLQLLDSAFGVVEDGDELLAKFMNTLQDAGEKPSTYLYRLQTALRVTIKRG FT GVSPEEADRHLLKQFCRGCWDNDLITDLQLERRRNNPPSFGQLLLMLRTEE FT DKHTAKVTRMKQHLGSSKPRAVMHSQRTWVSSEVEQGEVSNMVSLAAETKE FT IKRQIAKLQSQLASLVPAHKTQKKASQQAVVNKQDKKKSDTANQLTRTPVS FT QRQKDRPRPWYCFTCGEDCHIASSCTSEPNPTLVNAKRKLLREKQLLWDSQ FT NANSNPDLN" FT CDS 0..0 FT /product="Gypsy13-I_DR2p" FT /translation="FKLESVLVVGQTGAELESQCPNTNHVFEKSQVSKVQS FT TSLPKGLIGAMSIAEVTIANEKCSCLLDTGSQVTTVPKSFYEQHLSGYPIK FT SIDDILEVEGANGLSVPYEGYIEMGITFPEELLGVSVEIPTVALVVPDVKA FT HNQSMVLIGTNTLDVLYKKYLSADPPKFQPCSYGYKVVLKTLEIRWRQNTS FT GVLGHVRLRSRAPKVLMAGQTVVVGGSVSNPCRIDQTILVEHSPDSSLPGG FT VFVKRCLLNQPENQMNSIPVVLTNETNHNITIPPRCVIAELHAVDSLQSLS FT KTSNSNGEGDFTLNFGDSPLPQVWKERISKKLREIPEVFSHHDLDFGHTQK FT VKHSIKLHDETPFKQRARPIHPQDIEAVRRHLQDLLASGVIRESESPFSSP FT IVVVRKKNGDVRLCIDYRKLNIQTVRDAYALPNLEETFSALTGSKWFSVLD FT LKSGYYQIEVDEADKPKTAFVCPLGFWEFNRMPQGVTNAPSTFQRLMEKCM FT GDINLKEVLVFLDDLIVFSDTLEEHETRLLNVLFRLKEYGLKLSLEKCKFF FT QTSVRYLGHIVSEHGVETDPEKVQALKTWPVPKNLKELRSFLGFGGYYRRF FT IKDYSKIVKPLNDLTSGYPPLRKGAKKCNKGSQYHNPKESFGDRWTPSCEE FT AFRTLIEKLTSAPILGFADPKLPYFLHTDASTKGLGAALYQEQDGQMRAIA FT FASRGLSHSESRYPAHKLEFLALKWAVTEKFNDYLYGNHFTVITDSNPLTY FT ILTTAKLDATSYRWLSALSTFSFKLQYRAGKQNVDADSLSRRPQEPIPETA FT WSSKEQERIHQFVQHHHHDAADIVSAPNDVVHAICEKHLINQDAASGVALV FT GSLALRPDAIPDEYGEDCNLDGLPVMPYLPHEIGEKQRADSVLREVIFHLE FT LGEKPSPTVRKEIPTFPLFLKEWNRLELRDGILYRRRQENNLLTYQLVLPE FT ELRPLVMSSLHDDMGHLGIERTVDLIRSRFYWPKMAADVERKVKECSRCVR FT RKAQPQKAAPLVNFHATRPLQLVCMDFLSLEPDRSNTKDILVITDFFTKYA FT VAYPTPNQKAKTVAKCLWENFVTHYGFPERLHSDQGPDFESHVIRELCEVS FT GIKKSRTTPYHPQGNPVERFNRTLLGMLGTLEEKDKAHWKDFVKPLVHAYN FT CTKHEVTGFTPYELMFGRQPRLPVDLAFGLPHHGKSDVIPHSEYVKQLKSH FT LKESYLLASQGMLKTAEKNKTRFDKSVTHSSLEVGDRVLVRNVRLRGKHKL FT ADKWESEVYVVVKKAGDLPVYTVHPENSESPLRTLHRDLLLPCGYLPLSNS FT SNPSPPKMSRPRTRQNPGFQQPEEDCSFLSPEDEDYEWYGDNHQNVEPLQF FT STVYDVPQPVKGKSVSLADESAGQNLGKDGTQCKEGTQNDNLPVTPPVDNL FT PDDLPANDPPVYNAPVHSSVDNPSDDVLEVCETTETPGKDQPETDNSKDLP FT TQRESKVEEANDSENSVSCEREEQQSEQEIPSLRRSARERDKPERLTYFQL FT GNPLSHVVQSLFQGLSTALVNSLNSVEDLGDSPITSDIPANIVTTQPLRAC FT KGTCIVSRGEA" XX SQ Sequence 6488 BP; 1949 A; 1372 C; 1449 G; 1718 T; 0 other; aagtggcgag ccagccagga ggtggcgctg ttgctgagta ttaattacat tctaagccat 60 tatttttgcc tacgttgacc agtaacaatt gacaatggat ataatagagg cagaagggat 120 taaaattccg aactcagtga tcattagcgg attaacacaa gacaaaagtg atgatgagct 180 gtttgatttc ctaaaacaat atggttcttt tgccaaaaca gtttttatta gtgacaagga 240 ttctgagttt tatcaaagtg ttatacttga atataccagt ggccaagctt tgcattctct 300 agagccacag ctaccttata ctcaccagct atcaagtgac ccaagtgtta cttaccatgt 360 gagagcatta tctagtgtgt ttacacagta taagggaacc agtgtcacaa agtcatacct 420 ggagggactg aaagaagtag caaaattgag tgggactgat tttgagattg tcttgagcca 480 gatgctgtca caaatgactg ctgagcttac tccaacgtcc gctgacacag aggctgatga 540 cgaggatctg gatgaaccgc aagctcaggt atgtcctgaa gagagtttta ccccagcccc 600 tgtcaaaatt agtcaaccag atgactcact gtcacagcat acctctgcta aaccaaataa 660 gccccccctc ttaacctctt cagaggtgtt aaatccacct gaggttcaaa aactcgtaat 720 tgaacatgtg gtgagaactg gtgaagtggc tactcaagga cttatgcagc aaaggcttag 780 ggtattttca ggaaaatgtc ccagacccgg aagtgaggtc gattacgaca cttggcgctc 840 cagtgtagag ctaatgctga aagattccac cttatctgat ttgaacgtat ccaggaaaat 900 agtagacagt cttttaccac ccgcagcaga tgttataaaa catcttagct ctgaagctcc 960 atcatcagct taccttcagt tgctggattc tgcttttgga gttgttgagg atggagatga 1020 actccttgca aagttcatga atactctgca ggacgctggt gaaaagccat ctacttactt 1080 gtacagattg cagacagctt tgagagtgac aataaaaaga ggtggtgtct cacctgaaga 1140 agcagatcgg catcttctca agcagttctg tagaggctgc tgggataatg atctaattac 1200 tgacctgcag ctagagcgaa ggcgtaataa tcctccttca tttgggcagc ttctacttat 1260 gttacgcact gaagaagaca aacacaccgc aaaagtcact cgtatgaagc agcatcttgg 1320 gtcttctaaa ccaagagcag taatgcactc gcaaaggacg tgggtttctt ctgaggtgga 1380 gcaaggagaa gtttcaaaca tggtatcact tgcagctgaa actaaggaga tcaagagaca 1440 gatagcaaaa ctacaaagtc agttggctag ccttgttcct gcgcataaaa cccagaagaa 1500 agcttcacag caagcagtag taaataaaca ggacaaaaag aagtcagata ctgctaacca 1560 gttaactagg actccagtta gccaaagaca aaaggacagg cctagaccat ggtactgctt 1620 tacctgtggt gaggattgtc atattgcatc ctcttgcact tctgagccaa accccacact 1680 tgttaatgcc aagcgtaaac ttttgagaga aaaacagcta ctatgggact ctcagaacgc 1740 caactccaac cctgatttaa actagaatca gtccttgttg tgggacagac aggggctgag 1800 ttggaatcac aatgtcccaa tactaatcat gtttttgaaa aaagtcaggt ttctaaagtg 1860 caaagcacta gcttgccaaa ggggttaata ggtgctatga gcattgctga agttactata 1920 gccaatgaaa aatgcagctg tttattggat acaggctctc aagtgacaac agttccaaag 1980 tccttctatg aacagcatct ctcaggatac ccaattaagt ccattgatga tatcctggaa 2040 gtagaaggcg caaatggtct atctgttcca tacgaaggct acatagaaat gggaattacc 2100 tttccagaag aattactagg agtgagcgtt gaaataccta cagtagcctt agtagttcct 2160 gatgtaaaag ctcacaatca gtcaatggtt cttattggga caaacacctt ggatgttctc 2220 tataagaagt atttaagtgc tgatccccca aaatttcaac cttgttcata tggttacaaa 2280 gtggtactta aaactcttga aataagatgg agacaaaaca caagtggtgt tcttggccat 2340 gtacgattga ggagtcgagc acccaaagtt ttaatggctg gtcagactgt agtggtggga 2400 ggttcagttt caaatccatg cagaatagat cagactatac ttgtggagca ctcacctgat 2460 tcgtctctac ctggaggggt gtttgtcaag cggtgtcttc ttaatcagcc tgagaaccag 2520 atgaatagta taccagttgt gcttaccaat gagacaaatc ataacattac tatcccaccc 2580 agatgtgtga tcgctgagct ccatgctgtg gattctttac agtctctttc aaaaacttcc 2640 aatagtaatg gagagggtga tttcactctg aacttcggtg attccccctt acctcaggta 2700 tggaaggagc gcatttctaa aaaactcaga gaaatacctg aagtgttcag tcatcatgac 2760 ctagattttg gccacaccca gaaagtgaag catagcatta aattgcacga tgaaactccc 2820 tttaaacaga gggcacgacc tattcatcca caagatattg aggcagttcg taggcatcta 2880 caagatcttc ttgcaagtgg agtcatccgg gaatcagagt cacctttctc ttccccaatt 2940 gtggtggtga ggaagaagaa tggggatgta cgtctatgta tagattatcg caagctgaat 3000 attcaaacag tgagagatgc ctatgcattg ccgaatcttg aggaaacatt ctcggctctg 3060 acaggatcta aatggttctc tgtcctcgat ttaaagtctg ggtactacca gatagaagtg 3120 gatgaggcag acaaacccaa aaccgccttt gtctgtccgc tgggattttg ggagtttaat 3180 cgcatgccac agggtgtgac gaatgcccca agtacattcc aaagattaat ggaaaaatgt 3240 atgggggaca ttaatctgaa ggaggtactt gttttcttgg acgacttaat agttttctca 3300 gacacattgg aagaacatga gactcgacta ttaaatgtcc tgtttcgtct aaaggaatat 3360 ggtctgaaac tctccttgga aaaatgtaag tttttccaga cttcagtccg ctatttggga 3420 catatcgtgt cagagcatgg agtggagact gaccctgaga aggtccaagc tttaaaaacc 3480 tggcctgtac ctaaaaacct aaaagaactt aggtctttct taggctttgg gggatattat 3540 cgtcgtttca tcaaggacta ctctaaaata gtgaagccac ttaatgatct gacctcagga 3600 tatcctccgc taagaaaggg tgccaagaag tgtaacaaag gaagccagta ccataaccca 3660 aaagagtcct ttggtgatcg atggacgcct tcttgtgaag aagcatttcg aacccttata 3720 gaaaaactca cttctgcacc tattctggga tttgctgacc ccaaactccc ttattttctc 3780 cacactgatg caagtacaaa gggactaggg gcagcacttt atcaagaaca ggatgggcag 3840 atgcgtgcaa tagcattcgc aagcagaggg ttgtctcaca gtgagtctag atatcctgct 3900 cacaaacttg aattccttgc cctaaaatgg gcagtgactg aaaagtttaa tgactactta 3960 tacggtaacc atttcaccgt tattacagat agcaaccctt tgacttatat cctcactaca 4020 gcaaaattgg atgctacaag ttatcgatgg ctgtcagctc tttccacctt ctctttcaaa 4080 ttgcaataca gagctggcaa gcagaatgta gatgcggata gtctctcaag gagaccccag 4140 gaacccattc ctgagactgc ttggtccagt aaagagcagg aaagaatcca tcaatttgta 4200 caacaccatc accatgatgc tgctgacatt gttagtgccc caaatgatgt agttcatgct 4260 atttgcgaaa agcaccttat caatcaagac gcagcttctg gggttgcact ggtgggatca 4320 cttgcactcc gtcctgatgc tatacctgat gagtatggag aggattgtaa tttagatgga 4380 ttacctgtta tgccttacct accacatgag attggtgaaa agcagagagc agactcagtc 4440 cttcgagaag tcattttcca cttggagttg ggggagaaac cttctcctac agtgcgaaaa 4500 gagattccaa cttttcctct ttttctaaag gagtggaatc gacttgagtt gcgagatgga 4560 atactctata gaagaaggca ggaaaataat ttactcactt accaactagt actccctgag 4620 gagttaagac ccttggtgat gagcagttta catgatgaca tgggtcacct agggattgag 4680 aggactgtag atctgattcg atctcgtttc tactggccaa aaatggctgc agatgtggaa 4740 cgaaaagtca aggagtgtag tcgctgtgtg cgtaggaagg cacaacccca aaaagcagct 4800 cctctggtca attttcatgc cactaggcca ttacagctgg tgtgtatgga tttcctttca 4860 ttggaacctg acaggagtaa taccaaagac atcttggtta ttacggactt ttttaccaag 4920 tatgcagtgg cttatcctac acctaatcag aaggctaaaa cagtagcaaa gtgtctttgg 4980 gagaactttg tcacacacta tgggtttcct gaacgcttgc atagtgacca gggccctgat 5040 tttgagtcac atgtcatcag agagctctgt gaagtgtcag gcataaagaa aagccgaaca 5100 actccctatc acccgcaggg aaacccggtt gaacgcttca ataggaccct gctgggcatg 5160 ttgggtactt tggaggagaa agataaagcc cattggaaag actttgtcaa gccacttgtc 5220 catgcgtaca attgtacgaa gcatgaagtc actgggttta ctccttatga attgatgttt 5280 ggcagacagc caagattgcc tgttgacctt gcttttggtc tgcctcatca tgggaaatca 5340 gatgtcattc ctcattcaga gtatgtgaaa cagctgaaat cacacttgaa ggaaagctat 5400 ttgcttgctt cacaaggtat gttgaaaact gctgaaaaga acaaaaccag gtttgacaaa 5460 tctgtcactc attcttcttt ggaggtcggg gatcgtgtcc tggtgcgaaa tgtcagactg 5520 cgtggtaagc acaagttagc ggacaagtgg gaatctgagg tgtatgtggt ggtaaagaag 5580 gctggcgact tacctgtcta caccgttcat cctgaaaaca gtgaaagtcc ccttcgtacc 5640 cttcataggg accttcttct tccttgtggt tacctgccat tgtcaaacag ttcaaacccc 5700 tccccaccca aaatgtcaag acctagaaca aggcagaatc caggtttcca acaacctgaa 5760 gaggattgtt ctttccttag tccagaagat gaggattatg agtggtatgg tgataaccat 5820 caaaatgtgg aaccattgca gttctctaca gtgtatgatg ttccccaacc agtaaaaggg 5880 aagagtgtgt ccttagctga tgaatcagct ggtcaaaacc tcggaaaaga tggaactcaa 5940 tgcaaggaag gaactcaaaa tgataaccta cctgtaactc ctccagttga caacttacct 6000 gatgatcttc cagccaatga tcctccagtt tataatgcac ctgtacactc ttcagttgat 6060 aacccatctg atgatgttct ggaagtctgt gaaacaactg agacacctgg aaaagaccaa 6120 cctgaaactg ataacagtaa agacttaccc actcagagag agagtaaagt tgaagaggca 6180 aatgattcag aaaactctgt ttcttgtgaa agagaagaac aacaatcaga acaagaaata 6240 ccttcactaa gacgttctgc aagggagaga gacaaacctg agagacttac ctactttcag 6300 cttggaaatc ctctgtctca tgttgttcag tctctgttcc aaggattaag cactgctctt 6360 gttaactctc tcaatagtgt tgaagatctg ggtgattctc ctataacttc tgacattcca 6420 gcaaacatag tgactaccca accgctcaga gcatgcaaag ggacttgcat tgtttcaaga 6480 ggggaagg 6488 // ID DIRS1_DR repbase; DNA; ZEB; 6132 BP. XX AC . XX DT 11-FEB-2002 (Rel. 7.01, Created) DT 10-JAN-2009 (Rel. 7.01, Last updated, Version 4) XX DE DIRS1_DR is a DIRS-like LTR retrotransposon - a consensus. XX KW DIRS; LTR Retrotransposon; Transposable Element; KW endogenous retrovirus; gypsy; DIRS1; DIRSDR1; DIRS superfamily; KW reverse transcriptase RNase H; phage integrase; DIRS1_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 414-5132 RA Jekosch K.; RT "DIRSDR1: putative non-LTR retrotransposon."; RL Repbase Reports 2(2), 9-9 (2002). XX RN [2] RP 1-6132 RA Kapitonov V.V. and Jurka J.; RT "DIRS1_DR, a family of DIRS-like endogenous retroviruses in RT zebrafish."; RL Repbase Reports 3(1), 1-1 (2003). XX DR [2] (Consensus) XX CC DIRS1_DR is a family of DIRS1-like retrotransposons. These CC elements CC are related to Gypsy-like LTR retrotransposons and endogenous CC retroviruses. CC There are ~100 copies of DIRS1a_DR in the genome, they are ~0.3% CC divergent from the consensus sequence. Therefore, this family CC retrotransposed in the zebrafish genome very recently. CC The unusual structure of DIRS1_DR is depicted in the next figure. CC GTTCCCCTTCGGTTGGGGAACTTCAGTGCCATGAATGGGAGGATTCGGATCAGAAGCCGCTTATCTGGAG CC <====== ======> <--------------------------------------------- CC AGTATTGAACGGGCCAATGAATGAAATTAATTGGCAGCGTAAGCTTGCGCAGGTGTGCGACATCTGCAAT CC ---------------------------------------------------------------------- CC TATCTCAGCATATAAGCACACCTGAAGCCAGCAGACGCCATCCTTTTCGCTTCAGATCCTTTCTGAGTGA CC ----------------------------------------------- CC ...................................................................... CC ...................................................................... CC GGTGCAGTCATTATGGCGCTTTCCATATTCTCCCATTCATGGCACTGAAGTTCCCCAACCGAAGGGGAAC CC <====== ======> CC <~~ CC GTTCGAGGTTACAGAAGTAACCCTTCGTTCCCCGAGGAGGGGAACGGAAGTGCCATATTCCGTCGCCATA CC ~~~~~~~~~~~~~~~~~~~~~~~~~~<====== ======> CC ATGACTGTCCCTTAGCTGTTTGAAAGTCTCTTCAGCTT CC AAAAGGATGGCGTCTGCTGGCTTCAGGTGTGCTTATATGCTGAGATAATTGCAGATGTCGCACACCTGCG CC ---------------------------------------------------------------------- CC CAAGCTTACGCTGCCAATTAATTTCATTCATTGGCCCGTTCAATACTCTCCAGATAAGCGGCTTCTGATC CC ---------------------------------------------------------------------- CC CGAATCCTCCCATTCATGGCACTTCCGTTCCCCTCCTCGGGGAACgaagggttacttctgtaacctcgaacgtt CC ----------------------> <====== CC ======>~~~~~~~~~~~~~~~~~~~~~~~~~~~~> CC Fig.1 Termini of DIRS1_DR. The 163-bp sub-terminal inverted CC repeats are CC underlined by a single line. CC DIRS1_DR encodes three ORFs. ORF1 (positions 414-1632) codes for CC the CC gag-like protein. ORF2 (positions 1633-2597) codes for reverse CC transcriptase and RNase H. ORF3 (positions 2598-5129) codes for CC the CC phage integrase. XX FH Key Location/Qualifiers FT CDS 414..1850 FT /product="ORF1p" FT /note="Gag-like protein" FT /translation="MALRLCVSGCGGFLSPDDGHDHCIACLGVQHVNAVLA FT GGSCRHCDAMTVAQLRSRLTFARERATPVASCSKKAAGARADLRVSAGANP FT PPTGSRTSRSSRRSIQASGGESDPSNQMVALTLADTGDQMSSAASEGGLSL FT SDEDPDPLAPSGQVSAVKSDPEADMLAVLSRAASAVGLEMVYPPAPRPDRL FT DGCYVEDQKAKPSKPLVPFFPEVHSRLTQSWRAPFSARAASASALTALDGG FT AARGYEAIPSVERAIAVNLCPRGASTWRGLPRLPSKACRLSASLGARAYKA FT AGQAASALHAMATYQRYQAQALAELHEGGSNPSLLHELRTATDYALRTTKS FT AACALGRTMSTLVVQERHLWLNLADMRDVDKVRFLDSPISQAGLFGDTVGE FT FTQEFKAVKEQSDAMGNVIYRRGRKPAPPAEPSTSAVPRRGRPPTSAAPPP FT PAPPAKRARRSPRKQAAPPAQGAVKSGKRTAKRP" FT CDS 2598..5129 FT /product="ORF3p" FT /translation="MRSSSGLVCSHRPEGRVFPCLHSSTPPPISAVCVRGS FT SVAVQGPPLRALSVSAGLHQTRGGCPSAPSARGHSHTQLSRRLADFSPLAG FT AIDYAQGRGASASPPTGASGQPRKEQTRPRAEDFFSRDGAGLDHHGSAPLR FT GTRSPVAELSEGARQQTSGPTEVLSEAPGAYGIRSRRHAARVAPYETTSAL FT ASRSGPQTRMARGHTPGLGYCAVSPRPQPLERPLVPTGRCASRTGVQPCCC FT FNRRFQHGLGGRVSRACGCGPLEGCPAALAYQSPRAVGSVPRSPPLFTGAG FT AATRAGQDGQYGGGGVYQPHGGYALSPHVSARPPSAPLESPAAEIAARHSR FT PRHAQSCSRCALTTAVTPWRMETPPRVCSADMGAIRGGPDRSVCFPRERSL FT PVVFFPDRGLSRHGCTGPQLASGHAQVCVSPSEPARAVSVQGQGGRGTGSA FT SCAPLAQPDLDIRALTPRDGPPLADPFERGPTLSGTGHHLAPSPRSLEPPR FT VVPRREEDLGNLPTAVVNTITQARAPSTRRAYALKWSLFTEWCVSRREDPR FT NCQISVVLSFLQEKLDSRLSPSTLKVYVAAISAYHSAVAGGTVGKHNLVIQ FT FLRGARRINPSRPPLMPSWDLALVLTSLRSDPFEPLESVSLRFLSLKTALL FT VALASIKRVGDLEAFSVSDSCLEFGPDYSHVILRPRPGYVPKVPTTPFRDQ FT VVNLQALPPEEADPALSLLCPVRALRIYVDRTQNFRSSEQLFVCYGGRQQG FT SAVSKQRLSHWIVDAISLAYSSRGQPCPPGVRAHSTRSVASSWARARGASL FT TDICRAAGWATPNTFARFYNLRVEPVSSRVLGNPLVIEETTR" FT CDS 1633..4110 FT /product="ORF2p" FT /note="reverse transcriptase" FT /translation="MRWAMSSIGVAVSPLRPPSHPPPLFLAEGARQRVLPR FT PRLRLRPSGRGVHLESRQPLLPRAPLSPVNGPRSVPETGHPEKRKLALSPL FT EGGAPITTVLFSATKTSVKEHFFPSPDVTARVLPVRDALPSGSQTLRASPV FT AHERWGDGLPSLSPPAPSPESGCGARANRSPPAFPRDPRASRISTPTPRCP FT TAGTSAIVAMTPLARALPAWLARASPSRWLIRTIRLGYAIQFAKRPPKFTG FT VYFSRVNPLSAPVLREEIAALLAKGAIEPVPPAEMESGFYSPYFIVPKKSG FT GSRPILDLRVLNRCLHKLPFRMLTQRRILQCVRPRDWFAAIDLKDAYFHVS FT ILPRHRQFLRFAFEGRAWQYKVLPFGLSLSPRVFTKLAEGALAPLRLAGIR FT ILSYLDDWLILAHSREQLIMHRDEVLRHLRLLGLQVNREKSKLAPVQRISF FT LGMELDSITMVAHLSEERARLLLNCLRELDSKLVVPLKFFQRLLGHMASAA FT AVTPLGLLHMRPLQHWLHDRVPRRAWHAGTHRVSVTALCRRALSPWNDPSF FT LQAGVPLGQASSHVVVSTDASNTGWGAVCRGHAAAGLWKGAQLHWHINRLE FT LLAVFLALHRFLPVLERQHVLVRTDSTAAAAYINRMGGMRSRRMSQLARRL FT LLWSHPRLKSLRAIHVPGTLNRAADALSRQLLRPGEWRLHPESVQLIWARF FT GEAQIDLFASPENAHCQLFFSLTEGSLGTDALAHSWPRGMRKYAFPPVSLL FT AQFLCKVREDEEQVLLVAPLWPNRTWISELSLLATALPWRIPLREDLLSQG FT QGTIWHPRPDLWNLHVWSLDARKT" XX SQ Sequence 6132 BP; 1117 A; 1898 C; 1706 G; 1411 T; 0 other; gttccccttc ggttggggaa cttcagtgcc atgaatggga ggattcggat cagaagccgc 60 ttatctggag agtattgaac gggccaatga atgaaattaa ttggcagcgt aagcttgcgc 120 aggtgtgcga catctgcaat tatctcagca tataagcaca cctgaagcca gcagacgcca 180 tccttttcgc ttcagatcct ttctgagtga gtcgatgagg gttcctcttg ctgatcagca 240 cttcagagcg aacgagtgtg tctcccggtc cagagtgggt cttcgcggtg gcagacggtc 300 gagctgggtt actcccttgc ctgcggttct ttgggtccgg tcctccagag cggtgcgtat 360 agttgcaact ttcctaaaag agcaacacag tcgtgcagca cgtccttttc aggatggcgc 420 tccgactgtg cgtttctgga tgcgggggtt tcctgtctcc ggatgatgga cacgatcact 480 gcattgcatg tttgggggtc cagcatgtta atgcggtgct cgcgggcggt tcatgtcgtc 540 attgcgatgc catgaccgtt gcacagctaa gatcgcggct aactttcgca agagagcgag 600 ccaccccagt tgcctcctgt tctaaaaaag cagcgggcgc tcgggcagat ctgagggttt 660 cagcgggagc taatccgccg cccacgggct cgcggacctc tcgctcctca cggcgctcca 720 tccaagcttc gggtggtgag agtgatccgt ctaaccagat ggtagctctc acactcgctg 780 acaccggaga tcagatgtcc tccgcggcat cggagggtgg gctttcactg tccgacgaag 840 atccggaccc gctcgccccc tccgggcagg tgagcgctgt caaatcggat cctgaagcgg 900 acatgttagc cgtgctttcc cgggctgctt cggccgtggg gttggagatg gtttatcccc 960 cagctccgcg gccggaccga ctagatgggt gctacgtaga ggaccagaag gcgaagcctt 1020 cgaagcctct cgtccccttc ttcccggaag tgcacagtag gctcacgcag tcctggaggg 1080 cacctttctc tgcccgtgct gcgagtgcct ccgccctcac cgcccttgac ggcggagctg 1140 ccagggggta tgaggcgatc ccgtcagtgg agcgcgctat cgcggtcaat ctttgtccgc 1200 gcggcgcctc tacgtggcgg ggtttgcccc gcctcccgtc caaagcctgt aggttgtctg 1260 cctccctcgg agccagagct tataaggctg cgggccaggc tgcttctgct ttgcacgcga 1320 tggccaccta ccagcgctac caagcgcagg cgctggccga gctgcacgag ggcgggtcca 1380 acccaagctt attacatgag ctgcgcaccg cgaccgacta tgctcttcgg actactaagt 1440 ccgccgcgtg tgcgctgggg aggacgatgt ccacacttgt ggttcaggaa cgccacctct 1500 ggctaaacct ggccgatatg cgcgacgttg acaaagttcg ctttcttgac tcgcccatat 1560 cccaggctgg cctgttcggc gacaccgtcg gtgaattcac ccaggaattc aaggcggtga 1620 aagagcagtc ggatgcgatg ggcaatgtca tctatcggcg tggccgtaag cccgctccgc 1680 ccgccgagcc atccacctcc gctgttcctc gccgagggcg cccgccaacg agtgctgccc 1740 cgcccccgcc tgcgcctccg gccaagcggg cgcggcgttc acctcgaaag caggcagccc 1800 ctcctgccca gggcgccgtt aagtccggta aacggaccgc gaagcgtccc tgagacaggc 1860 catccggaga agaggaaact tgctctttcc ccgctggagg gcggggcccc gataacaacg 1920 gtacttttca gtgccaccaa aacatcagta aaagagcact ttttcccttc cccggatgtg 1980 actgcacgag ttctgccagt ccgggacgcg ctgccttccg gctcgcagac tctacgtgct 2040 tcgccagtgg ctcacgagcg ctggggggac ggtctccctt ccctcagccc tccagccccc 2100 tctccggagt cagggtgcgg agccagagcg aatcgctctc ctccagcttt tccgcgggac 2160 cctcgtgctt cccggatcag cacacccact ccgcgctgcc ccaccgctgg tacgtcagcg 2220 attgtagcga tgactccatt agcgagggct ctgcctgcct ggttagcgcg ggccagcccc 2280 tcgcggtggc tcatacgcac aatcagactc ggttacgcga ttcagttcgc gaaacggccc 2340 cccaagttta cgggcgtgta tttctccagg gtcaaccccc tgtccgcccc tgtcttgcga 2400 gaggagattg ctgccctcct ggcgaagggt gcaatcgagc cggttcctcc agccgagatg 2460 gagagtgggt tttacagccc atacttcatc gtacccaaaa agagcggtgg gtcacggcca 2520 atcctagatc tgcgcgtttt gaaccgctgt ctgcacaagc tgccgttcag aatgctcacg 2580 cagaggcgca ttctccaatg cgttcgtcct cgggattggt ttgcagccat agacctgaag 2640 gacgcgtatt tccatgtctc cattcttcca cgccaccgcc aatttctgcg gtttgcgttc 2700 gagggtcgag cgtggcagta caaggtcctc cccttcgggc tctctctgtc tccgcgggtc 2760 ttcaccaaac tcgcggaggg tgccctagcg ccccttcggc tcgcgggcat tcgcatactc 2820 agttatctcg acgactggct gattttagcc cactcgcggg agcaattgat tatgcacagg 2880 gacgaggtgc ttcggcatct ccgcctactg gggcttcagg tcaaccgaga aaagagcaaa 2940 ctcgcccccg tgcagaggat ttcttttctc gggatggagc tggactcgat caccatggta 3000 gcgcacctct ccgaggaacg cgctcgcctg ttgctgaact gtctgaggga gctcgacagc 3060 aaactagtgg tcccactgaa gttctttcag aggctcctgg ggcatatggc atccgcagcc 3120 gccgtcacgc cgctcgggtt gctccatatg agaccacttc agcactggct tcacgatcgg 3180 gtccccagac gcgcatggca cgcgggcaca caccgggtct cggttactgc gctgtgtcgc 3240 cgcgccctca gcccttggaa cgacccctcg ttcctacagg ccggtgtgcc tctaggacag 3300 gcgtccagcc atgttgttgt ttcaacagac gcttccaaca cgggttgggg ggccgtgtgt 3360 cgcgggcatg cggctgcggg cctctggaag ggtgcccagc tgcattggca tatcaatcgc 3420 ctagagctgt tggcagtgtt cctcgctctc caccgctttt taccggtgct ggagcggcaa 3480 cacgtgctgg tcaggacgga cagtacggcg gcggcggcgt atatcaaccg catggggggt 3540 atgcgctctc gccgcatgtc tcagctcgcc cgccgtctgc tcctctggag tcacccgcgg 3600 ctgaaatcgc tgcgcgccat tcacgtccca ggcacgctca atcgtgcagc cgatgcgctc 3660 tcacgacagc tgttacgccc tggagaatgg agactccacc ccgagtctgt tcagctgata 3720 tgggcgcgat tcggggaggc ccagatcgat ctgtttgctt cccccgagaa cgctcactgc 3780 cagttgtttt tttccctgac cgagggctct ctcggcacgg atgcactggc ccacagctgg 3840 cctcggggca tgcgcaagta tgcgtttccc ccagtgagcc tgctcgcgca gtttctgtgc 3900 aaggtcaggg aggacgagga acaggttctg ctagttgcgc ccctttggcc caaccggacc 3960 tggatatcag agctctcact cctcgcgacg gccctcccct ggcggatccc tttgagagag 4020 gacctactct ctcagggaca gggcaccatc tggcaccctc gccccgatct ttggaacctc 4080 cacgtgtggt ccctagacgc gaggaagact taggtaacct accgactgcg gtggttaata 4140 ccatcactca ggctagagcc ccctccacga ggcgcgccta cgccctgaag tggagtctat 4200 tcactgaatg gtgcgtctct cgcagagaag acccccgaaa ttgccagatt agtgttgtgc 4260 tctctttcct tcaagagaag ttggacagca ggctgtcgcc ctccactctc aaggtttacg 4320 tggccgccat ctccgcttat catagcgcgg tagctggcgg caccgtggga aagcataacc 4380 tggtcatcca gttccttagg ggtgctaggc gaattaatcc atctcgcccc cctctcatgc 4440 cctcttggga tctcgccctc gttctcacga gtctgcgatc cgatcccttt gagccactcg 4500 aatcagtatc tctaagattt ctgtccctga agacagctct gctggttgcg ttggcctcca 4560 tcaagagggt cggggacctg gaggcatttt cggtcagtga ctcgtgcctg gaattcgggc 4620 cggattactc tcacgttatc ctgagacccc gccccggtta tgtgcccaag gttcctacca 4680 ccccctttag agatcaggta gtgaacctgc aagcgctgcc cccggaggag gcagacccag 4740 ccctttcttt actttgtcca gttcgcgctc tgcgcattta tgtggaccgt actcagaatt 4800 ttagatcatc tgagcagctc tttgtctgtt atggcggtcg gcagcaggga agtgccgtat 4860 cgaaacaaag attatcccac tggattgtgg atgccatttc actcgcttat tcgagtcgag 4920 gtcagccgtg tcccccggga gtacgtgcac actccactcg gagcgttgca tcctcttggg 4980 cgcgtgcacg cggcgcctct ctaacagaca tctgtagagc tgcgggctgg gcgacaccca 5040 acacatttgc aaggttttac aatctgcgag tggagccggt ttcctcaagg gtattaggta 5100 accctttggt gattgaggag acaactcggt agggtgttga aacacgcttg ctgcgccatt 5160 ctccctaaca cggaggtacg tgcgcctttt ttatctgtca gtaaagttcc ccgtcaggtg 5220 agccctgcag attcctccgt ggcccccagc actgactcag cggaggagtc acttgctggc 5280 ccactacgtt gtaggtctgc ccgctggtca gcccgcgttt tgggtatagg tgcctgctat 5340 gcgtgatccc cactaggcga tcccatatgc ttattccgcc acggttaagt cccccccctg 5400 ggcggacccg tgtcttccct ctccgctaac cactcttttg ctatgcgtac tccccctttt 5460 tagggctagt ccataggtaa attctgccat ctatcccccc cttgggtaac ggatggcctc 5520 cgcagcgtcc tccctatcgg gattgcacgc ttcccaacgt actgtcgtat ttcctagaat 5580 tatctagatg ctcacgactt cccaaaaaat atatctaaat ccgtaaaact tctgttgaag 5640 taggataaat tagggccagg gacacgttgg aggaccgcgc cccccatgat gtgggtgcgt 5700 cacgcttgct tgactatctc ctcatcgggg gtgttggtaa ggtgcagtca ttatggcgct 5760 ttccatattc tcccattcat ggcactgaag ttccccaacc gaaggggaac gttcgaggtt 5820 acagaagtaa cccttcgttc cccgaggagg ggaacggaag tgccatattc cgtcgccata 5880 atgactgtcc cttagctgtt tgaaagtctc ttcagcttaa aaggatggcg tctgctggct 5940 tcaggtgtgc ttatatgctg agataattgc agatgtcgca cacctgcgca agcttacgct 6000 gccaattaat ttcattcatt ggcccgttca atactctcca gataagcggc ttctgatccg 6060 aatcctccca ttcatggcac ttccgttccc ctcctcgggg aacgaagggt tacttctgta 6120 acctcgaacg tt 6132 // ID TE-X-4_DR repbase; DNA; ZEB; 16300 BP. XX AC . XX DT 01-DEC-2008 (Rel. 13.12, Created) DT 01-DEC-2008 (Rel. 13.12, Last updated, Version 1) XX DE nonautonomous transposable element from zebrafish - a consensus. XX KW Transposable Element; Nonautonomous; TE-X-4_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-16300 RA Bao W. and Jurka J.; RT "transposable elements from zebrafish."; RL Repbase Reports 8(12), 2179-2179 (2008). XX DR [1] (Consensus) XX CC This element inserted preferentially into CA minisatellite; TSD CC length is unknown. Sequences corresponding to DNA-2-21_DR and CC TDR18 is masked out. XX SQ Sequence 16300 BP; 4863 A; 2481 C; 2216 G; 4730 T; 2010 other; cacacacaca cacacacaca catacttgtt tttgtgcatt gtgggggtca cctgtcacgg 60 tggttacgta atggggaccg ccctttctga tgatgttaga gacataaaaa aaatagtttg 120 ccacacaaaa acacaaacta tttcataaaa tcacttcaga ttttggatat tccgcaaatt 180 tttttttaga cctgacacag tggtcacttg tggggacatt tcaggttttg tgtagaagtg 240 gggtccgcca catacacaac cgatttttag acaaagtggg gaccaattct acacacactt 300 tactggtttt gagacaaagt aaggaccagt tccagataca cattaccggt attgagtcaa 360 agtagggacc aggtctacac acacattact ggtattgagt caaagtaagg accagatcta 420 cacacacatt actggtattg agtcaaagta aggaccagat ctacacacac attaccggta 480 ttgagtcaaa gtaaggacca ggtctacaca cacattaccg gtattgagtc aaagtaagga 540 ccaggtctac acacacatta ctggtattga gtcaaagtaa ggaccagatc tacacacaca 600 ttaccggtat tgagtcaaag taaggaccag gtctacacac acattactgg tattgagtca 660 aagtaaggac caggtctaca cacacattac cggtattgag tcaaagtaag gaccaggtct 720 acacacacat taccggtatt gagtcaaagt aaggaccagg tctacacaca cattacyggt 780 attgagtcaa agtaaggacc aggtctacac acacattact ggtattgagt caaagtaagg 840 accagatcta cacacacatc actggttttg agtcaaagta aggaccagat ctacacacac 900 attaccgttt tttagtcaaa gtaaggacca gttctacaca cacattaccg gtttttaggc 960 aacgtgggga tcagatcagc acccttacta atattaaatt ttaaacctca atcaagtaat 1020 ttgacagtaa actgctttag aattttcatt tgaagagcag gtcagtttat aaaaaatgta 1080 atttgaatac agaaggaacc tcaagatcca aacaggaaat ggtccagaat agatttaata 1140 aatgttatat ttattataaa taatttccac aaaaccttta aacacaaaaa aaacagtaaa 1200 cacaaaatcc atgaaactct cttgcatgtc taaaagcagt tctcattggg tataacaagc 1260 ttccacataa ctcttccatg tctgcataca catttagtta agtgcaatag caaacaatat 1320 taaacggtta gttcacccca aaatgaagat tctgttatta attcctctcc ttcatattgt 1380 ttcaatccct tgctgtctat ggaggatgag gagttcttgc acttcatcta agttatcttc 1440 ctttgagtct gaagatgaat gaaggtctca ggggattaga gtgagatgag ggcaagtaat 1500 taataatata atgtacattt ttgggtaaac tcttaaaacc caccagtatg ttctatttct 1560 gtttctgtcg tattttaact atttgaaaaa aaaaaaacat taaaaatatt taaaaatata 1620 tttataaaga actgaacccc acaaatgcac agagaccttc atcatataac caagccacca 1680 aaactccaaa tagacatcat ttgttaagca ttaactctac attaatagta agcagtttat 1740 aaacacagct agatgctcta ttgttgactt ataagcatgt gtaatgtgct caatttttgt 1800 gttttcatac tttgtttata attcattttt cattactaaa tgaagtattg cattatttac 1860 aaactgtcta agaataattg gtgtttttta agatcattaa gaatgagtaa ataaatgatt 1920 aaaaaactat ataaatgaac atttatacat ttactattca ggcatataat cagggctaaa 1980 aaaaaaaaaa aagaaaaaaa agaaaatcca atcggttcac tctaagcaga atttatattt 2040 tttcggttct gttcttgcgt tgcttaatag acaaaatatt ccgaaaacag ttttctagaa 2100 aaaaaaaaaa aaaaaacttt atttaattac aaagtattgc agggctctag agtgctctgt 2160 ttttttaatg gtgcaactaa aaaatctata cgaggggtaa gattaaatga attttcaata 2220 agtatcatta aagatatatt tataatatga acttgctttt ggttattgat tgacacgctg 2280 tttaaaggtg ctacacacag ccccactcct tggtatcgag cgcaggcccg gcgctacatt 2340 tagatgttgt ataatatgaa atagcaataa aaattatgaa aattgttatg aaatagtttt 2400 tcagttagcc aamctaataa aaatatttag tacactgcac agtgcaatta ttatttaaat 2460 ttattatatc cattattatt aatattagta aaattgacaa aacacagaac tcttttgaac 2520 gtttgctttc attttgacat gctaaagcgc acttttacag ttttgtaaag aaacatccca 2580 ttcatgcagc agcagtcaag ttaggacact taaaaatgcc ttttgtgttt aaagatgaac 2640 ggcctgcgaa gtcaagttta cattgttata aattaaatag atatagccta gtgtgtaagt 2700 gaatttgctg cagatccacc tctcacatgt gctgcaggta gggtaaatga taatcttttt 2760 aaaaaattta ctttaggcta tactgctatt tgctcatgca aaattaaaca ccagaaaata 2820 tatcgtaggc ctatattaag agctaaatta cagttagcat cataattagt tcaattatgt 2880 gctgtaggct attgattgac caacaccccc tgtatacagt ttacccactg caggtctaca 2940 gttctgatgt gtattttgaa ttacttgttt taaatgacag taagacatgc tatatttagt 3000 ttttattgta aaatatacat tataatccag tacaaagtaa aataatattt attaatggcc 3060 tattggcagc gtatagccgt taagcttata gtctaagcat gtcattttat tattattaac 3120 ataaattaca aactagcatc cttatgattt tatttcgttt ctgtatattt ttatcaaacg 3180 aaaaatgcaa tgagttatac tttggtttta attttattag aaatttaata gccgtttagg 3240 caacactact tctacatgtc attggctctg attgcgtttt ttacagataa cctatttaca 3300 aagattttta ttaaacaagt gcactaccac aagatatttt tgtaggtgta aaaatctaaa 3360 aagtatcatt atcatattct tgccttattc atctttaata taaatataaa atttaacaca 3420 ttcattgatt aaactaggga tgctaacgat tactcgattg atttattgtc ggtaaccctt 3480 taaaaccgat agaccttatc gatgaccgat taagcatggg catttaataa gttatgcttt 3540 gctctttgag atgcgtccac gatttcacac attacataat caagtgtgcg cagtttaggc 3600 tacgttaccc tattttaggg agtcgtgcac tctcttgact ttgcatattg gttatgcaca 3660 tattggtcat acctattatg ctcagatctc gtttatccgc tggtgctgcc attaatccgc 3720 tggtcaagta atcaaaaagt aggctacatg acgtttaatt atgggcaggt tattaagacc 3780 aaacattggt tgtgcaaact gtattacgtt ctctaaaatt ttaaagtgtt ttaagctaca 3840 ttatcttttt atagactggc ataggcctaa atacgcatca cggccagaca tttatagttt 3900 atataaaaac gagcgtcacc gtcgcatttg ttcacatgct ttaaattaat gctttttgtc 3960 tcccaaattc atataataaa gagaacgata aaataacatt attaaccgtt taactgtaaa 4020 catttctgtt cagaacgatt aaagttgatc tttttttttc ggttttcgtt tctgttcctg 4080 aaaaatgtca tttgattctg tttttcgttt ccgttccttg aaccggtttg gagccctgca 4140 tataatacgt ttttctgcat gttaaataat gctttattaa ctcaacttca tgcagttttg 4200 tgatctaatc taaagtgagg aatattgatg ctttataaat cccttataaa tgacaattaa 4260 aggctcagaa tcattcattt attcatttat tttcattttg gttttctctc tttattaatc 4320 cggggtcgca acagcggatt gaaccaccaa cttatccagc acgtttttat gcagcggatg 4380 cccttccagc tgcaacccat ccctgggaca catccataca catacacact cattcacgct 4440 catttactac ggaccatttt aggctaccca attcacctgt accgcatgta ggagcacctg 4500 gaggaaaccc acaccaacac agggagaaca tgcaaacttt acacagaaat gccaactgac 4560 ccagccaggg ctcgaaccag cgaccttctt gctgtgaggt gacagcacta cctactgtgc 4620 cactgcgttg ccaggctcag tatcaaataa acaataagta tttgcaatcg tacctaaaca 4680 aatacaatta ctgtaaattt taaacattgc tgaataacag gagtgtcgaa atacaacatc 4740 atttgataaa acaacaatat aataatttaa caatacatta cgtgtaaagc aattttatat 4800 ttagacaaga ttgcaatgat ttacttttca tttgatacag tttcatttga gcatcttgta 4860 ataagtagaa attaatagtc atttttttcc tgtatataat gtaactgagc cttatattgt 4920 catttataat ggatttataa agcatgaata gccctcgctt tagattagat cacaaaactg 4980 catgaagttg agttcataaa gcatttatta acatactaaa caacttatta tatgcctgaa 5040 taataagaat tatcaatgtt attttgaata gtttattaat catttacttg cacattctga 5100 attatctaaa aaaaaaaaca ccaactactt ttaaatatct ggtttgttaa tgatgtattt 5160 ttcattacta aattaagtat tgcattattt accaagtatg aaaacaatca ttaagcacat 5220 tataattgtg cttataagtc aataatattt gtaatttata tttataaata taatttataa 5280 actgcttact aatgtctatt aatgtagcgt taatgcttaa caaacgttca actaactatt 5340 tgctaatgtt tcataaataa ttcattgtgt gcagttatta tagtgttacc caacatttat 5400 atatcttatt aatcaggcat atagtaatgg ttactctgtc taataataaa tgctttatga 5460 actcaacttc atgcagtttt gtgagctaat gtaaagtgag gactattgat gctttataaa 5520 tcccttataa atgacaatta aaatctcagc tatattctaa acagaaaaaa agaaataaat 5580 taaaaggggg agtgatttat agctttagag tgatttatat atatatatat atatatatat 5640 atatggtgaa gcagtggcgc ggtaggtagt gctgtcgcct cacagcaaga aggtcgctgg 5700 gtcactggtt tgaacctcgg ctcagttggc atttctgtga gaagtttgca tgttctccct 5760 gcattcgtgt gggtttcctc cgggtgcaaa ggcacagtcc aaagacatgc tacaggtgaa 5820 ttgggaaggc taaattgtcc ctagtgtatg agtgtgacag gcccatatga caggctggcc 5880 ggaaggtgta aaaaagtcgt tatcatatgg acaagtctaa aactgaagga cttgttccaa 5940 aaagagccga ccccgcaatc atacgggact aaggccaaag aagatatata tatatatata 6000 tatatttttt ttttttttca ttaatttcaa gatacacaaa tgaaactatc taatgaaaaa 6060 taaatctttg caatcttatc aaaatataaa attactgtac agtgcaaaca tcacagaaac 6120 actgaaatga atattgtcat attgttaatt gtgttgtttt gaaatgatgt tgtattttta 6180 ttttgcaatg tttaaacttt acagaatttt tttaataaga ttgcaaagat ttattgttca 6240 ttagatactg agccttaaat tgtcatttat aagggattta tcaactataa acagtcctta 6300 ggtcacaaaa ctgcatgaag ttgtgttatt atagcattta ctaacacaca tggttacaat 6360 tactatatgc ctgaatataa catgtataaa tgttcatttg aatagtttat taatcattta 6420 ataactcatt ctgaattatc ttaaaaacca ccaactactc ttatatacaa atagtttgta 6480 aataatgcaa tacataatat agtaatgaaa aatgaatcag taacaaagca tgaaaataca 6540 attaagtcaa gtcaagaata cagcgtttga agctgtattt ataaactgct tcctaaagtc 6600 tattaatgta gagttagtgc taaactaatg atgtatagtg tgtagttact atgaaagtgt 6660 tattcagatt ttttatctaa agagccgaga ttcagttttt aatataaaaa tccaaagctg 6720 accaatctct ggtttttaga gccaatggtt cagctttcac cacagcatgt acttcctgtt 6780 tgctccagga tcttctcttt atgggaccta tgggtggtta aaatataaaa aagtggtttt 6840 agttgcataa atgttcagtc atttttacaa catgtgctgt tatgttagtt taatattgtg 6900 atagaaaatg ctagaaaaat atgtcttgcc tattattaaa aatacttaag atggagatgg 6960 attaaaaaat acttcctatt aacttcaaaa ctggttaatg gggttacact ttattccata 7020 gtccagttta gacattcttc taactatgcg taactctgca actacgttaa ttatctcacc 7080 gtaaagtatt atgtattaga gtatttgcat actgttagaa tgttcaataa gactaacgct 7140 catctgatta tcagtagagc agtggttctc aaactgtggt acgtgtacca ctagtggtac 7200 gcgggcttcc ttctagtggt acgcggagga atgaaatatg tcatgtacat gctacatata 7260 ttttaaaatt tatcaaaaat gatgtatata atatgccata tatgacatat agcctatatt 7320 tctgaggtaa tctgccacgt tttttttatc tgagcagagt tgtagccgct ttactgagcc 7380 tactacgcta ctgtatttca atactgctca tttttggtgg tacttggaga gacaattttt 7440 ttctgaggtg gtacttgatg aaaaaagttt gagaaccact gcagtagagt attagcagac 7500 tgttagctca agtttagctc aaaacactac actgtaacac atatctgttc attaccagtt 7560 cctgtatttt gtgatttaca agtgtttttt tttttttttt tgtggtggtg aattgcatta 7620 tgggatgttg atctctgctc tgtgcacttt tgatgttgaa aattcaactc tacagtttaa 7680 caaagtgact tttactgaca ttttagtagt ttgaaataat ataatgtata agaaataata 7740 tatagagata aataagtctt tagaataaca gaaaatgtac cggcagttta ttacaaggtt 7800 tctgtagcat aatatacaac aacaacccaa acaaaaagtc actttttttt actttttaag 7860 attaaaagtt gttggaagaa agatcaagat cccataatgc aattcaaaag cagaaacaaa 7920 cagatttcat aaaaacatga atcacaaatt atgaaaaact gctaatttaa ttattattac 7980 ttatttttat taattttaga gtgtagagta ttagctgaca gttaaggtta cctcaataaa 8040 accaactctc attggtctca gtaaatagta cagcagactg ttaggttacg ctcaaaatta 8100 gtacaatatg actaactctt atataaattt agtagagcat tagcagtgtt gattctaaca 8160 gacagtctaa tggctgttag ttgacaagta gttgcagtta cttgtagtta gtagtagtag 8220 tagtagtaaa gtaaattatc aaaataaagt gtaaccagat atgcaattag tatatttttc 8280 caaagtattt agcaatattg tttaactaac atgaaatgag acatcaagtg caactgtact 8340 gaaatcatca tggaagtaat gtgcttctgt cctgttttac cctacatgtt atgtaaatga 8400 gcttgtgcat ttggacttgt acaataacac aagtcagtca caaattaact ttaatgatca 8460 tacctctctt ttttgtgaat gctgggtctg cttcactatc ttcacagaca ctgtaccatg 8520 gtgatgatac ttgaggggac tctgaaagta aaaatcaagt ctgagagttg acttctctgc 8580 ataaagctta ccaaaatatt tatttgattt tgtgttgttc tttttaatgg gtaaaagcgt 8640 gtgatggtag cttaaaaata gttttcaaac atagcacata catatcatga taaaaataaa 8700 cattctgaaa aaagaaaaat tctgatcaga aaaataaata atatttttac cttttagatt 8760 ggacttcacc tcctccaaaa tcacgatcca ctgtaacttt ctctgaatgt aaaaacccac 8820 acaaaaatac atcatttaaa acaatatggt atatgtttgc aaatacaaat gtgtatataa 8880 catccagaaa atctggcact gtattacatg tttgtgtgta aacccacgat ttgatcaaga 8940 ttccaccctt taaaatctgc tgagcgtctg ttttcaagtc ccagaaaaac tgagttgtgg 9000 gaacagagca gttctcttct gcagtcatac agtcataaca cacacataca tatgaagagt 9060 ttggttgcaa aatgcgataa acggcatttt ttgacatttg gattttatta ttggaaatca 9120 ctgttcagat gtaccttaat ctatgtgtga attgctgcca ttaataacat ttaagaatgx 9180 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 9240 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 9300 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 9360 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 9420 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 9480 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 9540 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 9600 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 9660 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 9720 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 9780 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 9840 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 9900 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 9960 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 10020 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 10080 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 10140 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 10200 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 10260 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 10320 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 10380 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxtatacagt tgaagtcaga 10440 attattagcc cccctgaatt attagacagc ttgtttatta tttccccaat ttctgtttaa 10500 cggagagaag atttttttca acacatttct aaacataata gttttaataa ctatttctaa 10560 taatcatttc txxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 10620 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 10680 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 10740 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 10800 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 10860 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 10920 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 10980 xxxxxxxxxx xatatatata tatatatata tatatatata tatatatata tatataaaag 11040 tatatacagt tgxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 11100 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxttttt tcttctggag 11160 aaagtcttat ttgttttatt tcagctagaa taaaagcagt tttaattttt tatgaaccat 11220 tttaaggtca atattattat ccctccaaxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 11280 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 11340 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 11400 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 11460 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxtg tatataatta aaaatatata 11520 ttttatatat aattttataa atattttata aaaaattatt ttatatattt ttatatataa 11580 tttataattt ttattttgaa aaatataatt tagacagtag tactcaaggg tccaggggtc 11640 tgttcttcgg acctcgctta aataatctaa gaatatatta cagatcctgg atcttttaat 11700 cttgataact catcttaggc taatttggtt cttcaaaaaa gtttgcgaat cagattaaac 11760 tgatctgaga ggtgttgtga ctgtgtgttg tgacagatct atcgatcctc aaaatcatta 11820 tcagcaatac aacgattggc tgacggcaca gcagcgtgat gacatctgat taatggacag 11880 ttatcaaaat tacatgaaat ccgtagtaaa cggttcgtta aatctgatat gcaataacag 11940 agttgtgagt cacgtgctgg agcagagcag ttctcttcct ctgtcatatg gtcatccttc 12000 aataaaagct gaatataaaa aaaatatata ttagttagac agcagtactc aagggtcgcg 12060 gtccccacta tgtcagacct tactgtgtgt gtattcagga ctaagtgtgt ttcacagctt 12120 ctctaacatg cctctggtcc ccacaatgtc agtgcatatg ttcaccaatt aggacacaca 12180 ctctgtttca tttctatatt ttgattaagt taaataatta caattaaata caccactact 12240 acaacaacaa ctactattac tactacaact aataatatta ataataataa taataataat 12300 aataattgtt attattatta ttataaacac acctcatcag agttgtgggt ctcctactgg 12360 agcagagcag ttctcttcct ctgtcatatg gtcatccttc catccttcaa taaaagctga 12420 ataaaaaaaa aaaaaatata tatattatat atatatatat atatatatat atatatatat 12480 atatatatta gttagacagc agtactcaag ggtcccggtc cccactatgt cagaccttac 12540 tgtgtgtgta ttcaggacta agtgtgtttc acagcttctc taacatgcct ctggtcccca 12600 caatgttaca ctagtgcata cgttcataaa ttaggacaca cactctgttt catttctata 12660 ttttgattaa gttaaataat tacaattaaa tgcactacta ctacaacaac aactactact 12720 actactamta ataataataa taataacaac aataataaac acacctcatc agagagttgt 12780 gggtctcgta ctggagcaga gcagttctct tcctctgtca tatggtcatc cttcaataaa 12840 agctgaatat aaaaaaatat atattagtta gacagcagta ctcaagggtc ccggtcccca 12900 ctatgtcaga ccgtactgtg tgtgtattca ggactaagtg tgtttcacag cttctctaac 12960 atgcctctgg tccccacaat gttacactag tgcatatgtt caccaattag gacacacact 13020 ctgtttcatt tctagatttt gattaagtta aataattaca attaaatgca ctactactac 13080 aacaacaact actactacta ctactaataa taataataat aacaacaaca ttaaacacac 13140 ctcatcagag ttgtgggtca cgtactggag cagagcagtt ctcttcctct gtcatatggt 13200 catccttcaa tataagctga aaataaaaaa atatatatta gttagacagc agtactcaag 13260 ggttccggtc cccactatgt cagaccgtac tgtgtgtgta ttcaggacta agtgtgtttc 13320 acagcttctc taacatgcct ctggtcccca caatgttaca ctagtgcata cgttcaccaa 13380 ttaggacaca cactctgttt catttctaga ttttgattaa gttaaataat tacaattaaa 13440 tgcactacta ctacaacaac aactactact actactacta ataataataa taattattat 13500 tattattaaa tttaaacaca cctcatcaga gggttgtggg tctcctactg gagcagagca 13560 gttctcttct tctcgtcata tggtcatact tcaataaaag ctgaatataa aaaacaaaaa 13620 ataataatta gacagtagta ctcagggctc cyggtcccca ctatgtcaga ccgtactgtg 13680 tgtgtattga ggactaagtg tgtttcacag cttctctaac atgcctctgg tccccacaat 13740 gttacactat tgcatatgtt cataaattag gacacacact aagttaaata attacaatta 13800 aatgcactac tacagcaaca acatctacta ctactactac tactactaat aataataata 13860 ctaataaaca tacctcatca gagagttgtc ggagcagttt tcttctgcag tcatatgggc 13920 atccttcaat gaaagctatt taaaaacagc tttgttatat aagtcatgtc atgactataa 13980 agctttcatg acagtcttat gaacccccct ttaaagtaaa gcattaccca attaattaaa 14040 cttttttacc cattaagaca agaagaagat attaagagat tgatttcatt atctcttatg 14100 gatgatgcaa actctttatt tttgaacaga tttaccctta atctatatta acattaaaaa 14160 actcttagct ttgctgaatg tattaaggat taatgtatta agtgatcaaa atgtcaaacc 14220 tttctccaag gccaatcaca tgcgccatag tcaaacgtta tttcctctcc aaagaaaata 14280 tccctgattg caaaaaggca caaatgaggt tttctctgaa caacgatctt attgacagtg 14340 cagttagggt gcagatcgtc ctcatgatcg gcatcaatgc tgcaaaacta agagaaaaga 14400 gcaagcaaca ttttaattat gccctctcat agcaataaaa agctaaacta tggcatgaat 14460 acaaacacaa tgaatataat atatacatgt acatttgaat ggggaaaaca cttaaccacc 14520 tgttcttgtt ttaccatata aagtcaaaca taaagacagt atttttatta atatctcatc 14580 tatgctttgc ttccacaagc tctatcattt gtactcaacc acaaaacttc ctttaataaa 14640 tgtggacaac gcaaactctg acctaaaaga tttaatgaaa cgtgtagggt tcaaaaacta 14700 caaaaacctc acattagcat cctctctgtg tgtgaattat aaatacagta aaccttttga 14760 agtggatcag aacctttcct caatgttgtt ctaatactat tgaacacctg ttcttctttc 14820 ttaggacagt tttaaagaac tttttttcca cttcaattgt ttacaggtca cactttacaa 14880 tcaggtttta ttagttaatg tacttactaa catgacctag taatgaacaa tgcatgtaca 14940 gcatttatta atcatagttt aacatttact aatgcattat tcaaatctaa attcatgctt 15000 gtaccagtaa taatgcattg caataattaa tgcattgtgc gtgttcaact agtaatgaac 15060 aactttattt tcattaacta aagttaacaa acatgaacaa atactgtaat aaagtattat 15120 ttattgttca tgttagtaaa tgcattaacc agcattaact aatttaacct tattgtaaaa 15180 ggctaccagt tgacagcaaa taaataaata aaacacacac agtgaacatt tatgtacttt 15240 tatatatgtt ttacactaag ttggagttta ttataattat ttatctaaaa taaaacgttt 15300 ctacacgtca tagaacaata aattaagtac atttctagga ccagatagcc ccacatttta 15360 attataacct ctatattact acattaacgt ataaatagta tataatagtg ttatttcagg 15420 attaacgtta aatgtttaaa gcaattaacc atttaaaaag tacgatacaa gaactatgaa 15480 aacatacctt agtaaccttc tatatattcg gtttccagtg aatgggtctt gtcagtggag 15540 ctggtgatgt gtttctccgc atcttttaat ggttttatcc tccgtcgatc catttagccg 15600 cccggtgatt ggtgactgtc tgaccgcggc tcatgcactc caccccggcc tctgactcac 15660 gtagcaccgg gcagttagca cgttagcaca ttagcgcgag taacttttac tcgtgtaaac 15720 aagttaaatt acataccgaa ctgtagtttg tgtcttatca gtgttatatt aatacgtaat 15780 ttgaaacaac atcgattaaa acacccagga aattcgttaa aacatttaat taactatacc 15840 tgataactgt caaaatatgc aacataagta tttgtcatat aacctatttt attacattat 15900 gatggtgaaa aacattctaa cttacccaaa aattcgaata atttgatgat agatcttaat 15960 taagttgatt aaagtcggat tcatttatga gattgcgcat gtgcagtagg cagagtgtgt 16020 aagtgtgtgt cagggtgcag taccgccatc cgactacagg accgctatag agcagtgcgc 16080 cgacacacac ttacacactc aattcaggta aatggtcaag gtggggatca gtgaaaaaac 16140 tgcactaggc atatcagcaa cgccccctga gtcttttaaa ggcaaaacat agtggggacc 16200 gggcgatcgg tccccactat gtttttggtc cccactttgt gagtgtgcta tcatgctgag 16260 gtccccacca tgtaataaaa acaaacacac acacacacac 16300 // ID DIRS1a_DR repbase; DNA; ZEB; 5979 BP. XX AC . XX DT 11-FEB-2002 (Rel. 7.01, Created) DT 11-FEB-2002 (Rel. 7.01, Last updated, Version 1) XX DE DIRS1a_DR is a nonautonomous DIRS-like LTR retrotransposon - a DE consensus. XX KW Gypsy; LTR Retrotransposon; Transposable Element; Nonautonomous; KW endogenous retrovirus; nonautonomous LTR retrotransposon; DIRSDR1; KW DIRS superfamily; reverse transcriptase RNase H; phage integrase; KW DIRS1_DR; DIRS1a_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5979 RA Kapitonov V.V. and Jurka J.; RT "DIRS1_DR, a family of DIRS-like endogenous retroviruses in RT zebrafish."; RL Repbase Reports 3(1), 2-2 (2003). XX DR [1] (Consensus) XX CC DIRS1a_DR is a subfamily of DIRS1_DR retrotransposons. CC There are ~100 copies of DIRS1a_DR in the genome, they are ~0.7% CC divergent from the consensus sequence. Therefore, this family CC retrotransposed in the zebrafish genome very recently. CC There is a 11% divergency between the DIRS1a_DR and DIRS1_DR CC consensus CC sequences. CC Presumably, DIRS1a_DR is a nonautonomous family because CC ORF1-ORF3, CC that are intact in DIRS1_DR, are corrupted by stop codons. The CC DIRS1a_DR was reconstructed based on multiple alignment of 7 CC copies. XX SQ Sequence 5979 BP; 1115 A; 1821 C; 1643 G; 1400 T; 0 other; gttccccttc ggggggaact tcagcactat aagtggattt gattgtaaaa tccacgcatt 60 gggaggttcg gttcagaagc tactcgtctg aaagagtatt gaacgggcca attaagaatg 120 aattggcagc gcaagcctgc gcaggtgagc ggcataagca atcaactgag tatataagct 180 cacctggcgc cagcagacgc tatccttttc gcttcagaga ctttctgatc gagtcgatga 240 gggttcctcc tgctgtgacc agcgattctg agcgaacgag agcattctcc cggtccagag 300 tgtgtacacg cagtggcaga cggtcgagct gggtttctcc cttgcctggc gttctttggg 360 tccggtcctc cagagcggtg cgtataaagt tgcaaacttc acagaaagag caacacagtc 420 gtgcagcacg tccttatcag gacggcgctc cgactgtgcg tttctggatg cggtggtttc 480 ctgtcctcgg atgatgggcg cgggcactgc gttacatgtc tgggggtcca gcatgttaat 540 gcgctgctcg cgggcagttc atgtcgtcat tgcgatgcca tgactgttgc gcaagatcgc 600 ggttagcctt tgcaaaaggg cgaaccaccc cagttgtccc ctgctctgca gcgggcactc 660 gggcagatct gagggtttca gcgagagata atccgtcgcc cacgggcccg cggacctccc 720 gctcctctaa gcgctccatc caagcttcgg gcgggggaag cgatccgtct aacagatggt 780 agcccttacg cccgatgaca ccggagacca gatgtccacc gcggcatcgg agggtgggct 840 ttcattgtcc gatgatgatc cagacccgct cgcccccttc gggctggtga gcgctgtcac 900 tacggatcct gaaacggaca tgttagccgt gctttcccgg gctgcttcgg ccgtgggttg 960 gagatggttt atcccccagc tccgcggccg gaccgactag aggggcgttc ccttcttccc 1020 ggaggtgcac agtaggctca cgcggtcttg taaagcactt ttttctgctc gtgctgcgtg 1080 ttcctccacc ctaaccactc ttgacggtga aacagccagg gggtatgtgg cgattcctca 1140 ggttgagtgc gcgatggcgg taaatccgcg cggcgcctct tcttggcggg gtccgcctcg 1200 tctcccatcc aaagcctgta agttatctac ctccctcgga gctagagctt acatagctgc 1260 gggccaggct gcttccgcct tgcatgcgat agccacctac cagcgctacc aagcgcaggc 1320 gctggccgag ctgcacgagg gtgggtccaa cccaggctta cgagcttcgc accgccgccg 1380 acttagctct tcggactact gagtccgctg cgtgtgcgtt ggggaggacg atgtccacat 1440 tagtggtcca ggagcgccac ccctggctga tatgcgcaaa gtcgacaaag tccgctttct 1500 tgacttcccc atatccccag ccaggttcgg cgacactgtc tgtgaattca cccaggaatt 1560 caaggcggtg agagagcagt tggtgggtga tgtcttatcg gcggtccgta gcccgctccg 1620 cccgccgtgc cattcatacc tgctcctcgc cgaaggcgcc cgcctacgag agctgctccg 1680 cccccgcaca cgcctccggc gaagcgagcg cgtcgggcac ctcggaagca ggcagccccc 1740 ctgcccagaa cgccgctaag tccggcaacg gaccgcgaag cgcccctggg acaggccatc 1800 tggagaagag ggaacttgct ctttccccgc tggagggcgg ggccccattt ccaacggcac 1860 tttttactgc catgaaaaca ttatgaaagg gcacttttca cttccccaga tgtgacagcc 1920 cgaaatctgc cagtctggga cgctatgctt tctagcttgc agattcggtg cgtttcgcca 1980 gtggctcacg agcgctggga ggacggtctc ctttctccca cccctcgagc ctcccctccg 2040 gagctcgggt ttggagtgag agcaaatatc tcacctccag cttttccgtg ggacccgcga 2100 gcttcccgga tcagcacacc cactccgcgc tgccccactg ctggtacgtc agcgattgta 2160 gcgatgagtc cattagcgag agctctgcct gcctggttag cgcgggccag ctcttcgcgg 2220 tggctcatac gcacaatcag actcggctat gcgattcagt tcgcgaaacg gccccccaag 2280 tcacgggtgt gtattcacca gggtcagccc cctgtccgcc cctgtcttgc gagaggagat 2340 tgctgtcctc ctggcgaagg atgcaatcga gccgctccct ccagccgaga tggagagcgg 2400 gttttacagc ccacgcttca tcgtgcccaa aaagagcggt gggtcacggc caatcctaga 2460 tctgcgcgtt ttgaaccgct gcctgcacaa gctgccgttc agaatgctca cgcagaagcg 2520 catcctccgg tgcgttcgtc ctctgggttg gtctgcagca ttagacctga tggacgcgta 2580 tttccatgtc tccactcttc ctcgccaccg acagtttctg cggtttgcgt ttgaaggtcg 2640 agcttggcaa tacaaagccc tccccttcgg gctctctctg tctccgcggg tcctcaccaa 2700 gcccgcggag ggtgcctcag cgccccttcg gctcgcgggc atctgcatac tcaatttctt 2760 tacgactggc tgatttttgc cctctctcgg agcagttgat tatgcacaga gacaaagtgc 2820 tctggcactt ccacctgtgg gggtttcagg tcaaccgaga aaagagcaaa ctcgcccccg 2880 tgcagaggat ctcttctctc gggctggagc tggactcggt caccatggca gcgcgcctct 2940 ccggagagcg cgctcagctg atgctgtact gtctgagaga gctcgacagt aaaatagtgg 3000 tcccactgaa actatttcag aggctcctgg ggcatatggc atccgcagcc gcttcatgcc 3060 gctcggatta ttctatatga gaccacttca gcactggctt cacgatcgag tccccagacg 3120 cgcatggcac gcgcgcgcac accgagtctc tgttactgcg ctgtgtcgcc gcgccctcag 3180 cccttggagc gacccctcgt tcctacaggc ctgggtgtct ctagaacagg cgtccagtct 3240 tgttgtcgtt tcagcagaca cttccaacac gggctggggg gctgtgcgtt gcgggcatgc 3300 ggctgcggac ctgtggaaag gtacccagtt gcattggcat atcgcctgga gctgttggca 3360 gtgttcctcg ctctccaccg tttttttccg gtgctggagc ggcaacacgt gctggtcagg 3420 acggacagta cggcggcggt ggcgtatatc agccgtatag ggggtatgcg ctctcgccgc 3480 atgtctcagc tcgcccgccg tctgctcctc tggagtcatc cgcggctgaa atcgctgcac 3540 gccattcata ttcaggcaag ctcaaccgtg cagccgatgc gctctcacgg cagccttgcg 3600 tcctggagaa tggagactcc accccgagtc tgttcagctg atatgggcgc gattcgggga 3660 agcccagatc gatctgttgc ttcccccgag atcgctcatt gccagttgtt ctttccctga 3720 ccgagtgctc tcggcacgga tgcactggct tacagctggc ctcggggcac gcgcaaatac 3780 gcgtttcccc cagtgagcct gctcgcgcag ttactgtgca aggtcaggga ggacgaggaa 3840 caggttgctg gttgcgcccc tctggctcaa ccggacctgg atgtcagagc tctccctcgc 3900 gatagccctc ccctggcagt cccttcgaga gagcacctac tctctcaggg acagggcacc 3960 acctggcacc ctcgccgatc tttgaagaga tttttagacg cgaggaagac ttaggtaacc 4020 tccgattgcg gtggctaata ccgtcactcg ggctagagcc ccctccccga gcgcgcctat 4080 gccctgacgt ggagtctatt cactgaatgg tgtgtctctc gctgagaaga cccccgtaat 4140 ttgccagatc agcgttgtgc tttctttacg ccgagagaag ttggagagca ggctgtcgcc 4200 ctccacactc aaggttacgt ggctgccatc tccgctctca taacgcggtg gctggcagca 4260 ccgtgggaac gcataacctc atcatccggt tcctcagggg cgttaagcga attaatccac 4320 cccgccccct ctcatgccct cttaggatct cgccctcgct acacaagccg cgtcagatcc 4380 cttcgatcct cgactcagta tctttctgtc cctgaagaca gctctgctgg tcgcgttgat 4440 atcgattaga gggtcgggga cccggaggca tttttcggtc agtgactcgt gcctgtaatt 4500 gggctggctt ctctcacgtc ctgagacccc gcccgcgata tgtgcccaag gttcctacca 4560 ctccgtttta atacgaggta gtgagcctgc aagcgctgcc ctcggaggag gcagacccag 4620 cccttcttta ttgtccagtt cgcgttttgc gtattatccg gaccgcactc agagtttaga 4680 tcatctgagc agctcttcgt ctgttatagc ggtcggcagc agggaagtgc cgtaccgaaa 4740 taagttccca ctagattgtg gatgcctttc tttcactatc agagccgaga tgagccgcgt 4800 cccccgagag cgcgtgcgca ctccactcgg agcttcgcat cctctcgagc gcgcgcacgc 4860 ggcgcccctc taacagacat ctgtagagct gcgggctggg tgacacccaa cacatttgca 4920 aggttttaca atctgcgagt ggagccggtt tcctcaaggg tattaggtaa cccttggtga 4980 ttgaggaaac aattcggtag ggtgttgaaa cacgcttgct gcgccatttt ccctaacacg 5040 gagatacgtg cgccttttta tctgtcagta aagttccccg tcaggtgagc cctgcagatt 5100 cctccgtggc ccccagcact gactcagcgg aggagtcact tgctggccca ctacgttgta 5160 ggtctgcccg ctggtcagcc cgcgttttgg gtaaaggtgc ctgctatgcg tggtccccac 5220 taggcgatcc catatgctta ttccgccacg gttaagtccc ccccctgggc ggacccgtgt 5280 cttccctccc cgctaaccac tcttttgcta tgcgtactcc ccctttttag ggctagtcca 5340 tatgtaaatt ctgccatcta tccccccttg ggtaacggat ggcctccgca gcgtcctccc 5400 tatcgggatt gcacgcttcc caacgtactg tcgtattttc ctagaattat ctagatgctc 5460 acgacttccc aaaaaatata tataaatccg taaaacttct gttgaagtag gataaattag 5520 ggccagggac acgttggagg accgcgcccc ccatgatgtg ggtgcgtcac gcttgcttga 5580 ctatctcctc atcgggggtg ttggtaaggt gcagtcatta tggcgctttc aatgggctcc 5640 caatgcgtgg attttacaat caaatccact tatagtgctg aagttccccc cgaaggggaa 5700 cgttcgaggt tactaaagta acccttcgtt ccccgaggag gacggaagca ctatactccg 5760 tcgccataat gactgtccct tagctgttga aagtctcttc agcttaaaaa ggatagcgtc 5820 tgctgcgcca ggtgagctta tatactcagt tgattgctta tgccgctcac ctgcgcaggc 5880 ttgcgctgcc aattcattct taattggccc gttcaatact ctttcagacg agtagccctc 5940 ctcggggaac gaagggttac tttagtaacc tcgaacgtt 5979 // ID DIRS-9_DR repbase; DNA; ZEB; 6431 BP. XX AC . XX DT 08-JAN-2009 (Rel. 14.01, Created) DT 08-JAN-2009 (Rel. 14.01, Last updated, Version 1) XX DE DIRS-like LTR retrotransposon family - a consensus. XX KW DIRS; LTR Retrotransposon; Transposable Element; Nonautonomous; KW reverse transcriptase RNase H; DIRS-9_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6431 RA Jurka J.; RT "Families of DIRS-like endogenous retroviruses in zebrafish."; RL Repbase Reports 9(1), 15-15 (2009). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 481..2838 FT /product="DIRS-9_DR_1p" FT /translation="MEPPPISNSARNQILSGADIDLISLLSPVAPPAAERQ FT VDCGEFSVTLKPSANTQSRTLTLAEFSIAFSRFTDIICSVFPHRRRELNDY FT MAIIAELALSYGGTHFYTYHKLFSAKCAXRVTQWNQCPYWGALDTELHNRV FT FLGCRNLTCAVCRSSLHPTTSCPXIIPPXDPTQXPSKSTSYVPXPPSRNIP FT SLLSTXSKPSLPNRDICQNFDXGRCHGXPCRYLHLCSYCGGAHAKIXCPIL FT RANNKKSKNYLSTPVNISNLARELNSHPDTNFSDFLISGLTHGFHPGVSAL FT PSHNLICPNLQSATSEPETVDYLIKKEIDNKFVIGPFKAPPFNISRISPIG FT VATRKFSGKKRLIXDLSXPHNSSFPSINSTIPLEEYTLNYHDIDQAISLIK FT IAGHNAWLAKVDISSAFKIMPIHPDFWHLFGIYWRSKFYFAVRLTFGCKSS FT PKIFDMMSEALCWILSNNYGIPYLIHLLDDFLIISPPSSPPAKHLAITQQV FT FADLGIPLAEEKTSGPSTSIEFLGINLDSHKFQASLPKEKIDRIISLSQIF FT LEKQMCTKRELLSILGHLNFAMRIIPQGRPFISHLLQLSTTVQGLEEIIIL FT SKPSRDELCLWISFLKQWNGCSFFYSDLTTSPIDINLYTDAAPSIGFGGYY FT KGHWFASTWPPQMFNSIPKDQCSSALFELYPIVAAAILWGDEWSTFSILIH FT CDNEATVHCINKXRSHSQALMPFLRRLIWISAKKQFIMIAEHVPGCKNQIA FT DSLSRFSLQKFRQLAPEADPHPTPVPPYSEMILP*" XX SQ Sequence 6431 BP; 1491 A; 2220 C; 915 G; 1756 T; 49 other; aaaacaaaaa cacgcascaa gccacwcccc cgcaagaccg cacccccagg aaacccagcc 60 gccaccgatg acgtcacacc tctcccacca ggactccaat caaacgaact caatccccaa 120 tctgtccctc ctcaacycta ttctacctct ttctcctggc ctccagcccc ctctagctct 180 cccattagcc ctcatctccc ttctacttca gctactcagt tcctcccttc taactctgct 240 cctcctgctc ttcctcaaca tcaatctttt cccccccctc tccaaccctc cgcttcatgc 300 tccatcccct tccaacctgc ttctcatccc tcctccttcc cctcttcttc actccccctc 360 gctcacacta acactaaccc tacccgsymc ttttctatcc ccacaccagt ttcttctaca 420 cgcccccctt tcactctgtc ttctgccacg cccctccctc cgccgaataa cgctctagct 480 atggaacctc cccccatctc taattcagca cgcaaccaaa tcctctcagg tgcggatatt 540 gacctcatct cactcctctc acccgtmgca ccccccgcgg cagaacgcca ggttgattgc 600 ggcgaatttt cagtaaccct caaaccgtca gctaacactc agtcacgcac cctaacctta 660 gccgaattta gcatagcctt ctcacgattc accgacataa tttgttccgt attcccccat 720 aggagacgcg agctaaatga ttacatggcc attattgccg agctcgcgct ctcctatggg 780 ggcacccact tttacactta ccacaarcta ttctccgcta aatgcgcamt gcgagttacc 840 cagtggaatc agtgtcccta ctggggggct ttggacactg agctccacaa cagggtmttt 900 ttaggttgcc gcaatctaac ctgygcggtc tgccgctcca gtctgcaccc cactacctcc 960 tgtcctttma ttatccctcc tyctgatcca actcagmcac cttctaaatc taccagctac 1020 gttcctcrcc cccccagtcg taacattcct tctcttcttt ctaccycttc taaaccctcc 1080 cttcctaacc gtgacatctg ccaaaacttc gacatkggca gatgtcacgg aawgccatgc 1140 agataccttc ayctgtgctc ctactgtggc ggcgcccacg ccaaaatart ctgcccaatc 1200 ytaagagcaa acaataaaaa atcaaaaaat tacttgtcga ctcctgtgaa tatttctaac 1260 cttgctcgtg aaytaaattc tcaccctgat actaactttt ctgattttct catttcaggt 1320 ctaacycacg gattccaccc aggtgtttca gctctccctt cwcataatct aatctgtcct 1380 aacctgcagt ctgcgacctc cgaacccgaa acagtcgatt atcttattaa aaaagaaatc 1440 gacaacaaat tcgtgatcgg accttttaag gctcctccat tcaatatttc acgcattagc 1500 cccattggcg tcgcaactcg aaaattttcc ggcaaaaaac gcctcatart ygayctttcg 1560 kccccacata attcctcttt ccctagcatt aacagcacga ttccactaga agaatatacg 1620 ctcaactatc acgacatcga tcaagcaatc tctcttatca aaatagccgg ccacaacgcc 1680 tggctagcca aagtagacat ctcttctgcc tttaaaatca tgccaatcca cccagacttc 1740 tggcaccttt ttggcattta ttggcgatca aaattctatt ttgcagtccg actaaccttc 1800 ggatgcaaaa gcagcccaaa aatatttgac atgatgtcag aagcattatg ctggattcta 1860 tccaataatt acggaattcc atacctcatc caccttctag acgattttct cattatttct 1920 cccccgtcat ctcctccagc caaacaccta gcgatcaccc aacaagtttt cgctgatctc 1980 ggaattcctc tagcagagga aaaaacttca ggtcccagta cttcaatcga atttctgggc 2040 attaatctag actcgcacaa attccaagca tccctcccca aagagaagat cgatcggatc 2100 atttctctat cccaaatctt cctcgaaaaa cagatgtgca caaaacgaga actcctatca 2160 attctcggcc atctaaattt cgctatgcgc atcattccac agggccgccc ctttatttca 2220 cacctccttc aactatccac cacagttcaa ggtttagaag aaataattat tctctctaaa 2280 ccaagtcgcg atgaactctg cttatggatc tctttcctta agcaatggaa cggctgttcc 2340 tttttctata gcgacttaac aacatccccc atcgacatta acctatacac agacgctgcc 2400 ccctctattg gtttcggcgg ctactacaaa ggacactggt ttgcctcmac atggccaccc 2460 caaatgttca attccattcc aaaagaccaa tgttcttcag ccctattcga actctacccc 2520 attgtcgcag cagccatctt gtggggggac gaatggtcta cttttagcat tctcattcac 2580 tgcgataatg aagccacagt gcattgcatc aacaaarggc gctcccactc ccaagcactt 2640 atgccatttt taagacgcct tatctggata tctgctaaaa aacaatttat catgattgct 2700 gaacatgtac ctggttgcaa aaaccaaatt gctgactctc tctctcgctt ctctttacag 2760 aaattccggc aattggcccc ggaagcggac cctcacccaa cgcctgtacc tccgtattca 2820 gaaatgatat tgccataaac cacccwcttc ataatctyca ccaaacttct ctatctctca 2880 tcctgcaagc aatagctcct agaaccctcc attcatacct cacagcatgg aattcrttca 2940 aacaattcca taytctacac caacttcctt tccctgattt ttctctcctc tctatcacyt 3000 ccttcgtatc ccaccttcac acygcaaatc acctacaagc cagttcaata aaaagctacc 3060 ttagcgggat ccagtttttt cacaaattaa ttcatgggtc tccttccgay gccatcacaa 3120 attcgcaaac ctccctcctt atcaagggta ttcagaaaaa ccaccctcac cagccctgat 3180 gccagacaac ccatcacact caaaatcctt acctcatgca tccacaccct tcgcaaaggg 3240 tatatttcca cccatacagc ccgcacccta gatgccatgt ttaayctagc attttttggs 3300 tttcttagat gttccgaatt aacagttaca tctaaattta acccatctac tccaccccac 3360 catctcagat ctagctttgc aagataagga aaccatctct ttccttatca aacaaagcaa 3420 aacagatcaa atccagaaag gacactctat ctacattttc gacatacctt cccccactcg 3480 cccattccaa accctcctag cmtatctata atctaagaaa atctcaagaa gctaaccctc 3540 tggccccgct ttttactgac gacgctaacc gtccagtaac tcgattctgg ttccaaaaac 3600 accttaaaga aattcttcgc ctatcaggtt tttccccaga gcctttttcc agccactcat 3660 tcaggattgg cgcagccact acagcagcct ctaacgggct ctcccacaat cagatccara 3720 cccttggtcg ctggtcttct gaagctttca aatcytacat acgcctcagt aaataccacc 3780 tcaaagaagc acaacaggct ctaaccagac ccccaccatc ctaattacag caactacccg 3840 caaggctcca actcacaaag gtacctaata gagccccacc tgctcgctag agccctcatc 3900 ctagccaata gccccaagct accagtagca accttatagg tcttcacttc cttcttgcat 3960 cgagtttctc cgcacctccc ttctctcctt tctagcgttg agttcctccg cacctccttc 4020 tcttccttca agcgttgaac gcttccgctc ttctctccat cccctccctc tttgaacctc 4080 cccccaacat ccttactccc tacccccttc cagcgtcgag tttctccgct acttttctct 4140 tttcagcgtc gagttgctcc gctactatcc cttccttcta gctccaagyt cttccgctac 4200 tcttatctct tcagcctaag taacacatct aaagcccgac tcccccggag tcaattaccc 4260 cctcaattac atccctccac aatcgacatc tttcttcttt tctagcgttg agttcctccg 4320 catctcctat cctaagttct atagcgttga gttyctccgc atctcctttc cttccttcaa 4380 gcgtcgaatg cctccgctct ttcctccatc ccctccctct ctctcccccc cccaacatct 4440 tcactcccta cccctttcca gcgtcgagtt tctccgcttc ttttcttttt tcagcgtcga 4500 gttgctccgc tactatcact tccttctagc gttgagtatt ccgctactct cttcagccta 4560 agctttgact cccccggagt cctgctcagc ccacctccta caggagtctc cacctccccc 4620 tccctatcta gactcctgta ggagcataat ctttcagctc taactcccgc agaggtcgct 4680 ccaagagcta cgactcccac ggagtcctcc ccctccccct ctctcctgcc ccggccaaat 4740 acgcatcttt cctaccttct agcgtcgagt tcctccgcat aggctcccgt cctatattcc 4800 ctagcgtcga gttcctccgc atctcctttt cttccttcca gcgtagaatg cttccgctct 4860 tttctctctc ccctcttata acccctctcc tctcccctac ccttaactcc caatccccct 4920 tccagcgttg agttccttcg ctacttttct ttcttcagcg tcgagtttct ccgctacctt 4980 actacattct agcgtcgagt tcctccgcta ctctttctat atttcaaagc atcactcacc 5040 caagctcaga ctcccccgga gtccccgccc agcccaccac ctacaggagt ctccacctcc 5100 ccctcccttc ccagactcct gtagtagccc aactctacag ctctaactcc cacggagttg 5160 accacagagc tccgactccc atagatttct tttctttacc tttccagcgt cgagttcctc 5220 cgcatctcat gctctctcct tcctagcgtc gagttcctcc gcatcctctt tacttccttc 5280 cagcgtcgaa tgcttccgct cttctctcca ccccctctac aacccctctc cccttcccac 5340 tccacacccc ttttcccagc gttgagttcc tccgctactt ttcttctttc agcgttgaat 5400 tactccgcta cttcctattt ctagcatcaa atctctctgc tactcttctt ccttccgtta 5460 ggcacccttc ccaaccctgc ctccccaaac ccgactccca cggagtcccc gaccactcct 5520 agaatcagtt acaacttctc ccacagtaac ctctctcact ttaacttata ttccagcagc 5580 cggatatagc actgaatctc ctgccttttg gggggttttt tcttcgaata cgcggctgct 5640 gtcccgagcg raaaacattt gcatttttgg ggagttctcg agatctacct gagctcaaac 5700 tcccctctcg ccctgctaac gggagggagc cccgggctcg aggatctcat gagctcgggg 5760 ctctctcccg ggacagcatg ccaaataagc tttattaatc atcagctaag tgtgaactct 5820 tgaagtgaag tttattcata aactaatttc gagaggatca cgtgcttatg attwatcacg 5880 gccggccccg tattagctaa tccgtaatca gcccaatcag atgattccta aagcactata 5940 aataacccga gtttttcact tcagtttatc ttcgtcttga agaamccccc cttccacccc 6000 ttcatcctcc tcctttacct gaaattcggg cggcacggtg gcccagtggc tagcactgtt 6060 gacctcacag caagaatacc gccggtccta cttcgatcgg accggtgagt gtttctgtgt 6120 ggagtttgca tgttctcccc gtgttcgcgt gggttttccc cgggttctcc ggtttcctcc 6180 caccatccaa agaacatwaa acatacccaa attgactaaa tcaaattatc acctaataca 6240 acctcagctt acacttctca cggckacaac ggcaggggag ttctcgagat ctacctgagc 6300 tcaaactccc ctctcgccct gccgacggga gggagccccg ggctcgagga tctcatgagc 6360 tcggggctct ctcccgggac agcatgccaa ataagcttta taaatcatca gctaagtgtg 6420 aactcttgaa a 6431 // ID DIRS1a_DR repbase; DNA; ZEB; 5979 BP. XX AC . XX DT 11-FEB-2002 (Rel. 7.01, Created) DT 11-DEC-2008 (Rel. 7.01, Last updated, Version 2) XX DE DIRS1a_DR is a nonautonomous DIRS-like LTR retrotransposon - a DE consensus. XX KW DIRS; LTR Retrotransposon; Transposable Element; Nonautonomous; KW endogenous retrovirus; nonautonomous LTR retrotransposon; gypsy; KW DIRSDR1; DIRS superfamily; reverse transcriptase RNase H; KW phage integrase; DIRS1_DR; DIRS1a_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5979 RA Kapitonov V.V. and Jurka J.; RT "DIRS1_DR, a family of DIRS-like endogenous retroviruses in RT zebrafish."; RL Repbase Reports 3(1), 2-2 (2003). XX DR [1] (Consensus) XX CC DIRS1a_DR is a subfamily of DIRS1_DR retrotransposons. There are CC ~100 copies of DIRS1a_DR in the genome, they are ~0.7% divergent CC from the consensus sequence. Therefore, this family CC retrotransposed in the zebrafish genome very recently. There is a CC 11% divergence between the DIRS1a_DR and DIRS1_DR consensus CC sequences. Presumably, DIRS1a_DR is a nonautonomous family CC because ORF1-ORF3, that are intact in DIRS1_DR, are corrupted by CC stop codons. The DIRS1a_DR was reconstructed based on multiple CC alignment of 7 copies. XX SQ Sequence 5979 BP; 1115 A; 1821 C; 1643 G; 1400 T; 0 other; gttccccttc ggggggaact tcagcactat aagtggattt gattgtaaaa tccacgcatt 60 gggaggttcg gttcagaagc tactcgtctg aaagagtatt gaacgggcca attaagaatg 120 aattggcagc gcaagcctgc gcaggtgagc ggcataagca atcaactgag tatataagct 180 cacctggcgc cagcagacgc tatccttttc gcttcagaga ctttctgatc gagtcgatga 240 gggttcctcc tgctgtgacc agcgattctg agcgaacgag agcattctcc cggtccagag 300 tgtgtacacg cagtggcaga cggtcgagct gggtttctcc cttgcctggc gttctttggg 360 tccggtcctc cagagcggtg cgtataaagt tgcaaacttc acagaaagag caacacagtc 420 gtgcagcacg tccttatcag gacggcgctc cgactgtgcg tttctggatg cggtggtttc 480 ctgtcctcgg atgatgggcg cgggcactgc gttacatgtc tgggggtcca gcatgttaat 540 gcgctgctcg cgggcagttc atgtcgtcat tgcgatgcca tgactgttgc gcaagatcgc 600 ggttagcctt tgcaaaaggg cgaaccaccc cagttgtccc ctgctctgca gcgggcactc 660 gggcagatct gagggtttca gcgagagata atccgtcgcc cacgggcccg cggacctccc 720 gctcctctaa gcgctccatc caagcttcgg gcgggggaag cgatccgtct aacagatggt 780 agcccttacg cccgatgaca ccggagacca gatgtccacc gcggcatcgg agggtgggct 840 ttcattgtcc gatgatgatc cagacccgct cgcccccttc gggctggtga gcgctgtcac 900 tacggatcct gaaacggaca tgttagccgt gctttcccgg gctgcttcgg ccgtgggttg 960 gagatggttt atcccccagc tccgcggccg gaccgactag aggggcgttc ccttcttccc 1020 ggaggtgcac agtaggctca cgcggtcttg taaagcactt ttttctgctc gtgctgcgtg 1080 ttcctccacc ctaaccactc ttgacggtga aacagccagg gggtatgtgg cgattcctca 1140 ggttgagtgc gcgatggcgg taaatccgcg cggcgcctct tcttggcggg gtccgcctcg 1200 tctcccatcc aaagcctgta agttatctac ctccctcgga gctagagctt acatagctgc 1260 gggccaggct gcttccgcct tgcatgcgat agccacctac cagcgctacc aagcgcaggc 1320 gctggccgag ctgcacgagg gtgggtccaa cccaggctta cgagcttcgc accgccgccg 1380 acttagctct tcggactact gagtccgctg cgtgtgcgtt ggggaggacg atgtccacat 1440 tagtggtcca ggagcgccac ccctggctga tatgcgcaaa gtcgacaaag tccgctttct 1500 tgacttcccc atatccccag ccaggttcgg cgacactgtc tgtgaattca cccaggaatt 1560 caaggcggtg agagagcagt tggtgggtga tgtcttatcg gcggtccgta gcccgctccg 1620 cccgccgtgc cattcatacc tgctcctcgc cgaaggcgcc cgcctacgag agctgctccg 1680 cccccgcaca cgcctccggc gaagcgagcg cgtcgggcac ctcggaagca ggcagccccc 1740 ctgcccagaa cgccgctaag tccggcaacg gaccgcgaag cgcccctggg acaggccatc 1800 tggagaagag ggaacttgct ctttccccgc tggagggcgg ggccccattt ccaacggcac 1860 tttttactgc catgaaaaca ttatgaaagg gcacttttca cttccccaga tgtgacagcc 1920 cgaaatctgc cagtctggga cgctatgctt tctagcttgc agattcggtg cgtttcgcca 1980 gtggctcacg agcgctggga ggacggtctc ctttctccca cccctcgagc ctcccctccg 2040 gagctcgggt ttggagtgag agcaaatatc tcacctccag cttttccgtg ggacccgcga 2100 gcttcccgga tcagcacacc cactccgcgc tgccccactg ctggtacgtc agcgattgta 2160 gcgatgagtc cattagcgag agctctgcct gcctggttag cgcgggccag ctcttcgcgg 2220 tggctcatac gcacaatcag actcggctat gcgattcagt tcgcgaaacg gccccccaag 2280 tcacgggtgt gtattcacca gggtcagccc cctgtccgcc cctgtcttgc gagaggagat 2340 tgctgtcctc ctggcgaagg atgcaatcga gccgctccct ccagccgaga tggagagcgg 2400 gttttacagc ccacgcttca tcgtgcccaa aaagagcggt gggtcacggc caatcctaga 2460 tctgcgcgtt ttgaaccgct gcctgcacaa gctgccgttc agaatgctca cgcagaagcg 2520 catcctccgg tgcgttcgtc ctctgggttg gtctgcagca ttagacctga tggacgcgta 2580 tttccatgtc tccactcttc ctcgccaccg acagtttctg cggtttgcgt ttgaaggtcg 2640 agcttggcaa tacaaagccc tccccttcgg gctctctctg tctccgcggg tcctcaccaa 2700 gcccgcggag ggtgcctcag cgccccttcg gctcgcgggc atctgcatac tcaatttctt 2760 tacgactggc tgatttttgc cctctctcgg agcagttgat tatgcacaga gacaaagtgc 2820 tctggcactt ccacctgtgg gggtttcagg tcaaccgaga aaagagcaaa ctcgcccccg 2880 tgcagaggat ctcttctctc gggctggagc tggactcggt caccatggca gcgcgcctct 2940 ccggagagcg cgctcagctg atgctgtact gtctgagaga gctcgacagt aaaatagtgg 3000 tcccactgaa actatttcag aggctcctgg ggcatatggc atccgcagcc gcttcatgcc 3060 gctcggatta ttctatatga gaccacttca gcactggctt cacgatcgag tccccagacg 3120 cgcatggcac gcgcgcgcac accgagtctc tgttactgcg ctgtgtcgcc gcgccctcag 3180 cccttggagc gacccctcgt tcctacaggc ctgggtgtct ctagaacagg cgtccagtct 3240 tgttgtcgtt tcagcagaca cttccaacac gggctggggg gctgtgcgtt gcgggcatgc 3300 ggctgcggac ctgtggaaag gtacccagtt gcattggcat atcgcctgga gctgttggca 3360 gtgttcctcg ctctccaccg tttttttccg gtgctggagc ggcaacacgt gctggtcagg 3420 acggacagta cggcggcggt ggcgtatatc agccgtatag ggggtatgcg ctctcgccgc 3480 atgtctcagc tcgcccgccg tctgctcctc tggagtcatc cgcggctgaa atcgctgcac 3540 gccattcata ttcaggcaag ctcaaccgtg cagccgatgc gctctcacgg cagccttgcg 3600 tcctggagaa tggagactcc accccgagtc tgttcagctg atatgggcgc gattcgggga 3660 agcccagatc gatctgttgc ttcccccgag atcgctcatt gccagttgtt ctttccctga 3720 ccgagtgctc tcggcacgga tgcactggct tacagctggc ctcggggcac gcgcaaatac 3780 gcgtttcccc cagtgagcct gctcgcgcag ttactgtgca aggtcaggga ggacgaggaa 3840 caggttgctg gttgcgcccc tctggctcaa ccggacctgg atgtcagagc tctccctcgc 3900 gatagccctc ccctggcagt cccttcgaga gagcacctac tctctcaggg acagggcacc 3960 acctggcacc ctcgccgatc tttgaagaga tttttagacg cgaggaagac ttaggtaacc 4020 tccgattgcg gtggctaata ccgtcactcg ggctagagcc ccctccccga gcgcgcctat 4080 gccctgacgt ggagtctatt cactgaatgg tgtgtctctc gctgagaaga cccccgtaat 4140 ttgccagatc agcgttgtgc tttctttacg ccgagagaag ttggagagca ggctgtcgcc 4200 ctccacactc aaggttacgt ggctgccatc tccgctctca taacgcggtg gctggcagca 4260 ccgtgggaac gcataacctc atcatccggt tcctcagggg cgttaagcga attaatccac 4320 cccgccccct ctcatgccct cttaggatct cgccctcgct acacaagccg cgtcagatcc 4380 cttcgatcct cgactcagta tctttctgtc cctgaagaca gctctgctgg tcgcgttgat 4440 atcgattaga gggtcgggga cccggaggca tttttcggtc agtgactcgt gcctgtaatt 4500 gggctggctt ctctcacgtc ctgagacccc gcccgcgata tgtgcccaag gttcctacca 4560 ctccgtttta atacgaggta gtgagcctgc aagcgctgcc ctcggaggag gcagacccag 4620 cccttcttta ttgtccagtt cgcgttttgc gtattatccg gaccgcactc agagtttaga 4680 tcatctgagc agctcttcgt ctgttatagc ggtcggcagc agggaagtgc cgtaccgaaa 4740 taagttccca ctagattgtg gatgcctttc tttcactatc agagccgaga tgagccgcgt 4800 cccccgagag cgcgtgcgca ctccactcgg agcttcgcat cctctcgagc gcgcgcacgc 4860 ggcgcccctc taacagacat ctgtagagct gcgggctggg tgacacccaa cacatttgca 4920 aggttttaca atctgcgagt ggagccggtt tcctcaaggg tattaggtaa cccttggtga 4980 ttgaggaaac aattcggtag ggtgttgaaa cacgcttgct gcgccatttt ccctaacacg 5040 gagatacgtg cgccttttta tctgtcagta aagttccccg tcaggtgagc cctgcagatt 5100 cctccgtggc ccccagcact gactcagcgg aggagtcact tgctggccca ctacgttgta 5160 ggtctgcccg ctggtcagcc cgcgttttgg gtaaaggtgc ctgctatgcg tggtccccac 5220 taggcgatcc catatgctta ttccgccacg gttaagtccc ccccctgggc ggacccgtgt 5280 cttccctccc cgctaaccac tcttttgcta tgcgtactcc ccctttttag ggctagtcca 5340 tatgtaaatt ctgccatcta tccccccttg ggtaacggat ggcctccgca gcgtcctccc 5400 tatcgggatt gcacgcttcc caacgtactg tcgtattttc ctagaattat ctagatgctc 5460 acgacttccc aaaaaatata tataaatccg taaaacttct gttgaagtag gataaattag 5520 ggccagggac acgttggagg accgcgcccc ccatgatgtg ggtgcgtcac gcttgcttga 5580 ctatctcctc atcgggggtg ttggtaaggt gcagtcatta tggcgctttc aatgggctcc 5640 caatgcgtgg attttacaat caaatccact tatagtgctg aagttccccc cgaaggggaa 5700 cgttcgaggt tactaaagta acccttcgtt ccccgaggag gacggaagca ctatactccg 5760 tcgccataat gactgtccct tagctgttga aagtctcttc agcttaaaaa ggatagcgtc 5820 tgctgcgcca ggtgagctta tatactcagt tgattgctta tgccgctcac ctgcgcaggc 5880 ttgcgctgcc aattcattct taattggccc gttcaatact ctttcagacg agtagccctc 5940 ctcggggaac gaagggttac tttagtaacc tcgaacgtt 5979 // ID Tc1-4_DR repbase; DNA; ZEB; 1572 BP. XX AC . XX DT 11-JUL-2002 (Rel. 7.06, Created) DT 11-JUL-2002 (Rel. 7.06, Last updated, Version 1) XX DE Tc1-4_DR is an autonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; TIR; KW Autonomous DNA transposon; TA target site; Tc1 superfamily; KW Dr000076; Dr000078; Tc1-4_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1188-2 RA Bao Z.; RT "Dr000076, an unclassified and uncharacterized repeat."; RL http://www.genetics.wustl.edu/fish_lab/repeats/. XX RN [2] RP 1424-614 RA Bao Z.; RT "Dr000078, an unclassified and uncharacterized repeat."; RL http://www.genetics.wustl.edu/fish_lab/repeats/. XX RN [3] RP 1-1572 RA Kapitonov V.V. and Jurka J.; RT "Tc1-4_DR, an ancient Tc1-like autonomous DNA transposon from RT zebrafish."; RL Repbase Reports 2(6), 23-23 (2002). XX DR [3] (Consensus) XX CC Tc1-4_DR copies are flanked by the TA target site duplications CC generated upon their integration in the genome. CC Tc1-4 has perfect 51-bp terminal inverted repeats that belong to CC ~200-bp imperfect TIRs. CC There are approximately 1500 copies of Tc1-4_DR harbored CC by the zebrafish genome, they are ~13% divergent from CC the consensus sequence. CC The reconstructed 340-aa transposase, Tc1-4_DRp, is encoded by CC this transposon (positions 350-1369). XX FH Key Location/Qualifiers FT CDS 350..1369 FT /product="Tc1-4_DRp" FT /translation="MGRGSPVCQQICEKIIEMFKNNVPQRKIGRHLDISPS FT TVHNIIKRFKESGGISVHKGQGCKPKLNNRDLRSLRRHCIKNRHSSISDIT FT TWAQDYFGKPLSSTTIHSYIHKCQLKLYCAKRKPYVNSVQKRCRLLWARRH FT LGWTITQWKCVLWSDESVFQVFFGRNGRRVLRTKEEKDHPDCYQQQVQKPG FT SVMVWGCVSALGKGNLHFCDGTINAEKYIEILEHNMLPSRRYIFQGRPCIF FT QQDNAKPHSAHITKSWLRRKRIQVLDWPVCSPNLSPIEKVWCILCGKMLQR FT RPCTVAHLKTCLQEEWDKITPETLHHLVSSVPKRLLSVVKRNGNITKW" XX SQ Sequence 1572 BP; 516 A; 294 C; 326 G; 436 T; 0 other; caaccccaaa tcagaaaaag ttgggacagt atggaaaacg caaataaaaa agaaaatagt 60 gatttccaaa tttactttga cttgtatttc attgcagaca atatgaacac aaaatatttc 120 atgttttgtt tgtggtcaac ttcatttcat ttgtaaatat acatcctttc ctgtcattca 180 gacctgcaac acattccaaa aaatgggaca ggagcaattt agggctagta atcaggtaaa 240 ttggttaaat aatgatgtga tttgaaacag gtgatgtcaa caggtgattg taattatgat 300 ttggtacaaa agcagcatcc aagaaaggtc tagtccttta ggagcaaaga tgggcagagg 360 atcgccagtt tgccaacaaa tatgtgagaa aattattgaa atgtttaaaa acaatgttcc 420 tcaaagaaag ataggaagac atttggatat ttcaccttca acagtgcata acataattaa 480 aagattcaag gaatctggag gaatttcagt gcataaagga caagggtgca agcctaagct 540 gaacaaccgt gatctccgat ccctcaggcg gcactgcatc aagaatcgtc attcatctat 600 aagcgatatc accacatggg ctcaggacta ctttggcaaa cctttgtcaa gtaccacaat 660 acatagttac atccacaaat gccagttaaa actgtactgt gccaaaagga agccctatgt 720 taacagtgtc cagaagcgct gtcgacttct ctgggctcgg aggcatctgg gatggaccat 780 cacacagtgg aaatgtgtac tgtggtcaga tgaatcagta tttcaggtat tttttgggag 840 aaatggacgc cgtgtgctcc ggaccaaaga agaaaaggat catccagact gttaccagca 900 acaagtccaa aagccagggt ctgtcatggt atggggttgt gtcagtgccc ttggcaaagg 960 taacttgcac ttctgtgatg gcaccattaa tgctgaaaag tacatagaga ttttggagca 1020 caatatgctg ccttcaagaa gatatatttt ccagggacgc ccatgcatat ttcaacaaga 1080 caatgcaaaa ccacattctg cacacattac aaagtcctgg ctgcggagga agaggataca 1140 ggtacttgac tggcctgtct gcagtcccaa cctgtctcca atagagaaag tgtggtgcat 1200 tttgtgtggc aaaatgctac aacgaagacc ctgtactgtt gcccacctta agacttgttt 1260 gcaggaagaa tgggacaaaa ttacacctga aacacttcat cacttggtgt cttcagtccc 1320 taaacgtctt ttaagtgttg tgaaaaggaa tggcaacatt acaaagtggt aaatgcttta 1380 ctgttccaac ttttttaaaa tgtgttgcaa gaaccaaaat tgaaatacgt gtttatttta 1440 aaaaaaaata atcatgagga acacattaaa taatgtttgt tgtattgtct gcaatgaaat 1500 acaagtcaaa gtacattaac tttttttatt tgcgttttcc atactgtccc aactttttct 1560 gatttggggt tg 1572 // ID LOOPERN4_DR repbase; DNA; ZEB; 917 BP. XX AC . XX DT 11-JUL-2002 (Rel. 7.06, Created) DT 11-JUL-2002 (Rel. 7.06, Last updated, Version 1) XX DE LOOPERN4_DR is a nonautonomous DNA transposon - a consensus. XX KW DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; Dr000048; LOOPERN4_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 917-64 RA Bao Z.; RT "Dr000048, an unclassified and uncharacterized repeat."; RL http://www.genetics.wustl.edu/fish_lab/repeats/. XX RN [2] RP 1-917 RA Kapitonov V.V. and Jurka J.; RT "LOOPERN4_DR, a nonautonomous DNA transposon from zebrafish."; RL Repbase Reports 2(6), 19-19 (2002). XX DR [2] (Consensus) XX CC About 1000 copies of LOOPERN4_DR are expected to populate the CC zebrafish genome. LOOPERN4_DR copies are ~15% divergent from the CC consensus sequence. CC This element is characterized by 11-bp terminal inverted repeats CC and putative TTAA targets site duplications. CC Its classification is not very certain yet, although it is CC expected to be a member of the piggyBac/Looper superfamily. XX SQ Sequence 917 BP; 284 A; 174 C; 185 G; 274 T; 0 other; atggcaccta tgatgaaaat caacttttgt aagctgtttg aacagaactg tgtgtaggtt 60 tatgtgtgtc cacagtcata ttggagtgat ataaacccaa caagtatctt tattaaaatt 120 tcctgacgtt aaaataggat ccaaatccca gtgattttga ggcccaccgc aacgtgacca 180 ttaggagtgc ggttttcccc gcccaccgaa ttgattgaca ggcgccatgt ctctataata 240 acatgtatac acatgtccac agaacatttt ttgcaaagaa actgggatta aaacatctgt 300 tacaactctc tgtgatctgc tccttaataa ttagttttat aagttttaaa acgtgttttt 360 aaaacagtgc atgtttgtaa taaagacagt aaaattgcta tgtaattctt aaccgctata 420 atcaccacgg ccgcatggtg tcagtaaatg cgcataagtt tgtaaagtta aataaatgtg 480 tgtgtgtgtg tgtgtagtgc atgacaaact gtgtgtactg caaacggcat ttgtgtgtga 540 ctcatcattt cagaaaggct tgaataaact ccaccacaaa tacatcaaat aaacttactt 600 ggtatttttg actaatgagc tgtatttcag cttcatccgt gagtctgtct ctgtcactga 660 ctgctgttta tctgacgtaa cgcatgatga gaagcagaca tgcacgtggg aacggtgggc 720 ggggagaagc agctcatttg catttaaagc cacaggctac aaaaacagct acactgtcct 780 cagacaccaa aatgggcaga ttctgcaggc tataataaat aatctgatgg gtattttgag 840 ctgaaacttt acagacacat tctggagaca ccaaagtctt atcttacatc ttgtaaaaga 900 ggtaaaatag gtgccct 917 // ID DIRS-4_DR repbase; DNA; ZEB; 6796 BP. XX AC . XX DT 24-OCT-2008 (Rel. 13.1, Created) DT 09-MAR-2009 (Rel. 13.1, Last updated, Version 2) XX DE DIRS-like LTR retrotransposon family - a consensus. XX KW DIRS; LTR Retrotransposon; Transposable Element; KW reverse transcriptase RNase H; phage integrase; DIRS-4_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6796 RA Bao W. and Jurka J.; RT "Families of DIRS-like retrotransposons in zebrafish."; RL Repbase Reports 8(10), 1271-1271 (2008). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 523..3816 FT /product="DIRS-4_DR_1p" FT /translation="MEGTNTPALPETPAQTQQQTQINTEAPIRGRRPIRST FT VTRTHRRTQSPSPNRNLPSPASSYASARSSSITSNKMTVSELRQSLTNAGI FT SIPTRCNKSELLKLYEAIPSPTPPPQDSRPTRSRHTPYPQPSATQHSRNPP FT GPPKKATKKTNKKQPQATGQXAPSTNQHTVNPPDNYATPGLPTPLLWPPAP FT QSSENSSPTLQAIPPTLNPPQFSLSSNLPHSSTQLIPNLSSNLPHSSTQLI FT PTQSFPTSSNALPLPANFSSTNPPFFPSTSLQAPTSITNPPQQNAFCTNTS FT SARAPFTLATATPLPIPHNAPVLEPPQISNTVRNLILSGADIDLSTLLSPI FT APPSADRQVDCGEFTITLKSPVSSQPRTLTIAEFHVAFSRYTDTICSVFPH FT RRRELNDYMAIISELALSYGGTHFYTYHKLFSAKCAIRVAQWNQCSYWGAL FT DTDLHNRVFLGCRNLSCAVCRSNLHPTTSCPFIIPSTEKELQTPRSTSYVP FT RPSTSAIPALLPPPSSQNPPSSLACNNFNAARCFRHPCKYLHICSYCGGAH FT ARVVCQVWKANKKHRSYLSTPVNISNLYHELCMHPDPNFSEFLISGLSNGF FT HPGVSTLPSYNLACPNLQSANAEPEVVEQLIKKEIDNKFMIGPFLAPPFST FT YRVSPIGVATRKFSGKKRLIIDLSSPHNSAYSSVNSIISPDEFSLNYHDID FT QAISLIKLVGRDAWLAKVDITSAFKIMPLHPEFWHLFGINWKSQFYFAVRL FT TFGCRSSPKIFDMLSEALCWILANNYGIPHVVHLLDDFLIISPPNSPPAKH FT LEITKAVFAKLGIPLAEEKTAGPSTFIEFLGINLDSNKFQASLPKEKVDRI FT ISLSSIFLEKQECSKRELLSILGHLNFAMRIIPQGRPFVTHLLQLAASVQS FT LEENISLSDPCRNELSLWISFLKCWNGCSFFYSDLISSPVDIHLYTDAAPS FT IGFGGYYQGRWFASDWPPQMLEVPSHQYSSALFELYPIVVATLLWGDEWSA FT SSILIHCDNEAVVHCINRGRSHSPALMPLLRRLIWTSAKKQFILTAVHVPG FT FHNQIADSLSRLHFQKFRELAPEAEQHPTPIPPYSEMIFQ*" FT CDS 3825..4916 FT /product="DIRS-4_DR_2p" FT /translation="MHDLHQASISLIMQAVAPRTLQAYLTAWKTFKHFHSL FT YNTTFPNFSLLTITSFITYLHSHKHIQANSIKSYLSGIQFFHKLMYGSSSE FT SITNSQTSLLIKGIQKTRPPLPDTRLPITHNILAKCISTLRKGYFSFHTDH FT TLDAMFILAFFGFLRCSEFTVTSKFDPSIHPTIADLTLIDEETIAFLIKQS FT KTDQSRKGHYIYIFNIPSPTSPFQTLLAYTHYRKTLSASPLDPLFIDDTHH FT PVTRFWFQKHLKYVLTNSGFPSESYSSHSFRIGAATTAAHKGLSQQHIQTL FT GRWSSDAFKTYIRLSHSHLREAQRTLTSRCSYPSGQRHEPSTRKEHNPAIP FT ASRGQRNYQVSQGRGHDPAI*" XX SQ Sequence 6796 BP; 1875 A; 2179 C; 1015 G; 1725 T; 2 other; aagtgaagtt tttaaactaa tttcgagagg agcacgtgat ataattgacc gcagctggcc 60 gccwatctac actcattagt tagccaatca gatctattcc aaattactat aaatagccta 120 gctagatatt actcccttac cttcgttttc cgaagaacaa ggacaaaccc tgctcctaac 180 aaactaccga aaccctcgac aacaaacaac aacaacaaca acgacaacag ctacaacaac 240 agctacaaca acagctacaa caacagctac aacaacagct acaacaacag ctacaacaac 300 agctacagct acaacaacag ctacaacaac aacaacaaca acaacaacag caacaacaac 360 agcaacaaca gcctacatca acaataacag ctaacgcttc aacaacaaca gcttcaacaa 420 caacagctac tacaacaaca gctacaacaa aacagcgaca acaacagaaa gaactcacaa 480 cctaaacaga acaatccaac atcaaagcca tcaacaagca acatggaagg aaccaacaca 540 ccagctctcc cggaaacacc agcccaaaca caacaacaaa cccaaataaa caccgaggct 600 cctatcagag gccgaagacc catccgctcc acagtcacaa gaacccatcg ccgcacgcaa 660 tctccatctc caaaccgcaa cttaccgtct cccgcttcat cctacgcctc tgcaagatct 720 tcatccatca catccaacaa aatgactgtt tctgaactcc gccagtcact cacaaacgcc 780 ggaatttcca tccccacccg ctgcaataaa tccgaacttc tgaaactgta cgaagccatc 840 ccgtcaccaa ctccgcctcc ccaagacagt agaccaactc gctcccgcca caccccctat 900 ccacaaccct ccgctactca gcactcaaga aacccccctg gaccacccaa gaaagcaacc 960 aagaaaacta ataaaaagca acctcaagct acaggacaga magcaccttc taccaaccaa 1020 cacacagtga atccaccgga caattatgcc actccaggac ttcccacccc cctcctttgg 1080 cctccagccc cacaatccag cgaaaactcc agtccgactc ttcaagcaat tcccccgact 1140 ctcaaccctc ctcagttctc tctttcttct aatctccctc attcttcaac tcaacttatt 1200 cccaatcttt cttctaatct ccctcattct tcaacccaac ttattcccac tcaatccttt 1260 cctacaagct ccaacgctct ccctctgcct gctaattttt catctactaa tccccccttt 1320 tttccctcta catccctcca agcacccact tccattacta accctcccca acaaaatgct 1380 ttctgtacta acacatcttc cgcacgagcc cccttcaccc tagccacagc cacacccctt 1440 cccattccgc ataacgctcc agtcctggaa ccacctcaga tctccaacac agtcaggaac 1500 ctcatcctat caggtgcaga catagacctc tctacactcc tttcacctat tgcacctccc 1560 tcggcagatc gacaggtgga ttgcggcgaa ttcaccatta cacttaaatc accagtcagc 1620 tctcaacctc gcacactcac aatagccgaa ttccacgtag ctttctcacg ttatacagac 1680 accatctgct ctgtctttcc ccataggagg cgcgagctga acgactatat ggctatcatt 1740 tcggagctcg cactctccta tgggggaacg catttctata catatcacaa attattttca 1800 gcaaaatgcg ctattcgcgt tgctcaatgg aatcagtgtt cttattgggg ggctttggac 1860 actgatctcc ataacagagt ttttctagga tgccgcaatc tttcctgcgc ggtctgccgc 1920 tcaaaccttc acccaaccac ttcctgtccc ttcataatcc cctccactga gaaagaacta 1980 caaaccccaa gatccactag ttacgtaccc cgcccttcta cctctgctat ccctgctctt 2040 ctcccccctc cctcctctca aaaccctcca tcatctctag cttgcaataa ctttaacgca 2100 gccagatgtt tccgccaccc ttgcaaatac ttacacattt gcagttactg cggtggcgct 2160 catgctcgag tggtctgcca agtgtggaaa gcaaataaaa aacatagatc ctatttgtcg 2220 actcctgtca atatttctaa tctttaccat gaattatgca tgcaccctga tcctaacttt 2280 tctgaatttc tcatttcagg tctgtctaat ggattccacc ccggtgtttc gactcttcct 2340 tcctataacc tcgcatgtcc taaccttcaa tctgctaacg ctgaaccaga agtggtggag 2400 caattaataa agaaagagat cgataataaa tttatgatcg gtccctttct tgcccccccg 2460 tttagcacct atcgagtcag cccaattgga gtagcgacca gaaaattttc gggcaaaaaa 2520 cggctaatta tcgacctgtc ttctccccat aattccgcct attcaagtgt caacagcata 2580 atttcacctg acgaattctc tctgaattac cacgatatag accaagccat ttctttaatt 2640 aaactcgtcg gacgcgacgc ctggctcgcg aaagtagaca tcacgtcagc tttcaaaatt 2700 atgccattgc atcccgagtt ctggcatctc tttggcatta attggaaatc ccaattctac 2760 tttgcagtcc gtttaacctt cggctgcaga agtagcccca aaatcttcga catgctttca 2820 gaagcattat gctggatcct cgctaacaat tacggcattc cgcacgtagt ccacctacta 2880 gatgatttcc tcataatttc ccctccaaat tccccacctg ctaaacacct agagattacc 2940 aaagcagtgt ttgccaaact cggcatccct ctagctgaag aaaaaaccgc cggccccagc 3000 accttcatag aattcttagg catcaatttg gactctaaca aatttcaagc atctttaccc 3060 aaagagaaag tcgatcgcat catttctcta tcttccatat ttttggagaa acaagaatgt 3120 tctaaacgcg aactgctgtc aatattagga catttaaatt tcgccatgcg catcatacct 3180 caaggacgcc cgttcgtcac tcacctcctt caactcgcag catcagttca gagtctagaa 3240 gaaaatatat ccttatccga tccatgccga aacgaactca gcctctggat ttccttcctt 3300 aagtgctgga acggctgttc tttcttttat agtgatttaa tttcatcccc cgtagacatc 3360 catctttata cagacgctgc accctccata ggatttggcg gttactacca aggccgctgg 3420 ttcgcatccg attggccccc ccaaatgtta gaggttccat cacaccaata ttcatctgca 3480 ttattcgaac tataccccat agtcgtcgcg accctattat ggggagatga atggtctgct 3540 tccagcattc tcattcactg tgacaatgaa gccgtcgttc actgcattaa tagagggcgc 3600 tctcactccc ccgctctaat gccgcttctc cgtcgcctta tttggacttc agccaaaaaa 3660 cagtttattt taactgctgt acatgttcct ggttttcata atcaaattgc tgactctctc 3720 tctcgtcttc attttcagaa attcagagaa ttagcgccgg aggcggagca gcacccgacg 3780 cccatccctc cttattcaga gatgatattc caataaatca tcccatgcac gatctgcacc 3840 aagcatccat atctctcatt atgcaagcgg tggctccaag aaccttacaa gcttatctca 3900 ctgcatggaa aacattcaaa catttccatt cactatacaa cactacattc cccaatttct 3960 ccctacttac aatcacatca tttatcactt accttcattc tcacaaacat atccaggcaa 4020 actcgattaa gagctattta agtggcattc agttttttca caaactcatg tacggctcca 4080 gttctgaatc tatcactaac tcacaaacta gccttcttat taaaggcatt cagaagaccc 4140 gcccccccct cccagacaca aggctaccca tcacacacaa catactagct aaatgcattt 4200 ccacactcag gaaaggctat ttttcatttc atacagatca taccctagat gcaatgttta 4260 ttcttgcctt ttttggattt ctaagatgtt ctgaatttac agttacatct aaattcgatc 4320 cctctatcca ccctactata gcagatctga ccttgattga tgaggaaaca attgctttcc 4380 tcattaagca aagtaaaaca gatcaatcca gaaagggaca ttacatctac atattcaaca 4440 ttccctcccc cacaagccca ttccaaactc ttctagctta cacacactac aggaaaacac 4500 taagtgcaag tcccctagac ccccttttca tagacgacac acaccaccca gtgacacgct 4560 tttggttcca aaaacacctt aaatatgtcc taaccaactc aggcttccca tcagaatcat 4620 actccagtca ctcattcaga attggagccg ccactacagc agcacacaaa gggttatcac 4680 aacaacacat acaaacacta ggaaggtggt cttctgacgc cttcaaaacc tacatccgac 4740 tcagccacag tcatctcagg gaagcccaga ggaccctcac cagccgttgc agttatccca 4800 gcggccaaag gcacgagcct agtacaagga aagaacacaa cccagctata ccagcttccc 4860 gagggcagag gaactaccaa gtctctcaag ggcgcgggca tgacccagcc atctaatttc 4920 ttttcttcct tccagctgat ttgcactcag ccttctcccc cttttcactc acccaactac 4980 agtaagagtt ctttccctgc ccaagccccc ccgtcacccc cgccccccct ggccgctgcc 5040 acagaagttt tcactgccca ccaacttttc taacttctgt aggaccccgc cccccccatg 5100 gctctggccc ccgcaggggc gttaccccga gcttcgattc ccgcaggaat catctcacgg 5160 cccggcctta gctttttata ttatatgtca tttcaatgac atatagtata gcactattta 5220 ttttctcttt gtttgatttt atttgtataa attcacatct atatgcaccc acgcatataa 5280 atatgaattt atatatagtg ctgtcaccct cacgctctgt tcccgcagga acacccccag 5340 agcacctata gcgcccgtca ccctccagaa agagtcttca ccttcccctc atctttccag 5400 actcctactg gagccagcca aatagctcac ccagccccga gctgtgacac tagccatgtc 5460 accgaccctc cccggcccta gcttttattc atttatttct gtttatttct cacatttatc 5520 ttttattttt taaatttatt tatatatatg tatatatata tgcacccacg catataaata 5580 tatatttata tatagtgctg tcaccctcac gctctaactc ccgcggagtt aatcccgagc 5640 accggacccc cgcaggggtc atcgcccaac tgccatttca ccctccagct ggggcttcac 5700 cgaccattcc ttttcccgac tccagctgga gatggcacat acagctctct ctcccgcagg 5760 agagccaaga gcttcgactc tctcaagagt cagtaaaaaa cgcccccccc caaggcccta 5820 gattacccca ttatatattt atatatctat atgtttcata taaatatata attatatata 5880 gtgctgccac ctcccagctc aatctccgca aggagtgttc ctcgagcaaa ttactccttt 5940 ggagtccccg cccccccctg ccccccttca cccccctctc cagccggagt ccttcactcc 6000 ccttcccttg taatgactcc agcaggattc ccgcccaccc catggctctg acccccgcag 6060 gggtctcccc gagtctctac tccagcagga gtattcacag cccaagccaa ctctgctcgg 6120 gttcccgcag gaacctgtgt caccctttgc tccaaggagc cctctttaca tttcctttca 6180 aataactata tccagcagcc ggatatagca tttcaagcct tttggggagt ttcttcgaat 6240 acacggctgc tgtcccgagc ttcatgcatt tggggagctc tcgagaacca cctgatctcg 6300 tactcccctt acatgctcta tggacctggc gggagccctg ggctcaacta tctccgagct 6360 cagggttctc tcccgggaca gcatgccaaa cctgcttaca gtcgtcaagc aatatctaag 6420 tgtgaactct tgaagtgaag tttttaaact aatttcgaga ggagcacgtg atataattga 6480 ccgcagctgg ccgccaatct acactcatta gttagccaat cagatctatt ccaaattact 6540 ataaatagcc tagctagata ttactccctt accttcgttt tccgaagaaa ccccccatcc 6600 accccctatc tcctcctttc ctccctttaa aaaggggagc tctcgagaac cacctgatct 6660 cgtactcccc ttacatgctc tatggacctg gcgggagccc tgggctcaac tatctccgag 6720 ctcagggttc tctcccggga cagcatgcca aacctgctta cagtcgtcaa gcaatatcta 6780 agtgtgaact cttgaa 6796 // ID DIRS-2_DR repbase; DNA; ZEB; 5291 BP. XX AC . XX DT 26-SEP-2008 (Rel. 13.09, Created) DT 26-SEP-2008 (Rel. 13.09, Last updated, Version 4) XX DE DIRS-like LTR retrotransposon - a consensus. XX KW DIRS; LTR Retrotransposon; Transposable Element; KW endogenous retrovirus; DIRS1; DIRSDR1; KW reverse transcriptase RNase H; phage integrase; DIRS1_DR; KW DIRS-2_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5291 RA Jurka J.; RT "A family of DIRS-like endogenous retroviruses in zebrafish."; RL Repbase Reports 8(9), 928-928 (2008). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS 277..4899 FT /product="DIRS-2_DR_1p" FT /translation="MAEKNFKRCVPPCPRFITAGDSHDLCLECLGEEHALA FT AFENADCGHCDVLSRKELRSRREFFNKAPVAHAPRGSGPARAEAERRLRSW FT GSQLDLADEMETDSVLSLSGSVRSNPPSGASEARSAVSSAPRGRAASSVSE FT ETEAPQQQIQRGSGNLPPQSVEYEELVEVISRAADRFEVIDWQPAREQQQL FT RLTGMLDERELPSRVDPPLRDLPFCPELHDAVSKSWKNPYSAWLMTPKTAI FT YSAVRGLGEKACTVMPMIEEDLARHRRSDLKSARVPPLLSRPLRVTSGLVS FT KAYMTAGQSVGCLHTMSVLQAYQADLIKECLDGGGATPEQLREALRASDLA FT LRATKEAASSLGRSMATLVATERHLWLTQAEVSETDRAILMDAPISSSGLF FT GDAVDRVAESLDRVKKRSSSLGDFLPPRSKSQGAVKRQPQPSTSSSYHEVQ FT RIKQSPDVLGVRLSRAPPGHREHHSALSLNQGPWRKGEATSPLMLVGGSVS FT PGVGVPQCLRALGAPPPLQGGQRTTRGQSREAGTPSTFFGSVETPADGVPV FT GPVHSRTWLQNTVLCTPTSLQRYCTHHSEARTGSGYGTGSTGLINKRRNRA FT CSPTRQRVRVLQPVFYSSQKGWGIASHIRSEKSKSVRRGPQVQDVNHQKRG FT VTNSVRGLVCDDRPERRILPYFHPSPTQEIPEVRLRGRSVPVSGSSIRPSS FT VTSNLYQNSRSGTSSTSYAGDTYPKLHRRLANSSSDSRYGSSASRCRSHPY FT QKVGVSVKHRKKCACSSQDDHFFRCAMGLHDDASTSVPAMNRFDSVNRTQS FT QTRPVHHCETLSEVVGSHGSSSQRDSVRSAVHETPAVVAQIQGVFPQGESF FT PHDQGLAALPSSLKYVENALVPVPGPSVGGCLSSRHAYDKCFSDGLGSNGL FT DPHDQGLAALPSSLKYVENALVPVPGPSVGGCLSSRHAYDRCFSNGLGSNP FT EGASRSGTMGRTSSLLAHKLPGDDGRVSGLKTLSPRSKGPSCFSLHEQHIG FT GRLHQPAGGSEVSNAMQTSTSDPPVGPEQNPVHQGNVCPGPSEHGSRSPVE FT AGGEIQGMETSSPRGGGVHLGKIRESAGRPVCFPRDHALRTMVFSLASSPS FT GTGCHGSDIAEATSVCFSPDRSAPRSPGEGPSRLSTVTAGSPGLAYQDLVF FT GPHSPAGGSLVGDPHQQGPSVPGGRNDTSSPTRPVETVGVASEGAHLIEYG FT LSTEVAQTILSSRAPSTRKLYALKWALFSAWCREHQLNPVSCQVASVLEFL FT QDRLSAGLAASTLRVYVSAIAAYRSPLDDESLGQDPLIRRFLRGAIRLRPV FT STHRVPTWDLTLVLEGISVPPFEPLQEASDKFLTLKTAFLLAISSLKRVGD FT LQALSVAPSFLEFAPGMSKAFLYPRPGYVPKVPTHVARPAVLQAFHPPPFQ FT SSDQEKLNLLCPVRALNTYVNRVINWRKSEQLLVCFGPSKRGSPANKQTIS FT NWIVETISFTYQAAGRPAPKFVKAHSTRAVGASKASISGSALSDICLAAGW FT STPHTFVRHYQLDVDPSPGSSILTA" XX SQ Sequence 5291 BP; 1230 A; 1357 C; 1380 G; 1324 T; 0 other; ttccccttct agggaacttc aacactgcgt ctaaccagaa cgctagggga acacctcttt 60 tatacgcgtc ttgaagcaca tgtgaaatca atctaatgta attaagcagg tgtcgtcaga 120 ccagagagta taaaagcctg tactgagcat tcagtatcaa cttctttgct ttcaagaagc 180 acgcacgtga aaatacaccc tctttctgtg aactttcatt actgatttgc atacacaaaa 240 tacaaaaaaa ctgacaactt acttttttat tttggtatgg cagagaaaaa ctttaaacgt 300 tgtgtgcctc catgccctcg ctttattacg gctggtgact cacatgattt gtgtttagag 360 tgtttgggag aagagcatgc cctggcagca tttgagaatg ctgactgtgg acactgtgac 420 gttctctccc gtaaagagct gcgtagtcgg agagagttct ttaataaagc tcccgtggcg 480 cacgctcctc gcggttcggg tcccgctcgt gctgaggctg agcgtcgact tcggtcgtgg 540 ggttcgcagc tagatctggc ggatgagatg gagacggact ctgtcctttc tctctctgga 600 tccgtgagat ctaatcctcc ttcgggagcg tcagaagcac gctctgcggt ttcttctgcg 660 cctcgtggga gggcggcgtc ctccgtttcc gaggaaaccg aggcgccgca gcaacaaata 720 caaagagggt cggggaatct gccgccccag tcagtggaat atgaggagtt agtggaggtg 780 atttcacgtg ctgctgacag gttcgaagta atagattggc agccagcacg tgagcagcag 840 cagctgcgtc tgacaggaat gctggatgag agagaattac ccagcagagt agatcctcca 900 ctaagggacc tccccttttg tcccgagcta catgatgcgg tttctaaatc atggaaaaat 960 ccgtattcag catggttaat gacaccaaaa acagctattt attcagcagt tcgtgggcta 1020 ggggaaaagg catgtacagt aatgccaatg atagaagagg acttagcacg tcatcgtcgt 1080 tcagatctaa aatctgcaag ggtccctcct ttgttgtcga gaccattaag agtaacatcg 1140 ggtctagtca gtaaagcata tatgacggct ggtcagtctg ttggatgcct gcacaccatg 1200 tcagtgctgc aggcatatca ggctgaccta attaaagagt gcctagatgg tgggggagca 1260 acacccgaac agcttcgaga agctcttcgg gcgtcagatc tagctttaag agctactaaa 1320 gaggcagcct ctagtttggg gcgatctatg gctaccctgg tggctactga gcggcacctc 1380 tggctgacac aagcagaagt gtcagaaacc gatagagcta ttcttatgga cgctccaata 1440 tcgagctcag ggctcttcgg cgacgccgtc gatcgcgtcg ccgaatccct cgacagagtt 1500 aaaaaacgct ctagctccct cggggacttt ctccccccaa gatcaaaaag tcagggggct 1560 gttaaaagac agccccagcc gtcaaccagc tcctcatatc atgaagtaca aaggataaaa 1620 caaagtcctg acgtgttggg agtcaggctt tccagggccc cccctggaca ccgagagcac 1680 cattcagcgc tctcactgaa ccagggtccg tggagaaagg gagaggccac ctcaccactt 1740 atgttggtgg ggggctctgt gtctcccggg gtgggcgttc ctcagtgtct gagggcattg 1800 ggggccccac cccctctgca ggggggtcaa agaacaacca gaggccagtc tcgagaggct 1860 ggtaccccta gcacattttt tggcagcgtg gaaacacctg ccgatggtgt cccagtgggt 1920 cctgttcaca gtagaacatg gctacaaaat acagttttgt gcacgcccac ctcgcttcaa 1980 cggtattgca cccaccatag tgaagccaga acaggctctg gttatggaac aggaagtact 2040 ggccttatta ataaaaggcg caatagagcg tgttctccca ctcgacagag agtcagggtt 2100 ttacagccgg tattttatag ttcccaaaaa ggatggggga ttgcgtccca tattagatct 2160 gagaaatcta aatcggtccg tcggggccct caggttcagg atgttaacca tcaaaaacgt 2220 ggtgtcacaa attcagtccg aggactggtt tgtgacgata gacctgaaag acgcatactt 2280 ccatatttcc atccttcccc aacacaggaa atacctgagg ttcgcttgcg ggggcgaagc 2340 gttccagtat cgggttcttc cattcggcct agctctgtca cctcgaacct ttaccaaaat 2400 agtcgaagcg gcactagctc cacttcgtat gcaggggata cgtatcctaa actacataga 2460 cgattggcta attctagctc agactcacga tatggcagtt cggcatcgag atgtcgttct 2520 cacccatatc agaaggttgg ggtttcggtt aaacaccgca agaagtgtgc ttgttccagc 2580 caggacgacc atttctttag gtgtgctatg ggactccatg acgatgcgag cacgtctgtc 2640 cccgccatga atcgcttcga ttcagtcaac cgtacacaga gtcaaactag gccagttcat 2700 cactgtgaaa cactttcaga ggttgttggg tctcatggca gcagcagcca gcgtgattcc 2760 gttcggtctg ctgtacatga gacccctgca gtggtggctc aaatccaggg ggttttccct 2820 caaggggaat cctttccgca tgatcaaggt ctcgcggcgc tgccttcgag ccttaagtat 2880 gtggaaaatg ccctggttcc tgtcccaggg cccagtgttg ggggctgtct gtcatcgcgt 2940 catgcctatg acaaatgctt ctctgacggg ctagggagca acgggcttga tccgcatgat 3000 caaggtctcg cggcgctgcc ttcgagcctt aagtatgtgg aaaatgccct ggttcctgtc 3060 ccagggccta gtgttggggg ctgtctgtca tcgcgtcatg cttatgacag atgcttctct 3120 aacgggctgg ggagcaaccc tgagggggct tcccgcagcg ggacgatggg gagaacatca 3180 tcgttactgg cacataaact gcctggagat gatggccgtg tttctggcct taaaacactt 3240 tctcccagat ctaaggggcc atcatgtttt agtctgcacg aacaacacat tggtggtcgc 3300 ttacatcaac cagcaggggg gtctgaagtc tcgaatgcta tgcaaactag cacatcggat 3360 cctcctgtgg gcccagaaca aaatcctgtc catcagggca atgtatgtcc cgggccatct 3420 gaacatggga gcagatctcc tgtcgaggca gggggtgaga tccagggaat ggaaacttct 3480 tcaccccgag gtggtggagt ccatttggga aagattaggg aaagcgcagg tagacctgtt 3540 tgcttcccaa gagaccacgc attgcgtact atggttttct ctctcgcatc cagcccctct 3600 gggactggtt gccatggttc agacatagcc gaggctacgt ctgtatgctt ttcccccgat 3660 cgctctgctc ccaggagtcc tggagagggt ccgtcaagac tgagtacagt tactgctggt 3720 agccccggtt tggcctacca ggatttggtt ttcggacctc atagccctgc tggcgggtct 3780 ctcgtgggag atccccatca gcagggacct tctgtcccag gcgggaggaa tgatacttca 3840 tcccctaccc gacctgtgga aactgtgggt gtggcctctg agggggccca cctcatagag 3900 tatggactgt caaccgaggt tgctcagacc attctaagct ccagggctcc ctccacaagg 3960 aagctttatg ccctaaaatg ggctctcttt tcagcttggt gcagagaaca ccagctgaac 4020 ccagtcagct gccaggtagc ctcagtgctg gaatttctcc aagatcgcct gtctgctggg 4080 ttagctgcat ccactctgag agtgtacgtg tcagctatag cggcctaccg ttctccccta 4140 gatgatgagt cactaggaca ggatccgcta attcgtcgct tccttcgtgg agccataagg 4200 ctaaggcctg tcagcacaca cagggtaccg acatgggatt taacattggt gctcgagggc 4260 atctctgttc ccccatttga gccactgcag gaggcgtcag ataagtttct gacactaaaa 4320 acagctttct tattagctat ttcttcctta aaaagggttg gtgacctcca ggctttgtcg 4380 gttgcacctt catttctgga gtttgctcca ggcatgtcca aagcctttct ttatcccaga 4440 ccggggtacg tgcctaaggt gcccactcat gtggcgagac ctgctgtgct acaggccttt 4500 cacccgcccc catttcagtc gtcggaccaa gagaagttaa acttactctg cccagttaga 4560 gctctgaata catatgttaa ccgggttatc aactggagaa agagtgaaca gttactggtc 4620 tgcttcggac cctcaaaaag ggggagtccg gcaaataagc agacaataag taattggata 4680 gttgagacta tctcatttac ctatcaggct gctggacgcc ctgcacctaa atttgttaag 4740 gcccactcca caagggctgt cggggcctcc aaagcttcta tttcgggctc agccctttct 4800 gacatttgtt tggcggcagg atggtcgact ccacatacat ttgtgcgtca ctatcaactc 4860 gatgtagacc cctcaccagg gtcctctatt ctcactgcgt agtgtgcgtt cacagtcagc 4920 agtgagtctg gcctagtggg tattgcgttc ccctagcgtt ctggttagac gcagtgttga 4980 agttccctag aaggggaacg tctcgggtta cgtatgtaac catagttccc cgagagggaa 5040 cgagacactg cgtattccgc catactctct tctgcctgtt acttctttca agcaaattcg 5100 aagttgatac tgaatgctca gtacaggctt ttatactctc tggtctgacg acacctgctt 5160 aattacatta gattgatttc acatgtgctt caagacgcgt ataaaagagg tgttccccta 5220 gcgttctggt tagacgcagt gtctcgttcc ctctcgggga actatggtta catacgtaac 5280 ccgagacgtt t 5291 // ID DIRS1_DR repbase; DNA; ZEB; 6132 BP. XX AC . XX DT 11-FEB-2002 (Rel. 7.01, Created) DT 19-MAY-2005 (Rel. 7.01, Last updated, Version 2) XX DE DIRS1_DR is a DIRS-like LTR retrotransposon - a consensus. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW DIRS superfamily; DIRS1; DIRS1_DR; endogenous retrovirus; KW phage integrase; reverse transcriptase RNase H. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 414-5132 RA Jekosch K.; RT "DIRSDR1: putative non-LTR retrotransposon."; RL Repbase Reports 2(2), 9-9 (2002). XX RN [2] RP 1-6132 RA Kapitonov V.V. and Jurka J.; RT "DIRS1_DR, a family of DIRS-like endogenous retroviruses in RT zebrafish."; RL Repbase Reports 3(1), 1-1 (2003). XX DR [2] (Consensus) XX CC DIRS1_DR is a family of DIRS1-like retrotransposons. These CC elements CC are related to gypsy-like LTR retrotransposons and endogenous CC retroviruses. CC There are ~100 copies of DIRS1a_DR in the genome, they are ~0.3% CC divergent from the consensus sequence. Therefore, this family CC retrotransposed in the zebrafish genome very recently. CC The unusual structure of DIRS1_DR is depicted in the next figure. CC GTTCCCCTTCGGTTGGGGAACTTCAGTGCCATGAATGGGAGGATTCGGATCAGAAGCCGCTTATCTGGAG CC <====== ======> <--------------------------------------------- CC AGTATTGAACGGGCCAATGAATGAAATTAATTGGCAGCGTAAGCTTGCGCAGGTGTGCGACATCTGCAAT CC ---------------------------------------------------------------------- CC TATCTCAGCATATAAGCACACCTGAAGCCAGCAGACGCCATCCTTTTCGCTTCAGATCCTTTCTGAGTGA CC ----------------------------------------------- CC ...................................................................... CC ...................................................................... CC GGTGCAGTCATTATGGCGCTTTCCATATTCTCCCATTCATGGCACTGAAGTTCCCCAACCGAAGGGGAAC CC <====== ======> CC <~~ CC GTTCGAGGTTACAGAAGTAACCCTTCGTTCCCCGAGGAGGGGAACGGAAGTGCCATATTCCGTCGCCATA CC ~~~~~~~~~~~~~~~~~~~~~~~~~~<====== ======> CC ATGACTGTCCCTTAGCTGTTTGAAAGTCTCTTCAGCTT CC AAAAGGATGGCGTCTGCTGGCTTCAGGTGTGCTTATATGCTGAGATAATTGCAGATGTCGCACACCTGCG CC ---------------------------------------------------------------------- CC CAAGCTTACGCTGCCAATTAATTTCATTCATTGGCCCGTTCAATACTCTCCAGATAAGCGGCTTCTGATC CC ---------------------------------------------------------------------- CC CGAATCCTCCCATTCATGGCACTTCCGTTCCCCTCCTCGGGGAACgaagggttacttctgtaacctcgaacgtt CC ----------------------> <====== CC ======>~~~~~~~~~~~~~~~~~~~~~~~~~~~~> CC Fig.1 Termini of DIRS1_DR. The 163-bp sub-terminal inverted CC repeats are CC underlined by a single line. CC DIRS1_DR encodes three ORFs. ORF1 (positions 414-1632) codes for CC the CC gag-like protein. ORF2 (positions 1633-2597) codes for reverse CC transcriptase and RNase H. ORF3 (positions 2598-5129) codes for CC the CC phage integrase. XX FH Key Location/Qualifiers FT CDS 1633..4110 FT /product="ORF2p" FT /note="reverse transcriptase" FT /translation="MRWAMSSIGVAVSPLRPPSHPPPLFLAEGARQRVLPR FT PRLRLRPSGRGVHLESRQPLLPRAPLSPVNGPRSVPETGHPEKRKLALSPL FT EGGAPITTVLFSATKTSVKEHFFPSPDVTARVLPVRDALPSGSQTLRASPV FT AHERWGDGLPSLSPPAPSPESGCGARANRSPPAFPRDPRASRISTPTPRCP FT TAGTSAIVAMTPLARALPAWLARASPSRWLIRTIRLGYAIQFAKRPPKFTG FT VYFSRVNPLSAPVLREEIAALLAKGAIEPVPPAEMESGFYSPYFIVPKKSG FT GSRPILDLRVLNRCLHKLPFRMLTQRRILQCVRPRDWFAAIDLKDAYFHVS FT ILPRHRQFLRFAFEGRAWQYKVLPFGLSLSPRVFTKLAEGALAPLRLAGIR FT ILSYLDDWLILAHSREQLIMHRDEVLRHLRLLGLQVNREKSKLAPVQRISF FT LGMELDSITMVAHLSEERARLLLNCLRELDSKLVVPLKFFQRLLGHMASAA FT AVTPLGLLHMRPLQHWLHDRVPRRAWHAGTHRVSVTALCRRALSPWNDPSF FT LQAGVPLGQASSHVVVSTDASNTGWGAVCRGHAAAGLWKGAQLHWHINRLE FT LLAVFLALHRFLPVLERQHVLVRTDSTAAAAYINRMGGMRSRRMSQLARRL FT LLWSHPRLKSLRAIHVPGTLNRAADALSRQLLRPGEWRLHPESVQLIWARF FT GEAQIDLFASPENAHCQLFFSLTEGSLGTDALAHSWPRGMRKYAFPPVSLL FT AQFLCKVREDEEQVLLVAPLWPNRTWISELSLLATALPWRIPLREDLLSQG FT QGTIWHPRPDLWNLHVWSLDARKT" FT CDS 2598..5129 FT /product="ORF3p" FT /translation="MRSSSGLVCSHRPEGRVFPCLHSSTPPPISAVCVRGS FT SVAVQGPPLRALSVSAGLHQTRGGCPSAPSARGHSHTQLSRRLADFSPLAG FT AIDYAQGRGASASPPTGASGQPRKEQTRPRAEDFFSRDGAGLDHHGSAPLR FT GTRSPVAELSEGARQQTSGPTEVLSEAPGAYGIRSRRHAARVAPYETTSAL FT ASRSGPQTRMARGHTPGLGYCAVSPRPQPLERPLVPTGRCASRTGVQPCCC FT FNRRFQHGLGGRVSRACGCGPLEGCPAALAYQSPRAVGSVPRSPPLFTGAG FT AATRAGQDGQYGGGGVYQPHGGYALSPHVSARPPSAPLESPAAEIAARHSR FT PRHAQSCSRCALTTAVTPWRMETPPRVCSADMGAIRGGPDRSVCFPRERSL FT PVVFFPDRGLSRHGCTGPQLASGHAQVCVSPSEPARAVSVQGQGGRGTGSA FT SCAPLAQPDLDIRALTPRDGPPLADPFERGPTLSGTGHHLAPSPRSLEPPR FT VVPRREEDLGNLPTAVVNTITQARAPSTRRAYALKWSLFTEWCVSRREDPR FT NCQISVVLSFLQEKLDSRLSPSTLKVYVAAISAYHSAVAGGTVGKHNLVIQ FT FLRGARRINPSRPPLMPSWDLALVLTSLRSDPFEPLESVSLRFLSLKTALL FT VALASIKRVGDLEAFSVSDSCLEFGPDYSHVILRPRPGYVPKVPTTPFRDQ FT VVNLQALPPEEADPALSLLCPVRALRIYVDRTQNFRSSEQLFVCYGGRQQG FT SAVSKQRLSHWIVDAISLAYSSRGQPCPPGVRAHSTRSVASSWARARGASL FT TDICRAAGWATPNTFARFYNLRVEPVSSRVLGNPLVIEETTR" FT CDS 414..1850 FT /product="ORF1p" FT /note="Gag-like protein" FT /translation="MALRLCVSGCGGFLSPDDGHDHCIACLGVQHVNAVLA FT GGSCRHCDAMTVAQLRSRLTFARERATPVASCSKKAAGARADLRVSAGANP FT PPTGSRTSRSSRRSIQASGGESDPSNQMVALTLADTGDQMSSAASEGGLSL FT SDEDPDPLAPSGQVSAVKSDPEADMLAVLSRAASAVGLEMVYPPAPRPDRL FT DGCYVEDQKAKPSKPLVPFFPEVHSRLTQSWRAPFSARAASASALTALDGG FT AARGYEAIPSVERAIAVNLCPRGASTWRGLPRLPSKACRLSASLGARAYKA FT AGQAASALHAMATYQRYQAQALAELHEGGSNPSLLHELRTATDYALRTTKS FT AACALGRTMSTLVVQERHLWLNLADMRDVDKVRFLDSPISQAGLFGDTVGE FT FTQEFKAVKEQSDAMGNVIYRRGRKPAPPAEPSTSAVPRRGRPPTSAAPPP FT PAPPAKRARRSPRKQAAPPAQGAVKSGKRTAKRP" XX SQ Sequence 6132 BP; 1117 A; 1898 C; 1706 G; 1411 T; 0 other; gttccccttc ggttggggaa cttcagtgcc atgaatggga ggattcggat cagaagccgc 60 ttatctggag agtattgaac gggccaatga atgaaattaa ttggcagcgt aagcttgcgc 120 aggtgtgcga catctgcaat tatctcagca tataagcaca cctgaagcca gcagacgcca 180 tccttttcgc ttcagatcct ttctgagtga gtcgatgagg gttcctcttg ctgatcagca 240 cttcagagcg aacgagtgtg tctcccggtc cagagtgggt cttcgcggtg gcagacggtc 300 gagctgggtt actcccttgc ctgcggttct ttgggtccgg tcctccagag cggtgcgtat 360 agttgcaact ttcctaaaag agcaacacag tcgtgcagca cgtccttttc aggatggcgc 420 tccgactgtg cgtttctgga tgcgggggtt tcctgtctcc ggatgatgga cacgatcact 480 gcattgcatg tttgggggtc cagcatgtta atgcggtgct cgcgggcggt tcatgtcgtc 540 attgcgatgc catgaccgtt gcacagctaa gatcgcggct aactttcgca agagagcgag 600 ccaccccagt tgcctcctgt tctaaaaaag cagcgggcgc tcgggcagat ctgagggttt 660 cagcgggagc taatccgccg cccacgggct cgcggacctc tcgctcctca cggcgctcca 720 tccaagcttc gggtggtgag agtgatccgt ctaaccagat ggtagctctc acactcgctg 780 acaccggaga tcagatgtcc tccgcggcat cggagggtgg gctttcactg tccgacgaag 840 atccggaccc gctcgccccc tccgggcagg tgagcgctgt caaatcggat cctgaagcgg 900 acatgttagc cgtgctttcc cgggctgctt cggccgtggg gttggagatg gtttatcccc 960 cagctccgcg gccggaccga ctagatgggt gctacgtaga ggaccagaag gcgaagcctt 1020 cgaagcctct cgtccccttc ttcccggaag tgcacagtag gctcacgcag tcctggaggg 1080 cacctttctc tgcccgtgct gcgagtgcct ccgccctcac cgcccttgac ggcggagctg 1140 ccagggggta tgaggcgatc ccgtcagtgg agcgcgctat cgcggtcaat ctttgtccgc 1200 gcggcgcctc tacgtggcgg ggtttgcccc gcctcccgtc caaagcctgt aggttgtctg 1260 cctccctcgg agccagagct tataaggctg cgggccaggc tgcttctgct ttgcacgcga 1320 tggccaccta ccagcgctac caagcgcagg cgctggccga gctgcacgag ggcgggtcca 1380 acccaagctt attacatgag ctgcgcaccg cgaccgacta tgctcttcgg actactaagt 1440 ccgccgcgtg tgcgctgggg aggacgatgt ccacacttgt ggttcaggaa cgccacctct 1500 ggctaaacct ggccgatatg cgcgacgttg acaaagttcg ctttcttgac tcgcccatat 1560 cccaggctgg cctgttcggc gacaccgtcg gtgaattcac ccaggaattc aaggcggtga 1620 aagagcagtc ggatgcgatg ggcaatgtca tctatcggcg tggccgtaag cccgctccgc 1680 ccgccgagcc atccacctcc gctgttcctc gccgagggcg cccgccaacg agtgctgccc 1740 cgcccccgcc tgcgcctccg gccaagcggg cgcggcgttc acctcgaaag caggcagccc 1800 ctcctgccca gggcgccgtt aagtccggta aacggaccgc gaagcgtccc tgagacaggc 1860 catccggaga agaggaaact tgctctttcc ccgctggagg gcggggcccc gataacaacg 1920 gtacttttca gtgccaccaa aacatcagta aaagagcact ttttcccttc cccggatgtg 1980 actgcacgag ttctgccagt ccgggacgcg ctgccttccg gctcgcagac tctacgtgct 2040 tcgccagtgg ctcacgagcg ctggggggac ggtctccctt ccctcagccc tccagccccc 2100 tctccggagt cagggtgcgg agccagagcg aatcgctctc ctccagcttt tccgcgggac 2160 cctcgtgctt cccggatcag cacacccact ccgcgctgcc ccaccgctgg tacgtcagcg 2220 attgtagcga tgactccatt agcgagggct ctgcctgcct ggttagcgcg ggccagcccc 2280 tcgcggtggc tcatacgcac aatcagactc ggttacgcga ttcagttcgc gaaacggccc 2340 cccaagttta cgggcgtgta tttctccagg gtcaaccccc tgtccgcccc tgtcttgcga 2400 gaggagattg ctgccctcct ggcgaagggt gcaatcgagc cggttcctcc agccgagatg 2460 gagagtgggt tttacagccc atacttcatc gtacccaaaa agagcggtgg gtcacggcca 2520 atcctagatc tgcgcgtttt gaaccgctgt ctgcacaagc tgccgttcag aatgctcacg 2580 cagaggcgca ttctccaatg cgttcgtcct cgggattggt ttgcagccat agacctgaag 2640 gacgcgtatt tccatgtctc cattcttcca cgccaccgcc aatttctgcg gtttgcgttc 2700 gagggtcgag cgtggcagta caaggtcctc cccttcgggc tctctctgtc tccgcgggtc 2760 ttcaccaaac tcgcggaggg tgccctagcg ccccttcggc tcgcgggcat tcgcatactc 2820 agttatctcg acgactggct gattttagcc cactcgcggg agcaattgat tatgcacagg 2880 gacgaggtgc ttcggcatct ccgcctactg gggcttcagg tcaaccgaga aaagagcaaa 2940 ctcgcccccg tgcagaggat ttcttttctc gggatggagc tggactcgat caccatggta 3000 gcgcacctct ccgaggaacg cgctcgcctg ttgctgaact gtctgaggga gctcgacagc 3060 aaactagtgg tcccactgaa gttctttcag aggctcctgg ggcatatggc atccgcagcc 3120 gccgtcacgc cgctcgggtt gctccatatg agaccacttc agcactggct tcacgatcgg 3180 gtccccagac gcgcatggca cgcgggcaca caccgggtct cggttactgc gctgtgtcgc 3240 cgcgccctca gcccttggaa cgacccctcg ttcctacagg ccggtgtgcc tctaggacag 3300 gcgtccagcc atgttgttgt ttcaacagac gcttccaaca cgggttgggg ggccgtgtgt 3360 cgcgggcatg cggctgcggg cctctggaag ggtgcccagc tgcattggca tatcaatcgc 3420 ctagagctgt tggcagtgtt cctcgctctc caccgctttt taccggtgct ggagcggcaa 3480 cacgtgctgg tcaggacgga cagtacggcg gcggcggcgt atatcaaccg catggggggt 3540 atgcgctctc gccgcatgtc tcagctcgcc cgccgtctgc tcctctggag tcacccgcgg 3600 ctgaaatcgc tgcgcgccat tcacgtccca ggcacgctca atcgtgcagc cgatgcgctc 3660 tcacgacagc tgttacgccc tggagaatgg agactccacc ccgagtctgt tcagctgata 3720 tgggcgcgat tcggggaggc ccagatcgat ctgtttgctt cccccgagaa cgctcactgc 3780 cagttgtttt tttccctgac cgagggctct ctcggcacgg atgcactggc ccacagctgg 3840 cctcggggca tgcgcaagta tgcgtttccc ccagtgagcc tgctcgcgca gtttctgtgc 3900 aaggtcaggg aggacgagga acaggttctg ctagttgcgc ccctttggcc caaccggacc 3960 tggatatcag agctctcact cctcgcgacg gccctcccct ggcggatccc tttgagagag 4020 gacctactct ctcagggaca gggcaccatc tggcaccctc gccccgatct ttggaacctc 4080 cacgtgtggt ccctagacgc gaggaagact taggtaacct accgactgcg gtggttaata 4140 ccatcactca ggctagagcc ccctccacga ggcgcgccta cgccctgaag tggagtctat 4200 tcactgaatg gtgcgtctct cgcagagaag acccccgaaa ttgccagatt agtgttgtgc 4260 tctctttcct tcaagagaag ttggacagca ggctgtcgcc ctccactctc aaggtttacg 4320 tggccgccat ctccgcttat catagcgcgg tagctggcgg caccgtggga aagcataacc 4380 tggtcatcca gttccttagg ggtgctaggc gaattaatcc atctcgcccc cctctcatgc 4440 cctcttggga tctcgccctc gttctcacga gtctgcgatc cgatcccttt gagccactcg 4500 aatcagtatc tctaagattt ctgtccctga agacagctct gctggttgcg ttggcctcca 4560 tcaagagggt cggggacctg gaggcatttt cggtcagtga ctcgtgcctg gaattcgggc 4620 cggattactc tcacgttatc ctgagacccc gccccggtta tgtgcccaag gttcctacca 4680 ccccctttag agatcaggta gtgaacctgc aagcgctgcc cccggaggag gcagacccag 4740 ccctttcttt actttgtcca gttcgcgctc tgcgcattta tgtggaccgt actcagaatt 4800 ttagatcatc tgagcagctc tttgtctgtt atggcggtcg gcagcaggga agtgccgtat 4860 cgaaacaaag attatcccac tggattgtgg atgccatttc actcgcttat tcgagtcgag 4920 gtcagccgtg tcccccggga gtacgtgcac actccactcg gagcgttgca tcctcttggg 4980 cgcgtgcacg cggcgcctct ctaacagaca tctgtagagc tgcgggctgg gcgacaccca 5040 acacatttgc aaggttttac aatctgcgag tggagccggt ttcctcaagg gtattaggta 5100 accctttggt gattgaggag acaactcggt agggtgttga aacacgcttg ctgcgccatt 5160 ctccctaaca cggaggtacg tgcgcctttt ttatctgtca gtaaagttcc ccgtcaggtg 5220 agccctgcag attcctccgt ggcccccagc actgactcag cggaggagtc acttgctggc 5280 ccactacgtt gtaggtctgc ccgctggtca gcccgcgttt tgggtatagg tgcctgctat 5340 gcgtgatccc cactaggcga tcccatatgc ttattccgcc acggttaagt cccccccctg 5400 ggcggacccg tgtcttccct ctccgctaac cactcttttg ctatgcgtac tccccctttt 5460 tagggctagt ccataggtaa attctgccat ctatcccccc cttgggtaac ggatggcctc 5520 cgcagcgtcc tccctatcgg gattgcacgc ttcccaacgt actgtcgtat ttcctagaat 5580 tatctagatg ctcacgactt cccaaaaaat atatctaaat ccgtaaaact tctgttgaag 5640 taggataaat tagggccagg gacacgttgg aggaccgcgc cccccatgat gtgggtgcgt 5700 cacgcttgct tgactatctc ctcatcgggg gtgttggtaa ggtgcagtca ttatggcgct 5760 ttccatattc tcccattcat ggcactgaag ttccccaacc gaaggggaac gttcgaggtt 5820 acagaagtaa cccttcgttc cccgaggagg ggaacggaag tgccatattc cgtcgccata 5880 atgactgtcc cttagctgtt tgaaagtctc ttcagcttaa aaggatggcg tctgctggct 5940 tcaggtgtgc ttatatgctg agataattgc agatgtcgca cacctgcgca agcttacgct 6000 gccaattaat ttcattcatt ggcccgttca atactctcca gataagcggc ttctgatccg 6060 aatcctccca ttcatggcac ttccgttccc ctcctcgggg aacgaagggt tacttctgta 6120 acctcgaacg tt 6132 // ID ERV2-I_DR repbase; DNA; ZEB; 3091 BP. XX AC AL591210; XX DT 05-JUN-2002 (Rel. 7.05, Created) DT 05-JUN-2002 (Rel. 7.05, Last updated, Version 1) XX DE ERV2-I_DR is an internal portion of the ERV2_DR endogenous DE retrovirus. XX KW ERV2-I_DR; ERV2-LTR_DR; ERV2_DR; endogenous retrovirus; KW LTR retrotransposon; class I ERV; gag. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-3091 RA Kapitonov V.V. and Jurka J.; RT "The ERV2_DR endogenous retrovirus from zebrafish."; RL Repbase Reports 2(5), 13-13 (2002). XX DR Genbank; AL591210; Positions 31851 28761. XX CC ERV2-I_DR is an internal portion of the ERV2_DR endogenous CC retrovirus. It is flanked by 99% identical ERV2-LTR_DR. CC ERV2_DR belongs to the class I of endogenous retroviruses (4-bp CC TSD). CC It encodes ERV2p, a 455-aa gag-like protein that is closely CC related CC to this from ERV1_DR. The C-terminal portion of this protein CC includes CC the zinc knuckle motif present in gag proteins of many CC retroviruses. CC ERV2p (positions 718-2085): CC MFRQNTRLINTTEKGSLTTPSLSDQDFKSLLRQHMDSITESVRLNSTKKFSIDPDDVWTLDECKKVLDAC CC IRKNNVKVIICCYREFNLSRAKQQAQAKSELEKQIQELHAKITSLTKQLNLKKLKTDIEEVSQPDNKHPD CC LQIVSEKDISIQSDSDLPIGSVEVCGATRSKRTESSHNNTSVVQIQTVSRTLGPKEIDRLSQSLPSARTN CC FSEFSRALISKMRLYDMSLTEVTQLMSQILTESEFNSFEHTVTSKLQHASKDDLREGVLKALKNIVGPKI CC DWSKVTCCVQRKDESVNEFTERFCQSAITYSGLADNSDSVLDDKGPLVRIWSDGLAAEYRKALPFLNITW CC SSTTLRNNMNSLALWERDSYIKDRVRFEAAATKVNTEKTSNHKCLRKNICCHYCGKKGHWMRECRKNKKN CC FKEMNRVNQLLNSLLNALNIPLPSSLNPLSSPQRC CC ERV2-I_DR is 72% identical with ERV1-I_DR. XX SQ Sequence 3091 BP; 1048 A; 623 C; 547 G; 873 T; 0 other; acttggtgcc gtgagccgga tagagactga aattatcaaa ttctcaacaa tttgctacaa 60 aagactgaaa tctacctgaa attcaacttg aaccatcaag atcaaggaaa cttcttcaga 120 cggaatcttc agctcaacgg caatttgaag tcactcagcg gacgtcacaa attctcaact 180 atttgctgac tacaaccctg atgaacatta aagctacctg aatacaaaga tcagatcact 240 ccagactgaa tcttttcagc tcaacctcga tttggtgagt ttcactctat caaaagaacc 300 atacagacca gtctatacac atggttattc tctgctccat cagagaaaag ttaacaaaca 360 gctgcagggt ttgtttaaca aaaagctatc ctctatttca atagagaagt aacagtttat 420 cctctgttta gacagagaag taaagcctta cgtgctattt tcttttaaaa agttatcctc 480 tcttcagaga agttaagcct tatgcgctat tttttttctt ttatttcgtt ttccctcttt 540 tcaaccagag aagtttaagt ttagctatac aagctaattg atgttatctc ttgttcaggc 600 aggaagtttt agcaacatgt gctaatattg ttaccctctg tttcgccaga gaagaaatta 660 ctacactttc atctgaaaag ttttagtatt tttactttgt gataataagc tcacaaaatg 720 tttagacaga atacaagact gatcaataca actgaaaaag gtagtttaac aactccttca 780 ttgtcagatc aggatttcaa atcgctgcta cgacagcaca tggactcaat tacagagtca 840 gtcagactaa attcgacaaa gaaatttagc attgacccag atgatgtttg gacattagat 900 gaatgcaaga aagtgcttga tgcatgcatt cgcaaaaaca atgtaaaagt cattatctgc 960 tgctacagag aattcaattt atctcgagct aaacaacaag ctcaagctaa atcagaatta 1020 gaaaaacaga tccaagagct tcatgctaag ataacctcat taacaaaaca actgaaccta 1080 aagaaattaa aaactgacat tgaggaggtt agtcagcctg ataataagca tcctgactta 1140 caaattgtct ctgaaaaaga catttccatc cagtcagata gcgatttacc tataggctca 1200 gtagaggttt gtggtgcaac aagatctaag agaactgaga gttcacacaa caacacttct 1260 gttgttcaaa ttcaaacagt ttccagaaca ctaggaccta aagaaataga cagactgtct 1320 caaagcttac catcagcacg cacaaatttt tcagagttta gcagagcact aatcagcaaa 1380 atgcgtcttt atgacatgtc attaacagaa gtcacccagc taatgtctca aattctcact 1440 gaatctgaat tcaacagttt tgagcatact gtgacctcta aattacaaca tgccagtaag 1500 gatgatttga gagagggtgt tttgaaagct ctaaagaaca ttgttggccc aaagattgac 1560 tggtcaaaag tgacttgttg tgtgcaaagg aaagatgaat ctgtgaatga attcactgag 1620 agattttgtc aatccgccat aacttacagt ggattagctg ataattcgga cagtgtgcta 1680 gatgataagg gacccctagt ccgcatctgg tcagatggcc ttgcagctga atacagaaaa 1740 gctttgccat ttcttaacat cacctggtct tccaccactc tcagaaataa tatgaacagt 1800 ttagctttgt gggaaagaga ctcttacatc aaagacagag tcagatttga agcagccgcc 1860 actaaggtca acacagaaaa gacatcaaat cacaaatgtc ttaggaaaaa tatctgttgt 1920 cattactgtg gtaaaaaagg acattggatg agggagtgta gaaagaataa aaagaacttt 1980 aaagagatga acagggtaaa tcaacttctg aacagtttgc taaatgccct caacataccc 2040 ctcccctcat cactaaatcc attgagcagt ccacaaagat gctaaccatt ttttggctgt 2100 aagtgcttaa tgtcccagtt catttacaat taagatgaaa gactctttgt aaaaagacac 2160 aaaggaaact tcttcctttt tattggggaa ctctgcttaa gtccacttac agccttgaat 2220 gtgtttaaca cattttacaa tcactaccca ttcaatgtta tgatgtctga taggactgat 2280 atttattgaa tgatagtttg tgtcacacat tttaaagttt gcccaccaca ctcagggaca 2340 gaacaggaca cacaaaccag tgaggagccc agggcagaac tgaaagcaac aggcttcagg 2400 ggccgcaccc tgtggtgaag actccctttt cttttttgtt tttcttcccc catcctcacc 2460 tcactgtttt ataggtgtca taattttaag attaggagac agaaaataca caacacaata 2520 cgatcaccaa cttagcacaa ccactataca atcacctgaa taactagaca catgaactgt 2580 atgatcatac aagtctatgc atgaatactg ctggtctgct gtttccgtgt tttcttgtgt 2640 gcaggtaaac acaaacactg catcttctca gaaacactct gaaccagagc atcactacgg 2700 agatccatga atgacaggaa gtcatcagat caactctgtg ctggagaaga accacaagaa 2760 acacacctca caaaagactg aacactattc acaagccatg gactttcaga agatctgcag 2820 tttaactaca tgatttttgt gtcaccatgt aggatttatg acattacaac tgattgtcat 2880 aaacgtgtga acagttttga aatctacact tacatctacg tagatattga aatgacacag 2940 aataagttaa atgtgcctgt aacttttcaa gttacatgac tagaagatgg ggcttctgtt 3000 atcaaacaga agccattgaa tggttttagg aaagcttaca aatttatata tttttttgtg 3060 agagttttta aataactctc aaagggggaa c 3091 // ID HARBINGER3_DR repbase; DNA; ZEB; 3599 BP. XX AC . XX DT 06-NOV-2003 (Rel. 8.1, Created) DT 06-NOV-2003 (Rel. 8.1, Last updated, Version 1) XX DE Autonomous Harbinger-like DNA transposon - a consensus. XX KW Harbinger; DNA transposon; Transposable Element; KW DNA-binding protein; HARBINGER3N_DR; HARBINGER3_DR; KW Harbinger superfamily; transposase. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-3599 RA Kapitonov V.V. and Jurka J.; RT "HARBINGER3_DR, an autonomous Harbinger-like DNA transposon from RT zebrafish."; RL Repbase Reports 3(10), 184-184 (2003). XX DR [1] (Consensus) XX CC HARBINGER3_DR is an autonomous Harbinger-like DNA transposon. CC The consensus sequence was built based on several copies that are CC 90% identical to each other. This transposon is characterized CC by the 3-bp target site duplications and 12-bp terminal CC inverted repeats. CC Protein machinery encoded by HARBINGER3_DR was involved in CC transpositions of HARBINGER3N_DR nonautonomous elements. CC HARBINGER3_DR encodes two proteins, 343-aa HARBINGER3_DR-1p (2 CC exons, CC positions 368-1037 and 1119-1480) and 221-aa HARBINGER3_DR-2p CC (2 exons, positions 3345-2940, 1762-1503). CC HARBINGER3_DR-1p is a Harbinger DNA transposase. CC HARBINGER3_DR-2p is a DNA-binding protein that contains the CC myb/trihelix motif. CC QAVLESQQDIVRAIGDINNHLKNISNALTDISQSLKELVKK. XX FH Key Location/Qualifiers FT CDS 0..0 FT /product="HARBINGER3_DR-1p" FT /translation="MAVLALLEDIVNGRIRRERVFRDHGDFLAHDDDWLIS FT RFRFPRAILLDLCAELGPLLERETARSHALPVPLQVLTTLGFLATGSFQRE FT LADRSGLSQSSLSRAMPAVWDGIIRMSSRYIRFPYHAVDQPNIKAQFAAIA FT GFPNVIGAIDCTHIAIKAPSEDEFAYVNRKHFHSINVQIICDAQMRLTNIV FT ARWPGSTHDSFILTNSMVGMRLQGGRVRDGWLLGDRGYPLKTWLLTPLNNP FT QTDQERRYNDAHSHTRSVVERAIGQLKCRWRCLDKSGGVLLYRPNKVCRIV FT LACGVLHNVAHRHGIPNGEPVAPPDDPDPGPVCIQPNQQAIQARQRVVAAI FT " FT CDS 0..0 FT /product="HARBINGER3_DR-2p" FT /translation="MAKANKKRNFTECELEVLLSEVDRRKTVLFASLSSGI FT NNKRKKIEWESLADAVNAVGSERRTVSELKKKWSDVKVQVKRRTAAHRQSV FT DRTGGGTGDTALTPFEERVASIVGDTLLSGVVSVSVGDTDVLEEAHEDGAG FT TSTDTDFVPPEEPEPSVSGATPRVSSASASAEARPSGRVLT" XX SQ Sequence 3599 BP; 1011 A; 749 C; 760 G; 1079 T; 0 other; gggcctcatg tatcaacgct gcgtacgcac aaaaactttg cgtacgccag gtttcacgct 60 cagaatcgcc cacgtttgga tttactaacg atgaaatgaa cgtgggaatg tgcgcagctc 120 cacgccagct tcttggctgg cgtacgcaca ttttttgtgc gtgtctgttt tatttccatt 180 ggcgactcct agaggcaatt atgtcaaatt gcacactaca aagtatacca ctaacacatg 240 tgaaaaacat tgctattaat gtataattga taaaatgggt taattgccac aatatttttg 300 taccaattat gtgatttaga acatataaaa gcatttgcaa atgcatacaa ccctttagtc 360 tatggcaatg gcagtgttgg ccttattaga ggacattgtc aatgggcgaa tccgaaggga 420 acgcgttttt agagatcacg gtgattttct ggcccacgac gatgactggc ttattagccg 480 tttcagattt ccaagagcta ttctcttgga tctctgtgct gagttgggtc cactgttgga 540 aagagagaca gcgaggagcc atgcattacc cgttccctta caggtgctga caacgcttgg 600 tttcctggca actggttctt tccaaaggga actggcagac cgctcggggt taagccagtc 660 gtctttgagc cgtgcaatgc cagctgtatg ggacgggatc atccgcatgt ctagcaggta 720 tataaggttt ccataccatg cagttgacca gccaaacatt aaagcgcaat ttgcagcgat 780 cgccggtttt cctaatgtaa tcggagcgat cgactgcacg cacattgcta taaaggcgcc 840 atctgaagac gaatttgcat acgtgaatcg gaaacatttc cattcaataa atgtgcaaat 900 aatatgtgat gctcaaatgc gcttaacaaa tattgtggca aggtggcctg ggtcaaccca 960 tgattcattt atccttacaa acagcatggt tgggatgagg ctccaaggtg gcagggtgcg 1020 tgatgggtgg cttcttggtg agtgatgtat ttaaagatat tattccagct aagtttattt 1080 tattttattt tgagcgtaat tatttaactg catatcagga gaccgtggtt atccattaaa 1140 gacgtggctg ttaacccccc tcaacaaccc acaaactgac caagagcgca ggtacaatga 1200 tgcccattct cacactcggt cagttgtaga gcgggcgatt gggcagctga aatgccggtg 1260 gcgctgcctt gataagagcg gaggggtgct gctataccgc cctaacaaag tgtgccgcat 1320 cgtgctggcc tgtggtgtgc tgcacaatgt tgcgcacaga catggcatac ctaatggtga 1380 gccagtggca ccgccagatg acccagaccc aggaccagtg tgtatacaac ccaaccaaca 1440 agccattcaa gcccgccaac gtgtggttgc ggcaatataa aaaaggtaat cagagaccaa 1500 gtttattttt tgaccaattc ttttaatgac tggcttatgt ctgttaaagc attacttata 1560 ttttttaggt gattattaat atcaccaatg gccctaacaa tatcctgttg tgattccaga 1620 acagcctgcg tcaggacgcg gcctgatgga cgggcctcgg cactggcact agcactggag 1680 acacgggggg tagcgccgga gacactgggt tctggttcct caggagggac gaagtccgtg 1740 tcagtagacg ttcctgcacc atctaagtgt aaataaaaac aagcacgtta actgatattt 1800 gagtccatta tttgggtaca cgcatattta taaatgaaat agatagggga gcgcggggcg 1860 caaagtaacg cagggttaaa tgtaacaaag agttttaagg tttttgctca gggttaacca 1920 tggcatgctt ccgaggtatt caccatagtg tctgcagatg tctacctggc aattattgaa 1980 aaagtttgcc acaatttgga taagaaacac caatttttgc cgcataaaag tattttttat 2040 tatagtctat ctatctatct atctatctat ctatctatct atctatctat ctatctatct 2100 atttttttta tctgtgaaag tatgcacgta aaatcttaca ttgttgtaat taccgtcttc 2160 taggctaaaa gattaaatta ttttgaaaga tgtaagaaac acacagtgac tgctagccag 2220 ttttgaggaa tacatgacac agcggggtta gttgtaacag gatgttacaa ttaagccaca 2280 cactgttgga caccaactaa actaacaaaa tattttttat tttaattttg agtgtaatta 2340 cactttttat aataaaatgt tttttattag aaagtatttt taaatagaaa aacatttatt 2400 gctaataatg caaataaatg caatgtataa taatggataa atggataata aggcaaatag 2460 attcaaatgt gtttatccaa ataaataaat agactgtgct atgtagaata aaaataaatg 2520 agccaattac tgaggaaata tggctacttt tttttaaagt aaactgaatt taaattaatc 2580 gtatgaaatc tgccttttta agcatgtgtt acaactaacc ccccatctgt tgcaatctgg 2640 cccacaggtg taaatggtaa cctttttact cctggcactt ttgccaatac tgcacagaaa 2700 cgatagttcc ggcgatcata atttcagtgc tcatttgtag gagaggcttg tgtgttgttg 2760 gtaaaaaaaa tggtttgtca aaccccatta ctttgttcat tatttgacaa aaaccaaaaa 2820 gtgttacttt gtgccccact ctcccctaca taaattaatt gtactgctac atgtattgga 2880 aacttaaatg caaaccggtt atttaacaaa gttaagacat aaaatgagaa gtaacttacc 2940 ttcgtgggct tcctctaata catccgtgtc tcccacagac acggatacta ctccagacag 3000 taaagtgtca cccacaattg aggcaactct ctcctcaaag ggtgttagtg cagtatcccc 3060 tgttcccccg cctgttcggt ccacactttg acggtgcgcc gctgttctcc tcttaacctg 3120 cacctttaca tcggaccatt tcttttttaa ttcactcaca gtgcgacgtt cagaccccac 3180 tgcgttaact gcgtcagcta aactctccca ctctattttt ttccttttgt tattaattcc 3240 ggaggacaaa cttgcaaata gcacagtttt tctccggtct acctccgata ggagcacctc 3300 caattcacat tctgtaaagt ttctcttttt gtttgcttta gccattgctt tttcgtttgg 3360 ttttgccaaa gtgaagtcat taccatattt ataaggggga ggaggcaggg aggggttttg 3420 cgctcgtgca cgtgcgctca atttcacgtt aattcggatg tacaaagaga atatgcgtgg 3480 gattcggcgt acgcagtgtt tcatacatct gaatttttta ctgcgtacgc acatttacag 3540 ctttgtgcgt acgcaatgtt ttagtatgaa ttccacgcaa gtcttcgtac atgaggccc 3599 // ID Gypsy-26-I_DR repbase; DNA; ZEB; 4616 BP. XX AC . XX DT 11-FEB-2005 (Rel. 10.01, Created) DT 11-FEB-2005 (Rel. 10.01, Last updated, Version 1) XX DE An internal portion of the Gypsy-26_DR LTR retrotransposon - a DE consensus sequence. XX KW GYPSY superfamily; Gypsy-26-I_DR; Gypsy-26-LTR_DR; Gypsy-26_DR; KW LTR retrotransposon; endogenous retrovirus; gag; integrase; KW reverse transcriptase. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4616 RA Kapitonov V.V. and Jurka J.; RT "Gypsy-26_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 5(1), 25-25 (2005). XX DR [1] (Consensus) XX CC Gypsy-26-I_DR is an internal portion of the Gypsy-26_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its CC long terminal repeat is deposited in Repbase as CC Gypsy-26-LTR_DR. Gypsy-26_DR is characterized by 4-bp target CC site duplications. The internal portion encodes one CC polyprotein: the 1503-aa Gypsy-26_DR1p (pos. 64-4572) composed CC of the gag, protease, reverse transcriptase, and integrase CC domains. The consensus sequence was built from 4 copies less CC than 1% diverged from the consensus sequence. XX FH Key Location/Qualifiers FT CDS 0..0 FT /product="Gypsy-26_DR1p" FT /translation="MAALAKETEGDDSLLPDYSSEPFTTHREPIQDLIHSL FT ASLYLENEPEIGEQEEGDFDDSLLPPPPPTIEDDDSLLTMIMATRMHKVES FT RVEAFENSSDVCFKDIQHQLNEQSNRIAVIESQVQHMLKQYQDHPIDVQQL FT ENSLSTMLKKECTQVKDTLETKVQELGQAIMDCLKRRDGQLKSLIQPSGGA FT TSTPHFSHTILDHGSCRPVHFKTPIKLEFPKFGSLDGEDPITYLERCDEYL FT AVQPLNDSEIISMLPSVLTHTAKDWWVAEKKRVRTWTQFKSVFLQSFLADD FT HDVEVERRIRERKQGVDESIRTFAYQYRALCLRLKPSMTEREILQAALRNC FT NPRIASILRGTVTTVDELVRVGTLIEKDINEERSFWRQRHQEANAKSTEGN FT KFFKGRQSNPHIAVCSDSSERSPVTLTLPLTIKGHQYQAILDTGSTYSLIQ FT ESCWKRLKSNHEVLQSSRGQSFSLANGCVQSALGKIAWQATIHGHDYPIRA FT YVMKDCDLAFPVLLGLEFLKMSGITVDFRNSSYSLPEEYGVIHSFTSSSPS FT PIVSLHLALPLIPTPSTDLTIIKELVDRADVSKAHRRQLEGLMLDWPTVCT FT ETLGQTNLIHHQIHTIDEIPVRKKAYPVPVNKQKFIDEEIARMLDKGIIRP FT SVSPWASPVVLVPKKDGSTRFCVDYRALNSKTPLDGFPMPQIQDILESLYG FT ATIFSTLDLKSGYWQVKMDEDSIKKTAFVTKNAQYEFLRLPFGLRNAAATF FT QRLMNNVLRDYMGEFCFVYLDDIVVYSKTIQDHFQHLKLLFAKLQDSGLTL FT NLKKCSMLQRTITYLGHVVSEEGVRTEDTKIKAVQDFPVPKNLKEVQRFLG FT LASWYHRFISHFSERAAPLYALKGKNAIWNWTVECQSAFDDLKYALQRAPV FT LMPPDFTKVFRVQTDASDIGLGAVLTQDFDGAEHVIAYASRLLHGAEKSYS FT TAEKECLAVVWAVEKWRQYLEGRNFEVLTDHSALTWVFNNPKPSSRLTRWA FT LRLQCFSFLVKYRKGSCNVVPDALSRGIPGQEVVGHIAICQANKTDPNLPV FT SWDEIGKAQKLDSSLQALWEAAKQATTDSSRIAYCVQNDYLFRRVPNKDQG FT CVYQLVIPASLREQFLHFAHSNPLSGHLGRMKTLKRLLDSVYWPEIRKDVW FT SFCTQCKTCQIYKPRISKLSGLLQSTPVVEPGYMLGVDLMGPFPKSNRSNE FT FLLVFVDYCSKWVELFALRSAKTHLITNILTKEMFTRWGTPAYLVSDRGPQ FT FTAQLLNETCKRWGVVQKLSTAYHPQTNLTERINRTLKTMLSSYVHDNHRD FT WDKWIPEFRYAINSAWQESTGFTPAEVALGRKLKGPLDRLIQRPPNPDHLA FT YNTLERQKAFLEQVIAKTSQAQERQGKYYNQRRKPKSFEEGDLVWILTHPL FT SRAADSFMAKLAPKWQGPGKIVKKVNNVNYRVVMLDKPSQCDTYHVEKLKE FT FYGTV" XX SQ Sequence 4616 BP; 1402 A; 951 C; 986 G; 1277 T; 0 other; tatttggcgc ccgaacaaaa gacaaaattt gattgaattc ttgagagata cattctgctt 60 tctttgagtc atgtttttgt taatgtgtct atgaggccat aatttcaaaa gtcctaacat 120 tacactctat ttcagcattc taaagttgac attatttgaa gagctttagt agcttcaaaa 180 tggctgcctt ggctaaagag acagagggtg atgattcttt gttacctgat tattcttcag 240 agcctttcac tacacatcga gagcctattc aagaccttat tcattctctc gctagcttgt 300 acttggaaaa tgaacctgaa attggtgaac aagaagaagg ggattttgat gattctttgt 360 tacctcctcc acctcctaca attgaagatg atgatagctt gttaacaatg attatggcca 420 cacgaatgca taaagttgaa agtagagttg aagcatttga aaattcctct gatgtttgtt 480 tcaaagacat tcagcatcag ctgaatgaac aaagtaatag gattgctgtc attgagagtc 540 aggtccagca catgctaaaa cagtaccaag atcaccccat agatgtacaa cagctcgaga 600 attcgttatc taccatgtta aagaaggagt gtacccaagt gaaagatact ttagaaacaa 660 aggttcaaga gttggggcaa gcaattatgg attgcctgaa acggagagat ggacaattga 720 aatcattaat ccagccttct gggggtgcaa cgtctactcc acacttcagc cacaccatat 780 tagaccatgg atcttgtcgt cctgtacatt ttaagactcc tatcaagcta gagtttccaa 840 aatttgggag tttagatggg gaagacccta ttacatacct ggaacgatgt gatgaatatc 900 tggctgtaca acctttaaat gactctgaga tcatatccat gcttccttct gtattgacac 960 acacagctaa agactggtgg gtagccgaga agaagagggt aagaacatgg acacaattca 1020 agtctgtttt tcttcaatct ttcttagcag atgatcatga tgttgaagtg gaaagaagga 1080 tcagagaaag gaaacaaggg gttgatgaaa gtattcgaac atttgcctac cagtacagag 1140 cattatgtct gagactgaaa ccttccatga cagagcgtga gattctccaa gcagcactac 1200 ggaactgtaa cccaagaata gcaagcattt taagaggtac tgtaactact gtagatgagt 1260 tggtacgtgt aggaacactt atagaaaaag atatcaatga agaaagatca ttttggagac 1320 agaggcacca agaagctaat gcaaagtcca ctgagggtaa taaatttttt aagggccgac 1380 agtcgaaccc tcacatagct gtatgttctg atagcagtga aagatctcct gtcacattaa 1440 cattgccctt aaccattaaa ggtcatcaat atcaagcaat tctggatact gggagcactt 1500 actctttaat tcaagagtcc tgctggaaac gactaaagtc aaatcatgaa gttttgcaat 1560 cgagtagagg acagtccttt tctcttgcaa atggatgtgt acaatcagcc ttggggaaga 1620 tagcttggca ggctaccatc cacggacatg actatcctat cagagcatat gtaatgaaag 1680 attgtgactt ggcttttcct gttttgttgg gcctggaatt tttgaagatg tctggaatca 1740 cagtcgattt tagaaattct tcctattctt tacctgaaga atatggggta atccactcct 1800 tcacctcttc ctccccttca ccaatagtaa gtctgcacct tgcactacct ctaataccaa 1860 caccttctac tgatctaacc attataaagg agttagtgga tcgagctgat gtctctaaag 1920 cccatagacg tcaactagaa ggactgatgc ttgattggcc cactgtatgc actgaaactt 1980 taggtcaaac caacttgatt catcatcaaa tccatacaat tgatgaaatt cctgtgcgaa 2040 agaaggccta tcctgttcca gtcaacaaac agaagtttat agatgaggaa atagcaagaa 2100 tgcttgacaa aggcattata agaccttctg tatctccatg ggcatcacca gttgtacttg 2160 tgcctaaaaa agatggcagt acccgctttt gtgttgatta tagagctttg aactccaaga 2220 ctcctcttga tgggtttcca atgcctcaaa ttcaggatat ccttgagtcc ttgtatggag 2280 caaccatatt tagcacatta gacctcaaat ctggctactg gcaggtaaaa atggatgaag 2340 acagcatcaa aaagactgct tttgtcacca aaaatgccca atatgagttt cttcgtcttc 2400 cttttggcct gcgaaatgct gctgcaacct ttcagaggct catgaacaat gttctgagag 2460 actacatggg agagttttgc tttgtctatc ttgacgacat tgtggtttac tcaaaaacca 2520 tccaagatca ctttcaacat ctcaagctac tctttgcaaa attgcaagac tctggtttaa 2580 cactcaatct caagaaatgt tctatgttgc agaggaccat tacttaccta ggacatgttg 2640 tttctgagga aggagtacgg actgaagaca ctaaaatcaa agcagttcag gattttcctg 2700 tcccaaaaaa tctcaaagag gtacagagat ttttaggtct tgctagttgg taccatcggt 2760 tcatttctca cttctcagag cgagctgctc cattgtatgc actgaaaggt aagaatgcaa 2820 tctggaactg gacagttgaa tgtcaaagtg cctttgatga tctcaaatat gcactacaac 2880 gagcaccagt attaatgccc ccggatttca ccaaagtctt tagagtgcag actgacgcca 2940 gtgacatagg actaggagct gtattgacac aggattttga tggtgcagaa cacgtcattg 3000 cctatgcttc acgtctttta catggagcag aaaaatcata ctccactgca gagaaggaat 3060 gtcttgcggt cgtgtgggct gtagagaagt ggaggcagta tttggaagga cgaaactttg 3120 aagtactgac ggatcattct gcgctgactt gggttttcaa taaccctaaa ccatcttcac 3180 gcttaaccag atgggcatta cgactacaat gtttcagttt cctggtcaag taccgtaagg 3240 ggtcctgcaa tgtggtacct gatgccctat ccagaggtat accagggcaa gaggttgtag 3300 gtcatattgc catctgccag gctaacaaga ctgatcctaa tttgccagtc agctgggatg 3360 aaattgggaa agctcagaag cttgattctt ctttgcaagc tctatgggaa gcagctaagc 3420 aagccaccac agattccagc cgcattgctt attgtgtaca gaatgactac ctctttcgca 3480 gggtgcctaa caaagatcaa gggtgtgttt accaactagt catccctgca tcactgagag 3540 aacagttttt acacttcgcc cattcaaatc cactgagtgg acacttagga aggatgaaaa 3600 ccttgaagag attacttgac agtgtctatt ggcctgaaat ccgtaaggat gtttggagct 3660 tttgcaccca gtgtaagact tgtcaaatat ataaaccgag aatctcgaaa ctgtctggat 3720 tgttgcagtc aacacccgta gtagagcctg gctatatgct gggagtggat ctaatgggtc 3780 ctttcccaaa aagtaatcga tcgaatgaat tcttattagt ctttgtggat tattgcagca 3840 agtgggtgga actttttgct ttaagatctg caaaaactca cctcataact aacatcttga 3900 caaaagagat gttcactcga tggggaactc cagcatatct cgtctctgac cgcggccctc 3960 aatttacagc ccaattgctc aatgagacct gcaagcgatg gggagtagtt cagaagctca 4020 gtactgccta tcatccgcag accaacttaa ctgagaggat aaaccgaacc ctaaaaacaa 4080 tgttgtcctc ttatgtccat gacaatcacc gtgactggga taaatggatc cctgagttca 4140 ggtatgccat taactcagca tggcaggaaa gtacgggttt cactccagct gaagttgctt 4200 tgggacgtaa gttgaagggt cctctagaca ggctgatcca gagaccacct aatccagacc 4260 atctagcata taacaccctt gaaaggcaga aagctttcct tgagcaagtg attgcaaaga 4320 ccagccaagc acaagagaga caaggaaaat actacaatca acgtagaaaa ccaaagtcct 4380 ttgaggaagg agatttggtt tggatcctca cacaccctct gtctcgagct gcagattcct 4440 ttatggccaa attagctccg aaatggcaag gtcctggcaa aattgtaaaa aaggtaaata 4500 atgtcaacta cagagtggta atgcttgata aacccagtca atgtgatact taccatgtgg 4560 aaaagttgaa ggaattttat ggaactgtat aactcttttc tttaggggaa ggggtg 4616 // ID DIRS1_DR repbase; DNA; ZEB; 6132 BP. XX AC . XX DT 11-FEB-2002 (Rel. 7.01, Created) DT 20-SEP-2005 (Rel. 7.01, Last updated, Version 3) XX DE DIRS1_DR is a DIRS-like LTR retrotransposon - a consensus. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW endogenous retrovirus; DIRS1; DIRSDR1; DIRS superfamily; KW reverse transcriptase RNase H; phage integrase; DIRS1_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 414-5132 RA Jekosch K.; RT "DIRSDR1: putative non-LTR retrotransposon."; RL Repbase Reports 2(2), 9-9 (2002). XX RN [2] RP 1-6132 RA Kapitonov V.V. and Jurka J.; RT "DIRS1_DR, a family of DIRS-like endogenous retroviruses in RT zebrafish."; RL Repbase Reports 3(1), 1-1 (2003). XX DR [2] (Consensus) XX CC DIRS1_DR is a family of DIRS1-like retrotransposons. These CC elements CC are related to Gypsy-like LTR retrotransposons and endogenous CC retroviruses. CC There are ~100 copies of DIRS1a_DR in the genome, they are ~0.3% CC divergent from the consensus sequence. Therefore, this family CC retrotransposed in the zebrafish genome very recently. CC The unusual structure of DIRS1_DR is depicted in the next figure. CC GTTCCCCTTCGGTTGGGGAACTTCAGTGCCATGAATGGGAGGATTCGGATCAGAAGCCGCTTATCTGGAG CC <====== ======> <--------------------------------------------- CC AGTATTGAACGGGCCAATGAATGAAATTAATTGGCAGCGTAAGCTTGCGCAGGTGTGCGACATCTGCAAT CC ---------------------------------------------------------------------- CC TATCTCAGCATATAAGCACACCTGAAGCCAGCAGACGCCATCCTTTTCGCTTCAGATCCTTTCTGAGTGA CC ----------------------------------------------- CC ...................................................................... CC ...................................................................... CC GGTGCAGTCATTATGGCGCTTTCCATATTCTCCCATTCATGGCACTGAAGTTCCCCAACCGAAGGGGAAC CC <====== ======> CC <~~ CC GTTCGAGGTTACAGAAGTAACCCTTCGTTCCCCGAGGAGGGGAACGGAAGTGCCATATTCCGTCGCCATA CC ~~~~~~~~~~~~~~~~~~~~~~~~~~<====== ======> CC ATGACTGTCCCTTAGCTGTTTGAAAGTCTCTTCAGCTT CC AAAAGGATGGCGTCTGCTGGCTTCAGGTGTGCTTATATGCTGAGATAATTGCAGATGTCGCACACCTGCG CC ---------------------------------------------------------------------- CC CAAGCTTACGCTGCCAATTAATTTCATTCATTGGCCCGTTCAATACTCTCCAGATAAGCGGCTTCTGATC CC ---------------------------------------------------------------------- CC CGAATCCTCCCATTCATGGCACTTCCGTTCCCCTCCTCGGGGAACgaagggttacttctgtaacctcgaacgtt CC ----------------------> <====== CC ======>~~~~~~~~~~~~~~~~~~~~~~~~~~~~> CC Fig.1 Termini of DIRS1_DR. The 163-bp sub-terminal inverted CC repeats are CC underlined by a single line. CC DIRS1_DR encodes three ORFs. ORF1 (positions 414-1632) codes for CC the CC gag-like protein. ORF2 (positions 1633-2597) codes for reverse CC transcriptase and RNase H. ORF3 (positions 2598-5129) codes for CC the CC phage integrase. XX FH Key Location/Qualifiers FT CDS 414..1850 FT /product="ORF1p" FT /note="Gag-like protein" FT /translation="MALRLCVSGCGGFLSPDDGHDHCIACLGVQHVNAVLA FT GGSCRHCDAMTVAQLRSRLTFARERATPVASCSKKAAGARADLRVSAGANP FT PPTGSRTSRSSRRSIQASGGESDPSNQMVALTLADTGDQMSSAASEGGLSL FT SDEDPDPLAPSGQVSAVKSDPEADMLAVLSRAASAVGLEMVYPPAPRPDRL FT DGCYVEDQKAKPSKPLVPFFPEVHSRLTQSWRAPFSARAASASALTALDGG FT AARGYEAIPSVERAIAVNLCPRGASTWRGLPRLPSKACRLSASLGARAYKA FT AGQAASALHAMATYQRYQAQALAELHEGGSNPSLLHELRTATDYALRTTKS FT AACALGRTMSTLVVQERHLWLNLADMRDVDKVRFLDSPISQAGLFGDTVGE FT FTQEFKAVKEQSDAMGNVIYRRGRKPAPPAEPSTSAVPRRGRPPTSAAPPP FT PAPPAKRARRSPRKQAAPPAQGAVKSGKRTAKRP" FT CDS 2598..5129 FT /product="ORF3p" FT /translation="MRSSSGLVCSHRPEGRVFPCLHSSTPPPISAVCVRGS FT SVAVQGPPLRALSVSAGLHQTRGGCPSAPSARGHSHTQLSRRLADFSPLAG FT AIDYAQGRGASASPPTGASGQPRKEQTRPRAEDFFSRDGAGLDHHGSAPLR FT GTRSPVAELSEGARQQTSGPTEVLSEAPGAYGIRSRRHAARVAPYETTSAL FT ASRSGPQTRMARGHTPGLGYCAVSPRPQPLERPLVPTGRCASRTGVQPCCC FT FNRRFQHGLGGRVSRACGCGPLEGCPAALAYQSPRAVGSVPRSPPLFTGAG FT AATRAGQDGQYGGGGVYQPHGGYALSPHVSARPPSAPLESPAAEIAARHSR FT PRHAQSCSRCALTTAVTPWRMETPPRVCSADMGAIRGGPDRSVCFPRERSL FT PVVFFPDRGLSRHGCTGPQLASGHAQVCVSPSEPARAVSVQGQGGRGTGSA FT SCAPLAQPDLDIRALTPRDGPPLADPFERGPTLSGTGHHLAPSPRSLEPPR FT VVPRREEDLGNLPTAVVNTITQARAPSTRRAYALKWSLFTEWCVSRREDPR FT NCQISVVLSFLQEKLDSRLSPSTLKVYVAAISAYHSAVAGGTVGKHNLVIQ FT FLRGARRINPSRPPLMPSWDLALVLTSLRSDPFEPLESVSLRFLSLKTALL FT VALASIKRVGDLEAFSVSDSCLEFGPDYSHVILRPRPGYVPKVPTTPFRDQ FT VVNLQALPPEEADPALSLLCPVRALRIYVDRTQNFRSSEQLFVCYGGRQQG FT SAVSKQRLSHWIVDAISLAYSSRGQPCPPGVRAHSTRSVASSWARARGASL FT TDICRAAGWATPNTFARFYNLRVEPVSSRVLGNPLVIEETTR" FT CDS 1633..4110 FT /product="ORF2p" FT /note="reverse transcriptase" FT /translation="MRWAMSSIGVAVSPLRPPSHPPPLFLAEGARQRVLPR FT PRLRLRPSGRGVHLESRQPLLPRAPLSPVNGPRSVPETGHPEKRKLALSPL FT EGGAPITTVLFSATKTSVKEHFFPSPDVTARVLPVRDALPSGSQTLRASPV FT AHERWGDGLPSLSPPAPSPESGCGARANRSPPAFPRDPRASRISTPTPRCP FT TAGTSAIVAMTPLARALPAWLARASPSRWLIRTIRLGYAIQFAKRPPKFTG FT VYFSRVNPLSAPVLREEIAALLAKGAIEPVPPAEMESGFYSPYFIVPKKSG FT GSRPILDLRVLNRCLHKLPFRMLTQRRILQCVRPRDWFAAIDLKDAYFHVS FT ILPRHRQFLRFAFEGRAWQYKVLPFGLSLSPRVFTKLAEGALAPLRLAGIR FT ILSYLDDWLILAHSREQLIMHRDEVLRHLRLLGLQVNREKSKLAPVQRISF FT LGMELDSITMVAHLSEERARLLLNCLRELDSKLVVPLKFFQRLLGHMASAA FT AVTPLGLLHMRPLQHWLHDRVPRRAWHAGTHRVSVTALCRRALSPWNDPSF FT LQAGVPLGQASSHVVVSTDASNTGWGAVCRGHAAAGLWKGAQLHWHINRLE FT LLAVFLALHRFLPVLERQHVLVRTDSTAAAAYINRMGGMRSRRMSQLARRL FT LLWSHPRLKSLRAIHVPGTLNRAADALSRQLLRPGEWRLHPESVQLIWARF FT GEAQIDLFASPENAHCQLFFSLTEGSLGTDALAHSWPRGMRKYAFPPVSLL FT AQFLCKVREDEEQVLLVAPLWPNRTWISELSLLATALPWRIPLREDLLSQG FT QGTIWHPRPDLWNLHVWSLDARKT" XX SQ Sequence 6132 BP; 1117 A; 1898 C; 1706 G; 1411 T; 0 other; gttccccttc ggttggggaa cttcagtgcc atgaatggga ggattcggat cagaagccgc 60 ttatctggag agtattgaac gggccaatga atgaaattaa ttggcagcgt aagcttgcgc 120 aggtgtgcga catctgcaat tatctcagca tataagcaca cctgaagcca gcagacgcca 180 tccttttcgc ttcagatcct ttctgagtga gtcgatgagg gttcctcttg ctgatcagca 240 cttcagagcg aacgagtgtg tctcccggtc cagagtgggt cttcgcggtg gcagacggtc 300 gagctgggtt actcccttgc ctgcggttct ttgggtccgg tcctccagag cggtgcgtat 360 agttgcaact ttcctaaaag agcaacacag tcgtgcagca cgtccttttc aggatggcgc 420 tccgactgtg cgtttctgga tgcgggggtt tcctgtctcc ggatgatgga cacgatcact 480 gcattgcatg tttgggggtc cagcatgtta atgcggtgct cgcgggcggt tcatgtcgtc 540 attgcgatgc catgaccgtt gcacagctaa gatcgcggct aactttcgca agagagcgag 600 ccaccccagt tgcctcctgt tctaaaaaag cagcgggcgc tcgggcagat ctgagggttt 660 cagcgggagc taatccgccg cccacgggct cgcggacctc tcgctcctca cggcgctcca 720 tccaagcttc gggtggtgag agtgatccgt ctaaccagat ggtagctctc acactcgctg 780 acaccggaga tcagatgtcc tccgcggcat cggagggtgg gctttcactg tccgacgaag 840 atccggaccc gctcgccccc tccgggcagg tgagcgctgt caaatcggat cctgaagcgg 900 acatgttagc cgtgctttcc cgggctgctt cggccgtggg gttggagatg gtttatcccc 960 cagctccgcg gccggaccga ctagatgggt gctacgtaga ggaccagaag gcgaagcctt 1020 cgaagcctct cgtccccttc ttcccggaag tgcacagtag gctcacgcag tcctggaggg 1080 cacctttctc tgcccgtgct gcgagtgcct ccgccctcac cgcccttgac ggcggagctg 1140 ccagggggta tgaggcgatc ccgtcagtgg agcgcgctat cgcggtcaat ctttgtccgc 1200 gcggcgcctc tacgtggcgg ggtttgcccc gcctcccgtc caaagcctgt aggttgtctg 1260 cctccctcgg agccagagct tataaggctg cgggccaggc tgcttctgct ttgcacgcga 1320 tggccaccta ccagcgctac caagcgcagg cgctggccga gctgcacgag ggcgggtcca 1380 acccaagctt attacatgag ctgcgcaccg cgaccgacta tgctcttcgg actactaagt 1440 ccgccgcgtg tgcgctgggg aggacgatgt ccacacttgt ggttcaggaa cgccacctct 1500 ggctaaacct ggccgatatg cgcgacgttg acaaagttcg ctttcttgac tcgcccatat 1560 cccaggctgg cctgttcggc gacaccgtcg gtgaattcac ccaggaattc aaggcggtga 1620 aagagcagtc ggatgcgatg ggcaatgtca tctatcggcg tggccgtaag cccgctccgc 1680 ccgccgagcc atccacctcc gctgttcctc gccgagggcg cccgccaacg agtgctgccc 1740 cgcccccgcc tgcgcctccg gccaagcggg cgcggcgttc acctcgaaag caggcagccc 1800 ctcctgccca gggcgccgtt aagtccggta aacggaccgc gaagcgtccc tgagacaggc 1860 catccggaga agaggaaact tgctctttcc ccgctggagg gcggggcccc gataacaacg 1920 gtacttttca gtgccaccaa aacatcagta aaagagcact ttttcccttc cccggatgtg 1980 actgcacgag ttctgccagt ccgggacgcg ctgccttccg gctcgcagac tctacgtgct 2040 tcgccagtgg ctcacgagcg ctggggggac ggtctccctt ccctcagccc tccagccccc 2100 tctccggagt cagggtgcgg agccagagcg aatcgctctc ctccagcttt tccgcgggac 2160 cctcgtgctt cccggatcag cacacccact ccgcgctgcc ccaccgctgg tacgtcagcg 2220 attgtagcga tgactccatt agcgagggct ctgcctgcct ggttagcgcg ggccagcccc 2280 tcgcggtggc tcatacgcac aatcagactc ggttacgcga ttcagttcgc gaaacggccc 2340 cccaagttta cgggcgtgta tttctccagg gtcaaccccc tgtccgcccc tgtcttgcga 2400 gaggagattg ctgccctcct ggcgaagggt gcaatcgagc cggttcctcc agccgagatg 2460 gagagtgggt tttacagccc atacttcatc gtacccaaaa agagcggtgg gtcacggcca 2520 atcctagatc tgcgcgtttt gaaccgctgt ctgcacaagc tgccgttcag aatgctcacg 2580 cagaggcgca ttctccaatg cgttcgtcct cgggattggt ttgcagccat agacctgaag 2640 gacgcgtatt tccatgtctc cattcttcca cgccaccgcc aatttctgcg gtttgcgttc 2700 gagggtcgag cgtggcagta caaggtcctc cccttcgggc tctctctgtc tccgcgggtc 2760 ttcaccaaac tcgcggaggg tgccctagcg ccccttcggc tcgcgggcat tcgcatactc 2820 agttatctcg acgactggct gattttagcc cactcgcggg agcaattgat tatgcacagg 2880 gacgaggtgc ttcggcatct ccgcctactg gggcttcagg tcaaccgaga aaagagcaaa 2940 ctcgcccccg tgcagaggat ttcttttctc gggatggagc tggactcgat caccatggta 3000 gcgcacctct ccgaggaacg cgctcgcctg ttgctgaact gtctgaggga gctcgacagc 3060 aaactagtgg tcccactgaa gttctttcag aggctcctgg ggcatatggc atccgcagcc 3120 gccgtcacgc cgctcgggtt gctccatatg agaccacttc agcactggct tcacgatcgg 3180 gtccccagac gcgcatggca cgcgggcaca caccgggtct cggttactgc gctgtgtcgc 3240 cgcgccctca gcccttggaa cgacccctcg ttcctacagg ccggtgtgcc tctaggacag 3300 gcgtccagcc atgttgttgt ttcaacagac gcttccaaca cgggttgggg ggccgtgtgt 3360 cgcgggcatg cggctgcggg cctctggaag ggtgcccagc tgcattggca tatcaatcgc 3420 ctagagctgt tggcagtgtt cctcgctctc caccgctttt taccggtgct ggagcggcaa 3480 cacgtgctgg tcaggacgga cagtacggcg gcggcggcgt atatcaaccg catggggggt 3540 atgcgctctc gccgcatgtc tcagctcgcc cgccgtctgc tcctctggag tcacccgcgg 3600 ctgaaatcgc tgcgcgccat tcacgtccca ggcacgctca atcgtgcagc cgatgcgctc 3660 tcacgacagc tgttacgccc tggagaatgg agactccacc ccgagtctgt tcagctgata 3720 tgggcgcgat tcggggaggc ccagatcgat ctgtttgctt cccccgagaa cgctcactgc 3780 cagttgtttt tttccctgac cgagggctct ctcggcacgg atgcactggc ccacagctgg 3840 cctcggggca tgcgcaagta tgcgtttccc ccagtgagcc tgctcgcgca gtttctgtgc 3900 aaggtcaggg aggacgagga acaggttctg ctagttgcgc ccctttggcc caaccggacc 3960 tggatatcag agctctcact cctcgcgacg gccctcccct ggcggatccc tttgagagag 4020 gacctactct ctcagggaca gggcaccatc tggcaccctc gccccgatct ttggaacctc 4080 cacgtgtggt ccctagacgc gaggaagact taggtaacct accgactgcg gtggttaata 4140 ccatcactca ggctagagcc ccctccacga ggcgcgccta cgccctgaag tggagtctat 4200 tcactgaatg gtgcgtctct cgcagagaag acccccgaaa ttgccagatt agtgttgtgc 4260 tctctttcct tcaagagaag ttggacagca ggctgtcgcc ctccactctc aaggtttacg 4320 tggccgccat ctccgcttat catagcgcgg tagctggcgg caccgtggga aagcataacc 4380 tggtcatcca gttccttagg ggtgctaggc gaattaatcc atctcgcccc cctctcatgc 4440 cctcttggga tctcgccctc gttctcacga gtctgcgatc cgatcccttt gagccactcg 4500 aatcagtatc tctaagattt ctgtccctga agacagctct gctggttgcg ttggcctcca 4560 tcaagagggt cggggacctg gaggcatttt cggtcagtga ctcgtgcctg gaattcgggc 4620 cggattactc tcacgttatc ctgagacccc gccccggtta tgtgcccaag gttcctacca 4680 ccccctttag agatcaggta gtgaacctgc aagcgctgcc cccggaggag gcagacccag 4740 ccctttcttt actttgtcca gttcgcgctc tgcgcattta tgtggaccgt actcagaatt 4800 ttagatcatc tgagcagctc tttgtctgtt atggcggtcg gcagcaggga agtgccgtat 4860 cgaaacaaag attatcccac tggattgtgg atgccatttc actcgcttat tcgagtcgag 4920 gtcagccgtg tcccccggga gtacgtgcac actccactcg gagcgttgca tcctcttggg 4980 cgcgtgcacg cggcgcctct ctaacagaca tctgtagagc tgcgggctgg gcgacaccca 5040 acacatttgc aaggttttac aatctgcgag tggagccggt ttcctcaagg gtattaggta 5100 accctttggt gattgaggag acaactcggt agggtgttga aacacgcttg ctgcgccatt 5160 ctccctaaca cggaggtacg tgcgcctttt ttatctgtca gtaaagttcc ccgtcaggtg 5220 agccctgcag attcctccgt ggcccccagc actgactcag cggaggagtc acttgctggc 5280 ccactacgtt gtaggtctgc ccgctggtca gcccgcgttt tgggtatagg tgcctgctat 5340 gcgtgatccc cactaggcga tcccatatgc ttattccgcc acggttaagt cccccccctg 5400 ggcggacccg tgtcttccct ctccgctaac cactcttttg ctatgcgtac tccccctttt 5460 tagggctagt ccataggtaa attctgccat ctatcccccc cttgggtaac ggatggcctc 5520 cgcagcgtcc tccctatcgg gattgcacgc ttcccaacgt actgtcgtat ttcctagaat 5580 tatctagatg ctcacgactt cccaaaaaat atatctaaat ccgtaaaact tctgttgaag 5640 taggataaat tagggccagg gacacgttgg aggaccgcgc cccccatgat gtgggtgcgt 5700 cacgcttgct tgactatctc ctcatcgggg gtgttggtaa ggtgcagtca ttatggcgct 5760 ttccatattc tcccattcat ggcactgaag ttccccaacc gaaggggaac gttcgaggtt 5820 acagaagtaa cccttcgttc cccgaggagg ggaacggaag tgccatattc cgtcgccata 5880 atgactgtcc cttagctgtt tgaaagtctc ttcagcttaa aaggatggcg tctgctggct 5940 tcaggtgtgc ttatatgctg agataattgc agatgtcgca cacctgcgca agcttacgct 6000 gccaattaat ttcattcatt ggcccgttca atactctcca gataagcggc ttctgatccg 6060 aatcctccca ttcatggcac ttccgttccc ctcctcgggg aacgaagggt tacttctgta 6120 acctcgaacg tt 6132 // ID L1-1_DR repbase; DNA; ZEB; 5811 BP. XX AC AL645691; XX DT 02-MAY-2002 (Rel. 7.04, Created) DT 02-MAY-2002 (Rel. 7.04, Last updated, Version 1) XX DE L1-1_DR is a non-LTR retrotransposon from the L1 clad. XX KW L1 clad; L1-1_DR; Non-LTR retrotransposon; endonuclease; KW reverse transcriptase. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5811 RA Kapitonov V.V. and Jurka J.; RT "L1-1_DR, a non-LTR L1-like retrotransposon from zebrafish."; RL Repbase Reports 2(4), 18-18 (2002). XX DR Genbank; AL645691; Positions 114238 120048. XX CC This element is characterized by 12-bp target site duplications. CC It can be an active element. It encodes two proteins: CC 294-aa L1-1_DR1p (positions 172-1053) and L1-1_DR2p (positions CC 1665-5489). These proteins are most close to corresponding CC proteins CC encoded by other L1-like elements. L1-1_DR1p is a putative CC RNA/DNA CC binding protein, and L1-1_DR2p is composed of the AP endonuclease CC (aa positions 1-200) and reverse transcriptase domains. XX FH Key Location/Qualifiers FT CDS 172..1053 FT /product="L1-1_DR1p" FT /translation="MAGKLRKYKYTGKTAANQEDANTPSMMSEPAEHTDIL FT EIKAELISSIKTEITSLFQKELKTALSNEFEMVKAELQAVKSEIASNASAV FT RSDLEAIKTTVSDMERGLSSCSDDVTELQNTVRKLEKNVVTLQEKCLDMEG FT RMRRSNIRILNVAEDPGACTPASVSKLLKDTLKMDKDILIDRSHRTLQAKR FT ADGKPRAIVAKLHYYQDCVEILRRVRETGPLHHNGATIFIFPDYPPSVARA FT RSAFNEVRKLLRGKDGVRYGILHPARLRITHNGTEKQFQDAAEALTYVKNN FT IL" XX SQ Sequence 5811 BP; 1884 A; 1132 C; 979 G; 1816 T; 0 other; gtgcccggcg ctactgagca ggcgatggag taaggtaaaa gttgggtgag ctctgcaaaa 60 aacgtaatat aacttataaa tttaaggcgc tttaatcgaa aatatgagat gctacatcta 120 tctaagactt tactacttca cataaagcag ctgggttctc tataaatcca aatggcgggc 180 aaactacgta aatataagta cacggggaaa accgctgcta atcaagagga tgctaataca 240 cctagcatga tgtccgagcc agccgaacac actgacattt tggagattaa agctgagctg 300 atctcatcga tcaaaacgga aataacctcg ctctttcaaa aagaactgaa aactgcactg 360 tcaaacgaat ttgaaatggt caaagctgaa ttacaagcag ttaaatccga gatagcaagc 420 aacgcttcgg ctgttcgctc agatcttgag gcaattaaaa cgacagtgtc agacatggaa 480 cgaggcttat ccagctgctc agacgatgtt acagaactgc aaaacaccgt gcgcaaactg 540 gagaaaaatg tagtaaccct acaagagaaa tgcttggaca tggaagggcg aatgaggagg 600 tcaaacatta gaatactgaa tgtagccgaa gatcctggcg catgcactcc agcctcagtg 660 tcgaagctgc tcaaagacac cctcaaaatg gataaagata tactgatcga tagatctcat 720 cgcactctcc aggcaaaacg tgcagatggt aaacccaggg ccattgttgc aaaattgcat 780 tactaccagg actgtgtgga aattctacgc cgggttcgtg aaaccggacc ccttcatcat 840 aacggcgcaa ctatattcat cttccccgac tacccgccga gcgtggcccg tgcaagatcg 900 gcttttaacg aagtccgaaa actactacga ggaaaagatg gtgttcgtta tggcattctt 960 cacccggcca ggctccgaat cacacacaac gggacggaaa agcagttcca agatgcagca 1020 gaagccctga cttatgtgaa aaataatatt ctttaagacg gctcgaccgt ctctgattga 1080 gtagtaccaa gccacttcag tgactttgtt ttttttcata cactcacact cccatctaaa 1140 tgaccatgtt tgagtgagta tatgaacatt acatctatgc agagactgac tgaattccta 1200 tttcatggat gaggagtgaa aaacaatatt actgatacaa aatcacttac tattttcagt 1260 taaacttaaa agcaggaaat ataattggga cagtacaatt actgagttta ttattatcat 1320 tactatgctg ttatgtaact gtcatccttg ttttaatagg tgattgttaa taatatgcgc 1380 caatttaatt tattattttt taatttcatt tttgttttct ttctttaata atgtgcagag 1440 caacttgtga ggttaaaact ccccaagtaa gcactttatg tgtggatatt gttgcgaggg 1500 gttaaagttg caccatgttc tatttggtgt ttgggaatgg gtaaatgtcg cacttcattt 1560 tatacttcta cttcttgttt tgttttctac aatcttattg gaagggtctt ttctgttata 1620 tttaaacgaa gcgtttgtat gtaagcttac attttttaag acgtatggtt aaaccacaca 1680 atgttaacgc atcaggcatt tgccaggtga atcttataag ttggaatgtt aaatctttga 1740 atcatccagt gaaacgtgga aaggttctct cacacttaaa acagttaaat acagatatcg 1800 ctttcctaca agaaacccac ctgaaaactt ttgatcactt tagactaaga ggaggatggg 1860 tgggacaact ctttcactcg acttttcact ccaaatctag aggaacagca attctcatta 1920 gtaaaacggt ttcatttgag gcatcaaaaa tcgaagctga tccagcaggc cgttatataa 1980 tggtagtggg tagactaaat aatactccgg tagttatggt aaatgtatat gcacccaatt 2040 gggatgacag tgcattcttt acgggtctct tctcacgaat acctaatata gatactcatc 2100 atcttatatt aggaggagat attaattgcg tactatcacc ctcactggat cgcagctctc 2160 tcaaaccaat gataccaagt cgtacaactc aagtgattaa ccaacttctt aaaacctatg 2220 gaatgattga tgtttggaga ttccaaaatc ctgggtgtag aggttattca ttttattcac 2280 cagttcataa gacatattca cgtatagatt atttttttct ggacagtgaa ctacttcctc 2340 tagttagtga atgcaaatat aatgcaatag tgatatcaga tcatgcgcca ttattaatca 2400 ctctagatat gccaattaca tcaaacaact atcggccatg gcgatttaat acactacttc 2460 tctctgatgt ggagtttgtt aaatttatat catcagaaat tagagaatat ttagtgcaca 2520 atcagactcc aggaatatct tctagtctta tttgggaatc tcttaaagcc tatcttcgag 2580 gccaaattat atcatatagt gccagattaa agaaaaaaca acatgagcgg cttaaaaaaa 2640 ttgaaaatga tatttttaaa cttgatgaaa ttttggcaca ctcatctaca cctgacatgt 2700 ttagacagcg tttagctctt cagtctgaat ttaatttatt atgtacaaaa caaacagaaa 2760 atcttttaat taagtccagg cataagatgt atgaacatgg tgaaaagata gggaagatct 2820 tagctcacca acttcgacaa caaaatgcag cacattccat tatgtcagtt aatgataaca 2880 ctggcactaa attgacgaat cccttagaga tcaaccatcg gtttagagaa tactattcac 2940 aattatatac ttcggagtct tgtaaagatg agtcattatt tgattctttt tttaagaaaa 3000 ttagtctacc cactattgat caagagttcg ctctagacat ggagaatcca ttttcaaaag 3060 acgaatttat tagagcagtg tcatctatgc aaaacggaaa atcaccaggc ccagacggtt 3120 ttccaagtga attctttaaa aagttctctg gcgaacttgc ccctattcta ctttccctat 3180 atgaagaatc ctcagtcacg ggctccttgc cagagactat gaatcaagca attatttctc 3240 taatctataa aaaagataaa aatccatcag aatgcagctc ttatcgacca atttcattgc 3300 tgaatgttga cagtaagata ttcgccaaaa tattagcgca tcggctggaa atagtgctac 3360 ctacaatagt ttctggtgac cagacaggct ttattaaaaa ccgatattca ttctataata 3420 tacgcagact tctaaatatc ctccaccatc ccactccatc tgatgttccg gaagtccttc 3480 tctcacttga tgctgagaag gcttttgatc gggtggagtg ggactacctc ttttacactc 3540 ttaaaaaatt tggatttggc acaaagttca tttcatggat taaaatctta tactcatcac 3600 ctatggcagc aatacgtaca aattgtcaca tttctccttt cttttcgtta gaaaggggaa 3660 ccagacaagg ctgccctctg tcccccttat tatttgcatt ggtaattgaa cctctgtcca 3720 ttgcgatacg aaatgatatc aatatcaagg gtatacagag ggacaacttt gaacataaaa 3780 tttctctcta tgcagatgac accctcctat atatatctga accactaaca actctaccac 3840 aaattatgac attactgact gcctttggga aaatatcagg ttataaaata aatatgcaaa 3900 aaagtgagct tatgcccatt aataatgctg gtagaaagat tatttttacc tcactaccat 3960 ttaaaataac taaagacaaa ttcaaatatt taggtatatg gatcactaat aaatacaaac 4020 atttgtacaa agttaatttc cctccactga tagattccat aaaaaaagac cttgaacgtt 4080 ggaatccgtt accattgtca ctgggaggta gaataaacac tataaaaatg aatatattac 4140 ccagatgttt atatcttttt cagtgcatac ccgtattctt aacaaaatca tttttcttac 4200 ttttagataa attaatatca tcttttatat ggaatggaaa aaatgcacgt atccgtaaaa 4260 atattttaca acgacaccga gaccatggag gattgtcatt acccaacatt cagcagtatt 4320 actgggcagc taatattcga gcaatgctac actggtcaaa tccatcatat gacagtggcc 4380 ctaattggtt atctttagaa aacacatcaa atttttcaac ctctctccat gctctgctat 4440 gctcaaattt tccgacacct gaacctttat ctaaatactc tttaaaccca gttgtcaaac 4500 actcactcaa aatatgggca caatttagaa gaagttttgc acttaaagga ctatcagcct 4560 atgcccccat agcaagaaat catatgttca ccccctctac tatagacaaa acttttgaca 4620 tctggtctat gaaaggtctt aagatattaa aagatatgtt tattgatggg caatttgctt 4680 cattccaaca agtaaaagtt aagtttcaaa ttccaaattc ccacttcttt agatacctcc 4740 agctgcgaag ttttgtgtcc tcctcaatga gtcactatcc ctcactgcct cccccctccc 4800 tgcttgactc tattatggag ctaagcccat actcaaaagg acttattggc aaaatatatt 4860 ctataattaa ttcccacaat ctggaacccc tagtaaaatt aaaaagaaaa tgggaggtgg 4920 agctagagat agaactatca gaagatatgt ggcaatccgt tttagacaat atccactcat 4980 cttcaatttg tttaaaacat agagttatac aatttaaagt agtacataga ttacattggt 5040 ccaaagtgaa actagccaaa tttaaaccaa atatagaccc taactgtagc attgagccag 5100 ctactttatc tcatatgttt tgggcttgtt caaaattaaa gaaattctgg cacctaatat 5160 tcaaattcct ctcggacgca ttaaatacct atgtagaacc tgaggccata atttcaattt 5220 ttggaatcac accacagtcc ttatgtttta acaaaagcaa gataaatgtg attgcctttg 5280 ctacgctttt agctagaaga ttaatattgc tgaaatggaa ggaaaaactt cctccaacct 5340 ttaagcaatg gcttatggaa cttctacacc acctgacctt agaaaaaata cgatacactt 5400 ttggaggctg tactgatatg ttttttctca cctggcaacc tgttttagat cacgtaaaaa 5460 agatggaccc ctcagtcatt ttagaagagt agaacctttc cttgtttgtt tgtttgtttg 5520 tttgtttgtt tttttttttt tttttctctt ttctcttcaa ttttttgtat tttctatttt 5580 cttttttctc cccttattaa tgagagtaac atatgtttta cttattataa ctatttattt 5640 gtattttttc ttaatattat ttatttattt tttttttttg aacctaaatg tatgtgcggc 5700 aggttttgtt tgtttgttgt gtaaaaaaaa aaaaaaattg aaaagctatt ttgtaatgta 5760 tgtttcatat ctaatatgtt caataaaaat acttttggaa aaaaaaaaaa a 5811 // ID hAT-2_DR repbase; DNA; ZEB; 2536 BP. XX AC . XX DT 07-DEC-2004 (Rel. 9.11, Created) DT 07-DEC-2004 (Rel. 9.11, Last updated, Version 1) XX DE hAT-2_DR is an autonomous DNA transposon - a consensus sequence. XX KW hAT; DNA transposon; Transposable Element; HAT superfamily; KW hAT-2_DR; transposase. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-2536 RA Kapitonov V.V. and Jurka J.; RT "hAT-2_DR autonomous DNA transposon from zebrafish."; RL Repbase Reports 4(11), 305-305 (2004). XX DR [1] (Consensus) XX CC hAT-2_DR is an autonomous DNA transposon. This transposon is CC characterized by the 8-bp target site duplications and 16-bp CC terminal inverted repeats. The consensus encodes the 611-aa CC hAT transposase (pos. 255-2087). The genome contains about CC 1000 copies of hAT-2_DR nonautonomous elements that are ~97% CC identical to the consensus. Nearly all these copies are CC nonautonomous elements. The transposase-encoding elements are CC present just in 2 copies. XX FH Key Location/Qualifiers FT CDS 0..0 FT /product="hAT-2_DRp" FT /translation="MPANTMDRFLRPPDSKPSSSGLKPKRRRYDDQYLSLG FT FTWTGPADEPRPLCVVCQDILANDSMRPAKLRRHLETKHDEVAGKPPEFFK FT RKLQTLQGQKKIVEDFVKLNGKATEASYRVALRIAKAGKAHTTGETLILPA FT AKDICSVMLGEAAASKIDSVPLSDNTISRRISDMAQDVKEQVLDGVRHSPF FT YALQIDESTDVASCAQLLTYVRYVKNMDIHEEFLFSSPLPAHTTGEQIFNQ FT LNEFVRKNDVEWERCCGICSDGAKSMTGRYSGLMSRVKEVAPNAIWTHCTI FT HRQALAAKKMPNDLRSVLDEAVKIINLIKARPLNARLFHILCDELGAHYKQ FT LLLHTEVRWLSRGRVLSRLLDLREEVLLFLSNVQSTLVQHMSDLSWIARLA FT YLSDIFERLNALNLSLQGRDCNVFSAFEQVSSFRRKLDLWATRVEKGCLDM FT FPTLADFMQEAGSVVHIQPLVAEHLRGLCQQFTHYFSNETILDEWIRNPFK FT FKPAESDVLSIQDEEALIDLTSNHELQQMITHSSIEHFWLSVQNEFPELTQ FT KALRKLLPFVSTYLCEPEFSALTFIKNKYRSRLQVEDDLRLFLTSLQPRIS FT LLCAARKQLHTTH" XX SQ Sequence 2536 BP; 746 A; 506 C; 560 G; 724 T; 0 other; caggggtttt caaagtgtga ggcgcgcctc ccctgggggg cgccagagca tgtcagggga 60 ggcgcgggaa aaaatattat ataataaaaa tataattatt aagtttaatt attatatgta 120 ttttttatta tatttaaacg ttttaattaa acaaagctaa aaaaataata cgtcaaaaat 180 aagaaaacct tttttaccca gaaggccata gctgtgaatt cgcttctgtt tggcaagccc 240 gccaatacag gtatatgcct gctaatacaa tggatcggtt tctgagaccc cccgattcaa 300 agccttcaag ttcagggctt aaacccaaaa gacgacgata tgatgatcag tatttgagtt 360 taggatttac gtggacagga ccagctgatg aaccacgacc tttatgtgtg gtttgtcaag 420 atattttggc taatgacagc atgagacccg ctaaacttcg gcgacacctt gaaaccaagc 480 atgatgaggt agcaggaaaa cctccagaat ttttcaagag aaaacttcaa acccttcaag 540 gtcagaaaaa aattgtggaa gattttgtca aattaaatgg aaaggccact gaagcttcat 600 atcgcgttgc attgcgtatt gccaaggcag gcaaagcaca taccaccggg gagacgttaa 660 ttctgccggc agcaaaagac atttgttctg tgatgctagg agaggcagcg gcttctaaga 720 tcgattctgt cccactctct gacaacacaa taagtcggcg catctcagat atggcacagg 780 atgtgaagga acaagtttta gacggcgtca gacacagccc attttatgca ctccagatcg 840 acgaatccac agatgtggcc agctgcgctc agctgttaac atatgtgcgg tacgtgaaaa 900 acatggacat tcacgaagag tttctattta gtagtccttt gccagcccac acaacaggtg 960 aacaaatttt taaccagctg aacgaatttg tgagaaagaa cgatgtagag tgggaacgct 1020 gctgtggcat atgcagcgat ggggcaaaat caatgacggg ccgctacagc ggtctcatgt 1080 cgagagttaa agaggtagct ccgaatgcca tatggaccca ctgcactatt catagacaag 1140 ccttagctgc caagaagatg ccaaatgatc ttcggagtgt cctcgacgaa gctgtgaaaa 1200 ttattaacct cataaaagca cgacctctaa atgctcgtct tttccacatt ttatgcgatg 1260 aattgggagc gcattacaaa cagctgcttt tgcacaccga agtccgctgg ctgtctcggg 1320 gcagagttct atcacgactt ttggatttgc gtgaggaagt actacttttt ctgtcaaatg 1380 tgcaatccac tctggtgcag cacatgagtg atttgagctg gatcgcaagg ttggcttatt 1440 tgtcggacat attcgaacgc ctcaacgcgc ttaatttatc attgcagggc agagactgca 1500 atgtgttttc ggcatttgag caagtttcct cgttccggag aaagctggat ctatgggcca 1560 ctcgtgtgga gaaaggatgc ttagacatgt ttcccacgct ggctgacttt atgcaagagg 1620 cagggtcggt ggttcatatt caacctttgg tcgctgaaca cctaaggggg ctgtgtcagc 1680 aattcacaca ctacttttcc aacgagacaa tactggatga gtggattcgc aatccattca 1740 agttcaagcc agcagaaagt gacgtactgt ctatccaaga cgaagaggct ttgattgatc 1800 tgactagtaa tcatgaactg cagcaaatga ttacacactc ttccattgaa catttctggc 1860 tctccgttca aaatgaattt cctgaactta cacaaaaagc actgaggaaa cttttaccat 1920 tcgtttcaac gtatttgtgc gaaccagaat tttctgcttt gactttcatc aagaacaaat 1980 atcgttcacg tcttcaagtg gaggatgacc ttcgtctctt tctgacgtca ctacaaccac 2040 gaattagtct tctctgtgca gcaaggaaac aactgcatac tacccactaa ggtaggaata 2100 taattatgta agcatattaa tcaatagtag agatatttaa attttggcgt tgtttaaagt 2160 tctttcccca ctctgttaca gatcatctga ctgttttgac tgggatggac agttcttgga 2220 tctgttctat tcatattgaa ccatcatcct agttttaaaa acgttatatt aaaatacttg 2280 ttattactct gtaaataatt gcactttcat gcaaatgatg ataaaagtga gttaacagtc 2340 tgacatctgt ctgtgtcttt atatatactg tatatatata tatatatata tatatatata 2400 tatatatata tatatatata tatatgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgcgtg 2460 tttgggagga ggggggcgcc aatggataag ttgtgtcaaa agggaggccc actgtcttag 2520 actttgaaaa accctg 2536 // ID L1-4_DR repbase; DNA; ZEB; 5548 BP. XX AC AL807749; XX DT 04-AUG-2002 (Rel. 7.07, Created) DT 18-MAY-2005 (Rel. 7.07, Last updated, Version 2) XX DE L1-4_DR is a non-LTR retrotransposon from the L1 clade. XX KW L1 clade; L1-4_DR; Non-LTR retrotransposon; endonuclease; KW reverse transcriptase. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5548 RA Kapitonov V.V. and Jurka J.; RT "L1-4_DR, a family of non-LTR L1-like retrotransposons from RT zebrafish."; RL Repbase Reports 2(7), 24-24 (2002). XX DR Genbank; AL807749; Positions 100605 95058. XX CC L1-4_DR is a family of L1-like non-LTR retrotransposon. This CC family was active recently (no stop codons in ORF1; CC a few stop-codons in ORF2). CC It encodes two proteins: CC the 459-aa L1-4_DR1p (positions 171-1547) and the 1294-aa CC L1-4_DR2p CC (positions 1577-5461, a conceptual translation). CC These proteins are most close to corresponding CC proteins encoded by other L1-like elements. L1-4_DR1p is a CC putative CC RNA/DNA binding protein, and L1-4_DR2p is composed of the CC AP endonuclease (aa positions 1-200) and reverse transcriptase CC domains. XX FH Key Location/Qualifiers FT CDS 171..1547 FT /product="L1-4_DR1p" FT /translation="MDTLVKELELKMDYGSMDKATPENNNSRDSNNGIDDN FT DTWATVVARRRKTKSDTSREGSGEMQQTKGKANAEHLENSQQSSHRNEMIN FT KLKSQARFQRQYKKETTLTMTVKDPENITVTMIIKAVEDKTGIGKLFGLRK FT KSNFDYELTMENETDCDHLMDGLMINQQFCEVSKLCATERMVSFLNLPNYI FT QDSEIIQKLVDWGVSPILPLRRRYHPGTTVADGTRFIRVKFPKEVMSLPYN FT VKFDTEEGPKYFRVIHDQQIKTCRLCGSAEHEKKDCPQFVCRECLEQGHFT FT RDCKAPRCQGCKKTILWCRCESDEEETGVMETNKQMEKSSNEEREEEVQEL FT PPDDLNEQEEEQAMSEEDEGDTADTEAQDLMKDDGHDMEEEQGAAGENTES FT RIEEEVSDDDDNEEINIGTKDRTIDSINRRRRKTVQLNIQQVLKKQKLRKE FT AKAKLKTERTDLRF" FT CDS 0..0 FT /product="L1-4_DR2p" FT /translation="MDCFIWIFFCLDGLFSFIFLMNNLCLVSINVRGLSSK FT VKFENVIALTKKCDVICIQETGWNENIVNDLKKCWDGEILYNNDPNKKKGM FT AILIRRGIGYTFDVLFKDNYGRILTIKIMNKDEEIRICNIHAPNEDLERVT FT FFKDLSVLMSGWNNVIVLGDFNTVLERIDVDDHMVFRADVGRRELKHMIEK FT HKYVDVWRERNRAKREYSRRQWVNTVLKQSRLDYVLCTRNVESFISNIFYK FT IFSCSDHDFLYVMMDFSGVERGPGVWVFNTELLKNDFYKIEMENIIINSVN FT DELYDEEISVWWDNVKLEAKRFSIECSKKMQKAKRAKERQLNKEWENEMEK FT ITEGNMDIRRIVILEEKLKKLEEEKCMGARIRSKIKNTVEGERSTKFFYDL FT EKTRQKADLIKNVSTKEKTVKDKESILRTVKDFYETLFKAKGVHEEDKDFL FT LNQIKVKVSEEDKKLCDSDITEEEINEAITQLSNGKSPGLDGLSSEFYKTF FT KDVLIPILKDLFIAIFKKGQLSESMKKGMIKIIYKNKGDKDYLQNYRPLSM FT LNTDYKILAKILANRLKKVVPTLITTNQAYGVIGRDIADTVTSIRDLIWYI FT KEKKDEGFLFSIDLEKAFDRVEHSYLFDIIQKFGFGENFIKWIKCFLYRDI FT FSCFKINGFLTDYMEISRSIRQGCPLSALLYTLVAEPLGLAINGEKKIKGF FT KIEINRTEQKIYQYADDTTLFLKDFKSVGKAMEIFDKYCRGSVAKVNKEKT FT EYMKMGKVDVQQGNWEYKEQKKYINILGITLGYDENKTREIIWDELINKME FT KRLCFWKQRVLFLKGKVLVLNSLFLSKMWYVLSVVSLPTWVYKKLKTMILN FT FLWDDKPSKIAYNTIIGKVDEGGLRLIDPWIRIKSMRIKTLKKFLNEDNIL FT WKSIMSYFINKCGQIRDDFLWMAFKDRMIENIPEFYEELLRTWKCFYNNIQ FT TEIEGRKLYLQQPLFLNQNNKSKKQMFYENWYAVGFRQVKDILYEIKPGFL FT PTQAIIDTLEEIEDVDDKEKIEDQYKKLRLALPDHWIKTIEENE" XX SQ Sequence 5548 BP; 2269 A; 553 C; 1134 G; 1592 T; 0 other; tgctttcaag gaagtgtgag gtggcagtag ggagagaaag gctctcccat tttgatttgc 60 tttattgttt tttttcttga ttttgcttaa attaaattgt attaattttg tttagtttta 120 gttaaacccc agacagtgtt atctgtttgg ggttaaaacg ttttgaaagg atggacactt 180 tggtaaagga actggaacta aaaatggact atggcagcat ggacaaagct actcctgaaa 240 acaacaattc aagagattca aacaacggca tcgacgataa tgacacatgg gcaactgttg 300 tggcaagaag gaggaaaact aaatcagaca caagtagaga aggaagtgga gaaatgcaac 360 aaactaaagg taaagcaaat gctgaacact tggaaaacag ccaacaatca agtcatagaa 420 atgagatgat aaacaaactg aaaagtcaag ctagatttca gcgacaatat aaaaaagaaa 480 caactctgac aatgactgtg aaagatcctg aaaatatcac tgtaacgatg attataaagg 540 ctgtggaaga taagactgga attggaaaat tgtttggact gaggaaaaaa tccaattttg 600 actatgaact tactatggaa aatgaaacgg actgtgatca cttaatggat ggactaatga 660 ttaaccaaca attttgtgaa gtatcaaaac tctgcgcaac tgagagaatg gtttcttttt 720 tgaacttacc caactatatt caagatagtg aaatcatcca aaagctggtg gactggggag 780 tttctccaat tctcccactg agaagaagat atcatccagg aacaactgtg gctgatggaa 840 caaggtttat cagagtgaaa tttccaaaag aagttatgag tcttccttac aatgtaaagt 900 ttgatacaga ggaaggacca aaatatttta gagtgataca tgatcagcag ataaaaacat 960 gcagattatg tggaagtgct gaacatgaaa aaaaagactg cccacaattt gtgtgtagag 1020 aatgtctgga gcaggggcat tttacgcggg actgtaaagc cccacggtgc caaggctgca 1080 aaaagacaat attgtggtgc agatgtgaat cggatgagga agagactgga gttatggaaa 1140 caaataaaca aatggagaaa tcaagcaatg aagaacggga agaggaagta caggaattac 1200 caccggatga tttgaatgaa caagaagagg agcaggccat gagtgaagag gatgaaggag 1260 atacagcaga cactgaggca caagatctaa tgaaagacga tggacacgac atggaagaag 1320 aacaaggagc agcaggtgaa aatacagaaa gcagaattga ggaagaggta agcgatgatg 1380 atgataatga agaaataaat attgggacta aagacagaac aatagacagc ataaacagaa 1440 gacgcagaaa aactgtacaa ttaaatattc agcaagtgct taagaaacaa aaattacgaa 1500 aggaagcaaa agcaaaacta aaaactgaaa gaactgatct aagattttag atcataaaaa 1560 agactacaaa tgattgatgg attgtttcat ttggattttc ttttgtttgg atggcttgtt 1620 ttcttttata tttctaatga acaacttatg tttggtttca attaatgtaa gagggctgtc 1680 atccaaagtg aagtttgaaa atgtaattgc tttaacaaaa aaatgtgatg ttatatgtat 1740 acaagagact ggatggaatg aaaacattgt taatgattta aaaaaatgtt gggatgggga 1800 aatattgtat aataatgacc caaataagaa aaaaggcatg gcaatattaa ttagaagggg 1860 aataggatat acatttgatg ttttatttaa agataattat ggaaggattt taactattaa 1920 aattatgaat aaggatgaag aaataagaat atgtaatata catgctccaa atgaagactt 1980 ggaaagagtt acttttttta aagatctaag tgttttaatg agtggatgga ataatgttat 2040 tgttttagga gattttaata ctgttttaga aagaatagat gtagatgatc atatggtgtt 2100 tagagcagat gttggaagaa gagaactgaa acacatgatt gaaaaacata aatatgtaga 2160 tgtatggaga gagagaaata gagccaaaag agaatactca agaaggcagt gggtgaatac 2220 agttttaaaa caaagcagat tagactatgt tttatgtaca agaaatgtag aatcttttat 2280 ttcaaatatt ttttacaaga tttttagctg tagtgaccat gattttctgt atgtaatgat 2340 ggatttcagt ggagttgaaa gaggaccagg tgtatgggtg tttaatacag agcttttaaa 2400 gaatgatttt tataaaattg aaatggaaaa cattattatt aatagtgtga atgatgagtt 2460 atatgatgaa gaaataagtg tgtggtggga caatgtaaaa ttagaggcca aaagattttc 2520 aatagaatgt tcaaagaaaa tgcagaaagc caaaagagct aaagaaagac aattaaacaa 2580 agaatgggag aatgaaatgg aaaagataac agaaggaaat atggatatta ggagaatagt 2640 gatattagaa gagaaactga aaaaactaga agaggaaaaa tgtatgggag ctagaataag 2700 aagcaagata aaaaatacag tggaaggaga aagaagtaca aagttctttt atgatctaga 2760 aaaaacacga caaaaagcag atttgataaa gaatgtctca acaaaagaga aaactgtcaa 2820 agataaagaa agtattttaa gaacagttaa agatttctat gaaactttgt ttaaagcaaa 2880 aggagttcat gaagaagata aggatttttt attgaatcaa ataaaggtta aagtaagcga 2940 agaggataaa aaactgtgtg atagtgatat aactgaagag gagatcaatg aagctataac 3000 acaattaagt aatgggaaaa gccctggttt agatggtttg tcatctgaat tttataagac 3060 ttttaaagat gttttaattc caattttaaa agatcttttt attgctattt ttaaaaaagg 3120 acagttgagt gagagtatga agaaaggaat gattaaaatt atttataaaa ataaaggtga 3180 taaagattat ttgcaaaatt atagaccttt aagtatgctt aatacagatt ataaaatatt 3240 agcaaagatt ttagcaaaca gacttaaaaa ggtagttccc actcttatta ctactaacca 3300 ggcttatggt gttataggta gagatatagc agacacagta acaagcatca gagatttaat 3360 ctggtacata aaagaaaaaa aagatgaagg atttttattc agcatagatc tagaaaaggc 3420 ttttgataga gttgagcata gctatttatt tgacataata cagaaatttg gctttggtga 3480 gaattttatt aagtggataa aatgtttttt atacagatat ttttagttgt tttaaaataa 3540 atggattttt aaccgactac atggagattt ctagatctat aagacaagga tgtcctttat 3600 cagcgttatt atacacatta gttgctgaac cattaggctt agctataaat ggagaaaaga 3660 aaattaaagg gtttaaaata gaaatcaata gaacagagca gaaaatttac cagtatgctg 3720 atgataccac tctattttta aaagatttta aaagtgttgg aaaagctatg gaaatatttg 3780 ataaatattg tcgaggatcg gtagcaaaag taaataaaga aaaaactgaa tatatgaaga 3840 tgggaaaagt agacgttcaa caaggaaatt gggaatataa agaacaaaaa aaatacataa 3900 atatcttagg cattacactg ggatatgatg aaaataaaac tagagaaata atttgggatg 3960 aacttataaa taaaatggaa aaaagattat gtttttggaa acagagagta ttgtttttaa 4020 aaggaaaagt actggtatta aattctcttt ttctatctaa gatgtggtat gttttaagtg 4080 ttgttagtct acctacgtgg gtgtataaga aattaaaaac tatgatttta aactttttat 4140 gggatgataa accatctaaa attgcatata acactatcat tggaaaggtg gatgagggag 4200 gactaagact tatagatcca tggataagaa taaaaagcat gagaattaaa acattaaaaa 4260 agtttttaaa tgaagacaat attctatgga aaagcataat gagttatttt attaataaat 4320 gtggacaaat aagagatgat ttcttatgga tggcatttaa agaccgcatg atagaaaata 4380 ttcctgagtt ttatgaagag ttgttgagaa catggaaatg cttttataat aatatacaaa 4440 ctgagattga agggagaaaa ctttatttac agcaacccct atttttaaat cagaacaata 4500 aaagcaaaaa gcaaatgttt tatgagaact ggtatgcagt gggttttaga caagtaaaag 4560 acattttata tgaaataaaa cctgggtttt taccgactca agcaataata gatacactgg 4620 aggaaataga agatgtagat gataaagaaa aaattgaaga tcaatacaag aagttaagat 4680 tagcattacc agatcactgg atcaaaacta ttgaagagaa tgaaaaaaga aactgaaaat 4740 agaaaaataa aagttttttt taaaaatgga tgaagataag attagtatca atgattgtcc 4800 tatcaagatg ttttatacat gtttgtgtaa cactgtgttt aaaaaaacct aaatcaagag 4860 aattttggga aaagttattt gaaaactttg atacttcaaa tatatggaaa aatgtaagat 4920 caattttaaa aagtccagca ttggaaaact tagattttat gttaagacac aactgcataa 4980 tgacagagat tatctttaaa aagattgggg tatcacaaga tgatttgtgt aaagtgtgtt 5040 tggaaaaaaa ggaaggcgtg ttacacctat ttttaaattg taaaaagttg agtgatttta 5100 tgaagatgtt gaaaacaatg gtatgcaatt ttctgtatga tgaaaacatt attttagaag 5160 aatgggatac actgttttat ttggttttaa tgggaaaaca aaaaataagt ttgctcttaa 5220 ttatatgttg actcttgcaa gatatacaat atggaaaaga agaaatatta tgaaacaaaa 5280 gaaaaaagaa attccattgg ttttgttgta taaacagatt gtgactgagg aaataatggt 5340 tatatatgac tattgcaaaa tgtatgaaaa gatggacatt tttgaaaaat gtattagaaa 5400 aaataatcca tatattgtac aaacttggac tggttttaaa gttttcttac ctggagattt 5460 ttaaatattt tatcttttaa atatttgtat gtataaatgt catgattgtt gatgatgtat 5520 tttttaagaa agaaaaaaaa aaaaaaaa 5548 // ID DIRS-5_DR repbase; DNA; ZEB; 6895 BP. XX AC . XX DT 24-OCT-2008 (Rel. 13.1, Created) DT 24-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE DIRS-like LTR retrotransposon family - a consensus. XX KW DIRS; LTR Retrotransposon; Transposable Element; KW endogenous retrovirus; reverse transcriptase RNase H; KW phage integrase; DIRS-5_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6895 RA Bao W. and Jurka J.; RT "Families of DIRS-like endogenous retroviruses in zebrafish."; RL Repbase Reports 8(10), 1272-1272 (2008). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS join(320..2326,2271..3593) FT /product="DIRS-5_DR_1p" FT /translation="MERDDTPAPLGTPAQGQQAPTGNQTNTEAPIRGRRPI FT RSTVSRTHRRTQSPSPITPNRNLPSPSSSYASARSSSLISNKMTVAELRQT FT ITNAGISIPNRCNKAELLKLYETIPSPTPPPQDSRPTRSRHTPYPQPTSAQ FT QATNHPGPPKKATRKTNKKLPQATGQSAPVTITFQNTDNPQENHATPGLPT FT PLLWPPAPLSSGNSIPALPDISPSLNPPHSILPSNLPHSSTQFFPTQTFPT FT IHNAVPLPTNFPSSTTSFFPSTSLHQAPTVITNPPQQSTLRTNISSARPPF FT TLSTATPLPIPQNAPVLEPPPISNAIRNLILSGADIDLSTLLSPIAPPSAE FT RQVDCGEFTITLKPPVSSQTRTLSIAEFHVAFARYTETICSVFPHRRRELN FT DYMAIISELALSYGGTHFYTYHKLFSAKCAIRVTQWNQCTYWGALDTDLHN FT RVFLGCRNLSCAVCRSNLHPTTSCPFVIPSADKELQTPKSTSYVPRPSTST FT IPSLLPPPSSQNPPSQICQSFNIARCFRHPCKFLHTCSYCGGAHARVVCQV FT LKANKKHRNYLSTPVDISNLYSELCLHPDPNFSEFLISGLSNGFHPGVSTL FT PSYNLACPNLQSANAEPDVVDHLIKKEIDNKFMIGPFLAPPFSTYRISPIG FT VATRKFSEEKTTNNRPVFSSQPENFLRKKRLIIDLSSPHNSAFSSINSLIS FT PDEFSLNYHDIDQAISLIKLVGRNAWLAKVDITSAFKIMPLHPDFWHLFGI FT NWKSQFYFAVRLTFGCRSSPKIFDMLSEAICWILANNYGIPHVVHLLDDFL FT IISPPHTPPAQHLATTKAVFARLGIPLAEEKTAGPSTRLEFLGINLDSQKF FT EASLPKEKIDRIISLSSIFLEKHECSKRELLSILGHLNFAMRIIPQGRPFI FT THLLQLAASVHSLEDNITLSDPCRNELSLWISFLKCWNGCSFLYSDLIASP FT VDIQLYTDAAPSVGFGGFYQGRWFASDWPSQMLETPLPQYSSALFELYPIV FT AAALLWGDEWSASSILIHCDNEAVVHCINRGRSHSPALMPLLRRLIWTAAK FT KQFIITAIHVPGFHNQIADALSRFLFQRFRQLAPEAEQHPTPIPPYSEMIF FT Q*" FT CDS join(3602..4663,4694..5233) FT /product="DIRS-5_DR_2p" FT /translation="MHELHQTSISLILQAVAPRTLQAYLTAWKTFKHFHFT FT YNTIFPDFSLLTISSFITYLHFHKNMQANSIKSYLSGIQFFHKLMYGSISE FT SIANSQVSLLIKGIQKARPPTPDARLPITHNILSKCISTLRKGYTSFHTDR FT TLDAMFILAFFGFLRCSEFTVTSKFDPSIHPTIADLTLIDEETISFLIKQS FT KTDQSRKGHCIYIFNIPSSTSPFQTLLAYIHYRKSLSNSPLAPLFIDDTHN FT PVTRFWFQKHLKATLHHSGFPSESYSSHSFRIGAATTAAHKGLTQQHIQTL FT GRWSSDAFKSYIRLSHSHLKEAQRTLTSRNANPSGQGHRHNPGTSQDPAMP FT APQGRKSYPIMRFPFFQLELHSASLPRHSPYYSRRSSCPSPLLPLRPPRPL FT PQKSPLPPPHSRTSVGPRPLALNPYNVTQSFDSGRNHISLAPALAFIYYIS FT FLIMLYNIVLFYLFSLFDLFYINVYSYAPTHININLYIVPPPSCSVPAGTL FT PEHLFRPSPSSRSLHLPIISPDSYWSWPQISSPNPEL*" XX SQ Sequence 6895 BP; 1832 A; 2196 C; 1072 G; 1794 T; 1 other; aatgaagttt cataaactaa tttcgagagg agcacgtgat atgattgact gcagccggcc 60 actcatctaa gctcattagc tagccaatcg gaacgatcca aacccactat aaataaccta 120 gctaaaatgt acacccctat cttcgttttc cgaagaagca cagaaggacg gacacagctc 180 ctcttcaaat cctcaaatct tcatctacca taccatcgcc tgcgacatta acttcaacct 240 actacaaatc aacaacgaca gcaacaacaa aactcaaaaa attcaacgga cacgccatca 300 tcaactaaac aacaacatca tggaaagaga cgatacaccc gctcccctag gaacaccagc 360 acaaggacaa caagccccaa ccggaaacca aacaaacacc gaagctccca taagaggcag 420 aagacccatc cgctctacag tttcaaggac ccatcgccgt acacagtctc catcccccat 480 aaccccaaac cgcaacttac cttctccatc ttcatcatac gcatctgcaa gatcctcatc 540 cctcatctca aataaaatga cagtcgccga actccgtcag accattacaa acgccggtat 600 atccatcccc aaccgctgca acaaagctga actgttgaaa ctctacgaaa ccatcccttc 660 accaactcct ccccctcagg acagcagacc aactcgatcc cggcacaccc cctatcctca 720 accgacttct gcacagcaag caactaacca ccccggacca cccaagaaag caaccaggaa 780 aacaaataaa aagctacctc aagctactgg acagtctgca cccgttacca ttacctttca 840 aaacacagac aatccacagg aaaatcacgc cactccagga cttcccactc cccttctctg 900 gcctccagct ccactttcca gcggaaactc cattcctgct cttccagaca tctctccctc 960 tctcaaccct cctcattcta ttctcccttc taaccttccc cattcttcaa ctcagttttt 1020 tcccactcaa acttttccta cgatccataa cgccgttcct cttcctacta atttcccctc 1080 ttctactacc tcctttttcc cctctacatc cctccaccaa gcacccactg tcattactaa 1140 ccctccccaa cagtctactc ttcgtactaa catctcttcc gcacggcccc ccttcactct 1200 aagcaccgcc acaccccttc ccattccgca aaatgctcca gtcctggaac cacccccgat 1260 ctccaatgcc atcagaaacc tcatcttatc aggtgccgac atagaccttt caacactcct 1320 ttcacccata gcacctccct cggcagagcg acaggtggat tgcggcgaat tcactattac 1380 ccttaaacca ccagtcagtt cacaaactcg cacactctcc attgccgaat ttcacgtagc 1440 cttcgcacga tacacagaaa ccatctgctc agtttttccc cataggaggc gcgagctgaa 1500 tgactatatg gccatcatct cagagctcgc gctctcctat gggggaacac atttctacac 1560 atatcataaa ttattctcag ctaaatgcgc aattcgcgtc actcagtgga atcagtgtac 1620 ttattggggg gctttggaca ctgatctcca caacagagta ttcttaggat gtcgcaatct 1680 atcctgcgcg gtctgccgct ctaaccttca cccgaccact tcctgtccct tcgtaattcc 1740 ctccgccgat aaagaactac aaaccccaaa atccaccagc tacgtacctc gcccttctac 1800 ttccactatc ccctctctac ttcctcctcc ctcctctcaa aaccctcctt ctcaaatctg 1860 tcaaagcttt aatatcgcta gatgctttcg ccacccgtgc aaattcctgc acacttgtag 1920 ctactgcggc ggcgcacacg ctcgtgtcgt ctgccaagta ctaaaagcaa ataaaaaaca 1980 tagaaattac ttgtcgactc ctgttgatat ttctaatctg tattctgaat tatgcttgca 2040 ccctgatcct aatttttctg aatttctcat ttcaggtctg tctaatggat tccaccctgg 2100 tgtttcgacc cttccttcct ataacctcgc atgtcctaat ctccaatccg ctaacgccga 2160 accagatgtg gtggatcatc taatcaagaa agagatcgat aataaattta tgatcggtcc 2220 ctttcttgcc cccccgttta gcacctatcg gattagtcca atcggcgtag caaccagaaa 2280 attttctgag gaaaaaacga ctaataatcg acctgtcttc tcctcataat tctgcctttt 2340 caagcattaa tagtttaatt tcacccgatg aattctcatt gaactaccat gacatagacc 2400 aagcaatttc tctaattaaa ctcgtcggcc gtaacgcttg gctcgctaaa gttgacatta 2460 cgtcagcttt taaaattatg ccgttacacc ctgatttctg gcacctcttt ggcatcaatt 2520 ggaaatccca attttatttc gcagtccgtc ttacgttcgg ctgcagaagc agccccaaaa 2580 ttttcgacat gctttcagaa gctatatgtt ggatcctcgc taataattac ggaatcccgc 2640 acgtagtcca cctccttgat gatttcctca tcatctctcc cccccatacc ccacctgctc 2700 aacacctagc gactactaaa gcagttttcg ctaggctggg tatccccctt gcagaagaaa 2760 aaaccgctgg acccagcact cgcttagaat ttctaggcat taatttggac tcccaaaaat 2820 ttgaagcttc gctgcccaaa gagaaaattg atcgaatcat ttctctatct tccatatttt 2880 tggagaaaca tgaatgttct aaacgcgaac tgctatcaat attaggacat cttaatttcg 2940 ccatgcgtat cattcctcag ggacgcccgt ttatcactca cctcctacaa ctcgcagctt 3000 ccgtccacag cttagaagat aacataacgt tatccgaccc ctgccgcaat gaactcagcc 3060 tgtggatttc cttccttaag tgctggaacg gctgctcatt cctgtatagc gatctaattg 3120 catcccccgt agacatccag ctatacacgg acgcagctcc ctcggtagga ttcggtggtt 3180 tctaccaagg ccgctggttc gcctctgatt ggccctctca aatgctggaa actcctctac 3240 ctcaatattc gtctgcttta ttcgaattat accccatagt agccgctgcc ttattatggg 3300 gagacgaatg gtctgcctct agcattctca ttcactgtga caacgaagcc gttgtgcact 3360 gcattaacag agggcgctct cactctcccg ctttaatgcc gcttctccgt cgccttattt 3420 ggaccgcagc caaaaaacaa tttatcataa ctgctataca tgtgcccggt tttcataacc 3480 aaattgctga cgctctttct cgctttcttt tccagagatt cagacaacta gcgccggagg 3540 cagagcagca cccgactccc atccctcctt attcagagat gatattccaa taaatcatcc 3600 aatgcatgag ctgcaccaaa catccatatc cctcattctg caggctgtgg ctccaaggac 3660 cttacaagca tatctcactg catggaaaac attcaaacac tttcatttca catacaacac 3720 catattccca gatttctccc tgcttacaat aagctcattt attacatacc ttcattttca 3780 taaaaacatg caggcaaact ccattaagag ctatttaagt ggtattcagt tttttcacaa 3840 actcatgtac ggctccattt ctgaatccat tgccaactct caagtcagcc ttcttattaa 3900 aggcatacag aaagcacgcc cccccacccc agatgccaga ttgcccatca cacataacat 3960 actctccaaa tgcatttcca cgctcaggaa aggctacaca tcttttcata cagaccgcac 4020 actagatgca atgtttattc ttgccttttt cggatttctc agatgttctg aatttacagt 4080 aacatcaaaa tttgatcctt ctatycaccc cactatagct gatctgaccc tgattgatga 4140 ggagacaatt tctttcctca tcaaacaaag caaaacagat caatcaagaa aaggacattg 4200 catctacata tttaacattc cctcctccac aagccccttc caaacactcc tagcttatat 4260 acactatagg aaatcactaa gcaacagtcc cttagccccc ctgttcatag acgacacaca 4320 caacccagtg acacgctttt ggttccaaaa acacctcaaa gctaccctac atcattcagg 4380 cttcccatca gaatcatact ccagccattc attcagaatc ggagccgcca ccacagccgc 4440 acacaaaggg ttaacgcaac aacacataca aacacttgga agatggtctt ccgacgcctt 4500 taaatcttac attaggctga gccacagcca tcttaaggaa gcccagagga ccctcactag 4560 cagaaatgcc aatcccagcg gccaagggca caggcataat ccagggacaa gtcaagaccc 4620 agccatgcca gctccccagg ggcggaagag ctacccaatt atgtaaagga gcaggcacga 4680 cccagcctcc taacggtttc ccttcttcca gcttgagttg cactcagctt ctctacctag 4740 acactcacct tactacagca gacgttcctc ctgcccaagc ccccttctgc cactccgccc 4800 ccccaggccg ctgccacaga agtctccact gcccccaccc cattctagga cttctgtagg 4860 accccgcccc ttggctctga acccgtacaa cgttacacag agctttgatt ccggcaggaa 4920 tcatattagt cttgccccag ccctagcttt tatatattat atatcatttc ttataatgct 4980 atataatata gtcttatttt atttattttc tctgttcgat ttgttttata taaatgtata 5040 ttcatatgca cccacgcata taaatataaa tttatatata gtgccgccac cctcatgctc 5100 agttcccgct ggaacactcc cagagcacct attccgcccg tcaccctcca gtagaagtct 5160 ccatctcccc atcatctctc ccgactccta ctggagttgg ccacaaatta gctcacccaa 5220 ccctgagctg tgacacgagc acgagtcact gaccctcccc ggccctagac ctttattttt 5280 ctttatttct tatttatttt tcacatttat cttttatttt ttaaatttat ttatatatct 5340 atatatctat atgcacccac gcatataaat atatacttat atatagtgct gtcaccttca 5400 agctctaaca ctcgcaagag ttgctcacga gcattcgacc cccgcagggg tcatcgccca 5460 aacgccactc acccttcagc tggggcctcc acccgccctc catcgttccc gtccctagct 5520 ggagatggca cttgcagctc tatctcccgc tggagagccc aaaagagcta gactcccgct 5580 ggagtcaagt aaaatcgccc cccccaaggc cttagttttt ccccattata tatatatata 5640 tatctatata tcacatatac atatatattt atatatagtg ctgtcaccct caagctctaa 5700 ctctagcaag agttgctccc gagcattcga cccccgcagg ggtcatcgcc caaccgccac 5760 tcacccttca gctggggcct ccacccgccc tccatcgttc ccgtccctag ctggagatgg 5820 cacttgcagc tctatctccc gctggagagc ccaaaagagc tagactcccg ctggagtcaa 5880 gtaaaatcgc ccccccaagg ccttagtttt ccccattata tatatatata tatatctata 5940 tatcgcatat acatatatat ttatatatag tgctgtcacc tcacagctct atctccgcaa 6000 ggagtgttcc tcgagcaatt actccttagg agcccctgac ccccccgcag ccctagtaac 6060 cccctcatcc agctggagtc ctcacttttc actcctcatc attttgactc caactggatc 6120 cccgtccatc ccacagctct gacccccgct ggggtttccc ttagtttcta atccagctgg 6180 agtatttata gcccacgcca actcggcacg ggctcccgca agagcccgta tcccccttgg 6240 ctcccatcgg agccccttca ctttcaacca ctatatccag cagccggata tagcatttca 6300 agcctttcgg ggagtttctt cgaatacacg gctgctgtcc cgagtctcat gcatttgggg 6360 agctctcgag aacacctgac ctcgtactcc cctcacatgc tttatggacc tggcgggaac 6420 cctgggctca actatctccg agctcagggt tctctcccgg gacagcatgc caaacctgct 6480 aacttgctaa caagttgtca aacagtatct aagtgtgaac tcttgaaatg aagtttcata 6540 aactaatttc gagaggagca cgtgatatga ttgactgcag ccggccactc atctaagctc 6600 attagctagc caatcggaac gatccaaacc cactataaat aacctagcta aaatgtacac 6660 ccctatcttc gttttccgaa gaaacccccc atccacccct tctcctcctt tcctcttttg 6720 ccgaggggag ctctcgagaa cacctgacct cgtactcccc tcacatgctt tatggacctg 6780 gcgggaaccc tgggctcaac tatctccgag ctcagggttc tctcccggga cagcatgcca 6840 aacctgctaa cttgctaaca agttgtcaaa cagtatctaa gtgtgaactc ttgaa 6895 // ID LOOPERN4B_DR repbase; DNA; ZEB; 511 BP. XX AC . XX DT 27-SEP-2008 (Rel. 7.06, Created) DT 27-SEP-2008 (Rel. 13.09, Last updated, Version 2) XX DE A a nonautonomous DNA transposon - a consensus. XX KW piggyBac; DNA transposon; Transposable Element; Nonautonomous; KW nonautonomous DNA transposon; LOOPERN4_DR; LOOPERN4B_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-511 RA Jurka J.; RT "Nonautonomous DNA transposons from zebrafish."; RL Repbase Reports 8(9), 937-937 (2008). XX DR [1] (Consensus) XX SQ Sequence 511 BP; 155 A; 93 C; 100 G; 163 T; 0 other; ttaaagggga cctattatgc ccctttttac aagatgtaaa ataagtctct gatgtcccta 60 gagtgtgtat gtgaagtttc agctcaaaat accacacaaa taatgtttta taactctttg 120 aaactgaccc ttttaggctt tgatcctaat tgtgccgttt tggtgactgt cgctttaaat 180 tcaaatgaga ttgtgctctt ttcaaaagag ggcggagcta caaatgcctg tgtgtcagca 240 tagtggcaga ttcaaaaaca agactaacgt cctatgctaa tgagggagag atggtcacta 300 gtgggcgggg ctttccccct ctgatgacac gtacaaaggg agaatgtcaa tcaaagtgtt 360 tctgcagact gtttttatca agtgtgatta taaaaaataa taattaaata catttttacc 420 attagaagct ggttatattc acacactgtt gccacacaac tgtgtttaaa ccccttataa 480 aagtgatttt tgcataatag gtccccttta a 511 // ID LDR1 repbase; DNA; ZEB; 4963 BP. XX AC AL591172; XX DT 04-MAR-2002 (Rel. 4, Created) DT 04-MAR-2002 (Rel. 4, Last updated, Version 1) XX DE LINE Danio rerio 1. XX KW LINE; retrotransposon; non-LTR; LDR1; Clone dZ48C11 (AL591172). XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4963 RA Jekosch K.; RT "LDR1: LINE-like element from Dano rerio."; RL Repbase Reports 2(2), 15-15 (2002). XX DR [1] (Consensus) XX CC Putative novel non-LTR LINE-like retrotransposon. XX SQ Sequence 4963 BP; 1628 A; 764 C; 1017 G; 1554 T; 0 other; atgcatggaa cggctaaata tttaaaaata atatcctgga atataaatgg ttctcataat 60 ccagttaaaa gaagaaaatg gttaggttac cttaaatcta aagatgtgga cattgcttta 120 attcaagaaa cgcatatgat gggtacagag gctgaaaaac ttaaacgtga ctgggtggga 180 caggtgttta ataattcata taatagtaag aggaatgggg tagcaatatt ggtgcataag 240 agggtaaatt ttgtcatgat taaacaaaaa aaaggatgag gagggaagat ttatatggtt 300 ggaagccatg gttgatgatc aaaaagttaa tatttgcaat atttattctc caaataagga 360 ggatagtgta ttttttcata cggttaataa gataattgga gtacaagcag gtaatcagtt 420 aatagttgca ggttatttta atcaggtgca agatgcctat ctggatagaa caacctacca 480 taaaaatatg cccagagaca gattagctat acaattaatg atggaagatt tggggttggt 540 ggacatatgg aggcttgtca atcctagaga aagagagagt ataccttctt ttcacatagt 600 cataaatctt acccaagaat agattacttt ctggtttctg gtgatttagt cgagtcagta 660 gtagactgta agataggcgt gattgctttg ttagatcacg caccagtgga gatgacatta 720 gacaccaatt ctaggacaat aaaacaaaac agatgaagat tcaatatatc cttgcttcag 780 gacttaaatt ttagtacaaa attgggagct gacttaaagg aattttcgag attaatgttg 840 gtaccacaga gaggctggga acagtatggg aatcatcaaa agcttatgtc aggggtaacg 900 caatacaata tgctagcttg gtaaaaagac tcaacaagga aaaagttaag gacatttagg 960 ccagaattaa agttttagac aggcaattat cactaaagtt tacagatagt attttaaaac 1020 aagtctgcaa tctgaaatat cagcttaatg atatttaaaa taggaaagca tagtatgcaa 1080 tgtttaggat gtgcacagct ttctatgaaa gtggagaaaa ggctgataaa ttgttagcaa 1140 ggcagttgaa acaaaaggat gctagtttct taatttcagc ttttaaaaat aaaaaaaatg 1200 aagtggtgac tgcaaacatg gacattaata atgtctttga aaagctttat aataaattat 1260 atgaagcaga gtcatcccca gactgtacta aatatagaga ttttttctcc aaaattacac 1320 ttcccacctt gtcctcagac cagcttgaga tgttagatgc gccaatagaa gaatctgggc 1380 tgcaataatg tcaataaagg ccgctaagtg agcaggttta gatggctttc ctgccgaata 1440 ttataagaag tatattgaca ttgttgcacc aatattggaa ggggtgtata aagaaacctt 1500 gttactggag caaatgcccc caacatttaa tgatgcgcta attacgttaa ttcttaaaaa 1560 ggataaggat ctttatgatc ctgggagtca tagaccagtt agtttagaaa atgttgattg 1620 taaaatttta tctaaagtat tagcattgag gttggagggc attttatcca atatcgtata 1680 tatcgaccag gtacgtttta taaaagggag atcttcttct gataatcttc ggtcactact 1740 tcatctcatc tggcaaagcc gcaatgagaa tgttccagtc gctgcttttt gactagatgc 1800 gatgaaggca tttgatagag tagaatgggg ctatttaact tatacgttac aaatgtttgg 1860 ctttgggcca acttttttta agtgggtcaa ggtgctatat tccggcccac gtgcagctgt 1920 tcttacaaat ggcattattt ctcctttctt taaattaaag cgaggcacca gacaagggga 1980 ccccctgtct cctttgcttt ttactttgtt tctggagccc ttagcagttg caatcaggaa 2040 tgacataaga gtgaatggtg tccatttagg agagagggaa tataagtgtt ttttatatgc 2100 tgatgatatt cttctcttgc tttcaaatcc aagtacatct atacctgctg tgatggatac 2160 tattgaacat ttttctcaaa tatcaggtta caaagttaat tgggtaaaat ctgaggtaat 2220 gccagtgtct gtgggatgtt cgttggcgga tgtgagtgct ttctccttta aatggatatc 2280 aactgggatg aagtatttag gtattaggct ctcaagggag ttgtatgaag ttgttcagat 2340 gaatataacc cgtatgcttc aaaatgttag tacaaacttt gataaatgga aagtgttaaa 2400 tttgtctttg tgggggaagg ttaatgcaat taaaatgatg gtatcatcaa ggattaatta 2460 tatctctata atgatccctt tgaagtttcc tttatacatc tttaagaaat acaatcagct 2520 agttaaggac tttctgtggg agggaagaag cccagaataa gtatgaaaaa tatgtttacc 2580 actagaataa aggggggttt ggcattgcca aatatagagc tttataatac tgcatttgag 2640 atgattaaaa tatgtaaaca ctggtcaggt gataatgtag agggtataag ctggatcgaa 2700 ataaagaaaa cgctaacttt cccattcagt gttattgatg ctttatctca gaaatcttta 2760 tattctatta tgaatgggga agttaaccct atactggaac actcatagca ggtctggaaa 2820 aaaatacata agatgtttaa tttgtcccat tataaacaat gcttttcttc attatggaat 2880 aaccctgcta ttaagatata tgcaccattg tggaggatag tagcatctgt gtgcaagtcc 2940 aatataaact aaggttagct gatgctgtgc agcatcttct tttgtttaaa agtaactgtg 3000 tagttacaaa tacagtgaaa atctgtttgc tagttatacg tacatgaaca cattagtgca 3060 ttgaatgact ttacgttttt gtgagctaaa ctgggctttt gtgaactaag cattatatca 3120 gttgatgtaa tatgtttttt tttctgtaaa tttaaacaat cctataaact ttcgtatttt 3180 tacagaaaat tcctgacaac cactgccggt atttttcagt aaatttaaca gatttttttt 3240 agcagtgtaa ataaattaga gctaaattat aaatgttgtg ttgattgaac taaacaacat 3300 taccttacct ttatttaact aacttaagtt cactcaatta aatattcttt ctttaaggca 3360 gaaatattct ttctttcaaa gtctaagaca gtgggcctcc cttttgacac agcttatcca 3420 ttggcgtccc cctcctccca aacacgcacg cacacacacg cgcacacgca cacacgaaaa 3480 attccggatg ttttcctgct acctcatcat gcttggtttc aaggtgtcgc cgaagtttag 3540 cgggactcat agccaaaata tcttgaaaaa ccacacataa aagtcgtggt tcatcagctg 3600 gtcctgtcca cgtaaatcct aaactcaaat actgatcatc atattgtcgt cttttgggtt 3660 taagccctga acttgaaggc tttgaatcgg ggggtctcag aaactgatcc attgtattgg 3720 caggcatata cctgtattgg cgggcttgcc aaacagaagc aaattcacag ctatggccat 3780 ctgggtaaaa aaggttttct tatttttgac gtataatttt ttttagcttt gtttaattaa 3840 aatgtttaaa aataataaaa atacatataa taattaaact taataattat gtttttatta 3900 tataatattt ttcccgcgca tcccctaaca tgctctggcg cccaccaggg aatgcgcctc 3960 acactttgaa aacccctgtt ttaaggtaac ttaactcggt tacgtggaac ctgttgacat 4020 aacaaagtta attaaagcca gcatatcatt tttttgagtg taggaaagaa aacagtctat 4080 tggaaggatt ggtgtaaaaa aaaaggcctg aaaacagttg atgatttata tggacaaggt 4140 gcactgtatt catttcagga gttgaaagac aaatttaatc tagtagataa aggggatttc 4200 tggaaatata tacagctgca tagtagtata agaactgtgg gatataagcc aggagcagaa 4260 gaaaatgttt tattagggtt tctaaatatg ccaaagtcga tgcaaaccac atcttttgtt 4320 tataagattg ctgcatgtat ggaaaaagtg atcatttaaa aattatctgg gagaaagacc 4380 tggaggtgga atttgaagag ggtgaatggg aggcagtagt ttctggtcgt gggggtactg 4440 tgagagatgt taggagtaaa ctcatacatt acaagataat taatcaatat tactagacac 4500 cagtaagact gcataggata ggattaaagg aaaataatca ctgttggaaa tgtggtcatt 4560 ctgtgggcac ttttttacat ttaatgtgga gctgtcattt ggtggctcca ttctggacaa 4620 gagttattca aaacctagag aaatggctag gacaaccttt accttattcc ccaagagtct 4680 gtcttcttgg tgatacatcc actttacaga acggaatatc taaaacacag gctggactag 4740 tcgtcgcagg atatattatt gctgtgagac tggtgctgcg aaattggaag aactcagaca 4800 ctccctcttt taaagattgg attgagctga tgacttctaa tgcatcatat atgaacgtat 4860 gttggcaaga cttcaggatt ccacccatac ctttaatcag aaatggggta gctttttgca 4920 atatttggag agcacataaa aagaaaattg aaaagttgtc aat 4963 // ID Tc1-1_DR repbase; DNA; ZEB; 1625 BP. XX AC . XX DT 07-DEC-2004 (Rel. 9.11, Created) DT 07-DEC-2004 (Rel. 9.11, Last updated, Version 1) XX DE Tc1-1_DR, an ancient Tc1 transposon reconstructed from its DE deffective copies present in the zebrafish genome. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Prince; TXr; KW TC1; Tc1-1_DR; TC1/mariner superfamily; transposase. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1625 RA Kapitonov V.V. and Jurka J.; RT "Tc1-1_DR, an ancient family of Tc1 transposons identified in RT zebrafish genome."; RL Repbase Reports 4(11), 303-303 (2004). XX DR [1] (Consensus) XX CC Tc1-1_DR is an ancient family of Tc1 transposons that was CC active in the zebrafish genome more than 10 million years ago. CC The zebrafish genome harbors several thousand copies of CC Tc1-1_DR elements that are ~85% identical to their consensus CC sequences; all copies are severely damaged by mutations. The CC Tc1-1_DR consensus sequence encodes the 340-aa Tc1-1_DRp CC transposase (pos. 376-1395). Tc1-1_DR elements are CC characterized by the TA target site duplications and 210-bp CC terminal inverted repeats. The Tc1-1_DR consensus sequence is CC ~90% identical to Tc1 elements present in frogs (TXr from CC Xenopus laevis, Prince from Rana pipiens). Presumably, CC independent multiple events of horizontal transfer were CC involved in evolution of these transposons in vertebrates. XX FH Key Location/Qualifiers FT CDS 0..0 FT /product="Tc1-1_DRp" FT /translation="MPRSKEIQKQMRKKIIEIYQSGKGYKAISKALGLQRT FT TVRAIIYKWQKHGTVENLPRSGRPTKITPRAQRQLIQEVTKDPTTTSKELQ FT ASLASVKVSVHDSTIRKRLGKNGLHGRVPRRKPLLSKKNIKARLSFARKHL FT DDPQDFWENTLWTDETKVELFGRCVSHYVWRKSNTAFQKKNIIPTVKCGGG FT SVMVWGCFAASGPGRLAVINGTMNSAVYQNILKENVRPSVSDLKLKRTWVL FT QQDNDPKHTSKSTSEWLKKNKMKTLEWPSQSPDLNPIEMLWHDLKKTVHAQ FT KPSNVAELQQFCKDEWAKIPPQRCNRLIASYRKCLIAVVAAKGGPTSY" XX SQ Sequence 1625 BP; 550 A; 326 C; 326 G; 423 T; 0 other; cagtggtgtg aaaaagtgtt tgccccttac tgatttttta tttttttgca tgtttgtcac 60 actttaatgt ttcagatcat caaacaaatt taaatattag tcaaagataa cacaagtaaa 120 cacatcatgc agtttttaaa tgaaggtttt tattattaag ggaaaacaaa atccaaaact 180 acatagccct gtgtgaaaaa gtgtttgccc cctgttaaaa cataacttaa ctctggttta 240 tcacacctga gttcaatttc tctagccaca cccaggcctg attactgcca cacctgttcg 300 caatcaagaa atcacttaaa taggacctgc ctgacaaagt gaagtagacc aaaagatcct 360 caaaagctag acatcatgcc gagatccaaa gaaattcaaa aacaaatgag aaagaaaata 420 attgagatct accagtctgg aaaaggttat aaagccattt ctaaagcttt gggactgcag 480 cgaaccacag tgagagccat tatctacaaa tggcaaaaac atggaacagt ggagaacctt 540 cccaggagtg gccggccgac caaaattacc ccaagagcgc agcgacaact catccaagag 600 gtcacaaaag accccacaac aacatccaaa gaactgcagg cctcacttgc ctcagttaag 660 gtgagtgttc atgactccac cataagaaag agactgggca aaaatggttt gcatggcaga 720 gttccaagac gaaaaccact gctgagcaaa aagaacataa aggctcgtct cagttttgcc 780 agaaaacatc ttgatgatcc ccaagacttt tgggaaaata ctctgtggac tgacgagaca 840 aaagttgaac tttttggaag gtgtgtgtcc cattatgtgt ggcgtaaaag taacaccgca 900 tttcagaaaa agaacatcat accaacagta aaatgtggtg gtggtagtgt gatggtctgg 960 ggctgttttg ctgcttcagg acctggaaga cttgctgtga taaatggaac catgaattct 1020 gctgtgtacc aaaatatcct gaaggagaat gtccggccat ctgttagtga cctcaagctg 1080 aagcgaactt gggttctgca gcaggacaat gatccaaagc acaccagcaa gtccacttct 1140 gaatggctga agaaaaacaa aatgaagact ttggagtggc ctagtcaaag tcctgacctg 1200 aatccaattg agatgctgtg gcatgacctt aaaaagacag ttcatgctca aaaaccctcc 1260 aatgtggctg aattacaaca attctgcaaa gatgagtggg ccaaaattcc tccacagcgc 1320 tgtaacagac tcattgcaag ttatcgaaaa tgcttgattg cagttgttgc tgctaagggt 1380 ggcccaacca gttattaggt ttaggggcaa acactttttc acacagggct atgtagtttt 1440 gtattttgtt ttcccttaat aataaaaacc ttattttgaa aactgcatga tgtgtttact 1500 tgtgttaata tctttgaata tatgactaat atttaaatta gtttgatgat ctgaaacatt 1560 aaagtgtgac aaacatgcaa aaataagata aatcagtaag ggacaaacac tttttcacac 1620 cactg 1625 // ID ENSPM-1N_DR repbase; DNA; ZEB; 1438 BP. XX AC . XX DT 26-SEP-2008 (Rel. 13.09, Created) DT 27-SEP-2008 (Rel. 13.09, Last updated, Version 4) XX DE DNA transposon - a consensus. XX KW EnSpm; DNA transposon; Transposable Element; Nonautonomous; KW ENSPM-1N_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1438 RA Jurka J.; RT "EnSpm-type families from zebrafish."; RL Repbase Reports 8(9), 929-929 (2008). XX DR [1] (Consensus) XX SQ Sequence 1438 BP; 480 A; 235 C; 203 G; 517 T; 3 other; ggggtgcgtt tcccaaaacc atcgttagcc aactaaggtc gcaagttccg tcgttacaaa 60 catagtttgt tgatttgccg tttcccaaat ccgtcgctcc aacgaacatt cgcaaactgc 120 gtcgcaaact tgagcgctcg caactacacc tctggagctg tagttagaaa catagttcct 180 ggctgtgttc tattcccact tatcccccct atgccctatt catttagaac attctaacat 240 tcaaagttgg aattattaaa aataaaaaag cattaaggtc atctctctta ggtgtaattt 300 gctttcaaac tatttttaca gttcagtttt agcgatcttc atgtttacaa ttgtgctccc 360 ttcgcagtgc actttgaaaa cattgatgtc attttgaaca cagcctcatg gacgaaagct 420 ataggtgacc tattatttaa aagcggaatt tatgttacgt ctccaaagct tgtgaaaaca 480 aaaatatagc atacgttaat tcttttaatt tatagtaggt tatttattaa gtatctgtac 540 tgtatatgac atgggcctgc tggttagaac tttctgcagt ggtttacatg tgtcaaattg 600 taaaagtaga cttttctaaa aaaaaaaaaa taataataaa taaataaaca atatcctact 660 tamttcttat tantactaat aataataata ataataataa taataataat aataataata 720 ataataataa taataataat aataatcatc atcatcatta tcattattaa tattattatt 780 aaattatata tttttttatt tctaatacat aataataaaa ataataatta ttatcatcat 840 catcatcatc atcatcctca tgatcatcat tagtattatt aatattatta ttaaattagg 900 atttttttca tttcctaata tataataaat attacatttc attttataat aaatagcctc 960 attatttatt tatttatatg atttatatat gatattagaa tacgtgttag cttttgtaag 1020 tgattttatg ttttagaata gaatatgttc atsttctcaa taatatttgt aaaggaaaca 1080 caggccctag gctctatatg tgccatttac atatatttca gttaatatgg aaagcgagtg 1140 catgttttta ccaaaactaa tattggattt tatttttaaa tgcgtgtgcg tgtaaaacaa 1200 tataatttgc acaaagaaat gatggggttc ttctctaaag aagtagttac tccaccccgt 1260 ttagagcgtc attatgggcg ttttacgtta taactaacat ggttcaagcg atggatctgc 1320 gacagagaaa ctacgcgttt tgggaaacac tcgtcactac atcgttcttt tcccaaacga 1380 tgcatcgtac tatgatagtt cagccgtgag ttacgtcgtt gtttgggaaa cgcacccc 1438 // ID EnSpm-3_DR repbase; DNA; ZEB; 9673 BP. XX AC . XX DT 31-JUL-2008 (Rel. 13.07, Created) DT 31-JUL-2008 (Rel. 13.07, Last updated, Version 1) XX DE EnSpm-3_DR is an autonomous DNA transposon - a consensus. XX KW EnSpm; DNA transposon; Transposable Element; KW Autonomous DNA transposon; EnSpm-3_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-9673 RA Kapitonov V.V. and Jurka J.; RT "Zebrafish En/Spm DNA transposons."; RL Repbase Reports 8(7), 751-751 (2008). XX DR [1] (Consensus) XX CC EnSpm-3_DR is young family of autonomous En/Spm DNA transposons. CC The consensus sequence was derived based on multiple alignment of CC several copies of EnSpm-3_DR that are less then 5% divergent from CC each other. EnSpm-3_DR transposons are charecterized by 2-bp CC target-site duplications and imperfect 18-bp terminal inverted CC repeats (1 mismatch). See also commentary on EnSpm-2_DR. XX FH Key Location/Qualifiers FT CDS 2115..4736 FT /product="EnSpm-3_DRp" FT /note="En/Spm transposase." FT /translation="MQCKNCKFSTSSEDVLLKHYQLHHHRISNWPCLYTEC FT VCAFKTAGALRSHLSKSHHNTDRVSRDLSTFNCGLCEFHEICTEKRFFLHL FT QNHLKRKETIQCPFKGCGFKTNNRPTFSSHRSRNHRNQTLSDRQSQIVSDS FT LGVSEVLCTTQNISEFDEEDIIESVEDVNVHTLERKIASLLLCMQTVLHVS FT KSATQTIFEELKSILLFSKSHALHTIETIVTKHNINIESVVIKEIADSVCL FT TNPLFASISEKGTLSTDYKRNQYFKRNFFVIEPTEYLYEHSHKEVFVHVPV FT IRLLETLLNQDSFLNNIEFTHKHLPGQYSSFKDGKYYRESGFVTEDDVKLS FT LAFYVDEFEICNPLGTSRKIHKITAVYWVVLNLPANLRSTLSSIQLAVLGK FT SIDVKKFGFDKFLEPLIKELKSLEHEGLFVEALGHYIKPTVFCVCADNLGA FT HGLAGFQENFNVEKFCRFCLISRDQISTVKPCDFPLRTVDQHDLFVEQLKQ FT SAVQSVNGIKSECALSKHLRYFHPVTGFPPDILHDFFEGVIPVELSLCLRD FT LISKGFITLEGVNHSIRTFPYKYSDRVNKPKTILKTSLAKGSIGGNGHENW FT TLLRLLPLMIGDRIPEHEPSWDILMDLKEIVEIVLSNSLSDETLCYLSFKF FT SDHRLLLTSTFPDYALKPKHHFIEHYPELTKCFGPLVTLWTMRFESKHSFF FT KKVARDAHNMKNVLLTLSMKHQQMIAYHLDAQSLLKSDLHVEKLDVVSISL FT LDATLRHAVQTKFPQLHTVSLSRDVCLHGTRYAKNMIISAGHCNGQPEFFR FT IESMLIHSAKVFFITKRVSAWYLEHLRSYELVESHYTDMVIFDSGDLNGYH FT PLIPYSVGSKVFVTLRTYLQH" XX SQ Sequence 9673 BP; 3096 A; 1594 C; 1704 G; 3279 T; 0 other; cactgtaaaa aataaaattt caatttgttc agcacactgt actttttctg tcatatatac 60 taagaattgt gaataactac tgatctacat aaaaatatat gttatgtcaa taatattcgc 120 taaaatgtta tgtctacatg tttaccagag ttttctggac atacattttt gtgttgaaca 180 ggtatgttga tatttaaata aattgcgcat gcgtgacgca acgtttgaaa acggcgccaa 240 ccctccccca tattgaccat tcgtttcgct cctcatctcc attttgtctt ggtgccatat 300 cggacttcac gcatattttc gactcttcga ggaactaaac attaacttgt tcctgtaggt 360 gagtaacaac acttattaac actaattcta tgcatgcaat gaattttact atattcacaa 420 cctgacggtt tgtttttagc gacgagaatt attgcatatg ctgaagtttc aaatgcagtg 480 ggtaattaat gtggcctata gtttcagcta agtccaaact cataaatgtt tccgtgtatt 540 catgacataa gtggcggatt taggcagagt caatcgctca gggaggcatg gagaccccgc 600 gcgacatcct tcccgcccac gcccgcacct tcgagagggc ggaggaccgt gagttcggca 660 cggtcgtcgt ttgcctaacg ccagttaagc ttgtgccgct ccttaagaca ttacttggaa 720 aaaaacggaa actgcttttc tgtgccgttt aagtttaaaa tgtttcatgt taacactaaa 780 attacatcat tatgcttaac tgttaatcac tgataaaaga tggctaatat gggaccatag 840 tattaacatt tacgttaggg aattcagagg ttttgtttgt ttttaggtga aacgttttca 900 tttctctggc gagttttcac atttttctca ttttacgtgc ttctccgtgt gtgccgccta 960 gtgatgggtc gttcttgaac gattcgttca ttttgaacga atcttttgta tgactcgata 1020 acgacgagtc ctctctagga gtgatttgtc cactcgcgca tgcgcacatt tgtgcaggtg 1080 gaggaaaaga ttagttcatt tatcgagtcc tctatctggt ttaagtcatt cgttcatcac 1140 gttaaagaca taaggcaatg aaatgcaatc agagccggaa atttttttaa atcttctttc 1200 gagtcctcgg cgttgagtca tctctcttta catgctgtca cgtgatgaac gaacgagtca 1260 aaaaccagat gactcgaaag gtgaactaat ggctcctttc ggttcagact gtgctttggt 1320 taagcttata tgggtctgtc tgtgtgacgt agacgaaccc ctcaaatttg aagacctgtc 1380 agaatagctg aactcacatt atcatagaca aaagactagg taaacaaacg attattttct 1440 ttttcttata gcattcaagt tatgacttgt ttgtagtgtg atcaacgtct gggctagttg 1500 tagatgcgtt tggaataacc tgtatcattt taataatatt ttggcaaatt gatggaaatg 1560 acttgaaaaa aagatttgtt catctcaacg aatgagactc aaaggtccaa atcagtaata 1620 tgatccgaac ttcccatcac taagagcgag tgagagaccg taagttgtac ttataaattt 1680 gtgaagcata tataaagcac agatattcag attatattag ataaagtaat atgtaaatgt 1740 agtttatatt ttttatttat aaaatattgc cgaaaacgtt atttaatcct gtcaaattaa 1800 tataaatagt tcctattagg tttaaaatag tcttaatttc aaagcatata ttattactaa 1860 ctatattaca ttatttgtgg actcctctgt atacatcaat atcatatatt gatattgata 1920 tcattgtctg ctcccaaaca tgtttcatgt taaaatatgg gggcataaac ttgaaaaatg 1980 tttaaaaaat tttaaatgtg aagctttttc aaatatattt tttgtaagaa ttatatttta 2040 tttaaagttg tgatttaaca atacaaggtt ataactgatt atttctgttt tcaggatgat 2100 ctcctcaaga cttcatgcag tgtaaaaact gcaaattcag cacttcaagt gaagatgtcc 2160 ttctgaagca ctatcagcta catcatcata gaatttccaa ctggccctgt ctttacacgg 2220 agtgtgtttg tgcttttaaa actgcaggtg ctttacgatc ccacttatct aaatcacacc 2280 acaacactga tagagtcagt cgagatcttt caacttttaa ttgtggacta tgtgaatttc 2340 atgagatttg tactgaaaaa agattttttc ttcacctaca aaatcatctg aaacgtaaag 2400 aaacaattca gtgtcccttt aaaggatgtg gatttaaaac aaacaaccgc ccaaccttta 2460 gctcgcatag aagcagaaat catagaaatc aaaccttaag tgacagacag tctcagatag 2520 tttcagatag tctcggagtc tcagaagttc tttgtacaac acagaatata agtgaatttg 2580 atgaagagga tataatagag tctgttgaag atgtaaatgt tcatacactt gaacgcaaga 2640 tcgcctcact tttattgtgt atgcaaactg ttttgcatgt ctcaaaaagt gctactcaga 2700 caatttttga ggaacttaaa agtattttgt tgttctcaaa atctcatgct cttcatacaa 2760 tagaaacaat tgtaacaaag cacaatatta acattgaaag tgttgtaatc aaggaaattg 2820 cagattctgt ttgtctaaca aatccacttt ttgcatcaat ttctgaaaag ggcactttgt 2880 ctactgacta taaacgaaat caatatttca aaaggaactt ttttgtaatc gaacctactg 2940 aatatcttta tgagcattct cataaagaag tgtttgttca tgttccagtc attcggttgc 3000 ttgaaacctt gttaaatcaa gacagctttt taaataacat tgaatttaca cataaacatc 3060 tccctggaca atacagctca tttaaagatg gaaagtacta cagggaaagt ggatttgtta 3120 cagaagatga tgttaaacta agtttagcct tttatgtgga tgagtttgaa atttgcaacc 3180 ctcttggaac atctcgaaaa atccataaaa tcactgctgt gtactgggtg gtcttaaatt 3240 tacctgcaaa tttaagatct actttatcat caatccagtt agctgtttta ggaaaaagta 3300 ttgatgttaa aaaatttgga tttgacaaat ttcttgaacc tttgataaaa gagttaaaat 3360 ctctggagca tgaaggtttg tttgtggaag ctttaggaca ttatataaaa ccaactgtat 3420 tctgtgtgtg tgccgataat cttggagcac atggtcttgc tggttttcag gaaaatttta 3480 atgtagaaaa attctgtcga ttctgtttga ttagtcgtga tcagatttca actgtaaaac 3540 catgtgactt tcctttgaga actgtggatc aacatgattt atttgtagaa cagcttaagc 3600 agagtgctgt tcagagtgtt aatggtataa agagtgagtg tgcattgagc aaacacttaa 3660 gatactttca tcctgtaact ggatttcccc cggacatttt acatgatttc tttgaagggg 3720 tcatccctgt ggagttatct ttgtgcctca gagacttaat ttccaaaggt ttcattactc 3780 ttgaaggagt aaatcactcc attagaacat ttccttacaa gtactctgac agggtcaaca 3840 aaccaaagac aattctaaaa acaagtcttg ctaaaggatc aatcggagga aatggacacg 3900 aaaattggac gttattgcgc ttacttcctc tgatgattgg ggatcgtatc ccagagcatg 3960 agccatcatg ggacatatta atggacctaa aagaaatagt tgagattgtt ttgtcaaaca 4020 gtctctctga tgaaactctg tgttacttgt catttaaatt ttctgaccac cgcttgcttc 4080 ttacttccac ttttccggac tatgcattaa agcctaagca tcactttatc gaacactacc 4140 cagaactaac taaatgtttt ggacctttag tgactttgtg gaccatgcgc tttgagtcta 4200 agcactcttt cttcaagaag gttgcacgtg atgcccacaa catgaaaaat gtacttctca 4260 ctctttccat gaaacatcaa cagatgattg cataccattt ggatgcacaa agccttttaa 4320 agtcagactt gcatgttgaa aagttggatg tggttagcat atcattgttg gatgcaaccc 4380 tgaggcatgc tgtacaaaca aagttcccac agttgcacac tgtgtcactg tccagagacg 4440 tttgtcttca tggaactaga tatgccaaga acatgatcat atcggcagga cactgcaatg 4500 gacagcctga gttcttcaga atagaaagca tgttgatcca ttctgccaaa gtgttcttta 4560 taacaaaaag ggtttctgcc tggtatttag aacatttgag atcttatgaa cttgttgaaa 4620 gccactatac tgacatggtt atctttgact ctggtgacct aaatggctat cacccattaa 4680 ttccatacag tgtgggatca aaagtgttcg tgaccctgag gacctatttg cagcattaaa 4740 tgtcctttac aatggtatgt accacattaa ctaattaatt tattaattgc ttaaatttag 4800 taattaaaat tggcttcttg tttatttaca gcctttgcta ctacgagtca tcatttcctc 4860 cactgaagcc cggcgagtcc agcttcctga agtgcctgaa tcagtggaat ctctcatcac 4920 tattcttcaa gagaagctgc aattacaagg acagttttct cttaagtttg aggatgctga 4980 ttttggcaat gcactttgca acctgtctga catctcagaa ttgcccagtg gaaaagcagt 5040 cttgcatatt cagtggtgca agtcatcagc ttatgaaagc agtagccttc catcagtttc 5100 atcacttgat actgctagtc tcgactctga agaatccttg ccaagcactt caggctctgt 5160 gcaaaactat ttacgtactg cctcagaatg gccctcgcca ttccccatac cagcgctgtc 5220 atttgatgtg gagctaaaac taagacgagg aaacgaggca tttgaaaaaa caaaaatagg 5280 cattgatgtg actagagaca tgaaaataga gattcttgac aaaatagtgc agacagtttt 5340 tgacataaag gcataccctg acaatcagga aattgaatcc attgcatctg ctttggtttt 5400 aaaatatcct tgccttaagg agcctggcaa aggaaagggt tttgagggat ggttgatcag 5460 catcaaaaac aagctaaaca attatagggc aaagttgcga gaggcaggtt gcaatgaagt 5520 aattgttaac agaaagcgaa acgatgatgc cagtggtcgg aggagtttca ctctgaaaaa 5580 ggcaaagcgt ggagaagtca atcatgtacc ggaacatcca tgcaaccaca ctgacacttc 5640 acttgaagag caaagagttt ttttggtaga ggaaaccaaa aaggcaagaa gagacatggc 5700 agccataagt gaaaaaatgg aactaacatt ttcccttaga agaaaagaaa tggtccaaga 5760 gcagccaatg attgtagagg ttcaggagag gtggcctgca ctcttttttc aagaacaggt 5820 aaagataatt tttttcttac ttattaggct ttaagtttac agaatgcaag atgctctgtt 5880 tacagaaaaa attgattgct gtaaatctgg ttggtttgtt ttaaatagat ctgtgaggaa 5940 tttttccgca tcaccaacaa agacctacta ggagtcttca tggcagccac tgacaagtac 6000 acaccaaagc ttctgaaatt atacagagcc agaaaaggag cgtttggaca tgaaatggaa 6060 gagctcctgg aaagacttga tgaaagggta agttaataat tttcattatt tacatgtttt 6120 tgatctatgt agttgatgat atttcagttg ttattctgtg actttttggt tgtatctttt 6180 ttatgttttt gcagacaaca gaaattgtta atcacagaag gactgctgct ttggagggcc 6240 tgcctttgtt tcttcgagag aaacacacta accttttcaa gaaatgtaaa gtaggtatca 6300 aacactttaa atggtttaaa gcatgagtgt ccaaacttgc ttctgtattg ctggtgtctt 6360 gaaaagttga gtgccaatct atttggagct ttcatgaaaa cttttgatta gctgagttaa 6420 aatgtttaat ttaggtatca ttaacagaaa ttacaaatct tctgacctta aggtataaga 6480 agtaaatttc atgattcaca agtatcttgt tatttttaaa agattgatat caggttctat 6540 attagtacca gtttagacct gctcactgat ttatgcaggt agttaaggga aagaacatgt 6600 aaaagagatg caagagaaac tgttgctttg tctgttatta cataaaatta gagttgtgcc 6660 gagagtctga gacgttattc ttgtcaacca aaatggcagc tgttttattc agagtacaac 6720 aaatcagcat ttatgtattt attcatttat gacagtattt taaaataaat cacttaaaat 6780 atttaaagtc caattactgc attaatgttg catatgaata ctgatgaatc cttgttattt 6840 acatgttgtc tgtttaattg ttacattcgc ttttgcttaa tgatttgaat tataacacta 6900 tgtcttccta cagacaaact acacattaca tagcagtttt atgatcataa ctatttgatg 6960 taactggtca ttttgcttaa ctgcaattaa gtttgcatag tcattgtatt atgctggttt 7020 gttattttta cagaatgttc atatttctga aagataatgg tatatatttt taactgaata 7080 aataactttg ttacaattat ttaaatgtgt gtgttgttat agccctataa taaacaagta 7140 cttgcaaaat tttttaaatc tctataatca ccattaattg gaatatttta tttgacaatt 7200 aatcgtcacc taaatttcat aagcatgaca gctatatttg caatataaca tcttgcttgt 7260 ttgtagataa agtactataa tgactttcta tgctcaaata ttcttattgt gttttcacag 7320 gaaactgaag atggaacaaa gggcgtatca gttggcattc tctatgttac ggaagaggac 7380 tcccgggcag catctccagt gatccagaac attgctgttg tactggaaga ggttgttgtt 7440 ttggaagaca ttccagatac ctcaagtgct gtagcatacc tctttggcct tctttatgca 7500 ctcaactttt cctatcccaa agaactccgg tacacttttg acacaattca gaatgttttc 7560 atggagcttg ggactggatg tacgcaacgt gtgctttctc ttaaaaacaa gatgttaaac 7620 taatactgta aacagtgttg tgtcaatggt ctgatgttac aaagacctaa tgtacaattt 7680 gagtactaaa tctgtttcat gagagccact gtttaggcac tgtgttcagt taaatgggta 7740 acacttcata ataactacac actatgaatc atttgttaag cattagcaaa tagttagttc 7800 attatttgtt aagtattaac tctaataggc attaataagc actttataaa tatagccaca 7860 aatgctatat tcttaaattg taattaattg cattttcata ctttgttaat gattcttttt 7920 tttattacat aagtattgca ttatttacaa atcagttatt taagaatagg tgttggtttt 7980 ttccagatca ttcagaatgt aaaagtacat tattaataaa ctattcaaat aacgttcata 8040 tatcttatta ttcaggcata tactattagt ttatttatat attaataaat gctttattaa 8100 ctcgacttca tccagttttg tgacctaaag tgaggactat ttatgcttta taaatccctt 8160 ataaatgaca attaaatgct cagttatatt ccgaacaggg aaaaagagca tgaactattt 8220 aattcatttc tatttattta aagatacaca aatgaaacta tcaaatgaaa aaaaaaatct 8280 ttgcaacttt atctaaaatg aagttactgt acagtttaaa ctttactaaa taacactgaa 8340 atgttatatc ataatatatt gttttatcca attttattca attgtaatta ttgctaattg 8400 tattttgaca tttctgttat tcggcgatgt ttaaagttta cagtaatttt atttttttag 8460 attagattgc aaacatttat tgttcatttg atactcagct tttaattgtc atttataaga 8520 gatttataaa gcaaaaatag tctcacttta gattaggtca caaaaatgga tgaagttgag 8580 ttaaaaaaag catttattaa catacatagt aaccattact atatgcctga ataataagat 8640 atataattgt taatttgaat agttccatta ataatttact acctcatgct gaatgatctt 8700 aaaaaccacc agctacactt aaatacaaat ggtttgtaaa taatgcaata ctgaattcag 8760 taaggaacaa ttaataaaaa gcaaagtatg aaatacaatc attaagcaca ttttgagctt 8820 ataacagcat ttgtagctgt atttataaag tgcttactaa cgtcttataa tgtagagtta 8880 atgcttaaca gataattaat taactatttg ttaatgcttt gctaatgatt cgttttgtgt 8940 agttattatg aagagttacc gttaaaggtt ttaaaattaa aaggtttaca ttttgtcaag 9000 acacccaagc acttaacttt tgttttggtt taaatgttta tggaaactgg tgtccaaatc 9060 aacccttttg tttataattt gtgcataaaa gcattcaaag caaatttagt ttgtgtcaaa 9120 tgcaatgttt aaattgttaa tttctgtgct gttattgcag ctaaaatgtt aaagctagca 9180 tatgtgcctg caaattagca gaaaagtttg tatactgcaa agtttgtaat ttgaagcact 9240 tttcatctta aaatatttac aaatgtttgt gctatttcct attgtaaagc gtgttatctg 9300 ttgctttcag tggaaataaa ttgaagaagt agtaagtttg tgtttgtcat ttgtaagtag 9360 aatgattaaa acaaaatgat atgctaacta gtactaatta cgtattttgc aaatttgaaa 9420 ataaacatcc actaaacaga aaaaaattaa ttgactcaat agcagtaact gttgctttag 9480 ttgatgtgac aagaatatct ttattagtat aatactataa tcaatattgc atgaacaaag 9540 aaaattatgt tagctcaact aaatattgtt actgagaaga actaaaaaac agttgttgag 9600 acaactcaaa gttcttactg catcaagttg ccttattttt ttatgtttgc tcaacatttt 9660 tttttttaca gtg 9673 // ID Gypsy8-I_DR repbase; DNA; ZEB; 6655 BP. XX AC . XX DT 07-JAN-2005 (Rel. 10, Created) DT 07-JAN-2005 (Rel. 10, Last updated, Version 1) XX DE An internal portions of the Gypsy8_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW GYPSY superfamily; Gypsy8-I_DR; Gypsy8-LTR_DR; Gypsy8_DR; KW endogenous retrovirus; gag; integrase; reverse transcriptase. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6655 RA Kapitonov V.V. and Jurka J.; RT "Gypsy8_DR, an LTR retrotransposon from zebrafish."; RL Repbase Reports 4(12), 321-321 (2004). XX DR [1] (Consensus) XX CC Gypsy8-I_DR is an internal portion of the Gypsy8_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its CC long terminal repeat is deposited in Repbase as Gypsy8-LTR_DR. CC Gypsy8_DR is characterized by 4-bp target site duplications. CC The internal portion encodes two proteins: the 622-aa gag-like CC Gypsy8_DR-1p (pos. 93-1958) and 1557-aa pol Gypsy8_DR-2p (pos. CC 1985-6655). The primer-binding site (PBS) is complementary to CC the Arg-tRNA. XX FH Key Location/Qualifiers FT CDS 0..0 FT /product="Gypsy8_DR-1p" FT /translation="MDVIEREDVKIENAVIISGLTLTETDEVVESYLLRFG FT SIRQNFLIDDPQSVFHHNAVVEFSHNSAICNIEPQLPLTIVSPTDMSVIFR FT IHKLSTAYSTATSSDTAKQLKSQEDKNVESQQDIIKQNSMKMNGVTHLDDA FT SHREDCGLENLKSSTKVLTSVTNTPQRTGSFKPSALKIDNNNPAKVNMSTT FT SPHSLTEANSVSVRIPISSMHPPDIQRVIMEHVVKMSETVPQHSASFRLKA FT FSGRMPRPGNEPDFDTWRTSVDYLLNELSLSESHKMQKILDSLLPPASDVI FT KHVSPNAPASECLRLLESVYGSVEDGDELLAKFINTLQDPGEKSSAYLHRL FT YVLLCTTIRRGGIAESERDRYLLKQFCRGCWDNALIVELQLERIKADPLSF FT AELAVLLRTAEEKKTSKEERMRKHLGLGRPSPTPLKLRTITHQQSAHFNNS FT LEVDATNIPAFDNPKQKVSKQKSKTQCPETSEADALKKEIMALQSQITAIK FT TAADQEARERTEASELQQLKGQIAELKVQLATSGAQRKQFQKSPQQSNNPS FT DFGVSRERNERQKNTELRTNRPRPWYCFRCGDDGHLAIYCENAPNPLRVEE FT KRQKLREKQAEWDLRNGGATVPLN" FT CDS 0..0 FT /product="Gypsy8_DR-2p" FT /translation="MGTKRKQSPNNQQALNKHTDTRNSLKLPAGLVGMKCT FT ARVKIEEKEVNCLLDTGSQVTTVPMSFYNRYLSRHPMQPLNHLLEVEGANG FT QAVPYLGYVELTLKFPQEFLGSEAEVPTLALVVPDLTQTPQILIGTNTLDV FT LYAGHAQTVKPRVMSRYQGYQAVIRVLEARLQQAGVNVLGQVKVKGDMPEV FT IAAGSTVVLNGCVKVIGPLSETCVTIEPSVSSSLPGGLLVASSLHSLPAKR FT NVQIPIVLRNETQIDLTIPPKAVLAEVHAVQSVIEGRKPVNASVSKVTNPA FT QNRIIPDFGDSPLSNDWKEKITDLVNSMQDVFALHDLDYGHTNKVKHHIHL FT SDSTPFKQRARPIHPQDVDAVRRHLKELLDANIIRESESSFASPIVVVRKK FT NNDVRLCIDFRKLNSQTVKDAYALPNLEEAFSVLSGSKWFSVLDLKSGFYQ FT IEMEEADKSKTAFVCPLGFYEFNRMPQGVTNAPSTFQRLMERCMGDLNRKD FT VLVFIDDLIVFSKTLEEHKAKLLQVLTRLKEYGLKLSPKKCKFFQTSVQYL FT GHIVSQNGVETDPTKIEALKTWPRPKNLKELKSFLGFAGYYRRFVQDFSKI FT TRPLNDLTIGYPPLQKNQKRNPTKSTPYLDPKEQFGERWNQDCQWAFNTII FT EKLTSAPVLGYADPRLPYVLTTDASTVGLGAALYQEQEGRMRVIAFASRGL FT TKSEAKYPAHKLEFLALKWAVTAKFSDYLYGGEFTVITDSNPLTYILTSAK FT LDATSYRWLSSLSTFNFKIQYRAGKRNLDADGLSRRPHDEQIDDFASQKER FT ERIKQFTLHHLAETEQQVILPDAVKALCERHQVYQSCGDSNLPYSQLTVVE FT SLSQSVDAVPQEFQQEAGGLPVLPQMSEEELKECQRTDPVLEKVIRHLDSG FT KKPHGKVEPAEVALWLREWDRFEFKNGILFRKRQDPRGVLYQLALPKKLRG FT NVLKNLHNDMGHLGIERTLDLARARFYWPKMAITVEEKVKTCERCVKRKTP FT PERAAPLVNITTSRPLELVCIDYLSLEPDRSNTKDILVITDHFTKYAVAVP FT TRNQKAETVAKCLWDNFFIHYGFPERILSDQGPDFESRIIKELCGIAGIQK FT VRTTPYHPRGNPVERFNRTLLQMLGTLEDKQKTYWKDFVKPLVHAYNCTRN FT DTTGFSPYELMFGRQPRLPVDIAFGLPATGSSPSHSIYVRNLKDRLEESYR FT IATENASKLARRNKKRFDERVVTSFLEVGDRVLVRNVKLRGKHKLADKWEK FT EIYVVLKKAGDLPVYTVSPEGRDGPLRTLHRDLLLPCGFLQESMPEPVKPK FT PPRRPRTRANTSAREPDTMTESSDSEDDSMDHYSRRHLPKVESRILFNPRH FT VKPSRDRPIAELSSKTRVVKSKINDCPAIEIPQENLPYLPEDENCPMSEPE FT RENTPVMDSVRPDMLRNVPVVNDQELLEQRDELEILSEIDDEADQRNIHSG FT QAVIDQVERNTLRRSQRHREPPQRLQYSQLGNPLSLVIQSLLQGLSTAVTA FT SLEESDCPREASLLMQKMFPSAAVTQPKRCRGTCIDSRRGE" XX SQ Sequence 6655 BP; 2188 A; 1425 C; 1423 G; 1619 T; 0 other; atcttggcga gccagccagg agcgagaaag cagcagcttt tgaggtgaat aattattgaa 60 atacgcatat atagtgaaaa tataacatta caatggatgt catagaacga gaagacgtta 120 aaatagaaaa tgcagtgatt attagtggtt taaccctaac tgaaacagat gaagtcgtag 180 aatcctacct tttgagattt ggttctatac ggcaaaactt tctgattgat gatccgcagt 240 ctgtatttca tcataatgca gttgttgagt tttcacacaa ctccgccatt tgcaacattg 300 agcctcaatt gcctttgact atcgtaagcc ctactgatat gagcgttata ttccgcatac 360 acaaattgag tactgcttac agcacagcca catctagcga cacagcaaag caactgaagt 420 cacaagagga taaaaatgta gaatcccaac aagatatcat taagcaaaat tcaatgaaaa 480 tgaatggagt gacacatttg gatgatgcat cacacagaga agattgtgga cttgaaaatt 540 taaaatccag cacaaaagtc ttgacaagtg taactaacac ccctcagcga actggaagtt 600 tcaagccatc tgctctaaaa attgataata acaacccagc caaggtgaac atgagtacta 660 catcgcccca tagtttaaca gaagccaatt ctgtatctgt gagaattcct atatcctcta 720 tgcacccccc agacattcag agagtaatta tggaacatgt tgtgaagatg agtgaaacag 780 tgccccaaca cagtgcttct ttccgcttaa aagccttttc tggacggatg cctcgtccag 840 gtaatgagcc tgacttcgac acatggcgga caagcgttga ctatttgttg aatgagttat 900 ccctttctga gtcacataaa atgcaaaaaa ttctagacag cctgttaccg cctgcttcag 960 atgtcattaa gcatgtgagc cccaatgctc cagcatcaga gtgtctgagg ttactagagt 1020 ctgtttacgg ttcagtggaa gatggagatg aattattagc aaagttcata aacactctgc 1080 aggatcctgg tgaaaaatca tctgcctatc ttcatagatt gtatgtgctc ttgtgcacta 1140 ccattaggcg tggagggatt gcggagagtg aacgagaccg ttatctcctg aaacagttct 1200 gccgtggctg ttgggacaat gccttgatag ttgaattgca gctagagaga ataaaagctg 1260 atccactctc ctttgctgag ttagcagtac tcttaagaac agctgaggag aagaaaactt 1320 caaaagaaga gagaatgaga aagcatcttg gtttgggcag accctcacca accccactca 1380 aattaagaac aataactcac caacagtctg ctcactttaa taactcactt gaagtggatg 1440 caactaatat tccagcattt gacaatccaa aacagaaagt ctctaaacag aaaagtaaaa 1500 cccagtgtcc tgaaacatct gaagctgatg ctttgaaaaa ggagattatg gctctccaaa 1560 gccaaatcac tgccattaaa acagcagctg accaggaagc aagggagaga actgaggcaa 1620 gtgagcttca gcaactaaaa ggacagatag cagagcttaa agtccaactt gccacttctg 1680 gagcacagag aaagcagttt cagaaatccc ctcaacagag taataatcct agtgattttg 1740 gtgtcagtcg agagagaaat gaaaggcaaa aaaacactga attaagaacc aatcgaccca 1800 gaccatggta ttgctttcgc tgtggagatg atggtcatct tgccatttac tgtgaaaatg 1860 caccaaaccc attaagagtt gaagaaaaga gacaaaaatt aagagagaaa caagctgagt 1920 gggatcttag aaatggagga gccacagtgc ctttaaacta aaatcagtct ctatcgcagg 1980 gcggatgggg actaagagaa aacaaagccc aaataatcaa caagcactta ataagcatac 2040 agataccagg aactccttaa agttaccagc cggactagtg ggaatgaagt gcactgctag 2100 agtcaaaatt gaagaaaagg aagtgaattg cttgctagac acagggtctc aagtcacaac 2160 agtccccatg tctttctaca accgctacct gtcacggcat cctatgcagc cgttgaatca 2220 tctgttagag gttgaagggg caaacggcca agctgttcct taccttggat atgttgaact 2280 gactctaaag tttccacaag agtttttagg atctgaggcc gaagttccaa cattggccct 2340 agttgtccca gacctgacac aaacacccca aatccttatt ggcactaaca ccctagatgt 2400 cttatatgct ggtcatgctc aaacagtcaa gcccagagtt atgtcacgtt atcaagggta 2460 tcaagctgtg ataagagttc tagaagcaag actgcagcag gctggtgtga atgtcctggg 2520 ccaagtgaaa gtaaaaggag acatgcctga agtgatagca gctggaagta ctgtagttct 2580 taatggatgc gtcaaagtta ttgggccact ctcagagact tgtgttacaa ttgaaccctc 2640 agtatcgtca tctttgcctg gtggattact tgtggcaagc agtttgcatt ctctacctgc 2700 aaaacgcaat gtccaaatac caatagtgct aagaaatgag acacaaattg atttaactat 2760 tcctccaaaa gcagtattgg ccgaagtaca tgctgtacaa agtgtgattg agggaagaaa 2820 accagtaaat gcttctgtga gcaaagttac aaatcctgca caaaatagaa tcatccctga 2880 ctttggtgac tctccgttat caaatgattg gaaggaaaag ataactgatc ttgtaaactc 2940 catgcaggat gtatttgcac ttcatgactt ggattatggc cacacaaaca aagtaaaaca 3000 ccatatccat ctcagtgata gcaccccatt caagcagcgt gctcggccta tccatcccca 3060 ggatgtcgat gctgtaagac ggcatcttaa agaactcctt gatgcaaaca tcatcagaga 3120 atctgaatcc tcttttgctt ctccaattgt agtagtaaga aagaaaaaca atgatgtacg 3180 cctctgcatt gacttcagaa agttaaactc gcaaactgta aaagatgcat atgccctgcc 3240 taatttggag gaagcctttt ctgttctatc tggctccaaa tggttttcag ttctcgactt 3300 aaaatcaggc ttttatcaaa ttgagatgga agaggctgat aaatcaaaga ctgcattcgt 3360 ctgtccttta ggattctatg aattcaacag aatgcctcaa ggcgtcacta atgcaccaag 3420 tacatttcag aggctgatgg agcgatgcat gggtgatctt aacagaaaag acgtcctagt 3480 tttcatagac gaccttattg tcttttccaa gacattagaa gagcacaaag ccaaactctt 3540 gcaagtcctg acacgactaa aagaatacgg attaaagctt tctcccaaga agtgcaagtt 3600 tttccaaaca tcagtccagt acttaggcca catagtctct cagaatggtg ttgaaacaga 3660 tccaaccaaa attgaagctc tcaaaacctg gccaagacct aaaaacctta aagaactgaa 3720 atcttttctt ggatttgcag gatactacag aaggttcgtt caagacttct caaaaatcac 3780 aagacccctt aacgacctta ctattggata tcccccactg cagaaaaatc agaaacgaaa 3840 ccccacaaaa agtacacctt acctggatcc taaagaacag tttggagagc gatggaacca 3900 ggattgtcag tgggcattta acacaatcat agagaagttg acctctgctc cagtcctagg 3960 atacgcagat cctagactcc cctatgtgtt gaccactgat gccagcactg ttggacttgg 4020 agcagctctt tatcaagagc aggaaggtcg aatgagggtg attgcctttg caagtagggg 4080 actaactaaa agtgaagcaa agtaccccgc tcacaaatta gagttccttg cactcaagtg 4140 ggcagtcaca gccaaattca gtgactatct gtatggagga gagttcactg tgattacaga 4200 cagcaaccca ctcacctaca tattaacatc tgcaaaactc gatgcaacca gttacagatg 4260 gctgtccagc ctgtcaacat tcaattttaa gatccagtat cgtgcaggca aaagaaatct 4320 agatgcagat ggactctcaa gacggcccca tgatgaacag attgatgatt tcgcctctca 4380 gaaagaacgt gaaagaatca aacaattcac tctccatcac ttagctgaga cagaacaaca 4440 agttatcttg cccgatgcag taaaagccct ctgtgaacga caccaagtat atcagagctg 4500 tggtgattct aacctaccgt attctcaact tactgtggta gaatcactgt cccaaagtgt 4560 tgatgcagta ccccaagaat tccagcagga ggcaggaggc cttccagtac ttccccaaat 4620 gtcagaggaa gagttaaaag aatgtcaaag aactgatcca gtgcttgaaa aagtaattag 4680 gcaccttgat tctggaaaga aacctcatgg gaaagtagag cctgcagaag tagctttgtg 4740 gctaagagag tgggaccgct ttgagttcaa gaatggaatc ttatttagaa agagacagga 4800 cccaagaggc gtattgtatc agttggcctt gcccaaaaag ctcagaggaa atgtattaaa 4860 aaatctgcac aatgacatgg ggcatctcgg aattgaaagg actttggacc tggctagagc 4920 ccgtttttac tggccaaaaa tggcaataac tgtggaagag aaagtaaaga cctgtgagcg 4980 atgtgtaaaa cgaaagaccc ctcctgagcg ggctgcaccc ctggtgaata tcacaaccag 5040 cagacccctt gaactagtct gcatagacta cttgtcacta gagcctgatc gaagcaacac 5100 taaagacatc cttgtaatta ctgatcattt caccaagtat gctgttgcag tgcccactag 5160 aaaccagaaa gctgaaactg tggccaaatg cttatgggac aactttttca tccattacgg 5220 gtttcctgaa agaattctga gtgatcaggg gcccgacttt gagtcaagaa taatcaaaga 5280 actgtgtggc atagcaggaa tacaaaaagt aagaactacg ccttaccatc ccagaggcaa 5340 ccctgtagaa cgtttcaaca gaacattgct ccaaatgtta gggaccctgg aggacaaaca 5400 aaagacctat tggaaagact ttgttaaacc attagtccat gcttacaatt gcactcgtaa 5460 tgacacaaca ggattctcac cttacgagct gatgtttggg agacagcctc ggctgccagt 5520 agacatagca tttggtttac ctgctactgg gtcatcccca tctcattcca tttatgtgag 5580 aaatctgaaa gatcgcttag aggaaagcta tagaatagct actgagaatg cctcaaaatt 5640 agcaagaagg aacaagaaac gatttgatga gcgagtagtc acctcattcc ttgaggttgg 5700 agatcgtgtt ctagtacgga atgtcaagct aagaggaaag cataaattgg ctgataagtg 5760 ggagaaagaa atctatgttg tgttaaagaa ggcgggagac ttacctgttt atactgttag 5820 tccagaaggc agggatggcc cacttcgcac actgcatcga gacttgctgc tgccctgtgg 5880 attcttacaa gaaagcatgc ctgagccagt aaaacctaaa ccaccccgca gacctagaac 5940 tcgagcaaac actagtgcaa gagaacctga tactatgact gagagttctg actctgagga 6000 tgattcaatg gatcactact cacgtagaca cttaccaaaa gtagagagca gaatcctctt 6060 caatccaaga catgttaagc cttcaagaga caggccgatt gcagaacttt ctagtaaaac 6120 aagagtcgtg aaaagtaaaa tcaatgactg tcctgcaata gagataccac aagagaactt 6180 accttactta cctgaagatg aaaactgtcc catgagtgaa cctgaaagag agaacacacc 6240 tgtaatggat tctgttagac cagatatgtt aagaaacgta cctgtagtaa atgatcaaga 6300 actattggaa cagagagatg aactggaaat attgagtgaa atagatgatg aagctgatca 6360 gagaaatatt cacagtggcc aggcagtgat cgaccaggtt gaaagaaata cattgagacg 6420 ttcgcaaaga caccgtgagc caccacaaag gctacagtac tctcaattag ggaatcctct 6480 ttctctggtt atacaatcct tactacaagg tctcagtaca gcagtcactg catctttaga 6540 ggagtctgat tgccccagag aagcctctct tttaatgcag aaaatgttcc cttctgctgc 6600 cgttacgcag cctaaaagat gcagagggac ctgcatagat tccaggcggg gagaa 6655 // ID DIRS-N1_DR repbase; DNA; ZEB; 6632 BP. XX AC . XX DT 31-OCT-2008 (Rel. 13.1, Created) DT 31-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE DIRS-like LTR retrotransposon family - a consensus. XX KW DIRS; LTR Retrotransposon; Transposable Element; Nonautonomous; KW endogenous retrovirus; reverse transcriptase RNase H; KW phage integrase; DIRS-N1_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6632 RA Bao W. and Jurka J.; RT "Families of DIRS-like endogenous retroviruses in zebrafish."; RL Repbase Reports 8(10), 1273-1273 (2008). XX DR [1] (Consensus) XX CC Members are 98% identical to the consensus. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 6632 BP; 1838 A; 2035 C; 956 G; 1802 T; 1 other; aagtgaagtt tattcataaa ctaatttcga gaggatcacg tgcttatgat ttatcacgtc 60 cagtctcgca tttgctaatc actaatcttc caatcatatg agccctaagg caccataaat 120 agtctaagtt ttcagttcac tttatcttcg tttggaagaa gcagcttcac cgtagctagc 180 tccgttgaag aaccaactct aaaccagcat ggacaactaa agcagcaaca agctacaatc 240 ttcaggatct tcatttggaa gaacgctctg ctctgctcat caactacaaa agctacaagt 300 gctacaagct acaaaagctt caagctacaa tctacaatat acaatctaaa agcttcaatc 360 tacaagctac aatatacaag ctacagcaac aacaacaaca actactattg ctaccatttc 420 aaacaactac aactccaaca atttcatcaa cagcttaaac aacaacacca aaaccttcca 480 cttcaaagag cctccacaac gcatctgctg tgtcttcaac cttatttccc agcaagatat 540 aagccaaaac tccaaagcta ttgcaataaa acggaaccct caccttctcc tagcagtaca 600 catgctatcc atcgtttaac agattgtttg ctggaatgtg ttcagcagag cagcagctcc 660 agccacaaac aatgttgaat ttgatcaaca aaatggccgc caggcttttt gcaccttttg 720 catggcctga ctgaaactcc ttaagccaat agctgtaaag ataagcgtct ccatccaatg 780 agctaaaaga ctggagatgg tcccgccctc tctcttgact ctgttgcaaa gaccttatga 840 atgaataggc cagacacgct tgtaaattta gttaaaaaaa aaaaaaaaaa aaaaaaaaaa 900 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaataaaa ttccatttga catttaactt 960 ctgaaaagat tattacccaa ccagtaaata ttaacaagct attgaaaaac attatgccat 1020 caagcacaga agtccaattg taatacgcca gcgactcatg tgttgattta caagcgtcat 1080 ataagattat atagcaagtc agtcggatgt gaaaacggca aagaaacaaa cttcccactt 1140 gactgttata aatacaaatt taaacttcac tatcacttca catgctcagt cttcaagggg 1200 taccagtttt ccaggaccac cagactagcc atatcccatg tcctcagctc tccattcatc 1260 tgtcatcccg ttaggagacg aacacccacc cctaccatcc ttgcacatcg tatcaatgaa 1320 gtcatcggac catgcattat caaaagcgct tccatttaca gccgcccatg cccttattaa 1380 gatgcctcgc atggatataa actaaaacaa ttttcatgtc tgccatactt gtattatttt 1440 gcaaagattt aatttctgat tctctttgtt ttttctattt tagagatcat gactgctggc 1500 tccagaagct gaccctcatc caacctatta cactccacat cctcttcaca aaggatacat 1560 tccatttttc actttatcca atctgtccta acctgcagtc tgcaacctcc gaacctgaaa 1620 ccgtcgatta tcttattaaa aagaaatcga caacaacttc atgattggaa ctccaaggct 1680 cctcctttta atatctcgcg cattagcccc attggtgtcg ctactcaaaa atgttctggc 1740 aaaaacgcct tgtagtcgac ctttcgtccc aacacaattg ctcctttcct agcattaaca 1800 gcactatccc attggacgaa tatttactta actaccacga cattgatcaa gagatctctc 1860 tcataaagat agccagtcgt aacacctggc tagccaaaga agacatttcc cctaccttta 1920 aaatcatgcc aatccaccca gacttctggc accttttggc atttattggc gaaatcaatt 1980 ctatttttcg gatgcaaaag cagcccaaaa aatatttgac ataatgtcag aagctttatg 2040 ctggattctg tctaatattt acgaaatacc atacctcatt caccttctag acgattttct 2100 tatttccccc ccgtcatctc ctccagccaa acacctagcg atcacccagc aggttttcgc 2160 tgatctcgga aggacccagt acttcaatcg aatttctggg tattaatcta gactcgcata 2220 aattccaagc atcccttcct aaagagaaaa tcaatcgaat aatttctcta tcccaaattt 2280 tccttgaaaa acagatgtgc acacaacgag aactcctacc aattctcggc catctaaatt 2340 tcgctatgcg cattattcct ttcatttccc acctccatca attatccacc acagttcatg 2400 gtttagaaga aacaattttt ctctccaaac ccagtcgcga tgaactttgc ttatggatct 2460 ctttccttaa gcaatggaac ggctgttcct ttttctacag cgacttgatt gcatccccta 2520 ttgacatcaa cacatacaca gacgctgccc ctccaatagg ttacagtggc taatataaag 2580 gacactggtt tgtctcaaca tggccaaccc aattgtaatt ccattccaaa gaccaatgtt 2640 tttcagccct cttcaaattc taccctatca tcgcagcagc catcctgtgg ggggacgaat 2700 ggtctacatc tagcattctc tttcactgcg ataacgaagc tacagtgcat tgcattaaca 2760 aagggcgctc ccactcccaa tcgcttatgc catttttaag acaccttatc tggatatctg 2820 ctaaaaaaca atttatcatg attgctgaac atgtacctgg ttgcaaaaac caaatcgctg 2880 actctctctc tctctcattt ctctttgcag atattctggc aactagcccc ggaagcagac 2940 cctcacccaa cgccggtccc tccttattca gtaacgatat ttccataaac cacccacttc 3000 ataatcttca tcaaacttct ctatctctta tcctacaagc aatagctccc agaaccctca 3060 atgcatacct cacagcatgg aattcgttca aacaattcca taccatgcta gacaacccat 3120 tacactcaaa atccttacct catgcatcta caccctctgt aaaggttaca tttcctccca 3180 tacagcccgc accttagatg ctatgttcaa tctagcattt ttttgggggg tttcttaaat 3240 gttctgaatt aacagttaca tccaaattta accctctact ccaccccacc atctcagatc 3300 tagctttgca agacagggaa actatctctt tccttatcaa acaaagcaaa acagattaat 3360 tccagagagg acactctatc ctcattttca atattccttc acctacacgc ccattccaaa 3420 ccctcttagc ctagaaaatc tcaagaggct aacccactgg ccccgctttt tactgatgac 3480 gctaaccgtc cagtatctcg attctggttt caaaaacacc ttaaagaaat ctttcgccta 3540 tcaggttttt tcccagagcc cttttccagc cactcattca ggattggcgc agccactaca 3600 acgggctctc acaccatcag atccagaccc ttggtcgctg gtcttctgaa gttttcaaat 3660 cttatatacg tctcagtaaa taccacctca agaaagccca acaggcttta accaaccccc 3720 aagcacctcc tcacacggct ccaactcaaa aaggtgtcta acagagcctc gactctcgca 3780 agagtcatcc aaaccaaacc ccgcagaagc ccaattgtcc acccatggca cccttacatg 3840 tttgccaagg actccctcaa gggccattgc agttagcatc ttctgcaatc accaatagct 3900 atttacagat gcagacgttt gtatccacat gttctttgcg catttactgc gtttgtacat 3960 atgtattata tttgtgcttg ttcccttctt ccaacttttc caaatctttt atccttataa 4020 ccctactttt cagcctctcg ttcctttctt ctctacactc cactcccaac taatctattt 4080 tacactctac ctctaatgat cttgtttctc ctgtctagac tcttcagaca cccgaagggg 4140 cccccttgag ctccaactct cgcaggattc ggccatagcg gccacagtcc tttcttcttc 4200 acgatgttac ctattctttt tccgcctaga ctctttctcc aagctccaaa ccccgcaggg 4260 gtccgctcaa gctaaaactt cttcactcta cttcaactac ctctcttctc cttatgactc 4320 accctctagc tctgaccccc acaggggtcg ctccgagccc caatctctcg caagagttat 4380 ccaaatcctc tttcccactc actgatatct atctcctcct atccgcaatc taccctcaag 4440 ctctgacctc cacaaaggtc gctcagagct ccaactctca caagagttac tctcttccac 4500 tcactgctac ctatctcccc ctatctacac tctaccccca agctctgacc tccacagagg 4560 tcgcttcaag ctccaactct cgcaagagtt actctcttcc actcattgct cctcccagct 4620 acactctacc ctcaagctca aacctccgca gaggtcactc ctatctacac cctaccctca 4680 agctcaaacc tccgcagagg tcactcctat ctacacccta ccttcaagct caaacctccg 4740 cagaggtcgc tcctagctac actctaccct caagctcaaa cctccgcaga ggtcactcct 4800 atctacactc taccctcaag ctcaaacctc cgcagaggtc actcctatct acaccctacc 4860 ctcaagctca aacctccgca gaggtcactc ctatctacac cctaccttca agctcaaacc 4920 tccgcagagg tcgctcctat ctacacccta ccctcaagct caaacctccg cagaggtcac 4980 tcctatctac accctacctt caagctcaaa cctccgcaga ggtcgctcct atctacactc 5040 taccctcaag ctctaacctc cgcagaggtc actcctatct acaccctacc ctcaagctca 5100 aacctccgca gaggtcactc ctatctacac cctaccttca agctcaaacc tccgcagagg 5160 tcgctcctat ctacwcccta ccctcaagct ctaacctccg cagaggtcac tcctatctac 5220 accctaccct caagctcaaa cctccgcaga ggtcactcct atctacaccc taccttcaag 5280 ctcaaacctc cgcagaggtc gctcctatct acaccctacc ttcaagctca aacctccgca 5340 gaggtcgctc ctatctacac cctaccttca agctcaaacc tccgcagagg tcgctcctat 5400 ctactcccta ccctcaagct ctaacctccg cagaggtcac tcctatctac accctaccct 5460 caagctctaa cctccgcaga ggccgctcta aactcaaact ctcacaaaag ctccatcttt 5520 cgtaagagct actctcctcc actctctgct ccataccttc actctactct ggcccccgca 5580 ggggtcactt agagctccaa ctcccacaag attcactcaa agttctcttt tccactgctt 5640 caaaccacct attctcccct tcttcttaac cccgaaaacc atgacccccg caggggttcc 5700 ttcaaactct aactccagca agagttatta gaatcctctt ttcaccctta ttaatcctat 5760 ctaattaatg cccatgcatt taacctcttt ccttctatta tatccagcag ccggatatag 5820 ctctaaattt cctgcctttt ggggggtttt ttcttcgaat acgcggctgc tgtcccgagc 5880 gattaatttc tgcattttgg ggagttctga gatccaccga gctcaggctc ccttcttgct 5940 ctgccaacgg gaggaagccc cgggctcgag gagcccttga gctcggggct ctctcccggg 6000 acagcatgcc aaataagctt tgtaaatcat cagctaagtg tgaactcttg aagtgaagtt 6060 tattcataaa ctaatttcga gaggatcacg tgcttatgat ttatcacgtc cagtctcgca 6120 tttgctaatc actaatcttc caatcatatg agccctaagg caccataaat agtctaagtt 6180 ttcagttcac tttatcttcg tttggaagaa aacccccctc ctcccctatt ctcctccttt 6240 acctgtaatt gggcggcacg gcggtccagt ggttagcact gtgaaccaca cagcaagaat 6300 actgccggtc ctagttcgat aggaccggtg agtgtttctg tggggagttt gtatgtcctt 6360 cccgtgtccg cgtgggtttt ccccgggctc tccggtttcc tcccaccatc caaagacatt 6420 caacatactt aacaatcaag ctggtctaat tccttacgtt cccttagcta cagcggcagg 6480 ggagttctga gatccaccga gctcaggctc ccttcttgct ctgccaacgg gaggaagccc 6540 cgggctcgag gagcccttga gctcggggct ctctcccggg acagcatgcc aaataagctt 6600 tgtaaatcat cagctaagtg tgaactcttg aa 6632 // ID RTE-1_DR repbase; DNA; ZEB; 4083 BP. XX AC . XX DT 29-APR-2005 (Rel. 10.04, Created) DT 29-APR-2005 (Rel. 10.04, Last updated, Version 1) XX DE RTE-1_DR is a non-LTR retrotransposon - a consensus sequence. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; KW Interspersed repeat; reverse transcriptase; endonuclease; KW RTE-1_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4083 RA Kapitonov V.V. and Jurka J.; RT "RTE-1_DR non-LTR retrotransposon from zebrafish genome."; RL Repbase Reports 5(4), 94-94 (2005). XX DR [1] (Consensus) XX CC RTE-1_DR is a non-LTR retrotransposon that belongs to the RTE CC clade. CC Copies of RTE-1_DR are ~1% divergent form the consensus sequence. CC RTE-1_DR encodes a 1049-aa protein composed of endonuclease and CC reverse transcriptase domains. XX FH Key Location/Qualifiers FT CDS 891..4037 FT /product="RTE-1_DR" FT /translation="MLTGLSEDLRGIDDSRKTAVINNELLRLNVDIAALQE FT TRLADSGTLKEKDYTFYWQGRAPDEPRQHGVAFAVKNNLLSMVEPGRNGTE FT RLLTLRLNTTTGPLTLVSVYAPTLNATLETKDKFYGNLTSVINNIPDKEQL FT VLLGDFNARVGANHESWPSCLGKFGIGKMNENGQRLLELCAFHNLCIANSY FT FQTKPQHKVSWRHPRSKHWHQLDLILVRRSAINCVLHVRSYHSADCDTDHS FT LVCCKIRLNPKRFHRLRKQGNPRINVSRMSQPDLTQKFAETLVKELDTAQT FT GDSALEMWESLRNTMQRTALATFGKRTTKTHDWFEAKASTMIPCIEAKRAA FT LTEYKRSPTQKNLQILRSTRSKAQHIARYCANEYWQELSNDIQKAAIAGNI FT RGMYDGIKKALGPTQCKTAPLKSSTGEIISDKQQQMERWVEHYSDLYSRQN FT TVSSAALDVIKCLPTMEELDEEPTAEELRKAIDKLASGKAPGSDGIPPDLL FT KQCKDSLLHPLHKALCQCWKEGAVPQDMRDAKIITLYKNKGERSDCNNYRG FT ISLLSIVGKVFAKVILARLQKLAERVYPESQCGFRAERSTVDMIFSLRQLQ FT EKCREQQMPLYISFIDLTKAFDLVSRDGLFKILPKIGCPPKLQSLIESFHT FT DMKGTIQFNGSCSEPFSISSGVKQGCVLAPTLFGIFFALLLRHAFGSATEG FT IYLRTRSDGKLFNLSRLKAKTKVRETLIRDMLFADDAAVTTHTQEELQSLM FT TRFSMACKDFGLTISLKKTNVLSQDTATPPTITVDDYQLDVVHQFTYLGST FT ITDNLSLDAELDRRIGKAASTLARLTTRVWTNHRLTTATKMAVYNACVIST FT LLYGSETWTTYARQERRLNTFHLRGLRRILGITWQDKVSNVKVLTRAGLPS FT MYTMLRQRRLRWLGHVCRMEDGRIPKDILYGELSSGKRTTGRPFLRFKDVV FT KRDMKALDINTKSWEDLAADRLKWRCTLTKQLKSGEKNMMRASEDKRVHRK FT VQSVSQGATYQCDCCGKKCLSRIGLFSHQRRCLRQSVMPQYKS" XX SQ Sequence 4083 BP; 1141 A; 1112 C; 993 G; 837 T; 0 other; ggttccaagg ttgtcatagc cgggggcggt aatggggata agctcccact atctataaag 60 tacccctata cggcgtgcgt ctcaaatagc ctctgacaac caagtccagc tcctggcctt 120 caagtgtggt ttagctacca aacccggcgg aactgttttc actgacagga gaaggggcgc 180 aggcgggtca ctggcgcctt acaaccagtt gcttcgggga gatgatattc gttagcctgg 240 gaaggcagat catctagggg aaggcaaccc tgttttcaaa cctccgctgc cttgcggcta 300 tatccattca tggaaaaggc ttcaggagta aacctcgagg aaaaatccgg agtcggagtc 360 cctaaggcag tttaacgctg tttgcagcct cactctggca actcctgcga cggcgccgat 420 accaaactgt agcagccctg ctgttccttt ggatttgtcg acaacgtgga gaggggggac 480 ccgctacatg ggcaacagcc tgtcctccat aatacattgc cctggctagt atccgatctc 540 gcacaccctg gagaggacac tccagcctcg ctagcactct ggcgtggata caacacgggg 600 agcagtagtt taccggttat aagccacagc tcggttggcg tagagcaagg cgccagggac 660 tgctcccgac ggtgggaggg atcttcgggt cccactggac agttaccgcc cgcctcaagc 720 tgggcagccc ccagtcaatt aggtactgcc ccgccacagt ctgctttcct cattgggtgc 780 atggggaata ggagcatttc gaacagcaga ctgcaaccat cgcaccagac aataaaataa 840 cacagagaaa gagaccagct ctaaaactgg gatgctggaa tgtccgtaca atgcttacgg 900 gcctctccga ggacttacgg ggcattgacg actcacgaaa aacagctgtc attaacaacg 960 aactactgag gctaaacgta gacatcgctg ctctacagga aacacgacta gcagactcag 1020 gaactctaaa agaaaaagac tacaccttct actggcaggg aagggcccca gacgagccca 1080 gacagcatgg cgtggctttt gctgtgaaga acaacttact gagcatggta gaaccaggca 1140 gaaatggtac agaacgactt cttaccctcc gcctcaacac caccacaggc cctctcactc 1200 ttgtcagcgt gtacgctcca actctgaacg caacactgga aactaaagat aagttttatg 1260 ggaacctaac atctgtcatt aacaacatcc ctgataagga acaactcgta cttctgggcg 1320 atttcaatgc cagagtgggt gcaaaccacg aatcatggcc ctcgtgccta ggcaaatttg 1380 gcattggaaa aatgaacgag aacggccaac gcctgctcga gctttgcgct tttcacaacc 1440 tgtgcatcgc caactcatac ttccagacca agccccagca taaagtctcc tggcggcatc 1500 cgcggtcaaa acactggcac cagctggacc tcatcttagt tcgccgctca gctatcaact 1560 gcgtcctgca cgtacgctct tatcacagtg ctgattgcga cacagaccac tccttagtgt 1620 gctgcaagat caggttaaac ccaaaaaggt ttcaccgttt aaggaaacaa gggaatcctc 1680 gcatcaacgt cagcaggatg tcgcagcctg atctgacgca gaaatttgca gaaacccttg 1740 tgaaagaact tgacaccgca cagacaggtg attctgccct ggaaatgtgg gaatcactac 1800 gaaacacaat gcaacgcact gccctggcaa cttttggaaa gaggaccaca aagacgcatg 1860 actggtttga agcaaaggcc tctacgatga tcccatgcat agaagccaag cgtgcggccc 1920 tgacagaata caagcggtca ccaactcaga agaaccttca aattctcaga tcaactagga 1980 gcaaggctca acacattgcc agatattgcg caaacgagta ttggcaagag ctcagcaacg 2040 acatccagaa agcagccata gcggggaaca taagaggcat gtacgacggc attaagaaag 2100 cgctaggccc cacccagtgc aaaacggcac cccttaagtc atctactggg gaaataatct 2160 ccgacaagca acaacagatg gagagatggg tggaacacta ctccgacctc tactctagac 2220 agaacacggt gtcctccgca gcactagacg tcattaaatg cctgccaacc atggaagaac 2280 ttgacgagga gccaacagca gaagagctca gaaaggctat cgataaactg gcctcaggca 2340 aagcccctgg cagcgacggg attcctccag acttgctgaa acagtgcaag gattccctac 2400 tgcaccctct tcacaaagcc ctctgtcagt gttggaaaga aggggccgta ccgcaggata 2460 tgagggatgc taagatcatc accctctaca aaaataaggg tgagagaagt gattgcaaca 2520 actacagagg catctccctt cttagcatcg ttggaaaagt atttgctaag gtcatcttgg 2580 cccgactgca gaagctggct gaacgtgttt acccggagtc acagtgtggt tttcgcgccg 2640 aacggtcaac ggtagacatg attttctccc tcagacaact gcaggagaag tgtagagaac 2700 agcagatgcc cctatacatc tcctttattg acctcaccaa agcctttgac ctggtcagta 2760 gagacggact ttttaaaatc ctccccaaga ttggctgccc accaaaactg cagagtttga 2820 ttgaatcttt ccacacagat atgaagggaa caatccagtt caacggcagt tgctctgagc 2880 ctttcagtat aagcagtggc gtcaagcaag gctgcgttct tgcccccaca ctgttcggaa 2940 ttttctttgc cctgctccta aggcatgcct ttggttcagc aacggaagga atctacctcc 3000 gcaccaggtc agatggcaag ctatttaatc tttctcgcct gaaagccaag acaaaggtac 3060 gcgagacact gattagagac atgctttttg ctgacgacgc tgcagtcacc acacacaccc 3120 aggaagaact acagtcgctg atgacccgtt tttccatggc ctgcaaagac tttgggctga 3180 ccatcagttt gaaaaaaaca aatgtcttga gccaggacac tgccactcca ccaaccatca 3240 cagtagatga ttaccagctc gatgtcgtcc accagttcac gtacttgggc tccaccatca 3300 ccgataacct ctccctggat gctgaacttg acaggaggat cgggaaggca gcctctactc 3360 tagcccgcct gacaacccga gtgtggacaa accataggct gacaactgca acaaagatgg 3420 cagtgtacaa tgcttgcgtc atcagcactc tgctgtatgg gagtgagaca tggaccacct 3480 atgcaagaca ggagaggaga ctgaacacct tccacctaag aggtctgcgt cgcattctgg 3540 gcattacctg gcaggacaaa gtctccaacg tcaaagtctt gactcgagcc ggccttccca 3600 gcatgtatac catgctccga caacgtcgcc tgcgctggct tggccatgtg tgccgtatgg 3660 aggatgggag aatcccaaag gatatccttt acggagaact ctcatctggg aagagaacaa 3720 caggacgccc atttctgaga tttaaagatg ttgtgaagag ggacatgaag gcccttgaca 3780 taaacaccaa gtcctgggaa gacctcgcag cagaccgcct gaaatggagg tgcaccctga 3840 ccaaacagct caagtcaggt gagaagaaca tgatgcgtgc gtcagaggac aagcgagttc 3900 accgaaaggt gcagagcgtc agccagggag ccacctatca atgcgactgc tgtggtaaaa 3960 aatgcctctc ccgtattggt ctcttcagcc accaacgacg ctgtcttaga caatcagtca 4020 tgcctcaata taagagctag gatacgtcat ccatggtcaa cactgaccga cggaggccta 4080 cta 4083 // ID BRSATI repbase; DNA; ZEB; 186 BP. XX AC M89944; XX DT 22-DEC-1995 (Rel. 2, Created) DT 22-DEC-1995 (Rel. 2, Last updated, Version 1) XX DE Zebrafish satellite type I DNA. XX KW Repetitive DNA; satellite type I; BRSATI. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-186 RA Ekker M., Fritz A. and Westerfield M.; RT "Identification of two families of satellite-like repetitive DNA RT sequences from the sebrafish (Brachydanio rerio)."; RL Genomics 13, 1169-1173 (1992). XX DR GenBank; M89944; Positions 1 186. XX SQ Sequence 186 BP; 56 A; 36 C; 30 G; 64 T; 0 other; gatccagcca taaaatgcat cattcttttt tgttttagac aacatttcat gcactgttaa 60 acatgttaaa gcaagttgca agtgaaaatc tatgtctctg actgagtttg cattactgtg 120 atttgacctc tctgctggct gagataagct cattttcaac gtccaattca gaatgtaata 180 aaacct 186 // ID DIRS-1_DR repbase; DNA; ZEB; 5654 BP. XX AC . XX DT 26-SEP-2008 (Rel. 13.1, Created) DT 04-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE DIRS-like LTR retrotransposon family - a consensus. XX KW DIRS; LTR Retrotransposon; Transposable Element; KW endogenous retrovirus; reverse transcriptase RNase H; KW phage integrase; DIRS-1_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5654 RA Jurka J.; RT "Families of DIRS-like endogenous retroviruses in zebrafish."; RL Repbase Reports 8(10), 1267-1267 (2008). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 5654 BP; 1066 A; 1734 C; 1551 G; 1295 T; 8 other; ggcacgagta ttgcgtcaca tgcttggggg tccaacatgt tgatgcggtg ctcgcggaca 60 gttcttgccc gcattgcgat ggcttgactg ttgcgcaact gaggtcgcgg tgagctcttt 120 tattggggca agtcacccct gctgctcccc gccaagcggt tagcactcgg gcagacctga 180 gggttacagt gggggctaat ccgtcgcctg cggaccctcg ctcctctcca gtgcgctctg 240 tttctgcttc gagtgcgagc gctggtccat cctccgggct ggccgcgctt gcattagatg 300 acaccgggga cgtgacgtcc atcgctgcat cggagggcgg gttgacatgc tccgaggaag 360 atccggaccc cctgccaccc tccggggttg tcagtatggt tactggaatc ggacatgtta 420 gccgtgttat cccgggctgc ttcggccgtg gggctggaga tggcttgtcc cccagaaccg 480 cggccggacc gcttagacgg gtgctacgtg ggggtgaaga aaggcgaagc cttctagacc 540 cccccgtccc cttcttcccg gaggtacaca gagagctcac gcaggcatgg agggcacctt 600 tttctgcccg gtcctcgtgc gctcccgctc tcacctccct ccgtgatgga gcggccaggg 660 gatatgaggc gattccccgg gkggagcgtg ctagccgytc cactcgtgga gattcgccca 720 aactcccttc caaggcgtgc aggttatctg agtctctcat ggccagagcc tacactgctg 780 cgggtcaggc cgcctccgct ctgcacgcta cggccaccta tcaacgctac caagccgaga 840 agctggccga gatgcaggag ggaggttctc ccccgggctt gctgcatgag ctccgcacgg 900 caaccgatta tgctctcatg acccccaaat cggctgctca tgccctgggg aagacgatgg 960 ccacattagt ggtccagaaa cgccaccttt ggctgagctt ggctgatatg agtgacgttg 1020 acaaagctcg ctttcttaac gctcccatat cccaggccgg cctgttcggc gacaccgttg 1080 ggggttcacc caggaattct ccgcggtgaa agagcagtcc aaggcgatgg gtgaagtcat 1140 ctatcagcgg gctcagaaaa ctgctcctcc cgctgcccct tccgcttggc gctcctcgcc 1200 gagggcgtcc acccgcggct tcaacattcc gcttccgctg cccgctccac cggccaagcg 1260 gcaacgccga gcctcccacg ggctgcgacg ccaccgcccc agggcgccgt taagtcggta 1320 aacggacccc gaagcgttcc tgggacgggc cattcggaga agaggggacc tgctctttcc 1380 tcggtggggg gccgaggatt attaatcagt ccacaacgac tctaaactca tcgctgccgg 1440 cctccgggcc agcggcaacc aaattttcaa aagagcagtt tcctctttct ccggatgcgc 1500 aaacccgagc actgccagtc tgggacgctc cgccttccag ctcgcagcgc cgggacccct 1560 cgcctcaggc cctcagagtg cagcagaacg gactcctttc tctcactctg gcctcaccgc 1620 gggatccagg gaggaaggta agagaaattc tcttattttc agctcttcct cgggacgcwc 1680 tgcttcccgg gatgagcact cccatcccga gctgcccctc cgctggcacg tcagcgatcg 1740 ctccgatggt gccattagcg cgcgctctgc tggcttggtt agcgccgctc agcgcgttgc 1800 ggtggctcat acggacggtc agactcggct atgcgattca gttcgctatt cgccctccca 1860 agttcacggg tgtccttttc acgagggtga tccccgagag cgcccctgtc ttgcgagagg 1920 agattgctgt cctcctggcg aaggatgcaa tcgagcaggt ccctccagcc gagatgaggt 1980 ccgggtttta cagcccgtat ttcatcgtgc ccaaaaarag cggggggtta aggccaatcc 2040 taaatctatg cctttcagaa tgctcacgca gaaacgcttg attcagagcg tccgtccaaa 2100 ggattggttt gcagccatag acctgaagga cgcatgtttt cacgtctcct ttcttccacg 2160 ccaccgccct tttctccggt tcgcgttcga aggacgagcg tggcaataca aagtcctccc 2220 cttcgggctc tctctgtctc cacgggtttt caccaagctc gcggagggtg ccctagcgcc 2280 cctgcgcctt gcgggcatcc gcatactcag ctatcgtgac gattggctca ttctagcctc 2340 gttccgcgat cagctgatta tgcacagaga caaagtgctt cggcacctcg accagttggg 2400 gtttcaggtc aaccgagaga agagcaaact ttgccctgtg cagaggatct cttatctcgg 2460 gctggagctg gattcggtcg ctatggttgc gcgcctctcc gaggagcgcg ccaggctgat 2520 gctttcctgt gtaaacgagc tccacaggaa gatagtggtc ccactgaaat gttttcagag 2580 gctcctgggg catatggcat ccgcagccgc ggtcacgccg ctcggtttgc ttcatatgag 2640 accacttcgg cgttggcttt gcgatcgagt ccccagacgg gcatggcgcg cgggcacgca 2700 ccgggtgcgc gtcactccgc tgtgtctccg caccctcagc ccctggacgg atctggtttt 2760 tctacgggcc ggagtgcccc tagggctagt atccaggcat gttgtcgtaa cgacagatgc 2820 ctccagcatg ggctgggggg ccgtgtacaa cgagcatgca gccgcgggtt cgtggtccgg 2880 accccgcctg cattggcata tcaactgcct ggagctgttg gcagtgtatc tagctctccg 2940 ccgcttttta ccggtgctgg agcggaaaca catgctggtc aggacggaca gcatggcgac 3000 ggtggcctat atcaaccgta tggggggtat acgctctcgc cgcatgtctc agctcgctcg 3060 ccgtctgctc ctttggagtc acacgcggct gaaatcgctg cgtgccatcc acattccggg 3120 cgagctcaac cgtgcagcca gtgcgctctc acggcagcta gtgtcccgag gggagtggag 3180 actccacccc gattcggtcc agctgatatg ggcgcgcttc ggggaagccc agatcgatct 3240 gtttgcttcc accgagaacg cacattgcca gctgttttat tccctgaccg aggcccccct 3300 cggcacggat gcactggctc acagctggcc atcgggcacg cgcaaatatg cgtttccccc 3360 agtgagccta atcgcacaaa ctttgtgcaa agtcagggag gacgaggagc aggtcttgtt 3420 tgttgcgccc ctctggccca accggacctg ggtttcggag ctcacactcc tcgtggcggc 3480 ccctccttgg cgcattcccc taagggagga cctcctctct cagggacggg gcaccatctg 3540 gcacccacgc ccagatctct ggaacctcca cgtgtggtcc atagacggag cgcggaagac 3600 ttaggtgact taccgcccgc ggtacttaac accatcactc aggctagagc accctctacg 3660 aggcatgcct acgccctgaa gtggagtcta ttctctgagt ggtgcgcttc tcgccgagaa 3720 gacccccgaa cttgccagat tagcattgtg ttatccttcc ttcaggataa gctggagcgc 3780 gggctgtcac cctccacact gaaggtttac gtggctgcga tctccgctca tcatgacgcg 3840 gtagatggca acacgctcgg gaagcatgat ctaatcatcc gattcctcag aggcgcgcgg 3900 cggttaaatc cgtcccgccc ccctctcatg ccctcttggg atctctctct agtcctagcg 3960 ggtctgcaga gagatccgtt cgagccactc gagtcagtat ctcttaaaat tctgtcatta 4020 aagacagctc tgctgatcgc attggcgtca ttcaagagag ttggggatct ggaggcattt 4080 tcggtcagcg aatcgtgcct tgaattcggg cccggttact ctcacgttgt cctgagaccc 4140 cggcctggct atgtgcccaa ggttcctacc accccattta gagatcaggt ggtgaacctg 4200 caagcgctgc cttcggagga ggcaggctca acccactcac tgctttgtcc ckttcgcgct 4260 ttgcgccttt acgtggaacg aacgcaaaat gtaagatcat gtgaacagct ctttatctgt 4320 tacggtggtc grcagaaggg aagtgccgta tcaaaacaga ggttggccca ctggttagtt 4380 gatgccatcg ccctcgctta tcaatgccag ggcgagccgc gccctcctaa cgtgagagcg 4440 cactctacwa gagstgtcgc ttcctcatgg gcgttatcac gcggcgcctc tctcacagat 4500 atctgcagag ctgcgggttg ggcgacacct aacacattcg cgaggttcta taatctttga 4560 gtggagccag ttttctccca gttattggta accccaatca atcgggggga attaagctcg 4620 gtgtcacaaa cgcttgctgc gccatgctcc ctaacccgga gatgcgtgcg ctttattcta 4680 ctctgctagt aagtttccct tctcaggcga accctagttc ctccgaggcc cccatcatcg 4740 actcagcgga ggagtcgaat gcatggctca gtgtgcggtt ggtacgccca tttggtctac 4800 acgcatattg aggatcagct atgtgcatcc ccacttggtg atgccatatg cattattacc 4860 acggtgtgtt cccccttatc aggcggtccc gtgtcttccc taaccgctaa ccagcttatc 4920 atatgtagca ctccccctca ttagggctag tccatatgcc tccttaccat caggtctccc 4980 cttctgggta gaaggtggtc tccaccgcgt cctccccctg cggggactga cgcttcccaa 5040 cgtactgtcg tatttccaaa aatccctagt ctatattagc taggtaaaag cacacttaca 5100 cccccgatta acttaaacat ttattcccat gtaatggtaa tatgttgggc cgaggggacg 5160 ttggaaggtt gcgctcttgg tgatgtcagt gcgctcacgc tttgcttggc aaactacaca 5220 tcaggagcgt gacgggcttg gttgccgtgg cgctttccat acagtctccc aaagtctgtt 5280 tatacagaca cacgtcgaag ttcccatatg aaggggaacg tccaggttac gtacgtaacc 5340 cacgttcccc gaatagggaa cggagacgtg tgtctccact gccacagcac ctgagtctcc 5400 agctgggagc tgagcgatcg gctcttcagc aggcaaaatt ctgacgagct aactcacgac 5460 taatatagcc ctaattggct catcgtttca gctgtggagc taagctccgc caattcaatt 5520 ggcatttcat tggcccgttt ttatatcttc agaaaagatt ggtcgtctaa agcactccca 5580 aagtctgttt atacagacac acgtctccgt tccctattcg gggaacgtgg gttacgtacg 5640 taacctggac gttc 5654 // ID SINE3-1a repbase; DNA; ZEB; 570 BP. XX AC . XX DT 22-OCT-2004 (Rel. 9.09, Created) DT 22-OCT-2004 (Rel. 9.09, Last updated, Version 1) XX DE SINE3-1a is a SINE retrotransposon - a consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; SINE; 5S rRNA; KW polIII; SINE3-1; SINE3-1a; conserved. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RA Kapitonov V.V. and Jurka J.; RT "SINE3-1, a novel class of SINE elements that utilize 5S rRNA."; RL Repbase Reports 2(6), 22-22 (2002). XX RN [2] RA Kapitonov V.V. and Jurka J.; RT "A novel class of SINE elements derived from 5S rRNA."; RL Mol. Biol. Evol 20, 694-702 (2003). XX RN [3] RP 1-570 RA Jurka J.; RT "Direct submission."; RL Direct submission (September 2004). XX DR [3] (Consensus) XX CC SINE3-1a is a consensus sequence of a SINE3-1 subfamily. CC The SINE3-1a and SINE3-1 consensus sequences are 84% identical CC to each other. XX SQ Sequence 570 BP; 124 A; 149 C; 155 G; 141 T; 1 other; cagctagctc tctgcaactc tcacatggtc gcccactgaa gctaagcagg gctgcgcccg 60 gtcagtacct ggatgggaga ccacatggga aagctaggtt gctgccggaa gtggtgttag 120 tgaggccagc agggggcgcc caacctgcgg tctgtgtggg tcctaatgcc ccagtatagt 180 gacggggacn ctatactgct cagtgagcgc cgtctttcgg atgagacgtt aaaccgaggt 240 cctgactctc tgtggtcgtt aaaaatccca ggatgtcctt cgaaaagagt aggggtttaa 300 ccccggcatc ctggccaaat ctgcccactg gcctctgtcc atcatggcct cctaaccatc 360 cccatatcta attggcttca tcactgtctc ctctccacca atcagctggt gtgtggtgtg 420 cggtctggcg caaaatggct gccgtcgcgt catccaggtg gatgctgcac actggtggtg 480 gatgaggaga ttccccccaa tgtgtaaagc gctttgagtg cccagaaaag cgctatataa 540 atgtaaggaa ttattattat tattattatt 570 // ID Gypsy10-I_DR repbase; DNA; ZEB; 6147 BP. XX AC . XX DT 07-JAN-2005 (Rel. 10, Created) DT 07-JAN-2005 (Rel. 10, Last updated, Version 1) XX DE An internal portions of the Gypsy10_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW GYPSY superfamily; Gypsy10-I_DR; Gypsy10-LTR_DR; Gypsy10_DR; KW endogenous retrovirus; gag; integrase; reverse transcriptase. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6147 RA Kapitonov V.V. and Jurka J.; RT "Gypsy10_DR, an LTR retrotransposon from zebrafish."; RL Repbase Reports 4(12), 313-313 (2004). XX DR [1] (Consensus) XX CC Gypsy10-I_DR is an internal portion of the Gypsy10_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its CC long terminal repeat is deposited in Repbase as CC Gypsy10-LTR_DR. Gypsy10_DR is characterized by 4-bp target CC site duplications. The internal portion encodes one 1988-aa CC polyprotein composed of gag, protease, reverse transcriptase, CC and integrase domains (pos. 142-6105). XX FH Key Location/Qualifiers FT CDS 0..0 FT /product="Gypsy10-I_DR1p" FT /translation="MSFRSDPQLQSELLEWCREAVIDPAHALLLTGVPQNT FT ETACIEKVAESVKLFGRVRVRDSKGGATPTTLLVLCECREVVDPTNCPEEL FT HPTDGEEAWMIILATERESAHVAPAEFADKLSKFLKDEGKSMTDVHALFSP FT QSANPSSPESIIRAVGEILEKTVRPSSDGSAYRRLRTFSGIVPTPVGEETM FT DHWIEQAKLMISECECSEKEKRRRIVESLKGPALEIIKAVRMSCPDAEAVK FT YVEALESTFGSSESGEDLYFAFRLLRQCPGESLSAFLRRMEKSLTKVIQKG FT GLTSANADKARVEQLIRGAVESDMMLLQLRLRERKENPPTFLSLLNEVREA FT EEIEATRRKITATAKPIHLQEESICPTVVHELKAEIQELRAQIKSDRSNIA FT SSMMIENRVKSRQLSPTDSKEVVKETEVQVLKKQVQQLQQQLAIMSVGQSN FT STSHAQLPQTPVSSSSRHQLSKPKADYFCYRCGEEGHIATKCKAPENATLV FT INKLVRSLKRAKGTKSDPNSHAEDRACFSKKSQIHSCESKGLPKGLVGPAS FT TIAVKVGGHPCCALWDSGSQVTIVFDSWYSEHLSNVPILPLSGLSIWGLSS FT SNYPYKGYIVVDVTFPVALTGAEETVTILALVCPDPKGPQQFPLIIGTNAS FT FFQRMTNGEGTNITRSAHSLRIQTHLDTTSFAVDQPEGQVRWKGPGMLNVP FT SQGERYASCKIESDKPLRKDIFLIETSPVDSLPAGLLVSPVVFSSSAVDVN FT NFRILIQNETSKELSVPPGTVVAQIFPTDTVTVAHGVTESNDQINPDVFNF FT GDSNIPTAWEKRLRLKLAERKNVFSTQEWDVGLAKGVTHQIRLHDPHPFRE FT RSRRIAPADIDDVRRHLKDLLAAGIIEESRSPYASPIVIVRKKSGAVRMCI FT DYRTLNSRTTPDQYVTPRIDDALDCLAGSKWFSVLDLRSGYYQIAMSEEDK FT EKTAFICPLGFYQFQRMPQGITGAPATFQRLMEKVVGDMHLLQVIVYLDDL FT IVFGSTLEEHEERLMKVLDRLEEWGLKVSIDKCQFCQPKVKYVGHIVSAAG FT IAPDPEKVAAVTQWKEPTDLKSLRSFLGFCGFYRRFVKNYSSIVRPLTELT FT KGYPPAKGKSKVEGKKYFKETDQFGERWDNACKQAFQEIIKCLTQAPVLAF FT ADPSLPYVLHIDASLSGIGAVLNQEHSDGLRPVAFASRKLSASEQRYHIHQ FT LEFLALKWAVVDKFHDYLYGVPFVVRTDNNPLTYVLTSAKLNATGHRWLAA FT LATYNFSLQYKPGKHNIDADVLSRYPAEPATSFSWTEIPQSGVKAICQLSN FT LSWSDEKSRLIDQLGVSPGSIPAVYSCPALLDVCHLEQLTHADLKLSQEQD FT PVIGKVKQDIARNKPLTPIKGSDPTLTLLQRQGPKLVIRNNLLYRVTKNQS FT GKEKVQLVLPEKYHLTVLRSLHDDSGHLGVEKTTELLRDRFYWPRMTSDIE FT QYIKNCGRCITRKTLPQKSAPLSHITSSGPLDLVCIDFLSLEPDSKGIANV FT LVITDHFTRYAQAFPTKDQRAVTVAKVLVEKFFVHYGLPSRIHSDQGRDFE FT SRLIQELLGMLGIRKSRTTPYHPQGDPQPERFNRTLLSMLGTLEPAQKSKW FT SQHITQLVHAYNCTKNEATGYSPYQLLFGREARLPIDICFGISPAGEKGVT FT HLQYVEKMKAELQQAYQLAAETSLKAHQRNKKLYDTRVKPQLLTVGDRVLI FT RNLAVKGKNKLQDRWNSLPYVVVEKFKDLPVYKLRPERGMGAIRTMHRDHL FT LPVGENVRFSKPNDSNPSTQSPVTRAQSGKRVQKEKKVENVEQVRDENHET FT SESEDDNLCYYYPKLIPALRTLPTPQIAVEPEIPAEYGPELNSECEARKDT FT VEEEAREVLNQPAEAVDDQGVGDNDHRNVPKPGSEPAVCRKSTREVKPVIK FT LSYDDLGRPTDKPLTMVHRGMVVHIEDLSKTRKSCNTVWCHPMAQCSQCVP FT TAPGPIVRTVIQF" XX SQ Sequence 6147 BP; 1843 A; 1324 C; 1415 G; 1565 T; 0 other; caaattgggg gctcgtccgg gatacactct caccttttcc tgatacacta agaagagtaa 60 attaaacacc ttaatacacg ttaagaacac tgaagaggaa aaactgacac catcctgtaa 120 ctaactgctt attattgaga aatgtctttc agaagtgacc cacaactgca gagtgaactt 180 cttgagtggt gcagagaagc agtaatcgac ccagctcatg cgttgttgct gacaggagta 240 ccacaaaaca ctgagactgc ttgcattgaa aaagtggccg agagtgtgaa gctttttgga 300 cgagtacggg ttcgagactc aaaaggtgga gccactccga caactctgtt agtgctgtgt 360 gaatgcagag aagttgtaga tcccactaat tgccctgagg aattgcatcc tacagatggt 420 gaggaagcct ggatgattat cttagctacg gaaagagaat cagctcatgt tgctccagca 480 gagtttgctg acaagctctc caagtttctg aaggatgaag gcaagtctat gactgatgta 540 catgctctat tttctccaca gagtgcaaat cccagttcac ctgagtcaat aattcgtgca 600 gtgggtgaaa ttcttgaaaa aacagtaaga ccatcgagtg atggaagtgc ctaccgtcgt 660 ttacgtacct tctctggtat tgttccaacc cctgtaggag aagaaactat ggatcactgg 720 attgaacaag ctaagttgat gatatctgaa tgtgagtgct ctgaaaagga gaaacggaga 780 agaattgtgg aaagtttaaa gggaccagcc ctggaaatca ttaaagctgt ccgtatgtca 840 tgtcctgatg ctgaagcggt gaaatatgtg gaagctctgg aaagtacttt tggatcttct 900 gagtctggag aagacttata ctttgctttt cggcttctta gacagtgtcc tggtgagtca 960 ctttctgctt ttctgagaag aatggagaaa tcactgacta aagtcatcca aaaaggggga 1020 ctgacttctg ctaatgctga taaggctaga gtagagcaat tgattcgagg agctgttgaa 1080 tctgacatga tgctattgca gttgcgatta agggagcgga aagaaaatcc accaactttc 1140 ctgagtctgt taaatgaggt tcgtgaggct gaggagatag aagccactcg acgcaagata 1200 actgctactg caaagcccat acacttgcag gaagaaagca tttgtcccac tgttgtacat 1260 gaacttaaag cagaaattca agaattgagg gctcagataa aaagtgatcg ttcaaatatt 1320 gcttcatcca tgatgatcga gaacagagtg aaatcacgcc aattaagccc cacagactca 1380 aaggaagtag tcaaagagac tgaggttcag gtgttgaaaa aacaagtaca gcagttacaa 1440 caacagctag ctataatgag tgttggtcaa agtaattcaa caagccatgc ccagttacca 1500 caaacccctg tttcaagttc atcgcgacac cagctttcaa aacccaaagc tgactacttt 1560 tgttaccgat gtggtgaaga gggacacatc gcaactaaat gtaaagcccc tgaaaatgcc 1620 actcttgtaa tcaacaagct agtacgttcc ttaaaaaggg ctaaaggaac aaagagtgat 1680 ccgaacagcc atgctgaaga tagagcttgt ttttcaaaaa agagccagat acacagctgt 1740 gagtcaaagg gtcttcccaa gggtttagtt ggaccagcgt ctaccattgc agtgaaagta 1800 ggaggacatc catgctgtgc tctttgggac agcgggtctc aagtcactat tgttttcgac 1860 tcctggtatt ctgaacacct gtcaaatgtg cccatacttc ctctttctgg cctatccatc 1920 tggggcctaa gttcatccaa ttatccctat aaaggataca ttgtagttga tgtcacattc 1980 cctgttgctc ttactggtgc agaagaaaca gtcactatcc ttgctttagt ctgcccagac 2040 cctaaaggac cacagcagtt tccattaatc atcggaacca atgctagctt cttccagcga 2100 atgacaaatg gtgaaggtac taatatcaca cgcagtgccc actcactcag aattcaaaca 2160 cacctggaca ccacatcttt tgctgtcgac caacctgagg gccaagtgag atggaaaggc 2220 ccaggtatgc ttaatgtccc atcacaaggt gagcgatatg cttcatgcaa aattgagtct 2280 gacaaacctt tgagaaaaga catctttctc attgaaacct ctcctgttga ttcccttcct 2340 gctggactgc ttgtttcccc tgtcgttttt tcttcatcag cagtggatgt aaacaatttc 2400 aggatcttga ttcaaaatga gaccagcaaa gaactctcag ttcctccagg gactgtagtt 2460 gctcaaatat tccctacaga tacagtcact gttgctcatg gagttacaga gtctaacgat 2520 cagattaatc ctgacgtgtt taactttggt gattcgaaca tacctacagc ttgggagaaa 2580 agattacgct tgaagctggc tgagcgaaag aatgtgtttt cgacacagga atgggatgta 2640 ggcttggcta aaggagtcac ccatcaaatt agactgcatg atcctcaccc attcagagag 2700 cgttcaaggc gcattgcccc agccgatatc gatgatgtca gaaggcatct gaaagatctt 2760 ctagctgctg gtatcattga ggagtccaga agcccatatg catcgcctat agtaatagtg 2820 cgcaaaaaga gtggtgctgt gagaatgtgc attgattacc ggactctaaa cagtcgcact 2880 acacccgatc aatatgtcac ccctcgcatt gatgatgcat tagactgcct agcgggaagc 2940 aaatggtttt cagttttgga tttgcgaagc ggctattacc aaattgcgat gtctgaggaa 3000 gacaaagaaa aaactgcatt catttgccca ctggggttct accagtttca acgtatgcca 3060 caaggaatca ctggggcccc agcgacgttt caaagattaa tggagaaagt ggttggagat 3120 atgcatctat tacaagtgat tgtctatctc gatgacctca ttgtctttgg aagcacactg 3180 gaagagcatg aggagcgatt gatgaaagtc ctcgaccgac tggaggaatg ggggctgaaa 3240 gtgtctatcg acaagtgcca gttctgtcag ccaaaagtca agtatgtggg acacattgtt 3300 tctgctgcag gaatagctcc agaccccgag aaagtagctg cggtgactca gtggaaagag 3360 cctactgacc tgaaatcttt aagatctttc cttggatttt gtggatttta ccgccgtttc 3420 gttaagaatt actcctccat tgtaagacct ctgacagagt taacgaaagg ttacccacct 3480 gcaaaaggaa agagtaaggt ggaaggaaag aagtacttca aggagactga tcaatttggt 3540 gagcgttggg acaacgcatg taaacaagct tttcaagaga taatcaagtg tttaactcag 3600 gcacctgtac ttgcctttgc tgatccatcc ctaccatacg tactccacat tgatgcaagt 3660 ctgagcggga ttggtgctgt gctgaatcaa gaacactctg atggacttcg accagttgct 3720 tttgctagca gaaagttgag tgcttcagaa cagagatatc acatacacca gctagagttc 3780 cttgcattaa agtgggctgt tgtagacaag tttcacgatt acctgtacgg agttccattt 3840 gtcgtgagaa ctgataacaa tcctctaact tacgtactga caagtgcaaa gttgaatgca 3900 actggtcata gatggctagc agccttggca acatataact tcagcctgca gtataagcca 3960 ggcaaacaca acattgatgc tgacgtgctt tcccgttatc ctgcggaacc tgccacttcc 4020 ttctcttgga ctgaaattcc acagtctgga gtaaaagcta tttgtcagtt gtccaacttg 4080 tcttggagtg atgaaaagtc cagactgata gaccagttag gtgtttcacc tggtagcatc 4140 cccgctgttt actcttgtcc tgcgttgctt gatgtttgtc acctagaaca gttgactcat 4200 gctgacctaa aattgtcaca agaacaggat cctgttattg gcaaagtaaa acaagacatt 4260 gcacgaaaca agccactcac ccctataaaa ggttctgatc ctaccctcac tctcctgcaa 4320 cgccaaggtc ccaaacttgt cattcgaaat aatctgttgt acagagtcac caaaaaccaa 4380 agtggaaaag agaaagttca actagtgttg ccagagaaat accatttaac agtgctgcgg 4440 tctctgcatg atgattctgg tcacctagga gtagagaaga ccacagagtt actgagagat 4500 cgtttttact ggccacgtat gaccagtgac attgagcaat acatcaaaaa ttgtggtcgt 4560 tgtattacac gcaagacctt acctcaaaag tctgccccat taagccacat taccagcagt 4620 ggtccactag acttggtttg tatcgacttc ttgtcccttg agcctgatag taagggtatt 4680 gctaatgtgc tagtaataac tgaccacttc acccgctatg cacaagcatt tcctactaaa 4740 gaccagcgag ctgtgacagt tgccaaagtg ttggtagaaa agttttttgt tcattacgga 4800 ttgccctcac gcattcattc cgatcaagga agagactttg aaagccggtt gatacaggaa 4860 cttctgggaa tgttggggat ccgcaaatca cggaccacac cttaccaccc acaaggtgac 4920 ccacagccag aacgctttaa ccgtaccctt ctgtcaatgc taggtactct ggagccagcc 4980 caaaaaagta agtggagtca acacatcact cagcttgtac acgcttataa ttgtacaaaa 5040 aacgaggcta caggctactc cccttatcag ttgctttttg gaagagaggc tcgtctgccc 5100 atagacatat gttttggcat ttcacctgct ggtgagaaag gagtgactca tctgcaatac 5160 gttgagaaaa tgaaagctga actgcaacaa gcatatcagc tggcggctga gacttcgttg 5220 aaggctcatc aaagaaacaa gaagctctat gacacaagag tgaaacctca actgttgact 5280 gttggagaca gagtgctcat tcgaaatctt gctgtaaaag gaaagaacaa actccaggac 5340 agatggaatt ccttaccata tgtggttgtg gagaagttta aggacttacc ggtctataaa 5400 ctgaggcctg agcgtggaat gggagcaata aggacaatgc atcgagacca cttgttacct 5460 gttggagaga atgtgagatt cagtaagccg aatgactcca atccctcaac acagtcacct 5520 gtaacaagag cacaatcagg aaaaagagta caaaaggaaa agaaagttga aaatgttgag 5580 caagtaagag atgaaaacca tgagacatct gagagtgagg atgacaacct ttgttattac 5640 tacccaaagt taattcctgc tctgagaact ctgccaacac ctcaaattgc tgtggaacct 5700 gaaatacctg ctgaatatgg tccagaactg aattcagagt gtgaagcgag aaaagataca 5760 gtggaggagg aggcgagaga ggttctcaat cagcctgctg aagctgtgga tgatcaaggt 5820 gtaggagaca atgaccacag aaatgttccc aaaccggggt ctgaacctgc tgtgtgtcgt 5880 aagtctacga gagaagtgaa accagtgata aaactgagtt atgatgattt gggccgaccc 5940 actgataaac cattaactat ggttcaccga gggatggtag tacacattga agatttgtca 6000 aagacccgaa agagctgcaa cacagtttgg tgtcacccca tggctcagtg ttcccagtgt 6060 gtccctacag cccctggccc tattgtcaga acagtaattc aattttaaat gtctcatgag 6120 ggcatgagaa gtttagaagg gggagga 6147 // ID CR1-1_DR repbase; DNA; ZEB; 4985 BP. XX AC . XX DT 02-MAY-2002 (Rel. 7.04, Created) DT 02-MAY-2002 (Rel. 7.04, Last updated, Version 1) XX DE CR1-1_DR is a CR1-like non-LTR retrotransposon - a consensus. XX KW AP endonuclease; CR1 clad; CR1-1_DR; CR1DR1; ORF1; ORF2; KW Non-LTR retrotransposon; reverse transcriptase. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 311-4303 RA Jekosch K.; RT "CR1DR1: CR1-like repeat from Danio rerio."; RL Repbase Reports 2(2), 7-7 (2002). XX RN [2] RP 1-4985 RA Kapitonov V.V. and Jurka J.; RT "CR1-1_DR, a family of CR1-like non-LTR retrotransposons in RT zebrafish."; RL Repbase Reports 2(4), 6-6 (2002). XX DR [2] (Consensus) XX CC CR1-1_DR is a family of CR1-like non-LTR retrotransposons and it CC was active in zebrafish recently. The consensus sequence encodes CC two proteins, CR1-1_DR1p (position 311-1216) and CR1-1_DR2p CC (positions 1220-4300). CC The 1027-aa CR1-1_DR2p protein is composed of AP endonuclease CC (aa positions 72-225) and reverse transcriptase (aa positions CC 559-796). The 302-aa CR1-1_DR1p protein is distantly related to CC proteins encoded by ORF1 in known CR1-like elements from chicken, CC pufferfish and turtle. CC The consensus sequence [2] was reconstructed from 10 copies CC that are ~1% divergent from it. CC Approximately 1000 copies of CR1-1_DR are present in the CC zebrafish CC genome. XX FH Key Location/Qualifiers FT CDS 311..1216 FT /product="CR1-1_DR1p" FT /translation="MSLPSLSLCAGEASMEALELELEEVESQIRALVVRRS FT RLREQLLVVPNAKAVSSPKVRGNYNHIIPSTSTPRPSLSRPSAPGARLSQA FT SFTPTPGYHGAWVQPRKVLPRSRGRTSPPVFEISTENRFSPLRESGPDVAI FT IGDSIVRHVRAASSKGNKVRTFCFPGARVRNISTQIPTILGAAESPGAVVL FT HVGTNDTGLRQSEILKKDFRSLIETVRRTSPATQIIVSGPLPTYRRGNERF FT SRLLALNEWLITWCKEQKLLFANNWNLFWERPRLFRPDGLHPSRAGAELLS FT DNISRLLRTI" FT CDS 0..0 FT /product="CR1-1_DR2p" FT /translation="CVSGACLDNSVFIRCLQNKQEPSVSVAFSTQICVLLR FT DRKPKTLSDRIANHSNLVSIKCISETSVVKTTKLAGKNSQNSHYSHLDSCS FT PHLNISNAYLANPIETVSVPRIIRLRNKRTVCSRKNLVRIKPEKPVESENT FT NFVKLGLLNIRSLAPKALIINEIITENNLNALCLTETWLKQNDYISLNEAT FT PPGFLYKHEARQTGRGGGVASIFSDFLNIKQRNGLMFSSFEVLSLNVQLPD FT TIQKPMLSLALITIYRPPGPYVKFLKEFSDFISDLLVKTDKMLIVGDFNIH FT IDDANDTLGLAFMDLIHSLGIKQNVVGPTHRLKHTLDLILSYGIEVIDVDI FT IPQSDDITDHYLLLYKLCLPEISKPAPILRPSRTIVPSTKDEFINNLPDLS FT LFRNAPANSNDLDVVTSSMDAIFTSTLNTVAPIKLKKAREIKTIPWYNSHT FT RALKTATRALERKWKKTNLEVFRIAYKDSMSSYRRALKSARTEHLRKLIEN FT NHNNPRFLFNTISKLANNRSSLEQTTPPQISSDDFMNFFSNKIEGFRQKIG FT DAKLSAPAYTPNPVNISLNHNNNLHCFKIIEHEELVKIINSSKPATCMLDS FT IPTKLLKELLPAIGEPLLNIINSSLSIGHVPNSYKLAVIKPIIKKPQLDTN FT NLANYRPISNLPFMSKILEKVVSTQLCSFLQTNNIFEVFQSGFRAHHSTET FT ALVKITNDLLLAADRGCVSLLVLLDLSAAFDTIDHNILINRLKSTGVQGQA FT LQWFKSYLTDRYQFVNLNGQPSQICPVKYGVPQGSVLGPLLFTIYMLPLGD FT IIRRHGISFHCYADDTQLYISTKPDETSELSKLTECIKDIKDWMTNNFLLL FT NSDKTELLLIGPKSCTQQISQLNLQLEGYKVSFSSTIKDLGVILDSNLTFK FT NHISHVTKTAFFHLRNIAKLRNMLSISDAEKLVHAFMTSRLDYCNALFAGC FT PASSINKLQLVQNAAARVLTRSRKYDHITPILSSLHWLPVKFRIEFKILLL FT TYKALNNLAPVYLTNLLSRYKPTRSLRSQNSGLLVVPRIAKSSKGGRAFSF FT MAPTLWNSLPDNVRGSDTLSQFKTRLKTYLFSKAYTQCIT" XX SQ Sequence 4985 BP; 1464 A; 1202 C; 916 G; 1403 T; 0 other; cgtcactggc gtcactgtct ccgttcggtc acatactgcg tgtgcttgaa gtttggactt 60 gctatttact cgcaatttaa atcttaatca taaaattctt cctcattcgc taagtatttg 120 tctctcctac ttagggtgac caaaccctta atacatttat acaaacaaaa ctgctttaaa 180 aacggtctgt ccctcgagca tccgcctgtt gtttgtagct ttagcctgct agcgccgctg 240 gtcagctaaa gctaccgacc tcttttacca tacacttttg acttactggc tttgctcttt 300 accccgtaaa atgtcgcttc cgtctctgtc cttgtgtgca ggagaagcat cgatggaggc 360 gttggagctg gagctggaag aagtggagtc ccagatccgc gcgctggtgg tgagacggtc 420 gcggctacgg gaacaactcc ttgttgtacc taatgctaag gccgtctcat cacctaaggt 480 acgtggaaat tacaaccaca tcattccctc tacctcaacc ccgcgtcctt ctctgtccag 540 gcccagcgca cccggggcgc ggctcagcca ggcgtcgttc acgccgacac ccggctacca 600 cggcgcctgg gtgcagccgc gcaaggtgct tcccagatcc cggggcagaa cgtctcctcc 660 tgtgttcgag atctccacgg agaaccgctt ctcccctctc cgcgagtcgg gtcccgatgt 720 ggccatcatc ggtgactcga tcgttcgtca cgtccgtgcc gcctcctcaa aaggtaataa 780 agtacgtact ttctgctttc ctggtgcccg tgtgagaaat atttctacac agattccaac 840 catcctgggc gctgccgaga gccctggtgc cgttgtcctc cacgtgggga caaacgacac 900 cgggctccgg cagtcggaga tcctgaagaa ggacttcagg agcctgatcg agacggttcg 960 acgcacctcg cccgccacgc agatcatcgt ttctgggccg cttcctacct accgccgagg 1020 aaatgaaagg ttcagtagac ttttagctct gaatgaatgg ctaataacat ggtgtaaaga 1080 acagaaattg ctctttgcta ataactggaa tcttttctgg gagcgtccta ggctcttccg 1140 tcctgacggc ctgcacccca gtcgagccgg agctgaactc ctgtcggaca acatctccag 1200 attacttcgc accatctgac tagcaggtaa aaattcacaa aattcacact atagccacct 1260 agactcttgt tcaccccact taaacatcag taacgcatat ctggcgaatc ctatagagac 1320 tgtgtctgtt cctcgtatta ttagattaag aaataaacgt actgtgtgct ccagaaaaaa 1380 tctagtaaga atcaaaccag aaaaaccagt agaaagtgaa aatacaaatt tcgtaaaact 1440 tggtctccta aacatcaggt cacttgcacc taaagcactt atcattaatg aaataataac 1500 agaaaacaat cttaatgcac tctgtctcac tgaaacctgg ctgaaacaaa atgactatat 1560 tagcttaaat gaagcaactc ctccaggatt cttatataaa catgaggctc gtcaaactgg 1620 tcgtggtggt ggagttgcat caatctttag tgatttcctt aatattaaac agagaaacgg 1680 acttatgttt agctcctttg aagtattatc gcttaatgtt cagcttccag atactataca 1740 aaaacctatg ttatctctcg ctttaatcac catatataga cccccaggac cctatgtcaa 1800 atttctaaaa gaattttctg attttatttc tgacttacta gtcaaaactg ataaaatgct 1860 aattgtaggt gactttaaca tccacataga tgacgctaat gatacattag ggctcgcgtt 1920 tatggattta atacactcac ttgggataaa gcaaaacgtt gtgggtccaa cccatcgctt 1980 aaagcataca ttagatctaa ttctgtctta tggaatcgag gttattgacg tagacattat 2040 accacaaagt gatgatatta cagatcacta cctcttacta tataagctgt gtttacctga 2100 aatcagcaaa cccgctccaa tactccgccc tagtagaact attgttccgt caactaaaga 2160 tgaatttata aataacttac ctgatctttc tctatttcgt aatgcacccg caaactcaaa 2220 tgatcttgat gtagtaacca gcagtatgga tgccatcttt actagcacac taaatactgt 2280 ggcacccatc aaattaaaaa aggctagaga gattaaaact ataccatggt ataatagtca 2340 tactcgtgcg ctcaaaacag caacccgtgc cctggaacgt aaatggaaaa aaactaattt 2400 agaggtcttt agaattgcgt acaaagacag tatgtccagc tataggaggg ctctaaaatc 2460 tgccaggacc gagcacctgc gcaaactgat agaaaataat cataacaatc ctagattttt 2520 atttaacacc atctctaaat tagcaaataa tcggtcatcc ttggaacaaa ctactccacc 2580 gcaaattagt agtgatgact tcatgaattt tttcagtaat aaaatagaag gctttagaca 2640 gaaaatagga gatgccaaac tttctgcacc ggcttatact ccaaatcctg taaatatttc 2700 attaaatcat aataataacc tacactgctt caaaatcata gaacatgaag agttagtaaa 2760 aattataaat agctctaaac cagctacgtg tatgctggac tcaattccaa caaaattact 2820 gaaagagctg ctacctgcta taggagaacc tcttcttaac attatcaact cttctttatc 2880 tataggccat gttccaaact cttacaagct agctgttatt aagcctatta ttaagaaacc 2940 gcaactagac accaacaact tagctaacta taggcctatt tcaaatcttc catttatgtc 3000 taaaatacta gaaaaagttg tttccactca attatgctct tttctgcaga cgaacaatat 3060 ttttgaagtg tttcagtcag gtttcagggc tcaccacagt acagaaaccg ccttagtgaa 3120 aataaccaac gatttactct tagctgctga ccgagggtgc gtctcgctat tagttttact 3180 cgatcttagt gcggcatttg ataccattga ccacaatatc ctcataaatc gcttaaagtc 3240 tacaggtgtc cagggacagg ctctacaatg gtttaagtca tacttaactg accgctacca 3300 gtttgtgaat cttaatggac agccttcaca aatctgccca gtaaagtatg gggtgcctca 3360 aggatcagtt ttaggccctt tactgtttac aatttacatg ctacctctgg gagacattat 3420 tagaagacat gggatcagct ttcactgcta tgcagatgat actcaattat atatttcaac 3480 taaacctgac gagacgtctg aactttctaa actaactgag tgtatcaaag acatcaaaga 3540 ctggatgacc aacaattttc ttctcttaaa ctcagacaaa acagaattat tacttattgg 3600 gcctaaatct tgcacacagc agatctcgca actcaattta caattagagg gatacaaagt 3660 tagctttagc tctactataa aagatctggg tgtcatatta gacagcaatc taacttttaa 3720 aaaccatata tcccatgtca caaaaactgc cttctttcat ctgagaaata tcgctaaatt 3780 acgaaatatg ctatccatct cagatgcaga aaagctagtc catgctttta tgacttcgag 3840 actggattac tgtaatgctc tatttgctgg ctgcccagca tcctctatta acaaacttca 3900 attagtacaa aatgcagcag ccagagttct gaccaggtct agaaaatatg atcatataac 3960 cccaatttta tcctccttac actggctgcc tgttaagttt cgtattgaat ttaaaatatt 4020 acttctcacc tataaagctc taaataatct agctcctgtt tatctaacca accttctgtc 4080 tcgctacaaa ccaactcgct ctttaagatc tcaaaattca gggcttctgg tagtacctag 4140 aatagcaaaa tcaagtaaag gaggtcgagc cttctctttc atggctccta cactctggaa 4200 tagccttcct gataacgtcc gaggctcaga cacactctcc cagttcaaaa ctagattaaa 4260 gacctatctg tttagtaaag catacactca atgcatcacc tagcgggttc cacacaggct 4320 tctgcatctt gcttatatac actatgaaca gcagctacgc taattattct ctttattctc 4380 tattttcacc tggggatact catcccgagg tcctcagatt atgcggagtc actgattgga 4440 tccaagacca gcgacgtgat gatcccaagg attccatatc cgggaccagg ccatatcctg 4500 agctgctgct gcgctgatgg tcgtggggag tggagaacat gagtctgatt ccagcgacgc 4560 tccagggaca gacgagtctt cgctgaggcc atcttccagc ctaaaccacg gcgaatgaag 4620 ctctgcacaa gacttttggc cagcggagaa attaaaatgg tcgtgcccaa ctgagtctgg 4680 ttctctcaag gttttttttc ttcactccca tcaggtgaag ttttttttcc ctctccgctg 4740 tcgccactgc ctcgcatggt tcaggattgg tagagctacg catcgatgaa tttgctcttc 4800 agtgtttgaa ctctcagtaa tgattaaatc acactgaact gagctaaact gaactgaact 4860 gaacttaaac actaaaacct gaaccacact gttccagtta ctatgaccat ttatgtgaag 4920 ctgctttgac acaatctaca ttgtaaaagc gctatacaaa taaagctgaa ttgaattgaa 4980 ttgaa 4985 // ID Gypsy-165-I_DR repbase; DNA; ZEB; 4366 BP. XX AC . XX DT 17-NOV-2008 (Rel. 13.12, Created) DT 17-NOV-2008 (Rel. 13.12, Last updated, Version 1) XX DE An internal portion of the Gypsy-165_DR LTR retrotransposon - a DE consensus sequence. XX KW Gypsy; LTR Retrotransposon; Transposable Element; KW endogenous retrovirus; Interspersed repeat; reverse transcriptase; KW gag; GYPSY superfamily; integrase; Gypsy-165-I_DR; KW Gypsy-165-LTR_DR; Gypsy-165_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-4366 RA Dib M.R. and Naveira H.F.; RT "Gypsy-165_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 8(12), 2162-2162 (2008). XX DR [1] (Consensus) XX CC Gypsy-165-I_DR is an internal portion of the Gypsy-165_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its long CC terminal repeat is deposited in Repbase as Gypsy-165-LTR_DR. CC Gypsy-165_DR is characterized by 4-bp target site duplications. CC The internal portion encodes two proteins: the 344-aa gag CC Gypsy30_DR1p (pos. 18-1049) and 1131-aa polyprotein (pos. CC 959-4351, conceptual translation) composed of the protease, CC reverse transcriptase, and integrase domains. XX FH Key Location/Qualifiers FT CDS 18..1049 FT /product="Gypsy165-I_DR_1p" FT /note="Gag-protein." FT /translation="MDPEALSTKELLAVIGSHEASFQRHEEVLRRQEEVMT FT KHSELLADVTSSIRQLFQSLPGVSSPASPAAPPLSTNSPPIAPVAAAEPRL FT PPPKPFSGDPSSCQGFLTQCSLTFELQPSSFPTDRSKIAYIITLLTDKALS FT WASAAWESQPSYCQSYTAFEKEFKKVFNHPVSGQEASKRLLTLRQGPRSAA FT DFAIEFRTIAAGSGWNDEALRVCFLGGLAESIQDEMATREPAKDLESLIDM FT AIRLDIRLRERRMTRGRASHSQTPVHKPASPVHAPPVRLLPVNEQAPEPPE FT DMQLGRSRLSPSERDRRMRERRCLYCGMSGHFRSTCPELSGNEGPHRVTGG FT L" FT CDS 959..4351 FT /product="Gypsy165-I_DR_2p" FT /note="Polyprotein." FT /translation="MPLLWYVWSFSIHLSRTIGKRGSPPSYRRTVMGKIRV FT PPANTVLALKAILSWESHQFPVQAMIDSGAAGNFIDLSLAKKLKIPTHLLP FT HPQSVTALDGRPLEPGKVTEATQSLKLTIAKHQQEETFYLIDSPEYPVILG FT HPWLHRHNPHINWSTGSILDWSPSCHFTCFTHSLSAPHPEPQDSVDLSQVP FT AVYHKFRAVFSKSRATSLPPHRPYDCAIDLLPGSSPPRGRVFSLSPPEQAA FT MNAYIQESLATGIIRASTSPAGAGFFFVGKKDGGLRPCIDYRGLNKITIRN FT RYPLPLMATAFELLQGASIFTKLDLRNAYHLVRIRQGDEWKTAFNTPTGHY FT EYLVMPFGLTNAPAVFQALINDVLRDMLNIFVFVYLDDILIFSKSMEEHEG FT HVSRVLQRLLENHLFVKPEKCEFHVSLTKFLGYIVTPGHLEMDPSKIKAVL FT NWPIPTTVKEVQRFVGFANFYRKFIRNFSSVVAPLTALTKGGGVKIEWGPK FT AAAAFQDLKDRFTSAPILSIPNPDIPFMVEVDASDVGVGAILSQRNEDGKL FT HPCAFMSRRLSNAERNYHVGDRELLAVKLALEEWRHWLEGARHPFQVLTDH FT KNLEYLQQAKQLNPRQARWSLFFNRFQFILTYRPGSKNLKPDALSRAYSPE FT THEKPVSSIIPSSRIVAPLRWDLEQVVRKAQTKEPDPGNGPLGALYVPRAV FT RAQVLQWGHESVLTCHPGSSRTLEFLRRRFWWPSIKEDVKGYVEACQVCCQ FT GKSSHQRPQGLLHPLPVPHRPWSHLSLDFITGLPLSQGNTVILVVVDRFSK FT AARFIPLPKLPSAKETAELIISHVFRVFGIPQDIVSDRGPQFLSRFWGAFC FT RLFGTTASLSSGFHPESNGQTERVNQDLETTLRCMAANNPTTWSSYIMWAE FT YAHNTLKSSATGLSPFECQFGYSPPLFPEKEVQVGVPSAQHFVRRCRRTWR FT RARSALLRTSLRYQHQANRRRRRPPTFRVGQRVWLATKNLPLRVESRKLSQ FT RFIGPFRIARKVNPVSYRLYLPRSLRINPTFHVSLLKPVLSSPFAPPRRPP FT PPPRIIDGQPAYTVHRILDSRRVQNSLQYLVDWEGYGPEERSWIPAKDILD FT PSLIREFHTQRPGCSGRNVRSRS" XX SQ Sequence 4366 BP; 1016 A; 1224 C; 988 G; 1138 T; 0 other; gaaacaccga gtcagacatg gacccagagg cactcagtac caaagaactg ttggcagtga 60 ttggcagtca tgaagcttcg ttccagcgtc atgaagaggt tctccgtcgt caagaggagg 120 tgatgacaaa acactcagag ctccttgctg atgtcacgtc gtccattcgg caactctttc 180 aaagccttcc tggggtttct tctcctgctt ctcctgctgc accaccccta agtaccaaca 240 gtcctcccat agctccagta gctgctgcag agcccagact tccacccccc aagccattct 300 ctggagatcc tagttcctgt cagggatttc tcacccagtg ctctctcact tttgagcttc 360 agccctcaag ttttcccact gaccgctcaa agatcgctta tattatcacc cttctgactg 420 acaaggcatt gtcttgggcc tctgcggcat gggagtccca gcctagttat tgccagtcat 480 acacagcctt tgagaaggaa ttcaagaagg tgttcaatca cccagttagt ggacaggagg 540 cttccaaacg tctccttact ctccgtcaag gtcctcgcag cgctgcagac ttcgccattg 600 aatttcgaac tattgcagca ggtagtggat ggaatgacga ggccttaaga gtctgctttc 660 tgggcggatt agctgaatcc attcaagatg agatggccac ccgggaacca gccaaagacc 720 tagaatccct tattgatatg gccattcgcc ttgatattcg cttgagagaa cggagaatga 780 ctcgaggcag agcatcccat tcccaaactc ctgttcacaa acctgcatct ccagttcatg 840 cgccaccagt cagacttctc ccagtcaatg aacaagctcc cgagcctcca gaagatatgc 900 agctaggtcg ttccagactc tctcccagtg aaagggacag acggatgagg gagcgacgat 960 gcctttactg tggtatgtct ggtcattttc gatccacttg tccagaacta tcgggaaacg 1020 agggtcccca ccgagttaca ggaggactgt gatggggaaa ataagagttc ctcctgccaa 1080 caccgttcta gctctcaaag ctattttgtc ctgggagagt catcagttcc cagtccaggc 1140 aatgatcgat tcaggggccg caggtaattt catagatctc tccttggcca agaaacttaa 1200 gattcctacc caccttctcc ctcatcccca gtcagtaact gctttggatg gtagacccct 1260 tgaacccggc aaagtaactg aggccactca gtccctgaag cttaccattg ctaaacatca 1320 gcaggaggag actttctacc ttattgactc tcccgagtat ccggtcattc taggtcatcc 1380 ctggttgcac agacataatc cccatatcaa ctggtctact ggttccattc tagattggag 1440 tccttcatgt cacttcacct gttttaccca tagcctctct gcccctcatc ccgagcctca 1500 agattctgta gatctgtctc aagttcccgc tgtctatcat aagtttaggg cagtattcag 1560 taagtctcga gccacctctt tgccacctca ccgcccatac gactgtgcaa ttgaccttct 1620 ccccggttcc tctcctccta gaggcagagt cttctcccta tctccccctg aacaggctgc 1680 tatgaatgct tacatccaag agtccctggc aactggcatc atccgagcct ccacttcccc 1740 tgctggtgct ggcttcttct ttgtggggaa gaaggatggg gggcttaggc cttgtatcga 1800 ttaccgaggt cttaacaaga taaccattcg gaatcgatat cccctgcctc ttatggctac 1860 tgcctttgag ctgctgcagg gagcttccat ttttaccaag ctcgaccttc gcaatgccta 1920 ccatctggtg cggatacggc aaggagatga atggaagact gcttttaaca cccccacagg 1980 ccactatgaa tacctggtga tgcctttcgg ccttaccaat gcccctgccg tgttccaggc 2040 acttatcaac gacgtcctcc gagacatgtt aaatatattt gtattcgttt atctggacga 2100 tatacttata ttttccaagt ccatggagga gcacgagggc catgtcagca gggttctcca 2160 aagactcctt gaaaaccatc tctttgtcaa gccagaaaaa tgtgagtttc atgtttccct 2220 gactaagttt cttgggtaca ttgtcacccc tggtcacctg gagatggacc ctagtaagat 2280 taaagctgtt ctcaactggc ctattccaac cacagtaaaa gaggtgcaac ggtttgtggg 2340 ctttgcaaac ttttacagga agtttatcag gaatttcagc tcagttgtgg ctcccttgac 2400 agcactgaca aagggaggag gagtcaagat tgaatggggt cctaaagcag cggctgcctt 2460 ccaggatctc aaggatcgat tcacctcagc tcccatactc tctatcccta atccagacat 2520 accctttatg gtagaggtag atgcctcaga tgtgggtgta ggagccattt tatcacagag 2580 gaatgaggat ggaaaactac acccctgtgc tttcatgtca cgtcgcctgt ctaatgccga 2640 gcgcaactac cacgtggggg accgagagct gcttgctgtt aagttggcct tggaagaatg 2700 gcgccattgg cttgagggcg ctcgacatcc tttccaggta cttacagacc ataagaacct 2760 agaatatctc cagcaggcca agcaactgaa ccctcggcag gctcgatggt ctctgttttt 2820 caacagattt cagttcatcc tgacttatag acccggttcc aagaacctta agcctgatgc 2880 cttgtcccga gcctactctc cagagacaca tgaaaaacct gtttcttcta ttattcctag 2940 ttcaaggatt gttgcccccc tcagatggga tctggaacaa gtggtccgta aggctcaaac 3000 caaagaacct gatccaggga atggaccgtt gggggctcta tatgtccctc gagcagtgcg 3060 agctcaggtc ttgcagtggg gtcatgagtc tgtattgacc tgccacccag gtagttcccg 3120 tactttagaa ttcctccgac gtcgcttttg gtggccttcc ataaaggaag atgttaaagg 3180 ttatgtggag gcctgccaag tatgttgtca gggaaaatca tcacaccagc gacctcaggg 3240 actgctccat cccttacctg ttccccacag gccttggtca cacctttccc tggatttcat 3300 tacaggactt ccactctccc agggcaacac ggtcatattg gttgtggtgg accgattttc 3360 caaggctgcc cggttcattc ctctgcccaa gttgccatcg gctaaagaga ctgctgagct 3420 cataataagc catgttttca gagtttttgg cattccccaa gacattgttt ctgaccgagg 3480 tccacaattt ctgtccagat tctggggggc tttttgcaga ctcttcggaa ccactgccag 3540 cctatcatcc gggttccatc ctgagtctaa tggtcagaca gaacgagtta accaagattt 3600 ggaaaccact ctacggtgca tggcagccaa taaccccacc acttggtcat cttacataat 3660 gtgggctgaa tatgcccaca acaccctcaa gtcctcagcc accggacttt ccccttttga 3720 atgccaattt ggttattccc ctccattgtt tcctgagaaa gaggtccagg tgggagttcc 3780 ctcagcccag cactttgttc gacgctgtcg acgaacctgg aggagagcta gaagcgctct 3840 ccttcgaacc tccctgagat accaacatca ggctaatcgc cgtcgtcgaa ggcctcctac 3900 tttccgggtt ggccagagag tctggttggc cactaagaac cttccactcc gggttgagtc 3960 gagaaaattg tcccagaggt tcatcgggcc atttagaata gccaggaaag ttaaccctgt 4020 ttcttatcgt ttgtatcttc ctcgttcact tagaattaat cccacatttc atgtctcttt 4080 attaaaacct gtcttgtctt ctccctttgc cccccctcgc agaccccctc cacctcccag 4140 gatcattgac ggccagccag cctacacggt ccaccggata ctggattcca ggagggtcca 4200 gaactcactt cagtatctgg ttgactggga gggctacggg ccagaggagc gctcctggat 4260 tcctgccaaa gacatcttgg accctagttt aatccgggag tttcataccc agaggccagg 4320 gtgttctggt aggaacgtca ggagccgttc ctaaaggagg gggtcc 4366 // ID LOOPERN7_DR repbase; DNA; ZEB; 1290 BP. XX AC . XX DT 09-JAN-2009 (Rel. 14.01, Created) DT 09-JAN-2009 (Rel. 14.01, Last updated, Version 1) XX DE Nonautonomous DNA transposon - a consensus. XX KW piggyBac; DNA transposon; Transposable Element; Nonautonomous; KW LOOPERN7_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-1290 RA Jurka J.; RT "A nonautonomous piggyBac-like DNA transposon from zebrafish."; RL Repbase Reports 9(1), 1-1 (2009). XX DR [1] (Consensus) XX CC 87% identity to consensus. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 1290 BP; 396 A; 262 C; 235 G; 396 T; 1 other; aggtgcagta ggtgatctgc cagaatgcta gcattagcat aatagctttt agaaatttta 60 cgaccctccc ctgccgtcca aagccacgcc tccttaagtc atgaacacgt gaacgcgcac 120 tgattagata taacactaga tcatgtcatt caccagttag aaaactctat ggagacacgc 180 actttatcaa gcgtagttgt tgacaggcca gmgggtgcag gaattatatt tccccaaaac 240 tgtctcacag gctgtcactg ttcaaaaatg aacaaatgtg tgaatataaa tcaaacttca 300 cctcagtaaa gctggttagg cggaagagca catttactta tgttttgtta ttaaactaaa 360 taaaagttag atcagaataa aggttagatc tgctaaaaac agaaatatcc tgtcatcttt 420 ttacattaca cttatgttta cgttaccaag acaacgagtg acttttcatt ttcagttgtg 480 ctcgctgatc ctcgtacatc gcttgttttc agctcctttt acacttgaac aactaaacga 540 atgctcaggt ctatttaacc gactatatta tcgatgctgt aaaaggtact ccatgaactt 600 gaaaatagcc aatcttttgc cggacgtttt cagtggctga acaacacctc tgtgcatata 660 aacccattca taaacaagaa cacagtctac agcacataag ctttgcgtgg ctaaaatagt 720 tttaaaacat acctgtctaa aagaaatact tcagccgtgg tgtcaccctt cagatcgttg 780 ttttgaactg cataacgtca tttccctcca gcgtgcaaaa gtaattccga tattgattca 840 gggtttagaa aagttttgat tctgcatttt tggcagaagc cctttcagaa acaatctttc 900 ttttcatgca tacaaggcca cgttggtctt tcagaaagaa gttcaggttg atgcagagtt 960 tgcaggatga cgtctctctg tcagctgtag atgcgtgctt cacgtgtgcg cgcacgagtg 1020 acgtatctgt ctgcttaaag aggctgcgca gaaattcaaa tttaaatttg ttgacagaca 1080 gtttgagata cctgtcgtaa ttgagttatt tgtcggtccg acaatattta attggatgaa 1140 cattttttat gttttatgcc ttatccagaa tataaaaata catataaata catttagatc 1200 atttacttta atcattacta ttggaatgtg aagagacttt caaccagcac aacaaaaaag 1260 tgtttctgaa gacaatcacc tactgcacct 1290 // ID Polinton-1N1_DR repbase; DNA; ZEB; 15469 BP. XX AC . XX DT 15-MAR-2006 (Rel. 11.02, Created) DT 15-MAR-2006 (Rel. 11.02, Last updated, Version 1) XX DE A family of nonautonomous Polinton DNA transposons - a consensus DE sequence. XX KW Polinton; DNA transposon; Transposable Element; Nonautonomous; KW Interspersed repeat; Polinton-1_DR; Polinton-1N1_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-15469 RA Kapitonov V.V. and Jurka J.; RT "Self-synthesizing DNA transposons in eukaryotes."; RL Proc. Natl. Acad. Sci. USA 103(12), 4540-4545 (2006). XX DR [1] (Consensus) XX CC This nonautonomous transposon is characterized by ~7.3-kb-bp CC terminal inverted repeats and 6-bp target site duplications. The CC consensus sequence was built based on multiple alignment of CC several copies that are >90% identical to each other. This CC nonautonomous family was derived from the autonomous CC Polinton-1_DR. XX SQ Sequence 15469 BP; 5466 A; 2295 C; 2295 G; 5413 T; 0 other; agagagaatt atgatggagc acctgtcaaa aagtttgatg gattgtagcc ccgcccccta 60 aggcgggact tccggtctaa gccccgcccc ttttatataa actttatagt tccggtaatt 120 agattcttat cggtttaata taaaataata gttccggtaa ttagattttt atcggtttaa 180 tataaaataa taattccggt aattagattt ttatcagact aattaaatta taattccggt 240 aattagattt ttatcagact aattaaatta ataggctatt tccaggtaat gaaaatattg 300 gtaattaaat tctaatcaaa cagaaataaa attattatta taataataac ggtaattaaa 360 ttactgttat tagaaccatt attttatatt tttagttatt tgactaaata ttatattaca 420 atatattata ttatattaag ttatataaaa ttaatatttt attaaataaa attatgtatt 480 gtattataat atattatatt aagttatatt aaatcattat tttattaact tatattgtaa 540 ttataccata ttatataata ttattatatt atattattat tataatattt atattagata 600 ttaaattaaa ttatattata ttatatatca ttattattct attctattct attctattat 660 attatattat attatattat attatattat attataatta ataaactatt aacataaaca 720 tttctttatg ttcataccga tgtttcctct gtgtctcagt ctgggcctga gcgtgtgagt 780 gagtgatgat aaggaaaact ttggataaca tcccgtcaac tgcttgtgaa agatagattt 840 atatcagatg tagaatatgt atatctgatt gtatgtgttg ggtgtctccc actaactact 900 atatgttgtt aggtgattag aaagtgtgct gatatctata tatccacatg cagtcaactg 960 tatgcatatg tatgtgtttg tgtgtgcgta cgtgcgtgcg tgcgtgcatg tgtgcgcgct 1020 tgagcgaact tgggtctggc tttgtatgct tgaatccaac gacacagtat ttactaaaaa 1080 tactcattaa taggttatgt tgtttttcac attattacta caataataca tattttaaga 1140 cacacaaaca ttttgatgta tctttgtcgg agtattaaat taaaatatgt ttacaatatt 1200 ttttgtcgcc ttagcccctt tttaattcct gggtggacac agcggaatga accgccaact 1260 taaccagcaa ggttttgcgc agcgtatgcc cttccacccg caacccatct ctgggaaaat 1320 gtttacaata ttaatccagt aaataacaca tcttaatctt tattgacaat aaattgtcac 1380 accttttaaa acaacgttat ctaaaatgta atattctaaa atgttgaaaa acacagatgt 1440 aaattcaagt tgtctttata attaatgaaa acttttatta tgaaaaacaa tttaaaatga 1500 aagtttttcc tttgactaac atgttttcaa tttatttaca tgcatgtcct gtaatttatg 1560 gcacacattg taatacagtt aatccaaagc ctaatgtttt gagcactggt catcactggt 1620 gttggtttaa aaaatcaaca taaaatattt actaaaaagg acttatcact ctttagacat 1680 tattgttttt tctgtttcaa aagataaata aaatgtaatt ttattacaat aaattgtagt 1740 aattgcatta ctagacaact tagttactga attttaaatt caaactgttt ttaattcatt 1800 gcaaatactt gatcatataa agagaaatat gttaattata ctgaataatg gaataacact 1860 gatatttatt ttaagaagaa aattggctct cagtttatct aaaaacagta cagctccagc 1920 ccttagttaa atctacttaa tgttttatag taaaattttc tagtttaact actgaatcca 1980 ctatgtacat gttcatacaa tagtgtcaaa tctagtaagt ttacaaattt gtcaaaatta 2040 gtcaagttta aaagtagttc ttgaaagtag actgattact tttttacagg gtaaatgatg 2100 cttttcttct accaatagat ttaatcaaga ttattgtcac tacacacaat ttgttgggag 2160 tgtatatgtt tctttaaaca atgtttttta ttggcaaaca cattttcaaa tgatcaaact 2220 aaaatctaac ggaatgtttt tcactctgaa tataaaagct ttatctgtgc atttcacttg 2280 gaaaagtgat gtttaaaata ctattcatta aacttactgc atgactggag ttttattttt 2340 tacaaaccaa tagcattgac ctaattcatg cttttgattt tccactctgt agcagttttt 2400 caacacaaaa tgtaaacgtt ttatgcatgt gctaaaaatc tgtaattttg caccactgtc 2460 agagtttgta tgtaacaaag catacatgtt tttaaaactg tcagttttca aaagttactt 2520 aacgttgtct cctttcttct ggagttttta ggaacaacaa tgtttcattt gtgggggaaa 2580 gaaaaaaaaa gtgatgtttt gaaatacatt ggttacacca atatggtgta aaggttagaa 2640 gttggggttt catccagacg gccggggctc aacgcctgat tttttaatgc tttttaacct 2700 ctaaagtgtc ccaagtaaca ttaataacaa cacttagcca caattatttg tatcgcagtt 2760 gaaggctaca cttaatactg ccagcaaaac acaataataa caaagagtaa acactttctt 2820 attttaagaa tactttgcag taaactttaa tgcattagta agtttatttt atccaccttc 2880 tgatgcagaa tgtttttgaa acatttaatt atgtttttat gaccacgggc atctcaccaa 2940 aatatttgac ctaattcaag taaccagtgc tgcagttttt aatcacaaaa tgtctttaca 3000 aacttttttg cgtatgtgtt aaaagtctaa ttttacacaa ctcttatagc gtccgttaca 3060 atgcatacag taaatgtcac aataaccatt tctactgcca attatctttt ggtgtctaat 3120 tcttcacaat attttttcac gagaactatg aatctacaaa ggcaaggcaa ggcaagacaa 3180 ggcaagttta tttctatagc acatttcata cacagtggta attcaaagtg ctttacataa 3240 acaggaataa aagagacaac tataagaaaa ataaaaacaa ataatgaaaa atttaaaaaa 3300 atgataaaaa cagatcaaat gtgttaaaac aagttataaa agaatgaaaa gaagagaaaa 3360 acatattagt gtgatctgtc aatagtgcga tctgtctaca aaactaaaag aatagttgcc 3420 actaaatctg cattaatatc ttgttcaaca gtgtctaatg tactgtgtat taaaactgta 3480 ttcttttgaa ctgtcatgaa agtttgcgtt gtgaatgttt tggttttaaa ccagagacaa 3540 ccaacaatat tatgtatacc ttttgttcaa atactttttc aaatgaaaga taaattatga 3600 aattgtatgt cagtgcatct ttaaaactaa attgggcaaa aaaaaattag gttgtctttc 3660 taaactgact acaaaaccgt tatacatgag aactgtgtgt tattgggttg acttaaacgt 3720 taaccaattt taataaatgt tctacagttt atacatttgc tttaaacaag tgaccagtcc 3780 ttgacacact atgcaacatt ctaattttct aaatatttaa atatacattt ggcataaata 3840 cagggcttta atcagttttg acaatgtttc tatgatttgt ttcacaccat catgttttgc 3900 acctagctca atttttaact tttttacagg aaaggcatag acacttgagt tgataaactt 3960 ttatttcaag taaatataca ctgaaataga gtagtacata acaaacaaac atggattgca 4020 ttaccataat aaagacaaat taactaagaa aaaaaaaata catttataca gtcatacatt 4080 gaagaaaaaa cagggtacac cagaacaata aagacattac tgaatattac caaacaacaa 4140 agaaatgctg ctaaaagcat tctgtcatgt cctagtatgt ggctagtcat aacttccaca 4200 atctcagcaa tatttgttat tttgttttgt gaaaccagtg acacacaaat atctgttata 4260 tatgtggaca tacatagaac ctgcgttacc ccccatctca tcactcaatt cagtcagcac 4320 ctccagaatt tttgtattgg ttgcacagac agcaccattg ttatgaaaac acaggtgtgt 4380 caaatgattt atcgtatcgg cccatgagtt taggtccctc tcacaacgtc catttctctc 4440 ttgagatttt tcacacgttt gccatctcct gtacaatcat gtttagaatc acacgcatca 4500 ttaatgatct tctccagaag aaaaaccata tggtccctat aatgctgttt aaacatgttg 4560 tccacaatag accttatttg acaatcaaaa acagggttgc actcgtactc gggcctttca 4620 tgcgcattat cgtcagtatt ttgagaacaa tagcaaccaa ccatctttgc atggtggctg 4680 tcagaaaaaa atctgaaaat atttttaaga aaacgttttt tctttaatca caatacaaag 4740 tttgacccat aatctgtgta gatcatataa agtagtacct taaagatgcg gcctgtgtcc 4800 ttaaacagcg atagttcagc cctaatagtt cagtgcaaac aagaccaaag taggggctgt 4860 ggtcctcaaa gatagtacct ttttatgtct ttaatcttct ttttactcct gatcaattgg 4920 ttcttgtaga aagctttagg atcagtgact gtagccagaa gcattactcc ttggtagcta 4980 gaagtaaaaa tgggtgcttt atctacacat tcaaacaggc ccgccctcac tcacggggag 5040 gtactatgaa catctgctat ggtgtcataa aactcatgat agttgacatc ttgtctttta 5100 tttaagacat aggcgtgatg atgtagggat acctatttta aaaataaaaa taaagataca 5160 gagtcatacc gtaaatatat tcacaatgaa agattttaga gaactaatat tatagatttt 5220 ttacatttca tggaagtttt aggttgacaa aatataatgc acagctgaag tcatctcaaa 5280 actcattgcc aaaatggaca atacaattaa tttgaatgtt tttaagcaaa tatgtacagt 5340 cttttcaaca ttattctgca gtttgaattg ttcaatacaa ttgtttattt ggtttcctat 5400 ttgggctcaa agcagtcaca tatttaactg aaatttgcat ctttcatgat agtgttcttt 5460 ttcaaagaat gattgagaag aaataggatt acatctacac ttgcatattc tttttagaat 5520 ttaagtgaac tatacttgta agaaaataca attaaattta tgaaataacc catattccat 5580 aacttattac aatgcctttg aggtcaagaa agagatcaag tttaggagcc attacctcat 5640 aattggttat tagataaaag ttattttaaa aagtcaagtt tggttaaaaa cctaggcttt 5700 cttaaatttg ttctgttaaa acataaagaa aattttaaaa atcaagttat gcattatacg 5760 ttctggagat atataaacat acactttacc atcagataat ctacgaataa caatattcac 5820 caattttata caaagggtac caaaaccgtt tgcttatgtt tgttgcagag tagcaataag 5880 attttttttt tttttttttt tttttatgtt atgtgatact aaatctattt tctggcctat 5940 aattttagtg atatacttat tataaagttg aaataaaaag ctgatacatt tagttaaaca 6000 tcatgtgctt gacattttgg taaatagtta aagcttctga ttgtactaat tgaatattta 6060 gcattttgaa aggtaaatat gtgtcagggt atgtaaagct gacacctcta acttgtttgg 6120 ttgagatttg tggaatgttt ccacaaattg aaatattaaa gtgtttttct tattcactat 6180 tcagcaactt cgatgcaaca ttgtaaaagt gttagaaata tacaaaagta catgtcatgc 6240 tgttttgtaa aaacagacag tcatatcagt aaagaactgc acttcctgag acaaacattt 6300 tgttttgctg taaatcttta atattttaat gtatcgggca ggatgtaagc atacaactta 6360 catataactg actttttgta cgacaatctt tcgatgaaca tgtttacagt atattgcaac 6420 attttagagt ggtctttgaa atagttacaa tatcacagat gtaaagtact cgaacaatgt 6480 ttgaacaatg ttagctgatt gctgtactta agtactactt ttatgatttg taccctactt 6540 gagtacagtt aaaattaatg tttactttta cttcattact tttttaaaaa cagtattttt 6600 tttcacaact ttcaatttta tttattctca aaaaaagttc ctcactccta acaaaacgac 6660 aaagctgctt aaattaatgt tatttactat gtaattgaca ttcatttcaa tattccatca 6720 aactggtcag acatgctctt cgattgttta gaagtcatgt cctgtcacat tgagctctac 6780 aatacacatc aaccccagta agtctgctat cagtacactt ctgttaagca gttcataaga 6840 aaatatacat tcagtccatt gtctgtgtac acattcacat aattttattt agatgccgca 6900 cagaaataaa tttaaacctc acttaagtat atttaaactt atacaaacag aaagtgttta 6960 gttatttgtc ctacagtatg cttggtcatt gtttttcgcg tacaatttta ttacagtacg 7020 tgttctaagt acatttggtt agtagttatt ttgacaccct ttgattaatt gggcacatta 7080 tttttctaca taatgttata gacacatgta ttaagacaga gaataaatta gtttgagtcg 7140 acttaactga aatgaactgc gaacatgtca aatctttata aaagacggta atcttataat 7200 aacacaaatt taactgctta ctgcagtgag aaaaatatta aaatgaaatc acaaaaacat 7260 aaaaagtata catgcagcta gacaaaagac ctgaatactg tcaatattta catgtttact 7320 gaattctttt cagtatagag tatactgtaa aagtataatt tcaggtgtga cagcagtaaa 7380 cattaagcac ataagtagct aactataaca tttttattta tacatcctgt caacttatga 7440 gaatagtagt agttcactta tacttgtagt agataaatag tttgtgctgg aaatgatgca 7500 atgaaatgat gcatttatgt ctgcacttaa accatccata tgagaactgt cggtgctgca 7560 tgctcaatgc ttcaacattt tcaccacagg tctgactttc caaccaattt agcgttataa 7620 tttttagtga tacagtgtcc atattactct agtgtcttga aacttgcaag agtacagcta 7680 gtgtggcctt ggttaactag tagtacacaa agcagttacc acaagggata gtgtgttatt 7740 gaaaacatgt catgcaattt gtagaaacca atttagaaaa caactgaagt aaccgctgag 7800 cattttatct cagaataagt gcagtgttag attacctgtg gattttgaaa atgtttataa 7860 tttaataaaa attataaaca ttattaaaaa attataaaaa taattgtaat aaaaataaca 7920 aaagcttcaa acattattca aacatgtaat gaacttacac agtcatatgg tcaaagtata 7980 cttgttgcta atcaaatata gcacataaaa atacattaaa tacccaagag aatatgctgt 8040 agtacaaatt atgtacaatt caaatatacc aaatttaata tttaaaaata tggcatttca 8100 agatagcaaa taattaatta cattgaatta tacaggctct ttgtctagct gcatgtatac 8160 tttttatgtt tttgtgattt cattttaata tttttctcac tgcagtaagc agttaaattt 8220 gtgttattat aagattaccg tcttttataa agatttgaca tgttcgcagt tcatttcagt 8280 taagtcgact caaactaatt tattctctgt cttaatacat gtgtctataa cattatgtag 8340 aaaaataatg tgcccaatta atcaaagggt gtcaaaataa ctactaacca aatgtactta 8400 gaacacgtac tgtaataaaa ttgtacgcga aaaacaatga ccaagcatac tgtaggacaa 8460 ataactaaac actttctgtt tgtataagtt taaatatact taagtgaggt ttaaatttat 8520 ttctgtgcgg catctaaata aaattatgtg aatgtgtaca cagacaatgg actgaatgta 8580 tattttctta tgaactgctt aacagaagtg tactgatagc agacttactg gggttgatgt 8640 gtattgtaga gctcaatgtg acaggacatg acttctaaac aatcgaagag catgtctgac 8700 cagtttgatg gaatattgaa atgaatgtca attacatagt aaataacatt aatttaagca 8760 gctttgtcgt tttgttagga gtgaggaact ttttttgaga ataaataaaa ttgaaagttg 8820 tgaaaaaaaa atactgtttt taaaaaagta atgaagtaaa agtaaacatt aattttaact 8880 gtactcaagt agggtacaaa tcataaaagt agtacttaag tacagcaatc agctaacatt 8940 gttcaaacat tgttcgagta ctttacatct gtgatattgt aactatttca aagaccactc 9000 taaaaatgtt gcaatatact gtaaacatgt tcatcgaaag attgtcgtac aaaaagtcag 9060 ttatatgtaa gttgtatgct tacatcctgc ccgatacatt aaaatattaa agatttacag 9120 caaaacaaaa tgtttgtctc aggaagtgca gttctttact gatatgactg tctgttttta 9180 caaaacagca tgacatgtac ttttgtatat ttctaacact tttacaatgt tgcatcgaag 9240 ttgctgaata gtgaataaga aaaacacttt aatatttcaa tttgtggaaa cattccacaa 9300 atctcaacca aacaagttag aggtgtcagc tttacatacc ctgacacata tttacctttc 9360 aaaatgctaa atattcaatt agtacaatca gaagctttaa ctatttacca aaatgtcaag 9420 cacatgatgt ttaactaaat gtatcagctt tttatttcaa ctttataata agtatatcac 9480 taaaattata ggccagaaaa tagatttagt atcacataac ataaaaaaaa aaaaaaaaaa 9540 aaaatcttat tgctactctg caacaaacat aagcaaacgg ttttggtacc ctttgtataa 9600 aattggtgaa tattgttatt cgtagattat ctgatggtaa agtgtatgtt tatatatctc 9660 cagaacgtat aatgcataac ttgattttta aaattttctt tatgttttaa cagaacaaat 9720 ttaagaaagc ctaggttttt aaccaaactt gactttttaa aataactttt atctaataac 9780 caattatgag gtaatggctc ctaaacttga tctctttctt gacctcaaag gcattgtaat 9840 aagttatgga atatgggtta tttcataaat ttaattgtat tttcttacaa gtatagttca 9900 cttaaattct aaaaagaata tgcaagtgta gatgtaatcc tatttcttct caatcattct 9960 ttgaaaaaga acactatcat gaaagatgca aatttcagtt aaatatgtga ctgctttgag 10020 cccaaatagg aaaccaaata aacaattgta ttgaacaatt caaactgcag aataatgttg 10080 aaaagactgt acatatttgc ttaaaaacat tcaaattaat tgtattgtcc attttggcaa 10140 tgagttttga gatgacttca gctgtgcatt atattttgtc aacctaaaac ttccatgaaa 10200 tgtaaaaaat ctataatatt agttctctaa aatctttcat tgtgaatata tttacggtat 10260 gactctgtat ctttattttt atttttaaaa taggtatccc tacatcatca cgcctatgtc 10320 ttaaataaaa gacaagatgt caactatcat gagttttatg acaccatagc agatgttcat 10380 agtacctccc cgtgagtgag ggcgggcctg tttgaatgtg tagataaagc acccattttt 10440 acttctagct accaaggagt aatgcttctg gctacagtca ctgatcctaa agctttctac 10500 aagaaccaat tgatcaggag taaaaagaag attaaagaca taaaaaggta ctatctttga 10560 ggaccacagc ccctactttg gtcttgtttg cactgaacta ttagggctga actatcgctg 10620 tttaaggaca caggccgcat ctttaaggta ctactttata tgatctacac agattatggg 10680 tcaaactttg tattgtgatt aaagaaaaaa cgttttctta aaaatatttt cagatttttt 10740 tctgacagcc accatgcaaa gatggttggt tgctattgtt ctcaaaatac tgacgataat 10800 gcgcatgaaa ggcccgagta cgagtgcaac cctgtttttg attgtcaaat aaggtctatt 10860 gtggacaaca tgtttaaaca gcattatagg gaccatatgg tttttcttct ggagaagatc 10920 attaatgatg cgtgtgattc taaacatgat tgtacaggag atggcaaacg tgtgaaaaat 10980 ctcaagagag aaatggacgt tgtgagaggg acctaaactc atgggccgat acgataaatc 11040 atttgacaca cctgtgtttt cataacaatg gtgctgtctg tgcaaccaat acaaaaattc 11100 tggaggtgct gactgaattg agtgatgaga tggggggtaa cgcaggttct atgtatgtcc 11160 acatatataa cagatatttg tgtgtcactg gtttcacaaa acaaaataac aaatattgct 11220 gagattgtgg aagttatgac tagccacata ctaggacatg acagaatgct tttagcagca 11280 tttctttgtt gtttggtaat attcagtaat gtctttattg ttctggtgta ccctgttttt 11340 tcttcaatgt atgactgtat aaatgtattt tttttttctt agttaatttg tctttattat 11400 ggtaatgcaa tccatgtttg tttgttatgt actactctat ttcagtgtat atttacttga 11460 aataaaagtt tatcaactca agtgtctatg cctttcctgt aaaaaagtta aaaattgagc 11520 taggtgcaaa acatgatggt gtgaaacaaa tcatagaaac attgtcaaaa ctgattaaag 11580 ccctgtattt atgccaaatg tatatttaaa tatttagaaa attagaatgt tgcatagtgt 11640 gtcaaggact ggtcacttgt ttaaagcaaa tgtataaact gtagaacatt tattaaaatt 11700 ggttaacgtt taagtcaacc caataacaca cagttctcat gtataacggt tttgtagtca 11760 gtttagaaag acaacctaaa ttttttttgc ccaatttagt tttaaagatg cactgacata 11820 caatttcata atttatcttt catttgaaaa agtatttgaa caaaaggtat acataatatt 11880 gttggttgtc tctggtttaa aaccaaaaca ttcacaacgc aaactttcat gacagttcaa 11940 aagaatacag ttttaataca cagtacatta gacactgttg aacaagatat taatgcagat 12000 ttagtggcaa ctattctttt agttttgtag acagatcgca ctattgacag atcacactaa 12060 tatgtttttc tcttcttttc attcttttat aacttgtttt aacacatttg atctgttttt 12120 atcatttttt tatatttttc attatttgtt tttatttttc ttatagttgt ctcttttatt 12180 cctgtttatg taaagcactt tgaattacca ctgtgtatga aatgtgctat agaaataaac 12240 ttgccttgtc ttgccttgcc ttgcctttgt agattcatag ttctcgtgaa aaaaatattg 12300 tgaagaatta gacaccaaaa gataattggc agtagaaatg gttattgtga catttactgt 12360 atgcattgta acggacgcta taagagttgt gtaaaattag acttttaaca catacgcaaa 12420 aaagtttgta aagacatttt gtgattaaaa actgcagcac tggttacttg aattaggtca 12480 aatattttgg tgagatgccc gtggtcataa aaacataatt aaatgtttca aaaacattct 12540 gcatcagaag gtggataaaa taaacttact aatgcattaa agtttactgc aaagtattct 12600 taaaataaga aagtgtttac tctttgttat tattgtgttt tgctggcagt attaagtgta 12660 gccttcaact gcgatacaaa taattgtggc taagtgttgt tattaatgtt acttgggaca 12720 ctttagaggt taaaaagcat taaaaaatca ggcgttgagc cctggccgtc tggatgaaac 12780 cccgaacttc taacctttac accatattgg tgtaaccaat gtatttcaaa acatcacttt 12840 tttttttctt tcccccacaa atgaaacatt gttgttccta aaaactccag aagaaaggag 12900 acaacgttaa gtaacttttg aaaactgaca gttttaaaaa catgtatgct ttgttacata 12960 caaactctga cagtggtgca aaattacaga tttttagcac atgcataaaa cgtttacatt 13020 ttgtgttgaa aaactgctac agagtggaaa atcaaaagca tgaattaggt caatgctatt 13080 ggtttgtaaa aaataaaact ccagtcatgc agtaagttta atgaatagta ttttaaacat 13140 cacttttcca agtgaaatgc acagataaag cttttatatt cagagtgaaa aacattccgt 13200 tagattttag tttgatcatt tgaaaatgtg tttgccaata aaaaacattg tttaaagaaa 13260 catatacact cccaacaaat tgtgtgtagt gacaataatc ttgattaaat ctattggtag 13320 aagaaaagca tcatttaccc tgtaaaaaag taatcagtct actttcaaga actactttta 13380 aacttgacta attttgacaa atttgtaaac ttactagatt tgacactatt gtatgaacat 13440 gtacatagtg gattcagtag ttaaactaga aaattttact ataaaacatt aagtagattt 13500 aactaagggc tggagctgta ctgtttttag ataaactgag agccaatttt cttcttaaaa 13560 taaatatcag tgttattcca ttattcagta taattaacat atttctcttt atatgatcaa 13620 gtatttgcaa tgaattaaaa acagtttgaa tttaaaattc agtaactaag ttgtctagta 13680 atgcaattac tacaatttat tgtaataaaa ttacatttta tttatctttt gaaacagaaa 13740 aaacaataat gtctaaagag tgataagtcc tttttagtaa atattttatg ttgatttttt 13800 aaaccaacac cagtgatgac cagtgctcaa aacattaggc tttggattaa ctgtattaca 13860 atgtgtgcca taaattacag gacatgcatg taaataaatt gaaaacatgt tagtcaaagg 13920 aaaaactttc attttaaatt gtttttcata ataaaagttt tcattaatta taaagacaac 13980 ttgaatttac atctgtgttt ttcaacattt tagaatatta cattttagat aacgttgttt 14040 taaaaggtgt gacaatttat tgtcaataaa gattaagatg tgttatttac tggattaata 14100 ttgtaaacat tttcccagag atgggttgcg ggtggaaggg catacgctgc gcaaaacctt 14160 gctggttaag ttggcggttc attccgctgt gtccacccag gaattaaaaa ggggctaagg 14220 cgacaaaaaa tattgtaaac atattttaat ttaatactcc gacaaagata catcaaaatg 14280 tttgtgtgtc ttaaaatatg tattattgta gtaataatgt gaaaaacaac ataacctatt 14340 aatgagtatt tttagtaaat actgtgtcgt tggattcaag catacaaagc cagacccaag 14400 ttcgctcaag cgcgcacaca tgcacgcacg cacgcacgca cgcacgtacg cacacacaaa 14460 cacatacata tgcatacagt tgactgcatg tggatatata gatatcagca cactttctaa 14520 tcacctaaca acatatagta gttagtggga gacacccaac acatacaatc agatatacat 14580 attctacatc tgatataaat ctatctttca caagcagttg acgggatgtt atccaaagtt 14640 ttccttatca tcactcactc acacgctcag gcccagactg agacacagag gaaacatcgg 14700 tatgaacata aagaaatgtt tatgttaata gtttattaat tataatataa tataatataa 14760 tataatataa tataatataa tataatataa tataatataa tataatataa tagaatagaa 14820 tagaataata atgatatata atataatata atttaattta atatctaata taaatattat 14880 aataataata taatataata atattatata atatggtata attacaatat aagttaataa 14940 aataatgatt taatataact taatataata tattataata caatacataa ttttatttaa 15000 taaaatatta attttatata acttaatata atataatata ttgtaatata ataatataat 15060 ttagtcaaat aactaaaaat ataaaataat ggttctaata acagtaattt aattaccgtt 15120 attattataa taataatttt atttctgttt gattagaatt taattaccaa tattttcatt 15180 acctggaaat agcctattaa tttaattaga ctgataaaaa tctaattacc ggaattatta 15240 ttttatatta aaccgataaa aatctaatta ccggaattat tattttatat taaaccgata 15300 agaatctaat taccggaact attattttat attaaaccga taagaatcta attaccggaa 15360 ctataaagtt tatataaaag gggcggggct tagaccggaa gtcccgcctt agggggcggg 15420 gctacaatcc atcaaacttt ttgacaggtg ctccatcata attctctct 15469 // ID Mariner-N5_DR repbase; DNA; ZEB; 8969 BP. XX AC . XX DT 24-OCT-2008 (Rel. 13.1, Created) DT 24-OCT-2008 (Rel. 13.1, Last updated, Version 1) XX DE Putative Mariner-type non-autonomous DNA transposon - consensus. XX KW Mariner/Tc1; DNA transposon; Transposable Element; Nonautonomous; KW TSD TA; Mariner-N5_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-8969 RA Bao W. and Jurka J.; RT "Mariner-type DNA transposons from zebrafish."; RL Repbase Reports 8(10), 1614-1614 (2008). XX DR [1] (Consensus) XX CC The TSD is highly (>90%) TA specific. CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX SQ Sequence 8969 BP; 2983 A; 1473 C; 1512 G; 2999 T; 2 other; cagtaggtga gagttcaaag tggctattaa cgccgcggct ggtaccgccg ccgtttgaaa 60 taggtgccgc tctgttttta gtgtatgacc gcagtttgtg aatgagccgc cagggggcgc 120 aaagggacgg gatgcgaacg gacagaaata gatcatacag ctactgtgct tgtaaatgat 180 attaaacaat aagtcaaagc attataattc cttcattaat catttaaaat ttgttaacac 240 gctttattgt aaacttttta agtgcaaaga ggattcgttt tggaaaatag aattgaagaa 300 gtaaaagaca actaaaatga aagcacaaac atttagtaca aataagtagt cttaaataaa 360 taaataaata aataaataaa taaataaata aataaataaa taaataaata aagcagccta 420 aatatagaaa gatgttcagt gtttgctatt ttgataggac taagacaaga cattttcatt 480 tatgttttat ttctgttttt atttccattt tcgttgttga ttatatttta ctatcttatt 540 tatgtctcaa caattgtaca taaacgaata ataaacgaat aatgaaaata aatgaataat 600 gaaaataggc ctaaataaat tatttattta aagacattgc cgtacagtac gtgaaacact 660 gagaaaaata aattgaccta ccttaagcag atcactaaaa gtataataaa attatttttg 720 ccgcaggaca taaattaagt cttgcaggtt tatttttaat aatttagttt cagttctgtt 780 catttttttt ttcaaaacgt caatgttctc ctttaaagta gctataactg ttgttttgtt 840 tttttgtttt ttttttttac ttatggctca gtatgttttg tattttaact tcatattcag 900 tcaccttgca aatgtgtgtg aaatataatg ccgcataacc gcggaaaaag aagaaaaaac 960 tgtcgaaggt tagagctaat tcatcaaaat aagctatatt aaaatacaac atttatgtta 1020 atacaggcag agtgtgaaca aactcaaggc taagatatgc gcttgccttt taatgcattt 1080 gcggcctttt tataggctac aggcaactat gtataataat gttttttttt ttcttgtttt 1140 tttttttgta tggtaaagga caatacacat tatgttattg atgtaaactt aggctaaaac 1200 ttaccattga taaacgtgac ctacctcaag cagatcactt aaagtataaa aaaataaagt 1260 aggctatata tatatattct taataaactt cacagccaca aatataaaaa caacttgtat 1320 taaaactggg cgaggattag aaagaatacg atgcttgtta tatataattt aaattttgat 1380 tccaatttct gcaagcacat tttttcactc gctactctgc cagaaacttc gccgccggag 1440 agcacctcaa taatgtaaac atttgtttaa agatggagcg caagatgtgc cgagtggcaa 1500 catgtgaaga acgatggcaa agctaagtgt gattattgta ataaactaag ttataaagca 1560 gagtcccgac caacaggact aagcatatgc ggttggctca ttcttcacga atatagaatt 1620 aaaataatgg cgcgttaggt tcactaagca gcgcattctc tccgcctcat cgtttaacca 1680 gcgggacacg ctgctgaagc aggagagaca ctccgtcatt cataatctta gctgatgtat 1740 cggagtcgaa agaatatatc aaagaggaat tttcttttaa cagtcccgat atgtttaatc 1800 tttttatttt attttttttt ttcgtgtcat ttctcttcag caaattagat attttttaaa 1860 gctgcttgat tcttttaaaa ccgaaagcag ggcccatagt gtttttccat aagatactcc 1920 tttatgaatg ttttaatttt tttattaaag tttttaatgt gctttttgat agttttattt 1980 tatttcaagc agcatctaaa tgcgaattta tcctcaaaac ttggcatgag agcgatgcaa 2040 gtcatgtgtg gcataattgc cttttttccc tgcgcgtgcg agatcaattt ctgatctttt 2100 tgcaactaaa atgaaagcat aaacattcag tacaaataag tagtcttaaa taaataaata 2160 aataaataaa taaataaata aataaataaa taaataaata aagcagtctt ttaacctaaa 2220 tatagagaga tgttcagtgt ttgctatttt gacaggacta agacaagaca ttttcattta 2280 tgtttttatt tctgttttta cttamatttt cgttgttgat tatagtttac tatctcattt 2340 tatgtctcaa caattataga taataataaa cgaataatga aaattaatga ataatgaaaa 2400 taggcctaga tatattattt atttaaagac attgccgaca ttgagcttaa gttgaatgca 2460 gcttcatcaa aagcagaaaa aaataaaata atttgaccta ttttaagcag attataataa 2520 aaatattttt gccgcaggac ataaattaag tctagcaggt ttgttttttt aatactttag 2580 tttcagttca gttttttttt ttcaaaacgt caatgttctc ctttaaagta gctataactg 2640 ttgttttgtt ttgttttttt acttaatggc tcagtatgtt ttgtatttga atttgacgcg 2700 tcagaaaaaa aaaaccgctg aatctctgca aatatttggt taacttataa aacaaatgta 2760 tggtttagtg atgttagtaa tgcatagaaa aatgcaaagc agaattatta aagttattat 2820 ttcattgatc tgaacataac catttataat ataattacta acccgtcgtc taaaatatta 2880 aaataataaa atagtctatt attttattca tggctagagc ttgattaaat tgattaacaa 2940 gccatgtgct tttgtttgtt tgttctcggt actgggtata cgcaaaacat tttctaaaat 3000 gtcaagcagt atttgtctat agagcagcat tttgctaatt ataatagtcc tttatattta 3060 tatacatgtg tttttgcata ttcgttattt cacttgatgc attcgttatt taatgtttga 3120 atactgaaat aaaaagccat cgtttaaaag gttatttcac catcaaaatg tagcctgtta 3180 tatgtatcat taacgtttta tataaaactg ctgtgcagtg taaatgggac cttaatacag 3240 tttaaagcga agttgtctgt aaggatgatg ggcatggcat gtctattgtt gtggctcgcg 3300 ttgtgaatat tagaaaataa ctgtcgggtg aaggctttat ggtcgtgcgg aattgtctgc 3360 tgctgtcatg gctcagcagg caacacccta ttcagctagg ctatgttcgg tgtttgtatg 3420 tacaaattcg ttttggcgaa tgctagagcg taatagagat attttaaata acaagtatag 3480 ttcagtccag aagttgctgt gcatagaaac attcattttc ataaggcccc atttgcggtt 3540 tggttttaaa acgcataggt tttgctacgg ttacgccatc cgtcctcggg agttttggat 3600 tttgtgtaac catgtttgtg gaaaacactt gagggtggag acatacccct tcccccgtct 3660 cataaccaaa agcttgtctt tcaggtgtta atgggcatcg agaccgaagt catgtcgcat 3720 ttccactgtc ggcctatagc tcgcagcgcg taacgcaaac ccagccccca gaacgtcccc 3780 cgagtgttgg catcgtgcat ccgttaataa ctcggtagaa aatgtatttt ataacatcct 3840 catttagaca cagtcaaaca ttcagcccaa atgcactgga gagacctgaa acgtgttccc 3900 aaaaatgctc tgtttgcacg atttgattat tatcatcata tccaaatact ttataaataa 3960 tattcttaac cttctctggt gtttattgtt tttagaaaat atctaaaatc ttttaaaaga 4020 aaatatcttg taaaataaaa gtttgatttg gctttaacgg ttttatttat gtataggcct 4080 acctaaactg ttatagcttg cttttgcttg ttttatattg gtttatttat ttaatgcgat 4140 tttattatat ccgataatgt aattctttaa attatatttc aatgtggctt aggcttatca 4200 agatatgaca cttgttttga gtctacaaaa ggtaatttgt aaagtgttat gtctttctgt 4260 tgtattcata tggtgtatta tctgcaaaat aaataacaaa aagaaagaag ccgctagcag 4320 catccacaga gctgtgaaga gagcgctctt ttgctcggtc tctcacacac aacctttatg 4380 tctgctgtgt ctttaatatg gtgtgtagat tattatgcgt tattgaaaca tcaaggggga 4440 cgcagcaaaa tcacttataa attaaaattg tattattcta tgcaacagcc ttacatttaa 4500 aacggaaaca taagggcgat tttaagcaaa aagacctcag cctataagtc tagatgtttt 4560 cttttccttt ttatttattt gtttatttgt ttggcgcatg tgtgcgatag gaaaactata 4620 gtgtaattac ggcaagccat ctttcccaga ctcggggcaa attcctttca ctccgtcccg 4680 aagccccagc tggccctcct gccatccact gcgtaaaaca tatgctggat aagttgacgg 4740 ttcattccgc tgtggcgatc acagactaat aaagggacta aaccgaaaag aaaatgaatg 4800 aagaatgaat gaataaatta ataataataa aataaaaata aggtaataat aaacacaaat 4860 gccgcgtaag cggccttttt tttagagatg gtacatttga attgcctctt cccacttccc 4920 ccacctcgaa attgaaagtt gcaaagagaa ctcaatagca aagatcatag cccattaaca 4980 catcaacaca tagtccattt acacatcaac acagtgcaag tttagcggaa gaatgtcaaa 5040 tcagaatcac gcagatgaag cacaccagct actactatga aactattaaa attgttttgt 5100 cataaatttg cttatttgca gcagcttttt ttcgcgcaac agaaaagcgg cgttgggaag 5160 ccgtcaatga ataaaaacag ggtcaatagt gaaaatcggc ttttcattgt atccctatta 5220 cctatatcaa tgcattgaag gccttttatg gtgtcatctc gatagaatag cctaagttat 5280 aaataattta aagaacaaag ttataaaaac tcacattaca taaataaaca attaaaaata 5340 tgtttaggtt gtctatttaa ttgccctcgt gatgatggtg tgctttatca ttatgtgcgt 5400 tttacttata actaacctga tttaaatgat ggatctgcga cagagaaact aggggttttg 5460 ggggacatgc atcactatat cctttttttc cccaaactgt gcatagttca gctgcaagtt 5520 actgtacgtc tttttttgga aaagcacccc atgtccgcta gtgccgcgcg cctctttctc 5580 tgggcaaaaa tgagttgatt tttaaaataa taaatatata gattaattaa attttcagca 5640 aaagagacat ttatttcata tttcactatg catttcatct gatctttata gcttaattaa 5700 acacatattg tcaaaaccca cacataggcc tattgtatga aaatgaaaag ggcatttttt 5760 gttattatta cactactatt tgttatttgt tgcctaggct attttcacat ttgaggtgct 5820 ttattggcat gacaagtaac tgtacattcg ttttgccaaa gcagtgcgcg tctcacaaac 5880 aagacagtgc aaaaagggca gtagtgcaaa caaatatgaa aataacatat aggctaataa 5940 aaaaatgatt aataaaaata aaaatagaat aaaaaataaa aatataataa taaaaataaa 6000 aatagatttt tgataacatt aaacaggata aaggtaataa tagtagccta ttatcaaaag 6060 taagtccaca taaagagttc cagtatttgt gtgttggaga caggagcggg tattgttggg 6120 tgaacacaca gcagcacaca cactccccca gtaagatggg ttctgttcga cagggactca 6180 aagttgtttt gtatgtgttg gattatcggg tagaatttct cccggatctc ggagtattta 6240 gtgcactcgg gatatgcagc tccgtctcaa tcagctgctg agggcacagc cgcttctcca 6300 ccggcagcca tgtttttttt tgtgcgggcc cgtctttacg gctagctgat gtccgctgcc 6360 cgcttctaac atgttagaaa atactgcaaa tcaagctgtc ggtatctccc cgcggttgaa 6420 attgttcgtt attccgaaaa aaagaagaag aaatatcagc gcgcagtagt tgctttctag 6480 attatttagg cctatattca gctgttttaa tcttgcaatt ctgataatta gagataatgt 6540 ctgttcaact tgtctgtcat ttatttctca tttgctactt attttattaa tttgttgatg 6600 ttaatatact acatagtctt gtatatgcat atctaaaatc ctttggcatt caaatttgct 6660 ttgaacttga aagagagaaa gaatcggatt ttacctgtca gagcacattg catatgcttt 6720 ttttgaaata acatttattt aacaaatatt ttagcacatc gaaagtaatt aaattacctg 6780 tcttttgatt aaaaccttta tgcaaaaaga tcagaaaaaa aggcaatatg cgacacatga 6840 catgcatcgc tctcatgcca cgttttgagg ataaattcgc ttttaggtgc tgctttgaat 6900 aaaataaaac tattaaaaag cacattaaag aatactcata aaataaaacc ttgatagagg 6960 agtatcttat gaaaaaacac caattgacgg gctctgcttt cggttttaaa agaatcaagc 7020 agctttaaaa atataatttg ctgaagagaa atgacaggga aaaaaagatt gaacatatcg 7080 ggactgtcaa aagaacattc ctctttgata tattctttcg actccgacac atcagcaaag 7140 attatgaacg acggagtgtc ctccactgct tcagtggcgt gtcccgctgg ttataaacga 7200 ggaggcggag agaatgcgct gcttagtttc ggctcgatat atttaaataa aaactaacaa 7260 cgagctgctt ataacgctca aaaagttaaa ctcaaatgca caacccttgt tggttgcacc 7320 tgcaggatgc acaatgaacc taacgcgcca taattttaat tctatattcg tgaagaatga 7380 gccaaccgca tatgcttagt cagtcctgtt ggtcgggact ctgctttata acttattagt 7440 ttattacaat aatcacactt agctttgcca ttgttcttca catatagcca ctcaacacat 7500 cttgcggtcc atctttaatg tttagattat tgaaggtgct attgacgttt tgagaaaaaa 7560 attaatagta ctgaaactaa attattaaaa ataaacttgc gggacttcat ttatgtcctg 7620 cggcaacatt atttttatta cactttaagt gatctgcttg aggtaggtca cgtttatcaa 7680 tggtaagttt tagcctaagt ttacatcaat aacataatgt gcattgtcct ttaccataca 7740 aaaaaacaac accagttatg tatatagcaa atatttttaa atgcattaaa aggcaagcac 7800 atatcttagc cgtttgttca cactctgcct gtattaacat aaatgctgta ttttaatata 7860 gcttattttg attaattagc tttaactgac agccttttct tctttttccg cggttatgcg 7920 gcattatatt tcacacgcat atgcaaggtg actgaatatg aaaataaaat acaaaacata 7980 ctgagccatt aagtaaaaaa acaaaacaaa acaacagtta tactttaaag cagaacattg 8040 acgttttgaa acaaaaaaac ggaactgaaa ctaaattatt aaaaataaac ctgccagact 8100 taatttatgt aatattttca ttattcattt attttcatta ttcgtttatt attatgtact 8160 ataattgaaa cataaaatga gatcgtaaat ataatcaaca acgaaaatgt aaataaaaac 8220 gtaaattaaa aatataaatg aacatgtcta gtcttagttc tatcaaaata gcaaacacta 8280 catctctcta tatttagact aaaagactgc twatttagct ttatttattt atttatttat 8340 ttatttaaga ctaattattt gtactgaatg tttgtgcttt cattttagtt gtctttcatt 8400 tcttcaattc tattttccaa aacgaatcct ctttgcactt aataagttta caataaacca 8460 ggtaaaaaaa aaattaaatg taagttgagg tgatcgcaag ttcacaactg tgaggtgatg 8520 tgctctatcg gcaggataat ctatattatt ttgctgtttt tcatatgcgc acgtttaaga 8580 tttgagatta taacattgcc atgatagtca aagaaatcaa aacattattt tcaccccaac 8640 gcaatttaat cgggcactga aggctaataa cttaattctt atattagctt tttgacttct 8700 caccatgtat cttttctaca gcatagtata gtgtaagagc atgtctatta ccagaaaaac 8760 taaattaaaa ttatatttta attgaacaca tcacatcaca tacagtaagc tacgcctaca 8820 gaacagtctg tttagcattg tgaaaaggca aagctgaata tctgtggtta aagtgccacc 8880 cagcggtcaa atgctgctgg cgcatcaagc accgccgtcg ccgcggcatg aatggcggta 8940 caaggaacac attgaagtag taacatctg 8969 // ID REX1-1_DR repbase; DNA; ZEB; 3829 BP. XX AC . XX DT 05-JUN-2002 (Rel. 7.05, Created) DT 05-JUN-2002 (Rel. 7.05, Last updated, Version 1) XX DE REX1-1_DR is a CR1-like non-LTR retrotransposon - a consensus. XX KW CR1; Non-LTR Retrotransposon; Transposable Element; KW reverse transcriptase; CR1 clade; AP endonuclease; KW REX1 subclade ORF2; REX1-1_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-3829 RA Kapitonov V.V. and Jurka J.; RT "REX1-1_DR, a family of CR1-like non-LTR retrotransposons in RT zebrafish."; RL Repbase Reports 2(5), 29-29 (2002). XX DR [1] (Consensus) XX CC REX1-1_DR is a family of CR1-like non-LTR retrotransposons and it CC was active in zebrafish a few million years ago. The consensus CC sequence encodes one protein, REX1-1_DR1p (position 723-3311). CC The 863-aa REX1-1_DR1p protein is composed of the AP endonuclease CC (positions 1-200) and reverse transcriptase. CC REX1-1_DR copies are ~9% divergent from the consensus sequence. CC Approximately 1000 copies of REX1-1_DR are present in the CC zebrafish CC genome. XX FH Key Location/Qualifiers FT CDS 723..3311 FT /product="REX1-1_DR1p" FT /translation="HGRLKAKLKLIPHRLSLPSIFLANVQSLVNKMDEIRL FT RINHSKRLWNCNVMIFTETWLNSGIPDNAVFLTEHNTFRADRTADDFDQRH FT CSANLEFLMVKCRPFYLPREFTFTIVTAAYIPPDADAKLVMNELPAISKQQ FT TAHPEATFIVAEDFNHSNLKTVLPKIHQKDFCHKSGNKTLDHVHTNMAEAY FT VMNPPPPLGSIRSPFLFLTPKYSLLINRVKPSVRTIKVWPAGVDSTLQDRF FT QHTDWSIFASQANYGPYIDIISYTSSVLEYITTAIDSVTTQKQISTYPNQK FT PWMNKEVCLLLKARNTAFRSGDAQAYSTSRANLKRGIKKAKHCYKLKLEEH FT FSNSDPRCMWQGIQAISNYKPSQTTSTATNVSFLNELNDFYARFESDNKEA FT YTRITSSTDHSPITLTSSEVYTALSQINVCKAAGPDGIPGHVLKACAEQLA FT GVFTDIFNLSLNLAAVPTCFKTTSIVPVPKHCSPTCLNDYRPVALTPIIMK FT CFERLVLAHLKDSLPSTLDPHQFAYRGNRSTEDADSIALHSVLTHLDNKNI FT YARMLFVDFRSAFNTVIPSKLMIKLRDLDIDTSLCNWVMDFLTNRPQNVRS FT GHICSTTVTLNTGVPQGCVLSPFLYSLFTINCRPVNRSNTIIKFADDTTVI FT GLISNNDETAYREEIQHLATWCTDNNLLLNTNKTKELIVDFRKGRTGSHDP FT IHINGMAVEPVSSFKFLGTHISKDLSWTTNTSSLVKKAHQRLFFLRQLKKN FT QLSSAVLVNFYRCTIESILTNCVTVWYGSCSVAERKALQRVVKTAQRITGT FT TLPAIEDIQKKHCLRRARSILKDTSHPAHRLFSLLLSGRRFRLPRTKTSRL FT RNSFFPRAPF" XX SQ Sequence 3829 BP; 1107 A; 1022 C; 729 G; 971 T; 0 other; cacaacacct catattgaca gaaaaacaca gaattgttga catttttgca gatttacgta 60 ttacaaaaga aagactaaaa taccacacgg tcttaagtat ttagaccctt tgctgtaaca 120 cttatatatt taactcatgg gcggttcatt tcttctgatt atccttgaga ttattcttta 180 tttatgtcca gctgtgtttg attatactga ttggacttga ttaggaaaac cacacaccta 240 cacaatgcaa atcagagcga atgaaaatca tgaggtcaaa agaactgctt tgaagatctc 300 agaaacataa ctgtggcaag gcacagatct ggccaaggtg acaaaaatat tgctgcactt 360 aaggtgccta agagcaaagt ggcctctata atccttaaat gaaacacttt gggatgacca 420 gaacccttcc tataggcaag atggtggcgc atacacatta cgaggctgag cgtctctcca 480 gtttctgcag ttttgcagta ttattcctgc ttatttcggg tctgttcgtg cagaacagtg 540 gtgcctttac atcgtacacc cgacaggaga tcttggatat ttgtttgtgc attccggaca 600 gttttattag caatcttcga ctcatccctg agattgccag aacacccgag gctgagcggc 660 ccggctggcc gggcggatgt gcttaaaggc ggcgtcgaga cggtaaacaa aggcgggggt 720 agcacggcag gctaaaagct aagctaaagc taataccaca ccggctctct ttacccagca 780 tctttctcgc caatgtacag tcactggtga acaaaatgga tgagattcga ctgcgcataa 840 accacagcaa aagactatgg aactgtaatg tcatgatttt cacagaaaca tggctaaaca 900 gcgggatacc agacaatgct gtatttttaa ctgagcataa cacatttcga gcagacagaa 960 cggcggatga cttcgatcag agacattgct ctgctaacct ggaatttcta atggttaaat 1020 gtagaccgtt ttatctacca agggagttca cattcaccat tgtaactgct gcttatattc 1080 ctcctgatgc tgatgccaag cttgttatga atgaacttcc agccatcagc aaacaacaga 1140 ctgctcaccc ggaggcaact tttattgttg cggaagattt taatcactca aacttaaaga 1200 cagtgcttcc caaaattcat caaaaagatt tctgccacaa aagtggaaac aaaaccttgg 1260 accatgtaca cacaaacatg gctgaagcct atgttatgaa cccccctccc ccacttgggt 1320 caatcagatc accttttttg ttcctcacgc ccaagtactc actcctcatc aaccgtgtga 1380 agccatcagt gagaaccatc aaagtgtggc cagcgggggt agactccaca ctccaggaca 1440 ggttccaaca cacagactgg agtatattcg cttcccaggc caactatggc ccttacatag 1500 acataattag ttacacttcc tcagttctgg aatacatcac caccgccata gacagtgtta 1560 caacccagaa acagatcagt acatacccga atcagaagcc atggatgaac aaggaggtgt 1620 gcctcctgct gaaggcacgc aacactgcct tcagatcagg ggatgcacag gcctacagca 1680 cttccagggc taatctgaaa aggggcatca aaaaggccaa gcactgttac aagctaaagc 1740 tagaggagca cttttccaac tctgatcctc ggtgcatgtg gcagggcatc caggccatca 1800 gcaactacaa acccagccag actacatcca cagccacaaa tgtctccttc ctgaacgagc 1860 taaatgactt ttatgctcgc tttgaaagtg acaataaaga agcctacacc aggatcactt 1920 cctcaaccga ccactcacct atcacactca cctcctcaga agtctacacc gcactgagtc 1980 agatcaatgt gtgtaaggct gctggaccag acggtatccc tgggcacgtc ctcaaagcat 2040 gtgcagaaca gctcgctggg gtattcacag acattttcaa cctgtcactt aacctagcag 2100 ctgtgccaac atgctttaaa accacctcta ttgtgccagt gcccaaacac tgcagcccaa 2160 catgcctgaa tgactaccgc cctgtagcac tcacacccat catcatgaag tgcttcgagc 2220 ggttggtcct ggcacatctg aaagactctc tgccatccac actggaccca catcagtttg 2280 cctaccgtgg caacaggagc acagaagatg cagactccat agcactgcac tctgtactca 2340 cacacctgga caataaaaac atttatgcac gaatgctgtt tgttgacttc cgctcagcat 2400 tcaacactgt cataccctcc aagttaatga tcaaacttag agacctggat atcgacacgt 2460 ctctctgcaa ctgggttatg gactttctga ctaacagacc tcagaatgtt agatcaggcc 2520 acatctgctc caccaccgtc acactcaaca ctggtgtacc acagggctgt gtgctgagcc 2580 ccttcctcta ctcccttttt accatcaact gtaggcctgt gaacagatcc aacaccatca 2640 tcaaatttgc agatgacacc acagtgattg gtctaatcag caacaatgat gagacggcct 2700 acagggagga gatacagcat ctggccactt ggtgcacaga caataatctg ctccttaaca 2760 ccaacaagac caaggagctc attgtggact tcaggaaggg acgaacaggc tcacatgatc 2820 caatccacat caatgggatg gccgttgagc ctgtctcatc cttcaagttc ctggggaccc 2880 acatctcaaa ggacctttcc tggaccacca acacctccag tctggtcaag aaggctcacc 2940 agcgcctatt tttcttaagg caacttaaga agaaccagct ttcatcagcc gtcttggtga 3000 acttctaccg ctgcacaata gaaagcatcc tgaccaactg cgtcacagtc tggtatggaa 3060 gctgctctgt tgctgagcgt aaggcactgc agcgggtggt gaaaactgcc caacgcatca 3120 cagggaccac actgccagcc atagaggaca tccagaagaa acactgtctg cgccgagcac 3180 gcagtattct taaggacacc tctcaccctg ctcacagact gttttcactc ctgctttccg 3240 gcaggcgctt caggctcccc cggacaaaaa cgagcagact gaggaacagc tttttcccca 3300 gagctccctt ttgaactctg cccctcactg actcttttgc cccaccccaa tacaccccca 3360 ctctcctcta acttatactc ctcacaatca ctgcactatt taacatttgc acatttaaaa 3420 tttgcacata ttcattgcac tacattgcac tgattcactt atttgaactg tacacaccca 3480 ctgcacatgg acatttgtaa ttatgtacac acccactgta catatacatt tgtaattatg 3540 tttatttatc tgcacacttc tgattattaa tagcaacctg tacatatatt catttattgt 3600 aaatctctgt tcatagctaa tacaacctgt atataatgtt catagtacat ccatctgtaa 3660 atatcaccat agtttttcta taactgcact ttataactta tacccgtatc ctgcacttgc 3720 tgctattgca ctgctggtta gacctaaact gcatttcgtt gccttgtact tgtacatgtg 3780 taatgacaat aaagttgaat ctaatctaat ctaatctaat ctaatctaa 3829 // ID EnSpm-4_DR repbase; DNA; ZEB; 9197 BP. XX AC . XX DT 31-JUL-2008 (Rel. 13.07, Created) DT 31-JUL-2008 (Rel. 13.07, Last updated, Version 1) XX DE EnSpm-4_DR is an autonomous DNA transposon - a consensus. XX KW EnSpm; DNA transposon; Transposable Element; KW Autonomous DNA transposon; EnSpm-4_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-9197 RA Kapitonov V.V. and Jurka J.; RT "Zebrafish En/Spm DNA transposons."; RL Repbase Reports 8(7), 752-752 (2008). XX DR [1] (Consensus) XX CC EnSpm-4_DR is a young family of autonomous En/Spm DNA CC transposons. The consensus sequence was derived based on multiple CC alignment of several copies of EnSpm-4_DR that are less then 2% CC divergent from each other. EnSpm-4_DR transposons are CC characterized by 2-bp target-site duplications and imperfect CC 11-bp terminal inverted repeats (1 mismatch). See also commentary CC on EnSpm-2_DR. XX FH Key Location/Qualifiers FT CDS 6136..8571 FT /product="EnSpm-4_DRp" FT /note="En/Spm transposase." FT /translation="MLKCYICRTLHDAPSSLIQHLKFFHGLYPGKKFVLVC FT AQEGCSRQFKSFKGFKMHLNTCHYTTDLDASSDVMQVPHQSDFGEQSSQHN FT SSVMDQDAINEPSTSLMSKDQAKDMCASIIAKLKGSGVANSVVLSVVESME FT EYVDEIHANLEEQVLSALPAENQIRSAVKDVFSNAFNPFSDLNTNSKMTKY FT FSEKWGVVEPIEIHLGVRFDSKRNKKSGVYEQVPVNDTFIYVPLLKTLEFI FT FKNPEVCSHINKPPATDCNLYQDFCDGKYYKHHTLYSMSQNALQIQVYYDD FT FETANPLGSKHGVHKLGCLYFTVRNLPPRLNSSLMNIHLISLFHSQDSKKY FT GIDKILCPFVEDVKVLEQHGMKVSFTEQPLYGTIAQVTGDNLGLNSILGYV FT ESFSGNYFCRMCLADKGLAQTMFSENDPRMVLRSRLTNEEHYNYLCENPRE FT TSCFGLKRNSIFNSLSYFSVSDNFVLDIMHDVLEGVAQYEIKLLFGYLNQN FT FISNENILQRVYAFNYGFMDKKNRPTRINLSSSGNSIGLNASQTLCLSRNL FT PLIFGDVVPEGDRHWHLLLLLIHIVNIIFSPSITDGMIVFLKHLIREHHQL FT FSELYPQNNLIPKHHFMIHYPECIRQIGPLVHVWSMRYEAKHKFFKSSLKN FT FKNITKSLAMKHQIAVAYHWESLFTKGIESGPVKSKKLTDVDNGHLIAEHF FT RIDMLSEVNITSWIKHEDVEFHTGLVVCTGVVEELPVFNKIVCIFLRNEAY FT FLVTEMETSFVEHLHAFEVTENMHNVSVVMPHDLRFFKPFDVQMAYGADSL FT FYVVPDCCIV" XX SQ Sequence 9197 BP; 3025 A; 1502 C; 1729 G; 2941 T; 0 other; cacagcaaat tcattagtgt taaattttga gtgttgaggt aattcagagt aaagtgttat 60 aattctagag ttagataaag ataaaataac cctctcagtg ttggaagctg atttgcctcc 120 ttgtcgcaca gagttacatg tatctaacgc taatatagta ttgactacta catccgggaa 180 tttatgcagc aatatccctg agggattttt ccatacgcca ttttcttctg ccacgaggag 240 caatgagaag cagcaactgg tgagtaaaag tacttaaaat gtttatacat agacatttga 300 cataaaaatt gttatatata cgttagtttt aaatatcata tgtaattttc catcatttat 360 ttagttctgt actatctatg ttacattaaa gaaactaaag ctgtcttaac cgctcctctt 420 ggttaaaata cgtgaaaata cggtttgtga ctgatttgaa ttgctttcct taaacattaa 480 cgcaatagtt aataaagttt tactgatgtt taacaagata acgttatatt tttattcccg 540 tgcgcgggcc cgcgcggttc ggaaaactaa cgttacctcc agctccctca cgccataccc 600 gggcactttt cacacacttt tgctggtaac agcaaaagaa aggtccacgc gtgtgaggga 660 gctgcttcgt ttatttatgt aacacagcca tacgtcgtgc tttttttgct tgcttgattc 720 aacatcaaaa gaaaaaacgc ccgttgtata gcactaacaa gtgccctttc tagataattt 780 aacacctgcc attcgccatt ttcttccatc gcgagcagga gtggtggtga ggttcacgtg 840 ctatttctgc gtgaggcgat tattctgact aaattcacga cagaaagttt tagttaatga 900 tcataattta tcaaatttag aacgaacgtt actcgtagca tgtctttgtg tgagtgtgtg 960 tgcgtgcttg cgtgtgtaat ttagcttcgt taactaacgt aaagttacat cttttttata 1020 ggaatcaaca gcttcttgag gctcaacgtt cagatatttc tccttcgggt aagtgggagt 1080 tacatgtgaa gttataaatt tagactcgag agttatgaat tctgccaatg ttgatagaaa 1140 atgccatttg cagataagct aattgctttc agaagtcatg ctagaaagtt tacgaagact 1200 agttgtatgt ttaactatct ataatttatt aaacgtgttg tcttaagttc ataggatatt 1260 ggggcaccta aaaggtacaa atataacagt taataaaaag caatttacag ttgaccagtt 1320 attttttatt actcttttgt caagcttgtc gaattgtcag tttagcgaag gtgacaagtt 1380 tatttttgct ttagtgcgtt tgttaaacag ggtagaacga tattgccgtc atcatgacaa 1440 gtgtgaaagg cagtatgtac agttagaagc ctttcaagcc ttaaaagaac gattgaatac 1500 acaacaactt cagtttcaga aggtactttt ttatggcaaa gcataagaaa gcataatata 1560 atctttctga atgttatttt gtaaattaaa ttaggtattt atcatttaga tatctaataa 1620 aaattgtact gtatgtccct tttttttcct ctgtagacgc atgttgttga tacaaaaatg 1680 ataattaagg tgcagtatga aaatcgcaag aaatacatca aattacaggc tgctgatttt 1740 gatgaattca tttctcaagg taaaattccc aataacaatt tgagtaagta ataacatgta 1800 ctaggttacg gaaccccgga agggacatag tggaggagaa aaaattggct ggttaaaaaa 1860 aaaaaaaaaa aaaatgggtg aaagaagaaa aaaaatgggt gggaggaaaa tatatattta 1920 tattataaat atatatttat atttatttaa tatatatata tatattaagt ttttgcatta 1980 tctcgcaaaa atgtttctcc acaaacactt cctgttcact tcactcacgt aaaccctccc 2040 gttttttgcc gaaattctcc catattttac cattctattc cactttcttt aatatgcaat 2100 atatatgcaa cctctgaacc agtgaatgca tgtataagcg ctgactgaca gacgcgctct 2160 gtacaataaa ctgattccag atcagcgtct gtacgtgcca tttaaaggga cacttctgtt 2220 atcaaacaag tgagcagcta atgaatctca tgttttgttt tgtttgataa cagaggtgtc 2280 actgtaagaa tgcacagatc tataatgaaa gcagtccatt actttacaat ctgtttgtgc 2340 tcatttaaga aaaaatttgc tgtttaaaat aaaacatgta ggacggagat gtttacatat 2400 tattcagtgt atttgtctgt ttaaacatac acccgctcct ctgtaaatgg cgtgtacaga 2460 agctggtctg gcatttgctg attcagaggt tgcatataaa aggtgacgat cgacgcacgt 2520 attaatgtag agaaagtggg atagaatggt aaaatacagg agaatttcag caaatcggga 2580 gggtttacat gattgaagtg cacagaaagt gtatgtggag aaacagtttg cgagataacg 2640 caaaacattt ttgcgtggga acacaaaact ttgtgagaga acgcaaaaac ttaaaaaata 2700 tatattttcc tcctacccat tttttctttc acccattttt ttttttttca cccagccaat 2760 tttatttctc ctccactacg tcccttccgg ggttccgtac taggtggagg taatttggta 2820 atttaaaaaa atgtaacatg tttctcttta tcatcattta gtgagagaaa agttctccat 2880 tcctgctagc aacattacag tggaagatga ctctggcacg gaggtggatg aaactgtgtt 2940 tgcagaattg tctgcagtgg cagggatttg ctttgttgtg aaggacggtt tggatcacgg 3000 taaaaaataa atagtagtgg tcattttaga taatttaatt cactctgtat tcatcttacg 3060 tccattttag tggcatagtt aaccaaatgc taaaaaaaaa aactgccagc aattcctcac 3120 tagtgtgtaa agacttttac tctgttttga agaaaaggtc tcattagttg gcattggtta 3180 atatgtaaat taaaataaat gttaataatt aaaatattct ttatttagtc tttattataa 3240 aaagattacc agagtaacaa gttaccaaga ttacaaatat ggtaacacat gtatttgtca 3300 ttattactat tggataattt tactaataaa attttcaata aattattact gatatttgat 3360 atctgatata tccataagta tactgcaaat aagggtaatt gctttttata tgttttgttt 3420 tattcagaca catctcggtc atcaactcca tctgcaccgc tgtcatacag tgggagttcc 3480 ctctctgttt tgagtagtgg cagtgacagt gacttgtcaa gacaacccaa acgaatgaaa 3540 attgatgaag agccattaca gagtgctttg gccaaagatg taagtttatc gtctgaagtg 3600 tgcaatgggt actaagtgta aaacaaaaaa ctatataggg cagggcgata attcggtatc 3660 gataattatc acaatatgta tttttttcga taaaacaata agtgttcgat aatatttatg 3720 cagtatgcgt aaggctgcgc aggcattttg cagcctgcat ttccagatgc cacacgcagt 3780 acaagcttac agccatacag tattagttgc acttggaggg gtaagaaaaa aataaggtaa 3840 aaaaatgaag atgctaataa cattacagac ccaaacaagt gttggtaagc cagctcaact 3900 gcccagcaga gtttcatcat ttttgaaagc agtgccttac gagaaaaaaa ccctggaaaa 3960 aaaaaaaaaa aaaaaaaaaa aaaaatatat atatatatat ataattttac aaaatgtctt 4020 aaatcagctg cactaagcgg ttggcattct gaaaatcttt aattaatata tataagattt 4080 gacatttatc gtgataatta tcgatataga ctgatatgaa ataaattatc gtgatatgat 4140 gattttccca acaagcatta ctcttgttgg cttaataaaa caattaacag tctaatttac 4200 attttagctt ataaagcaga tcctccaaac aaaatcagga gggacaaaag tgttggaaga 4260 atatgacgaa actgggacat tgtgtgacag tacaagaagg caaatggtta acattcttgc 4320 tgcctacatg gttgaaatgg aagggtaagt ttgttgtatt aatattaggc agttaattgc 4380 tataaagctt tttttctctt aagttttttt cgtgtacttt ttcatattgc gcaagtatta 4440 aaatcaattt ttatttattt ttgcaggaga atccctcagc ggagtactaa agaaaaatat 4500 gcattgggaa ttgtcacctt gttccctgca ttaaaagatc ccttttcaac aaaaggatat 4560 gtaagttatg tatttatttt aacatattga tttaaaatgt atttatcatt aagaaaaggt 4620 aatgaatttg cttgacactt ttaaggactg tcacaataat tagtgcatgt gcgtgtgttg 4680 tatttgttaa attatgttag tgtggattaa ctaatgaaaa agacaacagt agcaacatta 4740 atttgaaaca ttttaataaa atattataca taatatttat actatgaaaa agttatgact 4800 gtgggattcc ttctatttgc ttataataaa gttaaaccaa cttatgtctc tgtttttgat 4860 taggaacatt tttatgatgg ccaaagtggc tctggatttc ttgcgtggag gttaaagaca 4920 attcaacgaa agactaaaat tgagttcaga gagtttaaaa tacagaatgc aggagcaggt 4980 ggtccaaccc agtaaaggga gctgccttcg gctgctgatc agttggatga agaacgttgt 5040 aaagagttaa tttccttaat gaaccacacc actgaccgag aaactgtcct gcaaaagatg 5100 aaggagacct tctgctatag acagcgcctt atctacaatc ctgacgagtc gcacaacatc 5160 ctcacagtgt ttccaaggct gttggacacg aaagggctgg taagtgatga aatgatttcc 5220 gtgaactaca attcatatct gtaaagcatg agatggtcag taatactttt ttgttgttct 5280 gttcctgcag atagatcaag attttagcct cctatttggt gcagaaacag ctgccaaact 5340 gcttgagaag tggcctacct tctataagga aaaggtgaac agagaagcag agagacttac 5400 taccacctca gtgctccaaa gcttgctgaa ttcagccagg aatctgtaca atgatgagtc 5460 ttccgaggat catcgaggta tgtcaaagaa aaatttaata tctaacagag agctctaatg 5520 tgaatttatg aaaaacagct aataagggaa aataataaat tgttctattg tttcttacag 5580 agtgggacag tgatatggca tcttttttgc tgctcctgca ccttctacca cctcagcctt 5640 ctaaaaagaa aaaacagaag atcagtgcag ctcaggccat ggaacatcta gttgtgtttc 5700 acaaggtttg tgcagagcta cagtttgtga ttaactgggc aagataagct taattaattt 5760 agtgcataat ctaagattat catgatgatc tattttttcc tccagtcaaa caacagtttt 5820 gaagaacact tcgcaaaaca ggagggacat cgccaaccat acctccttgc ttcaggaatg 5880 cacaagagcg ccatcagcaa ttactttatt gcaatggaca agatgatcat cccatgccag 5940 ggaaccacct cgttggcagc cattgatgaa ctgtttaaag cacacttcgt tttcagtgta 6000 agctatgatg atgcactcag caacatgtac acattcctcc agacaacagt ctacggtgta 6060 gatgttgaca ccactaaaga aagtccaaag gtgaaggagt tacgagcaaa gttcatgaac 6120 agaaactaaa agactatgtt aaagtgctac atttgcagaa cattgcatga tgcacccagt 6180 tcattaattc agcaccttaa gttttttcat gggttatatc ctggcaaaaa gtttgttctt 6240 gtttgtgcac aagaaggatg ctcaaggcag tttaaaagtt ttaagggttt taaaatgcat 6300 ttaaatactt gtcattatac tacagatctt gatgcaagca gtgatgttat gcaagtacca 6360 catcagtcag actttggtga acagagctca caacacaact cctctgtaat ggaccaagat 6420 gcaatcaacg aaccatcaac atctcttatg tcaaaagacc aagcaaaaga tatgtgcgcg 6480 tcaattattg caaagttaaa gggcagtggc gttgcgaaca gtgtagtgtt atctgttgtt 6540 gaaagtatgg aggagtatgt tgatgaaatt catgcaaatc ttgaagaaca agtgctcagt 6600 gctttacctg ctgaaaatca aataagaagt gcagtaaaag atgtctttag caatgctttc 6660 aatccattta gtgacttaaa cacaaattcc aaaatgacaa aatacttcag tgaaaaatgg 6720 ggtgttgttg agccaattga gattcattta ggagtgagat ttgattcaaa aagaaacaaa 6780 aaatctggag tatatgaaca ggttccagta aatgacactt tcatttatgt acccctgtta 6840 aaaacgctag aatttatttt caaaaatcca gaagtatgta gtcatattaa taaacctcct 6900 gcaacagatt gtaacttata ccaagacttc tgtgatggaa aatactacaa gcatcacaca 6960 ctgtattcta tgtcacaaaa tgctttgcaa attcaagttt attatgatga ctttgaaacc 7020 gcaaaccctc ttgggtcaaa acatggggtt cacaagcttg gatgtttata ttttacagtc 7080 cgaaatttac caccacgttt aaattcgtct ttgatgaaca ttcacctcat ctctttgttt 7140 cattcccaag attccaaaaa atatggcatt gacaaaattc tttgtccatt tgttgaagat 7200 gtaaaagtgc tagaacaaca tggaatgaaa gtgtcattta ctgaacaacc tctttatggt 7260 acaattgctc aagtaacagg ggacaattta ggtctgaact caatccttgg ttatgtggaa 7320 tctttctctg gaaactactt ttgcagaatg tgtcttgctg acaaaggatt ggctcaaaca 7380 atgtttagtg aaaatgatcc acgtatggtt ttgcgcagca gattgacaaa tgaggagcat 7440 tacaattatc tttgtgagaa tccgagggaa acgtcatgtt ttggcttgaa acggaacagt 7500 atattcaatt ctttgtcata cttcagtgtt tcagataatt ttgttttaga tatcatgcac 7560 gatgtcttag agggcgtggc acaatatgag attaagttgt tgtttggtta tttgaatcag 7620 aacttcattt ctaatgaaaa catactccag cgtgtatatg cattcaatta tggtttcatg 7680 gacaaaaaga accgtccaac acgcataaac ctgtctagta gtggcaacag tattggactt 7740 aacgctagtc aaacattatg ccttagtaga aacctcccac taatcttcgg tgatgtggtc 7800 ccagaaggtg acagacactg gcatttactt ctgcttttaa tccacatagt aaacataata 7860 ttttccccaa gtattacaga tggaatgatt gtatttctaa aacatcttat tcgagagcat 7920 caccagctat tcagtgaatt gtatccccaa aataatttga taccaaaaca ccatttcatg 7980 attcactacc ctgagtgtat acgccaaatt ggtcctttag ttcatgtttg gagtatgcga 8040 tatgaagcaa aacacaaatt ttttaagtcc agtttgaaaa atttcaagaa cataactaag 8100 tcccttgcga tgaaacacca gatagctgtt gcataccatt gggagtcact ctttacaaaa 8160 gggattgaat ctgggcctgt taagtcaaag aaactgactg atgttgacaa tggtcatttg 8220 attgcagaac attttcggat tgatatgtta agtgaagtaa atatcactag ttggattaaa 8280 catgaagacg ttgagtttca cacaggtctt gttgtttgca caggtgttgt tgaagaattg 8340 ccagtgttca acaaaattgt ttgtatattt ctgaggaatg aagcttattt tttggtaacg 8400 gaaatggaga cctcatttgt ggaacattta catgcatttg aagttactga aaacatgcat 8460 aatgtttcag ttgttatgcc tcatgactta agattcttta agccttttga tgtacaaatg 8520 gcttatggtg cagactcttt gttttatgtt gtaccagact gctgcattgt gtagattgca 8580 agataatgtt ttaagttttg tttcaggaga ttttatgtac aaggtcattg tacagagtgt 8640 tttaagtgta atgtaaaacc atgttaaaga aaataaaatg tattcattgc agcacatatt 8700 tattgtatga gtggtgaatt gaaatgatac attgtaatta tagaattaac gacagggcat 8760 aatgtaaata caaatgtaaa atatgtcaaa tgaacactgg tgaagtgttg aaaatttaac 8820 ctaagagtgt taacacaaaa cactgagtgg tgttcatatg atcatttttg gtgttaatat 8880 gttacactat aagagtttgc aaactaacac tctcggagtg ttgactatgt taacaccatc 8940 aaaataacac tggtaaagtg ttgaaaaatt tacctaagag tgttaaaaca taacactgac 9000 tggagttcag ataatccttt gtggtgttaa tattttacac tataagagtt tggtaattaa 9060 cactttctga gtgttgatga cgttaactct ttcaaaagtg ttatttcaac acttttccag 9120 tggttcccat ataaactctg agaaagtgtt aaatttaact ctaaggtagt taaatctaca 9180 atctaaaatt tgcagtg 9197 // ID DIRS-6_DR repbase; DNA; ZEB; 6526 BP. XX AC . XX DT 23-NOV-2008 (Rel. 13.11, Created) DT 23-NOV-2008 (Rel. 13.11, Last updated, Version 1) XX DE DIRS-like LTR retrotransposon family - a consensus. XX KW DIRS; LTR Retrotransposon; Transposable Element; KW reverse transcriptase RNase H; phage integrase; DIRS-6_DR. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-6526 RA Bao W. and Jurka J.; RT "Families of DIRS-like endogenous retroviruses in zebrafish."; RL Repbase Reports 8(11), 1737-1737 (2008). XX DR [1] (Consensus) XX CC This sequence was derived from sequence data generated by the CC Danio rerio Sequencing Group at the Sanger Institute. XX FH Key Location/Qualifiers FT CDS join(1874..2869,2746..3183,3084..3524,3439..3906) FT /product="DIRS-6_DR_1p" FT /translation="EFSSKKPLIIGLSAPHNSIFPSINSTIPSDEYSLNYH FT DIDQAISLISLAGRNAWLAKIDISSAFKIMPIHPDFWHLFGIHWRSQFYFA FT VRLTFGCKSSPKIFDMLSEALCWILSNNYEIPHIIHLLDDFLLISPPSSPP FT AKHLSITQKVFENLGIPLAEEKTAGPSTSIEFLGINLDSNKFQASLPKEKI FT DRIISLSQIFLEKQSCTKRELLSILGHFNFAMRIIPQGRPFITHLLQLSSS FT VPGLEDTIYLSKPSRNELSLWISFLKQWNGCSFFYSDLISSPVDINLFTDA FT APSVGFGGFYQGHWFASTWPPQMLSLPRNQQSSALSNSTPRCPLSRVRRLL FT PRSLVCFNVAPADAQSTQKSAIICAFELYPIVAAALLWGDEWSASSILVHC FT DNEATVYCINKGRSHALPIMPLLRRLVWTAAKKQFIMTARHVPVCKNQIAD FT SLSRFLFQKFRLLVPEADQHPTPVPLYSQXILPFSLSLSFPEISALGTGSR FT PASNTCSTLFTNXIAINHPLHRLHETSISLILHAVAPRTLESYLTAWKSYK FT YFHTLYQIQFPDFSLLTITSFISHLHTAKNIQASSIKSYLSGVQFFHKLIY FT GATSEAISNAQTSLLIKGIQKSPPPPSFTGPPQKPFLTLKLPSSLKVSRNH FT PPPLPDTRLPITLNILAKCIRTLRKGYLSLHTARTLDAMFTLAFFGFLRCS FT EMAITSNFNPAIHPTISDLTLLNAETLAFFIKQSKTDQTSKGHFIYIFNIF FT SPTQPFQTLLAFLHSRRAKESDPHAPLFY*" XX SQ Sequence 6526 BP; 1734 A; 1893 C; 1098 G; 1797 T; 4 other; gtgaagttta tgtataaaca aatttcgaga ggatcacgtg cttatgattg ctagcagctg 60 acccgcatta tccaattcac tacgatccaa tcagatgact cctaaactac tataaatacc 120 ctggggttta ttccattgct atcttcgttt tgaagaggca gcttcaccgt agctagctcc 180 gttgaagaac caactctgta ccagcatgga caaccaaagc agctaccagc taccatctac 240 cagctaccat ctaccagcta caatctacca gctaccatct acaaattacc agctataatc 300 tacaagctac tagctataat ttacaagcta caatctacag cagcagcagc aacaactgca 360 gaaacaacaa atgcagcaac aacaacaact actactatta ctacttcaaa acaactccaa 420 caattgcatc aacaacttca acaaaacatc aaaccttcca cttcaacgca tctgctgtgt 480 cttcaaccat attctccagc aaatgacagc caaaactcca aagccattgc aatgaaactg 540 aaccattacc tttttcctag cggtgcacat gctatgcaca tgaagggaag ctttgcaata 600 taactttacc aaacatttag actaaaacat ttataaatct tttgaataac aagactagcg 660 taaacaatga ttttaactaa agatacttgc ctcttgcccg tccactgatc cataacattt 720 aacagcttgt ttgctggaat gttcagcaga gcaacagctc aagtcataaa caatggtgaa 780 cttgatcaac aaaatggccg ccgggctttt tcacctttgg cacgcttgac tggctctcct 840 ttaagccaat agctgtaagg aaaagcgtca ccatccaata agctctctgg agaaggtccc 900 gccctctctc ttgactctgt tgcaagtctt atgaacgaat ttgggccgga cccgcttgtg 960 aatgaagtta agataccaac aaatgttttt catggattaa agtaaagcca attactaaca 1020 ttccattcac agttttacta agtggtgaca tcgcccttga agtcttatag agttcagagt 1080 ttgatgttag caatataaaa aaaataataa taataaaaat aaaaaaaaaa aaaataaata 1140 aaataaaaaa taaaaaataa ataaatttat acacagagca ctgaggtcta attacactag 1200 accagtkact catttattca tacactagtg tcttatgtaa gcttataatg gcaagtcagt 1260 tggatgtgaa acagcaacaa aacaaacttc ccgattggct gctatgaata ggaggacatt 1320 tctgacaacg gatcttcaat aaacacttcc aggaccacca gacaaaccat accccttatc 1380 ctcagctctc catccatttg ccattctgag gggagatgaa tgctcacctc taccatcctt 1440 gcacgtagcg acatcaagct atcagaccat gcattaataa aaagaccatg acatttggct 1500 gcagaacctg accctcatcc aacccattac attttatatg ctcttcaaat acaatcacac 1560 tctacataaa ggatgcattt ccactcacat ggctcacatc ctcgatgcca tatttctcaa 1620 attgctttga attgcattgt caccctgata ctaacttctg atttcaggtc taacccaagg 1680 atctaacccg ggcgtttcag ctctcccttc cttaacctta ttctgctcca atctacaatc 1740 tgcgactacc gaacctgcaa cagtgaattc gttaatcaaa aaagaaaaga attgataata 1800 aattcatgat agtaccattc ttgccccact gttcagcgtt tcttgtatta gtcctattga 1860 cattgcgact tgagaattct ctagcaaaaa acccttgatt attggtctct ccgctcctca 1920 caattctata tttcctagca ttaacagcac cattccatca gacgaatact cgctgaatta 1980 tcatgatata gaccaagcaa tttcgctcat cagtttagcc ggtcgcaatg cttggcttgc 2040 aaaaatcgac atttcatccg cctttaaaat catgccaatt cacccagact tctggcacct 2100 ttttggcatt cattggcgct cacaatttta ctttgcagtc cgattaactt tcggctgcaa 2160 aagtagtcct aaaattttcg atatgctttc agaagcatta tgctggattt tatctaataa 2220 ctacgaaatc ccgcatatca tccatctcct cgatgatttt ctcctcattt ccccaccttc 2280 ttcacctcca gctaaacacc tatcgatcac ccaaaaggtt ttcgaaaatc taggcatccc 2340 cctcgcagag gagaaaacag ccggtcccag cacttccata gaatttctgg gcatcaactt 2400 agattcgaat aaatttcaag catcccttcc caaagaaaag atcgaccgta taatctcttt 2460 atctcaaatt ttcctcgaga aacaatcatg caccaaacgc gaactgctat ctattctcgg 2520 gcatttcaat ttcgcgatgc gcataattcc ccaaggccgc ccgtttatta ctcacctcct 2580 tcagctctcc tcctcggtcc ccggtttaga agataccata tatctttcta aacccagtcg 2640 caatgaactc agcttatgga tctccttcct taagcaatgg aacggctgtt cctttttcta 2700 tagcgaccta atttcttccc cggtggatat taatttattc actgacgctg ccccctcagt 2760 cgggttcggc ggcttttacc aaggtcactg gtttgcttca acgtggcccc cgcagatgct 2820 cagtctaccc agaaatcagc aatcatctgc gctttcgaac tctaccccat agtcgcagca 2880 gcgcttttat ggggagatga atggtccgcc tctagcattc tcgttcattg cgacaacgaa 2940 gccaccgttt attgcattaa taaaggacgc tcgcacgcac ttccaattat gcccttacta 3000 agacgcctcg tttggacggc agccaaaaag caattcatta tgactgctag acatgttcca 3060 gtttgcaaaa atcaaattgc tgattctctc tctcgctttc ttttccagaa atttcggctc 3120 ttggtaccgg aagcagacca gcatccaaca cctgttccac tctattcaca aatkatattg 3180 ccataaacca cccattacac cgcctccatg aaacttccat atctctgatc cttcacgctg 3240 tcgcaccaag gacccttgag tcatatctca cagcatggaa atcttataaa tatttccata 3300 ccctgtatca aatacagttc ccagattttt cattgcttac catcacctca tttatttccc 3360 acctccatac agccaaaaat atccaggcaa gctccattaa aagctacctt agtggggtcc 3420 aattttttca taaattgatt tacggggcca cctcagaagc catttctaac gctcaaactt 3480 ccctcctcat taaaggtatc cagaaatcac cccccccccc ttcctgacac tagactaccc 3540 attacactca atattttagc aaagtgcata cgcacactgc gcaaaggata cctctcactc 3600 cacacagccc gtacactaga cgccatgttt accctggctt tttttggctt cctgaggtgt 3660 tcagaaatgg ctatcacatc caatttcaat cccgcaatac accccactat atctgatcta 3720 acactgctga atgctgaaac actcgccttt ttcattaagc aaagcaaaac agatcagaca 3780 agcaaagggc attttattta catatttaac attttttccc ccacacagcc attccaaact 3840 ctcctagctt ttctccattc aaggagagca aaggagtctg acccacatgc cccacttttt 3900 tactgatgac gctaaccgtc cagtaacccg tttttggttt caaaaacatc tgaaagaagt 3960 ccttcgcctt tcgggcgttt cccccgaatt gtattccagc cattcattta gaattggcgc 4020 agccaccaca gcagctcaca aaggtctatc ctcgcaccaa atccagaccc taggccgctg 4080 gtcctccgat gccttcaaag cttacattcg cctcagccga tcccacctca agacagccca 4140 gctagcccta atcagctaaa ttccaaactc caactagggg tacgagctcg accctaagta 4200 ggctggagta cgcgtgtgtg tgtgcttgtg tatgggtgcg catgggaatg tgtgtgattg 4260 ctcgcgcaaa tacagcgtgc gtttgcacac gtatacatgc ttgtttgcac gcggatatgt 4320 atataggtgc gggcgtacgt ctaagtatgc atatgtacgt gcaagcaaat gagtatgtga 4380 gcctaggggc attacatgtt gtgcaggttg catgtatgct cttaactttc ctttcaaacc 4440 tactatgctc cttccctcac catgctctga cccccgcagg ggtccaatcc gagtttcgac 4500 tctcgcgagt cacccctagc ccatccccgg ccaccccttc actaccacct ccagtaagag 4560 cctgcactac cccctccctc ttcccgactc ttactgcccc ccccccccca tagctctgac 4620 ccccgcaggg gtcgcgctga gcttcgactc tcgcaagagt caccatttgc gccgccccac 4680 ccaggccatt agaaatggta tgtacgcgca tgttatacgc atacatccga tgtatacctg 4740 ttccgatgcg tgtgcatgca catgccgaaa caggaataga cgcacgtgca tgctcacgca 4800 cgcaggttat tccctctctc cttttcttcc tttggcgtcg agttcctccg ccctctcttt 4860 tcttcctttc tcataggcca tcaatcctca gctctaactc ccgcaggagt gactaaaacg 4920 agcttcgact ctcgcaagag tcccgccccg gccacctcct gctgcagtct ccaccgcccc 4980 ctccctttcc cagacttcag caggagaaat gcccacagca gctctgaccc ccgcaggggt 5040 cgctccgagc ttcgactctc gcaagagtca atcaccgccc atgcccggcc cttaccatgt 5100 aataaccata tatattcata tatatttata tatgcatata tactcctata ttctcctttt 5160 cttctttctg gcgtcgagat tctccgcctg gcaattattt gccccttctt tcttccttac 5220 agcgtgagtg cttccgctac ttttaccctt ctctgatctt cccccttatc tccttaccct 5280 cctatttcct caatttcttt tccagcgttg agttctccgc taactgtctt tctttcagcg 5340 tcactctttc gctatgttct tcctttcgct taggccctca atcctcagct ctagctcccg 5400 caggggcgac taaaacgagc ttcaactcct gccggagttc aggccctggc cacctcccac 5460 cctctcatgc aagagtctcc accgcccaaa tcttcccgac tcttgcttga gcaatctaac 5520 cccagctccc acccccgcag gggtgtatat atttgagcct tgactcccgc ggagtcaatc 5580 cagccctgcc ctcaccccgc ctaggctcta gcagtctact ttgctctgct cccktagaag 5640 cttctcttcc taagctatca acgctatatc cagcagccgg atatggcttt gttacacctg 5700 ctttttgggg ggctcttcaa tacgcggctg ctgtcccgag ctattcaaag gcatttttgg 5760 ggagttctcg agatctacct gagctcaaac tcccctcccg ctttgcaacg ggagggagcc 5820 ctgggctcga ggatctcatg agctcagggc tctctcccgg gacagcatgc caaacaagct 5880 tttataatta atcatcagct aagtgtgaac tcttgaagtg aagtttatgt ataaacaaat 5940 ttcgagagga tcacgtgctt atgattgcta gcagctgacc cgcattatcc aattcactac 6000 gatccaatca gatgactcct aaactactat aaataccctg gggtttattc cattgctatc 6060 ttcgttttga agagtccccc cttcctcccc tacgcctcct tcttagacgg gtgacacggt 6120 ggcccagtgc ctagcactgt cgcctcacag caagaatgtc tctggttcct ggctttacca 6180 aaactagcag acatttctgt gcggagtttt gcacttctcc ccgtgctcac gtgggtttcc 6240 cccgggttcc ccggtttcct cccaccgcct aaaaacatgc aattaagtca attgaacaak 6300 ccaaattgtc aacatagaca cgctcctagt tagtaattag tttcaagagc attcacttgc 6360 tacagcaggg gagttctcga gatctacctg agctcaaact cccctcccgc tttgcaacgg 6420 gagggagccc tgggctcgag gatctcatga gctcagggct ctctcccggg acagcatgcc 6480 aaacaagctt ttataattaa tcatcagcta agtgtgaact cttgaa 6526 // ID L1-4_DR repbase; DNA; ZEB; 5548 BP. XX AC AL807749; XX DT 04-AUG-2002 (Rel. 7.07, Created) DT 04-AUG-2002 (Rel. 7.07, Last updated, Version 1) XX DE L1-4_DR is a non-LTR retrotransposon from the L1 clade. XX KW L1 clade; L1-4_DR; Non-LTR retrotransposon; endonuclease; KW reverse transcriptase. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5548 RA Kapitonov V.V. and Jurka J.; RT "L1-4_DR, a family of non-LTR L1-like retrotransposons from RT zebrafish."; RL Repbase Reports 2(7), 24-24 (2002). XX DR Genbank; AL807749; Positions 100605 95058. XX CC L1-4_DR is a family of L1-like non-LTR retrotransposon. This CC family was active recently (no stop codons in ORF1; CC a few stop-codons in ORF2). CC It encodes two proteins: CC the 459-aa L1-4_DR1p (positions 171-1547) and the 1294-aa CC L1-4_DR2p CC (positions 1577-5461, a conceptual translation). CC These proteins are most close to corresponding CC proteins encoded by other L1-like elements. L1-4_DR1p is a CC putative CC RNA/DNA binding protein, and L1-4_DR2p is composed of the CC AP endonuclease (aa positions 1-200) and reverse transcriptase CC domains. CC KRN*K*KNKSFF*KWMKIRLVSMIVLSRCFIHVCVTLCLKKPKSREFWEKLFENFDTSNIWKNVRSILKS CC PALENLDFMLRHNCIMTEIIFKKIGVSQDDLCKVCLEKKEGVLHLFLNCKKLSDFMKMLKTMVCNFLYDE CC NIILEEWDVLFGFNGKTKNKFALNYMLTLARYTIWKRRNIMKQKKKEIPLVLLYKQIVTEEIMVIYDYCK CC MYEKMDIFEKCIRKNNPYIVQTWTGFKVFLPGDF. XX FH Key Location/Qualifiers FT CDS 171..1547 FT /product="L1-4_DR1p" FT /translation="MDTLVKELELKMDYGSMDKATPENNNSRDSNNGIDDN FT DTWATVVARRRKTKSDTSREGSGEMQQTKGKANAEHLENSQQSSHRNEMIN FT KLKSQARFQRQYKKETTLTMTVKDPENITVTMIIKAVEDKTGIGKLFGLRK FT KSNFDYELTMENETDCDHLMDGLMINQQFCEVSKLCATERMVSFLNLPNYI FT QDSEIIQKLVDWGVSPILPLRRRYHPGTTVADGTRFIRVKFPKEVMSLPYN FT VKFDTEEGPKYFRVIHDQQIKTCRLCGSAEHEKKDCPQFVCRECLEQGHFT FT RDCKAPRCQGCKKTILWCRCESDEEETGVMETNKQMEKSSNEEREEEVQEL FT PPDDLNEQEEEQAMSEEDEGDTADTEAQDLMKDDGHDMEEEQGAAGENTES FT RIEEEVSDDDDNEEINIGTKDRTIDSINRRRRKTVQLNIQQVLKKQKLRKE FT AKAKLKTERTDLRF" XX SQ Sequence 5548 BP; 2269 A; 553 C; 1134 G; 1592 T; 0 other; tgctttcaag gaagtgtgag gtggcagtag ggagagaaag gctctcccat tttgatttgc 60 tttattgttt tttttcttga ttttgcttaa attaaattgt attaattttg tttagtttta 120 gttaaacccc agacagtgtt atctgtttgg ggttaaaacg ttttgaaagg atggacactt 180 tggtaaagga actggaacta aaaatggact atggcagcat ggacaaagct actcctgaaa 240 acaacaattc aagagattca aacaacggca tcgacgataa tgacacatgg gcaactgttg 300 tggcaagaag gaggaaaact aaatcagaca caagtagaga aggaagtgga gaaatgcaac 360 aaactaaagg taaagcaaat gctgaacact tggaaaacag ccaacaatca agtcatagaa 420 atgagatgat aaacaaactg aaaagtcaag ctagatttca gcgacaatat aaaaaagaaa 480 caactctgac aatgactgtg aaagatcctg aaaatatcac tgtaacgatg attataaagg 540 ctgtggaaga taagactgga attggaaaat tgtttggact gaggaaaaaa tccaattttg 600 actatgaact tactatggaa aatgaaacgg actgtgatca cttaatggat ggactaatga 660 ttaaccaaca attttgtgaa gtatcaaaac tctgcgcaac tgagagaatg gtttcttttt 720 tgaacttacc caactatatt caagatagtg aaatcatcca aaagctggtg gactggggag 780 tttctccaat tctcccactg agaagaagat atcatccagg aacaactgtg gctgatggaa 840 caaggtttat cagagtgaaa tttccaaaag aagttatgag tcttccttac aatgtaaagt 900 ttgatacaga ggaaggacca aaatatttta gagtgataca tgatcagcag ataaaaacat 960 gcagattatg tggaagtgct gaacatgaaa aaaaagactg cccacaattt gtgtgtagag 1020 aatgtctgga gcaggggcat tttacgcggg actgtaaagc cccacggtgc caaggctgca 1080 aaaagacaat attgtggtgc agatgtgaat cggatgagga agagactgga gttatggaaa 1140 caaataaaca aatggagaaa tcaagcaatg aagaacggga agaggaagta caggaattac 1200 caccggatga tttgaatgaa caagaagagg agcaggccat gagtgaagag gatgaaggag 1260 atacagcaga cactgaggca caagatctaa tgaaagacga tggacacgac atggaagaag 1320 aacaaggagc agcaggtgaa aatacagaaa gcagaattga ggaagaggta agcgatgatg 1380 atgataatga agaaataaat attgggacta aagacagaac aatagacagc ataaacagaa 1440 gacgcagaaa aactgtacaa ttaaatattc agcaagtgct taagaaacaa aaattacgaa 1500 aggaagcaaa agcaaaacta aaaactgaaa gaactgatct aagattttag atcataaaaa 1560 agactacaaa tgattgatgg attgtttcat ttggattttc ttttgtttgg atggcttgtt 1620 ttcttttata tttctaatga acaacttatg tttggtttca attaatgtaa gagggctgtc 1680 atccaaagtg aagtttgaaa atgtaattgc tttaacaaaa aaatgtgatg ttatatgtat 1740 acaagagact ggatggaatg aaaacattgt taatgattta aaaaaatgtt gggatgggga 1800 aatattgtat aataatgacc caaataagaa aaaaggcatg gcaatattaa ttagaagggg 1860 aataggatat acatttgatg ttttatttaa agataattat ggaaggattt taactattaa 1920 aattatgaat aaggatgaag aaataagaat atgtaatata catgctccaa atgaagactt 1980 ggaaagagtt acttttttta aagatctaag tgttttaatg agtggatgga ataatgttat 2040 tgttttagga gattttaata ctgttttaga aagaatagat gtagatgatc atatggtgtt 2100 tagagcagat gttggaagaa gagaactgaa acacatgatt gaaaaacata aatatgtaga 2160 tgtatggaga gagagaaata gagccaaaag agaatactca agaaggcagt gggtgaatac 2220 agttttaaaa caaagcagat tagactatgt tttatgtaca agaaatgtag aatcttttat 2280 ttcaaatatt ttttacaaga tttttagctg tagtgaccat gattttctgt atgtaatgat 2340 ggatttcagt ggagttgaaa gaggaccagg tgtatgggtg tttaatacag agcttttaaa 2400 gaatgatttt tataaaattg aaatggaaaa cattattatt aatagtgtga atgatgagtt 2460 atatgatgaa gaaataagtg tgtggtggga caatgtaaaa ttagaggcca aaagattttc 2520 aatagaatgt tcaaagaaaa tgcagaaagc caaaagagct aaagaaagac aattaaacaa 2580 agaatgggag aatgaaatgg aaaagataac agaaggaaat atggatatta ggagaatagt 2640 gatattagaa gagaaactga aaaaactaga agaggaaaaa tgtatgggag ctagaataag 2700 aagcaagata aaaaatacag tggaaggaga aagaagtaca aagttctttt atgatctaga 2760 aaaaacacga caaaaagcag atttgataaa gaatgtctca acaaaagaga aaactgtcaa 2820 agataaagaa agtattttaa gaacagttaa agatttctat gaaactttgt ttaaagcaaa 2880 aggagttcat gaagaagata aggatttttt attgaatcaa ataaaggtta aagtaagcga 2940 agaggataaa aaactgtgtg atagtgatat aactgaagag gagatcaatg aagctataac 3000 acaattaagt aatgggaaaa gccctggttt agatggtttg tcatctgaat tttataagac 3060 ttttaaagat gttttaattc caattttaaa agatcttttt attgctattt ttaaaaaagg 3120 acagttgagt gagagtatga agaaaggaat gattaaaatt atttataaaa ataaaggtga 3180 taaagattat ttgcaaaatt atagaccttt aagtatgctt aatacagatt ataaaatatt 3240 agcaaagatt ttagcaaaca gacttaaaaa ggtagttccc actcttatta ctactaacca 3300 ggcttatggt gttataggta gagatatagc agacacagta acaagcatca gagatttaat 3360 ctggtacata aaagaaaaaa aagatgaagg atttttattc agcatagatc tagaaaaggc 3420 ttttgataga gttgagcata gctatttatt tgacataata cagaaatttg gctttggtga 3480 gaattttatt aagtggataa aatgtttttt atacagatat ttttagttgt tttaaaataa 3540 atggattttt aaccgactac atggagattt ctagatctat aagacaagga tgtcctttat 3600 cagcgttatt atacacatta gttgctgaac cattaggctt agctataaat ggagaaaaga 3660 aaattaaagg gtttaaaata gaaatcaata gaacagagca gaaaatttac cagtatgctg 3720 atgataccac tctattttta aaagatttta aaagtgttgg aaaagctatg gaaatatttg 3780 ataaatattg tcgaggatcg gtagcaaaag taaataaaga aaaaactgaa tatatgaaga 3840 tgggaaaagt agacgttcaa caaggaaatt gggaatataa agaacaaaaa aaatacataa 3900 atatcttagg cattacactg ggatatgatg aaaataaaac tagagaaata atttgggatg 3960 aacttataaa taaaatggaa aaaagattat gtttttggaa acagagagta ttgtttttaa 4020 aaggaaaagt actggtatta aattctcttt ttctatctaa gatgtggtat gttttaagtg 4080 ttgttagtct acctacgtgg gtgtataaga aattaaaaac tatgatttta aactttttat 4140 gggatgataa accatctaaa attgcatata acactatcat tggaaaggtg gatgagggag 4200 gactaagact tatagatcca tggataagaa taaaaagcat gagaattaaa acattaaaaa 4260 agtttttaaa tgaagacaat attctatgga aaagcataat gagttatttt attaataaat 4320 gtggacaaat aagagatgat ttcttatgga tggcatttaa agaccgcatg atagaaaata 4380 ttcctgagtt ttatgaagag ttgttgagaa catggaaatg cttttataat aatatacaaa 4440 ctgagattga agggagaaaa ctttatttac agcaacccct atttttaaat cagaacaata 4500 aaagcaaaaa gcaaatgttt tatgagaact ggtatgcagt gggttttaga caagtaaaag 4560 acattttata tgaaataaaa cctgggtttt taccgactca agcaataata gatacactgg 4620 aggaaataga agatgtagat gataaagaaa aaattgaaga tcaatacaag aagttaagat 4680 tagcattacc agatcactgg atcaaaacta ttgaagagaa tgaaaaaaga aactgaaaat 4740 agaaaaataa aagttttttt taaaaatgga tgaagataag attagtatca atgattgtcc 4800 tatcaagatg ttttatacat gtttgtgtaa cactgtgttt aaaaaaacct aaatcaagag 4860 aattttggga aaagttattt gaaaactttg atacttcaaa tatatggaaa aatgtaagat 4920 caattttaaa aagtccagca ttggaaaact tagattttat gttaagacac aactgcataa 4980 tgacagagat tatctttaaa aagattgggg tatcacaaga tgatttgtgt aaagtgtgtt 5040 tggaaaaaaa ggaaggcgtg ttacacctat ttttaaattg taaaaagttg agtgatttta 5100 tgaagatgtt gaaaacaatg gtatgcaatt ttctgtatga tgaaaacatt attttagaag 5160 aatgggatac actgttttat ttggttttaa tgggaaaaca aaaaataagt ttgctcttaa 5220 ttatatgttg actcttgcaa gatatacaat atggaaaaga agaaatatta tgaaacaaaa 5280 gaaaaaagaa attccattgg ttttgttgta taaacagatt gtgactgagg aaataatggt 5340 tatatatgac tattgcaaaa tgtatgaaaa gatggacatt tttgaaaaat gtattagaaa 5400 aaataatcca tatattgtac aaacttggac tggttttaaa gttttcttac ctggagattt 5460 ttaaatattt tatcttttaa atatttgtat gtataaatgt catgattgtt gatgatgtat 5520 tttttaagaa agaaaaaaaa aaaaaaaa 5548 // ID CR1DR2 repbase; DNA; ZEB; 2898 BP. XX AC AL591213; XX DT 04-MAR-2002 (Rel. 4, Created) DT 04-MAR-2002 (Rel. 4, Last updated, Version 1) XX DE CR1 Danio rerio 2. XX KW retrotransposon; non-LTR; CR1; CR1DR2. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-2898 RA Jekosch K.; RT "CR1DR2: CR1-like repeat from Danio rerio."; RL Repbase Reports 2(2), 8-8 (2002). XX DR [1] (Consensus) XX CC Putative novel CR1-like non-LTR retrotransposon with one open CC reading frame (pos 1-2898). XX SQ Sequence 2898 BP; 749 A; 860 C; 474 G; 815 T; 0 other; atgtgttttc taattcctgt tgttactaac actcgcaaaa cacgggaggt gcgctgcagg 60 cgtaatcctc acaaccttcg ttcaatacat gtatctacta tttcacaact ctctctctcc 120 gtgggcctct ggaattgtca atcagctgtt aacaaggctg attttattac ctccatagct 180 acatattctg actataatct catggctcta actgaaacct ggttgaggcc ggaggacact 240 gctacacatg ctactctttc tgctaatttc tctttttccc acactcctcg tcagacaggg 300 agagggggtg ggactggact actaatttcc aaagaatgga aatttactct gataccgtcc 360 ctgccaacaa tcagctcctt tgaattccat gcagtcacca ttatccaccc cttctacata 420 aatgtggttg tcatctaccg cccaccaggt aaattaggtc acttcctaga tgaactggat 480 gttcttctct catctttttc taattttgcc actcccttat tggtgctagg tgacttcaac 540 atttacgttg acaaaccgca agctgcagac tttcagactt tgcttgcctc ttttgaccta 600 aaaagagcac ctacttctgc tacccacaaa tcaggtaatc agctagacct tatttacaca 660 cgacactgct tcactgatca aacaatagta actccactac aaatatctga tcatttcctt 720 ctgtctctca acatccacat tactcctgag ccgccacaca ctcctacact ggttaccttt 780 cgcagaaacc tacgatctct ctcacccaat agactatcca ccattgtttc agactctctt 840 cctccatctc gcaaactcac tgcacttgat tcgaacagtg ccactaatac actctgctcc 900 acactagcat catctctaga ccgattatgt cctcttgcat ccaggccagc ccgtgccagt 960 cctcctgcac cctggctctc ggatgctctc cgtgagcatc gctcaaaact tcgggctgcg 1020 gagagaattt ggcggaaaac taaaaatcct gcacatctct taacatacca aactcttctg 1080 tcctctttct cagctgaggt tacttctgca aagcagacgt attaccgtct gaaaatcaac 1140 aatgccacta atcctcgcct actttttaaa acattttcct ccctcctcta tcctcctcct 1200 ccacccgcat cctccacact tactactgat gactttgcta cattcttctg caccaaaact 1260 gcaaaaatca gtgctcaatt tgctgcacct acaacaaaca cgcaagatac aacaccaaca 1320 ccacacacac tcacctcttt ttctcagctc tctgagtctg aggtgtccaa acttgtgcta 1380 tctagccatg caaccacctg tccactcgat cccattccct ctcatctctt gcaagccatc 1440 tctcctgcag tcataccaac actgactcac ataattaaca catctcttga ctctggttta 1500 ttccccacta catttaagca ggctagggta accccactgc taaagaaacc caacctggac 1560 catacgctac ttgaaaacta cagaccagta tccctgcttc cattcatggc caagattctg 1620 gagaaagtag tgttcaatca agttctggac tttcttactc aaaacaatct catggacaac 1680 aagcaatccg gctttaagaa aggccactca actgagactg ccctgctctc ggtcgtggag 1740 gatctcagac tggctaaagc agactctaaa tcatcagtcc tcattttgct ggacttgtca 1800 gctgcttttg acactgtcaa ccaccagatc ctgctatcta cgcttgagtc actgggcgtt 1860 gcgggcactg ttatacaatg gttcagatct tacctctctg acaggtcatt cagggtgtct 1920 tggaggggag aggtgtccaa cctacagcat ctaaacactg gggtacctca aggctctgtt 1980 cttgggccac ttctcttctc catctacaca tcatctctag gaccagtcat ccagagacat 2040 ggattctcct accactgcta tgctgatgat acccagctat acctctcttt tcatcctgat 2100 gatccctcgg ttccagctcg tatctcagcc tgcctgttgg atatttcaca ctggatgaaa 2160 gatcatcatc ttcagctgaa cctcgcaaaa acggaaatgc ttgtagtttc tgccaacccg 2220 actctacacc ataacttttc aatccagatg gatggggcaa ccattactgc atccaaaatg 2280 gtgaaaagcc ttggagtaac gattgatgac caactaaact tctctgacca catttctaga 2340 actgctcgat cgtgcagatt tgcactctat aacatcagaa agatccgacc cttcttatct 2400 gaacatgcag ctcaactcct tgttcaagct cttgttctct ccaaactgga ttactgcaac 2460 tctctactag ctgggcttcc agctaactct atcaagcctc ttcaactgct ccagaatgca 2520 gcagcacgag ttgtcttcaa tgaacctaaa cgagcacatg tcactccgct gctagtccgt 2580 ttgcactggc tgccagttgc tgctcgcatc aaattcaaaa ctctgatgtt tgcctacaaa 2640 gtgacttctg gcctagcacc ttcgtatctg cactcacttc tgcagatcta tgtgccctcc 2700 agaaacttgc gttctgtgaa tgaacgtcgc ctcgtggttc cagcccaaag aggaaaaaaa 2760 tcactttcgc gaacgctcac gctcaatctg cccagttggt ggaatgaact ccctaactgc 2820 atcagaacag cagagtcact cgctattttc aagaaacgac taaaaactca actatttagt 2880 ctccacttca cttcctaa 2898 // ID DANA repbase; DNA; ZEB; 394 BP. XX AC L42295; XX DT 22-DEC-1995 (Rel. 1.11, Created) DT 22-DEC-1995 (Rel. 1.11, Last updated, Version 1) XX DE Danio rerio (clone DANA-m1) DANA retroposon. XX KW MSAT; Satellite; Simple Repeat; SINE; retroposon; KW Repetitive element; microsatellite; DANA. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-394 RA Izsvak Z., Ivics Z., Garcia-Estefania D., Fahrenkrug C.S. RA and Hackett B.P.; RT "DANA elements: a family of composite, tRNA-derived short RT interspersed DNA elements associated with mutational activities RT in zebrafish (Danio rerio)."; RL Proc. Natl. Acad. Sci. U.S.A 93(3), 1077-1081 (1996). XX DR GenBank; L42295; Positions 1 394. XX CC misc_signal 1..64 CC /note="tRNA-related region" CC misc_signal 11..22 CC /note="Pol III promoter A box; putative" CC misc_feature 55..64 CC /note="Pol III promoter B box; putative" CC repeat_region 231..250 CC /note="TG dinucleotide" CC /rpt_family="microsatellite" CC polyA_signal 368..373 CC /note="putative" CC DNA is the first SINE isolated from zebrafish (Danio rerio) CC exhibiting CC all the hallmarks of these tRNA-derived elements. DANA is unique CC in CC its clearly defined substructure of distinct cassettes. In CC contrast to CC generic SINE elements, DANA appears to have been assembled by CC insertions of short sequences into a progenitor, tRNA-derived CC element. CC Once associated with each other, these subunits were amplified as CC a CC new transposable element with such a remarkable success that CC DANA-related sequences comprise approximately 10% of the modern CC zebrafish genome. At least some of the sequences comprised by the CC full-length element were capable of movement, forming a new group CC of CC mobile, composite transposons, one of which caused an insertional CC mutation in the zebrafish no tail gene. Being present only in the CC genus Danio, and estimated to be as old as the genus itself, DANA CC may CC have played a role in Danio speciation by massive amplification CC and CC genome-wide dispersion. There are extensive DNA polymorphisms CC between CC zebrafish populations and strains detected by PCR amplification CC using CC primers specific to DANA, suggesting that the DANA element will CC be CC useful as a molecular tool for genetic and phylogenetic analyses. XX SQ Sequence 394 BP; 90 A; 81 C; 119 G; 104 T; 0 other; ggcgacgcag tggcgcagtg ggaagtgctg tcgcctcaca gcaagaagct cgctggttcg 60 agcctcggtt aacaaggttt gaacgagcct cgctcagttg gcgtttctgt gtggagtttg 120 catgttctcc ctgcgttcca tgggtttcct ccgggtgctc cgttccccca cagtcctaag 180 gcatgtggta caggtgaatt gggtaggcta aattgtccgt agtgtatgag tgtgtgtgaa 240 tgtgtgtgtg gatgtttccc agcgatgggt tgcggctgga aggcgatccg ctaaaactaa 300 aaaaaaaact tgattaaaaa cttgctggat aagttggcgg ttcattccgc tgtggtgacc 360 ctggattaat aaagggacta agccaaaaag aaaa 394 // ID Gypsy-27-I_DR repbase; DNA; ZEB; 5177 BP. XX AC . XX DT 11-FEB-2005 (Rel. 10.01, Created) DT 11-FEB-2005 (Rel. 10.01, Last updated, Version 1) XX DE An internal portion of the Gypsy-27_DR LTR retrotransposon - a DE consensus sequence. XX KW GYPSY superfamily; Gypsy-27-I_DR; Gypsy-27-LTR_DR; Gypsy-27_DR; KW LTR retrotransposon; endogenous retrovirus; gag; integrase; KW reverse transcriptase. XX OS Danio rerio OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; OC Cypriniformes; Cyprinidae; Danio. XX RN [1] RP 1-5177 RA Kapitonov V.V. and Jurka J.; RT "Gypsy-27_DR, a family of LTR retrotransposons from zebrafish."; RL Repbase Reports 5(1), 27-27 (2005). XX DR [1] (Consensus) XX CC Gypsy-27-I_DR is an internal portion of the Gypsy-27_DR LTR CC retrotransposon that belongs to the Gypsy superfamily. Its CC long terminal repeat is deposited in Repbase as CC Gypsy-27-LTR_DR. Gypsy-27_DR is characterized by 4-bp target CC site duplications. The internal portion encodes one CC polyprotein: the 1632-aa Gypsy-27_DR1p (pos. 71-4966) composed CC of the gag, protease, reverse transcriptase, and integrase CC domains. The consensus sequence was built from 5 copies less CC than 1% diverged from the consensus sequence. XX FH Key Location/Qualifiers FT CDS 0..0 FT /product="Gypsy-27_DR1p" FT /translation="MEAENFLINFPDAAEGSCEGPSHNSPVTPQIDRSLLD FT EDTGMEQPFIGAPILSSESRTDDLAVLVQAIDSLKSVFVDTVDRQKKWLQD FT STELCTHEVPKKTLTVVEKAHSEFLDVLSDRMKEFTKHTIEAIQWNSKQTH FT NEFKAMFDEANSSMFSSFKRALEESRVGEKTLQEEVCSLRKELLAMQTSLD FT SFMQQRISLDTKVDEKVLVSGSQVCSNTSSHVDSSESISQPALSNLTCGRI FT PPIKMVFPTFGGVNDESDPVIYLERCNDFLALRPMSNSEILATMRSVLHAS FT ARDWWETVRFKINSWESFQKAFLAVFLPEDYQDVLEEKVRNRLQGTNESVR FT DFAFSFQALFKRWKKEATDTEVLKALLKAMNPCYASQLRGRAQTVEELVRL FT GTQLERDYDHQIKYNQLQLPKSSFLDQHLSKKSKGVEVKSNLPPEQSLTSL FT LCWRCHENHSPGSCSKFKAVLSGKQTGGQFQGTKEFQPQNQQRGMLSAYDT FT QRFVKKDVSSGRKLDKRDGIKILRQLVIPMMVRNCSGKALVDTGATYTLLN FT VDLWNKIKEPNERLDVWHEGPLYLANGDATVPLGQKVLDFYLQDLHFQVPT FT VVLPAQNLVYSMVLGLDFIALTGLQLNIKDQLYSFSDDPQGRVFPFQPPIT FT PGDSWKCCKPPGNYNPSASLYSAIPPVQFVCTANECVDKSDVGESGSVLME FT LLQSKIKESNLSERESHVLFTLLYDHPEVCTPNLGRTHLVQHKIAVSTDVV FT VAQKPYRLPIHKKEIVKEQIDDMLNQDIIQPSHSPWASPIVLVPKKDGGQR FT FCVDYRKLNAVSESDAFPLPTVNEILESLSGSGIFSTLDLNSGYWQVSMHP FT DSMAKTAFVSPFGLYEFKVLPFGLKNAPATFQRLMNRVLADYLGQCCLVYL FT DDIVVYSANFHQHVLDLQKVLRCLQRAGLTLKLPKCHFCLTEIKFLGHIVT FT TDGVKADPAKTEAIQNFPVPTNLKELQRFLGMSGWYHRYVQNFSDIAEPLN FT ALKKKGVRFQWTAECQVAFDCLKRHLSSPPVLGHPNHAHTFVVYTDASSTG FT LGAVLAQRPSTFGASEEVLAYASRTLTSAEKNYSTTERECLAVVWAVERWR FT HYLEGKSFIVVTDHASLLWVFNTTKTNSRLIRWALRLQEFEFILEYRKGKL FT NSAPDALSRIDVPDSCPMVASYVPKQSTESMVSLFPLCDEDIWIAQQQDVE FT IQRIYQSLAEGKQSDEGSGSEFVILEDKVYRKVSNPTKGTHFQIYVPQTLR FT EILLEAYHSNPLSGHFGRYKTQKRLMQVAFWPNMWRDVSDFVKNCTSCQQN FT KPECRKPAGKLQQTEVKEPWEMLGVDLMGPLPRSTLGNTQLLVVVDYYSHW FT VEMFPLRKATAGVIAQTLRKEVLTRWGVPKFLLSDRGPQFTSEILKDLCSR FT WGVVQKLTTAYHPQTNFTERVNQVIKVMISSYVFGEHNRWDHYLPELRYAI FT NSAVQESTGYSPAELLLHRNLRGPFELVLEPHQTGLRVLKDLQEVVKRNVR FT RAKEKQKRLYDARRRDVHFTRNDRVWMRAHPLSKASQAFAAKFAARWIGPY FT RIVEKLGPVNYRIVREDNGEDLRTVHVCNLKPAFPSAGELDRRERERVLKI FT FVEESEDEEFLGFE" XX SQ Sequence 5177 BP; 1481 A; 944 C; 1190 G; 1562 T; 0 other; gatttggcgc ccaacgtggg gccctgagat atgctaaata tttagtggtt tttgtaattt 60 ttgcttaata atggaagctg aaaactttct tattaatttt cctgatgcag cagaaggtag 120 ttgtgagggc ccatctcata acagtcctgt tactccacag attgaccgat cattattaga 180 tgaggacact gggatggaac agccatttat tggggctccg atcctctcct ctgaaagtcg 240 gacagacgat ttagcagttt tagtccaggc aattgattct cttaaaagtg tatttgtaga 300 tacagtagac cggcagaaaa aatggcttca ggacagtact gaactttgta ctcatgaagt 360 tcctaagaaa acattaactg tggttgaaaa agcacattct gaatttttag acgttctttc 420 tgaccgtatg aaagagttta ctaaacatac aattgaagct atacagtgga actcgaaaca 480 gacacacaat gaatttaagg caatgtttga tgaggctaac tcttcaatgt tttcttcctt 540 taagagagct ttggaagaat ctagggtagg agaaaaaaca ttacaggaag aagtatgcag 600 cttgaggaag gagctacttg ccatgcaaac ttcgcttgat tcctttatgc agcagaggat 660 ctcacttgac actaaagtag atgaaaaggt tctggtttct gggtctcagg tatgttcaaa 720 tacatcttca catgtagact ctagtgagtc cattagtcaa cctgctcttt caaatttgac 780 ttgtgggcgt attccgccta ttaaaatggt atttcctaca tttggaggag taaatgatga 840 gtctgatcct gttatttatt tggagagatg taatgacttc ttagctctaa gaccaatgtc 900 aaacagtgag atacttgcca ccatgcgtag tgttctgcat gcctcagccc gagattggtg 960 ggagacagta agatttaaaa ttaattcttg ggaaagtttt cagaaggctt ttttagcggt 1020 tttcctccca gaggattatc aggatgtgct cgaagaaaaa gtgcgtaatc gactgcaagg 1080 gacaaatgaa agtgttcgag actttgcttt ttcatttcaa gctttattta aacgctggaa 1140 aaaagaagct actgatactg aagttttgaa agctctttta aaagcgatga atccttgtta 1200 tgctagtcaa cttcgtggcc gtgcacagac tgtagaggaa ctggtgagat taggaacaca 1260 attagagaga gattatgatc atcaaataaa gtacaatcag ttacagttac ctaagagttc 1320 ttttcttgac cagcatttgt ctaaaaaatc aaaaggggta gaagtaaaat caaacttacc 1380 tccagaacaa agtttgacta gtttgctttg ttggcgatgt catgaaaatc attccccagg 1440 ttcttgttcc aagtttaagg ctgtcctgag tggaaagcaa actggtggac aatttcaagg 1500 aactaaagag ttccaacctc aaaatcagca acgaggtatg ctgtctgctt atgatactca 1560 gcgctttgta aaaaaagatg tgtcatctgg taggaagctg gataagagag atggtattaa 1620 aattcttcgt cagttggtta tccccatgat ggttcggaac tgtagtggta aagcacttgt 1680 tgatacagga gctacttata cattgctcaa cgtcgattta tggaataaga taaaagaacc 1740 caatgagcga ctagatgtat ggcatgaggg accactttat ctagctaatg gcgatgctac 1800 tgttcctctt ggccaaaaag tactggactt ctatttgcag gatttgcatt tccaagttcc 1860 aactgtagtt ctgccagctc agaaccttgt gtattcaatg gttttgggtt tagattttat 1920 cgcactgact gggttgcaac tgaacattaa agatcaactg tacagttttt cagatgatcc 1980 acaaggtcgg gttttccctt tccaaccgcc tattacacca ggggactctt ggaagtgttg 2040 taaaccacct ggtaactata atccatctgc ttctttgtat tctgctatac ctcctgttca 2100 gtttgtatgt acagcaaacg aatgtgtgga taagagtgat gtgggagagt ctggttctgt 2160 attgatggag cttttgcaaa gtaaaatcaa ggagagcaat ctttcagaac gtgagtctca 2220 tgttctgttt actcttttgt atgaccatcc tgaagtttgt actcctaatt tgggtagaac 2280 ccatcttgtc cagcataaga ttgcggtttc aacagatgtt gttgtagctc aaaaaccata 2340 cagactaccc atacacaaaa aggagattgt aaaggaacaa attgatgata tgctcaatca 2400 agatatcatc cagccatctc attcaccctg ggcatctcct attgtgttag ttcctaaaaa 2460 agatggaggg cagagatttt gtgttgatta tcgtaaactg aatgcagtca gtgaaagcga 2520 cgcatttcct cttcccacag tgaatgagat cttggagtcc ctttctgggt cagggatatt 2580 tagtactctg gatttaaata gtggatattg gcaggtgtcc atgcatccag atagtatggc 2640 caagaccgct ttcgtctcac cttttggcct atatgagttt aaggtgttac cttttggcct 2700 gaaaaatgca ccggccacat ttcaaaggct tatgaataga gtcttggctg actatctggg 2760 gcagtgttgt ttagtgtatc tagatgacat agttgtctat tcggctaact ttcaccaaca 2820 tgtcctagac ctccagaaag ttttgagatg tttgcagaga gcaggactaa ctctcaaact 2880 tccaaaatgt catttctgtt taacagagat caagtttctt ggccatattg tgactactga 2940 tggtgtaaag gcagaccctg ctaaaacaga agctattcag aattttccag tcccaacaaa 3000 tttaaaggaa ctccaacggt ttctgggaat gagtgggtgg taccataggt acgttcaaaa 3060 tttttcagat attgccgaac cacttaatgc tctaaagaag aaaggagttc gttttcagtg 3120 gacagctgag tgccaggtag cgtttgactg cctcaaaagg catctttcct caccacctgt 3180 acttggacac ccaaatcatg cccatacatt tgtagtttat actgatgcca gttcaaccgg 3240 cttgggtgct gttcttgctc aacgaccctc cacttttggt gcatctgagg aggtgttggc 3300 gtatgctagc cgtactctga catcagcaga gaaaaactat tctacaactg agcgggagtg 3360 tttagcagtg gtttgggctg tagaacggtg gcggcattat ttggagggaa aatccttcat 3420 tgtagtcact gatcatgcct ctcttctttg ggtgttcaac actacaaaaa caaattcccg 3480 actgattcgc tgggctttga gactacagga gtttgagttt attcttgagt atcgtaaggg 3540 gaagcttaat agcgccccag acgctttgtc tcggattgac gttcctgatt cttgtccaat 3600 ggtagcttcc tatgttccta agcaaagtac agagagtatg gtgtcactat ttcctctttg 3660 tgatgaggat atctggattg cacagcaaca agatgtagaa atccaaagga tatatcagag 3720 cttagcagaa ggaaagcaga gtgatgaggg atctggctct gagtttgtca ttttagagga 3780 taaggtctat aggaaggtgt ctaatcctac aaagggaact cattttcaga tttatgttcc 3840 acagactctc agagaaattt tgttggaggc atatcactcc aatccattga gtggtcattt 3900 tgggcgttat aaaactcaaa aaaggttaat gcaagttgcc ttttggccta atatgtggag 3960 agatgtatct gactttgtga agaactgtac tagttgtcag cagaataaac cagagtgtcg 4020 taagcctgct ggaaaacttc agcaaactga agtgaaggag ccttgggaaa tgcttggtgt 4080 ggaccttatg gggccactgc ctcgtagtac cctgggtaac actcaattgt tagttgttgt 4140 ggactattac agtcactggg ttgagatgtt tcctcttcgc aaagctactg ctggagtaat 4200 tgcccagaca cttaggaagg aagtactgac tcgatggggt gttccaaagt tcctactatc 4260 tgacagggga cctcaattta catctgagat tttgaaggat ctgtgtagca gatggggagt 4320 ggtacaaaaa ttaacaacag catatcatcc ccaaactaat tttacggaac gtgtgaatca 4380 ggtaattaaa gtcatgatct cctcttatgt gtttggtgag cataatcggt gggatcatta 4440 cttaccagag ttgagatatg ccatcaactc tgctgttcag gagagtactg gatactcccc 4500 agcagagtta ttattgcaca gaaatctgag aggacctttt gaacttgtgt tagagcctca 4560 ccagactggt cttagggttc ttaaagactt gcaggaagtg gtaaaaagaa atgtgcgtcg 4620 ggctaaggaa aaacagaagc gtttgtatga tgcgagacga agagatgtgc atttcacaag 4680 aaatgataga gtttggatga gagcacatcc tctttctaag gcttctcaag catttgcagc 4740 taaattcgca gctagatgga ttgggcctta tcgaattgtg gaaaagcttg gtccggtgaa 4800 ctaccggatt gttcgggagg ataatgggga agatttacgt acagttcatg tatgcaattt 4860 aaaacctgca tttccctccg caggggaatt agatcgtaga gagagagaaa gggtcctgaa 4920 aatttttgtc gaagagtcag aagatgagga atttctcgga tttgaatagt cctgagatcc 4980 agtaatcact taactgattt ccatatttat agatcagatt atttggtgag tagagttatt 5040 ggtatggaaa gctgttgagt gtctttttat tgctgcataa tttttaggtc ttaaaaaacc 5100 cctaaagatt tcttgtaagt aacttataga tcacaacctc tttaagtttg actgatcttt 5160 ttccatgggg gggagag 5177 //