ID BRIDGE2_FR repbase; DNA; FUG; 618 BP. XX AC . XX DT 28-JUL-1999 (Rel. 3.1, Created) DT 28-JUL-1999 (Rel. 3.1, Last updated, Version 1) XX DE BRIDGE2_FR is a non-LTR retrotransposon - an incomplete consensus DE sequence. XX KW Non-LTR retrotransposon; LINE; reverse transcriptase; KW BRIDGE superfamily; ORF1; BRIDGE2_FR. XX OS Takifugu rubripes OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Euteleostei; Neoteleostei; OC Acanthomorpha; Acanthopterygii; Percomorpha; Tetraodontiformes; OC Tetradontoidea; Tetraodontidae; Takifugu. XX RN [1] RP 1-618 RA Kapitonov V.V. and Jurka J.; RT "BRIDGE2_FR."; RL Direct Submission to Repbase Update (JUL-1999). XX DR [1] (Consensus) XX CC BRIDGE2_FR is a non-LTR retrotransposon that belongs to the CC BRIDGE superfamily. Its consensus sequence has been recovered CC from numerous GSS sequences. The consensus sequence can CC be expanded when additional DNA from the fugu genomae will be CC available. BRIDGE2_FR is a young retrotransposon since its CC copies are, on average, 92% identical with the consensus CC sequence. BRIDGE1_FR and BRIDGE2_FR are significantly CC similar on the protein level only. XX SQ Sequence 618 BP; 181 A; 157 C; 154 G; 125 T; 1 other; atcttacaga aagaacaacc ttaacaccgg cccaaatctg caccatgctg gacctgtgtt 60 tgaacaccac ctatttccaa tacagagaag gcttctacag gcagaaacat ggctgtgcca 120 tgggctcacc agtatccccc atagttgcca atctatacat ggagaaggtg gaatcccagg 180 ccctgacatc cttcacagga actgcgccaa gccactggtt caggtatgtg gatgacacct 240 gggtcaaaat tcaaacacaa gaattggaag cgttctccga tcacctcaac aaaacagacg 300 agcatgtaaa attcacccgg gaagaggtaa aaggaaacag tctggccttt ctggactgcg 360 cagtcaagat cactgaggac agaaatcagt agtcagaaac acagtctgtc agtccatcca 420 ggtcctcacc atgtggttca cagagagcag accagttggt ggtctccagg gcatcccgca 480 gacagtccat ggcaccatct gaccatttcc taacagtcct cacagtgact gattgctgct 540 gaaccacagg tgtgtatgat ggggagggca gmaggagatt atgatcagac ttgcggagcg 600 gagggagtgc agtggaag 618 // ID KenoFr1 repbase; DNA; FUG; 5443 BP. XX AC AB111947; XX DT 02-JUN-2009 (Rel. 14.06, Created) DT 02-JUN-2009 (Rel. 14.06, Last updated, Version 1) XX DE Takifugu rubripes retrotransposon KenoFr1 DNA, complete sequence. XX KW Tx1; Non-LTR Retrotransposon; Transposable Element; KENOFR1. XX OS Takifugu rubripes OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Euteleostei; Neoteleostei; OC Acanthomorpha; Acanthopterygii; Percomorpha; Tetraodontiformes; OC Tetradontoidea; Tetraodontidae; Takifugu. XX RN [1] RP 1-5443 RA Kojima K.K. and Fujiwara H.; RT "Cross-genome screening of novel sequence-specific non-LTR RT retrotransposons: various multicopy RNA genes and microsatellites RT are selected as targets."; RL Mol Biol Evol 21(2), 207-217 (2004). XX DR EMBL/GenBank/DDBJ; AB111947; Positions 1 5443. XX FH Key Location/Qualifiers FT CDS 125..1387 FT /product="KenoFr1_1p" FT /translation="MSANYAGLRRHRSVRFYFKDEQVKISRLDFSRKFVQN FT LLNFQPDDINCILTLPANKGYDVSFCSAALLREFWTRFENVKSQFSAFAVE FT KLTDNAHKTVIVRMFNETVNAEDICIWLARFCTVKGQATKVRDEDGIWNCA FT WRVPIQQWEDPQGFQGLRHLPQMIVLGNNRGYIHYQGQPKLCRKCGEQGHL FT AEACPVIVCGKCRAVGHSFEECTTGRKCNLCGATDHLFRDCPLSFANKLKA FT RKKQEEKGPTENEVRTVLEGNSNLPPKPVIGGEESSEVGQREGPGPHPVEA FT DMELGRSHQADSGVRPASTDDDSTRSLHIVEEEQSSTSLPDAQPSSKRTAS FT ELSNSGAESASEKRGRAESEHHSPPVDQDSGSSSSSSNECSFLNVALQSTP FT LRKRADFAFRRTDSQSLPPGCRGIFR*" FT CDS 1494..4958 FT /product="KenoFr1_2p" FT /note="endonuclease and reverse transcriptase." FT /translation="MTLNISTINVRSVRSRVRAQSVLSFLSSFKSDVFLLQ FT ECGLPFLNHYRQWEEMWPQTSLWSGSNENRNDGVAILIKNPQVLVKGSTVV FT KNGRALLTHLTFMGQDFKILNIYGFNEKNDRYDLLEDLQSHMLGRVPLVVG FT GDFNCILSRKDRRRTGEDFKVDKTSVLLQGICRDFKLQDCFKTMHPREEGF FT TWFSGDGTRASRIDYVFTRDCPATDARLTPVFFSDHLMLSCTLSLPSGVTS FT GSGLWKLNCSLLEDRELVRQYREQYKEWQTLQDFYDTRAHWWEMVKGRTRT FT FFRQAGKEKKNRETRRMMGLQKRLQRYFNLNQQGIDFNEEIKQVKKEMSVL FT SEIQSKGVILRSKEREIEEGEKCTRYFFKKIINKGGTITKLTKENGCTTET FT IDEIKETIESFYEKLYKEKHVQIDTMNEILKFLNKTVNNSVLLSQDFTLLE FT LNNSLCSFKTGKSPGQDGLPVEFYLTFWDLLAPDLITVFMEFEGLDRLPDS FT FRVGIVTLLHKQKDKTELKNWRPITLLNVDCKLFSRLLATRMSTFLEEVID FT PDQACAIPGRKITDSLVLIRDTICYARDRNMRLVVLNLDFEKAFDRVSHQY FT LFQVLQKMGFPERFVAWVGLLYRDITSRILFNGHLTKAVDINCGVRQGCPL FT SALLYVICIEPLAQILRRDKRINGVQVPGSGGLETRCILYMDDVNVLCTDL FT LSVNRTLDLTDWFGRASGSKLNKEKTRAQFHGPWTATDMTGLPLTVTQTDQ FT KILGVKFDREGGGKTNWPDMVGKVRQKLGFWGLRGLTMEGKVLITKAVILP FT LLLLISSVFFPPRSVILELERAIFYFLWGSKMEKLKRAIMKKTKEKGGKDV FT PDLHLFLGARFTAIHVGIVTAPSKNPKTAAMARFWMGTYLRKLKIEPVNLS FT VPVSFNLPTAYSFIQRFLVHFNVENEELHILTNHRSLISVVQEREPVSPVR FT GLALGEPSTVWRNVNHPALPNRLRDLSWMVAHEILPVRSVMHSRGMAAHAT FT CPRPGCGAPESVRHLLWECSAAKDQWAMAGSLKFPYLPAREVLTAQLVLYG FT VSPTVIPPKDFAKQWLTLAAIKDATWTSRNLLVRKHMQIPPVAVIRMAAAM FT VQGAGTAGGRPRTPPQRRIASVPIRTKEPELHSKGQGSSGLALRVR*" XX SQ Sequence 5443 BP; 1592 A; 1134 C; 1322 G; 1395 T; 0 other; ctgagtgttt ttcttaaccg tcttatttgt atttatcttt taagtagcac gtttcctttt 60 aacttacttt taaaattatc ttttctttta caagctttta tcctcgcttg tggtgcctcc 120 agccatgtcg gctaattatg ccggcctgcg gaggcaccgc agtgtcaggt tttattttaa 180 agatgagcaa gttaaaatat cacgtttaga cttctcacgt aagtttgtcc agaacctttt 240 gaactttcag cccgacgata taaattgtat tttgactctg ccggcaaaca aaggatatga 300 cgtaagtttt tgctctgcag ctctgctccg tgagttttgg acccgttttg aaaatgttaa 360 aagccaattt tctgcatttg ctgttgaaaa actgactgac aatgcccata aaacggttat 420 tgtcagaatg tttaatgaaa cagtgaatgc agaagacatc tgtatctggt tggctagatt 480 ctgcactgtt aaaggccagg caaccaaggt aagagacgag gatggcatct ggaactgtgc 540 ttggagggtc cccattcaac aatgggagga cccccaaggc ttccagggcc tgagacactt 600 gccgcaaatg attgttctgg gtaataacag aggctatatc cactaccagg gtcaaccaaa 660 actctgccgc aaatgcggcg agcaggggca tctggcagag gcttgtccag ttattgtgtg 720 tgggaaatgc agagcagttg gacattcctt tgaggagtgt accactggaa ggaaatgtaa 780 cctttgtgga gcaacagacc atcttttccg agactgtcct ttgtctttcg ccaacaaact 840 gaaagctagg aaaaaacagg aggaaaaggg gccaacagaa aatgaggtgc gaactgtttt 900 ggagggaaat tcaaatctcc ctccaaaacc tgtgattgga ggagaggagt ccagtgaggt 960 tgggcagagg gaggggcctg gaccccaccc agttgaagct gacatggagt tggggaggag 1020 ccatcaggca gacagtgggg ttcggcccgc ctctaccgat gatgactcca ctaggagtct 1080 ccatattgtg gaggaggagc agagctccac ttccctccca gatgcccaac ctagttcaaa 1140 aagaacagca tctgaactgt ccaactctgg ggctgaatca gcctcagaga agagggggag 1200 agctgaatct gaacatcaca gtccccctgt ggatcaggac agtggctcct cttccagttc 1260 atctaatgaa tgttcctttc taaatgtagc actacagtca actccactca gaaagcgcgc 1320 agactttgct ttcaggagga cagattctca aagcttaccc cctggatgta gggggatctt 1380 tcgatgagaa cctgttacag cacaagcaac aaggcaataa aaatgtgctt gttacttatt 1440 cttaagatac ctgcttcttt taaaatgttg tacttgtcat tttaaacata ctcatgactc 1500 tcaacatttc cactatcaat gtgagaagtg tgaggtcaag agttagagcc caaagtgttt 1560 tatccttttt aagttccttt aagtctgatg tgtttttatt acaggaatgt ggcttaccat 1620 ttttaaacca ctaccgccag tgggaggaga tgtggccaca gacatccctt tggagtggat 1680 caaatgaaaa cagaaatgat ggagtggcca ttttaattaa aaacccccag gttctggtaa 1740 aaggtagcac tgtggtgaaa aatggtcggg cgcttttaac acatttgact tttatgggac 1800 aggattttaa aatcttaaat atttatggct ttaatgaaaa aaatgacagg tatgaccttt 1860 tagaagactt gcagtcccac atgctaggta gggtaccttt agtagtaggg ggagatttta 1920 actgtatttt aagtaggaaa gataggagaa gaacagggga agattttaaa gtagacaaaa 1980 catcagtttt attacagggc atatgcaggg attttaaact tcaagactgt tttaaaacca 2040 tgcatcccag ggaggagggc ttcacctggt tcagtggtga tggcaccaga gcctcccgca 2100 tagattatgt ctttacacgg gactgcccag caactgatgc tagactaaca cctgtcttct 2160 tctccgatca cctcatgctc tcctgcaccc tttcactgcc ttcaggtgtg acatcaggaa 2220 gtggtctgtg gaaactcaac tgctccctct tggaagatag ggagttagtt agacagtaca 2280 gggagcagta caaagagtgg cagacccttc aggacttcta cgacacacga gcacactggt 2340 gggagatggt gaagggaagg acccggactt tctttaggca ggcaggtaag gaaaaaaaga 2400 atagggaaac cagacgcatg atgggactgc agaagcgact acagcgctat tttaacctta 2460 accaacaagg tattgatttt aatgaagaaa ttaaacaagt aaaaaaagaa atgtcggttt 2520 tatcagaaat tcaaagcaaa ggtgtcattt taagaagcaa ggaacgggaa attgaggagg 2580 gggaaaagtg cacaaggtat ttcttcaaga aaatcataaa caaagggggg accatcacaa 2640 aattgactaa agaaaatggg tgcacaaccg aaactataga tgaaattaaa gagacaattg 2700 aaagctttta tgaaaaattg tataaagaaa aacacgttca aattgacacc atgaatgaaa 2760 ttttaaaatt cctaaataaa acagtgaaca acagcgtgct tttatcccag gattttaccc 2820 ttttagagtt aaataattct ctatgtagtt ttaaaactgg gaagtcccct ggacaagatg 2880 gacttcctgt agaattttat ttgacttttt gggacctttt agcacctgac ttaatcactg 2940 tttttatgga atttgaagga ctcgaccgac ttcctgacag ttttagagta gggatagtga 3000 ctctccttca caaacaaaaa gacaagacag aactaaagaa ctggagacct atcactcttt 3060 taaatgttga ctgtaaactt tttagtagac ttttagcaac acgtatgtcc acgtttttag 3120 aagaggtgat tgacccggat caagcctgtg ccattccggg gaggaagatc accgacagcc 3180 tcgtactgat ccgagacacc atctgttatg cgagagacag aaacatgagg ctagtagttt 3240 taaacttaga ttttgagaaa gcctttgatc gggtctcgca ccagtacctt ttccaggtac 3300 tgcaaaaaat ggggttccca gaaagatttg tagcttgggt gggtctgctg taccgggaca 3360 tcaccagcag aattctgttt aatgggcatc tgacaaaagc agtggacatt aactgcggtg 3420 tccgtcaggg gtgtccgtta tcggccctcc tctacgttat ctgcattgaa ccactggcac 3480 agatcttgag aagggacaaa cgaatcaatg gggtgcaagt gccggggagc ggaggactgg 3540 aaacgagatg tattttatat atggacgacg tgaatgtttt atgcactgac cttttatctg 3600 ttaacaggac gctggacttg actgactggt tcggacgggc ctctgggtca aaactaaaca 3660 aagaaaagac acgagcccag tttcacggac catggacagc gacagacatg acaggactcc 3720 ccctgaccgt gacacagact gaccaaaaaa tacttggtgt taaatttgat cgggaggggg 3780 gagggaaaac aaattggccg gacatggtag ggaaagtcag acaaaaacta ggattctggg 3840 gacttagagg actgactatg gaagggaagg ttttaatcac caaagcagtg attttacctt 3900 tgcttttact gatcagttct gtttttttcc ctcccagaag tgtgatttta gaactggaga 3960 gagccatctt ttacttcctg tgggggtcca aaatggaaaa actgaagaga gcaatcatga 4020 aaaaaacaaa agaaaaagga ggaaaagacg tcccggacct gcacctgttc ttaggggcca 4080 ggttcacagc tatacatgtg ggcatagtca cagccccatc caaaaaccca aagacggcgg 4140 caatggcacg cttttggatg gggacgtacc tcagaaaact aaaaattgaa cctgtaaacc 4200 tctcggtgcc tgtgtccttt aacctaccga cggcatatag ttttatccag aggtttttag 4260 tgcattttaa tgtggaaaat gaggagttgc atattttaac taaccaccgc tctttgattt 4320 ctgttgtgca ggagcgcgag ccagtgagtc cagtgcgcgg cctcgcatta ggtgagccct 4380 caactgtttg gcgcaatgtg aaccaccctg ctctcccaaa cagactccgg gacctgtcgt 4440 ggatggtggc tcatgagatc ctcccggtca ggtccgtcat gcactcccga ggcatggcgg 4500 cacacgcaac ctgcccccgg cctggttgtg gcgcgcctga gtcggtgagg catctcctct 4560 gggagtgcag tgctgccaaa gaccagtggg caatggccgg ctccttgaaa ttcccgtact 4620 tgccagcaag ggaggtcctt acagcacaac tggtgctgta tggggtgagc ccaactgtaa 4680 taccaccgaa ggactttgcc aagcagtggc tcaccctagc cgccataaaa gacgccacat 4740 ggacctccag aaacttgctg gtaaggaagc acatgcagat cccccccgtg gcggtgatcc 4800 ggatggctgc agcaatggtt caaggggccg gaactgcggg cggcaggcct aggacaccgc 4860 cacaaagaag aatcgcctct gtgcccattc ggacgaagga gccggagctc cacagcaaag 4920 gtcaaggcag cagcggcctg gctctccggg tgcggtaggc gggaaggagc agcagggcga 4980 ggatctccgc tgaccaccaa cagtatcccg agagatcaaa tgtttggagg agcaggctga 5040 gtgagacttg gtagggactc accacgctcc tgcacaaaga tggatatgtt tgaactgatg 5100 cactcccttg tacggacttt taaagaagct ttcaaaggac acatgaacac taaatgtaac 5160 acctttaacc aatggaaatg gctattttac ctacaagact gttttaaaaa aattattcta 5220 aatgtatata tctttttctt aaaaatggtt ttatagtcct cacaatttgg atattttatt 5280 ggtatatatt gaggtgtata tgtgaacctt ttactcaaag tttgaaaatg tctcaactat 5340 ttaatatttt attggcatat ttgaggtgta tatgtgaacc ttttactcaa agccagaaaa 5400 tgtaaagatg attgagtgcc aataaactaa gaaaaagaaa aaa 5443 // ID BRIDGE1_FR repbase; DNA; FUG; 1430 BP. XX AC . XX DT 28-JUL-1999 (Rel. 3.1, Created) DT 28-JUL-1999 (Rel. 3.1, Last updated, Version 1) XX DE BRIDGE1_FR is a non-LTR retrotransposon - an incomplete consensus DE sequence. XX KW Non-LTR retrotransposon; LINE; reverse transcriptase; KW BRIDGE superfamily; ORF1; BRIDGE1_FR. XX OS Takifugu rubripes OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Euteleostei; Neoteleostei; OC Acanthomorpha; Acanthopterygii; Percomorpha; Tetraodontiformes; OC Tetradontoidea; Tetraodontidae; Takifugu. XX RN [1] RP 1-1430 RA Kapitonov V.V. and Jurka J.; RT "BRIDGE1_FR."; RL Direct Submission to Repbase Update (JUL-1999). XX DR [1] (Consensus) XX CC BRIDGE1_FR is a non-LTR retrotransposon that belongs to the CC BRIDGE superfamily. Its consensus sequence has been recovered CC from numerous (~50) GSS sequences. The consensus sequence can CC be expanded when additional DNA from the fugu genomae will be CC available. BRIDGE1_FR is a young retrotransposon since its CC copies are, on average, 95% identical with the consensus CC sequence. XX SQ Sequence 1430 BP; 515 A; 239 C; 258 G; 418 T; 0 other; ttggcaattt ttaccggggt ttcaatttaa aaaaattttt ttcaaagttt tttggaacct 60 tccggttgga aacccccctt ttggggggat tccccccccg gaattgggga atttgattgg 120 aaaagttctg gaaacattta aaaatcattt tatttcagtc aaggaaaatt taaatatttc 180 attaccagaa ataaaagcat tacaggagct acaacataaa aagaatatag ttattaaacc 240 agcagacaaa gggtctgcaa ttgtcattct ggaccgggaa caatatattt ttgaagtaga 300 gcgacaactt aatgacacgg aatattataa aaagttagaa aaaccaattt atttagatac 360 aattcccatg gtagctacaa ttttagacac attaaagaga aagaaattca tttgcgaaaa 420 acaaaggaag tatttagttg gggatgtaga acctagggaa aggagattct atattcttcc 480 caaaattcac aaggagccag aaaagtggac aattccatat gaactgcccc cgggaaggcc 540 aatcgtctct gactgtggga gcgagacata tttcactgcg gaattcttag atttttacct 600 gaatccgctg tccacaaaac atcctgcctt tgtcagagac acttatcatt tcatagagat 660 tataaaaacc ttgaagattc ctgcagattc ctatttcttc tctatggatg tggatagtct 720 atatactaat atccccattg aatctggcat caactgtatt agaaatatat ttgaaaaata 780 tcctgacccc aagagaccag atgaagagct cttacaatta ttagaaataa atttgagcag 840 aaatgatttc atatttaatg gtaaatacta tttacaaacc aaaggaactg ctatgggcaa 900 aaggtttgcc ccggcatatg cgaatatttt catggcaaac tgggaagaag aggcactatt 960 gaaatgtaaa cataaaccag ctcactatct acggtattta gatgatgttt ggggaatatg 1020 gacaggatct gaactacagt ttagggaatt tgttgagacc ttaaattccc acgatccctc 1080 tattaaactt aaaacagaaa ttaataaaca ttcaattaat tttttagata caacagtttt 1140 caaagggcca gattttctaa aaactgccaa attggacatt aaagtatatt ttaaaagcac 1200 tgatacccat gcattgcttc ataaaacaag ttttcatcca aaacatacat ttaaaggcat 1260 agtaaaatct caaattatta gatttaagcg aatttgcaca agggatgagg aattcaagga 1320 agcagtaaat gttttattta aggcccttcg taaaagaggg tattccaaac cctttttgaa 1380 acactgcttt aaatctgttc aaaaaagaaa aaatggggga gatcgtgggg 1430 // ID Expander repbase; DNA; FUG; 3362 BP. XX AC . XX DT 13-AUG-1999 (Rel. 4.07, Created) DT 20-JUL-2009 (Rel. 4.07, Last updated, Version 2) XX DE EXPANDER is a non-LTR retrotransposon from RTE superfamily - a DE consensus sequence. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; ORF1; KW reverse transcriptase; endonuclease; LINE; RTE superfamily; KW EXPANDER. XX OS Takifugu rubripes OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Euteleostei; Neoteleostei; OC Acanthomorpha; Acanthopterygii; Percomorpha; Tetraodontiformes; OC Tetradontoidea; Tetraodontidae; Takifugu. XX RN [1] RP 1-3362 RA Kapitonov V.V. and Jurka J.; RT "EXPANDER."; RL Direct Submission to Repbase Update (31-JUL-1999). XX DR [1] (Consensus) XX CC EXPANDER is a non-LTR retrotransposon that belongs to the RTE CC superfamily. Its consensus sequence has been recovered from CC numerous (~200) GSS sequences. Putatively, EXPANDER constitutes CC ~1% of the fugu genome. Its copies are, on average, 92% identical CC with the consensus sequence. EXPANDER has one open reading frames CC encoding endonuclease and reverse transcriptase (ORF1: 158-3337). CC Its 3' UTR ends with several tandem copies of 4 bp-long repeat CC (GATG), analogously to other RTE-like elements. XX SQ Sequence 3362 BP; 633 A; 849 C; 1142 G; 738 T; 0 other; cccctcctgg gctaccacct tatcgtggtg gaggggtttg cgtgtcccaa tgatcctagg 60 agctccgttg tctggggctt tatgcccctg gtagggccac ccatggcaaa caggtcctag 120 gtgagggacc agacaaagcg tagcccagga ccccctaatg atgataaaca acattggtgc 180 caagtttccc ttgcccggat gcgggtcacc ggggccccct cctggagcca ggcctggggg 240 tggggcacgt tggcgagcgc ctggtggccg gacctctgcc catggagtcc ggccgggcgc 300 agcccgaaga ggcaacatgg gacccccttc ccgtgggctc accacctgca ggaggggcca 360 agggggtcgg gtgcattgtg ttttgggtag cagccggagg cagggacctt ggcggtctga 420 ttcccggctg cacaaactgg ctctagggac atggaatgtc acctctctgg tgggaaagga 480 gcctgagttg gtgcgcgagg ttgagaagtt ccgactagat atagttggcc tcacctcgac 540 gcacggcaag ggctctggaa ccagtcttct tgagaggggt tggactctct accactctgg 600 agttgccgat ggtgagaggc gacgagcagg ggtggcaatt ctggttgctc cccagctcag 660 tgcctgtgta ttggagttta ccccggtgga tgagagggta gcctcccttc gccttcgggt 720 ggggggacgg atcctgactg ttgtttgtgc ctatggtcca aacagcagtt cagcgtatcc 780 accctttttg gagtccttag agggagtgct ggagagtgcc ccttctgggg gctccctcgt 840 cctcctgggt gacttcaatg ctcacgttgg caatgacagt gtgacctgga gaggtgtgat 900 tgggaagaac ggcccccctg atctgaaccc gagtggtgtt ttgttattgg acttctgtgc 960 tcgtctcaga ttgtccataa cgaacacctt gttcagacat aaaggcgtcc acatgtgcac 1020 ttggcaccag gacgccttag gccgcagatc gatgatcgac tttgtggttg tgtcatcgga 1080 tttgcggccg catgttctgg acactcgggt gaagagaggg gcggagctgt caactgatca 1140 ccacctggtg gtgagttggc tccgatggtg gggaaggatg ccggacagac ctggcaggcc 1200 caaacgtgtt gtgagggtct gctgggaacg cctggcagag tcccctgtca gaaggagctt 1260 caactcacac ctccgggaga gctttgacca tgtcccgggg gaggcggggg acattgagtc 1320 cgagtggacc atgttccgtg cctccattgt tgaggcggct gaccggtgct gtggccgcaa 1380 ggtggttggt gcctgtcgtg gcggcaatgc ccgaacccgc tggtggacac cagcggtgag 1440 ggatgccgtc aagctgaaga aggagtcgta tcgggcctta ctggcctgtg ggactcctga 1500 ggcagcagat gggtaccggc gtgccaagcg gagtgcagct acggcggttg ccgaggcaaa 1560 gactcgggca tgggaagagt tcggtgaggc catggagaac gactttcgga cggcctcgaa 1620 aaggttctgg accaccatcc ggcgtctgag gagggggaag cagtgcactg tcaacactgt 1680 gtatagtggt gatggtgtgc tgctgacctc aactcgggat gttgtggatc ggtggaagga 1740 atacttcgag gacctcctca atcccaccaa cacgccttcc agtgaggaag tagggcctgg 1800 ggacctggag atgggctctc gtatctccgg ggctgaagtt gccgaggtag ttaaaaaact 1860 cctcggtggc aaggccccgg gggtggatga gatccgccca gagtccctta aggctctgga 1920 tgttgtaggg ctgtcttggt tgactcgact ctgcaacatc gcgtggacat cgggggcagt 1980 gcccctggat tggcagaccg gggtggtagt ccctcttttt aagaaggggg accggagggt 2040 gtgttccaac tataggggga tcacactcct cagcctccct gggaaggtct attcaagggt 2100 actggagagg aggatccgcc ggatagtcga acctcggatt caggaggaac aatgtggttt 2160 tcgtcctggg cgtggaacag tggaccagct ctacaccctc agcagggtct ttgagggtgc 2220 atgggagttt gcccaaccag tccacatgtg ttttgtggac ttggagaagg cattcgaccg 2280 tgtccctcgg ggggtcctgt ggggggtcct ccgagagtat ggggtgtcgg gcccgctgat 2340 acgggccgtc cgctccctgt acgatcggtg ccagagtttg gtccgaattg ctggcagtaa 2400 gtcgaactcg tttccggtga gggttggact ccgccagggc tgccctttgt caccgattct 2460 gttcataatt tttatggaca gaatttctag gtgcagtcat ggtgtggagg gggtccggtt 2520 tggtgacctc aggatctcgt ctctgctttt tgcggatgat gtggtcctgt tggcttcatc 2580 ggcccgtgac ctccaactat cactggatcg gttcgccgcc gcctgtgaag cggctgggat 2640 gagaatcagc acctccaaat ccgaggccat ggttctcaac cggaaaaagg tggagtgcct 2700 tctccgggta aaggaggaga tcctgcccca agtggaggag ttcaagtacc tcggggtctt 2760 gttcacgagt gagggaagaa tggagcggga gatcgacagg cggatcggtg cggcgtccgc 2820 agtaatgcgg actctgcacc ggtccgtagt ggtgaagaga gagctgagcc gaaaggcgaa 2880 gctctcgatt taccggtcga tcttcgttcc taccctcacc tatggtcatg agctttgggt 2940 aatgaccgaa agaacaagat cacgggtaca agcggctgaa atgagcttcc tccgtagggt 3000 ggctgggctc tcccttagag atagggtgag aagctctgcc atccgggagg agctcggagt 3060 agagccgctg ctcctccgcg ttgagaggag ccagatgagg tggcttgggc atctagttag 3120 gatgccccct ggacgcctcc ctggtgaggt gttcagggca tgtccctccg gtaggagacc 3180 cccgggaaga cccaggacac gttggagaga ctatgtctct cgactggcct gggaacgcct 3240 ggggatcccc ccggatgagc tggaagaagt agctggggag agggaagtct gggcttctct 3300 ccttaggctg ctgcccccgc gacccgaccc cggataagcg gtagaggatg gatggatgga 3360 tg 3362 // ID EXPANDER2 repbase; DNA; FUG; 3369 BP. XX AC . XX DT 08-FEB-2002 (Rel. 7.01, Created) DT 08-FEB-2002 (Rel. 7.01, Last updated, Version 1) XX DE non-LTR retrotransposon; RTE superfamily; REX3_FR2; EXPANDER2. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; KW RTE superfamily; REX3_FR2; EXPANDER2. XX OS Takifugu rubripes OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Euteleostei; Neoteleostei; OC Acanthomorpha; Acanthopterygii; Percomorpha; Tetraodontiformes; OC Tetradontoidea; Tetraodontidae; Takifugu. XX RN [1] RA Smit A.F.; RT "Initial survey of interspersed repeats in Takifugu rubripes."; RL Repbase Reports 2(1), 15-15 (2002). XX DR [1] (Consensus) XX CC EXPANDER2 is a sister lineage to EXPANDER/REX3_FR, to which it CC is 80% similar at the DNA level over the coding region. A full CC coding open reading frame is present from 172-3348. CC On average, copies are 6% diverged from the consensus, but CC younger elements are present. XX SQ Sequence 3369 BP; 653 A; 850 C; 1181 G; 681 T; 4 other; tcctcctccg tgaaccatca ccttatcgtg gtggaggagt ttgcgtaccc taatgagcct 60 gggagctatg ctgtccgggg cagtttgccc ctggtagggt ctcccaaggc agattggtcc 120 taggtgaagg gtcagacaaa gaatggttca caagacccat catggaacat cnaaaaggga 180 agccgcgtac ccggcccgga gggttaccgg ggccccaccc tggagccagg cctggggctg 240 gggctcgatg gcgagcgcct ggtggccggg cctacgtcca tggggcccgg ccgggcccag 300 cccgaaccag ctatgtggac actcccgtct ccagtggacc caccacccgc aggaggagca 360 tgaagggtcc ggtgcagtgt ggattgggcg gcggaccaag gcgggggcct tggcggtccg 420 atcctcggtt atggaaattg gctttcggaa catggaacgt cacctctctg gcggggaaag 480 agctggagct tgtgggggag gttgagcgct accgactaga tatggtcggc ctcacctcca 540 cacatagcgt cggctcggga acccaagtcc ttgagggggg ttggacactc ttctacgctg 600 gagttgctca gggtgagaga cggagagckg gggtgggctt tttgcttgcc cccagactct 660 ctagttcgac gttggggttc tcgcggtcga gcgaaagggt cgcttccctg cgccttcggg 720 tcggggaacg ggttctgacg gttgtttgtg cgtatgcacc gaacaatagt tcagagtacc 780 cgcccttttt ggaggaccta ggacggacgc tggacagtgt cccgactggg gactccatcg 840 tgctgctggg agacttcaac gctcacgtgg gcaacgacag cgtgacctgg aaaggcgtga 900 ttgggaggaa cggtctgccc gatcagaacc agagtggtgt ncagttattg gacttctgtg 960 ctagtcgcag tttggccata actaacacca tgttcgagca taaggttgtt catcggtgct 1020 cttggcacca tgacggcctg ggccgtaggt caatgattga cttcatagtc gtgtcagctg 1080 atctgcggcc atatgttttg gacactcggg tgaagagagg agcagagctg tcaactgatc 1140 actacctggt ggtgagttgg atcaggtggg gggggaagcc accgcgcgga cctggcaggc 1200 ccaaacgctt agtgagggtc tgttgggaac gcttggtgga ggaacccgtc aggctggtct 1260 tcaactccca cctccgacag agctttgatc gcgttccggg ggcggtaggg gacattgagt 1320 ctgaatgggc catgttccgc tccgccattg tcgaggcggc tgtcgcgagc tgcggctgca 1380 aggccgctgg tgctggtcgt ggcggtaatc cccgtacccg atggtggaca ccagaggtga 1440 ggggagccgt caggctgaag aaagaggcct acaggtcatg gctggtctgt gggtctccgg 1500 aagcagctga ccggtacagg ttggctaagc gggctgcggc cgagacggtc gcggaggcaa 1560 aaactcgggc gtgggaggag ttcggtgagg ctatggagga agactttcgg tcggcaccga 1620 gaagattctg gcaaactgtc cggcgcctta ggggcggcag gcggcaactc gctcacaccg 1680 tgctaggtgt gcgtggggag ctgctgacgt ctcctggggc aattatccgg cggtggaagg 1740 aatacttccg ggagctcctc aatcctacca acacgtatcc ncaaggagga acagagtcgg 1800 gagaccagga ggtggaccat ccaatttccg gagcggaagt tgccgaggta gtgaaacagc 1860 tgcctggcgg cggagctccg ggagcggatg agatccgccc ggggtatctt aaggctctgg 1920 atgttgtagg gctgtcctgg ttgacacgcc tctgcaacat tgcgtggaca tcgggggcag 1980 tgcccttgga ctggcagacc ggggtggtgg tccctatttt caagagtggg gaccagaggg 2040 tgtgttccaa ctataggggg atcacactcc tcagcctccc tgggaaagtc tatgccaggg 2100 tgctggaaaa gaggattaga tcgatagtcg aacctctgat cgaggaggaa caatgcgggt 2160 ttcgccccgg tcgtggaacc acggatcagc tctttaccct tgctggggtg cttgagggga 2220 gttgggagtt tgcccaacca gtccacatgt gctttgtgga cttggaaaag gcctatgacc 2280 gggtccccag gagcatcctg tggggggtgc tccgagagta tggggtggat ggtcccttga 2340 taagggccgt ccagtccctg taccgaagga gcaggagctt ggtccggata gccggttgta 2400 agtcggactc gttcccagtg agggttggac tccgccaggg ctgccctttg tcaccggttc 2460 tgttcataac ttttatggac agaatttcta ggcgcagccg gggagtggag ggtgtcgagt 2520 tcggtgggcg gaagatctcg tcgctgcttt ttgcggatga cgtggttctt ctggcaccat 2580 cgagcaaggg cctccaacac atgctggggc ggttcgcgac cgagtgtgaa gcggcaggga 2640 tgcggatcag cacctccaag tcagaggcca tggtcctcgc tcggaaaaag gtagagtgcc 2700 ttctccgggt cggggaggag gtcctgcctc aggtggagga gttcaagtat ctcgggatct 2760 tgttcacgag tgagggtagg atggaacggg agatcgacag gcggatcgga gcggcgtcag 2820 cagtgatgcg ggcgcttaac cgatctgtcg tggtgaagaa ggagctgagc cagaaggcaa 2880 agctctcgat ttaccggtcg atctacgtcc cagtcctcac ctatggccat caacgttggg 2940 tgatgaccga aagaacgaga tcgcggatac aagcggccga aatgagtttc ctccgcaggg 3000 tggccgggct cagccttaga gatagggtga gaagctcgga catccgggag gggctcggag 3060 tagaaccgct gcttctccac atcgagagga gtcagctggg gtggctcggg catctggcca 3120 ggatgccttc cggacgcctc cctttagagg tgttccggac atgtcccacc gggaggaggc 3180 ctcgtggccg gcccaggact aggtggaggg attacatctc tcgcctggct tgggagcggc 3240 tgggggtccc cccggaggag ctgatggaag tggccgggga gagggctgtc tgggcatccc 3300 tcctgaagct gctgcccccg cgacccggat ccggataagc gggagaaaac gaaacgaaac 3360 gaaacgaaa 3369 // ID EXPANDER repbase; DNA; FUG; 3362 BP. XX AC . XX DT 13-AUG-1999 (Rel. 4.07, Created) DT 13-AUG-1999 (Rel. 4.07, Last updated, Version 1) XX DE EXPANDER is a non-LTR retrotransposon from RTE superfamily - a DE consensus sequence. XX KW RTE; Non-LTR Retrotransposon; Transposable Element; ORF1; KW reverse transcriptase; endonuclease; LINE; RTE superfamily; KW EXPANDER. XX OS Takifugu rubripes OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Euteleostei; Neoteleostei; OC Acanthomorpha; Acanthopterygii; Percomorpha; Tetraodontiformes; OC Tetradontoidea; Tetraodontidae; Takifugu. XX RN [1] RP 1-3362 RA Kapitonov V.V. and Jurka J.; RT "EXPANDER."; RL Direct Submission to Repbase Update (31-JUL-1999). XX DR [1] (Consensus) XX CC EXPANDER is a non-LTR retrotransposon that belongs to the CC RTE superfamily. Its consensus sequence has been recovered CC from numerous (~200) GSS sequences. Putatively, EXPANDER CC constitutes ~1% of the fugu genome. CC Its copies are, on average, 92% identical with the consensus CC sequence. EXPANDER has one open reading frames encoding CC endonuclease and reverse transcriptase (ORF1: 158-3337). CC Its 3' UTR ends with several tandem copies of 4 bp-long repeat CC (GATG), analogously to other RTE-like elements. XX SQ Sequence 3362 BP; 633 A; 849 C; 1142 G; 738 T; 0 other; cccctcctgg gctaccacct tatcgtggtg gaggggtttg cgtgtcccaa tgatcctagg 60 agctccgttg tctggggctt tatgcccctg gtagggccac ccatggcaaa caggtcctag 120 gtgagggacc agacaaagcg tagcccagga ccccctaatg atgataaaca acattggtgc 180 caagtttccc ttgcccggat gcgggtcacc ggggccccct cctggagcca ggcctggggg 240 tggggcacgt tggcgagcgc ctggtggccg gacctctgcc catggagtcc ggccgggcgc 300 agcccgaaga ggcaacatgg gacccccttc ccgtgggctc accacctgca ggaggggcca 360 agggggtcgg gtgcattgtg ttttgggtag cagccggagg cagggacctt ggcggtctga 420 ttcccggctg cacaaactgg ctctagggac atggaatgtc acctctctgg tgggaaagga 480 gcctgagttg gtgcgcgagg ttgagaagtt ccgactagat atagttggcc tcacctcgac 540 gcacggcaag ggctctggaa ccagtcttct tgagaggggt tggactctct accactctgg 600 agttgccgat ggtgagaggc gacgagcagg ggtggcaatt ctggttgctc cccagctcag 660 tgcctgtgta ttggagttta ccccggtgga tgagagggta gcctcccttc gccttcgggt 720 ggggggacgg atcctgactg ttgtttgtgc ctatggtcca aacagcagtt cagcgtatcc 780 accctttttg gagtccttag agggagtgct ggagagtgcc ccttctgggg gctccctcgt 840 cctcctgggt gacttcaatg ctcacgttgg caatgacagt gtgacctgga gaggtgtgat 900 tgggaagaac ggcccccctg atctgaaccc gagtggtgtt ttgttattgg acttctgtgc 960 tcgtctcaga ttgtccataa cgaacacctt gttcagacat aaaggcgtcc acatgtgcac 1020 ttggcaccag gacgccttag gccgcagatc gatgatcgac tttgtggttg tgtcatcgga 1080 tttgcggccg catgttctgg acactcgggt gaagagaggg gcggagctgt caactgatca 1140 ccacctggtg gtgagttggc tccgatggtg gggaaggatg ccggacagac ctggcaggcc 1200 caaacgtgtt gtgagggtct gctgggaacg cctggcagag tcccctgtca gaaggagctt 1260 caactcacac ctccgggaga gctttgacca tgtcccgggg gaggcggggg acattgagtc 1320 cgagtggacc atgttccgtg cctccattgt tgaggcggct gaccggtgct gtggccgcaa 1380 ggtggttggt gcctgtcgtg gcggcaatgc ccgaacccgc tggtggacac cagcggtgag 1440 ggatgccgtc aagctgaaga aggagtcgta tcgggcctta ctggcctgtg ggactcctga 1500 ggcagcagat gggtaccggc gtgccaagcg gagtgcagct acggcggttg ccgaggcaaa 1560 gactcgggca tgggaagagt tcggtgaggc catggagaac gactttcgga cggcctcgaa 1620 aaggttctgg accaccatcc ggcgtctgag gagggggaag cagtgcactg tcaacactgt 1680 gtatagtggt gatggtgtgc tgctgacctc aactcgggat gttgtggatc ggtggaagga 1740 atacttcgag gacctcctca atcccaccaa cacgccttcc agtgaggaag tagggcctgg 1800 ggacctggag atgggctctc gtatctccgg ggctgaagtt gccgaggtag ttaaaaaact 1860 cctcggtggc aaggccccgg gggtggatga gatccgccca gagtccctta aggctctgga 1920 tgttgtaggg ctgtcttggt tgactcgact ctgcaacatc gcgtggacat cgggggcagt 1980 gcccctggat tggcagaccg gggtggtagt ccctcttttt aagaaggggg accggagggt 2040 gtgttccaac tataggggga tcacactcct cagcctccct gggaaggtct attcaagggt 2100 actggagagg aggatccgcc ggatagtcga acctcggatt caggaggaac aatgtggttt 2160 tcgtcctggg cgtggaacag tggaccagct ctacaccctc agcagggtct ttgagggtgc 2220 atgggagttt gcccaaccag tccacatgtg ttttgtggac ttggagaagg cattcgaccg 2280 tgtccctcgg ggggtcctgt ggggggtcct ccgagagtat ggggtgtcgg gcccgctgat 2340 acgggccgtc cgctccctgt acgatcggtg ccagagtttg gtccgaattg ctggcagtaa 2400 gtcgaactcg tttccggtga gggttggact ccgccagggc tgccctttgt caccgattct 2460 gttcataatt tttatggaca gaatttctag gtgcagtcat ggtgtggagg gggtccggtt 2520 tggtgacctc aggatctcgt ctctgctttt tgcggatgat gtggtcctgt tggcttcatc 2580 ggcccgtgac ctccaactat cactggatcg gttcgccgcc gcctgtgaag cggctgggat 2640 gagaatcagc acctccaaat ccgaggccat ggttctcaac cggaaaaagg tggagtgcct 2700 tctccgggta aaggaggaga tcctgcccca agtggaggag ttcaagtacc tcggggtctt 2760 gttcacgagt gagggaagaa tggagcggga gatcgacagg cggatcggtg cggcgtccgc 2820 agtaatgcgg actctgcacc ggtccgtagt ggtgaagaga gagctgagcc gaaaggcgaa 2880 gctctcgatt taccggtcga tcttcgttcc taccctcacc tatggtcatg agctttgggt 2940 aatgaccgaa agaacaagat cacgggtaca agcggctgaa atgagcttcc tccgtagggt 3000 ggctgggctc tcccttagag atagggtgag aagctctgcc atccgggagg agctcggagt 3060 agagccgctg ctcctccgcg ttgagaggag ccagatgagg tggcttgggc atctagttag 3120 gatgccccct ggacgcctcc ctggtgaggt gttcagggca tgtccctccg gtaggagacc 3180 cccgggaaga cccaggacac gttggagaga ctatgtctct cgactggcct gggaacgcct 3240 ggggatcccc ccggatgagc tggaagaagt agctggggag agggaagtct gggcttctct 3300 ccttaggctg ctgcccccgc gacccgaccc cggataagcg gtagaggatg gatggatgga 3360 tg 3362 // ID LINE_FR repbase; DNA; FUG; 4585 BP. XX AC AF086712; XX DT 20-JUL-1999 (Rel. 3, Created) DT 20-JUL-1999 (Rel. 3, Last updated, Version 1) XX DE Fugu rubripes CR1-like LINE retrotransposon - complete cds. XX KW LINE_FR. XX OS Takifugu rubripes OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Euteleostei; Neoteleostei; OC Acanthomorpha; Acanthopterygii; Percomorpha; Tetraodontiformes; OC Tetradontoidea; Tetraodontidae; Takifugu. XX RN [1] RP 1-4585 RA Poulter R., Butler M. and Ormandy J.; RT "A LINE element from the pufferfish (fugu) Fugu rubripes which RT shows similarity to the CR1 family of non-LTR retrotransposons."; RL Gene 227(2), 169-179 (1999). XX RN [2] RP 1-4585 RA Poulter T.R., Butler I.M. and Ormandy J.; RT "Direct Submission."; RL Direct Submission to Repbase Update (24-AUG-1998)Biochemistry, RL University of Otago, Cumberland Street, Dunedin, New Zealand. XX DR GenBank; AF086712; Positions 1 4585. XX CC Putative nucleic acid binding protein and reverse CC transcriptase-like CC protein genes. XX SQ Sequence 4585 BP; 1202 A; 1083 C; 995 G; 1305 T; 0 other; gtattgcgtc acttcctgtc ttctcaatcg ttcgttggtt gcactgtgca tggggtgttg 60 tattcactta atattgaaca cttggtgtgt tgtattcact taatattgaa cacttggtgt 120 gttgtattca ctttacttaa aattttgaac acttgggaca ttctagtcac ctgataacaa 180 cagtgagaaa caacccatat taaacaaata cagggctcca ccgattatca gcgttagcat 240 ggcctcaccg tctgtttcac ccggtgtctt tgtctgttca gcgtgtgaaa tgtttagtta 300 ctcctctgcc tcctttagtg aagggaatag gtgcagaaag tgtagtttat ttatggctat 360 ggaggcgaga cttagcgagc ttgagacgcg gttccgcagc ttggagttag ctggagttgc 420 gtcaggtagc caggagaagc tagctgctgc ggagccgcct agcgtagcta cagctagcgg 480 tcccccggca gcagccgagc agccggctag ccagggcggc tgggtgacgg ttcgtaggaa 540 gcgtagccca aaacaaaggc ccacggtgca ccaccaaccg cttcccgtgg ctaaccgctt 600 ttccccactc ggcgacacac ccgctgagaa accgaccctg gtaattggcg actctgtttt 660 gcgctacgtg aagccgactc cagcgaccat agttaagtgc attccggggg ccagagcggg 720 cgacatagaa gcaaatttac ggctgctggc gagacgtaat cgtaaatttg gtaaagttat 780 tattcacgtc ggagccaacg acacccggct tcgtcagtcg gaggtcacca aaattaactt 840 ggagtcggtg tgtaactacg caaaaacgat gtcggactcc gtagcattct ctggtcccct 900 ccccaatctg gccagcgagg agatgtttag ccgcatgtcg tcgcttcgtc gctggctgtc 960 acggtggtgc cccgaaaacc aggtggcctt tatagacaat tggagcactt tttggggaaa 1020 acctggtctg attaggagag acggtgtcca tcccacacgg gatggtgctt ctctcatttc 1080 tagtaatttg gctaatttta ttagacccaa agtgacctga caatccaggg tccagaccag 1140 gatgcagagt tgtagtctta cacacctctc tgctgcttcc ttagaaccct catccaccaa 1200 caataacata tttaacacta tagaggtagt ctctgttcca cggttaaaag ttcaccaagc 1260 acagagcagg ggagcggtca atcaccataa tcttattaaa attaatacca aagcacaagt 1320 tggagaaact aatatcacaa ttaagtgtgg actgttaaat attagatctc ttttgtgtaa 1380 atccctgtta gtgcacgacc tgatagcgga tcatcacatt gatttatttt gtcttactga 1440 gacctggctt caggaggagg agtatgttag cttaaatgaa tctactcctc ctacccatct 1500 taattatcat attcctcgtg ttactggtcg aggaggggga gtggcagcaa tctatcactc 1560 caagttatta attaatccca gaccaaaaca tggcttcagt tcatttgaaa gcctgactct 1620 tggcatcact catctgaact ggaggacaga aaagccactt ctgtttgtag ttgtatatcg 1680 gccccctgct gggccacatt cagagttcct gtctgagttc tctgacttct tatctgactt 1740 ggtccttaga acggacaaag tcattatcat tggagacttt aacatccata tggacgttat 1800 aaatgacagc tttagaaatg gcttcatttc attacttgag tcagttggtt tcctccagca 1860 gataaaccaa ccaactcaca gctttaacca caccctagat ctagttctga cttatggtgt 1920 tgaggtagaa catgtgtcag tgttccctca gaaccccctc ctgtcagacc attctttgat 1980 cacttttaca tttatgatta aggattcttc tatgctcaga acacagtctt actatagcag 2040 atgtctttca gataatgctg tagctaagtt taaggaagcg atccctgtgc tgatcccagg 2100 accaccgtgt gtttccccag ggatcaatca ttacaatctt agccctgctg aggttgactc 2160 tattgctgaa ggtgcagcaa cctcactgag aatcacgctt gattctgttg cccccctgaa 2220 aaagaaaata gtaaatcaga ggaggtgtgc cccctggtat aattcacata tcaggaccct 2280 caagcagaaa gtgcgaagac tggaaaggaa gtggcattct tgtaaaatag acagctacca 2340 tgtagcctgg aaagactgtc tattagttta caaaaaggcc cttcgcaagg ctagaacagc 2400 ttatttttct tctttgattg aggaaaataa gagcaacccc aggtttcttt tcagcactgt 2460 ggccaaatta actaagagtc acagtgtttt agatccacgt atcccttctt cccttagtgg 2520 tgaagacttc atgagcttct tcactgataa agttctagct atcagagaaa aagctaacca 2580 ggccatccca acaactggac catcaccaga tgtgctgact gtgggaacat acagggtctc 2640 caacgagccc ttaaactcct tcagccctat atatttttct gaggcgtcat cgctaattca 2700 gaaatccaag accaccacgt gtcttttaga tcccatccca acacacctgt tgaaggatgt 2760 tttaccattg ataggcagtt ctatcctgga ccagatcaat ggttctttag tgtcaggtta 2820 tgtaccccgg tcctacaagg tggcagtgat taagccgttg cttaaaaaac catcactgga 2880 tcctgatgtc ttagcaaatt ataggccaat atccaacctt ccttttatct ctaaagttct 2940 agagaaggtg gtggtgactc agttactgga gcacctgcag aggaacagcc tgtttgagat 3000 gtttcagtca ggctttagag ctcaccacag cacagaaaca gcacttctta aagtcactaa 3060 tgatcttctc atagcttccg atcatggact ggtctctatg ctggttctgc tggacctcag 3120 tgctgctttt gatacagttg atcacagcat cctgttacag agactggaac atgtgattgg 3180 gattaaaggg acagcactag actggtttag atcatattta tctgatagat accagtttgc 3240 tcatgtccat ggtgttccct cctcatacag tagggttagc catggagttc ctcaaggttc 3300 tgtacttgga ccaatcctct tcaccttgta catgcttccc ttagggaaca ttattcggca 3360 gcatgggata aattttcatt gttatgctga tgacactcag ctttatttat ccatgaaacc 3420 agaggagaca gagaagttag tgaagctcca gacctgtctt aaagacataa agtcctggat 3480 gtcttcaaat ttcctcctcc ttaacccagg aaaaactgag gtcatggtgt ttggtcctga 3540 acctctcagg gatagattag atcacatgat cactctagat ggtatctcat taacatctag 3600 tctctctgtg aggaatctag gagtaacttt tgatcaaaat ctctccttca actcacacat 3660 taaattagtc tctagaagtg ccttttttca cttgaggaac atcacaaaga tcaggaaact 3720 actgacacgg catgatgctg aaaagttagt ccatgcattt gttacttcca ggctggacta 3780 ttgtaattct ttattatcag ggtgtccaaa caactcttta agaagcctcc aggtgatcca 3840 aaatgctgca gccagagttc tgacaggtat tgacaaaaga gatcacataa ctcctgtact 3900 ggcgtctctt cattggctgc ccattaaatt tagaataatt tttaaaactc ttcttctgac 3960 ctacaaggtc ctcagatgcc tagctccatc ctacctggaa gagctagtga caccttatca 4020 gcccaataga ccgctccgct ctcagaatgc tggtctactt gtggttccca gagtttctag 4080 ggttagaatg gggggccgag catttagcta ccaggccccc ctgctatgga accagctccc 4140 tgtccaggta cgggaggctg actccatcgc tacttttaag atcagactta aaacctacct 4200 ctttgaaaaa gcttattgtt actaattctg gagttccagt tactatcata gacagacaaa 4260 taatcatact tagggggtcg tctaatcatt aggttaacat cttagtcatg ctgttatagg 4320 ccaaggctgc cggggtccag aaacatgatc acctgacagg cctctgtcac cccactgggt 4380 catggtttcc tctttaccca gccggccatc agcaggaggg tccccctaca tgagcctggt 4440 cctgctcaag gtttcttcct gttaaagggg agtttttcct tgccactgtt gcttgtctgg 4500 ggtcaggccc tgggattctg gaaagcgcct tgaaacaatt ttgattgtat aagacgctat 4560 ataaataaag attgattgat tgatt 4585 // ID SENKUSHA1 repbase; DNA; FUG; 2224 BP. XX AC . XX DT 08-FEB-2002 (Rel. 7.01, Created) DT 08-FEB-2002 (Rel. 7.01, Last updated, Version 1) XX DE DNA transposon; Harbinger/PIF-IS5 class; SENKUSHA1. XX KW Harbinger; DNA transposon; Transposable Element; KW Harbinger/PIF-IS5 class; SENKUSHA1. XX OS Takifugu rubripes OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Euteleostei; Neoteleostei; OC Acanthomorpha; Acanthopterygii; Percomorpha; Tetraodontiformes; OC Tetradontoidea; Tetraodontidae; Takifugu. XX RN [1] RA Smit A.F.; RT "Initial survey of interspersed repeats in Takifugu rubripes."; RL Repbase Reports 2(1), 45-45 (2002). XX DR [1] (Consensus) XX CC SENKUSHA1 is the first Harbinger type DNA transposon reported in CC vertebrates. SENKUSHA means harbinger/forerunner in Japanese. CC This consensus may still contain internal deletions. The ORF at CC pos 796-1716 encodes a protein (fragment) that has 48% CC similarity to the Harbinger, and somewhat lower similarity to CC the CANDYSTRIPE and TURMOIL transposases. CC Copies are about 5% diverged from the consensus. The Fugu genome CC contains multiple closely related elements (e.g. SENKUSHA2). XX SQ Sequence 2224 BP; 638 A; 476 C; 466 G; 622 T; 22 other; ggctgtttcc gaaaccacac actcgttccc tattcactac tcactacatg ggggacatgg 60 attgagtgaa ctacgtagcg cactcaattt aaagttagca ttcggacact acggcgtcat 120 ttacggcgca cgtaaagtga cgtcatggtg tcgcataata attacgaacc ggtcgccggg 180 aaaagtggcc aggtccattt aattatttta acccatccga aacaacattc agcggtcgtt 240 ttatgaaaat gcaatatttt accctataat taactttata taataaaata aaatattaat 300 gtcaataata atanaataat aattacttat tacctaaaat aattagtgtc ccttgacatt 360 ttcaagccaa gacgtgatgg cacattctca agtcatattc cgctgtagtg aaaacattta 420 ataaagcaat ttttcatcaa gaaatggatc cagagctact tttcatcttt gtaaatattc 480 ttttactgag atcctgtcgc ctggtgagga acagacgatt ccgaagaaca attttaagtg 540 aggggcgggg gatcagtgaa gattgattga tttttttttt tttacacatt tatgttgtaa 600 tattttaaga tgatcatatt ggacccctac tgaaaanaat tgtaaactac acattgattt 660 cttatttccc aaaaatttca tatatcaaac cgcacagtag aaattacgtt aaaaaaaact 720 tttttaccta cagaacganc acccatcaca ttattgtgtg aggaacaggc atgttccact 780 gctggcgatg tacatcgatg gccaagctga cctccgccca gacttccgtc tctctcggtc 840 gacggtggcc aaactcattg atgtcctgcg ctcaccgttt gaccatggct ggggtctgga 900 ggtggaggtt ttagtcntcc ttttctggct ggcaggcgca acctcctacc gtgtggtgtc 960 aagagnnttc tcaattccac gctccacagt caatgacatc gttcacnggg tctcagataa 1020 agtactggcc ctgaaaaaat gtccaattat attttctaag ctggacaagc tgacaacaat 1080 tgctgcaggc tttcagcgcc cggctggatc accagcttct gccaaagtgg tgggcagcat 1140 agatggatgc cacnttagga tcaagccccc tggtgcagat gctcagtgtt actttaacag 1200 aaaactgttt tattccattc agcttcaagc ngtgtgtgac caccagngcc tgtttatgga 1260 catnttcact ggatacccag gctctgtcca tgactcgaga gtcctgaaaa acagccctct 1320 ntatgttcag ggactgtacc cacctgaggg ttactgtatc ctaggggacg gtggttatcc 1380 ctgtatgtcc aggcctgtag cgttgatnac accatncaga gagcangtga ggcatatgat 1440 ggcggccagg ttcaacagac atcatgcaaa agctcgatct gtcatagaaa gggcnttggg 1500 gattatgaag accagatggc gggcaatatt ttncaaaacc ttggaggtga agcctgcctt 1560 tgccaccaag gttattgcat gctgcaccat tttgcacaat gtctgcctga ggaatggtga 1620 cagcatggaa ccctcagaat cccaggaaaa ggtggtcctt ctccacccag cngcggatca 1680 aactttggac gtcatcgtca ctccctaaag aaatcagttc aatcgatgag tctgcagtat 1740 gtttatatca ctccaacaat atcttaaatc tccatttaaa gtgtgacaac tctagtanag 1800 tatgattaaa atggtatgaa agagcaacga gctaactaat gcttataatc gcataaattc 1860 taatgtcttt aaggcaaaaa catattanac aggaagcanc aatgtgtcac taaagtgaaa 1920 gaatttgacg ctttagtatg aaatgtgaaa caaacagcgg acaatacata catttgtaag 1980 tcatctctcc ttgtttaccc gccattgata gattgtcacg tgataccgta aaggctgatg 2040 ggatacattt tccgagttag ggtgcatcgg ttgtacacta cttttcgcag tgcattgtgg 2100 gatacattga gtgcactaca tagggtacag cgatgctcac taacanttcg gacactattt 2160 caaaatggcg tccacactat tgagtgcact atgtagggta tagggggtgg tttcggaaac 2220 agcc 2224 // ID L1-1_FR repbase; DNA; FUG; 5431 BP. XX AC AB111947; XX DT 28-JUN-2005 (Rel. 10.06, Created) DT 07-JUL-2005 (Rel. 10.06, Last updated, Version 1) XX DE Takifugu rubripes retrotransposon KenoFr1. XX KW L1; Non-LTR Retrotransposon; Transposable Element; KENOFR1; KW L1-1_FR. XX OS Takifugu rubripes OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Euteleostei; Neoteleostei; OC Acanthomorpha; Acanthopterygii; Percomorpha; Tetraodontiformes; OC Tetradontoidea; Tetraodontidae; Takifugu. XX RN [1] RP 1-5431 RA Kojima K.K. and Fujiwara H.; RT "Cross-genome screening of novel sequence-specific non-LTR RT retrotransposons: various multicopy RNA genes and microsatellites RT are selected as targets."; RL Mol Biol Evol 21(2), 207-217 (2004). XX DR Genbank; AB111947; Positions 1 5431. XX FH Key Location/Qualifiers FT CDS 74..1384 FT /product="L1-1_FR_1p" FT /note="gag-like protein" FT /translation="NYLFFYKLLSSLVVPPAMSANYAGLRRHRSVRFYFKD FT EQVKISRLDFSRKFVQNLLNFQPDDINCILTLPANKGYDVSFCSAALLREF FT WTRFENVKSQFSAFAVEKLTDNAHKTVIVRMFNETVNAEDICIWLARFCTV FT KGQATKVRDEDGIWNCAWRVPIQQWEDPQGFQGLRHLPQMIVLGNNRGYIH FT YQGQPKLCRKCGEQGHLAEACPVIVCGKCRAVGHSFEECTTGRKCNLCGAT FT DHLFRDCPLSFANKLKARKKQEEKGPTENEVRTVLEGNSNLPPKPVIGGEE FT SSEVGQREGPGPHPVEADMELGRSHQADSGVRPASTDDDSTRSLHIVEEEQ FT SSTSLPDAQPSSKRTASELSNSGAESASEKRGRAESEHHSPPVDQDSGSSS FT SSSNECSFLNVALQSTPLRKRADFAFRRTDSQSLPPGCRGIFR" FT CDS 1464..4955 FT /product="L1-1_FR_2p" FT /note="reverse transcriptase" FT /translation="NVVLVILNILMTLNISTINVRSVRSRVRAQSVLSFLS FT SFKSDVFLLQECGLPFLNHYRQWEEMWPQTSLWSGSNENRNDGVAILIKNP FT QVLVKGSTVVKNGRALLTHLTFMGQDFKILNIYGFNEKNDRYDLLEDLQSH FT MLGRVPLVVGGDFNCILSRKDRRRTGEDFKVDKTSVLLQGICRDFKLQDCF FT KTMHPREEGFTWFSGDGTRASRIDYVFTRDCPATDARLTPVFFSDHLMLSC FT TLSLPSGVTSGSGLWKLNCSLLEDRELVRQYREQYKEWQTLQDFYDTRAHW FT WEMVKGRTRTFFRQAGKEKKNRETRRMMGLQKRLQRYFNLNQQGIDFNEEI FT KQVKKEMSVLSEIQSKGVILRSKEREIEEGEKCTRYFFKKIINKGGTITKL FT TKENGCTTETIDEIKETIESFYEKLYKEKHVQIDTMNEILKFLNKTVNNSV FT LLSQDFTLLELNNSLCSFKTGKSPGQDGLPVEFYLTFWDLLAPDLITVFME FT FEGLDRLPDSFRVGIVTLLHKQKDKTELKNWRPITLLNVDCKLFSRLLATR FT MSTFLEEVIDPDQACAIPGRKITDSLVLIRDTICYARDRNMRLVVLNLDFE FT KAFDRVSHQYLFQVLQKMGFPERFVAWVGLLYRDITSRILFNGHLTKAVDI FT NCGVRQGCPLSALLYVICIEPLAQILRRDKRINGVQVPGSGGLETRCILYM FT DDVNVLCTDLLSVNRTLDLTDWFGRASGSKLNKEKTRAQFHGPWTATDMTG FT LPLTVTQTDQKILGVKFDREGGGKTNWPDMVGKVRQKLGFWGLRGLTMEGK FT VLITKAVILPLLLLISSVFFPPRSVILELERAIFYFLWGSKMEKLKRAIMK FT KTKEKGGKDVPDLHLFLGARFTAIHVGIVTAPSKNPKTAAMARFWMGTYLR FT KLKIEPVNLSVPVSFNLPTAYSFIQRFLVHFNVENEELHILTNHRSLISVV FT QEREPVSPVRGLALGEPSTVWRNVNHPALPNRLRDLSWMVAHEILPVRSVM FT HSRGMAAHATCPRPGCGAPESVRHLLWECSAAKDQWAMAGSLKFPYLPARE FT VLTAQLVLYGVSPTVIPPKDFAKQWLTLAAIKDATWTSRNLLVRKHMQIPP FT VAVIRMAAAMVQGAGTAGGRPRTPPQRRIASVPIRTKEPELHSKGQGSSGL FT ALRVR" XX SQ Sequence 5431 BP; 1581 A; 1134 C; 1321 G; 1395 T; 0 other; ctgagtgttt ttcttaaccg tcttatttgt atttatcttt taagtagcac gtttcctttt 60 aacttacttt taaaattatc ttttctttta caagctttta tcctcgcttg tggtgcctcc 120 agccatgtcg gctaattatg ccggcctgcg gaggcaccgc agtgtcaggt tttattttaa 180 agatgagcaa gttaaaatat cacgtttaga cttctcacgt aagtttgtcc agaacctttt 240 gaactttcag cccgacgata taaattgtat tttgactctg ccggcaaaca aaggatatga 300 cgtaagtttt tgctctgcag ctctgctccg tgagttttgg acccgttttg aaaatgttaa 360 aagccaattt tctgcatttg ctgttgaaaa actgactgac aatgcccata aaacggttat 420 tgtcagaatg tttaatgaaa cagtgaatgc agaagacatc tgtatctggt tggctagatt 480 ctgcactgtt aaaggccagg caaccaaggt aagagacgag gatggcatct ggaactgtgc 540 ttggagggtc cccattcaac aatgggagga cccccaaggc ttccagggcc tgagacactt 600 gccgcaaatg attgttctgg gtaataacag aggctatatc cactaccagg gtcaaccaaa 660 actctgccgc aaatgcggcg agcaggggca tctggcagag gcttgtccag ttattgtgtg 720 tgggaaatgc agagcagttg gacattcctt tgaggagtgt accactggaa ggaaatgtaa 780 cctttgtgga gcaacagacc atcttttccg agactgtcct ttgtctttcg ccaacaaact 840 gaaagctagg aaaaaacagg aggaaaaggg gccaacagaa aatgaggtgc gaactgtttt 900 ggagggaaat tcaaatctcc ctccaaaacc tgtgattgga ggagaggagt ccagtgaggt 960 tgggcagagg gaggggcctg gaccccaccc agttgaagct gacatggagt tggggaggag 1020 ccatcaggca gacagtgggg ttcggcccgc ctctaccgat gatgactcca ctaggagtct 1080 ccatattgtg gaggaggagc agagctccac ttccctccca gatgcccaac ctagttcaaa 1140 aagaacagca tctgaactgt ccaactctgg ggctgaatca gcctcagaga agagggggag 1200 agctgaatct gaacatcaca gtccccctgt ggatcaggac agtggctcct cttccagttc 1260 atctaatgaa tgttcctttc taaatgtagc actacagtca actccactca gaaagcgcgc 1320 agactttgct ttcaggagga cagattctca aagcttaccc cctggatgta gggggatctt 1380 tcgatgagaa cctgttacag cacaagcaac aaggcaataa aaatgtgctt gttacttatt 1440 cttaagatac ctgcttcttt taaaatgttg tacttgtcat tttaaacata ctcatgactc 1500 tcaacatttc cactatcaat gtgagaagtg tgaggtcaag agttagagcc caaagtgttt 1560 tatccttttt aagttccttt aagtctgatg tgtttttatt acaggaatgt ggcttaccat 1620 ttttaaacca ctaccgccag tgggaggaga tgtggccaca gacatccctt tggagtggat 1680 caaatgaaaa cagaaatgat ggagtggcca ttttaattaa aaacccccag gttctggtaa 1740 aaggtagcac tgtggtgaaa aatggtcggg cgcttttaac acatttgact tttatgggac 1800 aggattttaa aatcttaaat atttatggct ttaatgaaaa aaatgacagg tatgaccttt 1860 tagaagactt gcagtcccac atgctaggta gggtaccttt agtagtaggg ggagatttta 1920 actgtatttt aagtaggaaa gataggagaa gaacagggga agattttaaa gtagacaaaa 1980 catcagtttt attacagggc atatgcaggg attttaaact tcaagactgt tttaaaacca 2040 tgcatcccag ggaggagggc ttcacctggt tcagtggtga tggcaccaga gcctcccgca 2100 tagattatgt ctttacacgg gactgcccag caactgatgc tagactaaca cctgtcttct 2160 tctccgatca cctcatgctc tcctgcaccc tttcactgcc ttcaggtgtg acatcaggaa 2220 gtggtctgtg gaaactcaac tgctccctct tggaagatag ggagttagtt agacagtaca 2280 gggagcagta caaagagtgg cagacccttc aggacttcta cgacacacga gcacactggt 2340 gggagatggt gaagggaagg acccggactt tctttaggca ggcaggtaag gaaaaaaaga 2400 atagggaaac cagacgcatg atgggactgc agaagcgact acagcgctat tttaacctta 2460 accaacaagg tattgatttt aatgaagaaa ttaaacaagt aaaaaaagaa atgtcggttt 2520 tatcagaaat tcaaagcaaa ggtgtcattt taagaagcaa ggaacgggaa attgaggagg 2580 gggaaaagtg cacaaggtat ttcttcaaga aaatcataaa caaagggggg accatcacaa 2640 aattgactaa agaaaatggg tgcacaaccg aaactataga tgaaattaaa gagacaattg 2700 aaagctttta tgaaaaattg tataaagaaa aacacgttca aattgacacc atgaatgaaa 2760 ttttaaaatt cctaaataaa acagtgaaca acagcgtgct tttatcccag gattttaccc 2820 ttttagagtt aaataattct ctatgtagtt ttaaaactgg gaagtcccct ggacaagatg 2880 gacttcctgt agaattttat ttgacttttt gggacctttt agcacctgac ttaatcactg 2940 tttttatgga atttgaagga ctcgaccgac ttcctgacag ttttagagta gggatagtga 3000 ctctccttca caaacaaaaa gacaagacag aactaaagaa ctggagacct atcactcttt 3060 taaatgttga ctgtaaactt tttagtagac ttttagcaac acgtatgtcc acgtttttag 3120 aagaggtgat tgacccggat caagcctgtg ccattccggg gaggaagatc accgacagcc 3180 tcgtactgat ccgagacacc atctgttatg cgagagacag aaacatgagg ctagtagttt 3240 taaacttaga ttttgagaaa gcctttgatc gggtctcgca ccagtacctt ttccaggtac 3300 tgcaaaaaat ggggttccca gaaagatttg tagcttgggt gggtctgctg taccgggaca 3360 tcaccagcag aattctgttt aatgggcatc tgacaaaagc agtggacatt aactgcggtg 3420 tccgtcaggg gtgtccgtta tcggccctcc tctacgttat ctgcattgaa ccactggcac 3480 agatcttgag aagggacaaa cgaatcaatg gggtgcaagt gccggggagc ggaggactgg 3540 aaacgagatg tattttatat atggacgacg tgaatgtttt atgcactgac cttttatctg 3600 ttaacaggac gctggacttg actgactggt tcggacgggc ctctgggtca aaactaaaca 3660 aagaaaagac acgagcccag tttcacggac catggacagc gacagacatg acaggactcc 3720 ccctgaccgt gacacagact gaccaaaaaa tacttggtgt taaatttgat cgggaggggg 3780 gagggaaaac aaattggccg gacatggtag ggaaagtcag acaaaaacta ggattctggg 3840 gacttagagg actgactatg gaagggaagg ttttaatcac caaagcagtg attttacctt 3900 tgcttttact gatcagttct gtttttttcc ctcccagaag tgtgatttta gaactggaga 3960 gagccatctt ttacttcctg tgggggtcca aaatggaaaa actgaagaga gcaatcatga 4020 aaaaaacaaa agaaaaagga ggaaaagacg tcccggacct gcacctgttc ttaggggcca 4080 ggttcacagc tatacatgtg ggcatagtca cagccccatc caaaaaccca aagacggcgg 4140 caatggcacg cttttggatg gggacgtacc tcagaaaact aaaaattgaa cctgtaaacc 4200 tctcggtgcc tgtgtccttt aacctaccga cggcatatag ttttatccag aggtttttag 4260 tgcattttaa tgtggaaaat gaggagttgc atattttaac taaccaccgc tctttgattt 4320 ctgttgtgca ggagcgcgag ccagtgagtc cagtgcgcgg cctcgcatta ggtgagccct 4380 caactgtttg gcgcaatgtg aaccaccctg ctctcccaaa cagactccgg gacctgtcgt 4440 ggatggtggc tcatgagatc ctcccggtca ggtccgtcat gcactcccga ggcatggcgg 4500 cacacgcaac ctgcccccgg cctggttgtg gcgcgcctga gtcggtgagg catctcctct 4560 gggagtgcag tgctgccaaa gaccagtggg caatggccgg ctccttgaaa ttcccgtact 4620 tgccagcaag ggaggtcctt acagcacaac tggtgctgta tggggtgagc ccaactgtaa 4680 taccaccgaa ggactttgcc aagcagtggc tcaccctagc cgccataaaa gacgccacat 4740 ggacctccag aaacttgctg gtaaggaagc acatgcagat cccccccgtg gcggtgatcc 4800 ggatggctgc agcaatggtt caaggggccg gaactgcggg cggcaggcct aggacaccgc 4860 cacaaagaag aatcgcctct gtgcccattc ggacgaagga gccggagctc cacagcaaag 4920 gtcaaggcag cagcggcctg gctctccggg tgcggtaggc gggaaggagc agcagggcga 4980 ggatctccgc tgaccaccaa cagtatcccg agagatcaaa tgtttggagg agcaggctga 5040 gtgagacttg gtagggactc accacgctcc tgcacaaaga tggatatgtt tgaactgatg 5100 cactcccttg tacggacttt taaagaagct ttcaaaggac acatgaacac taaatgtaac 5160 acctttaacc aatggaaatg gctattttac ctacaagact gttttaaaaa aattattcta 5220 aatgtatata tctttttctt aaaaatggtt ttatagtcct cacaatttgg atattttatt 5280 ggtatatatt gaggtgtata tgtgaacctt ttactcaaag tttgaaaatg tctcaactat 5340 ttaatatttt attggcatat ttgaggtgta tatgtgaacc ttttactcaa agccagaaaa 5400 tgtaaagatg attgagtgcc aataaactaa g 5431 // ID DONG_FR repbase; DNA; FUG; 3621 BP. XX AC . XX DT 08-FEB-2002 (Rel. 7.01, Created) DT 08-FEB-2002 (Rel. 7.01, Last updated, Version 1) XX DE Non-LTR retrotransposon; site-specific LINE; R4/Dong superfamily; DE REX6; DONG_FR. XX KW R4; Non-LTR Retrotransposon; Transposable Element; KW site-specific LINE; R4/Dong superfamily; REX6; DONG_FR. XX OS Takifugu rubripes OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Euteleostei; Neoteleostei; OC Acanthomorpha; Acanthopterygii; Percomorpha; Tetraodontiformes; OC Tetradontoidea; Tetraodontidae; Takifugu. XX RN [1] RA Volff N.J., Korting C., Froschauer A., Sweeney K. and Schartl M.; RT "Non-LTR retrotransposons encoding a restriction enzyme-like RT endonuclease in vertebrates."; RL J Mol Evol 52(4), 351-360 (2001). XX RN [2] RA Smit A.F.; RT "Initial survey of interspersed repeats in Takifugu rubripes."; RL Direct Submission to Repbase Update (28-NOV-2001). XX DR [2] (Consensus) XX CC This is the first site-specific LINE reported in vertebrates. CC The consensus has a full length ORF from bp 130-3540. CC Comparison of the encoded product with figure 1 of reference [1] CC showed DONG_FR and REX6 to be the same elements. CC On average, copies are 5% diverged from the consensus, but some CC are >99% identical to consensus. XX SQ Sequence 3621 BP; 1125 A; 919 C; 973 G; 596 T; 8 other; ttctatgcgc cttatgcgac tggataggcc agtggtttac gccgctgact ttggtgcgga 60 aggttgtcgg ttcgaatcca ggcgagccct taggcaaggc tccttacgca tatatgccta 120 cacctcggta tgagcggaac mwtaactgat agagtcatac cggctcggac gtcgcccggg 180 tcaacaaggt ccgcgtcagg ggtcggggaa ccagggccac ctgatgtcaa attggctact 240 ggaacaaggc attcatggag cagggcagag aacgtggtac tgatggagtg ttactacggg 300 agtaacccca gcgagagagg gtacatgcag aggatgtggg agaaatgggt gcttcgaaac 360 cccacatcct cactgactaa gaagcagctc ttagctcagt gttcgaatat ccgcaataag 420 aagctgctat cacagctaga gattgacgag gcacggcgnt gtgcctcacc cacggtacaa 480 atatgctacg gcaaggggga gccaggacgc caggtcagcn tgggggtgat ttcatcatca 540 ccccccaata tcgagattgg gtacaaagcc ccaatgacag acggactcgg cacaagggca 600 gctgacctga gagagagaat catgaagtcc tgggggaact caactacctc cctgccaagg 660 ctaacacaca aagtaccaga ccaatctcta ctggaggaca tgaacacggc actgtcaacc 720 atccctacca ctaccatcac tgagaccaat cagctgatgt atgcagcggc aacagtaatc 780 ctacagatgc ttggctataa gatgaagagc atgaacagcc agaaggagca aatggcccca 840 tggaggagaa ggctagaggc aaaaatcatg gcgacacgga gagaagtcag ccttctaaca 900 gagctgagta gaggcgtgaa tctaaggaca gagcngccca agaaatataa caaactgtcc 960 acaactgagg cactggagac tgctaagcaa aggctcacag ccctggctac ccgactaaag 1020 aggtacacaa gagaagtaga ggcaaggaga ataaacaagg tgttctccac caatccagcn 1080 aaggtctact ctcaatggca gggcaacaag atgacaacag acccccccag ggctgagact 1140 gagcaatact ggaagagtat ctgggagaaa gaggcaacgc acaacactan tgcccaatgg 1200 ctgcaagacc tacagacaga gcacagccaa cttccagaac aagacccagt agtcatcacc 1260 ttagcagaca tccaaacaag agtgtccaaa atgaagagct ggacagcacc agggcccgat 1320 aagatccacg cctactggct taagaagctg actgcactcc atgaacgcct ggcagcacag 1380 atgaaccagc tgctaacatc agggaaccac ccagagtggc taacccaggg ccggacagtc 1440 ctcataatga aggaccccca gaagggcaca ataccatcca actaccggcc cataacctgc 1500 ctcagcacca catggaagct cctatcaggc atcatagcgg ctaagatcag taggcacatg 1560 gatcaataca tgagcagagc acagaaaggc ataggcaaca acaccagagg tgccaagcac 1620 cagctactgg tcgacagggc aatcgcccag gactgtagga cgcggcacac caacctgtgc 1680 actgcctgga ttgattacaa gaaagcctat gactcaatgc cgcacacatg gatactggag 1740 tgcctaaagc tgtataacat caacaggaca ctaagagagt tcatccagaa ctccatgaag 1800 ctgtggaaca cgactctgga ggccaactca aagccaattg cgcgggtgag catcagatgc 1860 ggcatatatc aaggagatgc cctgtccccc ctgctgttct gcataggcct aaaccccctc 1920 agccagatca tcacaaagag tggctatggg taccagttcc gaagtggaac aaccgtcagc 1980 cacctcctct acatggatga catcaagctg tatgccaaga acgagcgtga catcgactcc 2040 ctgattcacc tcactaggat ctacagcaaa gacatcggga tgtcattcgg actagataaa 2100 tgtgggcgga tgatatctag aagaggaaag gtgatcgcaa ctgatggggt tgaactacct 2160 gaagggaaca tcacagatgt gcaggacagt tacaaatacc tggggatccc gcaggcaaat 2220 ggtaaccatg aggaggcagc taggaggtca gccacagcca aatacctaca gaggttaagg 2280 caggtcctga aaagtcagct gaatggtaag aataagatcc aggccataaa cacctacgcc 2340 ctgccagtaa tcagataccc tgctggcata ataccctggc cactggaaga gatacaagcc 2400 actgacatca agacaaggaa gctgaatggt aagcataaga tccaggccat aaacacctac 2460 gccctgccag taatcagata ccctgctggc ataataccct ggccactgga agagatacaa 2520 gccactgaca tcaagacaag gaagctcctc accatgcacg gagggtttca ccctaagtcc 2580 agcgtcctga ggctgtacac gaagcgaaag gaagggggcc gaggactagt aagtgtccga 2640 actactgtcc aggaggaaac aacaagcctc cgagaataca tcaagaagat ggcccccact 2700 gaccgactgc tgagtgaatg cctcaggcaa caaaagccca ccaaggagga ggagcctgag 2760 gggctatcat ggaaggacaa gcccctgcat ggnatgtacc accgacaaat tgaggaagtg 2820 gctgatatcg agaaaacata ccagtggctg gaaaaggccg gactgaaaga cagcacagag 2880 gcactactca tggctgcaca agaacaggcc ctgagcacca gagcaataga ggccagggtc 2940 taccatacca gacaagaccc caggtgcagg ctgtgtggag acgcccctga gacagtccag 3000 cacatcacag cagggtgcaa gatgttggca ggcaaggcat acatggagcg gcataaccag 3060 gtggctggca tagtgtacag gaacatctgc actgagtatg gactggaggt cccagggtcc 3120 aggtgggaga cacccccgaa agtgctggag aacaagcagg ccaagatcct gtgggacttc 3180 cagatccaga ctgacaagat ggtggtggcc aaccagcctg acatagtggt ggtggataaa 3240 caccagaaga cagtggtggt gatagatgta gcaatcccaa gtgatagcaa catcaggaag 3300 aaggaacacg agaagctgga gaagtaccaa gggctgaagg aggagatgga gagaatgtgg 3360 ggcatgaagg caacagtggt cccagtagtg attgggacac taggggcagt aacacccaag 3420 ctgagtagat ggctccaaca gataccagga accacatcag agatctctgt ccagaagagc 3480 gcagtcctag gaacagctaa gatcctgcgc agaaccctca gactcccagg cctctggtag 3540 aggacccgag tctgaaggaa ggaggcaccg cccaggaggg cgaggaagag attttttttt 3600 atatatatat atatatatat a 3621 // ID BRIDGE1_FR repbase; DNA; FUG; 4209 BP. XX AC . XX DT 28-JUL-1999 (Rel. 3.1, Created) DT 09-AUG-1999 (Rel. 3.1, Last updated, Version 2) XX DE BRIDGE1_FR is a non-LTR retrotransposon - a consensus sequence. XX KW Non-LTR retrotransposon; LINE; reverse transcriptase; ORF2; KW BRIDGE superfamily; ORF1; BRIDGE1_FR; ORF3. XX OS Takifugu rubripes OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Euteleostei; Neoteleostei; OC Acanthomorpha; Acanthopterygii; Percomorpha; Tetraodontiformes; OC Tetradontoidea; Tetraodontidae; Takifugu. XX RN [1] RP 2713-4135 RA Kapitonov V.V. and Jurka J.; RT "BRIDGE1_FR."; RL Direct Submission to Repbase Update (JUL-1999). XX RN [2] RP 1-4209 RA Kapitonov V.V. and Jurka J.; RT "BRIDGE1_FR."; RL Direct Submission to Repbase Update (AUG-1999). XX DR [1] (Consensus) XX CC BRIDGE1_FR is a non-LTR retrotransposon that belongs to the CC BRIDGE superfamily. Its consensus sequence has been recovered CC from numerous (~120) GSS sequences. CC BRIDGE1_FR is a young retrotransposon since its CC copies are, on average, 95% identical with the consensus CC sequence. BRIDGE1_FR has three overlaped open reading frames: CC ORF1 (position 684-1556), ORF2 (1307-2512 or 1466-2512) and CC ORF3 (2455-4206). ORF3 encodes reverse transcriptase similar CC to these encoded by the Penelope non-LTR retrotransposon in CC drosophila and BRIDGE2_FR in fugu. XX SQ Sequence 4209 BP; 1345 A; 849 C; 902 G; 1113 T; 0 other; gggggaattg ggttcccccg cgtggcggcc gtttagaaat agtgttcccc cggggtgcag 60 ggatttgata cccctaatct taccctaacc ctaaatccta acccaaagat taaatattac 120 atttgatatc aaactaattt aatttggcat ttagggggtt aaaatggtag ggtgaaaaaa 180 tgaaaaaaga aaaaacatcg cgtatatagt gttttttaaa aaaaagcgat gccacagcag 240 acactatata agggactcga gttcgcaaaa agggaccatt tgcttttcaa ccagcgtatg 300 gaacacatcg tccacagcca atacagcgag ccagcttttg gttatttagt gtgccctacc 360 tgaagaagac ccataggggg tcgaaacgtc gttttttggc acactcttcc ttagttattt 420 aatacattta ttgatttttg gttttatttt acttaccttt taagaaaagg gagaaattaa 480 aagaaaaaaa tacttaccct taaaaaaata aaaaaaaaaa caaaataaac aaaaaaaaca 540 ttaaaatact tacctttttc ctttcctatt ttttggaaag cgcttttatt tcaaaaaact 600 ttctaaaata aataactttc ctattttttg gaaagcactt ttagttcaaa aaactttcta 660 aaataaataa cttaccttga tttatggagc cgggttggac ggaggtgcgc tatggcaggg 720 gtcgccagcg tgcccatcgt caacaacaag gatttcaggg ttatgggagg tcggttggga 780 ggaaggaccg tgctttccct gtctcgtatg ggaggcaggg tacattcccc taccctatcc 840 ggccagtgct tcctccacgg atcgcccatt accctgttcc tcaacagcgt acatatgcag 900 cagtggttcg gcaagggtac ccgcaagcat cccaaagacg ggtgtttcca agggccagta 960 ggccgggggt ccaataccaa taccaatacc aaccagctga cccccagttt gggaaactgg 1020 tacgcaaact acattcagtg atcaaaacag tccaccacct acaaaatgtg gctccaaagg 1080 aggggaaacc tgaaccccgg atgatttctc ggatggtgga tatcctgaca gatatgatta 1140 agcctgctgc tcccacacag cgcacaaaag acttgatttc tgggaatgct aaaaattggg 1200 gttataatac ctacctgatt ttgatggagc actacgaagc aaacctagca gaatatctgg 1260 atgaagtatc tggcctcctc acgccagaat ggagggcagc ctttgaggtg gctgttcgtt 1320 gggctcgacg aaacttacct cgtattacac gggacgagat agaccatgcc caggcattgg 1380 taacagctcg agtggacact atggtcccgg cagtggttca ggtcccgcag cagaccacag 1440 tcagtacaga cacgagggac ctggcatgtg ggcattcacc acgacagact gttgtcacag 1500 ctgcaaccca gtgttgtgga cagtgtatag ctcaagtggt tctagggacg agatagaagg 1560 aagggctgag actccattac agccgacaac caagagagcg gtcgttacta tgactgactc 1620 tactggccag cattcggact ggcactttga tcccccatac gtagagcctc ctcgggagct 1680 tagagtggga cgtaagtgcc taaggaagac caaacatttt gtcccaaatg aggacgactt 1740 actacaggat gacccagaag atttgatggg gagggggaca cccgttggag acgccacact 1800 cctggaccta agcggtcttt ttgtggaatc tcaaaatgag gaagctgagg tcgggaccgc 1860 tagacacgag atggttgttc aggctccccg agggccctcc ttatttgagg aggatagctt 1920 tgaggactcc ttcgaccatt tcgtagaggc gggccctcca atgtttcagg tgttgagaca 1980 ccctaatacc caaaggaaat tgattgattg ggacttggtg gttagaaaga aatggttgat 2040 cattggggat tcaaatctct gcagtttccc agattttgcc atcaagaacc tacagattga 2100 tagttatcca ggggcccact tccgccatgc acaggctctt attgaaaaaa ctgaaccacc 2160 tttagatttg gtagtggaaa agattgtact gtcctttggg atcaacagta gggggaatag 2220 ctcgaaggga actacggtaa aaaacgtcca ggctgccctt cgggttgcaa agaaaaaatt 2280 tccctatgca gagatgtggg ttcctctgct caatttctcg aataatcttc ctgaggagga 2340 aagggagaac ctccaacttc tcaatcacca cctggagagg aacatgccct tcattcctct 2400 tctgccggga agattctttc gcacagggac agataatgtt cattggacag ttgaaacagg 2460 aagggctatg tttcaccatt ggatggagtt tttaaacttc accacccctt gagtctggaa 2520 cagagaggga tagatggacc acagtcctct cattctcagg aagtggttgt tctagctaaa 2580 aatttcaaac tttttagacc acaacaggac ctgttaagca aagggttgtc gttcattcct 2640 tcttggaacg taggcaggga tcaaaaggta cagctccaat tggacattca aaattatcac 2700 cgtaagatta aattggcaac ttattacaga gattcaacta aaaaactata ttcaaagttt 2760 attggaacct cagattggac accaccattg ggggaattgc caccggtaat tggagaattg 2820 attagaaaag atctggaaac atttaaaaat cattttattt cagtcaagga aaatctaaat 2880 atttcattac cagaaataaa agcattacag gagctacaac ataaaaagaa tatagttatt 2940 aaaccagcag acaaagggtc tgcaattgtc attctggacc gggaacaata tatttttgaa 3000 gtagagcgac aacttaatga cacggaatat tataaaaagt tagaaaaacc aatttattta 3060 gatacaattc ccatggtagc tacaatttta gacacattaa agagaaagaa attcatttgc 3120 gaaaaacaaa ggaagtattt agttggggat gtagaaccta gggaaaggag attctatatt 3180 cttcccaaaa ttcacaagga gccagaaaag tggacaattc catatgaact gcccccggga 3240 aggccaatcg tctctgactg tgggagcgag acatatttca ctgcggaatt cttagatttt 3300 tacctgaatc cgctgtccac aaaacatcct gcctttgtca gagacactta tcatttcata 3360 gagattataa aaaccttgaa gattcctgca gattcctatt tcttctctat ggatgtggat 3420 agtctatata ctaatatccc cattgaatct ggcatcaact gtattagaaa tatatttgaa 3480 aaatatcctg accccaagag accagatgaa gagctcttac aattattaga aataaatttg 3540 agcagaaatg atttcatatt taatggtaaa tactatttac aaaccaaagg aactgctatg 3600 ggcaaaaggt ttgccccggc atatgcgaat attttcatgg caaactggga agaagaggca 3660 ctattgaaat gtaaacataa accagctcac tatctacggt atttagatga tgtttgggga 3720 atatggacag gatctgaact acagtttagg gaatttgttg agaccttaaa ttcccacgat 3780 ccctctatta aacttaaaac agaaattaat aaacattcaa ttaatttttt agatacaaca 3840 gttttcaaag ggccagattt tctaaaaact gccaaattgg acattaaagt atattttaaa 3900 agcactgata cccatgcatt gcttcataaa acaagttttc atccaaaaca tacatttaaa 3960 ggcatagtaa aatctcaaat tattagattt aagcgaattt gcacaaggga tgaggaattc 4020 aaggaagcag taaatgtttt atttaaggcc cttcgtaaaa gagggtattc caaacccttt 4080 ttgaaacact gctttaaatc tgttcaaaaa agaaaaaatg ggggagatcg tggggatttg 4140 atccctctta ttacaacata ttcttcggtt aataaatctc taaacatagg tttgaaaaca 4200 atttgaaaa 4209 // ID TC1_FR1 repbase; DNA; FUG; 1220 BP. XX AC . XX DT 08-FEB-2002 (Rel. 7.01, Created) DT 08-FEB-2002 (Rel. 7.01, Last updated, Version 1) XX DE DNA transposon; Pogo-Tc1-IS630 class; Tc1-family; TC1_FR1. XX KW Mariner/Tc1; DNA transposon; Transposable Element; KW Pogo-Tc1-IS630 class; Tc1-family; TC1_FR1. XX OS Takifugu rubripes OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; OC Actinopterygii; Neopterygii; Teleostei; Euteleostei; Neoteleostei; OC Acanthomorpha; Acanthopterygii; Percomorpha; Tetraodontiformes; OC Tetradontoidea; Tetraodontidae; Takifugu. XX RN [1] RA Smit A.F.; RT "Initial survey of interspersed repeats in Takifugu rubripes."; RL Repbase Reports 2(1), 54-54 (2002). XX DR [1] (Consensus) XX CC TC1_FR1 to 5 are TC1-like DNA transposon that are spread through CC the phylogenetic tree of the Tc1-family. CC Complete open reading frame from bp 148-1188. Its product is very CC (83%) similar to the transposase of an unreported Tc1 family in CC Danio rerio (not Tzf or Tdr1). CC On average, copies are 13% diverged from the consensus. XX SQ Sequence 1220 BP; 355 A; 269 C; 303 G; 290 T; 3 other; cactcaccgg ccactttatt aggtncacct gttcaattgc ttgttaacac aaatagctaa 60 tcagccaatc acatggccgc aactcaatgc atttaggcat gtagacgtgg tcaagacaac 120 ttgctgaagt tcaaaccgag catcagaatg gggaagaaag gggatttaag tgactttgaa 180 cgtggcatgg ttgttggtgc cagacgggct ggtctgagta tttcagaaac tgctgatcta 240 ctgggatttt cacgcacaac catctctagg gtttacagag aatggtccga aaaagagaaa 300 atatccagtg agcggcagtt gtgtggacga aaatgccttg ttgatgtgag aggtcagagg 360 agaatgggca gactggttcn agatgataga aaggcaacag taactcaaat aaccactcgt 420 tacaaccaag gaatgcagaa tancatctct gaacgcacaa cacgtggaac cttgaagaag 480 atgggctaca gcagcagaag accacaccgg gtgccactcc tgtcagctaa gaacaggaaa 540 gtgaggctac aattcgcaca ggctcaccaa aattggacaa tagaagattg gaaaaacgtt 600 gcctggtctg acgagtctcg atttcagctg cgacattcag atggtcgggt cagaatttgg 660 cgtaaacaac atgaaagcat ggatccatcc tgccttgtat caacggttca ggctgctgct 720 ggtggtgtaa tggtgtgggg gatattttct tggcacactt tgggcccctt agtaccaact 780 gagcatcgtt taaacgccgc agcctacctg agtattgttg ctgaccatgt ccatcccttt 840 atgaccacag tggacccatc ttctgatggc tacttccagc aggataatgc accatgtcac 900 aaagctcaga tcatctcaaa ctggtttctg gaacatgaca atgagttcac tgtactccag 960 tggcctccac agtcaccaga tctcaatcca atagagcacc tttgggatgt ggtggaacgg 1020 gagattcgca tcatagatgt gcagccgaca aatctgcagc aactgcgtga tgctatcatg 1080 tcaatatgga ccaaaatctc tgaggaatgt ttccaacacc ttgttgaaag tatgccacga 1140 agaattaagg cagttctgaa ggcaaaaggg ggtccaacct tttactagca aggtgtacct 1200 aataaagtgg ccggtgagtg 1220 //